In this thesis, we address the problem of two-dimensional human pose estimation (HPE) from a single viewpoint. While many approaches to estimate the 2D human pose from a single viewpoint exist, the estimated joints' locations with respect to the viewpoint are often disregarded. This limits the overall accuracy of localizing the human body parts. To address this limitation, we define a novel problem in 2D HPE: the Confusion of Body Sides (CBS). We show the CBS problem in many 2D HPE approaches as well as in the state-of-the-art methods. In order to overcome the CBS problem, we introduce SHAPE: Smart Human Articulated Pose Estimation. We demonstrate how SHAPE can be plugged into a 2D HPE algorithm to solve the CBS problem. We report our qualitative and quantitative results on our proposed challenging dataset: "Humans AUC" as well as on two popular HPE benchmark datasets: "KTH Multiview Football dataset II" [1] and "Image Parsing" [2]. Our approach is shown to make a notable 2D HPE approach [3] viewpoint-invariant and enhance the accuracy by 20% on average.


Computer Science & Engineering Department

Degree Name

MS in Computer Science

Graduation Date


Submission Date

May 2017

First Advisor

Moustafa, Mohamed N.

Committee Member 1

Mikhail, Mikhail

Committee Member 2

Mahmoud, Khalil


110 p.

Document Type

Master's Thesis


The author retains all rights with regard to copyright. The author certifies that written permission from the owner(s) of third-party copyrighted matter included in the thesis, dissertation, paper, or record of study has been obtained. The author further certifies that IRB approval has been obtained for this thesis, or that IRB approval is not necessary for this thesis. Insofar as this thesis, dissertation, paper, or record of study is an educational record as defined in the Family Educational Rights and Privacy Act (FERPA) (20 USC 1232g), the author has granted consent to disclosure of it to anyone who requests a copy. The author has granted the American University in Cairo or its agents a non-exclusive license to archive this thesis, dissertation, paper, or record of study, and to make it accessible, in whole or in part, in all forms of media, now or hereafter known.

Institutional Review Board (IRB) Approval

Approval has been obtained for this item


I would also like to recognize SAFRAN-France (MORPHO) for sponsoring a fundamental part of this work under the Research Award Program.