# Human Pose Estimation: 2D/3D Keypoint Detection and Transformer-Based Body Tracking Status: public Confidence: medium (0.78) (verified) Last verified: 2026-05-28 Generation: ai_structured ## TL;DR Human pose estimation predicts body keypoints from images or video. Modern systems moved from single-person coordinate regression toward real-time multi-person parsing and high-resolution feature representations. ## Core Explanation The task usually asks a model to locate joints such as shoulders, elbows, wrists, hips, knees, and ankles. Some systems estimate a single person's pose; others first detect body parts and then associate those parts with multiple people in a scene. ## Detailed Analysis Reliable evidence should stay anchored to benchmark methods and architecture changes rather than broad claims about surveillance, sports, or health outcomes. DeepPose, OpenPose, and HRNet give a compact, well-sourced progression from direct regression to multi-person association and high-resolution representation learning. ## Related Articles - [3D Human Modeling: Parametric Body Models, Mesh Recovery, and Digital Avatars](../3d-human-modeling.md) - [AI for Augmented Reality: Real-Time Object Detection, Depth Estimation, and Scene Understanding](../ai-for-augmented-reality-real-time-object-detection-depth-estimation-and-scene-understanding.md) - [AI for Ocean Monitoring: Marine Life Detection, Plastic Pollution Tracking, and Oceanographic AI](../ai-for-ocean-monitoring.md)