Human3.6M | 2D/3D Pose Estimation | Videos from 4 camera views with poses from motion capture | Human (single-agent) |
MS COCO | 2D Pose Estimation | Images from uncontrolled settings with annotated poses | Human (multi-agent) |
PoseTrack | 2D Pose Estimation & Tracking | Videos from crowded scenes with annotated poses | Human (multi-agent) |
AP-10K | 2D Pose Estimation | Images of diverse animal species with annotated poses | Diverse species (single & multi-agent) |
MARS | 2D Pose Estimation | Videos from 2 camera views with annotated poses | Mouse (multi-agent) |
3D-ZEF | 2D/3D Pose Estimation & Tracking | Videos from 2 camera views with annotated poses | Zebrafish (multi-agent) |
OpenMonkeyStudio | 2D/3D Pose Estimation | Images with annotated poses from a 62 camera setup | Monkey (single-agent) |
PAIR-R24M | 2D/3D Pose Estimation & Tracking | Videos from 12 camera views with poses from motion capture | Rat (multi-agent) |
3DPW | 2D/3D Pose Estimation & Tracking | Videos from moving phone camera in challenging outdoor settings | Human (multi-agent) |
3DHP | 2D/3D Pose Estimation | Videos from 14 camera views with poses from motion capture | Human (single-agent) |
Rat 7M | 2D/3D Pose Estimation | Videos from 12 camera views with poses from motion capture | Rat (single-agent) |
Kinetics | Video-level Action Classification | Videos from uncontrolled settings that cover 700 human actions | Human (single & agent, may interact with other organisms/objects) |
NTU-RGBD | Video-level Action Classification (also has 3D poses) | Videos from 80 views and depth with 60 human actions | Human (single & multi-agent) |
MultiTHUMOS | Frame-level Action Classification | Videos from uncontrolled settings with 65 action classes | Human (single & multi-agent) |
CRIM13 | Frame-level Behavior Classification | Videos from 2 views, with 13 annotated social behaviors | Mouse (multi-agent) |
Fly vs. Fly | Frame-level Behavior Classification (also has 2D poses) | Videos & trajectory, with 10 annotated social behaviors | Fly (multi-agent) |
CalMS21 | Frame-level Behavior Classification (also has 2D poses) | Videos & trajectory, with 10 annotated social behaviors | Mouse (multi-agent) |
MABe | Frame-level Behavior Classification (also has 2D poses) | Top-down views, 7 annotated keypoints, hundreds of videos | Mouse (multi-agent) |