Home
Benchmarks
Simulation benchmarks for embodied AI evaluation
Tasks
Manip
Manipulation
Nav
Navigation
WBC
Whole-Body Control
Observation modalities
RGB
Color camera
RGB-D
RGB + Depth
S
Semantic segmentation
PC
Point cloud
Pose
Joint/end-effector pose
Benchmark
Task
Scenes / Objects
Observation
Physics
Built Upon
Embodiments
Description
robosuite
Manip
— / 10
RGB-D
S
MuJoCo
—
Fixed Base
Panda
PandaDexRH
PandaDexLH
Sawyer
IIWA
Jaco
UR5e
Baxter
Kinova3
XArm7
Wheeled
PandaOmron
Tiago
Legged
SpotWithArm
SpotWithArmFloating
GR1
GR1FixedLowerBody
GR1ArmsOnly
GR1FloatingBody
Modular framework, 11 tasks
RoboCasa
Manip
120 / 2.5K
RGB
MuJoCo
robosuite
Franka Panda (mobile)
100 kitchen tasks, photorealistic
LIBERO
Manip
— / —
RGB
MuJoCo
robosuite
Franka Panda
130 tasks in 4 task suites
Meta-World
Manip
1 / 80
Pose
MuJoCo
—
Sawyer
50 Manip tasks for Meta-RL
LeVERB-Bench
Nav
WBC
4 / —
RGB
PhysX (Isaac Sim)
Isaac Sim
Unitree H1
Humanoid control
ManiSkill
Manip
— / 162
RGB-D
PC
S
PhysX
SAPIEN
Franka Panda
XArm
4 tasks, 36K demos
ManiSkill 2
Manip
— / 2.1K
RGB-D
PC
PhysX
ManiSkill
Franka Panda
XArm
Extended task diversity
ManiSkill 3
Nav
Manip
WBC
— / —
RGB-D
PC
S
PhysX
ManiSkill 2
Franka Panda
Unitree H1
Fetch
GPU-parallelized simulation
ManiSkill-HAB
Manip
105 / 92
RGB-D
PhysX
ManiSkill 3, Habitat 2.0
Fetch
HAB tasks from Habitat 2.0
RoboTwin
Manip
— / 731
RGB-D
PhysX
SAPIEN
AgileX AgiBot G1 (dual-arm)
Dual-arm tasks
Ravens
Manip
— / —
RGB-D
PyBullet
—
UR5
10 tabletop tasks
VIMA-BENCH
Manip
— / 29
RGB
S
PyBullet
Ravens
UR5
17 multimodal prompt tasks
LoHoRavens
Manip
1 / 3
RGB-D
PyBullet
Ravens
UR5
Long-horizon planning
CALVIN
Manip
4 / 7
RGB-D
PyBullet
—
Franka Panda
Long-horizon lang-cond tasks
Habitat
Nav
185 / —
RGB-D
S
Bullet
—
LoCoBot
Fast, Nav only
Habitat 2.0
Nav
Manip
105 / 92
RGB-D
Bullet
Habitat
Fetch
Spot
Mobile manipulation (HAB)
Habitat 3.0
Nav
Manip
211 / 18K
RGB-D
Bullet
Habitat 2.0
Fetch
Spot
Human avatars support
RLBench
Manip
1 / 28
RGB-D
S
PyBullet
V-REP
Franka Panda
Tiered task difficulty
THE COLOSSEUM
Manip
1 / 107
RGB-D
PyBullet
RLBench
Franka Panda
20 tasks, 14 env variations
AI2-THOR
Nav
Manip
— / 118
RGB-D
S
Unity
—
LoCoBot
Object states, task planning
CHORES
Nav
191K / 40K
RGB
Unity
AI2-THOR
LoCoBot
Shortest-path planning
SIMPLER
Manip
4 / 17
RGB
PhysX
SAPIEN, Isaac Sim
Google Everyday Robot
WidowX 250
Real-to-sim evaluation
RoboArena
Manip
— / —
RGB
Real
—
Real-world (unspecified)
Distributed real-world evaluation