Benchmark results
Checkpoint: nvidia/GR00T-N1.6-3B (zero-shot evaluation)| Task | Success rate |
|---|---|
| CoffeeSetupMug | 31.0% |
| CoffeeServeMug | 63.5% |
| CoffeePressButton | 98.5% |
| OpenSingleDoor | 81.5% |
| OpenDoubleDoor | 39.0% |
| CloseSingleDoor | 96.0% |
| CloseDoubleDoor | 88.5% |
| OpenDrawer | 81.1% |
| CloseDrawer | 100.0% |
| TurnOnMicrowave | 91.5% |
| TurnOffMicrowave | 96.0% |
| PnPCounterToCab | 47.5% |
| PnPCabToCounter | 41.0% |
| PnPCounterToSink | 46.0% |
| PnPSinkToCounter | 50.0% |
| PnPCounterToMicrowave | 19.0% |
| PnPMicrowaveToCounter | 24.5% |
| PnPCounterToStove | 63.2% |
| PnPStoveToCounter | 54.5% |
| TurnOnSinkFaucet | 89.0% |
| TurnOffSinkFaucet | 93.5% |
| TurnSinkSpout | 87.0% |
| TurnOnStove | 76.5% |
| TurnOffStove | 31.0% |
| Average | 66.22% |
Evaluation
Setup environment
Install the required dependencies (only needs to be done once):Run evaluation
Available tasks
All tasks use therobocasa_panda_omron/ prefix with _PandaOmron_Env suffix:
Coffee station tasks
CoffeeSetupMug_PandaOmron_Env- Set up mug for coffeeCoffeeServeMug_PandaOmron_Env- Serve coffee in mugCoffeePressButton_PandaOmron_Env- Press coffee machine button
Door and drawer tasks
OpenSingleDoor_PandaOmron_Env- Open single cabinet doorOpenDoubleDoor_PandaOmron_Env- Open double cabinet doorsCloseSingleDoor_PandaOmron_Env- Close single cabinet doorCloseDoubleDoor_PandaOmron_Env- Close double cabinet doorsOpenDrawer_PandaOmron_Env- Open drawerCloseDrawer_PandaOmron_Env- Close drawer
Appliance tasks
TurnOnMicrowave_PandaOmron_Env- Turn on microwaveTurnOffMicrowave_PandaOmron_Env- Turn off microwaveTurnOnSinkFaucet_PandaOmron_Env- Turn on faucetTurnOffSinkFaucet_PandaOmron_Env- Turn off faucetTurnSinkSpout_PandaOmron_Env- Rotate sink spoutTurnOnStove_PandaOmron_Env- Turn on stove burnerTurnOffStove_PandaOmron_Env- Turn off stove burner
Pick and place tasks
PnPCounterToCab_PandaOmron_Env- Counter to cabinetPnPCabToCounter_PandaOmron_Env- Cabinet to counterPnPCounterToSink_PandaOmron_Env- Counter to sinkPnPSinkToCounter_PandaOmron_Env- Sink to counterPnPCounterToMicrowave_PandaOmron_Env- Counter to microwavePnPMicrowaveToCounter_PandaOmron_Env- Microwave to counterPnPCounterToStove_PandaOmron_Env- Counter to stovePnPStoveToCounter_PandaOmron_Env- Stove to counter