Models

We open-source a total of 324 TD-MPC2 model checkpoints, including 12 multi-task models (ranging from 1M to 317M parameters) trained on 80, 70, and 30 tasks, respectively. We are excited to see what the community will do with these models, and hope that our release will encourage other research labs to open-source their checkpoints as well.

Multi-task Checkpoints

We recommend starting out with the 48M checkpoints, which strike a good balance between model size and performance. The 317M checkpoints are the largest models we trained, and achieve the best performance on both task sets. However, they are also the most expensive to train and evaluate. Note that "Score" is not comparable across task sets.

Tasks Domains Params Score Link
80 DMControl + Meta-World 1M 15.98 Download
80 DMControl + Meta-World 5M 49.45 Download
80 DMControl + Meta-World 19M 57.13 Download
80 DMControl + Meta-World 48M 68.03 Download
80 DMControl + Meta-World 317M 70.63 Download
70 DMControl + Meta-World 5M 49.3 Download
70 DMControl + Meta-World 19M 67.0 Download
30 DMControl 1M 18.93 Download
30 DMControl 5M 28.32 Download
30 DMControl 19M 54.22 Download
30 DMControl 48M 59.43 Download
30 DMControl 317M 71.41 Download

Single-task Checkpoints

We organize single-task checkpoints by task domain, totalling 312 checkpoints for 104 tasks across 4 domains. Most, but not all, models were trained to convergence and produce expert behavior.

Tasks Domain Params Checkpoints Total size Link
39 DMControl 5M 117 3.6GB Download
50 Meta-World 5M 150 4.9GB Download
5 ManiSkill2 5M 15 505MB Download
10 MyoSuite 5M 30 1.0GB Download

We also provide raw per-task and per-seed scores for all 104 tasks. They can be downloaded here.