##article.return## D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Download Download PDF