Elizabeth April 3D 4D 5D

TesserAct: Learning 4D Embodied World Models

We propose TesserAct, the first open-source and generalized 4D World Model for robotics, which takes input images and text instructions to generate RGB, depth, and normal videos, reconstructing a 4D ...

IEEE

L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream

Abstract: 3D visual language multi-modal modeling plays an important role in actual human-computer interaction. However, the inaccessibility of large-scale 3D-language pairs restricts their ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

TesserAct: Learning 4D Embodied World Models

L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream

Trending now