Breaking the 2D Dependency: What Limits 3D-Only Open-Vocabulary Scene Understanding
Click Here to...
ViSketch-GPT: Collaborative Multi-Scale Feature Extraction for Hand-Drawn Sketch Retrieval
Click Here to...
Learning Egocentric In-Hand Object Segmentation through Weak Supervision from Human Narrations
Click Here to...