Return to Issue Details Real-Time 3D Scene Understanding with Vision-Language Models Download Download PDF