[1]
“Real-Time 3D Scene Understanding with Vision-Language Models”, IJRAI, vol. 8, no. 3, pp. 12255 – 12257, May 2025, doi: 10.15662/IJRAI.2025.0803001.