Abstract: Recent advancements in Vision-Language Models (VLMs) have demonstrated strong potential for autonomous driving tasks. However, their spatial understanding and reasoning-key capabilities for ...
Abstract: We investigate on potential improvements to reasoning methods for topological and directional spatial information in OWL. Building upon path consistency, the new reasoner design, referred to ...