Skip to content

wufeim/SpatialReasonerDataGen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SpatialReasonerDataGen

Synthetic VQA data generation code for SpatialReasoner from the following paper:

SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning.
Wufei Ma, Yu-Cheng Chou, Qihao Liu, Xingrui Wang, Celso de Melo, Jianwen Xie, and Alan Yuille
Johns Hopkins University
[arXiv] [Project Page]

Installation

Please check INSTALL.md for installation instructions. See Troubleshooting for known issues.

TODO

  • Release visualization code.
  • Visualize step-by-step generation results.

License

This project is released under the CC-BY-4.0 license. Please see the LICENSE file for more information.

Acknowledgements

Citation

If you find this repository helpful, please consider citing:

@article{ma2025spatialreasoner,
  title={SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning},
  author={Ma, Wufei and Chou, Yu-Cheng and Liu, Qihao and Wang, Xingrui and de Melo, Celso and Xie, Jianwen and Yuille, Alan},
  journal={arXiv preprint arXiv:2504.20024},
  year={2025}
}

@inproceedings{ma2025spatialllm,
  title={Spatialllm: A compound 3d-informed design towards spatially-intelligent large multimodal models},
  author={Ma, Wufei and Ye, Luoxin and de Melo, Celso M and Yuille, Alan and Chen, Jieneng},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={17249--17260},
  year={2025}
}

About

Synthetic VQA data generation code for SpatialReasoner.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages