Skip to content

shenqq377/IDOW

Repository files navigation

Solving Instance Detection from an Open-World Perspective

Qianqian Shen1 · Yunhan Zhao2 · Nahyun Kwon3 · Jeeeun Kim3 · Yanan Li4 · Shu Kong5,6

1Zhejiang University 2UC Irvine 3Texas A&M University 4Zhejiang Lab 5University of Macau 6Institute of Collaborative

Paper PDF Project Challenge

idow

Movitated by the InsDet's open-world nature, we exploit diverse open data and foundation models to solve InsDet in the open world (IDOW). To better adapt FM for instance-level feature matching, we introduce distractor sampling to sample patches of random background images as universal negative data to all object instances, and novel-view synthesisgenerate more visual references not only training but for testing. Our IDOW outperforms prior works by >10 AP in both conventional and novel instance detection settings.

Code

The project is built on detectron2, SAM, GroundingDINO, and DINOv2.

The Jupyter notebooks files demonstrate our IDOW on HR-InsDet dataset and RoboTools dataset.

Citation

If you find our project useful, please consider citing:

@inproceedings{shen2025solving,
  title={Solving Instance Detection from an Open-World Perspective},
  author={Shen, Qianqian and Zhao, Yunhan and Kwon, Nahyun and Kim, Jeeeun and Li, Yanan and Kong, Shu},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2025}

About

[CVPR 2025] Solving Instance Detection from an Open-World Perspective

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published