- I am currently a Ph.D candidate at SCUT, supervised by Lei Zhang.
- I love coding and making video demos.
- Email 📫: mountchicken@outlook.com
- Rex-Omni
- Detect Anything via Next Point Prediction
- Rex-Thinker
- Grounded Object Referring via Chain-of-Thought Reasoning
- ChatRex
- Taming Multimodal LLM for Joint Perception and Understanding
- RexSeek
- Referring to Any Person
- T-Rex2
- Towards Generic Object Detection via Text-Visual Prompt Synergy
- DINO-X
- A Unified Vision Model for Open-World Object Detection and Understanding
- Grounding DINO 1.5
- Advance the "Edge" of Open-Set Object Detection
- T-Rex
- Counting by Visual Prompting
- Resophy
- Agentic Paper Reading Tool
- CodeCookbook
- Cookbook to Craft Good Code
- MMOCR
- OpenMMLab OCR Toolbox
- Scene Text Recognition Recommendations
- Latest papers, datasets, and SOTA methods about OCR




