CASIA-IVA-Lab
Popular repositories Loading
-
ChatBridge
ChatBridge PublicChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.
Repositories
- ChatSearch Public
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval
CASIA-IVA-Lab/ChatSearch’s past year of commit activity - VRoPE Public
[EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.
CASIA-IVA-Lab/VRoPE’s past year of commit activity - COSA Public
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
CASIA-IVA-Lab/COSA’s past year of commit activity - VALOR Public
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
CASIA-IVA-Lab/VALOR’s past year of commit activity - MRES Public
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation", accepted by CVPR 2024.
CASIA-IVA-Lab/MRES’s past year of commit activity - SC-Tune Public
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
CASIA-IVA-Lab/SC-Tune’s past year of commit activity - VAST Public
[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
CASIA-IVA-Lab/VAST’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…