Skip to content
Change the repository type filter

All

    Repositories list

    • F1-VLA

      Public
      F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
      Python
      1015140Updated Jan 2, 2026Jan 2, 2026
    • The webpage of InternVLA-A1
      HTML
      0000Updated Jan 1, 2026Jan 1, 2026
    • InternNav

      Public
      InternRobotics' open platform for building generalized navigation foundation models.
      Jupyter Notebook
      6357052Updated Dec 31, 2025Dec 31, 2025
    • VL-LN

      Public
      VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs
      Python
      0900Updated Dec 30, 2025Dec 30, 2025
    • [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
      Python
      06700Updated Dec 29, 2025Dec 29, 2025
    • MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
      Python
      04200Updated Dec 26, 2025Dec 26, 2025
    • NavDP

      Public
      Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"
      Python
      2742370Updated Dec 25, 2025Dec 25, 2025
    • Documentation of Intern Robotics Platform & Toolkits
      Python
      6203Updated Dec 25, 2025Dec 25, 2025
    • GenManip

      Public
      [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"
      Python
      313160Updated Dec 23, 2025Dec 23, 2025
    • AnySplat

      Public
      [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
      Python
      32660320Updated Dec 22, 2025Dec 22, 2025
    • CronusVLA

      Public
      [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
      Python
      26700Updated Dec 21, 2025Dec 21, 2025
    • InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
      Python
      1932430Updated Dec 17, 2025Dec 17, 2025
    • JavaScript
      0000Updated Dec 10, 2025Dec 10, 2025
    • MeshCoder

      Public
      Jupyter Notebook
      2142280Updated Dec 8, 2025Dec 8, 2025
    • G2VLM

      Public
      G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
      Python
      424150Updated Nov 27, 2025Nov 27, 2025
    • Official implementation of EgoThinker at NIPS 2025
      Python
      02220Updated Nov 25, 2025Nov 25, 2025
    • EgoHOD

      Public
      Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
      Python
      23110Updated Nov 25, 2025Nov 25, 2025
    • [NIPS 2025] MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation
      Python
      21420Updated Nov 21, 2025Nov 21, 2025
    • HTML
      0100Updated Nov 20, 2025Nov 20, 2025
    • StreamVLN

      Public
      Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
      Python
      24364161Updated Nov 2, 2025Nov 2, 2025
    • Aether

      Public
      [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
      Python
      655500Updated Oct 26, 2025Oct 26, 2025
    • Astro
      0100Updated Oct 23, 2025Oct 23, 2025
    • [arxiv 2025] Official implementation of "Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints"
      Python
      712900Updated Oct 22, 2025Oct 22, 2025
    • [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.
      Python
      620640Updated Oct 17, 2025Oct 17, 2025
    • AdaMimic

      Public
      [arxiv 2025] Official implementation of "Towards Adaptable Humanoid Control via Adaptive Motion Tracking"
      Python
      1118870Updated Oct 17, 2025Oct 17, 2025
    • .github

      Public
      4000Updated Oct 16, 2025Oct 16, 2025
    • An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.
      Python
      1016760Updated Oct 15, 2025Oct 15, 2025
    • PhysHSI

      Public
      Official implementation of the paper: "PhysHSI: Towards a Real-World Generalizable and Natural Humanoid-Scene Interaction System"
      Python
      1422352Updated Oct 14, 2025Oct 14, 2025
    • A versatile, all-in-one toolbox for whole-body humanoid robot control.
      Python
      315510Updated Oct 10, 2025Oct 10, 2025
    • ARTDECO

      Public
      ARTDECO unifies 3D foundation priors with structured scene representations, enabling robust and generalizable 3D reconstruction of diverse real-world scenes using only monocular video.
      813900Updated Oct 10, 2025Oct 10, 2025