Ref-SAM3D

arXiv: https://arxiv.org/abs/2511.19426

TODO List

Release SAM3Agent + SAM3D code
Release quantitative evaluation code and results for Ref-SAM3D
Update the paper

⚙️ Installation

1.Installing SAM3D

conda env create -f environments/default.yml
conda activate ref-sam3d

# for pytorch/cuda dependencies
export PIP_EXTRA_INDEX_URL="https://pypi.ngc.nvidia.com https://download.pytorch.org/whl/cu121"

# install sam3d-objects and core dependencies
pip install -e '.[dev]'
pip install -e '.[p3d]' # pytorch3d dependency on pytorch is broken, this 2-step approach solves it

# for inference
export PIP_FIND_LINKS="https://nvidia-kaolin.s3.us-east-2.amazonaws.com/torch-2.5.1_cu121.html"
pip install -e '.[inference]'

# patch things that aren't yet in official pip packages
./patching/hydra # https://github.com/facebookresearch/hydra/pull/2863

2.Getting Checkpoints

⚠️ Before using SAM 3D Objects, please request access to the checkpoints on the SAM 3D Objects Hugging Face repo. Once accepted, you need to be authenticated to download the checkpoints. You can do this by running the following steps (e.g. hf auth login after generating an access token).

pip install 'huggingface-hub[cli]<1.0'


TAG=hf
hf download \
  --repo-type model \
  --local-dir checkpoints/${TAG}-download \
  --max-workers 1 \
  facebook/sam-3d-objects
mv checkpoints/${TAG}-download/checkpoints checkpoints/${TAG}
rm -rf checkpoints/${TAG}-download

3.Installing SAM 3

⚠️ Before using SAM 3, please request access to the checkpoints on the SAM3 Hugging Face repo. Once accepted, you need to be authenticated to download the checkpoints. You can do this by running the following steps (e.g. hf auth login after generating an access token.)

git clone https://github.com/facebookresearch/sam3.git

cd sam3
pip install -e .
pip install -e ".[notebooks]"

📌 Getting Started

Setup vLLM server

This step is only required if you are using a model served by vLLM, skip this step if you are calling LLM using an API like Gemini and GPT.

Install vLLM (in a separate conda env from SAM 3 to avoid dependency conflicts).

conda create -n vllm python=3.12
pip install vllm --extra-index-url https://download.pytorch.org/whl/cu128

Start vLLM server on the same machine

conda activate vllm # activate vllm conda env in another terminal
vllm serve Qwen/Qwen3-VL-4B-Instruct-FP8 --tensor-parallel-size 1 --allowed-local-media-path / --max-model-len 65536 --enforce-eager --port 8001 --gpu-memory-utilization 0.4

You can control vLLM's GPU memory usage via the max-model-len and gpu-memory-utilization parameters.

Run inference code

python inference_pipeline.py --image_path <path_to_image_file> --prompt "<your_text_prompt>" --model <llm_model_name> --llm_server_url <llm_server_url>

The model's predictions and 3D reconstruction results will be saved in the output/ directory with the following structure:

output/
└── <image_name>/                 
    └── <prompt_text>/         
        ├── splat.ply              
        ├── mask_xxx.png           
        ├── agent_debug_out/       
        ├── none_out/             
        ├── sam_out/               
        └── ...

Visualization

To visualize the results, run the following command. This will launch a local Gradio interface where you can view the 3D model:

python vis.py --ply_path <path_to_ply_file>

Citation

If you find this work useful, we would greatly appreciate your citation:

@article{zhou2025ref,
  title={Ref-SAM3D: Bridging SAM3D with Text for Reference 3D Reconstruction},
  author={Zhou, Yun and Wang, Yaoting and Jie, Guangquan and Liu, Jinyu and Ding, Henghui},
  journal={arXiv preprint arXiv:2511.19426},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
doc		doc
environments		environments
notebook		notebook
patching		patching
sam3d_objects		sam3d_objects
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Ref-SAM3D.pdf		Ref-SAM3D.pdf
inference_pipeline.py		inference_pipeline.py
pyproject.toml		pyproject.toml
requirements.dev.txt		requirements.dev.txt
requirements.inference.txt		requirements.inference.txt
requirements.p3d.txt		requirements.p3d.txt
requirements.txt		requirements.txt
test_image.jpg		test_image.jpg
vis.py		vis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ref-SAM3D

TODO List

⚙️ Installation

1.Installing SAM3D

2.Getting Checkpoints

3.Installing SAM 3

📌 Getting Started

Setup vLLM server

Run inference code

Visualization

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Languages

License

FudanCVL/Ref-SAM3D

Folders and files

Latest commit

History

Repository files navigation

Ref-SAM3D

TODO List

⚙️ Installation

1.Installing SAM3D

2.Getting Checkpoints

3.Installing SAM 3

📌 Getting Started

Setup vLLM server

Run inference code

Visualization

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages