feat(deploy): add DeepSeek R1 MI300 data-parallel inference preset #12

bongwoobak · 2026-01-05T08:21:59Z

우선 preset안에 폴더 만들어서 넣어놨는데 구조적으로 수정 할 부분이 필요하다면 커멘트 남겨주세요

hhk7734 · 2026-01-05T08:32:27Z

moai-inference-preset에는 InferenceServiceTemplate만 들어갈거에요.
아래와 같은 구조로 변경해주세요

base/
  - workertemplate...
deepseek-r1/
  - deepseek-r1...

hhk7734 · 2026-01-05T09:11:04Z

end-to-end (co-located, prefill + decode)
- deepseek-r1-mi300x-tp8
- deepseek-r1-mi300x-dp8ep
prefill only
- deepseek-r1-prefill-mi300x-tp8
decode only
- deepseek-r1-decode-mi308x-tp8

Copilot

Pull request overview

This PR adds DeepSeek R1 inference presets for MI300X GPUs with data-parallel inference support, along with a collection of base vLLM data-parallel templates for the Moreh inference framework.

Introduces DeepSeek R1 model configuration for both prefill and decode workers with MI300X-specific optimizations
Adds reusable base templates for vLLM data-parallel deployments including core functionality, metadata labeling, offline HuggingFace hub support, and decode proxy configuration
Organizes templates in a subdirectory structure (base/ and deepseek-r1/) for better maintainability

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
`deploy/helm/moai-inference-preset/templates/deepseek-r1/vllm-deepseek-r1-prefill-mi300x-dp8ep.helm.yaml`	DeepSeek R1 prefill worker configuration with MI300X-specific environment variables and vLLM settings
`deploy/helm/moai-inference-preset/templates/deepseek-r1/vllm-deepseek-r1-decode-mi300x-dp8ep.helm.yaml`	DeepSeek R1 decode worker configuration with MI300X-specific environment variables and vLLM settings
`deploy/helm/moai-inference-preset/templates/base/vllm-dp.helm.yaml`	Base vLLM data-parallel template with complete pod specification including container setup, health checks, and volume configuration
`deploy/helm/moai-inference-preset/templates/base/vllm-dp-prefill-meta.helm.yaml`	Metadata template for labeling prefill workers
`deploy/helm/moai-inference-preset/templates/base/vllm-dp-hf-hub-offline.helm.yaml`	Configuration for offline HuggingFace Hub usage with persistent volume mount
`deploy/helm/moai-inference-preset/templates/base/vllm-dp-decode-proxy.helm.yaml`	Decode worker proxy configuration with initialization container and port offset settings
`deploy/helm/moai-inference-preset/templates/base/vllm-dp-decode-meta.helm.yaml`	Metadata template for labeling decode workers

.../moai-inference-preset/templates/deepseek-r1/vllm-deepseek-r1-prefill-mi300x-dp8ep.helm.yaml

...m/moai-inference-preset/templates/deepseek-r1/vllm-deepseek-r1-decode-mi300x-dp8ep.helm.yaml

.../moai-inference-preset/templates/deepseek-r1/vllm-deepseek-r1-prefill-mi300x-dp8ep.helm.yaml

...m/moai-inference-preset/templates/deepseek-r1/vllm-deepseek-r1-decode-mi300x-dp8ep.helm.yaml

deploy/helm/moai-inference-preset/templates/base/vllm-dp.helm.yaml

…eepseek-r1-prefill-mi300x-dp8ep.helm.yaml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

feat(deploy): add DeepSeek R1 MI300 data-parallel inference preset

e90a0c0

bongwoobak requested a review from a team as a code owner January 5, 2026 08:22

bongwoobak requested review from hhk7734 and jinwoopark-moreh January 5, 2026 08:22

gitgod-bot assigned bongwoobak Jan 5, 2026

refactor(deploy): restructure inference service templates

144375b

hhk7734 previously approved these changes Jan 5, 2026

View reviewed changes

hhk7734 requested a review from Copilot January 5, 2026 11:24

Copilot started reviewing on behalf of hhk7734 January 5, 2026 11:24 View session

Copilot AI reviewed Jan 5, 2026

View reviewed changes

feat(deploy): add namespace to inference service templates

65a4202

bongwoobak dismissed hhk7734’s stale review via 65a4202 January 5, 2026 11:37

bongwoobak and others added 4 commits January 5, 2026 21:37

Update deploy/helm/moai-inference-preset/templates/deepseek-r1/vllm-d…

369e449

…eepseek-r1-prefill-mi300x-dp8ep.helm.yaml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Apply suggestion from @Copilot

6ff0a2a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Apply suggestion from @Copilot

890d908

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Apply suggestion from @Copilot

97251f0

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

hhk7734 approved these changes Jan 5, 2026

View reviewed changes

hhk7734 merged commit adda148 into main Jan 5, 2026
3 checks passed

hhk7734 deleted the deepseek-dp branch January 5, 2026 12:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(deploy): add DeepSeek R1 MI300 data-parallel inference preset #12

feat(deploy): add DeepSeek R1 MI300 data-parallel inference preset #12

Uh oh!

bongwoobak commented Jan 5, 2026

Uh oh!

hhk7734 commented Jan 5, 2026 •

edited

Loading

Uh oh!

hhk7734 commented Jan 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(deploy): add DeepSeek R1 MI300 data-parallel inference preset #12

feat(deploy): add DeepSeek R1 MI300 data-parallel inference preset #12

Uh oh!

Conversation

bongwoobak commented Jan 5, 2026

Uh oh!

hhk7734 commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hhk7734 commented Jan 5, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hhk7734 commented Jan 5, 2026 •

edited

Loading