Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] [Torchax] fp8 quantization skeleton
#1307 opened Dec 14, 2025 by xingliu14 Loading…
Allow pytest to correctly discover all tests
#1303 opened Dec 12, 2025 by wdhongtw Loading…
Add Quantized Weights Support for MoE Layers ready ONLY add when PR is ready to merge/full CI is needed
#1300 opened Dec 12, 2025 by kyuyeunk Loading…
[do not merge ]Get all change files instead of last commit when bootstrap. ready ONLY add when PR is ready to merge/full CI is needed
#1299 opened Dec 12, 2025 by QiliangCui Loading…
[test do not review] ready ONLY add when PR is ready to merge/full CI is needed
#1298 opened Dec 12, 2025 by QiliangCui Loading…
Add dummy placeholder for unsupported models in the support matrix ready ONLY add when PR is ready to merge/full CI is needed
#1291 opened Dec 12, 2025 by boe20211 Loading…
[DRAFT] [DP][Bugfix] Fix bad sharding in non_dp case.
#1288 opened Dec 12, 2025 by py4 Loading…
[Kernel][Misc] Remove jax.named_scope ready ONLY add when PR is ready to merge/full CI is needed
#1278 opened Dec 10, 2025 by kyuyeunk Loading…
[do not review][do not submit] ready ONLY add when PR is ready to merge/full CI is needed
#1277 opened Dec 10, 2025 by QiliangCui Loading…
Move the If nightly==1 check out of command.
#1276 opened Dec 10, 2025 by QiliangCui Loading…
add new kernel and quantization support matrices
#1275 opened Dec 10, 2025 by boe20211 Loading…
docs: update support matrices and improve visuals
#1250 opened Dec 5, 2025 by RobMulla Loading…
Avoid installing CUDA related stuff ready ONLY add when PR is ready to merge/full CI is needed
#1246 opened Dec 4, 2025 by wdhongtw Loading…
Add workflow to build vLLM-TPU wheel using PyPI tpu-inference ready ONLY add when PR is ready to merge/full CI is needed
#1241 opened Dec 4, 2025 by ylangtsou Draft
[CI] Fix awq dtype ready ONLY add when PR is ready to merge/full CI is needed
#1220 opened Dec 2, 2025 by kyuyeunk Loading…
[Oncall] update the SchedulerConfig interface
#1219 opened Dec 2, 2025 by bzgoogle Loading…
Add a SP e2e test.
#1209 opened Dec 2, 2025 by vanbasten23 Loading…
Save size in scalar scratch for bo and bq ready ONLY add when PR is ready to merge/full CI is needed
#1201 opened Dec 1, 2025 by rupengliu-meta Loading…
ProTip! Updated in the last three days: updated:>2025-12-10.