-
Notifications
You must be signed in to change notification settings - Fork 120
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: MiniMax M2.5 PD Disaggregation Recipe (1P2D, MoRI-EP + MoRI-IO)
#999
opened Apr 2, 2026 by
ChuanLi1101
•
Draft
4 tasks
feat: MI300X disaggregated inference with Broadcom IBGDA (#982)
sweep-enabled
#998
opened Apr 2, 2026 by
JordanNanos
Loading…
Add Qwen3.5 MXFP4 single-node MI355X SGLang benchmark (TP4)
#994
opened Apr 1, 2026 by
adibarra
Loading…
[code not in mergable state yet] Add MI325X DeepSeek-R1 FP8 disaggregated inference with Broadcom Thor 2 IBGDA
#985
opened Mar 31, 2026 by
JordanNanos
•
Draft
2 of 8 tasks
[WIP] B200 Minimax FP8 vllm upgrade
NVIDIA
sweep-enabled
#947
opened Mar 26, 2026 by
kedarpotdar-nv
Loading…
fix: multi-turn benchmark hangs after all clients finish
#908
opened Mar 13, 2026 by
lishicheng1996-nv
Loading…
3 of 4 tasks
[NV - WIP] Qwen3.5 B200 SGLang FP4 configs
NVIDIA
sweep-enabled
#820
opened Feb 27, 2026 by
kedarpotdar-nv
Loading…
Performance Improvements for MI300X with GEMM and FP8 Enhancements
#811
opened Feb 26, 2026 by
chunfangamd
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.