Skip to content

Pull requests: aws-neuron/neuronx-distributed-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add YOLO26 object detection contrib model
#151 opened Apr 29, 2026 by jimburtoft Contributor Loading…
[contrib] Add MiMo-V2.5-Pro (Xiaomi, 384 experts MoE, FP8 on Trn2)
#150 opened Apr 29, 2026 by whn09 Loading…
13 of 14 tasks
Contrib: S3Diff one-step 4x super-resolution
#149 opened Apr 28, 2026 by jimburtoft Contributor Loading…
contrib: add MiMo-V2.5 (FP8 on Trn2)
#148 opened Apr 28, 2026 by whn09 Loading…
14 tasks done
Contrib: FLUX.1-lite-8B-alpha (native FLUX.1 compatibility)
#147 opened Apr 28, 2026 by jimburtoft Contributor Loading…
11 of 12 tasks
Add FLUX.2-klein-base-9B contrib model
#146 opened Apr 26, 2026 by jimburtoft Contributor Loading…
8 tasks done
Add Kimi-K2.5 multimodal contrib (1T MoE + MoonViT vision encoder)
#145 opened Apr 26, 2026 by jimburtoft Contributor Loading…
Add Sarvam-30B (sarvam_moe) contrib model
#144 opened Apr 24, 2026 by jimburtoft Contributor Loading…
11 of 14 tasks
Add GLM-5 (754B MoE) contrib model for trn2.48xlarge
#143 opened Apr 24, 2026 by jimburtoft Contributor Loading…
Add Shrutam-2 contrib model: multilingual Indic ASR on Neuron
#142 opened Apr 24, 2026 by jimburtoft Contributor Loading…
11 of 15 tasks
Add Qwen3.5-2B contrib model
#141 opened Apr 24, 2026 by jimburtoft Contributor Loading…
12 of 14 tasks
Contrib: Add Qwen3.6-27B (post-training update of Qwen3.5-27B)
#140 opened Apr 23, 2026 by jimburtoft Contributor Loading…
7 tasks done
Add sarvam-m contrib model (Mistral head_dim fix)
#139 opened Apr 23, 2026 by jimburtoft Contributor Loading…
13 of 14 tasks
[contrib] Add MiniMax-M2 (229B / ~10B active MoE, TP=64, EP=64)
#138 opened Apr 22, 2026 by whn09 Loading…
14 tasks done
[contrib] Add MiMo-V2-Flash (Xiaomi, TP=64, EP=64 MoE)
#137 opened Apr 22, 2026 by whn09 Loading…
16 tasks done
Fix NameError in HuggingFaceGenerationAdapter.prepare_inputs_for_generation
#136 opened Apr 22, 2026 by whn09 Loading…
1 task done
Add Ministral-3-14B-Instruct-2512 (Leanstral) contrib model
#134 opened Apr 21, 2026 by jimburtoft Contributor Loading…
contrib: Mixtral MoE (SDK 2.29) + Mistral-Small-4-119B-2603
#133 opened Apr 20, 2026 by jimburtoft Contributor Loading…
Add TKG-optimized contribs for 4 dense Mistral models
#132 opened Apr 18, 2026 by jimburtoft Contributor Loading…
Add Kimi-K2-Instruct-0905 contrib model (1T MoE on trn2.48xlarge)
#131 opened Apr 17, 2026 by jimburtoft Contributor Loading…
Add HunyuanVideo-1.5 contrib model
#130 opened Apr 13, 2026 by jimburtoft Contributor Loading…
11 of 14 tasks
Contrib: Add Qwen3.5-27B with hybrid DeltaNet + GQA architecture
#128 opened Apr 12, 2026 by jimburtoft Contributor Loading…
6 tasks done
Add ZAYA1-base contrib model (MoE with CCA attention)
#127 opened Apr 12, 2026 by jimburtoft Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.