Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Sep 25, 2025
Add mistral small3 24B config and recipe (#14784)
Show description for a9fd59aAdd community label bot (#14796)
Show description for 5d586b3authored
Commits on Sep 24, 2025
Disable blank Issues (#14788)
Show description for aff66aeauthoredOneLogger Integration (#13437)
Show description for 13fe7cbfix: Use shutil.copy fallback to handle file metadata permission errors (#14639)
Show description for a35d2afauthoredMXFP8 must only use E4M3 as dtype (#14793)
Show description for 7d3df0fauthored
Commits on Sep 23, 2025
Update gpt_oss.py (#14706)
Show description for 638d299authoredRemove export-deploy, automodel, and eval tutorials (#14790)
Show description for 1fb69acauthoredci: Automodel deprecation warning (#14787)
Show description for 21a5bc4Update prune-distill notebooks to Qwen3 + simplify + mmlu eval (#14785)
Show description for 431fd11authoredReplace texterrors with kaldialign library (#14775)
Show description for 991e376authored
Commits on Sep 22, 2025
Adding bf16 Sortformer train and inference (#14627)
Show description for 709da78authoredAdd transducer timestamps without alignments, timestamps to streaming (#14766)
Show description for eb5426e
Commits on Sep 20, 2025
[Flux] Add cuda_graph_scope and cache images ids for full iteration cuda graph. (#14744)
Show description for 8cfedd7[Flux] Remove redundant host & device sync. (#14711)
Show description for d9a1c0a
Commits on Sep 19, 2025
Support additional Slurm parameters (#14742)
Show description for 20ed590Fix Some Failures (#14763)
Show description for 57dc705authoredData prediction objective for flow matching speech enhancement models (#14749)
Show description for de90351authoredRandomized shard slicing for tarred data (#14558)
Show description for 7fc5144authored
Commits on Sep 18, 2025
Use lhotse dataloader for ASR models to support in-manifest channel selection for multichannel recordings (#14586)
Show description for a035e05authored
Commits on Sep 17, 2025
detach arg option for run scripts (#14722)
Show description for a961bf1authoredremove env var (#14739)
Show description for 8f93234authoredcast SE weights and activations to fp32 (#14743)
Show description for 910236fauthoredimported get_moe_layer_wise_logging_tracker from megatron core moe_utils (#14694)
Show description for 350ec2dauthored
Commits on Sep 16, 2025
feat: Compatibility modification of megatron-fsdp (#14593)
Show description for cf17ca0
Commits on Sep 15, 2025
drop speech_llm example suite (#14683)
Show description for 52bfd8aauthoredUpdate ModelCommPGs API from megatron-core (#14578)
Show description for d2067cb
Commits on Sep 12, 2025
Replace MegatronTokenizer with MegatronLegacyTokenizer (#14721)
Show description for 129573bauthored
Commits on Sep 11, 2025
Remove artificial block to vortex fp8 TP (#14684)
Show description for 8c6fd8bauthored
Commits on Sep 10, 2025
Update Reasoning-SFT.ipynb (#14716)
Show description for 2f7dc67authoredfp4 support (#14625)
Show description for 4e1a835add load-in-4bit param (#14636)
Show description for 6217032authoredAdd option for LoRA with Transformer Engine op fuser (#14411)
Show description for 91dbc17Tutorial fix (#14699)
Show description for 87f7882authoredBump modelopt to 0.35.0 and remove `safe_import("modelopt")` in llm collection (#14656)
Show description for fea44d3