ultranationalism

ultranationalism synced commits to model-test-refactor at ultranationalism/diffusers from mirror

7b3ef42a01 update

23 hours ago

ultranationalism synced commits to pull-request/2738 at ultranationalism/Megatron-LM from mirror

9f3ad3f8af Merge remote-tracking branch 'main/main' into evict-from-context
4439396123 clean up.
0b8806e2d7 minimized evicted/active request swap.
72b925375c reverted default max_requests to total block count.
49e82b1c06 converted evict_block_count to evict_request_count.
Compare 14 commits »

1 day ago

ultranationalism synced commits to pull-request/2716 at ultranationalism/Megatron-LM from mirror

37e383d43c revert mtp tie output change that might have lead to error

1 day ago

ultranationalism synced commits to pull-request/2482 at ultranationalism/Megatron-LM from mirror

bdbe48792f Merge branch 'main' into robinz/capture_time_main
3cf7a63fe9 Update oncall for next few weeks (#2748)
ede9ae4e8f chore: rotate oncall schedule
876a04689b feat: manual registration mode for nccl-ub option when using megatron-fsdp (#2661) Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
0a771229e4 Update sequence packing case when dummy PackedSeqParams are used (#2743)
Compare 28 commits »

1 day ago

ultranationalism synced commits to main at ultranationalism/Megatron-LM from mirror

dd7c9f4f69 Prep work for migrating to types from ModuleSpec (#2668)
3cf7a63fe9 Update oncall for next few weeks (#2748)
ede9ae4e8f chore: rotate oncall schedule
876a04689b feat: manual registration mode for nccl-ub option when using megatron-fsdp (#2661) Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
0a771229e4 Update sequence packing case when dummy PackedSeqParams are used (#2743)
Compare 9 commits »

1 day ago

ultranationalism synced commits to dev at ultranationalism/Megatron-LM from mirror

1068d775d6 Revert "[Dev] Remove calculation of padding token in moe routing loss (#2121)" (#2747) Signed-off-by: Charlie Truong <chtruong@nvidia.com>
0b6714ec87 [Dev] Remove calculation of padding token in moe routing loss (#2121) Co-authored-by: Li Tao <lit@nvidia.com>
Compare 2 commits »

1 day ago

ultranationalism synced commits to main at ultranationalism/diffusers from mirror

f6b6a7181e Add z-image-omni-base implementation (#12857) * Add z-image-omni-base implementation * Merged into one transformer for Z-Image. * Fix bugs for controlnet after merging the main branch new feature. * Fix for auto_pipeline, Add Styling. * Refactor noise handling and modulation - Add select_per_token function for per-token value selection - Separate adaptive modulation logic - Cleanify t_noisy/clean variable naming - Move image_noise_mask handler from forward to pipeline * Styling & Formatting. * Rewrite code with more non-forward func & clean forward. 1.Change to one forward with shorter code with omni code (None). 2.Split out non-forward funcs: _build_unified_sequence, _prepare_sequence, patchify, pad. * Styling & Formatting. * Manual check fix-copies in controlnet, Add select_per_token, _patchify_image, _pad_with_ids; Styling. * Add Import in pipeline __init__.py. --------- Co-authored-by: Jerry Qilong Wu <xinglong.wql@alibaba-inc.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>

2 days ago

ultranationalism synced commits to pull-request/2685 at ultranationalism/Megatron-LM from mirror

592644541e Merge branch 'main' into stop_words
5b1ef07031 Add oncall to all new PRs (#2734)
8ea3b8d04a ci: Fix copy-pr-bot update (#2736) Signed-off-by: oliver könig <okoenig@nvidia.com>
1c67e7e093 Update copy-pr-bot.yaml [skip ci]
cfd980b4e7 Add options to improve data loader initialization time, especially at scale (#2445) Signed-off-by: Antoni-Joan Solergibert <asolergibert@nvidia.com> Co-authored-by: Deepak Narayanan <2724038+deepakn94@users.noreply.github.com>
Compare 40 commits »

3 days ago

ultranationalism synced commits to pull-request/2121 at ultranationalism/Megatron-LM from mirror

13676a9f7e fix linting
cae40958ae add UT in 1f1b & SP-S
Compare 2 commits »

3 days ago

ultranationalism synced commits to main at ultranationalism/Megatron-LM from mirror

5b1ef07031 Add oncall to all new PRs (#2734)

3 days ago

ultranationalism synced commits to dev at ultranationalism/Megatron-LM from mirror

46b550591a [Dev] Fix ep overlap missing final layernorm (#2691)

3 days ago

ultranationalism synced commits to main at ultranationalism/diffusers from mirror

52766e6a69 Use `T5Tokenizer` instead of `MT5Tokenizer` (removed in Transformers v5.0+) (#12877) Use `T5Tokenizer` instead of `MT5Tokenizer` Given that the `MT5Tokenizer` in `transformers` is just a "re-export" of `T5Tokenizer` as per https://github.com/huggingface/transformers/blob/v4.57.3/src/transformers/models/mt5/tokenization_mt5.py )on latest available stable Transformers i.e., v4.57.3), this commit updates the imports to point to `T5Tokenizer` instead, so that those still work with Transformers v5.0.0rc0 onwards.

3 days ago

ultranationalism synced commits to device-map-direct at ultranationalism/diffusers from mirror

c61e455ce7 Merge branch 'main' into device-map-direct
973a077c6a Cosmos Predict2.5 14b Conversion (#12863) 14b conversion
0c4f6c9cff Add `OvisImagePipeline` in `AUTO_TEXT2IMAGE_PIPELINES_MAPPING` (#12876)
262ce19bff Feature: Add Mambo-G Guidance as Guider (#12862) * Feature: Add Mambo-G Guidance to Qwen-Image Pipeline * change to guider implementation * fix copied code residual * Update src/diffusers/guiders/magnitude_aware_guidance.py * Apply style fixes --------- Co-authored-by: Pscgylotti <pscgylotti@github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
f7753b1bc8 more update in modular (#12560) * move node registry to mellon * up * fix * modula rpipeline update: filter out none for input_names, fix default blocks for pipe.init() and allow user pass additional kwargs_type in a dict * qwen modular refactor, unpack before decode * update mellon node config, adding* to required_inputs and required_model_inputs * modularpipeline.from_pretrained: error out if no config found * add a component_names property to modular blocks to be consistent! * flux image_encoder -> vae_encoder * controlnet_bundle * refator MellonNodeConfig MellonPipelineConfig * refactor & simplify mellon utils * vae_image_encoder -> vae_encoder * mellon config save keep key order * style + copies * add kwargs input for zimage
Compare 18 commits »

3 days ago

ultranationalism synced commits to master at ultranationalism/apex from mirror

3c98f93d1a [pre-commit.ci] pre-commit autoupdate (#1973)

3 days ago

ultranationalism synced commits to pull-request/2724 at ultranationalism/Megatron-LM from mirror

d1b016de44 Merge branch 'main' of github.com:wdykas/Megatron-LM into inference-memory-test
4193f3aa9b remove fp16 assert in moe_grouped_gemm & EP (#2495) Co-authored-by: Philip Petrakian <pgpetrak@gmail.com>
ddf691df57 [dcos] Update docs title to Megatron Core (#2729)
7f471d720b Fix UVM argument for RL (#2722)
703bc36727 [docs] Add developer section to docs (#2717)
Compare 6 commits »

3 days ago

ultranationalism synced commits to pull-request/2716 at ultranationalism/Megatron-LM from mirror

e25faa6361 Revert "[Dev] Use the latest Hybrid-EP (#2424)" This reverts commit e8a927578d0fdeb98db5d40ab7bdc81d123795f7.

3 days ago

ultranationalism synced commits to pull-request/2474 at ultranationalism/Megatron-LM from mirror

db50940d2c Fix copyright Signed-off-by: Keshav Santhanam <ksanthanam@nvidia.com>
ea0858e4b8 Address comments Signed-off-by: Keshav Santhanam <ksanthanam@nvidia.com>
ecb53d165b Merge remote-tracking branch 'upstream/main' into mamba_cuda_graphs
8ea3b8d04a ci: Fix copy-pr-bot update (#2736) Signed-off-by: oliver könig <okoenig@nvidia.com>
1c67e7e093 Update copy-pr-bot.yaml [skip ci]
Compare 11 commits »

3 days ago

ultranationalism synced commits to pull-request/2363 at ultranationalism/Megatron-LM from mirror

9d07525a47 fixed linting errors
c66109df1f solved merge conflicts
d732548e39 fixed some lintingand small bugs
d5a8dd2fea updated
4896a80db8 fixed more issues
Compare 14 commits »

3 days ago

ultranationalism synced commits to main at ultranationalism/Megatron-LM from mirror

8ea3b8d04a ci: Fix copy-pr-bot update (#2736) Signed-off-by: oliver könig <okoenig@nvidia.com>
1c67e7e093 Update copy-pr-bot.yaml [skip ci]
cfd980b4e7 Add options to improve data loader initialization time, especially at scale (#2445) Signed-off-by: Antoni-Joan Solergibert <asolergibert@nvidia.com> Co-authored-by: Deepak Narayanan <2724038+deepakn94@users.noreply.github.com>
a05766227d Improve ModelOpt paths & add more Nemotron/hybrid model support (#2131) Signed-off-by: Jennifer Chen <jennifchen@nvidia.com> Co-authored-by: Jared Casper <155158+jaredcasper@users.noreply.github.com>
4193f3aa9b remove fp16 assert in moe_grouped_gemm & EP (#2495) Co-authored-by: Philip Petrakian <pgpetrak@gmail.com>
Compare 5 commits »

3 days ago

ultranationalism synced commits to dev at ultranationalism/Megatron-LM from mirror

4665be4dec Revert "[Dev] Use the latest Hybrid-EP (#2424)" (#2732)
2b1fc70891 Revert "[dev] Add assertion for mxfp8 params without dp overlap (#2270)" This reverts commit 7968d5f98f8457297d4a73f96d8a086d84a8fa67.
Compare 2 commits »

3 days ago