[docker, CI] use radixark/Megatron-LM' and allow specify sglang/megatron PR in CI by yueming-yuan · Pull Request #620 · radixark/miles

yueming-yuan · 2026-02-19T04:58:37Z

ci-megatron-pr: #11
ci-sglang-pr: sglang-miles

gemini-code-assist · 2026-02-19T04:58:56Z

Summary of Changes

Hello @yueming-yuan, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request refactors the Dockerfile responsible for setting up the development environment. The primary goal is to move away from maintaining and applying a custom patch for Megatron-LM, instead opting for direct integration with a specific branch of an external repository. This change simplifies dependency management by utilizing pre-built wheels and standardizing the installation of various deep learning libraries, leading to a more robust and easier-to-maintain build process.

Highlights

Megatron-LM Integration Strategy: The integration of Megatron-LM has been updated to directly use the miles-main branch from the radixark/Megatron-LM repository, replacing the previous method of applying a local megatron.patch file.
Streamlined Dependency Management: The Dockerfile now leverages pre-built Python wheels from a specified GitHub repository (yueming-yuan/miles-wheels), simplifying the installation process for several dependencies like flash-attn, TransformerEngine, and apex.
Improved CUDA 13 Support: Conditional logic for CUDA 13 installations has been refined, particularly for TransformerEngine and nvidia-cudnn, ensuring more straightforward and version-specific dependency handling.
Dockerfile Clean-up: Removed outdated patch application steps and conditional build logic, resulting in a cleaner and more maintainable Dockerfile.

Changelog

docker/Dockerfile.dev
- Added comments detailing different image build configurations for radixark/miles.
- Updated build arguments to specify MEGATRON_REPO, MEGATRON_BRANCH, and WHEELS_REPO.
- Introduced a new step to download pre-built Python wheels from the WHEELS_REPO.
- Simplified installation commands for flash-attn, flash_attn_3, TransformerEngine, and apex to use the newly downloaded pre-built wheels.
- Modified the Megatron-LM cloning process to use the specified MEGATRON_REPO and MEGATRON_BRANCH.
- Removed the explicit megatron.patch application logic.
- Removed conditional Triton and sgl-kernel installations.
- Adjusted nvidia-cudnn installation to be conditional on ENABLE_CUDA_13.
docker/patch/dev/megatron.patch
- Removed the megatron.patch file, as its functionality is now integrated via direct repository usage.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request significantly simplifies the Dockerfile by replacing the megatron.patch with a direct dependency on a forked radixark/Megatron-LM repository. This is a great improvement for maintainability. The changes also refactor the installation of Python dependencies, using pre-built wheels more effectively and cleaning up the logic for different CUDA versions. The new method for downloading all wheels from a GitHub release in one step is particularly clever. I have one suggestion to improve the readability and robustness of the Python script used for downloading the wheels.

gemini-code-assist · 2026-02-19T05:05:42Z

docker/Dockerfile.dev

+    | python3 -c "import sys, json, subprocess; \
+[subprocess.run(['curl', '-fSL', '-o', '/tmp/wheels/' + a['name'], a['browser_download_url']], check=True) \
+ for a in json.load(sys.stdin)['assets'] if a['name'].endswith('.whl')]" && \


This Python one-liner is a bit dense and has a couple of potential issues:

It will fail with a KeyError if the GitHub API response for the release does not contain an assets key. Using .get('assets', []) would be more robust.

Using a list comprehension for its side effects (calling subprocess.run) is not idiomatic Python. A for loop is more explicit and readable.

Consider refactoring this into a multi-line script within the RUN command for better readability and maintainability, for example:

import sys, json, subprocess release_data = json.load(sys.stdin) for asset in release_data.get('assets', []): if asset['name'].endswith('.whl'): url = asset['browser_download_url'] filename = '/tmp/wheels/' + asset['name'] print(f"Downloading {url} to {filename}") subprocess.run(['curl', '-fSL', '-o', filename, url], check=True)

# Conflicts: # docker/Dockerfile.dev # docker/patch/dev/megatron.patch

yueming-yuan added 5 commits February 18, 2026 12:24

update dockerfile to support gb300 wheels

9b4242f

add comment annotation

6a7c900

update cmt

f59b5a6

update

0985960

use megatron fork to replace megatron patch

6a949d0

gemini-code-assist bot reviewed Feb 19, 2026

View reviewed changes

[ci] allow specify megatron/sglang branch in ci test

aeaecf6

yueming-yuan requested a review from yushengsu-thu as a code owner February 19, 2026 05:24

yueming-yuan changed the title ~~[docker] use radixark/Megatron-LM's miles-main to replace megatron.patch~~ [docker, CI] use radixark/Megatron-LM' and allow specify sglang/megatron branch in CI Feb 19, 2026

yueming-yuan changed the title ~~[docker, CI] use radixark/Megatron-LM' and allow specify sglang/megatron branch in CI~~ [docker, CI] use radixark/Megatron-LM' and allow specify sglang/megatron PR in CI Feb 19, 2026

[ci] support use pr#

6680fad

yueming-yuan added the run-ci-short label Feb 19, 2026

yushengsu-thu approved these changes Feb 20, 2026

View reviewed changes

fix

536c3b0

yueming-yuan added run-ci-short and removed run-ci-short labels Feb 23, 2026

yueming-yuan added 3 commits February 22, 2026 16:12

Merge remote-tracking branch 'origin/main' into docker/megatron-fork

3e062a8

# Conflicts: # docker/Dockerfile.dev # docker/patch/dev/megatron.patch

minor

ab0b6a6

fix

982f621

yueming-yuan added run-ci-short and removed run-ci-short labels Feb 24, 2026

Merge branch 'main' into docker/megatron-fork

338ca8c

yueming-yuan added run-ci-image and removed run-ci-short run-ci-image labels Feb 24, 2026

Merge branch 'main' into docker/megatron-fork

c3b0138

yueming-yuan merged commit 4ebc75e into main Feb 25, 2026
63 of 65 checks passed

yueming-yuan deleted the docker/megatron-fork branch February 25, 2026 04:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[docker, CI] use radixark/Megatron-LM' and allow specify sglang/megatron PR in CI#620

[docker, CI] use radixark/Megatron-LM' and allow specify sglang/megatron PR in CI#620
yueming-yuan merged 13 commits intomainfrom
docker/megatron-fork

yueming-yuan commented Feb 19, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Feb 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

yueming-yuan commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Feb 19, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yueming-yuan commented Feb 19, 2026 •

edited

Loading