[misc] fix: harden parser validation and stabilize cross-platform test behavior by FightingZhen · Pull Request #56 · verl-project/rl-insight

FightingZhen · 2026-05-16T08:37:54Z

Summary

This PR improves path handling, parser robustness, and test coverage across data validation and profiling parsers. It also updates the quickstart profiling link to the current VeRL docs path.

Changes

rl_insight/data/rules.py: added shared path coercion, accepted Path inputs, and improved MSTX/NVTX validation errors for missing paths, empty files, and JSON parse failures.
rl_insight/parser/gmm_parser.py: normalized path parsing for rank/step extraction so GMM parsing works on Windows-style paths too.
rl_insight/parser/mstx_parser.py: extracted the legacy timestamp key helper and kept directory sorting stable.
rl_insight/parser/parser.py: tracked failed ranks during parallel parsing and added clearer logging when all ranks fail.
docs/overview/RL_Timeline_quickstart.md: fixed the NPU profiling tutorial link.
tests/data/test_paths.py, tests/data/test_data_checker.py, tests/data/test_rules.py: centralized shared test paths.
tests/doc/test_docs_urls.py: made URL checks more resilient by falling back to GET for blocked HEAD responses.
tests/parser/test_cluster_analysis.py: added coverage for legacy underscore-based MSTX directory ordering.
tests/parser/test_gmm_parser.py: added a cross-platform path parsing test for GMM inputs.

Tests

Added/updated unit tests for path coercion, parser path extraction, MSTX ordering, and docs URL validation.

…t stability

gemini-code-assist

Code Review

This pull request introduces cross-platform path handling for GMM parsing, improves error reporting for parallel processing, and transitions the GMM heatmap visualization from Matplotlib to Pillow. While the Pillow implementation addresses stability, feedback highlights that it currently lacks essential visualization elements such as axes labels, titles, and colorbars, and uses fixed cell dimensions that may cause memory issues with large traces. Additionally, a suggestion was made to reduce code duplication in the GMM parser by reusing helper methods for step extraction.

FightingZhen · 2026-05-16T09:17:40Z

/gemini review

gemini-code-assist

Code Review

This pull request enhances cross-platform path handling, improves error reporting for missing or malformed profiling data, and replaces matplotlib with PIL in the GmmVisualizer to better manage large heatmap outputs. Test infrastructure was also updated with centralized paths and improved URL validation. Reviewer feedback identifies an opportunity to remove redundant exception handling in path coercion and suggests dynamically calculating text buffer sizes in the visualizer to avoid potential clipping issues.

FightingZhen · 2026-05-16T10:31:35Z

/gemini review

gemini-code-assist

Code Review

This pull request enhances path handling, validation rules, and cross-platform compatibility across the rl_insight package. Key improvements include refactoring validation rules to support both string and Path objects, adding robust error tracking for parallel processing, and standardizing path extraction in the GMM parser. Test infrastructure was also reorganized for better maintainability. Feedback highlights a need to refine timestamp sorting for multi-day logs in the MSTX parser and to use context managers for streamed HTTP requests in documentation tests to avoid potential resource leaks.

FightingZhen · 2026-05-16T17:03:18Z

/gemini review

gemini-code-assist

Code Review

This pull request improves path handling robustness and cross-platform compatibility across the codebase. Key changes include the introduction of path coercion and normalization helpers in the validation rules and GMM parser, enhanced error reporting for parallel processing, and more resilient URL validation in tests. Feedback suggests expanding the use of the new path coercion helper to remaining validation rules and adopting more idiomatic pathlib methods, such as using .glob(), .stat().st_size, and .as_posix() for file operations and path formatting.

tardis-key · 2026-05-18T08:13:25Z

It seems the tests/doc/test_docs_urls.py failed to detect https://github.com/verl-project/verl/blob/main/docs/ascend_tutorial/profiling/ascend_profiling_zh.rst is invalid.

tardis-key

lgtm. Please pass all checks before merging

FightingZhen · 2026-05-18T12:58:19Z

/genimi review

FightingZhen · 2026-05-19T08:59:30Z

PR is ready for review @tardis-key

FightingZhen added 2 commits May 16, 2026 15:47

fix: improve parser robustness, cross-platform path handling, and tes…

5a5ed2f

…t stability

docs: refine RL timeline quickstart profiling link text

f294699

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Comment thread rl_insight/visualizer/gmm_visualizer.py Outdated

Comment thread rl_insight/visualizer/gmm_visualizer.py Outdated

Comment thread rl_insight/visualizer/gmm_visualizer.py Outdated

Comment thread rl_insight/parser/gmm_parser.py Outdated

fix: improve GMM parsing and restore scalable heatmap metadata

0d531fb

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Comment thread rl_insight/data/rules.py Outdated

Comment thread rl_insight/visualizer/gmm_visualizer.py Outdated

FightingZhen added 2 commits May 16, 2026 18:17

fix: restore matplotlib gmm visualizer

caa3bf4

fix: restore original gmm visualizer

4331ffa

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Comment thread rl_insight/parser/mstx_parser.py

Comment thread tests/doc/test_docs_urls.py

fix: improve MSTX ordering and harden docs URL validation

c0bbe16

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

Comment thread rl_insight/data/rules.py

Comment thread rl_insight/data/rules.py Outdated

Comment thread rl_insight/data/rules.py Outdated

Comment thread rl_insight/parser/gmm_parser.py

fix: accept Path inputs in validators and normalize GMM paths

4202f91

tardis-key approved these changes May 18, 2026

View reviewed changes

FightingZhen changed the title ~~fix: harden parser validation and stabilize cross-platform test behavior~~ [misc] fix: harden parser validation and stabilize cross-platform test behavior May 18, 2026

fix: apply pre-commit cleanup for validator tests

e0d214a

FightingZhen added 2 commits May 19, 2026 15:44

update requirements.txt

4955015

fix ut error & update requirements

e820d0d

tardis-key mentioned this pull request May 19, 2026

tests/doc/test_docs_urls.py failed to detect https://github.com/verl-project/verl/blob/main/docs/ascend_tutorial/profiling/ascend_profiling_zh.rst is invalid. #57

Closed

tardis-key merged commit 70494be into verl-project:main May 19, 2026
5 checks passed

Conversation

FightingZhen commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Tests

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

FightingZhen commented May 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

FightingZhen commented May 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

FightingZhen commented May 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tardis-key commented May 18, 2026

Uh oh!

tardis-key left a comment

Choose a reason for hiding this comment

Uh oh!

FightingZhen commented May 18, 2026

Uh oh!

FightingZhen commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FightingZhen commented May 16, 2026 •

edited

Loading