Report ZGC individual collections instead of cycles (#445) by akiselev98 · Pull Request #447 · microsoft/gctoolkit

akiselev98 · 2025-06-11T17:41:25Z

Previously, the ZGC parser would attempt to construct a single event for each minor/major cycle, with each minor cycle containing a single young collection, and each major cycle containing a single young and a single old collection.

This created a few problems.

First, the implementation of the parser assumed that only one cycle could be active at any given time. As a result, any time a minor cycle started before the previous major cycle was finished, all data collected about the major cycle would be lost.

Second, the data model for a MajorZGCCycle assumed there could be only one young collection per cycle. This is not the case. There can be (and frequently are) multiple young collections in a major cycle.

Even if the implementation and datamodel were fixed to account for the concurrent nature of generational ZGC, grouping collections by cycle would still create API usability problems. Individual collections within a cycle independently report various JVM/heap level metrics at specific points in time. Within a single major cycle, those points in time may be very far apart, and overlap with a large number of minor cycles. If an end user wished to construct a timeline for a particular metric (for example, heap occupancy), they would need to traverse all the reported cycles, extract the metrics from the individual collections, and put them in order.

To address the above issues, the parser and data models are refactored to report each individual collection (young, old, or full) immediately once they are completed instead of waiting until the parent cycle is done. The parser maintains 3 independent forward references (one for each collection type: old, young, and full). As a result, the parser will not lose state even when lines from an old collection are intermixed with lines from a young collection.

Previously, the ZGC parser would attempt to construct a single event for each minor/major cycle, with each minor cycle containing a single young collection, and each major cycle containing a single young and a single old collection. This created a few problems. First, the implementation of the parser assumed that only one cycle could be active at any given time. As a result, any time a minor cycle started before the previous major cycle was finished, all data collected about the major cycle would be lost. Second, the data model for a MajorZGCCycle assumed there could be only one young collection per cycle. This is not the case. There can be (and frequently are) multiple young collections in a major cycle. Even if the implementation and datamodel were fixed to account for the concurrent nature of generational ZGC, grouping collections by cycle would still create API usability problems. Individual collections within a cycle independently report various JVM/heap level metrics at specific points in time. Within a single major cycle, those points in time may be very far apart, and overlap with a large number of minor cycles. If an end user wished to construct a timeline for a particular metric (for example, heap occupancy), they would need to traverse all the reported cycles, extract the metrics from the individual collections, and put them in order. To address the above issues, the parser and data models are refactored to report each individual collection (young, old, or full) immediately once they are completed instead of waiting until the parent cycle is done. The parser maintains 3 independent forward references (one for each collection type: old, young, and full). As a result, the parser will not lose state even when lines from an old collection are intermixed with lines from a young collection.

akiselev98 · 2025-06-11T17:44:08Z

@microsoft-github-policy-service agree company="IMC Markets N.A."

akiselev98 · 2025-06-11T17:47:24Z

There's still a few minor naming inconsistencies I'd like to resolve (i.e. ZGCCollectionType could be more accurately described as ZGCCycleType, and ZGCPhase could be more accurately described as ZGCCollectionType), but figured I could save that for a second pass.

karianna

This looks clean to me overall

karianna · 2025-06-16T22:37:39Z

@dsgrieve / @kcpeppe - in case you wanted to comment - else I'll merge this in 42-48 hours.

dsgrieve

LGTM. This change does break API.

karianna · 2025-06-16T23:30:43Z

LGTM. This change does break API.

Fair, would require version bump if we're being semantic

Fix mismatched GC type label

b206ade

karianna previously approved these changes Jun 13, 2025

View reviewed changes

Comment thread api/src/main/java/com/microsoft/gctoolkit/jvm/SupportedFlags.java Outdated

akiselev98 added 2 commits June 16, 2025 13:48

Rename ZGCCollectionType -> ZGCCycleType

a89f4ef

Fix inconsistent whitespace

fb58a04

akiselev98 dismissed karianna’s stale review via fb58a04 June 16, 2025 18:49

dsgrieve reviewed Jun 16, 2025

View reviewed changes

karianna previously approved these changes Jun 16, 2025

View reviewed changes

Comment thread parser/src/test/java/com/microsoft/gctoolkit/parser/ZGCParserTest.java Outdated

Use explicit imports in ZGCParserTest

96bbb9a

akiselev98 dismissed karianna’s stale review via 96bbb9a June 17, 2025 13:41

karianna approved these changes Jun 23, 2025

View reviewed changes

karianna merged commit 54e339e into microsoft:main Jun 23, 2025
8 checks passed

karianna mentioned this pull request Jul 23, 2025

updates deps 23 jul 2025 #450

Closed

fthevenet mentioned this pull request Oct 7, 2025

Refactored parts of the new ZGC parsing API to improve consistency #455

Merged

This was referenced Feb 26, 2026

ZGC parsing refactoring #439

Open

gen-z parser #278

Open

parse rules for gen-Z support #276

Open

gen-z support in GCToolKit #280

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report ZGC individual collections instead of cycles (#445)#447

Report ZGC individual collections instead of cycles (#445)#447
karianna merged 5 commits intomicrosoft:mainfrom
akiselev98:main

akiselev98 commented Jun 11, 2025

Uh oh!

akiselev98 commented Jun 11, 2025

Uh oh!

akiselev98 commented Jun 11, 2025

Uh oh!

karianna left a comment

Uh oh!

Uh oh!

karianna commented Jun 16, 2025

Uh oh!

dsgrieve left a comment

Uh oh!

karianna commented Jun 16, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

akiselev98 commented Jun 11, 2025

Uh oh!

akiselev98 commented Jun 11, 2025

Uh oh!

akiselev98 commented Jun 11, 2025

Uh oh!

karianna left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

karianna commented Jun 16, 2025

Uh oh!

dsgrieve left a comment

Choose a reason for hiding this comment

Uh oh!

karianna commented Jun 16, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants