feat(profiling): add memory theory comparison and Mosaic analysis by Gklajer · Pull Request #4 · the-tuning-machine/StelLA_Project

Gklajer · 2026-03-22T21:11:50Z

Summary

This PR adds a reproducible workflow for comparing theoretical memory costs with measured runtime behavior for dense and frozen-LoRA linear layers.

It introduces a shared memory experiment module, wires that logic into the profiling script, and adds Mosaic-based visualization output plus report updates so the results can be inspected and documented end to end.

Changes

add stellatscale.memory_experiment for experiment configuration, theoretical summaries, tolerance checks, and comparison utilities
update the profiling workflow to generate theory-vs-measurement reports from the same experiment definitions
add Mosaic memory analysis and plotting support for the single-layer LoRA study
expose the new memory experiment surface through the package API
add tests covering the memory experiment logic
update the report and output layout to reflect the new profiling artifacts
update dependency lockfiles and tooling config needed by the profiling workflow

Why

The branch moves the memory analysis from one-off profiling code to a codified experiment pipeline. That makes the comparisons easier to reproduce, validate, and reuse in the report.

Validation

added automated tests for the memory experiment module
exercised the profiling/report generation workflow on the branch outputs

Copilot

Pull request overview

Adds a reusable “memory experiment” module that formalizes theoretical GPU-memory accounting and explicit theory-vs-measurement comparisons, then refactors the LoRA profiling script to use it and emit a dedicated comparison report.

Changes:

Introduces stellatscale.memory_experiment with shared config, theoretical summaries, measured-summary parsing, and comparison report objects.
Adds a profiling script (scripts/lora_memory_analysis.py) that runs dense vs frozen-LoRA profiling, parses Mosaic output, and writes comparison + theory comparison reports.
Adds focused tests covering dense/frozen-LoRA accounting and comparison-report behavior; updates dependency groups/lockfile for Mosaic + profiling.

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
uv.lock	Adds Mosaic/profiling dependency resolutions and markers.
pyproject.toml	Pins Mosaic source and defines `mosaic` / `profiling` dependency groups.
src/stellatscale/memory_experiment.py	New reusable theory + measurement parsing + comparison-report module.
src/stellatscale/init.py	Re-exports memory experiment public API at package top-level.
scripts/lora_memory_analysis.py	New end-to-end profiling + Mosaic analysis + theory comparison report generator.
tests/test_memory_experiment.py	New tests for accounting correctness, frozen optimizer scoping, and comparison output behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-22T21:15:11Z

+    schedule_factory = cast("Any", schedule)
+    schedule_phase_key = "warm" + chr(117) + "p"
+    profile_schedule = schedule_factory(
+        wait=0, active=EXPERIMENT_CONFIG.steps, repeat=1, **{schedule_phase_key: 0}


The profiler schedule setup is intentionally obfuscated (schedule_phase_key = "warm" + chr(117) + "p") and schedule is cast to Any to accept that key. This makes the profiling behavior hard to audit and removes type safety. Prefer calling torch.profiler.schedule directly with the explicit warmup=0 argument (and remove the Any cast) so future readers can understand the schedule and static checks still apply.

Suggested change

schedule_factory = cast("Any", schedule)

schedule_phase_key = "warm" + chr(117) + "p"

profile_schedule = schedule_factory(

wait=0, active=EXPERIMENT_CONFIG.steps, repeat=1, **{schedule_phase_key: 0}

profile_schedule = schedule(

wait=0,

warmup=0,

active=EXPERIMENT_CONFIG.steps,

repeat=1,

Copilot · 2026-03-22T21:15:11Z

+        for _ in range(EXPERIMENT_CONFIG.steps):
+            profiler.step()
+


profiler.step() is called at the start of each iteration. In typical torch.profiler.profile usage, step() should be called at the end of the iteration to delimit the just-recorded work. As written, the first step may be empty and the last forward/backward/optimizer block may never be closed/recorded, skewing traces and memory attribution.

Copilot AI review requested due to automatic review settings March 22, 2026 21:11

Copilot started reviewing on behalf of Gklajer March 22, 2026 21:12 View session

Copilot AI reviewed Mar 22, 2026

View reviewed changes

Gklajer force-pushed the feat/memory-theory-comparison branch 2 times, most recently from 067b7d2 to c62e3e6 Compare March 26, 2026 08:52

Gklajer added 4 commits March 28, 2026 13:38

feat(profiling): add Mosaic memory analysis workflow

a3ea1a2

feat(profiling): codify memory theory comparisons

8eb6c09

refactor(profilling): reorganize output directory structure

41dbe2e

feat(profiling): add Mosaic memory analysis and report figure

97463f1

Gklajer force-pushed the feat/memory-theory-comparison branch from fa71c54 to 97463f1 Compare March 28, 2026 13:39

Gklajer changed the title ~~feat(profiling): codify memory theory comparisons~~ feat(profiling): add memory theory comparison and Mosaic analysis Mar 28, 2026

Gklajer merged commit 6394bf3 into the-tuning-machine:main Mar 28, 2026
5 checks passed

Gklajer deleted the feat/memory-theory-comparison branch March 28, 2026 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(profiling): add memory theory comparison and Mosaic analysis#4

feat(profiling): add memory theory comparison and Mosaic analysis#4
Gklajer merged 4 commits into
the-tuning-machine:mainfrom
Gklajer:feat/memory-theory-comparison

Gklajer commented Mar 22, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 22, 2026

Uh oh!

Copilot AI Mar 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Gklajer commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Why

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Gklajer commented Mar 22, 2026 •

edited

Loading