perf: More efficient calling frame detection by maximilianruesch · Pull Request #1884 · Quantinuum/guppylang

maximilianruesch · 2026-06-18T09:04:53Z

Using inspect.getmodule is overkill when all you are trying to retrieve is the name of the module (you can access that through the file-global __name__ as well). At the same time, inspect.getmodule accesses the file cache of the Python interpreter, potentially doing a lot of IO. During tracing workloads, calling frame detection is run for most expressions for purposes of nice error messages, and thus the inspect.getmodule is called in a rather hot loop. Any improvements here are vital for tracing performance.

github-actions · 2026-06-18T09:06:22Z

Bencher Report

Branch	mr/perf/calling-frame-detection
Testbed	Linux

Click to view all benchmark results

Benchmark	hugr_bytes	Benchmark Result bytes x 1e3 (Result Δ%)	Upper Boundary bytes x 1e3 (Limit %)	hugr_nodes	Benchmark Result nodes (Result Δ%)	Upper Boundary nodes (Limit %)
tests/benchmarks/test_big_array.py::test_big_array_compile	📈 view plot 🚷 view threshold	154.02 x 1e3 (0.00%) Baseline: 154.02 x 1e3	155.56 x 1e3 (99.01%)	📈 view plot 🚷 view threshold	6,630.00 (0.00%) Baseline: 6,630.00	6,696.30 (99.01%)
tests/benchmarks/test_ctrl_flow.py::test_many_ctrl_flow_compile	📈 view plot 🚷 view threshold	27.71 x 1e3 (0.00%) Baseline: 27.71 x 1e3	27.99 x 1e3 (99.01%)	📈 view plot 🚷 view threshold	1,051.00 (0.00%) Baseline: 1,051.00	1,061.51 (99.01%)
tests/benchmarks/test_queue_push_pop.py::test_queue_push_benchmark_compile	📈 view plot 🚷 view threshold	10.09 x 1e3 (0.00%) Baseline: 10.09 x 1e3	10.19 x 1e3 (99.01%)	📈 view plot 🚷 view threshold	301.00 (0.00%) Baseline: 301.00	304.01 (99.01%)
tests/benchmarks/test_queue_push_pop.py::test_queue_push_pop_benchmark_compile	📈 view plot 🚷 view threshold	13.69 x 1e3 (-0.01%) Baseline: 13.70 x 1e3	13.83 x 1e3 (99.00%)	📈 view plot 🚷 view threshold	420.00 (0.00%) Baseline: 420.00	424.20 (99.01%)

🐰 View full continuous benchmarking report in Bencher

codecov-commenter · 2026-06-18T09:09:36Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.84%. Comparing base (e2c7014) to head (2c2ecb3).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1884   +/-   ##
=======================================
  Coverage   92.84%   92.84%           
=======================================
  Files         146      146           
  Lines       13822    13822           
=======================================
  Hits        12833    12833           
  Misses        989      989

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

codspeed-hq · 2026-06-18T09:25:43Z

Merging this PR will improve performance by 17.56%

⚠️

Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 1 improved benchmark
✅ 10 untouched benchmarks

Performance Changes

	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	`test_circuit_comptime_compile`	1.7 s	1.5 s	+17.56%

Tip

Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.

_{Comparing mr/perf/calling-frame-detection (2c2ecb3) with main (e2c7014)}

nicolaassolini-qntm

17% faster, nice

More efficient frame detection

2c2ecb3

maximilianruesch marked this pull request as ready for review June 18, 2026 09:27

maximilianruesch requested a review from a team as a code owner June 18, 2026 09:27

maximilianruesch requested review from acl-cqc and nicolaassolini-qntm and removed request for acl-cqc June 18, 2026 09:27

nicolaassolini-qntm approved these changes Jun 18, 2026

View reviewed changes

maximilianruesch added this pull request to the merge queue Jun 18, 2026

Merged via the queue into main with commit 487a8ac Jun 18, 2026
13 checks passed

maximilianruesch deleted the mr/perf/calling-frame-detection branch June 18, 2026 10:44

This was referenced Jun 18, 2026

chore: release guppylang-internals 1.0.0-a6 #1873

Draft

chore: release guppylang 1.0.0-a6 (internals 1.0) #1888

Closed

chore: release guppylang 1.0.0-a6 (internals 1.0) #1889

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: More efficient calling frame detection#1884

perf: More efficient calling frame detection#1884
maximilianruesch merged 1 commit into
mainfrom
mr/perf/calling-frame-detection

maximilianruesch commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

codecov-commenter commented Jun 18, 2026

Uh oh!

codspeed-hq Bot commented Jun 18, 2026

Uh oh!

nicolaassolini-qntm left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

maximilianruesch commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Bencher Report

Uh oh!

codecov-commenter commented Jun 18, 2026

Codecov Report

Uh oh!

codspeed-hq Bot commented Jun 18, 2026

Merging this PR will improve performance by 17.56%

Performance Changes

Uh oh!

nicolaassolini-qntm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants