Use GPUArrays caching allocator in train! by CarloLucibello · Pull Request #2665 · FluxML/Flux.jl

CarloLucibello · 2026-03-25T22:21:03Z

Summary

Wraps each train! iteration in GPUArrays.@cached so temporary GPU allocations (gradients, intermediate activations) are pooled and reused across steps rather than freed and reallocated each time
Creates one AllocCache per train! call and calls unsafe_free! after the loop for deterministic cleanup
Adds GPUArrays as a direct dependency with compat "11.2" (the version that introduced AllocCache)
On CPU this is a no-op — CPU arrays don't go through GPUArrays' allocation path

Test plan

Existing train! tests pass on CPU
GPU training shows stable memory usage across iterations (no GC spikes)
DomainError on non-finite loss still works correctly

🤖 Generated with Claude Code

Use GPUArrays caching allocator in train! Wraps each training iteration in `GPUArrays.@cached` so temporary GPU allocations (gradients, activations) are pooled and reused across steps instead of being freed and reallocated each iteration, reducing GC pressure. Closes #2636 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> cleanup cleanup

CarloLucibello force-pushed the cl/caching-allocator branch from 8ab3fa7 to e4c6bed Compare March 25, 2026 22:23

CarloLucibello merged commit cec0db7 into master Mar 26, 2026
4 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use GPUArrays caching allocator in train!#2665

Use GPUArrays caching allocator in train!#2665
CarloLucibello merged 1 commit into
masterfrom
cl/caching-allocator

CarloLucibello commented Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

CarloLucibello commented Mar 25, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant