Update tensor.py -> fix severe memory leaking issue! by HectorHHZ · Pull Request #4 · llmsystem/llmsys_s24_hw3

HectorHHZ · 2025-02-16T03:53:01Z

The requires_grad_ function does not respect the input flag x. This means no matter I intend to require gradients or not, histories are preserved and gradients will be computed. This can lead to severe memory "leak" over the training process since the system never releases unnecessary tensors. bug fixed

HectorHHZ · 2025-02-16T03:58:25Z

the bug leads to the severe memory usage in assignment 3 and assignment 2, the RAM usage will easily get hundreds of gigabytes during training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tensor.py -> fix severe memory leaking issue!#4

Update tensor.py -> fix severe memory leaking issue!#4
HectorHHZ wants to merge 1 commit into
llmsystem:mainfrom
HectorHHZ:patch-5

HectorHHZ commented Feb 16, 2025

Uh oh!

HectorHHZ commented Feb 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

HectorHHZ commented Feb 16, 2025

Uh oh!

HectorHHZ commented Feb 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant