Open
Conversation
distmat() minimized memory allocation, added option to remove grad kernelmat() minimized mem alloc by using in place ops, fixed devices, tensor on input device == output device, hsic_normalized_cca() reuse temp tensors
Owner
|
oh T-H-A-N-K-S! |
Author
|
no worry. I only touched 3 functions - that I needed. if you really want to
speed it up you could write it in cuda and save yourself allocation of all
those eye matrices, and since much of the code results in symmetric
matrices, probably need half or even less computation.
alternatively, I just saw a sparse cuda matrix library -
https://github.com/huggingface/pytorch_block_sparse
last but not least as a general speed optimization with pytorch, therse a
nice tutorial from nvidia, https://www.youtube.com/watch?v=9mS1fIYj1So
but thats a general reference. Im more interested in the idea that hsic()
or another type of information measure could replace end to end backprop. -
OR be used in combination with... thanks for the work.
…On Tue, Sep 8, 2020 at 7:14 PM Leopad ***@***.***> wrote:
oh T-H-A-N-K-S!
I'll spend days after work to test your code. I'll confirm with you
shortly.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#6 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AJYHPOWML7CG5XYD5UKSILLSE3QJXANCNFSM4Q42PU2A>
.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Ok, this is the correct branch to create pull request -- same coment as the one i closed. this one has only one file.
Hi Kurt, I used 3 functions from hsic and noticed a few things