You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Aug 1, 2024. It is now read-only.
Dear authors, thanks for this work! It is indeed a very efficient and simple architecture. If you could help me understand the following question, it would be immensely useful.
You have not provided motivations for why there is no class token. Is this because you want to use the masking of only the image patches, and therefore do not want to introduce class (or even register) tokens?
(I'm currently adapting some of this code to my workflow, and in my case, I am using a class token and four register tokens. The training is running so we shall see if there is any sense to the calculations.)
Dear authors, thanks for this work! It is indeed a very efficient and simple architecture. If you could help me understand the following question, it would be immensely useful.
You have not provided motivations for why there is no class token. Is this because you want to use the masking of only the image patches, and therefore do not want to introduce class (or even register) tokens?
(I'm currently adapting some of this code to my workflow, and in my case, I am using a class token and four register tokens. The training is running so we shall see if there is any sense to the calculations.)