I'm having a hard time understanding how a simple model like ResNet(18) is created.
I think in general for a library like this it would be good to favor
- simplicity
- readability
- locality
at the cost of some code duplication and verbosity. This is basically the philosophy adopted by hugging face transformers.
So each model should have its own file and define its own submodules.
I'm having a hard time understanding how a simple model like
ResNet(18)is created.I think in general for a library like this it would be good to favor
at the cost of some code duplication and verbosity. This is basically the philosophy adopted by hugging face transformers.
So each model should have its own file and define its own submodules.