- trained resnet18 seems slower than base resnet18? is the torch.tanh slowing it down? wasn't it properly fused? - or is it the preprocessing? - need to have a way to quickly debug that