Hello, I am trying to run Yolo topology with an 8MB SRAM buffer each for Input, Output, and Filter .i.e a total of 24MB SRAM. It takes a long time to execute just the first layer.
I have the following questions:
- Is there a way I can estimate the runtime memory required?
- Can we run the simulation on multiple cores? I see a comment on Line 69 of Simulator.py about parallelizable.
# 2. Run each layer
# TODO: This is parallelizable
Hello, I am trying to run Yolo topology with an 8MB SRAM buffer each for Input, Output, and Filter .i.e a total of 24MB SRAM. It takes a long time to execute just the first layer.
I have the following questions: