tensorrt 7.2.2.3版本

对于大通道，比如1024个channels的网络，比如最简单的 单层网络
        
class conv_1024_1024_33_d1(torch.nn.Module):
    def __init__(self):
        self.in_channel = 1024
        self.out_channel = 1024
        self.kernel_w = 3
        self.kernel_h = 3
        super(conv_1024_1024_33_d1, self).__init__()
        self.conv = torch.nn.Sequential(
            OrderedDict([("conv1", torch.nn.Conv2d(self.in_channel, self.out_channel, self.kernel_w, 1, 1))])
        )
    
    def forward(self, x):
        return self.conv(x)

用这个pytorch转onnx，然后调用trt的execute接口跑出来的耗时结合这层的计算量计算出来的算力（模型计算量/耗时）大于平台标称算力，这是由于trt加速的原因还是什么？有大神知道的么？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tensorrt 7.2.2.3版本 #12

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

tensorrt 7.2.2.3版本 #12

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions