PTX ISA 7.2 now supports .loc with inlined_at: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#debugging-directives-loc. This is huge, as we currently attribute so much source code to e.g. LLVM.jl's call_function.
We should look into making LLVM emit this.
PTX ISA 7.2 now supports
.locwithinlined_at: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#debugging-directives-loc. This is huge, as we currently attribute so much source code to e.g. LLVM.jl'scall_function.We should look into making LLVM emit this.