Skip to content

Pull requests: JuliaGPU/CUDA.jl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Expose directed rounding for Float64 WMMA tensor cores
#3143 opened May 16, 2026 by orkolorko Contributor Draft
4 tasks
Add support for family- and architecture-specific features
#3124 opened Apr 30, 2026 by AntonOresten Contributor Loading…
Added Base.similar methods for CuSparseMatrixCOO and BSR
#3114 opened Apr 21, 2026 by rainerrodrigues Contributor Loading…
2
9
Use Republic.jl to forward subpackage APIs
#3103 opened Apr 15, 2026 by AntonOresten Contributor Draft
CUPTI Profiler Host, Range Profiler, and PM Sampling APIs
#3059 opened Mar 23, 2026 by gbaraldi Member Draft
1 of 2 tasks
[Do not merge] Test KernelIntrinsics
#2944 opened Oct 22, 2025 by christiangnrd Member Loading…
Add AnyCuDeviceArray variations and CuScalar cuda array Stuff about CuArray. speculative Not sure about this one yet.
#2849 opened Aug 22, 2025 by moukle Loading…
fixes the kron implementation for sparse + diagonal matrix
#2804 opened Jun 27, 2025 by tam724 Contributor Loading…
add fastmath flag
#2732 opened Apr 9, 2025 by vchuravy Member Draft
Try fast linear indexes for KA enhancement New feature or request needs changes Changes are needed. performance How fast can we go?
#2612 opened Jan 9, 2025 by vchuravy Member Draft
Allow disabling the linking of libdevice in CUDACompilerParams enhancement New feature or request needs changes Changes are needed. speculative Not sure about this one yet.
#2611 opened Jan 8, 2025 by gbaraldi Member Draft
make CUDA randn work with Zygote enhancement New feature or request needs changes Changes are needed.
#2581 opened Dec 9, 2024 by bgctw Draft
WIP: Native I/O. cuda kernels Stuff about writing CUDA kernels. speculative Not sure about this one yet.
#2485 opened Sep 5, 2024 by maleadt Member Draft
High Level Wrapper for Fused Matmul + Bias + Activation cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request
#2360 opened May 4, 2024 by avik-pal Draft
Use TaskLocalValues enhancement New feature or request speculative Not sure about this one yet.
#2075 opened Sep 8, 2023 by vchuravy Member Draft
Support FFT adjoint plans and test cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request
#2073 opened Sep 4, 2023 by gaurav-arya Draft
Add contract through FastmathOverlays.jl cuda kernels Stuff about writing CUDA kernels. enhancement New feature or request
#2037 opened Aug 16, 2023 by vchuravy Member Draft
WIP: Add an index typevar to CuDeviceArray. enhancement New feature or request help wanted Extra attention is needed performance How fast can we go?
#1895 opened May 3, 2023 by maleadt Member Draft
Add an experimental opaque closure type. cuda kernels Stuff about writing CUDA kernels. enhancement New feature or request speculative Not sure about this one yet.
#1853 opened Apr 4, 2023 by maleadt Member Draft
Add wrappers for NVPERF
#1823 opened Mar 22, 2023 by vchuravy Member Draft
Use Atomix
#1790 opened Mar 10, 2023 by vchuravy Member Draft
4 tasks
ProTip! Exclude everything labeled bug with -label:bug.