AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
-
Updated
Oct 6, 2025 - Python
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"
Contents for my Masters thesis "Optimized Block-Level Matrix Inversion Kernels for Small, Batched Matrices on GPUs"
Add a description, image, and links to the fused-kernel topic page so that developers can more easily learn about it.
To associate your repository with the fused-kernel topic, visit your repo's landing page and select "manage topics."