Add Composable Kernel examples #332

j-stephan · 2025-10-20T15:50:37Z

Motivation

This PR adds Composable Kernel's ck_tile examples to this repository.

Technical Details

This PR only targets ROCm + Linux; Windows and CUDA are not supported by Composable Kernel.

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

j-stephan · 2025-10-29T16:57:16Z

The failing markdown linter will be resolved once ROCm/rocm-docs-core#1449 is merged.

j-stephan · 2025-11-10T14:23:55Z

This PR requires ROCm 7.1. Once #341 is merged the build errors should disappear.

Copilot

Pull Request Overview

This PR adds Composable Kernel's ck_tile examples to the ROCm Examples repository, focusing exclusively on ROCm and Linux platforms (CUDA and Windows are not supported). The examples demonstrate various GPU operations using CK Tile's programming model, including GEMM operations, convolutions, and basic tensor operations.

Key Changes

Added comprehensive examples for GEMM operations (batched, block-scale, flatmm, multi-d, grouped)
Introduced grouped convolution examples (forward and backward weight)
Implemented basic operations (elementwise, reduce, permute, img2col)
Provided build infrastructure through CMake and Makefiles with architecture-specific support checks

Reviewed Changes

Copilot reviewed 111 out of 281 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
Libraries/ComposableKernel/gemm/flatmm/flatmm_basic.cpp	Implements FLATMM GEMM kernel with tile partitioning and pipeline configuration
Libraries/ComposableKernel/gemm/block_scale_gemm/gemm_aquant_basic.cpp	Implements block-scale quantized GEMM with group quantization support
Libraries/ComposableKernel/gemm/batched_gemm/batched_gemm.cpp	Implements batched GEMM operations with configurable pipeline strategies
Libraries/ComposableKernel/convolution/grouped_convolution/grouped_convolution_forward.cpp	Implements grouped convolution forward pass
Libraries/ComposableKernel/basic/reduce/reduce.cpp	Demonstrates 2D reduction operations with block tiling
Libraries/ComposableKernel/basic/permute/permute.cpp	Generic tensor permutation with matrix-core optimized alternative
CMakeLists.txt and Makefile files	Build configuration with architecture checks for gfx908/gfx90a/gfx942/gfx950

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

zichguan-amd

LGTM

vidyasagar-amd

Thanks for the addition

Signed-off-by: Jan Stephan <[email protected]>

adeljo-amd

LGTM

j-stephan self-assigned this Oct 20, 2025

j-stephan force-pushed the ComposableKernel branch 2 times, most recently from bc1a295 to 05d8ec8 Compare October 28, 2025 13:06

j-stephan marked this pull request as ready for review October 28, 2025 13:14

j-stephan requested review from a team as code owners October 28, 2025 13:14

j-stephan requested review from adeljo-amd and zichguan-amd October 29, 2025 16:57

j-stephan force-pushed the ComposableKernel branch from bc4819e to 60c923a Compare November 10, 2025 17:06

j-stephan requested a review from Copilot November 12, 2025 13:33

Copilot AI reviewed Nov 12, 2025

View reviewed changes

amitkumar-amd requested a review from vidyasagar-amd November 12, 2025 16:23

zichguan-amd approved these changes Nov 12, 2025

View reviewed changes

vidyasagar-amd approved these changes Nov 12, 2025

View reviewed changes

j-stephan added 14 commits November 14, 2025 04:25

Add basic Composable Kernel examples

fa46b56

Use examples from stable branch

8896d55

Add attention examples

8a1ac4a

Enable C++17 everywhere

ebe4175

Typo

2d49581

Add GEMM examples

216e9ff

Add MoE examples

fce96bf

Add normalization examples

e690823

Add quantization examples

422648a

Add .gitignore files

0433450

Add Makefiles

6404161

CMake fixes

fd72179

Markdown fixes

efd274e

Update to latest version

d433de6

j-stephan added 17 commits November 14, 2025 04:25

Add convolution examples

7e3159d

Add gemm_multi_d example

8cd6f9f

README updates

dfa7847

First round of fixes

c5bb32b

Second round of fixes

9445ba2

Add check for supported architectures

0b18f6d

Update basic examples to ROCm 7.1

5373298

Signed-off-by: Jan Stephan <[email protected]>

Update convolution examples to ROCm 7.1

299be7b

Signed-off-by: Jan Stephan <[email protected]>

Update GEMM examples to ROCm 7.1

28ccd0b

Signed-off-by: Jan Stephan <[email protected]>

Update attention examples to ROCm 7.1

63e794d

Signed-off-by: Jan Stephan <[email protected]>

Update Moe examples to ROCm 7.1

1b92c28

Signed-off-by: Jan Stephan <[email protected]>

Update integer types

9238c34

Signed-off-by: Jan Stephan <[email protected]>

Fix MoE issues and add supported arch info

061c7e8

Signed-off-by: Jan Stephan <[email protected]>

Reduce generated FMHA files

ddd5a28

Signed-off-by: Jan Stephan <[email protected]>

Update normalization examples to ROCm 7.1

8cd1657

Signed-off-by: Jan Stephan <[email protected]>

Update quantization examples to ROCm 7.1

1d5e94d

Signed-off-by: Jan Stephan <[email protected]>

Add per-example gfx arch checks

f7b0481

Signed-off-by: Jan Stephan <[email protected]>

j-stephan force-pushed the ComposableKernel branch from 60c923a to f7b0481 Compare November 14, 2025 09:25

adeljo-amd approved these changes Nov 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Composable Kernel examples #332

Add Composable Kernel examples #332

Uh oh!

j-stephan commented Oct 20, 2025

Uh oh!

j-stephan commented Oct 29, 2025

Uh oh!

j-stephan commented Nov 10, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

zichguan-amd left a comment

Uh oh!

vidyasagar-amd left a comment

Uh oh!

adeljo-amd left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add Composable Kernel examples #332

Are you sure you want to change the base?

Add Composable Kernel examples #332

Uh oh!

Conversation

j-stephan commented Oct 20, 2025

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

j-stephan commented Oct 29, 2025

Uh oh!

j-stephan commented Nov 10, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

zichguan-amd left a comment

Choose a reason for hiding this comment

Uh oh!

vidyasagar-amd left a comment

Choose a reason for hiding this comment

Uh oh!

adeljo-amd left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants