-
Notifications
You must be signed in to change notification settings - Fork 75
[LoadStoreOpToLLVM] Enable the block store for tensor pointer #4666
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
I am cleaning up the LIT test. Will add it soon. |
050b19b to
1b4e6e3
Compare
ca8654c to
fcf3c4b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR enables block store functionality for tensor pointer operations by extending the existing block I/O conversion infrastructure to support store operations in addition to the previously supported prefetch and load operations.
- Extends the
isMemoryRowMajortemplate function to includetriton::StoreOpin addition to existing operations - Adds a comprehensive
getBlockIOTileSizetemplate function to determine optimal tile dimensions for block I/O operations - Implements block store conversion logic in
StoreOpToBlockIOConversionwith hardware constraints validation and 2D block store operation generation
Reviewed Changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.
| File | Description |
|---|---|
| LoadStoreOpToLLVM.cpp | Main implementation adding block store support with tile size calculation and store operation conversion |
| GetEnv.hpp | Adds new environment variable for enabling block store on regular pointers |
third_party/intel/lib/TritonIntelGPUToLLVM/LoadStoreOpToLLVM.cpp
Outdated
Show resolved
Hide resolved
third_party/intel/lib/TritonIntelGPUToLLVM/LoadStoreOpToLLVM.cpp
Outdated
Show resolved
Hide resolved
1bd8db0 to
df8a253
Compare
third_party/intel/lib/TritonIntelGPUToLLVM/LoadStoreOpToLLVM.cpp
Outdated
Show resolved
Hide resolved
8dd3f04 to
70a5b73
Compare
493783e to
5889663
Compare
3e66e80 to
b63da93
Compare
fbefa53 to
faf30a9
Compare
|
Some part of changes are under reviewing in a separate PR #4704 |
2cd07aa to
0dbedde
Compare
|
@whitneywhtsang , @etiotto , @LiyangLingIntel |
Signed-off-by: Lu,Chengjun <[email protected]>
Enable the block store for tensor pointer.