-
Microsoft Research Asia
- Beijing, China
Pinned Loading
-
flex_head_fa
flex_head_fa PublicForked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
-
microsoft/nnfusion
microsoft/nnfusion PublicA flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
-
tile-ai/tilelang
tile-ai/tilelang PublicDomain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.