xiayuqing0622

Yuqing Xia xiayuqing0622

Achievements

flex_head_fa flex_head_fa Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 70 11
microsoft/nnfusion microsoft/nnfusion Public

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 994 165
tile-ai/tilelang tile-ai/tilelang Public

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1.7k 162
tile-ai/AttentionEngine tile-ai/AttentionEngine Public

Python 50 2