Commit 192090b
authored
llamafile : improve sgemm.cpp (#6796)
* llamafile : improve sgemm.cpp
- Re-enable by default
- Fix issue described in #6716
- Make code more abstract, elegant, and maintainable
- Faster handling of weirdly shaped `m` an `n` edge cases
* Address review comments
* Help clang produce fma instructions
* Address review comments1 parent e931888 commit 192090b
4 files changed
+412
-573
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
43 | 43 | | |
44 | 44 | | |
45 | 45 | | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
57 | 51 | | |
58 | 52 | | |
59 | 53 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
384 | 384 | | |
385 | 385 | | |
386 | 386 | | |
387 | | - | |
388 | | - | |
389 | | - | |
390 | | - | |
391 | 387 | | |
392 | 388 | | |
393 | 389 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10825 | 10825 | | |
10826 | 10826 | | |
10827 | 10827 | | |
10828 | | - | |
| 10828 | + | |
10829 | 10829 | | |
10830 | 10830 | | |
10831 | 10831 | | |
| |||
10878 | 10878 | | |
10879 | 10879 | | |
10880 | 10880 | | |
10881 | | - | |
| 10881 | + | |
10882 | 10882 | | |
10883 | 10883 | | |
10884 | 10884 | | |
10885 | 10885 | | |
10886 | 10886 | | |
10887 | | - | |
10888 | | - | |
10889 | | - | |
| 10887 | + | |
10890 | 10888 | | |
10891 | 10889 | | |
10892 | 10890 | | |
| |||
0 commit comments