Lowering `arith::TruncFOp` from fp32 to bf16 is harder than it should be.

`arith::TruncFOp` is [currently lowered](https://github.com/intel/intel-xpu-backend-for-triton/blob/9c6423658a5168638907caebc7b16b4c482b5cfe/third_party/intel/lib/TritonIntelGPUToLLVM/ElementwiseOpToLLVM.cpp#L1300) to a number of bitwise operations since the `llvm-spirv` translation of
```
fptrunc float %21 to bfloat, !dbg !22
```
is
```
OpFConvert %half %53
```
with half defined as 
```
%half = OpTypeFloat 16
```

This is partially solved with PR #1074, but the Intel SPIR-V intrinsics only supports round-to-nearest-even so there are still some bit operations for truncation with round-to-zero. 
One of the big barriers for the SPIR-V Translation Tools (`llvm-spirv`) is that there isn't even a `bf16` type in base SPIR-V, we just have the intel extensions to truncate to a `bf16` format but the type is `i16`.
Currently we don't see patching `llvm-spirv` as a solution for internal reasons.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lowering `arith::TruncFOp` from fp32 to bf16 is harder than it should be. #1111

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Lowering arith::TruncFOp from fp32 to bf16 is harder than it should be. #1111

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Lowering `arith::TruncFOp` from fp32 to bf16 is harder than it should be. #1111