[CodeGen] Use extendhfsf2 and truncsfhf2 by default #126880

nikic · 2025-02-12T10:04:34Z

The standard libcalls for half to float and float to half conversion are __extendhfsf2 and __truncsfhf2. However, LLVM currently uses __gnu_h2f_ieee and __gnu_f2h_ieee instead. As far as I can tell, these libcalls are an ARM-ism and only provided by libgcc on that platform. compiler-rt always provides both libcalls.

Use the standard libcalls by default, and only use the __gnu libcalls on ARM.

llvmbot · 2025-02-12T10:05:15Z

@llvm/pr-subscribers-lld-wasm
@llvm/pr-subscribers-llvm-globalisel
@llvm/pr-subscribers-llvm-ir
@llvm/pr-subscribers-backend-aarch64
@llvm/pr-subscribers-backend-x86
@llvm/pr-subscribers-backend-hexagon

@llvm/pr-subscribers-backend-arm

Author: Nikita Popov (nikic)

Changes

The standard libcalls for half to float and float to half conversion are __extendhfsf2 and __truncsfhf2. However, LLVM currently uses __gnu_h2f_ieee and __gnu_f2h_ieee instead. As far as I can tell, these libcalls are ARM-ism and only provided by libgcc on that platform. compiler-rt always provides both libcalls.

Use the standard libcalls by default, and only use the __gnu libcalls on ARM to improve libgcc compatibility.

We encountered this issue with MLIR execution engine test failures on Power 8.

Patch is 223.28 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/126880.diff

35 Files Affected:

(modified) llvm/include/llvm/IR/RuntimeLibcalls.def (+2-2)
(modified) llvm/lib/IR/RuntimeLibcalls.cpp (-3)
(modified) llvm/lib/Target/ARM/ARMISelLowering.cpp (+3)
(modified) llvm/lib/Target/Hexagon/HexagonISelLowering.cpp (-5)
(modified) llvm/lib/Target/RISCV/RISCVISelLowering.cpp (-3)
(modified) llvm/lib/Target/WebAssembly/WebAssemblyISelLowering.cpp (-5)
(modified) llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp (-4)
(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (-3)
(modified) llvm/test/CodeGen/AArch64/16bit-float-promotion-with-nofp.ll (+1-1)
(modified) llvm/test/CodeGen/AArch64/atomicrmw-fadd.ll (+12-12)
(modified) llvm/test/CodeGen/AArch64/atomicrmw-fmax.ll (+12-12)
(modified) llvm/test/CodeGen/AArch64/atomicrmw-fmin.ll (+12-12)
(modified) llvm/test/CodeGen/AArch64/atomicrmw-fsub.ll (+12-12)
(modified) llvm/test/CodeGen/AArch64/strictfp_f16_abi_promote.ll (+20-20)
(modified) llvm/test/CodeGen/LoongArch/fp16-promote.ll (+46-46)
(modified) llvm/test/CodeGen/Mips/fp16-promote.ll (+48-48)
(modified) llvm/test/CodeGen/Mips/ldexp.ll (+2-2)
(modified) llvm/test/CodeGen/PowerPC/atomics.ll (+4-4)
(modified) llvm/test/CodeGen/PowerPC/handle-f16-storage-type.ll (+84-84)
(modified) llvm/test/CodeGen/PowerPC/pr48519.ll (+7-7)
(modified) llvm/test/CodeGen/PowerPC/pr49092.ll (+1-1)
(modified) llvm/test/CodeGen/PowerPC/vector-llrint.ll (+378-378)
(modified) llvm/test/CodeGen/PowerPC/vector-lrint.ll (+378-378)
(modified) llvm/test/CodeGen/SPARC/fp16-promote.ll (+38-38)
(modified) llvm/test/CodeGen/VE/Scalar/fp_extload_truncstore.ll (+14-14)
(modified) llvm/test/CodeGen/X86/cvt16.ll (+5-5)
(modified) llvm/test/CodeGen/X86/fmf-flags.ll (+3-3)
(modified) llvm/test/CodeGen/X86/fp-i129.ll (+2-2)
(modified) llvm/test/CodeGen/X86/fp128-cast-strict.ll (+2-2)
(modified) llvm/test/CodeGen/X86/fptosi-sat-scalar.ll (+10-10)
(modified) llvm/test/CodeGen/X86/fptoui-sat-scalar.ll (+10-10)
(modified) llvm/test/CodeGen/X86/frem.ll (+1-1)
(modified) llvm/test/CodeGen/X86/half-constrained.ll (+7-7)
(modified) llvm/test/CodeGen/X86/ldexp.ll (+2-2)
(modified) llvm/test/CodeGen/X86/llvm.frexp.ll (+5-5)

diff --git a/llvm/include/llvm/IR/RuntimeLibcalls.def b/llvm/include/llvm/IR/RuntimeLibcalls.def
index a7963543c4350..c6ac341d71a20 100644
--- a/llvm/include/llvm/IR/RuntimeLibcalls.def
+++ b/llvm/include/llvm/IR/RuntimeLibcalls.def
@@ -384,8 +384,8 @@ HANDLE_LIBCALL(FPEXT_F16_F128, "__extendhftf2")
 HANDLE_LIBCALL(FPEXT_F16_F80, "__extendhfxf2")
 HANDLE_LIBCALL(FPEXT_F32_F64, "__extendsfdf2")
 HANDLE_LIBCALL(FPEXT_F16_F64, "__extendhfdf2")
-HANDLE_LIBCALL(FPEXT_F16_F32, "__gnu_h2f_ieee")
-HANDLE_LIBCALL(FPROUND_F32_F16, "__gnu_f2h_ieee")
+HANDLE_LIBCALL(FPEXT_F16_F32, "__extendhfsf2")
+HANDLE_LIBCALL(FPROUND_F32_F16, "__truncsfhf2")
 HANDLE_LIBCALL(FPROUND_F64_F16, "__truncdfhf2")
 HANDLE_LIBCALL(FPROUND_F80_F16, "__truncxfhf2")
 HANDLE_LIBCALL(FPROUND_F128_F16, "__trunctfhf2")
diff --git a/llvm/lib/IR/RuntimeLibcalls.cpp b/llvm/lib/IR/RuntimeLibcalls.cpp
index e38fce764b640..1f94400f7c088 100644
--- a/llvm/lib/IR/RuntimeLibcalls.cpp
+++ b/llvm/lib/IR/RuntimeLibcalls.cpp
@@ -170,9 +170,6 @@ void RuntimeLibcallsInfo::initLibcalls(const Triple &TT) {
     // TODO: BridgeOS should be included in isOSDarwin.
     setLibcallName(RTLIB::EXP10_F32, "__exp10f");
     setLibcallName(RTLIB::EXP10_F64, "__exp10");
-  } else {
-    setLibcallName(RTLIB::FPEXT_F16_F32, "__gnu_h2f_ieee");
-    setLibcallName(RTLIB::FPROUND_F32_F16, "__gnu_f2h_ieee");
   }
 
   if (TT.isGNUEnvironment() || TT.isOSFuchsia() ||
diff --git a/llvm/lib/Target/ARM/ARMISelLowering.cpp b/llvm/lib/Target/ARM/ARMISelLowering.cpp
index 5c4fe9d922f4c..3a975e5db0e5e 100644
--- a/llvm/lib/Target/ARM/ARMISelLowering.cpp
+++ b/llvm/lib/Target/ARM/ARMISelLowering.cpp
@@ -767,6 +767,9 @@ ARMTargetLowering::ARMTargetLowering(const TargetMachine &TM,
       setLibcallName(LC.Op, LC.Name);
       setLibcallCallingConv(LC.Op, LC.CC);
     }
+  } else if (!Subtarget->isTargetMachO()) {
+    setLibcallName(RTLIB::FPROUND_F32_F16, "__gnu_f2h_ieee");
+    setLibcallName(RTLIB::FPEXT_F16_F32, "__gnu_h2f_ieee");
   }
 
   if (Subtarget->isThumb1Only())
diff --git a/llvm/lib/Target/Hexagon/HexagonISelLowering.cpp b/llvm/lib/Target/Hexagon/HexagonISelLowering.cpp
index 1a7667fe42fbc..be1960db41479 100644
--- a/llvm/lib/Target/Hexagon/HexagonISelLowering.cpp
+++ b/llvm/lib/Target/Hexagon/HexagonISelLowering.cpp
@@ -1886,11 +1886,6 @@ HexagonTargetLowering::HexagonTargetLowering(const TargetMachine &TM,
     setLibcallName(RTLIB::SQRT_F32, "__hexagon_fast2_sqrtf");
   else
     setLibcallName(RTLIB::SQRT_F32, "__hexagon_sqrtf");
-
-  // Routines to handle fp16 storage type.
-  setLibcallName(RTLIB::FPROUND_F32_F16, "__truncsfhf2");
-  setLibcallName(RTLIB::FPROUND_F64_F16, "__truncdfhf2");
-  setLibcallName(RTLIB::FPEXT_F16_F32, "__extendhfsf2");
 }
 
 const char* HexagonTargetLowering::getTargetNodeName(unsigned Opcode) const {
diff --git a/llvm/lib/Target/RISCV/RISCVISelLowering.cpp b/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
index 7ca8482149eb9..1e06215fa79e6 100644
--- a/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+++ b/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
@@ -1549,9 +1549,6 @@ RISCVTargetLowering::RISCVTargetLowering(const TargetMachine &TM,
   if (Subtarget.useRVVForFixedLengthVectors())
     setTargetDAGCombine(ISD::BITCAST);
 
-  setLibcallName(RTLIB::FPEXT_F16_F32, "__extendhfsf2");
-  setLibcallName(RTLIB::FPROUND_F32_F16, "__truncsfhf2");
-
   // Disable strict node mutation.
   IsStrictFPEnabled = true;
   EnableExtLdPromotion = true;
diff --git a/llvm/lib/Target/WebAssembly/WebAssemblyISelLowering.cpp b/llvm/lib/Target/WebAssembly/WebAssemblyISelLowering.cpp
index fedad25c775e2..3c918e8b675f0 100644
--- a/llvm/lib/Target/WebAssembly/WebAssemblyISelLowering.cpp
+++ b/llvm/lib/Target/WebAssembly/WebAssemblyISelLowering.cpp
@@ -377,11 +377,6 @@ WebAssemblyTargetLowering::WebAssemblyTargetLowering(
 
   setMaxAtomicSizeInBitsSupported(64);
 
-  // Override the __gnu_f2h_ieee/__gnu_h2f_ieee names so that the f32 name is
-  // consistent with the f64 and f128 names.
-  setLibcallName(RTLIB::FPEXT_F16_F32, "__extendhfsf2");
-  setLibcallName(RTLIB::FPROUND_F32_F16, "__truncsfhf2");
-
   // Define the emscripten name for return address helper.
   // TODO: when implementing other Wasm backends, make this generic or only do
   // this on emscripten depending on what they end up doing.
diff --git a/llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp b/llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp
index b20a06b238c88..1fe0b1f2e0591 100644
--- a/llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp
+++ b/llvm/lib/Target/WebAssembly/WebAssemblyRuntimeLibcallSignatures.cpp
@@ -537,10 +537,6 @@ struct StaticLibcallNameMap {
         Map[NameLibcall.first] = NameLibcall.second;
       }
     }
-    // Override the __gnu_f2h_ieee/__gnu_h2f_ieee names so that the f32 name is
-    // consistent with the f64 and f128 names.
-    Map["__extendhfsf2"] = RTLIB::FPEXT_F16_F32;
-    Map["__truncsfhf2"] = RTLIB::FPROUND_F32_F16;
 
     Map["emscripten_return_address"] = RTLIB::RETURN_ADDRESS;
   }
diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 91249f0bb009f..839c28ef643ef 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -736,9 +736,6 @@ X86TargetLowering::X86TargetLowering(const X86TargetMachine &TM,
     setOperationAction(ISD::STRICT_FP_EXTEND, MVT::f32, Custom);
     setOperationAction(ISD::STRICT_FP_EXTEND, MVT::f64, Custom);
 
-    setLibcallName(RTLIB::FPROUND_F32_F16, "__truncsfhf2");
-    setLibcallName(RTLIB::FPEXT_F16_F32, "__extendhfsf2");
-
     // Lower this to MOVMSK plus an AND.
     setOperationAction(ISD::FGETSIGN, MVT::i64, Custom);
     setOperationAction(ISD::FGETSIGN, MVT::i32, Custom);
diff --git a/llvm/test/CodeGen/AArch64/16bit-float-promotion-with-nofp.ll b/llvm/test/CodeGen/AArch64/16bit-float-promotion-with-nofp.ll
index bfe9ab8424bb0..0bd7c1b10b123 100644
--- a/llvm/test/CodeGen/AArch64/16bit-float-promotion-with-nofp.ll
+++ b/llvm/test/CodeGen/AArch64/16bit-float-promotion-with-nofp.ll
@@ -7,7 +7,7 @@ define half @f2h(float %a) {
 ; CHECK-NEXT:    str x30, [sp, #-16]! // 8-byte Folded Spill
 ; CHECK-NEXT:    .cfi_def_cfa_offset 16
 ; CHECK-NEXT:    .cfi_offset w30, -16
-; CHECK-NEXT:    bl __gnu_f2h_ieee
+; CHECK-NEXT:    bl __truncsfhf2
 ; CHECK-NEXT:    ldr x30, [sp], #16 // 8-byte Folded Reload
 ; CHECK-NEXT:    ret
 entry:
diff --git a/llvm/test/CodeGen/AArch64/atomicrmw-fadd.ll b/llvm/test/CodeGen/AArch64/atomicrmw-fadd.ll
index 0c3a40d93d640..21729b9dfd101 100644
--- a/llvm/test/CodeGen/AArch64/atomicrmw-fadd.ll
+++ b/llvm/test/CodeGen/AArch64/atomicrmw-fadd.ll
@@ -60,13 +60,13 @@ define half @test_atomicrmw_fadd_f16_seq_cst_align2(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB0_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl __addsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB0_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB0_2 Depth=1
@@ -148,13 +148,13 @@ define half @test_atomicrmw_fadd_f16_seq_cst_align4(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB1_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl __addsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB1_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB1_2 Depth=1
@@ -712,22 +712,22 @@ define <2 x half> @test_atomicrmw_fadd_v2f16_seq_cst_align4(ptr %ptr, <2 x half>
 ; SOFTFP-NOLSE-NEXT:    // =>This Loop Header: Depth=1
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB7_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    and w0, w19, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w23, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w24
 ; SOFTFP-NOLSE-NEXT:    bl __addsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w21, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w25, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w25
 ; SOFTFP-NOLSE-NEXT:    bl __addsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w22
 ; SOFTFP-NOLSE-NEXT:    bfi w0, w24, #16, #16
 ; SOFTFP-NOLSE-NEXT:    bfi w8, w23, #16, #16
diff --git a/llvm/test/CodeGen/AArch64/atomicrmw-fmax.ll b/llvm/test/CodeGen/AArch64/atomicrmw-fmax.ll
index 24088998f36d1..9b5e48d2b4217 100644
--- a/llvm/test/CodeGen/AArch64/atomicrmw-fmax.ll
+++ b/llvm/test/CodeGen/AArch64/atomicrmw-fmax.ll
@@ -62,13 +62,13 @@ define half @test_atomicrmw_fmax_f16_seq_cst_align2(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB0_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl fmaxf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB0_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB0_2 Depth=1
@@ -150,13 +150,13 @@ define half @test_atomicrmw_fmax_f16_seq_cst_align4(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB1_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl fmaxf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB1_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB1_2 Depth=1
@@ -592,22 +592,22 @@ define <2 x half> @test_atomicrmw_fmax_v2f16_seq_cst_align4(ptr %ptr, <2 x half>
 ; SOFTFP-NOLSE-NEXT:    // =>This Loop Header: Depth=1
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB6_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    and w0, w19, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w23, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w24
 ; SOFTFP-NOLSE-NEXT:    bl fmaxf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w21, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w25, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w25
 ; SOFTFP-NOLSE-NEXT:    bl fmaxf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w22
 ; SOFTFP-NOLSE-NEXT:    bfi w0, w24, #16, #16
 ; SOFTFP-NOLSE-NEXT:    bfi w8, w23, #16, #16
diff --git a/llvm/test/CodeGen/AArch64/atomicrmw-fmin.ll b/llvm/test/CodeGen/AArch64/atomicrmw-fmin.ll
index 65f1f4863c173..f6c542fe7d407 100644
--- a/llvm/test/CodeGen/AArch64/atomicrmw-fmin.ll
+++ b/llvm/test/CodeGen/AArch64/atomicrmw-fmin.ll
@@ -62,13 +62,13 @@ define half @test_atomicrmw_fmin_f16_seq_cst_align2(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB0_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl fminf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB0_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB0_2 Depth=1
@@ -150,13 +150,13 @@ define half @test_atomicrmw_fmin_f16_seq_cst_align4(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB1_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl fminf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB1_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB1_2 Depth=1
@@ -592,22 +592,22 @@ define <2 x half> @test_atomicrmw_fmin_v2f16_seq_cst_align4(ptr %ptr, <2 x half>
 ; SOFTFP-NOLSE-NEXT:    // =>This Loop Header: Depth=1
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB6_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    and w0, w19, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w23, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w24
 ; SOFTFP-NOLSE-NEXT:    bl fminf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w21, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w25, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w25
 ; SOFTFP-NOLSE-NEXT:    bl fminf
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w22
 ; SOFTFP-NOLSE-NEXT:    bfi w0, w24, #16, #16
 ; SOFTFP-NOLSE-NEXT:    bfi w8, w23, #16, #16
diff --git a/llvm/test/CodeGen/AArch64/atomicrmw-fsub.ll b/llvm/test/CodeGen/AArch64/atomicrmw-fsub.ll
index 0f1a2f03c98c3..82e0f14e68e26 100644
--- a/llvm/test/CodeGen/AArch64/atomicrmw-fsub.ll
+++ b/llvm/test/CodeGen/AArch64/atomicrmw-fsub.ll
@@ -60,13 +60,13 @@ define half @test_atomicrmw_fsub_f16_seq_cst_align2(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB0_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl __subsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB0_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB0_2 Depth=1
@@ -148,13 +148,13 @@ define half @test_atomicrmw_fsub_f16_seq_cst_align4(ptr %ptr, half %value) #0 {
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB1_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    mov w22, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w20, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w21, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w21
 ; SOFTFP-NOLSE-NEXT:    bl __subsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w0
 ; SOFTFP-NOLSE-NEXT:  .LBB1_3: // %cmpxchg.start
 ; SOFTFP-NOLSE-NEXT:    // Parent Loop BB1_2 Depth=1
@@ -712,22 +712,22 @@ define <2 x half> @test_atomicrmw_fsub_v2f16_seq_cst_align4(ptr %ptr, <2 x half>
 ; SOFTFP-NOLSE-NEXT:    // =>This Loop Header: Depth=1
 ; SOFTFP-NOLSE-NEXT:    // Child Loop BB7_3 Depth 2
 ; SOFTFP-NOLSE-NEXT:    and w0, w19, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w23, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w24
 ; SOFTFP-NOLSE-NEXT:    bl __subsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w24, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w21, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w25, w0
 ; SOFTFP-NOLSE-NEXT:    and w0, w22, #0xffff
-; SOFTFP-NOLSE-NEXT:    bl __gnu_h2f_ieee
+; SOFTFP-NOLSE-NEXT:    bl __extendhfsf2
 ; SOFTFP-NOLSE-NEXT:    mov w1, w25
 ; SOFTFP-NOLSE-NEXT:    bl __subsf3
-; SOFTFP-NOLSE-NEXT:    bl __gnu_f2h_ieee
+; SOFTFP-NOLSE-NEXT:    bl __truncsfhf2
 ; SOFTFP-NOLSE-NEXT:    mov w8, w22
 ; SOFTFP-NOLSE-NEXT:    bfi w0, w24, #16, #16
 ; SOFTFP-NOLSE-NEXT:    bfi w8, w23, #16, #16
diff --git a/llvm/test/CodeGen/AArch64/strictfp_f16_abi_promote.ll b/llvm/test/CodeGen/AArch64/strictfp_f16_abi_promote.ll
index 3db802a2bc355..63b8a1cee27ae 100644
--- a/llvm/test/CodeGen/AArch64/strictfp_f16_abi_promote.ll
+++ b/llvm/test/CodeGen/AArch64/strictfp_f16_abi_promote.ll
@@ -22,7 +22,7 @@ define void @f16_arg(half %arg, ptr %ptr) #0 {
 ; NOFP16-NEXT:    .cfi_offset w30, -16
 ; NOFP16-NEXT:    and w0, w0, #0xffff
 ; NOFP16-NEXT:    mov x19, x1
-; NOFP16-NEXT:    bl __gnu_h2f_ieee
+; NOFP16-NEXT:    bl __extendhfsf2
 ; NOFP16-NEXT:    str w0, [x19]
 ; NOFP16-NEXT:    ldp x30, x19, [sp], #16 // 16-byte Folded Reload
 ; NOFP16-NEXT:    ret
@@ -44,10 +44,10 @@ define void @v2f16_arg(<2 x half> %arg, ptr %ptr) #0 {
 ; NOFP16-NEXT:    and w0, w0, #0xffff
 ; NOFP16-NEXT:    mov x19, x2
 ; NOFP16-NEXT:    mov w20, w1
-; NOFP16-NEXT:    bl __gnu_h2f_ieee
+; NOFP16-NEXT:    bl __extendhfsf2
 ; NOFP16-NEXT:    mov w21, w0
 ; NOFP16-NEXT:    and w0, w20, #0xffff
-; NOFP16-NEXT:    bl __gnu_h2f_ieee
+; NOFP16-NEXT:    bl __extendhfsf2
 ; NOFP16-NEXT:    stp w21, w0, [x19]
 ; NOFP16-NEXT:    ldp x20, x19, [sp, #16] // 16-byte Folded Reload
 ; NOFP16-NEXT:    ldp x30, x21, [sp], #32 // 16-byte Folded Reload
@@ -73,14 +73,14 @@ define void @v3f16_arg(<3 x half> %arg, ptr %ptr) #0 {
 ; NOFP16-NEXT:    and w0, w1, #0xffff
 ; NOFP16-NEXT:    mov x19, x3
 ; NOFP16-NEXT:    mov w20, w2
-; NOFP16-NEXT:    bl __gnu_h2f_ieee
+; NOFP16-NEXT:    bl __extendhfsf2
 ; NOFP16-NEXT:    mov w22, w0
 ; NOFP16-NEXT:    and w0, w21, #0xffff
-; NOFP16-NEXT:    bl __gnu_h2f_i...
[truncated]

nikic · 2025-02-12T13:50:17Z

Okay, looking a bit more deeply, libgcc only seems to actually enable __extendhfsf2 and __truncsfhf2 for i386 and riscv, if I'm reading the sources correctly.

So while I still think it makes sense to use these symbols instead of the ARM ones, it probably makes little practical difference...

The standard libcalls for half to float and float to half conversion are __extendhfsf2 and __truncsfhf2. However, LLVM currently uses __gnu_h2f_ieee and __gnu_f2h_ieee instead. As far as I can tell, these libcalls are ARM-ism and only provided by libgcc on that platform. compiler-rt always provides both libcalls. Use the standard libcalls by default, and only use the __gnu libcalls on ARM to improve libgcc compatibilty. We encounted this issue with MLIR execution engine test failures on Power 8.

The issue this testing is now fixed for the half float intrinsics. Use a different one that still has the issue.

nikic · 2025-02-17T12:06:34Z

Adjusted the lld test to use a different builtin, because the issue it is testing is fixed for half float builtins now.

efriedma-quic

Assuming this matches the defaults for libgcc, LGTM.

The standard libcalls for half to float and float to half conversion are __extendhfsf2 and __truncsfhf2. However, LLVM currently uses __gnu_h2f_ieee and __gnu_f2h_ieee instead. As far as I can tell, these libcalls are an ARM-ism and only provided by libgcc on that platform. compiler-rt always provides both libcalls. Use the standard libcalls by default, and only use the __gnu libcalls on ARM.

mgorny · 2025-02-22T10:37:14Z

This change broke CSKY CodeGen tests:

FAIL: LLVM :: CodeGen/CSKY/fpu/fp16-promote.ll (1 of 46)                                                                               
******************** TEST 'LLVM :: CodeGen/CSKY/fpu/fp16-promote.ll' FAILED ********************                                       
Exit Code: 1                                                                                                                           
                                                                                                                                       
Command Output (stderr):                                                                                                               
--                                                                                                                                     
RUN: at line 2: /home/mgorny/llvm-project/build/bin/llc -verify-machineinstrs -csky-no-aliases < /home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll -mtriple=csky -float-abi=hard -mattr=+hard-float -mattr=+2e3 -mattr=+fpuv2_sf,+fpuv2_df | /home/mgorny/llvm-project/build/bin/FileCheck /home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll --check-prefix=CHECK-FPUV2
+ /home/mgorny/llvm-project/build/bin/llc -verify-machineinstrs -csky-no-aliases -mtriple=csky -float-abi=hard -mattr=+hard-float -mattr=+2e3 -mattr=+fpuv2_sf,+fpuv2_df
+ /home/mgorny/llvm-project/build/bin/FileCheck /home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll --check-prefix=CHECK-FPUV2
/home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll:36:21: error: CHECK-FPUV2-NEXT: expected string not found in input
; CHECK-FPUV2-NEXT: .long __gnu_h2f_ieee                                                                                               
                    ^                                                                                                                  
<stdin>:29:10: note: scanning from here                                                                                                
.LCPI1_0:                                                                                                                              
         ^                                                                                                                             
<stdin>:30:2: note: possible intended match here                                                                                       
 .long __extendhfsf2                                                                                                                   
 ^                                                                                                                                     
/home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll:72:21: error: CHECK-FPUV2-NEXT: expected string not found in input
; CHECK-FPUV2-NEXT: .long __gnu_h2f_ieee                                                                                               
                    ^                                                                                                                  
<stdin>:50:10: note: scanning from here                                                                                                
.LCPI2_0:                                                                                                                              
         ^                                                                                                                             
<stdin>:51:2: note: possible intended match here                                                                                       
 .long __extendhfsf2                                                                                                                   
 ^                                                                                                                                     
/home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll:111:21: error: CHECK-FPUV2-NEXT: expected string not found in input
; CHECK-FPUV2-NEXT: .long __gnu_f2h_ieee                                                                                               
                    ^                                                                                                                  
<stdin>:73:10: note: scanning from here                                                                                                
.LCPI3_0:                                                                                                                              
         ^                                                                                                                             
<stdin>:74:2: note: possible intended match here                                                                                       
 .long __truncsfhf2                                                                                                                    
 ^                                                                                                                                     
/home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll:204:21: error: CHECK-FPUV2-NEXT: expected string not found in input
; CHECK-FPUV2-NEXT: .long __gnu_h2f_ieee                                                                                               
                    ^                                                                                                                  
<stdin>:130:10: note: scanning from here                                                                                               
.LCPI5_0:                                                                                                                              
         ^                                                                                                                             
<stdin>:133:2: note: possible intended match here                                                                                      
 .long __truncsfhf2                                                                                                                    
 ^                                                                                                                                     
/home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll:273:21: error: CHECK-FPUV2-NEXT: expected string not found in input
; CHECK-FPUV2-NEXT: .long __gnu_h2f_ieee                                                                                               
                    ^                                                                                                                  
<stdin>:166:10: note: scanning from here                                                                                               
.LCPI6_0:                                                                                                                              
         ^                                                                                                                             
<stdin>:169:2: note: possible intended match here                                                                                      
 .long __truncsfhf2                                                                                                                    
 ^                                                                                                                                     
                                                                                                                                       
Input file: <stdin>                                                                                                                    
Check file: /home/mgorny/llvm-project/llvm/test/CodeGen/CSKY/fpu/fp16-promote.ll                                                       
                                                                                                                                       
-dump-input=help explains the following input dump.                                                                                    
                                                                                                                                       
Input was:                                                                                                                             
<<<<<<                                                                                                                                 
            .                                                                                                                          
            .                                                                                                                          
            .                                                                                                                          
           24:  addi16 sp, sp, 4                                                                                                       
           25:  rts16                                                                                                                  
           26:  .p2align 1                                                                                                             
           27: # %bb.1:                                                                                                                
           28:  .p2align 2, 0x0                                                                                                        
           29: .LCPI1_0:                                                                                                               
next:36'0               X error: no match found                                                                                        
           30:  .long __extendhfsf2                                                                                                    
next:36'0      ~~~~~~~~~~~~~~~~~~~~~                                                                                                   
next:36'1       ?                    possible intended match                                                                           
           31: .Lfunc_end1:                                                                                                            
next:36'0      ~~~~~~~~~~~~~                                                                                                           
           32:  .size test_fpextend_float, .Lfunc_end1-test_fpextend_float                                                             
next:36'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                            
           33:  # -- End function                                                                                                      
next:36'0      ~~~~~~~~~~~~~~~~~~~                                                                                                     
           34:  .globl test_fpextend_double # -- Begin function test_fpextend_double                                                   
next:36'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                  
           35:  .p2align 1                                                                                                             
next:36'0      ~~~~~~~~~~~~                                                                                                            
            .                                                                                                                          
            .                                                                                                                          
            .                                                                                                                          
           45:  addi16 sp, sp, 4                                                                                                       
           46:  rts16                                                                                                                  
           47:  .p2align 1                                                                                                             
           48: # %bb.1:                                                                                                                
           49:  .p2align 2, 0x0                                                                                                        
           50: .LCPI2_0:                                                                                                               
next:72'0               X error: no match found                                                                                        
           51:  .long __extendhfsf2                                                                                                    
next:72'0      ~~~~~~~~~~~~~~~~~~~~~                                                                                                   
next:72'1       ?                    possible intended match                                                                           
           52: .Lfunc_end2:                                                                                                            
next:72'0      ~~~~~~~~~~~~~                                                                                                           
           53:  .size test_fpextend_double, .Lfunc_end2-test_fpextend_double                                                           
next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                          
           54:  # -- End function                                                                                                      
next:72'0      ~~~~~~~~~~~~~~~~~~~                                                                                                     
           55:  .globl test_fptrunc_float # -- Begin function test_fptrunc_float                                                       
next:72'0      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                      
           56:  .p2align 1                                                                                                             
next:72'0      ~~~~~~~~~~~~                                                                                                            
            .                                                                                                                          
            .                                                                                                                          
            .                                                                                                                          
           68:  addi16 sp, sp, 8                                                                                                       
           69:  rts16                                                                                                                  
           70:  .p2align 1                                                                                                             
           71: # %bb.1:                                                                                                                
           72:  .p2align 2, 0x0                                                                                                        
           73: .LCPI3_0:                                                                                                               
next:111'0              X error: no match found                                                                                        
           74:  .long __truncsfhf2                                                                                                     
next:111'0     ~~~~~~~~~~~~~~~~~~~~                                                                                                    
next:111'1      ?                   possible intended match                                                                            
           75: .Lfunc_end3:                                                                                                            
next:111'0     ~~~~~~~~~~~~~                                                                                                           
           76:  .size test_fptrunc_float, .Lfunc_end3-test_fptrunc_float                                                               
next:111'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                              
           77:  # -- End function                                                                                                      
next:111'0     ~~~~~~~~~~~~~~~~~~~                                                                                                     
           78:  .globl test_fptrunc_double # -- Begin function test_fptrunc_double                                                     
next:111'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                    
           79:  .p2align 1                                                                                                             
next:111'0     ~~~~~~~~~~~~                                                                                                            
            .                                                                                                                          
            .                                                                                                                          
            .                                                                                                                          
          125:  addi16 sp, sp, 20                                                                                                      
          126:  rts16                                                                                                                  
          127:  .p2align 1                                                                                                             
          128: # %bb.1:                                                                                                                
          129:  .p2align 2, 0x0                                                                                                        
          130: .LCPI5_0:                                                                                                               
next:204'0              X error: no match found                                                                                        
          131:  .long __extendhfsf2                                                                                                    
next:204'0     ~~~~~~~~~~~~~~~~~~~~~                                                                                                   
          132: .LCPI5_1:                                                                                                               
next:204'0     ~~~~~~~~~~                                                                                                              
          133:  .long __truncsfhf2                                                                                                     
next:204'0     ~~~~~~~~~~~~~~~~~~~~                                                                                                    
next:204'1      ?                   possible intended match                                                                            
          134: .Lfunc_end5:                                                                                                            
next:204'0     ~~~~~~~~~~~~~                                                                                                           
          135:  .size test_fadd, .Lfunc_end5-test_fadd                                                                                 
next:204'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                                                
          136:  # -- End function                                                                                                      
next:204'0     ~~~~~~~~~~~~~~~~~~~                                                                                                     
          137:  .globl test_fmul # -- Begin function test_fmul                                                                         
next:204'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                                        
          138:  .p2align 1                                                                                                             
next:204'0     ~~~~~~~~~~~~                                                                                                            
            .                                                                                                                          
            .                                                                                                                          
            .                                                                                                                          
          161:  addi16 sp, sp, 20                                                                                                      
          162:  rts16                                                                                                                  
          163:  .p2align 1                                                                                                             
          164: # %bb.1:                                                                                                                
          165:  .p2align 2, 0x0                                                                                                        
          166: .LCPI6_0:                                                                                                               
next:273'0              X error: no match found                                                                                        
          167:  .long __extendhfsf2                                                                                                    
next:273'0     ~~~~~~~~~~~~~~~~~~~~~                                                                                                   
          168: .LCPI6_1:                                                                                                               
next:273'0     ~~~~~~~~~~                                                                                                              
          169:  .long __truncsfhf2                                                                                                     
next:273'0     ~~~~~~~~~~~~~~~~~~~~                                                                                                    
next:273'1      ?                   possible intended match                                                                            
          170: .Lfunc_end6:                                                                                                            
next:273'0     ~~~~~~~~~~~~~                                                                                                           
          171:  .size test_fmul, .Lfunc_end6-test_fmul                                                                                 
next:273'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                                                
          172:  # -- End function                                                                                                      
next:273'0     ~~~~~~~~~~~~~~~~~~~                                                                                                     
          173:  .section ".note.GNU-stack","",@progbits                                                                                
next:273'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~                                                                               
>>>>>>                                                                                                                                 
                                                                                                                                       
--                                                                                                                                     
                                                                                                                                       
********************                                                                                                                   
********************                                                                                                                   
Failed Tests (1):                                                                                                                      
  LLVM :: CodeGen/CSKY/fpu/fp16-promote.ll                                                                                             
                                                                                                                                       
                                                                                                                                       
Testing Time: 0.54s                                                                                                                    
                                                                                                                                       
Total Discovered Tests: 46                                                                                                             
  Passed: 45 (97.83%)                                                                                                                  
  Failed:  1 (2.17%)                                                                                                                   
FAILED: test/CMakeFiles/check-llvm-codegen-csky /home/mgorny/llvm-project/build/test/CMakeFiles/check-llvm-codegen-csky

efriedma-quic · 2025-02-24T21:57:52Z

CSKY is experimental; please don't ask non-CSKY developers to fix regression test failures.

mgorny · 2025-02-25T07:50:56Z

I figured it wouldn't have hurt to have asked, given that this change altered a fair number of targets already.

llvmbot added backend:ARM backend:AArch64 backend:Hexagon backend:RISC-V backend:PowerPC backend:WebAssembly backend:X86 backend:loongarch llvm:ir labels Feb 12, 2025

nikic requested review from arsenm, topperc and efriedma-quic February 12, 2025 10:09

nikic force-pushed the f16-conv-builtins branch from 087f019 to 4c582d1 Compare February 13, 2025 10:59

llvmbot added the llvm:globalisel label Feb 13, 2025

arsenm requested a review from jcranmer-intel February 13, 2025 12:55

nikic force-pushed the f16-conv-builtins branch from 4c582d1 to 756b302 Compare February 17, 2025 11:59

llvmbot added lld lld:wasm labels Feb 17, 2025

Use different intrinsic in wasm lld test

e0caefe

The issue this testing is now fixed for the half float intrinsics. Use a different one that still has the issue.

nikic force-pushed the f16-conv-builtins branch from 756b302 to e0caefe Compare February 17, 2025 12:05

efriedma-quic approved these changes Feb 18, 2025

View reviewed changes

nikic merged commit cc53913 into llvm:main Feb 19, 2025
8 checks passed

nikic deleted the f16-conv-builtins branch February 19, 2025 09:17

mgorny mentioned this pull request Feb 25, 2025

[LLVM] [CSKY] Test regression in CodeGen/CSKY/fpu/fp16-promote.ll #128652

Open

joaosaffran mentioned this pull request Feb 26, 2025

[libc++] Set feature-test macro __cpp_lib_atomic_float (#127559) joaosaffran/llvm-project#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CodeGen] Use extendhfsf2 and truncsfhf2 by default #126880

[CodeGen] Use extendhfsf2 and truncsfhf2 by default #126880

nikic commented Feb 12, 2025 •

edited

Loading

llvmbot commented Feb 12, 2025 •

edited

Loading

nikic commented Feb 12, 2025

nikic commented Feb 17, 2025

efriedma-quic left a comment

mgorny commented Feb 22, 2025

efriedma-quic commented Feb 24, 2025

mgorny commented Feb 25, 2025

[CodeGen] Use __extendhfsf2 and __truncsfhf2 by default #126880

[CodeGen] Use __extendhfsf2 and __truncsfhf2 by default #126880

Conversation

nikic commented Feb 12, 2025 • edited Loading

llvmbot commented Feb 12, 2025 • edited Loading

nikic commented Feb 12, 2025

nikic commented Feb 17, 2025

efriedma-quic left a comment

Choose a reason for hiding this comment

mgorny commented Feb 22, 2025

efriedma-quic commented Feb 24, 2025

mgorny commented Feb 25, 2025

[CodeGen] Use extendhfsf2 and truncsfhf2 by default #126880

[CodeGen] Use extendhfsf2 and truncsfhf2 by default #126880

nikic commented Feb 12, 2025 •

edited

Loading

llvmbot commented Feb 12, 2025 •

edited

Loading