add neon instruction vmaxnm_f* vpmaxnm_f* vminnm_f* vpminnm_f* #1105

surechen · 2021-04-02T04:07:15Z

vmaxnm_f* : Floating-point Maximum Number (vector). This instruction compares corresponding vector elements in the two source SIMD&FP registers, writes the larger of the two floating-point values into a vector, and writes the vector to the destination SIMD&FP register.
vpmaxnm_f* : Floating-point Maximum Number Pairwise (vector). This instruction creates a vector by concatenating the vector elements of the first source SIMD&FP register after the vector elements of the second source SIMD&FP register, reads each pair of adjacent vector elements in the two source SIMD&FP registers, writes the largest of each pair of values into a vector, and writes the vector to the destination SIMD&FP register. All the values in this instruction are floating-point values.
vminnm_f* : Floating-point Minimum Number (vector). This instruction compares corresponding vector elements in the two source SIMD&FP registers, writes the smaller of the two floating-point values into a vector, and writes the vector to the destination SIMD&FP register.
vpminnm_f* : Floating-point Minimum Number Pairwise (vector). This instruction creates a vector by concatenating the vector elements of the first source SIMD&FP register after the vector elements of the second source SIMD&FP register, reads each pair of adjacent vector elements in the two source SIMD&FP registers, writes the smallest of each pair of floating-point values into a vector, and writes the vector to the destination SIMD&FP register. All the values in this instruction are floating-point values.

Amanieu · 2021-04-02T05:10:07Z

vmaxnm_f32/vmaxnmq_f32 (and the min equivalents) are also available on ARM.

surechen · 2021-04-02T06:32:46Z

vmaxnm_f32/vmaxnmq_f32 (and the min equivalents) are also available on ARM.

Hi, thanks for reviewing. I check these instructions in https://godbolt.org/ and get Compilation failed.

https://godbolt.org/z/1nM7eYvrz
https://godbolt.org/z/nMo63G89W

Amanieu · 2021-04-02T06:33:45Z

These instruction were added in ARMv8 for both 32-bit and 64-bit mode. If you change armv7 to armv8 in godbolt then it will be accepted.

surechen · 2021-04-02T06:52:45Z

These instruction were added in ARMv8 for both 32-bit and 64-bit mode. If you change armv7 to armv8 in godbolt then it will be accepted.

Hi, Thank you for your guidance.
If I remove line #[cfg_attr(target_arch = "arm", target_feature(enable = "v7"))] . Is the following code right？

#[inline]
#[target_feature(enable = "neon")]
#[cfg_attr(all(test, target_arch = "arm"), assert_instr("vmaxnm"))]
#[cfg_attr(all(test, target_arch = "aarch64"), assert_instr(fmaxnm))]
pub unsafe fn vmaxnm_f32(a: float32x2_t, b: float32x2_t) -> float32x2_t {
#[allow(improper_ctypes)]
extern "C" {
#[cfg_attr(target_arch = "arm", link_name = "llvm.arm.neon.vmaxnm.v2f32")]
#[cfg_attr(target_arch = "aarch64", link_name = "llvm.aarch64.neon.fmaxnm.v2f32")]
fn vmaxnm_f32_(a: float32x2_t, b: float32x2_t) -> float32x2_t;
}
vmaxnm_f32_(a, b)
}

Amanieu · 2021-04-02T07:01:51Z

I think you need to enable the v8 target feature for this instruction.

surechen · 2021-04-02T07:04:13Z

I think you need to enable the v8 target feature for this instruction.

Ok, Thank you very much.

bors · 2021-04-02T21:03:17Z

☔ The latest upstream changes (presumably 15babf5) made this pull request unmergeable. Please resolve the merge conflicts.

Amanieu · 2021-04-02T21:53:04Z

I looked into the compiler crash. You need to enable the fp-armv8 feature. But first you need to update rustc to expose this feature on ARM. This can be done by adding it to compiler/rustc_codegen_ssa/src/target_features.rs.

surechen · 2021-04-03T01:44:54Z

I looked into the compiler crash. You need to enable the fp-armv8 feature. But first you need to update rustc to expose this feature on ARM. This can be done by adding it to compiler/rustc_codegen_ssa/src/target_features.rs.

Hi, Thank you very much. I'll try

…ochenkov add fp-armv8 for ARM_ALLOWED_FEATURES For fixing err in rust-lang/stdarch#1105.

…d_vpmaxnm # Conflicts: # crates/stdarch-gen/src/main.rs

Amanieu · 2021-04-05T12:48:57Z

You need both v8 and fp-armv8.

add neon instruction vmaxnm_f* vpmaxnm_f* vminnm_f* vpminnm_f*

c7a099c

edit for v8

73c5a58

surechen mentioned this pull request Apr 3, 2021

add fp-armv8 for ARM_ALLOWED_FEATURES rust-lang/rust#83803

Merged

JohnTitor added a commit to JohnTitor/rust that referenced this pull request Apr 3, 2021

Rollup merge of rust-lang#83803 - surechen:add_target_feature, r=petr…

d0266e3

…ochenkov add fp-armv8 for ARM_ALLOWED_FEATURES For fixing err in rust-lang/stdarch#1105.

Merge branch 'master' of https://github.com/rust-lang/stdarch into ad…

cb5a99d

…d_vpmaxnm # Conflicts: # crates/stdarch-gen/src/main.rs

edit target

5428d5d

Amanieu merged commit daae8f8 into rust-lang:master Apr 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add neon instruction vmaxnm_f* vpmaxnm_f* vminnm_f* vpminnm_f* #1105

add neon instruction vmaxnm_f* vpmaxnm_f* vminnm_f* vpminnm_f* #1105

surechen commented Apr 2, 2021

Amanieu commented Apr 2, 2021

surechen commented Apr 2, 2021 •

edited

Loading

Amanieu commented Apr 2, 2021

surechen commented Apr 2, 2021 •

edited

Loading

Amanieu commented Apr 2, 2021

surechen commented Apr 2, 2021

bors commented Apr 2, 2021

Amanieu commented Apr 2, 2021

surechen commented Apr 3, 2021

Amanieu commented Apr 5, 2021

add neon instruction vmaxnm_f* vpmaxnm_f* vminnm_f* vpminnm_f* #1105

add neon instruction vmaxnm_f* vpmaxnm_f* vminnm_f* vpminnm_f* #1105

Conversation

surechen commented Apr 2, 2021

Amanieu commented Apr 2, 2021

surechen commented Apr 2, 2021 • edited Loading

Amanieu commented Apr 2, 2021

surechen commented Apr 2, 2021 • edited Loading

Amanieu commented Apr 2, 2021

surechen commented Apr 2, 2021

bors commented Apr 2, 2021

Amanieu commented Apr 2, 2021

surechen commented Apr 3, 2021

Amanieu commented Apr 5, 2021

surechen commented Apr 2, 2021 •

edited

Loading

surechen commented Apr 2, 2021 •

edited

Loading