Tracking Issue for RFC 2948: Portable SIMD #86656

calebzulawski · 2021-06-27T03:57:51Z

Feature gate: #![feature(portable_simd)]

This is a tracking issue for the future feature chartered in RFC 2977, with the intent of creating something akin to the design in RFC 2948 (rust-lang/rfcs#2948): a portable SIMD library (std::simd).

Portable SIMD project group: https://github.com/rust-lang/project-portable-simd
Implementation: https://github.com/rust-lang/portable-simd

More discussion can be found in the #project-portable-simd zulip stream.

Steps

Implement the experimental feature behind a feature gate:
- pub use core::simd; #89167
Define the semantics explicitly.
Write up an RFC detailing the design and defined semantics.
Adjust documentation (see instructions on rustc-dev-guide)
Stabilization PR (see instructions on rustc-dev-guide)

Unresolved Questions

What will the overall design be?
What are the ideal semantics for Masks?
Are there any limits or vector sizes we should not support?
How should these types interop with types like Saturating, NonZero, etc.?

Implementation History

The text was updated successfully, but these errors were encountered:

This enables programmers to use a safe alternative to the current `extern "platform-intrinsics"` API for writing portable SIMD code. This is `#![feature(portable_simd)]` as tracked in rust-lang#86656

…crum pub use core::simd; A portable abstraction over SIMD has been a major pursuit in recent years for several programming languages. In Rust, `std::arch` offers explicit SIMD acceleration via compiler intrinsics, but it does so at the cost of having to individually maintain each and every single such API, and is almost completely `unsafe` to use. `core::simd` offers safe abstractions that are resolved to the appropriate SIMD instructions by LLVM during compilation, including scalar instructions if that is all that is available. `core::simd` is enabled by the `#![portable_simd]` nightly feature tracked in rust-lang#86656 and is introduced here by pulling in the https://github.com/rust-lang/portable-simd repository as a subtree. We built the repository out-of-tree to allow faster compilation and a stochastic test suite backed by the proptest crate to verify that different targets, features, and optimizations produce the same result, so that using this library does not introduce any surprises. As these tests are technically non-deterministic, and thus can introduce overly interesting Heisenbugs if included in the rustc CI, they are visible in the commit history of the subtree but do nothing here. Some tests **are** introduced via the documentation, but these use deterministic asserts. There are multiple unsolved problems with the library at the current moment, including a want for better documentation, technical issues with LLVM scalarizing and lowering to libm, room for improvement for the APIs, and so far I have not added the necessary plumbing for allowing the more experimental or libm-dependent APIs to be used. However, I thought it would be prudent to open this for review in its current condition, as it is both usable and it is likely I am going to learn something else needs to be fixed when bors tries this out. The major types are - `core::simd::Simd<T, N>` - `core::simd::Mask<T, N>` There is also the `LaneCount` struct, which, together with the SimdElement and SupportedLaneCount traits, limit the implementation's maximum support to vectors we know will actually compile and provide supporting logic for bitmasks. I'm hoping to simplify at least some of these out of the way as the compiler and library evolve.

HannesGitH · 2023-02-21T15:07:52Z

Feature gate: #![feature(portable_simd)]

i'm sorry if this is the wrong place to ask but im rather new to rust and stumbled upon this issues as my compiler told me to

if i want to use this feature as soon as my compiler supports it can i gate it like:

#[cfg(feature = "portable_simd")]
use std::simd::Simd;

or is that only for feautures regarding my package (set in toml or passed to cargo?) if so what would be the appropriate way to use simd as soon as this issue is resolved?

Lokathor · 2023-02-21T15:23:10Z

The #![feature(portable_simd)] part goes at the top of a binary or library.

It's a language feature not a cargo feature so it works a little differently.

It's unfortunate that they're both just "feature". Rust is often too terse when it counts.

HannesGitH · 2023-02-21T17:00:03Z

ok thanks a lot!

just to make sure this means there is no (easy*) way to use this language feature if my compiler supports it and fall back to a custom implementation otherwise?

_{*easy as in compile time guards / attribute-like macros or creating a custom wrapper module that either provides rusts simd or my own fallback or something else in that level of skill}

for anyone else stumbling upon this:

language features are (unstable) features you can opt-in when using nightly rust (by putting the specified flag in your library root, the whole project will then be compiled with a compiler that uses this feature)

CarlKCarlK · 2023-11-22T15:15:23Z

@agausmann
To enable the experimental feature flag on nightly,
#![rustversion::attr(nightly, feature(portable_simd))]

@safinaskar

Unfortunately, this particular code doesn't work

This worked for me:

#![cfg_attr(feature = "from_slice", feature(portable_simd))]

where "from_slice" is the name of my the-other-kind-of-feature, defined in Cargo.toml, that uses portable_simd.

[features]
from_slice = []

So, I run tests, for example, via cargo test --features=from_slice.

GlenDC · 2023-12-07T18:04:07Z

Is this on the 2024 edition roadmap, or will it be only for after that? I know it’s not related, but gives me a timeline range.

calebzulawski · 2023-12-07T18:11:04Z

I don't think anyone has a specific timeline, but we still need to draft a new RFC and go through the approval process, which can take some time.

jhpratt · 2024-03-08T10:46:48Z

Is there a particular reason that Simd does not implement Deref and DerefMut? I don't see any reason the impls would restrict the ability to do anything.

Lokathor · 2024-03-08T15:34:32Z

Like deref into a slice? Usually that's not done because it's a huge performance footgun.

Firstyear · 2024-03-08T22:58:39Z

It may be good to document what that footgun is and why the choice was made because people will ask this again in future.

Lokathor · 2024-03-08T23:11:24Z

So, to add more detail: the problem is that (depending on SIMD used) you can't in general index to a particular lane of a SIMD register. So if you view the SIMD data as a slice and operate on an element of the slice, what the hardware must do is have the CPU stop the current SIMD processing, write the register to the stack, work on the stack value (however the slice is adjusted), and then load that back into a SIMD register. This is, in general, a performance disaster. As usual, the optimizer might be able to cut out this stall in the pipeline, in some cases, depending on circumstances, etc etc. But you should expect that the SIMD handling is totally stalled when trying to treat the data as a slice.

jhpratt · 2024-03-09T05:25:23Z

I figured there was a reason, but I'm not familiar with how SIMD works under the hood. Given that indexing is the problem, why implement Index and IndexMut then?

Lokathor · 2024-03-09T05:52:43Z

Oh, uh, well I haven't looked in a while! I guess I'm out of the loop on the current API details.

I'm surprised that Index is in if Deref is out. Either both should be in or both should be out, would be my expectation.

calebzulawski · 2024-03-09T06:03:20Z

The basic idea is that we want a clear marker of the boundary between SIMD and non-SIMD operations. When using Index (vector[i]) there is an obvious sign that you are no longer using SIMD operations. Likewise with arrays and slices, we implement AsRef and the to_array function because these are explicit. The concern with Deref is that the automatic inclusion of all slice functions makes it harder to tell which operations are SIMD. For example, you may expect is_ascii to be vectorized, but instead it is simply a scalar implementation inherited from slices.

Lokathor · 2024-03-09T06:08:09Z

vector[i] isn't particularly more obvious, I would say.

Maybe we should just always make people convert to an array to index elements?

calebzulawski · 2024-03-09T06:11:37Z

A while ago we didn't implement Index and we got requests for it, but this is the first time Deref has come up, so I think it's a good compromise. Maybe it's not particularly obvious that Index is the boundary, but Deref is completely invisible without consulting the docs.

ZagButNoZig · 2024-04-22T13:03:11Z

There are certain types of instructions where the output data type is different from the input data type like: _mm256_maddubs_epi16. I don't think there is a way to do that in portable simd without casting first which is slower? Are there any plans to support these instructions. Similar instructions also exist on arch: vdotq_s32

abysssol · 2024-04-25T19:07:40Z

Hi, I was wondering if there had been any discussion or consideration of making a dynamically sized api for vector operations. The current api seems to be analogous to arrays, but perhaps a more elegant and convenient solution would be analogous to slices.

I learned about this idea when researching risc-v's vector extension. Both this article and this one (fully rendered here) are good references on the motivation, from the perspective of an ISA.

While the current api is already much better than traditional simd instructions, it seems to me that the logical conclusion is a runtime sized type; maybe a wrapper around &mut [T], or a type like Vec<T>, or perhaps a modification to Vec<T> that guarantees simd optimization if T is a numeric primitive.

Hopefully this can spark a useful discussion on the best design of simd/vector types and operations. Thank you for your consideration.

Lokathor · 2024-04-25T19:38:06Z

That could be some additional API that lives along aside the fixed sized SIMD types, but for the main CPU arches a fixed sized simd type is what generally works best with optimizations.

as_simd: fix doc comment to be in line with align_to In rust-lang#121201, the guarantees about `align_offset` and `align_to` were changed. This PR aims to correct the doc comment of `as_simd` to be in line with the new `align_to`. Tagging rust-lang#86656 for good measure.

Rollup merge of rust-lang#127422 - greaka:master, r=workingjubilee as_simd: fix doc comment to be in line with align_to In rust-lang#121201, the guarantees about `align_offset` and `align_to` were changed. This PR aims to correct the doc comment of `as_simd` to be in line with the new `align_to`. Tagging rust-lang#86656 for good measure.

colejohnson66 · 2024-07-15T12:12:03Z

Curious how ARM SVE and RISC-V V are meant to be used in Rust. The fixed-length abstraction is a nice one, and it's what .NET is going with in .NET 9 (Vector<T> for SVE is 128-bit, at least for now), but variable-length vectors are here to stay.

dead-claudia · 2024-07-20T00:11:09Z

Curious how ARM SVE and RISC-V V are meant to be used in Rust. The fixed-length abstraction is a nice one, and it's what .NET is going with in .NET 9 (Vector<T> for SVE is 128-bit, at least for now), but variable-length vectors are here to stay.

RISC-V offers extensions like Zvl128b that provide hard guarantees on minimum vector size. It should be possible to leverage this in the interim while RISC-V figures out their P extension (which isn't very far along).

Edit: fix extension name

Salabar · 2024-08-04T11:52:51Z

Would it it make sense to add a family of functions like "load_base_*" that take a slice and an isize index? It would account for buffer underflow as well as overflow. With this you can write things such as convolution with nice clean loops that don't have account for edge cases.

for i in 0..image.len(){
   let mut result = 0.;

  for j in 1..kernel.radius() / N {
    let left = Simd::<N>::load_base_or(image, i - j * N, splat(image[0]);
   //... 
 }
  for j in 0..kernel.radius() / N {
    let right= Simd::<N>::load_base_or(image, i + j * N, splat(image.last());
   //... 
}
  image[i] = result;
}

DXist · 2024-08-11T18:50:42Z

Is it possible to move Mask inherent methods into a trait like SimdMask and add this trait as a bound to associated type Mask of other traits, e.g. SimdPartialEq?

This will help to write generic code that works for different primitive types.

Got this idea while writing a fixed index map data structure that is expected to work with unsigned integer keys regardless of the width.

Without the trait bound for Mask associated type I have to wrap my implementation into macros and explicitly apply it to u8, u16, u32, u64 and usize.

calebzulawski · 2024-08-12T01:02:43Z

I think you should probably be able to do what you want:

fn generic<T>(v: Simd<T, 4>, m: Mask<T::Mask, 4>) -> bool
where
    T: SimdElement + Default,
    Simd<T, 4>: SimdPartialEq<Mask = Mask<T::Mask, 4>>,
{
    (v.simd_eq(Simd::splat(Default::default())) ^ m).all()
}

However, it would be nice if there were an easier way to do this without requiring that extra bound.

DXist · 2024-08-12T12:49:12Z

@calebzulawski , thank you!

It worked along with a couple of bounds from num-traits crate.

Maybe an example with generic code will be a useful demo of bounds usage.

calebzulawski added C-tracking-issue Category: A tracking issue for an RFC or an unstable feature. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. labels Jun 27, 2021

jonas-schievink added A-simd Area: SIMD (Single Instruction Multiple Data) PG-portable-simd Project group: Portable SIMD (https://github.com/rust-lang/project-portable-simd) labels Jun 27, 2021

workingjubilee mentioned this issue Sep 22, 2021

pub use core::simd; #89167

Merged

workingjubilee added the needs-rfc This change is large or controversial enough that it should have an RFC accepted before doing it. label Dec 1, 2021

Kerollmops mentioned this issue Jan 24, 2022

Will simd be introduced？ RoaringBitmap/roaring-rs#60

Closed

jorgecarleitao mentioned this issue Apr 2, 2022

packed_simd requires nightly apache/arrow-rs#54

Closed

akhilles mentioned this issue Feb 4, 2023

Add SIMD implementations if/when std::simd is stable mrhooray/crc-rs#83

Open

dlaehnemann mentioned this issue Oct 19, 2023

Coverage CI test failing due to compiler bug (being addressed upstream) rust-bio/rust-bio#543

Open

sinui0 mentioned this issue Nov 7, 2023

Improve Block with SIMD privacy-scaling-explorations/mpz#84

Open

Dylan-DPC mentioned this issue Mar 4, 2024

Tracking issues for unstable library features used by std #94971

Open

32 tasks

TheKindlerofWildfires mentioned this issue Jun 1, 2024

Integrate perf enhancers as they mature TheKindlerofWildfires/nonphysical#2

Open

Cryptex-github mentioned this issue Jun 23, 2024

SIMD implementations jay3332/ril#10

Draft

1 task

greaka mentioned this issue Jul 6, 2024

as_simd: fix doc comment to be in line with align_to #127422

Merged

ChristopherRabotin mentioned this issue Aug 26, 2024

[DRAFT] Durations are now zeptosecond counters (1e-21 second) nyx-space/hifitime#326

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking Issue for RFC 2948: Portable SIMD #86656

Tracking Issue for RFC 2948: Portable SIMD #86656

calebzulawski commented Jun 27, 2021 •

edited by workingjubilee

Loading

HannesGitH commented Feb 21, 2023

Lokathor commented Feb 21, 2023

HannesGitH commented Feb 21, 2023 •

edited

Loading

CarlKCarlK commented Nov 22, 2023 •

edited

Loading

GlenDC commented Dec 7, 2023

calebzulawski commented Dec 7, 2023

jhpratt commented Mar 8, 2024

Lokathor commented Mar 8, 2024 •

edited

Loading

Firstyear commented Mar 8, 2024

Lokathor commented Mar 8, 2024

jhpratt commented Mar 9, 2024

Lokathor commented Mar 9, 2024

calebzulawski commented Mar 9, 2024 •

edited

Loading

Lokathor commented Mar 9, 2024

calebzulawski commented Mar 9, 2024

ZagButNoZig commented Apr 22, 2024 •

edited

Loading

abysssol commented Apr 25, 2024

Lokathor commented Apr 25, 2024

colejohnson66 commented Jul 15, 2024

dead-claudia commented Jul 20, 2024 •

edited

Loading

Salabar commented Aug 4, 2024 •

edited

Loading

DXist commented Aug 11, 2024

calebzulawski commented Aug 12, 2024

DXist commented Aug 12, 2024

Tracking Issue for RFC 2948: Portable SIMD #86656

Tracking Issue for RFC 2948: Portable SIMD #86656

Comments

calebzulawski commented Jun 27, 2021 • edited by workingjubilee Loading

Steps

Unresolved Questions

Implementation History

HannesGitH commented Feb 21, 2023

Lokathor commented Feb 21, 2023

HannesGitH commented Feb 21, 2023 • edited Loading

CarlKCarlK commented Nov 22, 2023 • edited Loading

GlenDC commented Dec 7, 2023

calebzulawski commented Dec 7, 2023

jhpratt commented Mar 8, 2024

Lokathor commented Mar 8, 2024 • edited Loading

Firstyear commented Mar 8, 2024

Lokathor commented Mar 8, 2024

jhpratt commented Mar 9, 2024

Lokathor commented Mar 9, 2024

calebzulawski commented Mar 9, 2024 • edited Loading

Lokathor commented Mar 9, 2024

calebzulawski commented Mar 9, 2024

ZagButNoZig commented Apr 22, 2024 • edited Loading

abysssol commented Apr 25, 2024

Lokathor commented Apr 25, 2024

colejohnson66 commented Jul 15, 2024

dead-claudia commented Jul 20, 2024 • edited Loading

Salabar commented Aug 4, 2024 • edited Loading

DXist commented Aug 11, 2024

calebzulawski commented Aug 12, 2024

DXist commented Aug 12, 2024

calebzulawski commented Jun 27, 2021 •

edited by workingjubilee

Loading

HannesGitH commented Feb 21, 2023 •

edited

Loading

CarlKCarlK commented Nov 22, 2023 •

edited

Loading

Lokathor commented Mar 8, 2024 •

edited

Loading

calebzulawski commented Mar 9, 2024 •

edited

Loading

ZagButNoZig commented Apr 22, 2024 •

edited

Loading

dead-claudia commented Jul 20, 2024 •

edited

Loading

Salabar commented Aug 4, 2024 •

edited

Loading