pub unsafe fn _mm256_mask_permutexvar_ps(
    src: __m256,
    k: __mmask8,
    idx: __m256i,
    a: __m256
) -> __m256
🔬 This is a nightly-only experimental API. (stdsimd #48556)
Available on x86-64 and target feature avx512f,avx512vl only.
Expand description

Shuffle single-precision (32-bit) floating-point elements in a across lanes using the corresponding index in idx, and store the results in dst using writemask k (elements are copied from src when the corresponding mask bit is not set).

Intel’s documentation