This patch has been committed to the master branch: 6d0b7b69d14 — Emit cvtne2ps2bf16 for odd increasing perm in __builtin_shufflevector This patch teaches GCC’s x86 backend to recognize a specific BF16 vector shuffle pattern — selecting every odd element from two concatenated vectors — and lower it to the vcvtne2ps2bf16 instruction instead of a general-purpose byte… Read more x86: Emit vcvtne2ps2bf16 for odd-element BF16 shuffle