About

Enablement of auto-vectorization in LLVM for RISC-V, targeting the V extension version 1.0. While the long term goal is to focus on vector length agnostic (VLA) approaches to vectorization, some of LLVM's vectorizer may still be biased towards fixed vector sizes. Thus we expect to find cases that are not well handled using VLA approaches and we expect to support VLS approaches to vectorization a stop-gap alternatives.

LLVM's support for auto-vectorization on RISC-V appears to be improving regularly, but it is sensitive to having reasonable micro-architectural data available. Thus it may be necessary to stub-out values for these key parameters when enabling auto-vectorization on a new micro-architecture, or to disable the costing model.

Stakeholders/Partners

RISE:

Ventana: 1 FTE focused on getting necessary uarch data ready

Ventana: 1 FTE Reference/target implementation of key x264 loops, breakdown of tasks that need to be solved to achieve desired code generation

SiFive: Craig Topper, Alexey Bataev

Rivos:

External:

Alex Bradbury

Dependencies

The most pressing upstream dependencies are:

PSABI specification for vector argument passing and return values
Kernel support to enable discovery of the V extension
glibc support for libmvec to enable vector API for key math library functions such as sin, cos, sqrt, etc (does LLVM support libmvec calls?)

Status

Development	IN PROGRESS
Development Timeline	NA
Upstreaming	IN PROGRESS
Upstream Version	Development Trunk
Contacts	Jeff Law (Ventana) Alexey Bataev (SiFive)
Dependencies	PSABI vector spec Kernel discovery glibc libmvec

Updates

12 Feb 2025

Moved to 1H2025
EVL vectorization with tail folding showing good gains on 525.x264_r in spec2017 on Banana Pi F3.
EVL-based vectorizer is currently stable. Lacks support for multi-exit loops and first-order recurrences. SLP vectorizer support segmented loads/stores, strided loads and partially strided stores (-1 stride). Expand/compress for SLP still WIP.

11 Jul 2024

Moved to 2H2024
Patch for using VP intrinsics for unary and binary operators https://github.com/llvm/llvm-project/pull/93854

11 Apr 2024

First VP intrinsic vectorizer patch merged https://github.com/llvm/llvm-project/pull/76172. First step to vectorizing using strip mined loops like examples in the vector spec.

30 Aug 2023

Improving vectorization split off as distinct project

Home

CT_01_008 - Autovectorization -- Improvements (LLVM)