/
CT_00_049 -- New instruction fusions
CT_00_049 -- New instruction fusions
About
Instruction fusion is an important technique to improve performance of RISC-V systems. While GCC has support for a common set of instruction fusions, it's believed that additional cases will become more important as new designs come to market.
- Store pair fusions. Right now GCC only supports fusion of two 64 bit stores to aligned addresses. Newer designs are expected to be able to fuse pairs of same sized stores fairly aggressively.
- Address calculations with memory references. Right now GCC supports a limited set fusions of address arithmetic with a memory reference. Newer designs are expected to be able to fuse nearly every add/shadd with a subsequent memory reference.
- Zero extended bitfield extractions. Right now GCC supports a small subset of bitfield extractions when implemented via shifts. The shift counts are fairly restrictive.
- Newer designs are expected to fuse these operations very aggressively.
- The compiler should rewrite a right shift + masking off upper bits as a left shift + right logical shift to facilitate fusion.
- Left shift + add (shift count > 3) are expected to be fusible in the near future. As are shifts followed by other non-shift ALU ops, including 'w' variants.
- Many bugs exist in the current fusion code. Those all need to be fixed as they mark instructions as fusible when they are not which in turn restricts scheduling
- Addition of instrumentation to fusion code so we know which ones are triggering allowing us to evaluate.....
- Synopsys engineers have changes which change GCC's model from "these are fusible, keep them consecutive" to "find fusible cases, bring them together and keep them together". Maybe as much as a 10% increase in real fusion opportunities.
Ventana has code to support these cases using the general infrastructure already in the RISC-V GCC backend. This code just needs to be benchmarked and upstreamed once gcc-16 is open for development.
Stakeholders/Partners
RISE:
Ventana: Jeff Law. Oversight/guidance
Ventana: Daniel Barboza: Implementation
External:
Samsung: Artemiy Volkov
Dependencies
Status
Updates
- Upstreaming efforts broken out as new 2025 project
, multiple selections available,
Related content
CT_00_033 -- New instruction fusions
CT_00_033 -- New instruction fusions
More like this
CT_00_040 - Improve Instruction Length Computations (GCC)
CT_00_040 - Improve Instruction Length Computations (GCC)
More like this
CT_01_006 - Stack Clash Support (LLVM)
CT_01_006 - Stack Clash Support (LLVM)
More like this
CT_00_039 - Extension Elimination (GCC)
CT_00_039 - Extension Elimination (GCC)
More like this
2024-2H - Compilers and Toolchains Priorities
2024-2H - Compilers and Toolchains Priorities
Read with this
2025-1H - Compilers and Toolchains Priorities
2025-1H - Compilers and Toolchains Priorities
Read with this