...
Beyond fixing the regression, vector is not expected to provide much benefit for CactuBSSN. Testing on the k1 chip shows that 1.2% improvement in dynamic instruction counts using vector, but a 1% performance regression. As noted in the cam4 work item, we believe this is due to weaknesses in the k1 vector unit design.
Stakeholders/Partners
RISE:
...
Page Properties | ||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Updates
- Add data from run on the k1 (BPI-F3 board)
- Regression has been fixed on the GCC trunk
- Competitive data shows no significant improvement with vector
- The 1% reduction in instruction counts we see with vector enabled is not likely to move the needle in any significant way performane-wise
- Considering this done/closed.
...