Open
Description
I see patterns such as this after reduction:
%i3.i = tail call <3 x float> @llvm.fmuladd.v3f32(<3 x float> %extractVec278.i.i, <3 x float> zeroinitializer, <3 x float> %extractVec3.i.i.i)
InstCombine will fold this to fadd
, but only with nsz
. I'm not sure the best way to force this simplification. We could either start introducing fast math flags, or repeat the same kind of folds in the reducer.