13ee34b33549b4115c9e2308bbbdd2a34de23503 - platform_external_llvm80

commit	13ee34b33549b4115c9e2308bbbdd2a34de23503	[log] [tgz]
author	Craig Topper <craig.topper@intel.com>	Sun Jul 22 19:44:35 2018 +0000
committer	Craig Topper <craig.topper@intel.com>	Sun Jul 22 19:44:35 2018 +0000
tree	b3cd78dd0c03d57d2b3c1f2f60646260cd5e5d01
parent	a747d6134aada23da1df97e526005d0c1a8a4155 [diff]

[X86] Remove the max vector width restriction from combineLoopMAddPattern and rely splitOpsAndApply to handle splitting.

This seems to be a net improvement. There's still an issue under avx512f where we have a 512-bit vpaddd, but not vpmaddwd so we end up doing two 256-bit vpmaddwds and inserting the results before a 512-bit vpaddd. It might be better to do two 512-bits paddds with zeros in the upper half. Same number of instructions, but breaks a dependency.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@337656 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Target/X86/X86ISelLowering.cpp[diff]
test/CodeGen/X86/madd.ll[diff]
test/CodeGen/X86/required-vector-width.ll[diff]

3 files changed

tree: b3cd78dd0c03d57d2b3c1f2f60646260cd5e5d01