[X86] Use BWI instructions to improve lowering of v32i8 MULHU/S

Summary: If we have BWI instructions we can widen to v32i16 to do the multiply instead of splitting.

Reviewers: RKSimon, spatel, zvi

Reviewed By: zvi

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38305

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@314432 91177308-0d34-0410-b5e6-96231b3b80d8
3 files changed