3a7e7531706ed43543b1fafe419a95853f36b899 - platform_external_llvm80

commit	3a7e7531706ed43543b1fafe419a95853f36b899	[log] [tgz]
author	Sanjay Patel <spatel@rotateright.com>	Mon Feb 29 23:16:48 2016 +0000
committer	Sanjay Patel <spatel@rotateright.com>	Mon Feb 29 23:16:48 2016 +0000
tree	7e36efe181e841f2e478aa9414c5a6a3ab8dbe93
parent	87e4278b8cd22aeb95e206c940a1b675d712b65f [diff]

[x86, InstCombine] transform x86 AVX masked loads to LLVM intrinsics

The intended effect of this patch in conjunction with:
http://reviews.llvm.org/rL259392
http://reviews.llvm.org/rL260145

is that customers using the AVX intrinsics in C will benefit from combines when
the load mask is constant:

__m128 mload_zeros(float *f) {
  return _mm_maskload_ps(f, _mm_set1_epi32(0));
}

__m128 mload_fakeones(float *f) {
  return _mm_maskload_ps(f, _mm_set1_epi32(1));
}

__m128 mload_ones(float *f) {
  return _mm_maskload_ps(f, _mm_set1_epi32(0x80000000));
}

__m128 mload_oneset(float *f) {
  return _mm_maskload_ps(f, _mm_set_epi32(0x80000000, 0, 0, 0));
}

...so none of the above will actually generate a masked load for optimized code.

This is the masked load counterpart to:
http://reviews.llvm.org/rL262064



git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@262269 91177308-0d34-0410-b5e6-96231b3b80d8

2 files changed

tree: 7e36efe181e841f2e478aa9414c5a6a3ab8dbe93