129e49689153433fe3471fc3f196b5bbc760a20d - platform_external_llvm

commit	129e49689153433fe3471fc3f196b5bbc760a20d	[log] [tgz]
author	Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	Mon Nov 12 18:48:17 2018 +0000
committer	Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com>	Mon Nov 12 18:48:17 2018 +0000
tree	37095fe116b980d0a73b97c15ebf8b3529f3ca20
parent	7ffaeb4030081f5c25bae77b1b43818195259e7e [diff]

[AMDGPU] Optimize S_CBRANCH_VCC[N]Z -> S_CBRANCH_EXEC[N]Z

Sometimes after basic block placement we end up with a code like:

  sreg = s_mov_b64 -1
  vcc = s_and_b64 exec, sreg
  s_cbranch_vccz

This happens as a join of a block assigning -1 to a saved mask and
another block which consumes that saved mask with s_and_b64 and a
branch.

This is essentially a single s_cbranch_execz instruction when moved
into a single new basic block.

Differential Revision: https://reviews.llvm.org/D54164

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@346690 91177308-0d34-0410-b5e6-96231b3b80d8

lib/Target/AMDGPU/SIInsertSkips.cpp[diff]
test/CodeGen/AMDGPU/branch-relaxation.ll[diff]
test/CodeGen/AMDGPU/infinite-loop.ll[diff]
test/CodeGen/AMDGPU/insert-skip-from-vcc.mir[Added - diff]

4 files changed

tree: 37095fe116b980d0a73b97c15ebf8b3529f3ca20