fb66864554f3d4b1696afa2e15b6ef4ab0b2e794 - platform_external_llvm

commit	fb66864554f3d4b1696afa2e15b6ef4ab0b2e794	[log] [tgz]
author	Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net>	Wed May 16 12:33:09 2018 +0000
committer	Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net>	Wed May 16 12:33:09 2018 +0000
tree	d07bcfd668bd0bf51a9db8d32d07ae2b57110ab1
parent	f392447b194cc62e6dc1ad6c60b9a0a76aa10409 [diff]

[llvm-mca] Fix perf regression after r332390.

Revision 332390 introduced a FetchStage class in llvm-mca.
By design, FetchStage owns all the instructions in-flight in the OoO Backend.

Before this change, new instructions were added to a DenseMap indexed by
instruction id. The problem with using a DenseMap is that elements are not
ordered by key. This was causing a massive slow down in method
FetchStage::postExecute(), which searches for instructions retired that can be
deleted.

This patch replaces the DenseMap with a std::map ordered by instruction index.
At the end of every cycle, we search for the first instruction which is not
marked as "retired", and we remove all the previous instructions before it.
This works well because instructions are retired in-order.

Before this patch, a debug build of llvm-mca (on my Ryzen linux machine) took
~8.0 seconds to simulate 3000 iterations of a x86 dot-product (a `vmulps,
vpermilps, vaddps, vpermilps, vaddps` sequence). With this patch, it now takes
~0.8s to run all the 3000 iterations.


git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@332461 91177308-0d34-0410-b5e6-96231b3b80d8

2 files changed

tree: d07bcfd668bd0bf51a9db8d32d07ae2b57110ab1