blob: d63ff5a84394808d50299b8b2785121b9e18bbc2 [file] [log] [blame]
Sanjoy Das1991e2a2015-06-15 18:44:08 +00001==============================
2FaultMaps and implicit checks
3==============================
4
5.. contents::
6 :local:
7 :depth: 2
8
9Motivation
10==========
11
12Code generated by managed language runtimes tend to have checks that
13are required for safety but never fail in practice. In such cases, it
14is profitable to make the non-failing case cheaper even if it makes
15the failing case significantly more expensive. This asymmetry can be
16exploited by folding such safety checks into operations that can be
17made to fault reliably if the check would have failed, and recovering
18from such a fault by using a signal handler.
19
20For example, Java requires null checks on objects before they are read
21from or written to. If the object is ``null`` then a
22``NullPointerException`` has to be thrown, interrupting normal
23execution. In practice, however, dereferencing a ``null`` pointer is
24extremely rare in well-behaved Java programs, and typically the null
25check can be folded into a nearby memory operation that operates on
26the same memory location.
27
28The Fault Map Section
29=====================
30
31Information about implicit checks generated by LLVM are put in a
32special "fault map" section. On Darwin this section is named
33``__llvm_faultmaps``.
34
35The format of this section is
36
37.. code-block:: none
38
39 Header {
40 uint8 : Fault Map Version (current version is 1)
41 uint8 : Reserved (expected to be 0)
42 uint16 : Reserved (expected to be 0)
43 }
44 uint32 : NumFunctions
45 FunctionInfo[NumFunctions] {
46 uint64 : FunctionAddress
47 uint32 : NumFaultingPCs
48 uint32 : Reserved (expected to be 0)
49 FunctionFaultInfo[NumFaultingPCs] {
Sanjoy Dasa0be0b12017-02-07 19:19:49 +000050 uint32 : FaultKind
Sanjoy Das1991e2a2015-06-15 18:44:08 +000051 uint32 : FaultingPCOffset
Sanjoy Das5a94e2a2015-06-22 18:02:55 +000052 uint32 : HandlerPCOffset
Sanjoy Das1991e2a2015-06-15 18:44:08 +000053 }
54 }
Sanjoy Dasb3709592015-06-29 22:00:30 +000055
Sanjoy Das9ac774d2017-02-07 20:36:03 +000056FailtKind describes the reason of expected fault. Currently three kind
57of faults are supported:
58
59 1. ``FaultMaps::FaultingLoad`` - fault due to load from memory.
60 2. ``FaultMaps::FaultingLoadStore`` - fault due to instruction load and store.
61 3. ``FaultMaps::FaultingStore`` - fault due to store to memory.
Sanjoy Dasb3709592015-06-29 22:00:30 +000062
63The ``ImplicitNullChecks`` pass
64===============================
65
66The ``ImplicitNullChecks`` pass transforms explicit control flow for
67checking if a pointer is ``null``, like:
68
69.. code-block:: llvm
70
71 %ptr = call i32* @get_ptr()
72 %ptr_is_null = icmp i32* %ptr, null
Sanjoy Dase1e95c12015-06-30 21:22:32 +000073 br i1 %ptr_is_null, label %is_null, label %not_null, !make.implicit !0
Sanjoy Dasb3709592015-06-29 22:00:30 +000074
75 not_null:
76 %t = load i32, i32* %ptr
77 br label %do_something_with_t
78
79 is_null:
80 call void @HFC()
81 unreachable
Sanjoy Dase1e95c12015-06-30 21:22:32 +000082
83 !0 = !{}
Sanjoy Dasb3709592015-06-29 22:00:30 +000084
85to control flow implicit in the instruction loading or storing through
86the pointer being null checked:
87
88.. code-block:: llvm
89
90 %ptr = call i32* @get_ptr()
91 %t = load i32, i32* %ptr ;; handler-pc = label %is_null
92 br label %do_something_with_t
93
94 is_null:
95 call void @HFC()
96 unreachable
97
98This transform happens at the ``MachineInstr`` level, not the LLVM IR
99level (so the above example is only representative, not literal). The
100``ImplicitNullChecks`` pass runs during codegen, if
101``-enable-implicit-null-checks`` is passed to ``llc``.
102
103The ``ImplicitNullChecks`` pass adds entries to the
104``__llvm_faultmaps`` section described above as needed.
Sanjoy Dase1e95c12015-06-30 21:22:32 +0000105
106``make.implicit`` metadata
107--------------------------
108
109Making null checks implicit is an aggressive optimization, and it can
110be a net performance pessimization if too many memory operations end
111up faulting because of it. A language runtime typically needs to
112ensure that only a negligible number of implicit null checks actually
113fault once the application has reached a steady state. A standard way
114of doing this is by healing failed implicit null checks into explicit
115null checks via code patching or recompilation. It follows that there
116are two requirements an explicit null check needs to satisfy for it to
117be profitable to convert it to an implicit null check:
118
119 1. The case where the pointer is actually null (i.e. the "failing"
120 case) is extremely rare.
121
122 2. The failing path heals the implicit null check into an explicit
123 null check so that the application does not repeatedly page
124 fault.
125
126The frontend is expected to mark branches that satisfy (1) and (2)
127using a ``!make.implicit`` metadata node (the actual content of the
128metadata node is ignored). Only branches that are marked with
129``!make.implicit`` metadata are considered as candidates for
130conversion into implicit null checks.
131
132(Note that while we could deal with (1) using profiling data, dealing
133with (2) requires some information not present in branch profiles.)