Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 1 | ========================== |
| 2 | Exception Handling in LLVM |
| 3 | ========================== |
| 4 | |
| 5 | .. contents:: |
| 6 | :local: |
| 7 | |
| 8 | Introduction |
| 9 | ============ |
| 10 | |
| 11 | This document is the central repository for all information pertaining to |
| 12 | exception handling in LLVM. It describes the format that LLVM exception |
| 13 | handling information takes, which is useful for those interested in creating |
| 14 | front-ends or dealing directly with the information. Further, this document |
| 15 | provides specific examples of what exception handling information is used for in |
| 16 | C and C++. |
| 17 | |
| 18 | Itanium ABI Zero-cost Exception Handling |
| 19 | ---------------------------------------- |
| 20 | |
| 21 | Exception handling for most programming languages is designed to recover from |
| 22 | conditions that rarely occur during general use of an application. To that end, |
| 23 | exception handling should not interfere with the main flow of an application's |
| 24 | algorithm by performing checkpointing tasks, such as saving the current pc or |
| 25 | register state. |
| 26 | |
| 27 | The Itanium ABI Exception Handling Specification defines a methodology for |
| 28 | providing outlying data in the form of exception tables without inlining |
| 29 | speculative exception handling code in the flow of an application's main |
| 30 | algorithm. Thus, the specification is said to add "zero-cost" to the normal |
| 31 | execution of an application. |
| 32 | |
| 33 | A more complete description of the Itanium ABI exception handling runtime |
| 34 | support of can be found at `Itanium C++ ABI: Exception Handling |
Vlad Tsyrklevich | 30b4738 | 2017-09-12 00:19:11 +0000 | [diff] [blame] | 35 | <http://itanium-cxx-abi.github.io/cxx-abi/abi-eh.html>`_. A description of the |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 36 | exception frame format can be found at `Exception Frames |
Tim Northover | 7aa3080 | 2013-01-12 12:38:54 +0000 | [diff] [blame] | 37 | <http://refspecs.linuxfoundation.org/LSB_3.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html>`_, |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 38 | with details of the DWARF 4 specification at `DWARF 4 Standard |
| 39 | <http://dwarfstd.org/Dwarf4Std.php>`_. A description for the C++ exception |
| 40 | table formats can be found at `Exception Handling Tables |
Vlad Tsyrklevich | 30b4738 | 2017-09-12 00:19:11 +0000 | [diff] [blame] | 41 | <http://itanium-cxx-abi.github.io/cxx-abi/exceptions.pdf>`_. |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 42 | |
| 43 | Setjmp/Longjmp Exception Handling |
| 44 | --------------------------------- |
| 45 | |
| 46 | Setjmp/Longjmp (SJLJ) based exception handling uses LLVM intrinsics |
| 47 | `llvm.eh.sjlj.setjmp`_ and `llvm.eh.sjlj.longjmp`_ to handle control flow for |
| 48 | exception handling. |
| 49 | |
| 50 | For each function which does exception processing --- be it ``try``/``catch`` |
| 51 | blocks or cleanups --- that function registers itself on a global frame |
| 52 | list. When exceptions are unwinding, the runtime uses this list to identify |
| 53 | which functions need processing. |
| 54 | |
| 55 | Landing pad selection is encoded in the call site entry of the function |
| 56 | context. The runtime returns to the function via `llvm.eh.sjlj.longjmp`_, where |
| 57 | a switch table transfers control to the appropriate landing pad based on the |
| 58 | index stored in the function context. |
| 59 | |
| 60 | In contrast to DWARF exception handling, which encodes exception regions and |
| 61 | frame information in out-of-line tables, SJLJ exception handling builds and |
| 62 | removes the unwind frame context at runtime. This results in faster exception |
| 63 | handling at the expense of slower execution when no exceptions are thrown. As |
| 64 | exceptions are, by their nature, intended for uncommon code paths, DWARF |
| 65 | exception handling is generally preferred to SJLJ. |
| 66 | |
Andrew Kaylor | 7741851 | 2015-02-10 19:52:43 +0000 | [diff] [blame] | 67 | Windows Runtime Exception Handling |
| 68 | ----------------------------------- |
| 69 | |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 70 | LLVM supports handling exceptions produced by the Windows runtime, but it |
| 71 | requires a very different intermediate representation. It is not based on the |
| 72 | ":ref:`landingpad <i_landingpad>`" instruction like the other two models, and is |
| 73 | described later in this document under :ref:`wineh`. |
Andrew Kaylor | 7741851 | 2015-02-10 19:52:43 +0000 | [diff] [blame] | 74 | |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 75 | Overview |
| 76 | -------- |
| 77 | |
| 78 | When an exception is thrown in LLVM code, the runtime does its best to find a |
| 79 | handler suited to processing the circumstance. |
| 80 | |
| 81 | The runtime first attempts to find an *exception frame* corresponding to the |
| 82 | function where the exception was thrown. If the programming language supports |
| 83 | exception handling (e.g. C++), the exception frame contains a reference to an |
| 84 | exception table describing how to process the exception. If the language does |
| 85 | not support exception handling (e.g. C), or if the exception needs to be |
| 86 | forwarded to a prior activation, the exception frame contains information about |
| 87 | how to unwind the current activation and restore the state of the prior |
| 88 | activation. This process is repeated until the exception is handled. If the |
| 89 | exception is not handled and no activations remain, then the application is |
| 90 | terminated with an appropriate error message. |
| 91 | |
| 92 | Because different programming languages have different behaviors when handling |
| 93 | exceptions, the exception handling ABI provides a mechanism for |
| 94 | supplying *personalities*. An exception handling personality is defined by |
| 95 | way of a *personality function* (e.g. ``__gxx_personality_v0`` in C++), |
| 96 | which receives the context of the exception, an *exception structure* |
| 97 | containing the exception object type and value, and a reference to the exception |
| 98 | table for the current function. The personality function for the current |
| 99 | compile unit is specified in a *common exception frame*. |
| 100 | |
| 101 | The organization of an exception table is language dependent. For C++, an |
| 102 | exception table is organized as a series of code ranges defining what to do if |
| 103 | an exception occurs in that range. Typically, the information associated with a |
| 104 | range defines which types of exception objects (using C++ *type info*) that are |
| 105 | handled in that range, and an associated action that should take place. Actions |
| 106 | typically pass control to a *landing pad*. |
| 107 | |
| 108 | A landing pad corresponds roughly to the code found in the ``catch`` portion of |
| 109 | a ``try``/``catch`` sequence. When execution resumes at a landing pad, it |
| 110 | receives an *exception structure* and a *selector value* corresponding to the |
| 111 | *type* of exception thrown. The selector is then used to determine which *catch* |
| 112 | should actually process the exception. |
| 113 | |
| 114 | LLVM Code Generation |
| 115 | ==================== |
| 116 | |
| 117 | From a C++ developer's perspective, exceptions are defined in terms of the |
| 118 | ``throw`` and ``try``/``catch`` statements. In this section we will describe the |
| 119 | implementation of LLVM exception handling in terms of C++ examples. |
| 120 | |
| 121 | Throw |
| 122 | ----- |
| 123 | |
| 124 | Languages that support exception handling typically provide a ``throw`` |
| 125 | operation to initiate the exception process. Internally, a ``throw`` operation |
| 126 | breaks down into two steps. |
| 127 | |
| 128 | #. A request is made to allocate exception space for an exception structure. |
| 129 | This structure needs to survive beyond the current activation. This structure |
| 130 | will contain the type and value of the object being thrown. |
| 131 | |
| 132 | #. A call is made to the runtime to raise the exception, passing the exception |
| 133 | structure as an argument. |
| 134 | |
| 135 | In C++, the allocation of the exception structure is done by the |
| 136 | ``__cxa_allocate_exception`` runtime function. The exception raising is handled |
| 137 | by ``__cxa_throw``. The type of the exception is represented using a C++ RTTI |
| 138 | structure. |
| 139 | |
| 140 | Try/Catch |
| 141 | --------- |
| 142 | |
| 143 | A call within the scope of a *try* statement can potentially raise an |
| 144 | exception. In those circumstances, the LLVM C++ front-end replaces the call with |
| 145 | an ``invoke`` instruction. Unlike a call, the ``invoke`` has two potential |
| 146 | continuation points: |
| 147 | |
| 148 | #. where to continue when the call succeeds as per normal, and |
| 149 | |
| 150 | #. where to continue if the call raises an exception, either by a throw or the |
| 151 | unwinding of a throw |
| 152 | |
Mark Seaborn | 1c30a35 | 2014-02-27 06:54:04 +0000 | [diff] [blame] | 153 | The term used to define the place where an ``invoke`` continues after an |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 154 | exception is called a *landing pad*. LLVM landing pads are conceptually |
| 155 | alternative function entry points where an exception structure reference and a |
| 156 | type info index are passed in as arguments. The landing pad saves the exception |
| 157 | structure reference and then proceeds to select the catch block that corresponds |
| 158 | to the type info of the exception object. |
| 159 | |
Dmitri Gribenko | 74401c8 | 2013-01-13 16:06:11 +0000 | [diff] [blame] | 160 | The LLVM :ref:`i_landingpad` is used to convey information about the landing |
| 161 | pad to the back end. For C++, the ``landingpad`` instruction returns a pointer |
| 162 | and integer pair corresponding to the pointer to the *exception structure* and |
| 163 | the *selector value* respectively. |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 164 | |
Vedant Kumar | b74d92d | 2015-09-08 20:16:35 +0000 | [diff] [blame] | 165 | The ``landingpad`` instruction looks for a reference to the personality |
| 166 | function to be used for this ``try``/``catch`` sequence in the parent |
| 167 | function's attribute list. The instruction contains a list of *cleanup*, |
| 168 | *catch*, and *filter* clauses. The exception is tested against the clauses |
| 169 | sequentially from first to last. The clauses have the following meanings: |
Mark Seaborn | 95e9730 | 2014-02-25 23:48:59 +0000 | [diff] [blame] | 170 | |
| 171 | - ``catch <type> @ExcType`` |
| 172 | |
| 173 | - This clause means that the landingpad block should be entered if the |
| 174 | exception being thrown is of type ``@ExcType`` or a subtype of |
| 175 | ``@ExcType``. For C++, ``@ExcType`` is a pointer to the ``std::type_info`` |
| 176 | object (an RTTI object) representing the C++ exception type. |
| 177 | |
| 178 | - If ``@ExcType`` is ``null``, any exception matches, so the landingpad |
| 179 | should always be entered. This is used for C++ catch-all blocks ("``catch |
| 180 | (...)``"). |
| 181 | |
| 182 | - When this clause is matched, the selector value will be equal to the value |
| 183 | returned by "``@llvm.eh.typeid.for(i8* @ExcType)``". This will always be a |
| 184 | positive value. |
| 185 | |
| 186 | - ``filter <type> [<type> @ExcType1, ..., <type> @ExcTypeN]`` |
| 187 | |
| 188 | - This clause means that the landingpad should be entered if the exception |
| 189 | being thrown does *not* match any of the types in the list (which, for C++, |
| 190 | are again specified as ``std::type_info`` pointers). |
| 191 | |
| 192 | - C++ front-ends use this to implement C++ exception specifications, such as |
| 193 | "``void foo() throw (ExcType1, ..., ExcTypeN) { ... }``". |
| 194 | |
| 195 | - When this clause is matched, the selector value will be negative. |
| 196 | |
| 197 | - The array argument to ``filter`` may be empty; for example, "``[0 x i8**] |
| 198 | undef``". This means that the landingpad should always be entered. (Note |
| 199 | that such a ``filter`` would not be equivalent to "``catch i8* null``", |
| 200 | because ``filter`` and ``catch`` produce negative and positive selector |
| 201 | values respectively.) |
| 202 | |
| 203 | - ``cleanup`` |
| 204 | |
| 205 | - This clause means that the landingpad should always be entered. |
| 206 | |
| 207 | - C++ front-ends use this for calling objects' destructors. |
| 208 | |
| 209 | - When this clause is matched, the selector value will be zero. |
| 210 | |
| 211 | - The runtime may treat "``cleanup``" differently from "``catch <type> |
| 212 | null``". |
| 213 | |
| 214 | In C++, if an unhandled exception occurs, the language runtime will call |
| 215 | ``std::terminate()``, but it is implementation-defined whether the runtime |
| 216 | unwinds the stack and calls object destructors first. For example, the GNU |
| 217 | C++ unwinder does not call object destructors when an unhandled exception |
| 218 | occurs. The reason for this is to improve debuggability: it ensures that |
| 219 | ``std::terminate()`` is called from the context of the ``throw``, so that |
| 220 | this context is not lost by unwinding the stack. A runtime will typically |
| 221 | implement this by searching for a matching non-``cleanup`` clause, and |
| 222 | aborting if it does not find one, before entering any landingpad blocks. |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 223 | |
| 224 | Once the landing pad has the type info selector, the code branches to the code |
| 225 | for the first catch. The catch then checks the value of the type info selector |
| 226 | against the index of type info for that catch. Since the type info index is not |
| 227 | known until all the type infos have been gathered in the backend, the catch code |
| 228 | must call the `llvm.eh.typeid.for`_ intrinsic to determine the index for a given |
| 229 | type info. If the catch fails to match the selector then control is passed on to |
| 230 | the next catch. |
| 231 | |
| 232 | Finally, the entry and exit of catch code is bracketed with calls to |
| 233 | ``__cxa_begin_catch`` and ``__cxa_end_catch``. |
| 234 | |
| 235 | * ``__cxa_begin_catch`` takes an exception structure reference as an argument |
| 236 | and returns the value of the exception object. |
| 237 | |
| 238 | * ``__cxa_end_catch`` takes no arguments. This function: |
| 239 | |
| 240 | #. Locates the most recently caught exception and decrements its handler |
| 241 | count, |
| 242 | |
| 243 | #. Removes the exception from the *caught* stack if the handler count goes to |
| 244 | zero, and |
| 245 | |
| 246 | #. Destroys the exception if the handler count goes to zero and the exception |
| 247 | was not re-thrown by throw. |
| 248 | |
| 249 | .. note:: |
| 250 | |
| 251 | a rethrow from within the catch may replace this call with a |
| 252 | ``__cxa_rethrow``. |
| 253 | |
| 254 | Cleanups |
| 255 | -------- |
| 256 | |
| 257 | A cleanup is extra code which needs to be run as part of unwinding a scope. C++ |
| 258 | destructors are a typical example, but other languages and language extensions |
| 259 | provide a variety of different kinds of cleanups. In general, a landing pad may |
| 260 | need to run arbitrary amounts of cleanup code before actually entering a catch |
Dmitri Gribenko | 74401c8 | 2013-01-13 16:06:11 +0000 | [diff] [blame] | 261 | block. To indicate the presence of cleanups, a :ref:`i_landingpad` should have |
| 262 | a *cleanup* clause. Otherwise, the unwinder will not stop at the landing pad if |
| 263 | there are no catches or filters that require it to. |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 264 | |
| 265 | .. note:: |
| 266 | |
| 267 | Do not allow a new exception to propagate out of the execution of a |
| 268 | cleanup. This can corrupt the internal state of the unwinder. Different |
| 269 | languages describe different high-level semantics for these situations: for |
| 270 | example, C++ requires that the process be terminated, whereas Ada cancels both |
| 271 | exceptions and throws a third. |
| 272 | |
| 273 | When all cleanups are finished, if the exception is not handled by the current |
Nico Weber | 176a288 | 2015-02-26 19:48:43 +0000 | [diff] [blame] | 274 | function, resume unwinding by calling the :ref:`resume instruction <i_resume>`, |
| 275 | passing in the result of the ``landingpad`` instruction for the original |
| 276 | landing pad. |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 277 | |
| 278 | Throw Filters |
| 279 | ------------- |
| 280 | |
| 281 | C++ allows the specification of which exception types may be thrown from a |
| 282 | function. To represent this, a top level landing pad may exist to filter out |
Dmitri Gribenko | 74401c8 | 2013-01-13 16:06:11 +0000 | [diff] [blame] | 283 | invalid types. To express this in LLVM code the :ref:`i_landingpad` will have a |
| 284 | filter clause. The clause consists of an array of type infos. |
| 285 | ``landingpad`` will return a negative value |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 286 | if the exception does not match any of the type infos. If no match is found then |
| 287 | a call to ``__cxa_call_unexpected`` should be made, otherwise |
| 288 | ``_Unwind_Resume``. Each of these functions requires a reference to the |
| 289 | exception structure. Note that the most general form of a ``landingpad`` |
| 290 | instruction can have any number of catch, cleanup, and filter clauses (though |
| 291 | having more than one cleanup is pointless). The LLVM C++ front-end can generate |
| 292 | such ``landingpad`` instructions due to inlining creating nested exception |
| 293 | handling scopes. |
| 294 | |
| 295 | .. _undefined: |
| 296 | |
| 297 | Restrictions |
| 298 | ------------ |
| 299 | |
| 300 | The unwinder delegates the decision of whether to stop in a call frame to that |
| 301 | call frame's language-specific personality function. Not all unwinders guarantee |
| 302 | that they will stop to perform cleanups. For example, the GNU C++ unwinder |
| 303 | doesn't do so unless the exception is actually caught somewhere further up the |
| 304 | stack. |
| 305 | |
| 306 | In order for inlining to behave correctly, landing pads must be prepared to |
| 307 | handle selector results that they did not originally advertise. Suppose that a |
| 308 | function catches exceptions of type ``A``, and it's inlined into a function that |
| 309 | catches exceptions of type ``B``. The inliner will update the ``landingpad`` |
| 310 | instruction for the inlined landing pad to include the fact that ``B`` is also |
| 311 | caught. If that landing pad assumes that it will only be entered to catch an |
| 312 | ``A``, it's in for a rude awakening. Consequently, landing pads must test for |
| 313 | the selector results they understand and then resume exception propagation with |
| 314 | the `resume instruction <LangRef.html#i_resume>`_ if none of the conditions |
| 315 | match. |
| 316 | |
| 317 | Exception Handling Intrinsics |
| 318 | ============================= |
| 319 | |
| 320 | In addition to the ``landingpad`` and ``resume`` instructions, LLVM uses several |
| 321 | intrinsic functions (name prefixed with ``llvm.eh``) to provide exception |
| 322 | handling information at various points in generated code. |
| 323 | |
| 324 | .. _llvm.eh.typeid.for: |
| 325 | |
Dmitri Gribenko | 135174d | 2013-01-13 16:07:49 +0000 | [diff] [blame] | 326 | ``llvm.eh.typeid.for`` |
| 327 | ---------------------- |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 328 | |
| 329 | .. code-block:: llvm |
| 330 | |
| 331 | i32 @llvm.eh.typeid.for(i8* %type_info) |
| 332 | |
| 333 | |
| 334 | This intrinsic returns the type info index in the exception table of the current |
| 335 | function. This value can be used to compare against the result of |
| 336 | ``landingpad`` instruction. The single argument is a reference to a type info. |
| 337 | |
Mark Seaborn | e259168 | 2014-03-28 17:08:57 +0000 | [diff] [blame] | 338 | Uses of this intrinsic are generated by the C++ front-end. |
| 339 | |
Andrew Kaylor | 7741851 | 2015-02-10 19:52:43 +0000 | [diff] [blame] | 340 | .. _llvm.eh.begincatch: |
| 341 | |
| 342 | ``llvm.eh.begincatch`` |
| 343 | ---------------------- |
| 344 | |
| 345 | .. code-block:: llvm |
| 346 | |
Reid Kleckner | 9c28314 | 2015-03-03 17:41:09 +0000 | [diff] [blame] | 347 | void @llvm.eh.begincatch(i8* %ehptr, i8* %ehobj) |
Andrew Kaylor | 7741851 | 2015-02-10 19:52:43 +0000 | [diff] [blame] | 348 | |
| 349 | |
| 350 | This intrinsic marks the beginning of catch handling code within the blocks |
| 351 | following a ``landingpad`` instruction. The exact behavior of this function |
| 352 | depends on the compilation target and the personality function associated |
| 353 | with the ``landingpad`` instruction. |
| 354 | |
Reid Kleckner | 9c28314 | 2015-03-03 17:41:09 +0000 | [diff] [blame] | 355 | The first argument to this intrinsic is a pointer that was previously extracted |
| 356 | from the aggregate return value of the ``landingpad`` instruction. The second |
| 357 | argument to the intrinsic is a pointer to stack space where the exception object |
| 358 | should be stored. The runtime handles the details of copying the exception |
| 359 | object into the slot. If the second parameter is null, no copy occurs. |
Andrew Kaylor | 7741851 | 2015-02-10 19:52:43 +0000 | [diff] [blame] | 360 | |
| 361 | Uses of this intrinsic are generated by the C++ front-end. Many targets will |
| 362 | use implementation-specific functions (such as ``__cxa_begin_catch``) instead |
| 363 | of this intrinsic. The intrinsic is provided for targets that require a more |
| 364 | abstract interface. |
| 365 | |
| 366 | When used in the native Windows C++ exception handling implementation, this |
| 367 | intrinsic serves as a placeholder to delimit code before a catch handler is |
Hiroshi Inoue | ef1bc2d | 2018-04-12 05:53:20 +0000 | [diff] [blame] | 368 | outlined. When the handler is outlined, this intrinsic will be replaced |
Andrew Kaylor | 7741851 | 2015-02-10 19:52:43 +0000 | [diff] [blame] | 369 | by instructions that retrieve the exception object pointer from the frame |
| 370 | allocation block. |
| 371 | |
| 372 | |
| 373 | .. _llvm.eh.endcatch: |
| 374 | |
| 375 | ``llvm.eh.endcatch`` |
| 376 | ---------------------- |
| 377 | |
| 378 | .. code-block:: llvm |
| 379 | |
| 380 | void @llvm.eh.endcatch() |
| 381 | |
| 382 | |
| 383 | This intrinsic marks the end of catch handling code within the current block, |
| 384 | which will be a successor of a block which called ``llvm.eh.begincatch''. |
| 385 | The exact behavior of this function depends on the compilation target and the |
| 386 | personality function associated with the corresponding ``landingpad`` |
| 387 | instruction. |
| 388 | |
| 389 | There may be more than one call to ``llvm.eh.endcatch`` for any given call to |
| 390 | ``llvm.eh.begincatch`` with each ``llvm.eh.endcatch`` call corresponding to the |
| 391 | end of a different control path. All control paths following a call to |
| 392 | ``llvm.eh.begincatch`` must reach a call to ``llvm.eh.endcatch``. |
| 393 | |
| 394 | Uses of this intrinsic are generated by the C++ front-end. Many targets will |
| 395 | use implementation-specific functions (such as ``__cxa_begin_catch``) instead |
| 396 | of this intrinsic. The intrinsic is provided for targets that require a more |
| 397 | abstract interface. |
| 398 | |
| 399 | When used in the native Windows C++ exception handling implementation, this |
| 400 | intrinsic serves as a placeholder to delimit code before a catch handler is |
| 401 | outlined. After the handler is outlined, this intrinsic is simply removed. |
| 402 | |
| 403 | |
Joseph Tremoulet | 8f3f5c3 | 2015-09-03 09:15:32 +0000 | [diff] [blame] | 404 | .. _llvm.eh.exceptionpointer: |
| 405 | |
| 406 | ``llvm.eh.exceptionpointer`` |
Joseph Tremoulet | 2467a4c | 2015-09-03 09:33:54 +0000 | [diff] [blame] | 407 | ---------------------------- |
Joseph Tremoulet | 8f3f5c3 | 2015-09-03 09:15:32 +0000 | [diff] [blame] | 408 | |
Renato Golin | 88ea57f | 2016-07-20 12:16:38 +0000 | [diff] [blame] | 409 | .. code-block:: text |
Joseph Tremoulet | 8f3f5c3 | 2015-09-03 09:15:32 +0000 | [diff] [blame] | 410 | |
| 411 | i8 addrspace(N)* @llvm.eh.padparam.pNi8(token %catchpad) |
| 412 | |
| 413 | |
| 414 | This intrinsic retrieves a pointer to the exception caught by the given |
| 415 | ``catchpad``. |
| 416 | |
| 417 | |
Mark Seaborn | e259168 | 2014-03-28 17:08:57 +0000 | [diff] [blame] | 418 | SJLJ Intrinsics |
| 419 | --------------- |
| 420 | |
| 421 | The ``llvm.eh.sjlj`` intrinsics are used internally within LLVM's |
| 422 | backend. Uses of them are generated by the backend's |
| 423 | ``SjLjEHPrepare`` pass. |
| 424 | |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 425 | .. _llvm.eh.sjlj.setjmp: |
| 426 | |
Dmitri Gribenko | 135174d | 2013-01-13 16:07:49 +0000 | [diff] [blame] | 427 | ``llvm.eh.sjlj.setjmp`` |
Mark Seaborn | e259168 | 2014-03-28 17:08:57 +0000 | [diff] [blame] | 428 | ~~~~~~~~~~~~~~~~~~~~~~~ |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 429 | |
Renato Golin | 88ea57f | 2016-07-20 12:16:38 +0000 | [diff] [blame] | 430 | .. code-block:: text |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 431 | |
| 432 | i32 @llvm.eh.sjlj.setjmp(i8* %setjmp_buf) |
| 433 | |
| 434 | For SJLJ based exception handling, this intrinsic forces register saving for the |
| 435 | current function and stores the address of the following instruction for use as |
| 436 | a destination address by `llvm.eh.sjlj.longjmp`_. The buffer format and the |
| 437 | overall functioning of this intrinsic is compatible with the GCC |
| 438 | ``__builtin_setjmp`` implementation allowing code built with the clang and GCC |
| 439 | to interoperate. |
| 440 | |
| 441 | The single parameter is a pointer to a five word buffer in which the calling |
| 442 | context is saved. The front end places the frame pointer in the first word, and |
| 443 | the target implementation of this intrinsic should place the destination address |
| 444 | for a `llvm.eh.sjlj.longjmp`_ in the second word. The following three words are |
| 445 | available for use in a target-specific manner. |
| 446 | |
| 447 | .. _llvm.eh.sjlj.longjmp: |
| 448 | |
Dmitri Gribenko | 135174d | 2013-01-13 16:07:49 +0000 | [diff] [blame] | 449 | ``llvm.eh.sjlj.longjmp`` |
Mark Seaborn | e259168 | 2014-03-28 17:08:57 +0000 | [diff] [blame] | 450 | ~~~~~~~~~~~~~~~~~~~~~~~~ |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 451 | |
| 452 | .. code-block:: llvm |
| 453 | |
| 454 | void @llvm.eh.sjlj.longjmp(i8* %setjmp_buf) |
| 455 | |
| 456 | For SJLJ based exception handling, the ``llvm.eh.sjlj.longjmp`` intrinsic is |
| 457 | used to implement ``__builtin_longjmp()``. The single parameter is a pointer to |
| 458 | a buffer populated by `llvm.eh.sjlj.setjmp`_. The frame pointer and stack |
| 459 | pointer are restored from the buffer, then control is transferred to the |
| 460 | destination address. |
| 461 | |
Dmitri Gribenko | 135174d | 2013-01-13 16:07:49 +0000 | [diff] [blame] | 462 | ``llvm.eh.sjlj.lsda`` |
Mark Seaborn | e259168 | 2014-03-28 17:08:57 +0000 | [diff] [blame] | 463 | ~~~~~~~~~~~~~~~~~~~~~ |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 464 | |
| 465 | .. code-block:: llvm |
| 466 | |
| 467 | i8* @llvm.eh.sjlj.lsda() |
| 468 | |
| 469 | For SJLJ based exception handling, the ``llvm.eh.sjlj.lsda`` intrinsic returns |
| 470 | the address of the Language Specific Data Area (LSDA) for the current |
| 471 | function. The SJLJ front-end code stores this address in the exception handling |
| 472 | function context for use by the runtime. |
| 473 | |
Dmitri Gribenko | 135174d | 2013-01-13 16:07:49 +0000 | [diff] [blame] | 474 | ``llvm.eh.sjlj.callsite`` |
Mark Seaborn | e259168 | 2014-03-28 17:08:57 +0000 | [diff] [blame] | 475 | ~~~~~~~~~~~~~~~~~~~~~~~~~ |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 476 | |
| 477 | .. code-block:: llvm |
| 478 | |
| 479 | void @llvm.eh.sjlj.callsite(i32 %call_site_num) |
| 480 | |
| 481 | For SJLJ based exception handling, the ``llvm.eh.sjlj.callsite`` intrinsic |
| 482 | identifies the callsite value associated with the following ``invoke`` |
| 483 | instruction. This is used to ensure that landing pad entries in the LSDA are |
| 484 | generated in matching order. |
| 485 | |
| 486 | Asm Table Formats |
| 487 | ================= |
| 488 | |
| 489 | There are two tables that are used by the exception handling runtime to |
| 490 | determine which actions should be taken when an exception is thrown. |
| 491 | |
| 492 | Exception Handling Frame |
| 493 | ------------------------ |
| 494 | |
| 495 | An exception handling frame ``eh_frame`` is very similar to the unwind frame |
| 496 | used by DWARF debug info. The frame contains all the information necessary to |
| 497 | tear down the current frame and restore the state of the prior frame. There is |
| 498 | an exception handling frame for each function in a compile unit, plus a common |
| 499 | exception handling frame that defines information common to all functions in the |
| 500 | unit. |
| 501 | |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 502 | The format of this call frame information (CFI) is often platform-dependent, |
| 503 | however. ARM, for example, defines their own format. Apple has their own compact |
| 504 | unwind info format. On Windows, another format is used for all architectures |
| 505 | since 32-bit x86. LLVM will emit whatever information is required by the |
| 506 | target. |
| 507 | |
Bill Wendling | 7d85f87 | 2012-06-27 07:20:57 +0000 | [diff] [blame] | 508 | Exception Tables |
| 509 | ---------------- |
| 510 | |
| 511 | An exception table contains information about what actions to take when an |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 512 | exception is thrown in a particular part of a function's code. This is typically |
| 513 | referred to as the language-specific data area (LSDA). The format of the LSDA |
| 514 | table is specific to the personality function, but the majority of personalities |
| 515 | out there use a variation of the tables consumed by ``__gxx_personality_v0``. |
| 516 | There is one exception table per function, except leaf functions and functions |
| 517 | that have calls only to non-throwing functions. They do not need an exception |
| 518 | table. |
| 519 | |
| 520 | .. _wineh: |
| 521 | |
| 522 | Exception Handling using the Windows Runtime |
| 523 | ================================================= |
| 524 | |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 525 | Background on Windows exceptions |
| 526 | --------------------------------- |
| 527 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 528 | Interacting with exceptions on Windows is significantly more complicated than |
| 529 | on Itanium C++ ABI platforms. The fundamental difference between the two models |
| 530 | is that Itanium EH is designed around the idea of "successive unwinding," while |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 531 | Windows EH is not. |
| 532 | |
| 533 | Under Itanium, throwing an exception typically involes allocating thread local |
| 534 | memory to hold the exception, and calling into the EH runtime. The runtime |
| 535 | identifies frames with appropriate exception handling actions, and successively |
| 536 | resets the register context of the current thread to the most recently active |
| 537 | frame with actions to run. In LLVM, execution resumes at a ``landingpad`` |
| 538 | instruction, which produces register values provided by the runtime. If a |
| 539 | function is only cleaning up allocated resources, the function is responsible |
| 540 | for calling ``_Unwind_Resume`` to transition to the next most recently active |
| 541 | frame after it is finished cleaning up. Eventually, the frame responsible for |
| 542 | handling the exception calls ``__cxa_end_catch`` to destroy the exception, |
| 543 | release its memory, and resume normal control flow. |
| 544 | |
| 545 | The Windows EH model does not use these successive register context resets. |
| 546 | Instead, the active exception is typically described by a frame on the stack. |
| 547 | In the case of C++ exceptions, the exception object is allocated in stack memory |
| 548 | and its address is passed to ``__CxxThrowException``. General purpose structured |
| 549 | exceptions (SEH) are more analogous to Linux signals, and they are dispatched by |
| 550 | userspace DLLs provided with Windows. Each frame on the stack has an assigned EH |
| 551 | personality routine, which decides what actions to take to handle the exception. |
| 552 | There are a few major personalities for C and C++ code: the C++ personality |
| 553 | (``__CxxFrameHandler3``) and the SEH personalities (``_except_handler3``, |
| 554 | ``_except_handler4``, and ``__C_specific_handler``). All of them implement |
| 555 | cleanups by calling back into a "funclet" contained in the parent function. |
| 556 | |
| 557 | Funclets, in this context, are regions of the parent function that can be called |
| 558 | as though they were a function pointer with a very special calling convention. |
| 559 | The frame pointer of the parent frame is passed into the funclet either using |
| 560 | the standard EBP register or as the first parameter register, depending on the |
| 561 | architecture. The funclet implements the EH action by accessing local variables |
| 562 | in memory through the frame pointer, and returning some appropriate value, |
| 563 | continuing the EH process. No variables live in to or out of the funclet can be |
| 564 | allocated in registers. |
| 565 | |
| 566 | The C++ personality also uses funclets to contain the code for catch blocks |
| 567 | (i.e. all user code between the braces in ``catch (Type obj) { ... }``). The |
| 568 | runtime must use funclets for catch bodies because the C++ exception object is |
| 569 | allocated in a child stack frame of the function handling the exception. If the |
| 570 | runtime rewound the stack back to frame of the catch, the memory holding the |
| 571 | exception would be overwritten quickly by subsequent function calls. The use of |
| 572 | funclets also allows ``__CxxFrameHandler3`` to implement rethrow without |
| 573 | resorting to TLS. Instead, the runtime throws a special exception, and then uses |
| 574 | SEH (``__try / __except``) to resume execution with new information in the child |
| 575 | frame. |
| 576 | |
| 577 | In other words, the successive unwinding approach is incompatible with Visual |
| 578 | C++ exceptions and general purpose Windows exception handling. Because the C++ |
| 579 | exception object lives in stack memory, LLVM cannot provide a custom personality |
| 580 | function that uses landingpads. Similarly, SEH does not provide any mechanism |
| 581 | to rethrow an exception or continue unwinding. Therefore, LLVM must use the IR |
| 582 | constructs described later in this document to implement compatible exception |
| 583 | handling. |
| 584 | |
| 585 | SEH filter expressions |
| 586 | ----------------------- |
| 587 | |
| 588 | The SEH personality functions also use funclets to implement filter expressions, |
| 589 | which allow executing arbitrary user code to decide which exceptions to catch. |
| 590 | Filter expressions should not be confused with the ``filter`` clause of the LLVM |
| 591 | ``landingpad`` instruction. Typically filter expressions are used to determine |
| 592 | if the exception came from a particular DLL or code region, or if code faulted |
| 593 | while accessing a particular memory address range. LLVM does not currently have |
| 594 | IR to represent filter expressions because it is difficult to represent their |
| 595 | control dependencies. Filter expressions run during the first phase of EH, |
| 596 | before cleanups run, making it very difficult to build a faithful control flow |
| 597 | graph. For now, the new EH instructions cannot represent SEH filter |
| 598 | expressions, and frontends must outline them ahead of time. Local variables of |
| 599 | the parent function can be escaped and accessed using the ``llvm.localescape`` |
| 600 | and ``llvm.localrecover`` intrinsics. |
| 601 | |
| 602 | New exception handling instructions |
| 603 | ------------------------------------ |
| 604 | |
| 605 | The primary design goal of the new EH instructions is to support funclet |
| 606 | generation while preserving information about the CFG so that SSA formation |
| 607 | still works. As a secondary goal, they are designed to be generic across MSVC |
| 608 | and Itanium C++ exceptions. They make very few assumptions about the data |
| 609 | required by the personality, so long as it uses the familiar core EH actions: |
| 610 | catch, cleanup, and terminate. However, the new instructions are hard to modify |
| 611 | without knowing details of the EH personality. While they can be used to |
| 612 | represent Itanium EH, the landingpad model is strictly better for optimization |
| 613 | purposes. |
| 614 | |
| 615 | The following new instructions are considered "exception handling pads", in that |
| 616 | they must be the first non-phi instruction of a basic block that may be the |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 617 | unwind destination of an EH flow edge: |
David Majnemer | 868145e | 2015-12-14 18:34:23 +0000 | [diff] [blame] | 618 | ``catchswitch``, ``catchpad``, and ``cleanuppad``. |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 619 | As with landingpads, when entering a try scope, if the |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 620 | frontend encounters a call site that may throw an exception, it should emit an |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 621 | invoke that unwinds to a ``catchswitch`` block. Similarly, inside the scope of a |
David Majnemer | 868145e | 2015-12-14 18:34:23 +0000 | [diff] [blame] | 622 | C++ object with a destructor, invokes should unwind to a ``cleanuppad``. |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 623 | |
Joseph Tremoulet | 226889e | 2015-09-03 09:09:43 +0000 | [diff] [blame] | 624 | New instructions are also used to mark the points where control is transferred |
| 625 | out of a catch/cleanup handler (which will correspond to exits from the |
| 626 | generated funclet). A catch handler which reaches its end by normal execution |
| 627 | executes a ``catchret`` instruction, which is a terminator indicating where in |
| 628 | the function control is returned to. A cleanup handler which reaches its end |
| 629 | by normal execution executes a ``cleanupret`` instruction, which is a terminator |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 630 | indicating where the active exception will unwind to next. |
Joseph Tremoulet | 226889e | 2015-09-03 09:09:43 +0000 | [diff] [blame] | 631 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 632 | Each of these new EH pad instructions has a way to identify which action should |
David Majnemer | 868145e | 2015-12-14 18:34:23 +0000 | [diff] [blame] | 633 | be considered after this action. The ``catchswitch`` instruction is a terminator |
| 634 | and has an unwind destination operand analogous to the unwind destination of an |
| 635 | invoke. The ``cleanuppad`` instruction is not |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 636 | a terminator, so the unwind destination is stored on the ``cleanupret`` |
| 637 | instruction instead. Successfully executing a catch handler should resume |
| 638 | normal control flow, so neither ``catchpad`` nor ``catchret`` instructions can |
| 639 | unwind. All of these "unwind edges" may refer to a basic block that contains an |
| 640 | EH pad instruction, or they may unwind to the caller. Unwinding to the caller |
| 641 | has roughly the same semantics as the ``resume`` instruction in the landingpad |
| 642 | model. When inlining through an invoke, instructions that unwind to the caller |
| 643 | are hooked up to unwind to the unwind destination of the call site. |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 644 | |
| 645 | Putting things together, here is a hypothetical lowering of some C++ that uses |
| 646 | all of the new IR instructions: |
| 647 | |
| 648 | .. code-block:: c |
| 649 | |
| 650 | struct Cleanup { |
| 651 | Cleanup(); |
| 652 | ~Cleanup(); |
| 653 | int m; |
| 654 | }; |
| 655 | void may_throw(); |
| 656 | int f() noexcept { |
| 657 | try { |
| 658 | Cleanup obj; |
| 659 | may_throw(); |
| 660 | } catch (int e) { |
Joseph Tremoulet | 226889e | 2015-09-03 09:09:43 +0000 | [diff] [blame] | 661 | may_throw(); |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 662 | return e; |
| 663 | } |
| 664 | return 0; |
| 665 | } |
| 666 | |
Renato Golin | 88ea57f | 2016-07-20 12:16:38 +0000 | [diff] [blame] | 667 | .. code-block:: text |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 668 | |
| 669 | define i32 @f() nounwind personality i32 (...)* @__CxxFrameHandler3 { |
| 670 | entry: |
| 671 | %obj = alloca %struct.Cleanup, align 4 |
| 672 | %e = alloca i32, align 4 |
| 673 | %call = invoke %struct.Cleanup* @"\01??0Cleanup@@QEAA@XZ"(%struct.Cleanup* nonnull %obj) |
| 674 | to label %invoke.cont unwind label %lpad.catch |
| 675 | |
| 676 | invoke.cont: ; preds = %entry |
| 677 | invoke void @"\01?may_throw@@YAXXZ"() |
| 678 | to label %invoke.cont.2 unwind label %lpad.cleanup |
| 679 | |
| 680 | invoke.cont.2: ; preds = %invoke.cont |
| 681 | call void @"\01??_DCleanup@@QEAA@XZ"(%struct.Cleanup* nonnull %obj) nounwind |
| 682 | br label %return |
| 683 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 684 | return: ; preds = %invoke.cont.3, %invoke.cont.2 |
| 685 | %retval.0 = phi i32 [ 0, %invoke.cont.2 ], [ %3, %invoke.cont.3 ] |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 686 | ret i32 %retval.0 |
| 687 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 688 | lpad.cleanup: ; preds = %invoke.cont.2 |
| 689 | %0 = cleanuppad within none [] |
| 690 | call void @"\01??1Cleanup@@QEAA@XZ"(%struct.Cleanup* nonnull %obj) nounwind |
| 691 | cleanupret %0 unwind label %lpad.catch |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 692 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 693 | lpad.catch: ; preds = %lpad.cleanup, %entry |
| 694 | %1 = catchswitch within none [label %catch.body] unwind label %lpad.terminate |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 695 | |
Joseph Tremoulet | 226889e | 2015-09-03 09:09:43 +0000 | [diff] [blame] | 696 | catch.body: ; preds = %lpad.catch |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 697 | %catch = catchpad within %1 [%rtti.TypeDescriptor2* @"\01??_R0H@8", i32 0, i32* %e] |
Joseph Tremoulet | 226889e | 2015-09-03 09:09:43 +0000 | [diff] [blame] | 698 | invoke void @"\01?may_throw@@YAXXZ"() |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 699 | to label %invoke.cont.3 unwind label %lpad.terminate |
Joseph Tremoulet | 226889e | 2015-09-03 09:09:43 +0000 | [diff] [blame] | 700 | |
| 701 | invoke.cont.3: ; preds = %catch.body |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 702 | %3 = load i32, i32* %e, align 4 |
| 703 | catchret from %catch to label %return |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 704 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 705 | lpad.terminate: ; preds = %catch.body, %lpad.catch |
David Majnemer | 868145e | 2015-12-14 18:34:23 +0000 | [diff] [blame] | 706 | cleanuppad within none [] |
| 707 | call void @"\01?terminate@@YAXXZ" |
| 708 | unreachable |
Reid Kleckner | cf16656 | 2015-08-06 21:01:32 +0000 | [diff] [blame] | 709 | } |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 710 | |
| 711 | Funclet parent tokens |
| 712 | ----------------------- |
| 713 | |
| 714 | In order to produce tables for EH personalities that use funclets, it is |
| 715 | necessary to recover the nesting that was present in the source. This funclet |
| 716 | parent relationship is encoded in the IR using tokens produced by the new "pad" |
| 717 | instructions. The token operand of a "pad" or "ret" instruction indicates which |
| 718 | funclet it is in, or "none" if it is not nested within another funclet. |
| 719 | |
| 720 | The ``catchpad`` and ``cleanuppad`` instructions establish new funclets, and |
| 721 | their tokens are consumed by other "pad" instructions to establish membership. |
| 722 | The ``catchswitch`` instruction does not create a funclet, but it produces a |
| 723 | token that is always consumed by its immediate successor ``catchpad`` |
| 724 | instructions. This ensures that every catch handler modelled by a ``catchpad`` |
| 725 | belongs to exactly one ``catchswitch``, which models the dispatch point after a |
David Majnemer | 868145e | 2015-12-14 18:34:23 +0000 | [diff] [blame] | 726 | C++ try. |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 727 | |
| 728 | Here is an example of what this nesting looks like using some hypothetical |
| 729 | C++ code: |
| 730 | |
| 731 | .. code-block:: c |
| 732 | |
| 733 | void f() { |
| 734 | try { |
| 735 | throw; |
| 736 | } catch (...) { |
| 737 | try { |
| 738 | throw; |
| 739 | } catch (...) { |
| 740 | } |
| 741 | } |
| 742 | } |
| 743 | |
Renato Golin | 88ea57f | 2016-07-20 12:16:38 +0000 | [diff] [blame] | 744 | .. code-block:: text |
David Majnemer | 7197ef7 | 2015-12-12 06:56:02 +0000 | [diff] [blame] | 745 | |
David Majnemer | 8cec2f2 | 2015-12-12 05:38:55 +0000 | [diff] [blame] | 746 | define void @f() #0 personality i8* bitcast (i32 (...)* @__CxxFrameHandler3 to i8*) { |
| 747 | entry: |
| 748 | invoke void @_CxxThrowException(i8* null, %eh.ThrowInfo* null) #1 |
| 749 | to label %unreachable unwind label %catch.dispatch |
| 750 | |
| 751 | catch.dispatch: ; preds = %entry |
| 752 | %0 = catchswitch within none [label %catch] unwind to caller |
| 753 | |
| 754 | catch: ; preds = %catch.dispatch |
| 755 | %1 = catchpad within %0 [i8* null, i32 64, i8* null] |
| 756 | invoke void @_CxxThrowException(i8* null, %eh.ThrowInfo* null) #1 |
| 757 | to label %unreachable unwind label %catch.dispatch2 |
| 758 | |
| 759 | catch.dispatch2: ; preds = %catch |
| 760 | %2 = catchswitch within %1 [label %catch3] unwind to caller |
| 761 | |
| 762 | catch3: ; preds = %catch.dispatch2 |
| 763 | %3 = catchpad within %2 [i8* null, i32 64, i8* null] |
| 764 | catchret from %3 to label %try.cont |
| 765 | |
| 766 | try.cont: ; preds = %catch3 |
| 767 | catchret from %1 to label %try.cont6 |
| 768 | |
| 769 | try.cont6: ; preds = %try.cont |
| 770 | ret void |
| 771 | |
| 772 | unreachable: ; preds = %catch, %entry |
| 773 | unreachable |
| 774 | } |
| 775 | |
| 776 | The "inner" ``catchswitch`` consumes ``%1`` which is produced by the outer |
| 777 | catchswitch. |
Joseph Tremoulet | fe9953a | 2016-01-10 04:28:38 +0000 | [diff] [blame] | 778 | |
| 779 | .. _wineh-constraints: |
| 780 | |
| 781 | Funclet transitions |
| 782 | ----------------------- |
| 783 | |
| 784 | The EH tables for personalities that use funclets make implicit use of the |
| 785 | funclet nesting relationship to encode unwind destinations, and so are |
| 786 | constrained in the set of funclet transitions they can represent. The related |
| 787 | LLVM IR instructions accordingly have constraints that ensure encodability of |
| 788 | the EH edges in the flow graph. |
| 789 | |
| 790 | A ``catchswitch``, ``catchpad``, or ``cleanuppad`` is said to be "entered" |
| 791 | when it executes. It may subsequently be "exited" by any of the following |
| 792 | means: |
| 793 | |
| 794 | * A ``catchswitch`` is immediately exited when none of its constituent |
| 795 | ``catchpad``\ s are appropriate for the in-flight exception and it unwinds |
| 796 | to its unwind destination or the caller. |
| 797 | * A ``catchpad`` and its parent ``catchswitch`` are both exited when a |
| 798 | ``catchret`` from the ``catchpad`` is executed. |
| 799 | * A ``cleanuppad`` is exited when a ``cleanupret`` from it is executed. |
| 800 | * Any of these pads is exited when control unwinds to the function's caller, |
| 801 | either by a ``call`` which unwinds all the way to the function's caller, |
| 802 | a nested ``catchswitch`` marked "``unwinds to caller``", or a nested |
| 803 | ``cleanuppad``\ 's ``cleanupret`` marked "``unwinds to caller"``. |
| 804 | * Any of these pads is exited when an unwind edge (from an ``invoke``, |
| 805 | nested ``catchswitch``, or nested ``cleanuppad``\ 's ``cleanupret``) |
| 806 | unwinds to a destination pad that is not a descendant of the given pad. |
| 807 | |
| 808 | Note that the ``ret`` instruction is *not* a valid way to exit a funclet pad; |
| 809 | it is undefined behavior to execute a ``ret`` when a pad has been entered but |
| 810 | not exited. |
| 811 | |
| 812 | A single unwind edge may exit any number of pads (with the restrictions that |
| 813 | the edge from a ``catchswitch`` must exit at least itself, and the edge from |
| 814 | a ``cleanupret`` must exit at least its ``cleanuppad``), and then must enter |
| 815 | exactly one pad, which must be distinct from all the exited pads. The parent |
| 816 | of the pad that an unwind edge enters must be the most-recently-entered |
| 817 | not-yet-exited pad (after exiting from any pads that the unwind edge exits), |
| 818 | or "none" if there is no such pad. This ensures that the stack of executing |
| 819 | funclets at run-time always corresponds to some path in the funclet pad tree |
| 820 | that the parent tokens encode. |
Joseph Tremoulet | c2d8241 | 2016-01-10 04:30:02 +0000 | [diff] [blame] | 821 | |
| 822 | All unwind edges which exit any given funclet pad (including ``cleanupret`` |
| 823 | edges exiting their ``cleanuppad`` and ``catchswitch`` edges exiting their |
| 824 | ``catchswitch``) must share the same unwind destination. Similarly, any |
| 825 | funclet pad which may be exited by unwind to caller must not be exited by |
| 826 | any exception edges which unwind anywhere other than the caller. This |
| 827 | ensures that each funclet as a whole has only one unwind destination, which |
| 828 | EH tables for funclet personalities may require. Note that any unwind edge |
| 829 | which exits a ``catchpad`` also exits its parent ``catchswitch``, so this |
| 830 | implies that for any given ``catchswitch``, its unwind destination must also |
| 831 | be the unwind destination of any unwind edge that exits any of its constituent |
| 832 | ``catchpad``\s. Because ``catchswitch`` has no ``nounwind`` variant, and |
| 833 | because IR producers are not *required* to annotate calls which will not |
| 834 | unwind as ``nounwind``, it is legal to nest a ``call`` or an "``unwind to |
| 835 | caller``\ " ``catchswitch`` within a funclet pad that has an unwind |
| 836 | destination other than caller; it is undefined behavior for such a ``call`` |
| 837 | or ``catchswitch`` to unwind. |
Joseph Tremoulet | 12b6cd2 | 2016-01-10 04:31:05 +0000 | [diff] [blame] | 838 | |
| 839 | Finally, the funclet pads' unwind destinations cannot form a cycle. This |
| 840 | ensures that EH lowering can construct "try regions" with a tree-like |
| 841 | structure, which funclet-based personalities may require. |
David Chisnall | e8a15e2f | 2018-01-24 09:53:01 +0000 | [diff] [blame] | 842 | |
| 843 | Exception Handling support on the target |
| 844 | ================================================= |
| 845 | |
| 846 | In order to support exception handling on particular target, there are a few |
| 847 | items need to be implemented. |
| 848 | |
| 849 | * CFI directives |
| 850 | |
| 851 | First, you have to assign each target register with a unique DWARF number. |
| 852 | Then in ``TargetFrameLowering``'s ``emitPrologue``, you have to emit `CFI |
| 853 | directives <https://sourceware.org/binutils/docs/as/CFI-directives.html>`_ |
| 854 | to specify how to calculate the CFA (Canonical Frame Address) and how register |
| 855 | is restored from the address pointed by the CFA with an offset. The assembler |
| 856 | is instructed by CFI directives to build ``.eh_frame`` section, which is used |
| 857 | by th unwinder to unwind stack during exception handling. |
| 858 | |
| 859 | * ``getExceptionPointerRegister`` and ``getExceptionSelectorRegister`` |
| 860 | |
| 861 | ``TargetLowering`` must implement both functions. The *personality function* |
| 862 | passes the *exception structure* (a pointer) and *selector value* (an integer) |
| 863 | to the landing pad through the registers specified by ``getExceptionPointerRegister`` |
| 864 | and ``getExceptionSelectorRegister`` respectively. On most platforms, they |
| 865 | will be GPRs and will be the same as the ones specified in the calling convention. |
| 866 | |
| 867 | * ``EH_RETURN`` |
| 868 | |
| 869 | The ISD node represents the undocumented GCC extension ``__builtin_eh_return (offset, handler)``, |
| 870 | which adjusts the stack by offset and then jumps to the handler. ``__builtin_eh_return`` |
| 871 | is used in GCC unwinder (`libgcc <https://gcc.gnu.org/onlinedocs/gccint/Libgcc.html>`_), |
| 872 | but not in LLVM unwinder (`libunwind <https://clang.llvm.org/docs/Toolchain.html#unwind-library>`_). |
| 873 | If you are on the top of ``libgcc`` and have particular requirement on your target, |
| 874 | you have to handle ``EH_RETURN`` in ``TargetLowering``. |
| 875 | |
| 876 | If you don't leverage the existing runtime (``libstdc++`` and ``libgcc``), |
| 877 | you have to take a look on `libc++ <https://libcxx.llvm.org/>`_ and |
| 878 | `libunwind <https://clang.llvm.org/docs/Toolchain.html#unwind-library>`_ |
| 879 | to see what have to be done there. For ``libunwind``, you have to do the following |
| 880 | |
| 881 | * ``__libunwind_config.h`` |
| 882 | |
| 883 | Define macros for your target. |
| 884 | |
| 885 | * ``include/libunwind.h`` |
| 886 | |
| 887 | Define enum for the target registers. |
| 888 | |
| 889 | * ``src/Registers.hpp`` |
| 890 | |
| 891 | Define ``Registers`` class for your target, implement setter and getter functions. |
| 892 | |
| 893 | * ``src/UnwindCursor.hpp`` |
| 894 | |
| 895 | Define ``dwarfEncoding`` and ``stepWithCompactEncoding`` for your ``Registers`` |
| 896 | class. |
| 897 | |
| 898 | * ``src/UnwindRegistersRestore.S`` |
| 899 | |
| 900 | Write an assembly function to restore all your target registers from the memory. |
| 901 | |
| 902 | * ``src/UnwindRegistersSave.S`` |
| 903 | |
| 904 | Write an assembly function to save all your target registers on the memory. |