doc/manual/compiler.texinfo

   1 @node Compiler
   2 @comment  node-name,  next,  previous,  up
   3 @chapter Compiler
   4
   5 This chapter will discuss most compiler issues other than efficiency,
   6 including compiler error messages, the SBCL compiler's unusual
   7 approach to type safety in the presence of type declarations, the
   8 effects of various compiler optimization policies, and the way that
   9 inlining and open coding may cause optimized code to differ from a
  10 naive translation. Efficiency issues are sufficiently varied and
  11 separate that they have their own chapter, @ref{Efficiency}.
  12
  13 @menu
  14 * Diagnostic Messages::
  15 * Handling of Types::
  16 * Compiler Policy::
  17 * Compiler Errors::
  18 * Open Coding and Inline Expansion::
  19 * Interpreter::
  20 * Advanced Compiler Use and Efficiency Hints::
  21 @end menu
  22
  23 @node  Diagnostic Messages
  24 @comment  node-name,  next,  previous,  up
  25 @section Diagnostic Messages
  26 @cindex Messages, Compiler
  27 @cindex Compiler messages
  28
  29 @menu
  30 * Controlling Verbosity::
  31 * Diagnostic Severity::
  32 * Understanding Compiler Diagnostics::
  33 @end menu
  34
  35 @node Controlling Verbosity
  36 @comment  node-name,  next,  previous,  up
  37 @subsection Controlling Verbosity
  38
  39 The compiler can be quite verbose in its diagnostic reporting, rather
  40 more then some users would prefer -- the amount of noise emitted can
  41 be controlled, however.
  42
  43 To control emission of compiler diagnostics (of any severity other
  44 than @code{error}: @pxref{Diagnostic Severity}) use the
  45 @code{sb-ext:muffle-conditions} and @code{sb-ext:unmuffle-conditions}
  46 declarations, specifying the type of condition that is to be muffled
  47 (the muffling is done using an associated @code{muffle-warning} restart).
  48
  49 Global control:
  50 @lisp
  51 ;;; Muffle compiler-notes globally
  52 (declaim (sb-ext:muffle-conditions sb-ext:compiler-note))
  53 @end lisp
  54
  55 Local control:
  56 @lisp
  57 ;;; Muffle compiler-notes based on lexical scope
  58 (defun foo (x)
  59   (declare (optimize speed) (fixnum x)
  60            (sb-ext:muffle-conditions sb-ext:compiler-note))
  61   (values (* x 5) ; no compiler note from this
  62     (locally
  63       (declare (sb-ext:unmuffle-conditions sb-ext:compiler-note))
  64       ;; this one gives a compiler note
  65       (* x -5))))
  66 @end lisp
  67
  68 @deffn {Declaration} @sbext{muffle-conditions}
  69 Syntax: type*
  70
  71 Muffles the diagnostic messages that would be caused by compile-time
  72 signals of given types.
  73 @end deffn
  74
  75 @deffn {Declaration} @sbext{unmuffle-conditions}
  76 Syntax: type*
  77
  78 Cancels the effect of a previous @code{sb-ext:muffle-conditions}
  79 declaration.
  80 @end deffn
  81
  82 Various details of @emph{how} the compiler messages are printed can be
  83 controlled via the alist
  84 @code{sb-ext:*compiler-print-variable-alist*}.
  85
  86 @include var-sb-ext-star-compiler-print-variable-alist-star.texinfo
  87
  88 For information about muffling warnings signaled outside of the
  89 compiler, see @ref{Customization Hooks for Users}.
  90
  91 @c <!-- FIXME: How much control over error messages is in SBCL?
  92 @c      _     How much should be? How much of this documentation should
  93 @c      _     we save or adapt?
  94 @c      _
  95 @c      _ %%\node Error Message Parameterization,  , Read Errors, Interpreting Error Messages
  96 @c      _ \subsection{Error Message Parameterization}
  97 @c      _ \cpsubindex{error messages}{verbosity}
  98 @c      _ \cpsubindex{verbosity}{of error messages}
  99 @c      _
 100 @c      _ There is some control over the verbosity of error messages.  See also
 101 @c      _ \varref{undefined-warning-limit}, \code{*efficiency-note-limit*} and
 102 @c      _ \varref{efficiency-note-cost-threshold}.
 103 @c      _
 104 @c      _ \begin{defvar}{}{enclosing-source-cutoff}
 105 @c      _
 106 @c      _   This variable specifies the number of enclosing actual source forms
 107 @c      _   that are printed in full, rather than in the abbreviated processing
 108 @c      _   path format.  Increasing the value from its default of \code{1}
 109 @c      _   allows you to see more of the guts of the macroexpanded source,
 110 @c      _   which is useful when debugging macros.
 111 @c      _ \end{defvar}
 112 @c      _
 113 @c      _ \begin{defmac}{extensions:}{define-source-context}{%
 114 @c      _     \args{\var{name} \var{lambda-list} \mstar{form}}}
 115 @c      _
 116 @c      _   This macro defines how to extract an abbreviated source context from
 117 @c      _   the \var{name}d form when it appears in the compiler input.
 118 @c      _   \var{lambda-list} is a \code{defmacro} style lambda-list used to
 119 @c      _   parse the arguments.  The \var{body} should return a list of
 120 @c      _   subforms that can be printed on about one line.  There are
 121 @c      _   predefined methods for \code{defstruct}, \code{defmethod}, etc.  If
 122 @c      _   no method is defined, then the first two subforms are returned.
 123 @c      _   Note that this facility implicitly determines the string name
 124 @c      _   associated with anonymous functions.
 125 @c      _ \end{defmac}
 126 @c      _
 127 @c      _ -->
 128
 129 @node  Diagnostic Severity
 130 @comment  node-name,  next,  previous,  up
 131 @subsection Diagnostic Severity
 132 @cindex Severity of compiler messages
 133 @cindex Compiler Diagnostic Severity
 134 @tindex @cl{error}
 135 @tindex @cl{warning}
 136 @tindex @cl{style-warning}
 137 @tindex @sbext{compiler-note}
 138 @tindex @sbext{code-deletion-note}
 139
 140 There are four levels of compiler diagnostic severity:
 141
 142 @enumerate 1
 143 @item error
 144 @item warning
 145 @item style warning
 146 @item note
 147 @end enumerate
 148
 149 The first three levels correspond to condition classes which are
 150 defined in the ANSI standard for Common Lisp and which have special
 151 significance to the @code{compile} and @code{compile-file} functions.
 152 These levels of compiler error severity occur when the compiler
 153 handles conditions of these classes.
 154
 155 The fourth level of compiler error severity, @emph{note}, corresponds
 156 to the @code{sb-ext:compiler-note}, and is used for problems which are
 157 too mild for the standard condition classes, typically hints about how
 158 efficiency might be improved. The @code{sb-ext:code-deletion-note}, a
 159 subtype of @code{compiler-note}, is signalled when the compiler
 160 deletes user-supplied code after proving that the code in question is
 161 unreachable.
 162
 163 Future work for SBCL includes expanding this hierarchy of types to
 164 allow more fine-grained control over emission of diagnostic messages.
 165
 166 @include condition-sb-ext-compiler-note.texinfo
 167 @include condition-sb-ext-code-deletion-note.texinfo
 168
 169
 170 @node Understanding Compiler Diagnostics
 171 @comment  node-name,  next,  previous,  up
 172 @subsection Understanding Compile Diagnostics
 173
 174 The messages emitted by the compiler contain a lot of detail in a
 175 terse format, so they may be confusing at first. The messages will be
 176 illustrated using this example program:
 177
 178 @lisp
 179 (defmacro zoq (x)
 180   `(roq (ploq (+ ,x 3))))
 181
 182 (defun foo (y)
 183   (declare (symbol y))
 184   (zoq y))
 185 @end lisp
 186
 187 The main problem with this program is that it is trying to add
 188 @code{3} to a symbol. Note also that the functions @code{roq} and
 189 @code{ploq} aren't defined anywhere.
 190
 191 @menu
 192 * The Parts of a Compiler Diagnostic::
 193 * The Original and Actual Source::
 194 * The Processing Path::
 195 @end menu
 196
 197 @node  The Parts of a Compiler Diagnostic
 198 @comment  node-name,  next,  previous,  up
 199 @subsubsection The Parts of a Compiler Diagnostic
 200
 201 When processing this program, the compiler will produce this warning:
 202
 203 @example
 204 ; file: /tmp/foo.lisp
 205 ; in: DEFUN FOO
 206 ;     (ZOQ Y)
 207 ; --> ROQ PLOQ
 208 ; ==>
 209 ;   (+ Y 3)
 210 ;
 211 ; caught WARNING:
 212 ;   Asserted type NUMBER conflicts with derived type (VALUES SYMBOL &OPTIONAL).
 213 @end example
 214
 215 In this example we see each of the six possible parts of a compiler
 216 diagnostic:
 217
 218 @enumerate
 219
 220 @item
 221 @findex @cl{with-compilation-unit}
 222 @samp{file: /tmp/foo.lisp} This is the name of the file that the
 223 compiler read the relevant code from.  The file name is displayed
 224 because it may not be immediately obvious when there is an error
 225 during compilation of a large system, especially when
 226 @code{with-compilation-unit} is used to delay undefined warnings.
 227
 228 @item
 229 @samp{in: DEFUN FOO} This is the definition top level form responsible
 230 for the diagnostic. It is obtained by taking the first two elements of
 231 the enclosing form whose first element is a symbol beginning with
 232 ``@samp{def}''. If there is no such enclosing ``@samp{def}'' form,
 233 then the outermost form is used. If there are multiple @samp{def}
 234 forms, then they are all printed from the outside in, separated by
 235 @samp{=>}'s. In this example, the problem was in the @code{defun} for
 236 @code{foo}.
 237
 238 @item
 239 @cindex Original Source
 240 @samp{(ZOQ Y)} This is the @dfn{original source} form responsible for
 241 the diagnostic. Original source means that the form directly appeared
 242 in the original input to the compiler, i.e. in the lambda passed to
 243 @code{compile} or in the top level form read from the source file. In
 244 this example, the expansion of the @code{zoq} macro was responsible
 245 for the message.
 246
 247 @item
 248 @cindex Processing Path
 249 @samp{--> ROQ PLOQ} This is the @dfn{processing path} that the
 250 compiler used to produce the code that caused the message to be
 251 emitted. The processing path is a representation of the evaluated
 252 forms enclosing the actual source that the compiler encountered when
 253 processing the original source. The path is the first element of each
 254 form, or the form itself if the form is not a list. These forms result
 255 from the expansion of macros or source-to-source transformation done
 256 by the compiler. In this example, the enclosing evaluated forms are
 257 the calls to @code{roq} and @code{ploq}. These calls resulted from the
 258 expansion of the @code{zoq} macro.
 259
 260 @item
 261 @cindex Actual Source
 262 @samp{==> (+ Y 3)} This is the @dfn{actual source} responsible for the
 263 diagnostic. If the actual source appears in the explanation, then we
 264 print the next enclosing evaluated form, instead of printing the
 265 actual source twice. (This is the form that would otherwise have been
 266 the last form of the processing path.) In this example, the problem is
 267 with the evaluation of the reference to the variable @code{y}.
 268
 269 @item
 270 @samp{caught WARNING: Asserted type NUMBER conflicts with derived type
 271 (VALUES SYMBOL &OPTIONAL).}  This is the @dfn{explanation} of the
 272 problem. In this example, the problem is that, while the call to
 273 @code{+} requires that its arguments are all of type @code{number},
 274 the compiler has derived that @code{y} will evaluate to a
 275 @code{symbol}.  Note that @samp{(VALUES SYMBOL &OPTIONAL)} expresses
 276 that @code{y} evaluates to precisely one value.
 277
 278 @end enumerate
 279
 280 Note that each part of the message is distinctively marked:
 281
 282 @itemize
 283
 284 @item
 285  @samp{file:} and @samp{in:} mark the file and definition,
 286 respectively.
 287
 288 @item
 289 The original source is an indented form with no prefix.
 290
 291 @item
 292 Each line of the processing path is prefixed with @samp{-->}
 293
 294 @item
 295 The actual source form is indented like the original source, but is
 296 marked by a preceding @samp{==>} line.
 297 @comment no it isn't.
 298
 299 @item
 300 The explanation is prefixed with the diagnostic severity, which can be
 301 @samp{caught ERROR:}, @samp{caught WARNING:}, @samp{caught
 302 STYLE-WARNING:}, or @samp{note:}.
 303
 304 @end itemize
 305
 306 Each part of the message is more specific than the preceding one. If
 307 consecutive messages are for nearby locations, then the front part of
 308 the messages would be the same. In this case, the compiler omits as
 309 much of the second message as in common with the first. For example:
 310
 311 @example
 312 ; file: /tmp/foo.lisp
 313 ; in: DEFUN FOO
 314 ;     (ZOQ Y)
 315 ; --> ROQ
 316 ; ==>
 317 ;   (PLOQ (+ Y 3))
 318 ;
 319 ; caught STYLE-WARNING:
 320 ;   undefined function: PLOQ
 321
 322 ; ==>
 323 ;   (ROQ (PLOQ (+ Y 3)))
 324 ;
 325 ; caught STYLE-WARNING:
 326 ;   undefined function: ROQ
 327 @end example
 328 @comment fixing that weird blank line might be good
 329
 330 In this example, the file, definition and original source are
 331 identical for the two messages, so the compiler omits them in the
 332 second message. If consecutive messages are entirely identical, then
 333 the compiler prints only the first message, followed by: @samp{[Last
 334 message occurs @var{repeats} times]} where @var{repeats} is the number
 335 of times the message was given.
 336
 337 If the source was not from a file, then no file line is printed. If
 338 the actual source is the same as the original source, then the
 339 processing path and actual source will be omitted. If no forms
 340 intervene between the original source and the actual source, then the
 341 processing path will also be omitted.
 342
 343
 344 @node  The Original and Actual Source
 345 @comment  node-name,  next,  previous,  up
 346 @subsubsection The Original and Actual Source
 347 @cindex Original Source
 348 @cindex Actual Source
 349
 350 The @emph{original source} displayed will almost always be a list. If
 351 the actual source for an message is a symbol, the original source will
 352 be the immediately enclosing evaluated list form. So even if the
 353 offending symbol does appear in the original source, the compiler will
 354 print the enclosing list and then print the symbol as the actual
 355 source (as though the symbol were introduced by a macro.)
 356
 357 When the @emph{actual source} is displayed (and is not a symbol), it
 358 will always be code that resulted from the expansion of a macro or a
 359 source-to-source compiler optimization. This is code that did not
 360 appear in the original source program; it was introduced by the
 361 compiler.
 362
 363 Keep in mind that when the compiler displays a source form in an
 364 diagnostic message, it always displays the most specific (innermost)
 365 responsible form. For example, compiling this function
 366
 367 @lisp
 368 (defun bar (x)
 369   (let (a)
 370     (declare (fixnum a))
 371     (setq a (foo x))
 372     a))
 373 @end lisp
 374
 375 gives this error message
 376
 377 @example
 378 ; file: /tmp/foo.lisp
 379 ; in: DEFUN BAR
 380 ;     (LET (A)
 381 ;     (DECLARE (FIXNUM A))
 382 ;     (SETQ A (FOO X))
 383 ;     A)
 384 ;
 385 ; caught WARNING:
 386 ;   Asserted type FIXNUM conflicts with derived type (VALUES NULL &OPTIONAL).
 387 @end example
 388
 389 This message is not saying ``there is a problem somewhere in this
 390 @code{let}'' -- it is saying that there is a problem with the
 391 @code{let} itself. In this example, the problem is that @code{a}'s
 392 @code{nil} initial value is not a @code{fixnum}.
 393
 394 @node The Processing Path
 395 @comment  node-name,  next,  previous,  up
 396 @subsubsection The Processing Path
 397 @cindex Processing Path
 398 @cindex Macroexpansion
 399 @cindex Source-to-source transformation
 400
 401 The processing path is mainly useful for debugging macros, so if you
 402 don't write macros, you can probably ignore it. Consider this example:
 403
 404 @lisp
 405 (defun foo (n)
 406   (dotimes (i n *undefined*)))
 407 @end lisp
 408
 409 Compiling results in this error message:
 410
 411 @example
 412 ; in: DEFUN FOO
 413 ;     (DOTIMES (I N *UNDEFINED*))
 414 ; --> DO BLOCK LET TAGBODY RETURN-FROM
 415 ; ==>
 416 ;   (PROGN *UNDEFINED*)
 417 ;
 418 ; caught WARNING:
 419 ;   undefined variable: *UNDEFINED*
 420 @end example
 421
 422 Note that @code{do} appears in the processing path. This is because
 423 @code{dotimes} expands into:
 424
 425 @lisp
 426 (do ((i 0 (1+ i)) (#:g1 n))
 427     ((>= i #:g1) *undefined*)
 428   (declare (type unsigned-byte i)))
 429 @end lisp
 430
 431 The rest of the processing path results from the expansion of
 432 @code{do}:
 433
 434 @lisp
 435 (block nil
 436   (let ((i 0) (#:g1 n))
 437     (declare (type unsigned-byte i))
 438     (tagbody (go #:g3)
 439       #:g2    (psetq i (1+ i))
 440       #:g3    (unless (>= i #:g1) (go #:g2))
 441       (return-from nil (progn *undefined*)))))
 442 @end lisp
 443
 444 In this example, the compiler descended into the @code{block},
 445 @code{let}, @code{tagbody} and @code{return-from} to reach the
 446 @code{progn} printed as the actual source. This is a place where the
 447 ``actual source appears in explanation'' rule was applied. The
 448 innermost actual source form was the symbol @code{*undefined*} itself,
 449 but that also appeared in the explanation, so the compiler backed out
 450 one level.
 451
 452
 453
 454
 455
 456 @node  Handling of Types
 457 @comment  node-name,  next,  previous,  up
 458 @section Handling of Types
 459
 460 One of the most important features of the SBCL compiler (similar to
 461 the original CMUCL compiler, also known as @dfn{Python}) is its fairly
 462 sophisticated understanding of the Common Lisp type system and its
 463 conservative approach to the implementation of type declarations.
 464
 465 These two features reward the use of type declarations throughout
 466 development, even when high performance is not a concern. Also, as
 467 discussed in the chapter on performance (@pxref{Efficiency}), the use
 468 of appropriate type declarations can be very important for performance
 469 as well.
 470
 471 @findex @cl{satisfies}
 472 The SBCL compiler also has a greater knowledge of the Common Lisp
 473 type system than other compilers. Support is incomplete only for types
 474 involving the @code{satisfies} type specifier.
 475
 476 @c <!-- FIXME: See also sections \ref{advanced-type-stuff}
 477 @c      and \ref{type-inference}, once we snarf them from the
 478 @c      CMU CL manual. -->
 479 @c
 480 @c Also see my paper on improving Baker, when I get round to it.
 481 @c
 482 @c Whose paper?
 483
 484 @menu
 485 * Declarations as Assertions::
 486 * Precise Type Checking::
 487 * Getting Existing Programs to Run::
 488 * Implementation Limitations::
 489 @end menu
 490
 491 @node Declarations as Assertions
 492 @comment  node-name,  next,  previous,  up
 493 @subsection Declarations as Assertions
 494 @cindex Safety optimization quality
 495
 496 The SBCL compiler treats type declarations differently from most other
 497 Lisp compilers. Under default compilation policy the compiler doesn't
 498 blindly believe type declarations, but considers them assertions about
 499 the program that should be checked: all type declarations that have
 500 not been proven to always hold are asserted at runtime.
 501
 502 @quotation
 503 @emph{Remaining bugs in the compiler's handling of types unfortunately
 504 provide some exceptions to this rule, see @ref{Implementation
 505 Limitations}.}
 506 @end quotation
 507
 508 CLOS slot types form a notable exception. Types declared using the
 509 @code{:type} slot option in @code{defclass} are asserted if and only
 510 if the class was defined in @emph{safe code} and the slot access
 511 location is in @emph{safe code} as well. This laxness does not pose
 512 any internal consistency issues, as the CLOS slot types are not
 513 available for the type inferencer, nor do CLOS slot types provide any
 514 efficiency benefits.
 515
 516 There are three type checking policies available in SBCL, selectable
 517 via @code{optimize} declarations.
 518
 519 @table @strong
 520
 521 @c FIXME: This should be properly integrated with general policy
 522 @c stuff, once that gets cleaned up.
 523
 524 @item Full Type Checks
 525 All declarations are considered assertions to be checked at runtime,
 526 and all type checks are precise. The default compilation policy
 527 provides full type checks.
 528
 529 Used when @code{(or (>= safety 2) (>= safety speed 1))}.
 530
 531 @item Weak Type Checks
 532 Declared types may be simplified into faster to check supertypes: for
 533 example, @code{(or (integer -17 -7) (integer 7 17))} is simplified
 534 into @code{(integer -17 17)}.
 535
 536 @strong{Note}: it is relatively easy to corrupt the heap when weak
 537 type checks are used if the program contains type-errors.
 538
 539 Used when @code{(and (< safety 2) (< safety speed))}
 540
 541 @item No Type Checks
 542 All declarations are believed without assertions. Also disables
 543 argument count and array bounds checking.
 544
 545 @strong{Note}: any type errors in code where type checks are not
 546 performed are liable to corrupt the heap.
 547
 548 Used when @code{(= safety 0)}.
 549
 550 @end table
 551
 552 @node  Precise Type Checking
 553 @comment  node-name,  next,  previous,  up
 554 @subsection Precise Type Checking
 555 @cindex Precise type checking
 556 @cindex Type checking, precise
 557
 558 Precise checking means that the check is done as though @code{typep}
 559 had been called with the exact type specifier that appeared in the
 560 declaration.
 561
 562 If a variable is declared to be @code{(integer 3 17)} then its value
 563 must always be an integer between @code{3} and @code{17}. If multiple
 564 type declarations apply to a single variable, then all the
 565 declarations must be correct; it is as though all the types were
 566 intersected producing a single @code{and} type specifier.
 567
 568 To gain maximum benefit from the compiler's type checking, you should
 569 always declare the types of function arguments and structure slots as
 570 precisely as possible. This often involves the use of @code{or},
 571 @code{member}, and other list-style type specifiers.
 572
 573
 574 @node  Getting Existing Programs to Run
 575 @comment  node-name,  next,  previous,  up
 576 @subsection Getting Existing Programs to Run
 577 @cindex Existing programs, to run
 578 @cindex Types, portability
 579 @cindex Compatibility with other Lisps
 580 @c     (should also have an entry in the non-ANSI-isms section)-->
 581
 582 Since SBCL's compiler does much more comprehensive type checking than
 583 most Lisp compilers, SBCL may detect type errors in programs that have
 584 been debugged using other compilers. These errors are mostly incorrect
 585 declarations, although compile-time type errors can find actual bugs
 586 if parts of the program have never been tested.
 587
 588 Some incorrect declarations can only be detected by run-time type
 589 checking. It is very important to initially compile a program with
 590 full type checks (high @code{safety} optimization) and then test this
 591 safe version. After the checking version has been tested, then you can
 592 consider weakening or eliminating type checks.  @emph{This applies
 593 even to previously debugged programs,} because the SBCL compiler does
 594 much more type inference than other Common Lisp compilers, so an
 595 incorrect declaration can do more damage.
 596
 597 The most common problem is with variables whose constant initial value
 598 doesn't match the type declaration. Incorrect constant initial values
 599 will always be flagged by a compile-time type error, and they are
 600 simple to fix once located. Consider this code fragment:
 601
 602 @lisp
 603 (prog (foo)
 604   (declare (fixnum foo))
 605   (setq foo ...)
 606   ...)
 607 @end lisp
 608
 609 Here @code{foo} is given an initial value of @code{nil}, but is
 610 declared to be a @code{fixnum}.  Even if it is never read, the initial
 611 value of a variable must match the declared type.  There are two ways
 612 to fix this problem. Change the declaration
 613
 614 @lisp
 615 (prog (foo)
 616   (declare (type (or fixnum null) foo))
 617   (setq foo ...)
 618   ...)
 619 @end lisp
 620
 621 or change the initial value
 622
 623 @lisp
 624 (prog ((foo 0))
 625   (declare (fixnum foo))
 626   (setq foo ...)
 627   ...)
 628 @end lisp
 629
 630 It is generally preferable to change to a legal initial value rather
 631 than to weaken the declaration, but sometimes it is simpler to weaken
 632 the declaration than to try to make an initial value of the
 633 appropriate type.
 634
 635 Another declaration problem occasionally encountered is incorrect
 636 declarations on @code{defmacro} arguments. This can happen when a
 637 function is converted into a macro. Consider this macro:
 638
 639 @lisp
 640 (defmacro my-1+ (x)
 641   (declare (fixnum x))
 642   `(the fixnum (1+ ,x)))
 643 @end lisp
 644
 645 Although legal and well-defined Common Lisp code, this meaning of this
 646 definition is almost certainly not what the writer intended. For
 647 example, this call is illegal:
 648
 649 @lisp
 650 (my-1+ (+ 4 5))
 651 @end lisp
 652
 653 This call is illegal because the argument to the macro is @code{(+ 4
 654 5)}, which is a @code{list}, not a @code{fixnum}.  Because of macro
 655 semantics, it is hardly ever useful to declare the types of macro
 656 arguments.  If you really want to assert something about the type of
 657 the result of evaluating a macro argument, then put a @code{the} in
 658 the expansion:
 659
 660 @lisp
 661 (defmacro my-1+ (x)
 662   `(the fixnum (1+ (the fixnum ,x))))
 663 @end lisp
 664
 665 In this case, it would be stylistically preferable to change this
 666 macro back to a function and declare it inline.
 667 @c <!--FIXME: <xref>inline-expansion, once we crib the
 668 @c      relevant text from the CMU CL manual.-->
 669
 670 Some more subtle problems are caused by incorrect declarations that
 671 can't be detected at compile time.  Consider this code:
 672
 673 @lisp
 674 (do ((pos 0 (position #\a string :start (1+ pos))))
 675   ((null pos))
 676   (declare (fixnum pos))
 677   ...)
 678 @end lisp
 679
 680 Although @code{pos} is almost always a @code{fixnum}, it is @code{nil}
 681 at the end of the loop. If this example is compiled with full type
 682 checks (the default), then running it will signal a type error at the
 683 end of the loop. If compiled without type checks, the program will go
 684 into an infinite loop (or perhaps @code{position} will complain
 685 because @code{(1+ nil)} isn't a sensible start.) Why? Because if you
 686 compile without type checks, the compiler just quietly believes the
 687 type declaration. Since the compiler believes that @code{pos} is
 688 always a @code{fixnum}, it believes that @code{pos} is never
 689 @code{nil}, so @code{(null pos)} is never true, and the loop exit test
 690 is optimized away. Such errors are sometimes flagged by unreachable
 691 code notes, but it is still important to initially compile and test
 692 any system with full type checks, even if the system works fine when
 693 compiled using other compilers.
 694
 695 In this case, the fix is to weaken the type declaration to @code{(or
 696 fixnum null)} @footnote{Actually, this declaration is unnecessary in
 697 SBCL, since it already knows that @code{position} returns a
 698 non-negative @code{fixnum} or @code{nil}.}.
 699
 700 Note that there is usually little performance penalty for weakening a
 701 declaration in this way. Any numeric operations in the body can still
 702 assume that the variable is a @code{fixnum}, since @code{nil} is not a
 703 legal numeric argument. Another possible fix would be to say:
 704
 705 @lisp
 706 (do ((pos 0 (position #\a string :start (1+ pos))))
 707     ((null pos))
 708   (let ((pos pos))
 709     (declare (fixnum pos))
 710     ...))
 711 @end lisp
 712
 713 This would be preferable in some circumstances, since it would allow a
 714 non-standard representation to be used for the local @code{pos}
 715 variable in the loop body.
 716 @c <!-- FIXME: <xref>ND-variables, once we crib the text from the
 717 @c      CMU CL manual. -->
 718
 719 @node  Implementation Limitations
 720 @comment  node-name,  next,  previous,  up
 721 @subsection Implementation Limitations
 722
 723 Ideally, the compiler would consider @emph{all} type declarations to
 724 be assertions, so that adding type declarations to a program, no
 725 matter how incorrect they might be, would @emph{never} cause undefined
 726 behavior. However, the compiler is known to fall short of this goal in
 727 two areas:
 728
 729 @itemize
 730
 731 @item
 732 @emph{Proclaimed} constraints on argument and result types of a
 733 function are supposed to be checked by the function. If the function
 734 type is proclaimed before function definition, type checks are
 735 inserted by the compiler, but the standard allows the reversed order,
 736 in which case the compiler will trust the declaration.
 737
 738 @item
 739 The compiler cannot check types of an unknown number of values; if the
 740 number of generated values is unknown, but the number of consumed is
 741 known, only consumed values are checked.
 742
 743 For example,
 744
 745 @lisp
 746 (defun foo (x)
 747   (the integer (bar x)))
 748 @end lisp
 749
 750 causes the following compiler diagnostic to be emitted:
 751
 752 @example
 753 ; note: type assertion too complex to check:
 754 ;  (VALUES INTEGER &REST T).
 755 @end example
 756
 757 A partial workaround is instead write:
 758
 759 @lisp
 760 (defun foo (x)
 761   (the (values integer &optional) (bar x)))
 762 @end lisp
 763
 764 @end itemize
 765
 766 These are important issues, but are not necessarily easy to fix, so
 767 they may, alas, remain in the system for a while.
 768
 769 @node Compiler Policy
 770 @comment  node-name,  next,  previous,  up
 771 @section Compiler Policy
 772
 773 Compiler policy is controlled by the @code{optimize} declaration,
 774 supporting all ANSI optimization qualities (@code{debug},
 775 @code{safety}, @code{space}, and @code{speed}).@footnote{A deprecated
 776 extension @code{sb-ext:inhibit-warnings} is still supported, but
 777 liable to go away at any time.}
 778
 779 For effects of various optimization qualities on type-safety and
 780 debuggability see @ref{Declarations as Assertions} and @ref{Debugger
 781 Policy Control}.
 782
 783 Ordinarily, when the @code{speed} quality is high, the compiler emits
 784 notes to notify the programmer about its inability to apply various
 785 optimizations. For selective muffling of these notes @xref{Controlling
 786 Verbosity}.
 787
 788 The value of @code{space} mostly influences the compiler's decision
 789 whether to inline operations, which tend to increase the size of
 790 programs. Use the value @code{0} with caution, since it can cause the
 791 compiler to inline operations so indiscriminately that the net effect
 792 is to slow the program by causing cache misses or even swapping.
 793
 794 @c <!-- FIXME: old CMU CL compiler policy, should perhaps be adapted
 795 @c      _    for SBCL. (Unfortunately, the CMU CL docs are out of sync with the
 796 @c      _    CMU CL code, so adapting this requires not only reformatting
 797 @c      _    the documentation, but rooting out code rot.)
 798 @c      _
 799 @c      _<sect2 id="compiler-policy"><title>Compiler Policy</1000
 800 @c      _  INDEX {policy}{compiler}
 801 @c      _  INDEX compiler policy
 802 @c      _
 803 @c      _<para>The policy is what tells the compiler <emphasis>how</emphasis> to
 804 @c      _compile a program. This is logically (and often textually) distinct
 805 @c      _from the program itself. Broad control of policy is provided by the
 806 @c      _<parameter>optimize</parameter> declaration; other declarations and variables
 807 @c      _control more specific aspects of compilation.
 808 @c      _
 809 @c      _\begin{comment}
 810 @c      _* The Optimize Declaration::
 811 @c      _* The Optimize-Interface Declaration::
 812 @c      _\end{comment}
 813 @c      _
 814 @c      _%%\node The Optimize Declaration, The Optimize-Interface Declaration, Compiler Policy, Compiler Policy
 815 @c      _\subsection{The Optimize Declaration}
 816 @c      _\label{optimize-declaration}
 817 @c      _\cindex{optimize declaration}
 818 @c      _\cpsubindex{declarations}{\code{optimize}}
 819 @c      _
 820 @c      _The \code{optimize} declaration recognizes six different
 821 @c      _\var{qualities}.  The qualities are conceptually independent aspects
 822 @c      _of program performance.  In reality, increasing one quality tends to
 823 @c      _have adverse effects on other qualities.  The compiler compares the
 824 @c      _relative values of qualities when it needs to make a trade-off; i.e.,
 825 @c      _if \code{speed} is greater than \code{safety}, then improve speed at
 826 @c      _the cost of safety.
 827 @c      _
 828 @c      _The default for all qualities (except \code{debug}) is \code{1}.
 829 @c      _Whenever qualities are equal, ties are broken according to a broad
 830 @c      _idea of what a good default environment is supposed to be.  Generally
 831 @c      _this downplays \code{speed}, \code{compile-speed} and \code{space} in
 832 @c      _favor of \code{safety} and \code{debug}.  Novice and casual users
 833 @c      _should stick to the default policy.  Advanced users often want to
 834 @c      _improve speed and memory usage at the cost of safety and
 835 @c      _debuggability.
 836 @c      _
 837 @c      _If the value for a quality is \code{0} or \code{3}, then it may have a
 838 @c      _special interpretation.  A value of \code{0} means ``totally
 839 @c      _unimportant'', and a \code{3} means ``ultimately important.''  These
 840 @c      _extreme optimization values enable ``heroic'' compilation strategies
 841 @c      _that are not always desirable and sometimes self-defeating.
 842 @c      _Specifying more than one quality as \code{3} is not desirable, since
 843 @c      _it doesn't tell the compiler which quality is most important.
 844 @c      _
 845 @c      _
 846 @c      _These are the optimization qualities:
 847 @c      _\begin{Lentry}
 848 @c      _
 849 @c      _\item[\code{speed}] \cindex{speed optimization quality}How fast the
 850 @c      _  program should is run.  \code{speed 3} enables some optimizations
 851 @c      _  that hurt debuggability.
 852 @c      _
 853 @c      _\item[\code{compilation-speed}] \cindex{compilation-speed optimization
 854 @c      _    quality}How fast the compiler should run.  Note that increasing
 855 @c      _  this above \code{safety} weakens type checking.
 856 @c      _
 857 @c      _\item[\code{space}] \cindex{space optimization quality}How much space
 858 @c      _  the compiled code should take up.  Inline expansion is mostly
 859 @c      _  inhibited when \code{space} is greater than \code{speed}.  A value
 860 @c      _  of \code{0} enables indiscriminate inline expansion.  Wide use of a
 861 @c      _  \code{0} value is not recommended, as it may waste so much space
 862 @c      _  that run time is slowed.  \xlref{inline-expansion} for a discussion
 863 @c      _  of inline expansion.
 864 @c      _
 865 @c      _\item[\code{debug}] \cindex{debug optimization quality}How debuggable
 866 @c      _  the program should be.  The quality is treated differently from the
 867 @c      _  other qualities: each value indicates a particular level of debugger
 868 @c      _  information; it is not compared with the other qualities.
 869 @c      _  \xlref{debugger-policy} for more details.
 870 @c      _
 871 @c      _\item[\code{safety}] \cindex{safety optimization quality}How much
 872 @c      _  error checking should be done.  If \code{speed}, \code{space} or
 873 @c      _  \code{compilation-speed} is more important than \code{safety}, then
 874 @c      _  type checking is weakened (\pxlref{weakened-type-checks}).  If
 875 @c      _  \code{safety} if \code{0}, then no run time error checking is done.
 876 @c      _  In addition to suppressing type checks, \code{0} also suppresses
 877 @c      _  argument count checking, unbound-symbol checking and array bounds
 878 @c      _  checks.
 879 @c      _  ... and checking of tag existence in RETURN-FROM and GO.
 880 @c      _
 881 @c      _\item[\code{extensions:inhibit-warnings}] \cindex{inhibit-warnings
 882 @c      _    optimization quality}This is a CMU extension that determines how
 883 @c      _  little (or how much) diagnostic output should be printed during
 884 @c      _  compilation.  This quality is compared to other qualities to
 885 @c      _  determine whether to print style notes and warnings concerning those
 886 @c      _  qualities.  If \code{speed} is greater than \code{inhibit-warnings},
 887 @c      _  then notes about how to improve speed will be printed, etc.  The
 888 @c      _  default value is \code{1}, so raising the value for any standard
 889 @c      _  quality above its default enables notes for that quality.  If
 890 @c      _  \code{inhibit-warnings} is \code{3}, then all notes and most
 891 @c      _  non-serious warnings are inhibited.  This is useful with
 892 @c      _  \code{declare} to suppress warnings about unavoidable problems.
 893 @c      _\end{Lentry}
 894 @c      _
 895 @c      _%%\node The Optimize-Interface Declaration,  , The Optimize Declaration, Compiler Policy
 896 @c      _\subsection{The Optimize-Interface Declaration}
 897 @c      _\label{optimize-interface-declaration}
 898 @c      _\cindex{optimize-interface declaration}
 899 @c      _\cpsubindex{declarations}{\code{optimize-interface}}
 900 @c      _
 901 @c      _The \code{extensions:optimize-interface} declaration is identical in
 902 @c      _syntax to the \code{optimize} declaration, but it specifies the policy
 903 @c      _used during compilation of code the compiler automatically generates
 904 @c      _to check the number and type of arguments supplied to a function.  It
 905 @c      _is useful to specify this policy separately, since even thoroughly
 906 @c      _debugged functions are vulnerable to being passed the wrong arguments.
 907 @c      _The \code{optimize-interface} declaration can specify that arguments
 908 @c      _should be checked even when the general \code{optimize} policy is
 909 @c      _unsafe.
 910 @c      _
 911 @c      _Note that this argument checking is the checking of user-supplied
 912 @c      _arguments to any functions defined within the scope of the
 913 @c      _declaration, \code{not} the checking of arguments to \llisp{}
 914 @c      _primitives that appear in those definitions.
 915 @c      _
 916 @c      _The idea behind this declaration is that it allows the definition of
 917 @c      _functions that appear fully safe to other callers, but that do no
 918 @c      _internal error checking.  Of course, it is possible that arguments may
 919 @c      _be invalid in ways other than having incorrect type.  Functions
 920 @c      _compiled unsafely must still protect themselves against things like
 921 @c      _user-supplied array indices that are out of bounds and improper lists.
 922 @c      _See also the \kwd{context-declarations} option to
 923 @c      _\macref{with-compilation-unit}.
 924 @c      _
 925 @c      _(end of section on compiler policy)
 926 @c      _-->
 927
 928 @include fun-sb-ext-describe-compiler-policy.texinfo
 929 @include fun-sb-ext-restrict-compiler-policy.texinfo
 930 @include macro-common-lisp-with-compilation-unit.texinfo
 931
 932 @node Compiler Errors
 933 @comment  node-name,  next,  previous,  up
 934 @section Compiler Errors
 935
 936 @menu
 937 * Type Errors at Compile Time::
 938 * Errors During Macroexpansion::
 939 * Read Errors::
 940 @end menu
 941
 942 @node  Type Errors at Compile Time
 943 @comment  node-name,  next,  previous,  up
 944 @subsection Type Errors at Compile Time
 945 @cindex Compile time type errors
 946 @cindex Type checking, at compile time
 947
 948 If the compiler can prove at compile time that some portion of the
 949 program cannot be executed without a type error, then it will give a
 950 warning at compile time.
 951
 952 It is possible that the offending code would never actually be
 953 executed at run-time due to some higher level consistency constraint
 954 unknown to the compiler, so a type warning doesn't always indicate an
 955 incorrect program.
 956
 957 For example, consider this code fragment:
 958
 959 @lisp
 960 (defun raz (foo)
 961   (let ((x (case foo
 962               (:this 13)
 963               (:that 9)
 964               (:the-other 42))))
 965     (declare (fixnum x))
 966     (foo x)))
 967 @end lisp
 968
 969 Compilation produces this warning:
 970
 971 @example
 972 ; in: DEFUN RAZ
 973 ;     (CASE FOO (:THIS 13) (:THAT 9) (:THE-OTHER 42))
 974 ; --> LET COND IF COND IF COND IF
 975 ; ==>
 976 ;   (COND)
 977 ;
 978 ; caught WARNING:
 979 ;   This is not a FIXNUM:
 980 ;   NIL
 981 @end example
 982
 983 In this case, the warning means that if @code{foo} isn't any of
 984 @code{:this}, @code{:that} or @code{:the-other}, then @code{x} will be
 985 initialized to @code{nil}, which the @code{fixnum} declaration makes
 986 illegal. The warning will go away if @code{ecase} is used instead of
 987 @code{case}, or if @code{:the-other} is changed to @code{t}.
 988
 989 This sort of spurious type warning happens moderately often in the
 990 expansion of complex macros and in inline functions. In such cases,
 991 there may be dead code that is impossible to correctly execute. The
 992 compiler can't always prove this code is dead (could never be
 993 executed), so it compiles the erroneous code (which will always signal
 994 an error if it is executed) and gives a warning.
 995
 996 @node  Errors During Macroexpansion
 997 @comment  node-name,  next,  previous,  up
 998 @subsection Errors During Macroexpansion
 999 @cindex Macroexpansion, errors during
1000
1001 The compiler handles errors that happen during macroexpansion, turning
1002 them into compiler errors. If you want to debug the error (to debug a
1003 macro), you can set @code{*break-on-signals*} to @code{error}. For
1004 example, this definition:
1005
1006 @lisp
1007 (defun foo (e l)
1008   (do ((current l (cdr current))
1009        ((atom current) nil))
1010       (when (eq (car current) e) (return current))))
1011 @end lisp
1012
1013 gives this error:
1014
1015 @example
1016 ; in: DEFUN FOO
1017 ;     (DO ((CURRENT L (CDR CURRENT))
1018 ;        ((ATOM CURRENT) NIL))
1019 ;       (WHEN (EQ (CAR CURRENT) E) (RETURN CURRENT)))
1020 ;
1021 ; caught ERROR:
1022 ;   (in macroexpansion of (DO # #))
1023 ;   (hint: For more precise location, try *BREAK-ON-SIGNALS*.)
1024 ;   DO step variable is not a symbol: (ATOM CURRENT)
1025 @end example
1026
1027
1028 @node  Read Errors
1029 @comment  node-name,  next,  previous,  up
1030 @subsection Read Errors
1031 @cindex Read errors, compiler
1032
1033 SBCL's compiler does not attempt to recover from read errors when
1034 reading a source file, but instead just reports the offending
1035 character position and gives up on the entire source file.
1036
1037 @node  Open Coding and Inline Expansion
1038 @comment  node-name,  next,  previous,  up
1039 @section Open Coding and Inline Expansion
1040 @cindex Open-coding
1041 @cindex Inline expansion
1042 @cindex Static functions
1043
1044 Since Common Lisp forbids the redefinition of standard functions, the
1045 compiler can have special knowledge of these standard functions
1046 embedded in it. This special knowledge is used in various ways (open
1047 coding, inline expansion, source transformation), but the implications
1048 to the user are basically the same:
1049
1050 @itemize
1051
1052 @item
1053 Attempts to redefine standard functions may be frustrated, since the
1054 function may never be called. Although it is technically illegal to
1055 redefine standard functions, users sometimes want to implicitly
1056 redefine these functions when they are debugging using the
1057 @code{trace} macro.  Special-casing of standard functions can be
1058 inhibited using the @code{notinline} declaration, but even then some
1059 phases of analysis such as type inferencing are applied by the
1060 compiler.
1061
1062 @item
1063 The compiler can have multiple alternate implementations of standard
1064 functions that implement different trade-offs of speed, space and
1065 safety.  This selection is based on the compiler policy, @ref{Compiler
1066 Policy}.
1067
1068 @end itemize
1069
1070 When a function call is @emph{open coded}, inline code whose effect is
1071 equivalent to the function call is substituted for that function
1072 call. When a function call is @emph{closed coded}, it is usually left
1073 as is, although it might be turned into a call to a different function
1074 with different arguments. As an example, if @code{nthcdr} were to be
1075 open coded, then
1076
1077 @lisp
1078 (nthcdr 4 foobar)
1079 @end lisp
1080
1081 might turn into
1082
1083 @lisp
1084 (cdr (cdr (cdr (cdr foobar))))
1085 @end lisp
1086
1087 or even
1088
1089 @lisp
1090 (do ((i 0 (1+ i))
1091   (list foobar (cdr foobar)))
1092   ((= i 4) list))
1093 @end lisp
1094
1095 If @code{nth} is closed coded, then
1096
1097 @lisp
1098 (nth x l)
1099 @end lisp
1100
1101 might stay the same, or turn into something like
1102
1103 @lisp
1104 (car (nthcdr x l))
1105 @end lisp
1106
1107 In general, open coding sacrifices space for speed, but some functions
1108 (such as @code{car}) are so simple that they are always
1109 open-coded. Even when not open-coded, a call to a standard function
1110 may be transformed into a different function call (as in the last
1111 example) or compiled as @emph{static call}. Static function call uses
1112 a more efficient calling convention that forbids redefinition.
1113
1114 @node  Interpreter
1115 @comment  node-name,  next,  previous,  up
1116 @section Interpreter
1117 @cindex Interpreter
1118 @findex @cl{eval}
1119 @vindex @sbext{@earmuffs{evaluator-mode}}
1120
1121 By default SBCL implements @code{eval} by calling the native code
1122 compiler.
1123
1124 SBCL also includes an interpreter for use in special cases where using
1125 the compiler is undesirable, for example due to compilation overhead.
1126 Unlike in some other Lisp implementations, in SBCL interpreted code is
1127 not safer or more debuggable than compiled code.
1128
1129 @include var-sb-ext-star-evaluator-mode-star.texinfo
1130
1131 @node  Advanced Compiler Use and Efficiency Hints
1132 @comment  node-name,  next,  previous,  up
1133 @section Advanced Compiler Use and Efficiency Hints
1134 For more advanced usages of the compiler, please see the chapter of the
1135 same name in the CMUCL manual. Many aspects of the compiler have stayed
1136 exactly the same, and there is a much more detailed explanation of the
1137 compiler's behavior and how to maximally optimize code in their
1138 manual. In particular, while SBCL no longer supports byte-code
1139 compilation, it does support CMUCL's block compilation facility allowing
1140 whole program optimization and increased use of the local call
1141 convention.
1142
1143 Unlike CMUCL, SBCL is able to open-code forward-referenced type tests
1144 while block compiling. This helps for mutually referential
1145 @code{defstruct}s in particular.