doc/misc/ert.texi

   1 \input texinfo
   2 @c %**start of header
   3 @setfilename ../../info/ert.info
   4 @settitle Emacs Lisp Regression Testing
   5 @include docstyle.texi
   6 @syncodeindex fn cp
   7 @syncodeindex vr cp
   8 @syncodeindex pg cp
   9 @syncodeindex ky cp
  10 @c %**end of header
  11
  12 @dircategory Emacs misc features
  13 @direntry
  14 * ERT: (ert).                   Emacs Lisp regression testing tool.
  15 @end direntry
  16
  17 @copying
  18 Copyright @copyright{} 2008, 2010--2017 Free Software Foundation, Inc.
  19
  20 @quotation
  21 Permission is granted to copy, distribute and/or modify this document
  22 under the terms of the GNU Free Documentation License, Version 1.3 or
  23 any later version published by the Free Software Foundation; with no
  24 Invariant Sections, with the Front-Cover Texts being ``A GNU Manual,''
  25 and with the Back-Cover Texts as in (a) below.  A copy of the license
  26 is included in the section entitled ``GNU Free Documentation License''.
  27
  28 (a) The FSF's Back-Cover Text is: ``You have the freedom to copy and
  29 modify this GNU manual.''
  30 @end quotation
  31 @end copying
  32
  33 @titlepage
  34 @title Emacs Lisp Regression Testing
  35 @page
  36 @vskip 0pt plus 1filll
  37 @insertcopying
  38 @end titlepage
  39
  40 @contents
  41
  42 @ifnottex
  43 @node Top
  44 @top ERT: Emacs Lisp Regression Testing
  45
  46 @insertcopying
  47
  48 ERT is a tool for automated testing in Emacs Lisp.  Its main features
  49 are facilities for defining tests, running them and reporting the
  50 results, and for debugging test failures interactively.
  51
  52 ERT is similar to tools for other environments such as JUnit, but has
  53 unique features that take advantage of the dynamic and interactive
  54 nature of Emacs.  Despite its name, it works well both for test-driven
  55 development (see
  56 @url{http://en.wikipedia.org/wiki/Test-driven_development}) and for
  57 traditional software development methods.
  58
  59 @menu
  60 * Introduction::                A simple example of an ERT test.
  61 * How to Run Tests::            Run tests in Emacs or from the command line.
  62 * How to Write Tests::          How to add tests to your Emacs Lisp code.
  63 * How to Debug Tests::          What to do if a test fails.
  64 * Extending ERT::               ERT is extensible in several ways.
  65 * Other Testing Concepts::      Features not in ERT.
  66 * Index::                       Concept, Function and Variable Index
  67 * GNU Free Documentation License::  The license for this documentation.
  68
  69 @detailmenu
  70  --- The Detailed Node Listing ---
  71
  72 How to Run Tests
  73
  74 * Running Tests Interactively::  Run tests in your current Emacs.
  75 * Running Tests in Batch Mode::  Run tests in emacs -Q.
  76 * Test Selectors::               Choose which tests to run.
  77
  78 How to Write Tests
  79
  80 * The @code{should} Macro::          A powerful way to express assertions.
  81 * Expected Failures::           Tests for known bugs.
  82 * Tests and Their Environment:: Don't depend on customizations; no side effects.
  83 * Useful Techniques::           Some examples.
  84
  85 How to Debug Tests
  86
  87 * Understanding Explanations::  How ERT gives details on why an assertion failed.
  88 * Interactive Debugging::       Tools available in the ERT results buffer.
  89
  90 Extending ERT
  91
  92 * Defining Explanation Functions::  Teach ERT about more predicates.
  93 * Low-Level Functions for Working with Tests::  Use ERT's data for your purposes.
  94
  95 Other Testing Concepts
  96
  97 * Mocks and Stubs::           Stubbing out code that is irrelevant to the test.
  98 * Fixtures and Test Suites::  How ERT differs from tools for other languages.
  99
 100 Index
 101
 102 * Index::                       Concept, Function and Variable Index
 103
 104 Appendix
 105
 106 * GNU Free Documentation License:: The license for this documentation.
 107
 108 @end detailmenu
 109 @end menu
 110 @end ifnottex
 111
 112 @node Introduction
 113 @chapter Introduction
 114 @cindex introduction to ERT
 115
 116 ERT allows you to define @emph{tests} in addition to functions,
 117 macros, variables, and the other usual Lisp constructs.  Tests are
 118 simply Lisp code: code that invokes other code and checks whether
 119 it behaves as expected.
 120
 121 ERT keeps track of the tests that are defined and provides convenient
 122 commands to run them to verify whether the definitions that are
 123 currently loaded in Emacs pass the tests.
 124
 125 Some Lisp files have comments like the following (adapted from the
 126 package @code{pp.el}):
 127
 128 @lisp
 129 ;; (pp-to-string '(quote quote))          ; expected: "'quote"
 130 ;; (pp-to-string '((quote a) (quote b)))  ; expected: "('a 'b)\n"
 131 ;; (pp-to-string '('a 'b))                ; same as above
 132 @end lisp
 133
 134 The code contained in these comments can be evaluated from time to
 135 time to compare the output with the expected output.  ERT formalizes
 136 this and introduces a common convention, which simplifies Emacs
 137 development, since programmers no longer have to manually find and
 138 evaluate such comments.
 139
 140 An ERT test definition equivalent to the above comments is this:
 141
 142 @lisp
 143 (ert-deftest pp-test-quote ()
 144   "Tests the rendering of `quote' symbols in `pp-to-string'."
 145   (should (equal (pp-to-string '(quote quote)) "'quote"))
 146   (should (equal (pp-to-string '((quote a) (quote b))) "('a 'b)\n"))
 147   (should (equal (pp-to-string '('a 'b)) "('a 'b)\n")))
 148 @end lisp
 149
 150 If you know @code{defun}, the syntax of @code{ert-deftest} should look
 151 familiar: This example defines a test named @code{pp-test-quote} that
 152 will pass if the three calls to @code{equal} all return non-@code{nil}.
 153
 154 @code{should} is a macro with the same meaning as @code{cl-assert} but
 155 better error reporting.  @xref{The @code{should} Macro}.
 156
 157 Each test should have a name that describes what functionality it tests.
 158 Test names can be chosen arbitrarily---they are in a
 159 namespace separate from functions and variables---but should follow
 160 the usual Emacs Lisp convention of having a prefix that indicates
 161 which package they belong to.  Test names are displayed by ERT when
 162 reporting failures and can be used when selecting which tests to run.
 163
 164 The empty parentheses @code{()} in the first line don't currently have
 165 any meaning and are reserved for future extension.  They also make
 166 the syntax of @code{ert-deftest} more similar to that of @code{defun}.
 167
 168 The docstring describes what feature this test tests.  When running
 169 tests interactively, the first line of the docstring is displayed for
 170 tests that fail, so it is good if the first line makes sense on its
 171 own.
 172
 173 The body of a test can be arbitrary Lisp code.  It should have as few
 174 side effects as possible; each test should be written to clean up
 175 after itself, leaving Emacs in the same state as it was before the
 176 test.  Tests should clean up even if they fail.  @xref{Tests and Their
 177 Environment}.
 178
 179
 180 @node  How to Run Tests
 181 @chapter How to Run Tests
 182 @cindex how to run ert tests
 183
 184 You can run tests either in the Emacs you are working in, or on the
 185 command line in a separate Emacs process in batch mode (i.e., with no
 186 user interface).  The former mode is convenient during interactive
 187 development, the latter is useful to make sure that tests pass
 188 independently of your customizations; and it allows you to invoke
 189 tests from makefiles, and to write scripts that run tests in several
 190 different Emacs versions.
 191
 192 @menu
 193 * Running Tests Interactively::  Run tests in your current Emacs.
 194 * Running Tests in Batch Mode::  Run tests in emacs -Q.
 195 * Test Selectors::               Choose which tests to run.
 196 @end menu
 197
 198
 199 @node Running Tests Interactively
 200 @section Running Tests Interactively
 201 @cindex running tests interactively
 202 @cindex interactive testing
 203
 204 @findex ert
 205 You can run the tests that are currently defined in your Emacs with
 206 the command @kbd{@kbd{M-x} ert @kbd{RET} t @kbd{RET}}.  (For an
 207 explanation of the @code{t} argument, @pxref{Test Selectors}.) ERT will pop
 208 up a new buffer, the ERT results buffer, showing the results of the
 209 tests run.  It looks like this:
 210
 211 @example
 212 Selector: t
 213 Passed:  31
 214 Skipped: 0
 215 Failed:  2 (2 unexpected)
 216 Total:   33/33
 217
 218 Started at:   2008-09-11 08:39:25-0700
 219 Finished.
 220 Finished at:  2008-09-11 08:39:27-0700
 221
 222 FF...............................
 223
 224 F addition-test
 225     (ert-test-failed
 226      ((should
 227        (=
 228         (+ 1 2)
 229         4))
 230       :form
 231       (= 3 4)
 232       :value nil))
 233
 234 F list-test
 235     (ert-test-failed
 236      ((should
 237        (equal
 238         (list 'a 'b 'c)
 239         '(a b d)))
 240       :form
 241       (equal
 242        (a b c)
 243        (a b d))
 244       :value nil :explanation
 245       (list-elt 2
 246                 (different-atoms c d))))
 247 @end example
 248
 249 @cindex test results buffer
 250 At the top, there is a summary of the results: we ran all tests defined
 251 in the current Emacs (@code{Selector: t}), 31 of them passed, and 2
 252 failed unexpectedly.  @xref{Expected Failures}, for an explanation of
 253 the term @emph{unexpected} in this context.
 254
 255 The line of dots and @code{F}s is a progress bar where each character
 256 represents one test; it fills while the tests are running.  A dot
 257 means that the test passed, an @code{F} means that it failed.  Below
 258 the progress bar, ERT shows details about each test that had an
 259 unexpected result.  In the example above, there are two failures, both
 260 due to failed @code{should} forms.  @xref{Understanding Explanations},
 261 for more details.
 262
 263 @kindex TAB@r{, in ert results buffer}
 264 @kindex S-TAB@r{, in ert results buffer}
 265 In the ERT results buffer, @kbd{TAB} and @kbd{S-TAB} cycle between
 266 buttons.  Each name of a function or macro in this buffer is a button;
 267 moving point to it and typing @kbd{RET} jumps to its definition.
 268
 269 @kindex r@r{, in ert results buffer}
 270 @kindex d@r{, in ert results buffer}
 271 @kindex .@r{, in ert results buffer}
 272 @kindex b@r{, in ert results buffer}
 273 @cindex backtrace of a failed test
 274 Pressing @kbd{r} re-runs the test near point on its own.  Pressing
 275 @kbd{d} re-runs it with the debugger enabled.  @kbd{.} jumps to the
 276 definition of the test near point (@kbd{RET} has the same effect if
 277 point is on the name of the test).  On a failed test, @kbd{b} shows
 278 the backtrace of the failure.
 279
 280 @kindex l@r{, in ert results buffer}
 281 @kbd{l} shows the list of @code{should} forms executed in the test.
 282 If any messages were generated (with the Lisp function @code{message})
 283 in a test or any of the code that it invoked, @kbd{m} will show them.
 284
 285 @kindex L@r{, in ert results buffer}
 286 By default, long expressions in the failure details are abbreviated
 287 using @code{print-length} and @code{print-level}.  Pressing @kbd{L}
 288 while point is on a test failure will increase the limits to show more
 289 of the expression.
 290
 291
 292 @node Running Tests in Batch Mode
 293 @section Running Tests in Batch Mode
 294 @cindex running tests in batch mode
 295 @cindex batch-mode testing
 296
 297 @findex ert-run-tests-batch
 298 @findex ert-run-tests-batch-and-exit
 299 ERT supports automated invocations from the command line or from
 300 scripts or makefiles.  There are two functions for this purpose,
 301 @code{ert-run-tests-batch} and @code{ert-run-tests-batch-and-exit}.
 302 They can be used like this:
 303
 304 @example
 305 emacs -batch -l ert -l my-tests.el -f ert-run-tests-batch-and-exit
 306 @end example
 307
 308 This command will start up Emacs in batch mode, load ERT, load
 309 @code{my-tests.el}, and run all tests defined in it.  It will exit
 310 with a zero exit status if all tests passed, or nonzero if any tests
 311 failed or if anything else went wrong.  It will also print progress
 312 messages and error diagnostics to standard output.
 313
 314 @findex ert-summarize-tests-batch-and-exit
 315 You can also redirect the above output to a log file, say
 316 @file{output.log}, and use the
 317 @code{ert-summarize-tests-batch-and-exit} function to produce a neat
 318 summary as shown below:
 319
 320 @example
 321 emacs -batch -l ert -f ert-summarize-tests-batch-and-exit output.log
 322 @end example
 323
 324 @vindex ert-quiet
 325 By default, ERT in batch mode is quite verbose, printing a line with
 326 result after each test.  This gives you progress information: how many
 327 tests have been executed and how many there are.  However, in some
 328 cases this much output may be undesirable.  In this case, set
 329 @code{ert-quiet} variable to a non-nil value:
 330
 331 @example
 332 emacs -batch -l ert -l my-tests.el \
 333       --eval "(let ((ert-quiet t)) (ert-run-tests-batch-and-exit))"
 334 @end example
 335
 336 In quiet mode ERT prints only unexpected results and summary.
 337
 338 If ERT is not part of your Emacs distribution, you may need to use
 339 @code{-L /path/to/ert/} so that Emacs can find it.  You may need
 340 additional @code{-L} flags to ensure that @code{my-tests.el} and all the
 341 files that it requires are on your @code{load-path}.
 342
 343
 344 @node Test Selectors
 345 @section Test Selectors
 346 @cindex test selector
 347 @cindex selecting tests
 348
 349 Functions like @code{ert} accept a @emph{test selector}, a Lisp
 350 expression specifying a set of tests.  Test selector syntax is similar
 351 to Common Lisp's type specifier syntax:
 352
 353 @itemize
 354 @item @code{nil} selects no tests.
 355 @item @code{t} selects all tests.
 356 @item @code{:new} selects all tests that have not been run yet.
 357 @item @code{:failed} and @code{:passed} select tests according to their most recent result.
 358 @item @code{:expected}, @code{:unexpected} select tests according to their most recent result.
 359 @item A string is a regular expression that selects all tests with matching names.
 360 @item A test (i.e., an object of @code{ert-test} data type) selects that test.
 361 @item A symbol selects the test that the symbol names.
 362 @item @code{(member @var{tests}...)} selects the elements of
 363 @var{tests}, a list of tests or symbols naming tests.
 364 @item @code{(eql @var{test})} selects @var{test}, a test or a symbol
 365 naming a test.
 366 @item @code{(and @var{selectors}@dots{})} selects the tests that match
 367 all @var{selectors}.
 368 @item @code{(or @var{selectors}@dots{})} selects the tests that match
 369 any of the @var{selectors}.
 370 @item @code{(not @var{selector})} selects all tests that do not match
 371 @var{selector}.
 372 @item @code{(tag @var{tag})} selects all tests that have @var{tag} on
 373 their tags list.
 374 (Tags are optional labels you can apply to tests when you define them.)
 375 @item @code{(satisfies @var{predicate})} selects all tests that
 376 satisfy @var{predicate}, a function that takes a test as argument and
 377 returns non-@code{nil} if it is selected.
 378 @end itemize
 379
 380 Selectors that are frequently useful when selecting tests to run
 381 include @code{t} to run all tests that are currently defined in Emacs,
 382 @code{"^foo-"} to run all tests in package @code{foo} (this assumes
 383 that package @code{foo} uses the prefix @code{foo-} for its test names),
 384 result-based selectors such as @code{(or :new :unexpected)} to
 385 run all tests that have either not run yet or that had an unexpected
 386 result in the last run, and tag-based selectors such as @code{(not
 387 (tag :causes-redisplay))} to run all tests that are not tagged
 388 @code{:causes-redisplay}.
 389
 390
 391 @node How to Write Tests
 392 @chapter How to Write Tests
 393 @cindex how to write tests
 394
 395 @findex ert-deftest
 396 ERT lets you define tests in the same way you define functions.  You
 397 can type @code{ert-deftest} forms in a buffer and evaluate them there
 398 with @code{eval-defun} or @code{compile-defun}, or you can save the
 399 file and load it, optionally byte-compiling it first.
 400
 401 Just like @code{find-function} is only able to find where a function
 402 was defined if the function was loaded from a file, ERT is only able
 403 to find where a test was defined if the test was loaded from a file.
 404
 405
 406 @menu
 407 * The @code{should} Macro::          A powerful way to express assertions.
 408 * Expected Failures::           Tests for known bugs.
 409 * Tests and Their Environment:: Don't depend on customizations; no side effects.
 410 * Useful Techniques::           Some examples.
 411 @end menu
 412
 413 @node The @code{should} Macro
 414 @section The @code{should} Macro
 415
 416 @findex should@r{, ert macro}
 417 Test bodies can include arbitrary code; but to be useful, they need to
 418 check whether the code being tested (or @emph{code under test})
 419 does what it is supposed to do.  The macro @code{should} is similar to
 420 @code{cl-assert} from the cl package
 421 (@pxref{Assertions,,, cl, Common Lisp Extensions}),
 422 but analyzes its argument form and records information that ERT can
 423 display to help debugging.
 424
 425 This test definition
 426
 427 @lisp
 428 (ert-deftest addition-test ()
 429   (should (= (+ 1 2) 4)))
 430 @end lisp
 431
 432 will produce this output when run via @kbd{M-x ert}:
 433
 434 @example
 435 F addition-test
 436     (ert-test-failed
 437      ((should
 438        (=
 439         (+ 1 2)
 440         4))
 441       :form
 442       (= 3 4)
 443       :value nil))
 444 @end example
 445
 446 In this example, @code{should} recorded the fact that (= (+ 1 2) 4)
 447 reduced to (= 3 4) before it reduced to @code{nil}.  When debugging why the
 448 test failed, it helps to know that the function @code{+} returned 3
 449 here.  ERT records the return value for any predicate called directly
 450 within @code{should}.
 451
 452 @findex should-not@r{, ert macro}
 453 @findex should-error@r{, ert macro}
 454 In addition to @code{should}, ERT provides @code{should-not}, which
 455 checks that the predicate returns @code{nil}, and @code{should-error}, which
 456 checks that the form called within it signals an error.  An example
 457 use of @code{should-error}:
 458
 459 @lisp
 460 (ert-deftest test-divide-by-zero ()
 461   (should-error (/ 1 0)
 462                 :type 'arith-error))
 463 @end lisp
 464
 465 This checks that dividing one by zero signals an error of type
 466 @code{arith-error}.  The @code{:type} argument to @code{should-error}
 467 is optional; if absent, any type of error is accepted.
 468 @code{should-error} returns an error description of the error that was
 469 signaled, to allow additional checks to be made.  The error
 470 description has the format @code{(ERROR-SYMBOL . DATA)}.
 471
 472 There is no @code{should-not-error} macro since tests that signal an
 473 error fail anyway, so @code{should-not-error} is effectively the
 474 default.
 475
 476 @xref{Understanding Explanations}, for more details on what
 477 @code{should} reports.
 478
 479
 480 @node Expected Failures
 481 @section Expected Failures
 482 @cindex expected failures
 483 @cindex known bugs
 484
 485 @vindex :expected-result
 486 Some bugs are complicated to fix, or not very important, and are left as
 487 @emph{known bugs}.  If there is a test case that triggers the bug and
 488 fails, ERT will alert you of this failure every time you run all
 489 tests.  For known bugs, this alert is a distraction.  The way to
 490 suppress it is to add @code{:expected-result :failed} to the test
 491 definition:
 492
 493 @lisp
 494 (ert-deftest future-bug ()
 495   "Test `time-forward' with negative arguments.
 496 Since this functionality isn't implemented, the test is known to fail."
 497   :expected-result :failed
 498   (time-forward -1))
 499 @end lisp
 500
 501 ERT will still display a small @code{f} in the progress bar as a
 502 reminder that there is a known bug, and will count the test as failed,
 503 but it will be quiet about it otherwise.
 504
 505 An alternative to marking the test as a known failure this way is to
 506 delete the test.  This is a good idea if there is no intent to fix it,
 507 i.e., if the behavior that was formerly considered a bug has become an
 508 accepted feature.
 509
 510 In general, however, it can be useful to keep tests that are known to
 511 fail.  If someone wants to fix the bug, they will have a very good
 512 starting point: an automated test case that reproduces the bug.  This
 513 makes it much easier to fix the bug, demonstrate that it is fixed, and
 514 prevent future regressions.
 515
 516 ERT displays the same kind of alerts for tests that pass unexpectedly
 517 as it displays for unexpected failures.  This way, if you make code
 518 changes that happen to fix a bug that you weren't aware of, you will
 519 know to remove the @code{:expected-result} clause of that test and
 520 close the corresponding bug report, if any.
 521
 522 Since @code{:expected-result} evaluates its argument when the test is
 523 loaded, tests can be marked as known failures only on certain Emacs
 524 versions, specific architectures, etc.:
 525
 526 @lisp
 527 (ert-deftest foo ()
 528   "A test that is expected to fail on Emacs 23 but succeed elsewhere."
 529   :expected-result (if (string-match "GNU Emacs 23[.]" (emacs-version))
 530                        :failed
 531                      :passed)
 532   ...)
 533 @end lisp
 534
 535
 536 @node Tests and Their Environment
 537 @section Tests and Their Environment
 538
 539 @cindex skipping tests
 540 @cindex test preconditions
 541 @cindex preconditions of a test
 542 Sometimes, it doesn't make sense to run a test due to missing
 543 preconditions.  A required Emacs feature might not be compiled in, the
 544 function to be tested could call an external binary which might not be
 545 available on the test machine, you name it.  In this case, the macro
 546 @code{skip-unless} could be used to skip the test:
 547
 548 @lisp
 549 (ert-deftest test-dbus ()
 550   "A test that checks D-BUS functionality."
 551   (skip-unless (featurep 'dbusbind))
 552   ...)
 553 @end lisp
 554
 555 @cindex tests and their environment
 556 The outcome of running a test should not depend on the current state
 557 of the environment, and each test should leave its environment in the
 558 same state it found it in.  In particular, a test should not depend on
 559 any Emacs customization variables or hooks, and if it has to make any
 560 changes to Emacs's state or state external to Emacs (such as the file
 561 system), it should undo these changes before it returns, regardless of
 562 whether it passed or failed.
 563
 564 Tests should not depend on the environment because any such
 565 dependencies can make the test brittle or lead to failures that occur
 566 only under certain circumstances and are hard to reproduce.  Of
 567 course, the code under test may have settings that affect its
 568 behavior.  In that case, it is best to make the test @code{let}-bind
 569 all such setting variables to set up a specific configuration for the
 570 duration of the test.  The test can also set up a number of different
 571 configurations and run the code under test with each.
 572
 573 Tests that have side effects on their environment should restore it to
 574 its original state because any side effects that persist after the
 575 test can disrupt the workflow of the programmer running the tests.  If
 576 the code under test has side effects on Emacs's current state, such as
 577 on the current buffer or window configuration, the test should create
 578 a temporary buffer for the code to manipulate (using
 579 @code{with-temp-buffer}), or save and restore the window configuration
 580 (using @code{save-window-excursion}), respectively.  For aspects of
 581 the state that can not be preserved with such macros, cleanup should
 582 be performed with @code{unwind-protect}, to ensure that the cleanup
 583 occurs even if the test fails.
 584
 585 An exception to this are messages that the code under test prints with
 586 @code{message} and similar logging; tests should not bother restoring
 587 the @file{*Message*} buffer to its original state.
 588
 589 The above guidelines imply that tests should avoid calling highly
 590 customizable commands such as @code{find-file}, except, of course, if
 591 such commands are what they want to test.  The exact behavior of
 592 @code{find-file} depends on many settings such as
 593 @code{find-file-wildcards}, @code{enable-local-variables}, and
 594 @code{auto-mode-alist}.  It is difficult to write a meaningful test if
 595 its behavior can be affected by so many external factors.  Also,
 596 @code{find-file} has side effects that are hard to predict and thus
 597 hard to undo: It may create a new buffer or reuse an existing
 598 buffer if one is already visiting the requested file; and it runs
 599 @code{find-file-hook}, which can have arbitrary side effects.
 600
 601 Instead, it is better to use lower-level mechanisms with simple and
 602 predictable semantics like @code{with-temp-buffer}, @code{insert} or
 603 @code{insert-file-contents-literally}, and to activate any desired mode
 604 by calling the corresponding function directly, after binding the
 605 hook variables to @code{nil}.  This avoids the above problems.
 606
 607
 608 @node Useful Techniques
 609 @section Useful Techniques when Writing Tests
 610 @cindex useful techniques
 611 @cindex tips and tricks
 612
 613 Testing simple functions that have no side effects and no dependencies
 614 on their environment is easy.  Such tests often look like this:
 615
 616 @lisp
 617 (ert-deftest ert-test-mismatch ()
 618   (should (eql (cl-mismatch "" "") nil))
 619   (should (eql (cl-mismatch "" "a") 0))
 620   (should (eql (cl-mismatch "a" "a") nil))
 621   (should (eql (cl-mismatch "ab" "a") 1))
 622   (should (eql (cl-mismatch "Aa" "aA") 0))
 623   (should (eql (cl-mismatch '(a b c) '(a b d)) 2)))
 624 @end lisp
 625
 626 This test calls the function @code{cl-mismatch} several times with
 627 various combinations of arguments and compares the return value to the
 628 expected return value.  (Some programmers prefer @code{(should (eql
 629 EXPECTED ACTUAL))} over the @code{(should (eql ACTUAL EXPECTED))}
 630 shown here.  ERT works either way.)
 631
 632 Here's a more complicated test:
 633
 634 @lisp
 635 (ert-deftest ert-test-record-backtrace ()
 636   (let ((test (make-ert-test :body (lambda () (ert-fail "foo")))))
 637     (let ((result (ert-run-test test)))
 638       (should (ert-test-failed-p result))
 639       (with-temp-buffer
 640         (ert--print-backtrace (ert-test-failed-backtrace result))
 641         (goto-char (point-min))
 642         (end-of-line)
 643         (let ((first-line (buffer-substring-no-properties
 644                            (point-min) (point))))
 645           (should (equal first-line
 646                          "  signal(ert-test-failed (\"foo\"))")))))))
 647 @end lisp
 648
 649 @findex make-ert-test
 650 @findex ert-equal-including-properties
 651 This test creates a test object using @code{make-ert-test} whose body
 652 will immediately signal failure.  It then runs that test and asserts
 653 that it fails.  Then, it creates a temporary buffer and invokes
 654 @code{ert--print-backtrace} to print the backtrace of the failed test
 655 to the current buffer.  Finally, it extracts the first line from the
 656 buffer and asserts that it matches what we expect.  It uses
 657 @code{buffer-substring-no-properties} and @code{equal} to ignore text
 658 properties; for a test that takes properties into account,
 659 @code{buffer-substring} and @code{ert-equal-including-properties}
 660 could be used instead.
 661
 662 The reason why this test only checks the first line of the backtrace
 663 is that the remainder of the backtrace is dependent on ERT's internals
 664 as well as whether the code is running interpreted or compiled.  By
 665 looking only at the first line, the test checks a useful property---that
 666 the backtrace correctly captures the call to @code{signal} that
 667 results from the call to @code{ert-fail}---without being brittle.
 668
 669 This example also shows that writing tests is much easier if the code
 670 under test was structured with testing in mind.
 671
 672 For example, if @code{ert-run-test} accepted only symbols that name
 673 tests rather than test objects, the test would need a name for the
 674 failing test, which would have to be a temporary symbol generated with
 675 @code{make-symbol}, to avoid side effects on Emacs's state.  Choosing
 676 the right interface for @code{ert-run-tests} allows the test to be
 677 simpler.
 678
 679 Similarly, if @code{ert--print-backtrace} printed the backtrace to a
 680 buffer with a fixed name rather than the current buffer, it would be
 681 much harder for the test to undo the side effect.  Of course, some
 682 code somewhere needs to pick the buffer name.  But that logic is
 683 independent of the logic that prints backtraces, and keeping them in
 684 separate functions allows us to test them independently.
 685
 686 A lot of code that you will encounter in Emacs was not written with
 687 testing in mind.  Sometimes, the easiest way to write tests for such
 688 code is to restructure the code slightly to provide better interfaces
 689 for testing.  Usually, this makes the interfaces easier to use as
 690 well.
 691
 692
 693 @node How to Debug Tests
 694 @chapter How to Debug Tests
 695
 696 This section describes how to use ERT's features to understand why
 697 a test failed.
 698
 699
 700 @menu
 701 * Understanding Explanations::  How ERT gives details on why an assertion failed.
 702 * Interactive Debugging::       Tools available in the ERT results buffer.
 703 @end menu
 704
 705
 706 @node Understanding Explanations
 707 @section Understanding Explanations
 708 @cindex understanding explanations
 709 @cindex explanations, understanding
 710
 711 Failed @code{should} forms are reported like this:
 712
 713 @example
 714 F addition-test
 715     (ert-test-failed
 716      ((should
 717        (=
 718         (+ 1 2)
 719         4))
 720       :form
 721       (= 3 4)
 722       :value nil))
 723 @end example
 724
 725 ERT shows what the @code{should} expression looked like and what
 726 values its subexpressions had: The source code of the assertion was
 727 @code{(should (= (+ 1 2) 4))}, which applied the function @code{=} to
 728 the arguments @code{3} and @code{4}, resulting in the value
 729 @code{nil}.  In this case, the test is wrong; it should expect 3
 730 rather than 4.
 731
 732 If a predicate like @code{equal} is used with @code{should}, ERT
 733 provides a so-called @emph{explanation}:
 734
 735 @example
 736 F list-test
 737     (ert-test-failed
 738      ((should
 739        (equal
 740         (list 'a 'b 'c)
 741         '(a b d)))
 742       :form
 743       (equal
 744        (a b c)
 745        (a b d))
 746       :value nil :explanation
 747       (list-elt 2
 748                 (different-atoms c d))))
 749 @end example
 750
 751 In this case, the function @code{equal} was applied to the arguments
 752 @code{(a b c)} and @code{(a b d)}.  ERT's explanation shows that
 753 the item at index 2 differs between the two lists; in one list, it is
 754 the atom c, in the other, it is the atom d.
 755
 756 In simple examples like the above, the explanation is unnecessary.
 757 But in cases where the difference is not immediately apparent, it can
 758 save time:
 759
 760 @example
 761 F test1
 762     (ert-test-failed
 763      ((should
 764        (equal x y))
 765       :form
 766       (equal a a)
 767       :value nil :explanation
 768       (different-symbols-with-the-same-name a a)))
 769 @end example
 770
 771 ERT only provides explanations for predicates that have an explanation
 772 function registered.  @xref{Defining Explanation Functions}.
 773
 774
 775 @node Interactive Debugging
 776 @section Interactive Debugging
 777 @cindex interactive debugging
 778 @cindex debugging failed tests
 779
 780 Debugging failed tests essentially works the same way as debugging any
 781 other problems with Lisp code.  Here are a few tricks specific to
 782 tests:
 783
 784 @itemize
 785 @cindex re-running a failed test
 786 @item
 787 Re-run the failed test a few times to see if it fails in the same way
 788 each time.  It's good to find out whether the behavior is
 789 deterministic before spending any time looking for a cause.  In the
 790 ERT results buffer, @kbd{r} re-runs the selected test.
 791
 792 @cindex jump to the test source code
 793 @item
 794 Use @kbd{.} to jump to the source code of the test to find out exactly
 795 what it does.  Perhaps the test is broken rather than the code
 796 under test.
 797
 798 @item
 799 If the test contains a series of @code{should} forms and you can't
 800 tell which one failed, use @kbd{l}, which shows you the list of all
 801 @code{should} forms executed during the test before it failed.
 802
 803 @cindex show backtrace of failed test
 804 @item
 805 Use @kbd{b} to view the backtrace.  You can also use @kbd{d} to re-run
 806 the test with debugging enabled, this will enter the debugger and show
 807 the backtrace as well; but the top few frames shown there will not be
 808 relevant to you since they are ERT's own debugger hook.  @kbd{b}
 809 strips them out, so it is more convenient.
 810
 811 @item
 812 If the test or the code under testing prints messages using
 813 @code{message}, use @kbd{m} to see what messages it printed before it
 814 failed.  This can be useful to figure out how far it got.
 815
 816 @cindex instrumenting test for Edebug
 817 @item
 818 You can instrument tests for debugging the same way you instrument
 819 @code{defun}s for debugging: go to the source code of the test and
 820 type @kbd{@kbd{C-u} @kbd{C-M-x}}.  Then, go back to the ERT buffer and
 821 re-run the test with @kbd{r} or @kbd{d}.
 822
 823 @cindex discard obsolete test results
 824 @item
 825 If you have been editing and rearranging tests, it is possible that
 826 ERT remembers an old test that you have since renamed or removed:
 827 renamings or removals of definitions in the source code leave around a
 828 stray definition under the old name in the running process (this is a
 829 common problem in Lisp).  In such a situation, hit @kbd{D} to let ERT
 830 forget about the obsolete test.
 831 @end itemize
 832
 833
 834 @node Extending ERT
 835 @chapter Extending ERT
 836 @cindex extending ert
 837
 838 There are several ways to add functionality to ERT.
 839
 840 @menu
 841 * Defining Explanation Functions::  Teach ERT about more predicates.
 842 * Low-Level Functions for Working with Tests::  Use ERT's data for your purposes.
 843 @end menu
 844
 845
 846 @node Defining Explanation Functions
 847 @section Defining Explanation Functions
 848 @cindex defining explanation functions
 849
 850 The explanation function for a predicate is a function that takes the
 851 same arguments as the predicate and returns an @emph{explanation}.
 852 The explanation should explain why the predicate, when invoked with
 853 the arguments given to the explanation function, returns the value
 854 that it returns.  The explanation can be any object but should have a
 855 comprehensible printed representation.  If the return value of the
 856 predicate needs no explanation for a given list of arguments, the
 857 explanation function should return @code{nil}.
 858
 859 @vindex ert-explainer@r{, property}
 860 To associate an explanation function with a predicate, add the
 861 property @code{ert-explainer} to the symbol that names the predicate.
 862 The value of the property should be the symbol that names the
 863 explanation function.
 864
 865
 866 @node Low-Level Functions for Working with Tests
 867 @section Low-Level Functions for Working with Tests
 868 @cindex low-level functions
 869
 870 Both @code{ert-run-tests-interactively} and @code{ert-run-tests-batch}
 871 are implemented on top of the lower-level test handling code in the
 872 sections of @file{ert.el} labeled ``Facilities for running a single test'',
 873 ``Test selectors'', and ``Facilities for running a whole set of tests''.
 874
 875 If you want to write code that works with ERT tests, you should take a
 876 look at this lower-level code.  Symbols that start with @code{ert--}
 877 are internal to ERT, whereas those that start with @code{ert-} are
 878 meant to be usable by other code.  But there is no mature API yet.
 879
 880 Contributions to ERT are welcome.
 881
 882
 883 @node Other Testing Concepts
 884 @chapter Other Testing Concepts
 885
 886 For information on mocks, stubs, fixtures, or test suites, see below.
 887
 888
 889 @menu
 890 * Mocks and Stubs::           Stubbing out code that is irrelevant to the test.
 891 * Fixtures and Test Suites::  How ERT differs from tools for other languages.
 892 @end menu
 893
 894 @node Mocks and Stubs
 895 @section Other Tools for Emacs Lisp
 896 @cindex mocks and stubs
 897
 898 Stubbing out functions or using so-called @emph{mocks} can make it
 899 easier to write tests.  See
 900 @url{http://en.wikipedia.org/wiki/Mock_object} for an explanation of
 901 the corresponding concepts in object-oriented languages.
 902
 903 ERT does not have built-in support for mocks or stubs.  The package
 904 @code{el-mock} (see @url{http://www.emacswiki.org/emacs/el-mock.el})
 905 offers mocks for Emacs Lisp and can be used in conjunction with ERT.
 906
 907
 908 @node Fixtures and Test Suites
 909 @section Fixtures and Test Suites
 910 @cindex fixtures
 911
 912 In many ways, ERT is similar to frameworks for other languages like
 913 SUnit or JUnit.  However, two features commonly found in such
 914 frameworks are notably absent from ERT: fixtures and test suites.
 915
 916 Fixtures are mainly used (e.g., in SUnit or JUnit) to provide an
 917 environment for a set of tests, and consist of set-up and tear-down
 918 functions.
 919
 920 While fixtures are a useful syntactic simplification in other
 921 languages, this does not apply to Lisp, where higher-order functions
 922 and @code{unwind-protect} are available.  One way to implement and use a
 923 fixture in ERT is
 924
 925 @lisp
 926 (defun my-fixture (body)
 927   (unwind-protect
 928       (progn [set up]
 929              (funcall body))
 930     [tear down]))
 931
 932 (ert-deftest my-test ()
 933   (my-fixture
 934    (lambda ()
 935      [test code])))
 936 @end lisp
 937
 938 (Another way would be a @code{with-my-fixture} macro.)  This solves
 939 the set-up and tear-down part, and additionally allows any test
 940 to use any combination of fixtures, so it is more flexible than what
 941 other tools typically allow.
 942
 943 If the test needs access to the environment the fixture sets up, the
 944 fixture can be modified to pass arguments to the body.
 945
 946 These are well-known Lisp techniques.  Special syntax for them could
 947 be added but would provide only a minor simplification.
 948
 949 (If you are interested in such syntax, note that splitting set-up and
 950 tear-down into separate functions, like *Unit tools usually do, makes
 951 it impossible to establish dynamic @code{let} bindings as part of the
 952 fixture.  So, blindly imitating the way fixtures are implemented in
 953 other languages would be counter-productive in Lisp.)
 954
 955 The purpose of test suites is to group related tests together.
 956
 957 The most common use of this is to run just the tests for one
 958 particular module.  Since symbol prefixes are the usual way of
 959 separating module namespaces in Emacs Lisp, test selectors already
 960 solve this by allowing regexp matching on test names; e.g., the
 961 selector @code{"^ert-"} selects ERT's self-tests.
 962
 963 Other uses include grouping tests by their expected execution time,
 964 e.g., to run quick tests during interactive development and slow tests less
 965 often.  This can be achieved with the @code{:tag} argument to
 966 @code{ert-deftest} and @code{tag} test selectors.
 967
 968 @node Index
 969 @unnumbered Index
 970
 971 @printindex cp
 972
 973 @node GNU Free Documentation License
 974 @appendix GNU Free Documentation License
 975 @include doclicense.texi
 976
 977 @bye
 978
 979 @c  LocalWords:  ERT JUnit namespace docstring ERT's
 980 @c  LocalWords:  backtrace makefiles workflow backtraces API SUnit
 981 @c  LocalWords:  subexpressions