1 <?xml version="1.0" encoding="iso-8859-1"?>
2 <!DOCTYPE book PUBLIC "-//OASIS//DTD DocBook XML V4.1//EN"
3 "http://www.oasis-open.org/docbook/xml/4.1/docbookx.dtd" [
4 <!ENTITY % myents SYSTEM "entities.inc">
8 <chapter id="ffi"><title>The Foreign Function Interface</title>
10 <para>This chapter describes &SBCL;'s interface to C programs and
11 libraries (and, since C interfaces are a sort of <foreignphrase>lingua
12 franca</foreignphrase> of the Unix world, to other programs and libraries in
15 <note><para>In the modern Lisp world, the usual term for this
16 functionality is Foreign Function Interface, or <acronym>FFI</acronym>, where
17 despite the mention of <quote>function</quote> in this term, <acronym>FFI</acronym> also
18 refers to direct manipulation of C data structures as well as
19 functions. The traditional &CMUCL; terminology is Alien Interface, and
20 while that older terminology is no longer used much in the system
21 documentation, it still reflected in names in the
22 implementation, notably in the name of the <literal>SB-ALIEN</literal>
23 package.</para></note>
25 <sect1 id="ffi-intro"><title>Introduction to the Foreign Function Interface</title>
26 <!-- AKA "Introduction to Aliens" in the CMU CL manual -->
29 Because of Lisp's emphasis on dynamic memory allocation and garbage
30 collection, Lisp implementations use non-C-like memory representations
31 for objects. This representation mismatch creates friction when a Lisp
32 program must share objects with programs which expect C data. There
33 are three common approaches to establishing communication:
35 <listitem><para>The burden can be placed on the foreign program
36 (and programmer) by requiring the knowledge and use of the
37 representations used internally by the Lisp implementation.
38 This can require a considerable amount of <quote>glue</quote> code on the
39 C side, and that code tends to be sensitively dependent on the
40 internal implementation details of the Lisp system.</para></listitem>
41 <listitem><para>The Lisp system can automatically convert objects
42 back and forth between the Lisp and foreign representations.
43 This is convenient, but translation becomes prohibitively slow
44 when large or complex data structures must be shared. This approach
45 is supported by the &SBCL; <acronym>FFI</acronym>, and used automatically
46 by the when passing integers and strings.</para></listitem>
47 <listitem><para>The Lisp program can directly manipulate foreign
48 objects through the use of extensions to the Lisp language.
53 <para>&SBCL;, like &CMUCL; before it, relies primarily on the
54 automatic conversion and direct manipulation approaches. The SB-ALIEN
55 package provices a facility wherein foreign values of simple scalar
56 types are automatically converted and complex types are directly
57 manipulated in their foreign representation. Additionally the
58 lower-level System Area Pointers (or SAPs) can be used where
59 necessary to provide untyped access to foreign memory.</para>
61 <para>Any foreign objects that can't automatically be converted into
62 Lisp values are represented by objects of type <type>alien-value</type>.
63 Since Lisp is a dynamically typed language, even foreign objects must
64 have a run-time type; this type information is provided by
65 encapsulating the raw pointer to the foreign data within an
66 <type>alien-value</type> object.</para>
68 <para>The type language and operations on foreign types are
69 intentionally similar to those of the C language.</para>
73 <sect1 id="foreign-types"><title>Foreign Types</title>
74 <!-- AKA "Alien Types" in the CMU CL manual -->
76 <para>Alien types have a description language based on nested list
77 structure. For example the C type
78 <programlisting>struct foo {
82 has the corresponding &SBCL; FFI type
83 <programlisting>(struct foo
85 (b (array (* (struct foo)) 100)))</programlisting>
88 <sect2><title>Defining Foreign Types</title>
91 Types may be either named or anonymous. With structure and union
92 types, the name is part of the type specifier, allowing recursively
93 defined types such as:
94 <programlisting>(struct foo (a (* (struct foo))))</programlisting>
95 An anonymous structure or union type is specified by using the name
96 <literal>nil</literal>. The <function>with-alien</function> macro defines a local
97 scope which <quote>captures</quote> any named type definitions. Other types
98 are not inherently named, but can be given named abbreviations using
99 the <function>define-alien-type</function> macro.
104 <sect2><title>Foreign Types and Lisp Types</title>
107 The foreign types form a subsystem of the &SBCL; type system. An
108 <type>alien</type> type specifier provides a way to use any foreign type as a
109 Lisp type specifier. For example,
110 <programlisting>(typep foo '(alien (* int)))</programlisting>
111 can be used to determine whether <varname>foo</varname> is a pointer to a foreign
112 <type>int</type>. <type>alien</type> type specifiers can be used in the same ways
113 as ordinary Lisp type specifiers (like <type>string</type>.) Alien type
114 declarations are subject to the same
115 precise type checking <!-- FIXME: should be linked to id="precisetypechecking" -->
116 as any other declaration.
120 Note that the type identifiers used in the
121 foreign type system overlap with native Lisp type
122 specifiers in some cases. For example, the type specifier
123 <type>(alien single-float)</type> is identical to <type>single-float</type>, since
124 foreign floats are automatically converted to Lisp floats. When
125 <function>type-of</function> is called on an alien value that is not automatically
126 converted to a Lisp value, then it will return an <type>alien</type> type
132 <sect2><title>Foreign Type Specifiers</title>
135 All foreign type names are exported from the <literal>sb-alien</literal>
136 package. Some foreign type names are also symbols in
137 the <literal>common-lisp</literal> package, in which case they are
138 reexported from the <literal>sb-alien</literal> package, so that
139 e.g. it is legal to refer to <type>sb-alien:single-float</type>.
143 These are the basic foreign type specifiers:
144 <!-- FIXME: There must be some better way of formatting definitions
145 in DocBook than this. I haven't found it yet, but suggestions
146 or patches would be welcome. -->
150 The foreign type specifier <type>(* foo)</type> describes a
151 pointer to an object of type <type>foo</type>. A pointed-to type
152 <type>foo</type> of <type>t</type> indicates a pointer to anything,
153 similar to <type>void *</type> in ANSI C. A null alien pointer can
154 be detected with the <function>sb-alien:null-alien</function>
160 The foreign type specifier <type>(array foo &optional dimensions)</type>
161 describes array of the specified <literal>dimensions</literal>, holding
162 elements of type <type>foo</type>. Note that (unlike in C) <type>(* foo)</type>
163 <type>(array foo)}</type> are considered to be different types when
164 type checking is done. If equivalence of pointer and array types
165 is desired, it may be explicitly coerced using
166 <function>sb-alien:cast</function>.
169 Arrays are accessed using <function>sb-alien:deref</function>, passing
170 the indices as additional arguments. Elements are stored in
171 column-major order (as in C), so the first dimension determines
172 only the size of the memory block, and not the layout of the
173 higher dimensions. An array whose first dimension is variable
174 may be specified by using <literal>nil</literal> as the first dimension.
175 Fixed-size arrays can be allocated as array elements, structure
176 slots or <function>sb-alien:with-alien</function> variables. Dynamic
177 arrays can only be allocated using <function>sb-alien:make-alien</function>.
182 The foreign type specifier
183 <type>(sb-alien:struct name &rest fields)</type>
184 describes a structure type with the specified <varname>name</varname> and
185 <varname>fields</varname>. Fields are allocated at the same offsets
186 used by the implementation's C compiler. If <varname>name</varname>
187 is <literal>nil</literal> then the structure is anonymous.
190 If a named foreign <type>struct</type> specifier is passed to
191 <function>define-alien-type</function> or <function>with-alien</function>,
192 then this defines, respectively, a new global or local foreign
193 structure type. If no <varname>fields</varname> are specified, then
194 the fields are taken from the current (local or global) alien
195 structure type definition of <varname>name</varname>.
200 The foreign type specifier
201 <type>(sb-alien:union name &rest fields)</type>
202 is similar to <type>sb-alien:struct</type>, but describes a union type.
203 All fields are allocated at the same offset, and the size of the
204 union is the size of the largest field. The programmer must
205 determine which field is active from context.
210 The foreign type specifier <type>(sb-alien:enum name &rest specs)</type>
211 describes an enumeration type that maps between integer values
212 and keywords. If <varname>name</varname> is <literal>nil</literal>, then the
213 type is anonymous. Each element of the <varname>specs</varname>
214 list is either a Lisp keyword, or a list <literal>(keyword value)</literal>.
215 <varname>value</varname> is an integer. If <varname>value</varname> is not
216 supplied, then it defaults to one greater than the value for
217 the preceding spec (or to zero if it is the first spec.)
222 The foreign type specifier <type>(sb-alien:signed &optional bits)</type>
223 specifies a signed integer with the specified number of
224 <varname>bits</varname> precision. The upper limit on integer
225 precision is determined by the machine's word
226 size. If <varname>bits</varname> is not specified, the maximum
232 The foreign type specifier <type>(integer &optional bits)</type> is
233 equivalent to the corresponding type specifier using
234 <type>sb-alien:signed</type> instead of <type>integer</type>.
239 The foreign type specifier
240 <type>(sb-alien:unsigned &optional bits)</type>
241 is like corresponding type specifier using <type>sb-alien:signed</type>
242 except that the variable is treated as an unsigned integer.
247 The foreign type specifier <type>(boolean &optional bits)</type> is
248 similar to an enumeration type, but maps from Lisp <literal>nil</literal>
249 and <literal>t</literal> to C <literal>0</literal> and <literal>1</literal>
250 respectively. <varname>bits</varname> determines the amount of
251 storage allocated to hold the truth value.
256 The foreign type specifier <type>single-float</type> describes a
257 floating-point number in IEEE single-precision format.
262 The foreign type specifier <type>double-float</type> describes a
263 floating-point number in IEEE double-precision format.
268 The foreign type specifier
269 <type>(function result-type &rest arg-types)</type>
270 describes a foreign function that takes arguments of the specified
271 <varname>arg-types</varname> and returns a result of type <type>result-type</type>.
272 Note that the only context where a foreign <type>function</type> type
273 is directly specified is in the argument to
274 <function>sb-alien:alien-funcall</function>.
275 In all other contexts, foreign functions are represented by
276 foreign function pointer types: <type>(* (function ...))</type>.
281 The foreign type specifier <type>sb-alien:system-area-pointer</type>
282 describes a pointer which is represented in Lisp as a
283 <type>system-area-pointer</type> object. &SBCL; exports this type from
284 <literal>sb-alien</literal> because &CMUCL; did, but tentatively (as of
285 the first draft of this section of the manual, &SBCL; 0.7.6) it is
286 deprecated, since it doesn't seem to be required by user code.
291 The foreign type specifier <type>sb-alien:void</type> is
292 used in function types to declare that no useful value
293 is returned. Using <function>alien-funcall</function>
294 to call a <type>void</type> foreign function will return
300 The foreign type specifier <type>sb-alien:c-string</type>
301 is similar to <type>(* char)</type>, but is interpreted as a
302 null-terminated string, and is automatically converted into a
303 Lisp string when accessed; or if the pointer is C <literal>NULL</literal>
304 or <literal>0</literal>, then accessing it gives Lisp <literal>nil</literal>.
305 Lisp strings are stored with a trailing NUL termination, so no
306 copying (either by the user or the implementation) is necessary
307 when passing them to foreign code.
310 Assigning a Lisp string to a <type>c-string</type> structure field or
311 variable stores the contents of the string to the memory already
312 pointed to by that variable. When a foreign object of type
313 <type>(* char)</type> is assigned to a <type>c-string</type>, then the
314 <type>c-string</type> pointer is assigned to. This allows
315 <type>c-string</type> pointers to be initialized. For example:
316 <programlisting>(cl:in-package "CL-USER") ; which USEs package "SB-ALIEN"
317 (define-alien-type nil (struct foo (str c-string)))
318 (defun make-foo (str) (let ((my-foo (make-alien (struct foo))))
319 (setf (slot my-foo 'str) (make-alien char (length str))
320 (slot my-foo 'str) str) my-foo))</programlisting>
321 Storing Lisp <literal>NIL</literal> in a <type>c-string</type> writes C
322 <literal>NULL</literal> to the variable.
327 <literal>sb-alien</literal> also exports translations of these C type
328 specifiers as foreign type specifiers:
329 <type>sb-alien:char</type>,
330 <type>sb-alien:short</type>,
331 <type>sb-alien:int</type>,
332 <type>sb-alien:long</type>,
333 <type>sb-alien:unsigned-char</type>,
334 <type>sb-alien:unsigned-short</type>,
335 <type>sb-alien:unsigned-int</type>,
336 <type>sb-alien:unsigned-long</type>,
337 <type>sb-alien:float</type>, and
338 <type>sb-alien:double</type>.
350 <sect1 id="foreign-values"><title>Operations On Foreign Values</title>
351 <!-- AKA "Alien Operations" in the CMU CL manual -->
353 <para>This section describes how to read foreign values as Lisp
354 values, how to coerce foreign values to different kinds of foreign values, and
355 how to dynamically allocate and free foreign variables.</para>
357 <sect2><title>Accessing Foreign Values</title>
359 <synopsis>(sb-alien:deref pointer-or-array &rest indices)</synopsis>
361 <para>The <function>sb-alien:deref</function> function returns the value pointed to by
362 a foreign pointer, or the value of a foreign array element. When
363 dereferencing a pointer, an optional single index can be specified to
364 give the equivalent of C pointer arithmetic; this index is scaled by
365 the size of the type pointed to. When dereferencing an array, the
366 number of indices must be the same as the number of dimensions in the
367 array type. <function>deref</function> can be set with <function>setf</function> to
368 assign a new value.</para>
370 <synopsis>(sb-alien:slot struct-or-union &rest slot-names)</synopsis>
372 <para>The <function>sb-alien:slot</function> function extracts the value of
373 the slot named <varname>slot-name</varname> from a foreign <type>struct</type> or
374 <type>union</type>. If <varname>struct-or-union</varname> is a pointer to a
375 structure or union, then it is automatically dereferenced.
376 <function>sb-alien:slot</function> can be set with <function>setf</function> to assign
377 a new value. Note that <varname>slot-name</varname> is evaluated, and need
378 not be a compile-time constant (but only constant slot accesses are
379 efficiently compiled.)</para>
381 <sect3><title>Untyped memory</title>
383 <para>As noted at the beginning of the chapter, the System Area
384 Pointer facilities allow untyped access to foreign memory. SAPs can
385 be converted to and from the usual typed foreign values using
386 <function>sap-alien</function> and <function>alien-sap</function>
387 (described elsewhere), and also to and from integers - raw machine
388 addresses. They should thus be used with caution; corrupting the Lisp
389 heap or other memory with SAPs is trivial.</para>
391 <synopsis>(sb-sys:int-sap machine-address)</synopsis>
393 <para>Creates a SAP pointing at the virtual address
394 <varname>machine-address</varname>. </para>
396 <synopsis>(sb-sys:sap-ref-32 sap offset)</synopsis>
398 <para>Access the value of the memory location at
399 <varname>offset</varname> bytes from <varname>sap</varname>. This form
400 may also be used with <function>setf</function> to alter the memory at
401 that location.</para>
403 <synopsis>(sb-sys:sap= sap1 sap2)</synopsis>
405 <para>Compare <varname>sap1</varname> and <varname>sap2</varname> for
408 <para>Similarly named functions exist for accessing other sizes of
409 word, other comparisons, and other conversions. The reader is invited
410 to use <function>apropos</function> and <function>describe</function>
411 for more details</para>
413 (apropos "sap" :sb-sys)
417 <sect2><title>Coercing Foreign Values</title>
419 <synopsis>(sb-alien:addr alien-expr)</synopsis>
422 The <function>sb-alien:addr</function> macro
423 returns a pointer to the location specified by
424 <varname>alien-expr</varname>, which must be either a foreign variable, a use of
425 <function>sb-alien:deref</function>, a use of <function>sb-alien:slot</function>, or a use of
426 <function>sb-alien:extern-alien</function>.
429 <synopsis>(sb-alien:cast foreign-value new-type)</synopsis>
431 <para>The <function>sb-alien:cast</function> macro
432 converts <varname>foreign-value</varname> to a new foreign value with the specified
433 <varname>new-type</varname>. Both types, old and new, must be foreign pointer,
434 array or function types. Note that the resulting Lisp
435 foreign variable object
436 is not <function>eq</function> to the
437 argument, but it does refer to the same foreign data bits.</para>
439 <synopsis>(sb-alien:sap-alien sap type)</synopsis>
441 <para>The <function>sb-alien:sap-alien</function> function converts <varname>sap</varname>
442 (a system area pointer) to a foreign value with the specified
443 <varname>type</varname>. <varname>type</varname> is not evaluated.
446 <para>The <varname>type</varname> must be some foreign pointer, array, or
449 <synopsis>(sb-alien:alien-sap foreign-value type)</synopsis>
451 <para>The <function>sb-alien:alien-sap</function> function
452 returns the SAP which points to <varname>alien-value</varname>'s data.
455 <para>The <varname>foreign-value</varname> must be of some foreign pointer,
456 array, or record type.</para>
460 <sect2><title>Foreign Dynamic Allocation</title>
462 <para>Lisp code can call the C standard library functions
463 <function>malloc</function> and <function>free</function> to dynamically allocate and
464 deallocate foreign variables. The Lisp code shares the same allocator
465 with foreign C code, so it's OK for foreign code to call
466 <function>free</function> on the result of Lisp
467 <function>sb-alien:make-alien</function>, or for Lisp code to call
468 <function>sb-alien:free-alien</function> on foreign objects allocated by C
471 <synopsis>(sb-alien:make-alien type size)</synopsis>
473 <para>The <function>sb-alien:make-alien</function> macro
474 returns a dynamically allocated foreign value of the specified
475 <varname>type</varname> (which is not evaluated.) The allocated memory is not
476 initialized, and may contain arbitrary junk. If supplied,
477 <varname>size</varname> is an expression to evaluate to compute the size of the
478 allocated object. There are two major cases:
481 <para>When <varname>type</varname> is a foreign array type, an array of
482 that type is allocated and a pointer to it is returned. Note that you
483 must use <function>deref</function> to change the result to an array before you
484 can use <function>deref</function> to read or write elements:
486 (cl:in-package "CL-USER") ; which USEs package "SB-ALIEN"
487 (defvar *foo* (make-alien (array char 10)))
488 (type-of *foo*) => (alien (* (array (signed 8) 10)))
489 (setf (deref (deref foo) 0) 10) => 10</programlisting>
490 If supplied, <varname>size</varname> is used as the first dimension for the
494 <para>When <varname>type</varname> is any other foreign type, then an
495 object for that type is allocated, and a pointer to it is
496 returned. So <function>(make-alien int)</function> returns a <type>(* int)</type>.
497 If <varname>size</varname> is specified, then a block of that many
498 objects is allocated, with the result pointing to the first one.</para>
503 <synopsis>(sb-alien:free-alien foreign-value)</synopsis>
505 <para>The <function>sb-alien:free-alien</function> function
506 frees the storage for <varname>foreign-value</varname>,
507 which must have been allocated with Lisp <function>make-alien</function>
508 or C <function>malloc</function>.</para>
510 <para>See also the <function>sb-alien:with-alien</function> macro, which
511 allocates foreign values on the stack.</para>
517 <sect1 id="foreign-variables"><title>Foreign Variables</title>
518 <!-- AKA "Alien Variables" in the CMU CL manual -->
521 Both local (stack allocated) and external (C global) foreign variables are
525 <sect2><title>Local Foreign Variables</title>
527 <synopsis>(sb-alien:with-alien var-definitions &body body)</synopsis>
529 <para>The <function>with-alien</function>
530 macro establishes local
533 alien types and names.
534 This form is analogous to defining a local variable in C: additional
535 storage is allocated, and the initial value is copied.
537 analogous to LET-allocated Lisp variables, since the variables
538 can't be captured in closures: they live only for the dynamic extent
539 of the body, and referring to them outside is a gruesome error.
542 <para>The <varname>var-definitions</varname> argument is a list of
543 variable definitions, each of the form
544 <programlisting>(name type &optional initial-value)</programlisting>
545 The names of the variables are established as symbol-macros; the bindings have
546 lexical scope, and may be assigned with <function>setq</function>
547 or <function>setf</function>.
550 <para>The <function>with-alien</function> macro also establishes
551 a new scope for named structures
552 and unions. Any <varname>type</varname> specified for a variable may contain
553 named structure or union types with the slots specified. Within the
554 lexical scope of the binding specifiers and body, a locally defined
555 foreign structure type <type>foo</type> can be referenced by its name using
556 <type>(struct foo)</type>.
561 <sect2><title>External Foreign Variables</title>
564 External foreign names are strings, and Lisp names are symbols. When
565 an external foreign value is represented using a Lisp variable, there
566 must be a way to convert from one name syntax into the other. The
567 macros <function>extern-alien</function>, <function>define-alien-variable</function> and
568 <function>define-alien-routine</function> use this conversion heuristic:
570 <listitem><para>Alien names are converted to Lisp names by uppercasing and
571 replacing underscores with hyphens.</para></listitem>
572 <listitem><para>Conversely, Lisp names are converted to alien names by
573 lowercasing and replacing hyphens with underscores.</para></listitem>
574 <listitem><para>Both the Lisp symbol and alien string names may be
575 separately specified by using a list of the form
576 <programlisting>(alien-string lisp-symbol)</programlisting></para></listitem>
580 <synopsis>(sb-alien:define-alien-variable name type)</synopsis>
583 The <function>define-alien-variable</function> macro
584 defines <varname>name</varname> as an external foreign variable of the
585 specified foreign <type>type</type>. <varname>name</varname> and <type>type</type> are not
586 evaluated. The Lisp name of the variable (see above) becomes a
587 global alien variable. Global alien variables
588 are effectively ``global symbol macros''; a reference to the
589 variable fetches the contents of the external variable. Similarly,
590 setting the variable stores new contents---the new contents must be
591 of the declared <type>type</type>. Someday, they may well be implemented
592 using the &ANSI; <function>define-symbol-macro</function> mechanism, but
593 as of &SBCL; 0.7.5, they are still implemented using an older
594 more-or-less parallel mechanism inherited from &CMUCL;.
598 For example, to access a C-level counter <varname>foo</varname>, one could
601 (define-alien-variable "foo" int)
602 ;; Now it is possible to get the value of the C variable foo simply by
603 ;; referencing that Lisp variable:
606 (incf foo)</programlisting>
609 <synopsis>(sb-alien:get-errno)</synopsis>
612 Since in modern C libraries, the <varname>errno</varname> "variable" is typically
613 no longer a variable, but some bizarre artificial construct
614 which behaves superficially like a variable within a given thread,
615 it can no longer reliably be accessed through the ordinary
616 <varname>define-alien-variable</varname> mechanism. Instead, &SBCL; provides
617 the operator <function>sb-alien:get-errno</function> to allow Lisp code to read it.
620 <synopsis>(sb-alien:extern-alien name type)</synopsis>
623 The <function>extern-alien</function> macro
624 returns an alien with the specified <type>type</type> which
625 points to an externally defined value.
626 <varname>name</varname> is not evaluated,
627 and may be either a string or a symbol.
628 <type>type</type> is an unevaluated alien type specifier.
635 <sect1 id="foreign-data-structure"><title>Foreign Data Structure Examples</title>
636 <!-- AKA "Alien Data Structure Example" in the CMU CL manual -->
639 Now that we have alien types, operations and variables, we can manipulate
640 foreign data structures. This C declaration
646 can be translated into the following alien type:
647 <programlisting>(define-alien-type nil
650 (b (array (* (struct foo)) 100))))</programlisting>
654 Once the <type>foo</type> alien type has been defined as above,
658 f.b[7].a</programlisting>
659 can be translated in this way:
661 (with-alien ((f (struct foo)))
662 (slot (deref (slot f 'b) 7) 'a)
664 ;; Do something with f...
669 Or consider this example of an external C variable and some accesses:
677 extern struct c_struct *my_struct;
680 my_struct = my_struct->n;</programlisting>
681 which can be manipulated in Lisp like this:
683 (define-alien-type nil
691 (define-alien-variable "my_struct" (* c-struct))
692 (incf (slot my-struct 'x))
693 (setf (slot my-struct 'a) 5)
694 (setq my-struct (slot my-struct 'n))</programlisting>
699 <sect1 id="load-object"><title>Loading Unix Object Files</title>
702 Foreign object files can be loaded into the running Lisp process by
703 calling the functions <function>load-foreign</function> or
704 <function>load-1-foreign</function>.
707 <para> The <function>sb-alien:load-1-foreign</function> function is the more
708 primitive of the two operations. It loads a single object file. into
709 the currently running Lisp. The external symbols defining routines and
710 variables are made available for future external references (e.g. by
711 <function>extern-alien</function>). Forward references to foreign symbols
712 aren't supported: <function>load-1-foreign</function> must be run before any
713 of the defined symbols are referenced.
716 <para><function>sb-alien:load-foreign</function> is built in terms of
717 <function>load-1-foreign</function> and some other machinery
718 like <function>sb-ext:run-program</function>.
719 It accepts a list of files and libraries,
720 and runs the linker on the files and
721 libraries, creating an absolute Unix object file which is then
722 processed by <function>load-1-foreign</function>.</para>
724 <note><para>As of &SBCL; 0.7.5, all foreign code (code loaded
725 with <function>load-1-function</function> or <function>load-function</function>) is
726 lost when a Lisp core is saved with
727 <function>sb-ext:save-lisp-and-die</function>, and no attempt is made to
728 restore it when the core is loaded. Historically this has been an
729 annoyance both for &SBCL; users and for &CMUCL; users.
730 It's hard to solve this problem completely cleanly, but some
731 generally-reliable partial solution might be useful. Once someone in
732 either camp gets sufficiently annoyed to create it, &SBCL; is
733 likely to adopt some mechanism for automatically restoring foreign
734 code when a saved core is loaded.</para></note>
738 <sect1 id="foreign-function-calls"><title>Foreign Function Calls</title>
741 The foreign function call interface allows a Lisp program to call
742 many functions written in languages that use the C calling convention.
746 Lisp sets up various signal handling routines and other environment
747 information when it first starts up, and expects these to be in place
748 at all times. The C functions called by Lisp should not change the
749 environment, especially the signal handlers: the signal handlers
750 installed by Lisp typically have interesting flags set (e.g to request
751 machine context information, or for signal delivery on an alternate
752 stack) which the Lisp runtime relies on for correct operation.
753 Precise details of how this works may change without notice between
754 versions; the source, or the brain of a friendly &SBCL; developer,
755 is the only documentation. Users of a Lisp built with the :sb-thread
756 feature should also read the Threading section
757 <!-- FIXME I'm sure docbook has some syntax for internal links -->
758 of this manual</para>
760 <sect2><title>The <function>alien-funcall</function> Primitive</title>
762 <synopsis>(sb-alien:alien-funcall alien-function &rest arguments)</synopsis>
765 The <function>alien-funcall</function> function is the foreign function call
766 primitive: <varname>alien-function</varname> is called with the supplied
767 <varname>arguments</varname> and its C return value is returned as a Lisp value.
768 The <varname>alien-function</varname> is an arbitrary
769 run-time expression; to refer to a constant function, use
770 <function>extern-alien</function> or a value defined by
771 <function>define-alien-routine</function>.
775 The type of <function>alien-function</function>
776 must be <type>(alien (function ...))</type>
777 or <type>(alien (* (function ...)))</type>.
778 The function type is used to
779 determine how to call the function (as though it was declared with
780 a prototype.) The type need not be known at compile time, but only
781 known-type calls are efficiently compiled. Limitations:
783 <listitem><para>Structure type return values are not implemented.</para></listitem>
784 <listitem><para>Passing of structures by value is not implemented.</para></listitem>
789 Here is an example which allocates a <type>(struct foo)</type>, calls a foreign
790 function to initialize it, then returns a Lisp vector of all the
791 <type>(* (struct foo))</type> objects filled in by the foreign call:
793 ;; Allocate a foo on the stack.
794 (with-alien ((f (struct foo)))
795 ;; Call some C function to fill in foo fields.
796 (alien-funcall (extern-alien "mangle_foo" (function void (* foo)))
798 ;; Find how many foos to use by getting the A field.
799 (let* ((num (slot f 'a))
800 (result (make-array num)))
801 ;; Get a pointer to the array so that we don't have to keep extracting it:
802 (with-alien ((a (* (array (* (struct foo)) 100)) (addr (slot f 'b))))
803 ;; Loop over the first N elements and stash them in the result vector.
805 (setf (svref result i) (deref (deref a) i)))
807 result)))</programlisting>
812 <sect2><title>The <function>define-alien-routine</function> Macro</title>
814 <synopsis>(sb-alien:define-alien-routine} name result-type &rest arg-specifiers)</synopsis>
817 The <function>define-alien-routine</function> macro is a convenience
818 for automatically generating Lisp
819 interfaces to simple foreign functions. The primary feature is the
820 parameter style specification, which translates the C
821 pass-by-reference idiom into additional return values.
825 <varname>name</varname> is usually a string external symbol, but may also be a
826 symbol Lisp name or a list of the foreign name and the Lisp name.
827 If only one name is specified, the other is automatically derived
828 as for <function>extern-alien</function>.
829 <varname>result-type</varname> is the alien type of the return value.
833 Each element of the <varname>arg-specifiers</varname> list
834 specifies an argument to the foreign function, and is
836 <programlisting>(aname atype &optional style)</programlisting>
837 <varname>aname</varname> is the symbol name of the argument to the constructed
838 function (for documentation). <varname>atype</varname> is the alien type of
839 corresponding foreign argument. The semantics of the actual call
840 are the same as for <function>alien-funcall</function>. <varname>style</varname>
841 specifies how this argument should be handled at call and return time,
842 and should be one of the following
844 <listitem><para><varname>:in</varname>specifies that the argument is
845 passed by value. This is the default. <varname>:in</varname> arguments
846 have no corresponding return value from the Lisp function.
848 <listitem><para><varname>:copy</varname> is similar to <varname>:in</varname>,
849 but the argument is copied
850 to a pre-allocated object and a pointer to this object is passed
851 to the foreign routine.</para></listitem>
852 <listitem><para><varname>:out</varname> specifies a pass-by-reference
853 output value. The type of the argument must be a pointer to
854 a fixed-sized object (such as an integer or pointer).
855 <varname>:out</varname> and <varname>:in-out</varname> style cannot
856 be used with pointers to arrays, records or functions. An
857 object of the correct size is allocated on the stack, and
858 its address is passed to the foreign function. When the
859 function returns, the contents
860 of this location are returned as one of the values of the Lisp
861 function (and the location is automatically deallocated).
863 <listitem><para><varname>:in-out</varname> is a combination of
864 <varname>:copy</varname> and <varname>:out</varname>.
865 The argument is copied to a pre-allocated object and a pointer to
866 this object is passed to the foreign routine. On return, the
867 contents of this location is returned as an additional value.
874 Any efficiency-critical foreign interface function should be inline
875 expanded, which can be done by preceding the
876 <function>define-alien-routine</function> call with:
877 <programlisting>(declaim (inline lisp-name))</programlisting>
878 In addition to avoiding the Lisp call overhead, this allows
879 pointers, word-integers and floats to be passed using non-descriptor
880 representations, avoiding consing.)
886 <sect2><title><function>define-alien-routine</function> Example</title>
889 Consider the C function <function>cfoo</function>
890 with the following calling convention:
895 char *a; /* update */
898 /* body of cfoo(...) */
900 This can be described by the following call to
901 <function>define-alien-routine</function>:
903 (define-alien-routine "cfoo" void
906 (i int :out))</programlisting>
907 The Lisp function <function>cfoo</function> will have
908 two arguments (<varname>str</varname> and <varname>a</varname>)
909 and two return values (<varname>a</varname> and <varname>i</varname>).
914 <sect2><title>Calling Lisp From C</title>
917 Calling Lisp functions from C is sometimes possible, but is extremely
918 hackish and poorly supported as of &SBCL; 0.7.5.
919 See <function>funcall0</function> ... <function>funcall3</function> in
920 the runtime system. The
921 arguments must be valid &SBCL; object descriptors (so that
923 left-shifted by 2.) As of &SBCL; 0.7.5, the format
924 of object descriptors is documented only by the source code and, in parts,
925 by the old &CMUCL; "INTERNALS" documentation.</para>
927 <para> Note that the garbage collector moves objects, and won't be
928 able to fix up any references in C variables. There are three
929 mechanisms for coping with this:
932 <listitem><para>The <function>sb-ext:purify</function> moves all live Lisp
933 data into static or read-only areas such that it will never be moved
934 (or freed) again in the life of the Lisp session</para></listitem>
936 <listitem><para><function>sb-sys:with-pinned-objects</function> is a
937 macro which arranges for some set of objects to be pinned in memory
938 for the dynamic extent of its body forms. On ports which use the
939 generational garbage collector (as of &SBCL; 0.8.3, only the x86) this
940 has a page granularity - i.e. the entire 4k page or pages containing
941 the objects will be locked down. On other ports it is implemented by
942 turning off GC for the duration (so could be said to have a
943 whole-world granularity). </para></listitem>
945 <listitem><para>Disable GC, using the <function>without-gcing</function>
946 macro or <function>gc-off</function> call.</para></listitem>
951 <!-- FIXME: This is a "changebar" section from the CMU CL manual.
952 I (WHN 2002-07-14) am not very familiar with this content, so
953 I'm not immediately prepared to try to update it for SBCL, and
954 I'm not feeling masochistic enough to work to encourage this
955 kind of low-level hack anyway. However, I acknowledge that callbacks
956 are sometimes really really necessary, so I include the original
957 text in case someone is hard-core enough to benefit from it. If
958 anyone brings the information up to date for SBCL, it belong
959 either in the main manual or on a CLiki SBCL Internals page.
960 LaTeX \subsection{Accessing Lisp Arrays}
962 LaTeX Due to the way \cmucl{} manages memory, the amount of memory that can
963 LaTeX be dynamically allocated by \code{malloc} or \funref{make-alien} is
964 LaTeX limited\footnote{\cmucl{} mmaps a large piece of memory for it's own
965 LaTeX use and this memory is typically about 8 MB above the start of the C
966 LaTeX heap. Thus, only about 8 MB of memory can be dynamically
969 Empirically determined to be considerably >8Mb on this x86 linux
970 machine, but I don't know what the actual values are - dan 2003.09.01
972 Note that this technique is used in SB-GROVEL in the SBCL contrib
975 LaTeX To overcome this limitation, it is possible to access the content of
976 LaTeX Lisp arrays which are limited only by the amount of physical memory
977 LaTeX and swap space available. However, this technique is only useful if
978 LaTeX the foreign function takes pointers to memory instead of allocating
979 LaTeX memory for itself. In latter case, you will have to modify the
980 LaTeX foreign functions.
982 LaTeX This technique takes advantage of the fact that \cmucl{} has
983 LaTeX specialized array types (\pxlref{specialized-array-types}) that match
984 LaTeX a typical C array. For example, a \code{(simple-array double-float
985 LaTeX (100))} is stored in memory in essentially the same way as the C
986 LaTeX array \code{double x[100]} would be. The following function allows us
987 LaTeX to get the physical address of such a Lisp array:
988 LaTeX \begin{example}
989 LaTeX (defun array-data-address (array)
990 LaTeX "Return the physical address of where the actual data of an array is
993 LaTeX ARRAY must be a specialized array type in CMU Lisp. This means ARRAY
994 LaTeX must be an array of one of the following types:
998 LaTeX (unsigned-byte 32)
999 LaTeX (unsigned-byte 16)
1000 LaTeX (unsigned-byte 8)
1001 LaTeX (signed-byte 32)
1002 LaTeX (signed-byte 16)
1003 LaTeX (signed-byte 8)
1005 LaTeX (declare (type (or #+signed-array (array (signed-byte 8))
1006 LaTeX #+signed-array (array (signed-byte 16))
1007 LaTeX #+signed-array (array (signed-byte 32))
1008 LaTeX (array (unsigned-byte 8))
1009 LaTeX (array (unsigned-byte 16))
1010 LaTeX (array (unsigned-byte 32))
1011 LaTeX (array single-float)
1012 LaTeX (array double-float))
1014 LaTeX (optimize (speed 3) (safety 0))
1015 LaTeX (ext:optimize-interface (safety 3)))
1016 LaTeX ;; with-array-data will get us to the actual data. However, because
1017 LaTeX ;; the array could have been displaced, we need to know where the
1018 LaTeX ;; data starts.
1019 LaTeX (lisp::with-array-data ((data array)
1022 LaTeX (declare (ignore end))
1023 LaTeX ;; DATA is a specialized simple-array. Memory is laid out like this:
1025 LaTeX ;; byte offset Value
1026 LaTeX ;; 0 type code (should be 70 for double-float vector)
1027 LaTeX ;; 4 4 * number of elements in vector
1028 LaTeX ;; 8 1st element of vector
1031 LaTeX (let ((addr (+ 8 (logandc1 7 (kernel:get-lisp-obj-address data))))
1032 LaTeX (type-size (let ((type (array-element-type data)))
1033 LaTeX (cond ((or (equal type '(signed-byte 8))
1034 LaTeX (equal type '(unsigned-byte 8)))
1036 LaTeX ((or (equal type '(signed-byte 16))
1037 LaTeX (equal type '(unsigned-byte 16)))
1039 LaTeX ((or (equal type '(signed-byte 32))
1040 LaTeX (equal type '(unsigned-byte 32)))
1042 LaTeX ((equal type 'single-float)
1044 LaTeX ((equal type 'double-float)
1047 LaTeX (error "Unknown specialized array element type"))))))
1048 LaTeX (declare (type (unsigned-byte 32) addr)
1049 LaTeX (optimize (speed 3) (safety 0) (ext:inhibit-warnings 3)))
1050 LaTeX (system:int-sap (the (unsigned-byte 32)
1051 LaTeX (+ addr (* type-size start)))))))
1054 LaTeX Assume we have the C function below that we wish to use:
1055 LaTeX \begin{example}
1056 LaTeX double dotprod(double* x, double* y, int n)
1059 LaTeX double sum = 0;
1061 LaTeX for (k = 0; k < n; ++k) \{
1062 LaTeX sum += x[k] * y[k];
1066 LaTeX The following example generates two large arrays in Lisp, and calls the C
1067 LaTeX function to do the desired computation. This would not have been
1068 LaTeX possible using \code{malloc} or \code{make-alien} since we need about
1069 LaTeX 16 MB of memory to hold the two arrays.
1070 LaTeX \begin{example}
1071 LaTeX (define-alien-routine "dotprod" double
1072 LaTeX (x (* double-float) :in)
1073 LaTeX (y (* double-float) :in)
1076 LaTeX (let ((x (make-array 1000000 :element-type 'double-float))
1077 LaTeX (y (make-array 1000000 :element-type 'double-float)))
1078 LaTeX ;; Initialize X and Y somehow
1079 LaTeX (let ((x-addr (system:int-sap (array-data-address x)))
1080 LaTeX (y-addr (system:int-sap (array-data-address y))))
1081 LaTeX (dotprod x-addr y-addr 1000000)))
1083 LaTeX In this example, it may be useful to wrap the inner \code{let}
1084 LaTeX expression in an \code{unwind-protect} that first turns off garbage
1085 LaTeX collection and then turns garbage collection on afterwards. This will
1086 LaTeX prevent garbage collection from moving \code{x} and \code{y} after we
1087 LaTeX have obtained the (now erroneous) addresses but before the call to
1088 LaTeX \code{dotprod} is made.
1096 <sect1 id="ffi-example"><title>Step-By-Step Example of the Foreign Function Interface</title>
1099 This section presents a complete example of an interface to a somewhat
1100 complicated C function.
1104 Suppose you have the following C function which you want to be able to
1105 call from Lisp in the file <filename>test.c</filename>
1113 struct c_struct *c_function (i, s, r, a)
1120 struct c_struct *r2;
1122 printf("i = %d\n", i);
1123 printf("s = %s\n", s);
1124 printf("r->x = %d\n", r->x);
1125 printf("r->s = %s\n", r->s);
1126 for (j = 0; j < 10; j++) printf("a[%d] = %d.\n", j, a[j]);
1127 r2 = (struct c_struct *) malloc (sizeof(struct c_struct));
1129 r2->s = "a C string";
1135 It is possible to call this C function from Lisp using the file
1136 <filename>test.lisp</filename> containing
1138 (cl:defpackage "TEST-C-CALL" (:use "CL" "SB-ALIEN" "SB-C-CALL"))
1139 (cl:in-package "TEST-C-CALL")
1141 ;;; Define the record C-STRUCT in Lisp.
1142 (define-alien-type nil
1147 ;;; Define the Lisp function interface to the C routine. It returns a
1148 ;;; pointer to a record of type C-STRUCT. It accepts four parameters:
1149 ;;; I, an int; S, a pointer to a string; R, a pointer to a C-STRUCT
1150 ;;; record; and A, a pointer to the array of 10 ints.
1152 ;;; The INLINE declaration eliminates some efficiency notes about heap
1153 ;;; allocation of alien values.
1154 (declaim (inline c-function))
1155 (define-alien-routine c-function
1156 (* (struct c-struct))
1159 (r (* (struct c-struct)))
1162 ;;; a function which sets up the parameters to the C function and
1163 ;;; actually calls it
1165 (with-alien ((ar (array int 10))
1166 (c-struct (struct c-struct)))
1167 (dotimes (i 10) ; Fill array.
1168 (setf (deref ar i) i))
1169 (setf (slot c-struct 'x) 20)
1170 (setf (slot c-struct 's) "a Lisp string")
1172 (with-alien ((res (* (struct c-struct))
1173 (c-function 5 "another Lisp string" (addr c-struct) ar)))
1174 (format t "~&back from C function~%")
1175 (multiple-value-prog1
1176 (values (slot res 'x)
1179 ;; Deallocate result. (after we are done referring to it:
1180 ;; "Pillage, *then* burn.")
1181 (free-alien res)))))</programlisting>
1185 To execute the above example, it is necessary to compile the C routine,
1187 <userinput>cc -c test.c</userinput>
1188 (In order to enable incremental loading with some linkers, you may need
1190 <userinput>cc -G 0 -c test.c</userinput>)
1194 Once the C code has been compiled, you can start up Lisp and load it in:
1195 <userinput>sbcl</userinput>.
1196 Lisp should start up with its normal prompt.</para>
1200 compile the Lisp file. (This step can be done separately. You don't
1201 have to recompile every time.)
1202 <userinput>(compile-file "test.lisp")</userinput>
1206 Within Lisp, load the foreign object file to define the necessary
1208 <userinput>(load-foreign "test.o")</userinput>.
1209 This must be done before loading any code that refers
1214 Now you can load the compiled Lisp ("fasl") file into Lisp:
1215 <userinput>(load "test.fasl")</userinput>
1216 And once the Lisp file is loaded, you can call the
1217 Lisp routine that sets up the parameters and calls the C
1219 <userinput>(test-c-call::call-cfun)</userinput>
1223 The C routine should print the following information to standard output:
1224 <!-- FIXME: What should be here is a verbatim environment for computer
1225 output, but since I don't know one in DocBook, I made do with
1226 PROGRAMLISTING for now... -->
1227 <programlisting>i = 5
1228 s = another Lisp string
1230 r->s = a Lisp string
1240 a[9] = 9.</programlisting>
1241 After return from the C function,
1242 the Lisp wrapper function should print the following output:
1243 <programlisting>back from C function</programlisting>
1244 And upon return from the Lisp wrapper function,
1245 before the next prompt is printed, the
1246 Lisp read-eval-print loop should print the following return values:
1247 <!-- FIXME: As above, it's not a program listing, but computer output... -->