2 @setfilename ../info/url
3 @settitle URL Programmer's Manual
8 @c @setchapternewpage odd
13 %\global\baselineskip 30pt % for printing in double space
15 @dircategory World Wide Web
16 @dircategory GNU Emacs Lisp
18 * URL: (url). URL loading package.
22 This file documents the URL loading package.
24 Copyright (C) 1996, 1997, 1998, 1999, 2002, 2004 Free Software Foundation
25 Copyright (C) 1993, 1994, 1995, 1996 William M. Perry
27 Permission is granted to copy, distribute and/or modify this document
28 under the terms of the GNU Free Documentation License, Version 1.1 or
29 any later version published by the Free Software Foundation; with the
30 Invariant Sections being
31 ``GNU GENERAL PUBLIC LICENSE''. A copy of the
32 license is included in the section entitled ``GNU Free Documentation
39 @center @titlefont{URL}
40 @center @titlefont{Programmer's Manual}
42 @center First Edition, URL Version 2.0
44 @c @center December 1999
46 @center William M. Perry
47 @center @email{wmperry@@gnu.org}
49 @center @email{fx@@gnu.org}
51 @vskip 0pt plus 1filll
52 Copyright @copyright{} 1993, 1994, 1995, 1996 William M. Perry@*
53 Copyright @copyright{} 1996, 1997, 1998, 1999, 2002 Free Software Foundation
55 Permission is granted to copy, distribute and/or modify this document
56 under the terms of the GNU Free Documentation License, Version 1.1 or
57 any later version published by the Free Software Foundation; with the
58 Invariant Sections being
59 ``GNU GENERAL PUBLIC LICENSE''. A copy of the
60 license is included in the section entitled ``GNU Free Documentation
70 * Getting Started:: Preparing your program to use URLs.
71 * Retrieving URLs:: How to use this package to retrieve a URL.
72 * Supported URL Types:: Descriptions of URL types currently supported.
73 * Defining New URLs:: How to define a URL loader for a new protocol.
74 * General Facilities:: URLs can be cached, accessed via a gateway
75 and tracked in a history list.
76 * Customization:: Variables you can alter.
83 @chapter Getting Started
84 @cindex URLs, definition
87 @dfn{Uniform Resource Locators} (URLs) are a specific form of
88 @dfn{Uniform Resource Identifiers} (URI) described in RFC 2396 which
89 updates RFC 1738 and RFC 1808. RFC 2016 defines uniform resource
92 URIs have the form @var{scheme}:@var{scheme-specific-part}, where the
93 @var{scheme}s supported by this library are described below.
94 @xref{Supported URL Types}.
96 FTP NFS, HTTP, HTTPS, @code{rlogin}, @code{telnet}, tn3270,
97 IRC and gopher URLs all have the form
100 @var{scheme}://@r{[}@var{userinfo}@@@r{]}@var{hostname}@r{[}:@var{port}@r{]}@r{[}/@var{path}@r{]}
103 where @samp{@r{[}} and @samp{@r{]}} delimit optional parts.
104 @var{userinfo} sometimes takes the form @var{username}:@var{password}
105 but you should beware of the security risks of sending cleartext
106 passwords. @var{hostname} may be a domain name or a dotted decimal
107 address. If the @samp{:@var{port}} is omitted then the library will
108 use the `well known' port for that service when accessing URLs. With
109 the possible exception of @code{telnet}, it is rare for ports to be
110 specified, and it is possible using a non-standard port may have
111 undesired consequences if a different service is listening on that
112 port (e.g., an HTTP URL specifying the SMTP port can cause mail to be
113 sent).@c , but @xref{Other Variables, url-bad-port-list}.
115 the @var{path} component depends on the service.
119 * Parsed URLs:: URLs are parsed into vector structures.
123 @section Configuration
125 @defvar url-configuration-directory
126 @cindex @file{~/.url}
127 @cindex configuration files
128 The directory in which URL configuration files, the cache etc.,
129 reside. Default @file{~/.url}.
135 The library functions typically operate on @dfn{parsed} versions of
136 URLs. These are actually vectors of the form:
139 [@var{type} @var{user} @var{password} @var{host} @var{port} @var{file} @var{target} @var{attributes} @var{full}]
145 is the type of the URL scheme, e.g., @code{http}
147 is the username associated with it, or @code{nil};
149 is the user password associated with it, or @code{nil};
151 is the host name associated with it, or @code{nil};
153 is the port number associated with it, or @code{nil};
155 is the `file' part of it, or @code{nil}. This doesn't necessarily
156 actually refer to a file;
158 is the target part, or @code{nil};
160 is the attributes associated with it, or @code{nil};
162 is @code{t} for a fully-specified URL, with a host part indicated by
163 @samp{//} after the scheme part.
173 @findex url-attributes
177 @findex url-set-password
181 @findex url-set-target
182 @findex url-set-attributes
184 These attributes have accessors named @code{url-@var{part}}, where
185 @var{part} is the name of one of the elements above, e.g.,
186 @code{url-host}. Similarly, there are setters of the form
187 @code{url-set-@var{part}}.
189 There are functions for parsing and unparsing between the string and
192 @defun url-generic-parse-url url
193 Return a parsed version of the string @var{url}.
196 @defun url-recreate-url url
197 @cindex unparsing URLs
198 Recreates a URL string from the parsed @var{url}.
201 @node Retrieving URLs
202 @chapter Retrieving URLs
204 @defun url-retrieve-synchronously url
205 Retrieve @var{url} synchronously and return a buffer containing the
206 data. @var{url} is either a string or a parsed URL structure. Return
207 @code{nil} if there are no data associated with it (the case for dired,
208 info, or mailto URLs that need no further processing).
211 @defun url-retrieve url callback &optional cbargs
212 Retrieve @var{url} asynchronously and call @var{callback} with args
213 @var{cbargs} when finished. The callback is called when the object
214 has been completely retrieved, with the current buffer containing the
215 object and any MIME headers associated with it. @var{url} is either a
216 string or a parsed URL structure. Returns the buffer @var{url} will
217 load into, or @code{nil} if the process has already completed.
220 @node Supported URL Types
221 @chapter Supported URL Types
224 * http/https:: Hypertext Transfer Protocol.
225 * file/ftp:: Local files and FTP archives.
226 * info:: Emacs `Info' pages.
227 * mailto:: Sending email.
228 * news/nntp/snews:: Usenet news.
229 * rlogin/telnet/tn3270:: Remote host connectivity.
230 * irc:: Internet Relay Chat.
231 * data:: Embedded data URLs.
232 * nfs:: Networked File System
239 * ldap:: Lightweight Directory Access Protocol
240 * imap:: IMAP mailboxes.
241 * man:: Unix man pages.
245 @section @code{http} and @code{https}
247 The scheme @code{http} is Hypertext Transfer Protocol. The library
248 supports version 1.1, specified in RFC 2616. (This supersedes 1.0,
249 defined in RFC 1945) HTTP URLs have the following form, where most of
250 the parts are optional:
252 http://@var{user}:@var{password}@var{host}:@var{port}/@var{path}?@var{searchpart}#@var{fragment}
254 @c The @code{:@var{port}} part is optional, and @var{port} defaults to
255 @c 80. The @code{/@var{path}} part, if present, is a slash-separated
256 @c series elements. The @code{?@var{searchpart}}, if present, is the
257 @c query for a search or the content of a form submission. The
258 @c @code{#fragment} part, if present, is a location in the document.
260 The scheme @code{https} is a secure version of @code{http}, with
261 transmission via SSL. It is defined in RFC 2069. Its default port is
262 443. This scheme depends on SSL support in Emacs via the
263 @file{ssl.el} library and is actually implemented by forcing the
264 @code{ssl} gateway method to be used. @xref{Gateways in general}.
266 @defopt url-honor-refresh-requests
267 This controls honouring of HTTP @samp{Refresh} headers by which
268 servers can direct clients to reload documents from the same URL or a
269 or different one. @code{nil} means they will not be honoured,
270 @code{t} (the default) means they will always be honoured, and
271 otherwise the user will be asked on each request.
277 * HTTP language/coding::
279 * Dealing with HTTP documents::
285 @defopt url-cookie-file
286 The file in which cookies are stored, defaulting to @file{cookies} in
287 the directory specified by @code{url-configuration-directory}.
290 @defopt url-cookie-confirmation
291 Specifies whether confirmation is require to accept cookies.
294 @defopt url-cookie-multiple-line
295 Specifies whether to put all cookies for the server on one line in the
296 HTTP request to satisfy broken servers like
297 @url{http://www.hotmail.com}.
300 @defopt url-cookie-trusted-urls
301 A list of regular expressions matching URLs from which to accept
305 @defopt url-cookie-untrusted-urls
306 A list of regular expressions matching URLs from which to reject
310 @defopt url-cookie-save-interval
311 The number of seconds between automatic saves of cookies to disk.
316 @node HTTP language/coding
317 @subsection Language and Encoding Preferences
319 HTTP allows clients to express preferences for the language and
320 encoding of documents which servers may honour. For each of these
321 variables, the value is a string; it can specify a single choice, or
322 it can be a comma-separated list.
324 Normally this list ordered by descending preference. However, each
325 element can be followed by @samp{;q=@var{priority}} to specify its
326 preference level, a decimal number from 0 to 1; e.g., for
327 @code{url-mime-language-string}, @w{@code{"de, en-gb;q=0.8,
328 en;q=0.7"}}. An element that has no @samp{;q} specification has
331 @defopt url-mime-charset-string
332 @cindex character sets
333 @cindex coding systems
334 This variable specifies a preference for character sets when documents
335 can be served in more than one encoding.
337 HTTP allows specifying a series of MIME charsets which indicate your
338 preferred character set encodings, e.g., Latin-9 or Big5, and these
339 can be weighted. The default series is generated automatically from
340 the associated MIME types of all defined coding systems, sorted by the
341 coding system priority specified in Emacs. @xref{Recognize Coding, ,
342 Recognizing Coding Systems, emacs, The GNU Emacs Manual}.
345 @defopt url-mime-language-string
346 @cindex language preferences
347 A string specifying the preferred language when servers can serve
348 files in several languages. Use RFC 1766 abbreviations, e.g.,
349 @samp{en} for English, @samp{de} for German.
351 The string can be @code{"*"} to get the first available language (as
352 opposed to the default).
355 @node HTTP URL Options
356 @subsection HTTP URL Options
358 HTTP supports an @samp{OPTIONS} method describing things supported by
361 @defun url-http-options url
362 Returns a property list describing options available for URL. The
363 property list members are:
367 A list of symbols specifying what HTTP methods the resource
372 A list of numbers specifying what DAV protocol/schema versions are
377 A list of supported DASL search types supported (string form).
380 A list of the units available for use in partial document fetches.
384 The @dfn{Platform For Privacy Protection} description for the resource.
385 Currently this is just the raw header contents.
390 @node Dealing with HTTP documents
391 @subsection Dealing with HTTP documents
393 HTTP URLs are retrieved into a buffer containing the HTTP headers
394 followed by the body. Since the headers are quasi-MIME, they may be
395 processed using the MIME library. @xref{Top,, Emacs MIME,
396 emacs-mime, The Emacs MIME Manual}. The URL package provides a
397 function to do this in general:
399 @defun url-decode-text-part handle &optional coding
400 This function decodes charset-encoded text in the current buffer. In
401 Emacs, the buffer is expected to be unibyte initially and is set to
402 multibyte after decoding.
403 HANDLE is the MIME handle of the original part. CODING is an explicit
404 coding to use, overriding what the MIME headers specify.
405 The coding system used for the decoding is returned.
407 Note that this function doesn't deal with @samp{http-equiv} charset
408 specifications in HTML @samp{<meta>} elements.
412 @section file and ftp
415 @cindex File Transfer Protocol
416 @cindex compressed files
420 ftp://@var{user}:@var{password}@@@var{host}:@var{port}/@var{file}
421 file://@var{user}:@var{password}@@@var{host}:@var{port}/@var{file}
424 These schemes are defined in RFC 1808.
425 @samp{ftp:} and @samp{file:} are synonymous in this library. They
426 allow reading arbitrary files from hosts. Either @samp{ange-ftp}
427 (Emacs) or @samp{efs} (XEmacs) is used to retrieve them from remote
428 hosts. Local files are accessed directly.
430 Compressed files are handled, but support is hard-coded so that
431 @code{jka-compr-compression-info-list} and so on have no affect.
432 Suffixes recognized are @samp{.z}, @samp{.gz}, @samp{.Z} and
435 @defopt url-directory-index-file
436 The filename to look for when indexing a directory, default
437 @samp{"index.html"}. If this file exists, and is readable, then it
438 will be viewed instead of using @code{dired} to view the directory.
445 @findex Info-goto-node
448 info:@var{file}#@var{node}
451 Info URLs are not officially defined. They invoke
452 @code{Info-goto-node} with argument @samp{(@var{file})@var{node}}.
453 @samp{#@var{node}} is optional, defaulting to @samp{Top}.
460 A mailto URL will send an email message to the address in the
461 URL, for example @samp{mailto:foo@@bar.com} would compose a
462 message to @samp{foo@@bar.com}.
464 @defopt url-mail-command
465 @vindex mail-user-agent
466 The function called whenever url needs to send mail. This should
467 normally be left to default from @var{mail-user-agent}. @xref{Mail
468 Methods, , Mail-Composition Methods, emacs, The GNU Emacs Manual}.
471 An @samp{X-Url-From} header field containing the URL of the document
472 that contained the mailto URL is added if that URL is known.
474 RFC 2368 extends the definition of mailto URLs in RFC 1738.
475 The form of a mailto URL is
477 @samp{mailto:@var{mailbox}[?@var{header}=@var{contents}[&@var{header}=@var{contents}]]}
479 @noindent where an arbitrary number of @var{header}s can be added. If the
480 @var{header} is @samp{body}, then @var{contents} is put in the body
481 otherwise a @var{header} header field is created with @var{contents}
482 as its contents. Note that the URL library does not consider any
483 headers `dangerous' so you should check them before sending the
487 Email messages are defined in @sc{rfc}822.
489 @node news/nntp/snews
490 @section @code{news}, @code{nntp} and @code{snews}
497 @c draft-gilman-news-url-01
498 The network news URL scheme take the following forms following RFC
499 1738 except that for compatibility with other clients, host and port
500 fields may be included in news URLs though they are properly only
501 allowed for nntp an snews.
504 @item news:@var{newsgroup}
505 Retrieves a list of messages in @var{newsgroup};
506 @item news:@var{message-id}
507 Retrieves the message with the given @var{message-id};
509 Retrieves a list of all available newsgroups;
510 @item nntp://@var{host}:@var{port}/@var{newsgroup}
511 @itemx nntp://@var{host}:@var{port}/@var{message-id}
512 @itemx nntp://@var{host}:@var{port}/*
513 Similar to the @samp{news} versions.
516 @samp{:@var{port}} is optional and defaults to :119.
518 @samp{snews} is the same as @samp{nntp} except that the default port
521 (It is tunneled through SSL.)
523 An @samp{nntp} URL is the same as a news URL, except that the URL may
524 specify an article by its number.
526 @defopt url-news-server
527 This variable can be used to override the default news server.
528 Usually this will be set by the Gnus package, which is used to fetch
530 @cindex environment variable
532 It may be set from the conventional environment variable
536 @node rlogin/telnet/tn3270
537 @section rlogin, telnet and tn3270
541 @cindex terminal emulation
542 @findex terminal-emulator
544 These URL schemes from RFC 1738 for logon via a terminal emulator have
547 telnet://@var{user}:@var{password}@@@var{host}:@var{port}
549 but the @code{:@var{password}} component is ignored.
551 To handle rlogin, telnet and tn3270 URLs, a @code{rlogin},
552 @code{telnet} or @code{tn3270} (the program names and arguments are
553 hardcoded) session is run in a @code{terminal-emulator} buffer.
554 Well-known ports are used if the URL does not specify a port.
559 @cindex Internet Relay Chat
561 @c Fixme: reference (was http://www.w3.org/Addressing/draft-mirashi-url-irc-01.txt)
562 @dfn{Internet Relay Chat} (IRC) is handled by handing off the @sc{irc}
563 session to a function named in @code{url-irc-function}.
565 @defopt url-irc-function
566 A function to actually open an IRC connection.
568 must take five arguments, @var{host}, @var{port}, @var{channel},
569 @var{user} and @var{password}. The @var{channel} argument specifies the
570 channel to join immediately, this can be @code{nil}. By default this is
571 @code{url-irc-zenirc}.
573 @defun url-irc-zenirc host port channel user password
574 Processes the arguments and lets @code{zenirc} handle the session.
582 data:@r{[}@var{media-type}@r{]}@r{[};@var{base64}@r{]},@var{data}
585 Data URLs contain MIME data in the URL itself. They are defined in
588 @var{media-type} is a MIME @samp{Content-Type} string, possibly
589 including parameters. It defaults to
590 @samp{text/plain;charset=US-ASCII}. The @samp{text/plain} can be
591 omitted but the charset parameter supplied. If @samp{;base64} is
592 present, the @var{data} are base64-encoded.
597 @cindex Network File System
601 nfs://@var{user}:@var{password}@@@var{host}:@var{port}/@var{file}
604 The @samp{nfs:} scheme is defined in RFC 2224. It is similar to
605 @samp{ftp:} except that it points to a file on a remote host that is
606 handled by the automounter on the local host.
608 @defvar url-nfs-automounter-directory-spec
610 A string saying how to invoke the NFS automounter. Certain @samp{%}
611 sequences are recognized:
615 The hostname of the NFS server;
617 The port number of the NFS server;
619 The username to use to authenticate;
621 The password to use to authenticate;
623 The filename on the remote server;
628 Each can be used any number of times.
642 @cindex Lightweight Directory Access Protocol
644 The LDAP scheme is defined in RFC 2255.
654 @cindex @command{man}
655 @cindex Unix man pages
659 @samp{man:@var{page-spec}}
662 This is a non-standard scheme. @var{page-spec} is passed directly to
663 the Lisp @code{man} function.
665 @node Defining New URLs
666 @chapter Defining New URLs
669 * Naming conventions::
670 * Required functions::
671 * Optional functions::
672 * Asynchronous fetching::
673 * Supporting file-name-handlers::
676 @node Naming conventions
677 @section Naming conventions
679 @node Required functions
680 @section Required functions
682 @node Optional functions
683 @section Optional functions
685 @node Asynchronous fetching
686 @section Asynchronous fetching
688 @node Supporting file-name-handlers
689 @section Supporting file-name-handlers
691 @node General Facilities
692 @chapter General Facilities
697 * Gateways in general::
702 @section Disk Caching
704 @cindex Persistent Cache
707 The disk cache stores retrieved documents locally, whence they can be
708 retrieved more quickly. When requesting a URL that is in the cache,
709 the library checks to see if the page has changed since it was last
710 retrieved from the remote machine. If not, the local copy is used,
711 saving the transmission over the network.
712 @cindex Cleaning the cache
713 @cindex Clearing the cache
714 @cindex Cache cleaning
715 Currently the cache isn't cleared automatically.
716 @c Running the @code{clean-cache} shell script
717 @c fist is recommended, to allow for future cleaning of the cache. This
718 @c shell script will remove all files that have not been accessed since it
719 @c was last run. To keep the cache pared down, it is recommended that this
720 @c script be run from @i{at} or @i{cron} (see the manual pages for
721 @c crontab(5) or at(1) for more information)
723 @defopt url-automatic-caching
724 Setting this variable non-@code{nil} causes documents to be cached
728 @defopt url-cache-directory
729 This variable specifies the
730 directory to store the cache files. It defaults to sub-directory
731 @file{cache} of @code{url-configuration-directory}.
734 @c Fixme: function v. option, but neither used.
735 @c @findex url-cache-expired
736 @c @defopt url-cache-expired
737 @c This is a function to decide whether or not a cache entry has expired.
738 @c It takes two times as it parameters and returns non-@code{nil} if the
739 @c second time is ``too old'' when compared with the first time.
742 @defopt url-cache-creation-function
743 The cache relies on a scheme for mapping URLs to files in the cache.
744 This variable names a function which sets the type of cache to use.
745 It takes a URL as argument and returns the absolute file name of the
746 corresponding cache file. The two supplied possibilities are
747 @code{url-cache-create-filename-using-md5} and
748 @code{url-cache-create-filename-human-readable}.
751 @defun url-cache-create-filename-using-md5 url
752 Creates a cache file name from @var{url} using MD5 hashing.
754 This is creates entries with very few cache collisions and is fast if
755 you have the @code{md5} function as a primitive (Emacs 21 and XEmacs).
757 (url-cache-create-filename-using-md5 "http://www.example.com/foo/bar")
758 @result{} "/home/fx/.url/cache/fx/http/com/example/www/b8a35774ad20db71c7c3409a5410e74f"
762 @defun url-cache-create-filename-human-readable url
763 Creates a cache file name from @var{url} more obviously connected to
764 @var{url} than for @code{url-cache-create-filename-using-md5}, but
765 more likely to conflict with other files.
767 (url-cache-create-filename-human-readable "http://www.example.com/foo/bar")
768 @result{} "/home/fx/.url/cache/fx/http/com/example/www/foo/bar"
772 @c Fixme: never actually used currently?
773 @c @defopt url-standalone-mode
774 @c @cindex Relying on cache
775 @c @cindex Cache only mode
776 @c @cindex Standalone mode
777 @c If this variable is non-@code{nil}, the library relies solely on the
778 @c cache for fetching documents and avoids checking if they have changed
779 @c on remote servers.
782 @c With a large cache of documents on the local disk, it can be very handy
783 @c when traveling, or any other time the network connection is not active
784 @c (a laptop with a dial-on-demand PPP connection, etc). Emacs/W3 can rely
785 @c solely on its cache, and avoid checking to see if the page has changed
786 @c on the remote server. In the case of a dial-on-demand PPP connection,
787 @c this will keep the phone line free as long as possible, only bringing up
788 @c the PPP connection when asking for a page that is not located in the
789 @c cache. This is very useful for demonstrations as well.
792 @section Proxies and Gatewaying
794 @c fixme: check/document url-ns stuff
795 @cindex proxy servers
797 @cindex environment variables
799 Proxy servers are commonly used to provide gateways through firewalls
800 or as caches serving some more-or-less local network. Each protocol
801 (HTTP, FTP, etc.)@: can have a different gateway server. Proxying is
802 conventionally configured commonly amongst different programs through
803 environment variables of the form @code{@var{protocol}_proxy}, where
804 @var{protocol} is one of the supported network protocols (@code{http},
805 @code{ftp} etc.). The library recognizes such variables in either
806 upper or lower case. Their values are of one of the forms:
808 @item @code{@var{host}:@var{port}}
810 @item Simply a host name.
814 The @code{NO_PROXY} environment variable specifies URLs that should be
815 excluded from proxying (on servers that should be contacted directly).
816 This should be a comma-separated list of hostnames, domain names, or a
817 mixture of both. Asterisks can be used as wildcards, but other
818 clients may not support that. Domain names may be indicated by a
819 leading dot. For example:
821 NO_PROXY="*.aventail.com,home.com,.seanet.com"
823 @noindent says to contact all machines in the @samp{aventail.com} and
824 @samp{seanet.com} domains directly, as well as the machine named
825 @samp{home.com}. If @code{NO_PROXY} isn't defined, @code{no_PROXY}
826 and @code{no_proxy} are also tried, in that order.
828 Proxies may also be specified directly in Lisp.
830 @defopt url-proxy-services
831 This variable is an alist of URL schemes and proxy servers that
832 gateway them. The items are of the form @w{@code{(@var{scheme}
833 . @var{host}:@var{portnumber})}}, says that the URL @var{scheme} is
834 gatewayed through @var{portnumber} on the specified @var{host}. An
835 exception is the pseudo scheme @code{"no_proxy"}, which is paired with
836 a regexp matching host names not to be proxied. This variable is
837 initialized from the environment as above.
840 (setq url-proxy-services
841 '(("http" . "proxy.aventail.com:80")
842 ("no_proxy" . "^.*\\(aventail\\|seanet\\)\\.com")))
846 @node Gateways in general
847 @section Gateways in General
851 The library provides a general gateway layer through which all
852 networking passes. It can both control access to the network and
853 provide access through gateways in firewalls. This may make direct
854 connexions in some cases and pass through some sort of gateway in
855 others.@footnote{Proxies (which only operate over HTTP) are
856 implemented using this.} The library's basic function responsible for
857 making connexions is @code{url-open-stream}.
859 @defun url-open-stream name buffer host service
860 @cindex opening a stream
861 @cindex stream, opening
862 Open a stream to @var{host}, possibly via a gateway. The other
863 arguments are as for @code{open-network-stream}. This will not make a
864 connexion if @code{url-gateway-unplugged} is non-@code{nil}.
867 @defvar url-gateway-local-host-regexp
868 This is a regular expression that matches local hosts that do not
869 require the use of a gateway. If @code{nil}, all connexions are made
873 @defvar url-gateway-method
874 This variable controls which gateway method is used. It may be useful
875 to bind it temporarily in some applications. It has values taken from
876 a list of symbols. Possible values are:
880 @cindex @command{telnet}
881 Use this method if you must first telnet and log into a gateway host,
882 and then run telnet from that host to connect to outside machines.
885 @cindex @command{rlogin}
886 This method is identical to @code{telnet}, but uses @command{rlogin}
887 to log into the remote machine without having to send the username and
888 password over the wire every time.
892 Use if the firewall has a @sc{socks} gateway running on it. The
893 @sc{socks} v5 protocol is defined in RFC 1928.
896 @c This probably shouldn't be documented
897 @c Fixme: why not? -- fx
900 This method uses Emacs's builtin networking directly. This is the
901 default. It can be used only if there is no firewall blocking access.
905 The following variables control the gateway methods.
907 @defopt url-gateway-telnet-host
908 The gateway host to telnet to. Once logged in there, you then telnet
909 out to the hosts you want to connect to.
911 @defopt url-gateway-telnet-parameters
912 This should be a list of parameters to pass to the @command{telnet} program.
914 @defopt url-gateway-telnet-password-prompt
915 This is a regular expression that matches the password prompt when
918 @defopt url-gateway-telnet-login-prompt
919 This is a regular expression that matches the username prompt when
922 @defopt url-gateway-telnet-user-name
923 The username to log in with.
925 @defopt url-gateway-telnet-password
926 The password to send when logging in.
928 @defopt url-gateway-prompt-pattern
929 This is a regular expression that matches the shell prompt.
932 @defopt url-gateway-rlogin-host
933 Host to @samp{rlogin} to before telnetting out.
935 @defopt url-gateway-rlogin-parameters
936 Parametres to pass to @samp{rsh}.
938 @defopt url-gateway-rlogin-user-name
939 User name to use when logging in to the gateway.
941 @defopt url-gateway-prompt-pattern
942 This is a regular expression that matches the shell prompt.
946 This specifies the default server, it takes the form
947 @w{@code{("Default server" @var{server} @var{port} @var{version})}}
948 where @var{version} can be either 4 or 5.
950 @defvar socks-password
951 If this is @code{nil} then you will be asked for the password,
952 otherwise it will be used as the password for authenticating you to
953 the @sc{socks} server.
955 @defvar socks-username
956 This is the username to use when authenticating yourself to the
957 @sc{socks} server. By default this is your login name.
959 @defvar socks-timeout
960 This controls how long, in seconds, to wait for responses from the
961 @sc{socks} server; it is 5 by default.
963 @c fixme: these have been effectively commented-out in the code
964 @c @defopt socks-server-aliases
965 @c This a list of server aliases. It is a list of aliases of the form
966 @c @var{(alias hostname port version)}.
968 @c @defopt socks-network-aliases
969 @c This a list of network aliases. Each entry in the list takes the form
970 @c @var{(alias (network))} where @var{alias} is a string that names the
971 @c @var{network}. The networks can contain a pair (not a dotted pair) of
972 @c @sc{ip} addresses which specify a range of @sc{ip} addresses, an @sc{ip}
973 @c address and a netmask, a domain name or a unique hostname or @sc{ip}
976 @c @defopt socks-redirection-rules
977 @c This a list of redirection rules. Each rule take the form
978 @c @var{(Destination network Connection type)} where @var{Destination
979 @c network} is a network alias from @code{socks-network-aliases} and
980 @c @var{Connection type} can be @code{nil} in which case a direct
981 @c connection is used, or it can be an alias from
982 @c @code{socks-server-aliases} in which case that server is used as a
985 @defopt socks-nslookup-program
986 @cindex @command{nslookup}
987 This the @samp{nslookup} program. It is @code{"nslookup"} by default.
991 * Suppressing network connexions::
993 @c * Broken hostname resolution::
995 @node Suppressing network connexions
996 @subsection Suppressing Network Connexions
998 @cindex network connexions, suppressing
999 @cindex suppressing network connexions
1002 In some circumstances it is desirable to suppress making network
1003 connexions. A typical case is when rendering HTML in a mail user
1004 agent, when external URLs should not be activated, particularly to
1005 avoid `bugs' which `call home' by fetch single-pixel images and the
1006 like. To arrange this, bind the following variable for the duration
1009 @defvar url-gateway-unplugged
1010 If this variable is non-@code{nil} new network connexions are never
1011 opened by the URL library.
1014 @c @node Broken hostname resolution
1015 @c @subsection Broken Hostname Resolution
1017 @c @cindex hostname resolver
1018 @c @cindex resolver, hostname
1019 @c Some C libraries do not include the hostname resolver routines in
1020 @c their static libraries. If Emacs was linked statically, and was not
1021 @c linked with the resolver libraries, it will not be able to get to any
1022 @c machines off the local network. This is characterized by being able
1023 @c to reach someplace with a raw ip number, but not its hostname
1024 @c (@url{http://129.79.254.191/} works, but
1025 @c @url{http://www.cs.indiana.edu/} doesn't). This used to happen on
1026 @c SunOS4 and Ultrix, but is now probably now rare. If Emacs can't be
1027 @c rebuilt linked against the resolver library, it can use the external
1028 @c @command{nslookup} program instead.
1030 @c @defopt url-gateway-broken-resolution
1031 @c @cindex @code{nslookup} program
1032 @c @cindex program, @code{nslookup}
1033 @c If non-@code{nil}, this variable says to use the program specified by
1034 @c @code{url-gateway-nslookup-program} program to do hostname resolution.
1037 @c @defopt url-gateway-nslookup-program
1038 @c The name of the program to do hostname lookup if Emacs can't do it
1039 @c directly. This program should expect a single argument on the command
1040 @c line---the hostname to resolve---and should produce output similar to
1041 @c the standard Unix @command{nslookup} program:
1043 @c Name: www.cs.indiana.edu
1044 @c Address: 129.79.254.191
1051 The library can maintain a global history list tracking URLs accessed.
1052 URL completion can be done from it. The history mechanism is set up
1053 @findex url-do-setup
1054 automatically via @code{url-do-setup} when it is configured to be on.
1055 Note that the size of the history list is currently not limited.
1057 @vindex url-history-hash-table
1058 The history `list' is actually a hash table,
1059 @code{url-history-hash-table}. It contains access times keyed by URL
1060 strings. The times are in the format returned by @code{current-time}.
1062 @defun url-history-update-url url time
1063 This function updates the history table with an entry for @var{url}
1064 accessed at the given @var{time}.
1067 @defopt url-history-track
1068 If non-@code{nil}, the library will keep track of all the URLs
1069 accessed. If is is @code{t}, the list is saved to disk at the end of
1070 each Emacs session. The default is @code{nil}.
1073 @defopt url-history-file
1074 The file storing the history list between sessions. It defaults to
1075 @file{history} in @code{url-configuration-directory}.
1078 @defopt url-history-save-interval
1079 @findex url-history-setup-save-timer
1080 The number of seconds between automatic saves of the history list.
1081 Default is one hour. Note that if you change this variable directly,
1082 rather than using Custom, after @code{url-do-setup} has been run, you
1083 need to run the function @code{url-history-setup-save-timer}.
1086 @defun url-history-parse-history &optional fname
1087 Parses the history file @var{fname} (default @code{url-history-file})
1088 and sets up the history list.
1091 @defun url-history-save-history &optional fname
1092 Saves the current history to file @var{fname} (default
1093 @code{url-history-file}).
1096 @defun url-completion-function string predicate function
1097 You can use this function to do completion of URLs from the history.
1101 @chapter Customization
1103 @section Environment Variables
1105 @cindex environment variables
1106 The following environment variables affect the library's operation at
1112 @vindex url-temporary-directory
1113 If this is defined, @var{url-temporary-directory} is initialized from
1117 @section General User Options
1119 The following user options, settable with Customize, affect the
1120 general operation of the package.
1124 Specifies the types of debug messages the library which are logged to
1125 the @code{*URL-DEBUG*} buffer.
1126 @code{t} means log all messages.
1127 A number means log all messages and show them with @code{message}.
1128 If may also be a list of the types of messages to be logged.
1130 @defopt url-personal-mail-address
1132 @defopt url-privacy-level
1134 @defopt url-uncompressor-alist
1136 @defopt url-passwd-entry-func
1138 @defopt url-standalone-mode
1140 @defopt url-bad-port-list
1142 @defopt url-max-password-attempts
1144 @defopt url-temporary-directory
1146 @defopt url-show-status
1148 @defopt url-confirmation-func
1149 The function to use for asking yes or no functions. This is normally
1150 either @code{y-or-n-p} or @code{yes-or-no-p}, but could be another
1151 function taking a single argument (the prompt) and returning @code{t}
1152 only if an affirmative answer is given.
1154 @defopt url-gateway-method
1155 @c fixme: describe gatewaying
1156 A symbol specifying the type of gateway support to use fro connexions
1157 from the local machine. The supported methods are:
1161 Run telnet in a subprocess to connect;
1163 Rlogin to another machine to connect;
1165 Connect through a socks server;
1173 @node Function Index
1174 @unnumbered Command and Function Index
1177 @node Variable Index
1178 @unnumbered Variable Index
1182 @unnumbered Concept Index
1185 @setchapternewpage odd
1190 arch-tag: c96be356-7e2d-4196-bcda-b13246c5c3f0