Bug #1627575: Added _open() method to FileHandler which can be used to reopen files...
[python.git] / Doc / lib / libsocksvr.tex
blobc7b28ea0dc4c212b50085c91b9e406c79db36ff6
1 \section{\module{SocketServer} ---
2 A framework for network servers}
4 \declaremodule{standard}{SocketServer}
5 \modulesynopsis{A framework for network servers.}
8 The \module{SocketServer} module simplifies the task of writing network
9 servers.
11 There are four basic server classes: \class{TCPServer} uses the
12 Internet TCP protocol, which provides for continuous streams of data
13 between the client and server. \class{UDPServer} uses datagrams, which
14 are discrete packets of information that may arrive out of order or be
15 lost while in transit. The more infrequently used
16 \class{UnixStreamServer} and \class{UnixDatagramServer} classes are
17 similar, but use \UNIX{} domain sockets; they're not available on
18 non-\UNIX{} platforms. For more details on network programming, consult
19 a book such as W. Richard Steven's \citetitle{UNIX Network Programming}
20 or Ralph Davis's \citetitle{Win32 Network Programming}.
22 These four classes process requests \dfn{synchronously}; each request
23 must be completed before the next request can be started. This isn't
24 suitable if each request takes a long time to complete, because it
25 requires a lot of computation, or because it returns a lot of data
26 which the client is slow to process. The solution is to create a
27 separate process or thread to handle each request; the
28 \class{ForkingMixIn} and \class{ThreadingMixIn} mix-in classes can be
29 used to support asynchronous behaviour.
31 Creating a server requires several steps. First, you must create a
32 request handler class by subclassing the \class{BaseRequestHandler}
33 class and overriding its \method{handle()} method; this method will
34 process incoming requests. Second, you must instantiate one of the
35 server classes, passing it the server's address and the request
36 handler class. Finally, call the \method{handle_request()} or
37 \method{serve_forever()} method of the server object to process one or
38 many requests.
40 When inheriting from \class{ThreadingMixIn} for threaded connection
41 behavior, you should explicitly declare how you want your threads
42 to behave on an abrupt shutdown. The \class{ThreadingMixIn} class
43 defines an attribute \var{daemon_threads}, which indicates whether
44 or not the server should wait for thread termination. You should
45 set the flag explicitly if you would like threads to behave
46 autonomously; the default is \constant{False}, meaning that Python
47 will not exit until all threads created by \class{ThreadingMixIn} have
48 exited.
50 Server classes have the same external methods and attributes, no
51 matter what network protocol they use:
53 \setindexsubitem{(SocketServer protocol)}
55 \subsection{Server Creation Notes}
57 There are five classes in an inheritance diagram, four of which represent
58 synchronous servers of four types:
60 \begin{verbatim}
61 +------------+
62 | BaseServer |
63 +------------+
66 +-----------+ +------------------+
67 | TCPServer |------->| UnixStreamServer |
68 +-----------+ +------------------+
71 +-----------+ +--------------------+
72 | UDPServer |------->| UnixDatagramServer |
73 +-----------+ +--------------------+
74 \end{verbatim}
76 Note that \class{UnixDatagramServer} derives from \class{UDPServer}, not
77 from \class{UnixStreamServer} --- the only difference between an IP and a
78 \UNIX{} stream server is the address family, which is simply repeated in both
79 \UNIX{} server classes.
81 Forking and threading versions of each type of server can be created using
82 the \class{ForkingMixIn} and \class{ThreadingMixIn} mix-in classes. For
83 instance, a threading UDP server class is created as follows:
85 \begin{verbatim}
86 class ThreadingUDPServer(ThreadingMixIn, UDPServer): pass
87 \end{verbatim}
89 The mix-in class must come first, since it overrides a method defined in
90 \class{UDPServer}. Setting the various member variables also changes the
91 behavior of the underlying server mechanism.
93 To implement a service, you must derive a class from
94 \class{BaseRequestHandler} and redefine its \method{handle()} method. You
95 can then run various versions of the service by combining one of the server
96 classes with your request handler class. The request handler class must be
97 different for datagram or stream services. This can be hidden by using the
98 handler subclasses \class{StreamRequestHandler} or \class{DatagramRequestHandler}.
100 Of course, you still have to use your head! For instance, it makes no sense
101 to use a forking server if the service contains state in memory that can be
102 modified by different requests, since the modifications in the child process
103 would never reach the initial state kept in the parent process and passed to
104 each child. In this case, you can use a threading server, but you will
105 probably have to use locks to protect the integrity of the shared data.
107 On the other hand, if you are building an HTTP server where all data is
108 stored externally (for instance, in the file system), a synchronous class
109 will essentially render the service "deaf" while one request is being
110 handled -- which may be for a very long time if a client is slow to receive
111 all the data it has requested. Here a threading or forking server is
112 appropriate.
114 In some cases, it may be appropriate to process part of a request
115 synchronously, but to finish processing in a forked child depending on the
116 request data. This can be implemented by using a synchronous server and
117 doing an explicit fork in the request handler class \method{handle()}
118 method.
120 Another approach to handling multiple simultaneous requests in an
121 environment that supports neither threads nor \function{fork()} (or where
122 these are too expensive or inappropriate for the service) is to maintain an
123 explicit table of partially finished requests and to use \function{select()}
124 to decide which request to work on next (or whether to handle a new incoming
125 request). This is particularly important for stream services where each
126 client can potentially be connected for a long time (if threads or
127 subprocesses cannot be used).
129 %XXX should data and methods be intermingled, or separate?
130 % how should the distinction between class and instance variables be
131 % drawn?
133 \subsection{Server Objects}
135 \begin{funcdesc}{fileno}{}
136 Return an integer file descriptor for the socket on which the server
137 is listening. This function is most commonly passed to
138 \function{select.select()}, to allow monitoring multiple servers in the
139 same process.
140 \end{funcdesc}
142 \begin{funcdesc}{handle_request}{}
143 Process a single request. This function calls the following methods
144 in order: \method{get_request()}, \method{verify_request()}, and
145 \method{process_request()}. If the user-provided \method{handle()}
146 method of the handler class raises an exception, the server's
147 \method{handle_error()} method will be called.
148 \end{funcdesc}
150 \begin{funcdesc}{serve_forever}{}
151 Handle an infinite number of requests. This simply calls
152 \method{handle_request()} inside an infinite loop.
153 \end{funcdesc}
155 \begin{datadesc}{address_family}
156 The family of protocols to which the server's socket belongs.
157 \constant{socket.AF_INET} and \constant{socket.AF_UNIX} are two
158 possible values.
159 \end{datadesc}
161 \begin{datadesc}{RequestHandlerClass}
162 The user-provided request handler class; an instance of this class is
163 created for each request.
164 \end{datadesc}
166 \begin{datadesc}{server_address}
167 The address on which the server is listening. The format of addresses
168 varies depending on the protocol family; see the documentation for the
169 socket module for details. For Internet protocols, this is a tuple
170 containing a string giving the address, and an integer port number:
171 \code{('127.0.0.1', 80)}, for example.
172 \end{datadesc}
174 \begin{datadesc}{socket}
175 The socket object on which the server will listen for incoming requests.
176 \end{datadesc}
178 % XXX should class variables be covered before instance variables, or
179 % vice versa?
181 The server classes support the following class variables:
183 \begin{datadesc}{allow_reuse_address}
184 Whether the server will allow the reuse of an address. This defaults
185 to \constant{False}, and can be set in subclasses to change the policy.
186 \end{datadesc}
188 \begin{datadesc}{request_queue_size}
189 The size of the request queue. If it takes a long time to process a
190 single request, any requests that arrive while the server is busy are
191 placed into a queue, up to \member{request_queue_size} requests. Once
192 the queue is full, further requests from clients will get a
193 ``Connection denied'' error. The default value is usually 5, but this
194 can be overridden by subclasses.
195 \end{datadesc}
197 \begin{datadesc}{socket_type}
198 The type of socket used by the server; \constant{socket.SOCK_STREAM}
199 and \constant{socket.SOCK_DGRAM} are two possible values.
200 \end{datadesc}
202 There are various server methods that can be overridden by subclasses
203 of base server classes like \class{TCPServer}; these methods aren't
204 useful to external users of the server object.
206 % should the default implementations of these be documented, or should
207 % it be assumed that the user will look at SocketServer.py?
209 \begin{funcdesc}{finish_request}{}
210 Actually processes the request by instantiating
211 \member{RequestHandlerClass} and calling its \method{handle()} method.
212 \end{funcdesc}
214 \begin{funcdesc}{get_request}{}
215 Must accept a request from the socket, and return a 2-tuple containing
216 the \emph{new} socket object to be used to communicate with the
217 client, and the client's address.
218 \end{funcdesc}
220 \begin{funcdesc}{handle_error}{request, client_address}
221 This function is called if the \member{RequestHandlerClass}'s
222 \method{handle()} method raises an exception. The default action is
223 to print the traceback to standard output and continue handling
224 further requests.
225 \end{funcdesc}
227 \begin{funcdesc}{process_request}{request, client_address}
228 Calls \method{finish_request()} to create an instance of the
229 \member{RequestHandlerClass}. If desired, this function can create a
230 new process or thread to handle the request; the \class{ForkingMixIn}
231 and \class{ThreadingMixIn} classes do this.
232 \end{funcdesc}
234 % Is there any point in documenting the following two functions?
235 % What would the purpose of overriding them be: initializing server
236 % instance variables, adding new network families?
238 \begin{funcdesc}{server_activate}{}
239 Called by the server's constructor to activate the server. The default
240 behavior just \method{listen}s to the server's socket.
241 May be overridden.
242 \end{funcdesc}
244 \begin{funcdesc}{server_bind}{}
245 Called by the server's constructor to bind the socket to the desired
246 address. May be overridden.
247 \end{funcdesc}
249 \begin{funcdesc}{verify_request}{request, client_address}
250 Must return a Boolean value; if the value is \constant{True}, the request will be
251 processed, and if it's \constant{False}, the request will be denied.
252 This function can be overridden to implement access controls for a server.
253 The default implementation always returns \constant{True}.
254 \end{funcdesc}
256 \subsection{RequestHandler Objects}
258 The request handler class must define a new \method{handle()} method,
259 and can override any of the following methods. A new instance is
260 created for each request.
262 \begin{funcdesc}{finish}{}
263 Called after the \method{handle()} method to perform any clean-up
264 actions required. The default implementation does nothing. If
265 \method{setup()} or \method{handle()} raise an exception, this
266 function will not be called.
267 \end{funcdesc}
269 \begin{funcdesc}{handle}{}
270 This function must do all the work required to service a request.
271 The default implementation does nothing.
272 Several instance attributes are available to it; the request is
273 available as \member{self.request}; the client address as
274 \member{self.client_address}; and the server instance as
275 \member{self.server}, in case it needs access to per-server
276 information.
278 The type of \member{self.request} is different for datagram or stream
279 services. For stream services, \member{self.request} is a socket
280 object; for datagram services, \member{self.request} is a string.
281 However, this can be hidden by using the request handler subclasses
282 \class{StreamRequestHandler} or \class{DatagramRequestHandler}, which
283 override the \method{setup()} and \method{finish()} methods, and
284 provide \member{self.rfile} and \member{self.wfile} attributes.
285 \member{self.rfile} and \member{self.wfile} can be read or written,
286 respectively, to get the request data or return data to the client.
287 \end{funcdesc}
289 \begin{funcdesc}{setup}{}
290 Called before the \method{handle()} method to perform any
291 initialization actions required. The default implementation does
292 nothing.
293 \end{funcdesc}