1 @node Name Service Switch, Users and Groups, Job Control, Top
2 @chapter System Databases and Name Service Switch
3 @c %MENU% Accessing system databases
4 @cindex Name Service Switch
8 Various functions in the C Library need to be configured to work
9 correctly in the local environment. Traditionally, this was done by
10 using files (e.g., @file{/etc/passwd}), but other nameservices (like the
11 Network Information Service (NIS) and the Domain Name Service (DNS))
12 became popular, and were hacked into the C library, usually with a fixed
15 @Theglibc{} contains a cleaner solution to this problem. It is
16 designed after a method used by Sun Microsystems in the C library of
17 @w{Solaris 2}. @Theglibc{} follows their name and calls this
18 scheme @dfn{Name Service Switch} (NSS).
20 Though the interface might be similar to Sun's version there is no
21 common code. We never saw any source code of Sun's implementation and
22 so the internal interface is incompatible. This also manifests in the
23 file names we use as we will see later.
27 * NSS Basics:: What is this NSS good for.
28 * NSS Configuration File:: Configuring NSS.
29 * NSS Module Internals:: How does it work internally.
30 * Extending NSS:: What to do to add services or databases.
33 @node NSS Basics, NSS Configuration File, Name Service Switch, Name Service Switch
36 The basic idea is to put the implementation of the different services
37 offered to access the databases in separate modules. This has some
42 Contributors can add new services without adding them to @theglibc{}.
44 The modules can be updated separately.
46 The C library image is smaller.
49 To fulfill the first goal above, the ABI of the modules will be described
50 below. For getting the implementation of a new service right it is
51 important to understand how the functions in the modules get called.
52 They are in no way designed to be used by the programmer directly.
53 Instead the programmer should only use the documented and standardized
54 functions to access the databases.
57 The databases available in the NSS are
76 @comment @pxref{Mail Aliases}.
79 @comment @pxref{Ethernet Numbers}.
81 Groups of users, @pxref{Group Database}.
83 Group passphrase hashes and related information.
85 Host names and numbers, @pxref{Host Names}.
87 Supplementary group access list.
89 Network wide list of host and users, @pxref{Netgroup Database}.
91 Network names and numbers, @pxref{Networks Database}.
93 User identities, @pxref{User Database}.
95 Network protocols, @pxref{Protocols Database}.
97 Public keys for Secure RPC.
99 Remote procedure call names and numbers.
100 @comment @pxref{RPC Database}.
102 Network services, @pxref{Services Database}.
104 User passphrase hashes and related information.
105 @comment @pxref{Shadow Passphrase Database}.
109 @c We currently don't implement automount, netmasks, or bootparams.
110 More databases may be added later.
112 @node NSS Configuration File, NSS Module Internals, NSS Basics, Name Service Switch
113 @section The NSS Configuration File
115 @cindex @file{/etc/nsswitch.conf}
116 @cindex @file{nsswitch.conf}
117 Somehow the NSS code must be told about the wishes of the user. For
118 this reason there is the file @file{/etc/nsswitch.conf}. For each
119 database, this file contains a specification of how the lookup process should
120 work. The file could look like this:
123 @include nsswitch.texi
126 The first column is the database as you can guess from the table above.
127 The rest of the line specifies how the lookup process works. Please
128 note that you specify the way it works for each database individually.
129 This cannot be done with the old way of a monolithic implementation.
131 The configuration specification for each database can contain two
136 the service specification like @code{files}, @code{db}, or @code{nis}.
138 the reaction on lookup result like @code{[NOTFOUND=return]}.
142 * Services in the NSS configuration:: Service names in the NSS configuration.
143 * Actions in the NSS configuration:: React appropriately to the lookup result.
144 * Notes on NSS Configuration File:: Things to take care about while
148 @node Services in the NSS configuration, Actions in the NSS configuration, NSS Configuration File, NSS Configuration File
149 @subsection Services in the NSS configuration File
151 The above example file mentions five different services: @code{files},
152 @code{db}, @code{dns}, @code{nis}, and @code{nisplus}. This does not
154 services are available on all sites and neither does it mean these are
155 all the services which will ever be available.
157 In fact, these names are simply strings which the NSS code uses to find
158 the implicitly addressed functions. The internal interface will be
159 described later. Visible to the user are the modules which implement an
162 Assume the service @var{name} shall be used for a lookup. The code for
163 this service is implemented in a module called @file{libnss_@var{name}}.
164 On a system supporting shared libraries this is in fact a shared library
165 with the name (for example) @file{libnss_@var{name}.so.2}. The number
166 at the end is the currently used version of the interface which will not
167 change frequently. Normally the user should not have to be cognizant of
168 these files since they should be placed in a directory where they are
169 found automatically. Only the names of all available services are
172 Lastly, some system software may make use of the NSS configuration file
173 to store their own configuration for similar purposes. Examples of this
174 include the @code{automount} service which is used by @code{autofs}.
176 @node Actions in the NSS configuration, Notes on NSS Configuration File, Services in the NSS configuration, NSS Configuration File
177 @subsection Actions in the NSS configuration
179 The second item in the specification gives the user much finer control
180 on the lookup process. Action items are placed between two service
181 names and are written within brackets. The general form is
184 @code{[} ( @code{!}? @var{status} @code{=} @var{action} )+ @code{]}
191 @var{status} @result{} success | notfound | unavail | tryagain
192 @var{action} @result{} return | continue
195 The case of the keywords is insignificant. The @var{status}
196 values are the results of a call to a lookup function of a specific
201 No error occurred and the wanted entry is returned. The default action
202 for this is @code{return}.
205 The lookup process works ok but the needed value was not found. The
206 default action is @code{continue}.
209 @cindex DNS server unavailable
210 The service is permanently unavailable. This can either mean the needed
211 file is not available, or, for DNS, the server is not available or does
212 not allow queries. The default action is @code{continue}.
215 The service is temporarily unavailable. This could mean a file is
216 locked or a server currently cannot accept more connections. The
217 default action is @code{continue}.
221 The @var{action} values mean:
226 If the status matches, stop the lookup process at this service
227 specification. If an entry is available, provide it to the application.
228 If an error occurred, report it to the application. In case of a prior
229 @samp{merge} action, the data is combined with previous lookup results,
234 If the status matches, proceed with the lookup process at the next
235 entry, discarding the result of the current lookup (and any merged
236 data). An exception is the @samp{initgroups} database and the
237 @samp{success} status, where @samp{continue} acts like @code{merge}
242 Proceed with the lookup process, retaining the current lookup result.
243 This action is useful only with the @samp{success} status. If a
244 subsequent service lookup succeeds and has a matching @samp{return}
245 specification, the results are merged, the lookup process ends, and the
246 merged results are returned to the application. If the following service
247 has a matching @samp{merge} action, the lookup process continues,
248 retaining the combined data from this and any previous lookups.
250 After a @code{merge} action, errors from subsequent lookups are ignored,
251 and the data gathered so far will be returned.
253 The @samp{merge} only applies to the @samp{success} status. It is
254 currently implemented for the @samp{group} database and its group
255 members field, @samp{gr_mem}. If specified for other databases, it
256 causes the lookup to fail (if the @var{status} matches).
258 When processing @samp{merge} for @samp{group} membership, the group GID
259 and name must be identical for both entries. If only one or the other is
260 a match, the behavior is undefined.
265 If we have a line like
268 ethers: nisplus [NOTFOUND=return] db files
272 this is equivalent to
275 ethers: nisplus [SUCCESS=return NOTFOUND=return UNAVAIL=continue
277 db [SUCCESS=return NOTFOUND=continue UNAVAIL=continue
283 (except that it would have to be written on one line). The default
284 value for the actions are normally what you want, and only need to be
285 changed in exceptional cases.
287 If the optional @code{!} is placed before the @var{status} this means
288 the following action is used for all statuses but @var{status} itself.
289 I.e., @code{!} is negation as in the C language (and others).
291 Before we explain the exception which makes this action item necessary
292 one more remark: obviously it makes no sense to add another action
293 item after the @code{files} service. Since there is no other service
294 following the action @emph{always} is @code{return}.
296 @cindex nisplus, and completeness
297 Now, why is this @code{[NOTFOUND=return]} action useful? To understand
298 this we should know that the @code{nisplus} service is often
299 complete; i.e., if an entry is not available in the NIS+ tables it is
300 not available anywhere else. This is what is expressed by this action
301 item: it is useless to examine further services since they will not give
304 @cindex nisplus, and booting
305 @cindex bootstrapping, and services
306 The situation would be different if the NIS+ service is not available
307 because the machine is booting. In this case the return value of the
308 lookup function is not @code{notfound} but instead @code{unavail}. And
309 as you can see in the complete form above: in this situation the
310 @code{db} and @code{files} services are used. Neat, isn't it? The
311 system administrator need not pay special care for the time the system
312 is not completely ready to work (while booting or shutdown or
316 @node Notes on NSS Configuration File, , Actions in the NSS configuration, NSS Configuration File
317 @subsection Notes on the NSS Configuration File
319 Finally a few more hints. The NSS implementation is not completely
320 helpless if @file{/etc/nsswitch.conf} does not exist. For
321 all supported databases there is a default value so it should normally
322 be possible to get the system running even if the file is corrupted or
325 @cindex default value, and NSS
326 For the @code{hosts} and @code{networks} databases the default value is
327 @code{files dns}. I.e., local configuration will override the contents
328 of the domain name system (DNS).
330 The @code{passwd}, @code{group}, and @code{shadow} databases was
331 traditionally handled in a special way. The appropriate files in the
332 @file{/etc} directory were read but if an entry with a name starting
333 with a @code{+} character was found NIS was used. This kind of lookup
334 was removed and now the default value for the services is @code{files}.
335 libnss_compat no longer depends on libnsl and can be used without NIS.
337 For all other databases the default value is @code{files}.
339 @cindex optimizing NSS
340 A second point is that the user should try to optimize the lookup
341 process. The different service have different response times.
342 A simple file look up on a local file could be fast, but if the file
343 is long and the needed entry is near the end of the file this may take
344 quite some time. In this case it might be better to use the @code{db}
345 service which allows fast local access to large data sets.
347 Often the situation is that some global information like NIS must be
348 used. So it is unavoidable to use service entries like @code{nis} etc.
349 But one should avoid slow services like this if possible.
352 @node NSS Module Internals, Extending NSS, NSS Configuration File, Name Service Switch
353 @section NSS Module Internals
355 Now it is time to describe what the modules look like. The functions
356 contained in a module are identified by their names. I.e., there is no
357 jump table or the like. How this is done is of no interest here; those
358 interested in this topic should read about Dynamic Linking.
359 @comment @ref{Dynamic Linking}.
363 * NSS Module Names:: Construction of the interface function of
365 * NSS Modules Interface:: Programming interface in the NSS module
369 @node NSS Module Names, NSS Modules Interface, NSS Module Internals, NSS Module Internals
370 @subsection The Naming Scheme of the NSS Modules
373 The name of each function consists of various parts:
376 _nss_@var{service}_@var{function}
379 @var{service} of course corresponds to the name of the module this
380 function is found in.@footnote{Now you might ask why this information is
381 duplicated. The answer is that we want to make it possible to link
382 directly with these shared objects.} The @var{function} part is derived
383 from the interface function in the C library itself. If the user calls
384 the function @code{gethostbyname} and the service used is @code{files}
388 _nss_files_gethostbyname_r
399 @cindex reentrant NSS functions
400 is used. You see, what is explained above in not the whole truth. In
401 fact the NSS modules only contain reentrant versions of the lookup
402 functions. I.e., if the user would call the @code{gethostbyname_r}
403 function this also would end in the above function. For all user
404 interface functions the C library maps this call to a call to the
405 reentrant function. For reentrant functions this is trivial since the
406 interface is (nearly) the same. For the non-reentrant version the
407 library keeps internal buffers which are used to replace the user
410 I.e., the reentrant functions @emph{can} have counterparts. No service
411 module is forced to have functions for all databases and all kinds to
412 access them. If a function is not available it is simply treated as if
413 the function would return @code{unavail}
414 (@pxref{Actions in the NSS configuration}).
416 The file name @file{libnss_files.so.2} would be on a @w{Solaris 2}
417 system @file{nss_files.so.2}. This is the difference mentioned above.
418 Sun's NSS modules are usable as modules which get indirectly loaded
421 The NSS modules in @theglibc{} are prepared to be used as normal
422 libraries themselves. This is @emph{not} true at the moment, though.
423 However, the organization of the name space in the modules does not make it
424 impossible like it is for Solaris. Now you can see why the modules are
425 still libraries.@footnote{There is a second explanation: we were too
426 lazy to change the Makefiles to allow the generation of shared objects
427 not starting with @file{lib} but don't tell this to anybody.}
430 @node NSS Modules Interface, , NSS Module Names, NSS Module Internals
431 @subsection The Interface of the Function in NSS Modules
433 Now we know about the functions contained in the modules. It is now
434 time to describe the types. When we mentioned the reentrant versions of
435 the functions above, this means there are some additional arguments
436 (compared with the standard, non-reentrant versions). The prototypes for
437 the non-reentrant and reentrant versions of our function above are:
440 struct hostent *gethostbyname (const char *name)
442 int gethostbyname_r (const char *name, struct hostent *result_buf,
443 char *buf, size_t buflen, struct hostent **result,
448 The actual prototype of the function in the NSS modules in this case is
451 enum nss_status _nss_files_gethostbyname_r (const char *name,
452 struct hostent *result_buf,
453 char *buf, size_t buflen,
454 int *errnop, int *h_errnop)
457 I.e., the interface function is in fact the reentrant function with the
458 change of the return value, the omission of the @var{result} parameter,
459 and the addition of the @var{errnop} parameter. While the user-level
460 function returns a pointer to the result the reentrant function return
461 an @code{enum nss_status} value:
464 @item NSS_STATUS_TRYAGAIN
465 numeric value @code{-2}
467 @item NSS_STATUS_UNAVAIL
468 numeric value @code{-1}
470 @item NSS_STATUS_NOTFOUND
471 numeric value @code{0}
473 @item NSS_STATUS_SUCCESS
474 numeric value @code{1}
478 Now you see where the action items of the @file{/etc/nsswitch.conf} file
481 If you study the source code you will find there is a fifth value:
482 @code{NSS_STATUS_RETURN}. This is an internal use only value, used by a
483 few functions in places where none of the above value can be used. If
484 necessary the source code should be examined to learn about the details.
486 In case the interface function has to return an error it is important
487 that the correct error code is stored in @code{*@var{errnop}}. Some
488 return status values have only one associated error code, others have
491 @multitable @columnfractions .3 .2 .50
493 @code{NSS_STATUS_TRYAGAIN} @tab
494 @code{EAGAIN} @tab One of the functions used ran temporarily out of
495 resources or a service is currently not available.
498 @code{ERANGE} @tab The provided buffer is not large enough.
499 The function should be called again with a larger buffer.
501 @code{NSS_STATUS_UNAVAIL} @tab
502 @code{ENOENT} @tab A necessary input file cannot be found.
504 @code{NSS_STATUS_NOTFOUND} @tab
505 @code{ENOENT} @tab The requested entry is not available.
508 @code{NSS_STATUS_NOTFOUND} @tab
509 @code{SUCCESS} @tab There are no entries.
510 Use this to avoid returning errors for inactive services which may
511 be enabled at a later time. This is not the same as the service
512 being temporarily unavailable.
515 These are proposed values. There can be other error codes and the
516 described error codes can have different meaning. @strong{With one
517 exception:} when returning @code{NSS_STATUS_TRYAGAIN} the error code
518 @code{ERANGE} @emph{must} mean that the user provided buffer is too
519 small. Everything else is non-critical.
521 In statically linked programs, the main application and NSS modules do
522 not share the same thread-local variable @code{errno}, which is the
523 reason why there is an explicit @var{errnop} function argument.
525 The above function has something special which is missing for almost all
526 the other module functions. There is an argument @var{h_errnop}. This
527 points to a variable which will be filled with the error code in case
528 the execution of the function fails for some reason. (In statically
529 linked programs, the thread-local variable @code{h_errno} is not shared
530 with the main application.)
532 The @code{get@var{XXX}by@var{YYY}} functions are the most important
533 functions in the NSS modules. But there are others which implement
534 the other ways to access system databases (say for the
535 user database, there are @code{setpwent}, @code{getpwent}, and
536 @code{endpwent}). These will be described in more detail later.
537 Here we give a general way to determine the
538 signature of the module function:
542 the return value is @code{enum nss_status};
544 the name (@pxref{NSS Module Names});
546 the first arguments are identical to the arguments of the non-reentrant
549 the next four arguments are:
552 @item STRUCT_TYPE *result_buf
553 pointer to buffer where the result is stored. @code{STRUCT_TYPE} is
554 normally a struct which corresponds to the database.
556 pointer to a buffer where the function can store additional data for
559 length of the buffer pointed to by @var{buffer}.
561 the low-level error code to return to the application. If the return
562 value is not @code{NSS_STATUS_SUCCESS}, @code{*@var{errnop}} needs to be
563 set to a non-zero value. An NSS module should never set
564 @code{*@var{errnop}} to zero. The value @code{ERANGE} is special, as
569 possibly a last argument @var{h_errnop}, for the host name and network
570 name lookup functions. If the return value is not
571 @code{NSS_STATUS_SUCCESS}, @code{*@var{h_errnop}} needs to be set to a
572 non-zero value. A generic error code is @code{NETDB_INTERNAL}, which
573 instructs the caller to examine @code{*@var{errnop}} for further
574 details. (This includes the @code{ERANGE} special case.)
578 This table is correct for all functions but the @code{set@dots{}ent}
579 and @code{end@dots{}ent} functions.
582 @node Extending NSS, , NSS Module Internals, Name Service Switch
583 @section Extending NSS
585 One of the advantages of NSS mentioned above is that it can be extended
586 quite easily. There are two ways in which the extension can happen:
587 adding another database or adding another service. The former is
588 normally done only by the C library developers. It is
589 here only important to remember that adding another database is
590 independent from adding another service because a service need not
591 support all databases or lookup functions.
593 A designer/implementer of a new service is therefore free to choose the
594 databases s/he is interested in and leave the rest for later (or
598 * Adding another Service to NSS:: What is to do to add a new service.
599 * NSS Module Function Internals:: Guidelines for writing new NSS
603 @node Adding another Service to NSS, NSS Module Function Internals, Extending NSS, Extending NSS
604 @subsection Adding another Service to NSS
606 The sources for a new service need not (and should not) be part of @theglibc{}
607 itself. The developer retains complete control over the
608 sources and its development. The links between the C library and the
609 new service module consists solely of the interface functions.
611 Each module is designed following a specific interface specification.
612 For now the version is 2 (the interface in version 1 was not adequate)
613 and this manifests in the version number of the shared library object of
614 the NSS modules: they have the extension @code{.2}. If the interface
615 changes again in an incompatible way, this number will be increased.
616 Modules using the old interface will still be usable.
618 Developers of a new service will have to make sure that their module is
619 created using the correct interface number. This means the file itself
620 must have the correct name and on ELF systems the @dfn{soname} (Shared
621 Object Name) must also have this number. Building a module from a bunch
622 of object files on an ELF system using GNU CC could be done like this:
625 gcc -shared -o libnss_NAME.so.2 -Wl,-soname,libnss_NAME.so.2 OBJECTS
629 @ref{Link Options, Options for Linking, , gcc, GNU CC}, to learn
630 more about this command line.
632 To use the new module the library must be able to find it. This can be
633 achieved by using options for the dynamic linker so that it will search
634 the directory where the binary is placed. For an ELF system this could be
635 done by adding the wanted directory to the value of
636 @code{LD_LIBRARY_PATH}.
638 But this is not always possible since some programs (those which run
639 under IDs which do not belong to the user) ignore this variable.
640 Therefore the stable version of the module should be placed into a
641 directory which is searched by the dynamic linker. Normally this should
642 be the directory @file{$prefix/lib}, where @file{$prefix} corresponds to
643 the value given to configure using the @code{--prefix} option. But be
644 careful: this should only be done if it is clear the module does not
645 cause any harm. System administrators should be careful.
648 @node NSS Module Function Internals, , Adding another Service to NSS, Extending NSS
649 @subsection Internals of the NSS Module Functions
651 Until now we only provided the syntactic interface for the functions in
652 the NSS module. In fact there is not much more we can say since the
653 implementation obviously is different for each function. But a few
654 general rules must be followed by all functions.
656 In fact there are four kinds of different functions which may appear in
657 the interface. All derive from the traditional ones for system databases.
658 @var{db} in the following table is normally an abbreviation for the
659 database (e.g., it is @code{pw} for the user database).
662 @item enum nss_status _nss_@var{database}_set@var{db}ent (void)
663 This function prepares the service for following operations. For a
664 simple file based lookup this means files could be opened, for other
665 services this function simply is a noop.
667 One special case for this function is that it takes an additional
668 argument for some @var{database}s (i.e., the interface is
669 @code{int set@var{db}ent (int)}). @ref{Host Names}, which describes the
670 @code{sethostent} function.
672 The return value should be @var{NSS_STATUS_SUCCESS} or according to the
673 table above in case of an error (@pxref{NSS Modules Interface}).
675 @item enum nss_status _nss_@var{database}_end@var{db}ent (void)
676 This function simply closes all files which are still open or removes
677 buffer caches. If there are no files or buffers to remove this is again
680 There normally is no return value other than @var{NSS_STATUS_SUCCESS}.
682 @item enum nss_status _nss_@var{database}_get@var{db}ent_r (@var{STRUCTURE} *result, char *buffer, size_t buflen, int *errnop)
683 Since this function will be called several times in a row to retrieve
684 one entry after the other it must keep some kind of state. But this
685 also means the functions are not really reentrant. They are reentrant
686 only in that simultaneous calls to this function will not try to
687 write the retrieved data in the same place (as it would be the case for
688 the non-reentrant functions); instead, it writes to the structure
689 pointed to by the @var{result} parameter. But the calls share a common
690 state and in the case of a file access this means they return neighboring
693 The buffer of length @var{buflen} pointed to by @var{buffer} can be used
694 for storing some additional data for the result. It is @emph{not}
695 guaranteed that the same buffer will be passed for the next call of this
696 function. Therefore one must not misuse this buffer to save some state
697 information from one call to another.
699 Before the function returns with a failure code, the implementation
700 should store the value of the local @code{errno} variable in the variable
701 pointed to be @var{errnop}. This is important to guarantee the module
702 working in statically linked programs. The stored value must not be
705 As explained above this function could also have an additional last
706 argument. This depends on the database used; it happens only for
707 @code{host} and @code{networks}.
709 The function shall return @code{NSS_STATUS_SUCCESS} as long as there are
710 more entries. When the last entry was read it should return
711 @code{NSS_STATUS_NOTFOUND}. When the buffer given as an argument is too
712 small for the data to be returned @code{NSS_STATUS_TRYAGAIN} should be
713 returned. When the service was not formerly initialized by a call to
714 @code{_nss_@var{DATABASE}_set@var{db}ent} all return values allowed for
715 this function can also be returned here.
717 @item enum nss_status _nss_@var{DATABASE}_get@var{db}by@var{XX}_r (@var{PARAMS}, @var{STRUCTURE} *result, char *buffer, size_t buflen, int *errnop)
718 This function shall return the entry from the database which is
719 addressed by the @var{PARAMS}. The type and number of these arguments
720 vary. It must be individually determined by looking to the user-level
721 interface functions. All arguments given to the non-reentrant version
722 are here described by @var{PARAMS}.
724 The result must be stored in the structure pointed to by @var{result}.
725 If there are additional data to return (say strings, where the
726 @var{result} structure only contains pointers) the function must use the
727 @var{buffer} of length @var{buflen}. There must not be any references
728 to non-constant global data.
730 The implementation of this function should honor the @var{stayopen}
731 flag set by the @code{set@var{DB}ent} function whenever this makes sense.
733 Before the function returns, the implementation should store the value of
734 the local @code{errno} variable in the variable pointed to by
735 @var{errnop}. This is important to guarantee the module works in
736 statically linked programs.
738 Again, this function takes an additional last argument for the
739 @code{host} and @code{networks} database.
741 The return value should as always follow the rules given above
742 (@pxref{NSS Modules Interface}).