10 .. sectionauthor:: Vladimir Marangozov <Vladimir.Marangozov@inrialpes.fr>
19 Memory management in Python involves a private heap containing all Python
20 objects and data structures. The management of this private heap is ensured
21 internally by the *Python memory manager*. The Python memory manager has
22 different components which deal with various dynamic storage management aspects,
23 like sharing, segmentation, preallocation or caching.
25 At the lowest level, a raw memory allocator ensures that there is enough room in
26 the private heap for storing all Python-related data by interacting with the
27 memory manager of the operating system. On top of the raw memory allocator,
28 several object-specific allocators operate on the same heap and implement
29 distinct memory management policies adapted to the peculiarities of every object
30 type. For example, integer objects are managed differently within the heap than
31 strings, tuples or dictionaries because integers imply different storage
32 requirements and speed/space tradeoffs. The Python memory manager thus delegates
33 some of the work to the object-specific allocators, but ensures that the latter
34 operate within the bounds of the private heap.
36 It is important to understand that the management of the Python heap is
37 performed by the interpreter itself and that the user has no control over it,
38 even if she regularly manipulates object pointers to memory blocks inside that
39 heap. The allocation of heap space for Python objects and other internal
40 buffers is performed on demand by the Python memory manager through the Python/C
41 API functions listed in this document.
49 To avoid memory corruption, extension writers should never try to operate on
50 Python objects with the functions exported by the C library: :cfunc:`malloc`,
51 :cfunc:`calloc`, :cfunc:`realloc` and :cfunc:`free`. This will result in mixed
52 calls between the C allocator and the Python memory manager with fatal
53 consequences, because they implement different algorithms and operate on
54 different heaps. However, one may safely allocate and release memory blocks
55 with the C library allocator for individual purposes, as shown in the following
59 char *buf = (char *) malloc(BUFSIZ); /* for I/O */
62 return PyErr_NoMemory();
63 ...Do some I/O operation involving buf...
64 res = PyString_FromString(buf);
65 free(buf); /* malloc'ed */
68 In this example, the memory request for the I/O buffer is handled by the C
69 library allocator. The Python memory manager is involved only in the allocation
70 of the string object returned as a result.
72 In most situations, however, it is recommended to allocate memory from the
73 Python heap specifically because the latter is under control of the Python
74 memory manager. For example, this is required when the interpreter is extended
75 with new object types written in C. Another reason for using the Python heap is
76 the desire to *inform* the Python memory manager about the memory needs of the
77 extension module. Even when the requested memory is used exclusively for
78 internal, highly-specific purposes, delegating all memory requests to the Python
79 memory manager causes the interpreter to have a more accurate image of its
80 memory footprint as a whole. Consequently, under certain circumstances, the
81 Python memory manager may or may not trigger appropriate actions, like garbage
82 collection, memory compaction or other preventive procedures. Note that by using
83 the C library allocator as shown in the previous example, the allocated memory
84 for the I/O buffer escapes completely the Python memory manager.
92 The following function sets, modeled after the ANSI C standard, but specifying
93 behavior when requesting zero bytes, are available for allocating and releasing
94 memory from the Python heap:
97 .. cfunction:: void* PyMem_Malloc(size_t n)
99 Allocates *n* bytes and returns a pointer of type :ctype:`void\*` to the
100 allocated memory, or *NULL* if the request fails. Requesting zero bytes returns
101 a distinct non-*NULL* pointer if possible, as if :cfunc:`PyMem_Malloc(1)` had
102 been called instead. The memory will not have been initialized in any way.
105 .. cfunction:: void* PyMem_Realloc(void *p, size_t n)
107 Resizes the memory block pointed to by *p* to *n* bytes. The contents will be
108 unchanged to the minimum of the old and the new sizes. If *p* is *NULL*, the
109 call is equivalent to :cfunc:`PyMem_Malloc(n)`; else if *n* is equal to zero,
110 the memory block is resized but is not freed, and the returned pointer is
111 non-*NULL*. Unless *p* is *NULL*, it must have been returned by a previous call
112 to :cfunc:`PyMem_Malloc` or :cfunc:`PyMem_Realloc`. If the request fails,
113 :cfunc:`PyMem_Realloc` returns *NULL* and *p* remains a valid pointer to the
114 previous memory area.
117 .. cfunction:: void PyMem_Free(void *p)
119 Frees the memory block pointed to by *p*, which must have been returned by a
120 previous call to :cfunc:`PyMem_Malloc` or :cfunc:`PyMem_Realloc`. Otherwise, or
121 if :cfunc:`PyMem_Free(p)` has been called before, undefined behavior occurs. If
122 *p* is *NULL*, no operation is performed.
124 The following type-oriented macros are provided for convenience. Note that
125 *TYPE* refers to any C type.
128 .. cfunction:: TYPE* PyMem_New(TYPE, size_t n)
130 Same as :cfunc:`PyMem_Malloc`, but allocates ``(n * sizeof(TYPE))`` bytes of
131 memory. Returns a pointer cast to :ctype:`TYPE\*`. The memory will not have
132 been initialized in any way.
135 .. cfunction:: TYPE* PyMem_Resize(void *p, TYPE, size_t n)
137 Same as :cfunc:`PyMem_Realloc`, but the memory block is resized to ``(n *
138 sizeof(TYPE))`` bytes. Returns a pointer cast to :ctype:`TYPE\*`. On return,
139 *p* will be a pointer to the new memory area, or *NULL* in the event of failure.
142 .. cfunction:: void PyMem_Del(void *p)
144 Same as :cfunc:`PyMem_Free`.
146 In addition, the following macro sets are provided for calling the Python memory
147 allocator directly, without involving the C API functions listed above. However,
148 note that their use does not preserve binary compatibility across Python
149 versions and is therefore deprecated in extension modules.
151 :cfunc:`PyMem_MALLOC`, :cfunc:`PyMem_REALLOC`, :cfunc:`PyMem_FREE`.
153 :cfunc:`PyMem_NEW`, :cfunc:`PyMem_RESIZE`, :cfunc:`PyMem_DEL`.
161 Here is the example from section :ref:`memoryoverview`, rewritten so that the
162 I/O buffer is allocated from the Python heap by using the first function set::
165 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */
168 return PyErr_NoMemory();
169 /* ...Do some I/O operation involving buf... */
170 res = PyString_FromString(buf);
171 PyMem_Free(buf); /* allocated with PyMem_Malloc */
174 The same code using the type-oriented function set::
177 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */
180 return PyErr_NoMemory();
181 /* ...Do some I/O operation involving buf... */
182 res = PyString_FromString(buf);
183 PyMem_Del(buf); /* allocated with PyMem_New */
186 Note that in the two examples above, the buffer is always manipulated via
187 functions belonging to the same set. Indeed, it is required to use the same
188 memory API family for a given memory block, so that the risk of mixing different
189 allocators is reduced to a minimum. The following code sequence contains two
190 errors, one of which is labeled as *fatal* because it mixes two different
191 allocators operating on different heaps. ::
193 char *buf1 = PyMem_New(char, BUFSIZ);
194 char *buf2 = (char *) malloc(BUFSIZ);
195 char *buf3 = (char *) PyMem_Malloc(BUFSIZ);
197 PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */
198 free(buf2); /* Right -- allocated via malloc() */
199 free(buf1); /* Fatal -- should be PyMem_Del() */
201 In addition to the functions aimed at handling raw memory blocks from the Python
202 heap, objects in Python are allocated and released with :cfunc:`PyObject_New`,
203 :cfunc:`PyObject_NewVar` and :cfunc:`PyObject_Del`.
205 These will be explained in the next chapter on defining and implementing new