2008-05-07 Kai Tietz <kai,tietz@onevision.com>
[official-gcc.git] / gcc / ada / exp_dbug.ads
blob9a9d5377d838b70c0cb0e834b3a76956f8c666c3
1 ------------------------------------------------------------------------------
2 -- --
3 -- GNAT COMPILER COMPONENTS --
4 -- --
5 -- E X P _ D B U G --
6 -- --
7 -- S p e c --
8 -- --
9 -- Copyright (C) 1996-2007, Free Software Foundation, Inc. --
10 -- --
11 -- GNAT is free software; you can redistribute it and/or modify it under --
12 -- terms of the GNU General Public License as published by the Free Soft- --
13 -- ware Foundation; either version 3, or (at your option) any later ver- --
14 -- sion. GNAT is distributed in the hope that it will be useful, but WITH- --
15 -- OUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY --
16 -- or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License --
17 -- for more details. You should have received a copy of the GNU General --
18 -- Public License distributed with GNAT; see file COPYING3. If not, go to --
19 -- http://www.gnu.org/licenses for a complete copy of the license. --
20 -- --
21 -- GNAT was originally developed by the GNAT team at New York University. --
22 -- Extensive contributions were provided by Ada Core Technologies Inc. --
23 -- --
24 ------------------------------------------------------------------------------
26 -- Expand routines for generation of special declarations used by the
27 -- debugger. In accordance with the Dwarf 2.2 specification, certain
28 -- type names are encoded to provide information to the debugger.
30 with Namet; use Namet;
31 with Types; use Types;
32 with Uintp; use Uintp;
34 package Exp_Dbug is
36 -----------------------------------------------------
37 -- Encoding and Qualification of Names of Entities --
38 -----------------------------------------------------
40 -- This section describes how the names of entities are encoded in
41 -- the generated debugging information.
43 -- An entity in Ada has a name of the form X.Y.Z ... E where X,Y,Z
44 -- are the enclosing scopes (not including Standard at the start).
46 -- The encoding of the name follows this basic qualified naming scheme,
47 -- where the encoding of individual entity names is as described in Namet
48 -- (i.e. in particular names present in the original source are folded to
49 -- all lower case, with upper half and wide characters encoded as described
50 -- in Namet). Upper case letters are used only for entities generated by
51 -- the compiler.
53 -- There are two cases, global entities, and local entities. In more formal
54 -- terms, local entities are those which have a dynamic enclosing scope,
55 -- and global entities are at the library level, except that we always
56 -- consider procedures to be global entities, even if they are nested
57 -- (that's because at the debugger level a procedure name refers to the
58 -- code, and the code is indeed a global entity, including the case of
59 -- nested procedures.) In addition, we also consider all types to be global
60 -- entities, even if they are defined within a procedure.
62 -- The reason for treating all type names as global entities is that a
63 -- number of our type encodings work by having related type names, and we
64 -- need the full qualification to keep this unique.
66 -- For global entities, the encoded name includes all components of the
67 -- fully expanded name (but omitting Standard at the start). For example,
68 -- if a library level child package P.Q has an embedded package R, and
69 -- there is an entity in this embedded package whose name is S, the encoded
70 -- name will include the components p.q.r.s.
72 -- For local entities, the encoded name only includes the components up to
73 -- the enclosing dynamic scope (other than a block). At run time, such a
74 -- dynamic scope is a subprogram, and the debugging formats know about
75 -- local variables of procedures, so it is not necessary to have full
76 -- qualification for such entities. In particular this means that direct
77 -- local variables of a procedure are not qualified.
79 -- As an example of the local name convention, consider a procedure V.W
80 -- with a local variable X, and a nested block Y containing an entity Z.
81 -- The fully qualified names of the entities X and Z are:
83 -- V.W.X
84 -- V.W.Y.Z
86 -- but since V.W is a subprogram, the encoded names will end up
87 -- encoding only
89 -- x
90 -- y.z
92 -- The separating dots are translated into double underscores
94 -----------------------------
95 -- Handling of Overloading --
96 -----------------------------
98 -- The above scheme is incomplete for overloaded subprograms, since
99 -- overloading can legitimately result in case of two entities with
100 -- exactly the same fully qualified names. To distinguish between
101 -- entries in a set of overloaded subprograms, the encoded names are
102 -- serialized by adding the suffix:
104 -- __nn (two underscores)
106 -- where nn is a serial number (2 for the second overloaded function,
107 -- 3 for the third, etc.). A suffix of __1 is always omitted (i.e. no
108 -- suffix implies the first instance).
110 -- These names are prefixed by the normal full qualification. So for
111 -- example, the third instance of the subprogram qrs in package yz
112 -- would have the name:
114 -- yz__qrs__3
116 -- A more subtle case arises with entities declared within overloaded
117 -- subprograms. If we have two overloaded subprograms, and both declare
118 -- an entity xyz, then the fully expanded name of the two xyz's is the
119 -- same. To distinguish these, we add the same __n suffix at the end of
120 -- the inner entity names.
122 -- In more complex cases, we can have multiple levels of overloading,
123 -- and we must make sure to distinguish which final declarative region
124 -- we are talking about. For this purpose, we use a more complex suffix
125 -- which has the form:
127 -- __nn_nn_nn ...
129 -- where the nn values are the homonym numbers as needed for any of the
130 -- qualifying entities, separated by a single underscore. If all the nn
131 -- values are 1, the suffix is omitted, Otherwise the suffix is present
132 -- (including any values of 1). The following example shows how this
133 -- suffixing works.
135 -- package body Yz is
136 -- procedure Qrs is -- Name is yz__qrs
137 -- procedure Tuv is ... end; -- Name is yz__qrs__tuv
138 -- begin ... end Qrs;
140 -- procedure Qrs (X: Int) is -- Name is yz__qrs__2
141 -- procedure Tuv is ... end; -- Name is yz__qrs__tuv__2_1
142 -- procedure Tuv (X: Int) is -- Name is yz__qrs__tuv__2_2
143 -- begin ... end Tuv;
145 -- procedure Tuv (X: Float) is -- Name is yz__qrs__tuv__2_3
146 -- type m is new float; -- Name is yz__qrs__tuv__m__2_3
147 -- begin ... end Tuv;
148 -- begin ... end Qrs;
149 -- end Yz;
151 --------------------
152 -- Operator Names --
153 --------------------
155 -- The above rules applied to operator names would result in names with
156 -- quotation marks, which are not typically allowed by assemblers and
157 -- linkers, and even if allowed would be odd and hard to deal with. To
158 -- avoid this problem, operator names are encoded as follows:
160 -- Oabs abs
161 -- Oand and
162 -- Omod mod
163 -- Onot not
164 -- Oor or
165 -- Orem rem
166 -- Oxor xor
167 -- Oeq =
168 -- One /=
169 -- Olt <
170 -- Ole <=
171 -- Ogt >
172 -- Oge >=
173 -- Oadd +
174 -- Osubtract -
175 -- Oconcat &
176 -- Omultiply *
177 -- Odivide /
178 -- Oexpon **
180 -- These names are prefixed by the normal full qualification, and
181 -- suffixed by the overloading identification. So for example, the
182 -- second operator "=" defined in package Extra.Messages would have
183 -- the name:
185 -- extra__messages__Oeq__2
187 ----------------------------------
188 -- Resolving Other Name Clashes --
189 ----------------------------------
191 -- It might be thought that the above scheme is complete, but in Ada 95,
192 -- full qualification is insufficient to uniquely identify an entity in
193 -- the program, even if it is not an overloaded subprogram. There are
194 -- two possible confusions:
196 -- a.b
198 -- interpretation 1: entity b in body of package a
199 -- interpretation 2: child procedure b of package a
201 -- a.b.c
203 -- interpretation 1: entity c in child package a.b
204 -- interpretation 2: entity c in nested package b in body of a
206 -- It is perfectly legal in both cases for both interpretations to be
207 -- valid within a single program. This is a bit of a surprise since
208 -- certainly in Ada 83, full qualification was sufficient, but not in
209 -- Ada 95. The result is that the above scheme can result in duplicate
210 -- names. This would not be so bad if the effect were just restricted
211 -- to debugging information, but in fact in both the above cases, it
212 -- is possible for both symbols to be external names, and so we have
213 -- a real problem of name clashes.
215 -- To deal with this situation, we provide two additional encoding
216 -- rules for names:
218 -- First: all library subprogram names are preceded by the string
219 -- _ada_ (which causes no duplications, since normal Ada names can
220 -- never start with an underscore. This not only solves the first
221 -- case of duplication, but also solves another pragmatic problem
222 -- which is that otherwise Ada procedures can generate names that
223 -- clash with existing system function names. Most notably, we can
224 -- have clashes in the case of procedure Main with the C main that
225 -- in some systems is always present.
227 -- Second, for the case where nested packages declared in package
228 -- bodies can cause trouble, we add a suffix which shows which
229 -- entities in the list are body-nested packages, i.e. packages
230 -- whose spec is within a package body. The rules are as follows,
231 -- given a list of names in a qualified name name1.name2....
233 -- If none are body-nested package entities, then there is no suffix
235 -- If at least one is a body-nested package entity, then the suffix
236 -- is X followed by a string of b's and n's (b = body-nested package
237 -- entity, n = not a body-nested package).
239 -- There is one element in this string for each entity in the encoded
240 -- expanded name except the first (the rules are such that the first
241 -- entity of the encoded expanded name can never be a body-nested'
242 -- package. Trailing n's are omitted, as is the last b (there must
243 -- be at least one b, or we would not be generating a suffix at all).
245 -- For example, suppose we have
247 -- package x is
248 -- pragma Elaborate_Body;
249 -- m1 : integer; -- #1
250 -- end x;
252 -- package body x is
253 -- package y is m2 : integer; end y; -- #2
254 -- package body y is
255 -- package z is r : integer; end z; -- #3
256 -- end;
257 -- m3 : integer; -- #4
258 -- end x;
260 -- package x.y is
261 -- pragma Elaborate_Body;
262 -- m2 : integer; -- #5
263 -- end x.y;
265 -- package body x.y is
266 -- m3 : integer; -- #6
267 -- procedure j is -- #7
268 -- package k is
269 -- z : integer; -- #8
270 -- end k;
271 -- begin
272 -- null;
273 -- end j;
274 -- end x.y;
276 -- procedure x.m3 is begin null; end; -- #9
278 -- Then the encodings would be:
280 -- #1. x__m1 (no BNPE's in sight)
281 -- #2. x__y__m2X (y is a BNPE)
282 -- #3. x__y__z__rXb (y is a BNPE, so is z)
283 -- #4. x__m3 (no BNPE's in sight)
284 -- #5. x__y__m2 (no BNPE's in sight)
285 -- #6. x__y__m3 (no BNPE's in signt)
286 -- #7. x__y__j (no BNPE's in sight)
287 -- #8. k__z (no BNPE's, only up to procedure)
288 -- #9 _ada_x__m3 (library level subprogram)
290 -- Note that we have instances here of both kind of potential name
291 -- clashes, and the above examples show how the encodings avoid the
292 -- clash as follows:
294 -- Lines #4 and #9 both refer to the entity x.m3, but #9 is a library
295 -- level subprogram, so it is preceded by the string _ada_ which acts
296 -- to distinguish it from the package body entity.
298 -- Lines #2 and #5 both refer to the entity x.y.m2, but the first
299 -- instance is inside the body-nested package y, so there is an X
300 -- suffix to distinguish it from the child library entity.
302 -- Note that enumeration literals never need Xb type suffixes, since
303 -- they are never referenced using global external names.
305 ---------------------
306 -- Interface Names --
307 ---------------------
309 -- Note: if an interface name is present, then the external name
310 -- is taken from the specified interface name. Given the current
311 -- limitations of the gcc backend, this means that the debugging
312 -- name is also set to the interface name, but conceptually, it
313 -- would be possible (and indeed desirable) to have the debugging
314 -- information still use the Ada name as qualified above, so we
315 -- still fully qualify the name in the front end.
317 -------------------------------------
318 -- Encodings Related to Task Types --
319 -------------------------------------
321 -- Each task object defined by a single task declaration is associated
322 -- with a prefix that is used to qualify procedures defined in that
323 -- task. Given
325 -- package body P is
326 -- task body TaskObj is
327 -- procedure F1 is ... end;
328 -- begin
329 -- B;
330 -- end TaskObj;
331 -- end P;
333 -- The name of subprogram TaskObj.F1 is encoded as p__taskobjTK__f1,
334 -- The body, B, is contained in a subprogram whose name is
335 -- p__taskobjTKB.
337 ------------------------------------------
338 -- Encodings Related to Protected Types --
339 ------------------------------------------
341 -- Each protected type has an associated record type, that describes
342 -- the actual layout of the private data. In addition to the private
343 -- components of the type, the Corresponding_Record_Type includes one
344 -- component of type Protection, which is the actual lock structure.
345 -- The run-time size of the protected type is the size of the corres-
346 -- ponding record.
348 -- For a protected type prot, the Corresponding_Record_Type is encoded
349 -- as protV.
351 -- The operations of a protected type are encoded as follows: each
352 -- operation results in two subprograms, a locking one that is called
353 -- from outside of the object, and a non-locking one that is used for
354 -- calls from other operations on the same object. The locking operation
355 -- simply acquires the lock, and then calls the non-locking version.
356 -- The names of all of these have a prefix constructed from the name of
357 -- the type, and a suffix which is P or N, depending on whether this is
358 -- the protected/non-locking version of the operation.
360 -- Operations generated for protected entries follow the same encoding.
361 -- Each entry results in two suprograms: a procedure that holds the
362 -- entry body, and a function that holds the evaluation of the barrier.
363 -- The names of these subprograms include the prefix '_E' or '_B' res-
364 -- pectively. The names also include a numeric suffix to render them
365 -- unique in the presence of overloaded entries.
367 -- Given the declaration:
369 -- protected type Lock is
370 -- function Get return Integer;
371 -- procedure Set (X: Integer);
372 -- entry Update (Val : Integer);
373 -- private
374 -- Value : Integer := 0;
375 -- end Lock;
377 -- the following operations are created:
379 -- lock_getN
380 -- lock_getP,
382 -- lock_setN
383 -- lock_setP
385 -- lock_update_E1s
386 -- lock_udpate_B2s
388 -- If the protected type implements at least one interface, the
389 -- following additional operations are created:
391 -- lock_get
393 -- lock_set
395 -- These operations are used to ensure overriding of interface level
396 -- subprograms and proper dispatching on interface class-wide objects.
397 -- The bodies of these operations contain calls to their respective
398 -- protected versions:
400 -- function lock_get return Integer is
401 -- begin
402 -- return lock_getP;
403 -- end lock_get;
405 -- procedure lock_set (X : Integer) is
406 -- begin
407 -- lock_setP (X);
408 -- end lock_set;
410 ----------------------------------------------------
411 -- Conversion between Entities and External Names --
412 ----------------------------------------------------
414 No_Dollar_In_Label : constant Boolean := True;
415 -- True iff the target does not allow dollar signs ("$") in external names
416 -- ??? We want to migrate all platforms to use the same convention.
417 -- As a first step, we force this constant to always be True. This
418 -- constant will eventually be deleted after we have verified that
419 -- the migration does not cause any unforseen adverse impact.
420 -- We chose "__" because it is supported on all platforms, which is
421 -- not the case of "$".
423 procedure Get_External_Name
424 (Entity : Entity_Id;
425 Has_Suffix : Boolean);
426 -- Set Name_Buffer and Name_Len to the external name of entity E.
427 -- The external name is the Interface_Name, if specified, unless
428 -- the entity has an address clause or a suffix.
430 -- If the Interface is not present, or not used, the external name
431 -- is the concatenation of:
433 -- - the string "_ada_", if the entity is a library subprogram,
434 -- - the names of any enclosing scopes, each followed by "__",
435 -- or "X_" if the next entity is a subunit)
436 -- - the name of the entity
437 -- - the string "$" (or "__" if target does not allow "$"), followed
438 -- by homonym suffix, if the entity is an overloaded subprogram
439 -- or is defined within an overloaded subprogram.
441 procedure Get_External_Name_With_Suffix
442 (Entity : Entity_Id;
443 Suffix : String);
444 -- Set Name_Buffer and Name_Len to the external name of entity E.
445 -- If Suffix is the empty string the external name is as above,
446 -- otherwise the external name is the concatenation of:
448 -- - the string "_ada_", if the entity is a library subprogram,
449 -- - the names of any enclosing scopes, each followed by "__",
450 -- or "X_" if the next entity is a subunit)
451 -- - the name of the entity
452 -- - the string "$" (or "__" if target does not allow "$"), followed
453 -- by homonym suffix, if the entity is an overloaded subprogram
454 -- or is defined within an overloaded subprogram.
455 -- - the string "___" followed by Suffix
457 -- Note that a call to this procedure has no effect if we are not
458 -- generating code, since the necessary information for computing the
459 -- proper encoded name is not available in this case.
461 --------------------------------------------
462 -- Subprograms for Handling Qualification --
463 --------------------------------------------
465 procedure Qualify_Entity_Names (N : Node_Id);
466 -- Given a node N, that represents a block, subprogram body, or package
467 -- body or spec, or protected or task type, sets a fully qualified name
468 -- for the defining entity of given construct, and also sets fully
469 -- qualified names for all enclosed entities of the construct (using
470 -- First_Entity/Next_Entity). Note that the actual modifications of the
471 -- names is postponed till a subsequent call to Qualify_All_Entity_Names.
472 -- Note: this routine does not deal with prepending _ada_ to library
473 -- subprogram names. The reason for this is that we only prepend _ada_
474 -- to the library entity itself, and not to names built from this name.
476 procedure Qualify_All_Entity_Names;
477 -- When Qualify_Entity_Names is called, no actual name changes are made,
478 -- i.e. the actual calls to Qualify_Entity_Name are deferred until a call
479 -- is made to this procedure. The reason for this deferral is that when
480 -- names are changed semantic processing may be affected. By deferring
481 -- the changes till just before gigi is called, we avoid any concerns
482 -- about such effects. Gigi itself does not use the names except for
483 -- output of names for debugging purposes (which is why we are doing
484 -- the name changes in the first place.
486 -- Note: the routines Get_Unqualified_[Decoded]_Name_String in Namet
487 -- are useful to remove qualification from a name qualified by the
488 -- call to Qualify_All_Entity_Names.
490 --------------------------------
491 -- Handling of Numeric Values --
492 --------------------------------
494 -- All numeric values here are encoded as strings of decimal digits.
495 -- Only integer values need to be encoded. A negative value is encoded
496 -- as the corresponding positive value followed by a lower case m for
497 -- minus to indicate that the value is negative (e.g. 2m for -2).
499 -------------------------
500 -- Type Name Encodings --
501 -------------------------
503 -- In the following typ is the name of the type as normally encoded by
504 -- the debugger rules, i.e. a non-qualified name, all in lower case,
505 -- with standard encoding of upper half and wide characters
507 ------------------------
508 -- Encapsulated Types --
509 ------------------------
511 -- In some cases, the compiler encapsulates a type by wrapping it in
512 -- a structure. For example, this is used when a size or alignment
513 -- specification requires a larger type. Consider:
515 -- type y is mod 2 ** 64;
516 -- for y'size use 256;
518 -- In this case the compile generates a structure type y___PAD, which
519 -- has a single field whose name is F. This single field is 64 bits
520 -- long and contains the actual value. This kind of padding is used
521 -- when the logical value to be stored is shorter than the object in
522 -- which it is allocated. For example if a size clause is used to set
523 -- a size of 256 for a signed integer value, then a typical choice is
524 -- to wrap a 64-bit integer in a 256 bit PAD structure.
526 -- A similar encapsulation is done for some packed array types,
527 -- in which case the structure type is y___JM and the field name
528 -- is OBJECT. This is used in the case of a packed array stored
529 -- in modular representation (see section on representation of
530 -- packed array objects). In this case the JM wrapping is used to
531 -- achieve correct positioning of the packed array value (left or
532 -- right justified in its field depending on endianness.
534 -- When the debugger sees an object of a type whose name has a
535 -- suffix of ___PAD or ___JM, the type will be a record containing
536 -- a single field, and the name of that field will be all upper case.
537 -- In this case, it should look inside to get the value of the inner
538 -- field, and neither the outer structure name, nor the field name
539 -- should appear when the value is printed.
541 -- When the debugger sees a record named REP being a field inside
542 -- another record, it should treat the fields inside REP as being
543 -- part of the outer record (this REP field is only present for
544 -- code generation purposes). The REP record should not appear in
545 -- the values printed by the debugger.
547 -----------------------
548 -- Fixed-Point Types --
549 -----------------------
551 -- Fixed-point types are encoded using a suffix that indicates the
552 -- delta and small values. The actual type itself is a normal
553 -- integer type.
555 -- typ___XF_nn_dd
556 -- typ___XF_nn_dd_nn_dd
558 -- The first form is used when small = delta. The value of delta (and
559 -- small) is given by the rational nn/dd, where nn and dd are decimal
560 -- integers.
562 -- The second form is used if the small value is different from the
563 -- delta. In this case, the first nn/dd rational value is for delta,
564 -- and the second value is for small.
566 ------------------------------
567 -- VAX Floating-Point Types --
568 ------------------------------
570 -- Vax floating-point types are represented at run time as integer
571 -- types, which are treated specially by the code generator. Their
572 -- type names are encoded with the following suffix:
574 -- typ___XFF
575 -- typ___XFD
576 -- typ___XFG
578 -- representing the Vax F Float, D Float, and G Float types. The
579 -- debugger must treat these specially. In particular, printing
580 -- these values can be achieved using the debug procedures that
581 -- are provided in package System.Vax_Float_Operations:
583 -- procedure Debug_Output_D (Arg : D);
584 -- procedure Debug_Output_F (Arg : F);
585 -- procedure Debug_Output_G (Arg : G);
587 -- These three procedures take a Vax floating-point argument, and
588 -- output a corresponding decimal representation to standard output
589 -- with no terminating line return.
591 --------------------
592 -- Discrete Types --
593 --------------------
595 -- Discrete types are coded with a suffix indicating the range in
596 -- the case where one or both of the bounds are discriminants or
597 -- variable.
599 -- Note: at the current time, we also encode compile time known
600 -- bounds if they do not match the natural machine type bounds,
601 -- but this may be removed in the future, since it is redundant
602 -- for most debugging formats. However, we do not ever need XD
603 -- encoding for enumeration base types, since here it is always
604 -- clear what the bounds are from the total number of enumeration
605 -- literals.
607 -- typ___XD
608 -- typ___XDL_lowerbound
609 -- typ___XDU_upperbound
610 -- typ___XDLU_lowerbound__upperbound
612 -- If a discrete type is a natural machine type (i.e. its bounds
613 -- correspond in a natural manner to its size), then it is left
614 -- unencoded. The above encoding forms are used when there is a
615 -- constrained range that does not correspond to the size or that
616 -- has discriminant references or other compile time known bounds.
618 -- The first form is used if both bounds are dynamic, in which case
619 -- two constant objects are present whose names are typ___L and
620 -- typ___U in the same scope as typ, and the values of these constants
621 -- indicate the bounds. As far as the debugger is concerned, these
622 -- are simply variables that can be accessed like any other variables.
623 -- In the enumeration case, these values correspond to the Enum_Rep
624 -- values for the lower and upper bounds.
626 -- The second form is used if the upper bound is dynamic, but the
627 -- lower bound is either constant or depends on a discriminant of
628 -- the record with which the type is associated. The upper bound
629 -- is stored in a constant object of name typ___U as previously
630 -- described, but the lower bound is encoded directly into the
631 -- name as either a decimal integer, or as the discriminant name.
633 -- The third form is similarly used if the lower bound is dynamic,
634 -- but the upper bound is compile time known or a discriminant
635 -- reference, in which case the lower bound is stored in a constant
636 -- object of name typ___L, and the upper bound is encoded directly
637 -- into the name as either a decimal integer, or as the discriminant
638 -- name.
640 -- The fourth form is used if both bounds are discriminant references
641 -- or compile time known values, with the encoding first for the lower
642 -- bound, then for the upper bound, as previously described.
644 -------------------
645 -- Modular Types --
646 -------------------
648 -- A type declared
650 -- type x is mod N;
652 -- Is encoded as a subrange of an unsigned base type with lower bound
653 -- 0 and upper bound N. That is, there is no name encoding. We use
654 -- the standard encodings provided by the debugging format. Thus
655 -- we give these types a non-standard interpretation: the standard
656 -- interpretation of our encoding would not, in general, imply that
657 -- arithmetic on type x was to be performed modulo N (especially not
658 -- when N is not a power of 2).
660 ------------------
661 -- Biased Types --
662 ------------------
664 -- Only discrete types can be biased, and the fact that they are
665 -- biased is indicated by a suffix of the form:
667 -- typ___XB_lowerbound__upperbound
669 -- Here lowerbound and upperbound are decimal integers, with the
670 -- usual (postfix "m") encoding for negative numbers. Biased
671 -- types are only possible where the bounds are compile time
672 -- known, and the values are represented as unsigned offsets
673 -- from the lower bound given. For example:
675 -- type Q is range 10 .. 15;
676 -- for Q'size use 3;
678 -- The size clause will force values of type Q in memory to be
679 -- stored in biased form (e.g. 11 will be represented by the
680 -- bit pattern 001).
682 ----------------------------------------------
683 -- Record Types with Variable-Length Fields --
684 ----------------------------------------------
686 -- The debugging formats do not fully support these types, and indeed
687 -- some formats simply generate no useful information at all for such
688 -- types. In order to provide information for the debugger, gigi creates
689 -- a parallel type in the same scope with one of the names
691 -- type___XVE
692 -- type___XVU
694 -- The former name is used for a record and the latter for the union
695 -- that is made for a variant record (see below) if that record or
696 -- union has a field of variable size or if the record or union itself
697 -- has a variable size. These encodings suffix any other encodings that
698 -- that might be suffixed to the type name.
700 -- The idea here is to provide all the needed information to interpret
701 -- objects of the original type in the form of a "fixed up" type, which
702 -- is representable using the normal debugging information.
704 -- There are three cases to be dealt with. First, some fields may have
705 -- variable positions because they appear after variable-length fields.
706 -- To deal with this, we encode *all* the field bit positions of the
707 -- special ___XV type in a non-standard manner.
709 -- The idea is to encode not the position, but rather information
710 -- that allows computing the position of a field from the position
711 -- of the previous field. The algorithm for computing the actual
712 -- positions of all fields and the length of the record is as
713 -- follows. In this description, let P represent the current
714 -- bit position in the record.
716 -- 1. Initialize P to 0
718 -- 2. For each field in the record:
720 -- 2a. If an alignment is given (see below), then round P
721 -- up, if needed, to the next multiple of that alignment.
723 -- 2b. If a bit position is given, then increment P by that
724 -- amount (that is, treat it as an offset from the end of the
725 -- preceding record).
727 -- 2c. Assign P as the actual position of the field
729 -- 2d. Compute the length, L, of the represented field (see below)
730 -- and compute P'=P+L. Unless the field represents a variant part
731 -- (see below and also Variant Record Encoding), set P to P'.
733 -- The alignment, if present, is encoded in the field name of the
734 -- record, which has a suffix:
736 -- fieldname___XVAnn
738 -- where the nn after the XVA indicates the alignment value in storage
739 -- units. This encoding is present only if an alignment is present.
741 -- The size of the record described by an XVE-encoded type (in bits)
742 -- is generally the maximum value attained by P' in step 2d above,
743 -- rounded up according to the record's alignment.
745 -- Second, the variable-length fields themselves are represented by
746 -- replacing the type by a special access type. The designated type
747 -- of this access type is the original variable-length type, and the
748 -- fact that this field has been transformed in this way is signalled
749 -- by encoding the field name as:
751 -- field___XVL
753 -- where field is the original field name. If a field is both
754 -- variable-length and also needs an alignment encoding, then the
755 -- encodings are combined using:
757 -- field___XVLnn
759 -- Note: the reason that we change the type is so that the resulting
760 -- type has no variable-length fields. At least some of the formats
761 -- used for debugging information simply cannot tolerate variable-
762 -- length fields, so the encoded information would get lost.
764 -- Third, in the case of a variant record, the special union
765 -- that contains the variants is replaced by a normal C union.
766 -- In this case, the positions are all zero.
768 -- Discriminants appear before any variable-length fields that depend
769 -- on them, with one exception. In some cases, a discriminant
770 -- governing the choice of a variant clause may appear in the list
771 -- of fields of an XVE type after the entry for the variant clause
772 -- itself (this can happen in the presence of a representation clause
773 -- for the record type in the source program). However, when this
774 -- happens, the discriminant's position may be determined by first
775 -- applying the rules described in this section, ignoring the variant
776 -- clause. As a result, discriminants can always be located
777 -- independently of the variable-length fields that depend on them.
779 -- The size of the ___XVE or ___XVU record or union is set to the
780 -- alignment (in bytes) of the original object so that the debugger
781 -- can calculate the size of the original type.
783 -- As an example of this encoding, consider the declarations:
785 -- type Q is array (1 .. V1) of Float; -- alignment 4
786 -- type R is array (1 .. V2) of Long_Float; -- alignment 8
788 -- type X is record
789 -- A : Character;
790 -- B : Float;
791 -- C : String (1 .. V3);
792 -- D : Float;
793 -- E : Q;
794 -- F : R;
795 -- G : Float;
796 -- end record;
798 -- The encoded type looks like:
800 -- type anonymousQ is access Q;
801 -- type anonymousR is access R;
803 -- type X___XVE is record
804 -- A : Character; -- position contains 0
805 -- B : Float; -- position contains 24
806 -- C___XVL : access String (1 .. V3); -- position contains 0
807 -- D___XVA4 : Float; -- position contains 0
808 -- E___XVL4 : anonymousQ; -- position contains 0
809 -- F___XVL8 : anonymousR; -- position contains 0
810 -- G : Float; -- position contains 0
811 -- end record;
813 -- Any bit sizes recorded for fields other than dynamic fields and
814 -- variants are honored as for ordinary records.
816 -- Notes:
818 -- 1) The B field could also have been encoded by using a position
819 -- of zero, and an alignment of 4, but in such a case, the coding by
820 -- position is preferred (since it takes up less space). We have used
821 -- the (illegal) notation access xxx as field types in the example
822 -- above.
824 -- 2) The E field does not actually need the alignment indication
825 -- but this may not be detected in this case by the conversion
826 -- routines.
828 -- 3) Our conventions do not cover all XVE-encoded records in which
829 -- some, but not all, fields have representation clauses. Such
830 -- records may, therefore, be displayed incorrectly by debuggers.
831 -- This situation is not common.
833 -----------------------
834 -- Base Record Types --
835 -----------------------
837 -- Under certain circumstances, debuggers need two descriptions of a
838 -- record type, one that gives the actual details of the base type's
839 -- structure (as described elsewhere in these comments) and one that may
840 -- be used to obtain information about the particular subtype and the
841 -- size of the objects being typed. In such cases the compiler will
842 -- substitute type whose name is typically compiler-generated and
843 -- irrelevant except as a key for obtaining the actual type.
845 -- Specifically, if this name is x, then we produce a record type named
846 -- x___XVS consisting of one field. The name of this field is that of
847 -- the actual type being encoded, which we'll call y (the type of this
848 -- single field is arbitrary). Both x and y may have corresponding
849 -- ___XVE types.
851 -- The size of the objects typed as x should be obtained from the
852 -- structure of x (and x___XVE, if applicable) as for ordinary types
853 -- unless there is a variable named x___XVZ, which, if present, will
854 -- hold the size (in bits) of x.
856 -- The type x will either be a subtype of y (see also Subtypes of
857 -- Variant Records, below) or will contain no fields at all. The layout,
858 -- types, and positions of these fields will be accurate, if present.
859 -- (Currently, however, the GDB debugger makes no use of x except to
860 -- determine its size).
862 -- Among other uses, XVS types are sometimes used to encode
863 -- unconstrained types. For example, given
865 -- subtype Int is INTEGER range 0..10;
866 -- type T1 (N: Int := 0) is record
867 -- F1: String (1 .. N);
868 -- end record;
869 -- type AT1 is array (INTEGER range <>) of T1;
871 -- the element type for AT1 might have a type defined as if it had
872 -- been written:
874 -- type at1___C_PAD is record null; end record;
875 -- for at1___C_PAD'Size use 16 * 8;
877 -- and there would also be
879 -- type at1___C_PAD___XVS is record t1: Integer; end record;
880 -- type t1 is ...
882 -- Had the subtype Int been dynamic:
884 -- subtype Int is INTEGER range 0 .. M; -- M a variable
886 -- Then the compiler would also generate a declaration whose effect
887 -- would be
889 -- at1___C_PAD___XVZ: constant Integer := 32 + M * 8 + padding term;
891 -- Not all unconstrained types are so encoded; the XVS convention may be
892 -- unnecessary for unconstrained types of fixed size. However, this
893 -- encoding is always necessary when a subcomponent type (array
894 -- element's type or record field's type) is an unconstrained record
895 -- type some of whose components depend on discriminant values.
897 -----------------
898 -- Array Types --
899 -----------------
901 -- Since there is no way for the debugger to obtain the index subtypes
902 -- for an array type, we produce a type that has the name of the
903 -- array type followed by "___XA" and is a record whose field names
904 -- are the names of the types for the bounds. The types of these
905 -- fields is an integer type which is meaningless.
907 -- To conserve space, we do not produce this type unless one of the
908 -- index types is either an enumeration type, has a variable upper
909 -- bound, has a lower bound different from the constant 1, is a biased
910 -- type, or is wider than "sizetype".
912 -- Given the full encoding of these types (see above description for
913 -- the encoding of discrete types), this means that all necessary
914 -- information for addressing arrays is available. In some debugging
915 -- formats, some or all of the bounds information may be available
916 -- redundantly, particularly in the fixed-point case, but this
917 -- information can in any case be ignored by the debugger.
919 ----------------------------
920 -- Note on Implicit Types --
921 ----------------------------
923 -- The compiler creates implicit type names in many situations where a
924 -- type is present semantically, but no specific name is present. For
925 -- example:
927 -- S : Integer range M .. N;
929 -- Here the subtype of S is not integer, but rather an anonymous subtype
930 -- of Integer. Where possible, the compiler generates names for such
931 -- anonymous types that are related to the type from which the subtype
932 -- is obtained as follows:
934 -- T name suffix
936 -- where name is the name from which the subtype is obtained, using
937 -- lower case letters and underscores, and suffix starts with an upper
938 -- case letter. For example the name for the above declaration might be:
940 -- TintegerS4b
942 -- If the debugger is asked to give the type of an entity and the type
943 -- has the form T name suffix, it is probably appropriate to just use
944 -- "name" in the response since this is what is meaningful to the
945 -- programmer.
947 -------------------------------------------------
948 -- Subprograms for Handling Encoded Type Names --
949 -------------------------------------------------
951 procedure Get_Encoded_Name (E : Entity_Id);
952 -- If the entity is a typename, store the external name of the entity as in
953 -- Get_External_Name, followed by three underscores plus the type encoding
954 -- in Name_Buffer with the length in Name_Len, and an ASCII.NUL character
955 -- stored following the name. Otherwise set Name_Buffer and Name_Len to
956 -- hold the entity name. Note that a call to this procedure has no effect
957 -- if we are not generating code, since the necessary information for
958 -- computing the proper encoded name is not available in this case.
960 --------------
961 -- Renaming --
962 --------------
964 -- Debugging information is generated for exception, object, package,
965 -- and subprogram renaming (generic renamings are not significant, since
966 -- generic templates are not relevant at debugging time).
968 -- Consider a renaming declaration of the form
970 -- x : typ renames y;
972 -- There is one case in which no special debugging information is required,
973 -- namely the case of an object renaming where the back end allocates a
974 -- reference for the renamed variable, and the entity x is this reference.
975 -- The debugger can handle this case without any special processing or
976 -- encoding (it won't know it was a renaming, but that does not matter).
978 -- All other cases of renaming generate a dummy variable for an entity
979 -- whose name is of the form:
981 -- x___XR_... for an object renaming
982 -- x___XRE_... for an exception renaming
983 -- x___XRP_... for a package renaming
985 -- and where the "..." represents a suffix that describes the structure of
986 -- the object name given in the renaming (see details below).
988 -- The name is fully qualified in the usual manner, i.e. qualified in the
989 -- same manner as the entity x would be. In the case of a package renaming
990 -- where x is a child unit, the qualification includes the name of the
991 -- parent unit, to disambiguate child units with the same simple name and
992 -- (of necessity) different parents.
994 -- Note: subprogram renamings are not encoded at the present time
996 -- The suffix of the variable name describing the renamed object is
997 -- defined to use the following encoding:
999 -- For the simple entity case, where y is just an entity name, the suffix
1000 -- is of the form:
1002 -- y___XE
1004 -- i.e. the suffix has a single field, the first part matching the
1005 -- name y, followed by a "___" separator, ending with sequence XE.
1006 -- The entity name portion is fully qualified in the usual manner.
1007 -- This same naming scheme is followed for all forms of encoded
1008 -- renamings that rename a simple entity.
1010 -- For the object renaming case where y is a selected component or an
1011 -- indexed component, the variable name is suffixed by additional fields
1012 -- that give details of the components. The name starts as above with a
1013 -- y___XE name indicating the outer level object entity. Then a series of
1014 -- selections and indexing operations can be specified as follows:
1016 -- Indexed component
1018 -- A series of subscript values appear in sequence, the number
1019 -- corresponds to the number of dimensions of the array. The
1020 -- subscripts have one of the following two forms:
1022 -- XSnnn
1024 -- Here nnn is a constant value, encoded as a decimal integer
1025 -- (pos value for enumeration type case). Negative values have
1026 -- a trailing 'm' as usual.
1028 -- XSe
1030 -- Here e is the (unqualified) name of a constant entity in the
1031 -- same scope as the renaming which contains the subscript value.
1033 -- Slice
1035 -- For the slice case, we have two entries. The first is for the
1036 -- lower bound of the slice, and has the form:
1038 -- XLnnn
1039 -- XLe
1041 -- Specifies the lower bound, using exactly the same encoding as
1042 -- for an XS subscript as described above.
1044 -- Then the upper bound appears in the usual XSnnn/XSe form
1046 -- Selected component
1048 -- For a selected component, we have a single entry
1050 -- XRf
1052 -- Here f is the field name for the selection
1054 -- For an explicit deference (.all), we have a single entry
1056 -- XA
1058 -- As an example, consider the declarations:
1060 -- package p is
1061 -- type q is record
1062 -- m : string (2 .. 5);
1063 -- end record;
1065 -- type r is array (1 .. 10, 1 .. 20) of q;
1067 -- g : r;
1069 -- z : string renames g (1,5).m(2 ..3)
1070 -- end p;
1072 -- The generated variable entity would appear as
1074 -- p__z___XR_p__g___XEXS1XS5XRmXL2XS3 : _renaming_type;
1075 -- p__g___XE--------------------outer entity is g
1076 -- XS1-----------------first subscript for g
1077 -- XS5--------------second subscript for g
1078 -- XRm-----------select field m
1079 -- XL2--------lower bound of slice
1080 -- XS3-----upper bound of slice
1082 -- Note that the type of the variable is a special internal type named
1083 -- _renaming_type. This type is an arbitrary type of zero size created
1084 -- in package Standard (see cstand.adb) and is ignored by the debugger.
1086 function Debug_Renaming_Declaration (N : Node_Id) return Node_Id;
1087 -- The argument N is a renaming declaration. The result is a variable
1088 -- declaration as described in the above paragraphs. If N is not a special
1089 -- debug declaration, then Empty is returned.
1091 ---------------------------
1092 -- Packed Array Encoding --
1093 ---------------------------
1095 -- For every packed array, two types are created, and both appear in
1096 -- the debugging output.
1098 -- The original declared array type is a perfectly normal array type,
1099 -- and its index bounds indicate the original bounds of the array.
1101 -- The corresponding packed array type, which may be a modular type, or
1102 -- may be an array of bytes type (see Exp_Pakd for full details). This
1103 -- is the type that is actually used in the generated code and for
1104 -- debugging information for all objects of the packed type.
1106 -- The name of the corresponding packed array type is:
1108 -- ttt___XPnnn
1110 -- where
1111 -- ttt is the name of the original declared array
1112 -- nnn is the component size in bits (1-31)
1114 -- When the debugger sees that an object is of a type that is encoded
1115 -- in this manner, it can use the original type to determine the bounds,
1116 -- and the component size to determine the packing details.
1118 -------------------------------------------
1119 -- Packed Array Representation in Memory --
1120 -------------------------------------------
1122 -- Packed arrays are represented in tightly packed form, with no extra
1123 -- bits between components. This is true even when the component size
1124 -- is not a factor of the storage unit size, so that as a result it is
1125 -- possible for components to cross storage unit boundaries.
1127 -- The layout in storage is identical, regardless of whether the
1128 -- implementation type is a modular type or an array-of-bytes type.
1129 -- See Exp_Pakd for details of how these implementation types are used,
1130 -- but for the purpose of the debugger, only the starting address of
1131 -- the object in memory is significant.
1133 -- The following example should show clearly how the packing works in
1134 -- the little-endian and big-endian cases:
1136 -- type B is range 0 .. 7;
1137 -- for B'Size use 3;
1139 -- type BA is array (0 .. 5) of B;
1140 -- pragma Pack (BA);
1142 -- BV : constant BA := (1,2,3,4,5,6);
1144 -- Little endian case
1146 -- BV'Address + 2 BV'Address + 1 BV'Address + 0
1147 -- +-----------------+-----------------+-----------------+
1148 -- | ? ? ? ? ? ? 1 1 | 0 1 0 1 1 0 0 0 | 1 1 0 1 0 0 0 1 |
1149 -- +-----------------+-----------------+-----------------+
1150 -- <---------> <-----> <---> <---> <-----> <---> <--->
1151 -- unused bits BV(5) BV(4) BV(3) BV(2) BV(1) BV(0)
1153 -- Big endian case
1155 -- BV'Address + 0 BV'Address + 1 BV'Address + 2
1156 -- +-----------------+-----------------+-----------------+
1157 -- | 0 0 1 0 1 0 0 1 | 1 1 0 0 1 0 1 1 | 1 0 ? ? ? ? ? ? |
1158 -- +-----------------+-----------------+-----------------+
1159 -- <---> <---> <-----> <---> <---> <-----> <--------->
1160 -- BV(0) BV(1) BV(2) BV(3) BV(4) BV(5) unused bits
1162 -- Note that if a modular type is used to represent the array, the
1163 -- allocation in memory is not the same as a normal modular type. The
1164 -- difference occurs when the allocated object is larger than the size of
1165 -- the array. For a normal modular type, we extend the value on the left
1166 -- with zeroes.
1168 -- For example, in the normal modular case, if we have a 6-bit modular
1169 -- type, declared as mod 2**6, and we allocate an 8-bit object for this
1170 -- type, then we extend the value with two bits on the most significant
1171 -- end, and in either the little-endian or big-endian case, the value 63 is
1172 -- represented as 00111111 in binary in memory.
1174 -- For a modular type used to represent a packed array, the rule is
1175 -- different. In this case, if we have to extend the value, then we do it
1176 -- with undefined bits (which are not initialized and whose value is
1177 -- irrelevant to any generated code). Furthermore these bits are on the
1178 -- right (least significant bits) in the big-endian case, and on the left
1179 -- (most significant bits) in the little-endian case.
1181 -- For example, if we have a packed boolean array of 6 bits, all set to
1182 -- True, stored in an 8-bit object, then the value in memory in binary is
1183 -- ??111111 in the little-endian case, and 111111?? in the big-endian case.
1185 -- This is done so that the representation of packed arrays does not
1186 -- depend on whether we use a modular representation or array of bytes
1187 -- as previously described. This ensures that we can pass such values by
1188 -- reference in the case where a subprogram has to be able to handle values
1189 -- stored in either form.
1191 -- Note that when we extract the value of such a modular packed array, we
1192 -- expect to retrieve only the relevant bits, so in this same example, when
1193 -- we extract the value we get 111111 in both cases, and the code generated
1194 -- by the front end assumes this although it does not assume that any high
1195 -- order bits are defined.
1197 -- There are opportunities for optimization based on the knowledge that the
1198 -- unused bits are irrelevant for these type of packed arrays. For example
1199 -- if we have two such 6-bit-in-8-bit values and we do an assignment:
1201 -- a := b;
1203 -- Then logically, we extract the 6 bits and store only 6 bits in the
1204 -- result, but the back end is free to simply assign the entire 8-bits in
1205 -- this case, since we don't actually care about the undefined bits.
1206 -- However, in the equality case, it is important to ensure that the
1207 -- undefined bits do not participate in an equality test.
1209 -- If a modular packed array value is assigned to a register, then
1210 -- logically it could always be held right justified, to avoid any need to
1211 -- shift, e.g. when doing comparisons. But probably this is a bad choice,
1212 -- as it would mean that an assignment such as a := above would require
1213 -- shifts when one value is in a register and the other value is in memory.
1215 ------------------------------------------------------
1216 -- Subprograms for Handling Packed Array Type Names --
1217 ------------------------------------------------------
1219 function Make_Packed_Array_Type_Name
1220 (Typ : Entity_Id;
1221 Csize : Uint)
1222 return Name_Id;
1223 -- This function is used in Exp_Pakd to create the name that is encoded as
1224 -- described above. The entity Typ provides the name ttt, and the value
1225 -- Csize is the component size that provides the nnn value.
1227 --------------------------------------
1228 -- Pointers to Unconstrained Arrays --
1229 --------------------------------------
1231 -- There are two kinds of pointers to arrays. The debugger can tell which
1232 -- format is in use by the form of the type of the pointer.
1234 -- Fat Pointers
1236 -- Fat pointers are represented as a struct with two fields. This
1237 -- struct has two distinguished field names:
1239 -- P_ARRAY is a pointer to the array type. The name of this type is
1240 -- the unconstrained type followed by "___XUA". This array will have
1241 -- bounds which are the discriminants, and hence are unparsable, but
1242 -- will give the number of subscripts and the component type.
1244 -- P_BOUNDS is a pointer to a struct, the name of whose type is the
1245 -- unconstrained array name followed by "___XUB" and which has
1246 -- fields of the form
1248 -- LBn (n a decimal integer) lower bound of n'th dimension
1249 -- UBn (n a decimal integer) upper bound of n'th dimension
1251 -- The bounds may be any integral type. In the case of an enumeration
1252 -- type, Enum_Rep values are used.
1254 -- For a given unconstrained array type, the compiler will generate one
1255 -- fat-pointer type whose name is "arr___XUP", where "arr" is the name
1256 -- of the array type, and use it to represent the array type itself in
1257 -- the debugging information.
1258 -- For each pointer to this unconstrained array type, the compiler will
1259 -- generate a typedef that points to the above "arr___XUP" fat-pointer
1260 -- type. As a consequence, when it comes to fat-pointer types:
1262 -- 1. The type name is given by the typedef
1264 -- 2. If the debugger is asked to output the type, the appropriate
1265 -- form is "access arr", except if the type name is "arr___XUP"
1266 -- for which it is the array definition.
1268 -- Thin Pointers
1270 -- The value of a thin pointer is a pointer to the second field of a
1271 -- structure with two fields. The name of this structure's type is
1272 -- "arr___XUT", where "arr" is the name of the unconstrained array
1273 -- type. Even though it actually points into middle of this structure,
1274 -- the thin pointer's type in debugging information is
1275 -- pointer-to-arr___XUT.
1277 -- The first field of arr___XUT is named BOUNDS, and has a type named
1278 -- arr___XUB, with the structure described for such types in fat
1279 -- pointers, as described above.
1281 -- The second field of arr___XUT is named ARRAY, and contains the
1282 -- actual array. Because this array has a dynamic size, determined by
1283 -- the BOUNDS field that precedes it, all of the information about
1284 -- arr___XUT is encoded in a parallel type named arr___XUT___XVE, with
1285 -- fields BOUNDS and ARRAY___XVL. As for previously described ___XVE
1286 -- types, ARRAY___XVL has a pointer-to-array type. However, the array
1287 -- type in this case is named arr___XUA and only its element type is
1288 -- meaningful, just as described for fat pointers.
1290 --------------------------------------
1291 -- Tagged Types and Type Extensions --
1292 --------------------------------------
1294 -- A type C derived from a tagged type P has a field named "_parent" of
1295 -- type P that contains its inherited fields. The type of this field is
1296 -- usually P (encoded as usual if it has a dynamic size), but may be a more
1297 -- distant ancestor, if P is a null extension of that type.
1299 -- The type tag of a tagged type is a field named _tag, of type void*. If
1300 -- the type is derived from another tagged type, its _tag field is found in
1301 -- its _parent field.
1303 -----------------------------
1304 -- Variant Record Encoding --
1305 -----------------------------
1307 -- The variant part of a variant record is encoded as a single field in the
1308 -- enclosing record, whose name is:
1310 -- discrim___XVN
1312 -- where discrim is the unqualified name of the variant. This field name is
1313 -- built by gigi (not by code in this unit). For Unchecked_Union record,
1314 -- this discriminant will not appear in the record, and the debugger must
1315 -- proceed accordingly (basically it can treat this case as it would a C
1316 -- union).
1318 -- The type corresponding to this field has a name that is obtained by
1319 -- concatenating the type name with the above string and is similar to a C
1320 -- union, in which each member of the union corresponds to one variant.
1321 -- However, unlike a C union, the size of the type may be variable even if
1322 -- each of the components are fixed size, since it includes a computation
1323 -- of which variant is present. In that case, it will be encoded as above
1324 -- and a type with the suffix "___XVN___XVU" will be present.
1326 -- The name of the union member is encoded to indicate the choices, and
1327 -- is a string given by the following grammar:
1329 -- union_name ::= {choice} | others_choice
1330 -- choice ::= simple_choice | range_choice
1331 -- simple_choice ::= S number
1332 -- range_choice ::= R number T number
1333 -- number ::= {decimal_digit} [m]
1334 -- others_choice ::= O (upper case letter O)
1336 -- The m in a number indicates a negative value. As an example of this
1337 -- encoding scheme, the choice 1 .. 4 | 7 | -10 would be represented by
1339 -- R1T4S7S10m
1341 -- In the case of enumeration values, the values used are the actual
1342 -- representation values in the case where an enumeration type has an
1343 -- enumeration representation spec (i.e. they are values that correspond
1344 -- to the use of the Enum_Rep attribute).
1346 -- The type of the inner record is given by the name of the union type (as
1347 -- above) concatenated with the above string. Since that type may itself be
1348 -- variable-sized, it may also be encoded as above with a new type with a
1349 -- further suffix of "___XVU".
1351 -- As an example, consider:
1353 -- type Var (Disc : Boolean := True) is record
1354 -- M : Integer;
1356 -- case Disc is
1357 -- when True =>
1358 -- R : Integer;
1359 -- S : Integer;
1361 -- when False =>
1362 -- T : Integer;
1363 -- end case;
1364 -- end record;
1366 -- V1 : Var;
1368 -- In this case, the type var is represented as a struct with three fields,
1369 -- the first two are "disc" and "m", representing the values of these
1370 -- record components.
1372 -- The third field is a union of two types, with field names S1 and O. S1
1373 -- is a struct with fields "r" and "s", and O is a struct with fields "t".
1375 ------------------------------------------------
1376 -- Subprograms for Handling Variant Encodings --
1377 ------------------------------------------------
1379 procedure Get_Variant_Encoding (V : Node_Id);
1380 -- This procedure is called by Gigi with V being the variant node. The
1381 -- corresponding encoding string is returned in Name_Buffer with the length
1382 -- of the string in Name_Len, and an ASCII.NUL character stored following
1383 -- the name.
1385 ---------------------------------
1386 -- Subtypes of Variant Records --
1387 ---------------------------------
1389 -- A subtype of a variant record is represented by a type in which the
1390 -- union field from the base type is replaced by one of the possible
1391 -- values. For example, if we have:
1393 -- type Var (Disc : Boolean := True) is record
1394 -- M : Integer;
1396 -- case Disc is
1397 -- when True =>
1398 -- R : Integer;
1399 -- S : Integer;
1401 -- when False =>
1402 -- T : Integer;
1403 -- end case;
1405 -- end record;
1406 -- V1 : Var;
1407 -- V2 : Var (True);
1408 -- V3 : Var (False);
1410 -- Here V2, for example, is represented with a subtype whose name is
1411 -- something like TvarS3b, which is a struct with three fields. The first
1412 -- two fields are "disc" and "m" as for the base type, and the third field
1413 -- is S1, which contains the fields "r" and "s".
1415 -- The debugger should simply ignore structs with names of the form
1416 -- corresponding to variants, and consider the fields inside as belonging
1417 -- to the containing record.
1419 -------------------------------------------
1420 -- Character literals in Character Types --
1421 -------------------------------------------
1423 -- Character types are enumeration types at least one of whose enumeration
1424 -- literals is a character literal. Enumeration literals are usually simply
1425 -- represented using their identifier names. If the enumeration literal is
1426 -- a character literal, the name is encoded as described in the following
1427 -- paragraph.
1429 -- A name QUhh, where each 'h' is a lower-case hexadecimal digit, stands
1430 -- for a character whose Unicode encoding is hh, and QWhhhh likewise stands
1431 -- for a wide character whose encoding is hhhh. The representation values
1432 -- are encoded as for ordinary enumeration literals (and have no necessary
1433 -- relationship to the values encoded in the names).
1435 -- For example, given the type declaration
1437 -- type x is (A, 'C', B);
1439 -- the second enumeration literal would be named QU43 and the value
1440 -- assigned to it would be 1.
1442 -----------------------------------------------
1443 -- Secondary Dispatch tables of tagged types --
1444 -----------------------------------------------
1446 procedure Get_Secondary_DT_External_Name
1447 (Typ : Entity_Id;
1448 Ancestor_Typ : Entity_Id;
1449 Suffix_Index : Int);
1450 -- Set Name_Buffer and Name_Len to the external name of one secondary
1451 -- dispatch table of Typ. If the interface has been inherited from some
1452 -- ancestor then Ancestor_Typ is such node (in this case the secondary DT
1453 -- is needed to handle overridden primitives); if there is no such ancestor
1454 -- then Ancestor_Typ is equal to Typ.
1456 -- Internal rule followed for the generation of the external name:
1458 -- Case 1. If the secondary dispatch has not been inherited from some
1459 -- ancestor of Typ then the external name is composed as
1460 -- follows:
1461 -- External_Name (Typ) + Suffix_Number + 'P'
1463 -- Case 2. if the secondary dispatch table has been inherited from some
1464 -- ancestor then the external name is composed as follows:
1465 -- External_Name (Typ) + '_' + External_Name (Ancestor_Typ)
1466 -- + Suffix_Number + 'P'
1468 -- Note: We have to use the external names (instead of simply their names)
1469 -- to protect the frontend against programs that give the same name to all
1470 -- the interfaces and use the expanded name to reference them. The
1471 -- Suffix_Number is used to differentiate all the secondary dispatch
1472 -- tables of a given type.
1474 -- Examples:
1476 -- package Pkg1 is | package Pkg2 is | package Pkg3 is
1477 -- type Typ is | type Typ is | type Typ is
1478 -- interface; | interface; | interface;
1479 -- end Pkg1; | end Pkg; | end Pkg3;
1481 -- with Pkg1, Pkg2, Pkg3;
1482 -- package Case_1 is
1483 -- type Typ is new Pkg1.Typ and Pkg2.Typ and Pkg3.Typ with ...
1484 -- end Case_1;
1486 -- with Case_1;
1487 -- package Case_2 is
1488 -- type Typ is new Case_1.Typ with ...
1489 -- end Case_2;
1491 -- These are the external names generated for Case_1.Typ (note that
1492 -- Pkg1.Typ is associated with the Primary Dispatch Table, because it
1493 -- is the the parent of this type, and hence no external name is
1494 -- generated for it).
1495 -- case_1__typ0P (associated with Pkg2.Typ)
1496 -- case_1__typ1P (associated with Pkg3.Typ)
1498 -- These are the external names generated for Case_2.Typ:
1499 -- case_2__typ_case_1__typ0P
1500 -- case_2__typ_case_1__typ1P
1502 ----------------------------
1503 -- Effect of Optimization --
1504 ----------------------------
1506 -- If the program is compiled with optimization on (e.g. -O1 switch
1507 -- specified), then there may be variations in the output from the above
1508 -- specification. In particular, objects may disappear from the output.
1509 -- This includes not only constants and variables that the program declares
1510 -- at the source level, but also the x___L and x___U constants created to
1511 -- describe the lower and upper bounds of subtypes with dynamic bounds.
1512 -- This means for example, that array bounds may disappear if optimization
1513 -- is turned on. The debugger is expected to recognize that these constants
1514 -- are missing and deal as best as it can with the limited information
1515 -- available.
1517 ---------------------------------
1518 -- GNAT Extensions to DWARF2/3 --
1519 ---------------------------------
1521 -- If the compiler switch "-gdwarf+" is specified, GNAT Vendor extensions
1522 -- to DWARF2/3 are generated, with the following variations from the above
1523 -- specification.
1525 -- Change in the contents of the DW_AT_name attribute.
1526 -- The operators are represented in their natural form. (Ie, the addition
1527 -- operator is written as "+" instead of "Oadd").
1528 -- The component separation string is "." instead of "__"
1530 -- Introduction of DW_AT_GNAT_encoding, encoded with value 0x2301.
1531 -- Any debugging information entry representing a program entity, named
1532 -- or implicit, may have a DW_AT_GNAT_encoding attribute. The value of
1533 -- this attribute is a string representing the suffix internally added
1534 -- by GNAT for various purposes, mainly for representing debug
1535 -- information compatible with other formats.
1537 -- If a debugging information entry has multiple encodings, all of them
1538 -- will be listed in DW_AT_GNAT_encoding. The separator for this list
1539 -- is ':'.
1541 -- Introduction of DW_AT_GNAT_descriptive_type, encoded with value 0x2302
1542 -- Any debugging information entry representing a type may have a
1543 -- DW_AT_GNAT_descriptive_type attribute whose value is a reference,
1544 -- pointing to a debugging information entry representing another type
1545 -- associated to the type.
1547 -- Modification of the contents of the DW_AT_producer string.
1548 -- When emitting full GNAT Vendor extensions to DWARF2/3, "-gdwarf+"
1549 -- is appended to the DW_AT_producer string.
1551 -- When emitting only DW_AT_GNAT_descriptive_type, "-gdwarf+-" is
1552 -- appended to the DW_AT_producer string.
1554 end Exp_Dbug;