[RS6000] TOC refs generated during reload
[official-gcc.git] / gcc / cgraphunit.c
blobd8f79036b407e85eeb5be09d5eedac9f5f8d6140
1 /* Driver of optimization process
2 Copyright (C) 2003-2016 Free Software Foundation, Inc.
3 Contributed by Jan Hubicka
5 This file is part of GCC.
7 GCC is free software; you can redistribute it and/or modify it under
8 the terms of the GNU General Public License as published by the Free
9 Software Foundation; either version 3, or (at your option) any later
10 version.
12 GCC is distributed in the hope that it will be useful, but WITHOUT ANY
13 WARRANTY; without even the implied warranty of MERCHANTABILITY or
14 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
15 for more details.
17 You should have received a copy of the GNU General Public License
18 along with GCC; see the file COPYING3. If not see
19 <http://www.gnu.org/licenses/>. */
21 /* This module implements main driver of compilation process.
23 The main scope of this file is to act as an interface in between
24 tree based frontends and the backend.
26 The front-end is supposed to use following functionality:
28 - finalize_function
30 This function is called once front-end has parsed whole body of function
31 and it is certain that the function body nor the declaration will change.
33 (There is one exception needed for implementing GCC extern inline
34 function.)
36 - varpool_finalize_decl
38 This function has same behavior as the above but is used for static
39 variables.
41 - add_asm_node
43 Insert new toplevel ASM statement
45 - finalize_compilation_unit
47 This function is called once (source level) compilation unit is finalized
48 and it will no longer change.
50 The symbol table is constructed starting from the trivially needed
51 symbols finalized by the frontend. Functions are lowered into
52 GIMPLE representation and callgraph/reference lists are constructed.
53 Those are used to discover other necessary functions and variables.
55 At the end the bodies of unreachable functions are removed.
57 The function can be called multiple times when multiple source level
58 compilation units are combined.
60 - compile
62 This passes control to the back-end. Optimizations are performed and
63 final assembler is generated. This is done in the following way. Note
64 that with link time optimization the process is split into three
65 stages (compile time, linktime analysis and parallel linktime as
66 indicated bellow).
68 Compile time:
70 1) Inter-procedural optimization.
71 (ipa_passes)
73 This part is further split into:
75 a) early optimizations. These are local passes executed in
76 the topological order on the callgraph.
78 The purpose of early optimiations is to optimize away simple
79 things that may otherwise confuse IP analysis. Very simple
80 propagation across the callgraph is done i.e. to discover
81 functions without side effects and simple inlining is performed.
83 b) early small interprocedural passes.
85 Those are interprocedural passes executed only at compilation
86 time. These include, for example, transational memory lowering,
87 unreachable code removal and other simple transformations.
89 c) IP analysis stage. All interprocedural passes do their
90 analysis.
92 Interprocedural passes differ from small interprocedural
93 passes by their ability to operate across whole program
94 at linktime. Their analysis stage is performed early to
95 both reduce linking times and linktime memory usage by
96 not having to represent whole program in memory.
98 d) LTO sreaming. When doing LTO, everything important gets
99 streamed into the object file.
101 Compile time and or linktime analysis stage (WPA):
103 At linktime units gets streamed back and symbol table is
104 merged. Function bodies are not streamed in and not
105 available.
106 e) IP propagation stage. All IP passes execute their
107 IP propagation. This is done based on the earlier analysis
108 without having function bodies at hand.
109 f) Ltrans streaming. When doing WHOPR LTO, the program
110 is partitioned and streamed into multple object files.
112 Compile time and/or parallel linktime stage (ltrans)
114 Each of the object files is streamed back and compiled
115 separately. Now the function bodies becomes available
116 again.
118 2) Virtual clone materialization
119 (cgraph_materialize_clone)
121 IP passes can produce copies of existing functoins (such
122 as versioned clones or inline clones) without actually
123 manipulating their bodies by creating virtual clones in
124 the callgraph. At this time the virtual clones are
125 turned into real functions
126 3) IP transformation
128 All IP passes transform function bodies based on earlier
129 decision of the IP propagation.
131 4) late small IP passes
133 Simple IP passes working within single program partition.
135 5) Expansion
136 (expand_all_functions)
138 At this stage functions that needs to be output into
139 assembler are identified and compiled in topological order
140 6) Output of variables and aliases
141 Now it is known what variable references was not optimized
142 out and thus all variables are output to the file.
144 Note that with -fno-toplevel-reorder passes 5 and 6
145 are combined together in cgraph_output_in_order.
147 Finally there are functions to manipulate the callgraph from
148 backend.
149 - cgraph_add_new_function is used to add backend produced
150 functions introduced after the unit is finalized.
151 The functions are enqueue for later processing and inserted
152 into callgraph with cgraph_process_new_functions.
154 - cgraph_function_versioning
156 produces a copy of function into new one (a version)
157 and apply simple transformations
160 #include "config.h"
161 #include "system.h"
162 #include "coretypes.h"
163 #include "backend.h"
164 #include "target.h"
165 #include "rtl.h"
166 #include "tree.h"
167 #include "gimple.h"
168 #include "cfghooks.h"
169 #include "regset.h" /* FIXME: For reg_obstack. */
170 #include "alloc-pool.h"
171 #include "tree-pass.h"
172 #include "stringpool.h"
173 #include "gimple-ssa.h"
174 #include "cgraph.h"
175 #include "coverage.h"
176 #include "lto-streamer.h"
177 #include "fold-const.h"
178 #include "varasm.h"
179 #include "stor-layout.h"
180 #include "output.h"
181 #include "cfgcleanup.h"
182 #include "gimple-fold.h"
183 #include "gimplify.h"
184 #include "gimple-iterator.h"
185 #include "gimplify-me.h"
186 #include "tree-cfg.h"
187 #include "tree-into-ssa.h"
188 #include "tree-ssa.h"
189 #include "langhooks.h"
190 #include "toplev.h"
191 #include "debug.h"
192 #include "symbol-summary.h"
193 #include "ipa-prop.h"
194 #include "gimple-pretty-print.h"
195 #include "plugin.h"
196 #include "ipa-inline.h"
197 #include "ipa-utils.h"
198 #include "except.h"
199 #include "cfgloop.h"
200 #include "context.h"
201 #include "pass_manager.h"
202 #include "tree-nested.h"
203 #include "dbgcnt.h"
204 #include "tree-chkp.h"
205 #include "lto-section-names.h"
207 /* Queue of cgraph nodes scheduled to be added into cgraph. This is a
208 secondary queue used during optimization to accommodate passes that
209 may generate new functions that need to be optimized and expanded. */
210 vec<cgraph_node *> cgraph_new_nodes;
212 static void expand_all_functions (void);
213 static void mark_functions_to_output (void);
214 static void handle_alias_pairs (void);
216 /* Used for vtable lookup in thunk adjusting. */
217 static GTY (()) tree vtable_entry_type;
219 /* Determine if symbol declaration is needed. That is, visible to something
220 either outside this translation unit, something magic in the system
221 configury */
222 bool
223 symtab_node::needed_p (void)
225 /* Double check that no one output the function into assembly file
226 early. */
227 gcc_checking_assert (!DECL_ASSEMBLER_NAME_SET_P (decl)
228 || !TREE_SYMBOL_REFERENCED (DECL_ASSEMBLER_NAME (decl)));
230 if (!definition)
231 return false;
233 if (DECL_EXTERNAL (decl))
234 return false;
236 /* If the user told us it is used, then it must be so. */
237 if (force_output)
238 return true;
240 /* ABI forced symbols are needed when they are external. */
241 if (forced_by_abi && TREE_PUBLIC (decl))
242 return true;
244 /* Keep constructors, destructors and virtual functions. */
245 if (TREE_CODE (decl) == FUNCTION_DECL
246 && (DECL_STATIC_CONSTRUCTOR (decl) || DECL_STATIC_DESTRUCTOR (decl)))
247 return true;
249 /* Externally visible variables must be output. The exception is
250 COMDAT variables that must be output only when they are needed. */
251 if (TREE_PUBLIC (decl) && !DECL_COMDAT (decl))
252 return true;
254 return false;
257 /* Head and terminator of the queue of nodes to be processed while building
258 callgraph. */
260 static symtab_node symtab_terminator;
261 static symtab_node *queued_nodes = &symtab_terminator;
263 /* Add NODE to queue starting at QUEUED_NODES.
264 The queue is linked via AUX pointers and terminated by pointer to 1. */
266 static void
267 enqueue_node (symtab_node *node)
269 if (node->aux)
270 return;
271 gcc_checking_assert (queued_nodes);
272 node->aux = queued_nodes;
273 queued_nodes = node;
276 /* Process CGRAPH_NEW_FUNCTIONS and perform actions necessary to add these
277 functions into callgraph in a way so they look like ordinary reachable
278 functions inserted into callgraph already at construction time. */
280 void
281 symbol_table::process_new_functions (void)
283 tree fndecl;
285 if (!cgraph_new_nodes.exists ())
286 return;
288 handle_alias_pairs ();
289 /* Note that this queue may grow as its being processed, as the new
290 functions may generate new ones. */
291 for (unsigned i = 0; i < cgraph_new_nodes.length (); i++)
293 cgraph_node *node = cgraph_new_nodes[i];
294 fndecl = node->decl;
295 switch (state)
297 case CONSTRUCTION:
298 /* At construction time we just need to finalize function and move
299 it into reachable functions list. */
301 cgraph_node::finalize_function (fndecl, false);
302 call_cgraph_insertion_hooks (node);
303 enqueue_node (node);
304 break;
306 case IPA:
307 case IPA_SSA:
308 case IPA_SSA_AFTER_INLINING:
309 /* When IPA optimization already started, do all essential
310 transformations that has been already performed on the whole
311 cgraph but not on this function. */
313 gimple_register_cfg_hooks ();
314 if (!node->analyzed)
315 node->analyze ();
316 push_cfun (DECL_STRUCT_FUNCTION (fndecl));
317 if ((state == IPA_SSA || state == IPA_SSA_AFTER_INLINING)
318 && !gimple_in_ssa_p (DECL_STRUCT_FUNCTION (fndecl)))
319 g->get_passes ()->execute_early_local_passes ();
320 else if (inline_summaries != NULL)
321 compute_inline_parameters (node, true);
322 free_dominance_info (CDI_POST_DOMINATORS);
323 free_dominance_info (CDI_DOMINATORS);
324 pop_cfun ();
325 call_cgraph_insertion_hooks (node);
326 break;
328 case EXPANSION:
329 /* Functions created during expansion shall be compiled
330 directly. */
331 node->process = 0;
332 call_cgraph_insertion_hooks (node);
333 node->expand ();
334 break;
336 default:
337 gcc_unreachable ();
338 break;
342 cgraph_new_nodes.release ();
345 /* As an GCC extension we allow redefinition of the function. The
346 semantics when both copies of bodies differ is not well defined.
347 We replace the old body with new body so in unit at a time mode
348 we always use new body, while in normal mode we may end up with
349 old body inlined into some functions and new body expanded and
350 inlined in others.
352 ??? It may make more sense to use one body for inlining and other
353 body for expanding the function but this is difficult to do. */
355 void
356 cgraph_node::reset (void)
358 /* If process is set, then we have already begun whole-unit analysis.
359 This is *not* testing for whether we've already emitted the function.
360 That case can be sort-of legitimately seen with real function redefinition
361 errors. I would argue that the front end should never present us with
362 such a case, but don't enforce that for now. */
363 gcc_assert (!process);
365 /* Reset our data structures so we can analyze the function again. */
366 memset (&local, 0, sizeof (local));
367 memset (&global, 0, sizeof (global));
368 memset (&rtl, 0, sizeof (rtl));
369 analyzed = false;
370 definition = false;
371 alias = false;
372 transparent_alias = false;
373 weakref = false;
374 cpp_implicit_alias = false;
376 remove_callees ();
377 remove_all_references ();
380 /* Return true when there are references to the node. INCLUDE_SELF is
381 true if a self reference counts as a reference. */
383 bool
384 symtab_node::referred_to_p (bool include_self)
386 ipa_ref *ref = NULL;
388 /* See if there are any references at all. */
389 if (iterate_referring (0, ref))
390 return true;
391 /* For functions check also calls. */
392 cgraph_node *cn = dyn_cast <cgraph_node *> (this);
393 if (cn && cn->callers)
395 if (include_self)
396 return true;
397 for (cgraph_edge *e = cn->callers; e; e = e->next_caller)
398 if (e->caller != this)
399 return true;
401 return false;
404 /* DECL has been parsed. Take it, queue it, compile it at the whim of the
405 logic in effect. If NO_COLLECT is true, then our caller cannot stand to have
406 the garbage collector run at the moment. We would need to either create
407 a new GC context, or just not compile right now. */
409 void
410 cgraph_node::finalize_function (tree decl, bool no_collect)
412 cgraph_node *node = cgraph_node::get_create (decl);
414 if (node->definition)
416 /* Nested functions should only be defined once. */
417 gcc_assert (!DECL_CONTEXT (decl)
418 || TREE_CODE (DECL_CONTEXT (decl)) != FUNCTION_DECL);
419 node->reset ();
420 node->local.redefined_extern_inline = true;
423 /* Set definition first before calling notice_global_symbol so that
424 it is available to notice_global_symbol. */
425 node->definition = true;
426 notice_global_symbol (decl);
427 node->lowered = DECL_STRUCT_FUNCTION (decl)->cfg != NULL;
429 /* With -fkeep-inline-functions we are keeping all inline functions except
430 for extern inline ones. */
431 if (flag_keep_inline_functions
432 && DECL_DECLARED_INLINE_P (decl)
433 && !DECL_EXTERNAL (decl)
434 && !DECL_DISREGARD_INLINE_LIMITS (decl))
435 node->force_output = 1;
437 /* When not optimizing, also output the static functions. (see
438 PR24561), but don't do so for always_inline functions, functions
439 declared inline and nested functions. These were optimized out
440 in the original implementation and it is unclear whether we want
441 to change the behavior here. */
442 if (((!opt_for_fn (decl, optimize) || flag_keep_static_functions)
443 && !node->cpp_implicit_alias
444 && !DECL_DISREGARD_INLINE_LIMITS (decl)
445 && !DECL_DECLARED_INLINE_P (decl)
446 && !(DECL_CONTEXT (decl)
447 && TREE_CODE (DECL_CONTEXT (decl)) == FUNCTION_DECL))
448 && !DECL_COMDAT (decl) && !DECL_EXTERNAL (decl))
449 node->force_output = 1;
451 /* If we've not yet emitted decl, tell the debug info about it. */
452 if (!TREE_ASM_WRITTEN (decl))
453 (*debug_hooks->deferred_inline_function) (decl);
455 if (!no_collect)
456 ggc_collect ();
458 if (symtab->state == CONSTRUCTION
459 && (node->needed_p () || node->referred_to_p ()))
460 enqueue_node (node);
463 /* Add the function FNDECL to the call graph.
464 Unlike finalize_function, this function is intended to be used
465 by middle end and allows insertion of new function at arbitrary point
466 of compilation. The function can be either in high, low or SSA form
467 GIMPLE.
469 The function is assumed to be reachable and have address taken (so no
470 API breaking optimizations are performed on it).
472 Main work done by this function is to enqueue the function for later
473 processing to avoid need the passes to be re-entrant. */
475 void
476 cgraph_node::add_new_function (tree fndecl, bool lowered)
478 gcc::pass_manager *passes = g->get_passes ();
479 cgraph_node *node;
481 if (dump_file)
483 struct function *fn = DECL_STRUCT_FUNCTION (fndecl);
484 const char *function_type = ((gimple_has_body_p (fndecl))
485 ? (lowered
486 ? (gimple_in_ssa_p (fn)
487 ? "ssa gimple"
488 : "low gimple")
489 : "high gimple")
490 : "to-be-gimplified");
491 fprintf (dump_file,
492 "Added new %s function %s to callgraph\n",
493 function_type,
494 fndecl_name (fndecl));
497 switch (symtab->state)
499 case PARSING:
500 cgraph_node::finalize_function (fndecl, false);
501 break;
502 case CONSTRUCTION:
503 /* Just enqueue function to be processed at nearest occurrence. */
504 node = cgraph_node::get_create (fndecl);
505 if (lowered)
506 node->lowered = true;
507 cgraph_new_nodes.safe_push (node);
508 break;
510 case IPA:
511 case IPA_SSA:
512 case IPA_SSA_AFTER_INLINING:
513 case EXPANSION:
514 /* Bring the function into finalized state and enqueue for later
515 analyzing and compilation. */
516 node = cgraph_node::get_create (fndecl);
517 node->local.local = false;
518 node->definition = true;
519 node->force_output = true;
520 if (!lowered && symtab->state == EXPANSION)
522 push_cfun (DECL_STRUCT_FUNCTION (fndecl));
523 gimple_register_cfg_hooks ();
524 bitmap_obstack_initialize (NULL);
525 execute_pass_list (cfun, passes->all_lowering_passes);
526 passes->execute_early_local_passes ();
527 bitmap_obstack_release (NULL);
528 pop_cfun ();
530 lowered = true;
532 if (lowered)
533 node->lowered = true;
534 cgraph_new_nodes.safe_push (node);
535 break;
537 case FINISHED:
538 /* At the very end of compilation we have to do all the work up
539 to expansion. */
540 node = cgraph_node::create (fndecl);
541 if (lowered)
542 node->lowered = true;
543 node->definition = true;
544 node->analyze ();
545 push_cfun (DECL_STRUCT_FUNCTION (fndecl));
546 gimple_register_cfg_hooks ();
547 bitmap_obstack_initialize (NULL);
548 if (!gimple_in_ssa_p (DECL_STRUCT_FUNCTION (fndecl)))
549 g->get_passes ()->execute_early_local_passes ();
550 bitmap_obstack_release (NULL);
551 pop_cfun ();
552 node->expand ();
553 break;
555 default:
556 gcc_unreachable ();
559 /* Set a personality if required and we already passed EH lowering. */
560 if (lowered
561 && (function_needs_eh_personality (DECL_STRUCT_FUNCTION (fndecl))
562 == eh_personality_lang))
563 DECL_FUNCTION_PERSONALITY (fndecl) = lang_hooks.eh_personality ();
566 /* Analyze the function scheduled to be output. */
567 void
568 cgraph_node::analyze (void)
570 tree decl = this->decl;
571 location_t saved_loc = input_location;
572 input_location = DECL_SOURCE_LOCATION (decl);
574 if (thunk.thunk_p)
576 cgraph_node *t = cgraph_node::get (thunk.alias);
578 create_edge (t, NULL, 0, CGRAPH_FREQ_BASE);
579 callees->can_throw_external = !TREE_NOTHROW (t->decl);
580 /* Target code in expand_thunk may need the thunk's target
581 to be analyzed, so recurse here. */
582 if (!t->analyzed)
583 t->analyze ();
584 if (t->alias)
586 t = t->get_alias_target ();
587 if (!t->analyzed)
588 t->analyze ();
590 if (!expand_thunk (false, false))
592 thunk.alias = NULL;
593 return;
595 thunk.alias = NULL;
597 if (alias)
598 resolve_alias (cgraph_node::get (alias_target), transparent_alias);
599 else if (dispatcher_function)
601 /* Generate the dispatcher body of multi-versioned functions. */
602 cgraph_function_version_info *dispatcher_version_info
603 = function_version ();
604 if (dispatcher_version_info != NULL
605 && (dispatcher_version_info->dispatcher_resolver
606 == NULL_TREE))
608 tree resolver = NULL_TREE;
609 gcc_assert (targetm.generate_version_dispatcher_body);
610 resolver = targetm.generate_version_dispatcher_body (this);
611 gcc_assert (resolver != NULL_TREE);
614 else
616 push_cfun (DECL_STRUCT_FUNCTION (decl));
618 assign_assembler_name_if_neeeded (decl);
620 /* Make sure to gimplify bodies only once. During analyzing a
621 function we lower it, which will require gimplified nested
622 functions, so we can end up here with an already gimplified
623 body. */
624 if (!gimple_has_body_p (decl))
625 gimplify_function_tree (decl);
627 /* Lower the function. */
628 if (!lowered)
630 if (nested)
631 lower_nested_functions (decl);
632 gcc_assert (!nested);
634 gimple_register_cfg_hooks ();
635 bitmap_obstack_initialize (NULL);
636 execute_pass_list (cfun, g->get_passes ()->all_lowering_passes);
637 free_dominance_info (CDI_POST_DOMINATORS);
638 free_dominance_info (CDI_DOMINATORS);
639 compact_blocks ();
640 bitmap_obstack_release (NULL);
641 lowered = true;
644 pop_cfun ();
646 analyzed = true;
648 input_location = saved_loc;
651 /* C++ frontend produce same body aliases all over the place, even before PCH
652 gets streamed out. It relies on us linking the aliases with their function
653 in order to do the fixups, but ipa-ref is not PCH safe. Consequentely we
654 first produce aliases without links, but once C++ FE is sure he won't sream
655 PCH we build the links via this function. */
657 void
658 symbol_table::process_same_body_aliases (void)
660 symtab_node *node;
661 FOR_EACH_SYMBOL (node)
662 if (node->cpp_implicit_alias && !node->analyzed)
663 node->resolve_alias
664 (TREE_CODE (node->alias_target) == VAR_DECL
665 ? (symtab_node *)varpool_node::get_create (node->alias_target)
666 : (symtab_node *)cgraph_node::get_create (node->alias_target));
667 cpp_implicit_aliases_done = true;
670 /* Process attributes common for vars and functions. */
672 static void
673 process_common_attributes (symtab_node *node, tree decl)
675 tree weakref = lookup_attribute ("weakref", DECL_ATTRIBUTES (decl));
677 if (weakref && !lookup_attribute ("alias", DECL_ATTRIBUTES (decl)))
679 warning_at (DECL_SOURCE_LOCATION (decl), OPT_Wattributes,
680 "%<weakref%> attribute should be accompanied with"
681 " an %<alias%> attribute");
682 DECL_WEAK (decl) = 0;
683 DECL_ATTRIBUTES (decl) = remove_attribute ("weakref",
684 DECL_ATTRIBUTES (decl));
687 if (lookup_attribute ("no_reorder", DECL_ATTRIBUTES (decl)))
688 node->no_reorder = 1;
691 /* Look for externally_visible and used attributes and mark cgraph nodes
692 accordingly.
694 We cannot mark the nodes at the point the attributes are processed (in
695 handle_*_attribute) because the copy of the declarations available at that
696 point may not be canonical. For example, in:
698 void f();
699 void f() __attribute__((used));
701 the declaration we see in handle_used_attribute will be the second
702 declaration -- but the front end will subsequently merge that declaration
703 with the original declaration and discard the second declaration.
705 Furthermore, we can't mark these nodes in finalize_function because:
707 void f() {}
708 void f() __attribute__((externally_visible));
710 is valid.
712 So, we walk the nodes at the end of the translation unit, applying the
713 attributes at that point. */
715 static void
716 process_function_and_variable_attributes (cgraph_node *first,
717 varpool_node *first_var)
719 cgraph_node *node;
720 varpool_node *vnode;
722 for (node = symtab->first_function (); node != first;
723 node = symtab->next_function (node))
725 tree decl = node->decl;
726 if (DECL_PRESERVE_P (decl))
727 node->mark_force_output ();
728 else if (lookup_attribute ("externally_visible", DECL_ATTRIBUTES (decl)))
730 if (! TREE_PUBLIC (node->decl))
731 warning_at (DECL_SOURCE_LOCATION (node->decl), OPT_Wattributes,
732 "%<externally_visible%>"
733 " attribute have effect only on public objects");
735 if (lookup_attribute ("weakref", DECL_ATTRIBUTES (decl))
736 && (node->definition && !node->alias))
738 warning_at (DECL_SOURCE_LOCATION (node->decl), OPT_Wattributes,
739 "%<weakref%> attribute ignored"
740 " because function is defined");
741 DECL_WEAK (decl) = 0;
742 DECL_ATTRIBUTES (decl) = remove_attribute ("weakref",
743 DECL_ATTRIBUTES (decl));
746 if (lookup_attribute ("always_inline", DECL_ATTRIBUTES (decl))
747 && !DECL_DECLARED_INLINE_P (decl)
748 /* redefining extern inline function makes it DECL_UNINLINABLE. */
749 && !DECL_UNINLINABLE (decl))
750 warning_at (DECL_SOURCE_LOCATION (decl), OPT_Wattributes,
751 "always_inline function might not be inlinable");
753 process_common_attributes (node, decl);
755 for (vnode = symtab->first_variable (); vnode != first_var;
756 vnode = symtab->next_variable (vnode))
758 tree decl = vnode->decl;
759 if (DECL_EXTERNAL (decl)
760 && DECL_INITIAL (decl))
761 varpool_node::finalize_decl (decl);
762 if (DECL_PRESERVE_P (decl))
763 vnode->force_output = true;
764 else if (lookup_attribute ("externally_visible", DECL_ATTRIBUTES (decl)))
766 if (! TREE_PUBLIC (vnode->decl))
767 warning_at (DECL_SOURCE_LOCATION (vnode->decl), OPT_Wattributes,
768 "%<externally_visible%>"
769 " attribute have effect only on public objects");
771 if (lookup_attribute ("weakref", DECL_ATTRIBUTES (decl))
772 && vnode->definition
773 && DECL_INITIAL (decl))
775 warning_at (DECL_SOURCE_LOCATION (vnode->decl), OPT_Wattributes,
776 "%<weakref%> attribute ignored"
777 " because variable is initialized");
778 DECL_WEAK (decl) = 0;
779 DECL_ATTRIBUTES (decl) = remove_attribute ("weakref",
780 DECL_ATTRIBUTES (decl));
782 process_common_attributes (vnode, decl);
786 /* Mark DECL as finalized. By finalizing the declaration, frontend instruct the
787 middle end to output the variable to asm file, if needed or externally
788 visible. */
790 void
791 varpool_node::finalize_decl (tree decl)
793 varpool_node *node = varpool_node::get_create (decl);
795 gcc_assert (TREE_STATIC (decl) || DECL_EXTERNAL (decl));
797 if (node->definition)
798 return;
799 /* Set definition first before calling notice_global_symbol so that
800 it is available to notice_global_symbol. */
801 node->definition = true;
802 notice_global_symbol (decl);
803 if (TREE_THIS_VOLATILE (decl) || DECL_PRESERVE_P (decl)
804 /* Traditionally we do not eliminate static variables when not
805 optimizing and when not doing toplevel reoder. */
806 || node->no_reorder
807 || ((!flag_toplevel_reorder
808 && !DECL_COMDAT (node->decl)
809 && !DECL_ARTIFICIAL (node->decl))))
810 node->force_output = true;
812 if (symtab->state == CONSTRUCTION
813 && (node->needed_p () || node->referred_to_p ()))
814 enqueue_node (node);
815 if (symtab->state >= IPA_SSA)
816 node->analyze ();
817 /* Some frontends produce various interface variables after compilation
818 finished. */
819 if (symtab->state == FINISHED
820 || (!flag_toplevel_reorder
821 && symtab->state == EXPANSION))
822 node->assemble_decl ();
824 if (DECL_INITIAL (decl))
825 chkp_register_var_initializer (decl);
828 /* EDGE is an polymorphic call. Mark all possible targets as reachable
829 and if there is only one target, perform trivial devirtualization.
830 REACHABLE_CALL_TARGETS collects target lists we already walked to
831 avoid udplicate work. */
833 static void
834 walk_polymorphic_call_targets (hash_set<void *> *reachable_call_targets,
835 cgraph_edge *edge)
837 unsigned int i;
838 void *cache_token;
839 bool final;
840 vec <cgraph_node *>targets
841 = possible_polymorphic_call_targets
842 (edge, &final, &cache_token);
844 if (!reachable_call_targets->add (cache_token))
846 if (symtab->dump_file)
847 dump_possible_polymorphic_call_targets
848 (symtab->dump_file, edge);
850 for (i = 0; i < targets.length (); i++)
852 /* Do not bother to mark virtual methods in anonymous namespace;
853 either we will find use of virtual table defining it, or it is
854 unused. */
855 if (targets[i]->definition
856 && TREE_CODE
857 (TREE_TYPE (targets[i]->decl))
858 == METHOD_TYPE
859 && !type_in_anonymous_namespace_p
860 (TYPE_METHOD_BASETYPE (TREE_TYPE (targets[i]->decl))))
861 enqueue_node (targets[i]);
865 /* Very trivial devirtualization; when the type is
866 final or anonymous (so we know all its derivation)
867 and there is only one possible virtual call target,
868 make the edge direct. */
869 if (final)
871 if (targets.length () <= 1 && dbg_cnt (devirt))
873 cgraph_node *target;
874 if (targets.length () == 1)
875 target = targets[0];
876 else
877 target = cgraph_node::create
878 (builtin_decl_implicit (BUILT_IN_UNREACHABLE));
880 if (symtab->dump_file)
882 fprintf (symtab->dump_file,
883 "Devirtualizing call: ");
884 print_gimple_stmt (symtab->dump_file,
885 edge->call_stmt, 0,
886 TDF_SLIM);
888 if (dump_enabled_p ())
890 location_t locus = gimple_location_safe (edge->call_stmt);
891 dump_printf_loc (MSG_OPTIMIZED_LOCATIONS, locus,
892 "devirtualizing call in %s to %s\n",
893 edge->caller->name (), target->name ());
896 edge->make_direct (target);
897 edge->redirect_call_stmt_to_callee ();
899 /* Call to __builtin_unreachable shouldn't be instrumented. */
900 if (!targets.length ())
901 gimple_call_set_with_bounds (edge->call_stmt, false);
903 if (symtab->dump_file)
905 fprintf (symtab->dump_file,
906 "Devirtualized as: ");
907 print_gimple_stmt (symtab->dump_file,
908 edge->call_stmt, 0,
909 TDF_SLIM);
915 /* Issue appropriate warnings for the global declaration DECL. */
917 static void
918 check_global_declaration (symtab_node *snode)
920 const char *decl_file;
921 tree decl = snode->decl;
923 /* Warn about any function declared static but not defined. We don't
924 warn about variables, because many programs have static variables
925 that exist only to get some text into the object file. */
926 if (TREE_CODE (decl) == FUNCTION_DECL
927 && DECL_INITIAL (decl) == 0
928 && DECL_EXTERNAL (decl)
929 && ! DECL_ARTIFICIAL (decl)
930 && ! TREE_NO_WARNING (decl)
931 && ! TREE_PUBLIC (decl)
932 && (warn_unused_function
933 || snode->referred_to_p (/*include_self=*/false)))
935 if (snode->referred_to_p (/*include_self=*/false))
936 pedwarn (input_location, 0, "%q+F used but never defined", decl);
937 else
938 warning (OPT_Wunused_function, "%q+F declared %<static%> but never defined", decl);
939 /* This symbol is effectively an "extern" declaration now. */
940 TREE_PUBLIC (decl) = 1;
943 /* Warn about static fns or vars defined but not used. */
944 if (((warn_unused_function && TREE_CODE (decl) == FUNCTION_DECL)
945 || (((warn_unused_variable && ! TREE_READONLY (decl))
946 || (warn_unused_const_variable > 0 && TREE_READONLY (decl)
947 && (warn_unused_const_variable == 2
948 || (main_input_filename != NULL
949 && (decl_file = DECL_SOURCE_FILE (decl)) != NULL
950 && filename_cmp (main_input_filename,
951 decl_file) == 0))))
952 && TREE_CODE (decl) == VAR_DECL))
953 && ! DECL_IN_SYSTEM_HEADER (decl)
954 && ! snode->referred_to_p (/*include_self=*/false)
955 /* This TREE_USED check is needed in addition to referred_to_p
956 above, because the `__unused__' attribute is not being
957 considered for referred_to_p. */
958 && ! TREE_USED (decl)
959 /* The TREE_USED bit for file-scope decls is kept in the identifier,
960 to handle multiple external decls in different scopes. */
961 && ! (DECL_NAME (decl) && TREE_USED (DECL_NAME (decl)))
962 && ! DECL_EXTERNAL (decl)
963 && ! DECL_ARTIFICIAL (decl)
964 && ! DECL_ABSTRACT_ORIGIN (decl)
965 && ! TREE_PUBLIC (decl)
966 /* A volatile variable might be used in some non-obvious way. */
967 && (! VAR_P (decl) || ! TREE_THIS_VOLATILE (decl))
968 /* Global register variables must be declared to reserve them. */
969 && ! (TREE_CODE (decl) == VAR_DECL && DECL_REGISTER (decl))
970 /* Global ctors and dtors are called by the runtime. */
971 && (TREE_CODE (decl) != FUNCTION_DECL
972 || (!DECL_STATIC_CONSTRUCTOR (decl)
973 && !DECL_STATIC_DESTRUCTOR (decl)))
974 /* Otherwise, ask the language. */
975 && lang_hooks.decls.warn_unused_global (decl))
976 warning_at (DECL_SOURCE_LOCATION (decl),
977 (TREE_CODE (decl) == FUNCTION_DECL)
978 ? OPT_Wunused_function
979 : (TREE_READONLY (decl)
980 ? OPT_Wunused_const_variable_
981 : OPT_Wunused_variable),
982 "%qD defined but not used", decl);
985 /* Discover all functions and variables that are trivially needed, analyze
986 them as well as all functions and variables referred by them */
987 static cgraph_node *first_analyzed;
988 static varpool_node *first_analyzed_var;
990 /* FIRST_TIME is set to TRUE for the first time we are called for a
991 translation unit from finalize_compilation_unit() or false
992 otherwise. */
994 static void
995 analyze_functions (bool first_time)
997 /* Keep track of already processed nodes when called multiple times for
998 intermodule optimization. */
999 cgraph_node *first_handled = first_analyzed;
1000 varpool_node *first_handled_var = first_analyzed_var;
1001 hash_set<void *> reachable_call_targets;
1003 symtab_node *node;
1004 symtab_node *next;
1005 int i;
1006 ipa_ref *ref;
1007 bool changed = true;
1008 location_t saved_loc = input_location;
1010 bitmap_obstack_initialize (NULL);
1011 symtab->state = CONSTRUCTION;
1012 input_location = UNKNOWN_LOCATION;
1014 /* Ugly, but the fixup can not happen at a time same body alias is created;
1015 C++ FE is confused about the COMDAT groups being right. */
1016 if (symtab->cpp_implicit_aliases_done)
1017 FOR_EACH_SYMBOL (node)
1018 if (node->cpp_implicit_alias)
1019 node->fixup_same_cpp_alias_visibility (node->get_alias_target ());
1020 build_type_inheritance_graph ();
1022 /* Analysis adds static variables that in turn adds references to new functions.
1023 So we need to iterate the process until it stabilize. */
1024 while (changed)
1026 changed = false;
1027 process_function_and_variable_attributes (first_analyzed,
1028 first_analyzed_var);
1030 /* First identify the trivially needed symbols. */
1031 for (node = symtab->first_symbol ();
1032 node != first_analyzed
1033 && node != first_analyzed_var; node = node->next)
1035 /* Convert COMDAT group designators to IDENTIFIER_NODEs. */
1036 node->get_comdat_group_id ();
1037 if (node->needed_p ())
1039 enqueue_node (node);
1040 if (!changed && symtab->dump_file)
1041 fprintf (symtab->dump_file, "Trivially needed symbols:");
1042 changed = true;
1043 if (symtab->dump_file)
1044 fprintf (symtab->dump_file, " %s", node->asm_name ());
1045 if (!changed && symtab->dump_file)
1046 fprintf (symtab->dump_file, "\n");
1048 if (node == first_analyzed
1049 || node == first_analyzed_var)
1050 break;
1052 symtab->process_new_functions ();
1053 first_analyzed_var = symtab->first_variable ();
1054 first_analyzed = symtab->first_function ();
1056 if (changed && symtab->dump_file)
1057 fprintf (symtab->dump_file, "\n");
1059 /* Lower representation, build callgraph edges and references for all trivially
1060 needed symbols and all symbols referred by them. */
1061 while (queued_nodes != &symtab_terminator)
1063 changed = true;
1064 node = queued_nodes;
1065 queued_nodes = (symtab_node *)queued_nodes->aux;
1066 cgraph_node *cnode = dyn_cast <cgraph_node *> (node);
1067 if (cnode && cnode->definition)
1069 cgraph_edge *edge;
1070 tree decl = cnode->decl;
1072 /* ??? It is possible to create extern inline function
1073 and later using weak alias attribute to kill its body.
1074 See gcc.c-torture/compile/20011119-1.c */
1075 if (!DECL_STRUCT_FUNCTION (decl)
1076 && !cnode->alias
1077 && !cnode->thunk.thunk_p
1078 && !cnode->dispatcher_function)
1080 cnode->reset ();
1081 cnode->local.redefined_extern_inline = true;
1082 continue;
1085 if (!cnode->analyzed)
1086 cnode->analyze ();
1088 for (edge = cnode->callees; edge; edge = edge->next_callee)
1089 if (edge->callee->definition
1090 && (!DECL_EXTERNAL (edge->callee->decl)
1091 /* When not optimizing, do not try to analyze extern
1092 inline functions. Doing so is pointless. */
1093 || opt_for_fn (edge->callee->decl, optimize)
1094 /* Weakrefs needs to be preserved. */
1095 || edge->callee->alias
1096 /* always_inline functions are inlined aven at -O0. */
1097 || lookup_attribute
1098 ("always_inline",
1099 DECL_ATTRIBUTES (edge->callee->decl))
1100 /* Multiversioned functions needs the dispatcher to
1101 be produced locally even for extern functions. */
1102 || edge->callee->function_version ()))
1103 enqueue_node (edge->callee);
1104 if (opt_for_fn (cnode->decl, optimize)
1105 && opt_for_fn (cnode->decl, flag_devirtualize))
1107 cgraph_edge *next;
1109 for (edge = cnode->indirect_calls; edge; edge = next)
1111 next = edge->next_callee;
1112 if (edge->indirect_info->polymorphic)
1113 walk_polymorphic_call_targets (&reachable_call_targets,
1114 edge);
1118 /* If decl is a clone of an abstract function,
1119 mark that abstract function so that we don't release its body.
1120 The DECL_INITIAL() of that abstract function declaration
1121 will be later needed to output debug info. */
1122 if (DECL_ABSTRACT_ORIGIN (decl))
1124 cgraph_node *origin_node
1125 = cgraph_node::get_create (DECL_ABSTRACT_ORIGIN (decl));
1126 origin_node->used_as_abstract_origin = true;
1129 else
1131 varpool_node *vnode = dyn_cast <varpool_node *> (node);
1132 if (vnode && vnode->definition && !vnode->analyzed)
1133 vnode->analyze ();
1136 if (node->same_comdat_group)
1138 symtab_node *next;
1139 for (next = node->same_comdat_group;
1140 next != node;
1141 next = next->same_comdat_group)
1142 if (!next->comdat_local_p ())
1143 enqueue_node (next);
1145 for (i = 0; node->iterate_reference (i, ref); i++)
1146 if (ref->referred->definition
1147 && (!DECL_EXTERNAL (ref->referred->decl)
1148 || ((TREE_CODE (ref->referred->decl) != FUNCTION_DECL
1149 && optimize)
1150 || (TREE_CODE (ref->referred->decl) == FUNCTION_DECL
1151 && opt_for_fn (ref->referred->decl, optimize))
1152 || node->alias
1153 || ref->referred->alias)))
1154 enqueue_node (ref->referred);
1155 symtab->process_new_functions ();
1158 update_type_inheritance_graph ();
1160 /* Collect entry points to the unit. */
1161 if (symtab->dump_file)
1163 fprintf (symtab->dump_file, "\n\nInitial ");
1164 symtab_node::dump_table (symtab->dump_file);
1167 if (first_time)
1169 symtab_node *snode;
1170 FOR_EACH_SYMBOL (snode)
1171 check_global_declaration (snode);
1174 if (symtab->dump_file)
1175 fprintf (symtab->dump_file, "\nRemoving unused symbols:");
1177 for (node = symtab->first_symbol ();
1178 node != first_handled
1179 && node != first_handled_var; node = next)
1181 next = node->next;
1182 if (!node->aux && !node->referred_to_p ())
1184 if (symtab->dump_file)
1185 fprintf (symtab->dump_file, " %s", node->name ());
1187 /* See if the debugger can use anything before the DECL
1188 passes away. Perhaps it can notice a DECL that is now a
1189 constant and can tag the early DIE with an appropriate
1190 attribute.
1192 Otherwise, this is the last chance the debug_hooks have
1193 at looking at optimized away DECLs, since
1194 late_global_decl will subsequently be called from the
1195 contents of the now pruned symbol table. */
1196 if (!decl_function_context (node->decl))
1197 (*debug_hooks->late_global_decl) (node->decl);
1199 node->remove ();
1200 continue;
1202 if (cgraph_node *cnode = dyn_cast <cgraph_node *> (node))
1204 tree decl = node->decl;
1206 if (cnode->definition && !gimple_has_body_p (decl)
1207 && !cnode->alias
1208 && !cnode->thunk.thunk_p)
1209 cnode->reset ();
1211 gcc_assert (!cnode->definition || cnode->thunk.thunk_p
1212 || cnode->alias
1213 || gimple_has_body_p (decl));
1214 gcc_assert (cnode->analyzed == cnode->definition);
1216 node->aux = NULL;
1218 for (;node; node = node->next)
1219 node->aux = NULL;
1220 first_analyzed = symtab->first_function ();
1221 first_analyzed_var = symtab->first_variable ();
1222 if (symtab->dump_file)
1224 fprintf (symtab->dump_file, "\n\nReclaimed ");
1225 symtab_node::dump_table (symtab->dump_file);
1227 bitmap_obstack_release (NULL);
1228 ggc_collect ();
1229 /* Initialize assembler name hash, in particular we want to trigger C++
1230 mangling and same body alias creation before we free DECL_ARGUMENTS
1231 used by it. */
1232 if (!seen_error ())
1233 symtab->symtab_initialize_asm_name_hash ();
1235 input_location = saved_loc;
1238 /* Translate the ugly representation of aliases as alias pairs into nice
1239 representation in callgraph. We don't handle all cases yet,
1240 unfortunately. */
1242 static void
1243 handle_alias_pairs (void)
1245 alias_pair *p;
1246 unsigned i;
1248 for (i = 0; alias_pairs && alias_pairs->iterate (i, &p);)
1250 symtab_node *target_node = symtab_node::get_for_asmname (p->target);
1252 /* Weakrefs with target not defined in current unit are easy to handle:
1253 they behave just as external variables except we need to note the
1254 alias flag to later output the weakref pseudo op into asm file. */
1255 if (!target_node
1256 && lookup_attribute ("weakref", DECL_ATTRIBUTES (p->decl)) != NULL)
1258 symtab_node *node = symtab_node::get (p->decl);
1259 if (node)
1261 node->alias_target = p->target;
1262 node->weakref = true;
1263 node->alias = true;
1264 node->transparent_alias = true;
1266 alias_pairs->unordered_remove (i);
1267 continue;
1269 else if (!target_node)
1271 error ("%q+D aliased to undefined symbol %qE", p->decl, p->target);
1272 symtab_node *node = symtab_node::get (p->decl);
1273 if (node)
1274 node->alias = false;
1275 alias_pairs->unordered_remove (i);
1276 continue;
1279 if (DECL_EXTERNAL (target_node->decl)
1280 /* We use local aliases for C++ thunks to force the tailcall
1281 to bind locally. This is a hack - to keep it working do
1282 the following (which is not strictly correct). */
1283 && (TREE_CODE (target_node->decl) != FUNCTION_DECL
1284 || ! DECL_VIRTUAL_P (target_node->decl))
1285 && ! lookup_attribute ("weakref", DECL_ATTRIBUTES (p->decl)))
1287 error ("%q+D aliased to external symbol %qE",
1288 p->decl, p->target);
1291 if (TREE_CODE (p->decl) == FUNCTION_DECL
1292 && target_node && is_a <cgraph_node *> (target_node))
1294 cgraph_node *src_node = cgraph_node::get (p->decl);
1295 if (src_node && src_node->definition)
1296 src_node->reset ();
1297 cgraph_node::create_alias (p->decl, target_node->decl);
1298 alias_pairs->unordered_remove (i);
1300 else if (TREE_CODE (p->decl) == VAR_DECL
1301 && target_node && is_a <varpool_node *> (target_node))
1303 varpool_node::create_alias (p->decl, target_node->decl);
1304 alias_pairs->unordered_remove (i);
1306 else
1308 error ("%q+D alias in between function and variable is not supported",
1309 p->decl);
1310 warning (0, "%q+D aliased declaration",
1311 target_node->decl);
1312 alias_pairs->unordered_remove (i);
1315 vec_free (alias_pairs);
1319 /* Figure out what functions we want to assemble. */
1321 static void
1322 mark_functions_to_output (void)
1324 bool check_same_comdat_groups = false;
1325 cgraph_node *node;
1327 if (flag_checking)
1328 FOR_EACH_FUNCTION (node)
1329 gcc_assert (!node->process);
1331 FOR_EACH_FUNCTION (node)
1333 tree decl = node->decl;
1335 gcc_assert (!node->process || node->same_comdat_group);
1336 if (node->process)
1337 continue;
1339 /* We need to output all local functions that are used and not
1340 always inlined, as well as those that are reachable from
1341 outside the current compilation unit. */
1342 if (node->analyzed
1343 && !node->thunk.thunk_p
1344 && !node->alias
1345 && !node->global.inlined_to
1346 && !TREE_ASM_WRITTEN (decl)
1347 && !DECL_EXTERNAL (decl))
1349 node->process = 1;
1350 if (node->same_comdat_group)
1352 cgraph_node *next;
1353 for (next = dyn_cast<cgraph_node *> (node->same_comdat_group);
1354 next != node;
1355 next = dyn_cast<cgraph_node *> (next->same_comdat_group))
1356 if (!next->thunk.thunk_p && !next->alias
1357 && !next->comdat_local_p ())
1358 next->process = 1;
1361 else if (node->same_comdat_group)
1363 if (flag_checking)
1364 check_same_comdat_groups = true;
1366 else
1368 /* We should've reclaimed all functions that are not needed. */
1369 if (flag_checking
1370 && !node->global.inlined_to
1371 && gimple_has_body_p (decl)
1372 /* FIXME: in ltrans unit when offline copy is outside partition but inline copies
1373 are inside partition, we can end up not removing the body since we no longer
1374 have analyzed node pointing to it. */
1375 && !node->in_other_partition
1376 && !node->alias
1377 && !node->clones
1378 && !DECL_EXTERNAL (decl))
1380 node->debug ();
1381 internal_error ("failed to reclaim unneeded function");
1383 gcc_assert (node->global.inlined_to
1384 || !gimple_has_body_p (decl)
1385 || node->in_other_partition
1386 || node->clones
1387 || DECL_ARTIFICIAL (decl)
1388 || DECL_EXTERNAL (decl));
1393 if (flag_checking && check_same_comdat_groups)
1394 FOR_EACH_FUNCTION (node)
1395 if (node->same_comdat_group && !node->process)
1397 tree decl = node->decl;
1398 if (!node->global.inlined_to
1399 && gimple_has_body_p (decl)
1400 /* FIXME: in an ltrans unit when the offline copy is outside a
1401 partition but inline copies are inside a partition, we can
1402 end up not removing the body since we no longer have an
1403 analyzed node pointing to it. */
1404 && !node->in_other_partition
1405 && !node->clones
1406 && !DECL_EXTERNAL (decl))
1408 node->debug ();
1409 internal_error ("failed to reclaim unneeded function in same "
1410 "comdat group");
1415 /* DECL is FUNCTION_DECL. Initialize datastructures so DECL is a function
1416 in lowered gimple form. IN_SSA is true if the gimple is in SSA.
1418 Set current_function_decl and cfun to newly constructed empty function body.
1419 return basic block in the function body. */
1421 basic_block
1422 init_lowered_empty_function (tree decl, bool in_ssa, gcov_type count)
1424 basic_block bb;
1425 edge e;
1427 current_function_decl = decl;
1428 allocate_struct_function (decl, false);
1429 gimple_register_cfg_hooks ();
1430 init_empty_tree_cfg ();
1431 init_tree_ssa (cfun);
1433 if (in_ssa)
1435 init_ssa_operands (cfun);
1436 cfun->gimple_df->in_ssa_p = true;
1437 cfun->curr_properties |= PROP_ssa;
1440 DECL_INITIAL (decl) = make_node (BLOCK);
1441 BLOCK_SUPERCONTEXT (DECL_INITIAL (decl)) = decl;
1443 DECL_SAVED_TREE (decl) = error_mark_node;
1444 cfun->curr_properties |= (PROP_gimple_lcf | PROP_gimple_leh | PROP_gimple_any
1445 | PROP_cfg | PROP_loops);
1447 set_loops_for_fn (cfun, ggc_cleared_alloc<loops> ());
1448 init_loops_structure (cfun, loops_for_fn (cfun), 1);
1449 loops_for_fn (cfun)->state |= LOOPS_MAY_HAVE_MULTIPLE_LATCHES;
1451 /* Create BB for body of the function and connect it properly. */
1452 ENTRY_BLOCK_PTR_FOR_FN (cfun)->count = count;
1453 ENTRY_BLOCK_PTR_FOR_FN (cfun)->frequency = REG_BR_PROB_BASE;
1454 EXIT_BLOCK_PTR_FOR_FN (cfun)->count = count;
1455 EXIT_BLOCK_PTR_FOR_FN (cfun)->frequency = REG_BR_PROB_BASE;
1456 bb = create_basic_block (NULL, ENTRY_BLOCK_PTR_FOR_FN (cfun));
1457 bb->count = count;
1458 bb->frequency = BB_FREQ_MAX;
1459 e = make_edge (ENTRY_BLOCK_PTR_FOR_FN (cfun), bb, EDGE_FALLTHRU);
1460 e->count = count;
1461 e->probability = REG_BR_PROB_BASE;
1462 e = make_edge (bb, EXIT_BLOCK_PTR_FOR_FN (cfun), 0);
1463 e->count = count;
1464 e->probability = REG_BR_PROB_BASE;
1465 add_bb_to_loop (bb, ENTRY_BLOCK_PTR_FOR_FN (cfun)->loop_father);
1467 return bb;
1470 /* Adjust PTR by the constant FIXED_OFFSET, and by the vtable
1471 offset indicated by VIRTUAL_OFFSET, if that is
1472 non-null. THIS_ADJUSTING is nonzero for a this adjusting thunk and
1473 zero for a result adjusting thunk. */
1475 tree
1476 thunk_adjust (gimple_stmt_iterator * bsi,
1477 tree ptr, bool this_adjusting,
1478 HOST_WIDE_INT fixed_offset, tree virtual_offset)
1480 gassign *stmt;
1481 tree ret;
1483 if (this_adjusting
1484 && fixed_offset != 0)
1486 stmt = gimple_build_assign
1487 (ptr, fold_build_pointer_plus_hwi_loc (input_location,
1488 ptr,
1489 fixed_offset));
1490 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1493 /* If there's a virtual offset, look up that value in the vtable and
1494 adjust the pointer again. */
1495 if (virtual_offset)
1497 tree vtabletmp;
1498 tree vtabletmp2;
1499 tree vtabletmp3;
1501 if (!vtable_entry_type)
1503 tree vfunc_type = make_node (FUNCTION_TYPE);
1504 TREE_TYPE (vfunc_type) = integer_type_node;
1505 TYPE_ARG_TYPES (vfunc_type) = NULL_TREE;
1506 layout_type (vfunc_type);
1508 vtable_entry_type = build_pointer_type (vfunc_type);
1511 vtabletmp =
1512 create_tmp_reg (build_pointer_type
1513 (build_pointer_type (vtable_entry_type)), "vptr");
1515 /* The vptr is always at offset zero in the object. */
1516 stmt = gimple_build_assign (vtabletmp,
1517 build1 (NOP_EXPR, TREE_TYPE (vtabletmp),
1518 ptr));
1519 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1521 /* Form the vtable address. */
1522 vtabletmp2 = create_tmp_reg (TREE_TYPE (TREE_TYPE (vtabletmp)),
1523 "vtableaddr");
1524 stmt = gimple_build_assign (vtabletmp2,
1525 build_simple_mem_ref (vtabletmp));
1526 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1528 /* Find the entry with the vcall offset. */
1529 stmt = gimple_build_assign (vtabletmp2,
1530 fold_build_pointer_plus_loc (input_location,
1531 vtabletmp2,
1532 virtual_offset));
1533 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1535 /* Get the offset itself. */
1536 vtabletmp3 = create_tmp_reg (TREE_TYPE (TREE_TYPE (vtabletmp2)),
1537 "vcalloffset");
1538 stmt = gimple_build_assign (vtabletmp3,
1539 build_simple_mem_ref (vtabletmp2));
1540 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1542 /* Adjust the `this' pointer. */
1543 ptr = fold_build_pointer_plus_loc (input_location, ptr, vtabletmp3);
1544 ptr = force_gimple_operand_gsi (bsi, ptr, true, NULL_TREE, false,
1545 GSI_CONTINUE_LINKING);
1548 if (!this_adjusting
1549 && fixed_offset != 0)
1550 /* Adjust the pointer by the constant. */
1552 tree ptrtmp;
1554 if (TREE_CODE (ptr) == VAR_DECL)
1555 ptrtmp = ptr;
1556 else
1558 ptrtmp = create_tmp_reg (TREE_TYPE (ptr), "ptr");
1559 stmt = gimple_build_assign (ptrtmp, ptr);
1560 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1562 ptr = fold_build_pointer_plus_hwi_loc (input_location,
1563 ptrtmp, fixed_offset);
1566 /* Emit the statement and gimplify the adjustment expression. */
1567 ret = create_tmp_reg (TREE_TYPE (ptr), "adjusted_this");
1568 stmt = gimple_build_assign (ret, ptr);
1569 gsi_insert_after (bsi, stmt, GSI_NEW_STMT);
1571 return ret;
1574 /* Expand thunk NODE to gimple if possible.
1575 When FORCE_GIMPLE_THUNK is true, gimple thunk is created and
1576 no assembler is produced.
1577 When OUTPUT_ASM_THUNK is true, also produce assembler for
1578 thunks that are not lowered. */
1580 bool
1581 cgraph_node::expand_thunk (bool output_asm_thunks, bool force_gimple_thunk)
1583 bool this_adjusting = thunk.this_adjusting;
1584 HOST_WIDE_INT fixed_offset = thunk.fixed_offset;
1585 HOST_WIDE_INT virtual_value = thunk.virtual_value;
1586 tree virtual_offset = NULL;
1587 tree alias = callees->callee->decl;
1588 tree thunk_fndecl = decl;
1589 tree a;
1591 /* Instrumentation thunk is the same function with
1592 a different signature. Never need to expand it. */
1593 if (thunk.add_pointer_bounds_args)
1594 return false;
1596 if (!force_gimple_thunk && this_adjusting
1597 && targetm.asm_out.can_output_mi_thunk (thunk_fndecl, fixed_offset,
1598 virtual_value, alias))
1600 const char *fnname;
1601 tree fn_block;
1602 tree restype = TREE_TYPE (TREE_TYPE (thunk_fndecl));
1604 if (!output_asm_thunks)
1606 analyzed = true;
1607 return false;
1610 if (in_lto_p)
1611 get_untransformed_body ();
1612 a = DECL_ARGUMENTS (thunk_fndecl);
1614 current_function_decl = thunk_fndecl;
1616 /* Ensure thunks are emitted in their correct sections. */
1617 resolve_unique_section (thunk_fndecl, 0,
1618 flag_function_sections);
1620 DECL_RESULT (thunk_fndecl)
1621 = build_decl (DECL_SOURCE_LOCATION (thunk_fndecl),
1622 RESULT_DECL, 0, restype);
1623 DECL_CONTEXT (DECL_RESULT (thunk_fndecl)) = thunk_fndecl;
1624 fnname = IDENTIFIER_POINTER (DECL_ASSEMBLER_NAME (thunk_fndecl));
1626 /* The back end expects DECL_INITIAL to contain a BLOCK, so we
1627 create one. */
1628 fn_block = make_node (BLOCK);
1629 BLOCK_VARS (fn_block) = a;
1630 DECL_INITIAL (thunk_fndecl) = fn_block;
1631 BLOCK_SUPERCONTEXT (fn_block) = thunk_fndecl;
1632 allocate_struct_function (thunk_fndecl, false);
1633 init_function_start (thunk_fndecl);
1634 cfun->is_thunk = 1;
1635 insn_locations_init ();
1636 set_curr_insn_location (DECL_SOURCE_LOCATION (thunk_fndecl));
1637 prologue_location = curr_insn_location ();
1638 assemble_start_function (thunk_fndecl, fnname);
1640 targetm.asm_out.output_mi_thunk (asm_out_file, thunk_fndecl,
1641 fixed_offset, virtual_value, alias);
1643 assemble_end_function (thunk_fndecl, fnname);
1644 insn_locations_finalize ();
1645 init_insn_lengths ();
1646 free_after_compilation (cfun);
1647 TREE_ASM_WRITTEN (thunk_fndecl) = 1;
1648 thunk.thunk_p = false;
1649 analyzed = false;
1651 else if (stdarg_p (TREE_TYPE (thunk_fndecl)))
1653 error ("generic thunk code fails for method %qD which uses %<...%>",
1654 thunk_fndecl);
1655 TREE_ASM_WRITTEN (thunk_fndecl) = 1;
1656 analyzed = true;
1657 return false;
1659 else
1661 tree restype;
1662 basic_block bb, then_bb, else_bb, return_bb;
1663 gimple_stmt_iterator bsi;
1664 int nargs = 0;
1665 tree arg;
1666 int i;
1667 tree resdecl;
1668 tree restmp = NULL;
1669 tree resbnd = NULL;
1671 gcall *call;
1672 greturn *ret;
1673 bool alias_is_noreturn = TREE_THIS_VOLATILE (alias);
1675 /* We may be called from expand_thunk that releses body except for
1676 DECL_ARGUMENTS. In this case force_gimple_thunk is true. */
1677 if (in_lto_p && !force_gimple_thunk)
1678 get_untransformed_body ();
1679 a = DECL_ARGUMENTS (thunk_fndecl);
1681 current_function_decl = thunk_fndecl;
1683 /* Ensure thunks are emitted in their correct sections. */
1684 resolve_unique_section (thunk_fndecl, 0,
1685 flag_function_sections);
1687 DECL_IGNORED_P (thunk_fndecl) = 1;
1688 bitmap_obstack_initialize (NULL);
1690 if (thunk.virtual_offset_p)
1691 virtual_offset = size_int (virtual_value);
1693 /* Build the return declaration for the function. */
1694 restype = TREE_TYPE (TREE_TYPE (thunk_fndecl));
1695 if (DECL_RESULT (thunk_fndecl) == NULL_TREE)
1697 resdecl = build_decl (input_location, RESULT_DECL, 0, restype);
1698 DECL_ARTIFICIAL (resdecl) = 1;
1699 DECL_IGNORED_P (resdecl) = 1;
1700 DECL_RESULT (thunk_fndecl) = resdecl;
1701 DECL_CONTEXT (DECL_RESULT (thunk_fndecl)) = thunk_fndecl;
1703 else
1704 resdecl = DECL_RESULT (thunk_fndecl);
1706 bb = then_bb = else_bb = return_bb
1707 = init_lowered_empty_function (thunk_fndecl, true, count);
1709 bsi = gsi_start_bb (bb);
1711 /* Build call to the function being thunked. */
1712 if (!VOID_TYPE_P (restype)
1713 && (!alias_is_noreturn
1714 || TREE_ADDRESSABLE (restype)
1715 || TREE_CODE (TYPE_SIZE_UNIT (restype)) != INTEGER_CST))
1717 if (DECL_BY_REFERENCE (resdecl))
1719 restmp = gimple_fold_indirect_ref (resdecl);
1720 if (!restmp)
1721 restmp = build2 (MEM_REF,
1722 TREE_TYPE (TREE_TYPE (DECL_RESULT (alias))),
1723 resdecl,
1724 build_int_cst (TREE_TYPE
1725 (DECL_RESULT (alias)), 0));
1727 else if (!is_gimple_reg_type (restype))
1729 if (aggregate_value_p (resdecl, TREE_TYPE (thunk_fndecl)))
1731 restmp = resdecl;
1733 if (TREE_CODE (restmp) == VAR_DECL)
1734 add_local_decl (cfun, restmp);
1735 BLOCK_VARS (DECL_INITIAL (current_function_decl)) = restmp;
1737 else
1738 restmp = create_tmp_var (restype, "retval");
1740 else
1741 restmp = create_tmp_reg (restype, "retval");
1744 for (arg = a; arg; arg = DECL_CHAIN (arg))
1745 nargs++;
1746 auto_vec<tree> vargs (nargs);
1747 i = 0;
1748 arg = a;
1749 if (this_adjusting)
1751 vargs.quick_push (thunk_adjust (&bsi, a, 1, fixed_offset,
1752 virtual_offset));
1753 arg = DECL_CHAIN (a);
1754 i = 1;
1757 if (nargs)
1758 for (; i < nargs; i++, arg = DECL_CHAIN (arg))
1760 tree tmp = arg;
1761 if (!is_gimple_val (arg))
1763 tmp = create_tmp_reg (TYPE_MAIN_VARIANT
1764 (TREE_TYPE (arg)), "arg");
1765 gimple *stmt = gimple_build_assign (tmp, arg);
1766 gsi_insert_after (&bsi, stmt, GSI_NEW_STMT);
1768 vargs.quick_push (tmp);
1770 call = gimple_build_call_vec (build_fold_addr_expr_loc (0, alias), vargs);
1771 callees->call_stmt = call;
1772 gimple_call_set_from_thunk (call, true);
1773 gimple_call_set_with_bounds (call, instrumentation_clone);
1775 /* Return slot optimization is always possible and in fact requred to
1776 return values with DECL_BY_REFERENCE. */
1777 if (aggregate_value_p (resdecl, TREE_TYPE (thunk_fndecl))
1778 && (!is_gimple_reg_type (TREE_TYPE (resdecl))
1779 || DECL_BY_REFERENCE (resdecl)))
1780 gimple_call_set_return_slot_opt (call, true);
1782 if (restmp)
1784 gimple_call_set_lhs (call, restmp);
1785 gcc_assert (useless_type_conversion_p (TREE_TYPE (restmp),
1786 TREE_TYPE (TREE_TYPE (alias))));
1788 gsi_insert_after (&bsi, call, GSI_NEW_STMT);
1789 if (!alias_is_noreturn)
1791 if (instrumentation_clone
1792 && !DECL_BY_REFERENCE (resdecl)
1793 && restmp
1794 && BOUNDED_P (restmp))
1796 resbnd = chkp_insert_retbnd_call (NULL, restmp, &bsi);
1797 create_edge (get_create (gimple_call_fndecl (gsi_stmt (bsi))),
1798 as_a <gcall *> (gsi_stmt (bsi)),
1799 callees->count, callees->frequency);
1802 if (restmp && !this_adjusting
1803 && (fixed_offset || virtual_offset))
1805 tree true_label = NULL_TREE;
1807 if (TREE_CODE (TREE_TYPE (restmp)) == POINTER_TYPE)
1809 gimple *stmt;
1810 edge e;
1811 /* If the return type is a pointer, we need to
1812 protect against NULL. We know there will be an
1813 adjustment, because that's why we're emitting a
1814 thunk. */
1815 then_bb = create_basic_block (NULL, bb);
1816 then_bb->count = count - count / 16;
1817 then_bb->frequency = BB_FREQ_MAX - BB_FREQ_MAX / 16;
1818 return_bb = create_basic_block (NULL, then_bb);
1819 return_bb->count = count;
1820 return_bb->frequency = BB_FREQ_MAX;
1821 else_bb = create_basic_block (NULL, else_bb);
1822 then_bb->count = count / 16;
1823 then_bb->frequency = BB_FREQ_MAX / 16;
1824 add_bb_to_loop (then_bb, bb->loop_father);
1825 add_bb_to_loop (return_bb, bb->loop_father);
1826 add_bb_to_loop (else_bb, bb->loop_father);
1827 remove_edge (single_succ_edge (bb));
1828 true_label = gimple_block_label (then_bb);
1829 stmt = gimple_build_cond (NE_EXPR, restmp,
1830 build_zero_cst (TREE_TYPE (restmp)),
1831 NULL_TREE, NULL_TREE);
1832 gsi_insert_after (&bsi, stmt, GSI_NEW_STMT);
1833 e = make_edge (bb, then_bb, EDGE_TRUE_VALUE);
1834 e->probability = REG_BR_PROB_BASE - REG_BR_PROB_BASE / 16;
1835 e->count = count - count / 16;
1836 e = make_edge (bb, else_bb, EDGE_FALSE_VALUE);
1837 e->probability = REG_BR_PROB_BASE / 16;
1838 e->count = count / 16;
1839 e = make_edge (return_bb, EXIT_BLOCK_PTR_FOR_FN (cfun), 0);
1840 e->probability = REG_BR_PROB_BASE;
1841 e->count = count;
1842 e = make_edge (then_bb, return_bb, EDGE_FALLTHRU);
1843 e->probability = REG_BR_PROB_BASE;
1844 e->count = count - count / 16;
1845 e = make_edge (else_bb, return_bb, EDGE_FALLTHRU);
1846 e->probability = REG_BR_PROB_BASE;
1847 e->count = count / 16;
1848 bsi = gsi_last_bb (then_bb);
1851 restmp = thunk_adjust (&bsi, restmp, /*this_adjusting=*/0,
1852 fixed_offset, virtual_offset);
1853 if (true_label)
1855 gimple *stmt;
1856 bsi = gsi_last_bb (else_bb);
1857 stmt = gimple_build_assign (restmp,
1858 build_zero_cst (TREE_TYPE (restmp)));
1859 gsi_insert_after (&bsi, stmt, GSI_NEW_STMT);
1860 bsi = gsi_last_bb (return_bb);
1863 else
1864 gimple_call_set_tail (call, true);
1866 /* Build return value. */
1867 if (!DECL_BY_REFERENCE (resdecl))
1868 ret = gimple_build_return (restmp);
1869 else
1870 ret = gimple_build_return (resdecl);
1871 gimple_return_set_retbnd (ret, resbnd);
1873 gsi_insert_after (&bsi, ret, GSI_NEW_STMT);
1875 else
1877 gimple_call_set_tail (call, true);
1878 remove_edge (single_succ_edge (bb));
1881 cfun->gimple_df->in_ssa_p = true;
1882 profile_status_for_fn (cfun)
1883 = count ? PROFILE_READ : PROFILE_GUESSED;
1884 /* FIXME: C++ FE should stop setting TREE_ASM_WRITTEN on thunks. */
1885 TREE_ASM_WRITTEN (thunk_fndecl) = false;
1886 delete_unreachable_blocks ();
1887 update_ssa (TODO_update_ssa);
1888 checking_verify_flow_info ();
1889 free_dominance_info (CDI_DOMINATORS);
1891 /* Since we want to emit the thunk, we explicitly mark its name as
1892 referenced. */
1893 thunk.thunk_p = false;
1894 lowered = true;
1895 bitmap_obstack_release (NULL);
1897 current_function_decl = NULL;
1898 set_cfun (NULL);
1899 return true;
1902 /* Assemble thunks and aliases associated to node. */
1904 void
1905 cgraph_node::assemble_thunks_and_aliases (void)
1907 cgraph_edge *e;
1908 ipa_ref *ref;
1910 for (e = callers; e;)
1911 if (e->caller->thunk.thunk_p
1912 && !e->caller->global.inlined_to
1913 && !e->caller->thunk.add_pointer_bounds_args)
1915 cgraph_node *thunk = e->caller;
1917 e = e->next_caller;
1918 thunk->expand_thunk (true, false);
1919 thunk->assemble_thunks_and_aliases ();
1921 else
1922 e = e->next_caller;
1924 FOR_EACH_ALIAS (this, ref)
1926 cgraph_node *alias = dyn_cast <cgraph_node *> (ref->referring);
1927 if (!alias->transparent_alias)
1929 bool saved_written = TREE_ASM_WRITTEN (decl);
1931 /* Force assemble_alias to really output the alias this time instead
1932 of buffering it in same alias pairs. */
1933 TREE_ASM_WRITTEN (decl) = 1;
1934 do_assemble_alias (alias->decl,
1935 DECL_ASSEMBLER_NAME (decl));
1936 alias->assemble_thunks_and_aliases ();
1937 TREE_ASM_WRITTEN (decl) = saved_written;
1942 /* Expand function specified by node. */
1944 void
1945 cgraph_node::expand (void)
1947 location_t saved_loc;
1949 /* We ought to not compile any inline clones. */
1950 gcc_assert (!global.inlined_to);
1952 announce_function (decl);
1953 process = 0;
1954 gcc_assert (lowered);
1955 get_untransformed_body ();
1957 /* Generate RTL for the body of DECL. */
1959 timevar_push (TV_REST_OF_COMPILATION);
1961 gcc_assert (symtab->global_info_ready);
1963 /* Initialize the default bitmap obstack. */
1964 bitmap_obstack_initialize (NULL);
1966 /* Initialize the RTL code for the function. */
1967 saved_loc = input_location;
1968 input_location = DECL_SOURCE_LOCATION (decl);
1970 gcc_assert (DECL_STRUCT_FUNCTION (decl));
1971 push_cfun (DECL_STRUCT_FUNCTION (decl));
1972 init_function_start (decl);
1974 gimple_register_cfg_hooks ();
1976 bitmap_obstack_initialize (&reg_obstack); /* FIXME, only at RTL generation*/
1978 execute_all_ipa_transforms ();
1980 /* Perform all tree transforms and optimizations. */
1982 /* Signal the start of passes. */
1983 invoke_plugin_callbacks (PLUGIN_ALL_PASSES_START, NULL);
1985 execute_pass_list (cfun, g->get_passes ()->all_passes);
1987 /* Signal the end of passes. */
1988 invoke_plugin_callbacks (PLUGIN_ALL_PASSES_END, NULL);
1990 bitmap_obstack_release (&reg_obstack);
1992 /* Release the default bitmap obstack. */
1993 bitmap_obstack_release (NULL);
1995 /* If requested, warn about function definitions where the function will
1996 return a value (usually of some struct or union type) which itself will
1997 take up a lot of stack space. */
1998 if (warn_larger_than && !DECL_EXTERNAL (decl) && TREE_TYPE (decl))
2000 tree ret_type = TREE_TYPE (TREE_TYPE (decl));
2002 if (ret_type && TYPE_SIZE_UNIT (ret_type)
2003 && TREE_CODE (TYPE_SIZE_UNIT (ret_type)) == INTEGER_CST
2004 && 0 < compare_tree_int (TYPE_SIZE_UNIT (ret_type),
2005 larger_than_size))
2007 unsigned int size_as_int
2008 = TREE_INT_CST_LOW (TYPE_SIZE_UNIT (ret_type));
2010 if (compare_tree_int (TYPE_SIZE_UNIT (ret_type), size_as_int) == 0)
2011 warning (OPT_Wlarger_than_, "size of return value of %q+D is %u bytes",
2012 decl, size_as_int);
2013 else
2014 warning (OPT_Wlarger_than_, "size of return value of %q+D is larger than %wd bytes",
2015 decl, larger_than_size);
2019 gimple_set_body (decl, NULL);
2020 if (DECL_STRUCT_FUNCTION (decl) == 0
2021 && !cgraph_node::get (decl)->origin)
2023 /* Stop pointing to the local nodes about to be freed.
2024 But DECL_INITIAL must remain nonzero so we know this
2025 was an actual function definition.
2026 For a nested function, this is done in c_pop_function_context.
2027 If rest_of_compilation set this to 0, leave it 0. */
2028 if (DECL_INITIAL (decl) != 0)
2029 DECL_INITIAL (decl) = error_mark_node;
2032 input_location = saved_loc;
2034 ggc_collect ();
2035 timevar_pop (TV_REST_OF_COMPILATION);
2037 /* Make sure that BE didn't give up on compiling. */
2038 gcc_assert (TREE_ASM_WRITTEN (decl));
2039 if (cfun)
2040 pop_cfun ();
2042 /* It would make a lot more sense to output thunks before function body to get more
2043 forward and lest backwarding jumps. This however would need solving problem
2044 with comdats. See PR48668. Also aliases must come after function itself to
2045 make one pass assemblers, like one on AIX, happy. See PR 50689.
2046 FIXME: Perhaps thunks should be move before function IFF they are not in comdat
2047 groups. */
2048 assemble_thunks_and_aliases ();
2049 release_body ();
2050 /* Eliminate all call edges. This is important so the GIMPLE_CALL no longer
2051 points to the dead function body. */
2052 remove_callees ();
2053 remove_all_references ();
2056 /* Node comparer that is responsible for the order that corresponds
2057 to time when a function was launched for the first time. */
2059 static int
2060 node_cmp (const void *pa, const void *pb)
2062 const cgraph_node *a = *(const cgraph_node * const *) pa;
2063 const cgraph_node *b = *(const cgraph_node * const *) pb;
2065 /* Functions with time profile must be before these without profile. */
2066 if (!a->tp_first_run || !b->tp_first_run)
2067 return a->tp_first_run - b->tp_first_run;
2069 return a->tp_first_run != b->tp_first_run
2070 ? b->tp_first_run - a->tp_first_run
2071 : b->order - a->order;
2074 /* Expand all functions that must be output.
2076 Attempt to topologically sort the nodes so function is output when
2077 all called functions are already assembled to allow data to be
2078 propagated across the callgraph. Use a stack to get smaller distance
2079 between a function and its callees (later we may choose to use a more
2080 sophisticated algorithm for function reordering; we will likely want
2081 to use subsections to make the output functions appear in top-down
2082 order). */
2084 static void
2085 expand_all_functions (void)
2087 cgraph_node *node;
2088 cgraph_node **order = XCNEWVEC (cgraph_node *,
2089 symtab->cgraph_count);
2090 unsigned int expanded_func_count = 0, profiled_func_count = 0;
2091 int order_pos, new_order_pos = 0;
2092 int i;
2094 order_pos = ipa_reverse_postorder (order);
2095 gcc_assert (order_pos == symtab->cgraph_count);
2097 /* Garbage collector may remove inline clones we eliminate during
2098 optimization. So we must be sure to not reference them. */
2099 for (i = 0; i < order_pos; i++)
2100 if (order[i]->process)
2101 order[new_order_pos++] = order[i];
2103 if (flag_profile_reorder_functions)
2104 qsort (order, new_order_pos, sizeof (cgraph_node *), node_cmp);
2106 for (i = new_order_pos - 1; i >= 0; i--)
2108 node = order[i];
2110 if (node->process)
2112 expanded_func_count++;
2113 if(node->tp_first_run)
2114 profiled_func_count++;
2116 if (symtab->dump_file)
2117 fprintf (symtab->dump_file,
2118 "Time profile order in expand_all_functions:%s:%d\n",
2119 node->asm_name (), node->tp_first_run);
2120 node->process = 0;
2121 node->expand ();
2125 if (dump_file)
2126 fprintf (dump_file, "Expanded functions with time profile (%s):%u/%u\n",
2127 main_input_filename, profiled_func_count, expanded_func_count);
2129 if (symtab->dump_file && flag_profile_reorder_functions)
2130 fprintf (symtab->dump_file, "Expanded functions with time profile:%u/%u\n",
2131 profiled_func_count, expanded_func_count);
2133 symtab->process_new_functions ();
2134 free_gimplify_stack ();
2136 free (order);
2139 /* This is used to sort the node types by the cgraph order number. */
2141 enum cgraph_order_sort_kind
2143 ORDER_UNDEFINED = 0,
2144 ORDER_FUNCTION,
2145 ORDER_VAR,
2146 ORDER_VAR_UNDEF,
2147 ORDER_ASM
2150 struct cgraph_order_sort
2152 enum cgraph_order_sort_kind kind;
2153 union
2155 cgraph_node *f;
2156 varpool_node *v;
2157 asm_node *a;
2158 } u;
2161 /* Output all functions, variables, and asm statements in the order
2162 according to their order fields, which is the order in which they
2163 appeared in the file. This implements -fno-toplevel-reorder. In
2164 this mode we may output functions and variables which don't really
2165 need to be output.
2166 When NO_REORDER is true only do this for symbols marked no reorder. */
2168 static void
2169 output_in_order (bool no_reorder)
2171 int max;
2172 cgraph_order_sort *nodes;
2173 int i;
2174 cgraph_node *pf;
2175 varpool_node *pv;
2176 asm_node *pa;
2177 max = symtab->order;
2178 nodes = XCNEWVEC (cgraph_order_sort, max);
2180 FOR_EACH_DEFINED_FUNCTION (pf)
2182 if (pf->process && !pf->thunk.thunk_p && !pf->alias)
2184 if (no_reorder && !pf->no_reorder)
2185 continue;
2186 i = pf->order;
2187 gcc_assert (nodes[i].kind == ORDER_UNDEFINED);
2188 nodes[i].kind = ORDER_FUNCTION;
2189 nodes[i].u.f = pf;
2193 /* There is a similar loop in symbol_table::output_variables.
2194 Please keep them in sync. */
2195 FOR_EACH_VARIABLE (pv)
2197 if (no_reorder && !pv->no_reorder)
2198 continue;
2199 if (DECL_HARD_REGISTER (pv->decl)
2200 || DECL_HAS_VALUE_EXPR_P (pv->decl))
2201 continue;
2202 i = pv->order;
2203 gcc_assert (nodes[i].kind == ORDER_UNDEFINED);
2204 nodes[i].kind = pv->definition ? ORDER_VAR : ORDER_VAR_UNDEF;
2205 nodes[i].u.v = pv;
2208 for (pa = symtab->first_asm_symbol (); pa; pa = pa->next)
2210 i = pa->order;
2211 gcc_assert (nodes[i].kind == ORDER_UNDEFINED);
2212 nodes[i].kind = ORDER_ASM;
2213 nodes[i].u.a = pa;
2216 /* In toplevel reorder mode we output all statics; mark them as needed. */
2218 for (i = 0; i < max; ++i)
2219 if (nodes[i].kind == ORDER_VAR)
2220 nodes[i].u.v->finalize_named_section_flags ();
2222 for (i = 0; i < max; ++i)
2224 switch (nodes[i].kind)
2226 case ORDER_FUNCTION:
2227 nodes[i].u.f->process = 0;
2228 nodes[i].u.f->expand ();
2229 break;
2231 case ORDER_VAR:
2232 nodes[i].u.v->assemble_decl ();
2233 break;
2235 case ORDER_VAR_UNDEF:
2236 assemble_undefined_decl (nodes[i].u.v->decl);
2237 break;
2239 case ORDER_ASM:
2240 assemble_asm (nodes[i].u.a->asm_str);
2241 break;
2243 case ORDER_UNDEFINED:
2244 break;
2246 default:
2247 gcc_unreachable ();
2251 symtab->clear_asm_symbols ();
2253 free (nodes);
2256 static void
2257 ipa_passes (void)
2259 gcc::pass_manager *passes = g->get_passes ();
2261 set_cfun (NULL);
2262 current_function_decl = NULL;
2263 gimple_register_cfg_hooks ();
2264 bitmap_obstack_initialize (NULL);
2266 invoke_plugin_callbacks (PLUGIN_ALL_IPA_PASSES_START, NULL);
2268 if (!in_lto_p)
2270 execute_ipa_pass_list (passes->all_small_ipa_passes);
2271 if (seen_error ())
2272 return;
2275 /* This extra symtab_remove_unreachable_nodes pass tends to catch some
2276 devirtualization and other changes where removal iterate. */
2277 symtab->remove_unreachable_nodes (symtab->dump_file);
2279 /* If pass_all_early_optimizations was not scheduled, the state of
2280 the cgraph will not be properly updated. Update it now. */
2281 if (symtab->state < IPA_SSA)
2282 symtab->state = IPA_SSA;
2284 if (!in_lto_p)
2286 /* Generate coverage variables and constructors. */
2287 coverage_finish ();
2289 /* Process new functions added. */
2290 set_cfun (NULL);
2291 current_function_decl = NULL;
2292 symtab->process_new_functions ();
2294 execute_ipa_summary_passes
2295 ((ipa_opt_pass_d *) passes->all_regular_ipa_passes);
2298 /* Some targets need to handle LTO assembler output specially. */
2299 if (flag_generate_lto || flag_generate_offload)
2300 targetm.asm_out.lto_start ();
2302 if (!in_lto_p)
2304 if (g->have_offload)
2306 section_name_prefix = OFFLOAD_SECTION_NAME_PREFIX;
2307 lto_stream_offload_p = true;
2308 ipa_write_summaries ();
2309 lto_stream_offload_p = false;
2311 if (flag_lto)
2313 section_name_prefix = LTO_SECTION_NAME_PREFIX;
2314 lto_stream_offload_p = false;
2315 ipa_write_summaries ();
2319 if (flag_generate_lto || flag_generate_offload)
2320 targetm.asm_out.lto_end ();
2322 if (!flag_ltrans && (in_lto_p || !flag_lto || flag_fat_lto_objects))
2323 execute_ipa_pass_list (passes->all_regular_ipa_passes);
2324 invoke_plugin_callbacks (PLUGIN_ALL_IPA_PASSES_END, NULL);
2326 bitmap_obstack_release (NULL);
2330 /* Return string alias is alias of. */
2332 static tree
2333 get_alias_symbol (tree decl)
2335 tree alias = lookup_attribute ("alias", DECL_ATTRIBUTES (decl));
2336 return get_identifier (TREE_STRING_POINTER
2337 (TREE_VALUE (TREE_VALUE (alias))));
2341 /* Weakrefs may be associated to external decls and thus not output
2342 at expansion time. Emit all necessary aliases. */
2344 void
2345 symbol_table::output_weakrefs (void)
2347 symtab_node *node;
2348 cgraph_node *cnode;
2349 FOR_EACH_SYMBOL (node)
2350 if (node->alias
2351 && !TREE_ASM_WRITTEN (node->decl)
2352 && (!(cnode = dyn_cast <cgraph_node *> (node))
2353 || !cnode->instrumented_version
2354 || !TREE_ASM_WRITTEN (cnode->instrumented_version->decl))
2355 && node->weakref)
2357 tree target;
2359 /* Weakrefs are special by not requiring target definition in current
2360 compilation unit. It is thus bit hard to work out what we want to
2361 alias.
2362 When alias target is defined, we need to fetch it from symtab reference,
2363 otherwise it is pointed to by alias_target. */
2364 if (node->alias_target)
2365 target = (DECL_P (node->alias_target)
2366 ? DECL_ASSEMBLER_NAME (node->alias_target)
2367 : node->alias_target);
2368 else if (node->analyzed)
2369 target = DECL_ASSEMBLER_NAME (node->get_alias_target ()->decl);
2370 else
2372 gcc_unreachable ();
2373 target = get_alias_symbol (node->decl);
2375 do_assemble_alias (node->decl, target);
2379 /* Perform simple optimizations based on callgraph. */
2381 void
2382 symbol_table::compile (void)
2384 if (seen_error ())
2385 return;
2387 symtab_node::checking_verify_symtab_nodes ();
2389 timevar_push (TV_CGRAPHOPT);
2390 if (pre_ipa_mem_report)
2392 fprintf (stderr, "Memory consumption before IPA\n");
2393 dump_memory_report (false);
2395 if (!quiet_flag)
2396 fprintf (stderr, "Performing interprocedural optimizations\n");
2397 state = IPA;
2399 /* Offloading requires LTO infrastructure. */
2400 if (!in_lto_p && g->have_offload)
2401 flag_generate_offload = 1;
2403 /* If LTO is enabled, initialize the streamer hooks needed by GIMPLE. */
2404 if (flag_generate_lto || flag_generate_offload)
2405 lto_streamer_hooks_init ();
2407 /* Don't run the IPA passes if there was any error or sorry messages. */
2408 if (!seen_error ())
2409 ipa_passes ();
2411 /* Do nothing else if any IPA pass found errors or if we are just streaming LTO. */
2412 if (seen_error ()
2413 || (!in_lto_p && flag_lto && !flag_fat_lto_objects))
2415 timevar_pop (TV_CGRAPHOPT);
2416 return;
2419 global_info_ready = true;
2420 if (dump_file)
2422 fprintf (dump_file, "Optimized ");
2423 symtab_node:: dump_table (dump_file);
2425 if (post_ipa_mem_report)
2427 fprintf (stderr, "Memory consumption after IPA\n");
2428 dump_memory_report (false);
2430 timevar_pop (TV_CGRAPHOPT);
2432 /* Output everything. */
2433 (*debug_hooks->assembly_start) ();
2434 if (!quiet_flag)
2435 fprintf (stderr, "Assembling functions:\n");
2436 symtab_node::checking_verify_symtab_nodes ();
2438 materialize_all_clones ();
2439 bitmap_obstack_initialize (NULL);
2440 execute_ipa_pass_list (g->get_passes ()->all_late_ipa_passes);
2441 bitmap_obstack_release (NULL);
2442 mark_functions_to_output ();
2444 /* When weakref support is missing, we autmatically translate all
2445 references to NODE to references to its ultimate alias target.
2446 The renaming mechanizm uses flag IDENTIFIER_TRANSPARENT_ALIAS and
2447 TREE_CHAIN.
2449 Set up this mapping before we output any assembler but once we are sure
2450 that all symbol renaming is done.
2452 FIXME: All this uglyness can go away if we just do renaming at gimple
2453 level by physically rewritting the IL. At the moment we can only redirect
2454 calls, so we need infrastructure for renaming references as well. */
2455 #ifndef ASM_OUTPUT_WEAKREF
2456 symtab_node *node;
2458 FOR_EACH_SYMBOL (node)
2459 if (node->alias
2460 && lookup_attribute ("weakref", DECL_ATTRIBUTES (node->decl)))
2462 IDENTIFIER_TRANSPARENT_ALIAS
2463 (DECL_ASSEMBLER_NAME (node->decl)) = 1;
2464 TREE_CHAIN (DECL_ASSEMBLER_NAME (node->decl))
2465 = (node->alias_target ? node->alias_target
2466 : DECL_ASSEMBLER_NAME (node->get_alias_target ()->decl));
2468 #endif
2470 state = EXPANSION;
2472 if (!flag_toplevel_reorder)
2473 output_in_order (false);
2474 else
2476 /* Output first asm statements and anything ordered. The process
2477 flag is cleared for these nodes, so we skip them later. */
2478 output_in_order (true);
2479 expand_all_functions ();
2480 output_variables ();
2483 process_new_functions ();
2484 state = FINISHED;
2485 output_weakrefs ();
2487 if (dump_file)
2489 fprintf (dump_file, "\nFinal ");
2490 symtab_node::dump_table (dump_file);
2492 if (!flag_checking)
2493 return;
2494 symtab_node::verify_symtab_nodes ();
2495 /* Double check that all inline clones are gone and that all
2496 function bodies have been released from memory. */
2497 if (!seen_error ())
2499 cgraph_node *node;
2500 bool error_found = false;
2502 FOR_EACH_DEFINED_FUNCTION (node)
2503 if (node->global.inlined_to
2504 || gimple_has_body_p (node->decl))
2506 error_found = true;
2507 node->debug ();
2509 if (error_found)
2510 internal_error ("nodes with unreleased memory found");
2515 /* Analyze the whole compilation unit once it is parsed completely. */
2517 void
2518 symbol_table::finalize_compilation_unit (void)
2520 timevar_push (TV_CGRAPH);
2522 /* If we're here there's no current function anymore. Some frontends
2523 are lazy in clearing these. */
2524 current_function_decl = NULL;
2525 set_cfun (NULL);
2527 /* Do not skip analyzing the functions if there were errors, we
2528 miss diagnostics for following functions otherwise. */
2530 /* Emit size functions we didn't inline. */
2531 finalize_size_functions ();
2533 /* Mark alias targets necessary and emit diagnostics. */
2534 handle_alias_pairs ();
2536 if (!quiet_flag)
2538 fprintf (stderr, "\nAnalyzing compilation unit\n");
2539 fflush (stderr);
2542 if (flag_dump_passes)
2543 dump_passes ();
2545 /* Gimplify and lower all functions, compute reachability and
2546 remove unreachable nodes. */
2547 analyze_functions (/*first_time=*/true);
2549 /* Mark alias targets necessary and emit diagnostics. */
2550 handle_alias_pairs ();
2552 /* Gimplify and lower thunks. */
2553 analyze_functions (/*first_time=*/false);
2555 if (!seen_error ())
2557 /* Emit early debug for reachable functions, and by consequence,
2558 locally scoped symbols. */
2559 struct cgraph_node *cnode;
2560 FOR_EACH_FUNCTION_WITH_GIMPLE_BODY (cnode)
2561 (*debug_hooks->early_global_decl) (cnode->decl);
2563 /* Clean up anything that needs cleaning up after initial debug
2564 generation. */
2565 (*debug_hooks->early_finish) ();
2568 /* Finally drive the pass manager. */
2569 compile ();
2571 timevar_pop (TV_CGRAPH);
2574 /* Reset all state within cgraphunit.c so that we can rerun the compiler
2575 within the same process. For use by toplev::finalize. */
2577 void
2578 cgraphunit_c_finalize (void)
2580 gcc_assert (cgraph_new_nodes.length () == 0);
2581 cgraph_new_nodes.truncate (0);
2583 vtable_entry_type = NULL;
2584 queued_nodes = &symtab_terminator;
2586 first_analyzed = NULL;
2587 first_analyzed_var = NULL;
2590 /* Creates a wrapper from cgraph_node to TARGET node. Thunk is used for this
2591 kind of wrapper method. */
2593 void
2594 cgraph_node::create_wrapper (cgraph_node *target)
2596 /* Preserve DECL_RESULT so we get right by reference flag. */
2597 tree decl_result = DECL_RESULT (decl);
2599 /* Remove the function's body but keep arguments to be reused
2600 for thunk. */
2601 release_body (true);
2602 reset ();
2604 DECL_UNINLINABLE (decl) = false;
2605 DECL_RESULT (decl) = decl_result;
2606 DECL_INITIAL (decl) = NULL;
2607 allocate_struct_function (decl, false);
2608 set_cfun (NULL);
2610 /* Turn alias into thunk and expand it into GIMPLE representation. */
2611 definition = true;
2613 memset (&thunk, 0, sizeof (cgraph_thunk_info));
2614 thunk.thunk_p = true;
2615 create_edge (target, NULL, count, CGRAPH_FREQ_BASE);
2616 callees->can_throw_external = !TREE_NOTHROW (target->decl);
2618 tree arguments = DECL_ARGUMENTS (decl);
2620 while (arguments)
2622 TREE_ADDRESSABLE (arguments) = false;
2623 arguments = TREE_CHAIN (arguments);
2626 expand_thunk (false, true);
2628 /* Inline summary set-up. */
2629 analyze ();
2630 inline_analyze_function (this);
2633 #include "gt-cgraphunit.h"