2006-08-07 Andrew John Hughes <gnu_andrew@member.fsf.org>
[official-gcc.git] / gcc / tree-ssa-ccp.c
blob53050e68429ab9a75177e209feb5487a542c41cd
1 /* Conditional constant propagation pass for the GNU compiler.
2 Copyright (C) 2000, 2001, 2002, 2003, 2004, 2005, 2006
3 Free Software Foundation, Inc.
4 Adapted from original RTL SSA-CCP by Daniel Berlin <dberlin@dberlin.org>
5 Adapted to GIMPLE trees by Diego Novillo <dnovillo@redhat.com>
7 This file is part of GCC.
9 GCC is free software; you can redistribute it and/or modify it
10 under the terms of the GNU General Public License as published by the
11 Free Software Foundation; either version 2, or (at your option) any
12 later version.
14 GCC is distributed in the hope that it will be useful, but WITHOUT
15 ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
16 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
17 for more details.
19 You should have received a copy of the GNU General Public License
20 along with GCC; see the file COPYING. If not, write to the Free
21 Software Foundation, 51 Franklin Street, Fifth Floor, Boston, MA
22 02110-1301, USA. */
24 /* Conditional constant propagation (CCP) is based on the SSA
25 propagation engine (tree-ssa-propagate.c). Constant assignments of
26 the form VAR = CST are propagated from the assignments into uses of
27 VAR, which in turn may generate new constants. The simulation uses
28 a four level lattice to keep track of constant values associated
29 with SSA names. Given an SSA name V_i, it may take one of the
30 following values:
32 UNINITIALIZED -> This is the default starting value. V_i
33 has not been processed yet.
35 UNDEFINED -> V_i is a local variable whose definition
36 has not been processed yet. Therefore we
37 don't yet know if its value is a constant
38 or not.
40 CONSTANT -> V_i has been found to hold a constant
41 value C.
43 VARYING -> V_i cannot take a constant value, or if it
44 does, it is not possible to determine it
45 at compile time.
47 The core of SSA-CCP is in ccp_visit_stmt and ccp_visit_phi_node:
49 1- In ccp_visit_stmt, we are interested in assignments whose RHS
50 evaluates into a constant and conditional jumps whose predicate
51 evaluates into a boolean true or false. When an assignment of
52 the form V_i = CONST is found, V_i's lattice value is set to
53 CONSTANT and CONST is associated with it. This causes the
54 propagation engine to add all the SSA edges coming out the
55 assignment into the worklists, so that statements that use V_i
56 can be visited.
58 If the statement is a conditional with a constant predicate, we
59 mark the outgoing edges as executable or not executable
60 depending on the predicate's value. This is then used when
61 visiting PHI nodes to know when a PHI argument can be ignored.
64 2- In ccp_visit_phi_node, if all the PHI arguments evaluate to the
65 same constant C, then the LHS of the PHI is set to C. This
66 evaluation is known as the "meet operation". Since one of the
67 goals of this evaluation is to optimistically return constant
68 values as often as possible, it uses two main short cuts:
70 - If an argument is flowing in through a non-executable edge, it
71 is ignored. This is useful in cases like this:
73 if (PRED)
74 a_9 = 3;
75 else
76 a_10 = 100;
77 a_11 = PHI (a_9, a_10)
79 If PRED is known to always evaluate to false, then we can
80 assume that a_11 will always take its value from a_10, meaning
81 that instead of consider it VARYING (a_9 and a_10 have
82 different values), we can consider it CONSTANT 100.
84 - If an argument has an UNDEFINED value, then it does not affect
85 the outcome of the meet operation. If a variable V_i has an
86 UNDEFINED value, it means that either its defining statement
87 hasn't been visited yet or V_i has no defining statement, in
88 which case the original symbol 'V' is being used
89 uninitialized. Since 'V' is a local variable, the compiler
90 may assume any initial value for it.
93 After propagation, every variable V_i that ends up with a lattice
94 value of CONSTANT will have the associated constant value in the
95 array CONST_VAL[i].VALUE. That is fed into substitute_and_fold for
96 final substitution and folding.
99 Constant propagation in stores and loads (STORE-CCP)
100 ----------------------------------------------------
102 While CCP has all the logic to propagate constants in GIMPLE
103 registers, it is missing the ability to associate constants with
104 stores and loads (i.e., pointer dereferences, structures and
105 global/aliased variables). We don't keep loads and stores in
106 SSA, but we do build a factored use-def web for them (in the
107 virtual operands).
109 For instance, consider the following code fragment:
111 struct A a;
112 const int B = 42;
114 void foo (int i)
116 if (i > 10)
117 a.a = 42;
118 else
120 a.b = 21;
121 a.a = a.b + 21;
124 if (a.a != B)
125 never_executed ();
128 We should be able to deduce that the predicate 'a.a != B' is always
129 false. To achieve this, we associate constant values to the SSA
130 names in the V_MAY_DEF and V_MUST_DEF operands for each store.
131 Additionally, since we also glob partial loads/stores with the base
132 symbol, we also keep track of the memory reference where the
133 constant value was stored (in the MEM_REF field of PROP_VALUE_T).
134 For instance,
136 # a_5 = V_MAY_DEF <a_4>
137 a.a = 2;
139 # VUSE <a_5>
140 x_3 = a.b;
142 In the example above, CCP will associate value '2' with 'a_5', but
143 it would be wrong to replace the load from 'a.b' with '2', because
144 '2' had been stored into a.a.
146 To support STORE-CCP, it is necessary to add a new value to the
147 constant propagation lattice. When evaluating a load for a memory
148 reference we can no longer assume a value of UNDEFINED if we
149 haven't seen a preceding store to the same memory location.
150 Consider, for instance global variables:
152 int A;
154 foo (int i)
156 if (i_3 > 10)
157 A_4 = 3;
158 # A_5 = PHI (A_4, A_2);
160 # VUSE <A_5>
161 A.0_6 = A;
163 return A.0_6;
166 The value of A_2 cannot be assumed to be UNDEFINED, as it may have
167 been defined outside of foo. If we were to assume it UNDEFINED, we
168 would erroneously optimize the above into 'return 3;'. Therefore,
169 when doing STORE-CCP, we introduce a fifth lattice value
170 (UNKNOWN_VAL), which overrides any other value when computing the
171 meet operation in PHI nodes.
173 Though STORE-CCP is not too expensive, it does have to do more work
174 than regular CCP, so it is only enabled at -O2. Both regular CCP
175 and STORE-CCP use the exact same algorithm. The only distinction
176 is that when doing STORE-CCP, the boolean variable DO_STORE_CCP is
177 set to true. This affects the evaluation of statements and PHI
178 nodes.
180 References:
182 Constant propagation with conditional branches,
183 Wegman and Zadeck, ACM TOPLAS 13(2):181-210.
185 Building an Optimizing Compiler,
186 Robert Morgan, Butterworth-Heinemann, 1998, Section 8.9.
188 Advanced Compiler Design and Implementation,
189 Steven Muchnick, Morgan Kaufmann, 1997, Section 12.6 */
191 #include "config.h"
192 #include "system.h"
193 #include "coretypes.h"
194 #include "tm.h"
195 #include "tree.h"
196 #include "flags.h"
197 #include "rtl.h"
198 #include "tm_p.h"
199 #include "ggc.h"
200 #include "basic-block.h"
201 #include "output.h"
202 #include "expr.h"
203 #include "function.h"
204 #include "diagnostic.h"
205 #include "timevar.h"
206 #include "tree-dump.h"
207 #include "tree-flow.h"
208 #include "tree-pass.h"
209 #include "tree-ssa-propagate.h"
210 #include "langhooks.h"
211 #include "target.h"
214 /* Possible lattice values. */
215 typedef enum
217 UNINITIALIZED = 0,
218 UNDEFINED,
219 UNKNOWN_VAL,
220 CONSTANT,
221 VARYING
222 } ccp_lattice_t;
224 /* Array of propagated constant values. After propagation,
225 CONST_VAL[I].VALUE holds the constant value for SSA_NAME(I). If
226 the constant is held in an SSA name representing a memory store
227 (i.e., a V_MAY_DEF or V_MUST_DEF), CONST_VAL[I].MEM_REF will
228 contain the actual memory reference used to store (i.e., the LHS of
229 the assignment doing the store). */
230 static prop_value_t *const_val;
232 /* True if we are also propagating constants in stores and loads. */
233 static bool do_store_ccp;
235 /* Dump constant propagation value VAL to file OUTF prefixed by PREFIX. */
237 static void
238 dump_lattice_value (FILE *outf, const char *prefix, prop_value_t val)
240 switch (val.lattice_val)
242 case UNINITIALIZED:
243 fprintf (outf, "%sUNINITIALIZED", prefix);
244 break;
245 case UNDEFINED:
246 fprintf (outf, "%sUNDEFINED", prefix);
247 break;
248 case VARYING:
249 fprintf (outf, "%sVARYING", prefix);
250 break;
251 case UNKNOWN_VAL:
252 fprintf (outf, "%sUNKNOWN_VAL", prefix);
253 break;
254 case CONSTANT:
255 fprintf (outf, "%sCONSTANT ", prefix);
256 print_generic_expr (outf, val.value, dump_flags);
257 break;
258 default:
259 gcc_unreachable ();
264 /* Print lattice value VAL to stderr. */
266 void debug_lattice_value (prop_value_t val);
268 void
269 debug_lattice_value (prop_value_t val)
271 dump_lattice_value (stderr, "", val);
272 fprintf (stderr, "\n");
276 /* The regular is_gimple_min_invariant does a shallow test of the object.
277 It assumes that full gimplification has happened, or will happen on the
278 object. For a value coming from DECL_INITIAL, this is not true, so we
279 have to be more strict ourselves. */
281 static bool
282 ccp_decl_initial_min_invariant (tree t)
284 if (!is_gimple_min_invariant (t))
285 return false;
286 if (TREE_CODE (t) == ADDR_EXPR)
288 /* Inline and unroll is_gimple_addressable. */
289 while (1)
291 t = TREE_OPERAND (t, 0);
292 if (is_gimple_id (t))
293 return true;
294 if (!handled_component_p (t))
295 return false;
298 return true;
302 /* Compute a default value for variable VAR and store it in the
303 CONST_VAL array. The following rules are used to get default
304 values:
306 1- Global and static variables that are declared constant are
307 considered CONSTANT.
309 2- Any other value is considered UNDEFINED. This is useful when
310 considering PHI nodes. PHI arguments that are undefined do not
311 change the constant value of the PHI node, which allows for more
312 constants to be propagated.
314 3- If SSA_NAME_VALUE is set and it is a constant, its value is
315 used.
317 4- Variables defined by statements other than assignments and PHI
318 nodes are considered VARYING.
320 5- Variables that are not GIMPLE registers are considered
321 UNKNOWN_VAL, which is really a stronger version of UNDEFINED.
322 It's used to avoid the short circuit evaluation implied by
323 UNDEFINED in ccp_lattice_meet. */
325 static prop_value_t
326 get_default_value (tree var)
328 tree sym = SSA_NAME_VAR (var);
329 prop_value_t val = { UNINITIALIZED, NULL_TREE, NULL_TREE };
331 if (!do_store_ccp && !is_gimple_reg (var))
333 /* Short circuit for regular CCP. We are not interested in any
334 non-register when DO_STORE_CCP is false. */
335 val.lattice_val = VARYING;
337 else if (SSA_NAME_VALUE (var)
338 && is_gimple_min_invariant (SSA_NAME_VALUE (var)))
340 val.lattice_val = CONSTANT;
341 val.value = SSA_NAME_VALUE (var);
343 else if (TREE_STATIC (sym)
344 && TREE_READONLY (sym)
345 && !MTAG_P (sym)
346 && DECL_INITIAL (sym)
347 && ccp_decl_initial_min_invariant (DECL_INITIAL (sym)))
349 /* Globals and static variables declared 'const' take their
350 initial value. */
351 val.lattice_val = CONSTANT;
352 val.value = DECL_INITIAL (sym);
353 val.mem_ref = sym;
355 else
357 tree stmt = SSA_NAME_DEF_STMT (var);
359 if (IS_EMPTY_STMT (stmt))
361 /* Variables defined by an empty statement are those used
362 before being initialized. If VAR is a local variable, we
363 can assume initially that it is UNDEFINED. If we are
364 doing STORE-CCP, function arguments and non-register
365 variables are initially UNKNOWN_VAL, because we cannot
366 discard the value incoming from outside of this function
367 (see ccp_lattice_meet for details). */
368 if (is_gimple_reg (sym) && TREE_CODE (sym) != PARM_DECL)
369 val.lattice_val = UNDEFINED;
370 else if (do_store_ccp)
371 val.lattice_val = UNKNOWN_VAL;
372 else
373 val.lattice_val = VARYING;
375 else if (TREE_CODE (stmt) == MODIFY_EXPR
376 || TREE_CODE (stmt) == PHI_NODE)
378 /* Any other variable defined by an assignment or a PHI node
379 is considered UNDEFINED (or UNKNOWN_VAL if VAR is not a
380 GIMPLE register). */
381 val.lattice_val = is_gimple_reg (sym) ? UNDEFINED : UNKNOWN_VAL;
383 else
385 /* Otherwise, VAR will never take on a constant value. */
386 val.lattice_val = VARYING;
390 return val;
394 /* Get the constant value associated with variable VAR. If
395 MAY_USE_DEFAULT_P is true, call get_default_value on variables that
396 have the lattice value UNINITIALIZED. */
398 static prop_value_t *
399 get_value (tree var, bool may_use_default_p)
401 prop_value_t *val = &const_val[SSA_NAME_VERSION (var)];
402 if (may_use_default_p && val->lattice_val == UNINITIALIZED)
403 *val = get_default_value (var);
405 return val;
409 /* Set the value for variable VAR to NEW_VAL. Return true if the new
410 value is different from VAR's previous value. */
412 static bool
413 set_lattice_value (tree var, prop_value_t new_val)
415 prop_value_t *old_val = get_value (var, false);
417 /* Lattice transitions must always be monotonically increasing in
418 value. We allow two exceptions:
420 1- If *OLD_VAL and NEW_VAL are the same, return false to
421 inform the caller that this was a non-transition.
423 2- If we are doing store-ccp (i.e., DOING_STORE_CCP is true),
424 allow CONSTANT->UNKNOWN_VAL. The UNKNOWN_VAL state is a
425 special type of UNDEFINED state which prevents the short
426 circuit evaluation of PHI arguments (see ccp_visit_phi_node
427 and ccp_lattice_meet). */
428 gcc_assert (old_val->lattice_val <= new_val.lattice_val
429 || (old_val->lattice_val == new_val.lattice_val
430 && old_val->value == new_val.value
431 && old_val->mem_ref == new_val.mem_ref)
432 || (do_store_ccp
433 && old_val->lattice_val == CONSTANT
434 && new_val.lattice_val == UNKNOWN_VAL));
436 if (old_val->lattice_val != new_val.lattice_val)
438 if (dump_file && (dump_flags & TDF_DETAILS))
440 dump_lattice_value (dump_file, "Lattice value changed to ", new_val);
441 fprintf (dump_file, ". %sdding SSA edges to worklist.\n",
442 new_val.lattice_val != UNDEFINED ? "A" : "Not a");
445 *old_val = new_val;
447 /* Transitions UNINITIALIZED -> UNDEFINED are never interesting
448 for propagation purposes. In these cases return false to
449 avoid doing useless work. */
450 return (new_val.lattice_val != UNDEFINED);
453 return false;
457 /* Return the likely CCP lattice value for STMT.
459 If STMT has no operands, then return CONSTANT.
461 Else if any operands of STMT are undefined, then return UNDEFINED.
463 Else if any operands of STMT are constants, then return CONSTANT.
465 Else return VARYING. */
467 static ccp_lattice_t
468 likely_value (tree stmt)
470 bool found_constant;
471 stmt_ann_t ann;
472 tree use;
473 ssa_op_iter iter;
475 ann = stmt_ann (stmt);
477 /* If the statement has volatile operands, it won't fold to a
478 constant value. */
479 if (ann->has_volatile_ops)
480 return VARYING;
482 /* If we are not doing store-ccp, statements with loads
483 and/or stores will never fold into a constant. */
484 if (!do_store_ccp
485 && !ZERO_SSA_OPERANDS (stmt, SSA_OP_ALL_VIRTUALS))
486 return VARYING;
489 /* A CALL_EXPR is assumed to be varying. NOTE: This may be overly
490 conservative, in the presence of const and pure calls. */
491 if (get_call_expr_in (stmt) != NULL_TREE)
492 return VARYING;
494 /* Anything other than assignments and conditional jumps are not
495 interesting for CCP. */
496 if (TREE_CODE (stmt) != MODIFY_EXPR
497 && TREE_CODE (stmt) != COND_EXPR
498 && TREE_CODE (stmt) != SWITCH_EXPR)
499 return VARYING;
501 if (is_gimple_min_invariant (get_rhs (stmt)))
502 return CONSTANT;
504 found_constant = false;
505 FOR_EACH_SSA_TREE_OPERAND (use, stmt, iter, SSA_OP_USE|SSA_OP_VUSE)
507 prop_value_t *val = get_value (use, true);
509 if (val->lattice_val == VARYING)
510 return VARYING;
512 if (val->lattice_val == UNKNOWN_VAL)
514 /* UNKNOWN_VAL is invalid when not doing STORE-CCP. */
515 gcc_assert (do_store_ccp);
516 return UNKNOWN_VAL;
519 if (val->lattice_val == CONSTANT)
520 found_constant = true;
523 if (found_constant
524 || ZERO_SSA_OPERANDS (stmt, SSA_OP_USE)
525 || ZERO_SSA_OPERANDS (stmt, SSA_OP_VUSE))
526 return CONSTANT;
528 return UNDEFINED;
532 /* Initialize local data structures for CCP. */
534 static void
535 ccp_initialize (void)
537 basic_block bb;
539 const_val = XNEWVEC (prop_value_t, num_ssa_names);
540 memset (const_val, 0, num_ssa_names * sizeof (*const_val));
542 /* Initialize simulation flags for PHI nodes and statements. */
543 FOR_EACH_BB (bb)
545 block_stmt_iterator i;
547 for (i = bsi_start (bb); !bsi_end_p (i); bsi_next (&i))
549 bool is_varying = false;
550 tree stmt = bsi_stmt (i);
552 if (likely_value (stmt) == VARYING)
555 tree def;
556 ssa_op_iter iter;
558 /* If the statement will not produce a constant, mark
559 all its outputs VARYING. */
560 FOR_EACH_SSA_TREE_OPERAND (def, stmt, iter, SSA_OP_ALL_DEFS)
561 get_value (def, false)->lattice_val = VARYING;
563 /* Never mark conditional jumps with DONT_SIMULATE_AGAIN,
564 otherwise the propagator will never add the outgoing
565 control edges. */
566 if (TREE_CODE (stmt) != COND_EXPR
567 && TREE_CODE (stmt) != SWITCH_EXPR)
568 is_varying = true;
571 DONT_SIMULATE_AGAIN (stmt) = is_varying;
575 /* Now process PHI nodes. */
576 FOR_EACH_BB (bb)
578 tree phi;
580 for (phi = phi_nodes (bb); phi; phi = PHI_CHAIN (phi))
582 int i;
583 tree arg;
584 prop_value_t *val = get_value (PHI_RESULT (phi), false);
586 for (i = 0; i < PHI_NUM_ARGS (phi); i++)
588 arg = PHI_ARG_DEF (phi, i);
590 if (TREE_CODE (arg) == SSA_NAME
591 && get_value (arg, false)->lattice_val == VARYING)
593 val->lattice_val = VARYING;
594 break;
598 DONT_SIMULATE_AGAIN (phi) = (val->lattice_val == VARYING);
604 /* Do final substitution of propagated values, cleanup the flowgraph and
605 free allocated storage. */
607 static void
608 ccp_finalize (void)
610 /* Perform substitutions based on the known constant values. */
611 substitute_and_fold (const_val, false);
613 free (const_val);
617 /* Compute the meet operator between *VAL1 and *VAL2. Store the result
618 in VAL1.
620 any M UNDEFINED = any
621 any M UNKNOWN_VAL = UNKNOWN_VAL
622 any M VARYING = VARYING
623 Ci M Cj = Ci if (i == j)
624 Ci M Cj = VARYING if (i != j)
626 Lattice values UNKNOWN_VAL and UNDEFINED are similar but have
627 different semantics at PHI nodes. Both values imply that we don't
628 know whether the variable is constant or not. However, UNKNOWN_VAL
629 values override all others. For instance, suppose that A is a
630 global variable:
632 +------+
634 | / \
635 | / \
636 | | A_1 = 4
637 | \ /
638 | \ /
639 | A_3 = PHI (A_2, A_1)
640 | ... = A_3
642 +----+
644 If the edge into A_2 is not executable, the first visit to A_3 will
645 yield the constant 4. But the second visit to A_3 will be with A_2
646 in state UNKNOWN_VAL. We can no longer conclude that A_3 is 4
647 because A_2 may have been set in another function. If we had used
648 the lattice value UNDEFINED, we would have had wrongly concluded
649 that A_3 is 4. */
652 static void
653 ccp_lattice_meet (prop_value_t *val1, prop_value_t *val2)
655 if (val1->lattice_val == UNDEFINED)
657 /* UNDEFINED M any = any */
658 *val1 = *val2;
660 else if (val2->lattice_val == UNDEFINED)
662 /* any M UNDEFINED = any
663 Nothing to do. VAL1 already contains the value we want. */
666 else if (val1->lattice_val == UNKNOWN_VAL
667 || val2->lattice_val == UNKNOWN_VAL)
669 /* UNKNOWN_VAL values are invalid if we are not doing STORE-CCP. */
670 gcc_assert (do_store_ccp);
672 /* any M UNKNOWN_VAL = UNKNOWN_VAL. */
673 val1->lattice_val = UNKNOWN_VAL;
674 val1->value = NULL_TREE;
675 val1->mem_ref = NULL_TREE;
677 else if (val1->lattice_val == VARYING
678 || val2->lattice_val == VARYING)
680 /* any M VARYING = VARYING. */
681 val1->lattice_val = VARYING;
682 val1->value = NULL_TREE;
683 val1->mem_ref = NULL_TREE;
685 else if (val1->lattice_val == CONSTANT
686 && val2->lattice_val == CONSTANT
687 && simple_cst_equal (val1->value, val2->value) == 1
688 && (!do_store_ccp
689 || (val1->mem_ref && val2->mem_ref
690 && operand_equal_p (val1->mem_ref, val2->mem_ref, 0))))
692 /* Ci M Cj = Ci if (i == j)
693 Ci M Cj = VARYING if (i != j)
695 If these two values come from memory stores, make sure that
696 they come from the same memory reference. */
697 val1->lattice_val = CONSTANT;
698 val1->value = val1->value;
699 val1->mem_ref = val1->mem_ref;
701 else
703 /* Any other combination is VARYING. */
704 val1->lattice_val = VARYING;
705 val1->value = NULL_TREE;
706 val1->mem_ref = NULL_TREE;
711 /* Loop through the PHI_NODE's parameters for BLOCK and compare their
712 lattice values to determine PHI_NODE's lattice value. The value of a
713 PHI node is determined calling ccp_lattice_meet with all the arguments
714 of the PHI node that are incoming via executable edges. */
716 static enum ssa_prop_result
717 ccp_visit_phi_node (tree phi)
719 int i;
720 prop_value_t *old_val, new_val;
722 if (dump_file && (dump_flags & TDF_DETAILS))
724 fprintf (dump_file, "\nVisiting PHI node: ");
725 print_generic_expr (dump_file, phi, dump_flags);
728 old_val = get_value (PHI_RESULT (phi), false);
729 switch (old_val->lattice_val)
731 case VARYING:
732 return SSA_PROP_VARYING;
734 case CONSTANT:
735 new_val = *old_val;
736 break;
738 case UNKNOWN_VAL:
739 /* To avoid the default value of UNKNOWN_VAL overriding
740 that of its possible constant arguments, temporarily
741 set the PHI node's default lattice value to be
742 UNDEFINED. If the PHI node's old value was UNKNOWN_VAL and
743 the new value is UNDEFINED, then we prevent the invalid
744 transition by not calling set_lattice_value. */
745 gcc_assert (do_store_ccp);
747 /* FALLTHRU */
749 case UNDEFINED:
750 case UNINITIALIZED:
751 new_val.lattice_val = UNDEFINED;
752 new_val.value = NULL_TREE;
753 new_val.mem_ref = NULL_TREE;
754 break;
756 default:
757 gcc_unreachable ();
760 for (i = 0; i < PHI_NUM_ARGS (phi); i++)
762 /* Compute the meet operator over all the PHI arguments flowing
763 through executable edges. */
764 edge e = PHI_ARG_EDGE (phi, i);
766 if (dump_file && (dump_flags & TDF_DETAILS))
768 fprintf (dump_file,
769 "\n Argument #%d (%d -> %d %sexecutable)\n",
770 i, e->src->index, e->dest->index,
771 (e->flags & EDGE_EXECUTABLE) ? "" : "not ");
774 /* If the incoming edge is executable, Compute the meet operator for
775 the existing value of the PHI node and the current PHI argument. */
776 if (e->flags & EDGE_EXECUTABLE)
778 tree arg = PHI_ARG_DEF (phi, i);
779 prop_value_t arg_val;
781 if (is_gimple_min_invariant (arg))
783 arg_val.lattice_val = CONSTANT;
784 arg_val.value = arg;
785 arg_val.mem_ref = NULL_TREE;
787 else
788 arg_val = *(get_value (arg, true));
790 ccp_lattice_meet (&new_val, &arg_val);
792 if (dump_file && (dump_flags & TDF_DETAILS))
794 fprintf (dump_file, "\t");
795 print_generic_expr (dump_file, arg, dump_flags);
796 dump_lattice_value (dump_file, "\tValue: ", arg_val);
797 fprintf (dump_file, "\n");
800 if (new_val.lattice_val == VARYING)
801 break;
805 if (dump_file && (dump_flags & TDF_DETAILS))
807 dump_lattice_value (dump_file, "\n PHI node value: ", new_val);
808 fprintf (dump_file, "\n\n");
811 /* Check for an invalid change from UNKNOWN_VAL to UNDEFINED. */
812 if (do_store_ccp
813 && old_val->lattice_val == UNKNOWN_VAL
814 && new_val.lattice_val == UNDEFINED)
815 return SSA_PROP_NOT_INTERESTING;
817 /* Otherwise, make the transition to the new value. */
818 if (set_lattice_value (PHI_RESULT (phi), new_val))
820 if (new_val.lattice_val == VARYING)
821 return SSA_PROP_VARYING;
822 else
823 return SSA_PROP_INTERESTING;
825 else
826 return SSA_PROP_NOT_INTERESTING;
830 /* CCP specific front-end to the non-destructive constant folding
831 routines.
833 Attempt to simplify the RHS of STMT knowing that one or more
834 operands are constants.
836 If simplification is possible, return the simplified RHS,
837 otherwise return the original RHS. */
839 static tree
840 ccp_fold (tree stmt)
842 tree rhs = get_rhs (stmt);
843 enum tree_code code = TREE_CODE (rhs);
844 enum tree_code_class kind = TREE_CODE_CLASS (code);
845 tree retval = NULL_TREE;
847 if (TREE_CODE (rhs) == SSA_NAME)
849 /* If the RHS is an SSA_NAME, return its known constant value,
850 if any. */
851 return get_value (rhs, true)->value;
853 else if (do_store_ccp && stmt_makes_single_load (stmt))
855 /* If the RHS is a memory load, see if the VUSEs associated with
856 it are a valid constant for that memory load. */
857 prop_value_t *val = get_value_loaded_by (stmt, const_val);
858 if (val && val->mem_ref)
860 if (operand_equal_p (val->mem_ref, rhs, 0))
861 return val->value;
863 /* If RHS is extracting REALPART_EXPR or IMAGPART_EXPR of a
864 complex type with a known constant value, return it. */
865 if ((TREE_CODE (rhs) == REALPART_EXPR
866 || TREE_CODE (rhs) == IMAGPART_EXPR)
867 && operand_equal_p (val->mem_ref, TREE_OPERAND (rhs, 0), 0))
868 return fold_build1 (TREE_CODE (rhs), TREE_TYPE (rhs), val->value);
870 return NULL_TREE;
873 /* Unary operators. Note that we know the single operand must
874 be a constant. So this should almost always return a
875 simplified RHS. */
876 if (kind == tcc_unary)
878 /* Handle unary operators which can appear in GIMPLE form. */
879 tree op0 = TREE_OPERAND (rhs, 0);
881 /* Simplify the operand down to a constant. */
882 if (TREE_CODE (op0) == SSA_NAME)
884 prop_value_t *val = get_value (op0, true);
885 if (val->lattice_val == CONSTANT)
886 op0 = get_value (op0, true)->value;
889 if ((code == NOP_EXPR || code == CONVERT_EXPR)
890 && tree_ssa_useless_type_conversion_1 (TREE_TYPE (rhs),
891 TREE_TYPE (op0)))
892 return op0;
893 return fold_unary (code, TREE_TYPE (rhs), op0);
896 /* Binary and comparison operators. We know one or both of the
897 operands are constants. */
898 else if (kind == tcc_binary
899 || kind == tcc_comparison
900 || code == TRUTH_AND_EXPR
901 || code == TRUTH_OR_EXPR
902 || code == TRUTH_XOR_EXPR)
904 /* Handle binary and comparison operators that can appear in
905 GIMPLE form. */
906 tree op0 = TREE_OPERAND (rhs, 0);
907 tree op1 = TREE_OPERAND (rhs, 1);
909 /* Simplify the operands down to constants when appropriate. */
910 if (TREE_CODE (op0) == SSA_NAME)
912 prop_value_t *val = get_value (op0, true);
913 if (val->lattice_val == CONSTANT)
914 op0 = val->value;
917 if (TREE_CODE (op1) == SSA_NAME)
919 prop_value_t *val = get_value (op1, true);
920 if (val->lattice_val == CONSTANT)
921 op1 = val->value;
924 return fold_binary (code, TREE_TYPE (rhs), op0, op1);
927 /* We may be able to fold away calls to builtin functions if their
928 arguments are constants. */
929 else if (code == CALL_EXPR
930 && TREE_CODE (TREE_OPERAND (rhs, 0)) == ADDR_EXPR
931 && (TREE_CODE (TREE_OPERAND (TREE_OPERAND (rhs, 0), 0))
932 == FUNCTION_DECL)
933 && DECL_BUILT_IN (TREE_OPERAND (TREE_OPERAND (rhs, 0), 0)))
935 if (!ZERO_SSA_OPERANDS (stmt, SSA_OP_USE))
937 tree *orig, var;
938 tree fndecl, arglist;
939 size_t i = 0;
940 ssa_op_iter iter;
941 use_operand_p var_p;
943 /* Preserve the original values of every operand. */
944 orig = XNEWVEC (tree, NUM_SSA_OPERANDS (stmt, SSA_OP_USE));
945 FOR_EACH_SSA_TREE_OPERAND (var, stmt, iter, SSA_OP_USE)
946 orig[i++] = var;
948 /* Substitute operands with their values and try to fold. */
949 replace_uses_in (stmt, NULL, const_val);
950 fndecl = get_callee_fndecl (rhs);
951 arglist = TREE_OPERAND (rhs, 1);
952 retval = fold_builtin (fndecl, arglist, false);
954 /* Restore operands to their original form. */
955 i = 0;
956 FOR_EACH_SSA_USE_OPERAND (var_p, stmt, iter, SSA_OP_USE)
957 SET_USE (var_p, orig[i++]);
958 free (orig);
961 else
962 return rhs;
964 /* If we got a simplified form, see if we need to convert its type. */
965 if (retval)
966 return fold_convert (TREE_TYPE (rhs), retval);
968 /* No simplification was possible. */
969 return rhs;
973 /* Return the tree representing the element referenced by T if T is an
974 ARRAY_REF or COMPONENT_REF into constant aggregates. Return
975 NULL_TREE otherwise. */
977 static tree
978 fold_const_aggregate_ref (tree t)
980 prop_value_t *value;
981 tree base, ctor, idx, field;
982 unsigned HOST_WIDE_INT cnt;
983 tree cfield, cval;
985 switch (TREE_CODE (t))
987 case ARRAY_REF:
988 /* Get a CONSTRUCTOR. If BASE is a VAR_DECL, get its
989 DECL_INITIAL. If BASE is a nested reference into another
990 ARRAY_REF or COMPONENT_REF, make a recursive call to resolve
991 the inner reference. */
992 base = TREE_OPERAND (t, 0);
993 switch (TREE_CODE (base))
995 case VAR_DECL:
996 if (!TREE_READONLY (base)
997 || TREE_CODE (TREE_TYPE (base)) != ARRAY_TYPE
998 || !targetm.binds_local_p (base))
999 return NULL_TREE;
1001 ctor = DECL_INITIAL (base);
1002 break;
1004 case ARRAY_REF:
1005 case COMPONENT_REF:
1006 ctor = fold_const_aggregate_ref (base);
1007 break;
1009 default:
1010 return NULL_TREE;
1013 if (ctor == NULL_TREE
1014 || (TREE_CODE (ctor) != CONSTRUCTOR
1015 && TREE_CODE (ctor) != STRING_CST)
1016 || !TREE_STATIC (ctor))
1017 return NULL_TREE;
1019 /* Get the index. If we have an SSA_NAME, try to resolve it
1020 with the current lattice value for the SSA_NAME. */
1021 idx = TREE_OPERAND (t, 1);
1022 switch (TREE_CODE (idx))
1024 case SSA_NAME:
1025 if ((value = get_value (idx, true))
1026 && value->lattice_val == CONSTANT
1027 && TREE_CODE (value->value) == INTEGER_CST)
1028 idx = value->value;
1029 else
1030 return NULL_TREE;
1031 break;
1033 case INTEGER_CST:
1034 break;
1036 default:
1037 return NULL_TREE;
1040 /* Fold read from constant string. */
1041 if (TREE_CODE (ctor) == STRING_CST)
1043 if ((TYPE_MODE (TREE_TYPE (t))
1044 == TYPE_MODE (TREE_TYPE (TREE_TYPE (ctor))))
1045 && (GET_MODE_CLASS (TYPE_MODE (TREE_TYPE (TREE_TYPE (ctor))))
1046 == MODE_INT)
1047 && GET_MODE_SIZE (TYPE_MODE (TREE_TYPE (TREE_TYPE (ctor)))) == 1
1048 && compare_tree_int (idx, TREE_STRING_LENGTH (ctor)) < 0)
1049 return build_int_cst (TREE_TYPE (t), (TREE_STRING_POINTER (ctor)
1050 [TREE_INT_CST_LOW (idx)]));
1051 return NULL_TREE;
1054 /* Whoo-hoo! I'll fold ya baby. Yeah! */
1055 FOR_EACH_CONSTRUCTOR_ELT (CONSTRUCTOR_ELTS (ctor), cnt, cfield, cval)
1056 if (tree_int_cst_equal (cfield, idx))
1057 return cval;
1058 break;
1060 case COMPONENT_REF:
1061 /* Get a CONSTRUCTOR. If BASE is a VAR_DECL, get its
1062 DECL_INITIAL. If BASE is a nested reference into another
1063 ARRAY_REF or COMPONENT_REF, make a recursive call to resolve
1064 the inner reference. */
1065 base = TREE_OPERAND (t, 0);
1066 switch (TREE_CODE (base))
1068 case VAR_DECL:
1069 if (!TREE_READONLY (base)
1070 || TREE_CODE (TREE_TYPE (base)) != RECORD_TYPE
1071 || !targetm.binds_local_p (base))
1072 return NULL_TREE;
1074 ctor = DECL_INITIAL (base);
1075 break;
1077 case ARRAY_REF:
1078 case COMPONENT_REF:
1079 ctor = fold_const_aggregate_ref (base);
1080 break;
1082 default:
1083 return NULL_TREE;
1086 if (ctor == NULL_TREE
1087 || TREE_CODE (ctor) != CONSTRUCTOR
1088 || !TREE_STATIC (ctor))
1089 return NULL_TREE;
1091 field = TREE_OPERAND (t, 1);
1093 FOR_EACH_CONSTRUCTOR_ELT (CONSTRUCTOR_ELTS (ctor), cnt, cfield, cval)
1094 if (cfield == field
1095 /* FIXME: Handle bit-fields. */
1096 && ! DECL_BIT_FIELD (cfield))
1097 return cval;
1098 break;
1100 case REALPART_EXPR:
1101 case IMAGPART_EXPR:
1103 tree c = fold_const_aggregate_ref (TREE_OPERAND (t, 0));
1104 if (c && TREE_CODE (c) == COMPLEX_CST)
1105 return fold_build1 (TREE_CODE (t), TREE_TYPE (t), c);
1106 break;
1109 default:
1110 break;
1113 return NULL_TREE;
1116 /* Evaluate statement STMT. */
1118 static prop_value_t
1119 evaluate_stmt (tree stmt)
1121 prop_value_t val;
1122 tree simplified = NULL_TREE;
1123 ccp_lattice_t likelyvalue = likely_value (stmt);
1125 val.mem_ref = NULL_TREE;
1127 /* If the statement is likely to have a CONSTANT result, then try
1128 to fold the statement to determine the constant value. */
1129 if (likelyvalue == CONSTANT)
1130 simplified = ccp_fold (stmt);
1131 /* If the statement is likely to have a VARYING result, then do not
1132 bother folding the statement. */
1133 if (likelyvalue == VARYING)
1134 simplified = get_rhs (stmt);
1135 /* If the statement is an ARRAY_REF or COMPONENT_REF into constant
1136 aggregates, extract the referenced constant. Otherwise the
1137 statement is likely to have an UNDEFINED value, and there will be
1138 nothing to do. Note that fold_const_aggregate_ref returns
1139 NULL_TREE if the first case does not match. */
1140 else if (!simplified)
1141 simplified = fold_const_aggregate_ref (get_rhs (stmt));
1143 if (simplified && is_gimple_min_invariant (simplified))
1145 /* The statement produced a constant value. */
1146 val.lattice_val = CONSTANT;
1147 val.value = simplified;
1149 else
1151 /* The statement produced a nonconstant value. If the statement
1152 had UNDEFINED operands, then the result of the statement
1153 should be UNDEFINED. Otherwise, the statement is VARYING. */
1154 if (likelyvalue == UNDEFINED || likelyvalue == UNKNOWN_VAL)
1155 val.lattice_val = likelyvalue;
1156 else
1157 val.lattice_val = VARYING;
1159 val.value = NULL_TREE;
1162 return val;
1166 /* Visit the assignment statement STMT. Set the value of its LHS to the
1167 value computed by the RHS and store LHS in *OUTPUT_P. If STMT
1168 creates virtual definitions, set the value of each new name to that
1169 of the RHS (if we can derive a constant out of the RHS). */
1171 static enum ssa_prop_result
1172 visit_assignment (tree stmt, tree *output_p)
1174 prop_value_t val;
1175 tree lhs, rhs;
1176 enum ssa_prop_result retval;
1178 lhs = TREE_OPERAND (stmt, 0);
1179 rhs = TREE_OPERAND (stmt, 1);
1181 if (TREE_CODE (rhs) == SSA_NAME)
1183 /* For a simple copy operation, we copy the lattice values. */
1184 prop_value_t *nval = get_value (rhs, true);
1185 val = *nval;
1187 else if (do_store_ccp && stmt_makes_single_load (stmt))
1189 /* Same as above, but the RHS is not a gimple register and yet
1190 has a known VUSE. If STMT is loading from the same memory
1191 location that created the SSA_NAMEs for the virtual operands,
1192 we can propagate the value on the RHS. */
1193 prop_value_t *nval = get_value_loaded_by (stmt, const_val);
1195 if (nval && nval->mem_ref
1196 && operand_equal_p (nval->mem_ref, rhs, 0))
1197 val = *nval;
1198 else
1199 val = evaluate_stmt (stmt);
1201 else
1202 /* Evaluate the statement. */
1203 val = evaluate_stmt (stmt);
1205 /* If the original LHS was a VIEW_CONVERT_EXPR, modify the constant
1206 value to be a VIEW_CONVERT_EXPR of the old constant value.
1208 ??? Also, if this was a definition of a bitfield, we need to widen
1209 the constant value into the type of the destination variable. This
1210 should not be necessary if GCC represented bitfields properly. */
1212 tree orig_lhs = TREE_OPERAND (stmt, 0);
1214 if (TREE_CODE (orig_lhs) == VIEW_CONVERT_EXPR
1215 && val.lattice_val == CONSTANT)
1217 tree w = fold_unary (VIEW_CONVERT_EXPR,
1218 TREE_TYPE (TREE_OPERAND (orig_lhs, 0)),
1219 val.value);
1221 orig_lhs = TREE_OPERAND (orig_lhs, 0);
1222 if (w && is_gimple_min_invariant (w))
1223 val.value = w;
1224 else
1226 val.lattice_val = VARYING;
1227 val.value = NULL;
1231 if (val.lattice_val == CONSTANT
1232 && TREE_CODE (orig_lhs) == COMPONENT_REF
1233 && DECL_BIT_FIELD (TREE_OPERAND (orig_lhs, 1)))
1235 tree w = widen_bitfield (val.value, TREE_OPERAND (orig_lhs, 1),
1236 orig_lhs);
1238 if (w && is_gimple_min_invariant (w))
1239 val.value = w;
1240 else
1242 val.lattice_val = VARYING;
1243 val.value = NULL_TREE;
1244 val.mem_ref = NULL_TREE;
1249 retval = SSA_PROP_NOT_INTERESTING;
1251 /* Set the lattice value of the statement's output. */
1252 if (TREE_CODE (lhs) == SSA_NAME)
1254 /* If STMT is an assignment to an SSA_NAME, we only have one
1255 value to set. */
1256 if (set_lattice_value (lhs, val))
1258 *output_p = lhs;
1259 if (val.lattice_val == VARYING)
1260 retval = SSA_PROP_VARYING;
1261 else
1262 retval = SSA_PROP_INTERESTING;
1265 else if (do_store_ccp && stmt_makes_single_store (stmt))
1267 /* Otherwise, set the names in V_MAY_DEF/V_MUST_DEF operands
1268 to the new constant value and mark the LHS as the memory
1269 reference associated with VAL. */
1270 ssa_op_iter i;
1271 tree vdef;
1272 bool changed;
1274 /* Stores cannot take on an UNDEFINED value. */
1275 if (val.lattice_val == UNDEFINED)
1276 val.lattice_val = UNKNOWN_VAL;
1278 /* Mark VAL as stored in the LHS of this assignment. */
1279 val.mem_ref = lhs;
1281 /* Set the value of every VDEF to VAL. */
1282 changed = false;
1283 FOR_EACH_SSA_TREE_OPERAND (vdef, stmt, i, SSA_OP_VIRTUAL_DEFS)
1284 changed |= set_lattice_value (vdef, val);
1286 /* Note that for propagation purposes, we are only interested in
1287 visiting statements that load the exact same memory reference
1288 stored here. Those statements will have the exact same list
1289 of virtual uses, so it is enough to set the output of this
1290 statement to be its first virtual definition. */
1291 *output_p = first_vdef (stmt);
1292 if (changed)
1294 if (val.lattice_val == VARYING)
1295 retval = SSA_PROP_VARYING;
1296 else
1297 retval = SSA_PROP_INTERESTING;
1301 return retval;
1305 /* Visit the conditional statement STMT. Return SSA_PROP_INTERESTING
1306 if it can determine which edge will be taken. Otherwise, return
1307 SSA_PROP_VARYING. */
1309 static enum ssa_prop_result
1310 visit_cond_stmt (tree stmt, edge *taken_edge_p)
1312 prop_value_t val;
1313 basic_block block;
1315 block = bb_for_stmt (stmt);
1316 val = evaluate_stmt (stmt);
1318 /* Find which edge out of the conditional block will be taken and add it
1319 to the worklist. If no single edge can be determined statically,
1320 return SSA_PROP_VARYING to feed all the outgoing edges to the
1321 propagation engine. */
1322 *taken_edge_p = val.value ? find_taken_edge (block, val.value) : 0;
1323 if (*taken_edge_p)
1324 return SSA_PROP_INTERESTING;
1325 else
1326 return SSA_PROP_VARYING;
1330 /* Evaluate statement STMT. If the statement produces an output value and
1331 its evaluation changes the lattice value of its output, return
1332 SSA_PROP_INTERESTING and set *OUTPUT_P to the SSA_NAME holding the
1333 output value.
1335 If STMT is a conditional branch and we can determine its truth
1336 value, set *TAKEN_EDGE_P accordingly. If STMT produces a varying
1337 value, return SSA_PROP_VARYING. */
1339 static enum ssa_prop_result
1340 ccp_visit_stmt (tree stmt, edge *taken_edge_p, tree *output_p)
1342 tree def;
1343 ssa_op_iter iter;
1345 if (dump_file && (dump_flags & TDF_DETAILS))
1347 fprintf (dump_file, "\nVisiting statement:\n");
1348 print_generic_stmt (dump_file, stmt, dump_flags);
1349 fprintf (dump_file, "\n");
1352 if (TREE_CODE (stmt) == MODIFY_EXPR)
1354 /* If the statement is an assignment that produces a single
1355 output value, evaluate its RHS to see if the lattice value of
1356 its output has changed. */
1357 return visit_assignment (stmt, output_p);
1359 else if (TREE_CODE (stmt) == COND_EXPR || TREE_CODE (stmt) == SWITCH_EXPR)
1361 /* If STMT is a conditional branch, see if we can determine
1362 which branch will be taken. */
1363 return visit_cond_stmt (stmt, taken_edge_p);
1366 /* Any other kind of statement is not interesting for constant
1367 propagation and, therefore, not worth simulating. */
1368 if (dump_file && (dump_flags & TDF_DETAILS))
1369 fprintf (dump_file, "No interesting values produced. Marked VARYING.\n");
1371 /* Definitions made by statements other than assignments to
1372 SSA_NAMEs represent unknown modifications to their outputs.
1373 Mark them VARYING. */
1374 FOR_EACH_SSA_TREE_OPERAND (def, stmt, iter, SSA_OP_ALL_DEFS)
1376 prop_value_t v = { VARYING, NULL_TREE, NULL_TREE };
1377 set_lattice_value (def, v);
1380 return SSA_PROP_VARYING;
1384 /* Main entry point for SSA Conditional Constant Propagation. */
1386 static void
1387 execute_ssa_ccp (bool store_ccp)
1389 do_store_ccp = store_ccp;
1390 ccp_initialize ();
1391 ssa_propagate (ccp_visit_stmt, ccp_visit_phi_node);
1392 ccp_finalize ();
1396 static unsigned int
1397 do_ssa_ccp (void)
1399 execute_ssa_ccp (false);
1400 return 0;
1404 static bool
1405 gate_ccp (void)
1407 return flag_tree_ccp != 0;
1411 struct tree_opt_pass pass_ccp =
1413 "ccp", /* name */
1414 gate_ccp, /* gate */
1415 do_ssa_ccp, /* execute */
1416 NULL, /* sub */
1417 NULL, /* next */
1418 0, /* static_pass_number */
1419 TV_TREE_CCP, /* tv_id */
1420 PROP_cfg | PROP_ssa | PROP_alias, /* properties_required */
1421 0, /* properties_provided */
1422 PROP_smt_usage, /* properties_destroyed */
1423 0, /* todo_flags_start */
1424 TODO_cleanup_cfg | TODO_dump_func | TODO_update_ssa
1425 | TODO_ggc_collect | TODO_verify_ssa
1426 | TODO_verify_stmts | TODO_update_smt_usage, /* todo_flags_finish */
1427 0 /* letter */
1431 static unsigned int
1432 do_ssa_store_ccp (void)
1434 /* If STORE-CCP is not enabled, we just run regular CCP. */
1435 execute_ssa_ccp (flag_tree_store_ccp != 0);
1436 return 0;
1439 static bool
1440 gate_store_ccp (void)
1442 /* STORE-CCP is enabled only with -ftree-store-ccp, but when
1443 -fno-tree-store-ccp is specified, we should run regular CCP.
1444 That's why the pass is enabled with either flag. */
1445 return flag_tree_store_ccp != 0 || flag_tree_ccp != 0;
1449 struct tree_opt_pass pass_store_ccp =
1451 "store_ccp", /* name */
1452 gate_store_ccp, /* gate */
1453 do_ssa_store_ccp, /* execute */
1454 NULL, /* sub */
1455 NULL, /* next */
1456 0, /* static_pass_number */
1457 TV_TREE_STORE_CCP, /* tv_id */
1458 PROP_cfg | PROP_ssa | PROP_alias, /* properties_required */
1459 0, /* properties_provided */
1460 PROP_smt_usage, /* properties_destroyed */
1461 0, /* todo_flags_start */
1462 TODO_dump_func | TODO_update_ssa
1463 | TODO_ggc_collect | TODO_verify_ssa
1464 | TODO_cleanup_cfg
1465 | TODO_verify_stmts | TODO_update_smt_usage, /* todo_flags_finish */
1466 0 /* letter */
1469 /* Given a constant value VAL for bitfield FIELD, and a destination
1470 variable VAR, return VAL appropriately widened to fit into VAR. If
1471 FIELD is wider than HOST_WIDE_INT, NULL is returned. */
1473 tree
1474 widen_bitfield (tree val, tree field, tree var)
1476 unsigned HOST_WIDE_INT var_size, field_size;
1477 tree wide_val;
1478 unsigned HOST_WIDE_INT mask;
1479 unsigned int i;
1481 /* We can only do this if the size of the type and field and VAL are
1482 all constants representable in HOST_WIDE_INT. */
1483 if (!host_integerp (TYPE_SIZE (TREE_TYPE (var)), 1)
1484 || !host_integerp (DECL_SIZE (field), 1)
1485 || !host_integerp (val, 0))
1486 return NULL_TREE;
1488 var_size = tree_low_cst (TYPE_SIZE (TREE_TYPE (var)), 1);
1489 field_size = tree_low_cst (DECL_SIZE (field), 1);
1491 /* Give up if either the bitfield or the variable are too wide. */
1492 if (field_size > HOST_BITS_PER_WIDE_INT || var_size > HOST_BITS_PER_WIDE_INT)
1493 return NULL_TREE;
1495 gcc_assert (var_size >= field_size);
1497 /* If the sign bit of the value is not set or the field's type is unsigned,
1498 just mask off the high order bits of the value. */
1499 if (DECL_UNSIGNED (field)
1500 || !(tree_low_cst (val, 0) & (((HOST_WIDE_INT)1) << (field_size - 1))))
1502 /* Zero extension. Build a mask with the lower 'field_size' bits
1503 set and a BIT_AND_EXPR node to clear the high order bits of
1504 the value. */
1505 for (i = 0, mask = 0; i < field_size; i++)
1506 mask |= ((HOST_WIDE_INT) 1) << i;
1508 wide_val = fold_build2 (BIT_AND_EXPR, TREE_TYPE (var), val,
1509 build_int_cst (TREE_TYPE (var), mask));
1511 else
1513 /* Sign extension. Create a mask with the upper 'field_size'
1514 bits set and a BIT_IOR_EXPR to set the high order bits of the
1515 value. */
1516 for (i = 0, mask = 0; i < (var_size - field_size); i++)
1517 mask |= ((HOST_WIDE_INT) 1) << (var_size - i - 1);
1519 wide_val = fold_build2 (BIT_IOR_EXPR, TREE_TYPE (var), val,
1520 build_int_cst (TREE_TYPE (var), mask));
1523 return wide_val;
1527 /* A subroutine of fold_stmt_r. Attempts to fold *(A+O) to A[X].
1528 BASE is an array type. OFFSET is a byte displacement. ORIG_TYPE
1529 is the desired result type. */
1531 static tree
1532 maybe_fold_offset_to_array_ref (tree base, tree offset, tree orig_type)
1534 tree min_idx, idx, elt_offset = integer_zero_node;
1535 tree array_type, elt_type, elt_size;
1537 /* If BASE is an ARRAY_REF, we can pick up another offset (this time
1538 measured in units of the size of elements type) from that ARRAY_REF).
1539 We can't do anything if either is variable.
1541 The case we handle here is *(&A[N]+O). */
1542 if (TREE_CODE (base) == ARRAY_REF)
1544 tree low_bound = array_ref_low_bound (base);
1546 elt_offset = TREE_OPERAND (base, 1);
1547 if (TREE_CODE (low_bound) != INTEGER_CST
1548 || TREE_CODE (elt_offset) != INTEGER_CST)
1549 return NULL_TREE;
1551 elt_offset = int_const_binop (MINUS_EXPR, elt_offset, low_bound, 0);
1552 base = TREE_OPERAND (base, 0);
1555 /* Ignore stupid user tricks of indexing non-array variables. */
1556 array_type = TREE_TYPE (base);
1557 if (TREE_CODE (array_type) != ARRAY_TYPE)
1558 return NULL_TREE;
1559 elt_type = TREE_TYPE (array_type);
1560 if (!lang_hooks.types_compatible_p (orig_type, elt_type))
1561 return NULL_TREE;
1563 /* If OFFSET and ELT_OFFSET are zero, we don't care about the size of the
1564 element type (so we can use the alignment if it's not constant).
1565 Otherwise, compute the offset as an index by using a division. If the
1566 division isn't exact, then don't do anything. */
1567 elt_size = TYPE_SIZE_UNIT (elt_type);
1568 if (integer_zerop (offset))
1570 if (TREE_CODE (elt_size) != INTEGER_CST)
1571 elt_size = size_int (TYPE_ALIGN (elt_type));
1573 idx = integer_zero_node;
1575 else
1577 unsigned HOST_WIDE_INT lquo, lrem;
1578 HOST_WIDE_INT hquo, hrem;
1580 if (TREE_CODE (elt_size) != INTEGER_CST
1581 || div_and_round_double (TRUNC_DIV_EXPR, 1,
1582 TREE_INT_CST_LOW (offset),
1583 TREE_INT_CST_HIGH (offset),
1584 TREE_INT_CST_LOW (elt_size),
1585 TREE_INT_CST_HIGH (elt_size),
1586 &lquo, &hquo, &lrem, &hrem)
1587 || lrem || hrem)
1588 return NULL_TREE;
1590 idx = build_int_cst_wide (NULL_TREE, lquo, hquo);
1593 /* Assume the low bound is zero. If there is a domain type, get the
1594 low bound, if any, convert the index into that type, and add the
1595 low bound. */
1596 min_idx = integer_zero_node;
1597 if (TYPE_DOMAIN (array_type))
1599 if (TYPE_MIN_VALUE (TYPE_DOMAIN (array_type)))
1600 min_idx = TYPE_MIN_VALUE (TYPE_DOMAIN (array_type));
1601 else
1602 min_idx = fold_convert (TYPE_DOMAIN (array_type), min_idx);
1604 if (TREE_CODE (min_idx) != INTEGER_CST)
1605 return NULL_TREE;
1607 idx = fold_convert (TYPE_DOMAIN (array_type), idx);
1608 elt_offset = fold_convert (TYPE_DOMAIN (array_type), elt_offset);
1611 if (!integer_zerop (min_idx))
1612 idx = int_const_binop (PLUS_EXPR, idx, min_idx, 0);
1613 if (!integer_zerop (elt_offset))
1614 idx = int_const_binop (PLUS_EXPR, idx, elt_offset, 0);
1616 return build4 (ARRAY_REF, orig_type, base, idx, min_idx,
1617 size_int (tree_low_cst (elt_size, 1)
1618 / (TYPE_ALIGN_UNIT (elt_type))));
1622 /* A subroutine of fold_stmt_r. Attempts to fold *(S+O) to S.X.
1623 BASE is a record type. OFFSET is a byte displacement. ORIG_TYPE
1624 is the desired result type. */
1625 /* ??? This doesn't handle class inheritance. */
1627 static tree
1628 maybe_fold_offset_to_component_ref (tree record_type, tree base, tree offset,
1629 tree orig_type, bool base_is_ptr)
1631 tree f, t, field_type, tail_array_field, field_offset;
1633 if (TREE_CODE (record_type) != RECORD_TYPE
1634 && TREE_CODE (record_type) != UNION_TYPE
1635 && TREE_CODE (record_type) != QUAL_UNION_TYPE)
1636 return NULL_TREE;
1638 /* Short-circuit silly cases. */
1639 if (lang_hooks.types_compatible_p (record_type, orig_type))
1640 return NULL_TREE;
1642 tail_array_field = NULL_TREE;
1643 for (f = TYPE_FIELDS (record_type); f ; f = TREE_CHAIN (f))
1645 int cmp;
1647 if (TREE_CODE (f) != FIELD_DECL)
1648 continue;
1649 if (DECL_BIT_FIELD (f))
1650 continue;
1652 field_offset = byte_position (f);
1653 if (TREE_CODE (field_offset) != INTEGER_CST)
1654 continue;
1656 /* ??? Java creates "interesting" fields for representing base classes.
1657 They have no name, and have no context. With no context, we get into
1658 trouble with nonoverlapping_component_refs_p. Skip them. */
1659 if (!DECL_FIELD_CONTEXT (f))
1660 continue;
1662 /* The previous array field isn't at the end. */
1663 tail_array_field = NULL_TREE;
1665 /* Check to see if this offset overlaps with the field. */
1666 cmp = tree_int_cst_compare (field_offset, offset);
1667 if (cmp > 0)
1668 continue;
1670 field_type = TREE_TYPE (f);
1672 /* Here we exactly match the offset being checked. If the types match,
1673 then we can return that field. */
1674 if (cmp == 0
1675 && lang_hooks.types_compatible_p (orig_type, field_type))
1677 if (base_is_ptr)
1678 base = build1 (INDIRECT_REF, record_type, base);
1679 t = build3 (COMPONENT_REF, field_type, base, f, NULL_TREE);
1680 return t;
1683 /* Don't care about offsets into the middle of scalars. */
1684 if (!AGGREGATE_TYPE_P (field_type))
1685 continue;
1687 /* Check for array at the end of the struct. This is often
1688 used as for flexible array members. We should be able to
1689 turn this into an array access anyway. */
1690 if (TREE_CODE (field_type) == ARRAY_TYPE)
1691 tail_array_field = f;
1693 /* Check the end of the field against the offset. */
1694 if (!DECL_SIZE_UNIT (f)
1695 || TREE_CODE (DECL_SIZE_UNIT (f)) != INTEGER_CST)
1696 continue;
1697 t = int_const_binop (MINUS_EXPR, offset, field_offset, 1);
1698 if (!tree_int_cst_lt (t, DECL_SIZE_UNIT (f)))
1699 continue;
1701 /* If we matched, then set offset to the displacement into
1702 this field. */
1703 offset = t;
1704 goto found;
1707 if (!tail_array_field)
1708 return NULL_TREE;
1710 f = tail_array_field;
1711 field_type = TREE_TYPE (f);
1712 offset = int_const_binop (MINUS_EXPR, offset, byte_position (f), 1);
1714 found:
1715 /* If we get here, we've got an aggregate field, and a possibly
1716 nonzero offset into them. Recurse and hope for a valid match. */
1717 if (base_is_ptr)
1718 base = build1 (INDIRECT_REF, record_type, base);
1719 base = build3 (COMPONENT_REF, field_type, base, f, NULL_TREE);
1721 t = maybe_fold_offset_to_array_ref (base, offset, orig_type);
1722 if (t)
1723 return t;
1724 return maybe_fold_offset_to_component_ref (field_type, base, offset,
1725 orig_type, false);
1729 /* A subroutine of fold_stmt_r. Attempt to simplify *(BASE+OFFSET).
1730 Return the simplified expression, or NULL if nothing could be done. */
1732 static tree
1733 maybe_fold_stmt_indirect (tree expr, tree base, tree offset)
1735 tree t;
1737 /* We may well have constructed a double-nested PLUS_EXPR via multiple
1738 substitutions. Fold that down to one. Remove NON_LVALUE_EXPRs that
1739 are sometimes added. */
1740 base = fold (base);
1741 STRIP_TYPE_NOPS (base);
1742 TREE_OPERAND (expr, 0) = base;
1744 /* One possibility is that the address reduces to a string constant. */
1745 t = fold_read_from_constant_string (expr);
1746 if (t)
1747 return t;
1749 /* Add in any offset from a PLUS_EXPR. */
1750 if (TREE_CODE (base) == PLUS_EXPR)
1752 tree offset2;
1754 offset2 = TREE_OPERAND (base, 1);
1755 if (TREE_CODE (offset2) != INTEGER_CST)
1756 return NULL_TREE;
1757 base = TREE_OPERAND (base, 0);
1759 offset = int_const_binop (PLUS_EXPR, offset, offset2, 1);
1762 if (TREE_CODE (base) == ADDR_EXPR)
1764 /* Strip the ADDR_EXPR. */
1765 base = TREE_OPERAND (base, 0);
1767 /* Fold away CONST_DECL to its value, if the type is scalar. */
1768 if (TREE_CODE (base) == CONST_DECL
1769 && ccp_decl_initial_min_invariant (DECL_INITIAL (base)))
1770 return DECL_INITIAL (base);
1772 /* Try folding *(&B+O) to B[X]. */
1773 t = maybe_fold_offset_to_array_ref (base, offset, TREE_TYPE (expr));
1774 if (t)
1775 return t;
1777 /* Try folding *(&B+O) to B.X. */
1778 t = maybe_fold_offset_to_component_ref (TREE_TYPE (base), base, offset,
1779 TREE_TYPE (expr), false);
1780 if (t)
1781 return t;
1783 /* Fold *&B to B. We can only do this if EXPR is the same type
1784 as BASE. We can't do this if EXPR is the element type of an array
1785 and BASE is the array. */
1786 if (integer_zerop (offset)
1787 && lang_hooks.types_compatible_p (TREE_TYPE (base),
1788 TREE_TYPE (expr)))
1789 return base;
1791 else
1793 /* We can get here for out-of-range string constant accesses,
1794 such as "_"[3]. Bail out of the entire substitution search
1795 and arrange for the entire statement to be replaced by a
1796 call to __builtin_trap. In all likelihood this will all be
1797 constant-folded away, but in the meantime we can't leave with
1798 something that get_expr_operands can't understand. */
1800 t = base;
1801 STRIP_NOPS (t);
1802 if (TREE_CODE (t) == ADDR_EXPR
1803 && TREE_CODE (TREE_OPERAND (t, 0)) == STRING_CST)
1805 /* FIXME: Except that this causes problems elsewhere with dead
1806 code not being deleted, and we die in the rtl expanders
1807 because we failed to remove some ssa_name. In the meantime,
1808 just return zero. */
1809 /* FIXME2: This condition should be signaled by
1810 fold_read_from_constant_string directly, rather than
1811 re-checking for it here. */
1812 return integer_zero_node;
1815 /* Try folding *(B+O) to B->X. Still an improvement. */
1816 if (POINTER_TYPE_P (TREE_TYPE (base)))
1818 t = maybe_fold_offset_to_component_ref (TREE_TYPE (TREE_TYPE (base)),
1819 base, offset,
1820 TREE_TYPE (expr), true);
1821 if (t)
1822 return t;
1826 /* Otherwise we had an offset that we could not simplify. */
1827 return NULL_TREE;
1831 /* A subroutine of fold_stmt_r. EXPR is a PLUS_EXPR.
1833 A quaint feature extant in our address arithmetic is that there
1834 can be hidden type changes here. The type of the result need
1835 not be the same as the type of the input pointer.
1837 What we're after here is an expression of the form
1838 (T *)(&array + const)
1839 where the cast doesn't actually exist, but is implicit in the
1840 type of the PLUS_EXPR. We'd like to turn this into
1841 &array[x]
1842 which may be able to propagate further. */
1844 static tree
1845 maybe_fold_stmt_addition (tree expr)
1847 tree op0 = TREE_OPERAND (expr, 0);
1848 tree op1 = TREE_OPERAND (expr, 1);
1849 tree ptr_type = TREE_TYPE (expr);
1850 tree ptd_type;
1851 tree t;
1852 bool subtract = (TREE_CODE (expr) == MINUS_EXPR);
1854 /* We're only interested in pointer arithmetic. */
1855 if (!POINTER_TYPE_P (ptr_type))
1856 return NULL_TREE;
1857 /* Canonicalize the integral operand to op1. */
1858 if (INTEGRAL_TYPE_P (TREE_TYPE (op0)))
1860 if (subtract)
1861 return NULL_TREE;
1862 t = op0, op0 = op1, op1 = t;
1864 /* It had better be a constant. */
1865 if (TREE_CODE (op1) != INTEGER_CST)
1866 return NULL_TREE;
1867 /* The first operand should be an ADDR_EXPR. */
1868 if (TREE_CODE (op0) != ADDR_EXPR)
1869 return NULL_TREE;
1870 op0 = TREE_OPERAND (op0, 0);
1872 /* If the first operand is an ARRAY_REF, expand it so that we can fold
1873 the offset into it. */
1874 while (TREE_CODE (op0) == ARRAY_REF)
1876 tree array_obj = TREE_OPERAND (op0, 0);
1877 tree array_idx = TREE_OPERAND (op0, 1);
1878 tree elt_type = TREE_TYPE (op0);
1879 tree elt_size = TYPE_SIZE_UNIT (elt_type);
1880 tree min_idx;
1882 if (TREE_CODE (array_idx) != INTEGER_CST)
1883 break;
1884 if (TREE_CODE (elt_size) != INTEGER_CST)
1885 break;
1887 /* Un-bias the index by the min index of the array type. */
1888 min_idx = TYPE_DOMAIN (TREE_TYPE (array_obj));
1889 if (min_idx)
1891 min_idx = TYPE_MIN_VALUE (min_idx);
1892 if (min_idx)
1894 if (TREE_CODE (min_idx) != INTEGER_CST)
1895 break;
1897 array_idx = fold_convert (TREE_TYPE (min_idx), array_idx);
1898 if (!integer_zerop (min_idx))
1899 array_idx = int_const_binop (MINUS_EXPR, array_idx,
1900 min_idx, 0);
1904 /* Convert the index to a byte offset. */
1905 array_idx = fold_convert (sizetype, array_idx);
1906 array_idx = int_const_binop (MULT_EXPR, array_idx, elt_size, 0);
1908 /* Update the operands for the next round, or for folding. */
1909 /* If we're manipulating unsigned types, then folding into negative
1910 values can produce incorrect results. Particularly if the type
1911 is smaller than the width of the pointer. */
1912 if (subtract
1913 && TYPE_UNSIGNED (TREE_TYPE (op1))
1914 && tree_int_cst_lt (array_idx, op1))
1915 return NULL;
1916 op1 = int_const_binop (subtract ? MINUS_EXPR : PLUS_EXPR,
1917 array_idx, op1, 0);
1918 subtract = false;
1919 op0 = array_obj;
1922 /* If we weren't able to fold the subtraction into another array reference,
1923 canonicalize the integer for passing to the array and component ref
1924 simplification functions. */
1925 if (subtract)
1927 if (TYPE_UNSIGNED (TREE_TYPE (op1)))
1928 return NULL;
1929 op1 = fold_unary (NEGATE_EXPR, TREE_TYPE (op1), op1);
1930 /* ??? In theory fold should always produce another integer. */
1931 if (op1 == NULL || TREE_CODE (op1) != INTEGER_CST)
1932 return NULL;
1935 ptd_type = TREE_TYPE (ptr_type);
1937 /* At which point we can try some of the same things as for indirects. */
1938 t = maybe_fold_offset_to_array_ref (op0, op1, ptd_type);
1939 if (!t)
1940 t = maybe_fold_offset_to_component_ref (TREE_TYPE (op0), op0, op1,
1941 ptd_type, false);
1942 if (t)
1943 t = build1 (ADDR_EXPR, ptr_type, t);
1945 return t;
1948 /* For passing state through walk_tree into fold_stmt_r and its
1949 children. */
1951 struct fold_stmt_r_data
1953 bool *changed_p;
1954 bool *inside_addr_expr_p;
1957 /* Subroutine of fold_stmt called via walk_tree. We perform several
1958 simplifications of EXPR_P, mostly having to do with pointer arithmetic. */
1960 static tree
1961 fold_stmt_r (tree *expr_p, int *walk_subtrees, void *data)
1963 struct fold_stmt_r_data *fold_stmt_r_data = (struct fold_stmt_r_data *) data;
1964 bool *inside_addr_expr_p = fold_stmt_r_data->inside_addr_expr_p;
1965 bool *changed_p = fold_stmt_r_data->changed_p;
1966 tree expr = *expr_p, t;
1968 /* ??? It'd be nice if walk_tree had a pre-order option. */
1969 switch (TREE_CODE (expr))
1971 case INDIRECT_REF:
1972 t = walk_tree (&TREE_OPERAND (expr, 0), fold_stmt_r, data, NULL);
1973 if (t)
1974 return t;
1975 *walk_subtrees = 0;
1977 t = maybe_fold_stmt_indirect (expr, TREE_OPERAND (expr, 0),
1978 integer_zero_node);
1979 break;
1981 /* ??? Could handle more ARRAY_REFs here, as a variant of INDIRECT_REF.
1982 We'd only want to bother decomposing an existing ARRAY_REF if
1983 the base array is found to have another offset contained within.
1984 Otherwise we'd be wasting time. */
1985 case ARRAY_REF:
1986 /* If we are not processing expressions found within an
1987 ADDR_EXPR, then we can fold constant array references. */
1988 if (!*inside_addr_expr_p)
1989 t = fold_read_from_constant_string (expr);
1990 else
1991 t = NULL;
1992 break;
1994 case ADDR_EXPR:
1995 *inside_addr_expr_p = true;
1996 t = walk_tree (&TREE_OPERAND (expr, 0), fold_stmt_r, data, NULL);
1997 *inside_addr_expr_p = false;
1998 if (t)
1999 return t;
2000 *walk_subtrees = 0;
2002 /* Set TREE_INVARIANT properly so that the value is properly
2003 considered constant, and so gets propagated as expected. */
2004 if (*changed_p)
2005 recompute_tree_invariant_for_addr_expr (expr);
2006 return NULL_TREE;
2008 case PLUS_EXPR:
2009 case MINUS_EXPR:
2010 t = walk_tree (&TREE_OPERAND (expr, 0), fold_stmt_r, data, NULL);
2011 if (t)
2012 return t;
2013 t = walk_tree (&TREE_OPERAND (expr, 1), fold_stmt_r, data, NULL);
2014 if (t)
2015 return t;
2016 *walk_subtrees = 0;
2018 t = maybe_fold_stmt_addition (expr);
2019 break;
2021 case COMPONENT_REF:
2022 t = walk_tree (&TREE_OPERAND (expr, 0), fold_stmt_r, data, NULL);
2023 if (t)
2024 return t;
2025 *walk_subtrees = 0;
2027 /* Make sure the FIELD_DECL is actually a field in the type on the lhs.
2028 We've already checked that the records are compatible, so we should
2029 come up with a set of compatible fields. */
2031 tree expr_record = TREE_TYPE (TREE_OPERAND (expr, 0));
2032 tree expr_field = TREE_OPERAND (expr, 1);
2034 if (DECL_FIELD_CONTEXT (expr_field) != TYPE_MAIN_VARIANT (expr_record))
2036 expr_field = find_compatible_field (expr_record, expr_field);
2037 TREE_OPERAND (expr, 1) = expr_field;
2040 break;
2042 case TARGET_MEM_REF:
2043 t = maybe_fold_tmr (expr);
2044 break;
2046 default:
2047 return NULL_TREE;
2050 if (t)
2052 *expr_p = t;
2053 *changed_p = true;
2056 return NULL_TREE;
2060 /* Return the string length, maximum string length or maximum value of
2061 ARG in LENGTH.
2062 If ARG is an SSA name variable, follow its use-def chains. If LENGTH
2063 is not NULL and, for TYPE == 0, its value is not equal to the length
2064 we determine or if we are unable to determine the length or value,
2065 return false. VISITED is a bitmap of visited variables.
2066 TYPE is 0 if string length should be returned, 1 for maximum string
2067 length and 2 for maximum value ARG can have. */
2069 static bool
2070 get_maxval_strlen (tree arg, tree *length, bitmap visited, int type)
2072 tree var, def_stmt, val;
2074 if (TREE_CODE (arg) != SSA_NAME)
2076 if (type == 2)
2078 val = arg;
2079 if (TREE_CODE (val) != INTEGER_CST
2080 || tree_int_cst_sgn (val) < 0)
2081 return false;
2083 else
2084 val = c_strlen (arg, 1);
2085 if (!val)
2086 return false;
2088 if (*length)
2090 if (type > 0)
2092 if (TREE_CODE (*length) != INTEGER_CST
2093 || TREE_CODE (val) != INTEGER_CST)
2094 return false;
2096 if (tree_int_cst_lt (*length, val))
2097 *length = val;
2098 return true;
2100 else if (simple_cst_equal (val, *length) != 1)
2101 return false;
2104 *length = val;
2105 return true;
2108 /* If we were already here, break the infinite cycle. */
2109 if (bitmap_bit_p (visited, SSA_NAME_VERSION (arg)))
2110 return true;
2111 bitmap_set_bit (visited, SSA_NAME_VERSION (arg));
2113 var = arg;
2114 def_stmt = SSA_NAME_DEF_STMT (var);
2116 switch (TREE_CODE (def_stmt))
2118 case MODIFY_EXPR:
2120 tree rhs;
2122 /* The RHS of the statement defining VAR must either have a
2123 constant length or come from another SSA_NAME with a constant
2124 length. */
2125 rhs = TREE_OPERAND (def_stmt, 1);
2126 STRIP_NOPS (rhs);
2127 return get_maxval_strlen (rhs, length, visited, type);
2130 case PHI_NODE:
2132 /* All the arguments of the PHI node must have the same constant
2133 length. */
2134 int i;
2136 for (i = 0; i < PHI_NUM_ARGS (def_stmt); i++)
2138 tree arg = PHI_ARG_DEF (def_stmt, i);
2140 /* If this PHI has itself as an argument, we cannot
2141 determine the string length of this argument. However,
2142 if we can find a constant string length for the other
2143 PHI args then we can still be sure that this is a
2144 constant string length. So be optimistic and just
2145 continue with the next argument. */
2146 if (arg == PHI_RESULT (def_stmt))
2147 continue;
2149 if (!get_maxval_strlen (arg, length, visited, type))
2150 return false;
2153 return true;
2156 default:
2157 break;
2161 return false;
2165 /* Fold builtin call FN in statement STMT. If it cannot be folded into a
2166 constant, return NULL_TREE. Otherwise, return its constant value. */
2168 static tree
2169 ccp_fold_builtin (tree stmt, tree fn)
2171 tree result, val[3];
2172 tree callee, arglist, a;
2173 int arg_mask, i, type;
2174 bitmap visited;
2175 bool ignore;
2177 ignore = TREE_CODE (stmt) != MODIFY_EXPR;
2179 /* First try the generic builtin folder. If that succeeds, return the
2180 result directly. */
2181 callee = get_callee_fndecl (fn);
2182 arglist = TREE_OPERAND (fn, 1);
2183 result = fold_builtin (callee, arglist, ignore);
2184 if (result)
2186 if (ignore)
2187 STRIP_NOPS (result);
2188 return result;
2191 /* Ignore MD builtins. */
2192 if (DECL_BUILT_IN_CLASS (callee) == BUILT_IN_MD)
2193 return NULL_TREE;
2195 /* If the builtin could not be folded, and it has no argument list,
2196 we're done. */
2197 if (!arglist)
2198 return NULL_TREE;
2200 /* Limit the work only for builtins we know how to simplify. */
2201 switch (DECL_FUNCTION_CODE (callee))
2203 case BUILT_IN_STRLEN:
2204 case BUILT_IN_FPUTS:
2205 case BUILT_IN_FPUTS_UNLOCKED:
2206 arg_mask = 1;
2207 type = 0;
2208 break;
2209 case BUILT_IN_STRCPY:
2210 case BUILT_IN_STRNCPY:
2211 arg_mask = 2;
2212 type = 0;
2213 break;
2214 case BUILT_IN_MEMCPY_CHK:
2215 case BUILT_IN_MEMPCPY_CHK:
2216 case BUILT_IN_MEMMOVE_CHK:
2217 case BUILT_IN_MEMSET_CHK:
2218 case BUILT_IN_STRNCPY_CHK:
2219 arg_mask = 4;
2220 type = 2;
2221 break;
2222 case BUILT_IN_STRCPY_CHK:
2223 case BUILT_IN_STPCPY_CHK:
2224 arg_mask = 2;
2225 type = 1;
2226 break;
2227 case BUILT_IN_SNPRINTF_CHK:
2228 case BUILT_IN_VSNPRINTF_CHK:
2229 arg_mask = 2;
2230 type = 2;
2231 break;
2232 default:
2233 return NULL_TREE;
2236 /* Try to use the dataflow information gathered by the CCP process. */
2237 visited = BITMAP_ALLOC (NULL);
2239 memset (val, 0, sizeof (val));
2240 for (i = 0, a = arglist;
2241 arg_mask;
2242 i++, arg_mask >>= 1, a = TREE_CHAIN (a))
2243 if (arg_mask & 1)
2245 bitmap_clear (visited);
2246 if (!get_maxval_strlen (TREE_VALUE (a), &val[i], visited, type))
2247 val[i] = NULL_TREE;
2250 BITMAP_FREE (visited);
2252 result = NULL_TREE;
2253 switch (DECL_FUNCTION_CODE (callee))
2255 case BUILT_IN_STRLEN:
2256 if (val[0])
2258 tree new = fold_convert (TREE_TYPE (fn), val[0]);
2260 /* If the result is not a valid gimple value, or not a cast
2261 of a valid gimple value, then we can not use the result. */
2262 if (is_gimple_val (new)
2263 || (is_gimple_cast (new)
2264 && is_gimple_val (TREE_OPERAND (new, 0))))
2265 return new;
2267 break;
2269 case BUILT_IN_STRCPY:
2270 if (val[1] && is_gimple_val (val[1]))
2271 result = fold_builtin_strcpy (callee, arglist, val[1]);
2272 break;
2274 case BUILT_IN_STRNCPY:
2275 if (val[1] && is_gimple_val (val[1]))
2276 result = fold_builtin_strncpy (callee, arglist, val[1]);
2277 break;
2279 case BUILT_IN_FPUTS:
2280 result = fold_builtin_fputs (arglist,
2281 TREE_CODE (stmt) != MODIFY_EXPR, 0,
2282 val[0]);
2283 break;
2285 case BUILT_IN_FPUTS_UNLOCKED:
2286 result = fold_builtin_fputs (arglist,
2287 TREE_CODE (stmt) != MODIFY_EXPR, 1,
2288 val[0]);
2289 break;
2291 case BUILT_IN_MEMCPY_CHK:
2292 case BUILT_IN_MEMPCPY_CHK:
2293 case BUILT_IN_MEMMOVE_CHK:
2294 case BUILT_IN_MEMSET_CHK:
2295 if (val[2] && is_gimple_val (val[2]))
2296 result = fold_builtin_memory_chk (callee, arglist, val[2], ignore,
2297 DECL_FUNCTION_CODE (callee));
2298 break;
2300 case BUILT_IN_STRCPY_CHK:
2301 case BUILT_IN_STPCPY_CHK:
2302 if (val[1] && is_gimple_val (val[1]))
2303 result = fold_builtin_stxcpy_chk (callee, arglist, val[1], ignore,
2304 DECL_FUNCTION_CODE (callee));
2305 break;
2307 case BUILT_IN_STRNCPY_CHK:
2308 if (val[2] && is_gimple_val (val[2]))
2309 result = fold_builtin_strncpy_chk (arglist, val[2]);
2310 break;
2312 case BUILT_IN_SNPRINTF_CHK:
2313 case BUILT_IN_VSNPRINTF_CHK:
2314 if (val[1] && is_gimple_val (val[1]))
2315 result = fold_builtin_snprintf_chk (arglist, val[1],
2316 DECL_FUNCTION_CODE (callee));
2317 break;
2319 default:
2320 gcc_unreachable ();
2323 if (result && ignore)
2324 result = fold_ignored_result (result);
2325 return result;
2329 /* Fold the statement pointed to by STMT_P. In some cases, this function may
2330 replace the whole statement with a new one. Returns true iff folding
2331 makes any changes. */
2333 bool
2334 fold_stmt (tree *stmt_p)
2336 tree rhs, result, stmt;
2337 struct fold_stmt_r_data fold_stmt_r_data;
2338 bool changed = false;
2339 bool inside_addr_expr = false;
2341 fold_stmt_r_data.changed_p = &changed;
2342 fold_stmt_r_data.inside_addr_expr_p = &inside_addr_expr;
2344 stmt = *stmt_p;
2346 /* If we replaced constants and the statement makes pointer dereferences,
2347 then we may need to fold instances of *&VAR into VAR, etc. */
2348 if (walk_tree (stmt_p, fold_stmt_r, &fold_stmt_r_data, NULL))
2350 *stmt_p
2351 = build_function_call_expr (implicit_built_in_decls[BUILT_IN_TRAP],
2352 NULL);
2353 return true;
2356 rhs = get_rhs (stmt);
2357 if (!rhs)
2358 return changed;
2359 result = NULL_TREE;
2361 if (TREE_CODE (rhs) == CALL_EXPR)
2363 tree callee;
2365 /* Check for builtins that CCP can handle using information not
2366 available in the generic fold routines. */
2367 callee = get_callee_fndecl (rhs);
2368 if (callee && DECL_BUILT_IN (callee))
2369 result = ccp_fold_builtin (stmt, rhs);
2370 else
2372 /* Check for resolvable OBJ_TYPE_REF. The only sorts we can resolve
2373 here are when we've propagated the address of a decl into the
2374 object slot. */
2375 /* ??? Should perhaps do this in fold proper. However, doing it
2376 there requires that we create a new CALL_EXPR, and that requires
2377 copying EH region info to the new node. Easier to just do it
2378 here where we can just smash the call operand. Also
2379 CALL_EXPR_RETURN_SLOT_OPT needs to be handled correctly and
2380 copied, fold_ternary does not have not information. */
2381 callee = TREE_OPERAND (rhs, 0);
2382 if (TREE_CODE (callee) == OBJ_TYPE_REF
2383 && lang_hooks.fold_obj_type_ref
2384 && TREE_CODE (OBJ_TYPE_REF_OBJECT (callee)) == ADDR_EXPR
2385 && DECL_P (TREE_OPERAND
2386 (OBJ_TYPE_REF_OBJECT (callee), 0)))
2388 tree t;
2390 /* ??? Caution: Broken ADDR_EXPR semantics means that
2391 looking at the type of the operand of the addr_expr
2392 can yield an array type. See silly exception in
2393 check_pointer_types_r. */
2395 t = TREE_TYPE (TREE_TYPE (OBJ_TYPE_REF_OBJECT (callee)));
2396 t = lang_hooks.fold_obj_type_ref (callee, t);
2397 if (t)
2399 TREE_OPERAND (rhs, 0) = t;
2400 changed = true;
2406 /* If we couldn't fold the RHS, hand over to the generic fold routines. */
2407 if (result == NULL_TREE)
2408 result = fold (rhs);
2410 /* Strip away useless type conversions. Both the NON_LVALUE_EXPR that
2411 may have been added by fold, and "useless" type conversions that might
2412 now be apparent due to propagation. */
2413 STRIP_USELESS_TYPE_CONVERSION (result);
2415 if (result != rhs)
2416 changed |= set_rhs (stmt_p, result);
2418 return changed;
2421 /* Perform the minimal folding on statement STMT. Only operations like
2422 *&x created by constant propagation are handled. The statement cannot
2423 be replaced with a new one. */
2425 bool
2426 fold_stmt_inplace (tree stmt)
2428 tree old_stmt = stmt, rhs, new_rhs;
2429 struct fold_stmt_r_data fold_stmt_r_data;
2430 bool changed = false;
2431 bool inside_addr_expr = false;
2433 fold_stmt_r_data.changed_p = &changed;
2434 fold_stmt_r_data.inside_addr_expr_p = &inside_addr_expr;
2436 walk_tree (&stmt, fold_stmt_r, &fold_stmt_r_data, NULL);
2437 gcc_assert (stmt == old_stmt);
2439 rhs = get_rhs (stmt);
2440 if (!rhs || rhs == stmt)
2441 return changed;
2443 new_rhs = fold (rhs);
2444 STRIP_USELESS_TYPE_CONVERSION (new_rhs);
2445 if (new_rhs == rhs)
2446 return changed;
2448 changed |= set_rhs (&stmt, new_rhs);
2449 gcc_assert (stmt == old_stmt);
2451 return changed;
2454 /* Convert EXPR into a GIMPLE value suitable for substitution on the
2455 RHS of an assignment. Insert the necessary statements before
2456 iterator *SI_P. */
2458 static tree
2459 convert_to_gimple_builtin (block_stmt_iterator *si_p, tree expr)
2461 tree_stmt_iterator ti;
2462 tree stmt = bsi_stmt (*si_p);
2463 tree tmp, stmts = NULL;
2465 push_gimplify_context ();
2466 tmp = get_initialized_tmp_var (expr, &stmts, NULL);
2467 pop_gimplify_context (NULL);
2469 if (EXPR_HAS_LOCATION (stmt))
2470 annotate_all_with_locus (&stmts, EXPR_LOCATION (stmt));
2472 /* The replacement can expose previously unreferenced variables. */
2473 for (ti = tsi_start (stmts); !tsi_end_p (ti); tsi_next (&ti))
2475 tree new_stmt = tsi_stmt (ti);
2476 find_new_referenced_vars (tsi_stmt_ptr (ti));
2477 bsi_insert_before (si_p, new_stmt, BSI_NEW_STMT);
2478 mark_new_vars_to_rename (bsi_stmt (*si_p));
2479 bsi_next (si_p);
2482 return tmp;
2486 /* A simple pass that attempts to fold all builtin functions. This pass
2487 is run after we've propagated as many constants as we can. */
2489 static unsigned int
2490 execute_fold_all_builtins (void)
2492 bool cfg_changed = false;
2493 basic_block bb;
2494 FOR_EACH_BB (bb)
2496 block_stmt_iterator i;
2497 for (i = bsi_start (bb); !bsi_end_p (i); )
2499 tree *stmtp = bsi_stmt_ptr (i);
2500 tree old_stmt = *stmtp;
2501 tree call = get_rhs (*stmtp);
2502 tree callee, result;
2503 enum built_in_function fcode;
2505 if (!call || TREE_CODE (call) != CALL_EXPR)
2507 bsi_next (&i);
2508 continue;
2510 callee = get_callee_fndecl (call);
2511 if (!callee || DECL_BUILT_IN_CLASS (callee) != BUILT_IN_NORMAL)
2513 bsi_next (&i);
2514 continue;
2516 fcode = DECL_FUNCTION_CODE (callee);
2518 result = ccp_fold_builtin (*stmtp, call);
2519 if (!result)
2520 switch (DECL_FUNCTION_CODE (callee))
2522 case BUILT_IN_CONSTANT_P:
2523 /* Resolve __builtin_constant_p. If it hasn't been
2524 folded to integer_one_node by now, it's fairly
2525 certain that the value simply isn't constant. */
2526 result = integer_zero_node;
2527 break;
2529 default:
2530 bsi_next (&i);
2531 continue;
2534 if (dump_file && (dump_flags & TDF_DETAILS))
2536 fprintf (dump_file, "Simplified\n ");
2537 print_generic_stmt (dump_file, *stmtp, dump_flags);
2540 if (!set_rhs (stmtp, result))
2542 result = convert_to_gimple_builtin (&i, result);
2543 if (result)
2545 bool ok = set_rhs (stmtp, result);
2547 gcc_assert (ok);
2550 mark_new_vars_to_rename (*stmtp);
2551 if (maybe_clean_or_replace_eh_stmt (old_stmt, *stmtp)
2552 && tree_purge_dead_eh_edges (bb))
2553 cfg_changed = true;
2555 if (dump_file && (dump_flags & TDF_DETAILS))
2557 fprintf (dump_file, "to\n ");
2558 print_generic_stmt (dump_file, *stmtp, dump_flags);
2559 fprintf (dump_file, "\n");
2562 /* Retry the same statement if it changed into another
2563 builtin, there might be new opportunities now. */
2564 call = get_rhs (*stmtp);
2565 if (!call || TREE_CODE (call) != CALL_EXPR)
2567 bsi_next (&i);
2568 continue;
2570 callee = get_callee_fndecl (call);
2571 if (!callee
2572 || DECL_BUILT_IN_CLASS (callee) != BUILT_IN_NORMAL
2573 || DECL_FUNCTION_CODE (callee) == fcode)
2574 bsi_next (&i);
2578 /* Delete unreachable blocks. */
2579 if (cfg_changed)
2580 cleanup_tree_cfg ();
2581 return 0;
2585 struct tree_opt_pass pass_fold_builtins =
2587 "fab", /* name */
2588 NULL, /* gate */
2589 execute_fold_all_builtins, /* execute */
2590 NULL, /* sub */
2591 NULL, /* next */
2592 0, /* static_pass_number */
2593 0, /* tv_id */
2594 PROP_cfg | PROP_ssa | PROP_alias, /* properties_required */
2595 0, /* properties_provided */
2596 0, /* properties_destroyed */
2597 0, /* todo_flags_start */
2598 TODO_dump_func
2599 | TODO_verify_ssa
2600 | TODO_update_ssa, /* todo_flags_finish */
2601 0 /* letter */