2008-05-07 Kai Tietz <kai,tietz@onevision.com>
[official-gcc.git] / gcc / haifa-sched.c
blob0f12cd04dfa3da7fd1e834cbbe957a560c934d01
1 /* Instruction scheduling pass.
2 Copyright (C) 1992, 1993, 1994, 1995, 1996, 1997, 1998, 1999, 2000,
3 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008
4 Free Software Foundation, Inc.
5 Contributed by Michael Tiemann (tiemann@cygnus.com) Enhanced by,
6 and currently maintained by, Jim Wilson (wilson@cygnus.com)
8 This file is part of GCC.
10 GCC is free software; you can redistribute it and/or modify it under
11 the terms of the GNU General Public License as published by the Free
12 Software Foundation; either version 3, or (at your option) any later
13 version.
15 GCC is distributed in the hope that it will be useful, but WITHOUT ANY
16 WARRANTY; without even the implied warranty of MERCHANTABILITY or
17 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
18 for more details.
20 You should have received a copy of the GNU General Public License
21 along with GCC; see the file COPYING3. If not see
22 <http://www.gnu.org/licenses/>. */
24 /* Instruction scheduling pass. This file, along with sched-deps.c,
25 contains the generic parts. The actual entry point is found for
26 the normal instruction scheduling pass is found in sched-rgn.c.
28 We compute insn priorities based on data dependencies. Flow
29 analysis only creates a fraction of the data-dependencies we must
30 observe: namely, only those dependencies which the combiner can be
31 expected to use. For this pass, we must therefore create the
32 remaining dependencies we need to observe: register dependencies,
33 memory dependencies, dependencies to keep function calls in order,
34 and the dependence between a conditional branch and the setting of
35 condition codes are all dealt with here.
37 The scheduler first traverses the data flow graph, starting with
38 the last instruction, and proceeding to the first, assigning values
39 to insn_priority as it goes. This sorts the instructions
40 topologically by data dependence.
42 Once priorities have been established, we order the insns using
43 list scheduling. This works as follows: starting with a list of
44 all the ready insns, and sorted according to priority number, we
45 schedule the insn from the end of the list by placing its
46 predecessors in the list according to their priority order. We
47 consider this insn scheduled by setting the pointer to the "end" of
48 the list to point to the previous insn. When an insn has no
49 predecessors, we either queue it until sufficient time has elapsed
50 or add it to the ready list. As the instructions are scheduled or
51 when stalls are introduced, the queue advances and dumps insns into
52 the ready list. When all insns down to the lowest priority have
53 been scheduled, the critical path of the basic block has been made
54 as short as possible. The remaining insns are then scheduled in
55 remaining slots.
57 The following list shows the order in which we want to break ties
58 among insns in the ready list:
60 1. choose insn with the longest path to end of bb, ties
61 broken by
62 2. choose insn with least contribution to register pressure,
63 ties broken by
64 3. prefer in-block upon interblock motion, ties broken by
65 4. prefer useful upon speculative motion, ties broken by
66 5. choose insn with largest control flow probability, ties
67 broken by
68 6. choose insn with the least dependences upon the previously
69 scheduled insn, or finally
70 7 choose the insn which has the most insns dependent on it.
71 8. choose insn with lowest UID.
73 Memory references complicate matters. Only if we can be certain
74 that memory references are not part of the data dependency graph
75 (via true, anti, or output dependence), can we move operations past
76 memory references. To first approximation, reads can be done
77 independently, while writes introduce dependencies. Better
78 approximations will yield fewer dependencies.
80 Before reload, an extended analysis of interblock data dependences
81 is required for interblock scheduling. This is performed in
82 compute_block_backward_dependences ().
84 Dependencies set up by memory references are treated in exactly the
85 same way as other dependencies, by using insn backward dependences
86 INSN_BACK_DEPS. INSN_BACK_DEPS are translated into forward dependences
87 INSN_FORW_DEPS the purpose of forward list scheduling.
89 Having optimized the critical path, we may have also unduly
90 extended the lifetimes of some registers. If an operation requires
91 that constants be loaded into registers, it is certainly desirable
92 to load those constants as early as necessary, but no earlier.
93 I.e., it will not do to load up a bunch of registers at the
94 beginning of a basic block only to use them at the end, if they
95 could be loaded later, since this may result in excessive register
96 utilization.
98 Note that since branches are never in basic blocks, but only end
99 basic blocks, this pass will not move branches. But that is ok,
100 since we can use GNU's delayed branch scheduling pass to take care
101 of this case.
103 Also note that no further optimizations based on algebraic
104 identities are performed, so this pass would be a good one to
105 perform instruction splitting, such as breaking up a multiply
106 instruction into shifts and adds where that is profitable.
108 Given the memory aliasing analysis that this pass should perform,
109 it should be possible to remove redundant stores to memory, and to
110 load values from registers instead of hitting memory.
112 Before reload, speculative insns are moved only if a 'proof' exists
113 that no exception will be caused by this, and if no live registers
114 exist that inhibit the motion (live registers constraints are not
115 represented by data dependence edges).
117 This pass must update information that subsequent passes expect to
118 be correct. Namely: reg_n_refs, reg_n_sets, reg_n_deaths,
119 reg_n_calls_crossed, and reg_live_length. Also, BB_HEAD, BB_END.
121 The information in the line number notes is carefully retained by
122 this pass. Notes that refer to the starting and ending of
123 exception regions are also carefully retained by this pass. All
124 other NOTE insns are grouped in their same relative order at the
125 beginning of basic blocks and regions that have been scheduled. */
127 #include "config.h"
128 #include "system.h"
129 #include "coretypes.h"
130 #include "tm.h"
131 #include "toplev.h"
132 #include "rtl.h"
133 #include "tm_p.h"
134 #include "hard-reg-set.h"
135 #include "regs.h"
136 #include "function.h"
137 #include "flags.h"
138 #include "insn-config.h"
139 #include "insn-attr.h"
140 #include "except.h"
141 #include "toplev.h"
142 #include "recog.h"
143 #include "sched-int.h"
144 #include "target.h"
145 #include "output.h"
146 #include "params.h"
147 #include "dbgcnt.h"
149 #ifdef INSN_SCHEDULING
151 /* issue_rate is the number of insns that can be scheduled in the same
152 machine cycle. It can be defined in the config/mach/mach.h file,
153 otherwise we set it to 1. */
155 static int issue_rate;
157 /* sched-verbose controls the amount of debugging output the
158 scheduler prints. It is controlled by -fsched-verbose=N:
159 N>0 and no -DSR : the output is directed to stderr.
160 N>=10 will direct the printouts to stderr (regardless of -dSR).
161 N=1: same as -dSR.
162 N=2: bb's probabilities, detailed ready list info, unit/insn info.
163 N=3: rtl at abort point, control-flow, regions info.
164 N=5: dependences info. */
166 static int sched_verbose_param = 0;
167 int sched_verbose = 0;
169 /* Debugging file. All printouts are sent to dump, which is always set,
170 either to stderr, or to the dump listing file (-dRS). */
171 FILE *sched_dump = 0;
173 /* Highest uid before scheduling. */
174 static int old_max_uid;
176 /* fix_sched_param() is called from toplev.c upon detection
177 of the -fsched-verbose=N option. */
179 void
180 fix_sched_param (const char *param, const char *val)
182 if (!strcmp (param, "verbose"))
183 sched_verbose_param = atoi (val);
184 else
185 warning (0, "fix_sched_param: unknown param: %s", param);
188 struct haifa_insn_data *h_i_d;
190 #define INSN_TICK(INSN) (h_i_d[INSN_UID (INSN)].tick)
191 #define INTER_TICK(INSN) (h_i_d[INSN_UID (INSN)].inter_tick)
193 /* If INSN_TICK of an instruction is equal to INVALID_TICK,
194 then it should be recalculated from scratch. */
195 #define INVALID_TICK (-(max_insn_queue_index + 1))
196 /* The minimal value of the INSN_TICK of an instruction. */
197 #define MIN_TICK (-max_insn_queue_index)
199 /* Issue points are used to distinguish between instructions in max_issue ().
200 For now, all instructions are equally good. */
201 #define ISSUE_POINTS(INSN) 1
203 /* List of important notes we must keep around. This is a pointer to the
204 last element in the list. */
205 static rtx note_list;
207 static struct spec_info_def spec_info_var;
208 /* Description of the speculative part of the scheduling.
209 If NULL - no speculation. */
210 spec_info_t spec_info;
212 /* True, if recovery block was added during scheduling of current block.
213 Used to determine, if we need to fix INSN_TICKs. */
214 static bool haifa_recovery_bb_recently_added_p;
216 /* True, if recovery block was added during this scheduling pass.
217 Used to determine if we should have empty memory pools of dependencies
218 after finishing current region. */
219 bool haifa_recovery_bb_ever_added_p;
221 /* Counters of different types of speculative instructions. */
222 static int nr_begin_data, nr_be_in_data, nr_begin_control, nr_be_in_control;
224 /* Array used in {unlink, restore}_bb_notes. */
225 static rtx *bb_header = 0;
227 /* Number of basic_blocks. */
228 static int old_last_basic_block;
230 /* Basic block after which recovery blocks will be created. */
231 static basic_block before_recovery;
233 /* Queues, etc. */
235 /* An instruction is ready to be scheduled when all insns preceding it
236 have already been scheduled. It is important to ensure that all
237 insns which use its result will not be executed until its result
238 has been computed. An insn is maintained in one of four structures:
240 (P) the "Pending" set of insns which cannot be scheduled until
241 their dependencies have been satisfied.
242 (Q) the "Queued" set of insns that can be scheduled when sufficient
243 time has passed.
244 (R) the "Ready" list of unscheduled, uncommitted insns.
245 (S) the "Scheduled" list of insns.
247 Initially, all insns are either "Pending" or "Ready" depending on
248 whether their dependencies are satisfied.
250 Insns move from the "Ready" list to the "Scheduled" list as they
251 are committed to the schedule. As this occurs, the insns in the
252 "Pending" list have their dependencies satisfied and move to either
253 the "Ready" list or the "Queued" set depending on whether
254 sufficient time has passed to make them ready. As time passes,
255 insns move from the "Queued" set to the "Ready" list.
257 The "Pending" list (P) are the insns in the INSN_FORW_DEPS of the
258 unscheduled insns, i.e., those that are ready, queued, and pending.
259 The "Queued" set (Q) is implemented by the variable `insn_queue'.
260 The "Ready" list (R) is implemented by the variables `ready' and
261 `n_ready'.
262 The "Scheduled" list (S) is the new insn chain built by this pass.
264 The transition (R->S) is implemented in the scheduling loop in
265 `schedule_block' when the best insn to schedule is chosen.
266 The transitions (P->R and P->Q) are implemented in `schedule_insn' as
267 insns move from the ready list to the scheduled list.
268 The transition (Q->R) is implemented in 'queue_to_insn' as time
269 passes or stalls are introduced. */
271 /* Implement a circular buffer to delay instructions until sufficient
272 time has passed. For the new pipeline description interface,
273 MAX_INSN_QUEUE_INDEX is a power of two minus one which is not less
274 than maximal time of instruction execution computed by genattr.c on
275 the base maximal time of functional unit reservations and getting a
276 result. This is the longest time an insn may be queued. */
278 static rtx *insn_queue;
279 static int q_ptr = 0;
280 static int q_size = 0;
281 #define NEXT_Q(X) (((X)+1) & max_insn_queue_index)
282 #define NEXT_Q_AFTER(X, C) (((X)+C) & max_insn_queue_index)
284 #define QUEUE_SCHEDULED (-3)
285 #define QUEUE_NOWHERE (-2)
286 #define QUEUE_READY (-1)
287 /* QUEUE_SCHEDULED - INSN is scheduled.
288 QUEUE_NOWHERE - INSN isn't scheduled yet and is neither in
289 queue or ready list.
290 QUEUE_READY - INSN is in ready list.
291 N >= 0 - INSN queued for X [where NEXT_Q_AFTER (q_ptr, X) == N] cycles. */
293 #define QUEUE_INDEX(INSN) (h_i_d[INSN_UID (INSN)].queue_index)
295 /* The following variable value refers for all current and future
296 reservations of the processor units. */
297 state_t curr_state;
299 /* The following variable value is size of memory representing all
300 current and future reservations of the processor units. */
301 static size_t dfa_state_size;
303 /* The following array is used to find the best insn from ready when
304 the automaton pipeline interface is used. */
305 static char *ready_try;
307 /* Describe the ready list of the scheduler.
308 VEC holds space enough for all insns in the current region. VECLEN
309 says how many exactly.
310 FIRST is the index of the element with the highest priority; i.e. the
311 last one in the ready list, since elements are ordered by ascending
312 priority.
313 N_READY determines how many insns are on the ready list. */
315 struct ready_list
317 rtx *vec;
318 int veclen;
319 int first;
320 int n_ready;
323 /* The pointer to the ready list. */
324 static struct ready_list *readyp;
326 /* Scheduling clock. */
327 static int clock_var;
329 /* Number of instructions in current scheduling region. */
330 static int rgn_n_insns;
332 static int may_trap_exp (const_rtx, int);
334 /* Nonzero iff the address is comprised from at most 1 register. */
335 #define CONST_BASED_ADDRESS_P(x) \
336 (REG_P (x) \
337 || ((GET_CODE (x) == PLUS || GET_CODE (x) == MINUS \
338 || (GET_CODE (x) == LO_SUM)) \
339 && (CONSTANT_P (XEXP (x, 0)) \
340 || CONSTANT_P (XEXP (x, 1)))))
342 /* Returns a class that insn with GET_DEST(insn)=x may belong to,
343 as found by analyzing insn's expression. */
345 static int
346 may_trap_exp (const_rtx x, int is_store)
348 enum rtx_code code;
350 if (x == 0)
351 return TRAP_FREE;
352 code = GET_CODE (x);
353 if (is_store)
355 if (code == MEM && may_trap_p (x))
356 return TRAP_RISKY;
357 else
358 return TRAP_FREE;
360 if (code == MEM)
362 /* The insn uses memory: a volatile load. */
363 if (MEM_VOLATILE_P (x))
364 return IRISKY;
365 /* An exception-free load. */
366 if (!may_trap_p (x))
367 return IFREE;
368 /* A load with 1 base register, to be further checked. */
369 if (CONST_BASED_ADDRESS_P (XEXP (x, 0)))
370 return PFREE_CANDIDATE;
371 /* No info on the load, to be further checked. */
372 return PRISKY_CANDIDATE;
374 else
376 const char *fmt;
377 int i, insn_class = TRAP_FREE;
379 /* Neither store nor load, check if it may cause a trap. */
380 if (may_trap_p (x))
381 return TRAP_RISKY;
382 /* Recursive step: walk the insn... */
383 fmt = GET_RTX_FORMAT (code);
384 for (i = GET_RTX_LENGTH (code) - 1; i >= 0; i--)
386 if (fmt[i] == 'e')
388 int tmp_class = may_trap_exp (XEXP (x, i), is_store);
389 insn_class = WORST_CLASS (insn_class, tmp_class);
391 else if (fmt[i] == 'E')
393 int j;
394 for (j = 0; j < XVECLEN (x, i); j++)
396 int tmp_class = may_trap_exp (XVECEXP (x, i, j), is_store);
397 insn_class = WORST_CLASS (insn_class, tmp_class);
398 if (insn_class == TRAP_RISKY || insn_class == IRISKY)
399 break;
402 if (insn_class == TRAP_RISKY || insn_class == IRISKY)
403 break;
405 return insn_class;
409 /* Classifies rtx X of an insn for the purpose of verifying that X can be
410 executed speculatively (and consequently the insn can be moved
411 speculatively), by examining X, returning:
412 TRAP_RISKY: store, or risky non-load insn (e.g. division by variable).
413 TRAP_FREE: non-load insn.
414 IFREE: load from a globally safe location.
415 IRISKY: volatile load.
416 PFREE_CANDIDATE, PRISKY_CANDIDATE: load that need to be checked for
417 being either PFREE or PRISKY. */
419 static int
420 haifa_classify_rtx (const_rtx x)
422 int tmp_class = TRAP_FREE;
423 int insn_class = TRAP_FREE;
424 enum rtx_code code;
426 if (GET_CODE (x) == PARALLEL)
428 int i, len = XVECLEN (x, 0);
430 for (i = len - 1; i >= 0; i--)
432 tmp_class = haifa_classify_rtx (XVECEXP (x, 0, i));
433 insn_class = WORST_CLASS (insn_class, tmp_class);
434 if (insn_class == TRAP_RISKY || insn_class == IRISKY)
435 break;
438 else
440 code = GET_CODE (x);
441 switch (code)
443 case CLOBBER:
444 /* Test if it is a 'store'. */
445 tmp_class = may_trap_exp (XEXP (x, 0), 1);
446 break;
447 case SET:
448 /* Test if it is a store. */
449 tmp_class = may_trap_exp (SET_DEST (x), 1);
450 if (tmp_class == TRAP_RISKY)
451 break;
452 /* Test if it is a load. */
453 tmp_class =
454 WORST_CLASS (tmp_class,
455 may_trap_exp (SET_SRC (x), 0));
456 break;
457 case COND_EXEC:
458 tmp_class = haifa_classify_rtx (COND_EXEC_CODE (x));
459 if (tmp_class == TRAP_RISKY)
460 break;
461 tmp_class = WORST_CLASS (tmp_class,
462 may_trap_exp (COND_EXEC_TEST (x), 0));
463 break;
464 case TRAP_IF:
465 tmp_class = TRAP_RISKY;
466 break;
467 default:;
469 insn_class = tmp_class;
472 return insn_class;
476 haifa_classify_insn (const_rtx insn)
478 return haifa_classify_rtx (PATTERN (insn));
482 /* A typedef for rtx vector. */
483 typedef VEC(rtx, heap) *rtx_vec_t;
485 /* Forward declarations. */
487 static int priority (rtx);
488 static int rank_for_schedule (const void *, const void *);
489 static void swap_sort (rtx *, int);
490 static void queue_insn (rtx, int);
491 static int schedule_insn (rtx);
492 static int find_set_reg_weight (const_rtx);
493 static void find_insn_reg_weight (basic_block);
494 static void find_insn_reg_weight1 (rtx);
495 static void adjust_priority (rtx);
496 static void advance_one_cycle (void);
498 /* Notes handling mechanism:
499 =========================
500 Generally, NOTES are saved before scheduling and restored after scheduling.
501 The scheduler distinguishes between two types of notes:
503 (1) LOOP_BEGIN, LOOP_END, SETJMP, EHREGION_BEG, EHREGION_END notes:
504 Before scheduling a region, a pointer to the note is added to the insn
505 that follows or precedes it. (This happens as part of the data dependence
506 computation). After scheduling an insn, the pointer contained in it is
507 used for regenerating the corresponding note (in reemit_notes).
509 (2) All other notes (e.g. INSN_DELETED): Before scheduling a block,
510 these notes are put in a list (in rm_other_notes() and
511 unlink_other_notes ()). After scheduling the block, these notes are
512 inserted at the beginning of the block (in schedule_block()). */
514 static rtx unlink_other_notes (rtx, rtx);
515 static void reemit_notes (rtx);
517 static rtx *ready_lastpos (struct ready_list *);
518 static void ready_add (struct ready_list *, rtx, bool);
519 static void ready_sort (struct ready_list *);
520 static rtx ready_remove_first (struct ready_list *);
522 static void queue_to_ready (struct ready_list *);
523 static int early_queue_to_ready (state_t, struct ready_list *);
525 static void debug_ready_list (struct ready_list *);
527 static void move_insn (rtx);
529 /* The following functions are used to implement multi-pass scheduling
530 on the first cycle. */
531 static rtx ready_element (struct ready_list *, int);
532 static rtx ready_remove (struct ready_list *, int);
533 static void ready_remove_insn (rtx);
534 static int max_issue (struct ready_list *, int *, int);
536 static int choose_ready (struct ready_list *, rtx *);
538 static void fix_inter_tick (rtx, rtx);
539 static int fix_tick_ready (rtx);
540 static void change_queue_index (rtx, int);
542 /* The following functions are used to implement scheduling of data/control
543 speculative instructions. */
545 static void extend_h_i_d (void);
546 static void extend_ready (int);
547 static void extend_global (rtx);
548 static void extend_all (rtx);
549 static void init_h_i_d (rtx);
550 static void generate_recovery_code (rtx);
551 static void process_insn_forw_deps_be_in_spec (rtx, rtx, ds_t);
552 static void begin_speculative_block (rtx);
553 static void add_to_speculative_block (rtx);
554 static dw_t dep_weak (ds_t);
555 static edge find_fallthru_edge (basic_block);
556 static void init_before_recovery (void);
557 static basic_block create_recovery_block (void);
558 static void create_check_block_twin (rtx, bool);
559 static void fix_recovery_deps (basic_block);
560 static void change_pattern (rtx, rtx);
561 static int speculate_insn (rtx, ds_t, rtx *);
562 static void dump_new_block_header (int, basic_block, rtx, rtx);
563 static void restore_bb_notes (basic_block);
564 static void extend_bb (void);
565 static void fix_jump_move (rtx);
566 static void move_block_after_check (rtx);
567 static void move_succs (VEC(edge,gc) **, basic_block);
568 static void sched_remove_insn (rtx);
569 static void clear_priorities (rtx, rtx_vec_t *);
570 static void calc_priorities (rtx_vec_t);
571 static void add_jump_dependencies (rtx, rtx);
572 #ifdef ENABLE_CHECKING
573 static int has_edge_p (VEC(edge,gc) *, int);
574 static void check_cfg (rtx, rtx);
575 #endif
577 #endif /* INSN_SCHEDULING */
579 /* Point to state used for the current scheduling pass. */
580 struct sched_info *current_sched_info;
582 #ifndef INSN_SCHEDULING
583 void
584 schedule_insns (void)
587 #else
589 /* Working copy of frontend's sched_info variable. */
590 static struct sched_info current_sched_info_var;
592 /* Pointer to the last instruction scheduled. Used by rank_for_schedule,
593 so that insns independent of the last scheduled insn will be preferred
594 over dependent instructions. */
596 static rtx last_scheduled_insn;
598 /* Cached cost of the instruction. Use below function to get cost of the
599 insn. -1 here means that the field is not initialized. */
600 #define INSN_COST(INSN) (h_i_d[INSN_UID (INSN)].cost)
602 /* Compute cost of executing INSN.
603 This is the number of cycles between instruction issue and
604 instruction results. */
605 HAIFA_INLINE int
606 insn_cost (rtx insn)
608 int cost = INSN_COST (insn);
610 if (cost < 0)
612 /* A USE insn, or something else we don't need to
613 understand. We can't pass these directly to
614 result_ready_cost or insn_default_latency because it will
615 trigger a fatal error for unrecognizable insns. */
616 if (recog_memoized (insn) < 0)
618 INSN_COST (insn) = 0;
619 return 0;
621 else
623 cost = insn_default_latency (insn);
624 if (cost < 0)
625 cost = 0;
627 INSN_COST (insn) = cost;
631 return cost;
634 /* Compute cost of dependence LINK.
635 This is the number of cycles between instruction issue and
636 instruction results. */
638 dep_cost (dep_t link)
640 rtx used = DEP_CON (link);
641 int cost;
643 /* A USE insn should never require the value used to be computed.
644 This allows the computation of a function's result and parameter
645 values to overlap the return and call. */
646 if (recog_memoized (used) < 0)
647 cost = 0;
648 else
650 rtx insn = DEP_PRO (link);
651 enum reg_note dep_type = DEP_TYPE (link);
653 cost = insn_cost (insn);
655 if (INSN_CODE (insn) >= 0)
657 if (dep_type == REG_DEP_ANTI)
658 cost = 0;
659 else if (dep_type == REG_DEP_OUTPUT)
661 cost = (insn_default_latency (insn)
662 - insn_default_latency (used));
663 if (cost <= 0)
664 cost = 1;
666 else if (bypass_p (insn))
667 cost = insn_latency (insn, used);
670 if (targetm.sched.adjust_cost != NULL)
672 /* This variable is used for backward compatibility with the
673 targets. */
674 rtx dep_cost_rtx_link = alloc_INSN_LIST (NULL_RTX, NULL_RTX);
676 /* Make it self-cycled, so that if some tries to walk over this
677 incomplete list he/she will be caught in an endless loop. */
678 XEXP (dep_cost_rtx_link, 1) = dep_cost_rtx_link;
680 /* Targets use only REG_NOTE_KIND of the link. */
681 PUT_REG_NOTE_KIND (dep_cost_rtx_link, DEP_TYPE (link));
683 cost = targetm.sched.adjust_cost (used, dep_cost_rtx_link,
684 insn, cost);
686 free_INSN_LIST_node (dep_cost_rtx_link);
689 if (cost < 0)
690 cost = 0;
693 return cost;
696 /* Return 'true' if DEP should be included in priority calculations. */
697 static bool
698 contributes_to_priority_p (dep_t dep)
700 /* Critical path is meaningful in block boundaries only. */
701 if (!current_sched_info->contributes_to_priority (DEP_CON (dep),
702 DEP_PRO (dep)))
703 return false;
705 /* If flag COUNT_SPEC_IN_CRITICAL_PATH is set,
706 then speculative instructions will less likely be
707 scheduled. That is because the priority of
708 their producers will increase, and, thus, the
709 producers will more likely be scheduled, thus,
710 resolving the dependence. */
711 if ((current_sched_info->flags & DO_SPECULATION)
712 && !(spec_info->flags & COUNT_SPEC_IN_CRITICAL_PATH)
713 && (DEP_STATUS (dep) & SPECULATIVE))
714 return false;
716 return true;
719 /* Compute the priority number for INSN. */
720 static int
721 priority (rtx insn)
723 if (! INSN_P (insn))
724 return 0;
726 /* We should not be interested in priority of an already scheduled insn. */
727 gcc_assert (QUEUE_INDEX (insn) != QUEUE_SCHEDULED);
729 if (!INSN_PRIORITY_KNOWN (insn))
731 int this_priority = 0;
733 if (sd_lists_empty_p (insn, SD_LIST_FORW))
734 /* ??? We should set INSN_PRIORITY to insn_cost when and insn has
735 some forward deps but all of them are ignored by
736 contributes_to_priority hook. At the moment we set priority of
737 such insn to 0. */
738 this_priority = insn_cost (insn);
739 else
741 rtx prev_first, twin;
742 basic_block rec;
744 /* For recovery check instructions we calculate priority slightly
745 different than that of normal instructions. Instead of walking
746 through INSN_FORW_DEPS (check) list, we walk through
747 INSN_FORW_DEPS list of each instruction in the corresponding
748 recovery block. */
750 rec = RECOVERY_BLOCK (insn);
751 if (!rec || rec == EXIT_BLOCK_PTR)
753 prev_first = PREV_INSN (insn);
754 twin = insn;
756 else
758 prev_first = NEXT_INSN (BB_HEAD (rec));
759 twin = PREV_INSN (BB_END (rec));
764 sd_iterator_def sd_it;
765 dep_t dep;
767 FOR_EACH_DEP (twin, SD_LIST_FORW, sd_it, dep)
769 rtx next;
770 int next_priority;
772 next = DEP_CON (dep);
774 if (BLOCK_FOR_INSN (next) != rec)
776 int cost;
778 if (!contributes_to_priority_p (dep))
779 continue;
781 if (twin == insn)
782 cost = dep_cost (dep);
783 else
785 struct _dep _dep1, *dep1 = &_dep1;
787 init_dep (dep1, insn, next, REG_DEP_ANTI);
789 cost = dep_cost (dep1);
792 next_priority = cost + priority (next);
794 if (next_priority > this_priority)
795 this_priority = next_priority;
799 twin = PREV_INSN (twin);
801 while (twin != prev_first);
803 INSN_PRIORITY (insn) = this_priority;
804 INSN_PRIORITY_STATUS (insn) = 1;
807 return INSN_PRIORITY (insn);
810 /* Macros and functions for keeping the priority queue sorted, and
811 dealing with queuing and dequeuing of instructions. */
813 #define SCHED_SORT(READY, N_READY) \
814 do { if ((N_READY) == 2) \
815 swap_sort (READY, N_READY); \
816 else if ((N_READY) > 2) \
817 qsort (READY, N_READY, sizeof (rtx), rank_for_schedule); } \
818 while (0)
820 /* Returns a positive value if x is preferred; returns a negative value if
821 y is preferred. Should never return 0, since that will make the sort
822 unstable. */
824 static int
825 rank_for_schedule (const void *x, const void *y)
827 rtx tmp = *(const rtx *) y;
828 rtx tmp2 = *(const rtx *) x;
829 int tmp_class, tmp2_class;
830 int val, priority_val, weight_val, info_val;
832 /* The insn in a schedule group should be issued the first. */
833 if (SCHED_GROUP_P (tmp) != SCHED_GROUP_P (tmp2))
834 return SCHED_GROUP_P (tmp2) ? 1 : -1;
836 /* Make sure that priority of TMP and TMP2 are initialized. */
837 gcc_assert (INSN_PRIORITY_KNOWN (tmp) && INSN_PRIORITY_KNOWN (tmp2));
839 /* Prefer insn with higher priority. */
840 priority_val = INSN_PRIORITY (tmp2) - INSN_PRIORITY (tmp);
842 if (priority_val)
843 return priority_val;
845 /* Prefer speculative insn with greater dependencies weakness. */
846 if (spec_info)
848 ds_t ds1, ds2;
849 dw_t dw1, dw2;
850 int dw;
852 ds1 = TODO_SPEC (tmp) & SPECULATIVE;
853 if (ds1)
854 dw1 = dep_weak (ds1);
855 else
856 dw1 = NO_DEP_WEAK;
858 ds2 = TODO_SPEC (tmp2) & SPECULATIVE;
859 if (ds2)
860 dw2 = dep_weak (ds2);
861 else
862 dw2 = NO_DEP_WEAK;
864 dw = dw2 - dw1;
865 if (dw > (NO_DEP_WEAK / 8) || dw < -(NO_DEP_WEAK / 8))
866 return dw;
869 /* Prefer an insn with smaller contribution to registers-pressure. */
870 if (!reload_completed &&
871 (weight_val = INSN_REG_WEIGHT (tmp) - INSN_REG_WEIGHT (tmp2)))
872 return weight_val;
874 info_val = (*current_sched_info->rank) (tmp, tmp2);
875 if (info_val)
876 return info_val;
878 /* Compare insns based on their relation to the last-scheduled-insn. */
879 if (INSN_P (last_scheduled_insn))
881 dep_t dep1;
882 dep_t dep2;
884 /* Classify the instructions into three classes:
885 1) Data dependent on last schedule insn.
886 2) Anti/Output dependent on last scheduled insn.
887 3) Independent of last scheduled insn, or has latency of one.
888 Choose the insn from the highest numbered class if different. */
889 dep1 = sd_find_dep_between (last_scheduled_insn, tmp, true);
891 if (dep1 == NULL || dep_cost (dep1) == 1)
892 tmp_class = 3;
893 else if (/* Data dependence. */
894 DEP_TYPE (dep1) == REG_DEP_TRUE)
895 tmp_class = 1;
896 else
897 tmp_class = 2;
899 dep2 = sd_find_dep_between (last_scheduled_insn, tmp2, true);
901 if (dep2 == NULL || dep_cost (dep2) == 1)
902 tmp2_class = 3;
903 else if (/* Data dependence. */
904 DEP_TYPE (dep2) == REG_DEP_TRUE)
905 tmp2_class = 1;
906 else
907 tmp2_class = 2;
909 if ((val = tmp2_class - tmp_class))
910 return val;
913 /* Prefer the insn which has more later insns that depend on it.
914 This gives the scheduler more freedom when scheduling later
915 instructions at the expense of added register pressure. */
917 val = (sd_lists_size (tmp2, SD_LIST_FORW)
918 - sd_lists_size (tmp, SD_LIST_FORW));
920 if (val != 0)
921 return val;
923 /* If insns are equally good, sort by INSN_LUID (original insn order),
924 so that we make the sort stable. This minimizes instruction movement,
925 thus minimizing sched's effect on debugging and cross-jumping. */
926 return INSN_LUID (tmp) - INSN_LUID (tmp2);
929 /* Resort the array A in which only element at index N may be out of order. */
931 HAIFA_INLINE static void
932 swap_sort (rtx *a, int n)
934 rtx insn = a[n - 1];
935 int i = n - 2;
937 while (i >= 0 && rank_for_schedule (a + i, &insn) >= 0)
939 a[i + 1] = a[i];
940 i -= 1;
942 a[i + 1] = insn;
945 /* Add INSN to the insn queue so that it can be executed at least
946 N_CYCLES after the currently executing insn. Preserve insns
947 chain for debugging purposes. */
949 HAIFA_INLINE static void
950 queue_insn (rtx insn, int n_cycles)
952 int next_q = NEXT_Q_AFTER (q_ptr, n_cycles);
953 rtx link = alloc_INSN_LIST (insn, insn_queue[next_q]);
955 gcc_assert (n_cycles <= max_insn_queue_index);
957 insn_queue[next_q] = link;
958 q_size += 1;
960 if (sched_verbose >= 2)
962 fprintf (sched_dump, ";;\t\tReady-->Q: insn %s: ",
963 (*current_sched_info->print_insn) (insn, 0));
965 fprintf (sched_dump, "queued for %d cycles.\n", n_cycles);
968 QUEUE_INDEX (insn) = next_q;
971 /* Remove INSN from queue. */
972 static void
973 queue_remove (rtx insn)
975 gcc_assert (QUEUE_INDEX (insn) >= 0);
976 remove_free_INSN_LIST_elem (insn, &insn_queue[QUEUE_INDEX (insn)]);
977 q_size--;
978 QUEUE_INDEX (insn) = QUEUE_NOWHERE;
981 /* Return a pointer to the bottom of the ready list, i.e. the insn
982 with the lowest priority. */
984 HAIFA_INLINE static rtx *
985 ready_lastpos (struct ready_list *ready)
987 gcc_assert (ready->n_ready >= 1);
988 return ready->vec + ready->first - ready->n_ready + 1;
991 /* Add an element INSN to the ready list so that it ends up with the
992 lowest/highest priority depending on FIRST_P. */
994 HAIFA_INLINE static void
995 ready_add (struct ready_list *ready, rtx insn, bool first_p)
997 if (!first_p)
999 if (ready->first == ready->n_ready)
1001 memmove (ready->vec + ready->veclen - ready->n_ready,
1002 ready_lastpos (ready),
1003 ready->n_ready * sizeof (rtx));
1004 ready->first = ready->veclen - 1;
1006 ready->vec[ready->first - ready->n_ready] = insn;
1008 else
1010 if (ready->first == ready->veclen - 1)
1012 if (ready->n_ready)
1013 /* ready_lastpos() fails when called with (ready->n_ready == 0). */
1014 memmove (ready->vec + ready->veclen - ready->n_ready - 1,
1015 ready_lastpos (ready),
1016 ready->n_ready * sizeof (rtx));
1017 ready->first = ready->veclen - 2;
1019 ready->vec[++(ready->first)] = insn;
1022 ready->n_ready++;
1024 gcc_assert (QUEUE_INDEX (insn) != QUEUE_READY);
1025 QUEUE_INDEX (insn) = QUEUE_READY;
1028 /* Remove the element with the highest priority from the ready list and
1029 return it. */
1031 HAIFA_INLINE static rtx
1032 ready_remove_first (struct ready_list *ready)
1034 rtx t;
1036 gcc_assert (ready->n_ready);
1037 t = ready->vec[ready->first--];
1038 ready->n_ready--;
1039 /* If the queue becomes empty, reset it. */
1040 if (ready->n_ready == 0)
1041 ready->first = ready->veclen - 1;
1043 gcc_assert (QUEUE_INDEX (t) == QUEUE_READY);
1044 QUEUE_INDEX (t) = QUEUE_NOWHERE;
1046 return t;
1049 /* The following code implements multi-pass scheduling for the first
1050 cycle. In other words, we will try to choose ready insn which
1051 permits to start maximum number of insns on the same cycle. */
1053 /* Return a pointer to the element INDEX from the ready. INDEX for
1054 insn with the highest priority is 0, and the lowest priority has
1055 N_READY - 1. */
1057 HAIFA_INLINE static rtx
1058 ready_element (struct ready_list *ready, int index)
1060 gcc_assert (ready->n_ready && index < ready->n_ready);
1062 return ready->vec[ready->first - index];
1065 /* Remove the element INDEX from the ready list and return it. INDEX
1066 for insn with the highest priority is 0, and the lowest priority
1067 has N_READY - 1. */
1069 HAIFA_INLINE static rtx
1070 ready_remove (struct ready_list *ready, int index)
1072 rtx t;
1073 int i;
1075 if (index == 0)
1076 return ready_remove_first (ready);
1077 gcc_assert (ready->n_ready && index < ready->n_ready);
1078 t = ready->vec[ready->first - index];
1079 ready->n_ready--;
1080 for (i = index; i < ready->n_ready; i++)
1081 ready->vec[ready->first - i] = ready->vec[ready->first - i - 1];
1082 QUEUE_INDEX (t) = QUEUE_NOWHERE;
1083 return t;
1086 /* Remove INSN from the ready list. */
1087 static void
1088 ready_remove_insn (rtx insn)
1090 int i;
1092 for (i = 0; i < readyp->n_ready; i++)
1093 if (ready_element (readyp, i) == insn)
1095 ready_remove (readyp, i);
1096 return;
1098 gcc_unreachable ();
1101 /* Sort the ready list READY by ascending priority, using the SCHED_SORT
1102 macro. */
1104 HAIFA_INLINE static void
1105 ready_sort (struct ready_list *ready)
1107 rtx *first = ready_lastpos (ready);
1108 SCHED_SORT (first, ready->n_ready);
1111 /* PREV is an insn that is ready to execute. Adjust its priority if that
1112 will help shorten or lengthen register lifetimes as appropriate. Also
1113 provide a hook for the target to tweek itself. */
1115 HAIFA_INLINE static void
1116 adjust_priority (rtx prev)
1118 /* ??? There used to be code here to try and estimate how an insn
1119 affected register lifetimes, but it did it by looking at REG_DEAD
1120 notes, which we removed in schedule_region. Nor did it try to
1121 take into account register pressure or anything useful like that.
1123 Revisit when we have a machine model to work with and not before. */
1125 if (targetm.sched.adjust_priority)
1126 INSN_PRIORITY (prev) =
1127 targetm.sched.adjust_priority (prev, INSN_PRIORITY (prev));
1130 /* Advance time on one cycle. */
1131 HAIFA_INLINE static void
1132 advance_one_cycle (void)
1134 if (targetm.sched.dfa_pre_advance_cycle)
1135 targetm.sched.dfa_pre_advance_cycle ();
1137 if (targetm.sched.dfa_pre_cycle_insn)
1138 state_transition (curr_state,
1139 targetm.sched.dfa_pre_cycle_insn ());
1141 state_transition (curr_state, NULL);
1143 if (targetm.sched.dfa_post_cycle_insn)
1144 state_transition (curr_state,
1145 targetm.sched.dfa_post_cycle_insn ());
1147 if (targetm.sched.dfa_post_advance_cycle)
1148 targetm.sched.dfa_post_advance_cycle ();
1151 /* Clock at which the previous instruction was issued. */
1152 static int last_clock_var;
1154 /* INSN is the "currently executing insn". Launch each insn which was
1155 waiting on INSN. READY is the ready list which contains the insns
1156 that are ready to fire. CLOCK is the current cycle. The function
1157 returns necessary cycle advance after issuing the insn (it is not
1158 zero for insns in a schedule group). */
1160 static int
1161 schedule_insn (rtx insn)
1163 sd_iterator_def sd_it;
1164 dep_t dep;
1165 int advance = 0;
1167 if (sched_verbose >= 1)
1169 char buf[2048];
1171 print_insn (buf, insn, 0);
1172 buf[40] = 0;
1173 fprintf (sched_dump, ";;\t%3i--> %-40s:", clock_var, buf);
1175 if (recog_memoized (insn) < 0)
1176 fprintf (sched_dump, "nothing");
1177 else
1178 print_reservation (sched_dump, insn);
1179 fputc ('\n', sched_dump);
1182 /* Scheduling instruction should have all its dependencies resolved and
1183 should have been removed from the ready list. */
1184 gcc_assert (sd_lists_empty_p (insn, SD_LIST_BACK));
1186 gcc_assert (QUEUE_INDEX (insn) == QUEUE_NOWHERE);
1187 QUEUE_INDEX (insn) = QUEUE_SCHEDULED;
1189 gcc_assert (INSN_TICK (insn) >= MIN_TICK);
1190 if (INSN_TICK (insn) > clock_var)
1191 /* INSN has been prematurely moved from the queue to the ready list.
1192 This is possible only if following flag is set. */
1193 gcc_assert (flag_sched_stalled_insns);
1195 /* ??? Probably, if INSN is scheduled prematurely, we should leave
1196 INSN_TICK untouched. This is a machine-dependent issue, actually. */
1197 INSN_TICK (insn) = clock_var;
1199 /* Update dependent instructions. */
1200 for (sd_it = sd_iterator_start (insn, SD_LIST_FORW);
1201 sd_iterator_cond (&sd_it, &dep);)
1203 rtx next = DEP_CON (dep);
1205 /* Resolve the dependence between INSN and NEXT.
1206 sd_resolve_dep () moves current dep to another list thus
1207 advancing the iterator. */
1208 sd_resolve_dep (sd_it);
1210 if (!IS_SPECULATION_BRANCHY_CHECK_P (insn))
1212 int effective_cost;
1214 effective_cost = try_ready (next);
1216 if (effective_cost >= 0
1217 && SCHED_GROUP_P (next)
1218 && advance < effective_cost)
1219 advance = effective_cost;
1221 else
1222 /* Check always has only one forward dependence (to the first insn in
1223 the recovery block), therefore, this will be executed only once. */
1225 gcc_assert (sd_lists_empty_p (insn, SD_LIST_FORW));
1226 fix_recovery_deps (RECOVERY_BLOCK (insn));
1230 /* This is the place where scheduler doesn't *basically* need backward and
1231 forward dependencies for INSN anymore. Nevertheless they are used in
1232 heuristics in rank_for_schedule (), early_queue_to_ready () and in
1233 some targets (e.g. rs6000). Thus the earliest place where we *can*
1234 remove dependencies is after targetm.sched.md_finish () call in
1235 schedule_block (). But, on the other side, the safest place to remove
1236 dependencies is when we are finishing scheduling entire region. As we
1237 don't generate [many] dependencies during scheduling itself, we won't
1238 need memory until beginning of next region.
1239 Bottom line: Dependencies are removed for all insns in the end of
1240 scheduling the region. */
1242 /* Annotate the instruction with issue information -- TImode
1243 indicates that the instruction is expected not to be able
1244 to issue on the same cycle as the previous insn. A machine
1245 may use this information to decide how the instruction should
1246 be aligned. */
1247 if (issue_rate > 1
1248 && GET_CODE (PATTERN (insn)) != USE
1249 && GET_CODE (PATTERN (insn)) != CLOBBER)
1251 if (reload_completed)
1252 PUT_MODE (insn, clock_var > last_clock_var ? TImode : VOIDmode);
1253 last_clock_var = clock_var;
1256 return advance;
1259 /* Functions for handling of notes. */
1261 /* Delete notes beginning with INSN and put them in the chain
1262 of notes ended by NOTE_LIST.
1263 Returns the insn following the notes. */
1265 static rtx
1266 unlink_other_notes (rtx insn, rtx tail)
1268 rtx prev = PREV_INSN (insn);
1270 while (insn != tail && NOTE_NOT_BB_P (insn))
1272 rtx next = NEXT_INSN (insn);
1273 basic_block bb = BLOCK_FOR_INSN (insn);
1275 /* Delete the note from its current position. */
1276 if (prev)
1277 NEXT_INSN (prev) = next;
1278 if (next)
1279 PREV_INSN (next) = prev;
1281 if (bb)
1283 /* Basic block can begin with either LABEL or
1284 NOTE_INSN_BASIC_BLOCK. */
1285 gcc_assert (BB_HEAD (bb) != insn);
1287 /* Check if we are removing last insn in the BB. */
1288 if (BB_END (bb) == insn)
1289 BB_END (bb) = prev;
1292 /* See sched_analyze to see how these are handled. */
1293 if (NOTE_KIND (insn) != NOTE_INSN_EH_REGION_BEG
1294 && NOTE_KIND (insn) != NOTE_INSN_EH_REGION_END)
1296 /* Insert the note at the end of the notes list. */
1297 PREV_INSN (insn) = note_list;
1298 if (note_list)
1299 NEXT_INSN (note_list) = insn;
1300 note_list = insn;
1303 insn = next;
1305 return insn;
1308 /* Return the head and tail pointers of ebb starting at BEG and ending
1309 at END. */
1311 void
1312 get_ebb_head_tail (basic_block beg, basic_block end, rtx *headp, rtx *tailp)
1314 rtx beg_head = BB_HEAD (beg);
1315 rtx beg_tail = BB_END (beg);
1316 rtx end_head = BB_HEAD (end);
1317 rtx end_tail = BB_END (end);
1319 /* Don't include any notes or labels at the beginning of the BEG
1320 basic block, or notes at the end of the END basic blocks. */
1322 if (LABEL_P (beg_head))
1323 beg_head = NEXT_INSN (beg_head);
1325 while (beg_head != beg_tail)
1326 if (NOTE_P (beg_head))
1327 beg_head = NEXT_INSN (beg_head);
1328 else
1329 break;
1331 *headp = beg_head;
1333 if (beg == end)
1334 end_head = beg_head;
1335 else if (LABEL_P (end_head))
1336 end_head = NEXT_INSN (end_head);
1338 while (end_head != end_tail)
1339 if (NOTE_P (end_tail))
1340 end_tail = PREV_INSN (end_tail);
1341 else
1342 break;
1344 *tailp = end_tail;
1347 /* Return nonzero if there are no real insns in the range [ HEAD, TAIL ]. */
1350 no_real_insns_p (const_rtx head, const_rtx tail)
1352 while (head != NEXT_INSN (tail))
1354 if (!NOTE_P (head) && !LABEL_P (head))
1355 return 0;
1356 head = NEXT_INSN (head);
1358 return 1;
1361 /* Delete notes between HEAD and TAIL and put them in the chain
1362 of notes ended by NOTE_LIST. */
1364 void
1365 rm_other_notes (rtx head, rtx tail)
1367 rtx next_tail;
1368 rtx insn;
1370 note_list = 0;
1371 if (head == tail && (! INSN_P (head)))
1372 return;
1374 next_tail = NEXT_INSN (tail);
1375 for (insn = head; insn != next_tail; insn = NEXT_INSN (insn))
1377 rtx prev;
1379 /* Farm out notes, and maybe save them in NOTE_LIST.
1380 This is needed to keep the debugger from
1381 getting completely deranged. */
1382 if (NOTE_NOT_BB_P (insn))
1384 prev = insn;
1386 insn = unlink_other_notes (insn, next_tail);
1388 gcc_assert (prev != tail && prev != head && insn != next_tail);
1393 /* Functions for computation of registers live/usage info. */
1395 /* This function looks for a new register being defined.
1396 If the destination register is already used by the source,
1397 a new register is not needed. */
1399 static int
1400 find_set_reg_weight (const_rtx x)
1402 if (GET_CODE (x) == CLOBBER
1403 && register_operand (SET_DEST (x), VOIDmode))
1404 return 1;
1405 if (GET_CODE (x) == SET
1406 && register_operand (SET_DEST (x), VOIDmode))
1408 if (REG_P (SET_DEST (x)))
1410 if (!reg_mentioned_p (SET_DEST (x), SET_SRC (x)))
1411 return 1;
1412 else
1413 return 0;
1415 return 1;
1417 return 0;
1420 /* Calculate INSN_REG_WEIGHT for all insns of a block. */
1422 static void
1423 find_insn_reg_weight (basic_block bb)
1425 rtx insn, next_tail, head, tail;
1427 get_ebb_head_tail (bb, bb, &head, &tail);
1428 next_tail = NEXT_INSN (tail);
1430 for (insn = head; insn != next_tail; insn = NEXT_INSN (insn))
1431 find_insn_reg_weight1 (insn);
1434 /* Calculate INSN_REG_WEIGHT for single instruction.
1435 Separated from find_insn_reg_weight because of need
1436 to initialize new instruction in generate_recovery_code. */
1437 static void
1438 find_insn_reg_weight1 (rtx insn)
1440 int reg_weight = 0;
1441 rtx x;
1443 /* Handle register life information. */
1444 if (! INSN_P (insn))
1445 return;
1447 /* Increment weight for each register born here. */
1448 x = PATTERN (insn);
1449 reg_weight += find_set_reg_weight (x);
1450 if (GET_CODE (x) == PARALLEL)
1452 int j;
1453 for (j = XVECLEN (x, 0) - 1; j >= 0; j--)
1455 x = XVECEXP (PATTERN (insn), 0, j);
1456 reg_weight += find_set_reg_weight (x);
1459 /* Decrement weight for each register that dies here. */
1460 for (x = REG_NOTES (insn); x; x = XEXP (x, 1))
1462 if (REG_NOTE_KIND (x) == REG_DEAD
1463 || REG_NOTE_KIND (x) == REG_UNUSED)
1464 reg_weight--;
1467 INSN_REG_WEIGHT (insn) = reg_weight;
1470 /* Move insns that became ready to fire from queue to ready list. */
1472 static void
1473 queue_to_ready (struct ready_list *ready)
1475 rtx insn;
1476 rtx link;
1477 rtx skip_insn;
1479 q_ptr = NEXT_Q (q_ptr);
1481 if (dbg_cnt (sched_insn) == false)
1482 /* If debug counter is activated do not requeue insn next after
1483 last_scheduled_insn. */
1484 skip_insn = next_nonnote_insn (last_scheduled_insn);
1485 else
1486 skip_insn = NULL_RTX;
1488 /* Add all pending insns that can be scheduled without stalls to the
1489 ready list. */
1490 for (link = insn_queue[q_ptr]; link; link = XEXP (link, 1))
1492 insn = XEXP (link, 0);
1493 q_size -= 1;
1495 if (sched_verbose >= 2)
1496 fprintf (sched_dump, ";;\t\tQ-->Ready: insn %s: ",
1497 (*current_sched_info->print_insn) (insn, 0));
1499 /* If the ready list is full, delay the insn for 1 cycle.
1500 See the comment in schedule_block for the rationale. */
1501 if (!reload_completed
1502 && ready->n_ready > MAX_SCHED_READY_INSNS
1503 && !SCHED_GROUP_P (insn)
1504 && insn != skip_insn)
1506 if (sched_verbose >= 2)
1507 fprintf (sched_dump, "requeued because ready full\n");
1508 queue_insn (insn, 1);
1510 else
1512 ready_add (ready, insn, false);
1513 if (sched_verbose >= 2)
1514 fprintf (sched_dump, "moving to ready without stalls\n");
1517 free_INSN_LIST_list (&insn_queue[q_ptr]);
1519 /* If there are no ready insns, stall until one is ready and add all
1520 of the pending insns at that point to the ready list. */
1521 if (ready->n_ready == 0)
1523 int stalls;
1525 for (stalls = 1; stalls <= max_insn_queue_index; stalls++)
1527 if ((link = insn_queue[NEXT_Q_AFTER (q_ptr, stalls)]))
1529 for (; link; link = XEXP (link, 1))
1531 insn = XEXP (link, 0);
1532 q_size -= 1;
1534 if (sched_verbose >= 2)
1535 fprintf (sched_dump, ";;\t\tQ-->Ready: insn %s: ",
1536 (*current_sched_info->print_insn) (insn, 0));
1538 ready_add (ready, insn, false);
1539 if (sched_verbose >= 2)
1540 fprintf (sched_dump, "moving to ready with %d stalls\n", stalls);
1542 free_INSN_LIST_list (&insn_queue[NEXT_Q_AFTER (q_ptr, stalls)]);
1544 advance_one_cycle ();
1546 break;
1549 advance_one_cycle ();
1552 q_ptr = NEXT_Q_AFTER (q_ptr, stalls);
1553 clock_var += stalls;
1557 /* Used by early_queue_to_ready. Determines whether it is "ok" to
1558 prematurely move INSN from the queue to the ready list. Currently,
1559 if a target defines the hook 'is_costly_dependence', this function
1560 uses the hook to check whether there exist any dependences which are
1561 considered costly by the target, between INSN and other insns that
1562 have already been scheduled. Dependences are checked up to Y cycles
1563 back, with default Y=1; The flag -fsched-stalled-insns-dep=Y allows
1564 controlling this value.
1565 (Other considerations could be taken into account instead (or in
1566 addition) depending on user flags and target hooks. */
1568 static bool
1569 ok_for_early_queue_removal (rtx insn)
1571 int n_cycles;
1572 rtx prev_insn = last_scheduled_insn;
1574 if (targetm.sched.is_costly_dependence)
1576 for (n_cycles = flag_sched_stalled_insns_dep; n_cycles; n_cycles--)
1578 for ( ; prev_insn; prev_insn = PREV_INSN (prev_insn))
1580 int cost;
1582 if (prev_insn == current_sched_info->prev_head)
1584 prev_insn = NULL;
1585 break;
1588 if (!NOTE_P (prev_insn))
1590 dep_t dep;
1592 dep = sd_find_dep_between (prev_insn, insn, true);
1594 if (dep != NULL)
1596 cost = dep_cost (dep);
1598 if (targetm.sched.is_costly_dependence (dep, cost,
1599 flag_sched_stalled_insns_dep - n_cycles))
1600 return false;
1604 if (GET_MODE (prev_insn) == TImode) /* end of dispatch group */
1605 break;
1608 if (!prev_insn)
1609 break;
1610 prev_insn = PREV_INSN (prev_insn);
1614 return true;
1618 /* Remove insns from the queue, before they become "ready" with respect
1619 to FU latency considerations. */
1621 static int
1622 early_queue_to_ready (state_t state, struct ready_list *ready)
1624 rtx insn;
1625 rtx link;
1626 rtx next_link;
1627 rtx prev_link;
1628 bool move_to_ready;
1629 int cost;
1630 state_t temp_state = alloca (dfa_state_size);
1631 int stalls;
1632 int insns_removed = 0;
1635 Flag '-fsched-stalled-insns=X' determines the aggressiveness of this
1636 function:
1638 X == 0: There is no limit on how many queued insns can be removed
1639 prematurely. (flag_sched_stalled_insns = -1).
1641 X >= 1: Only X queued insns can be removed prematurely in each
1642 invocation. (flag_sched_stalled_insns = X).
1644 Otherwise: Early queue removal is disabled.
1645 (flag_sched_stalled_insns = 0)
1648 if (! flag_sched_stalled_insns)
1649 return 0;
1651 for (stalls = 0; stalls <= max_insn_queue_index; stalls++)
1653 if ((link = insn_queue[NEXT_Q_AFTER (q_ptr, stalls)]))
1655 if (sched_verbose > 6)
1656 fprintf (sched_dump, ";; look at index %d + %d\n", q_ptr, stalls);
1658 prev_link = 0;
1659 while (link)
1661 next_link = XEXP (link, 1);
1662 insn = XEXP (link, 0);
1663 if (insn && sched_verbose > 6)
1664 print_rtl_single (sched_dump, insn);
1666 memcpy (temp_state, state, dfa_state_size);
1667 if (recog_memoized (insn) < 0)
1668 /* non-negative to indicate that it's not ready
1669 to avoid infinite Q->R->Q->R... */
1670 cost = 0;
1671 else
1672 cost = state_transition (temp_state, insn);
1674 if (sched_verbose >= 6)
1675 fprintf (sched_dump, "transition cost = %d\n", cost);
1677 move_to_ready = false;
1678 if (cost < 0)
1680 move_to_ready = ok_for_early_queue_removal (insn);
1681 if (move_to_ready == true)
1683 /* move from Q to R */
1684 q_size -= 1;
1685 ready_add (ready, insn, false);
1687 if (prev_link)
1688 XEXP (prev_link, 1) = next_link;
1689 else
1690 insn_queue[NEXT_Q_AFTER (q_ptr, stalls)] = next_link;
1692 free_INSN_LIST_node (link);
1694 if (sched_verbose >= 2)
1695 fprintf (sched_dump, ";;\t\tEarly Q-->Ready: insn %s\n",
1696 (*current_sched_info->print_insn) (insn, 0));
1698 insns_removed++;
1699 if (insns_removed == flag_sched_stalled_insns)
1700 /* Remove no more than flag_sched_stalled_insns insns
1701 from Q at a time. */
1702 return insns_removed;
1706 if (move_to_ready == false)
1707 prev_link = link;
1709 link = next_link;
1710 } /* while link */
1711 } /* if link */
1713 } /* for stalls.. */
1715 return insns_removed;
1719 /* Print the ready list for debugging purposes. Callable from debugger. */
1721 static void
1722 debug_ready_list (struct ready_list *ready)
1724 rtx *p;
1725 int i;
1727 if (ready->n_ready == 0)
1729 fprintf (sched_dump, "\n");
1730 return;
1733 p = ready_lastpos (ready);
1734 for (i = 0; i < ready->n_ready; i++)
1735 fprintf (sched_dump, " %s", (*current_sched_info->print_insn) (p[i], 0));
1736 fprintf (sched_dump, "\n");
1739 /* Search INSN for REG_SAVE_NOTE note pairs for
1740 NOTE_INSN_EHREGION_{BEG,END}; and convert them back into
1741 NOTEs. The REG_SAVE_NOTE note following first one is contains the
1742 saved value for NOTE_BLOCK_NUMBER which is useful for
1743 NOTE_INSN_EH_REGION_{BEG,END} NOTEs. */
1745 static void
1746 reemit_notes (rtx insn)
1748 rtx note, last = insn;
1750 for (note = REG_NOTES (insn); note; note = XEXP (note, 1))
1752 if (REG_NOTE_KIND (note) == REG_SAVE_NOTE)
1754 enum insn_note note_type = INTVAL (XEXP (note, 0));
1756 last = emit_note_before (note_type, last);
1757 remove_note (insn, note);
1762 /* Move INSN. Reemit notes if needed. Update CFG, if needed. */
1763 static void
1764 move_insn (rtx insn)
1766 rtx last = last_scheduled_insn;
1768 if (PREV_INSN (insn) != last)
1770 basic_block bb;
1771 rtx note;
1772 int jump_p = 0;
1774 bb = BLOCK_FOR_INSN (insn);
1776 /* BB_HEAD is either LABEL or NOTE. */
1777 gcc_assert (BB_HEAD (bb) != insn);
1779 if (BB_END (bb) == insn)
1780 /* If this is last instruction in BB, move end marker one
1781 instruction up. */
1783 /* Jumps are always placed at the end of basic block. */
1784 jump_p = control_flow_insn_p (insn);
1786 gcc_assert (!jump_p
1787 || ((current_sched_info->flags & SCHED_RGN)
1788 && IS_SPECULATION_BRANCHY_CHECK_P (insn))
1789 || (current_sched_info->flags & SCHED_EBB));
1791 gcc_assert (BLOCK_FOR_INSN (PREV_INSN (insn)) == bb);
1793 BB_END (bb) = PREV_INSN (insn);
1796 gcc_assert (BB_END (bb) != last);
1798 if (jump_p)
1799 /* We move the block note along with jump. */
1801 /* NT is needed for assertion below. */
1802 rtx nt = current_sched_info->next_tail;
1804 note = NEXT_INSN (insn);
1805 while (NOTE_NOT_BB_P (note) && note != nt)
1806 note = NEXT_INSN (note);
1808 if (note != nt
1809 && (LABEL_P (note)
1810 || BARRIER_P (note)))
1811 note = NEXT_INSN (note);
1813 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (note));
1815 else
1816 note = insn;
1818 NEXT_INSN (PREV_INSN (insn)) = NEXT_INSN (note);
1819 PREV_INSN (NEXT_INSN (note)) = PREV_INSN (insn);
1821 NEXT_INSN (note) = NEXT_INSN (last);
1822 PREV_INSN (NEXT_INSN (last)) = note;
1824 NEXT_INSN (last) = insn;
1825 PREV_INSN (insn) = last;
1827 bb = BLOCK_FOR_INSN (last);
1829 if (jump_p)
1831 fix_jump_move (insn);
1833 if (BLOCK_FOR_INSN (insn) != bb)
1834 move_block_after_check (insn);
1836 gcc_assert (BB_END (bb) == last);
1839 df_insn_change_bb (insn, bb);
1841 /* Update BB_END, if needed. */
1842 if (BB_END (bb) == last)
1843 BB_END (bb) = insn;
1846 reemit_notes (insn);
1848 SCHED_GROUP_P (insn) = 0;
1851 /* The following structure describe an entry of the stack of choices. */
1852 struct choice_entry
1854 /* Ordinal number of the issued insn in the ready queue. */
1855 int index;
1856 /* The number of the rest insns whose issues we should try. */
1857 int rest;
1858 /* The number of issued essential insns. */
1859 int n;
1860 /* State after issuing the insn. */
1861 state_t state;
1864 /* The following array is used to implement a stack of choices used in
1865 function max_issue. */
1866 static struct choice_entry *choice_stack;
1868 /* The following variable value is number of essential insns issued on
1869 the current cycle. An insn is essential one if it changes the
1870 processors state. */
1871 static int cycle_issued_insns;
1873 /* The following variable value is maximal number of tries of issuing
1874 insns for the first cycle multipass insn scheduling. We define
1875 this value as constant*(DFA_LOOKAHEAD**ISSUE_RATE). We would not
1876 need this constraint if all real insns (with non-negative codes)
1877 had reservations because in this case the algorithm complexity is
1878 O(DFA_LOOKAHEAD**ISSUE_RATE). Unfortunately, the dfa descriptions
1879 might be incomplete and such insn might occur. For such
1880 descriptions, the complexity of algorithm (without the constraint)
1881 could achieve DFA_LOOKAHEAD ** N , where N is the queue length. */
1882 static int max_lookahead_tries;
1884 /* The following value is value of hook
1885 `first_cycle_multipass_dfa_lookahead' at the last call of
1886 `max_issue'. */
1887 static int cached_first_cycle_multipass_dfa_lookahead = 0;
1889 /* The following value is value of `issue_rate' at the last call of
1890 `sched_init'. */
1891 static int cached_issue_rate = 0;
1893 /* The following function returns maximal (or close to maximal) number
1894 of insns which can be issued on the same cycle and one of which
1895 insns is insns with the best rank (the first insn in READY). To
1896 make this function tries different samples of ready insns. READY
1897 is current queue `ready'. Global array READY_TRY reflects what
1898 insns are already issued in this try. MAX_POINTS is the sum of points
1899 of all instructions in READY. The function stops immediately,
1900 if it reached the such a solution, that all instruction can be issued.
1901 INDEX will contain index of the best insn in READY. The following
1902 function is used only for first cycle multipass scheduling. */
1903 static int
1904 max_issue (struct ready_list *ready, int *index, int max_points)
1906 int n, i, all, n_ready, best, delay, tries_num, points = -1;
1907 struct choice_entry *top;
1908 rtx insn;
1910 best = 0;
1911 memcpy (choice_stack->state, curr_state, dfa_state_size);
1912 top = choice_stack;
1913 top->rest = cached_first_cycle_multipass_dfa_lookahead;
1914 top->n = 0;
1915 n_ready = ready->n_ready;
1916 for (all = i = 0; i < n_ready; i++)
1917 if (!ready_try [i])
1918 all++;
1919 i = 0;
1920 tries_num = 0;
1921 for (;;)
1923 if (top->rest == 0 || i >= n_ready)
1925 if (top == choice_stack)
1926 break;
1927 if (best < top - choice_stack && ready_try [0])
1929 best = top - choice_stack;
1930 *index = choice_stack [1].index;
1931 points = top->n;
1932 if (top->n == max_points || best == all)
1933 break;
1935 i = top->index;
1936 ready_try [i] = 0;
1937 top--;
1938 memcpy (curr_state, top->state, dfa_state_size);
1940 else if (!ready_try [i])
1942 tries_num++;
1943 if (tries_num > max_lookahead_tries)
1944 break;
1945 insn = ready_element (ready, i);
1946 delay = state_transition (curr_state, insn);
1947 if (delay < 0)
1949 if (state_dead_lock_p (curr_state))
1950 top->rest = 0;
1951 else
1952 top->rest--;
1953 n = top->n;
1954 if (memcmp (top->state, curr_state, dfa_state_size) != 0)
1955 n += ISSUE_POINTS (insn);
1956 top++;
1957 top->rest = cached_first_cycle_multipass_dfa_lookahead;
1958 top->index = i;
1959 top->n = n;
1960 memcpy (top->state, curr_state, dfa_state_size);
1961 ready_try [i] = 1;
1962 i = -1;
1965 i++;
1967 while (top != choice_stack)
1969 ready_try [top->index] = 0;
1970 top--;
1972 memcpy (curr_state, choice_stack->state, dfa_state_size);
1974 if (sched_verbose >= 4)
1975 fprintf (sched_dump, ";;\t\tChoosed insn : %s; points: %d/%d\n",
1976 (*current_sched_info->print_insn) (ready_element (ready, *index),
1977 0),
1978 points, max_points);
1980 return best;
1983 /* The following function chooses insn from READY and modifies
1984 *N_READY and READY. The following function is used only for first
1985 cycle multipass scheduling.
1986 Return:
1987 -1 if cycle should be advanced,
1988 0 if INSN_PTR is set to point to the desirable insn,
1989 1 if choose_ready () should be restarted without advancing the cycle. */
1990 static int
1991 choose_ready (struct ready_list *ready, rtx *insn_ptr)
1993 int lookahead;
1995 if (dbg_cnt (sched_insn) == false)
1997 rtx insn;
1999 insn = next_nonnote_insn (last_scheduled_insn);
2001 if (QUEUE_INDEX (insn) == QUEUE_READY)
2002 /* INSN is in the ready_list. */
2004 ready_remove_insn (insn);
2005 *insn_ptr = insn;
2006 return 0;
2009 /* INSN is in the queue. Advance cycle to move it to the ready list. */
2010 return -1;
2013 lookahead = 0;
2015 if (targetm.sched.first_cycle_multipass_dfa_lookahead)
2016 lookahead = targetm.sched.first_cycle_multipass_dfa_lookahead ();
2017 if (lookahead <= 0 || SCHED_GROUP_P (ready_element (ready, 0)))
2019 *insn_ptr = ready_remove_first (ready);
2020 return 0;
2022 else
2024 /* Try to choose the better insn. */
2025 int index = 0, i, n;
2026 rtx insn;
2027 int more_issue, max_points, try_data = 1, try_control = 1;
2029 if (cached_first_cycle_multipass_dfa_lookahead != lookahead)
2031 cached_first_cycle_multipass_dfa_lookahead = lookahead;
2032 max_lookahead_tries = 100;
2033 for (i = 0; i < issue_rate; i++)
2034 max_lookahead_tries *= lookahead;
2036 insn = ready_element (ready, 0);
2037 if (INSN_CODE (insn) < 0)
2039 *insn_ptr = ready_remove_first (ready);
2040 return 0;
2043 if (spec_info
2044 && spec_info->flags & (PREFER_NON_DATA_SPEC
2045 | PREFER_NON_CONTROL_SPEC))
2047 for (i = 0, n = ready->n_ready; i < n; i++)
2049 rtx x;
2050 ds_t s;
2052 x = ready_element (ready, i);
2053 s = TODO_SPEC (x);
2055 if (spec_info->flags & PREFER_NON_DATA_SPEC
2056 && !(s & DATA_SPEC))
2058 try_data = 0;
2059 if (!(spec_info->flags & PREFER_NON_CONTROL_SPEC)
2060 || !try_control)
2061 break;
2064 if (spec_info->flags & PREFER_NON_CONTROL_SPEC
2065 && !(s & CONTROL_SPEC))
2067 try_control = 0;
2068 if (!(spec_info->flags & PREFER_NON_DATA_SPEC) || !try_data)
2069 break;
2074 if ((!try_data && (TODO_SPEC (insn) & DATA_SPEC))
2075 || (!try_control && (TODO_SPEC (insn) & CONTROL_SPEC))
2076 || (targetm.sched.first_cycle_multipass_dfa_lookahead_guard_spec
2077 && !targetm.sched.first_cycle_multipass_dfa_lookahead_guard_spec
2078 (insn)))
2079 /* Discard speculative instruction that stands first in the ready
2080 list. */
2082 change_queue_index (insn, 1);
2083 return 1;
2086 max_points = ISSUE_POINTS (insn);
2087 more_issue = issue_rate - cycle_issued_insns - 1;
2089 for (i = 1; i < ready->n_ready; i++)
2091 insn = ready_element (ready, i);
2092 ready_try [i]
2093 = (INSN_CODE (insn) < 0
2094 || (!try_data && (TODO_SPEC (insn) & DATA_SPEC))
2095 || (!try_control && (TODO_SPEC (insn) & CONTROL_SPEC))
2096 || (targetm.sched.first_cycle_multipass_dfa_lookahead_guard
2097 && !targetm.sched.first_cycle_multipass_dfa_lookahead_guard
2098 (insn)));
2100 if (!ready_try [i] && more_issue-- > 0)
2101 max_points += ISSUE_POINTS (insn);
2104 if (max_issue (ready, &index, max_points) == 0)
2106 *insn_ptr = ready_remove_first (ready);
2107 return 0;
2109 else
2111 *insn_ptr = ready_remove (ready, index);
2112 return 0;
2117 /* Use forward list scheduling to rearrange insns of block pointed to by
2118 TARGET_BB, possibly bringing insns from subsequent blocks in the same
2119 region. */
2121 void
2122 schedule_block (basic_block *target_bb, int rgn_n_insns1)
2124 struct ready_list ready;
2125 int i, first_cycle_insn_p;
2126 int can_issue_more;
2127 state_t temp_state = NULL; /* It is used for multipass scheduling. */
2128 int sort_p, advance, start_clock_var;
2130 /* Head/tail info for this block. */
2131 rtx prev_head = current_sched_info->prev_head;
2132 rtx next_tail = current_sched_info->next_tail;
2133 rtx head = NEXT_INSN (prev_head);
2134 rtx tail = PREV_INSN (next_tail);
2136 /* We used to have code to avoid getting parameters moved from hard
2137 argument registers into pseudos.
2139 However, it was removed when it proved to be of marginal benefit
2140 and caused problems because schedule_block and compute_forward_dependences
2141 had different notions of what the "head" insn was. */
2143 gcc_assert (head != tail || INSN_P (head));
2145 haifa_recovery_bb_recently_added_p = false;
2147 /* Debug info. */
2148 if (sched_verbose)
2149 dump_new_block_header (0, *target_bb, head, tail);
2151 state_reset (curr_state);
2153 /* Allocate the ready list. */
2154 readyp = &ready;
2155 ready.vec = NULL;
2156 ready_try = NULL;
2157 choice_stack = NULL;
2159 rgn_n_insns = -1;
2160 extend_ready (rgn_n_insns1 + 1);
2162 ready.first = ready.veclen - 1;
2163 ready.n_ready = 0;
2165 /* It is used for first cycle multipass scheduling. */
2166 temp_state = alloca (dfa_state_size);
2168 if (targetm.sched.md_init)
2169 targetm.sched.md_init (sched_dump, sched_verbose, ready.veclen);
2171 /* We start inserting insns after PREV_HEAD. */
2172 last_scheduled_insn = prev_head;
2174 gcc_assert (NOTE_P (last_scheduled_insn)
2175 && BLOCK_FOR_INSN (last_scheduled_insn) == *target_bb);
2177 /* Initialize INSN_QUEUE. Q_SIZE is the total number of insns in the
2178 queue. */
2179 q_ptr = 0;
2180 q_size = 0;
2182 insn_queue = alloca ((max_insn_queue_index + 1) * sizeof (rtx));
2183 memset (insn_queue, 0, (max_insn_queue_index + 1) * sizeof (rtx));
2185 /* Start just before the beginning of time. */
2186 clock_var = -1;
2188 /* We need queue and ready lists and clock_var be initialized
2189 in try_ready () (which is called through init_ready_list ()). */
2190 (*current_sched_info->init_ready_list) ();
2192 /* The algorithm is O(n^2) in the number of ready insns at any given
2193 time in the worst case. Before reload we are more likely to have
2194 big lists so truncate them to a reasonable size. */
2195 if (!reload_completed && ready.n_ready > MAX_SCHED_READY_INSNS)
2197 ready_sort (&ready);
2199 /* Find first free-standing insn past MAX_SCHED_READY_INSNS. */
2200 for (i = MAX_SCHED_READY_INSNS; i < ready.n_ready; i++)
2201 if (!SCHED_GROUP_P (ready_element (&ready, i)))
2202 break;
2204 if (sched_verbose >= 2)
2206 fprintf (sched_dump,
2207 ";;\t\tReady list on entry: %d insns\n", ready.n_ready);
2208 fprintf (sched_dump,
2209 ";;\t\t before reload => truncated to %d insns\n", i);
2212 /* Delay all insns past it for 1 cycle. If debug counter is
2213 activated make an exception for the insn right after
2214 last_scheduled_insn. */
2216 rtx skip_insn;
2218 if (dbg_cnt (sched_insn) == false)
2219 skip_insn = next_nonnote_insn (last_scheduled_insn);
2220 else
2221 skip_insn = NULL_RTX;
2223 while (i < ready.n_ready)
2225 rtx insn;
2227 insn = ready_remove (&ready, i);
2229 if (insn != skip_insn)
2230 queue_insn (insn, 1);
2235 /* Now we can restore basic block notes and maintain precise cfg. */
2236 restore_bb_notes (*target_bb);
2238 last_clock_var = -1;
2240 advance = 0;
2242 sort_p = TRUE;
2243 /* Loop until all the insns in BB are scheduled. */
2244 while ((*current_sched_info->schedule_more_p) ())
2248 start_clock_var = clock_var;
2250 clock_var++;
2252 advance_one_cycle ();
2254 /* Add to the ready list all pending insns that can be issued now.
2255 If there are no ready insns, increment clock until one
2256 is ready and add all pending insns at that point to the ready
2257 list. */
2258 queue_to_ready (&ready);
2260 gcc_assert (ready.n_ready);
2262 if (sched_verbose >= 2)
2264 fprintf (sched_dump, ";;\t\tReady list after queue_to_ready: ");
2265 debug_ready_list (&ready);
2267 advance -= clock_var - start_clock_var;
2269 while (advance > 0);
2271 if (sort_p)
2273 /* Sort the ready list based on priority. */
2274 ready_sort (&ready);
2276 if (sched_verbose >= 2)
2278 fprintf (sched_dump, ";;\t\tReady list after ready_sort: ");
2279 debug_ready_list (&ready);
2283 /* Allow the target to reorder the list, typically for
2284 better instruction bundling. */
2285 if (sort_p && targetm.sched.reorder
2286 && (ready.n_ready == 0
2287 || !SCHED_GROUP_P (ready_element (&ready, 0))))
2288 can_issue_more =
2289 targetm.sched.reorder (sched_dump, sched_verbose,
2290 ready_lastpos (&ready),
2291 &ready.n_ready, clock_var);
2292 else
2293 can_issue_more = issue_rate;
2295 first_cycle_insn_p = 1;
2296 cycle_issued_insns = 0;
2297 for (;;)
2299 rtx insn;
2300 int cost;
2301 bool asm_p = false;
2303 if (sched_verbose >= 2)
2305 fprintf (sched_dump, ";;\tReady list (t = %3d): ",
2306 clock_var);
2307 debug_ready_list (&ready);
2310 if (ready.n_ready == 0
2311 && can_issue_more
2312 && reload_completed)
2314 /* Allow scheduling insns directly from the queue in case
2315 there's nothing better to do (ready list is empty) but
2316 there are still vacant dispatch slots in the current cycle. */
2317 if (sched_verbose >= 6)
2318 fprintf (sched_dump,";;\t\tSecond chance\n");
2319 memcpy (temp_state, curr_state, dfa_state_size);
2320 if (early_queue_to_ready (temp_state, &ready))
2321 ready_sort (&ready);
2324 if (ready.n_ready == 0 || !can_issue_more
2325 || state_dead_lock_p (curr_state)
2326 || !(*current_sched_info->schedule_more_p) ())
2327 break;
2329 /* Select and remove the insn from the ready list. */
2330 if (sort_p)
2332 int res;
2334 insn = NULL_RTX;
2335 res = choose_ready (&ready, &insn);
2337 if (res < 0)
2338 /* Finish cycle. */
2339 break;
2340 if (res > 0)
2341 /* Restart choose_ready (). */
2342 continue;
2344 gcc_assert (insn != NULL_RTX);
2346 else
2347 insn = ready_remove_first (&ready);
2349 if (targetm.sched.dfa_new_cycle
2350 && targetm.sched.dfa_new_cycle (sched_dump, sched_verbose,
2351 insn, last_clock_var,
2352 clock_var, &sort_p))
2353 /* SORT_P is used by the target to override sorting
2354 of the ready list. This is needed when the target
2355 has modified its internal structures expecting that
2356 the insn will be issued next. As we need the insn
2357 to have the highest priority (so it will be returned by
2358 the ready_remove_first call above), we invoke
2359 ready_add (&ready, insn, true).
2360 But, still, there is one issue: INSN can be later
2361 discarded by scheduler's front end through
2362 current_sched_info->can_schedule_ready_p, hence, won't
2363 be issued next. */
2365 ready_add (&ready, insn, true);
2366 break;
2369 sort_p = TRUE;
2370 memcpy (temp_state, curr_state, dfa_state_size);
2371 if (recog_memoized (insn) < 0)
2373 asm_p = (GET_CODE (PATTERN (insn)) == ASM_INPUT
2374 || asm_noperands (PATTERN (insn)) >= 0);
2375 if (!first_cycle_insn_p && asm_p)
2376 /* This is asm insn which is tryed to be issued on the
2377 cycle not first. Issue it on the next cycle. */
2378 cost = 1;
2379 else
2380 /* A USE insn, or something else we don't need to
2381 understand. We can't pass these directly to
2382 state_transition because it will trigger a
2383 fatal error for unrecognizable insns. */
2384 cost = 0;
2386 else
2388 cost = state_transition (temp_state, insn);
2389 if (cost < 0)
2390 cost = 0;
2391 else if (cost == 0)
2392 cost = 1;
2395 if (cost >= 1)
2397 queue_insn (insn, cost);
2398 if (SCHED_GROUP_P (insn))
2400 advance = cost;
2401 break;
2404 continue;
2407 if (current_sched_info->can_schedule_ready_p
2408 && ! (*current_sched_info->can_schedule_ready_p) (insn))
2409 /* We normally get here only if we don't want to move
2410 insn from the split block. */
2412 TODO_SPEC (insn) = (TODO_SPEC (insn) & ~SPECULATIVE) | HARD_DEP;
2413 continue;
2416 /* DECISION is made. */
2418 if (TODO_SPEC (insn) & SPECULATIVE)
2419 generate_recovery_code (insn);
2421 if (control_flow_insn_p (last_scheduled_insn)
2422 /* This is used to switch basic blocks by request
2423 from scheduler front-end (actually, sched-ebb.c only).
2424 This is used to process blocks with single fallthru
2425 edge. If succeeding block has jump, it [jump] will try
2426 move at the end of current bb, thus corrupting CFG. */
2427 || current_sched_info->advance_target_bb (*target_bb, insn))
2429 *target_bb = current_sched_info->advance_target_bb
2430 (*target_bb, 0);
2432 if (sched_verbose)
2434 rtx x;
2436 x = next_real_insn (last_scheduled_insn);
2437 gcc_assert (x);
2438 dump_new_block_header (1, *target_bb, x, tail);
2441 last_scheduled_insn = bb_note (*target_bb);
2444 /* Update counters, etc in the scheduler's front end. */
2445 (*current_sched_info->begin_schedule_ready) (insn,
2446 last_scheduled_insn);
2448 move_insn (insn);
2449 last_scheduled_insn = insn;
2451 if (memcmp (curr_state, temp_state, dfa_state_size) != 0)
2453 cycle_issued_insns++;
2454 memcpy (curr_state, temp_state, dfa_state_size);
2457 if (targetm.sched.variable_issue)
2458 can_issue_more =
2459 targetm.sched.variable_issue (sched_dump, sched_verbose,
2460 insn, can_issue_more);
2461 /* A naked CLOBBER or USE generates no instruction, so do
2462 not count them against the issue rate. */
2463 else if (GET_CODE (PATTERN (insn)) != USE
2464 && GET_CODE (PATTERN (insn)) != CLOBBER)
2465 can_issue_more--;
2467 advance = schedule_insn (insn);
2469 /* After issuing an asm insn we should start a new cycle. */
2470 if (advance == 0 && asm_p)
2471 advance = 1;
2472 if (advance != 0)
2473 break;
2475 first_cycle_insn_p = 0;
2477 /* Sort the ready list based on priority. This must be
2478 redone here, as schedule_insn may have readied additional
2479 insns that will not be sorted correctly. */
2480 if (ready.n_ready > 0)
2481 ready_sort (&ready);
2483 if (targetm.sched.reorder2
2484 && (ready.n_ready == 0
2485 || !SCHED_GROUP_P (ready_element (&ready, 0))))
2487 can_issue_more =
2488 targetm.sched.reorder2 (sched_dump, sched_verbose,
2489 ready.n_ready
2490 ? ready_lastpos (&ready) : NULL,
2491 &ready.n_ready, clock_var);
2496 /* Debug info. */
2497 if (sched_verbose)
2499 fprintf (sched_dump, ";;\tReady list (final): ");
2500 debug_ready_list (&ready);
2503 if (current_sched_info->queue_must_finish_empty)
2504 /* Sanity check -- queue must be empty now. Meaningless if region has
2505 multiple bbs. */
2506 gcc_assert (!q_size && !ready.n_ready);
2507 else
2509 /* We must maintain QUEUE_INDEX between blocks in region. */
2510 for (i = ready.n_ready - 1; i >= 0; i--)
2512 rtx x;
2514 x = ready_element (&ready, i);
2515 QUEUE_INDEX (x) = QUEUE_NOWHERE;
2516 TODO_SPEC (x) = (TODO_SPEC (x) & ~SPECULATIVE) | HARD_DEP;
2519 if (q_size)
2520 for (i = 0; i <= max_insn_queue_index; i++)
2522 rtx link;
2523 for (link = insn_queue[i]; link; link = XEXP (link, 1))
2525 rtx x;
2527 x = XEXP (link, 0);
2528 QUEUE_INDEX (x) = QUEUE_NOWHERE;
2529 TODO_SPEC (x) = (TODO_SPEC (x) & ~SPECULATIVE) | HARD_DEP;
2531 free_INSN_LIST_list (&insn_queue[i]);
2535 if (!current_sched_info->queue_must_finish_empty
2536 || haifa_recovery_bb_recently_added_p)
2538 /* INSN_TICK (minimum clock tick at which the insn becomes
2539 ready) may be not correct for the insn in the subsequent
2540 blocks of the region. We should use a correct value of
2541 `clock_var' or modify INSN_TICK. It is better to keep
2542 clock_var value equal to 0 at the start of a basic block.
2543 Therefore we modify INSN_TICK here. */
2544 fix_inter_tick (NEXT_INSN (prev_head), last_scheduled_insn);
2547 if (targetm.sched.md_finish)
2549 targetm.sched.md_finish (sched_dump, sched_verbose);
2551 /* Target might have added some instructions to the scheduled block.
2552 in its md_finish () hook. These new insns don't have any data
2553 initialized and to identify them we extend h_i_d so that they'll
2554 get zero luids.*/
2555 extend_h_i_d ();
2558 /* Update head/tail boundaries. */
2559 head = NEXT_INSN (prev_head);
2560 tail = last_scheduled_insn;
2562 /* Restore-other-notes: NOTE_LIST is the end of a chain of notes
2563 previously found among the insns. Insert them at the beginning
2564 of the insns. */
2565 if (note_list != 0)
2567 basic_block head_bb = BLOCK_FOR_INSN (head);
2568 rtx note_head = note_list;
2570 while (PREV_INSN (note_head))
2572 set_block_for_insn (note_head, head_bb);
2573 note_head = PREV_INSN (note_head);
2575 /* In the above cycle we've missed this note: */
2576 set_block_for_insn (note_head, head_bb);
2578 PREV_INSN (note_head) = PREV_INSN (head);
2579 NEXT_INSN (PREV_INSN (head)) = note_head;
2580 PREV_INSN (head) = note_list;
2581 NEXT_INSN (note_list) = head;
2582 head = note_head;
2585 /* Debugging. */
2586 if (sched_verbose)
2588 fprintf (sched_dump, ";; total time = %d\n;; new head = %d\n",
2589 clock_var, INSN_UID (head));
2590 fprintf (sched_dump, ";; new tail = %d\n\n",
2591 INSN_UID (tail));
2594 current_sched_info->head = head;
2595 current_sched_info->tail = tail;
2597 free (ready.vec);
2599 free (ready_try);
2600 for (i = 0; i <= rgn_n_insns; i++)
2601 free (choice_stack [i].state);
2602 free (choice_stack);
2605 /* Set_priorities: compute priority of each insn in the block. */
2608 set_priorities (rtx head, rtx tail)
2610 rtx insn;
2611 int n_insn;
2612 int sched_max_insns_priority =
2613 current_sched_info->sched_max_insns_priority;
2614 rtx prev_head;
2616 if (head == tail && (! INSN_P (head)))
2617 return 0;
2619 n_insn = 0;
2621 prev_head = PREV_INSN (head);
2622 for (insn = tail; insn != prev_head; insn = PREV_INSN (insn))
2624 if (!INSN_P (insn))
2625 continue;
2627 n_insn++;
2628 (void) priority (insn);
2630 gcc_assert (INSN_PRIORITY_KNOWN (insn));
2632 sched_max_insns_priority = MAX (sched_max_insns_priority,
2633 INSN_PRIORITY (insn));
2636 current_sched_info->sched_max_insns_priority = sched_max_insns_priority;
2638 return n_insn;
2641 /* Next LUID to assign to an instruction. */
2642 static int luid;
2644 /* Initialize some global state for the scheduler. */
2646 void
2647 sched_init (void)
2649 basic_block b;
2650 rtx insn;
2651 int i;
2653 /* Switch to working copy of sched_info. */
2654 memcpy (&current_sched_info_var, current_sched_info,
2655 sizeof (current_sched_info_var));
2656 current_sched_info = &current_sched_info_var;
2658 /* Disable speculative loads in their presence if cc0 defined. */
2659 #ifdef HAVE_cc0
2660 flag_schedule_speculative_load = 0;
2661 #endif
2663 /* Set dump and sched_verbose for the desired debugging output. If no
2664 dump-file was specified, but -fsched-verbose=N (any N), print to stderr.
2665 For -fsched-verbose=N, N>=10, print everything to stderr. */
2666 sched_verbose = sched_verbose_param;
2667 if (sched_verbose_param == 0 && dump_file)
2668 sched_verbose = 1;
2669 sched_dump = ((sched_verbose_param >= 10 || !dump_file)
2670 ? stderr : dump_file);
2672 /* Initialize SPEC_INFO. */
2673 if (targetm.sched.set_sched_flags)
2675 spec_info = &spec_info_var;
2676 targetm.sched.set_sched_flags (spec_info);
2677 if (current_sched_info->flags & DO_SPECULATION)
2678 spec_info->weakness_cutoff =
2679 (PARAM_VALUE (PARAM_SCHED_SPEC_PROB_CUTOFF) * MAX_DEP_WEAK) / 100;
2680 else
2681 /* So we won't read anything accidentally. */
2682 spec_info = 0;
2684 else
2685 /* So we won't read anything accidentally. */
2686 spec_info = 0;
2688 /* Initialize issue_rate. */
2689 if (targetm.sched.issue_rate)
2690 issue_rate = targetm.sched.issue_rate ();
2691 else
2692 issue_rate = 1;
2694 if (cached_issue_rate != issue_rate)
2696 cached_issue_rate = issue_rate;
2697 /* To invalidate max_lookahead_tries: */
2698 cached_first_cycle_multipass_dfa_lookahead = 0;
2701 old_max_uid = 0;
2702 h_i_d = 0;
2703 extend_h_i_d ();
2705 for (i = 0; i < old_max_uid; i++)
2707 h_i_d[i].cost = -1;
2708 h_i_d[i].todo_spec = HARD_DEP;
2709 h_i_d[i].queue_index = QUEUE_NOWHERE;
2710 h_i_d[i].tick = INVALID_TICK;
2711 h_i_d[i].inter_tick = INVALID_TICK;
2714 if (targetm.sched.init_dfa_pre_cycle_insn)
2715 targetm.sched.init_dfa_pre_cycle_insn ();
2717 if (targetm.sched.init_dfa_post_cycle_insn)
2718 targetm.sched.init_dfa_post_cycle_insn ();
2720 dfa_start ();
2721 dfa_state_size = state_size ();
2722 curr_state = xmalloc (dfa_state_size);
2724 h_i_d[0].luid = 0;
2725 luid = 1;
2726 FOR_EACH_BB (b)
2727 for (insn = BB_HEAD (b); ; insn = NEXT_INSN (insn))
2729 INSN_LUID (insn) = luid;
2731 /* Increment the next luid, unless this is a note. We don't
2732 really need separate IDs for notes and we don't want to
2733 schedule differently depending on whether or not there are
2734 line-number notes, i.e., depending on whether or not we're
2735 generating debugging information. */
2736 if (!NOTE_P (insn))
2737 ++luid;
2739 if (insn == BB_END (b))
2740 break;
2743 init_dependency_caches (luid);
2745 init_alias_analysis ();
2747 old_last_basic_block = 0;
2748 extend_bb ();
2750 /* Compute INSN_REG_WEIGHT for all blocks. We must do this before
2751 removing death notes. */
2752 FOR_EACH_BB_REVERSE (b)
2753 find_insn_reg_weight (b);
2755 if (targetm.sched.md_init_global)
2756 targetm.sched.md_init_global (sched_dump, sched_verbose, old_max_uid);
2758 nr_begin_data = nr_begin_control = nr_be_in_data = nr_be_in_control = 0;
2759 before_recovery = 0;
2761 haifa_recovery_bb_ever_added_p = false;
2763 #ifdef ENABLE_CHECKING
2764 /* This is used preferably for finding bugs in check_cfg () itself. */
2765 check_cfg (0, 0);
2766 #endif
2769 /* Free global data used during insn scheduling. */
2771 void
2772 sched_finish (void)
2774 free (h_i_d);
2775 free (curr_state);
2776 dfa_finish ();
2777 free_dependency_caches ();
2778 end_alias_analysis ();
2780 if (targetm.sched.md_finish_global)
2781 targetm.sched.md_finish_global (sched_dump, sched_verbose);
2783 if (spec_info && spec_info->dump)
2785 char c = reload_completed ? 'a' : 'b';
2787 fprintf (spec_info->dump,
2788 ";; %s:\n", current_function_name ());
2790 fprintf (spec_info->dump,
2791 ";; Procedure %cr-begin-data-spec motions == %d\n",
2792 c, nr_begin_data);
2793 fprintf (spec_info->dump,
2794 ";; Procedure %cr-be-in-data-spec motions == %d\n",
2795 c, nr_be_in_data);
2796 fprintf (spec_info->dump,
2797 ";; Procedure %cr-begin-control-spec motions == %d\n",
2798 c, nr_begin_control);
2799 fprintf (spec_info->dump,
2800 ";; Procedure %cr-be-in-control-spec motions == %d\n",
2801 c, nr_be_in_control);
2804 #ifdef ENABLE_CHECKING
2805 /* After reload ia64 backend clobbers CFG, so can't check anything. */
2806 if (!reload_completed)
2807 check_cfg (0, 0);
2808 #endif
2810 current_sched_info = NULL;
2813 /* Fix INSN_TICKs of the instructions in the current block as well as
2814 INSN_TICKs of their dependents.
2815 HEAD and TAIL are the begin and the end of the current scheduled block. */
2816 static void
2817 fix_inter_tick (rtx head, rtx tail)
2819 /* Set of instructions with corrected INSN_TICK. */
2820 bitmap_head processed;
2821 /* ??? It is doubtful if we should assume that cycle advance happens on
2822 basic block boundaries. Basically insns that are unconditionally ready
2823 on the start of the block are more preferable then those which have
2824 a one cycle dependency over insn from the previous block. */
2825 int next_clock = clock_var + 1;
2827 bitmap_initialize (&processed, 0);
2829 /* Iterates over scheduled instructions and fix their INSN_TICKs and
2830 INSN_TICKs of dependent instructions, so that INSN_TICKs are consistent
2831 across different blocks. */
2832 for (tail = NEXT_INSN (tail); head != tail; head = NEXT_INSN (head))
2834 if (INSN_P (head))
2836 int tick;
2837 sd_iterator_def sd_it;
2838 dep_t dep;
2840 tick = INSN_TICK (head);
2841 gcc_assert (tick >= MIN_TICK);
2843 /* Fix INSN_TICK of instruction from just scheduled block. */
2844 if (!bitmap_bit_p (&processed, INSN_LUID (head)))
2846 bitmap_set_bit (&processed, INSN_LUID (head));
2847 tick -= next_clock;
2849 if (tick < MIN_TICK)
2850 tick = MIN_TICK;
2852 INSN_TICK (head) = tick;
2855 FOR_EACH_DEP (head, SD_LIST_RES_FORW, sd_it, dep)
2857 rtx next;
2859 next = DEP_CON (dep);
2860 tick = INSN_TICK (next);
2862 if (tick != INVALID_TICK
2863 /* If NEXT has its INSN_TICK calculated, fix it.
2864 If not - it will be properly calculated from
2865 scratch later in fix_tick_ready. */
2866 && !bitmap_bit_p (&processed, INSN_LUID (next)))
2868 bitmap_set_bit (&processed, INSN_LUID (next));
2869 tick -= next_clock;
2871 if (tick < MIN_TICK)
2872 tick = MIN_TICK;
2874 if (tick > INTER_TICK (next))
2875 INTER_TICK (next) = tick;
2876 else
2877 tick = INTER_TICK (next);
2879 INSN_TICK (next) = tick;
2884 bitmap_clear (&processed);
2887 /* Check if NEXT is ready to be added to the ready or queue list.
2888 If "yes", add it to the proper list.
2889 Returns:
2890 -1 - is not ready yet,
2891 0 - added to the ready list,
2892 0 < N - queued for N cycles. */
2894 try_ready (rtx next)
2896 ds_t old_ts, *ts;
2898 ts = &TODO_SPEC (next);
2899 old_ts = *ts;
2901 gcc_assert (!(old_ts & ~(SPECULATIVE | HARD_DEP))
2902 && ((old_ts & HARD_DEP)
2903 || (old_ts & SPECULATIVE)));
2905 if (sd_lists_empty_p (next, SD_LIST_BACK))
2906 /* NEXT has all its dependencies resolved. */
2908 /* Remove HARD_DEP bit from NEXT's status. */
2909 *ts &= ~HARD_DEP;
2911 if (current_sched_info->flags & DO_SPECULATION)
2912 /* Remove all speculative bits from NEXT's status. */
2913 *ts &= ~SPECULATIVE;
2915 else
2917 /* One of the NEXT's dependencies has been resolved.
2918 Recalculate NEXT's status. */
2920 *ts &= ~SPECULATIVE & ~HARD_DEP;
2922 if (sd_lists_empty_p (next, SD_LIST_HARD_BACK))
2923 /* Now we've got NEXT with speculative deps only.
2924 1. Look at the deps to see what we have to do.
2925 2. Check if we can do 'todo'. */
2927 sd_iterator_def sd_it;
2928 dep_t dep;
2929 bool first_p = true;
2931 FOR_EACH_DEP (next, SD_LIST_BACK, sd_it, dep)
2933 ds_t ds = DEP_STATUS (dep) & SPECULATIVE;
2935 if (first_p)
2937 first_p = false;
2939 *ts = ds;
2941 else
2942 *ts = ds_merge (*ts, ds);
2945 if (dep_weak (*ts) < spec_info->weakness_cutoff)
2946 /* Too few points. */
2947 *ts = (*ts & ~SPECULATIVE) | HARD_DEP;
2949 else
2950 *ts |= HARD_DEP;
2953 if (*ts & HARD_DEP)
2954 gcc_assert (*ts == old_ts
2955 && QUEUE_INDEX (next) == QUEUE_NOWHERE);
2956 else if (current_sched_info->new_ready)
2957 *ts = current_sched_info->new_ready (next, *ts);
2959 /* * if !(old_ts & SPECULATIVE) (e.g. HARD_DEP or 0), then insn might
2960 have its original pattern or changed (speculative) one. This is due
2961 to changing ebb in region scheduling.
2962 * But if (old_ts & SPECULATIVE), then we are pretty sure that insn
2963 has speculative pattern.
2965 We can't assert (!(*ts & HARD_DEP) || *ts == old_ts) here because
2966 control-speculative NEXT could have been discarded by sched-rgn.c
2967 (the same case as when discarded by can_schedule_ready_p ()). */
2969 if ((*ts & SPECULATIVE)
2970 /* If (old_ts == *ts), then (old_ts & SPECULATIVE) and we don't
2971 need to change anything. */
2972 && *ts != old_ts)
2974 int res;
2975 rtx new_pat;
2977 gcc_assert ((*ts & SPECULATIVE) && !(*ts & ~SPECULATIVE));
2979 res = speculate_insn (next, *ts, &new_pat);
2981 switch (res)
2983 case -1:
2984 /* It would be nice to change DEP_STATUS of all dependences,
2985 which have ((DEP_STATUS & SPECULATIVE) == *ts) to HARD_DEP,
2986 so we won't reanalyze anything. */
2987 *ts = (*ts & ~SPECULATIVE) | HARD_DEP;
2988 break;
2990 case 0:
2991 /* We follow the rule, that every speculative insn
2992 has non-null ORIG_PAT. */
2993 if (!ORIG_PAT (next))
2994 ORIG_PAT (next) = PATTERN (next);
2995 break;
2997 case 1:
2998 if (!ORIG_PAT (next))
2999 /* If we gonna to overwrite the original pattern of insn,
3000 save it. */
3001 ORIG_PAT (next) = PATTERN (next);
3003 change_pattern (next, new_pat);
3004 break;
3006 default:
3007 gcc_unreachable ();
3011 /* We need to restore pattern only if (*ts == 0), because otherwise it is
3012 either correct (*ts & SPECULATIVE),
3013 or we simply don't care (*ts & HARD_DEP). */
3015 gcc_assert (!ORIG_PAT (next)
3016 || !IS_SPECULATION_BRANCHY_CHECK_P (next));
3018 if (*ts & HARD_DEP)
3020 /* We can't assert (QUEUE_INDEX (next) == QUEUE_NOWHERE) here because
3021 control-speculative NEXT could have been discarded by sched-rgn.c
3022 (the same case as when discarded by can_schedule_ready_p ()). */
3023 /*gcc_assert (QUEUE_INDEX (next) == QUEUE_NOWHERE);*/
3025 change_queue_index (next, QUEUE_NOWHERE);
3026 return -1;
3028 else if (!(*ts & BEGIN_SPEC) && ORIG_PAT (next) && !IS_SPECULATION_CHECK_P (next))
3029 /* We should change pattern of every previously speculative
3030 instruction - and we determine if NEXT was speculative by using
3031 ORIG_PAT field. Except one case - speculation checks have ORIG_PAT
3032 pat too, so skip them. */
3034 change_pattern (next, ORIG_PAT (next));
3035 ORIG_PAT (next) = 0;
3038 if (sched_verbose >= 2)
3040 int s = TODO_SPEC (next);
3042 fprintf (sched_dump, ";;\t\tdependencies resolved: insn %s",
3043 (*current_sched_info->print_insn) (next, 0));
3045 if (spec_info && spec_info->dump)
3047 if (s & BEGIN_DATA)
3048 fprintf (spec_info->dump, "; data-spec;");
3049 if (s & BEGIN_CONTROL)
3050 fprintf (spec_info->dump, "; control-spec;");
3051 if (s & BE_IN_CONTROL)
3052 fprintf (spec_info->dump, "; in-control-spec;");
3055 fprintf (sched_dump, "\n");
3058 adjust_priority (next);
3060 return fix_tick_ready (next);
3063 /* Calculate INSN_TICK of NEXT and add it to either ready or queue list. */
3064 static int
3065 fix_tick_ready (rtx next)
3067 int tick, delay;
3069 if (!sd_lists_empty_p (next, SD_LIST_RES_BACK))
3071 int full_p;
3072 sd_iterator_def sd_it;
3073 dep_t dep;
3075 tick = INSN_TICK (next);
3076 /* if tick is not equal to INVALID_TICK, then update
3077 INSN_TICK of NEXT with the most recent resolved dependence
3078 cost. Otherwise, recalculate from scratch. */
3079 full_p = (tick == INVALID_TICK);
3081 FOR_EACH_DEP (next, SD_LIST_RES_BACK, sd_it, dep)
3083 rtx pro = DEP_PRO (dep);
3084 int tick1;
3086 gcc_assert (INSN_TICK (pro) >= MIN_TICK);
3088 tick1 = INSN_TICK (pro) + dep_cost (dep);
3089 if (tick1 > tick)
3090 tick = tick1;
3092 if (!full_p)
3093 break;
3096 else
3097 tick = -1;
3099 INSN_TICK (next) = tick;
3101 delay = tick - clock_var;
3102 if (delay <= 0)
3103 delay = QUEUE_READY;
3105 change_queue_index (next, delay);
3107 return delay;
3110 /* Move NEXT to the proper queue list with (DELAY >= 1),
3111 or add it to the ready list (DELAY == QUEUE_READY),
3112 or remove it from ready and queue lists at all (DELAY == QUEUE_NOWHERE). */
3113 static void
3114 change_queue_index (rtx next, int delay)
3116 int i = QUEUE_INDEX (next);
3118 gcc_assert (QUEUE_NOWHERE <= delay && delay <= max_insn_queue_index
3119 && delay != 0);
3120 gcc_assert (i != QUEUE_SCHEDULED);
3122 if ((delay > 0 && NEXT_Q_AFTER (q_ptr, delay) == i)
3123 || (delay < 0 && delay == i))
3124 /* We have nothing to do. */
3125 return;
3127 /* Remove NEXT from wherever it is now. */
3128 if (i == QUEUE_READY)
3129 ready_remove_insn (next);
3130 else if (i >= 0)
3131 queue_remove (next);
3133 /* Add it to the proper place. */
3134 if (delay == QUEUE_READY)
3135 ready_add (readyp, next, false);
3136 else if (delay >= 1)
3137 queue_insn (next, delay);
3139 if (sched_verbose >= 2)
3141 fprintf (sched_dump, ";;\t\ttick updated: insn %s",
3142 (*current_sched_info->print_insn) (next, 0));
3144 if (delay == QUEUE_READY)
3145 fprintf (sched_dump, " into ready\n");
3146 else if (delay >= 1)
3147 fprintf (sched_dump, " into queue with cost=%d\n", delay);
3148 else
3149 fprintf (sched_dump, " removed from ready or queue lists\n");
3153 /* Extend H_I_D data. */
3154 static void
3155 extend_h_i_d (void)
3157 /* We use LUID 0 for the fake insn (UID 0) which holds dependencies for
3158 pseudos which do not cross calls. */
3159 int new_max_uid = get_max_uid () + 1;
3161 h_i_d = xrecalloc (h_i_d, new_max_uid, old_max_uid, sizeof (*h_i_d));
3162 old_max_uid = new_max_uid;
3164 if (targetm.sched.h_i_d_extended)
3165 targetm.sched.h_i_d_extended ();
3168 /* Extend READY, READY_TRY and CHOICE_STACK arrays.
3169 N_NEW_INSNS is the number of additional elements to allocate. */
3170 static void
3171 extend_ready (int n_new_insns)
3173 int i;
3175 readyp->veclen = rgn_n_insns + n_new_insns + 1 + issue_rate;
3176 readyp->vec = XRESIZEVEC (rtx, readyp->vec, readyp->veclen);
3178 ready_try = xrecalloc (ready_try, rgn_n_insns + n_new_insns + 1,
3179 rgn_n_insns + 1, sizeof (char));
3181 rgn_n_insns += n_new_insns;
3183 choice_stack = XRESIZEVEC (struct choice_entry, choice_stack,
3184 rgn_n_insns + 1);
3186 for (i = rgn_n_insns; n_new_insns--; i--)
3187 choice_stack[i].state = xmalloc (dfa_state_size);
3190 /* Extend global scheduler structures (those, that live across calls to
3191 schedule_block) to include information about just emitted INSN. */
3192 static void
3193 extend_global (rtx insn)
3195 gcc_assert (INSN_P (insn));
3197 /* These structures have scheduler scope. */
3199 /* Init h_i_d. */
3200 extend_h_i_d ();
3201 init_h_i_d (insn);
3203 /* Init data handled in sched-deps.c. */
3204 sd_init_insn (insn);
3206 /* Extend dependency caches by one element. */
3207 extend_dependency_caches (1, false);
3210 /* Extends global and local scheduler structures to include information
3211 about just emitted INSN. */
3212 static void
3213 extend_all (rtx insn)
3215 extend_global (insn);
3217 /* These structures have block scope. */
3218 extend_ready (1);
3220 (*current_sched_info->add_remove_insn) (insn, 0);
3223 /* Initialize h_i_d entry of the new INSN with default values.
3224 Values, that are not explicitly initialized here, hold zero. */
3225 static void
3226 init_h_i_d (rtx insn)
3228 INSN_LUID (insn) = luid++;
3229 INSN_COST (insn) = -1;
3230 TODO_SPEC (insn) = HARD_DEP;
3231 QUEUE_INDEX (insn) = QUEUE_NOWHERE;
3232 INSN_TICK (insn) = INVALID_TICK;
3233 INTER_TICK (insn) = INVALID_TICK;
3234 find_insn_reg_weight1 (insn);
3237 /* Generates recovery code for INSN. */
3238 static void
3239 generate_recovery_code (rtx insn)
3241 if (TODO_SPEC (insn) & BEGIN_SPEC)
3242 begin_speculative_block (insn);
3244 /* Here we have insn with no dependencies to
3245 instructions other then CHECK_SPEC ones. */
3247 if (TODO_SPEC (insn) & BE_IN_SPEC)
3248 add_to_speculative_block (insn);
3251 /* Helper function.
3252 Tries to add speculative dependencies of type FS between instructions
3253 in deps_list L and TWIN. */
3254 static void
3255 process_insn_forw_deps_be_in_spec (rtx insn, rtx twin, ds_t fs)
3257 sd_iterator_def sd_it;
3258 dep_t dep;
3260 FOR_EACH_DEP (insn, SD_LIST_FORW, sd_it, dep)
3262 ds_t ds;
3263 rtx consumer;
3265 consumer = DEP_CON (dep);
3267 ds = DEP_STATUS (dep);
3269 if (/* If we want to create speculative dep. */
3271 /* And we can do that because this is a true dep. */
3272 && (ds & DEP_TYPES) == DEP_TRUE)
3274 gcc_assert (!(ds & BE_IN_SPEC));
3276 if (/* If this dep can be overcome with 'begin speculation'. */
3277 ds & BEGIN_SPEC)
3278 /* Then we have a choice: keep the dep 'begin speculative'
3279 or transform it into 'be in speculative'. */
3281 if (/* In try_ready we assert that if insn once became ready
3282 it can be removed from the ready (or queue) list only
3283 due to backend decision. Hence we can't let the
3284 probability of the speculative dep to decrease. */
3285 dep_weak (ds) <= dep_weak (fs))
3287 ds_t new_ds;
3289 new_ds = (ds & ~BEGIN_SPEC) | fs;
3291 if (/* consumer can 'be in speculative'. */
3292 sched_insn_is_legitimate_for_speculation_p (consumer,
3293 new_ds))
3294 /* Transform it to be in speculative. */
3295 ds = new_ds;
3298 else
3299 /* Mark the dep as 'be in speculative'. */
3300 ds |= fs;
3304 dep_def _new_dep, *new_dep = &_new_dep;
3306 init_dep_1 (new_dep, twin, consumer, DEP_TYPE (dep), ds);
3307 sd_add_dep (new_dep, false);
3312 /* Generates recovery code for BEGIN speculative INSN. */
3313 static void
3314 begin_speculative_block (rtx insn)
3316 if (TODO_SPEC (insn) & BEGIN_DATA)
3317 nr_begin_data++;
3318 if (TODO_SPEC (insn) & BEGIN_CONTROL)
3319 nr_begin_control++;
3321 create_check_block_twin (insn, false);
3323 TODO_SPEC (insn) &= ~BEGIN_SPEC;
3326 /* Generates recovery code for BE_IN speculative INSN. */
3327 static void
3328 add_to_speculative_block (rtx insn)
3330 ds_t ts;
3331 sd_iterator_def sd_it;
3332 dep_t dep;
3333 rtx twins = NULL;
3334 rtx_vec_t priorities_roots;
3336 ts = TODO_SPEC (insn);
3337 gcc_assert (!(ts & ~BE_IN_SPEC));
3339 if (ts & BE_IN_DATA)
3340 nr_be_in_data++;
3341 if (ts & BE_IN_CONTROL)
3342 nr_be_in_control++;
3344 TODO_SPEC (insn) &= ~BE_IN_SPEC;
3345 gcc_assert (!TODO_SPEC (insn));
3347 DONE_SPEC (insn) |= ts;
3349 /* First we convert all simple checks to branchy. */
3350 for (sd_it = sd_iterator_start (insn, SD_LIST_SPEC_BACK);
3351 sd_iterator_cond (&sd_it, &dep);)
3353 rtx check = DEP_PRO (dep);
3355 if (IS_SPECULATION_SIMPLE_CHECK_P (check))
3357 create_check_block_twin (check, true);
3359 /* Restart search. */
3360 sd_it = sd_iterator_start (insn, SD_LIST_SPEC_BACK);
3362 else
3363 /* Continue search. */
3364 sd_iterator_next (&sd_it);
3367 priorities_roots = NULL;
3368 clear_priorities (insn, &priorities_roots);
3370 while (1)
3372 rtx check, twin;
3373 basic_block rec;
3375 /* Get the first backward dependency of INSN. */
3376 sd_it = sd_iterator_start (insn, SD_LIST_SPEC_BACK);
3377 if (!sd_iterator_cond (&sd_it, &dep))
3378 /* INSN has no backward dependencies left. */
3379 break;
3381 gcc_assert ((DEP_STATUS (dep) & BEGIN_SPEC) == 0
3382 && (DEP_STATUS (dep) & BE_IN_SPEC) != 0
3383 && (DEP_STATUS (dep) & DEP_TYPES) == DEP_TRUE);
3385 check = DEP_PRO (dep);
3387 gcc_assert (!IS_SPECULATION_CHECK_P (check) && !ORIG_PAT (check)
3388 && QUEUE_INDEX (check) == QUEUE_NOWHERE);
3390 rec = BLOCK_FOR_INSN (check);
3392 twin = emit_insn_before (copy_insn (PATTERN (insn)), BB_END (rec));
3393 extend_global (twin);
3395 sd_copy_back_deps (twin, insn, true);
3397 if (sched_verbose && spec_info->dump)
3398 /* INSN_BB (insn) isn't determined for twin insns yet.
3399 So we can't use current_sched_info->print_insn. */
3400 fprintf (spec_info->dump, ";;\t\tGenerated twin insn : %d/rec%d\n",
3401 INSN_UID (twin), rec->index);
3403 twins = alloc_INSN_LIST (twin, twins);
3405 /* Add dependences between TWIN and all appropriate
3406 instructions from REC. */
3407 FOR_EACH_DEP (insn, SD_LIST_SPEC_BACK, sd_it, dep)
3409 rtx pro = DEP_PRO (dep);
3411 gcc_assert (DEP_TYPE (dep) == REG_DEP_TRUE);
3413 /* INSN might have dependencies from the instructions from
3414 several recovery blocks. At this iteration we process those
3415 producers that reside in REC. */
3416 if (BLOCK_FOR_INSN (pro) == rec)
3418 dep_def _new_dep, *new_dep = &_new_dep;
3420 init_dep (new_dep, pro, twin, REG_DEP_TRUE);
3421 sd_add_dep (new_dep, false);
3425 process_insn_forw_deps_be_in_spec (insn, twin, ts);
3427 /* Remove all dependencies between INSN and insns in REC. */
3428 for (sd_it = sd_iterator_start (insn, SD_LIST_SPEC_BACK);
3429 sd_iterator_cond (&sd_it, &dep);)
3431 rtx pro = DEP_PRO (dep);
3433 if (BLOCK_FOR_INSN (pro) == rec)
3434 sd_delete_dep (sd_it);
3435 else
3436 sd_iterator_next (&sd_it);
3440 /* We couldn't have added the dependencies between INSN and TWINS earlier
3441 because that would make TWINS appear in the INSN_BACK_DEPS (INSN). */
3442 while (twins)
3444 rtx twin;
3446 twin = XEXP (twins, 0);
3449 dep_def _new_dep, *new_dep = &_new_dep;
3451 init_dep (new_dep, insn, twin, REG_DEP_OUTPUT);
3452 sd_add_dep (new_dep, false);
3455 twin = XEXP (twins, 1);
3456 free_INSN_LIST_node (twins);
3457 twins = twin;
3460 calc_priorities (priorities_roots);
3461 VEC_free (rtx, heap, priorities_roots);
3464 /* Extends and fills with zeros (only the new part) array pointed to by P. */
3465 void *
3466 xrecalloc (void *p, size_t new_nmemb, size_t old_nmemb, size_t size)
3468 gcc_assert (new_nmemb >= old_nmemb);
3469 p = XRESIZEVAR (void, p, new_nmemb * size);
3470 memset (((char *) p) + old_nmemb * size, 0, (new_nmemb - old_nmemb) * size);
3471 return p;
3474 /* Return the probability of speculation success for the speculation
3475 status DS. */
3476 static dw_t
3477 dep_weak (ds_t ds)
3479 ds_t res = 1, dt;
3480 int n = 0;
3482 dt = FIRST_SPEC_TYPE;
3485 if (ds & dt)
3487 res *= (ds_t) get_dep_weak (ds, dt);
3488 n++;
3491 if (dt == LAST_SPEC_TYPE)
3492 break;
3493 dt <<= SPEC_TYPE_SHIFT;
3495 while (1);
3497 gcc_assert (n);
3498 while (--n)
3499 res /= MAX_DEP_WEAK;
3501 if (res < MIN_DEP_WEAK)
3502 res = MIN_DEP_WEAK;
3504 gcc_assert (res <= MAX_DEP_WEAK);
3506 return (dw_t) res;
3509 /* Helper function.
3510 Find fallthru edge from PRED. */
3511 static edge
3512 find_fallthru_edge (basic_block pred)
3514 edge e;
3515 edge_iterator ei;
3516 basic_block succ;
3518 succ = pred->next_bb;
3519 gcc_assert (succ->prev_bb == pred);
3521 if (EDGE_COUNT (pred->succs) <= EDGE_COUNT (succ->preds))
3523 FOR_EACH_EDGE (e, ei, pred->succs)
3524 if (e->flags & EDGE_FALLTHRU)
3526 gcc_assert (e->dest == succ);
3527 return e;
3530 else
3532 FOR_EACH_EDGE (e, ei, succ->preds)
3533 if (e->flags & EDGE_FALLTHRU)
3535 gcc_assert (e->src == pred);
3536 return e;
3540 return NULL;
3543 /* Initialize BEFORE_RECOVERY variable. */
3544 static void
3545 init_before_recovery (void)
3547 basic_block last;
3548 edge e;
3550 last = EXIT_BLOCK_PTR->prev_bb;
3551 e = find_fallthru_edge (last);
3553 if (e)
3555 /* We create two basic blocks:
3556 1. Single instruction block is inserted right after E->SRC
3557 and has jump to
3558 2. Empty block right before EXIT_BLOCK.
3559 Between these two blocks recovery blocks will be emitted. */
3561 basic_block single, empty;
3562 rtx x, label;
3564 single = create_empty_bb (last);
3565 empty = create_empty_bb (single);
3567 single->count = last->count;
3568 empty->count = last->count;
3569 single->frequency = last->frequency;
3570 empty->frequency = last->frequency;
3571 BB_COPY_PARTITION (single, last);
3572 BB_COPY_PARTITION (empty, last);
3574 redirect_edge_succ (e, single);
3575 make_single_succ_edge (single, empty, 0);
3576 make_single_succ_edge (empty, EXIT_BLOCK_PTR,
3577 EDGE_FALLTHRU | EDGE_CAN_FALLTHRU);
3579 label = block_label (empty);
3580 x = emit_jump_insn_after (gen_jump (label), BB_END (single));
3581 JUMP_LABEL (x) = label;
3582 LABEL_NUSES (label)++;
3583 extend_global (x);
3585 emit_barrier_after (x);
3587 add_block (empty, 0);
3588 add_block (single, 0);
3590 before_recovery = single;
3592 if (sched_verbose >= 2 && spec_info->dump)
3593 fprintf (spec_info->dump,
3594 ";;\t\tFixed fallthru to EXIT : %d->>%d->%d->>EXIT\n",
3595 last->index, single->index, empty->index);
3597 else
3598 before_recovery = last;
3601 /* Returns new recovery block. */
3602 static basic_block
3603 create_recovery_block (void)
3605 rtx label;
3606 rtx barrier;
3607 basic_block rec;
3609 haifa_recovery_bb_recently_added_p = true;
3610 haifa_recovery_bb_ever_added_p = true;
3612 if (!before_recovery)
3613 init_before_recovery ();
3615 barrier = get_last_bb_insn (before_recovery);
3616 gcc_assert (BARRIER_P (barrier));
3618 label = emit_label_after (gen_label_rtx (), barrier);
3620 rec = create_basic_block (label, label, before_recovery);
3622 /* Recovery block always end with an unconditional jump. */
3623 emit_barrier_after (BB_END (rec));
3625 if (BB_PARTITION (before_recovery) != BB_UNPARTITIONED)
3626 BB_SET_PARTITION (rec, BB_COLD_PARTITION);
3628 if (sched_verbose && spec_info->dump)
3629 fprintf (spec_info->dump, ";;\t\tGenerated recovery block rec%d\n",
3630 rec->index);
3632 before_recovery = rec;
3634 return rec;
3637 /* This function creates recovery code for INSN. If MUTATE_P is nonzero,
3638 INSN is a simple check, that should be converted to branchy one. */
3639 static void
3640 create_check_block_twin (rtx insn, bool mutate_p)
3642 basic_block rec;
3643 rtx label, check, twin;
3644 ds_t fs;
3645 sd_iterator_def sd_it;
3646 dep_t dep;
3647 dep_def _new_dep, *new_dep = &_new_dep;
3649 gcc_assert (ORIG_PAT (insn)
3650 && (!mutate_p
3651 || (IS_SPECULATION_SIMPLE_CHECK_P (insn)
3652 && !(TODO_SPEC (insn) & SPECULATIVE))));
3654 /* Create recovery block. */
3655 if (mutate_p || targetm.sched.needs_block_p (insn))
3657 rec = create_recovery_block ();
3658 label = BB_HEAD (rec);
3660 else
3662 rec = EXIT_BLOCK_PTR;
3663 label = 0;
3666 /* Emit CHECK. */
3667 check = targetm.sched.gen_check (insn, label, mutate_p);
3669 if (rec != EXIT_BLOCK_PTR)
3671 /* To have mem_reg alive at the beginning of second_bb,
3672 we emit check BEFORE insn, so insn after splitting
3673 insn will be at the beginning of second_bb, which will
3674 provide us with the correct life information. */
3675 check = emit_jump_insn_before (check, insn);
3676 JUMP_LABEL (check) = label;
3677 LABEL_NUSES (label)++;
3679 else
3680 check = emit_insn_before (check, insn);
3682 /* Extend data structures. */
3683 extend_all (check);
3684 RECOVERY_BLOCK (check) = rec;
3686 if (sched_verbose && spec_info->dump)
3687 fprintf (spec_info->dump, ";;\t\tGenerated check insn : %s\n",
3688 (*current_sched_info->print_insn) (check, 0));
3690 gcc_assert (ORIG_PAT (insn));
3692 /* Initialize TWIN (twin is a duplicate of original instruction
3693 in the recovery block). */
3694 if (rec != EXIT_BLOCK_PTR)
3696 sd_iterator_def sd_it;
3697 dep_t dep;
3699 FOR_EACH_DEP (insn, SD_LIST_RES_BACK, sd_it, dep)
3700 if ((DEP_STATUS (dep) & DEP_OUTPUT) != 0)
3702 struct _dep _dep2, *dep2 = &_dep2;
3704 init_dep (dep2, DEP_PRO (dep), check, REG_DEP_TRUE);
3706 sd_add_dep (dep2, true);
3709 twin = emit_insn_after (ORIG_PAT (insn), BB_END (rec));
3710 extend_global (twin);
3712 if (sched_verbose && spec_info->dump)
3713 /* INSN_BB (insn) isn't determined for twin insns yet.
3714 So we can't use current_sched_info->print_insn. */
3715 fprintf (spec_info->dump, ";;\t\tGenerated twin insn : %d/rec%d\n",
3716 INSN_UID (twin), rec->index);
3718 else
3720 ORIG_PAT (check) = ORIG_PAT (insn);
3721 HAS_INTERNAL_DEP (check) = 1;
3722 twin = check;
3723 /* ??? We probably should change all OUTPUT dependencies to
3724 (TRUE | OUTPUT). */
3727 /* Copy all resolved back dependencies of INSN to TWIN. This will
3728 provide correct value for INSN_TICK (TWIN). */
3729 sd_copy_back_deps (twin, insn, true);
3731 if (rec != EXIT_BLOCK_PTR)
3732 /* In case of branchy check, fix CFG. */
3734 basic_block first_bb, second_bb;
3735 rtx jump;
3736 edge e;
3737 int edge_flags;
3739 first_bb = BLOCK_FOR_INSN (check);
3740 e = split_block (first_bb, check);
3741 /* split_block emits note if *check == BB_END. Probably it
3742 is better to rip that note off. */
3743 gcc_assert (e->src == first_bb);
3744 second_bb = e->dest;
3746 /* This is fixing of incoming edge. */
3747 /* ??? Which other flags should be specified? */
3748 if (BB_PARTITION (first_bb) != BB_PARTITION (rec))
3749 /* Partition type is the same, if it is "unpartitioned". */
3750 edge_flags = EDGE_CROSSING;
3751 else
3752 edge_flags = 0;
3754 e = make_edge (first_bb, rec, edge_flags);
3756 add_block (second_bb, first_bb);
3758 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (BB_HEAD (second_bb)));
3759 label = block_label (second_bb);
3760 jump = emit_jump_insn_after (gen_jump (label), BB_END (rec));
3761 JUMP_LABEL (jump) = label;
3762 LABEL_NUSES (label)++;
3763 extend_global (jump);
3765 if (BB_PARTITION (second_bb) != BB_PARTITION (rec))
3766 /* Partition type is the same, if it is "unpartitioned". */
3768 /* Rewritten from cfgrtl.c. */
3769 if (flag_reorder_blocks_and_partition
3770 && targetm.have_named_sections
3771 /*&& !any_condjump_p (jump)*/)
3772 /* any_condjump_p (jump) == false.
3773 We don't need the same note for the check because
3774 any_condjump_p (check) == true. */
3776 REG_NOTES (jump) = gen_rtx_EXPR_LIST (REG_CROSSING_JUMP,
3777 NULL_RTX,
3778 REG_NOTES (jump));
3780 edge_flags = EDGE_CROSSING;
3782 else
3783 edge_flags = 0;
3785 make_single_succ_edge (rec, second_bb, edge_flags);
3787 add_block (rec, EXIT_BLOCK_PTR);
3790 /* Move backward dependences from INSN to CHECK and
3791 move forward dependences from INSN to TWIN. */
3793 /* First, create dependencies between INSN's producers and CHECK & TWIN. */
3794 FOR_EACH_DEP (insn, SD_LIST_BACK, sd_it, dep)
3796 rtx pro = DEP_PRO (dep);
3797 ds_t ds;
3799 /* If BEGIN_DATA: [insn ~~TRUE~~> producer]:
3800 check --TRUE--> producer ??? or ANTI ???
3801 twin --TRUE--> producer
3802 twin --ANTI--> check
3804 If BEGIN_CONTROL: [insn ~~ANTI~~> producer]:
3805 check --ANTI--> producer
3806 twin --ANTI--> producer
3807 twin --ANTI--> check
3809 If BE_IN_SPEC: [insn ~~TRUE~~> producer]:
3810 check ~~TRUE~~> producer
3811 twin ~~TRUE~~> producer
3812 twin --ANTI--> check */
3814 ds = DEP_STATUS (dep);
3816 if (ds & BEGIN_SPEC)
3818 gcc_assert (!mutate_p);
3819 ds &= ~BEGIN_SPEC;
3822 init_dep_1 (new_dep, pro, check, DEP_TYPE (dep), ds);
3823 sd_add_dep (new_dep, false);
3825 if (rec != EXIT_BLOCK_PTR)
3827 DEP_CON (new_dep) = twin;
3828 sd_add_dep (new_dep, false);
3832 /* Second, remove backward dependencies of INSN. */
3833 for (sd_it = sd_iterator_start (insn, SD_LIST_SPEC_BACK);
3834 sd_iterator_cond (&sd_it, &dep);)
3836 if ((DEP_STATUS (dep) & BEGIN_SPEC)
3837 || mutate_p)
3838 /* We can delete this dep because we overcome it with
3839 BEGIN_SPECULATION. */
3840 sd_delete_dep (sd_it);
3841 else
3842 sd_iterator_next (&sd_it);
3845 /* Future Speculations. Determine what BE_IN speculations will be like. */
3846 fs = 0;
3848 /* Fields (DONE_SPEC (x) & BEGIN_SPEC) and CHECK_SPEC (x) are set only
3849 here. */
3851 gcc_assert (!DONE_SPEC (insn));
3853 if (!mutate_p)
3855 ds_t ts = TODO_SPEC (insn);
3857 DONE_SPEC (insn) = ts & BEGIN_SPEC;
3858 CHECK_SPEC (check) = ts & BEGIN_SPEC;
3860 /* Luckiness of future speculations solely depends upon initial
3861 BEGIN speculation. */
3862 if (ts & BEGIN_DATA)
3863 fs = set_dep_weak (fs, BE_IN_DATA, get_dep_weak (ts, BEGIN_DATA));
3864 if (ts & BEGIN_CONTROL)
3865 fs = set_dep_weak (fs, BE_IN_CONTROL,
3866 get_dep_weak (ts, BEGIN_CONTROL));
3868 else
3869 CHECK_SPEC (check) = CHECK_SPEC (insn);
3871 /* Future speculations: call the helper. */
3872 process_insn_forw_deps_be_in_spec (insn, twin, fs);
3874 if (rec != EXIT_BLOCK_PTR)
3876 /* Which types of dependencies should we use here is,
3877 generally, machine-dependent question... But, for now,
3878 it is not. */
3880 if (!mutate_p)
3882 init_dep (new_dep, insn, check, REG_DEP_TRUE);
3883 sd_add_dep (new_dep, false);
3885 init_dep (new_dep, insn, twin, REG_DEP_OUTPUT);
3886 sd_add_dep (new_dep, false);
3888 else
3890 if (spec_info->dump)
3891 fprintf (spec_info->dump, ";;\t\tRemoved simple check : %s\n",
3892 (*current_sched_info->print_insn) (insn, 0));
3894 /* Remove all dependencies of the INSN. */
3896 sd_it = sd_iterator_start (insn, (SD_LIST_FORW
3897 | SD_LIST_BACK
3898 | SD_LIST_RES_BACK));
3899 while (sd_iterator_cond (&sd_it, &dep))
3900 sd_delete_dep (sd_it);
3903 /* If former check (INSN) already was moved to the ready (or queue)
3904 list, add new check (CHECK) there too. */
3905 if (QUEUE_INDEX (insn) != QUEUE_NOWHERE)
3906 try_ready (check);
3908 /* Remove old check from instruction stream and free its
3909 data. */
3910 sched_remove_insn (insn);
3913 init_dep (new_dep, check, twin, REG_DEP_ANTI);
3914 sd_add_dep (new_dep, false);
3916 else
3918 init_dep_1 (new_dep, insn, check, REG_DEP_TRUE, DEP_TRUE | DEP_OUTPUT);
3919 sd_add_dep (new_dep, false);
3922 if (!mutate_p)
3923 /* Fix priorities. If MUTATE_P is nonzero, this is not necessary,
3924 because it'll be done later in add_to_speculative_block. */
3926 rtx_vec_t priorities_roots = NULL;
3928 clear_priorities (twin, &priorities_roots);
3929 calc_priorities (priorities_roots);
3930 VEC_free (rtx, heap, priorities_roots);
3934 /* Removes dependency between instructions in the recovery block REC
3935 and usual region instructions. It keeps inner dependences so it
3936 won't be necessary to recompute them. */
3937 static void
3938 fix_recovery_deps (basic_block rec)
3940 rtx note, insn, jump, ready_list = 0;
3941 bitmap_head in_ready;
3942 rtx link;
3944 bitmap_initialize (&in_ready, 0);
3946 /* NOTE - a basic block note. */
3947 note = NEXT_INSN (BB_HEAD (rec));
3948 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (note));
3949 insn = BB_END (rec);
3950 gcc_assert (JUMP_P (insn));
3951 insn = PREV_INSN (insn);
3955 sd_iterator_def sd_it;
3956 dep_t dep;
3958 for (sd_it = sd_iterator_start (insn, SD_LIST_FORW);
3959 sd_iterator_cond (&sd_it, &dep);)
3961 rtx consumer = DEP_CON (dep);
3963 if (BLOCK_FOR_INSN (consumer) != rec)
3965 sd_delete_dep (sd_it);
3967 if (!bitmap_bit_p (&in_ready, INSN_LUID (consumer)))
3969 ready_list = alloc_INSN_LIST (consumer, ready_list);
3970 bitmap_set_bit (&in_ready, INSN_LUID (consumer));
3973 else
3975 gcc_assert ((DEP_STATUS (dep) & DEP_TYPES) == DEP_TRUE);
3977 sd_iterator_next (&sd_it);
3981 insn = PREV_INSN (insn);
3983 while (insn != note);
3985 bitmap_clear (&in_ready);
3987 /* Try to add instructions to the ready or queue list. */
3988 for (link = ready_list; link; link = XEXP (link, 1))
3989 try_ready (XEXP (link, 0));
3990 free_INSN_LIST_list (&ready_list);
3992 /* Fixing jump's dependences. */
3993 insn = BB_HEAD (rec);
3994 jump = BB_END (rec);
3996 gcc_assert (LABEL_P (insn));
3997 insn = NEXT_INSN (insn);
3999 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (insn));
4000 add_jump_dependencies (insn, jump);
4003 /* Changes pattern of the INSN to NEW_PAT. */
4004 static void
4005 change_pattern (rtx insn, rtx new_pat)
4007 int t;
4009 t = validate_change (insn, &PATTERN (insn), new_pat, 0);
4010 gcc_assert (t);
4011 /* Invalidate INSN_COST, so it'll be recalculated. */
4012 INSN_COST (insn) = -1;
4013 /* Invalidate INSN_TICK, so it'll be recalculated. */
4014 INSN_TICK (insn) = INVALID_TICK;
4015 dfa_clear_single_insn_cache (insn);
4018 /* Return true if INSN can potentially be speculated with type DS. */
4019 bool
4020 sched_insn_is_legitimate_for_speculation_p (const_rtx insn, ds_t ds)
4022 if (HAS_INTERNAL_DEP (insn))
4023 return false;
4025 if (!NONJUMP_INSN_P (insn))
4026 return false;
4028 if (SCHED_GROUP_P (insn))
4029 return false;
4031 if (IS_SPECULATION_CHECK_P (insn))
4032 return false;
4034 if (side_effects_p (PATTERN (insn)))
4035 return false;
4037 if ((ds & BE_IN_SPEC)
4038 && may_trap_p (PATTERN (insn)))
4039 return false;
4041 return true;
4044 /* -1 - can't speculate,
4045 0 - for speculation with REQUEST mode it is OK to use
4046 current instruction pattern,
4047 1 - need to change pattern for *NEW_PAT to be speculative. */
4048 static int
4049 speculate_insn (rtx insn, ds_t request, rtx *new_pat)
4051 gcc_assert (current_sched_info->flags & DO_SPECULATION
4052 && (request & SPECULATIVE)
4053 && sched_insn_is_legitimate_for_speculation_p (insn, request));
4055 if ((request & spec_info->mask) != request)
4056 return -1;
4058 if (request & BE_IN_SPEC
4059 && !(request & BEGIN_SPEC))
4060 return 0;
4062 return targetm.sched.speculate_insn (insn, request & BEGIN_SPEC, new_pat);
4065 /* Print some information about block BB, which starts with HEAD and
4066 ends with TAIL, before scheduling it.
4067 I is zero, if scheduler is about to start with the fresh ebb. */
4068 static void
4069 dump_new_block_header (int i, basic_block bb, rtx head, rtx tail)
4071 if (!i)
4072 fprintf (sched_dump,
4073 ";; ======================================================\n");
4074 else
4075 fprintf (sched_dump,
4076 ";; =====================ADVANCING TO=====================\n");
4077 fprintf (sched_dump,
4078 ";; -- basic block %d from %d to %d -- %s reload\n",
4079 bb->index, INSN_UID (head), INSN_UID (tail),
4080 (reload_completed ? "after" : "before"));
4081 fprintf (sched_dump,
4082 ";; ======================================================\n");
4083 fprintf (sched_dump, "\n");
4086 /* Unlink basic block notes and labels and saves them, so they
4087 can be easily restored. We unlink basic block notes in EBB to
4088 provide back-compatibility with the previous code, as target backends
4089 assume, that there'll be only instructions between
4090 current_sched_info->{head and tail}. We restore these notes as soon
4091 as we can.
4092 FIRST (LAST) is the first (last) basic block in the ebb.
4093 NB: In usual case (FIRST == LAST) nothing is really done. */
4094 void
4095 unlink_bb_notes (basic_block first, basic_block last)
4097 /* We DON'T unlink basic block notes of the first block in the ebb. */
4098 if (first == last)
4099 return;
4101 bb_header = xmalloc (last_basic_block * sizeof (*bb_header));
4103 /* Make a sentinel. */
4104 if (last->next_bb != EXIT_BLOCK_PTR)
4105 bb_header[last->next_bb->index] = 0;
4107 first = first->next_bb;
4110 rtx prev, label, note, next;
4112 label = BB_HEAD (last);
4113 if (LABEL_P (label))
4114 note = NEXT_INSN (label);
4115 else
4116 note = label;
4117 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (note));
4119 prev = PREV_INSN (label);
4120 next = NEXT_INSN (note);
4121 gcc_assert (prev && next);
4123 NEXT_INSN (prev) = next;
4124 PREV_INSN (next) = prev;
4126 bb_header[last->index] = label;
4128 if (last == first)
4129 break;
4131 last = last->prev_bb;
4133 while (1);
4136 /* Restore basic block notes.
4137 FIRST is the first basic block in the ebb. */
4138 static void
4139 restore_bb_notes (basic_block first)
4141 if (!bb_header)
4142 return;
4144 /* We DON'T unlink basic block notes of the first block in the ebb. */
4145 first = first->next_bb;
4146 /* Remember: FIRST is actually a second basic block in the ebb. */
4148 while (first != EXIT_BLOCK_PTR
4149 && bb_header[first->index])
4151 rtx prev, label, note, next;
4153 label = bb_header[first->index];
4154 prev = PREV_INSN (label);
4155 next = NEXT_INSN (prev);
4157 if (LABEL_P (label))
4158 note = NEXT_INSN (label);
4159 else
4160 note = label;
4161 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (note));
4163 bb_header[first->index] = 0;
4165 NEXT_INSN (prev) = label;
4166 NEXT_INSN (note) = next;
4167 PREV_INSN (next) = note;
4169 first = first->next_bb;
4172 free (bb_header);
4173 bb_header = 0;
4176 /* Extend per basic block data structures of the scheduler.
4177 If BB is NULL, initialize structures for the whole CFG.
4178 Otherwise, initialize them for the just created BB. */
4179 static void
4180 extend_bb (void)
4182 rtx insn;
4184 old_last_basic_block = last_basic_block;
4186 /* The following is done to keep current_sched_info->next_tail non null. */
4188 insn = BB_END (EXIT_BLOCK_PTR->prev_bb);
4189 if (NEXT_INSN (insn) == 0
4190 || (!NOTE_P (insn)
4191 && !LABEL_P (insn)
4192 /* Don't emit a NOTE if it would end up before a BARRIER. */
4193 && !BARRIER_P (NEXT_INSN (insn))))
4195 rtx note = emit_note_after (NOTE_INSN_DELETED, insn);
4196 /* Make insn appear outside BB. */
4197 set_block_for_insn (note, NULL);
4198 BB_END (EXIT_BLOCK_PTR->prev_bb) = insn;
4202 /* Add a basic block BB to extended basic block EBB.
4203 If EBB is EXIT_BLOCK_PTR, then BB is recovery block.
4204 If EBB is NULL, then BB should be a new region. */
4205 void
4206 add_block (basic_block bb, basic_block ebb)
4208 gcc_assert (current_sched_info->flags & NEW_BBS);
4210 extend_bb ();
4212 if (current_sched_info->add_block)
4213 /* This changes only data structures of the front-end. */
4214 current_sched_info->add_block (bb, ebb);
4217 /* Helper function.
4218 Fix CFG after both in- and inter-block movement of
4219 control_flow_insn_p JUMP. */
4220 static void
4221 fix_jump_move (rtx jump)
4223 basic_block bb, jump_bb, jump_bb_next;
4225 bb = BLOCK_FOR_INSN (PREV_INSN (jump));
4226 jump_bb = BLOCK_FOR_INSN (jump);
4227 jump_bb_next = jump_bb->next_bb;
4229 gcc_assert (current_sched_info->flags & SCHED_EBB
4230 || IS_SPECULATION_BRANCHY_CHECK_P (jump));
4232 if (!NOTE_INSN_BASIC_BLOCK_P (BB_END (jump_bb_next)))
4233 /* if jump_bb_next is not empty. */
4234 BB_END (jump_bb) = BB_END (jump_bb_next);
4236 if (BB_END (bb) != PREV_INSN (jump))
4237 /* Then there are instruction after jump that should be placed
4238 to jump_bb_next. */
4239 BB_END (jump_bb_next) = BB_END (bb);
4240 else
4241 /* Otherwise jump_bb_next is empty. */
4242 BB_END (jump_bb_next) = NEXT_INSN (BB_HEAD (jump_bb_next));
4244 /* To make assertion in move_insn happy. */
4245 BB_END (bb) = PREV_INSN (jump);
4247 update_bb_for_insn (jump_bb_next);
4250 /* Fix CFG after interblock movement of control_flow_insn_p JUMP. */
4251 static void
4252 move_block_after_check (rtx jump)
4254 basic_block bb, jump_bb, jump_bb_next;
4255 VEC(edge,gc) *t;
4257 bb = BLOCK_FOR_INSN (PREV_INSN (jump));
4258 jump_bb = BLOCK_FOR_INSN (jump);
4259 jump_bb_next = jump_bb->next_bb;
4261 update_bb_for_insn (jump_bb);
4263 gcc_assert (IS_SPECULATION_CHECK_P (jump)
4264 || IS_SPECULATION_CHECK_P (BB_END (jump_bb_next)));
4266 unlink_block (jump_bb_next);
4267 link_block (jump_bb_next, bb);
4269 t = bb->succs;
4270 bb->succs = 0;
4271 move_succs (&(jump_bb->succs), bb);
4272 move_succs (&(jump_bb_next->succs), jump_bb);
4273 move_succs (&t, jump_bb_next);
4275 df_mark_solutions_dirty ();
4277 if (current_sched_info->fix_recovery_cfg)
4278 current_sched_info->fix_recovery_cfg
4279 (bb->index, jump_bb->index, jump_bb_next->index);
4282 /* Helper function for move_block_after_check.
4283 This functions attaches edge vector pointed to by SUCCSP to
4284 block TO. */
4285 static void
4286 move_succs (VEC(edge,gc) **succsp, basic_block to)
4288 edge e;
4289 edge_iterator ei;
4291 gcc_assert (to->succs == 0);
4293 to->succs = *succsp;
4295 FOR_EACH_EDGE (e, ei, to->succs)
4296 e->src = to;
4298 *succsp = 0;
4301 /* Remove INSN from the instruction stream.
4302 INSN should have any dependencies. */
4303 static void
4304 sched_remove_insn (rtx insn)
4306 sd_finish_insn (insn);
4308 change_queue_index (insn, QUEUE_NOWHERE);
4309 current_sched_info->add_remove_insn (insn, 1);
4310 remove_insn (insn);
4313 /* Clear priorities of all instructions, that are forward dependent on INSN.
4314 Store in vector pointed to by ROOTS_PTR insns on which priority () should
4315 be invoked to initialize all cleared priorities. */
4316 static void
4317 clear_priorities (rtx insn, rtx_vec_t *roots_ptr)
4319 sd_iterator_def sd_it;
4320 dep_t dep;
4321 bool insn_is_root_p = true;
4323 gcc_assert (QUEUE_INDEX (insn) != QUEUE_SCHEDULED);
4325 FOR_EACH_DEP (insn, SD_LIST_BACK, sd_it, dep)
4327 rtx pro = DEP_PRO (dep);
4329 if (INSN_PRIORITY_STATUS (pro) >= 0
4330 && QUEUE_INDEX (insn) != QUEUE_SCHEDULED)
4332 /* If DEP doesn't contribute to priority then INSN itself should
4333 be added to priority roots. */
4334 if (contributes_to_priority_p (dep))
4335 insn_is_root_p = false;
4337 INSN_PRIORITY_STATUS (pro) = -1;
4338 clear_priorities (pro, roots_ptr);
4342 if (insn_is_root_p)
4343 VEC_safe_push (rtx, heap, *roots_ptr, insn);
4346 /* Recompute priorities of instructions, whose priorities might have been
4347 changed. ROOTS is a vector of instructions whose priority computation will
4348 trigger initialization of all cleared priorities. */
4349 static void
4350 calc_priorities (rtx_vec_t roots)
4352 int i;
4353 rtx insn;
4355 for (i = 0; VEC_iterate (rtx, roots, i, insn); i++)
4356 priority (insn);
4360 /* Add dependences between JUMP and other instructions in the recovery
4361 block. INSN is the first insn the recovery block. */
4362 static void
4363 add_jump_dependencies (rtx insn, rtx jump)
4367 insn = NEXT_INSN (insn);
4368 if (insn == jump)
4369 break;
4371 if (sd_lists_empty_p (insn, SD_LIST_FORW))
4373 dep_def _new_dep, *new_dep = &_new_dep;
4375 init_dep (new_dep, insn, jump, REG_DEP_ANTI);
4376 sd_add_dep (new_dep, false);
4379 while (1);
4381 gcc_assert (!sd_lists_empty_p (jump, SD_LIST_BACK));
4384 /* Return the NOTE_INSN_BASIC_BLOCK of BB. */
4386 bb_note (basic_block bb)
4388 rtx note;
4390 note = BB_HEAD (bb);
4391 if (LABEL_P (note))
4392 note = NEXT_INSN (note);
4394 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (note));
4395 return note;
4398 #ifdef ENABLE_CHECKING
4399 /* Helper function for check_cfg.
4400 Return nonzero, if edge vector pointed to by EL has edge with TYPE in
4401 its flags. */
4402 static int
4403 has_edge_p (VEC(edge,gc) *el, int type)
4405 edge e;
4406 edge_iterator ei;
4408 FOR_EACH_EDGE (e, ei, el)
4409 if (e->flags & type)
4410 return 1;
4411 return 0;
4414 /* Check few properties of CFG between HEAD and TAIL.
4415 If HEAD (TAIL) is NULL check from the beginning (till the end) of the
4416 instruction stream. */
4417 static void
4418 check_cfg (rtx head, rtx tail)
4420 rtx next_tail;
4421 basic_block bb = 0;
4422 int not_first = 0, not_last;
4424 if (head == NULL)
4425 head = get_insns ();
4426 if (tail == NULL)
4427 tail = get_last_insn ();
4428 next_tail = NEXT_INSN (tail);
4432 not_last = head != tail;
4434 if (not_first)
4435 gcc_assert (NEXT_INSN (PREV_INSN (head)) == head);
4436 if (not_last)
4437 gcc_assert (PREV_INSN (NEXT_INSN (head)) == head);
4439 if (LABEL_P (head)
4440 || (NOTE_INSN_BASIC_BLOCK_P (head)
4441 && (!not_first
4442 || (not_first && !LABEL_P (PREV_INSN (head))))))
4444 gcc_assert (bb == 0);
4445 bb = BLOCK_FOR_INSN (head);
4446 if (bb != 0)
4447 gcc_assert (BB_HEAD (bb) == head);
4448 else
4449 /* This is the case of jump table. See inside_basic_block_p (). */
4450 gcc_assert (LABEL_P (head) && !inside_basic_block_p (head));
4453 if (bb == 0)
4455 gcc_assert (!inside_basic_block_p (head));
4456 head = NEXT_INSN (head);
4458 else
4460 gcc_assert (inside_basic_block_p (head)
4461 || NOTE_P (head));
4462 gcc_assert (BLOCK_FOR_INSN (head) == bb);
4464 if (LABEL_P (head))
4466 head = NEXT_INSN (head);
4467 gcc_assert (NOTE_INSN_BASIC_BLOCK_P (head));
4469 else
4471 if (control_flow_insn_p (head))
4473 gcc_assert (BB_END (bb) == head);
4475 if (any_uncondjump_p (head))
4476 gcc_assert (EDGE_COUNT (bb->succs) == 1
4477 && BARRIER_P (NEXT_INSN (head)));
4478 else if (any_condjump_p (head))
4479 gcc_assert (/* Usual case. */
4480 (EDGE_COUNT (bb->succs) > 1
4481 && !BARRIER_P (NEXT_INSN (head)))
4482 /* Or jump to the next instruction. */
4483 || (EDGE_COUNT (bb->succs) == 1
4484 && (BB_HEAD (EDGE_I (bb->succs, 0)->dest)
4485 == JUMP_LABEL (head))));
4487 if (BB_END (bb) == head)
4489 if (EDGE_COUNT (bb->succs) > 1)
4490 gcc_assert (control_flow_insn_p (head)
4491 || has_edge_p (bb->succs, EDGE_COMPLEX));
4492 bb = 0;
4495 head = NEXT_INSN (head);
4499 not_first = 1;
4501 while (head != next_tail);
4503 gcc_assert (bb == 0);
4505 #endif /* ENABLE_CHECKING */
4507 #endif /* INSN_SCHEDULING */