1 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
4 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
5 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
6 optimizing for size. Fix vpinsrw insn template.
7 (*movhf_internal): Fix pinsrw and pextrw insn templates for
8 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
9 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
10 and TARGET_SSE_SPLIT_REGS targets.
12 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
14 * configure: Re-regenerate.
16 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
19 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
20 variables. Use ETAGS variable in TAGS target.
21 * configure: Regenerate.
22 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
23 variables to be overridden.
25 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
27 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
29 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
30 (_mm_movemask_epi8): Likewise.
32 2021-11-29 Richard Biener <rguenther@suse.de>
34 * read-rtl-function.c (function_reader::read_rtx_operand):
35 Return only after resetting m_in_call_function_usage.
37 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
39 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
40 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
41 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
42 on QImode shift amounts from const_int_operand to general_operand.
44 2021-11-29 Richard Biener <rguenther@suse.de>
46 * tree.h (reverse_storage_order_for_component_p): Remove
47 spurious gcc_unreachable.
48 * cfganal.c (dfs_find_deadend): Likewise.
49 * fold-const-call.c (fold_const_logb): Likewise.
50 (fold_const_significand): Likewise.
51 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
54 2021-11-29 Richard Biener <rguenther@suse.de>
56 * vec.c (qsort_chk): Do not return the void return value
57 from the noreturn qsort_chk_error.
58 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
59 * df-scan.c (df_ref_equal_p): Likewise.
60 * dwarf2out.c (is_base_type): Likewise.
61 (add_const_value_attribute): Likewise.
62 * fixed-value.c (fixed_arithmetic): Likewise.
63 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
64 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
65 * graphite-isl-ast-to-gimple.c
66 (gcc_expression_from_isl_expr_op): Likewise.
67 (gcc_expression_from_isl_expression): Likewise.
68 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
70 * lto-streamer-in.c (lto_input_mode_table): Likewise.
72 2021-11-29 liuhongt <hongtao.liu@intel.com>
75 * config/i386/i386.c (inline_secondary_memory_needed): HImode
76 move between GPR and SSE registers is supported under
77 TARGET_SSE2 and above.
78 * config/i386/i386.md (extendhfsf2): Optimize expander.
80 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
83 2021-11-29 liuhongt <hongtao.liu@intel.com>
87 * config/i386/i386.c (ix86_secondary_reload): Without
88 TARGET_SSE4_1, General register is needed to move HImode from
89 sse register to memory.
90 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
91 pextrw in output templates.
92 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
93 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
94 alternatives related to sse register.
96 2021-11-29 Richard Biener <rguenther@suse.de>
98 PR tree-optimization/103458
99 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
100 create forwarders for blocks with abnormal predecessors.
102 2021-11-29 Richard Biener <rguenther@suse.de>
104 * gimple-predicate-analysis.cc (can_be_invalidated_p):
105 Restore semantics to the one before the split from
108 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
110 * profile.c: Include sreal.h
111 (struct bb_stats): New.
112 (cmp_stats): New function.
113 (compute_branch_probabilities): Output bb stats.
115 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
117 * cfghooks.c: Include sreal.h, profile.h.
118 (profile_record_check_consistency): Fix checking of count counsistency;
119 record also dynamic mismatches.
120 * cfgrtl.c (rtl_account_profile_record): Similarly.
121 * tree-cfg.c (gimple_account_profile_record): Likewise.
122 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
123 num_mismatched_freq_out, turn time to double, add
124 dyn_mismatched_prob_out, dyn_mismatched_count_in,
125 num_mismatched_prob_out; remove num_mismatched_count_out.
126 * passes.c (account_profile_1): New function.
127 (account_profile_in_list): New function.
128 (pass_manager::dump_profile_report): Rewrite.
129 (execute_one_ipa_transform_pass): Check profile consistency after
131 (execute_all_ipa_transforms): Remove cfun test; record all transform
133 (execute_one_pass): Fix collecting of profile stats.
135 2021-11-27 Jakub Jelinek <jakub@redhat.com>
138 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
139 Change insn pattern to accurately describe the instructions.
140 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
141 define_insn patterns.
142 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
143 define_insn_and_split patterns.
144 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
145 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
146 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
147 splitters for x86_{,64_}sh{l,r}d pattern changes.
149 2021-11-27 Jakub Jelinek <jakub@redhat.com>
151 PR tree-optimization/103435
152 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
153 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
156 2021-11-27 Martin Jambor <mjambor@suse.cz>
159 * ipa-prop.c (ipcp_transform_function): Call
160 delete_unreachable_blocks_update_callgraph instead of returning
163 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
165 * tree-object-size.c (object_sizes_grow, object_sizes_release,
166 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
167 object_sizes_set): New functions.
168 (addr_object_size, compute_builtin_object_size,
169 expr_object_size, call_object_size, unknown_object_size,
170 merge_object_sizes, plus_stmt_object_size,
171 cond_expr_object_size, collect_object_sizes_for,
172 check_for_plus_in_loops_1, init_object_sizes,
173 fini_object_sizes): Adjust.
175 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
177 * tree-object-size.c: New enum.
178 (object_sizes, computed, addr_object_size,
179 compute_builtin_object_size, expr_object_size, call_object_size,
180 merge_object_sizes, plus_stmt_object_size,
181 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
182 object_sizes_execute): Replace magic numbers with enums.
184 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
186 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
187 tree_niter_desc* argument instead of just a tree for NITER.
188 If we require the iv candidate value at the end of the final
189 loop iteration, try using the original loop bound as the
190 NITER for sufficiently simple loops.
191 (may_eliminate_iv): Update (only) call to cand_value_at.
193 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
195 * ipa-modref.c (analyze_function): Drop parameter F and dump
196 cgraph node name rather than cfun name.
197 (modref_generate): Update.
198 (modref_summaries::insert):Update.
199 (modref_summaries_lto::insert):Update.
200 (pass_modref::execute):Update.
201 (ipa_merge_modref_summary_after_inlining): Improve combining of
204 2021-11-26 Jan Hubicka <jh@suse.cz>
207 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
209 2021-11-26 konglin1 <lingling.kong@intel.com>
212 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
214 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
215 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
216 (*extendhf<mode>2): Rename from extendhf<mode>2.
217 (truncsfhf2): Likewise.
218 (truncdfhf2): Likewise.
219 (*trunc<mode>2): Likewise.
221 2021-11-26 liuhongt <hongtao.liu@intel.com>
224 * match.pd: Fix typo, use the type of second parameter, not
227 2021-11-25 Jan Hubicka <jh@suse.cz>
229 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
231 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
234 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
236 2021-11-25 Martin Jambor <mjambor@suse.cz>
239 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
241 * ipa-param-manipulation.h (ipa_param_adjustments): New member
242 function get_updated_index_or_split.
243 * ipa-param-manipulation.c
244 (ipa_param_adjustments::get_updated_index_or_split): New function.
245 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
246 capability to identify scalarized parameters and perform substitution
248 (ipcp_transform_function): Create descriptors earlier, handle new
249 return values of adjust_agg_replacement_values.
251 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
253 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
254 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
255 * gimple-range-path.h: Remove compute_ranges_defined.
257 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
260 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
261 number of spilled registers is counted correctly.
263 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
265 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
268 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
270 PR tree-optimization/103359
271 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
272 equivalent to def, don't initially include it's range.
274 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
276 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
279 2021-11-25 Richard Biener <rguenther@suse.de>
281 * cgraphunit.c (symbol_table::output_weakrefs): Remove
283 (get_alias_symbol): Remove now unused function.
285 2021-11-25 Richard Biener <rguenther@suse.de>
287 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
289 (skip_insns_after_block): Remove unreachable break and continue.
291 2021-11-25 Richard Biener <rguenther@suse.de>
293 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
295 2021-11-25 Richard Biener <rguenther@suse.de>
297 * regset.h (REG_SET_EMPTY_P): New macro.
298 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
300 2021-11-25 Martin Liska <mliska@suse.cz>
302 * doc/invoke.texi: Use @option for -Wuninitialized.
304 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
306 PR tree-optimization/103254
307 * gimple-range-path.cc (path_range_query::compute_ranges): Move
308 exported boolean code...
309 (path_range_query::compute_imports): ...here.
311 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
313 PR tree-optimization/103254
314 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
315 (path_range_query::compute_ranges_in_block): Move to
316 compute_ranges_defined.
317 * gimple-range-path.h (compute_ranges_defined): New.
319 2021-11-25 Jakub Jelinek <jakub@redhat.com>
321 PR tree-optimization/103417
322 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
325 2021-11-25 Jakub Jelinek <jakub@redhat.com>
327 PR tree-optimization/103376
328 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
329 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
330 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
331 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
332 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
335 2021-11-25 Jakub Jelinek <jakub@redhat.com>
338 * doc/invoke.texi (-Wcomma-subscript): Document that for
339 -std=c++20 the option isn't enabled by default with -Wno-deprecated
340 but for -std=c++23 it is.
342 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
344 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
345 (modref_merge_call_site_flags): ... here.
346 (ipa_merge_modref_summary_after_inlining): Use it.
348 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
350 * cfgloop.c (verify_loop_structure): Reduce scope of
351 'class loop *loop' variable.
352 * ipa-fnsummary.c (analyze_function_body): Likewise.
353 * loop-init.c (fix_loop_structure): Likewise.
354 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
355 * predict.c (predict_loops): Likewise.
356 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
357 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
359 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
361 PR tree-optimization/103231
362 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
363 (gimple_ranger::gimple_ranger): Delete stmt stack.
364 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
366 (gimple_ranger::prefill_name): New.
367 (gimple_ranger::prefill_stmt_dependencies): New.
368 * gimple-range.h (class gimple_ranger): Add prototypes.
370 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
372 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
373 return a range, return if it came from the cache or not.
374 (get_non_stale_global_range): Rename to get_global_range, and return
375 the temporal state in a flag.
376 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
377 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
379 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
382 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
384 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
385 a constant is encountered to terminate the trace.
387 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
390 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
391 of indexed addressing.
392 (find_reloads): Adjust accordingly.
394 2021-11-24 Richard Biener <rguenther@suse.de>
395 Jan Hubicka <jh@suse.cz>
397 PR tree-optimization/103168
398 * ipa-modref.h (struct modref_summary): Add load_accesses.
399 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
400 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
401 info to walk the virtual use->def chain to CSE const/pure
402 function calls possibly reading from memory.
404 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
407 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
410 2021-11-24 Richard Biener <rguenther@suse.de>
413 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
414 with NaNs and -ftrapping-math.
416 2021-11-24 Jakub Jelinek <jakub@redhat.com>
419 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
420 return 0 for kind(host) and continue for kind(nohost).
422 2021-11-24 Jakub Jelinek <jakub@redhat.com>
425 * attribs.h (lookup_attribute): Allow attr_name to start with
426 underscore, as long as canonicalize_attr_name returns false.
427 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
428 * attribs.c (extract_attribute_substring): Reimplement using
429 canonicalize_attr_name.
430 (register_scoped_attribute): Change gcc_assert into
431 gcc_checking_assert, verify !canonicalize_attr_name rather than
432 that str.str doesn't start with '_'.
434 2021-11-24 Jakub Jelinek <jakub@redhat.com>
436 PR tree-optimization/103376
437 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
438 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
440 (find_bswap_or_nop_1, find_bswap_or_nop,
441 imm_store_chain_info::try_coalesce_bswap): Adjust
442 perform_symbolic_merge callers.
444 2021-11-24 Richard Biener <rguenther@suse.de>
446 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
447 argument instead of re-computing it.
448 (find_interesting_uses): Likewise.
449 (find_induction_variables): Pass through loop body.
450 (tree_ssa_iv_optimize_loop): Pass down loop body.
452 2021-11-24 Tamar Christina <tamar.christina@arm.com>
454 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
457 2021-11-24 Tamar Christina <tamar.christina@arm.com>
459 * tree.c (bitmask_inv_cst_vector_p): New.
460 * tree.h (bitmask_inv_cst_vector_p): New.
461 * match.pd: Use it in new bitmask compare pattern.
463 2021-11-24 Jason Merrill <jason@redhat.com>
465 * timevar.h (class auto_cond_timevar): New.
467 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
468 H.J. Lu <hongjiu.lu@intel.com>
470 PR tree-optimization/103194
471 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
473 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
474 (optimize_atomic_bit_test_and): Enhance
475 optimize_atomic_bit_test_and to handle truncation.
477 2021-11-23 Martin Sebor <msebor@redhat.com>
480 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
481 * common.opt: Add -Winfinite-recursion.
482 * doc/invoke.texi (-Winfinite-recursion): Document.
483 * passes.def (pass_warn_recursion): Schedule a new pass.
484 * tree-pass.h (make_pass_warn_recursion): Declare.
485 * gimple-warn-recursion.c: New file.
487 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
489 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
490 error messages for ENB_P8V and ENB_P9V.
492 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
494 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
495 CMPB to power6-64 stanza.
496 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
498 (rs6000_new_builtin_is_supported): Likewise.
499 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
500 (rs6000_init_builtins): Handle ENB_P6_64 case.
501 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
502 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
503 (enable_string): Add "ENB_P6_64".
504 (write_decls): Add ENB_P6_64 to bif_enable enum.
506 2021-11-23 Jan Hubicka <jh@suse.cz>
508 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
510 (modref_tree::modref_tree): Remove parametr.
511 (modref_tree::insert_base): Add max_bases parameter.
512 (modref_tree::insert): Add max_bases, max_refs, max_accesses
514 (modref_tree::insert): New member function.
515 (modref_tree::merge): Add max_bases, max_refs, max_accesses
517 (modref_tree::insert): New member function.
518 * ipa-modref-tree.c (test_insert_search_collapse): Update.
519 (test_merge): Update.
520 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
521 (dump_lto_records): Likewise.
522 (modref_summary::finalize): Fix whitespace.
523 (get_modref_function_summary): Likewise.
524 (modref_access_analysis::record_access): Update.
525 (modref_access_analysis::record_access_lto): Update.
526 (modref_access_analysis::process_fnspec): Update.
527 (analyze_function): Update.
528 (modref_summaries::duplicate): Update.
529 (modref_summaries_lto::duplicate): Update.
530 (write_modref_records): Update.
531 (read_modref_records): Update.
532 (read_section): Update.
533 (propagate_unknown_call): Update.
534 (modref_propagate_in_scc): Update.
535 (ipa_merge_modref_summary_after_inlining): Update.
537 2021-11-23 Martin Liska <mliska@suse.cz>
539 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
541 2021-11-23 Richard Biener <rguenther@suse.de>
543 PR tree-optimization/103361
544 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
545 for the dependence distance.
546 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
548 2021-11-23 Jakub Jelinek <jakub@redhat.com>
550 * tree-inline.h (struct copy_body_data): Remove
551 transform_lang_insert_block member.
552 * tree-inline.c (remap_block): Don't call
553 id->transform_lang_insert_block.
554 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
555 tree_function_versioning, maybe_inline_call_in_expr,
556 copy_fn): Don't initialize id.transform_lang_insert_block.
557 * gimplify.c (gimplify_omp_loop): Likewise.
559 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
561 PR tree-optimization/103335
562 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
563 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
564 are divisible by BITS_PER_UNIT.
565 (get_byte_aligned_range_containing_ref): New function.
566 (get_byte_aligned_range_contained_in_ref): New function.
567 (normalize_ref): Rename to ...
568 (get_byte_range): ... this one; handle accesses not aligned to byte
569 boundary; return range in bytes rater than updating ao_ref.
570 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
572 (setup_live_bytes_from_ref): Likewise.
573 (clear_bytes_written_by): Update.
574 (live_bytes_read): Update.
575 (dse_classify_store): Simplify tech before live_bytes_read checks.
577 2021-11-23 Andrew Pinski <apinski@marvell.com>
579 PR tree-optimization/102216
580 * gimple-fold.c (fold_stmt_1): Add canonicalization
581 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
582 can only be done if !in_place.
584 2021-11-23 Jakub Jelinek <jakub@redhat.com>
587 * gimplify.c (replace_reduction_placeholders): Remove.
588 (note_no_context_vars): New function.
589 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
590 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
591 and replace_reduction_placeholders for duplication of
592 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
593 automatic vars have DECL_CONTEXT set to non-NULL before doing so
594 and reset it afterwards for those vars and their corresponding
597 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
600 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
601 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
602 or v4sf when p9_vector is set.
603 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
605 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
607 PR tree-optimization/102232
608 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
610 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
612 PR tree-optimization/96779
613 * match.pd (-x == x) -> (x == 0): New optimization.
615 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
617 PR tree-optimization/98953
618 PR tree-optimization/103345
619 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
620 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
621 (pass_optimize_bswap::execute): Likewise.
623 2021-11-22 Martin Liska <mliska@suse.cz>
625 * doc/invoke.texi: Remove duplicate documentation for 3 params.
627 2021-11-22 Richard Biener <rguenther@suse.de>
629 PR tree-optimization/103351
630 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
633 2021-11-22 liuhongt <hongtao.liu@intel.com>
636 * config/i386/constraints.md (Bk): New
637 define_memory_constraint.
638 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
640 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
642 * config/i386/i386.md (*movsi_internal): Don't allow
643 mask/sse/mmx move in TLS code sequences.
644 (*movdi_internal): Ditto.
646 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
648 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
651 2021-11-21 Jakub Jelinek <jakub@redhat.com>
654 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
655 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
656 instead of target_option_default_node.
658 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
661 * ipa-modref.c (parm_map_for_arg): Rename to ...
662 (parm_map_for_ptr): .. this one; handle static chain and calls to
664 (modref_access_analysis::get_access): Use parm_map_for_ptr.
665 (modref_access_analysis::process_fnspec): Update.
666 (modref_access_analysis::analyze_load): Update.
667 (modref_access_analysis::analyze_store): Update.
669 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
671 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
673 (ignore_retval_p): Likewise.
674 (ignore_stores_p): Likewise.
675 (parm_map_for_arg): Likewise.
676 (class modref_access_analysis): New class.
677 (modref_access_analysis::set_side_effects): New member function.
678 (modref_access_analysis::set_nondeterministic): New member function.
679 (get_access): Turn to ...
680 (modref_access_analysis::get_access): ... this one.
681 (record_access): Turn to ...
682 (modref_access_analysis::record_access): ... this one.
683 (record_access_lto): Turn to ...
684 (modref_access_analysis::record_access_lto): ... This one.
685 (record_access_p): Turn to ...
686 (modref_access_analysis::record_access_p): ... This one
687 (modref_access_analysis::record_unknown_load): New member function.
688 (modref_access_analysis::record_unknown_store): New member function.
689 (get_access_for_fnspec): Turn to ...
690 (modref_access_analysis::get_access_for_fnspec): ... this one.
691 (merge_call_side_effects): Turn to ...
692 (moderf_access_analysis::merge_call_side_effects): Turn to ...
693 (collapse_loads): Move later in source code.
694 (collapse_stores): Move later in source code.
695 (process_fnspec): Turn to ...
696 (modref_access_analysis::process_fnspec): ... this one.
697 (analyze_call): Turn to ...
698 (modref_access_analysis::analyze_call): ... this one.
699 (struct summary_ptrs): Remove.
700 (analyze_load): Turn to ...
701 (modref_access_analysis::analyze_load): ... this one.
702 (analyze_store): Turn to ...
703 (modref_access_analysis::analyze_store): ... this one.
704 (analyze_stmt): Turn to ...
705 (modref_access_analysis::analyze_stmt): ... This one.
706 (remove_summary): Remove.
707 (modref_access_analysis::propagate): Break out from ...
708 (modref_access_analysis::analyze): Break out from ...
709 (analyze_function): ... here.
711 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
712 Robin Dapp <rdapp@linux.ibm.com>
715 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
716 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
718 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
721 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
722 (merge_call_side_effects): Improve debug output.
724 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
727 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
729 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
731 PR tree-optimization/103226
732 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
735 2021-11-20 Andrew Pinski <apinski@marvell.com>
737 PR tree-optimization/103220
738 * match.pd ((type) X bitop CST): Don't check if CST
739 fits into the type if only the sign changes.
741 2021-11-20 Alexandre Oliva <oliva@adacore.com>
743 PR tree-optimization/102988
744 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
745 without decl sharing.
747 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
749 * doc/invoke.texi: Remove whitespace after an @option.
751 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
753 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
754 _ARCH_PWR9, optimize vec_sum2s when LE.
756 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
759 * config/darwin-driver.c (darwin_driver_init): Handle exported
760 symbols and symbol lists (suppress automatic export of the TLS
762 * config/darwin.c (darwin_rename_builtins): Remove workaround.
763 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
764 (REAL_LIBGCC_SPEC): Handle revised library uses.
765 * config/darwin.opt (nodefaultexport): New.
766 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
767 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
768 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
770 2021-11-19 Martin Jambor <mjambor@suse.cz>
772 * opts.c (default_options_table): Switch off
773 flag_semantic_interposition at Ofast.
774 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
775 -fsemantic-interposition.
777 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
779 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
780 by !gimple_call_chain.
782 2021-11-19 Martin Sebor <msebor@redhat.com>
786 * doc/invoke.texi (-Waddress): Update.
788 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
790 PR tree-optimization/103254
791 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
792 depth for all statements with multple ssa names.
794 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
796 * config/s390/s390.md (define_peephole2): Variable insn points
797 to the first matched insn. Use peep2_next_insn(1) to refer to
798 the second matched insn.
800 2021-11-19 Tamar Christina <tamar.christina@arm.com>
802 PR tree-optimization/103311
804 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
806 (complex_mul_pattern::matches): Move check downwards.
808 2021-11-19 Martin Liska <mliska@suse.cz>
811 2021-11-19 Martin Liska <mliska@suse.cz>
813 * cfgexpand.c (pass_expand::execute): Use option directly.
814 * function.c (allocate_struct_function): Likewise.
815 * gimple-low.c (lower_function_body): Likewise.
816 (lower_stmt): Likewise.
817 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
818 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
819 * ipa-split.c (split_function): Likewise.
820 * lto-streamer-in.c (input_function): Likewise.
821 * sese.c (sese_insert_phis_for_liveouts): Likewise.
822 * ssa-iterators.h (num_imm_uses): Likewise.
823 * tree-cfg.c (make_blocks): Likewise.
824 (gimple_merge_blocks): Likewise.
825 * tree-inline.c (tree_function_versioning): Likewise.
826 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
827 * tree-sra.c (analyze_access_subtree): Likewise.
828 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
829 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
830 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
831 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
832 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
833 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
834 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
835 (target_for_debug_bind): Likewise.
836 (insert_debug_temp_for_var_def): Likewise.
837 (insert_debug_temps_for_defs): Likewise.
838 (reset_debug_uses): Likewise.
839 * tree-ssanames.c (release_ssa_name_fn): Likewise.
840 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
841 (adjust_debug_stmts): Likewise.
842 (adjust_phi_and_debug_stmts): Likewise.
843 (vect_do_peeling): Likewise.
844 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
845 (vect_transform_loop): Likewise.
846 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
847 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
848 (MAY_HAVE_DEBUG_STMTS): Use options directly.
850 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
852 * gcc.c (process_command): Skip dumpdir override if file is a
854 * doc/invoke.texi: Update -dumpdir documentation.
856 2021-11-19 Andrew Pinski <apinski@marvell.com>
858 PR tree-optimization/103314
859 * match.pd ((type) X op CST): Restrict the equal
860 TYPE_PRECISION case to GIMPLE only.
862 2021-11-19 Martin Liska <mliska@suse.cz>
865 * ipa-modref-tree.h (struct modref_parm_map): Add default
867 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
869 2021-11-19 Richard Biener <rguenther@suse.de>
872 * tree-eh.c (operation_could_trap_helper_p): Properly handle
873 fixed-point RDIV_EXPR.
875 2021-11-19 Richard Biener <rguenther@suse.de>
877 PR tree-optimization/102436
878 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
879 to just create the if structure and return the then block.
880 (execute_sm): Add flag to indicate the var will re-use
882 (hoist_memory_references): Support a single conditional
883 block with all stores as special case.
885 2021-11-19 Andrew Pinski <apinski@marvell.com>
887 PR tree-optimization/103317
888 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
889 middle bb case, check to make sure it has a single predecessor.
891 2021-11-19 Andrew Pinski <apinski@marvell.com>
893 PR tree-optimization/103257
895 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
896 Disable until !canonicalize_math_p ().
898 2021-11-19 Marek Polacek <polacek@redhat.com>
902 * doc/invoke.texi: Update documentation for -Wuninitialized.
903 * tree.c (stabilize_reference): Set location.
905 2021-11-19 liuhongt <hongtao.liu@intel.com>
908 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
909 storing 256/512-bit SSE register to be equal to cost of
910 unaligned store to avoid odd alignment peeling.
911 (icelake_cost): Ditto.
913 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
915 * config/rs6000/predicates.md (current_file_function_operand):
916 Add flag_semantic_interposition to call of decl_replaceable_p.
918 2021-11-18 Martin Liska <mliska@suse.cz>
920 * ipa-modref.c (analyze_function): Do not execute the code
921 only if dump_file != NULL.
923 2021-11-18 Martin Liska <mliska@suse.cz>
925 * ipa-modref.c (analyze_function): Use fnode instead of repeated
926 cgraph_node::get (current_function_decl).
928 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
930 * cgraph.c (cgraph_node::get_availability): Update call of
932 (cgraph_node::verify_node): Verify that semantic_interposition flag
934 * cgraph.h: (symtab_node): Add semantic_interposition flag.
935 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
936 semantic_interposition flag.
937 * cgraphunit.c (cgraph_node::finalize_function): Set
938 semantic_interposition flag.
939 (cgraph_node::add_new_function): Likewise.
940 (varpool_node::finalize_decl): Likewise.
941 (cgraph_node::create_wrapper): Likewise.
942 * common.opt (fsemantic-interposition): Turn to optimization node.
943 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
944 (lto_output_varpool_node): Likewise.
945 (input_overwrite_node): Likewise.
946 (input_varpool_node): Likewise.
947 * symtab.c (symtab_node::dump_base): Dump new flag.
948 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
950 * varasm.h (decl_replaceable_p): Update declaration.
951 * varpool.c (varpool_node::ctor_useable_for_folding_p):
952 Use semantic_interposition flag.
953 (varpool_node::get_availability): Likewise.
954 (varpool_node::create_alias): Copy semantic_interposition flag.
956 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
959 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
960 parameter may still be returned.
961 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
962 even for unused function args.
964 2021-11-18 Tamar Christina <tamar.christina@arm.com>
966 PR tree-optimization/103311
967 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
968 complex_fms_pattern::matches): Check for multiplications.
970 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
973 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
974 CS prefix for -mindirect-branch-cs-prefix.
975 (ix86_output_indirect_branch_via_reg): Likewise.
976 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
977 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
979 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
981 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
984 2021-11-18 Tamar Christina <tamar.christina@arm.com>
986 PR tree-optimization/103253
987 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
989 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
991 * doc/extend.texi: Document __builtin_assoc_barrier.
993 2021-11-18 Martin Liska <mliska@suse.cz>
995 * cfgexpand.c (pass_expand::execute): Use option directly.
996 * function.c (allocate_struct_function): Likewise.
997 * gimple-low.c (lower_function_body): Likewise.
998 (lower_stmt): Likewise.
999 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1000 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1001 * ipa-split.c (split_function): Likewise.
1002 * lto-streamer-in.c (input_function): Likewise.
1003 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1004 * ssa-iterators.h (num_imm_uses): Likewise.
1005 * tree-cfg.c (make_blocks): Likewise.
1006 (gimple_merge_blocks): Likewise.
1007 * tree-inline.c (tree_function_versioning): Likewise.
1008 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1009 * tree-sra.c (analyze_access_subtree): Likewise.
1010 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1011 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1012 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1013 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1014 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1015 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1016 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1017 (target_for_debug_bind): Likewise.
1018 (insert_debug_temp_for_var_def): Likewise.
1019 (insert_debug_temps_for_defs): Likewise.
1020 (reset_debug_uses): Likewise.
1021 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1022 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1023 (adjust_debug_stmts): Likewise.
1024 (adjust_phi_and_debug_stmts): Likewise.
1025 (vect_do_peeling): Likewise.
1026 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1027 (vect_transform_loop): Likewise.
1028 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1029 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1030 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1032 2021-11-18 Richard Biener <rguenther@suse.de>
1034 PR tree-optimization/103277
1035 * tree-ssa-dse.c (need_ab_cleanup): New.
1036 (dse_optimize_redundant_stores): Adjust.
1037 (delete_dead_or_redundant_assignment): Get extra
1038 need_ab_cleanup argument and set when abnormal cleanup is
1040 (dse_optimize_call): Adjust.
1041 (dse_optimize_stmt): Likewise.
1042 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
1043 Perform abnormal cleanup.
1044 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
1046 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
1048 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1049 Adjust generated cfg to avoid infinite loop.
1051 2021-11-18 konglin1 <lingling.kong@intel.com>
1053 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
1054 (_mm512_mask_mul_pch): Likewise.
1055 (_mm512_maskz_mul_pch): Likewise.
1056 (_mm512_mul_round_pch): Likewise.
1057 (_mm512_mask_mul_round_pch): Likewise.
1058 (_mm512_maskz_mul_round_pch): Likewise.
1059 (_mm512_cmul_pch): Likewise.
1060 (_mm512_mask_cmul_pch): Likewise.
1061 (_mm512_maskz_cmul_pch): Likewise.
1062 (_mm512_cmul_round_pch): Likewise.
1063 (_mm512_mask_cmul_round_pch): Likewise.
1064 (_mm512_maskz_cmul_round_pch): Likewise.
1065 (_mm_mul_sch): Likewise.
1066 (_mm_mask_mul_sch): Likewise.
1067 (_mm_maskz_mul_sch): Likewise.
1068 (_mm_mul_round_sch): Likewise.
1069 (_mm_mask_mul_round_sch): Likewise.
1070 (_mm_maskz_mul_round_sch): Likewise.
1071 (_mm_cmul_sch): Likewise.
1072 (_mm_mask_cmul_sch): Likewise.
1073 (_mm_maskz_cmul_sch): Likewise.
1074 (_mm_cmul_round_sch): Likewise.
1075 (_mm_mask_cmul_round_sch): Likewise.
1076 (_mm_maskz_cmul_round_sch): Likewise.
1077 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
1078 (_mm_mask_mul_pch): Likewise.
1079 (_mm_maskz_mul_pch): Likewise.
1080 (_mm256_mul_pch): Likewise.
1081 (_mm256_mask_mul_pch): Likewise.
1082 (_mm256_maskz_mul_pch): Likewise.
1083 (_mm_cmul_pch): Likewise.
1084 (_mm_mask_cmul_pch): Likewise.
1085 (_mm_maskz_cmul_pch): Likewise.
1086 (_mm256_cmul_pch): Likewise.
1087 (_mm256_mask_cmul_pch): Likewise.
1088 (_mm256_maskz_cmul_pch): Likewise.
1090 2021-11-17 Andrew Pinski <apinski@marvell.com>
1092 PR tree-optimization/103228
1093 PR tree-optimization/55177
1094 * match.pd ((type) X bitop CST): Also do this
1095 transformation for nop conversions.
1097 2021-11-17 Martin Sebor <msebor@redhat.com>
1099 PR tree-optimization/102759
1100 * gimple-array-bounds.cc (build_printable_array_type): Move...
1101 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
1102 pathological function redeclarations that remove a previously
1104 Improve formatting of function arguments in informational notes.
1105 * pointer-query.cc (build_printable_array_type): ...to here.
1106 * pointer-query.h (build_printable_array_type): Declared.
1108 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1111 * config/i386/i386-opts.h (harden_sls): New enum.
1112 * config/i386/i386.c (output_indirect_thunk): Mitigate against
1113 SLS for function return.
1114 (ix86_output_function_return): Likewise.
1115 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
1117 (ix86_output_indirect_jmp): Likewise.
1118 (ix86_output_call_insn): Likewise.
1119 * config/i386/i386.opt: Add -mharden-sls=.
1120 * doc/invoke.texi: Document -mharden-sls=.
1122 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1125 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
1126 (ix86_output_function_return): Likewise.
1127 * config/i386/i386.md (simple_return_pop_internal): Likewise.
1129 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1132 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
1135 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1137 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
1138 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
1139 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
1140 on indirect_thunks_used.
1141 (ix86_output_indirect_function_return): Ditto.
1143 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1145 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
1146 (modref_access_node::stream_out): New member function.
1147 (modref_access_node::stream_in): New member function.
1148 * ipa-modref-tree.h (modref_access_node::stream_out,
1149 modref_access_node::stream_in): Declare.
1150 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
1151 (modref_summary_lto::dump): Dump kills.
1152 (analyze_store): Record kills for LTO
1153 (analyze_stmt): Likewise.
1154 (modref_summaries_lto::duplicate): Duplicate kills.
1155 (write_modref_records): Use new stream_out member function.
1156 (read_modref_records): Likewise.
1157 (modref_write): Stream out kills.
1158 (read_section): Stream in kills
1159 (remap_kills): New function.
1160 (update_signature): Use it.
1162 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1164 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
1165 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
1166 * config/i386/i386.c (zero_all_vector_registers):
1167 Use LEGACY_SSE_REGNO_P predicate.
1168 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
1169 and EXT_REG_SSE_REGNO_P predicates.
1170 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
1171 and LEGACY_SSE_REGNO_P predicates.
1173 2021-11-17 Jason Merrill <jason@redhat.com>
1175 * doc/invoke.texi (C++ Dialect Options): Document
1176 -fimplicit-constexpr.
1178 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
1180 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
1181 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
1183 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
1186 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
1190 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
1191 (gimplify_compound_lval): Gimplify base expression first.
1192 (gimplify_target_expr): Add comment.
1194 2021-11-17 Jakub Jelinek <jakub@redhat.com>
1196 PR tree-optimization/103192
1197 * tree-ssa-loop-im.c (move_computations_worker): Use
1198 reset_flow_sensitive_info instead of manually clearing
1199 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
1200 with integral types.
1202 2021-11-17 Jakub Jelinek <jakub@redhat.com>
1204 PR tree-optimization/103255
1205 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
1206 range_nonzero rather than unadjusted base's range. Formatting fixes.
1208 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
1210 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
1211 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
1212 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
1213 * internal-fn.c (first_commutative_argument): Handle them.
1214 (FOR_EACH_COND_FN_PAIR): Likewise.
1215 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
1216 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
1219 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1221 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
1222 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
1223 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
1224 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
1225 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
1228 2021-11-17 Andrew Pinski <apinski@marvell.com>
1230 PR tree-optimization/103288
1231 * tree-ssa-phiopt.c (value_replacement): Return early if middle
1232 block has more than one pred.
1234 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1236 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
1237 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
1238 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
1239 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
1240 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
1241 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
1242 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
1243 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
1245 2021-11-17 Marek Polacek <polacek@redhat.com>
1247 PR preprocessor/103026
1248 * doc/invoke.texi: Document -Wbidi-chars.
1250 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1253 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
1256 2021-11-16 Jan Hubicka <jh@suse.cz>
1258 * ipa-modref.c (get_modref_function_summary): Declare.
1259 * ipa-modref.h (get_modref_function_summary): New function.
1260 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
1261 (clear_bytes_written_by): ... here; also clear memory killed by
1264 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
1266 * ggc-common.c (gt_pch_save): If we cannot find a suitable
1267 memory segment for save, then error-out, do not try to
1269 (gt_pch_restore): Save the existing line table, and when
1270 the replacement is being read, use that when constructing
1273 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
1276 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
1278 (*mma_assemble_acc): Likewise.
1280 2021-11-16 Martin Sebor <msebor@redhat.com>
1282 PR tree-optimization/102960
1283 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
1284 than a pointer to it.
1285 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
1287 (get_maxval_strlen): Use an auto_bitmap.
1288 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
1290 (get_range_strlen_phi): ...into this function.
1291 Avoid assuming maximum string length is constant
1292 (printf_strlen_execute): Dump pointer query cache contents when
1293 details are requisted.
1295 2021-11-16 Jason Merrill <jason@redhat.com>
1297 * langhooks.h (struct lang_hooks): Adjust comment.
1298 * print-tree.c (print_node): Also call print_xnode hook for
1301 2021-11-16 Andrew Pinski <apinski@marvell.com>
1303 PR tree-optimization/103218
1304 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
1306 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
1308 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
1309 (umaddhisi4): Likewise.
1310 (machi): Update pattern.
1313 2021-11-16 Richard Biener <rguenther@suse.de>
1315 PR tree-optimization/102880
1316 * tree-ssa-dce.c (sort_phi_args): New function.
1317 (make_forwarders_with_degenerate_phis): Likewise.
1318 (perform_tree_ssa_dce): Call
1319 make_forwarders_with_degenerate_phis.
1321 2021-11-16 Richard Biener <rguenther@suse.de>
1323 PR tree-optimization/102880
1324 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
1325 single_pred (bb1) condition to places that really need it.
1326 (match_simplify_replacement): Likewise.
1327 (value_replacement): Likewise.
1328 (replace_phi_edge_with_variable): Deal with extra edges
1331 2021-11-16 Martin Jambor <mjambor@suse.cz>
1333 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
1334 add a fixme note about the mode assignment perhaps being unnecessary.
1335 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
1337 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
1338 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
1339 * tree-inline.c (remap_ssa_name): Likewise.
1340 (tree_function_versioning): Likewise.
1341 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
1342 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1343 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
1345 2021-11-16 Jakub Jelinek <jakub@redhat.com>
1347 PR tree-optimization/103208
1348 * omp-expand.c (expand_omp_build_cond): New function.
1349 (expand_omp_for_init_counts, expand_omp_for_init_vars,
1350 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
1352 2021-11-16 Jakub Jelinek <jakub@redhat.com>
1354 PR tree-optimization/102009
1355 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
1356 Punt if any of alloc_size arguments is out of bounds vs. number of
1359 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
1361 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
1362 on !optimize_function_for_size_p.
1363 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
1364 (define_splits): Conditionalize on !optimize_function_for_size_p.
1365 (*bmi2_rorxsi3_1_zext): Likewise.
1366 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
1367 (define_splits): Conditionalize on !optimize_function_for_size_p.
1369 2021-11-16 Jan Hubicka <jh@suse.cz>
1372 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
1375 2021-11-16 Andrew Pinski <apinski@marvell.com>
1377 PR tree-optimization/103245
1378 * match.pd: Combine the abs pattern matching using multiplication.
1379 Adding optional nop_convert too.
1381 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
1383 PR middle-end/103268
1384 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
1387 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1389 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
1390 determine if it is safe to transform to strcat.
1391 (gimple_fold_builtin_snprintf): Likewise.
1393 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1395 * gimple-fold.c (known_lower): New function.
1396 (gimple_fold_builtin_strncat_chk,
1397 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
1398 gimple_fold_builtin_stxncpy_chk,
1399 gimple_fold_builtin_snprintf_chk,
1400 gimple_fold_builtin_sprintf_chk): Use it.
1402 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1404 * gimple-fold.c (dump_transformation): New function.
1405 (gimple_fold_builtin_stxcpy_chk,
1406 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
1407 BUILT_IN_STRNCPY if return value is not used.
1409 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
1411 PR middle-end/103184
1412 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
1413 before transforming equivalent, but slighly different cases to
1414 their canonical forms.
1416 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
1419 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
1420 * doc/tm.texi: Regenerated.
1421 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
1422 * ipa.c (cgraph_build_static_cdtor_1): Return the built
1424 (build_cxa_atexit_decl): New.
1425 (build_dso_handle_decl): New.
1426 (build_cxa_dtor_registrations): New.
1427 (compare_cdtor_tu_order): New.
1428 (build_cxa_atexit_fns): New.
1429 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
1430 process the DTORs/CTORs accordingly.
1431 (pass_ipa_cdtor_merge::gate): Also run if
1432 dtors_from_cxa_atexit is set.
1433 * target.def (dtors_from_cxa_atexit): New hook.
1435 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
1437 * config.in: Regenerate.
1438 * configure: Regenerate.
1439 * configure.ac: Test ld64 for -platform-version support.
1441 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
1443 * diagnostic-spec.h (typedef xint_hash_t)
1444 (typedef xint_hash_map_t): Replace with...
1445 (typedef nowarn_map_t): ... this.
1446 (nowarn_map): Adjust.
1447 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
1449 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
1451 * profile.c (branch_prob): Use 'location_hash' for
1454 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
1456 PR tree-optimization/103207
1457 * value-range.cc (irange::set): Drop overflow.
1459 2021-11-15 Jakub Jelinek <jakub@redhat.com>
1461 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
1462 to OMP_TARGET_CLAUSES if it isn't there already.
1464 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
1466 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
1467 Remove useless code.
1468 (path_range_query::ssa_defined_in_bb): New.
1469 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
1470 could trigger additional lookups.
1471 Do not use the cache for ARGs defined in this block.
1472 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
1473 (path_range_query::maybe_register_phi_relation): Same.
1474 (path_range_query::range_of_stmt): Adjust comment.
1475 * gimple-range-path.h (ssa_defined_in_bb): New.
1477 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
1479 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
1480 Default to global range if nothing found.
1482 2021-11-15 Richard Biener <rguenther@suse.de>
1484 PR tree-optimization/103237
1485 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
1486 double reductions with multiple inner loop LC PHI nodes.
1488 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
1491 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1492 New expand function.
1493 * config/i386/i386-options.c (ix86_target_string): Add
1494 -mrelax-cmpxchg-loop flag.
1495 (ix86_valid_target_attribute_inner_p): Likewise.
1496 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
1497 New expand function prototype.
1498 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
1499 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
1501 (atomic_<logic>_fetch<mode>): Likewise.
1502 (atomic_fetch_nand<mode>): Likewise.
1503 (atomic_nand_fetch<mode>): Likewise.
1504 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
1505 (atomic_<logic>_fetch<mode>): Likewise.
1506 (atomic_fetch_nand<mode>): Likewise.
1507 (atomic_nand_fetch<mode>): Likewise.
1508 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
1510 2021-11-15 Richard Biener <rguenther@suse.de>
1512 PR tree-optimization/103219
1513 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
1514 to determine the exit for the VN region.
1516 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
1518 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
1519 (pass_data_vectorize): Likewise.
1520 (make_pass_vectorize): Likewise.
1521 * tree-vectorizer.c (vectorize_loops): Merge with
1522 pass_vectorize::execute and replace cfun occurences with fun param.
1523 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
1524 fun, and adjust callers approrpiately.
1525 (note_simd_array_uses): Likewise.
1526 (vect_loop_dist_alias_call): Likewise.
1527 (set_uid_loop_bbs): Likewise.
1528 (vect_transform_loops): Likewise.
1529 (try_vectorize_loop_1): Likewise.
1530 (try_vectorize_loop): Likewise.
1532 2021-11-15 Jakub Jelinek <jakub@redhat.com>
1535 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
1536 atomic_bit_test_and_complement<mode>,
1537 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
1540 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
1542 * config/vax/vax.c (vax_lra_p): New prototype and function.
1543 (TARGET_LRA_P): Wire it.
1544 * config/vax/vax.opt (mlra): New option.
1545 * doc/invoke.texi (Option Summary, VAX Options): Document the
1548 2021-11-14 Jan Hubicka <jh@suse.cz>
1550 * ipa-modref.h (struct modref_summary): Add nondeterministic
1551 and calls_interposable flags.
1552 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
1553 (modref_summary::useful_p): Check new flags.
1554 (struct modref_summary_lto): Add nondeterministic and
1555 calls_interposable flags.
1556 (modref_summary_lto::modref_summary_lto): Initialize new flags.
1557 (modref_summary_lto::useful_p): Check new flags.
1558 (modref_summary::dump): Dump new flags.
1559 (modref_summary_lto::dump): Dump new flags.
1560 (ignore_nondeterminism_p): New function.
1561 (merge_call_side_effects): Merge new flags.
1562 (process_fnspec): Likewise.
1563 (analyze_load): Volatile access is nondeterministic.
1564 (analyze_store): Liekwise.
1565 (analyze_stmt): Volatile ASM is nondeterministic.
1566 (analyze_function): Clear new flags.
1567 (modref_summaries::duplicate): Duplicate new flags.
1568 (modref_summaries_lto::duplicate): Duplicate new flags.
1569 (modref_write): Stream new flags.
1570 (read_section): Stream new flags.
1571 (propagate_unknown_call): Update new flags.
1572 (modref_propagate_in_scc): Propagate new flags.
1573 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
1575 * tree-ssa-structalias.c (determine_global_memory_access):
1578 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
1580 * config/vax/vax.h (SET_RATIO): New macro.
1581 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
1582 (setmemhi): New expander.
1583 (setmemhi1): New insn and splitter.
1584 (*setmemhi1): New insn.
1586 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
1588 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
1590 (modref_access_node::merge_for_kills): Likewise.
1591 (modref_access_node::insert_kill): Likewise.
1592 * ipa-modref-tree.h (modref_access_node::update_for_kills,
1593 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
1595 (modref_access_node::useful_for_kill): New member function.
1596 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
1597 (lto_modref_summary): Add kills.
1598 (modref_summary::dump): Dump kills.
1599 (record_access): Add mdoref_access_node parameter.
1600 (record_access_lto): Likewise.
1601 (merge_call_side_effects): Merge kills.
1602 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
1603 (struct summary_ptrs): Add always_executed filed.
1604 (analyze_load): Update.
1605 (analyze_store): Update; record kills.
1606 (analyze_stmt): Add always_executed; record kills in clobbers.
1607 (analyze_function): Track always_executed.
1608 (modref_summaries::duplicate): Duplicate kills.
1609 (update_signature): Release kills.
1610 * ipa-modref.h (struct modref_summary): Add kills.
1611 * tree-ssa-alias.c (alias_stats): Add kill stats.
1612 (dump_alias_stats): Dump kill stats.
1613 (store_kills_ref_p): Break out from ...
1614 (stmt_kills_ref_p): Use it; handle modref info based kills.
1616 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
1618 PR tree-optimization/103229
1619 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
1620 null value to memset.
1622 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
1624 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
1626 (modref_access_node::get_ao_ref): Likewise.
1627 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
1628 (modref_access_node::get_ao_ref): Declare.
1629 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
1630 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
1632 2021-11-13 Jan Hubicka <jh@suse.cz>
1635 * dbgcnt.def (ipa_attr): New counters.
1636 * ipa-pure-const.c: Include dbgcnt.c
1637 (ipa_make_function_const): Use debug counter.
1638 (ipa_make_function_pure): Likewise.
1639 (propagate_pure_const): Fix bug in my previous change.
1641 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
1643 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
1644 Offline from ipa-modref-tree.h.
1645 (modref_access_node::dump): Move from ipa-modref.c; make member
1647 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
1648 modref_access_node::dump): Declare.
1649 * ipa-modref.c (dump_access): Remove.
1650 (dump_records): Update.
1651 (dump_lto_records): Update.
1652 (record_access): Update.
1653 (record_access_lto): Update.
1655 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
1657 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
1658 (modref_summary::dump): Dump try_dse.
1659 (modref_summary::finalize): Add FUN attribute; compute try-dse.
1660 (analyze_function): Update.
1661 (read_section): Update.
1662 (update_signature): Update.
1663 (pass_ipa_modref::execute): Update.
1664 * ipa-modref.h (struct modref_summary):
1665 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
1666 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
1667 * tree-ssa-dse.c (dse_optimize_call): New function.
1668 (dse_optimize_stmt): Use it.
1670 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
1672 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
1674 2021-11-13 Jan Hubicka <jh@suse.cz>
1677 (struct modref_access_node): Move longer member functions to
1679 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
1681 * ipa-modref-tree.c (modref_access_node::contains): Move here
1682 from ipa-modref-tree.h.
1683 (modref_access_node::update): Likewise.
1684 (modref_access_node::merge): Likewise.
1685 (modref_access_node::closer_pair_p): Likewise.
1686 (modref_access_node::forced_merge): Likewise.
1687 (modref_access_node::update2): Likewise.
1688 (modref_access_node::combined_offsets): Likewise.
1689 (modref_access_node::try_merge_with): Likewise.
1690 (modref_access_node::insert): Likewise.
1692 2021-11-13 Jan Hubicka <jh@suse.cz>
1694 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
1695 (modref_summary::global_memory_written_p): Remove.
1696 (modref_summary::dump): Dump new flags.
1697 (modref_summary::finalize): New member function.
1698 (analyze_function): Call it.
1699 (read_section): Call it.
1700 (update_signature): Call it.
1701 (pass_ipa_modref::execute): Call it.
1702 * ipa-modref.h (struct modref_summary): Remove
1703 global_memory_read_p and global_memory_written_p.
1704 Add global_memory_read, global_memory_written.
1705 * tree-ssa-structalias.c (determine_global_memory_access):
1708 2021-11-13 Jan Hubicka <jh@suse.cz>
1710 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
1711 * ipa-param-manipulation.c
1712 (ipa_param_adjustments::type_attribute_allowed_p):
1713 New member function.
1714 (drop_type_attribute_if_params_changed_p): New function.
1715 (build_adjusted_function_type): Use it.
1716 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
1718 2021-11-13 David Malcolm <dmalcolm@redhat.com>
1720 * doc/invoke.texi (Static Analyzer Options): Add
1721 -Wno-analyzer-tainted-allocation-size,
1722 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
1723 -Wno-analyzer-tainted-size to list. Add
1724 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
1725 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
1726 of options effectively enabled by -fanalyzer.
1727 (-Wanalyzer-tainted-allocation-size): New.
1728 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
1729 (-Wanalyzer-tainted-divisor): New.
1730 (-Wanalyzer-tainted-offset): New.
1731 (-Wanalyzer-tainted-size): New.
1733 2021-11-13 Jan Hubicka <jh@suse.cz>
1735 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
1736 * gimple.c (gimple_call_arg_flags): ... here.
1737 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
1738 (modref_merge_call_site_flags): Use arg_eaf_flags.
1740 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
1742 PR tree-optimization/103222
1743 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
1745 (path_range_query::compute_ranges_in_block): Call
1746 compute_ranges_in_phis.
1747 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
1750 2021-11-13 Jan Hubicka <jh@suse.cz>
1752 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
1753 changes on "fn spec" attribute; give up on varadic types.
1754 * ipa-param-manipulation.c: Include attribs.h.
1755 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
1756 true remove "fn spec" attribute.
1757 (ipa_param_adjustments::build_new_function_type): Update.
1758 (ipa_param_body_adjustments::modify_formal_parameters): update.
1759 * ipa-sra.c: Include attribs.h.
1760 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
1762 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
1764 * gimple-range-path.cc (path_range_query::path_range_query): Merge
1766 (path_range_query::import_p): Move from header file.
1767 (path_range_query::~path_range_query): Adjust for combined ctors.
1768 * gimple-range-path.h: Merge ctors.
1769 (path_range_query::import_p): Move to .cc file.
1771 2021-11-13 Jan Hubicka <jh@suse.cz>
1773 PR tree-optimization/103182
1774 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
1775 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
1777 2021-11-12 Stafford Horne <shorne@gmail.com>
1779 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
1780 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
1781 new function or1k_profile_hook.
1782 * config/or1k/or1k.c (struct machine_function): Add new field
1783 set_mcount_arg_insn.
1784 (or1k_profile_hook): New function.
1785 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
1787 (or1k_frame_pointer_required): Frame pointer no longer needed
1790 2021-11-12 Jan Hubicka <jh@suse.cz>
1792 PR tree-optimization/103209
1793 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
1794 use of handle_rhs_call
1796 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
1798 PR tree-optimization/103202
1799 * gimple-range-path.cc
1800 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
1802 2021-11-12 Jan Hubicka <jh@suse.cz>
1804 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
1805 fix call of ipa_make_function_const and ipa_make_function_pure.
1807 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1809 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
1810 (aarch64_vector_op::m_seen_loads): Likewise.
1811 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
1813 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
1814 Remove code that tries to predict different vec_flags from the
1816 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
1817 Remove m_advsimd_ops handling.
1819 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1821 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
1822 (aarch64_sve_only_stmt_p): Delete.
1823 (aarch64_vector_costs::prefer_unrolled_loop): New function,
1824 extracted from adjust_body_cost.
1825 (aarch64_vector_costs::better_main_loop_than_p): New function,
1826 using heuristics extracted from adjust_body_cost and
1827 adjust_body_cost_sve.
1828 (aarch64_vector_costs::adjust_body_cost_sve): Remove
1829 advsimd_cycles_per_iter and could_use_advsimd parameters.
1830 Update after changes above.
1831 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
1833 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1835 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
1836 New member variable.
1837 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
1838 (aarch64_vec_op_count::vf_factor): New function.
1839 (aarch64_vector_costs::aarch64_vector_costs): When costing for
1840 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
1842 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
1843 instead of hard-coding 2.
1845 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1847 * config/aarch64/aarch64.c
1848 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
1849 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
1850 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
1851 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
1852 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
1853 above properties too.
1854 (aarch64_estimate_min_cycles_per_iter): Delete.
1855 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
1856 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
1857 routine to print CPI estimates.
1858 (adjust_body_cost_sve): Likewise. Use the other functions above
1859 instead of doing the work inline.
1861 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1863 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
1865 (aarch64_vec_op_count::base_issue_info): Remove handling of null
1867 (aarch64_vec_op_count::simd_issue_info): Likewise.
1868 (aarch64_vec_op_count::sve_issue_info): Likewise.
1869 (aarch64_vector_costs::m_ops): Turn into a vector.
1870 (aarch64_vector_costs::m_advsimd_ops): Likewise.
1871 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
1872 the vectors based on aarch64_tune_params.
1873 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
1874 of all entries in m_ops.
1875 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
1877 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
1878 parameter and get the information from the ops instead.
1879 (aarch64_vector_costs::adjust_body_cost_sve): Take a
1880 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
1881 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
1882 Exit earlier if m_ops is empty for either cost structure.
1884 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1886 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
1887 (aarch64_vector_costs::m_sve_ops): Replace with...
1888 (aarch64_vector_costs::m_ops): ...this.
1889 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
1890 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
1891 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
1892 Initialize m_vec_flags here rather than in add_stmt_cost.
1893 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
1894 Allow vectype to be null.
1895 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
1896 code too. Don't require vectype to be nonnull.
1897 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
1898 and scalar costs as parameters. Use the scalar costs to determine
1899 the cycles per iteration of the scalar loop, then multiply it
1900 by the estimated VF.
1901 (aarch64_vector_costs::finish_cost): Update call accordingly.
1903 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1905 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
1906 (aarch64_vector_costs::count_ops): Use it rather than the
1907 vectype to determine floatness.
1909 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1911 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
1912 Remove vectype parameter and get floatness from the type of the
1914 (arch64_in_loop_reduction_latency): Likewise.
1915 (aarch64_detect_vector_stmt_subtype): Update caller.
1916 (aarch64_vector_costs::count_ops): Likewise.
1918 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1920 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
1921 (aarch64_vec_op_count): ...this. Add a constructor.
1922 (aarch64_vec_op_count::vec_flags): New function.
1923 (aarch64_vec_op_count::base_issue_info): Likewise.
1924 (aarch64_vec_op_count::simd_issue_info): Likewise.
1925 (aarch64_vec_op_count::sve_issue_info): Likewise.
1926 (aarch64_vec_op_count::m_issue_info): New member variable.
1927 (aarch64_vec_op_count::m_vec_flags): Likewise.
1928 (aarch64_vector_costs): Add a constructor.
1929 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
1930 (aarch64_vector_costs::aarch64_vector_costs): New function.
1931 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
1932 (aarch64_vector_costs::count_ops): Remove vec_flags and
1933 issue_info parameters, using the new aarch64_vec_op_count
1935 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
1936 (aarch64_sve_op_count::dump): Fold into...
1937 (aarch64_vec_op_count::dump): ..here.
1939 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1941 * config/aarch64/aarch64.c: Include tree-dfa.h.
1942 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
1943 and MEM_OFFSET into account.
1944 (aarch64_swap_ldrstr_operands): Use it.
1945 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
1946 address of the second memory doesn't depend on the result of
1949 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
1951 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
1953 2021-11-12 Jan Hubicka <jh@suse.cz>
1955 PR tree-optimization/103175
1956 * ipa-modref.c (modref_lattice::merge): Add sanity check.
1957 (callee_to_caller_flags): Make flags adjustment sane.
1958 (modref_eaf_analysis::analyze_ssa_name): Likewise.
1960 2021-11-12 Jan Hubicka <jh@suse.cz>
1963 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
1964 not mark pure/const function if there are side-effects.
1966 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
1968 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
1969 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
1970 * tree-pretty-print.c (dump_omp_clause): Add support for printing
1971 implicit attribute in tree dumping.
1972 * gimplify.c (gimplify_adjust_omp_clauses_1):
1973 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
1975 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
1976 clauses, from simple append, to starting of list, after non-map clauses.
1977 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
1978 values passed to libgomp for implicit maps.
1980 2021-11-12 Jakub Jelinek <jakub@redhat.com>
1982 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
1983 (BUILT_IN_GOMP_TEAMS4): New.
1984 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
1985 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
1986 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
1987 GOMP_teams, pass to it also num_teams lower-bound expression
1988 or a dup of upper-bound if it is missing and a flag whether
1989 it is the first call or not.
1991 2021-11-12 Martin Liska <mliska@suse.cz>
1993 PR tree-optimization/102497
1994 * gimple-predicate-analysis.cc (add_pred): Remove unused
1997 2021-11-12 Richard Biener <rguenther@suse.de>
1999 PR tree-optimization/103204
2000 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
2001 top operand after folding in an address.
2003 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2005 * gimple-range-path.cc (path_range_query::path_range_query): New
2006 ctor without a ranger.
2007 (path_range_query::~path_range_query): Free ranger if necessary.
2008 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
2009 (path_range_query::ssa_range_in_phi): Same.
2010 (path_range_query::compute_ranges_in_block): Same.
2011 (path_range_query::compute_imports): Same.
2012 (path_range_query::compute_ranges): Same.
2013 (path_range_query::range_of_stmt): Same.
2014 (path_range_query::compute_outgoing_relations): Same.
2015 * gimple-range-path.h (class path_range_query): New ctor.
2016 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
2017 as path_range_query allocates one.
2018 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
2019 (back_threader::~back_threader): Same.
2021 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2023 * tree-ssa-threadbackward.c
2024 (back_threader_profitability::profitable_path_p): Remove loop
2025 crossing restriction.
2027 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2029 * ipa-modref.c (analyze_function): Do pure/const discovery, return
2031 (pass_modref::execute): If pure/const is discovered fixup cfg.
2032 (ignore_edge): Do not ignore pure/const edges.
2033 (modref_propagate_in_scc): Do pure/const discovery, return true if
2034 cdtor was promoted pure/const.
2035 (pass_ipa_modref::execute): If needed remove unreachable functions.
2036 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
2037 (warn_function_cold): Likewise.
2038 (skip_function_for_local_pure_const): Move earlier.
2039 (ipa_make_function_const): Break out from ...
2040 (ipa_make_function_pure): Break out from ...
2041 (propagate_pure_const): ... here.
2042 (pass_local_pure_const::execute): Use it.
2043 * ipa-utils.h (ipa_make_function_const): Declare.
2044 (ipa_make_function_pure): Declare.
2045 * passes.def: Move early modref after pure-const.
2047 2021-11-11 David Malcolm <dmalcolm@redhat.com>
2050 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
2052 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2054 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
2055 (TYPES_COMBINEP): Delete.
2056 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2057 qualified builtins for vcombine_* intrinsics.
2058 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
2060 (vcombine_s16): Likewise.
2061 (vcombine_s32): Likewise.
2062 (vcombine_f32): Likewise.
2063 (vcombine_u8): Use type-qualified builtin and remove casts.
2064 (vcombine_u16): Likewise.
2065 (vcombine_u32): Likewise.
2066 (vcombine_u64): Likewise.
2067 (vcombine_p8): Likewise.
2068 (vcombine_p16): Likewise.
2069 (vcombine_p64): Likewise.
2070 (vcombine_bf16): Remove unnecessary cast.
2071 * config/aarch64/iterators.md (VD_I): New mode iterator.
2072 (VDC_P): New mode iterator.
2074 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2076 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
2077 (TYPES_LOAD1_P): Define.
2078 (TYPES_STORE1_U): Define.
2079 (TYPES_STORE1P): Rename to...
2080 (TYPES_STORE1_P): This.
2081 (get_mem_type_for_load_store): Add unsigned and poly types.
2082 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
2083 type-qualified builtin declarations.
2084 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2085 qualified builtins for LD1/ST1.
2086 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
2087 builtin and remove cast.
2088 (vld1_p16): Likewise.
2089 (vld1_u8): Likewise.
2090 (vld1_u16): Likewise.
2091 (vld1_u32): Likewise.
2092 (vld1q_p8): Likewise.
2093 (vld1q_p16): Likewise.
2094 (vld1q_p64): Likewise.
2095 (vld1q_u8): Likewise.
2096 (vld1q_u16): Likewise.
2097 (vld1q_u32): Likewise.
2098 (vld1q_u64): Likewise.
2099 (vst1_p8): Likewise.
2100 (vst1_p16): Likewise.
2101 (vst1_u8): Likewise.
2102 (vst1_u16): Likewise.
2103 (vst1_u32): Likewise.
2104 (vst1q_p8): Likewise.
2105 (vst1q_p16): Likewise.
2106 (vst1q_p64): Likewise.
2107 (vst1q_u8): Likewise.
2108 (vst1q_u16): Likewise.
2109 (vst1q_u32): Likewise.
2110 (vst1q_u64): Likewise.
2111 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
2113 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2115 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2116 builtins for vector reduction.
2117 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
2118 builtin and remove casts.
2119 (vaddv_u16): Likewise.
2120 (vaddv_u32): Likewise.
2121 (vaddvq_u8): Likewise.
2122 (vaddvq_u16): Likewise.
2123 (vaddvq_u32): Likewise.
2124 (vaddvq_u64): Likewise.
2126 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2128 * config/aarch64/aarch64-simd-builtins.def:
2129 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
2130 builtin and remove casts.
2131 (vpaddq_u16): Likewise.
2132 (vpaddq_u32): Likewise.
2133 (vpaddq_u64): Likewise.
2134 (vpadd_u8): Likewise.
2135 (vpadd_u16): Likewise.
2136 (vpadd_u32): Likewise.
2137 (vpaddd_u64): Likewise.
2139 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2141 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2142 builtins for [r]subhn[2].
2143 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
2145 (vsubhn_s32): Likewise.
2146 (vsubhn_s64): Likewise.
2147 (vsubhn_u16): Use type-qualified builtin and remove casts.
2148 (vsubhn_u32): Likewise.
2149 (vsubhn_u64): Likewise.
2150 (vrsubhn_s16): Remove unnecessary cast.
2151 (vrsubhn_s32): Likewise.
2152 (vrsubhn_s64): Likewise.
2153 (vrsubhn_u16): Use type-qualified builtin and remove casts.
2154 (vrsubhn_u32): Likewise.
2155 (vrsubhn_u64): Likewise.
2156 (vrsubhn_high_s16): Remove unnecessary cast.
2157 (vrsubhn_high_s32): Likewise.
2158 (vrsubhn_high_s64): Likewise.
2159 (vrsubhn_high_u16): Use type-qualified builtin and remove
2161 (vrsubhn_high_u32): Likewise.
2162 (vrsubhn_high_u64): Likewise.
2163 (vsubhn_high_s16): Remove unnecessary cast.
2164 (vsubhn_high_s32): Likewise.
2165 (vsubhn_high_s64): Likewise.
2166 (vsubhn_high_u16): Use type-qualified builtin and remove
2168 (vsubhn_high_u32): Likewise.
2169 (vsubhn_high_u64): Likewise.
2171 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2173 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2174 builtins for [r]addhn[2].
2175 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
2177 (vaddhn_s32): Likewise.
2178 (vaddhn_s64): Likewise.
2179 (vaddhn_u16): Use type-qualified builtin and remove casts.
2180 (vaddhn_u32): Likewise.
2181 (vaddhn_u64): Likewise.
2182 (vraddhn_s16): Remove unnecessary cast.
2183 (vraddhn_s32): Likewise.
2184 (vraddhn_s64): Likewise.
2185 (vraddhn_u16): Use type-qualified builtin and remove casts.
2186 (vraddhn_u32): Likewise.
2187 (vraddhn_u64): Likewise.
2188 (vaddhn_high_s16): Remove unnecessary cast.
2189 (vaddhn_high_s32): Likewise.
2190 (vaddhn_high_s64): Likewise.
2191 (vaddhn_high_u16): Use type-qualified builtin and remove
2193 (vaddhn_high_u32): Likewise.
2194 (vaddhn_high_u64): Likewise.
2195 (vraddhn_high_s16): Remove unnecessary cast.
2196 (vraddhn_high_s32): Likewise.
2197 (vraddhn_high_s64): Likewise.
2198 (vraddhn_high_u16): Use type-qualified builtin and remove
2200 (vraddhn_high_u32): Likewise.
2201 (vraddhn_high_u64): Likewise.
2203 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2205 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2206 qualifiers in generator macros for uhsub builtins.
2207 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
2209 (vhsub_s16): Likewise.
2210 (vhsub_s32): Likewise.
2211 (vhsub_u8): Use type-qualified builtin and remove casts.
2212 (vhsub_u16): Likewise.
2213 (vhsub_u32): Likewise.
2214 (vhsubq_s8): Remove unnecessary cast.
2215 (vhsubq_s16): Likewise.
2216 (vhsubq_s32): Likewise.
2217 (vhsubq_u8): Use type-qualified builtin and remove casts.
2218 (vhsubq_u16): Likewise.
2219 (vhsubq_u32): Likewise.
2221 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2223 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2224 qualifiers in generator macros for u[r]hadd builtins.
2225 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
2227 (vhadd_s16): Likewise.
2228 (vhadd_s32): Likewise.
2229 (vhadd_u8): Use type-qualified builtin and remove casts.
2230 (vhadd_u16): Likewise.
2231 (vhadd_u32): Likewise.
2232 (vhaddq_s8): Remove unnecessary cast.
2233 (vhaddq_s16): Likewise.
2234 (vhaddq_s32): Likewise.
2235 (vhaddq_u8): Use type-qualified builtin and remove casts.
2236 (vhaddq_u16): Likewise.
2237 (vhaddq_u32): Likewise.
2238 (vrhadd_s8): Remove unnecessary cast.
2239 (vrhadd_s16): Likewise.
2240 (vrhadd_s32): Likewise.
2241 (vrhadd_u8): Use type-qualified builtin and remove casts.
2242 (vrhadd_u16): Likewise.
2243 (vrhadd_u32): Likewise.
2244 (vrhaddq_s8): Remove unnecessary cast.
2245 (vrhaddq_s16): Likewise.
2246 (vrhaddq_s32): Likewise.
2247 (vrhaddq_u8): Use type-wualified builtin and remove casts.
2248 (vrhaddq_u16): Likewise.
2249 (vrhaddq_u32): Likewise.
2251 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2253 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2254 qualifiers in generator macros for usub[lw][2] builtins.
2255 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
2257 (vsubl_s16): Likewise.
2258 (vsubl_s32): Likewise.
2259 (vsubl_u8): Use type-qualified builtin and remove casts.
2260 (vsubl_u16): Likewise.
2261 (vsubl_u32): Likewise.
2262 (vsubl_high_s8): Remove unnecessary cast.
2263 (vsubl_high_s16): Likewise.
2264 (vsubl_high_s32): Likewise.
2265 (vsubl_high_u8): Use type-qualified builtin and remove casts.
2266 (vsubl_high_u16): Likewise.
2267 (vsubl_high_u32): Likewise.
2268 (vsubw_s8): Remove unnecessary casts.
2269 (vsubw_s16): Likewise.
2270 (vsubw_s32): Likewise.
2271 (vsubw_u8): Use type-qualified builtin and remove casts.
2272 (vsubw_u16): Likewise.
2273 (vsubw_u32): Likewise.
2274 (vsubw_high_s8): Remove unnecessary cast.
2275 (vsubw_high_s16): Likewise.
2276 (vsubw_high_s32): Likewise.
2277 (vsubw_high_u8): Use type-qualified builtin and remove casts.
2278 (vsubw_high_u16): Likewise.
2279 (vsubw_high_u32): Likewise.
2281 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2283 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2284 qualifiers in generator macros for uadd[lw][2] builtins.
2285 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
2287 (vaddl_s16): Likewise.
2288 (vaddl_s32): Likewise.
2289 (vaddl_u8): Use type-qualified builtin and remove casts.
2290 (vaddl_u16): Likewise.
2291 (vaddl_u32): Likewise.
2292 (vaddl_high_s8): Remove unnecessary cast.
2293 (vaddl_high_s16): Likewise.
2294 (vaddl_high_s32): Likewise.
2295 (vaddl_high_u8): Use type-qualified builtin and remove casts.
2296 (vaddl_high_u16): Likewise.
2297 (vaddl_high_u32): Likewise.
2298 (vaddw_s8): Remove unnecessary cast.
2299 (vaddw_s16): Likewise.
2300 (vaddw_s32): Likewise.
2301 (vaddw_u8): Use type-qualified builtin and remove casts.
2302 (vaddw_u16): Likewise.
2303 (vaddw_u32): Likewise.
2304 (vaddw_high_s8): Remove unnecessary cast.
2305 (vaddw_high_s16): Likewise.
2306 (vaddw_high_s32): Likewise.
2307 (vaddw_high_u8): Use type-qualified builtin and remove casts.
2308 (vaddw_high_u16): Likewise.
2309 (vaddw_high_u32): Likewise.
2311 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2313 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2314 qualified builtins for [R]SHRN[2].
2315 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
2316 builtin and remove casts.
2317 (vshrn_n_u32): Likewise.
2318 (vshrn_n_u64): Likewise.
2319 (vrshrn_high_n_u16): Likewise.
2320 (vrshrn_high_n_u32): Likewise.
2321 (vrshrn_high_n_u64): Likewise.
2322 (vrshrn_n_u16): Likewise.
2323 (vrshrn_n_u32): Likewise.
2324 (vrshrn_n_u64): Likewise.
2325 (vshrn_high_n_u16): Likewise.
2326 (vshrn_high_n_u32): Likewise.
2327 (vshrn_high_n_u64): Likewise.
2329 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2331 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2332 type-qualified builtins for XTN[2].
2333 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
2334 qualified builtin and remove casts.
2335 (vmovn_high_u32): Likewise.
2336 (vmovn_high_u64): Likewise.
2337 (vmovn_u16): Likewise.
2338 (vmovn_u32): Likewise.
2339 (vmovn_u64): Likewise.
2341 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2343 * config/aarch64/aarch64-simd-builtins.def: Use poly type
2344 qualifier in builtin generator macros.
2345 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
2346 builtin and remove casts.
2347 (vmulq_p8): Likewise.
2348 (vmull_high_p8): Likewise.
2349 (vmull_p8): Likewise.
2351 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2353 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2354 qualified builtin generators for unsigned MLA/MLS intrinsics.
2355 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
2357 (vmla_n_u32): Likewise.
2358 (vmla_u8): Likewise.
2359 (vmla_u16): Likewise.
2360 (vmla_u32): Likewise.
2361 (vmlaq_n_u16): Likewise.
2362 (vmlaq_n_u32): Likewise.
2363 (vmlaq_u8): Likewise.
2364 (vmlaq_u16): Likewise.
2365 (vmlaq_u32): Likewise.
2366 (vmls_n_u16): Likewise.
2367 (vmls_n_u32): Likewise.
2368 (vmls_u8): Likewise.
2369 (vmls_u16): Likewise.
2370 (vmls_u32): Likewise.
2371 (vmlsq_n_u16): Likewise.
2372 (vmlsq_n_u32): Likewise.
2373 (vmlsq_u8): Likewise.
2374 (vmlsq_u16): Likewise.
2375 (vmlsq_u32): Likewise.
2377 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2379 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
2380 with looping const/pure.
2381 (modref_summary_lto::useful_p): Likewise.
2382 (merge_call_side_effects): Merge side effects before early exit
2384 (process_fnspec): Also handle pure functions.
2385 (analyze_call): Do not early exit on looping pure const.
2386 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
2387 (modref_propagate_in_scc): Update.
2389 2021-11-11 Richard Biener <rguenther@suse.de>
2391 PR tree-optimization/103190
2392 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
2394 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2396 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
2398 (path_range_query::compute_imports): ...this. Adapt it so it can
2399 be passed the imports bitmap instead of working on m_imports.
2400 (path_range_query::compute_ranges): Call compute_imports in all
2401 cases unless an imports bitmap is passed.
2402 * gimple-range-path.h (path_range_query::compute_imports): New.
2403 (path_range_query::add_copies_to_imports): Remove.
2404 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
2405 (back_threader::find_paths_to_names): Inline resolve_def.
2406 (back_threader::find_paths): Call compute_imports.
2407 (back_threader::resolve_phi): Adjust comment.
2409 2021-11-11 Richard Biener <rguenther@suse.de>
2411 PR tree-optimization/103188
2412 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
2413 Remove query parameter, split out check for size
2415 (ch_base::m_ranger, cb_base::m_query): Remove.
2416 (ch_base::copy_headers): Split processing loop into
2417 analysis around which we allocate and use ranger and
2418 transform where we do not.
2419 (pass_ch::execute): Do not allocate/free ranger here.
2420 (pass_ch_vect::execute): Likewise.
2422 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2424 * ipa-pure-const.c (propagate_pure_const): Self recursion is
2427 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2429 * cgraph.c (set_noreturn_flag_1): New function.
2430 (cgraph_node::set_noreturn_flag): New member function
2431 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
2432 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
2434 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2435 Richard Biener <rguenther@suse.de>
2437 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
2438 statically to the edge remaining in the loop.
2440 2021-11-11 Richard Biener <rguenther@suse.de>
2442 PR middle-end/103181
2443 * tree-eh.c (operation_could_trap_helper_p): Properly
2444 check vector constants for a zero element for integer
2445 division. Separate floating point and integer division code.
2447 2021-11-11 Jakub Jelinek <jakub@redhat.com>
2450 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
2451 handling only for DECL_BIT_FIELD_TYPE decls.
2453 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2456 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
2457 token is arch extension without leading '+' and emit appropriate
2458 diagnostic for the same.
2460 2021-11-11 Jakub Jelinek <jakub@redhat.com>
2462 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
2463 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
2464 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
2465 * tree.c (omp_clause_num_ops): Increase num ops for
2466 OMP_CLAUSE_NUM_TEAMS to 2.
2467 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
2468 for OMP_CLAUSE_NUM_TEAMS.
2469 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
2470 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
2471 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
2472 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
2473 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
2474 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
2475 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
2477 2021-11-11 Richard Biener <rguenther@suse.de>
2479 * cfganal.c (find_pdom): Remove.
2480 (control_dependences::find_control_dependence): Remove
2481 special-casing of entry block, call get_immediate_dominator
2483 * gimple-predicate-analysis.cc (find_pdom): Remove.
2484 (find_dom): Likewise.
2485 (find_control_equiv_block): Call get_immediate_dominator
2487 (compute_control_dep_chain): Likewise.
2488 (predicate::init_from_phi_def): Likewise.
2490 2021-11-11 Richard Biener <rguenther@suse.de>
2492 * cfganal.h (control_dependences::control_dependence_map):
2494 (control_dependences::m_bitmaps): New.
2495 * cfganal.c (control_dependences::set_control_dependence_map_bit):
2497 (control_dependences::clear_control_dependence_bitmap):
2499 (control_dependences::find_control_dependence): Do not
2500 find_edge for the abnormal edge test.
2501 (control_dependences::control_dependences): Instead do not
2502 add abnormal edges to the edge list. Adjust.
2503 (control_dependences::~control_dependences): Likewise.
2504 (control_dependences::get_edges_dependent_on): Likewise.
2505 * function-tests.c: Include bitmap.h.
2507 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
2509 * doc/invoke.texi: Change references to "future cpu" to "power10",
2510 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
2512 2021-11-11 Cui,Lili <lili.cui@intel.com>
2514 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
2516 (processor_cost_table): Use alderlake_cost for Alderlake.
2517 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
2518 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
2520 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
2522 (ix86_adjust_cost): Handle Alderlake.
2523 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
2524 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
2525 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
2526 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
2527 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
2528 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
2529 (X86_TUNE_USE_LEAVE): Likewise.
2530 (X86_TUNE_PUSH_MEMORY): Likewise.
2531 (X86_TUNE_USE_INCDEC): Likewise.
2532 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
2533 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
2534 (X86_TUNE_USE_SAHF): Likewise.
2535 (X86_TUNE_USE_BT): Likewise.
2536 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
2537 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
2538 (X86_TUNE_AVOID_MFENCE): Likewise.
2539 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
2540 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
2541 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
2542 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
2543 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
2544 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
2545 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
2546 (X86_TUNE_USE_GATHER): Disable for Alderlake.
2547 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
2548 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
2550 2021-11-11 liuhongt <hongtao.liu@intel.com>
2553 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
2554 (avxsizesuffix): Ditto.
2556 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
2558 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
2559 wrong marco checking.
2561 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2563 PR tree-optimization/102906
2564 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
2565 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
2566 (class ch_base): Add m_ranger and m_query.
2567 (ch_base::copy_headers): Pass m_query to
2568 entry_loop_condition_is_static.
2569 (pass_ch::execute): Allocate and deallocate m_ranger and
2571 (pass_ch_vect::execute): Same.
2573 2021-11-10 Andrew Pinski <apinski@marvell.com>
2576 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
2577 Use vwcore iterator for the r constraint output string.
2579 2021-11-10 qing zhao <qing.zhao@oracle.com>
2581 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
2582 only when have_insn_for return true for the mode. Fix a memory leak.
2584 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
2586 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
2587 cortexa5_extra_costs, cortexa7_extra_costs,
2588 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
2589 Initialize movi, dup and extract costing fields.
2591 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2593 * gimple-range-path.cc (path_range_query::path_range_query): Do
2595 (path_range_query::dump): Change m_path uses to non-pointer.
2596 (path_range_query::defined_outside_path): Same.
2597 (path_range_query::set_path): Same.
2598 (path_range_query::add_copies_to_imports): Same.
2599 (path_range_query::range_of_stmt): Same.
2600 (path_range_query::compute_outgoing_relations): Same.
2601 (path_range_query::compute_ranges): Imports are now optional.
2602 Implement overload that takes an edge.
2603 * gimple-range-path.h (class path_range_query): Make imports
2604 optional for compute_ranges. Add compute_ranges(edge) overload.
2605 Make m_path an auto_vec instead of a pointer and adjust
2608 2021-11-10 Tamar Christina <tamar.christina@arm.com>
2610 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
2611 (default_hash_traits<scalar_conf_masked_key>): Likewise.
2612 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
2614 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
2615 Register mask inverses.
2617 2021-11-10 Tamar Christina <tamar.christina@arm.com>
2619 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
2620 successful vectorization.
2622 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
2624 * gimple-range-cache.cc (sbr_vector::grow): New.
2625 (sbr_vector::set_bb_range): Call grow.
2626 (sbr_vector::get_bb_range): Same.
2627 (sbr_vector::bb_range_p): Remove assert.
2629 2021-11-10 Tamar Christina <tamar.christina@arm.com>
2631 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
2632 ,*aarch64_topbits_shuffle<mode>_be): Remove.
2634 2021-11-10 Jan Hubicka <jh@suse.cz>
2636 * ipa-modref.c: Include tree-eh.h
2637 (modref_summary::modref_summary): Initialize side_effects.
2638 (struct modref_summary_lto): New bool field side_effects.
2639 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
2640 (modref_summary::dump): Dump side_effects.
2641 (modref_summary_lto::dump): Dump side_effects.
2642 (merge_call_side_effects): Merge side effects.
2643 (process_fnspec): Calls to non-const/pure or looping
2644 function is a side effect.
2645 (analyze_call): Self-recursion is a side-effect; handle
2647 (analyze_load): Watch for volatile and throwing memory.
2648 (analyze_store): Likewise.
2649 (analyze_stmt): Watch for volatitle asm.
2650 (analyze_function): Handle side_effects.
2651 (modref_summaries::duplicate): Duplicate side_effects.
2652 (modref_summaries_lto::duplicate): Likewise.
2653 (modref_write): Stream side_effects.
2654 (read_section): Likewise.
2655 (update_signature): Update.
2656 (propagate_unknown_call): Handle side_effects.
2657 (modref_propagate_in_scc): Likewise.
2658 * ipa-modref.h (struct modref_summary): Add side_effects.
2659 * ipa-pure-const.c (special_builtin_state): Rename to ...
2660 (builtin_safe_for_const_function_p): ... this one.
2661 (check_call): Update.
2662 (finite_function_p): Break out from ...
2663 (propagate_pure_const): ... here
2664 * ipa-utils.h (finite_function): Declare.
2666 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
2668 * config.gcc (powerpc*-*-*): Remove -rpath from
2669 --with-advance-toolchain.
2671 2021-11-10 Marek Polacek <polacek@redhat.com>
2674 * attribs.c (struct scoped_attributes): Add a bool member.
2675 (lookup_scoped_attribute_spec): Forward declare.
2676 (register_scoped_attributes): New bool parameter, defaulted to
2678 (handle_ignored_attributes_option): New function.
2679 (free_attr_data): New function.
2680 (init_attributes): Call handle_ignored_attributes_option.
2681 (attr_namespace_ignored_p): New function.
2682 (decl_attributes): Check attr_namespace_ignored_p before
2684 * attribs.h (free_attr_data): Declare.
2685 (register_scoped_attributes): Adjust declaration.
2686 (handle_ignored_attributes_option): Declare.
2687 (canonicalize_attr_name): New function template.
2688 (canonicalize_attr_name): Use it.
2689 * common.opt (Wattributes=): New option with a variable.
2690 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
2691 * doc/invoke.texi: Document -Wno-attributes=.
2692 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
2693 * plugin.h (register_scoped_attributes): Adjust declaration.
2694 * toplev.c (compile_file): Call free_attr_data.
2696 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2698 * config/arm/arm-cpus.in (cortex-a710): New CPU.
2699 * config/arm/arm-tables.opt: Regenerate.
2700 * config/arm/arm-tune.md: Regenerate.
2701 * doc/invoke.texi: Update docs.
2703 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
2705 * config/aarch64/aarch64-builtins.c
2706 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
2708 2021-11-10 Martin Liska <mliska@suse.cz>
2710 * lto-wrapper.c (merge_and_complain): Make the first argument
2713 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2715 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
2716 smax_nan and fmin_nan instead of smin_nan.
2717 (maxmin_uns): Rename to...
2718 (fmaxmin): ...this and make the same changes. Remove entries
2719 unrelated to fmax* and fmin*.
2720 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
2721 (<fmaxmin><mode>3): ...this.
2722 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
2724 (aarch64_<optab>p<mode>): ...this.
2725 (<maxmin_uns><mode>3): Rename to...
2726 (<fmaxmin><mode>3): ...this.
2727 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
2728 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
2729 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
2730 (aarch64_reduc_<optab>_internal<mode>): ...this.
2731 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
2732 (aarch64_reduc_<optab>_internalv2si): ...this.
2733 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
2734 (<fmaxmin><mode>3): ...this.
2735 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
2737 (fmax_nan, fmin_nan): ...this.
2738 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
2739 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
2740 (vmin_f16, vminq_f16): Update accordingly.
2742 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2744 * tree-vectorizer.h (vector_costs::finish_cost): Take the
2745 corresponding scalar costs as a parameter.
2746 (finish_cost): Likewise.
2747 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
2748 (vect_estimate_min_profitable_iters): Update accordingly.
2749 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
2750 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
2751 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
2753 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
2755 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2757 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
2759 (_loop_vec_info::single_scalar_iteration_cost): Delete.
2760 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
2761 (vector_costs::total_cost): New function.
2762 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
2763 after above changes.
2764 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
2765 (vect_compute_single_scalar_iteration_cost): Store the costs
2766 in loop_vinfo->scalar_costs.
2767 (vect_estimate_min_profitable_iters): Get the scalar cost from
2768 loop_vinfo->scalar_costs.
2770 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2772 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
2773 (vector_costs::better_epilogue_loop_than_p)
2774 (vector_costs::compare_inside_loop_cost)
2775 (vector_costs::compare_outside_loop_cost): Likewise.
2776 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
2777 (vector_costs::better_epilogue_loop_than_p)
2778 (vector_costs::compare_inside_loop_cost)
2779 (vector_costs::compare_outside_loop_cost): New functions,
2780 containing code moved from...
2781 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
2783 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2785 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
2786 and vec_inside_cost.
2787 (vector_costs::outside_cost): New function.
2788 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
2790 (vect_estimate_min_profitable_iters): Likewise.
2791 (vect_better_loop_vinfo_p): Get the inside and outside costs
2792 from the loop_vec_infos' vector_costs.
2794 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2796 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
2797 (_loop_vec_info::vector_costs): ...this.
2798 (LOOP_VINFO_TARGET_COST_DATA): Delete.
2799 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
2801 (vec_info::~vec_info): Remove corresponding delete.
2802 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
2803 vector_costs to null.
2804 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
2805 (vect_analyze_loop_operations): Update after above changes.
2806 (vect_analyze_loop_2): Likewise.
2807 (vect_estimate_min_profitable_iters): Likewise.
2808 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
2810 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
2812 * tree-core.h (EAF_DIRECT): Remove.
2813 (EAF_NOCLOBBER): Remove.
2814 (EAF_UNUSED): Remove.
2815 (EAF_NOESCAPE): Remove.
2816 (EAF_NO_DIRECT_CLOBBER): New.
2817 (EAF_NO_INDIRECT_CLOBBER): New.
2818 (EAF_NODIRECTESCAPE): Remove.
2819 (EAF_NO_DIRECT_ESCAPE): New.
2820 (EAF_NO_INDIRECT_ESCAPE): New.
2821 (EAF_NOT_RETURNED): Remove.
2822 (EAF_NOT_RETURNED_INDIRECTLY): New.
2823 (EAF_NOREAD): Remove.
2824 (EAF_NO_DIRECT_READ): New.
2825 (EAF_NO_INDIRECT_READ): New.
2826 * gimple.c (gimple_call_arg_flags): Update for new flags.
2827 (gimple_call_retslot_flags): Update for new flags.
2828 * ipa-modref.c (dump_eaf_flags): Likewise.
2829 (remove_useless_eaf_flags): Likewise.
2830 (deref_flags): Likewise.
2831 (modref_lattice::init): Likewise.
2832 (modref_lattice::merge): Likewise.
2833 (modref_lattice::merge_direct_load): Likewise.
2834 (modref_lattice::merge_direct_store): Likewise.
2835 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
2836 (callee_to_caller_flags): Likewise.
2837 (modref_eaf_analysis::analyze_ssa_name): Likewise.
2838 (modref_eaf_analysis::propagate): Likewise.
2839 (modref_merge_call_site_flags): Likewise.
2840 * ipa-modref.h (interposable_eaf_flags): Likewise.
2841 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
2842 * tree-ssa-structalias.c (handle_call_arg): Likewise.
2843 (handle_rhs_call): Likewise.
2844 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
2846 2021-11-10 Tamar Christina <tamar.christina@arm.com>
2848 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
2850 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
2852 * config/aarch64/aarch64-builtins.c
2853 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
2856 2021-11-10 Jan Hubicka <jh@suse.cz>
2858 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
2860 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
2861 Hongtao Liu <hongtao.liu@intel.com>
2863 PR middle-end/102566
2864 * match.pd (nop_atomic_bit_test_and_p): New match.
2865 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
2866 (gimple_nop_atomic_bit_test_and_p): New prototype.
2867 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
2868 different cases to their canonical forms.
2870 2021-11-10 liuhongt <hongtao.liu@intel.com>
2872 PR tree-optimization/103126
2873 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
2874 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
2875 * tree-if-conv.c : Include tree-vectorizer.h.
2876 (is_cond_scalar_reduction): Handle
2877 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
2878 (convert_scalar_cond_reduction): Ditto.
2880 2021-11-10 konglin1 <lingling.kong@intel.com>
2882 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
2883 (cmla<conj_op><mode>4): Likewise
2885 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2887 * tree-ssa-threadedge.c: Do not include
2888 gimple-ssa-evrp-analyze.h.
2889 * value-pointer-equiv.cc: Same.
2891 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2893 * tree-ssa-threadbackward.c
2894 (back_threader::maybe_register_path_dump): Abstract path dumping...
2895 (dump_path): ...here.
2896 (back_threader::resolve_phi): Call dump_path.
2899 2021-11-10 konglin1 <lingling.kong@intel.com>
2901 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
2902 Add new define_insn.
2903 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
2904 (fma_<mode>_fcmaddc_bcst): Likewise
2906 2021-11-10 liuhongt <hongtao.liu@intel.com>
2909 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
2912 2021-11-10 Andrew Pinski <apinski@marvell.com>
2915 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
2916 as an input, use original one.
2918 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
2920 * config/nios2/nios2.c (nios2_can_inline_p): New.
2921 (TARGET_CAN_INLINE_P): Define.
2923 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
2925 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
2926 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
2927 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
2928 (lxvrse_expand_builtin): Modify the expansion for sign extension.
2929 All extensions are done within VSX registers.
2931 2021-11-09 Andrew Pinski <apinski@marvell.com>
2933 PR tree-optimization/10352
2934 * match.pd: Remove check of TYPE_PRECISION for
2937 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
2939 * function.c (allocate_struct_function): Don't set x_range_query.
2940 * function.h (get_range_query): Move to value-query.h.
2941 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
2942 (disable_ranger): Clear function current query field.
2943 * value-query.cc (get_global_range_query): Relocate to:
2944 * value-query.h (get_global_range_query): Here and inline.
2945 (get_range_query): Relocate here from function.h.
2947 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2949 * tree-ssa-threadbackward.c
2950 (back_threader::maybe_register_path_dump): New.
2951 (back_threader::maybe_register_path): Call maybe_register_path_dump.
2953 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2955 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
2956 Return NULL when unprofitable.
2958 2021-11-09 Martin Jambor <mjambor@suse.cz>
2960 * tree.h (build_debug_expr_decl): Declare.
2961 * tree.c (build_debug_expr_decl): New function.
2962 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
2963 instead of building a DEBUG_EXPR_DECL.
2964 * ipa-param-manipulation.c
2965 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
2966 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
2967 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
2968 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
2969 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
2971 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
2973 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
2974 (def_builtin): Change debug formatting for easier parsing and
2975 include more information.
2976 (rs6000_init_builtins): Add dump of autogenerated builtins.
2977 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
2980 2021-11-09 Marek Polacek <polacek@redhat.com>
2982 * ipa-modref.c (analyze_parms): Fix a typo.
2984 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
2987 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
2988 assertion for Z*-ext.
2990 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
2992 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
2993 and past_static_chain; merge past summary with current one.
2994 (analyze_function): Update.
2996 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2998 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
3000 2021-11-09 Martin Liska <mliska@suse.cz>
3002 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
3003 conditional based on
3004 * print-rtl.h (class rtx_writer): Likewise.
3006 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3008 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
3009 for 'Deleted'. Turn into a '#define'.
3011 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3013 * gimple-range-path.cc (path_range_query::dump): Clean up.
3014 (path_range_query::compute_ranges): Same.
3015 * value-relation.cc (path_oracle::dump): Same.
3017 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3019 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
3020 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
3022 * flag-types.h (enum threader_debug): New.
3023 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
3024 * params.opt: Add entry for --param=threader-debug=.
3026 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3028 PR middle-end/101204
3030 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
3031 (struct xint_hash_t): ... this.
3032 * doc/gty.texi: Update.
3034 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3036 * config/arm/arm-cpus.in (armv9): New define.
3037 (ARMv9a): New group.
3038 (armv9-a): New arch definition.
3039 * config/arm/arm-tables.opt: Regenerate.
3040 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
3041 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
3042 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
3043 to MULTILIB_MATCHES.
3044 * config/arm/t-multilib: Added v9_a_nosimd_variants and
3045 v9_a_simd_variants to MULTILIB_MATCHES.
3046 * doc/invoke.texi: Update docs.
3048 2021-11-09 Martin Jambor <mjambor@suse.cz>
3051 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
3052 return with error_mark_mode when part of expression is mapped to
3054 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
3055 mapped value to NULL if walk_tree returns error_mark_mode.
3057 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3059 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
3061 (get_addr_stridx): Same.
3063 (get_range_strlen_dynamic): Same.
3064 (handle_builtin_strlen): Same.
3065 (handle_builtin_strchr): Same.
3066 (handle_builtin_strcpy): Same.
3067 (maybe_diag_stxncpy_trunc): Same.
3068 (handle_builtin_stxncpy_strncat): Same.
3069 (handle_builtin_memcpy): Same.
3070 (handle_builtin_strcat): Same.
3071 (handle_alloc_call): Same.
3072 (handle_builtin_memset): Same.
3073 (handle_builtin_string_cmp): Same.
3074 (handle_pointer_plus): Same.
3075 (count_nonzero_bytes_addr): Same.
3076 (count_nonzero_bytes): Same.
3077 (handle_store): Same.
3078 (fold_strstr_to_strncmp): Same.
3079 (handle_integral_assign): Same.
3080 (check_and_optimize_stmt): Same.
3081 (class strlen_dom_walker): Replace evrp with ranger.
3082 (strlen_dom_walker::before_dom_children): Remove evrp.
3083 (strlen_dom_walker::after_dom_children): Remove evrp.
3084 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
3085 Restrict sprintf output.
3087 2021-11-09 Martin Liska <mliska@suse.cz>
3089 * genconditions.c (write_one_condition): Add const qualifier
3092 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
3094 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
3097 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3099 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
3100 escape-newline support.
3101 (VEC_INIT_V4SI): Likewise.
3102 (VEC_INIT_V8HI): Likewise.
3103 (PACK_V1TI): Likewise.
3105 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
3106 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
3107 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
3111 (FMAF128_ODD): Likewise.
3112 (VSCEQPUO): Likewise.
3114 (VSIEQPF): Likewise.
3115 (VSTDCQP): Likewise.
3116 (PACK_TD): Likewise.
3117 (TABORTDC): Likewise.
3118 (TABORTDCI): Likewise.
3119 (SE_LXVRBX): Likewise.
3120 (SE_LXVRHX): Likewise.
3121 (SE_LXVRWX): Likewise.
3122 (SE_LXVRDX): Likewise.
3123 (VREPLACE_UN_UV2DI): Likewise.
3124 (VREPLACE_UN_UV4SI): Likewise.
3125 (VREPLACE_UN_V2DI): Likewise.
3126 (VREPLACE_ELT_UV2DI): Likewise.
3127 (VREPLACE_ELT_V2DI): Likewise.
3128 (ZE_LXVRBX): Likewise.
3129 (ZE_LXVRHX): Likewise.
3130 (ZE_LXVRWX): Likewise.
3131 (ZE_LXVRDX): Likewise.
3133 (CNTLZDM): Likewise.
3134 (CNTTZDM): Likewise.
3137 (PMXVBF16GER2): Likewise.
3138 (PMXVBF16GER2_INTERNAL): Likewise.
3139 (PMXVBF16GER2NN): Likewise.
3140 (PMXVBF16GER2NN_INTERNAL): Likewise.
3141 (PMXVBF16GER2NP): Likewise.
3142 (PMXVBF16GER2NP_INTERNAL): Likewise.
3143 (PMXVBF16GER2PN): Likewise.
3144 (PMXVBF16GER2PN_INTERNAL): Likewise.
3145 (PMXVBF16GER2PP): Likewise.
3146 (PMXVBF16GER2PP_INTERNAL): Likewise.
3147 (PMXVF16GER2): Likewise.
3148 (PMXVF16GER2_INTERNAL): Likewise.
3149 (PMXVF16GER2NN): Likewise.
3150 (PMXVF16GER2NN_INTERNAL): Likewise.
3151 (PMXVF16GER2NP): Likewise.
3152 (PMXVF16GER2NP_INTERNAL): Likewise.
3153 (PMXVF16GER2PN): Likewise.
3154 (PMXVF16GER2PN_INTERNAL): Likewise.
3155 (PMXVF16GER2PP): Likewise.
3156 (PMXVF16GER2PP_INTERNAL): Likewise.
3157 (PMXVF32GER_INTERNAL): Likewise.
3158 (PMXVF32GERNN): Likewise.
3159 (PMXVF32GERNN_INTERNAL): Likewise.
3160 (PMXVF32GERNP): Likewise.
3161 (PMXVF32GERNP_INTERNAL): Likewise.
3162 (PMXVF32GERPN): Likewise.
3163 (PMXVF32GERPN_INTERNAL): Likewise.
3164 (PMXVF32GERPP): Likewise.
3165 (PMXVF32GERPP_INTERNAL): Likewise.
3166 (PMXVF64GER): Likewise.
3167 (PMXVF64GER_INTERNAL): Likewise.
3168 (PMXVF64GERNN): Likewise.
3169 (PMXVF64GERNN_INTERNAL): Likewise.
3170 (PMXVF64GERNP): Likewise.
3171 (PMXVF64GERNP_INTERNAL): Likewise.
3172 (PMXVF64GERPN): Likewise.
3173 (PMXVF64GERPN_INTERNAL): Likewise.
3174 (PMXVF64GERPP): Likewise.
3175 (PMXVF64GERPP_INTERNAL): Likewise.
3176 (PMXVI16GER2): Likewise.
3177 (PMXVI16GER2_INTERNAL): Likewise.
3178 (PMXVI16GER2PP): Likewise.
3179 (PMXVI16GER2PP_INTERNAL): Likewise.
3180 (PMXVI16GER2S): Likewise.
3181 (PMXVI16GER2S_INTERNAL): Likewise.
3182 (PMXVI16GER2SPP): Likewise.
3183 (PMXVI16GER2SPP_INTERNAL): Likewise.
3184 (PMXVI4GER8): Likewise.
3185 (PMXVI4GER8_INTERNAL): Likewise.
3186 (PMXVI4GER8PP): Likewise.
3187 (PMXVI4GER8PP_INTERNAL): Likewise.
3188 (PMXVI8GER4): Likewise.
3189 (PMXVI8GER4_INTERNAL): Likewise.
3190 (PMXVI8GER4PP): Likewise.
3191 (PMXVI8GER4PP_INTERNAL): Likewise.
3192 (PMXVI8GER4SPP): Likewise.
3193 (PMXVI8GER4SPP_INTERNAL): Likewise.
3194 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
3195 (linebuf): Increase size.
3196 (lines): New variable.
3197 (lastline): Likewise.
3198 (real_line_pos): New function.
3199 (diag): Change signature.
3200 (bif_diag): Change signature; support escape-newline handling.
3201 (ovld_diag): Likewise.
3202 (fatal): Move earlier.
3203 (consume_whitespace): Adjust diag call.
3204 (advance_line): Add escape-newline handling; call fatal.
3205 (safe_inc_pos): Adjust diag call.
3206 (match_identifier): Likewise.
3207 (match_integer): Likewise.
3208 (match_to_right_bracket): Call fatal instead of diag; adjust diag
3210 (match_basetype): Adjust diag calls.
3211 (match_bracketed_pair): Likewise.
3212 (match_const_restriction): Likewise.
3213 (match_type): Likewise.
3214 (parse_args): Likewise.
3215 (parse_bif_attrs): Likewise.
3216 (complete_vector_type): Likewise.
3217 (complete_base_type): Likewise.
3218 (parse_prototype): Likewise.
3219 (parse_bif_entry): Likewise.
3220 (parse_bif_stanza): Likewise.
3221 (parse_ovld_entry): Likewise.
3222 (parse_ovld_stanza): Likewise.
3223 (main): Allocate buffers for lines[].
3225 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
3227 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
3228 use the REG argument on the CM_LARGE_PIC code path as well.
3229 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
3230 specially with the large code models.
3232 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3234 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
3235 allowing interposition since issues with binds_to_local_def were
3238 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
3240 PR tree-optimization/103122
3241 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
3242 entry with "calulate new values" set to false.
3244 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3246 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
3247 and builtin_return_address.
3248 (builtin_fnspec): Annotate builtin_return,
3249 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
3250 builtin_cxa_end_cleanup, builtin_eh_copy_values,
3251 builtin_frame_address, builtin_apply_args,
3252 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
3253 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
3255 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
3256 and builtin_return_address.
3258 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3260 PR tree-optimization/103177
3261 * passes.def: Move uncprop after pure/const and modref.
3263 2021-11-08 Martin Jambor <mjambor@suse.cz>
3267 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
3268 location before invoking remap_with_debug_expressions on it.
3269 * ipa-param-manipulation.c
3270 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3272 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
3274 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
3277 2021-11-08 Richard Biener <rguenther@suse.de>
3279 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
3280 instead of an sbitmap to cache irreducible state.
3282 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
3284 PR tree-optimization/103120
3285 * gimple-range-path.cc (path_range_query::range_defined_in_block):
3286 Bail if there's a cache entry.
3288 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3290 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
3291 rs6000_builtin_decls_x when appropriate.
3292 (add_condition_to_bb): Likewise.
3293 (rs6000_atomic_assign_expand_fenv): Likewise.
3295 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3297 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
3298 (rs6000_builtin_decl): Call it.
3300 2021-11-08 Martin Liska <mliska@suse.cz>
3302 * cgraph.c (cgraph_node::dump): Dump it from decl.
3304 2021-11-08 Martin Liska <mliska@suse.cz>
3306 PR gcov-profile/100520
3307 * coverage.c (coverage_compute_profile_id): Strip .gk when
3308 compare debug is used.
3309 * system.h (endswith): New function.
3311 2021-11-08 Martin Liska <mliska@suse.cz>
3313 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
3315 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
3317 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
3318 * config/rs6000/rbtree.h: Likewise.
3319 * config/rs6000/rs6000-builtin-new.def: Likewise.
3320 * config/rs6000/rs6000-gen-builtins.c: Likewise.
3321 * config/rs6000/rs6000-overload.def: Likewise.
3323 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
3325 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
3326 (aarch64_address_cost): Use it instead of testing for CImode and
3329 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
3331 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
3332 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
3333 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
3334 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
3335 NUM_MODE_DECIMAL_FLOAT.
3336 (REAL_MODE_FORMAT): Likewise.
3338 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
3340 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
3343 2021-11-08 Richard Biener <rguenther@suse.de>
3345 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
3347 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
3348 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
3349 LOOP_VINFO_NITERS_ASSUMPTIONS.
3350 (vect_analyze_loop_1): Adjust.
3351 (vect_analyze_loop): Move loop constraint setting and
3352 SCEV/niter reset here from vect_create_loop_vinfo to perform
3354 (vect_analyze_loop_form): Move dumping of symbolic niters
3355 here from vect_create_loop_vinfo.
3357 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3359 * ipa-modref-tree.h (enum modref_special_parms): New enum.
3360 (struct modref_access_node): update for special parms.
3361 (struct modref_ref_node): Likewise.
3362 (struct modref_parm_map): Likewise.
3363 (struct modref_tree): Likewise.
3364 * ipa-modref.c (dump_access): Likewise.
3365 (get_access): Detect static chain.
3366 (parm_map_for_arg): Take tree as arg instead of
3368 (merge_call_side_effects): Compute map for static chain.
3369 (process_fnspec): Update.
3370 (struct escape_point): Remove retslot_arg and static_chain_arg.
3371 (analyze_parms): Update.
3372 (compute_parm_map): Update.
3373 (propagate_unknown_call): Update.
3374 (modref_propagate_in_scc): Update.
3375 (modref_merge_call_site_flags): Update.
3376 (ipa_merge_modref_summary_after_inlining): Update.
3377 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
3378 * ipa-modref-tree.c (test_merge): Update.
3380 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
3382 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
3383 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
3384 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
3387 2021-11-08 liuhongt <hongtao.liu@intel.com>
3389 PR tree-optimization/103077
3390 * doc/invoke.texi (Options That Control Optimization):
3391 Update documentation for -ftree-loop-vectorize and
3392 -ftree-slp-vectorize which are enabled by default at -02.
3394 2021-11-08 liuhongt <hongtao.liu@intel.com>
3397 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
3398 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
3400 2021-11-08 liuhongt <hongtao.liu@intel.com>
3403 * match.pd: Simplify
3404 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
3405 c) under flag_unsafe_math_optimizations.
3407 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
3410 * params.opt (modref-max-depth): Add range.
3411 (modref-max-adjustments): Fix range.
3413 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
3415 * tree-pass.h (make_pass_vrp_threader): Remove.
3416 * tree-ssa-threadbackward.c
3417 (back_threader_profitability::profitable_path_p): Remove
3418 ASSERT_EXPR references.
3419 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
3420 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
3421 (execute_vrp): Same.
3422 (class hybrid_threader): Remove.
3423 (hybrid_threader::hybrid_threader): Remove.
3424 (hybrid_threader::~hybrid_threader): Remove.
3425 (hybrid_threader::before_dom_children): Remove.
3426 (hybrid_threader::after_dom_children): Remove.
3427 (execute_vrp_threader): Remove.
3428 (class pass_vrp_threader): Remove.
3429 (make_pass_vrp_threader): Remove.
3431 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
3433 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
3434 (gimple_call_retslot_flags): Likewise.
3435 (gimple_call_static_chain_flags): Likewise.
3436 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
3438 (modref_summary::useful_p): Likewise.
3439 (modref_summary_lto::useful_p): Likewise.
3440 (analyze_parms): Do not give up on NOVOPS.
3441 (analyze_function): When dumping report chnages in EAF flags
3442 between IPA and local pass.
3443 (modref_merge_call_site_flags): Compute implicit eaf flags
3444 based on callee ecf_flags and fnspec; if the function does not
3445 bind to current defs use interposable_eaf_flags.
3446 (modref_propagate_flags_in_scc): Update.
3447 * ipa-modref.h (interposable_eaf_flags): New function.
3449 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
3451 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
3453 (rs6000_invalid_new_builtin): New function.
3454 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
3455 (rs6000_expand_ldst_mask): New function.
3456 (new_cpu_expand_builtin): Likewise.
3457 (elemrev_icode): Likewise.
3458 (ldv_expand_builtin): Likewise.
3459 (lxvrse_expand_builtin): Likewise.
3460 (lxvrze_expand_builtin): Likewise.
3461 (stv_expand_builtin): Likewise.
3462 (new_mma_expand_builtin): Likewise.
3463 (new_htm_spr_num): Likewise.
3464 (new_htm_expand_builtin): Likewise.
3465 (rs6000_expand_new_builtin): Likewise.
3466 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
3468 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
3470 * ipa-modref.c (modref_lattice): Add do_dataflow,
3471 changed and propagate_to fields.
3472 (modref_lattice::release): Free propagate_to
3473 (modref_lattice::merge): Do not give up early on unknown
3475 (modref_lattice::merge_deref): Likewise.
3476 (modref_eaf_analysis): Update toplevel comment.
3477 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
3478 do optimistic dataflow initialization.
3479 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
3480 (modref_eaf_analysis::propagate): New member function.
3481 (analyze_parms): Update to new API of modref_eaf_analysis.
3483 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
3485 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
3486 return true on functions from other partition.
3488 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
3490 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
3491 (vsx_splat_v4si_di): Change constraint to "wa".
3493 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
3495 PR tree-optimization/103061
3496 * value-relation.cc (path_oracle::path_oracle): Initialize
3498 (path_oracle::killing_def): Set m_killed_defs.
3499 (path_oracle::query_relation): Do not look at the root oracle for
3501 * value-relation.h (class path_oracle): Add m_killed_defs.
3503 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
3505 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
3506 Remove gotos and other cleanups.
3508 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
3511 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
3512 paradoxical and zero sized accesses.
3514 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
3517 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
3520 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
3522 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
3523 to the list of symbol prefixes that must be made linker-
3526 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
3528 * config.host: Add support for aarch64-*-darwin.
3529 * config/aarch64/host-aarch64-darwin.c: New file.
3530 * config/aarch64/x-darwin: New file.
3532 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
3534 PR tree-optimization/103093
3535 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
3537 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
3539 PR tree-optimization/102943
3540 * gimple-range-cache.cc (class update_list): New.
3541 (update_list::add): Replace add_to_update.
3542 (update_list::pop): New.
3543 (ranger_cache::ranger_cache): Adjust.
3544 (ranger_cache::~ranger_cache): Adjust.
3545 (ranger_cache::add_to_update): Delete.
3546 (ranger_cache::propagate_cache): Adjust to new class.
3547 (ranger_cache::propagate_updated_value): Ditto.
3548 (ranger_cache::fill_block_cache): Ditto.
3549 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
3551 2021-11-05 Richard Biener <rguenther@suse.de>
3553 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
3554 comment and expand on another one. Combine nested if.
3556 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
3559 * config/pa/pa-modes.def: Add OImode integer type.
3560 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
3562 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
3563 to UNITS_PER_WORD if IN_LIBGCC2.
3564 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
3565 negvti2, ashlti3, shrpd_internal): New patterns.
3566 Change some multi instruction types to multi.
3568 2021-11-05 Jakub Jelinek <jakub@redhat.com>
3571 * config/i386/i386.h
3572 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
3575 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
3578 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
3580 * config/aarch64/constraints.md: Support const in Usw.
3582 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
3584 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
3585 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
3587 2021-11-05 Richard Biener <rguenther@suse.de>
3589 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
3590 (LOOP_VINFO_N_STMTS): Likewise.
3591 (vec_info_for_bb): Remove unused function.
3592 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
3593 Initialize n_stmts member.
3594 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
3595 (vect_create_loop_vinfo): Do not set loop->aux.
3596 (vect_analyze_loop_2): Do not get n_stmts as argument,
3597 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
3599 (vect_analyze_loop_1): Remove callback, get the mode iterator
3600 and autodetected_vector_mode as argument, advancing the
3601 iterator and initializing autodetected_vector_mode here.
3602 (vect_analyze_loop): Split analysis loop into two, first
3603 processing main loops only and then epilogues.
3605 2021-11-05 Martin Jambor <mjambor@suse.cz>
3607 * ipa-prop.c (compute_complex_assign_jump_func): Remove
3608 unnecessary check for RECORD_TYPE.
3610 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
3612 * doc/install.texi2html: Do not generate old.html any longer.
3614 2021-11-05 Martin Liska <mliska@suse.cz>
3617 * opts.c (finish_options): Reset flag_gtoggle when it is used.
3619 2021-11-05 Jakub Jelinek <jakub@redhat.com>
3622 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
3623 in CONST_WIDE_INT handling. Adjust recursive calls.
3624 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
3625 new add_const_value_attribute argument.
3626 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
3627 add_const_value_attribute argument.
3629 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
3631 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
3634 2021-11-05 Richard Biener <rguenther@suse.de>
3636 * tree-vectorizer.h (struct vect_loop_form_info): New.
3637 (vect_analyze_loop_form): Adjust.
3638 (vect_create_loop_vinfo): New.
3639 * tree-parloops.c (gather_scalar_reductions): Adjust for
3640 vect_analyze_loop_form API change.
3641 * tree-vect-loop.c: Include <functional>.
3642 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
3643 take struct vect_loop_form_info as output parameter and adjust.
3644 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
3645 split out call to the original vect_analyze_loop_form_1.
3646 (vect_reanalyze_as_main_loop): Rename to...
3647 (vect_analyze_loop_1): ... this, factor out the call to
3648 vect_analyze_loop_form and generalize to be able to use it twice ...
3649 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
3652 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
3655 * config/rs6000/fusion.md: Regenerate.
3656 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
3658 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
3660 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
3663 2021-11-04 Tamar Christina <tamar.christina@arm.com>
3665 * config/aarch64/aarch64-builtins.c
3666 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
3667 ashr_simd, lshr, lshr_simd.
3668 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
3669 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
3670 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
3672 2021-11-04 Tamar Christina <tamar.christina@arm.com>
3674 * match.pd: New negate+shift pattern.
3676 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
3678 PR tree-optimization/103079
3679 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
3681 (gimple_range_calc_op2): Ditto.
3683 2021-11-04 Martin Jambor <mjambor@suse.cz>
3686 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
3687 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
3688 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
3689 parameter to mark_dead_statements.
3690 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
3691 (ipa_param_body_adjustments::mark_dead_statements): New parameter
3692 debugstack, push into it all SSA names used in debug statements,
3693 produce m_dead_ssa_debug_equiv mapping for the removed param.
3694 (replace_with_mapped_expr): New function.
3695 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
3696 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3697 (ipa_param_body_adjustments::common_initialization): Gather and
3698 procecc SSA which will be removed but are in debug statements. Simplify.
3699 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
3701 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
3702 when avoiding a copy of an unnecessary statement. Remap removed SSA
3703 names in existing debug statements.
3704 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
3705 parameters if we have already done so.
3707 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
3710 * gimple.c (gimple_call_static_chain_flags): Handle case when
3711 nested function does not bind locally.
3713 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
3715 * config/aarch64/aarch64.c (aarch64_function_value): Generate
3716 a register rtx for Neon vector-tuple modes.
3717 (aarch64_layout_arg): Likewise.
3719 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
3721 * lower-subreg.c (simple_move): Prevent decomposition if
3722 modes are not tieable.
3724 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
3725 Richard Sandiford <richard.sandiford@arm.com>
3727 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
3728 (v2x4hi_UP): Likewise.
3729 (v2x4hf_UP): Likewise.
3730 (v2x4bf_UP): Likewise.
3731 (v2x2si_UP): Likewise.
3732 (v2x2sf_UP): Likewise.
3733 (v2x1di_UP): Likewise.
3734 (v2x1df_UP): Likewise.
3735 (v2x16qi_UP): Likewise.
3736 (v2x8hi_UP): Likewise.
3737 (v2x8hf_UP): Likewise.
3738 (v2x8bf_UP): Likewise.
3739 (v2x4si_UP): Likewise.
3740 (v2x4sf_UP): Likewise.
3741 (v2x2di_UP): Likewise.
3742 (v2x2df_UP): Likewise.
3743 (v3x8qi_UP): Likewise.
3744 (v3x4hi_UP): Likewise.
3745 (v3x4hf_UP): Likewise.
3746 (v3x4bf_UP): Likewise.
3747 (v3x2si_UP): Likewise.
3748 (v3x2sf_UP): Likewise.
3749 (v3x1di_UP): Likewise.
3750 (v3x1df_UP): Likewise.
3751 (v3x16qi_UP): Likewise.
3752 (v3x8hi_UP): Likewise.
3753 (v3x8hf_UP): Likewise.
3754 (v3x8bf_UP): Likewise.
3755 (v3x4si_UP): Likewise.
3756 (v3x4sf_UP): Likewise.
3757 (v3x2di_UP): Likewise.
3758 (v3x2df_UP): Likewise.
3759 (v4x8qi_UP): Likewise.
3760 (v4x4hi_UP): Likewise.
3761 (v4x4hf_UP): Likewise.
3762 (v4x4bf_UP): Likewise.
3763 (v4x2si_UP): Likewise.
3764 (v4x2sf_UP): Likewise.
3765 (v4x1di_UP): Likewise.
3766 (v4x1df_UP): Likewise.
3767 (v4x16qi_UP): Likewise.
3768 (v4x8hi_UP): Likewise.
3769 (v4x8hf_UP): Likewise.
3770 (v4x8bf_UP): Likewise.
3771 (v4x4si_UP): Likewise.
3772 (v4x4sf_UP): Likewise.
3773 (v4x2di_UP): Likewise.
3774 (v4x2df_UP): Likewise.
3775 (TYPES_GETREGP): Delete.
3776 (TYPES_SETREGP): Likewise.
3777 (TYPES_LOADSTRUCT_U): Define.
3778 (TYPES_LOADSTRUCT_P): Likewise.
3779 (TYPES_LOADSTRUCT_LANE_U): Likewise.
3780 (TYPES_LOADSTRUCT_LANE_P): Likewise.
3781 (TYPES_STORE1P): Move for consistency.
3782 (TYPES_STORESTRUCT_U): Define.
3783 (TYPES_STORESTRUCT_P): Likewise.
3784 (TYPES_STORESTRUCT_LANE_U): Likewise.
3785 (TYPES_STORESTRUCT_LANE_P): Likewise.
3786 (aarch64_simd_tuple_types): Define.
3787 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
3788 (aarch64_init_simd_builtin_functions): Update frontend lookup
3789 for builtin functions after handling arm_neon.h pragma.
3790 (register_tuple_type): Manually set modes of single-integer
3791 tuple types. Record tuple types.
3792 * config/aarch64/aarch64-modes.def
3793 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
3794 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
3795 (SVE_MODES): Give single-vector modes priority over vector-
3797 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
3798 be after all single-vector modes.
3799 * config/aarch64/aarch64-simd-builtins.def: Update builtin
3800 generator macros to reflect modifications to the backend
3802 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
3803 Use vector-tuple mode iterator and rename to...
3804 (aarch64_simd_ld2<vstruct_elt>): This.
3805 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
3807 (aarch64_simd_ld2r<vstruct_elt>): This.
3808 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
3809 iterator and rename to...
3810 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
3811 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
3813 (vec_load_lanes<mode><vstruct_elt>): This.
3814 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
3816 (aarch64_simd_st2<vstruct_elt>): This.
3817 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
3818 iterator and rename to...
3819 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
3820 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
3822 (vec_store_lanes<mode><vstruct_elt>): This.
3823 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
3825 (aarch64_simd_ld3<vstruct_elt>): This.
3826 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
3828 (aarch64_simd_ld3r<vstruct_elt>): This.
3829 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
3830 iterator and rename to...
3831 (vec_load_lanesci<mode>): This.
3832 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
3834 (aarch64_simd_st3<vstruct_elt>): This.
3835 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
3836 iterator and rename to...
3837 (vec_store_lanesci<mode>): This.
3838 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
3840 (aarch64_simd_ld4<vstruct_elt>): This.
3841 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
3843 (aarch64_simd_ld4r<vstruct_elt>): This.
3844 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
3845 iterator and rename to...
3846 (vec_load_lanesxi<mode>): This.
3847 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
3849 (aarch64_simd_st4<vstruct_elt>): This.
3850 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
3851 iterator and rename to...
3852 (vec_store_lanesxi<mode>): This.
3853 (mov<mode>): Define for Neon vector-tuple modes.
3854 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
3856 (aarch64_ld1x3<vstruct_elt>): This.
3857 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
3859 (aarch64_ld1_x3_<vstruct_elt>): This.
3860 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
3862 (aarch64_ld1x4<vstruct_elt>): This.
3863 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
3865 (aarch64_ld1_x4_<vstruct_elt>): This.
3866 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
3868 (aarch64_st1x2<vstruct_elt>): This.
3869 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
3871 (aarch64_st1_x2_<vstruct_elt>): This.
3872 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
3874 (aarch64_st1x3<vstruct_elt>): This.
3875 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
3877 (aarch64_st1_x3_<vstruct_elt>): This.
3878 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
3880 (aarch64_st1x4<vstruct_elt>): This.
3881 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
3883 (aarch64_st1_x4_<vstruct_elt>): This.
3884 (*aarch64_mov<mode>): Define for vector-tuple modes.
3885 (*aarch64_be_mov<mode>): Likewise.
3886 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
3887 mode iterator and rename to...
3888 (aarch64_ld<nregs>r<vstruct_elt>): This.
3889 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
3891 (aarch64_ld2<vstruct_elt>_dreg): This.
3892 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
3894 (aarch64_ld3<vstruct_elt>_dreg): This.
3895 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
3897 (aarch64_ld4<vstruct_elt>_dreg): This.
3898 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
3899 iterator and rename to...
3900 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
3901 iterator and rename to...
3902 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
3903 (aarch64_ld1x2<VQ:mode>): Delete.
3904 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
3906 (aarch64_ld1x2<vstruct_elt>): This.
3907 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
3908 tuple mode iterator and rename to...
3909 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
3910 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
3911 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
3912 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
3914 (aarch64_st2<vstruct_elt>_dreg): This.
3915 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
3917 (aarch64_st3<vstruct_elt>_dreg): This.
3918 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
3920 (aarch64_st4<vstruct_elt>_dreg): This.
3921 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
3922 iterator and rename to...
3923 (aarch64_st<nregs><vstruct_elt>): This.
3924 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
3925 iterator and rename to aarch64_st<nregs><vstruct_elt>.
3926 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
3927 tuple mode iterator and rename to...
3928 (aarch64_st<nregs>_lane<vstruct_elt>): This.
3929 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
3930 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
3932 (aarch64_simd_ld1<vstruct_elt>_x2): This.
3933 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
3934 Refactor to include new vector-tuple modes.
3935 (aarch64_classify_vector_mode): Add cases for new vector-
3937 (aarch64_advsimd_partial_struct_mode_p): Define.
3938 (aarch64_advsimd_full_struct_mode_p): Likewise.
3939 (aarch64_advsimd_vector_array_mode): Likewise.
3940 (aarch64_sve_data_mode): Change location in file.
3941 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
3942 (aarch64_hard_regno_nregs): Handle case of partial Neon
3944 (aarch64_classify_address): Refactor to include handling of
3945 Neon vector-tuple modes.
3946 (aarch64_print_operand): Print "d" for "%R" for a partial
3947 Neon vector structure.
3948 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
3949 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
3950 modes with scalar machines modes larger than 8 bytes.
3951 (aarch64_can_change_mode_class): Don't allow changes between
3952 partial and full Neon vector-structure modes.
3953 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
3954 builtin and remove boiler-plate code for opaque mode.
3955 (vst2_lane_f32): Likewise.
3956 (vst2_lane_f64): Likewise.
3957 (vst2_lane_p8): Likewise.
3958 (vst2_lane_p16): Likewise.
3959 (vst2_lane_p64): Likewise.
3960 (vst2_lane_s8): Likewise.
3961 (vst2_lane_s16): Likewise.
3962 (vst2_lane_s32): Likewise.
3963 (vst2_lane_s64): Likewise.
3964 (vst2_lane_u8): Likewise.
3965 (vst2_lane_u16): Likewise.
3966 (vst2_lane_u32): Likewise.
3967 (vst2_lane_u64): Likewise.
3968 (vst2q_lane_f16): Likewise.
3969 (vst2q_lane_f32): Likewise.
3970 (vst2q_lane_f64): Likewise.
3971 (vst2q_lane_p8): Likewise.
3972 (vst2q_lane_p16): Likewise.
3973 (vst2q_lane_p64): Likewise.
3974 (vst2q_lane_s8): Likewise.
3975 (vst2q_lane_s16): Likewise.
3976 (vst2q_lane_s32): Likewise.
3977 (vst2q_lane_s64): Likewise.
3978 (vst2q_lane_u8): Likewise.
3979 (vst2q_lane_u16): Likewise.
3980 (vst2q_lane_u32): Likewise.
3981 (vst2q_lane_u64): Likewise.
3982 (vst3_lane_f16): Likewise.
3983 (vst3_lane_f32): Likewise.
3984 (vst3_lane_f64): Likewise.
3985 (vst3_lane_p8): Likewise.
3986 (vst3_lane_p16): Likewise.
3987 (vst3_lane_p64): Likewise.
3988 (vst3_lane_s8): Likewise.
3989 (vst3_lane_s16): Likewise.
3990 (vst3_lane_s32): Likewise.
3991 (vst3_lane_s64): Likewise.
3992 (vst3_lane_u8): Likewise.
3993 (vst3_lane_u16): Likewise.
3994 (vst3_lane_u32): Likewise.
3995 (vst3_lane_u64): Likewise.
3996 (vst3q_lane_f16): Likewise.
3997 (vst3q_lane_f32): Likewise.
3998 (vst3q_lane_f64): Likewise.
3999 (vst3q_lane_p8): Likewise.
4000 (vst3q_lane_p16): Likewise.
4001 (vst3q_lane_p64): Likewise.
4002 (vst3q_lane_s8): Likewise.
4003 (vst3q_lane_s16): Likewise.
4004 (vst3q_lane_s32): Likewise.
4005 (vst3q_lane_s64): Likewise.
4006 (vst3q_lane_u8): Likewise.
4007 (vst3q_lane_u16): Likewise.
4008 (vst3q_lane_u32): Likewise.
4009 (vst3q_lane_u64): Likewise.
4010 (vst4_lane_f16): Likewise.
4011 (vst4_lane_f32): Likewise.
4012 (vst4_lane_f64): Likewise.
4013 (vst4_lane_p8): Likewise.
4014 (vst4_lane_p16): Likewise.
4015 (vst4_lane_p64): Likewise.
4016 (vst4_lane_s8): Likewise.
4017 (vst4_lane_s16): Likewise.
4018 (vst4_lane_s32): Likewise.
4019 (vst4_lane_s64): Likewise.
4020 (vst4_lane_u8): Likewise.
4021 (vst4_lane_u16): Likewise.
4022 (vst4_lane_u32): Likewise.
4023 (vst4_lane_u64): Likewise.
4024 (vst4q_lane_f16): Likewise.
4025 (vst4q_lane_f32): Likewise.
4026 (vst4q_lane_f64): Likewise.
4027 (vst4q_lane_p8): Likewise.
4028 (vst4q_lane_p16): Likewise.
4029 (vst4q_lane_p64): Likewise.
4030 (vst4q_lane_s8): Likewise.
4031 (vst4q_lane_s16): Likewise.
4032 (vst4q_lane_s32): Likewise.
4033 (vst4q_lane_s64): Likewise.
4034 (vst4q_lane_u8): Likewise.
4035 (vst4q_lane_u16): Likewise.
4036 (vst4q_lane_u32): Likewise.
4037 (vst4q_lane_u64): Likewise.
4038 (vtbl3_s8): Likewise.
4039 (vtbl3_u8): Likewise.
4040 (vtbl3_p8): Likewise.
4041 (vtbl4_s8): Likewise.
4042 (vtbl4_u8): Likewise.
4043 (vtbl4_p8): Likewise.
4044 (vld1_u8_x3): Likewise.
4045 (vld1_s8_x3): Likewise.
4046 (vld1_u16_x3): Likewise.
4047 (vld1_s16_x3): Likewise.
4048 (vld1_u32_x3): Likewise.
4049 (vld1_s32_x3): Likewise.
4050 (vld1_u64_x3): Likewise.
4051 (vld1_s64_x3): Likewise.
4052 (vld1_f16_x3): Likewise.
4053 (vld1_f32_x3): Likewise.
4054 (vld1_f64_x3): Likewise.
4055 (vld1_p8_x3): Likewise.
4056 (vld1_p16_x3): Likewise.
4057 (vld1_p64_x3): Likewise.
4058 (vld1q_u8_x3): Likewise.
4059 (vld1q_s8_x3): Likewise.
4060 (vld1q_u16_x3): Likewise.
4061 (vld1q_s16_x3): Likewise.
4062 (vld1q_u32_x3): Likewise.
4063 (vld1q_s32_x3): Likewise.
4064 (vld1q_u64_x3): Likewise.
4065 (vld1q_s64_x3): Likewise.
4066 (vld1q_f16_x3): Likewise.
4067 (vld1q_f32_x3): Likewise.
4068 (vld1q_f64_x3): Likewise.
4069 (vld1q_p8_x3): Likewise.
4070 (vld1q_p16_x3): Likewise.
4071 (vld1q_p64_x3): Likewise.
4072 (vld1_u8_x2): Likewise.
4073 (vld1_s8_x2): Likewise.
4074 (vld1_u16_x2): Likewise.
4075 (vld1_s16_x2): Likewise.
4076 (vld1_u32_x2): Likewise.
4077 (vld1_s32_x2): Likewise.
4078 (vld1_u64_x2): Likewise.
4079 (vld1_s64_x2): Likewise.
4080 (vld1_f16_x2): Likewise.
4081 (vld1_f32_x2): Likewise.
4082 (vld1_f64_x2): Likewise.
4083 (vld1_p8_x2): Likewise.
4084 (vld1_p16_x2): Likewise.
4085 (vld1_p64_x2): Likewise.
4086 (vld1q_u8_x2): Likewise.
4087 (vld1q_s8_x2): Likewise.
4088 (vld1q_u16_x2): Likewise.
4089 (vld1q_s16_x2): Likewise.
4090 (vld1q_u32_x2): Likewise.
4091 (vld1q_s32_x2): Likewise.
4092 (vld1q_u64_x2): Likewise.
4093 (vld1q_s64_x2): Likewise.
4094 (vld1q_f16_x2): Likewise.
4095 (vld1q_f32_x2): Likewise.
4096 (vld1q_f64_x2): Likewise.
4097 (vld1q_p8_x2): Likewise.
4098 (vld1q_p16_x2): Likewise.
4099 (vld1q_p64_x2): Likewise.
4100 (vld1_s8_x4): Likewise.
4101 (vld1q_s8_x4): Likewise.
4102 (vld1_s16_x4): Likewise.
4103 (vld1q_s16_x4): Likewise.
4104 (vld1_s32_x4): Likewise.
4105 (vld1q_s32_x4): Likewise.
4106 (vld1_u8_x4): Likewise.
4107 (vld1q_u8_x4): Likewise.
4108 (vld1_u16_x4): Likewise.
4109 (vld1q_u16_x4): Likewise.
4110 (vld1_u32_x4): Likewise.
4111 (vld1q_u32_x4): Likewise.
4112 (vld1_f16_x4): Likewise.
4113 (vld1q_f16_x4): Likewise.
4114 (vld1_f32_x4): Likewise.
4115 (vld1q_f32_x4): Likewise.
4116 (vld1_p8_x4): Likewise.
4117 (vld1q_p8_x4): Likewise.
4118 (vld1_p16_x4): Likewise.
4119 (vld1q_p16_x4): Likewise.
4120 (vld1_s64_x4): Likewise.
4121 (vld1_u64_x4): Likewise.
4122 (vld1_p64_x4): Likewise.
4123 (vld1q_s64_x4): Likewise.
4124 (vld1q_u64_x4): Likewise.
4125 (vld1q_p64_x4): Likewise.
4126 (vld1_f64_x4): Likewise.
4127 (vld1q_f64_x4): Likewise.
4128 (vld2_s64): Likewise.
4129 (vld2_u64): Likewise.
4130 (vld2_f64): Likewise.
4131 (vld2_s8): Likewise.
4132 (vld2_p8): Likewise.
4133 (vld2_p64): Likewise.
4134 (vld2_s16): Likewise.
4135 (vld2_p16): Likewise.
4136 (vld2_s32): Likewise.
4137 (vld2_u8): Likewise.
4138 (vld2_u16): Likewise.
4139 (vld2_u32): Likewise.
4140 (vld2_f16): Likewise.
4141 (vld2_f32): Likewise.
4142 (vld2q_s8): Likewise.
4143 (vld2q_p8): Likewise.
4144 (vld2q_s16): Likewise.
4145 (vld2q_p16): Likewise.
4146 (vld2q_p64): Likewise.
4147 (vld2q_s32): Likewise.
4148 (vld2q_s64): Likewise.
4149 (vld2q_u8): Likewise.
4150 (vld2q_u16): Likewise.
4151 (vld2q_u32): Likewise.
4152 (vld2q_u64): Likewise.
4153 (vld2q_f16): Likewise.
4154 (vld2q_f32): Likewise.
4155 (vld2q_f64): Likewise.
4156 (vld3_s64): Likewise.
4157 (vld3_u64): Likewise.
4158 (vld3_f64): Likewise.
4159 (vld3_s8): Likewise.
4160 (vld3_p8): Likewise.
4161 (vld3_s16): Likewise.
4162 (vld3_p16): Likewise.
4163 (vld3_s32): Likewise.
4164 (vld3_u8): Likewise.
4165 (vld3_u16): Likewise.
4166 (vld3_u32): Likewise.
4167 (vld3_f16): Likewise.
4168 (vld3_f32): Likewise.
4169 (vld3_p64): Likewise.
4170 (vld3q_s8): Likewise.
4171 (vld3q_p8): Likewise.
4172 (vld3q_s16): Likewise.
4173 (vld3q_p16): Likewise.
4174 (vld3q_s32): Likewise.
4175 (vld3q_s64): Likewise.
4176 (vld3q_u8): Likewise.
4177 (vld3q_u16): Likewise.
4178 (vld3q_u32): Likewise.
4179 (vld3q_u64): Likewise.
4180 (vld3q_f16): Likewise.
4181 (vld3q_f32): Likewise.
4182 (vld3q_f64): Likewise.
4183 (vld3q_p64): Likewise.
4184 (vld4_s64): Likewise.
4185 (vld4_u64): Likewise.
4186 (vld4_f64): Likewise.
4187 (vld4_s8): Likewise.
4188 (vld4_p8): Likewise.
4189 (vld4_s16): Likewise.
4190 (vld4_p16): Likewise.
4191 (vld4_s32): Likewise.
4192 (vld4_u8): Likewise.
4193 (vld4_u16): Likewise.
4194 (vld4_u32): Likewise.
4195 (vld4_f16): Likewise.
4196 (vld4_f32): Likewise.
4197 (vld4_p64): Likewise.
4198 (vld4q_s8): Likewise.
4199 (vld4q_p8): Likewise.
4200 (vld4q_s16): Likewise.
4201 (vld4q_p16): Likewise.
4202 (vld4q_s32): Likewise.
4203 (vld4q_s64): Likewise.
4204 (vld4q_u8): Likewise.
4205 (vld4q_u16): Likewise.
4206 (vld4q_u32): Likewise.
4207 (vld4q_u64): Likewise.
4208 (vld4q_f16): Likewise.
4209 (vld4q_f32): Likewise.
4210 (vld4q_f64): Likewise.
4211 (vld4q_p64): Likewise.
4212 (vld2_dup_s8): Likewise.
4213 (vld2_dup_s16): Likewise.
4214 (vld2_dup_s32): Likewise.
4215 (vld2_dup_f16): Likewise.
4216 (vld2_dup_f32): Likewise.
4217 (vld2_dup_f64): Likewise.
4218 (vld2_dup_u8): Likewise.
4219 (vld2_dup_u16): Likewise.
4220 (vld2_dup_u32): Likewise.
4221 (vld2_dup_p8): Likewise.
4222 (vld2_dup_p16): Likewise.
4223 (vld2_dup_p64): Likewise.
4224 (vld2_dup_s64): Likewise.
4225 (vld2_dup_u64): Likewise.
4226 (vld2q_dup_s8): Likewise.
4227 (vld2q_dup_p8): Likewise.
4228 (vld2q_dup_s16): Likewise.
4229 (vld2q_dup_p16): Likewise.
4230 (vld2q_dup_s32): Likewise.
4231 (vld2q_dup_s64): Likewise.
4232 (vld2q_dup_u8): Likewise.
4233 (vld2q_dup_u16): Likewise.
4234 (vld2q_dup_u32): Likewise.
4235 (vld2q_dup_u64): Likewise.
4236 (vld2q_dup_f16): Likewise.
4237 (vld2q_dup_f32): Likewise.
4238 (vld2q_dup_f64): Likewise.
4239 (vld2q_dup_p64): Likewise.
4240 (vld3_dup_s64): Likewise.
4241 (vld3_dup_u64): Likewise.
4242 (vld3_dup_f64): Likewise.
4243 (vld3_dup_s8): Likewise.
4244 (vld3_dup_p8): Likewise.
4245 (vld3_dup_s16): Likewise.
4246 (vld3_dup_p16): Likewise.
4247 (vld3_dup_s32): Likewise.
4248 (vld3_dup_u8): Likewise.
4249 (vld3_dup_u16): Likewise.
4250 (vld3_dup_u32): Likewise.
4251 (vld3_dup_f16): Likewise.
4252 (vld3_dup_f32): Likewise.
4253 (vld3_dup_p64): Likewise.
4254 (vld3q_dup_s8): Likewise.
4255 (vld3q_dup_p8): Likewise.
4256 (vld3q_dup_s16): Likewise.
4257 (vld3q_dup_p16): Likewise.
4258 (vld3q_dup_s32): Likewise.
4259 (vld3q_dup_s64): Likewise.
4260 (vld3q_dup_u8): Likewise.
4261 (vld3q_dup_u16): Likewise.
4262 (vld3q_dup_u32): Likewise.
4263 (vld3q_dup_u64): Likewise.
4264 (vld3q_dup_f16): Likewise.
4265 (vld3q_dup_f32): Likewise.
4266 (vld3q_dup_f64): Likewise.
4267 (vld3q_dup_p64): Likewise.
4268 (vld4_dup_s64): Likewise.
4269 (vld4_dup_u64): Likewise.
4270 (vld4_dup_f64): Likewise.
4271 (vld4_dup_s8): Likewise.
4272 (vld4_dup_p8): Likewise.
4273 (vld4_dup_s16): Likewise.
4274 (vld4_dup_p16): Likewise.
4275 (vld4_dup_s32): Likewise.
4276 (vld4_dup_u8): Likewise.
4277 (vld4_dup_u16): Likewise.
4278 (vld4_dup_u32): Likewise.
4279 (vld4_dup_f16): Likewise.
4280 (vld4_dup_f32): Likewise.
4281 (vld4_dup_p64): Likewise.
4282 (vld4q_dup_s8): Likewise.
4283 (vld4q_dup_p8): Likewise.
4284 (vld4q_dup_s16): Likewise.
4285 (vld4q_dup_p16): Likewise.
4286 (vld4q_dup_s32): Likewise.
4287 (vld4q_dup_s64): Likewise.
4288 (vld4q_dup_u8): Likewise.
4289 (vld4q_dup_u16): Likewise.
4290 (vld4q_dup_u32): Likewise.
4291 (vld4q_dup_u64): Likewise.
4292 (vld4q_dup_f16): Likewise.
4293 (vld4q_dup_f32): Likewise.
4294 (vld4q_dup_f64): Likewise.
4295 (vld4q_dup_p64): Likewise.
4296 (vld2_lane_u8): Likewise.
4297 (vld2_lane_u16): Likewise.
4298 (vld2_lane_u32): Likewise.
4299 (vld2_lane_u64): Likewise.
4300 (vld2_lane_s8): Likewise.
4301 (vld2_lane_s16): Likewise.
4302 (vld2_lane_s32): Likewise.
4303 (vld2_lane_s64): Likewise.
4304 (vld2_lane_f16): Likewise.
4305 (vld2_lane_f32): Likewise.
4306 (vld2_lane_f64): Likewise.
4307 (vld2_lane_p8): Likewise.
4308 (vld2_lane_p16): Likewise.
4309 (vld2_lane_p64): Likewise.
4310 (vld2q_lane_u8): Likewise.
4311 (vld2q_lane_u16): Likewise.
4312 (vld2q_lane_u32): Likewise.
4313 (vld2q_lane_u64): Likewise.
4314 (vld2q_lane_s8): Likewise.
4315 (vld2q_lane_s16): Likewise.
4316 (vld2q_lane_s32): Likewise.
4317 (vld2q_lane_s64): Likewise.
4318 (vld2q_lane_f16): Likewise.
4319 (vld2q_lane_f32): Likewise.
4320 (vld2q_lane_f64): Likewise.
4321 (vld2q_lane_p8): Likewise.
4322 (vld2q_lane_p16): Likewise.
4323 (vld2q_lane_p64): Likewise.
4324 (vld3_lane_u8): Likewise.
4325 (vld3_lane_u16): Likewise.
4326 (vld3_lane_u32): Likewise.
4327 (vld3_lane_u64): Likewise.
4328 (vld3_lane_s8): Likewise.
4329 (vld3_lane_s16): Likewise.
4330 (vld3_lane_s32): Likewise.
4331 (vld3_lane_s64): Likewise.
4332 (vld3_lane_f16): Likewise.
4333 (vld3_lane_f32): Likewise.
4334 (vld3_lane_f64): Likewise.
4335 (vld3_lane_p8): Likewise.
4336 (vld3_lane_p16): Likewise.
4337 (vld3_lane_p64): Likewise.
4338 (vld3q_lane_u8): Likewise.
4339 (vld3q_lane_u16): Likewise.
4340 (vld3q_lane_u32): Likewise.
4341 (vld3q_lane_u64): Likewise.
4342 (vld3q_lane_s8): Likewise.
4343 (vld3q_lane_s16): Likewise.
4344 (vld3q_lane_s32): Likewise.
4345 (vld3q_lane_s64): Likewise.
4346 (vld3q_lane_f16): Likewise.
4347 (vld3q_lane_f32): Likewise.
4348 (vld3q_lane_f64): Likewise.
4349 (vld3q_lane_p8): Likewise.
4350 (vld3q_lane_p16): Likewise.
4351 (vld3q_lane_p64): Likewise.
4352 (vld4_lane_u8): Likewise.
4353 (vld4_lane_u16): Likewise.
4354 (vld4_lane_u32): Likewise.
4355 (vld4_lane_u64): Likewise.
4356 (vld4_lane_s8): Likewise.
4357 (vld4_lane_s16): Likewise.
4358 (vld4_lane_s32): Likewise.
4359 (vld4_lane_s64): Likewise.
4360 (vld4_lane_f16): Likewise.
4361 (vld4_lane_f32): Likewise.
4362 (vld4_lane_f64): Likewise.
4363 (vld4_lane_p8): Likewise.
4364 (vld4_lane_p16): Likewise.
4365 (vld4_lane_p64): Likewise.
4366 (vld4q_lane_u8): Likewise.
4367 (vld4q_lane_u16): Likewise.
4368 (vld4q_lane_u32): Likewise.
4369 (vld4q_lane_u64): Likewise.
4370 (vld4q_lane_s8): Likewise.
4371 (vld4q_lane_s16): Likewise.
4372 (vld4q_lane_s32): Likewise.
4373 (vld4q_lane_s64): Likewise.
4374 (vld4q_lane_f16): Likewise.
4375 (vld4q_lane_f32): Likewise.
4376 (vld4q_lane_f64): Likewise.
4377 (vld4q_lane_p8): Likewise.
4378 (vld4q_lane_p16): Likewise.
4379 (vld4q_lane_p64): Likewise.
4380 (vqtbl2_s8): Likewise.
4381 (vqtbl2_u8): Likewise.
4382 (vqtbl2_p8): Likewise.
4383 (vqtbl2q_s8): Likewise.
4384 (vqtbl2q_u8): Likewise.
4385 (vqtbl2q_p8): Likewise.
4386 (vqtbl3_s8): Likewise.
4387 (vqtbl3_u8): Likewise.
4388 (vqtbl3_p8): Likewise.
4389 (vqtbl3q_s8): Likewise.
4390 (vqtbl3q_u8): Likewise.
4391 (vqtbl3q_p8): Likewise.
4392 (vqtbl4_s8): Likewise.
4393 (vqtbl4_u8): Likewise.
4394 (vqtbl4_p8): Likewise.
4395 (vqtbl4q_s8): Likewise.
4396 (vqtbl4q_u8): Likewise.
4397 (vqtbl4q_p8): Likewise.
4398 (vqtbx2_s8): Likewise.
4399 (vqtbx2_u8): Likewise.
4400 (vqtbx2_p8): Likewise.
4401 (vqtbx2q_s8): Likewise.
4402 (vqtbx2q_u8): Likewise.
4403 (vqtbx2q_p8): Likewise.
4404 (vqtbx3_s8): Likewise.
4405 (vqtbx3_u8): Likewise.
4406 (vqtbx3_p8): Likewise.
4407 (vqtbx3q_s8): Likewise.
4408 (vqtbx3q_u8): Likewise.
4409 (vqtbx3q_p8): Likewise.
4410 (vqtbx4_s8): Likewise.
4411 (vqtbx4_u8): Likewise.
4412 (vqtbx4_p8): Likewise.
4413 (vqtbx4q_s8): Likewise.
4414 (vqtbx4q_u8): Likewise.
4415 (vqtbx4q_p8): Likewise.
4416 (vst1_s64_x2): Likewise.
4417 (vst1_u64_x2): Likewise.
4418 (vst1_f64_x2): Likewise.
4419 (vst1_s8_x2): Likewise.
4420 (vst1_p8_x2): Likewise.
4421 (vst1_s16_x2): Likewise.
4422 (vst1_p16_x2): Likewise.
4423 (vst1_s32_x2): Likewise.
4424 (vst1_u8_x2): Likewise.
4425 (vst1_u16_x2): Likewise.
4426 (vst1_u32_x2): Likewise.
4427 (vst1_f16_x2): Likewise.
4428 (vst1_f32_x2): Likewise.
4429 (vst1_p64_x2): Likewise.
4430 (vst1q_s8_x2): Likewise.
4431 (vst1q_p8_x2): Likewise.
4432 (vst1q_s16_x2): Likewise.
4433 (vst1q_p16_x2): Likewise.
4434 (vst1q_s32_x2): Likewise.
4435 (vst1q_s64_x2): Likewise.
4436 (vst1q_u8_x2): Likewise.
4437 (vst1q_u16_x2): Likewise.
4438 (vst1q_u32_x2): Likewise.
4439 (vst1q_u64_x2): Likewise.
4440 (vst1q_f16_x2): Likewise.
4441 (vst1q_f32_x2): Likewise.
4442 (vst1q_f64_x2): Likewise.
4443 (vst1q_p64_x2): Likewise.
4444 (vst1_s64_x3): Likewise.
4445 (vst1_u64_x3): Likewise.
4446 (vst1_f64_x3): Likewise.
4447 (vst1_s8_x3): Likewise.
4448 (vst1_p8_x3): Likewise.
4449 (vst1_s16_x3): Likewise.
4450 (vst1_p16_x3): Likewise.
4451 (vst1_s32_x3): Likewise.
4452 (vst1_u8_x3): Likewise.
4453 (vst1_u16_x3): Likewise.
4454 (vst1_u32_x3): Likewise.
4455 (vst1_f16_x3): Likewise.
4456 (vst1_f32_x3): Likewise.
4457 (vst1_p64_x3): Likewise.
4458 (vst1q_s8_x3): Likewise.
4459 (vst1q_p8_x3): Likewise.
4460 (vst1q_s16_x3): Likewise.
4461 (vst1q_p16_x3): Likewise.
4462 (vst1q_s32_x3): Likewise.
4463 (vst1q_s64_x3): Likewise.
4464 (vst1q_u8_x3): Likewise.
4465 (vst1q_u16_x3): Likewise.
4466 (vst1q_u32_x3): Likewise.
4467 (vst1q_u64_x3): Likewise.
4468 (vst1q_f16_x3): Likewise.
4469 (vst1q_f32_x3): Likewise.
4470 (vst1q_f64_x3): Likewise.
4471 (vst1q_p64_x3): Likewise.
4472 (vst1_s8_x4): Likewise.
4473 (vst1q_s8_x4): Likewise.
4474 (vst1_s16_x4): Likewise.
4475 (vst1q_s16_x4): Likewise.
4476 (vst1_s32_x4): Likewise.
4477 (vst1q_s32_x4): Likewise.
4478 (vst1_u8_x4): Likewise.
4479 (vst1q_u8_x4): Likewise.
4480 (vst1_u16_x4): Likewise.
4481 (vst1q_u16_x4): Likewise.
4482 (vst1_u32_x4): Likewise.
4483 (vst1q_u32_x4): Likewise.
4484 (vst1_f16_x4): Likewise.
4485 (vst1q_f16_x4): Likewise.
4486 (vst1_f32_x4): Likewise.
4487 (vst1q_f32_x4): Likewise.
4488 (vst1_p8_x4): Likewise.
4489 (vst1q_p8_x4): Likewise.
4490 (vst1_p16_x4): Likewise.
4491 (vst1q_p16_x4): Likewise.
4492 (vst1_s64_x4): Likewise.
4493 (vst1_u64_x4): Likewise.
4494 (vst1_p64_x4): Likewise.
4495 (vst1q_s64_x4): Likewise.
4496 (vst1q_u64_x4): Likewise.
4497 (vst1q_p64_x4): Likewise.
4498 (vst1_f64_x4): Likewise.
4499 (vst1q_f64_x4): Likewise.
4500 (vst2_s64): Likewise.
4501 (vst2_u64): Likewise.
4502 (vst2_f64): Likewise.
4503 (vst2_s8): Likewise.
4504 (vst2_p8): Likewise.
4505 (vst2_s16): Likewise.
4506 (vst2_p16): Likewise.
4507 (vst2_s32): Likewise.
4508 (vst2_u8): Likewise.
4509 (vst2_u16): Likewise.
4510 (vst2_u32): Likewise.
4511 (vst2_f16): Likewise.
4512 (vst2_f32): Likewise.
4513 (vst2_p64): Likewise.
4514 (vst2q_s8): Likewise.
4515 (vst2q_p8): Likewise.
4516 (vst2q_s16): Likewise.
4517 (vst2q_p16): Likewise.
4518 (vst2q_s32): Likewise.
4519 (vst2q_s64): Likewise.
4520 (vst2q_u8): Likewise.
4521 (vst2q_u16): Likewise.
4522 (vst2q_u32): Likewise.
4523 (vst2q_u64): Likewise.
4524 (vst2q_f16): Likewise.
4525 (vst2q_f32): Likewise.
4526 (vst2q_f64): Likewise.
4527 (vst2q_p64): Likewise.
4528 (vst3_s64): Likewise.
4529 (vst3_u64): Likewise.
4530 (vst3_f64): Likewise.
4531 (vst3_s8): Likewise.
4532 (vst3_p8): Likewise.
4533 (vst3_s16): Likewise.
4534 (vst3_p16): Likewise.
4535 (vst3_s32): Likewise.
4536 (vst3_u8): Likewise.
4537 (vst3_u16): Likewise.
4538 (vst3_u32): Likewise.
4539 (vst3_f16): Likewise.
4540 (vst3_f32): Likewise.
4541 (vst3_p64): Likewise.
4542 (vst3q_s8): Likewise.
4543 (vst3q_p8): Likewise.
4544 (vst3q_s16): Likewise.
4545 (vst3q_p16): Likewise.
4546 (vst3q_s32): Likewise.
4547 (vst3q_s64): Likewise.
4548 (vst3q_u8): Likewise.
4549 (vst3q_u16): Likewise.
4550 (vst3q_u32): Likewise.
4551 (vst3q_u64): Likewise.
4552 (vst3q_f16): Likewise.
4553 (vst3q_f32): Likewise.
4554 (vst3q_f64): Likewise.
4555 (vst3q_p64): Likewise.
4556 (vst4_s64): Likewise.
4557 (vst4_u64): Likewise.
4558 (vst4_f64): Likewise.
4559 (vst4_s8): Likewise.
4560 (vst4_p8): Likewise.
4561 (vst4_s16): Likewise.
4562 (vst4_p16): Likewise.
4563 (vst4_s32): Likewise.
4564 (vst4_u8): Likewise.
4565 (vst4_u16): Likewise.
4566 (vst4_u32): Likewise.
4567 (vst4_f16): Likewise.
4568 (vst4_f32): Likewise.
4569 (vst4_p64): Likewise.
4570 (vst4q_s8): Likewise.
4571 (vst4q_p8): Likewise.
4572 (vst4q_s16): Likewise.
4573 (vst4q_p16): Likewise.
4574 (vst4q_s32): Likewise.
4575 (vst4q_s64): Likewise.
4576 (vst4q_u8): Likewise.
4577 (vst4q_u16): Likewise.
4578 (vst4q_u32): Likewise.
4579 (vst4q_u64): Likewise.
4580 (vst4q_f16): Likewise.
4581 (vst4q_f32): Likewise.
4582 (vst4q_f64): Likewise.
4583 (vst4q_p64): Likewise.
4584 (vtbx4_s8): Likewise.
4585 (vtbx4_u8): Likewise.
4586 (vtbx4_p8): Likewise.
4587 (vld1_bf16_x2): Likewise.
4588 (vld1q_bf16_x2): Likewise.
4589 (vld1_bf16_x3): Likewise.
4590 (vld1q_bf16_x3): Likewise.
4591 (vld1_bf16_x4): Likewise.
4592 (vld1q_bf16_x4): Likewise.
4593 (vld2_bf16): Likewise.
4594 (vld2q_bf16): Likewise.
4595 (vld2_dup_bf16): Likewise.
4596 (vld2q_dup_bf16): Likewise.
4597 (vld3_bf16): Likewise.
4598 (vld3q_bf16): Likewise.
4599 (vld3_dup_bf16): Likewise.
4600 (vld3q_dup_bf16): Likewise.
4601 (vld4_bf16): Likewise.
4602 (vld4q_bf16): Likewise.
4603 (vld4_dup_bf16): Likewise.
4604 (vld4q_dup_bf16): Likewise.
4605 (vst1_bf16_x2): Likewise.
4606 (vst1q_bf16_x2): Likewise.
4607 (vst1_bf16_x3): Likewise.
4608 (vst1q_bf16_x3): Likewise.
4609 (vst1_bf16_x4): Likewise.
4610 (vst1q_bf16_x4): Likewise.
4611 (vst2_bf16): Likewise.
4612 (vst2q_bf16): Likewise.
4613 (vst3_bf16): Likewise.
4614 (vst3q_bf16): Likewise.
4615 (vst4_bf16): Likewise.
4616 (vst4q_bf16): Likewise.
4617 (vld2_lane_bf16): Likewise.
4618 (vld2q_lane_bf16): Likewise.
4619 (vld3_lane_bf16): Likewise.
4620 (vld3q_lane_bf16): Likewise.
4621 (vld4_lane_bf16): Likewise.
4622 (vld4q_lane_bf16): Likewise.
4623 (vst2_lane_bf16): Likewise.
4624 (vst2q_lane_bf16): Likewise.
4625 (vst3_lane_bf16): Likewise.
4626 (vst3q_lane_bf16): Likewise.
4627 (vst4_lane_bf16): Likewise.
4628 (vst4q_lane_bf16): Likewise.
4629 * config/aarch64/geniterators.sh: Modify iterator regex to
4630 match new vector-tuple modes.
4631 * config/aarch64/iterators.md (insn_count): Extend mode
4632 attribute with vector-tuple type information.
4634 (Vendreg): Likewise.
4637 (VSTRUCT_2D): New mode iterator.
4638 (VSTRUCT_2DNX): Likewise.
4639 (VSTRUCT_2DX): Likewise.
4640 (VSTRUCT_2Q): Likewise.
4641 (VSTRUCT_2QD): Likewise.
4642 (VSTRUCT_3D): Likewise.
4643 (VSTRUCT_3DNX): Likewise.
4644 (VSTRUCT_3DX): Likewise.
4645 (VSTRUCT_3Q): Likewise.
4646 (VSTRUCT_3QD): Likewise.
4647 (VSTRUCT_4D): Likewise.
4648 (VSTRUCT_4DNX): Likewise.
4649 (VSTRUCT_4DX): Likewise.
4650 (VSTRUCT_4Q): Likewise.
4651 (VSTRUCT_4QD): Likewise.
4652 (VSTRUCT_D): Likewise.
4653 (VSTRUCT_Q): Likewise.
4654 (VSTRUCT_QD): Likewise.
4655 (VSTRUCT_ELT): New mode attribute.
4656 (vstruct_elt): Likewise.
4657 * genmodes.c (VECTOR_MODE): Add default prefix and order
4659 (VECTOR_MODE_WITH_PREFIX): Define.
4660 (make_vector_mode): Add mode prefix and order parameters.
4662 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4664 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
4666 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4668 * expr.c (emit_group_load_1): Remove historic workaround.
4670 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4672 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
4673 Factor out main loop to...
4674 (aarch64_init_simd_builtin_functions): This new function.
4675 (register_tuple_type): Define.
4676 (aarch64_scalar_builtin_type_p): Define.
4677 (handle_arm_neon_h): Define.
4678 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
4679 pragma for arm_neon.h.
4680 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
4682 (handle_arm_neon_h): Likewise.
4683 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
4684 Remove static modifier.
4685 * config/aarch64/arm_neon.h (target): Remove Neon vector
4686 structure type definitions.
4688 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4690 PR tree-optimization/102943
4691 * gimple-range-path.cc (path_range_query::range_on_path_entry):
4692 Prefer range_of_expr unless there are no statements in the BB.
4694 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4696 PR tree-optimization/102943
4697 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4698 Avoid duplicate calculation of paths.
4700 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4702 PR tree-optimization/102943
4703 * gimple-range-path.cc (path_range_query::compute_phi_relations):
4704 Only compute relations for SSA names in the import list.
4705 (path_range_query::compute_outgoing_relations): Same.
4706 * gimple-range-path.h (path_range_query::import_p): New.
4708 2021-11-04 Richard Biener <rguenther@suse.de>
4710 PR rtl-optimization/103075
4711 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
4712 false for a VOIDmode operand.
4714 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4716 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
4717 variables private and add "m_" to their names. Remove is_loop.
4718 (aarch64_record_potential_advsimd_unrolling): Replace with...
4719 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
4720 (aarch64_analyze_loop_vinfo): Replace with...
4721 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
4722 Move initialization of (m_)vec_flags to add_stmt_cost.
4723 (aarch64_analyze_bb_vinfo): Delete.
4724 (aarch64_count_ops): Replace with...
4725 (aarch64_vector_costs::count_ops): ...this.
4726 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
4727 using m_costing_for_scalar to test whether we're costing
4728 scalar or vector code.
4729 (aarch64_adjust_body_cost_sve): Replace with...
4730 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
4731 (aarch64_adjust_body_cost): Replace with...
4732 (aarch64_vector_costs::adjust_body_cost): ...this.
4733 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
4735 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4737 * target.def (targetm.vectorize.init_cost): Replace with...
4738 (targetm.vectorize.create_costs): ...this.
4739 (targetm.vectorize.add_stmt_cost): Delete.
4740 (targetm.vectorize.finish_cost): Likewise.
4741 (targetm.vectorize.destroy_cost_data): Likewise.
4742 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
4743 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4744 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4745 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4746 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4747 * doc/tm.texi: Regenerate.
4748 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
4750 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
4751 (vector_costs): New class.
4752 (init_cost): Take a vec_info and return a vector_costs.
4753 (dump_stmt_cost): Remove data parameter.
4754 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
4755 (add_stmt_costs): Likewise.
4756 (finish_cost): Replace data parameter with a vector_costs.
4757 (destroy_cost_data): Delete.
4758 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
4760 (vec_info::vec_info): Remove the target_cost_data parameter and
4761 initialize the member variable to null instead.
4762 (vec_info::~vec_info): Delete target_cost_data instead of calling
4764 (vector_costs::add_stmt_cost): New function.
4765 (vector_costs::finish_cost): Likewise.
4766 (vector_costs::record_stmt_cost): Likewise.
4767 (vector_costs::adjust_cost_for_freq): Likewise.
4768 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
4769 call to vec_info::vec_info.
4770 (vect_compute_single_scalar_iteration_cost): Update after above
4771 changes to costing interface.
4772 (vect_analyze_loop_operations): Likewise.
4773 (vect_estimate_min_profitable_iters): Likewise.
4774 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
4775 at the start_over point, where it needs to be recreated after
4776 trying without slp. Update retry code accordingly.
4777 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
4778 to vec_info::vec_info.
4779 (vect_slp_analyze_operation): Update after above changes to costing
4781 (vect_bb_vectorization_profitable_p): Likewise.
4782 * targhooks.h (default_init_cost): Replace with...
4783 (default_vectorize_create_costs): ...this.
4784 (default_add_stmt_cost): Delete.
4785 (default_finish_cost, default_destroy_cost_data): Likewise.
4786 * targhooks.c (default_init_cost): Replace with...
4787 (default_vectorize_create_costs): ...this.
4788 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
4789 (default_finish_cost, default_destroy_cost_data): Delete.
4790 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
4791 vector_costs. Add a constructor.
4792 (aarch64_init_cost): Replace with...
4793 (aarch64_vectorize_create_costs): ...this.
4794 (aarch64_add_stmt_cost): Replace with...
4795 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
4796 to adjust the cost for inner loops.
4797 (aarch64_finish_cost): Replace with...
4798 (aarch64_vector_costs::finish_cost): ...this.
4799 (aarch64_destroy_cost_data): Delete.
4800 (TARGET_VECTORIZE_INIT_COST): Replace with...
4801 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4802 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4803 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4804 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4805 * config/i386/i386.c (ix86_vector_costs): New structure.
4806 (ix86_init_cost): Replace with...
4807 (ix86_vectorize_create_costs): ...this.
4808 (ix86_add_stmt_cost): Replace with...
4809 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
4810 to adjust the cost for inner loops.
4811 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
4812 (TARGET_VECTORIZE_INIT_COST): Replace with...
4813 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4814 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4815 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4816 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4817 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
4818 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4819 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4820 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4821 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4822 (rs6000_cost_data): Inherit from vector_costs.
4823 Add a constructor. Drop loop_info, cost and costing_for_scalar
4824 in favor of the corresponding vector_costs member variables.
4825 Add "m_" to the names of the remaining member variables and
4827 (rs6000_density_test): Replace with...
4828 (rs6000_cost_data::density_test): ...this.
4829 (rs6000_init_cost): Replace with...
4830 (rs6000_vectorize_create_costs): ...this.
4831 (rs6000_update_target_cost_per_stmt): Replace with...
4832 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
4833 (rs6000_add_stmt_cost): Replace with...
4834 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
4835 to adjust the cost for inner loops.
4836 (rs6000_adjust_vect_cost_per_loop): Replace with...
4837 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
4838 (rs6000_finish_cost): Replace with...
4839 (rs6000_cost_data::finish_cost): ...this. Group loop code
4840 into a single if statement and pass the loop_vinfo down to
4842 (rs6000_destroy_cost_data): Delete.
4844 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4846 PR tree-optimization/103062
4847 PR tree-optimization/103062
4848 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
4849 Increase size of allocation by 1.
4850 (ssa_equiv_stack::push_replacement): Grow as needed.
4851 (ssa_equiv_stack::get_replacement): Same.
4852 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
4853 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
4854 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
4855 (pointer_equiv_analyzer::get_equiv): Same.
4856 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
4857 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
4858 const markers. Use auto_vec instead of tree *.
4860 2021-11-04 Richard Biener <rguenther@suse.de>
4862 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
4863 true parameter and inline valueization.
4864 (vn_nary_op_lookup_1): Inline valueization from ...
4865 (vn_nary_op_compute_hash): ... here and remove it here.
4866 * tree-ssa-pre.c (phi_translate_1): Do not valueize
4867 before vn_nary_lookup_pieces.
4868 (get_representative_for): Mark created SSA representatives
4871 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4873 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
4874 that the operand has a vector mode. Use subreg_lowpart_offset
4875 to test whether an index corresponds to the low part.
4877 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4879 * read-rtl.c: Remove dead !GENERATOR_FILE block.
4880 * read-rtl-function.c (function_reader::consolidate_singletons):
4881 Generate canonical CONST_VECTORs.
4883 2021-11-04 liuhongt <hongtao.liu@intel.com>
4886 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
4887 (regmem_or_bitnot_regmem_operand): .. and extend to handle
4889 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
4890 the operands which are required to be register_operand.
4891 (*<avx512>_vpternlog<mode>_2): Ditto.
4892 (*<avx512>_vpternlog<mode>_3): Ditto.
4893 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
4894 vector HFmodes since it's not a real AVX512FP16 instruction.
4896 2021-11-04 liuhongt <hongtao.liu@intel.com>
4899 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
4900 .COPYSIGN (a,b) when a and b are same type as the truncation
4901 type and has less precision than extend type.
4903 2021-11-04 Richard Biener <rguenther@suse.de>
4905 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
4908 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
4910 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
4911 (VI1_AVX512VNNI): Likewise.
4912 (SDOT_VPDP_SUF): New mode_attr.
4915 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
4916 vpdpwssd when VNNI targets available.
4917 (usdot_prod<mode>): New expander for vector QImode.
4919 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
4921 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
4922 parentheses to base and stride.
4923 (_tile_stream_loadd_internal): Likewise.
4924 (_tile_stored_internal): Likewise.
4926 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
4928 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
4929 arguments to `reg_class_subset_p'.
4931 2021-11-03 Joseph Myers <joseph@codesourcery.com>
4934 * fold-const.c (fold_init): New function.
4935 * fold-const.h (fold_init): New prototype.
4937 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4939 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
4941 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4943 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
4944 a range from edge if arg == phidef.
4946 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4948 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
4949 for constant before any other processing.
4951 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4953 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
4955 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4957 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
4958 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
4959 no explcit context and current_bb is set.
4960 (gimple_ranger::fold_stmt): New.
4961 * gimple-range.h (current_bb, fold_stmt): New.
4962 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
4964 2021-11-03 Richard Biener <rguenther@suse.de>
4966 PR tree-optimization/102970
4967 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
4968 when translating a MEM_REF over a backedge.
4970 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
4972 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
4973 * config/aarch64/aarch64-tune.md: Regenerate.
4974 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
4975 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
4976 * doc/invoke.texi: Add documentation for Ampere-1 core.
4978 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
4980 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
4982 (ldr_got_small_<mode>): Remove pattern.
4983 (ldr_got_small_sidi): Likewise.
4984 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
4985 GOT accesses as moves.
4986 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
4987 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
4988 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
4990 2021-11-03 Martin Liska <mliska@suse.cz>
4992 * gcov.c (read_line): Remove dead variable.
4994 2021-11-03 Martin Liska <mliska@suse.cz>
4997 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
4998 (ipa_fn_summary::account_size_time): Likewise.
4999 (edge_set_predicate): Likewise.
5000 (set_hint_predicate): Likewise.
5001 (add_freqcounting_predicate): Likewise.
5002 (evaluate_conditions_for_known_args): Likewise.
5003 (evaluate_properties_for_edge): Likewise.
5004 (remap_freqcounting_preds_after_dup): Likewise.
5005 (ipa_fn_summary_t::duplicate): Likewise.
5006 (set_cond_stmt_execution_predicate): Likewise.
5007 (set_switch_stmt_execution_predicate): Likewise.
5008 (compute_bb_predicates): Likewise.
5009 (will_be_nonconstant_expr_predicate): Likewise.
5010 (will_be_nonconstant_predicate): Likewise.
5011 (phi_result_unknown_predicate): Likewise.
5012 (predicate_for_phi_result): Likewise.
5013 (analyze_function_body): Likewise.
5014 (compute_fn_summary): Likewise.
5015 (summarize_calls_size_and_time): Likewise.
5016 (estimate_calls_size_and_time): Likewise.
5017 (ipa_call_context::estimate_size_and_time): Likewise.
5018 (remap_edge_summaries): Likewise.
5019 (remap_freqcounting_predicate): Likewise.
5020 (ipa_merge_fn_summary_after_inlining): Likewise.
5021 (ipa_update_overall_fn_summary): Likewise.
5022 (read_ipa_call_summary): Likewise.
5023 (inline_read_section): Likewise.
5024 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
5025 * ipa-predicate.c (predicate::add_clause): Likewise.
5026 (ipa_predicate::add_clause): Likewise.
5027 (predicate::or_with): Likewise.
5028 (ipa_predicate::or_with): Likewise.
5029 (predicate::evaluate): Likewise.
5030 (ipa_predicate::evaluate): Likewise.
5031 (predicate::probability): Likewise.
5032 (ipa_predicate::probability): Likewise.
5033 (dump_condition): Likewise.
5034 (dump_clause): Likewise.
5035 (predicate::dump): Likewise.
5036 (ipa_predicate::dump): Likewise.
5037 (predicate::debug): Likewise.
5038 (ipa_predicate::debug): Likewise.
5039 (predicate::remap_after_duplication): Likewise.
5040 (ipa_predicate::remap_after_duplication): Likewise.
5041 (predicate::remap_after_inlining): Likewise.
5042 (ipa_predicate::remap_after_inlining): Likewise.
5043 (predicate::stream_in): Likewise.
5044 (ipa_predicate::stream_in): Likewise.
5045 (predicate::stream_out): Likewise.
5046 (ipa_predicate::stream_out): Likewise.
5047 (add_condition): Likewise.
5048 * ipa-predicate.h (class predicate): Likewise.
5049 (class ipa_predicate): Likewise.
5050 (add_condition): Likewise.
5052 2021-11-03 Richard Biener <rguenther@suse.de>
5054 * bitmap.h (bitmap_bit_p): Change the return type to bool.
5055 * bitmap.c (bitmap_bit_p): Likewise.
5056 * sbitmap.h (bitmap_bit_p): Likewise.
5057 (bitmap_set_bit): Return whether the bit changed.
5058 (bitmap_clear_bit): Likewise.
5059 * tree-ssa.c (verify_vssa): Make use of the changed state
5060 from bitmap_set_bit.
5062 2021-11-03 Richard Biener <rguenther@suse.de>
5064 PR middle-end/103033
5065 * internal-fn.c (expand_DEFERRED_INIT): Elide the
5066 native_interpret_expr path in favor of folding the
5067 VIEW_CONVERT_EXPR generated when punning the RHS.
5069 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5071 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
5074 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
5077 * ipa-modref.c (callee_to_caller_flags): New function.
5078 (modref_eaf_analysis::analyze_ssa_name): Use it.
5079 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
5081 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5083 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
5086 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5087 Uroš Bizjak <ubizjak@gmail.com>
5089 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
5090 rotations by non-constant amounts.
5092 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5094 * ipa-modref.c: Fix anonymous namespace placement.
5095 (class modref_eaf_analysis): New class.
5096 (analyze_ssa_name_flags): Turn to ...
5097 (modref_eaf_analysis::analyze_ssa_name): ... this one.
5098 (merge_call_lhs_flags): Turn to ...
5099 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
5100 (modref_eaf_analysis::merge_with_ssa_name): New member function.
5101 (record_escape_points): Turn to ...
5102 (modref_eaf_analysis::record_escape_points): ... this one.
5103 (analyze_parms): Updat
5104 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
5106 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5108 * gimple.c (gimple_call_static_chain_flags): New function.
5109 * gimple.h (gimple_call_static_chain_flags): Declare
5110 * ipa-modref.c (modref_summary::modref_summary): Initialize
5112 (modref_summary_lto::modref_summary_lto): Likewise.
5113 (modref_summary::useful_p): Test static_chain_flags.
5114 (modref_summary_lto::useful_p): Likewise.
5115 (struct modref_summary_lto): Add static_chain_flags.
5116 (modref_summary::dump): Dump static_chain_flags.
5117 (modref_summary_lto::dump): Likewise.
5118 (struct escape_point): Add static_cahin_arg.
5119 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
5120 (analyze_parms): Handle static chains.
5121 (modref_summaries::duplicate): Duplicate static_chain_flags.
5122 (modref_summaries_lto::duplicate): Likewise.
5123 (modref_write): Stream static_chain_flags.
5124 (read_section): Likewise.
5125 (modref_merge_call_site_flags): Handle static_chain_flags.
5126 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
5127 * tree-ssa-structalias.c (handle_rhs_call): Use
5128 gimple_static_chain_flags.
5130 2021-11-02 Richard Biener <rguenther@suse.de>
5132 PR tree-optimization/103029
5133 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
5134 the PHI nodes in the loop maintain their original operand
5137 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5139 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
5140 (EAF_NOREAD): Renumber.
5141 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
5142 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
5143 (deref_flags): Likewise.
5144 (modref_lattice::init): Likewise.
5145 (modref_lattice::merge): Likewise.
5146 (merge_call_lhs_flags): Likewise.
5147 (analyze_ssa_name_flags): Likewise.
5148 (modref_merge_call_site_flags): Likewise.
5149 * tree-ssa-structalias.c (handle_call_arg): Likewise.
5151 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
5153 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
5154 check and remove an unused local variable with shNadd/shNadd.uw
5157 2021-11-02 Tamar Christina <tamar.christina@arm.com>
5159 PR tree-optimization/103007
5160 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
5163 2021-11-02 Richard Biener <rguenther@suse.de>
5165 PR middle-end/103038
5166 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
5167 (can_native_interpret_type_p): Likewise.
5168 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
5169 RHS if the LHS is an SSA name.
5171 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
5173 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
5174 * langhooks-def.h (lhd_simulate_record_decl): Declare.
5175 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
5176 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
5177 * langhooks.c (lhd_simulate_record_decl): New function.
5179 2021-11-02 Jakub Jelinek <jakub@redhat.com>
5182 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
5183 modes with TImode inner mode if 32-bit.
5185 2021-11-02 liuhongt <hongtao.liu@intel.com>
5187 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
5188 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
5189 vect_slp_v4si_store_unalign): Document efficient target.
5190 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
5191 vect_slp_v16qi_store_unalign_1): Ditto.
5192 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
5193 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
5194 (struct_4char_block_move, struct_8char_block_move,
5195 struct_16char_block_move): Ditto.
5197 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5198 Jakub Jelinek <jakub@redhat.com>
5201 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
5202 ix86_expand_ti_to_v1ti): New helper functions.
5203 (ix86_expand_v1ti_shift): Check if the amount operand is an
5204 integer constant, and expand as a TImode shift if it isn't.
5205 (ix86_expand_v1ti_rotate): Check if the amount operand is an
5206 integer constant, and expand as a TImode rotate if it isn't.
5207 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
5208 right shifts of V1TImode quantities.
5209 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
5210 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
5211 to QImode general_operand, and let the helper functions lower
5212 shifts by non-constant operands, as TImode shifts. Make
5213 conditional on TARGET_64BIT.
5214 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
5215 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
5216 Make conditional on TARGET_64BIT.
5218 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5220 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
5221 "*tabort_1_plus"): Remove operands non-null check.
5223 2021-11-01 Martin Liska <mliska@suse.cz>
5225 * opt-functions.awk: Add new sanity checking.
5226 * optc-gen.awk: Add new argument to integer_range_info.
5227 * params.opt: Update 2 params which have negative IntegerRange.
5229 2021-11-01 qing zhao <qing.zhao@oracle.com>
5231 * gimplify.c (gimplify_decl_expr): Do not add call to
5232 __builtin_clear_padding when a variable is a gimple register
5233 or it might not have padding.
5234 (gimplify_init_constructor): Likewise.
5236 2021-11-01 Tamar Christina <tamar.christina@arm.com>
5238 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
5239 movi, dup and extract costing fields.
5240 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
5241 thunderx_extra_costs, thunderx2t99_extra_costs,
5242 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
5244 * config/arm/aarch-cost-tables.h (generic_extra_costs,
5245 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
5246 exynosm1_extra_costs, xgene1_extra_costs): Likewise
5247 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
5248 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
5250 2021-11-01 Tamar Christina <tamar.christina@arm.com>
5252 * cse.c (add_to_set): New.
5253 (find_sets_in_insn): Register constants in sets.
5254 (canonicalize_insn): Use auto_vec instead.
5255 (cse_insn): Try materializing using vec_dup.
5256 * rtl.h (simplify_context::simplify_gen_vec_select,
5257 simplify_gen_vec_select): New.
5258 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
5260 2021-11-01 David Malcolm <dmalcolm@redhat.com>
5262 * common.opt (fdiagnostics-escape-format=): New.
5263 (diagnostics_escape_format): New enum.
5264 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
5265 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
5266 * diagnostic-format-json.cc (json_end_diagnostic): Add
5267 "escape-source" attribute.
5268 * diagnostic-show-locus.c
5269 (exploc_with_display_col::exploc_with_display_col): Replace
5270 "tabstop" param with a cpp_char_column_policy and add an "aspect"
5271 param. Use these to compute m_display_col accordingly.
5272 (struct char_display_policy): New struct.
5273 (layout::m_policy): New field.
5274 (layout::m_escape_on_output): New field.
5275 (def_policy): New function.
5276 (make_range): Update for changes to exploc_with_display_col ctor.
5277 (default_print_decoded_ch): New.
5278 (width_per_escaped_byte): New.
5279 (escape_as_bytes_width): New.
5280 (escape_as_bytes_print): New.
5281 (escape_as_unicode_width): New.
5282 (escape_as_unicode_print): New.
5284 (layout::layout): Initialize new fields. Update m_exploc ctor
5285 call for above change to ctor.
5286 (layout::maybe_add_location_range): Update for changes to
5287 exploc_with_display_col ctor.
5288 (layout::calculate_x_offset_display): Update for change to
5290 (layout::print_source_line): Pass policy
5291 to cpp_display_width_computation. Capture cpp_decoded_char when
5292 calling process_next_codepoint. Move printing of source code to
5293 m_policy.m_print_cb.
5294 (line_label::line_label): Pass in policy rather than context.
5295 (layout::print_any_labels): Update for change to line_label ctor.
5296 (get_affected_range): Pass in policy rather than context, updating
5297 calls to location_compute_display_column accordingly.
5298 (get_printed_columns): Likewise, also for cpp_display_width.
5299 (correction::correction): Pass in policy rather than tabstop.
5300 (correction::compute_display_cols): Pass m_policy rather than
5301 m_tabstop to cpp_display_width.
5302 (correction::m_tabstop): Replace with...
5303 (correction::m_policy): ...this.
5304 (line_corrections::line_corrections): Pass in policy rather than
5306 (line_corrections::m_context): Replace with...
5307 (line_corrections::m_policy): ...this.
5308 (line_corrections::add_hint): Update to use m_policy rather than
5310 (line_corrections::add_hint): Likewise.
5311 (layout::print_trailing_fixits): Likewise.
5312 (selftest::test_display_widths): New.
5313 (selftest::test_layout_x_offset_display_utf8): Update to use
5314 policy rather than tabstop.
5315 (selftest::test_one_liner_labels_utf8): Add test of escaping
5317 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
5318 use policy rather than tabstop.
5319 (selftest::test_overlapped_fixit_printing): Likewise.
5320 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
5321 (selftest::test_overlapped_fixit_printing_2): Likewise.
5322 (selftest::test_tab_expansion): Likewise.
5323 (selftest::test_escaping_bytes_1): New.
5324 (selftest::test_escaping_bytes_2): New.
5325 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
5326 * diagnostic.c (diagnostic_initialize): Initialize
5327 context->escape_format.
5328 (convert_column_unit): Update to use default character width policy.
5329 (selftest::test_diagnostic_get_location_text): Likewise.
5330 * diagnostic.h (enum diagnostics_escape_format): New enum.
5331 (diagnostic_context::escape_format): New field.
5332 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
5333 (-fdiagnostics-format=): Add "escape-source" attribute to examples
5334 of JSON output, and document it.
5335 * input.c (location_compute_display_column): Pass in "policy"
5336 rather than "tabstop", passing to
5337 cpp_byte_column_to_display_column.
5338 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
5339 * input.h (class cpp_char_column_policy): New forward decl.
5340 (location_compute_display_column): Pass in "policy" rather than
5342 * opts.c (common_handle_option): Handle
5343 OPT_fdiagnostics_escape_format_.
5344 * selftest.c (temp_source_file::temp_source_file): New ctor
5345 overload taking a size_t.
5346 * selftest.h (temp_source_file::temp_source_file): Likewise.
5348 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
5350 * dbgcnt.def: Add debug counter for back_thread[12] and
5351 back_threadfull[12].
5352 * passes.def: Pass "first" argument to each back threading pass.
5353 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
5355 (back_threader::debug_counter): New.
5356 (back_threader::maybe_register_path): Call debug_counter.
5358 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
5360 * tree-ssa-threadbackward.c (BT_NONE): New.
5363 (back_threader::back_threader): Add flags.
5364 Move loop initialization here.
5365 (back_threader::~back_threader): New.
5366 (back_threader::find_taken_edge_switch): Change solver and ranger
5368 (back_threader::find_taken_edge_cond): Same.
5369 (back_threader::find_paths_to_names): Same.
5370 (back_threader::find_paths): Same.
5371 (back_threader::dump): Same.
5372 (try_thread_blocks): Merge into thread_blocks.
5373 (back_threader::thread_blocks): New.
5374 (do_early_thread_jumps): Merge into thread_blocks.
5375 (do_thread_jumps): Merge into thread_blocks.
5376 (back_threader::thread_through_all_blocks): Remove.
5378 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
5380 PR tree-optimization/103003
5381 * value-relation.cc (dom_oracle::register_relation): If the 2
5382 ssa names are the same, don't register any relation.
5384 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
5386 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
5387 redundant check for calls_eh_return.
5388 * config/aarch64/aarch64.md (*do_return): Likewise.
5390 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
5392 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
5393 duplicate_loop_to_header_edge to
5394 duplicate_loop_body_to_header_edge.
5395 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
5396 * cfghooks.h (struct cfg_hooks): Likewise.
5397 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
5398 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
5399 (clone_loop_to_header_edge): Likewise.
5400 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
5401 * cfgrtl.c (struct cfg_hooks): Likewise.
5402 * doc/loop.texi: Likewise.
5403 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
5404 (unroll_loop_runtime_iterations): Likewise.
5405 (unroll_loop_stupid): Likewise.
5406 (apply_opt_in_copies): Likewise.
5407 * tree-cfg.c (struct cfg_hooks): Likewise.
5408 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
5409 (try_peel_loop): Likewise.
5410 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
5411 (gimple_duplicate_loop_body_to_header_edge): Likewise.
5412 (tree_transform_and_unroll_loop): Likewise.
5413 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
5416 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
5418 * cfgloopmanip.c (loop_version): Refactor loopify to
5419 loop_version. Move condition generation after loopify.
5421 * cfgloopmanip.h (loopify): Delete.
5423 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
5425 * ipa-fnsummary.c: Include tree-dfa.h.
5426 (points_to_local_or_readonly_memory_p): Return true on return
5428 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
5431 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
5433 PR middle-end/102972
5434 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
5435 internal Fortran name; new permit_num_teams arg to permit
5436 omp_get_num_teams and omp_get_team_num.
5437 (scan_omp_1_stmt): Update call to it, add missing call for
5438 reverse offload, and check for strictly nested API calls in teams.
5440 2021-10-30 Jakub Jelinek <jakub@redhat.com>
5442 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
5444 2021-10-29 David Malcolm <dmalcolm@redhat.com>
5446 * selftest.c (assert_streq): Add newlines when emitting non-equal
5449 2021-10-29 David Malcolm <dmalcolm@redhat.com>
5451 * Makefile.in: Fix syntax for reference to LIBDEPS in
5454 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
5456 * doc/install.texi: Bump required minimum DejaGnu version.
5458 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
5459 Andrew MacLeod <amacleod@redhat.com>
5461 * value-relation.cc (path_oracle::killing_def): Add a
5462 self-equivalence so we don't look to the root oracle.
5464 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
5466 * passes.def: Replace the pass_thread_jumps before VRP* with
5467 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
5468 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
5469 Remove hyphen from "thread-full" name.
5471 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
5473 PR tree-optimization/102983
5474 * gimple-range-cache.h (propagate_updated_value): Make public.
5475 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
5476 when processing gcond stmts.
5478 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
5480 * gimple.c (gimple_call_retslot_flags): New function.
5481 * gimple.h (gimple_call_retslot_flags): Declare.
5482 * ipa-modref.c: Include tree-cfg.h.
5483 (struct escape_entry): Turn parm_index to signed.
5484 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
5485 (modref_summary::modref_summary): Initialize retslot_flags.
5486 (struct modref_summary_lto): Likewise.
5487 (modref_summary::useful_p): Check retslot_flags.
5488 (modref_summary_lto::useful_p): Likewise.
5489 (modref_summary::dump): Dump retslot_flags.
5490 (modref_summary_lto::dump): Likewise.
5491 (struct escape_point): Add hidden_args enum.
5492 (analyze_ssa_name_flags): Ignore return slot return;
5493 use gimple_call_retslot_flags.
5494 (record_escape_points): Break out from ...
5495 (analyze_parms): ... here; handle retslot_flags.
5496 (modref_summaries::duplicate): Duplicate retslot_flags.
5497 (modref_summaries_lto::duplicate): Likewise.
5498 (modref_write_escape_summary): Stream parm_index as signed.
5499 (modref_read_escape_summary): Likewise.
5500 (modref_write): Stream retslot_flags.
5501 (read_section): Likewise.
5502 (struct escape_map): Fix typo in comment.
5503 (update_escape_summary_1): Fix whitespace.
5504 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
5505 (modref_merge_call_site_flags): Merge retslot_flags.
5506 * ipa-modref.h (struct modref_summary): Add retslot_flags.
5507 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
5509 2021-10-29 Tamar Christina <tamar.christina@arm.com>
5511 PR tree-optimization/102977
5512 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
5513 (vect_detect_pair_op): Add crosslane check.
5514 (vect_match_call_complex_mla): Remove.
5515 (class complex_mul_pattern): Update comment.
5516 (complex_mul_pattern::matches): Update detection.
5517 (class complex_fma_pattern): Remove.
5518 (complex_fma_pattern::matches): Remove.
5519 (complex_fma_pattern::recognize): Remove.
5520 (complex_fma_pattern::build): Remove.
5521 (class complex_fms_pattern): Update comment.
5522 (complex_fms_pattern::matches): Remove.
5523 (complex_operations_pattern::recognize): Remove complex_fma_pattern
5525 2021-10-29 Jakub Jelinek <jakub@redhat.com>
5527 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
5530 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
5533 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
5534 patterns match and emit for VSX xxpermdi.
5536 2021-10-29 liuhongt <hongtao.liu@intel.com>
5539 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
5541 (V16HF_FTYPE_V16HF): Ditto.
5542 (V32HF_FTYPE_V32HF): Ditto.
5543 (V8HF_FTYPE_V8HF_ROUND): Ditto.
5544 (V16HF_FTYPE_V16HF_ROUND): Ditto.
5545 (V32HF_FTYPE_V32HF_ROUND): Ditto.
5546 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
5547 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
5548 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
5549 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
5550 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
5551 * config/i386/i386-builtins.c
5552 (ix86_builtin_vectorized_function): Enable vectorization for
5553 HFmode FLOOR/CEIL/TRUNC operation.
5554 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
5556 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
5559 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
5560 Andrew MacLeod <amacleod@redhat.com>
5562 * value-relation.cc (path_oracle::killing_def): Walk the
5563 equivalency list and remove SSA from any equivalencies.
5565 2021-10-28 Stafford Horne <shorne@gmail.com>
5567 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
5570 2021-10-28 Jakub Jelinek <jakub@redhat.com>
5572 PR tree-optimization/102951
5573 * fold-const.h (address_compare): Declare.
5574 * fold-const.c (address_compare): New function.
5575 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
5576 address_compare helper.
5577 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
5580 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
5582 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
5584 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
5586 PR tree-optimization/102940
5587 * tree-vrp.c (execute_ranger_vrp): Reset scev.
5589 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
5591 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
5593 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
5595 * configure: Regenerate.
5596 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
5598 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
5600 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
5602 2021-10-28 Richard Biener <rguenther@suse.de>
5605 * fold-const.c (fold_convert_const): Avoid int to float
5606 constant folding with -frounding-math and inexact result.
5607 * simplify-rtx.c (simplify_const_unary_operation): Likewise
5608 for both float and unsigned_float.
5610 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
5612 * tree-ssa-threadbackward.c
5613 (back_threader::find_taken_edge_switch): Use find_case_label_range
5614 instead of find_taken_edge.
5616 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
5618 * tree-ssa-threadbackward.c (class back_threader_registry):
5619 Inherit from back_jt_path_registry.
5620 (back_threader_registry::thread_through_all_blocks): Remove.
5621 (back_threader_registry::register_path): Remove
5622 m_lowlevel_registry prefix.
5624 2021-10-28 Richard Biener <rguenther@suse.de>
5627 * fold-const.c (fold_convert_const_real_from_real): Honor
5628 -frounding-math if the conversion is not exact.
5629 * simplify-rtx.c (simplify_const_unary_operation): Do not
5630 simplify FLOAT_TRUNCATE with sign dependent rounding.
5632 2021-10-28 Richard Biener <rguenther@suse.de>
5634 PR tree-optimization/102949
5635 * tree-vect-stmts.c (ensure_base_align): Look at the
5636 dr_info of a group leader and assert we are looking at
5637 one with analyzed alignment.
5639 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
5642 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
5643 V1T1 mode for unaligned load and store.
5645 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
5647 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
5650 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
5652 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
5655 2021-10-28 Alexandre Oliva <oliva@adacore.com>
5657 * common.opt (fharden-compares): New.
5658 (fharden-conditional-branches): New.
5659 * doc/invoke.texi: Document new options.
5660 * gimple-harden-conditionals.cc: New.
5661 * Makefile.in (OBJS): Build it.
5662 * passes.def: Add new passes.
5663 * tree-pass.h (make_pass_harden_compares): Declare.
5664 (make_pass_harden_conditional_branches): Declare.
5666 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
5669 * config/rs6000/altivec.md: Add vsx register constraints.
5670 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
5671 (vsx_xxsel<mode>2): Likewise.
5672 (vsx_xxsel<mode>3): Likewise.
5673 (vsx_xxsel<mode>4): Likewise.
5675 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
5678 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
5679 (altivec_vsel<mode>): ... this and update define.
5680 (*altivec_vsel<mode>_uns): Delete.
5681 (altivec_vsel<mode>2): New define_insn.
5682 (altivec_vsel<mode>3): Likewise.
5683 (altivec_vsel<mode>4): Likewise.
5684 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
5685 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
5687 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
5688 selection instead of per element.
5689 * config/rs6000/vector.md:
5690 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
5691 (vsx_xxsel<mode>): ... this and update define.
5692 (*vsx_xxsel<mode>_uns): Delete.
5693 (vsx_xxsel<mode>2): New define_insn.
5694 (vsx_xxsel<mode>3): Likewise.
5695 (vsx_xxsel<mode>4): Likewise.
5697 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
5699 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
5700 HFmode rsqrt without TARGET_SSE_MATH.
5701 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
5702 btrunc condition to be restricted by -ftrapping-math, adjust
5703 use_rsqrt_p function call.
5704 * config/i386/i386.md (rcphf2): New define_insn.
5705 (rsqrthf2): Likewise.
5706 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
5707 (div<mode>3): New expander for HF mode.
5708 (rsqrt<mode>2): Likewise.
5709 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
5710 (*avx512fp16_vmrsqrtv8hf2): Likewise.
5712 2021-10-27 Saagar Jha <saagar@saagarjha.com>
5714 * config.gcc: Adjust for Darwin21.
5715 * config/darwin-c.c (macosx_version_as_macro): Likewise.
5716 * config/darwin-driver.c (validate_macosx_version_min):
5718 (darwin_find_version_from_kernel): Likewise.
5720 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
5722 * gimple-range-path.cc
5723 (path_range_query::range_defined_in_block): Call killing_def.
5725 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
5727 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
5728 operands as well as relation.
5729 * gimple-range-path.cc
5730 (path_range_query::compute_ranges_in_block): Compute PHI relations
5731 first. Compute outgoing relations at the end.
5732 (path_range_query::compute_ranges): Remove call to compute_relations.
5733 (path_range_query::compute_relations): Remove.
5734 (path_range_query::maybe_register_phi_relation): New.
5735 (path_range_query::compute_phi_relations): Abstract out
5736 registering one PHI relation to...
5737 (path_range_query::compute_outgoing_relations): ...here.
5738 * gimple-range-path.h (class path_range_query): Remove
5740 Add maybe_register_phi_relation.
5742 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
5743 Andrew MacLeod <amacleod@redhat.com>
5745 * value-relation.cc (path_oracle::killing_def): Kill second
5748 2021-10-27 Martin Jambor <mjambor@suse.cz>
5750 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
5751 profile feedback depending on their local availability.
5753 2021-10-27 Martin Jambor <mjambor@suse.cz>
5755 * params.opt (param_ipa_cp_profile_count_base): New parameter.
5756 * doc/invoke.texi (Optimize Options): Add entry for
5757 ipa-cp-profile-count-base.
5758 * ipa-cp.c (max_count): Replace with base_count, replace all
5759 occurrences too, unless otherwise stated.
5760 (ipcp_cloning_candidate_p): identify mostly-directly called
5761 functions based on their counts, not max_count.
5762 (compare_edge_profile_counts): New function.
5763 (ipcp_propagate_stage): Instead of setting max_count, find the
5764 appropriate edge count in a sorted vector of counts of eligible
5765 edges and make it the base_count.
5767 2021-10-27 Martin Jambor <mjambor@suse.cz>
5769 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
5770 n_nonrec_calls and itself, document all fields.
5771 (init_caller_stats): Initialize the above new fields.
5772 (gather_caller_stats): Gather self-recursive counts and calls number.
5773 (get_info_about_necessary_edges): Gather counts of self-recursive and
5774 other edges bringing in the requested value separately.
5775 (dump_profile_updates): Rework to dump info about a single node only.
5776 (lenient_count_portion_handling): New function.
5777 (struct gather_other_count_struct): New type.
5778 (gather_count_of_non_rec_edges): New function.
5779 (struct desc_incoming_count_struct): New type.
5780 (analyze_clone_icoming_counts): New function.
5781 (adjust_clone_incoming_counts): Likewise.
5782 (update_counts_for_self_gen_clones): Likewise.
5783 (update_profiling_info): Rewritten.
5784 (update_specialized_profile): Adjust call to dump_profile_updates.
5785 (create_specialized_node): Do not update profiling info.
5786 (decide_about_value): New parameter self_gen_clones, either push new
5787 clones into it or updat their profile counts. For self-recursively
5788 generated values, use a portion of the node count instead of count
5789 from self-recursive edges to estimate goodness.
5790 (decide_whether_version_node): Gather clones for self-generated values
5791 in a new vector, update their profiles at once at the end.
5793 2021-10-27 Richard Biener <rguenther@suse.de>
5795 * tree-vectorizer.c (vect_transform_loops): New function,
5797 (try_vectorize_loop_1): ... here. Simplify as epilogues
5798 are now fully handled in the split part.
5800 2021-10-27 Jakub Jelinek <jakub@redhat.com>
5802 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
5803 iterators with pointer types.
5804 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
5806 2021-10-26 Martin Sebor <msebor@redhat.com>
5808 PR tree-optimization/102238
5809 PR tree-optimization/102919
5810 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
5811 (array_elt_at_offset): Move to pointer-query.
5812 (set_aggregate_size_and_offset): New function.
5813 (field_at_offset): Move to pointer-query.
5814 (get_origin_and_offset): Rename...
5815 (get_origin_and_offset_r): this. Add an argument. Make aggregate
5816 handling more robust.
5817 (get_origin_and_offset): New.
5818 (alias_offset): Add an argument.
5819 (format_string): Use subobject size determined by get_origin_and_offset.
5820 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
5821 Improve/correct handling of aggregates.
5822 (array_elt_at_offset): Same.
5823 * pointer-query.h (field_at_offset): Declare.
5824 (array_elt_at_offset): Declare.
5826 2021-10-26 Martin Sebor <msebor@redhat.com>
5828 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
5830 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
5831 (expand_builtin_fork_or_exec): Same.
5832 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
5833 compute_objsize additional arguments.
5834 (inbounds_memaccess_p): Same.
5835 (array_bounds_checker::check_array_bounds): Add an assert. Stash
5836 statement in a member.
5837 (check_array_bounds_dom_walker::before_dom_children): Same.
5838 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
5839 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
5840 (handle_printf_call): Pass a new argument.
5841 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
5842 (check_access): Add an argument and pass it along to callees.
5843 (check_read_access): Make a member function.
5844 (pass_waccess::check_strcat): Pass access_data ctor additional
5846 (pass_waccess::check_strncat): Same.
5847 (pass_waccess::check_stxcpy): Same.
5848 (pass_waccess::check_stxncpy): Same.
5849 (pass_waccess::check_strncmp): Same.
5850 (pass_waccess::check_read_access): Same.
5851 (pass_waccess::check_builtin): Same.
5852 (pass_waccess::maybe_check_access_sizes): Same.
5853 (pass_waccess::maybe_check_dealloc_call): Same.
5854 * gimple-ssa-warn-access.h (check_read_access): Declare a new
5856 * pointer-query.cc (compute_objsize_r): Add an argument.
5857 (gimple_call_return_array): Same.
5858 (gimple_call_alloc_size): Same.
5859 (access_ref::access_ref): Same.
5860 (access_ref::get_ref): Same.
5861 (pointer_query::get_ref): Same.
5862 (handle_min_max_size): Pass an arguments to callees.
5863 (handle_array_ref): Add an argument.
5864 (handle_mem_ref): Same.
5865 (compute_objsize): Same.
5866 * pointer-query.h (struct access_ref): Adjust signatures.
5867 (struct access_data): Same.
5868 (gimple_call_alloc_size): Add an argument.
5869 (gimple_parm_array_size): Same.
5870 (compute_objsize): Same.
5871 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
5872 argument to compute_objsize.
5873 (strlen_pass::maybe_warn_overflow): Same.
5874 (maybe_diag_stxncpy_trunc): Same.
5876 2021-10-26 Martin Sebor <msebor@redhat.com>
5878 PR middle-end/102453
5879 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
5880 (pass_waccess::check_atomic_builtin): Call it.
5882 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
5884 PR rtl-optimization/102842
5885 * lra-constraints.c (match_reload): Ignore out in checking values
5887 (curr_insn_transform): Collect outputs before doing reloads of operands.
5889 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
5892 * config/rs6000/x86intrin.h: Move some included headers to new
5893 headers. Include new immintrin.h instead of those headers.
5894 * config/rs6000/immintrin.h: New.
5895 * config/rs6000/x86gprintrin.h: New.
5896 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
5898 2021-10-26 Richard Biener <rguenther@suse.de>
5900 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
5901 Remove byte_offset parameter.
5902 (vect_create_data_ref_ptr): Likewise.
5903 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
5905 (vect_create_data_ref_ptr): Likewise.
5906 * tree-vect-stmts.c (vectorizable_store): Adjust.
5907 (vectorizable_load): Likewise.
5909 2021-10-26 Richard Biener <rguenther@suse.de>
5911 PR tree-optimization/96109
5912 * tree-vectorizer.h (dr_misalignment): Add optional offset
5914 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
5915 offset applied for negative stride accesses.
5916 (vect_enhance_data_refs_alignment): Compute negative stride
5917 access offset and pass it to dr_misalignment.
5918 * tree-vect-stmts.c (get_negative_load_store_type): Pass
5919 negative offset to dr_misalignment.
5920 (get_group_load_store_type): Likewise.
5921 (get_load_store_type): Likewise.
5922 (vectorizable_store): Remove asserts about alignment.
5923 (vectorizable_load): Likewise.
5925 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
5927 PR tree-optimization/102897
5928 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
5930 2021-10-26 Richard Biener <rguenther@suse.de>
5932 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
5933 Take offset in bytes.
5934 (vect_create_data_ref_ptr): Likewise.
5935 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
5936 offset by element size.
5937 (vect_create_cond_for_align_checks): Likewise.
5938 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
5939 (vectorizable_load): Remove duplicate leftover from merge
5942 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
5944 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
5945 function to expand V1TI mode logical shifts by integer constants.
5946 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
5947 mode rotations by integer constants.
5948 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
5949 ix86_expand_v1ti_rotate): Prototype new functions here.
5950 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
5951 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
5953 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
5955 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
5956 Avoid threading circular paths.
5958 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
5960 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
5961 Attempt to resolve all incoming paths to a PHI.
5962 (back_threader::resolve_def): Always return true for PHIs.
5964 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
5966 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
5967 Always try to resolve path without looking back.
5968 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
5969 edge is a back edge.
5971 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
5973 PR tree-optimization/102789
5974 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
5975 update inits of simd_lane_access.
5977 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
5979 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
5980 move to vr-values.c.
5981 (simplify_casted_conds): Move to vrp_folder class.
5982 (execute_vrp): Call via vrp_folder now.
5983 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
5984 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
5985 * vr-values.h (simplify_casted_cond): Add prototype.
5987 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
5989 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
5992 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
5994 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
5996 (_mm_testc_si128): Likewise.
5997 (_mm_testnzc_si128): Likewise.
5998 (_mm_blend_ps): Likewise.
5999 (_mm_blendv_ps): Likewise.
6000 (_mm_blend_pd): Likewise.
6001 (_mm_blendv_pd): Likewise.
6002 (_mm_ceil_pd): Likewise.
6003 (_mm_ceil_sd): Likewise.
6004 (_mm_ceil_ps): Likewise.
6005 (_mm_ceil_ss): Likewise.
6006 (_mm_floor_pd): Likewise.
6007 (_mm_floor_sd): Likewise.
6008 (_mm_floor_ps): Likewise.
6009 (_mm_floor_ss): Likewise.
6010 (_mm_minpos_epu16): Likewise.
6011 (_mm_mul_epi32): Likewise.
6012 (_mm_cvtepi8_epi16): Likewise.
6013 (_mm_packus_epi32): Likewise.
6014 (_mm_cmpgt_epi64): Likewise.
6016 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
6018 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
6019 shifts of the mode's smin_value and smax_value when the bit count
6020 operand doesn't have side-effects.
6021 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
6022 when the bit count operand doesn't have side-effects.
6023 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
6024 compile-time evaluation of saturating left shifts with constant
6027 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6029 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
6032 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6034 * flag-types.h (enum ranger_debug): Adjust values.
6035 * params.opt (ranger_debug): Ditto.
6037 2021-10-25 Martin Jambor <mjambor@suse.cz>
6039 PR tree-optimization/102886
6040 * tree-sra.c (totally_scalarize_subtree): Fix the out of
6043 2021-10-25 Andrew Pinski <apinski@marvell.com>
6045 * tree-ssa-dce.c (simple_dce_from_worklist):
6046 Check stmt_unremovable_because_of_non_call_eh_p also
6047 before removing the statement.
6049 2021-10-25 Richard Biener <rguenther@suse.de>
6051 PR tree-optimization/102905
6052 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
6053 Use vect_supportable_dr_alignment again to determine whether
6054 an access is supported when not aligned.
6056 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6058 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
6061 2021-10-25 Jim Wilson <jimw@sifive.com>
6062 Kito Cheng <kito.cheng@sifive.com>
6063 Jia-Wei Chen <jiawei@iscas.ac.cn>
6064 Shi-Hua Liao <shihua@iscas.ac.cn>
6066 * config/riscv/bitmanip.md (shiftm1): New.
6067 (*bset<mode>): Ditto.
6068 (*bset<mode>_mask): Ditto.
6069 (*bset<mode>_1): Ditto.
6070 (*bset<mode>_1_mask): Ditto.
6071 (*bseti<mode>): Ditto.
6072 (*bclr<mode>): Ditto.
6073 (*bclri<mode>): Ditto.
6074 (*binv<mode>): Ditto.
6075 (*binvi<mode>): Ditto.
6076 (*bext<mode>): Ditto.
6078 * config/riscv/predicates.md (splittable_const_int_operand):
6080 (single_bit_mask_operand): New.
6081 (not_single_bit_mask_operand): Ditto.
6082 (const31_operand): Ditto.
6083 (const63_operand): Ditto.
6084 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
6085 (riscv_output_move): Ditto.
6086 (riscv_print_operand): Handle new operand type: T and S.
6087 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
6089 2021-10-25 Jim Wilson <jimw@sifive.com>
6091 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
6094 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6096 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6098 (riscv_rtx_costs): Ditto.
6100 2021-10-25 Jim Wilson <jimw@sifive.com>
6101 Kito Cheng <kito.cheng@sifive.com>
6102 Jia-Wei Chen <jiawei@iscas.ac.cn>
6104 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
6105 (bitmanip_minmax): New.
6106 (clz_ctz_pcnt): New.
6107 (bitmanip_optab): New.
6108 (bitmanip_insn): New.
6109 (*<optab>_not<mode>): New.
6110 (*xor_not<mode>): New.
6111 (<bitmanip_optab>si2): New.
6112 (*<bitmanip_optab>disi2): New.
6113 (<bitmanip_optab>di2): New.
6114 (*zero_extendhi<GPR:mode>2_bitmanip): New.
6115 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
6116 (*zero_extendhi<GPR:mode>2_zbb): New.
6119 (rotrsi3_sext): New.
6122 (rotlsi3_sext): New.
6123 (bswap<mode>2): New.
6124 (<bitmanip_optab><mode>3): New.
6125 * config/riscv/riscv.md (type): Add rotate.
6126 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
6127 (*zero_extendhi<GPR:mode>2): New.
6128 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
6129 (*extend<SHORT:mode><SUPERQI:mode>2): New.
6131 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6133 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6135 (riscv_rtx_costs): Ditto.
6137 2021-10-25 Jim Wilson <jimw@sifive.com>
6138 Kito Cheng <kito.cheng@sifive.com>
6139 Jia-Wei Chen <jiawei@iscas.ac.cn>
6141 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
6146 (riscv_rtx_costs): Ditto.
6147 * config/riscv/riscv.md: Include bitmanip.md
6148 (type): Add bitmanip bype.
6149 (zero_extendsidi2): Change to define_expand pattern.
6150 (*zero_extendsidi2_internal): New.
6151 (zero_extendsidi2_shifted): Disable for ZBA.
6153 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6155 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
6156 Add zba, zbb, zbc and zbs.
6157 (riscv_ext_flag_table): Ditto.
6158 * config/riscv/riscv-opts.h (MASK_ZBA): New.
6162 (TARGET_ZBA): Ditto.
6163 (TARGET_ZBB): Ditto.
6164 (TARGET_ZBC): Ditto.
6165 (TARGET_ZBS): Ditto.
6166 * config/riscv/riscv.opt (riscv_zb_subext): New.
6168 2021-10-25 liuhongt <hongtao.liu@intel.com>
6171 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
6172 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
6174 2021-10-25 Richard Biener <rguenther@suse.de>
6176 PR tree-optimization/102920
6177 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
6178 controlling VN_TOP matching behavior.
6179 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
6180 (vn_phi_eq): Do not optimistically match VN_TOP.
6182 2021-10-25 konglin1 <lingling.kong@intel.com>
6184 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
6185 define_insn_and_split.
6186 (fma_<mode>_fadd_fcmul):Likewise
6187 (fma_<complexopname>_<mode>_fma_zero):Likewise
6189 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6191 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
6192 TARGET_DISABLE_FPREGS.
6193 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
6194 MASK_DISABLE_FPREGS.
6195 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
6196 cost of hardware integer multiplication.
6197 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
6198 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
6199 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
6200 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
6201 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
6202 SOFT_FLOAT. Revise comment.
6203 (msoft-float): New option.
6205 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6207 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
6209 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
6211 * doc/install.texi (Specific): Remove obsolete details
6212 around GNU/Linux on Itanium.
6213 (Specific): Remove reference to Windows for Itanium.
6215 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6217 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
6220 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
6222 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
6224 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
6226 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
6227 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
6228 (one_cmplv1ti2): New define expand.
6230 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
6233 * Makefile.in: Handle dvidir and install-dvi target.
6234 * configure: Regenerate.
6235 * configure.ac: Add install-dvi to target_list.
6237 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
6239 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
6242 2021-10-22 Richard Biener <rguenther@suse.de>
6244 PR tree-optimization/102893
6245 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
6246 test for an exit edge.
6248 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
6249 Andrew MacLeod <amacleod@redhat.com>
6251 * gimple-range-path.cc (path_range_query::compute_phi_relations):
6252 Kill any global relations we may know before registering a new
6254 * value-relation.cc (path_oracle::killing_def): New.
6255 * value-relation.h (path_oracle::killing_def): New.
6257 2021-10-22 Richard Biener <rguenther@suse.de>
6260 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
6262 (vn_phi_lookup): Likewise.
6264 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
6267 * doc/invoke.texi: Document -fcf-protection requires i686 or
6270 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
6272 PR middle-end/102764
6273 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
6275 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
6277 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
6278 macro and all invocations.
6280 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6282 * doc/invoke.texi (ranger-debug): Document.
6283 * flag-types.h (enum ranger_debug): New.
6284 (enum evrp_mode): Remove debug values.
6285 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
6286 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
6287 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
6288 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
6289 (execute_early_vrp): Use evrp-mode directly.
6290 * params.opt (enum evrp_mode): Remove debug values.
6291 (ranger-debug): New.
6292 (ranger-logical-depth): Relocate to be in alphabetical order.
6294 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6296 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
6297 * flag-types.h: (enum vrp_mode): New.
6298 * params.opt: (vrp1-mode, vrp2-mode): New.
6299 * tree-vrp.c (vrp_pass_num): New.
6300 (pass_vrp::pass_vrp): Set pass number.
6301 (pass_vrp::execute): Choose which VRP mode to execute.
6303 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6305 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
6306 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
6307 * tree-vrp.c (class rvrp_folder): Relocate here.
6308 (execute_ranger_vrp): New.
6309 * tree-vrp.h (execute_ranger_vrp): Export.
6311 2021-10-21 Martin Liska <mliska@suse.cz>
6315 * opts.c (finish_options): Process flag_var_tracking* options
6316 here as they can be adjusted by optimize attribute.
6317 Process also flag_syntax_only and flag_gtoggle.
6318 * toplev.c (process_options): Remove it here.
6319 * common.opt: Make debug_nonbind_markers_p as PerFunction
6320 attribute as it depends on optimization level.
6322 2021-10-21 Martin Jambor <mjambor@suse.cz>
6324 PR tree-optimization/102505
6325 * tree-sra.c (totally_scalarize_subtree): Check that the
6326 encountered field fits within the acces we would like to put it
6329 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
6331 * tree-ssa-threadbackward.c
6332 (back_threader::maybe_register_path): Remove circular paths check.
6334 2021-10-21 Richard Biener <rguenther@suse.de>
6336 * toplev.c (process_options): Move the initial debug_hooks
6338 (toplev::main): ... before the call of the post_options
6341 2021-10-21 Richard Biener <rguenther@suse.de>
6343 PR tree-optimization/102847
6344 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
6345 load cost in the prologue for VMAT_INVARIANT.
6347 2021-10-21 Richard Biener <rguenther@suse.de>
6349 PR tree-optimization/102847
6350 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
6351 handle VMAT_INVARIANT as a splat in the prologue.
6353 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
6356 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
6357 move to use the same logic as HImode.
6359 2021-10-21 Richard Biener <rguenther@suse.de>
6361 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
6362 superfluous gimple_call_nothrow_p check.
6364 2021-10-21 Andrew Pinski <apinski@marvell.com>
6366 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
6367 Mark the ssa-name of the rhs as one to be removed.
6368 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
6369 Call simple_dce_from_worklist at the end to a simple dce.
6371 2021-10-21 Andrew Pinski <apinski@marvell.com>
6373 * tree-cfg.c (maybe_remove_writeonly_store): New function
6374 factored out from ...
6375 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
6377 2021-10-21 Andrew Pinski <apinski@marvell.com>
6379 * tree-cfg.c (execute_fixup_cfg): Remove comment
6380 about standalone pass.
6382 2021-10-21 Andrew Pinski <apinski@marvell.com>
6384 * tree-cfg.c (execute_fixup_cfg): Output when the statement
6385 is removed when it is a write only var.
6387 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
6389 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6390 Avoid threading circular paths.
6392 2021-10-20 Alex Coplan <alex.coplan@arm.com>
6394 * calls.c (initialize_argument_information): Remove some dead
6395 code, remove handling for function_arg returning const_int.
6396 * doc/tm.texi: Delete documentation for unused target hooks.
6397 * doc/tm.texi.in: Likewise.
6398 * target.def (load_bounds_for_arg): Delete.
6399 (store_bounds_for_arg): Delete.
6400 (load_returned_bounds): Delete.
6401 (store_returned_bounds): Delete.
6402 * targhooks.c (default_load_bounds_for_arg): Delete.
6403 (default_store_bounds_for_arg): Delete.
6404 (default_load_returned_bounds): Delete.
6405 (default_store_returned_bounds): Delete.
6406 * targhooks.h (default_load_bounds_for_arg): Delete.
6407 (default_store_bounds_for_arg): Delete.
6408 (default_load_returned_bounds): Delete.
6409 (default_store_returned_bounds): Delete.
6411 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
6413 * doc/extend.texi (Basic Asm): Clarify that asm is not an
6415 * doc/invoke.texi (-fno-asm): Fix description for C++.
6417 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
6419 * doc/install.texi: Remove link to old.html
6421 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6423 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
6425 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6427 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
6429 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6431 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
6433 * config/aarch64/constraints.md (D1): New.
6435 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6437 * config/aarch64/aarch64-simd.md
6438 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
6439 (*aarch64_topbits_shuffle<mode>_le): New.
6440 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
6441 (*aarch64_topbits_shuffle<mode>_be): New.
6442 * config/aarch64/predicates.md
6443 (aarch64_simd_shift_imm_vec_exact_top): New.
6445 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6447 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
6448 *aarch64_<srn_op>shrn<mode>2_vect_le,
6449 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
6450 * config/aarch64/iterators.md (srn_op): New.
6452 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
6454 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
6456 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
6457 ctx->outer instead of ctx.
6459 2021-10-20 Martin Liska <mliska@suse.cz>
6462 2021-10-19 Martin Liska <mliska@suse.cz>
6465 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
6466 * system.h (strip_whilespaces): New function.
6468 2021-10-20 Martin Liska <mliska@suse.cz>
6471 2021-10-19 Martin Liska <mliska@suse.cz>
6474 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
6477 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
6479 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
6480 Change to 8 with -Os, 11 otherwise.
6482 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
6484 * config/aarch64/aarch64.c (neoversev1_tunings):
6485 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
6486 (neoversen2_tunings): Likewise.
6488 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
6491 * config/aarch64/aarch64.md (frint_pattern): Update comment.
6492 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
6493 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
6494 * config/aarch64/arm_neon.h: Likewise.
6495 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
6497 2021-10-20 Martin Liska <mliska@suse.cz>
6499 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
6500 already declared global variable.
6501 (arm_unwind_emit_set): Use out_file as function argument.
6502 (arm_unwind_emit): Likewise.
6503 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
6504 (machopic_output_stub_indirection): Likewise.
6505 (machopic_output_indirection): Likewise.
6506 (machopic_finish): Likewise.
6507 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
6508 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
6509 * config/ia64/ia64.c (process_epilogue): Likewise.
6510 (process_cfa_adjust_cfa): Likewise.
6511 (process_cfa_register): Likewise.
6512 (process_cfa_offset): Likewise.
6513 (ia64_asm_unwind_emit): Likewise.
6514 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
6516 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
6518 * config/aarch64/aarch64-builtins.c
6519 (aarch64_general_gimple_fold_builtin):
6520 lower vld1 and vst1 variants of the neon builtins
6521 * config/aarch64/aarch64-protos.h:
6522 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
6523 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
6526 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
6528 * match.pd: Generate IFN_TRUNC.
6530 2021-10-20 Richard Biener <rguenther@suse.de>
6532 PR tree-optimization/102853
6533 * tree-data-ref.c (split_constant_offset_1): Bail out
6534 immediately if the expression traps on overflow.
6536 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6538 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
6540 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6542 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
6543 Remove superflous debugging message.
6544 (back_jt_path_registry::duplicate_thread_path): Same.
6546 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6548 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
6550 (back_threader_registry::register_path): Remove m_threaded_paths.
6552 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6554 PR tree-optimization/102814
6555 * doc/invoke.texi: Document --param=max-fsm-thread-length.
6556 * params.opt: Add --param=max-fsm-thread-length.
6557 * tree-ssa-threadbackward.c
6558 (back_threader_profitability::profitable_path_p): Fail on paths
6559 longer than max-fsm-thread-length.
6561 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
6563 PR middle-end/102764
6564 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
6565 statement to reset the current location for the outgoing edges.
6567 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6568 Richard Biener <rguenther@suse.de>
6570 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
6571 on the same line as the threading cancellation.
6572 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
6573 Avoid threading through loop headers where the path remains in the
6576 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
6578 * tree-object-size.c (unknown): Make into a function. Adjust
6580 (unknown_object_size): Simplify implementation.
6582 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
6583 Kewen Lin <linkw@linux.ibm.com>
6585 * doc/sourcebuild.texi (Effective-Target Keywords): Document
6586 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
6587 vect_slp_v16qi_store, vect_slp_v2hi_store,
6588 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
6590 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
6592 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
6594 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
6598 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
6599 * config/rs6000/pmmintrin.h: Same.
6600 * config/rs6000/smmintrin.h: Same.
6601 * config/rs6000/tmmintrin.h: Same.
6603 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
6605 * config.gcc (extra_headers): Add nmmintrin.h.
6607 2021-10-19 Richard Biener <rguenther@suse.de>
6609 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
6610 misalignment parameter.
6611 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
6612 Do not change DR_MISALIGNMENT in place, instead pass the
6613 adjusted misalignment to vect_supportable_dr_alignment.
6614 (vect_peeling_supportable): Likewise.
6615 (vect_peeling_hash_get_lowest_cost): Adjust.
6616 (vect_enhance_data_refs_alignment): Likewise.
6617 (vect_vfa_access_size): Likewise.
6618 (vect_supportable_dr_alignment): Add misalignment
6619 parameter and simplify.
6620 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
6621 (get_group_load_store_type): Likewise.
6622 (get_load_store_type): Likewise.
6624 2021-10-19 Clément Chigot <clement.chigot@atos.net>
6626 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
6627 __tls_get_addr reference to .text csect.
6629 2021-10-19 Martin Liska <mliska@suse.cz>
6632 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
6635 2021-10-19 Richard Biener <rguenther@suse.de>
6637 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
6638 (vect_get_load_cost): Likewise.
6639 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
6640 alignment support scheme and misalignment as arguments
6642 (vect_get_peeling_costs_all_drs): Compute that info here
6643 and note that we shouldn't need to.
6644 * tree-vect-stmts.c (vect_model_store_cost): Get
6645 alignment support scheme and misalignment as arguments.
6646 (vect_get_store_cost): Likewise.
6647 (vect_model_load_cost): Likewise.
6648 (vect_get_load_cost): Likewise.
6649 (vectorizable_store): Pass down alignment support scheme
6650 and misalignment to costing.
6651 (vectorizable_load): Likewise.
6653 2021-10-19 Richard Biener <rguenther@suse.de>
6655 * tree-vect-stmts.c (get_negative_load_store_type): Add
6656 offset output parameter and initialize it.
6657 (get_group_load_store_type): Likewise.
6658 (get_load_store_type): Likewise.
6659 (vectorizable_store): Use offset as computed by
6660 get_load_store_type.
6661 (vectorizable_load): Likewise.
6663 2021-10-19 Richard Biener <rguenther@suse.de>
6665 PR tree-optimization/102827
6666 * tree-if-conv.c (predicate_statements): Add pe parameter
6667 and use that edge to insert invariant stmts on.
6668 (combine_blocks): Pass through pe.
6669 (tree_if_conversion): Compute the edge to insert invariant
6670 stmts on and pass it along.
6672 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
6675 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
6676 sssubaddv2hi3): Swap the order of operators in vec_concat.
6678 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
6680 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
6681 (altivec_vmrghb_direct): New.
6682 (*altivec_vmrghh_internal): Delete.
6683 (altivec_vmrghh_direct): New.
6684 (*altivec_vmrghw_internal): Delete.
6685 (altivec_vmrghw_direct_<mode>): New.
6686 (altivec_vmrghw_direct): Delete.
6687 (*altivec_vmrglb_internal): Delete.
6688 (altivec_vmrglb_direct): New.
6689 (*altivec_vmrglh_internal): Delete.
6690 (altivec_vmrglh_direct): New.
6691 (*altivec_vmrglw_internal): Delete.
6692 (altivec_vmrglw_direct_<mode>): New.
6693 (altivec_vmrglw_direct): Delete.
6694 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
6695 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
6697 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
6698 (vsx_xxmrglw_<mode>): Adjust.
6700 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
6702 * passes.def: Change threading comment before pass_ccp pass.
6704 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
6706 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
6707 Modify the expansion for sign extension. All extensions are done
6708 within VSX registers.
6710 2021-10-19 Richard Biener <rguenther@suse.de>
6712 * tree-vect-stmts.c (get_group_load_store_type): Add
6713 misalignment output parameter and initialize it.
6714 (get_group_load_store_type): Likewise.
6715 (vectorizable_store): Remove now redundant queries.
6716 (vectorizable_load): Likewise.
6718 2021-10-19 Richard Biener <rguenther@suse.de>
6720 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
6721 check_aligned argument.
6722 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
6724 (vect_peeling_hash_insert): Add supportable_if_not_aligned
6725 argument and do not call vect_supportable_dr_alignment here.
6726 (vect_peeling_supportable): Adjust.
6727 (vect_enhance_data_refs_alignment): Compute whether the
6728 access is supported with different alignment here and
6729 pass that down to vect_peeling_hash_insert.
6730 (vect_vfa_access_size): Adjust.
6731 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
6732 (vect_get_load_cost): Likewise.
6733 (get_negative_load_store_type): Likewise.
6734 (get_group_load_store_type): Likewise.
6735 (get_load_store_type): Likewise.
6737 2021-10-19 Martin Liska <mliska@suse.cz>
6740 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
6741 * system.h (strip_whilespaces): New function.
6743 2021-10-19 dianhong xu <dianhong.xu@intel.com>
6745 * config/i386/avx512fp16intrin.h:
6746 (_mm512_set1_pch): New intrinsic.
6747 * config/i386/avx512fp16vlintrin.h:
6748 (_mm256_set1_pch): New intrinsic.
6749 (_mm_set1_pch): Ditto.
6751 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
6753 PR tree-optimization/102796
6754 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
6755 normally. Return get_tree_range for non gimple_range_ssa_p names.
6756 (gimple_ranger::range_of_stmt): Use get_tree_range for non
6757 gimple_range_ssa_p names.
6759 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
6762 * config/i386/i386.c (ix86_print_operand_address):
6763 Error out for non-address_operand asm operands.
6765 2021-10-18 Richard Biener <rguenther@suse.de>
6767 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
6768 not auto-convert dr_alignment_support to bool.
6769 (vect_peeling_supportable): Likewise.
6770 (vect_enhance_data_refs_alignment): Likewise.
6771 (vect_supportable_dr_alignment): Commonize read/write case.
6772 * tree-vect-stmts.c (vect_get_store_cost): Use
6773 dr_alignment_support, not int, for the vect_supportable_dr_alignment
6775 (vect_get_load_cost): Likewise.
6777 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
6779 * tree-object-size.c (object_sizes_execute): Consolidate LHS
6780 null check and do it early.
6782 2021-10-18 Richard Biener <rguenther@suse.de>
6784 * tree-vect-stmts.c (vectorizable_store): Use the
6785 computed alignment scheme instead of querying
6788 2021-10-18 Richard Biener <rguenther@suse.de>
6790 * tree-vect-stmts.c (vectorizable_store): Do not recompute
6791 alignment scheme already determined by get_load_store_type.
6793 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
6795 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
6796 Clone corresponding pass.
6798 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
6800 * combine.c (recog_for_combine): For an unrecognized move/set of
6801 a constant, try force_const_mem to place it in the constant pool.
6802 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
6803 Delete global variables (that are no longer assigned a cost value).
6804 (cse_insn): Simplify logic for deciding whether to place a folded
6805 constant in the constant pool using force_const_mem.
6806 (cse_main): Remove zero initialization of constant_pool_entries_cost
6807 and constant_pool_entries_regcost.
6808 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
6809 fractionally more expensive, when optimizing for speed.
6811 2021-10-18 Martin Liska <mliska@suse.cz>
6813 PR gcov-profile/102746
6814 PR gcov-profile/102747
6815 * gcov.c (main): Return return_code.
6816 (output_gcov_file): Mark return_code when error happens.
6817 (generate_results): Likewise.
6818 (read_graph_file): Likewise.
6819 (read_count_file): Likewise.
6821 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
6823 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
6824 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
6825 a popcount, allowing compile-time evaluation/simplification.
6826 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
6828 2021-10-18 Richard Biener <rguenther@suse.de>
6830 PR tree-optimization/102788
6831 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
6833 (vect_pattern_recog_1): Likewise.
6834 (vect_recog_bool_pattern): Continue matching the pattern
6835 even if we do not have a vector type for a conversion
6838 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
6840 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
6841 Evalute SS_NEG and SS_ABS of a constant argument.
6843 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
6846 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
6847 * internal-fn.def: Add entry for COND_NEG internal function.
6848 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
6849 NEGATE_EXPR, COND_NEG mapping.
6850 * optabs.def: Add entry for cond_neg_optab.
6851 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
6852 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
6853 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
6855 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
6857 * tree-ssa-strlen.c (class strlen_pass): Rename from
6859 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
6860 (handle_assign): Same.
6861 (adjust_last_stmt): Same.
6862 (maybe_warn_overflow): Same.
6863 (handle_builtin_strlen): Same.
6864 (handle_builtin_strchr): Same.
6865 (handle_builtin_strcpy): Same.
6866 (handle_builtin_strncat): Same.
6867 (handle_builtin_stxncpy_strncat): Same.
6868 (handle_builtin_memcpy): Same.
6869 (handle_builtin_strcat): Same.
6870 (handle_alloc_call): Same.
6871 (handle_builtin_memset): Same.
6872 (handle_builtin_memcmp): Same.
6873 (get_len_or_size): Same.
6874 (strxcmp_eqz_result): Same.
6875 (handle_builtin_string_cmp): Same.
6876 (handle_pointer_plus): Same.
6877 (count_nonzero_bytes_addr): Same.
6878 (count_nonzero_bytes): Same.
6879 (handle_store): Same.
6880 (strlen_check_and_optimize_call): Same.
6881 (handle_integral_assign): Same.
6882 (check_and_optimize_stmt): Same.
6883 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
6885 2021-10-18 Richard Biener <rguenther@suse.de>
6887 PR tree-optimization/102798
6888 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6889 Only copy points-to info to newly generated SSA names.
6891 2021-10-18 Martin Liska <mliska@suse.cz>
6893 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
6894 * gcov.c (get_cycles_count): Likewise.
6895 * lto-compress.c (lto_compression_zlib): Likewise.
6896 (lto_uncompression_zlib): Likewise.
6897 * targhooks.c (default_pch_valid_p): Likewise.
6899 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
6901 * tree-pass.h (make_pass_thread_jumps_full): New.
6902 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
6903 (try_thread_blocks): Add resolve and speed arguments.
6904 (pass_thread_jumps::execute): Inline.
6905 (do_early_thread_jumps): New.
6906 (do_thread_jumps): New.
6907 (make_pass_thread_jumps): Move.
6908 (pass_early_thread_jumps::gate): Inline.
6909 (pass_early_thread_jumps::execute): Inline.
6910 (class pass_thread_jumps_full): New.
6912 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
6914 * configure.ac: Treat powerpc64*-*-freebsd* the same as
6915 powerpc64-*-freebsd*.
6916 * configure: Regenerate.
6918 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
6920 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
6921 set_ptr_nonull to set_ptr_nonnull in comments.
6923 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
6925 PR tree-optimization/102720
6926 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
6927 of call used and clobbered sets.
6929 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
6931 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
6932 skip gomp and itm when r or nodefaultlibs is given.
6933 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
6934 Update ordering of exclusions, remove duplicate 'v' addition
6935 (collect2 will add this from the main command line).
6937 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
6939 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
6940 filelist and framework options in specs instead of code.
6941 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
6942 specs that are really driver ones.
6943 (DARWIN_CC1_SPEC): Likewise.
6944 (CPP_SPEC): Likewise.
6945 (SYSROOT_SPEC): Append space.
6946 (LINK_SYSROOT_SPEC): Remove most driver link specs.
6947 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
6948 (STARTFILE_SPEC): Likewise.
6949 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
6950 (ASM_SPEC): Update driver-related specs.
6951 (ASM_FINAL_SPEC): Likewise.
6952 * config/darwin.opt: Remove now unused option aliases.
6953 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
6954 (ASM_SPEC): Update driver-related specs.
6956 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
6958 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
6959 allow standard_sse_constant_p immediate constants.
6961 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
6963 * config.gcc: Add tm-dwarf2.h to tm_d-file.
6965 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
6967 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
6968 that occur in abnormal phis.
6969 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
6970 abnormal and EH edges.
6971 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
6972 (rvrp_folder::value_on_edge): Ditto.
6973 (rvrp_folder::value_of_stmt): Ditto.
6974 (hybrid_folder::value_of_expr): Ditto for ranger queries.
6975 (hybrid_folder::value_on_edge): Ditto.
6976 (hybrid_folder::value_of_stmt): Ditto.
6977 * value-query.cc (gimple_range_global): Always return a range if
6978 the type is supported.
6980 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
6982 * config/pa/pa.md: Consistently use "rG" constraint for copy
6983 instruction in move patterns.
6985 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
6987 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
6990 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
6991 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
6992 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
6993 (ELFABIVERSION_AMDGPU_HSA_V4): New.
6994 (SET_SRAM_ECC_UNSUPPORTED): New.
6995 (copy_early_debug_info): Create elf flags to match the other objects.
6996 (main): Just let the attribute flags pass through.
6998 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7000 * tree-loop-distribution.c (reduction_var_overflows_first):
7001 Pass the type of reduction_var as first argument as it is also
7002 done for the load type.
7003 (loop_distribution::transform_reduction_loop): Add missing
7004 TREE_TYPE while determining precission of reduction_var.
7006 2021-10-15 Richard Biener <rguenther@suse.de>
7008 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
7010 * toplev.c (process_options): Warn when STABS debugging is
7011 enabled but not the preferred format.
7012 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
7014 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
7016 2021-10-15 Richard Biener <rguenther@suse.de>
7019 * tree-inline.c (copy_bb): Avoid underflowing nargs.
7021 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7023 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
7024 HFmode input operand to HImode.
7025 (ix86_vectorize_vec_perm_const): Likewise.
7026 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
7028 (*avx512f_permvar_truncv8siv8hi_1_hf):
7031 2021-10-15 Richard Biener <rguenther@suse.de>
7033 PR middle-end/102682
7034 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
7035 not create a paradoxical subreg.
7037 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7039 * config/i386/i386-expand.c (ix86_expand_vector_init):
7040 For half_vector concat for HFmode, handle them like HImode.
7042 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7044 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
7045 (back_threader::back_threader): Same.
7046 (back_threader::resolve_phi): Try to solve without looking back if
7048 (back_threader::find_paths_to_names): Same.
7049 (try_thread_blocks): Pass resolve argument to back threader.
7050 (pass_early_thread_jumps::execute): Same.
7052 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7054 * doc/invoke.texi: Remove max-fsm-thread-length,
7055 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
7057 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
7059 (back_threader_registry::back_threader_registry): Same.
7060 (back_threader_profitability::profitable_path_p): Remove
7061 param_max_fsm_thread-length.
7062 (back_threader_registry::register_path): Remove
7063 m_max_allowable_paths.
7065 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7067 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
7069 (back_threader::~back_threader): Do not release m_path.
7071 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
7073 PR tree-optimization/102738
7074 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
7076 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
7078 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
7079 (omp_mark_declare_variant): Move from c-omp.c.
7080 (omp_context_name_list_prop): Update for Fortran strings.
7081 * omp-general.h (omp_check_context_selector): New prototype.
7082 (omp_mark_declare_variant): New prototype.
7084 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7086 * config/rs6000/rs6000.c (rs6000_density_test): Move early
7087 exit test further up the function.
7089 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7091 * config/arm/arm.c (arm_add_stmt_cost): Delete.
7092 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
7094 2021-10-14 Martin Jambor <mjambor@suse.cz>
7096 * doc/invoke.texi (Optimize Options): Add entry for
7097 ipa-cp-recursive-freq-factor.
7099 2021-10-14 Tamar Christina <tamar.christina@arm.com>
7101 * match.pd: New rule.
7103 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
7106 * ipa-modref-tree.h (modref_access_node::update2):
7107 Also check that parm_offset is unchanged.
7108 (modref_ref_node::insert_access): Fix updating of
7111 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7113 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
7116 2021-10-14 Richard Biener <rguenther@suse.de>
7118 PR tree-optimization/102659
7119 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
7120 rewrite pointer typed undefined overflow operations.
7121 (predicate_statements): Likewise. Make sure to emit invariant
7122 conversions in the preheader.
7123 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
7124 non-empty preheaders.
7125 * tree-data-ref.c (dr_analyze_indices): Strip useless
7126 conversions to the MEM_REF base type.
7128 2021-10-14 Martin Liska <mliska@suse.cz>
7130 * common.opt: Stop using AUTODETECT_VALUE
7131 and use EnabledBy where possible.
7132 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
7133 * toplev.c (AUTODETECT_VALUE): Remove macro.
7134 (process_options): Simplify by using EnabledBy and
7135 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
7138 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
7140 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
7142 (vld1_s32_x3): Likewise.
7144 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7146 PR tree-optimization/102736
7147 PR tree-optimization/102736
7148 * gimple-range-path.cc (path_range_query::range_on_path_entry):
7149 Assert that the requested range is defined outside the path.
7150 (path_range_query::ssa_range_in_phi): Do not call
7151 range_on_path_entry for SSA names that are defined within the
7154 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
7156 * config/darwin-driver.c (darwin_find_version_from_kernel):
7157 Quote internal identifiers and avoid contractions in
7159 (darwin_default_min_version): Likewise.
7160 (darwin_driver_init): Likewise.
7162 2021-10-14 Martin Jambor <mjambor@suse.cz>
7164 * params.opt (ipa-cp-recursive-freq-factor): New.
7165 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
7166 scc_no, self_recursion_generated_level, same_scc and
7167 self_recursion_generated_p.
7168 (ipcp_lattice::add_value): Replaced parameter unlimited with
7169 same_lat_gen_level, usit it determine limit of values and store it to
7171 (ipcp_lattice<valtype>::print): Dump the new fileds.
7172 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
7173 parameter and store it to the new value.
7174 (self_recursively_generated_p): Removed.
7175 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
7176 instead of self_recursively_generated_p, store self generation level
7178 (value_topo_info<valtype>::add_val): Set scc_no.
7179 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
7180 recursively feeding values and self generated values by appropriate
7183 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7185 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
7186 redundant test for flag_vect_cost_model.
7188 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7190 * bitmap.c (debug): New overloaded function for auto_bitmaps.
7191 * bitmap.h (debug): Same.
7193 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
7195 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
7196 Adjust builtin call.
7197 (_mm512_mask3_fcmadd_pch): Likewise.
7198 (_mm512_mask_fmadd_pch): Likewise
7199 (_mm512_mask3_fmadd_pch): Likewise
7200 (_mm512_mask_fcmadd_round_pch): Likewise
7201 (_mm512_mask3_fcmadd_round_pch): Likewise
7202 (_mm512_mask_fmadd_round_pch): Likewise
7203 (_mm512_mask3_fmadd_round_pch): Likewise
7204 (_mm_mask_fcmadd_sch): Likewise
7205 (_mm_mask3_fcmadd_sch): Likewise
7206 (_mm_mask_fmadd_sch): Likewise
7207 (_mm_mask3_fmadd_sch): Likewise
7208 (_mm_mask_fcmadd_round_sch): Likewise
7209 (_mm_mask3_fcmadd_round_sch): Likewise
7210 (_mm_mask_fmadd_round_sch): Likewise
7211 (_mm_mask3_fmadd_round_sch): Likewise
7212 (_mm_fcmadd_round_sch): Likewise
7213 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
7214 Adjust builtin call.
7215 (_mm_mask3_fmadd_pch): Likewise
7216 (_mm256_mask_fmadd_pch): Likewise
7217 (_mm256_mask3_fmadd_pch): Likewise
7218 (_mm_mask_fcmadd_pch): Likewise
7219 (_mm_mask3_fcmadd_pch): Likewise
7220 (_mm256_mask_fcmadd_pch): Likewise
7221 (_mm256_mask3_fcmadd_pch): Likewise
7222 * config/i386/i386-builtin.def: Add mask3 builtin for complex
7223 fma, and adjust mask_builtin to corresponding expander.
7224 * config/i386/i386-expand.c (ix86_expand_round_builtin):
7225 Skip eraseing embedded rounding for expanders that emits
7227 * config/i386/sse.md (complexmove): New mode_attr.
7228 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
7229 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
7230 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
7231 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
7232 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
7233 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
7234 * config/i386/subst.md (round_embedded_complex): New subst.
7236 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
7238 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
7239 (cbranchdf4): Likewise.
7240 Add missing move patterns for TARGET_SOFT_FLOAT.
7242 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
7244 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
7245 pseudo intermediate when moving a SUBREG into a hard register,
7246 by checking ix86_hardreg_mov_ok.
7247 (ix86_expand_vector_extract): Store zero-extended SImode
7248 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
7250 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
7251 complex (SUBREG) sets of (vector) hard registers before reload, by
7252 checking ix86_hardreg_mov_ok.
7254 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
7256 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
7258 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
7260 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
7262 2021-10-13 Martin Sebor <msebor@redhat.com>
7264 PR middle-end/102630
7265 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
7267 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
7269 * collect2.c (is_lto_object_file): Release simple-object
7270 resources, close files.
7272 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
7274 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
7275 when checking mve addressing modes.
7276 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
7278 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
7279 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
7280 there is a single Ux constraint.
7282 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
7284 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
7285 multiplication on 32-bit PA 1.1 and 2.0 targets.
7287 2021-10-13 Martin Liska <mliska@suse.cz>
7289 PR gcov-profile/90364
7290 * coverage.c (build_info): Emit checksum to the global variable.
7291 (build_info_type): Add new field for checksum.
7292 (coverage_obj_finish): Pass object_checksum.
7293 (coverage_init): Use 0 as checksum for .gcno files.
7294 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
7295 * gcov.c (read_graph_file): Read also checksum.
7296 * doc/invoke.texi: Document the behaviour change.
7298 2021-10-13 Richard Biener <rguenther@suse.de>
7300 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
7301 start at integer value 2.
7302 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
7303 the iterator for GSI_LAST_NEW_STMT.
7304 (gsi_insert_seq_nodes_after): Likewise.
7305 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
7306 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
7307 arguments to gsi_replace.
7309 2021-10-13 Martin Liska <mliska@suse.cz>
7312 * common.opt: Use EnabledBy instead of detection in
7313 finish_options and process_options.
7314 * opts.c (finish_options): Remove handling of
7315 x_flag_unroll_all_loops.
7316 * toplev.c (process_options): Likewise for flag_web and
7317 flag_rename_registers.
7319 2021-10-13 Richard Biener <rguenther@suse.de>
7321 PR tree-optimization/102659
7322 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
7323 (if_convertible_gimple_assign_stmt_p): Mark the loop for
7324 rewrite when stmts with undefined behavior on integer
7326 (combine_blocks): Predicate also when we need to rewrite stmts.
7327 (predicate_statements): Rewrite affected stmts to something
7328 with well-defined behavior on overflow.
7329 (tree_if_conversion): Initialize need_to_rewrite_undefined.
7331 2021-10-13 Richard Biener <rguenther@suse.de>
7334 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
7337 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
7339 * dwarf2ctf.c: Fix typo in comment.
7341 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
7344 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
7345 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
7346 avoid generating an automatic #define of vec_cpsgn. Use the
7347 correct built-in for V4SFmode that doesn't depend on VSX.
7349 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
7353 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
7354 define_insn_and_split pattern. Add alternative 1 and split it
7355 post reload to insert operand 1 into the low part of operand 0.
7356 (*sub<mode>_1_slp): Ditto.
7357 (*and<mode>_1_slp): Ditto.
7358 (*<any_or:code><mode>_1_slp): Ditto.
7359 (*ashl<mode>3_1_slp): Ditto.
7360 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
7361 (*<any_rotate:insn><mode>3_1_slp): Ditto.
7362 (*neg<mode>_1_slp): New insn_and_split pattern.
7363 (*one_cmpl<mode>_1_slp): Ditto.
7365 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
7367 * doc/install.texi: Update MinGW and mingw-64 Binaries
7370 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
7372 * doc/extend.texi (Common Variable Attributes): Fix typos in
7373 alloc_size documentation.
7375 2021-10-12 Richard Biener <rguenther@suse.de>
7377 PR tree-optimization/102696
7378 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
7379 the tree fatally failed when we reject a BIT_FIELD_REF.
7381 2021-10-12 Richard Biener <rguenther@suse.de>
7383 PR tree-optimization/102572
7384 * tree-vect-stmts.c (vect_build_gather_load_calls): When
7385 gathering the vectorized defs for the mask pass in the
7386 desired mask vector type so invariants will be handled
7389 2021-10-12 Tamar Christina <tamar.christina@arm.com>
7391 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
7392 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
7393 *fcmuo<mode>_nor_combine): New.
7395 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
7398 * config/sparc/sparc-modes.def (OI): New integer mode.
7400 2021-10-12 Jakub Jelinek <jakub@redhat.com>
7402 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
7403 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
7406 2021-10-12 Jakub Jelinek <jakub@redhat.com>
7408 * tree-vectorizer.h (loop_cost_model): New function.
7409 (unlimited_cost_model): Use it.
7410 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
7411 call instead of flag_vect_cost_model.
7412 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
7413 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
7414 of flag_simd_cost_model.
7416 2021-10-12 liuhongt <hongtao.liu@intel.com>
7419 * config/i386/i386-expand.c (emit_reduc_half): Handle
7421 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
7422 (reduc_plus_scal_v4qi): Ditto.
7424 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7426 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
7427 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
7428 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
7430 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7432 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
7433 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
7434 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
7435 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
7436 _mm_cvtepu32_epi64): New.
7438 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7440 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
7441 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
7443 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7445 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
7446 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
7447 _mm_max_epi32, _mm_max_epu32): New.
7449 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
7451 * ipa-modref-tree.h (struct modref_access_node): Revert
7453 (struct modref_ref_node): Likewise.
7455 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
7457 * ipa-modref-tree.h (modref_tree::global_access_p): New member
7460 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
7461 ignore_stores_eaf_flags): Move to ipa-modref.h
7462 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
7463 (modref_summary::global_memory_read_p): New member function.
7464 (modref_summary::global_memory_written_p): New member function.
7465 * ipa-modref.h (modref_summary::global_memory_read_p,
7466 modref_summary::global_memory_written_p): Declare.
7467 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
7468 ignore_stores_eaf_flags): move here.
7469 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
7471 (handle_rhs_call): Rewrite.
7472 (handle_call_arg): New function.
7473 (determine_global_memory_access): New function.
7474 (handle_const_call): Remove
7475 (handle_pure_call): Remove
7476 (find_func_aliases_for_call): Update use of handle_rhs_call.
7477 (compute_points_to_sets): Handle global memory acccesses
7480 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
7482 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
7483 documentation, mention UBSAN_OPTIONS, similar to what is done
7484 for AddressSanitizer.
7486 2021-10-11 Richard Biener <rguenther@suse.de>
7488 PR middle-end/102683
7489 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
7490 availability before building an integer type for storage
7493 2021-10-11 Richard Biener <rguenther@suse.de>
7495 PR middle-end/101480
7496 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
7499 2021-10-11 Martin Liska <mliska@suse.cz>
7501 * common.opt: Remove Init(2) for some options.
7502 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
7503 use rather OPTION_SET_P.
7505 2021-10-11 Martin Liska <mliska@suse.cz>
7507 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
7508 * flag-types.h (enum ira_region): Likewise.
7509 * toplev.c (process_options): Use OPTION_SET_P instead of
7510 IRA_REGION_AUTODETECT.
7512 2021-10-11 Jakub Jelinek <jakub@redhat.com>
7514 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
7515 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
7517 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7519 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
7520 * config/s390/s390.c (s390_rawmemchr): New function.
7521 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
7522 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
7523 the pattern vfees<mode> from vx-builtins.md.
7524 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
7526 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7528 * builtins.c (get_memory_rtx): Change to external linkage.
7529 * builtins.h (get_memory_rtx): Add function prototype.
7530 * doc/md.texi (rawmemchr<mode>): Document.
7531 * internal-fn.c (expand_RAWMEMCHR): Define.
7532 * internal-fn.def (RAWMEMCHR): Add.
7533 * optabs.def (rawmemchr_optab): Add.
7534 * tree-loop-distribution.c (find_single_drs): Change return code
7535 behaviour by also returning true if no single store was found
7537 (loop_distribution::classify_partition): Respect the new return
7538 code behaviour of function find_single_drs.
7539 (loop_distribution::execute): Call new function
7540 transform_reduction_loop in order to replace rawmemchr or strlen
7541 like loops by calls into builtins.
7542 (generate_reduction_builtin_1): New function.
7543 (generate_rawmemchr_builtin): New function.
7544 (generate_strlen_builtin_1): New function.
7545 (generate_strlen_builtin): New function.
7546 (generate_strlen_builtin_using_rawmemchr): New function.
7547 (reduction_var_overflows_first): New function.
7548 (determine_reduction_stmt_1): New function.
7549 (determine_reduction_stmt): New function.
7550 (loop_distribution::transform_reduction_loop): New function.
7552 2021-10-11 Martin Liska <mliska@suse.cz>
7554 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
7555 and remove legacy hashing code.
7557 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
7560 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
7561 CONST_INT for BEGIN and END, and use gcc_assert rather than
7564 2021-10-10 Jakub Jelinek <jakub@redhat.com>
7567 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
7568 use MO_VAL_SET if loc is not sp.
7570 2021-10-10 Andrew Pinski <apinski@marvell.com>
7572 PR tree-optimization/102622
7573 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
7574 Swap the order of a?0:pow2cst and a?0:-1 transformations.
7576 2021-10-09 liuhongt <hongtao.liu@intel.com>
7579 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
7581 (ix86_use_mask_cmp_p): Ditto.
7582 (ix86_expand_sse_movcc): Ditto.
7583 * config/i386/i386.md (setcc_hf_mask): New define_insn.
7584 (movhf_mask): Ditto.
7585 (UNSPEC_MOVCC_MASK): New unspec.
7586 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
7588 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
7590 PR rtl-optimization/102627
7591 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
7593 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
7595 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
7598 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
7600 * value-range.cc (irange::debug): New.
7601 * value-range.h (irange::debug): New.
7603 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
7605 PR tree-optimization/102385
7606 * predict.h (change_edge_frequency): Declare.
7607 * predict.c (change_edge_frequency): New function.
7608 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
7610 (tree_unroll_loop): Likewise.
7611 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
7612 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
7614 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
7615 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
7616 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
7617 to retrieve the exit edges. Make all the old profile update code
7618 conditional on !single_loop_p -- the case it was written for --
7619 and use a different approach for the single-loop case.
7621 2021-10-08 Martin Liska <mliska@suse.cz>
7623 * config/alpha/alpha.c (alpha_option_override): Use new macro
7625 * config/arc/arc.c (arc_override_options): Likewise.
7626 * config/arm/arm.c (arm_option_override): Likewise.
7627 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
7628 * config/c6x/c6x.c (c6x_option_override): Likewise.
7629 * config/csky/csky.c: Likewise.
7630 * config/darwin.c (darwin_override_options): Likewise.
7631 * config/frv/frv.c (frv_option_override): Likewise.
7632 * config/i386/djgpp.h: Likewise.
7633 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
7634 (ix86_max_noce_ifcvt_seq_cost): Likewise.
7635 * config/ia64/ia64.c (ia64_option_override): Likewise.
7636 (ia64_override_options_after_change): Likewise.
7637 * config/m32c/m32c.c (m32c_option_override): Likewise.
7638 * config/m32r/m32r.c (m32r_init): Likewise.
7639 * config/m68k/m68k.c (m68k_option_override): Likewise.
7640 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
7641 * config/mips/mips.c (mips_option_override): Likewise.
7642 * config/nios2/nios2.c (nios2_option_override): Likewise.
7643 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
7644 * config/pa/pa.c (pa_option_override): Likewise.
7645 * config/riscv/riscv.c (riscv_option_override): Likewise.
7646 * config/rs6000/aix71.h: Likewise.
7647 * config/rs6000/aix72.h: Likewise.
7648 * config/rs6000/aix73.h: Likewise.
7649 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
7650 (rs6000_override_options_after_change): Likewise.
7651 (rs6000_linux64_override_options): Likewise.
7652 (glibc_supports_ieee_128bit): Likewise.
7653 (rs6000_option_override_internal): Likewise.
7654 (rs6000_file_start): Likewise.
7655 (rs6000_darwin_file_start): Likewise.
7656 * config/rs6000/rtems.h: Likewise.
7657 * config/rs6000/sysv4.h: Likewise.
7658 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
7659 * config/s390/s390.c (s390_option_override): Likewise.
7660 * config/sh/linux.h: Likewise.
7661 * config/sh/netbsd-elf.h (while): Likewise.
7662 * config/sh/sh.c (sh_option_override): Likewise.
7663 * config/sol2.c (solaris_override_options): Likewise.
7664 * config/sparc/sparc.c (sparc_option_override): Likewise.
7665 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
7666 * config/visium/visium.c (visium_option_override): Likewise.
7667 * config/vxworks.c (vxworks_override_options): Likewise.
7668 * lto-opts.c (lto_write_options): Likewise.
7669 * omp-expand.c (expand_omp_simd): Likewise.
7670 * omp-general.c (omp_max_vf): Likewise.
7671 * omp-offload.c (oacc_xform_loop): Likewise.
7672 * opts.h (OPTION_SET_P): Likewise.
7673 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
7674 * toplev.c (process_options): Likewise.
7675 * tree-predcom.c: Likewise.
7676 * tree-sra.c (analyze_all_variable_accesses): Likewise.
7678 2021-10-08 liuhongt <hongtao.liu@intel.com>
7681 * config/i386/i386.c (ix86_optab_supported_p):
7682 Return true for HFmode.
7683 * match.pd: Simplify (_Float16) ceil ((double) x) to
7684 __builtin_ceilf16 (a) when a is _Float16 type and
7685 direct_internal_fn_supported_p.
7687 2021-10-08 liuhongt <hongtao.liu@intel.com>
7690 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
7691 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
7692 (reduc_<code>_scal_v4hi): New.
7694 2021-10-08 liuhongt <hongtao.liu@intel.com>
7696 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
7697 * doc/invoke.texi (Options That Control Optimization): Update
7699 * opts.c (default_options_table): Enable auto-vectorization at
7700 O2 with very-cheap cost model.
7701 (finish_options): Use cheap cost model for
7702 explicit -ftree{,-loop}-vectorize.
7704 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
7706 * ctfc.c (ctfc_delete_container): Free hash table contents.
7708 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
7710 * toplev.c (process_options): Do not warn for GNU GIMPLE.
7712 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
7714 * tree-object-size.c (addr_object_size,
7715 compute_builtin_object_size): Drop PDECL and POFF arguments.
7716 (addr_object_size): Adjust calls.
7717 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
7720 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
7722 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
7723 representing signed and unsigned high-part multiplication resp.
7724 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
7725 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
7726 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
7727 SS_DIV, US_DIV]: Similar simplifications for saturating
7729 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
7730 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
7731 Implement compile-time evaluation for constant operands.
7732 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
7734 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
7735 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
7736 Mention the new smul_highpart and umul_highpart RTX codes.
7737 * doc/invoke.texi: Silence @xref "compilation" warnings.
7739 2021-10-07 Martin Jambor <mjambor@suse.cz>
7742 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
7743 case when the source reference description corresponds to a
7744 referance taken in a function src->caller is inlined to.
7746 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
7749 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
7751 (modref_access_node::try_merge_with): Add sanity check that there
7752 are no redundant entries in the list.
7754 2021-10-07 Richard Biener <rguenther@suse.de>
7756 PR tree-optimization/102608
7757 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
7760 2021-10-07 Martin Liska <mliska@suse.cz>
7762 * toplev.c (toplev::main): Make
7763 save_opt_decoded_options a pointer type
7764 * toplev.h: Likewise.
7766 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
7768 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
7769 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
7770 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
7772 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
7774 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
7775 Adjust for new -msram-ecc=any behaviour.
7776 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
7777 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
7778 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
7779 (EF_AMDGPU_XNACK_V3): ... this.
7780 (EF_AMDGPU_SRAM_ECC): Rename to ...
7781 (EF_AMDGPU_SRAM_ECC_V3): ... this.
7782 (EF_AMDGPU_FEATURE_XNACK_V4): New.
7783 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
7784 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
7785 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
7786 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
7787 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
7788 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
7789 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
7790 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
7791 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
7792 (SET_XNACK_ON): New.
7793 (SET_XNACK_OFF): New.
7795 (SET_SRAM_ECC_ON): New.
7796 (SET_SRAM_ECC_ANY): New.
7797 (SET_SRAM_ECC_OFF): New.
7798 (TEST_SRAM_ECC_ANY): New.
7799 (TEST_SRAM_ECC_ON): New.
7800 (main): Implement HSACOv4 and -msram-ecc=any.
7802 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
7804 * config.in: Regenerate.
7805 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
7809 (A_FIJI): Rename to ...
7811 (A_900): Rename to ...
7813 (A_906): Rename to ...
7815 (A_908): Rename to ...
7817 (SRAMOPT): New macro.
7818 (ASM_SPEC): Adjust xnack option usage.
7819 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
7820 * configure: Regenerate.
7821 * configure.ac: Detect LLVM assembler dialect.
7823 2021-10-07 Richard Biener <rguenther@suse.de>
7825 * tree-pretty-print.c (dump_generic_node): Do not elide
7826 printing '&' when dumping with -gimple.
7828 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7830 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
7832 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
7833 (adjust_imagpart_expr): Ditto.
7834 * value-range.cc (irange::irange_intersect): Call new routine if
7835 RHS is a single pair.
7836 (irange::intersect): New wide_int version.
7837 * value-range.h (class irange): New prototype.
7839 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7841 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
7842 Add parameter to limit size when recognizing switches.
7843 (gimple_outgoing_range::edge_range_p): Check size limit.
7844 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
7845 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
7846 that exceed the size limit.
7847 (gori_compute::gori_compute): Add initializer.
7848 * params.opt (evrp-switch-limit): New.
7849 * doc/invoke.texi: Update docs.
7851 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7853 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
7854 TYPE_MAX_VALUE instead of creating new trees when possible.
7856 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7858 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
7859 zero and non-zero more efficently.
7861 2021-10-06 Richard Biener <rguenther@suse.de>
7864 * dumpfile.h (TDF_GIMPLE_VAL): New.
7865 (dump_flag): Re-order and adjust TDF_* flags. Make
7866 the enum uint32_t. Use std::underlying_type in the
7868 (optgroup_flag): Likewise for the operator overloads.
7869 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
7870 in _Literal if TDF_GIMPLE_VAL.
7871 * gimple-pretty-print.c (dump_gimple_assign): Add
7872 TDF_GIMPLE_VAL to flags when dumping operands where only
7873 is_gimple_val are allowed.
7874 (dump_gimple_cond): Likewise.
7876 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
7878 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
7881 2021-10-05 qing zhao <qing.zhao@oracle.com>
7883 PR middle-end/102359
7884 * gimplify.c (gimplify_decl_expr): Not add initialization for an
7885 auto variable when it has been initialized by frontend.
7887 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
7889 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
7892 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
7894 * common/config/avr/avr-common.c (avr_handle_option): Mark
7895 argument as ATTRIBUTE_UNUSED.
7897 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
7899 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
7900 Undefine before redefinition.
7902 2021-10-05 Richard Biener <rguenther@suse.de>
7904 * toplev.c (no_backend): Remove global var.
7905 (process_options): Pass in no_backend, move post_options
7906 langhook call to toplev::main.
7907 (do_compile): Pass in no_backend, move process_options call
7909 (toplev::run_self_tests): Check no_backend at the caller.
7910 (toplev::main): Call post_options and process_options
7911 split out from do_compile, do self-tests only if
7912 no_backend is initialized.
7914 2021-10-05 Richard Biener <rguenther@suse.de>
7916 * tree-cfg.c (dump_function_to_file): Dump the UID of the
7917 function as part of the name when requested.
7918 * tree-pretty-print.c (dump_function_name): Dump the UID when
7919 requested and the langhook produced the actual name.
7921 2021-10-05 Richard Biener <rguenther@suse.de>
7923 PR middle-end/102587
7924 PR middle-end/102285
7925 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
7926 zero-initialization as last resort, use the constant
7927 size as given by the DEFERRED_INIT argument to build
7930 2021-10-04 Marek Polacek <polacek@redhat.com>
7933 * doc/invoke.texi: Document -Warray-compare.
7935 2021-10-04 Richard Biener <rguenther@suse.de>
7937 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
7938 variables are not to be initialized.
7940 2021-10-04 Richard Biener <rguenther@suse.de>
7942 * expr.h (non_mem_decl_p): Declare.
7943 (mem_ref_refers_to_non_mem_p): Likewise.
7944 * expr.c (non_mem_decl_p): Export.
7945 (mem_ref_refers_to_non_mem_p): Likewise.
7946 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
7947 but check the base with mem_ref_refers_to_non_mem_p
7950 2021-10-04 Richard Biener <rguenther@suse.de>
7952 PR tree-optimization/102570
7953 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
7954 we are using clique for the internal function code.
7955 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
7956 internal function code.
7957 (print_vn_reference_ops): Print the internal function code.
7958 (vn_reference_op_compute_hash): Hash it.
7959 (copy_reference_ops_from_call): Record it.
7960 (visit_stmt): Remove the restriction around internal function
7962 (fully_constant_vn_reference_p): Use fold_const_call and handle
7964 (vn_reference_eq): Compare call return types.
7965 * tree-ssa-pre.c (create_expression_by_pieces): Handle
7966 generating calls to internal functions.
7967 (compute_avail): Remove the restriction around internal function
7970 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
7972 PR tree-optimization/102560
7973 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
7974 marker for invalid_range.
7976 2021-10-04 Richard Biener <rguenther@suse.de>
7978 PR middle-end/102587
7979 * internal-fn.c (expand_DEFERRED_INIT): Guard register
7980 initialization path an avoid initializing VLA registers
7983 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
7985 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
7987 2021-10-03 Martin Liska <mliska@suse.cz>
7989 * toplev.c (toplev::main): Check opt_index if it is a part
7992 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
7994 PR tree-optimization/102563
7995 * range-op.cc (operator_lshift::op1_range): Do not clobber
7998 2021-10-02 Martin Liska <mliska@suse.cz>
8000 * toplev.c (toplev::main): save_decoded_options[0] is program
8001 name and so it should be skipped.
8003 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8005 PR tree-optimization/102546
8006 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
8007 X << Y is non-zero implies X is also non-zero.
8009 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8011 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8013 * config/aarch64/aarch64-tune.md: Regenerate.
8014 * doc/invoke.texi: Update docs.
8016 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8018 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8020 * config/aarch64/aarch64-tune.md: Regenerate.
8021 * doc/invoke.texi: Update docs.
8023 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8025 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8027 * config/aarch64/aarch64-tune.md: Regenerate.
8028 * doc/invoke.texi: Update docs.
8030 2021-10-01 Martin Sebor <msebor@redhat.com>
8033 * doc/invoke.texi (-Waddress): Update.
8034 * gengtype.c (write_types): Avoid -Waddress.
8035 * poly-int.h (POLY_SET_COEFF): Avoid using null.
8037 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
8040 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
8043 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8045 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
8047 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8049 * gimple-range-path.cc (path_range_query::compute_ranges): Use
8051 * gimple-range-path.h (class path_range_query): Remove shadowed
8053 (path_range_query::get_path_oracle): New.
8055 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8056 Richard Biener <rguenther@suse.de>
8059 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
8060 INT_MIN / -1 division detection from here ...
8061 (-fsanitize=signed-integer-overflow): ... and add it here.
8063 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8065 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
8067 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
8068 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
8069 (AARCH64_ISA_V9): New ISA flag.
8070 * doc/invoke.texi: Update docs.
8072 2021-10-01 Martin Liska <mliska@suse.cz>
8074 * toplev.c (toplev::main): Save decoded optimization options.
8075 * toplev.h (save_opt_decoded_options): New.
8076 * doc/extend.texi: Be more clear about optimize and target
8079 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8081 * explow.c: Include langhooks.h.
8082 (set_stack_check_libfunc): Build a proper function type.
8084 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8087 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
8088 not return true for external weak function symbols in medium model.
8090 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8092 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
8093 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
8094 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
8095 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
8096 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
8097 OMP_CLAUSE_SCHEDULE_STATIC.
8099 2021-10-01 Richard Biener <rguenther@suse.de>
8101 PR middle-end/102518
8102 * tree-inline.c (setup_one_parameter): Avoid substituting
8103 an invariant into contexts where a GIMPLE register is not valid.
8105 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8107 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
8108 * config/arm/arm-tables.opt: Regenerate.
8109 * config/arm/arm-tune.md: Regenerate.
8110 * doc/invoke.texi: Update docs.
8112 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
8115 * config/i386/i386.md
8116 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
8119 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
8121 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
8122 omp_{c,re}alloc, fix omp_alloc/omp_free.
8124 2021-09-30 Martin Liska <mliska@suse.cz>
8126 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
8127 asm_out_file and stream directly to MYFILE.
8129 2021-09-30 Richard Biener <rguenther@suse.de>
8131 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8132 Restore and fix condition under which we apply npeel to
8133 the DRs misalignment value.
8135 2021-09-30 Richard Biener <rguenther@suse.de>
8137 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8138 Fix npeel check for variable amount of peeling.
8140 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8142 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
8144 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8146 * gimple-range.cc (gimple_ranger::debug): New.
8147 * gimple-range.h (class gimple_ranger): Add debug.
8149 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8151 PR middle-end/102519
8152 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
8154 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
8157 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
8158 (btf_finalize): Empty the hash_map btf_var_ids.
8160 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8162 * tree-vrp.c (thread_through_all_blocks): Return bool.
8163 (execute_vrp_threader): Return TODO_* flags.
8164 (pass_data_vrp_threader): Set todo_flags_finish to 0.
8166 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8168 * timevar.def (TV_TREE_VRP_THREADER): New.
8169 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
8171 2021-09-29 David Faust <david.faust@oracle.com>
8173 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
8175 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
8177 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
8178 new type qualifier enum.
8179 (TYPES_TERNOP_SSSU): Likewise.
8180 (TYPES_TERNOP_PPPU): Likewise.
8181 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
8182 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
8184 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
8185 builtin and remove casts.
8186 (vqtbl1_s8): Likewise.
8187 (vqtbl1q_p8): Likewise.
8188 (vqtbl1q_s8): Likewise.
8189 (vqtbx1_s8): Likewise.
8190 (vqtbx1_p8): Likewise.
8191 (vqtbx1q_s8): Likewise.
8192 (vqtbx1q_p8): Likewise.
8193 (vtbl1_p8): Likewise.
8194 (vtbl2_p8): Likewise.
8195 (vtbx2_p8): Likewise.
8197 2021-09-29 Richard Biener <rguenther@suse.de>
8199 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
8201 (vect_update_misalignment_for_peel): Use it to update
8202 misaligned to the value necessary for an aligned access.
8203 (vect_get_peeling_costs_all_drs): Likewise.
8204 (vect_enhance_data_refs_alignment): Likewise.
8206 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8208 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
8209 emitted operations and adjust heuristic for code size.
8211 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8213 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
8214 emitted operations and adjust heuristic for code size.
8216 2021-09-29 Jakub Jelinek <jakub@redhat.com>
8218 PR middle-end/102504
8219 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
8220 in OMP_SCOPE clauses, not just on worksharing construct clauses.
8222 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
8224 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
8225 (<u>mulditi3): Call <su>muldi3_highpart.
8226 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
8227 (<u>mulsidi3): Call <su>mulsi3_highpart.
8228 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
8230 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
8232 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
8234 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
8236 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
8237 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
8240 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8242 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
8243 (AARCH64_FL_V8_7): Likewise.
8244 (AARCH64_FL_FOR_ARCH8_7): Likewise.
8245 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
8246 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
8247 * doc/invoke.texi: Document the above.
8249 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8251 * dbgcnt.c (dbg_cnt_counter): New.
8252 * dbgcnt.h (dbg_cnt_counter): New.
8253 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
8254 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
8255 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
8256 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
8259 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8261 * cfgcleanup.c (pass_jump::execute): Check
8262 flag_expensive_optimizations.
8263 (pass_jump_after_combine::gate): Same.
8264 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
8265 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
8266 * tree-ssa-threadupdate.c
8267 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
8270 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
8272 * tree-ssa-reassoc.c (biased_names): New global.
8273 (propagate_bias_p): New function.
8274 (loop_carried_phi): Remove.
8275 (propagate_rank): Propagate bias along single uses.
8276 (get_rank): Update biased_names when needed.
8278 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
8280 * passes.def (pass_reassoc): Rename parameter to early_p.
8281 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
8283 (phi_rank): Don't bias loop-carried phi ranks
8284 before vectorization pass.
8285 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
8286 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
8288 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
8290 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
8292 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
8294 2021-09-28 Jakub Jelinek <jakub@redhat.com>
8297 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
8298 special 80387 instruction XFmode constants if flag_rounding_math.
8300 2021-09-28 Richard Biener <rguenther@suse.de>
8302 PR tree-optimization/100112
8303 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
8304 referece into the hashtable twice in case last_vuse is
8305 different from the original vuse on the stmt.
8307 2021-09-28 Jakub Jelinek <jakub@redhat.com>
8309 PR middle-end/102492
8310 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
8311 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
8312 clauses on SIMD constructs.
8314 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8316 PR tree-optimization/102511
8317 * gimple-range-path.cc (path_range_query::range_on_path_entry):
8318 Return VARYING when nothing found.
8320 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
8323 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
8325 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
8326 (VALID_MMX_REG_MODE): Add V4HFmode.
8327 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
8328 vector mode condition.
8329 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
8331 (function_arg_32): Add V4HFmode.
8332 (function_arg_advance_32): Likewise.
8333 * config/i386/i386.md (mode): Add V4HF/V2HF.
8334 (MODE_SIZE): Likewise.
8335 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
8336 (V_32): Add V2HF mode.
8337 (VHF_32_64): New mode iterator.
8338 (*mov<mode>_internal): Adjust sse alternatives to support
8340 (*mov<mode>_internal): Adjust sse alternatives to support
8342 (<insn><mode>3): New define_insn for add/sub/mul/div.
8344 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8346 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
8348 (pass_early_thread_jumps::gate): Same.
8349 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
8350 Return if !flag_thread_jumps.
8351 * tree-ssa-threadupdate.c
8352 (jt_path_registry::register_jump_thread): Assert that
8353 flag_thread_jumps is true.
8355 2021-09-28 liuhongt <hongtao.liu@intel.com>
8358 (simplify_context::simplify_binary_operation_1): Relax
8359 condition of simplifying (vec_concat:M (vec_select op0
8360 index0)(vec_select op1 index1)) to allow different modes
8361 between op0 and M, but have same inner mode.
8363 2021-09-28 liuhongt <hongtao.liu@intel.com>
8365 * config/i386/i386-expand.c (emit_reduc_half): Handle
8366 V8HF/V16HF/V32HFmode.
8367 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
8368 (REDUC_SSE_SMINMAX_MODE): Ditto.
8369 (REDUC_PLUS_MODE): Add V16HF and V32HF.
8370 (REDUC_SMINMAX_MODE): Ditto.
8372 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8374 * gimple-range-path.cc
8375 (path_range_query::precompute_ranges_in_block): Rename to...
8376 (path_range_query::compute_ranges_in_block): ...this.
8377 (path_range_query::precompute_ranges): Rename to...
8378 (path_range_query::compute_ranges): ...this.
8379 (path_range_query::precompute_relations): Rename to...
8380 (path_range_query::compute_relations): ...this.
8381 (path_range_query::precompute_phi_relations): Rename to...
8382 (path_range_query::compute_phi_relations): ...this.
8383 * gimple-range-path.h: Rename precompute* to compute*.
8384 * tree-ssa-threadbackward.c
8385 (back_threader::find_taken_edge_switch): Same.
8386 (back_threader::find_taken_edge_cond): Same.
8387 * tree-ssa-threadedge.c
8388 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
8389 (hybrid_jt_state::register_equivs_stmt): Inline...
8390 * tree-ssa-threadedge.h: ...here.
8392 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8394 * tree-vrp.c (lhs_of_dominating_assert): Remove.
8395 (class vrp_jt_state): Remove.
8396 (class vrp_jt_simplifier): Remove.
8397 (vrp_jt_simplifier::simplify): Remove.
8398 (class vrp_jump_threader): Remove.
8399 (vrp_jump_threader::vrp_jump_threader): Remove.
8400 (vrp_jump_threader::~vrp_jump_threader): Remove.
8401 (vrp_jump_threader::before_dom_children): Remove.
8402 (vrp_jump_threader::after_dom_children): Remove.
8404 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8406 * passes.def (pass_vrp_threader): New.
8407 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
8408 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
8409 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
8410 (hybrid_jt_simplifier::simplify): New.
8411 (hybrid_jt_simplifier::compute_ranges_from_state): New.
8412 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
8413 (class hybrid_jt_simplifier): New.
8414 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
8416 (class hybrid_threader): New.
8417 (hybrid_threader::hybrid_threader): New.
8418 (hybrid_threader::~hybrid_threader): New.
8419 (hybrid_threader::before_dom_children): New.
8420 (hybrid_threader::after_dom_children): New.
8421 (execute_vrp_threader): New.
8422 (class pass_vrp_threader): New.
8423 (make_pass_vrp_threader): New.
8425 2021-09-27 Martin Liska <mliska@suse.cz>
8427 * output.h (enum section_flag): New.
8428 (SECTION_FORGET): Remove.
8429 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
8430 (SECTION_STYLE_MASK): Define it based on other enum
8432 * varasm.c (switch_to_section): Remove unused handling of
8435 2021-09-27 Martin Liska <mliska@suse.cz>
8437 * common.opt: Add new variable flag_default_complex_method.
8438 * opts.c (finish_options): Handle flags related to
8439 x_flag_complex_method.
8440 * toplev.c (process_options): Remove option handling related
8441 to flag_complex_method.
8443 2021-09-27 Richard Biener <rguenther@suse.de>
8445 PR middle-end/102450
8446 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
8447 type_for_size, instead use int_mode_for_size.
8449 2021-09-27 Andrew Pinski <apinski@marvell.com>
8452 * gimplify.c (gimplify_save_expr): Return early
8453 if the type of val is error_mark_node.
8455 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8457 * tree-ssanames.c (ssa_name_has_boolean_range): Use
8460 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8462 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
8463 vrp_visit_cond_stmt.
8464 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
8465 (cprop_into_stmt): Same.
8466 (dom_opt_dom_walker::optimize_stmt): Same.
8468 2021-09-27 Richard Biener <rguenther@suse.de>
8470 PR tree-optimization/97351
8471 PR tree-optimization/97352
8472 PR tree-optimization/82426
8473 * tree-vectorizer.h (dr_misalignment): Add vector type
8475 (aligned_access_p): Likewise.
8476 (known_alignment_for_access_p): Likewise.
8477 (vect_supportable_dr_alignment): Likewise.
8478 (vect_known_alignment_in_bytes): Likewise. Refactor.
8479 (DR_MISALIGNMENT): Remove.
8480 (vect_update_shared_vectype): Likewise.
8481 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
8482 a vector type with larger alignment requirement and apply
8483 the negative step adjustment here.
8484 (vect_calculate_target_alignment): Remove.
8485 (vect_compute_data_ref_alignment): Get explicit vector type
8486 argument, do not apply a negative step alignment adjustment
8488 (vect_slp_analyze_node_alignment): Re-analyze alignment
8489 when we re-visit the DR with a bigger desired alignment but
8490 keep more precise results from smaller alignments.
8491 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
8492 (vect_slp_analyze_node_operations_1): Do not update the
8493 shared vector type on stmts.
8494 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
8495 vector type of an SLP node to the representative stmt-info.
8496 (vect_transform_stmt): Likewise.
8498 2021-09-27 liuhongt <hongtao.liu@intel.com>
8501 2021-09-09 liuhongt <hongtao.liu@intel.com>
8504 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
8505 (reduc_plus_scal_v4sf): .. this, New define_expand.
8506 (reduc_plus_scal_v2df): .. and this, New define_expand.
8508 2021-09-26 liuhongt <hongtao.liu@intel.com>
8510 * doc/extend.texi (Half-Precision): Remove storage only
8511 description for _Float16 w/o avx512fp16.
8513 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
8515 * config/pru/constraints.md (Rrio): New constraint.
8516 * config/pru/predicates.md (regio_operand): New predicate.
8517 * config/pru/pru-pragma.c (pru_register_pragmas): Register
8518 the __regio_symbol address space.
8519 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
8520 * config/pru/pru.c (pru_symref2ioregno): New helper function.
8521 (pru_legitimate_address_p): Remove.
8522 (pru_addr_space_legitimate_address_p): Use the address space
8524 (pru_nongeneric_pointer_addrspace): New helper function.
8525 (pru_insert_attributes): New function to validate __regio_symbol
8527 (TARGET_INSERT_ATTRIBUTES): New macro.
8528 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
8529 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
8530 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
8531 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
8533 (*regio_nozext_writesi): New pattern to write to I/O registers.
8534 (*regio_zext_write_r30<EQS0:mode>): Ditto.
8535 * doc/extend.texi: Document the new PRU Named Address Space.
8537 2021-09-24 Patrick Palka <ppalka@redhat.com>
8541 * real.c (encode_ieee_double): Avoid unwanted sign extension.
8542 (encode_ieee_quad): Likewise.
8544 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
8546 PR rtl-optimization/102147
8547 * ira-build.c (ira_conflict_vector_profitable_p): Make
8548 profitability calculation independent of host compiler pointer and
8551 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
8553 * gimple-range-path.cc (path_range_query::path_range_query):
8554 Move debugging header...
8555 (path_range_query::precompute_ranges): ...here.
8556 (path_range_query::internal_range_of_expr): Do not call
8557 range_on_path_entry if NAME is defined in the current block.
8559 2021-09-24 Richard Biener <rguenther@suse.de>
8561 * cfghooks.c (verify_flow_info): Verify unallocated BB and
8562 edge flags are not set.
8564 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
8566 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
8568 (jt_path_registry::register_jump_thread): Call
8569 cancel_invalid_paths.
8570 * tree-ssa-threadupdate.h (class jt_path_registry): Add
8571 cancel_invalid_paths.
8573 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
8575 PR tree-optimization/102400
8576 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
8577 result_vdef to zero value.
8579 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
8581 PR tree-optimization/102451
8582 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
8585 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
8587 * config/i386/sse.md (cond_<insn><mode>): Extend to support
8589 (cond_mul<mode>): Likewise.
8590 (cond_div<mode>): Likewise.
8591 (cond_<code><mode>): Likewise.
8592 (cond_fma<mode>): Likewise.
8593 (cond_fms<mode>): Likewise.
8594 (cond_fnma<mode>): Likewise.
8595 (cond_fnms<mode>): Likewise.
8597 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
8599 PR tree-optimization/102463
8600 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
8601 there is no range-ops handler, don't look for a relation.
8603 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
8605 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
8606 non-executable_edge flag as parameter.
8607 * gimple-range-cache.h (ranger_cache): Adjust prototype.
8608 * gimple-range-gori.cc (gori_compute::gori_compute): Take
8609 non-executable_edge flag as parameter.
8610 (gori_compute::outgoing_edge_range_p): Check new flag.
8611 * gimple-range-gori.h (gori_compute): Adjust prototype.
8612 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
8613 (gimple_ranger::range_on_edge): Check new flag.
8614 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
8615 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
8616 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
8617 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
8618 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
8619 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
8620 (simplify_using_ranges::simplify_switch_using_ranges): Clear
8621 EDGE_EXECUTABLE like it originally did.
8622 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
8623 NON_EXECUTABLE flags.
8624 (simplify_using_ranges::simplify_using_ranges): Adjust.
8625 * vr-values.h (class simplify_using_ranges): Adjust.
8626 (simplify_using_ranges::set_range_query): Add non-executable flag param.
8628 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
8631 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
8632 zero-width bit fields and return indicator.
8633 (rs6000_discover_homogeneous_aggregate): Diagnose when the
8634 presence of a zero-width bit field changes parameter passing in
8637 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
8639 * gimple-range-fold.cc (fold_using_range::range_of_phi):
8640 Remove dominator check.
8642 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
8644 * gimple-range-path.cc (path_range_query::precompute_relations):
8645 Hoist edge calculations before using EDGE_SUCC.
8647 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
8649 * configure.ac: Fix --with-multilib-list description.
8650 * configure: Regenerate.
8652 2021-09-23 Richard Biener <rguenther@suse.de>
8654 PR tree-optimization/102448
8655 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
8656 Clear alignment info copied from DR_PTR_INFO.
8658 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8660 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
8662 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
8663 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
8664 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
8665 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
8666 (vcond<mode><mode>): New expander.
8667 (vcond<mode><sseintvecmodelower>): Likewise.
8668 (vcond<sseintvecmodelower><mode>): Likewise.
8669 (vcondu<mode><sseintvecmodelower>): Likewise.
8671 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8673 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
8675 (extendv4hf<mode>2): Likewise.
8676 (extendv2hfv2df2): Likewise.
8677 (trunc<mode><ssePHmodelower>2): Likewise.
8678 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
8679 (trunc<mode>v4hf2): ... this, and drop constraints.
8680 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
8681 (truncv2dfv2hf2): ... this, and likewise.
8683 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8685 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
8687 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
8689 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
8690 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
8691 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
8693 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8695 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
8696 (fixuns_trunchfhi2): Likewise.
8697 (*fixuns_trunchfsi2zext): New define_insn.
8698 * config/i386/sse.md (ssePHmodelower): New mode_attr.
8699 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
8700 New expander for same element vector fix_truncate.
8701 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
8702 Likewise for V4HF to V4SI/V4DI fix_truncate.
8703 (fix<fixunssuffix>_truncv2hfv2di2):
8704 Likeise for V2HF to V2DI fix_truncate.
8706 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8708 * config/i386/i386.md (<code>hf3): New expander.
8710 2021-09-23 liuhongt <hongtao.liu@intel.com>
8712 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
8713 (VFH_SF_AVX512VL): Extend to handle HFmode.
8714 (VF_SF_AVX512VL): Deleted.
8716 2021-09-23 liuhongt <hongtao.liu@intel.com>
8718 * config/i386/i386.md (rinthf2): New expander.
8719 (nearbyinthf2): New expander.
8721 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
8723 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
8724 (class dom_jt_state): ...this and provide virtual overrides.
8725 (dom_jt_state::register_equiv): New.
8726 (class dom_jt_simplifier): Rename from
8727 dom_jump_threader_simplifier.
8728 (dom_jump_threader_simplifier::simplify): Rename...
8729 (dom_jt_simplifier::simplify): ...to this.
8730 (pass_dominator::execute): Use dom_jt_simplifier and
8732 * tree-ssa-threadedge.c (jump_threader::jump_threader):
8734 (jt_state::register_equivs_stmt): Abstract out...
8735 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
8737 (jump_threader::thread_around_empty_blocks): Update state.
8738 (jump_threader::thread_through_normal_block): Same.
8739 (jt_state::jt_state): Remove.
8740 (jt_state::push): Remove pass specific bits. Keep block vector
8742 (jt_state::append_path): New.
8743 (jt_state::pop): Remove pass specific bits.
8744 (jt_state::register_equiv): Same.
8745 (jt_state::record_ranges_from_stmt): Same.
8746 (jt_state::register_equivs_on_edge): Same. Rename...
8747 (jt_state::register_equivs_edge): ...to this.
8748 (jt_state::dump): New.
8749 (jt_state::debug): New.
8750 (jump_threader_simplifier::simplify): Remove.
8751 (jt_state::get_path): New.
8752 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
8753 class. Expose common functionality as virtual methods.
8754 (class jump_threader_simplifier): Same. Rename...
8755 (class jt_simplifier): ...to this.
8756 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
8757 (class vrp_jt_simplifier): ...to this. Provide pass specific
8759 (class vrp_jt_state): New.
8760 (vrp_jump_threader_simplifier::simplify): Rename...
8761 (vrp_jt_simplifier::simplify): ...to this. Inline code from
8762 what used to be the base class.
8763 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
8766 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
8769 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
8772 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
8773 Richard Biener <rguenther@suse.de>
8775 * match.pd (negation simplifications): Implement some negation
8776 folding transformations from fold-const.c's fold_negate_expr.
8777 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
8778 argument, to control whether the op should be simplified prior
8779 to looking up/assigning a value number.
8780 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
8781 (vn_nary_simplify): Likewise.
8782 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
8783 now call vn_nary_build_or_lookup_1 disabling simplification.
8785 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
8787 PR tree-optimization/102087
8788 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
8789 Update bound/cmp/control for niter.
8791 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
8793 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
8794 Move check for non-empty BB here.
8795 (fur_source::register_outgoing_edges): ...from here.
8797 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
8799 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
8800 Remove call to improve_range_with_equivs.
8801 (path_range_query::improve_range_with_equivs): Remove
8802 * gimple-range-path.h: Remove improve_range_with_equivs.
8804 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8806 * config/i386/avx512fp16intrin.h:
8807 (_mm512_mask_blend_ph): New intrinsic.
8808 (_mm512_permutex2var_ph): Ditto.
8809 (_mm512_permutexvar_ph): Ditto.
8810 * config/i386/avx512fp16vlintrin.h:
8811 (_mm256_mask_blend_ph): New intrinsic.
8812 (_mm256_permutex2var_ph): Ditto.
8813 (_mm256_permutexvar_ph): Ditto.
8814 (_mm_mask_blend_ph): Ditto.
8815 (_mm_permutex2var_ph): Ditto.
8816 (_mm_permutexvar_ph): Ditto.
8818 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8820 * config/i386/avx512fp16intrin.h: Add new intrinsics.
8821 (_mm512_conj_pch): New intrinsic.
8822 (_mm512_mask_conj_pch): Ditto.
8823 (_mm512_maskz_conj_pch): Ditto.
8824 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
8825 (_mm256_conj_pch): New intrinsic.
8826 (_mm256_mask_conj_pch): Ditto.
8827 (_mm256_maskz_conj_pch): Ditto.
8828 (_mm_conj_pch): Ditto.
8829 (_mm_mask_conj_pch): Ditto.
8830 (_mm_maskz_conj_pch): Ditto.
8832 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8834 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
8835 (_mm512_reduce_add_ph): New intrinsic.
8836 (_mm512_reduce_mul_ph): Ditto.
8837 (_mm512_reduce_min_ph): Ditto.
8838 (_mm512_reduce_max_ph): Ditto.
8839 * config/i386/avx512fp16vlintrin.h
8840 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
8841 (_mm256_reduce_add_ph): New intrinsic.
8842 (_mm256_reduce_mul_ph): Ditto.
8843 (_mm256_reduce_min_ph): Ditto.
8844 (_mm256_reduce_max_ph): Ditto.
8845 (_mm_reduce_add_ph): Ditto.
8846 (_mm_reduce_mul_ph): Ditto.
8847 (_mm_reduce_min_ph): Ditto.
8848 (_mm_reduce_max_ph): Ditto.
8850 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8852 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
8853 __m128h_u): New typedef.
8854 (_mm512_load_ph): New intrinsic.
8855 (_mm256_load_ph): Ditto.
8856 (_mm_load_ph): Ditto.
8857 (_mm512_loadu_ph): Ditto.
8858 (_mm256_loadu_ph): Ditto.
8859 (_mm_loadu_ph): Ditto.
8860 (_mm512_store_ph): Ditto.
8861 (_mm256_store_ph): Ditto.
8862 (_mm_store_ph): Ditto.
8863 (_mm512_storeu_ph): Ditto.
8864 (_mm256_storeu_ph): Ditto.
8865 (_mm_storeu_ph): Ditto.
8866 (_mm512_abs_ph): Ditto.
8867 * config/i386/avx512fp16vlintrin.h
8868 (_mm_abs_ph): Ditto.
8869 (_mm256_abs_ph): Ditto.
8871 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
8873 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
8875 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
8878 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
8879 is actually a full copy of the source operand into the target.
8880 Don't emit a strict low part move if source and target mode match.
8882 2021-09-22 Jakub Jelinek <jakub@redhat.com>
8884 PR middle-end/102415
8885 * omp-expand.c (expand_omp_single): If region->exit is NULL,
8886 assert region->entry is GIMPLE_OMP_SCOPE region and return.
8888 2021-09-22 Jakub Jelinek <jakub@redhat.com>
8890 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
8891 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
8892 arguments from 2 to 3.
8893 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
8894 allocate clause allocator and print align if present.
8895 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
8896 for omp_default_mem_alloc if align modifier is present. If align
8897 modifier is present, use TREE_LIST to encode both allocator and
8899 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
8900 Handle align modifier on allocator clause if present.
8902 2021-09-22 liuhongt <hongtao.liu@intel.com>
8904 * config/i386/i386.md (define_attr "isa"): Add
8906 (define_attr "enabled"): Correspond fma_or_avx512vl to
8907 TARGET_FMA || TARGET_AVX512VL.
8908 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
8913 2021-09-22 liuhongt <hongtao.liu@intel.com>
8915 * config/i386/i386.md (cstorehf3): New define_expand.
8917 2021-09-22 liuhongt <hongtao.liu@intel.com>
8919 * config/i386/i386.md (<rounding_insn>hf2): New expander.
8920 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
8921 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
8922 Extend from VF_128 to VFH_128.
8924 2021-09-22 liuhongt <hongtao.liu@intel.com>
8926 * config/i386/i386-features.c (i386-features.c): Handle
8928 * config/i386/i386.md (sqrthf2): New expander.
8929 (*sqrthf2): New define_insn.
8930 * config/i386/sse.md
8931 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
8934 2021-09-22 liuhongt <hongtao.liu@intel.com>
8936 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
8938 (_mm_mask3_fcmadd_sch): Likewise.
8939 (_mm_maskz_fcmadd_sch): Likewise.
8940 (_mm_fcmadd_sch): Likewise.
8941 (_mm_mask_fmadd_sch): Likewise.
8942 (_mm_mask3_fmadd_sch): Likewise.
8943 (_mm_maskz_fmadd_sch): Likewise.
8944 (_mm_fmadd_sch): Likewise.
8945 (_mm_mask_fcmadd_round_sch): Likewise.
8946 (_mm_mask3_fcmadd_round_sch): Likewise.
8947 (_mm_maskz_fcmadd_round_sch): Likewise.
8948 (_mm_fcmadd_round_sch): Likewise.
8949 (_mm_mask_fmadd_round_sch): Likewise.
8950 (_mm_mask3_fmadd_round_sch): Likewise.
8951 (_mm_maskz_fmadd_round_sch): Likewise.
8952 (_mm_fmadd_round_sch): Likewise.
8953 (_mm_fcmul_sch): Likewise.
8954 (_mm_mask_fcmul_sch): Likewise.
8955 (_mm_maskz_fcmul_sch): Likewise.
8956 (_mm_fmul_sch): Likewise.
8957 (_mm_mask_fmul_sch): Likewise.
8958 (_mm_maskz_fmul_sch): Likewise.
8959 (_mm_fcmul_round_sch): Likewise.
8960 (_mm_mask_fcmul_round_sch): Likewise.
8961 (_mm_maskz_fcmul_round_sch): Likewise.
8962 (_mm_fmul_round_sch): Likewise.
8963 (_mm_mask_fmul_round_sch): Likewise.
8964 (_mm_maskz_fmul_round_sch): Likewise.
8965 * config/i386/i386-builtin.def: Add corresponding new builtins.
8966 * config/i386/sse.md
8967 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
8968 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
8969 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
8971 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
8972 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
8974 * config/i386/subst.md (mask_scalarcz_name): New.
8975 (mask_scalarc_name): Ditto.
8976 (mask_scalarc_operand3): Ditto.
8977 (mask_scalarcz_operand4): Ditto.
8978 (round_scalarcz_name): Ditto.
8979 (round_scalarc_mask_operand3): Ditto.
8980 (round_scalarcz_mask_operand4): Ditto.
8981 (round_scalarc_mask_op3): Ditto.
8982 (round_scalarcz_mask_op4): Ditto.
8983 (round_scalarcz_constraint): Ditto.
8984 (round_scalarcz_nimm_predicate): Ditto.
8985 (mask_scalarcz): Ditto.
8986 (mask_scalarc): Ditto.
8987 (round_scalarcz): Ditto.
8989 2021-09-22 liuhongt <hongtao.liu@intel.com>
8991 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
8993 (_mm512_mask_fcmadd_pch): Likewise.
8994 (_mm512_mask3_fcmadd_pch): Likewise.
8995 (_mm512_maskz_fcmadd_pch): Likewise.
8996 (_mm512_fmadd_pch): Likewise.
8997 (_mm512_mask_fmadd_pch): Likewise.
8998 (_mm512_mask3_fmadd_pch): Likewise.
8999 (_mm512_maskz_fmadd_pch): Likewise.
9000 (_mm512_fcmadd_round_pch): Likewise.
9001 (_mm512_mask_fcmadd_round_pch): Likewise.
9002 (_mm512_mask3_fcmadd_round_pch): Likewise.
9003 (_mm512_maskz_fcmadd_round_pch): Likewise.
9004 (_mm512_fmadd_round_pch): Likewise.
9005 (_mm512_mask_fmadd_round_pch): Likewise.
9006 (_mm512_mask3_fmadd_round_pch): Likewise.
9007 (_mm512_maskz_fmadd_round_pch): Likewise.
9008 (_mm512_fcmul_pch): Likewise.
9009 (_mm512_mask_fcmul_pch): Likewise.
9010 (_mm512_maskz_fcmul_pch): Likewise.
9011 (_mm512_fmul_pch): Likewise.
9012 (_mm512_mask_fmul_pch): Likewise.
9013 (_mm512_maskz_fmul_pch): Likewise.
9014 (_mm512_fcmul_round_pch): Likewise.
9015 (_mm512_mask_fcmul_round_pch): Likewise.
9016 (_mm512_maskz_fcmul_round_pch): Likewise.
9017 (_mm512_fmul_round_pch): Likewise.
9018 (_mm512_mask_fmul_round_pch): Likewise.
9019 (_mm512_maskz_fmul_round_pch): Likewise.
9020 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
9022 (_mm_mask_fmadd_pch): Likewise.
9023 (_mm_mask3_fmadd_pch): Likewise.
9024 (_mm_maskz_fmadd_pch): Likewise.
9025 (_mm256_fmadd_pch): Likewise.
9026 (_mm256_mask_fmadd_pch): Likewise.
9027 (_mm256_mask3_fmadd_pch): Likewise.
9028 (_mm256_maskz_fmadd_pch): Likewise.
9029 (_mm_fcmadd_pch): Likewise.
9030 (_mm_mask_fcmadd_pch): Likewise.
9031 (_mm_mask3_fcmadd_pch): Likewise.
9032 (_mm_maskz_fcmadd_pch): Likewise.
9033 (_mm256_fcmadd_pch): Likewise.
9034 (_mm256_mask_fcmadd_pch): Likewise.
9035 (_mm256_mask3_fcmadd_pch): Likewise.
9036 (_mm256_maskz_fcmadd_pch): Likewise.
9037 (_mm_fmul_pch): Likewise.
9038 (_mm_mask_fmul_pch): Likewise.
9039 (_mm_maskz_fmul_pch): Likewise.
9040 (_mm256_fmul_pch): Likewise.
9041 (_mm256_mask_fmul_pch): Likewise.
9042 (_mm256_maskz_fmul_pch): Likewise.
9043 (_mm_fcmul_pch): Likewise.
9044 (_mm_mask_fcmul_pch): Likewise.
9045 (_mm_maskz_fcmul_pch): Likewise.
9046 (_mm256_fcmul_pch): Likewise.
9047 (_mm256_mask_fcmul_pch): Likewise.
9048 (_mm256_maskz_fcmul_pch): Likewise.
9049 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
9050 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
9051 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
9052 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
9053 * config/i386/i386-builtin.def: Add new builtins.
9054 * config/i386/i386-expand.c: Handle new builtin types.
9055 * config/i386/subst.md (SUBST_CV): New.
9056 (maskc_name): Ditto.
9057 (maskc_operand3): Ditto.
9059 (sdc_maskz_name): Ditto.
9060 (sdc_mask_op4): Ditto.
9061 (sdc_mask_op5): Ditto.
9062 (sdc_mask_mode512bit_condition): Ditto.
9064 (round_maskc_operand3): Ditto.
9065 (round_sdc_mask_operand4): Ditto.
9066 (round_maskc_op3): Ditto.
9067 (round_sdc_mask_op4): Ditto.
9068 (round_saeonly_sdc_mask_operand5): Ditto.
9069 * config/i386/sse.md (unspec): Add complex fma unspecs.
9070 (avx512fmaskcmode): New.
9071 (UNSPEC_COMPLEX_F_C_MA): Ditto.
9072 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
9073 (complexopname): Ditto.
9074 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
9075 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
9076 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
9078 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
9079 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
9081 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
9083 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
9084 rs6000-density-size-threshold, rs6000-density-penalty,
9085 rs6000-density-load-pct-threshold,
9086 rs6000-density-load-num-threshold): New parameter.
9087 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
9088 corresponding parameters.
9090 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9092 * gimple-range-path.cc (path_range_query::defined_outside_path):
9094 (path_range_query::range_on_path_entry): New.
9095 (path_range_query::internal_range_of_expr): Resolve unknowns
9097 (path_range_query::improve_range_with_equivs): New.
9098 (path_range_query::ssa_range_in_phi): Resolve unknowns with
9100 * gimple-range-path.h (class path_range_query): Add
9101 defined_outside_path, range_on_path_entry, and
9102 improve_range_with_equivs.
9104 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9106 * gimple-range-path.cc (path_range_query::add_to_imports): New.
9107 (path_range_query::add_copies_to_imports): New.
9108 (path_range_query::precompute_ranges): Call
9109 add_copies_to_imports.
9110 * gimple-range-path.h (class path_range_query): Add prototypes
9111 for add_copies_to_imports and add_to_imports.
9113 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9115 * gimple-range-path.cc (path_range_query::range_defined_in_block):
9116 Remove useless code.
9118 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9120 * gimple-range-fold.h (class fur_source): Make oracle protected.
9121 * gimple-range-path.cc (path_range_query::path_range_query): Add
9122 resolve argument. Initialize oracle.
9123 (path_range_query::~path_range_query): Delete oracle.
9124 (path_range_query::range_of_stmt): Adapt to use relations.
9125 (path_range_query::precompute_ranges): Pre-compute relations.
9126 (class jt_fur_source): New
9127 (jt_fur_source::jt_fur_source): New.
9128 (jt_fur_source::register_relation): New.
9129 (jt_fur_source::query_relation): New.
9130 (path_range_query::precompute_relations): New.
9131 (path_range_query::precompute_phi_relations): New.
9132 * gimple-range-path.h (path_range_query): Add resolve argument.
9133 Add oracle, precompute_relations, precompute_phi_relations.
9134 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
9135 resolve argument to solver.
9137 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9139 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9140 Rename postfold_gcond_edges to register_outgoing_edges and
9142 (fold_using_range::postfold_gcond_edges): Rename...
9143 (fur_source::register_outgoing_edges): ...to this.
9144 * gimple-range-fold.h (postfold_gcond_edges): Rename to
9145 register_outgoing_edges and move to fur_source.
9147 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9149 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
9150 dom_info_available_p.
9152 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9154 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
9155 and quick_grow_cleared instead of safe_grow_cleared.
9157 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
9160 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
9163 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
9165 * configure.ac: Detect when the assembler supports new-style
9166 architecture extensions.
9167 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
9168 the full CPU string if the assembler can grok it.
9169 (arm_rewrite_march): Likewise but for the architecture.
9170 * config.in: Regenerate.
9171 * configure: Regenerate.
9173 2021-09-21 Richard Biener <rguenther@suse.de>
9175 PR tree-optimization/102421
9176 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
9177 adjust alignment info.
9179 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
9181 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
9182 bitfield stream out.
9184 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9186 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
9187 undefined edges, apply an equivalence if appropriate.
9188 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
9189 UNDEFINED if EDGE_EXECUTABLE is not set.
9190 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
9191 as EXECUTABLE upon startup.
9192 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
9193 EDGE_EXECUTABLE set.
9194 * vr-values.c (set_and_propagate_unexecutable): New.
9195 (simplify_using_ranges::fold_cond): Call set_and_propagate.
9196 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
9197 * vr-values.h: Add prototype.
9199 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9201 * value-relation.cc (equiv_oracle::register_initial_def): New.
9202 (equiv_oracle::register_relation): Call register_initial_def.
9203 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
9204 (relation_oracle::register_stmt): Check def block of PHI arguments.
9205 * value-relation.h (equiv_oracle): Add new prototypes.
9207 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
9209 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
9210 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
9211 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
9212 __ROUNDING_MATH__ according to their corresponding flags.
9213 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
9214 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
9215 and __ROUNDING_MATH__.
9217 2021-09-20 Richard Biener <rguenther@suse.de>
9219 * tree-vect-stmts.c (vectorizable_load): Use the vectype
9222 2021-09-20 Richard Biener <rguenther@suse.de>
9224 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9225 Do not compute alignment of the vectorized access here.
9227 2021-09-20 Richard Biener <rguenther@suse.de>
9229 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
9230 Store -1 for runtime alias peeling iterations.
9232 2021-09-20 Richard Biener <rguenther@suse.de>
9234 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
9236 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
9238 * input.c (string_concat_db::record_string_concatenation)
9239 (string_concat_db::get_string_concatenation): Skip for
9240 'RESERVED_LOCATION_P'.
9242 2021-09-20 Richard Biener <rguenther@suse.de>
9244 PR tree-optimization/65206
9245 * tree-data-ref.h (struct data_reference): Add alt_indices,
9247 * tree-data-ref.c (free_data_ref): Release alt_indices.
9248 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
9249 (create_data_ref): Adjust.
9250 (initialize_data_dependence_relation): Split into head
9251 and tail. When the base objects fail to match up try
9252 again with pointer-based analysis of indices.
9253 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
9254 not compare the lazily computed alternate set of indices.
9256 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
9258 * gcc.c: Test for execute OK when we find the
9259 programs for assembler linker and dsymutil and those
9260 were specified at configure-time.
9262 2021-09-19 Martin Sebor <msebor@redhat.com>
9264 PR middle-end/102403
9265 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
9266 Correct a function pre/postcondition.
9268 2021-09-19 Martin Sebor <msebor@redhat.com>
9270 PR middle-end/102243
9271 * tree-ssa-strlen.c (get_range): Handle null cfun.
9273 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
9275 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
9276 unwinder shim as a convenience library.
9278 2021-09-19 Andrew Pinski <apinski@marvell.com>
9280 * doc/install.texi: Add note about
9281 binutils 2.35 is required for LTO usage.
9283 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
9285 * tree-ssa-threadbackward.c
9286 (back_threader_registry::register_path): Use push_edge.
9287 * tree-ssa-threadedge.c
9288 (jump_threader::thread_around_empty_blocks): Same.
9289 (jump_threader::thread_through_normal_block): Same.
9290 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
9292 * tree-ssa-threadupdate.c
9293 (jt_path_registry::allocate_thread_edge): Remove.
9294 (jt_path_registry::push_edge): New.
9295 (dump_jump_thread_path): Make static.
9296 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
9299 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
9301 * gimple-range-path.cc (path_range_query::path_range_query): Add
9303 (path_range_query::dump): Remove extern declaration of dump_ranger.
9304 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
9305 * gimple-range-trace.h (dump_ranger): Add prototype.
9307 2021-09-19 John Ericson <git@JohnEricson.me>
9309 * gcc.c (find_a_program): New function, factored out of...
9310 (find_a_file): Here.
9311 (execute): Use find_a_program when looking for programs rather
9314 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
9316 * config/avr/avr-mcus.def: Add atmega324pb.
9317 * doc/avr-mmcu.texi: Corresponding changes.
9319 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
9322 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
9323 the same transformations as to @1. For comparisons against NaN,
9324 don't check HONOR_SNANS but confirm that neither operand is a
9327 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
9329 * attribs.c (make_unique_name): Delete.
9330 * attribs.h (make_unique_name): Delete.
9332 2021-09-19 Andrew Pinski <apinski@marvell.com>
9334 * lra-constraints.c (check_and_process_move): Assert
9335 that dclass and sclass are greater than or equal to NO_REGS.
9337 2021-09-18 Jakub Jelinek <jakub@redhat.com>
9339 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
9340 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
9341 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
9343 2021-09-18 liuhongt <hongtao.liu@intel.com>
9345 * config/i386/i386-features.c (remove_partial_avx_dependency):
9346 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
9347 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
9349 2021-09-18 Jakub Jelinek <jakub@redhat.com>
9351 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
9352 if file/namespace scope variable doesn't have predetermined sharing,
9353 treat it as if there was default(none).
9355 2021-09-18 liuhongt <hongtao.liu@intel.com>
9357 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
9359 (_mm_mask_fmadd_sh): Likewise.
9360 (_mm_mask3_fmadd_sh): Likewise.
9361 (_mm_maskz_fmadd_sh): Likewise.
9362 (_mm_fmadd_round_sh): Likewise.
9363 (_mm_mask_fmadd_round_sh): Likewise.
9364 (_mm_mask3_fmadd_round_sh): Likewise.
9365 (_mm_maskz_fmadd_round_sh): Likewise.
9366 (_mm_fnmadd_sh): Likewise.
9367 (_mm_mask_fnmadd_sh): Likewise.
9368 (_mm_mask3_fnmadd_sh): Likewise.
9369 (_mm_maskz_fnmadd_sh): Likewise.
9370 (_mm_fnmadd_round_sh): Likewise.
9371 (_mm_mask_fnmadd_round_sh): Likewise.
9372 (_mm_mask3_fnmadd_round_sh): Likewise.
9373 (_mm_maskz_fnmadd_round_sh): Likewise.
9374 (_mm_fmsub_sh): Likewise.
9375 (_mm_mask_fmsub_sh): Likewise.
9376 (_mm_mask3_fmsub_sh): Likewise.
9377 (_mm_maskz_fmsub_sh): Likewise.
9378 (_mm_fmsub_round_sh): Likewise.
9379 (_mm_mask_fmsub_round_sh): Likewise.
9380 (_mm_mask3_fmsub_round_sh): Likewise.
9381 (_mm_maskz_fmsub_round_sh): Likewise.
9382 (_mm_fnmsub_sh): Likewise.
9383 (_mm_mask_fnmsub_sh): Likewise.
9384 (_mm_mask3_fnmsub_sh): Likewise.
9385 (_mm_maskz_fnmsub_sh): Likewise.
9386 (_mm_fnmsub_round_sh): Likewise.
9387 (_mm_mask_fnmsub_round_sh): Likewise.
9388 (_mm_mask3_fnmsub_round_sh): Likewise.
9389 (_mm_maskz_fnmsub_round_sh): Likewise.
9390 * config/i386/i386-builtin-types.def
9391 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
9392 * config/i386/i386-builtin.def: Add new builtins.
9393 * config/i386/i386-expand.c: Handle new builtin type.
9394 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
9395 Ajdust to support FP16.
9396 (fmai_vmfmsub_<mode><round_name>): Ditto.
9397 (fmai_vmfnmadd_<mode><round_name>): Ditto.
9398 (fmai_vmfnmsub_<mode><round_name>): Ditto.
9399 (*fmai_fmadd_<mode>): Ditto.
9400 (*fmai_fmsub_<mode>): Ditto.
9401 (*fmai_fnmadd_<mode><round_name>): Ditto.
9402 (*fmai_fnmsub_<mode><round_name>): Ditto.
9403 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
9404 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
9405 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
9406 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
9407 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
9408 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
9409 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
9410 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
9411 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
9412 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
9413 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
9414 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
9415 adjust to support FP16.
9416 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
9417 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
9418 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
9421 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
9423 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
9425 (maskload<mode><avx512fmaskmodelower>): Ditto.
9426 (maskstore<mode><avx512fmaskmodelower>): Ditto.
9428 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
9430 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
9432 (ix86_expand_copysign): Ditto.
9433 (ix86_expand_xorsign): Ditto.
9434 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
9436 (ix86_build_signbit_mask): Ditto.
9437 (ix86_can_change_mode_class): Ditto.
9438 * config/i386/i386.md
9439 (SSEMODEF): Add HFmode.
9440 (ssevecmodef): Ditto.
9441 (<code>hf2): New define_expand.
9442 (*<code>hf2_1): New define_insn_and_split.
9443 (copysign<mode>): Extend to support HFmode under AVX512FP16.
9444 (xorsign<mode>): Ditto.
9445 * config/i386/sse.md (VFB): New mode iterator.
9446 (VFB_128_256): Ditto.
9448 (sseintvecmode2): Support HF vector mode.
9449 (<code><mode>2): Use new mode iterator.
9450 (*<code><mode>2): Ditto.
9451 (copysign<mode>3): Ditto.
9452 (xorsign<mode>3): Ditto.
9453 (<code><mode>3<mask_name>): Ditto.
9454 (<code><mode>3<mask_name>): Ditto.
9455 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
9456 (<sse>_andnot<mode>3<mask_name>): Ditto.
9457 (*<code><mode>3<mask_name>): Ditto.
9458 (*<code><mode>3<mask_name>): Ditto.
9460 2021-09-18 liuhongt <hongtao.liu@intel.com>
9462 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
9464 (_mm512_mask3_fmadd_ph): Likewise.
9465 (_mm512_maskz_fmadd_ph): Likewise.
9466 (_mm512_fmadd_round_ph): Likewise.
9467 (_mm512_mask_fmadd_round_ph): Likewise.
9468 (_mm512_mask3_fmadd_round_ph): Likewise.
9469 (_mm512_maskz_fmadd_round_ph): Likewise.
9470 (_mm512_fnmadd_ph): Likewise.
9471 (_mm512_mask_fnmadd_ph): Likewise.
9472 (_mm512_mask3_fnmadd_ph): Likewise.
9473 (_mm512_maskz_fnmadd_ph): Likewise.
9474 (_mm512_fnmadd_round_ph): Likewise.
9475 (_mm512_mask_fnmadd_round_ph): Likewise.
9476 (_mm512_mask3_fnmadd_round_ph): Likewise.
9477 (_mm512_maskz_fnmadd_round_ph): Likewise.
9478 (_mm512_fmsub_ph): Likewise.
9479 (_mm512_mask_fmsub_ph): Likewise.
9480 (_mm512_mask3_fmsub_ph): Likewise.
9481 (_mm512_maskz_fmsub_ph): Likewise.
9482 (_mm512_fmsub_round_ph): Likewise.
9483 (_mm512_mask_fmsub_round_ph): Likewise.
9484 (_mm512_mask3_fmsub_round_ph): Likewise.
9485 (_mm512_maskz_fmsub_round_ph): Likewise.
9486 (_mm512_fnmsub_ph): Likewise.
9487 (_mm512_mask_fnmsub_ph): Likewise.
9488 (_mm512_mask3_fnmsub_ph): Likewise.
9489 (_mm512_maskz_fnmsub_ph): Likewise.
9490 (_mm512_fnmsub_round_ph): Likewise.
9491 (_mm512_mask_fnmsub_round_ph): Likewise.
9492 (_mm512_mask3_fnmsub_round_ph): Likewise.
9493 (_mm512_maskz_fnmsub_round_ph): Likewise.
9494 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
9496 (_mm256_mask_fmadd_ph): Likewise.
9497 (_mm256_mask3_fmadd_ph): Likewise.
9498 (_mm256_maskz_fmadd_ph): Likewise.
9499 (_mm_fmadd_ph): Likewise.
9500 (_mm_mask_fmadd_ph): Likewise.
9501 (_mm_mask3_fmadd_ph): Likewise.
9502 (_mm_maskz_fmadd_ph): Likewise.
9503 (_mm256_fnmadd_ph): Likewise.
9504 (_mm256_mask_fnmadd_ph): Likewise.
9505 (_mm256_mask3_fnmadd_ph): Likewise.
9506 (_mm256_maskz_fnmadd_ph): Likewise.
9507 (_mm_fnmadd_ph): Likewise.
9508 (_mm_mask_fnmadd_ph): Likewise.
9509 (_mm_mask3_fnmadd_ph): Likewise.
9510 (_mm_maskz_fnmadd_ph): Likewise.
9511 (_mm256_fmsub_ph): Likewise.
9512 (_mm256_mask_fmsub_ph): Likewise.
9513 (_mm256_mask3_fmsub_ph): Likewise.
9514 (_mm256_maskz_fmsub_ph): Likewise.
9515 (_mm_fmsub_ph): Likewise.
9516 (_mm_mask_fmsub_ph): Likewise.
9517 (_mm_mask3_fmsub_ph): Likewise.
9518 (_mm_maskz_fmsub_ph): Likewise.
9519 (_mm256_fnmsub_ph): Likewise.
9520 (_mm256_mask_fnmsub_ph): Likewise.
9521 (_mm256_mask3_fnmsub_ph): Likewise.
9522 (_mm256_maskz_fnmsub_ph): Likewise.
9523 (_mm_fnmsub_ph): Likewise.
9524 (_mm_mask_fnmsub_ph): Likewise.
9525 (_mm_mask3_fnmsub_ph): Likewise.
9526 (_mm_maskz_fnmsub_ph): Likewise.
9527 * config/i386/i386-builtin.def: Add corresponding new builtins.
9528 * config/i386/sse.md
9529 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
9530 support HF vector modes.
9531 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
9533 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
9534 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
9535 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
9536 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
9537 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
9538 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
9539 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
9541 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
9542 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
9543 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
9544 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
9545 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
9546 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
9548 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
9549 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
9550 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
9551 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
9552 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
9553 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
9554 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
9556 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
9557 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
9558 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
9559 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
9560 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
9562 2021-09-18 liuhongt <hongtao.liu@intel.com>
9564 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
9566 (_mm512_mask_fmaddsub_ph): Likewise.
9567 (_mm512_mask3_fmaddsub_ph): Likewise.
9568 (_mm512_maskz_fmaddsub_ph): Likewise.
9569 (_mm512_fmaddsub_round_ph): Likewise.
9570 (_mm512_mask_fmaddsub_round_ph): Likewise.
9571 (_mm512_mask3_fmaddsub_round_ph): Likewise.
9572 (_mm512_maskz_fmaddsub_round_ph): Likewise.
9573 (_mm512_mask_fmsubadd_ph): Likewise.
9574 (_mm512_mask3_fmsubadd_ph): Likewise.
9575 (_mm512_maskz_fmsubadd_ph): Likewise.
9576 (_mm512_fmsubadd_round_ph): Likewise.
9577 (_mm512_mask_fmsubadd_round_ph): Likewise.
9578 (_mm512_mask3_fmsubadd_round_ph): Likewise.
9579 (_mm512_maskz_fmsubadd_round_ph): Likewise.
9580 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
9582 (_mm256_mask_fmaddsub_ph): Likewise.
9583 (_mm256_mask3_fmaddsub_ph): Likewise.
9584 (_mm256_maskz_fmaddsub_ph): Likewise.
9585 (_mm_fmaddsub_ph): Likewise.
9586 (_mm_mask_fmaddsub_ph): Likewise.
9587 (_mm_mask3_fmaddsub_ph): Likewise.
9588 (_mm_maskz_fmaddsub_ph): Likewise.
9589 (_mm256_fmsubadd_ph): Likewise.
9590 (_mm256_mask_fmsubadd_ph): Likewise.
9591 (_mm256_mask3_fmsubadd_ph): Likewise.
9592 (_mm256_maskz_fmsubadd_ph): Likewise.
9593 (_mm_fmsubadd_ph): Likewise.
9594 (_mm_mask_fmsubadd_ph): Likewise.
9595 (_mm_mask3_fmsubadd_ph): Likewise.
9596 (_mm_maskz_fmsubadd_ph): Likewise.
9597 * config/i386/i386-builtin.def: Add corresponding new builtins.
9598 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
9599 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
9600 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
9602 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
9604 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
9605 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
9606 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
9608 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
9609 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
9611 2021-09-18 liuhongt <hongtao.liu@intel.com>
9614 * config/i386/i386.c (ix86_print_operand): Handle
9615 V8HF/V16HF/V32HFmode.
9616 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
9617 * config/i386/sse.md (avx512bcst): Remove.
9619 2021-09-17 Martin Sebor <msebor@redhat.com>
9621 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
9622 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
9623 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
9624 (check_defs): Add comment.
9625 (can_skip_redundant_opnd): Update comment.
9626 (compute_uninit_opnds_pos): Adjust to namespace change.
9627 (find_pdom): Move to gimple-predicate-analysis.cc.
9629 (struct uninit_undef_val_t): New.
9630 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
9631 (find_control_equiv_block): Same.
9632 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
9633 (MAX_SWITCH_CASES): Same.
9634 (compute_control_dep_chain): Same.
9635 (find_uninit_use): Use predicate analyzer.
9636 (struct pred_info): Move to gimple-predicate-analysis.
9637 (convert_control_dep_chain_into_preds): Same.
9638 (find_predicates): Same.
9639 (collect_phi_def_edges): Same.
9640 (warn_uninitialized_phi): Use predicate analyzer.
9641 (find_def_preds): Move to gimple-predicate-analysis.
9642 (dump_pred_info): Same.
9643 (dump_pred_chain): Same.
9644 (dump_predicates): Same.
9645 (destroy_predicate_vecs): Remove.
9646 (execute_late_warn_uninitialized): New.
9647 (get_cmp_code): Move to gimple-predicate-analysis.
9648 (is_value_included_in): Same.
9649 (value_sat_pred_p): Same.
9650 (find_matching_predicate_in_rest_chains): Same.
9651 (is_use_properly_guarded): Same.
9652 (prune_uninit_phi_opnds): Same.
9653 (find_var_cmp_const): Same.
9654 (use_pred_not_overlap_with_undef_path_pred): Same.
9655 (pred_equal_p): Same.
9656 (is_neq_relop_p): Same.
9657 (is_neq_zero_form_p): Same.
9658 (pred_expr_equal_p): Same.
9659 (is_pred_expr_subset_of): Same.
9660 (is_pred_chain_subset_of): Same.
9661 (is_included_in): Same.
9662 (is_superset_of): Same.
9664 (simplify_pred): Same.
9665 (simplify_preds_2): Same.
9666 (simplify_preds_3): Same.
9667 (simplify_preds_4): Same.
9668 (simplify_preds): Same.
9670 (push_to_worklist): Same.
9671 (get_pred_info_from_cmp): Same.
9672 (is_degenerated_phi): Same.
9673 (normalize_one_pred_1): Same.
9674 (normalize_one_pred): Same.
9675 (normalize_one_pred_chain): Same.
9676 (normalize_preds): Same.
9677 (can_one_predicate_be_invalidated_p): Same.
9678 (can_chain_union_be_invalidated_p): Same.
9679 (uninit_uses_cannot_happen): Same.
9680 (pass_late_warn_uninitialized::execute): Define.
9681 * gimple-predicate-analysis.cc: New file.
9682 * gimple-predicate-analysis.h: New file.
9684 2021-09-17 Julian Brown <julian@codesourcery.com>
9686 * config/gcn/gcn.c (gimple.h): Include.
9687 (gcn_fork_join): Emit barrier for worker-level joins.
9688 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
9689 writes_gang_private bitmap parameter. Set bit for blocks
9690 containing gang-private variable writes.
9691 (worker_single_simple): Don't emit barrier after predicated block.
9692 (worker_single_copy): Don't emit barrier if we're not broadcasting
9693 anything and the block contains no gang-private writes.
9694 (neuter_worker_single): Don't predicate blocks that only contain
9695 NOPs or internal marker functions. Pass has_gang_private_write
9696 argument to worker_single_copy.
9697 (oacc_do_neutering): Add writes_gang_private bitmap handling.
9699 2021-09-17 Julian Brown <julian@codesourcery.com>
9701 * config/gcn/gcn-protos.h
9702 (gcn_goacc_create_worker_broadcast_record): Update prototype.
9703 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
9704 preallocated block of LDS memory. Do not cache/share decls for
9705 reduction temporaries between invocations.
9706 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
9707 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
9708 and return temporary LDS space at that offset. Return pointer in
9710 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
9712 (ACC_LDS_SIZE): Define as acc_lds_size.
9713 (gcn_init_machine_status): Don't initialise lds_allocated,
9714 lds_allocs, reduc_decls fields of machine function struct.
9715 (gcn_option_override): Handle default size for gang-private
9716 variables and -mgang-private-size option.
9717 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
9718 initialising M0_REG.
9719 (gcn_shared_mem_layout): New function.
9720 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
9721 gang_private_hwm variable.
9722 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
9723 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
9724 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
9725 * config/gcn/gcn.opt (gang_private_size_opt): New global.
9726 (mgang-private-size=): New option.
9727 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
9729 * doc/tm.texi: Regenerate.
9730 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
9732 (build_sender_ref): Handle sender_decl being pointer.
9733 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
9734 parameters. Pass placement argument to
9735 create_worker_broadcast_record hook invocations. Handle
9736 sender_decl being pointer and isolate_broadcasts inserting extra
9738 (blk_offset_map_t): Add typedef.
9739 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
9740 preallocated range to worker_single_copy call.
9741 (dfs_broadcast_reachable_1): New function.
9742 (idx_decl_pair_t, used_range_vec_t): New typedefs.
9743 (sort_size_descending): New function.
9744 (addr_range): New class.
9745 (splay_tree_compare_addr_range, splay_tree_free_key)
9746 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
9747 (execute_omp_oacc_neuter_broadcast): Rename to...
9748 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
9749 parameters. Arrange layout of shared memory for broadcast
9751 (execute_omp_oacc_neuter_broadcast): New function.
9752 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
9753 handling from here. Enable pass for all OpenACC routines in order
9754 to call shared memory-layout hook.
9755 * target.def (create_worker_broadcast_record): Add OFFSET
9757 (shared_mem_layout): New hook.
9759 2021-09-17 Julian Brown <julian@codesourcery.com>
9760 Thomas Schwinge <thomas@codesourcery.com>
9762 * omp-oacc-neuter-broadcast.cc
9763 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
9765 (execute_omp_oacc_neuter_broadcast): Adjust.
9767 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
9769 * value-relation.cc (class equiv_chain): Move to header file.
9770 (path_oracle::path_oracle): New.
9771 (path_oracle::~path_oracle): New.
9772 (path_oracle::register_relation): New.
9773 (path_oracle::query_relation): New.
9774 (path_oracle::reset_path): New.
9775 (path_oracle::dump): New.
9776 * value-relation.h (class equiv_chain): Move to here.
9777 (class path_oracle): New.
9779 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
9781 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
9783 * gimple-range-fold.cc (fur_depend::register_relation): Use
9784 register_stmt/edge routines.
9785 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
9786 (equiv_oracle::equiv_oracle): Create self equivalence cache.
9787 (equiv_oracle::~equiv_oracle): Release same.
9788 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
9789 are no equivalences.
9790 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
9791 (equiv_oracle::register_relation): Rename from register_equiv.
9792 (relation_chain_head::find_relation): Relocate from dom_oracle.
9793 (relation_oracle::register_stmt): New.
9794 (relation_oracle::register_edge): New.
9795 (dom_oracle::*): Rename from relation_oracle.
9796 (dom_oracle::register_relation): Adjust to call equiv_oracle.
9797 (dom_oracle::set_one_relation): Split from register_relation.
9798 (dom_oracle::register_transitives): Consolidate 2 methods.
9799 (dom_oracle::find_relation_block): Move core to relation_chain.
9800 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
9801 * value-relation.h (class relation_oracle): New pure virtual base.
9802 (class equiv_oracle): Inherit from relation_oracle and adjust.
9803 (class dom_oracle): Rename from old relation_oracle and adjust.
9805 2021-09-17 Martin Sebor <msebor@redhat.com>
9807 PR middle-end/102200
9808 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
9809 (handle_min_max_size): Change argument. Store original SSA_NAME for
9810 operands to potentially distinct (sub)objects.
9811 (compute_objsize_r): Adjust call to the above.
9813 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9815 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
9816 (rs6000_new_builtin_vectorized_function): New function.
9817 (rs6000_new_builtin_md_vectorized_function): Likewise.
9818 (rs6000_builtin_vectorized_function): Call
9819 rs6000_new_builtin_vectorized_function.
9820 (rs6000_builtin_md_vectorized_function): Call
9821 rs6000_new_builtin_md_vectorized_function.
9823 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9825 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
9826 (ASSEMBLE_PAIR): Likewise.
9827 (BUILD_ACC): Likewise.
9828 (DISASSEMBLE_ACC): Likewise.
9829 (DISASSEMBLE_PAIR): Likewise.
9830 (PMXVBF16GER2): Likewise.
9831 (PMXVBF16GER2NN): Likewise.
9832 (PMXVBF16GER2NP): Likewise.
9833 (PMXVBF16GER2PN): Likewise.
9834 (PMXVBF16GER2PP): Likewise.
9835 (PMXVF16GER2): Likewise.
9836 (PMXVF16GER2NN): Likewise.
9837 (PMXVF16GER2NP): Likewise.
9838 (PMXVF16GER2PN): Likewise.
9839 (PMXVF16GER2PP): Likewise.
9840 (PMXVF32GER): Likewise.
9841 (PMXVF32GERNN): Likewise.
9842 (PMXVF32GERNP): Likewise.
9843 (PMXVF32GERPN): Likewise.
9844 (PMXVF32GERPP): Likewise.
9845 (PMXVF64GER): Likewise.
9846 (PMXVF64GERNN): Likewise.
9847 (PMXVF64GERNP): Likewise.
9848 (PMXVF64GERPN): Likewise.
9849 (PMXVF64GERPP): Likewise.
9850 (PMXVI16GER2): Likewise.
9851 (PMXVI16GER2PP): Likewise.
9852 (PMXVI16GER2S): Likewise.
9853 (PMXVI16GER2SPP): Likewise.
9854 (PMXVI4GER8): Likewise.
9855 (PMXVI4GER8PP): Likewise.
9856 (PMXVI8GER4): Likewise.
9857 (PMXVI8GER4PP): Likewise.
9858 (PMXVI8GER4SPP): Likewise.
9859 (XVBF16GER2): Likewise.
9860 (XVBF16GER2NN): Likewise.
9861 (XVBF16GER2NP): Likewise.
9862 (XVBF16GER2PN): Likewise.
9863 (XVBF16GER2PP): Likewise.
9864 (XVF16GER2): Likewise.
9865 (XVF16GER2NN): Likewise.
9866 (XVF16GER2NP): Likewise.
9867 (XVF16GER2PN): Likewise.
9868 (XVF16GER2PP): Likewise.
9869 (XVF32GER): Likewise.
9870 (XVF32GERNN): Likewise.
9871 (XVF32GERNP): Likewise.
9872 (XVF32GERPN): Likewise.
9873 (XVF32GERPP): Likewise.
9874 (XVF64GER): Likewise.
9875 (XVF64GERNN): Likewise.
9876 (XVF64GERNP): Likewise.
9877 (XVF64GERPN): Likewise.
9878 (XVF64GERPP): Likewise.
9879 (XVI16GER2): Likewise.
9880 (XVI16GER2PP): Likewise.
9881 (XVI16GER2S): Likewise.
9882 (XVI16GER2SPP): Likewise.
9883 (XVI4GER8): Likewise.
9884 (XVI4GER8PP): Likewise.
9885 (XVI8GER4): Likewise.
9886 (XVI8GER4PP): Likewise.
9887 (XVI8GER4SPP): Likewise.
9888 (XXMFACC): Likewise.
9889 (XXMTACC): Likewise.
9890 (XXSETACCZ): Likewise.
9891 (ASSEMBLE_PAIR_V): Likewise.
9892 (BUILD_PAIR): Likewise.
9893 (DISASSEMBLE_PAIR_V): Likewise.
9896 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
9897 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
9898 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
9899 (parse_bif_attrs): Handle ismmaint.
9900 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
9901 (write_bif_static_init): Handle ismmaint.
9903 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9905 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
9907 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
9908 (rs6000_new_builtin_valid_without_lhs): New function.
9909 (rs6000_gimple_fold_new_mma_builtin): Likewise.
9910 (rs6000_gimple_fold_new_builtin): Likewise.
9912 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
9914 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
9915 Destruct stale Value objects.
9916 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
9919 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
9922 * match.pd (shift optimizations): Disable recent sign-changing
9923 optimization for shifts by zero, these will be folded later.
9925 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9927 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
9928 [power9] to [always].
9930 2021-09-17 Richard Biener <rguenther@suse.de>
9932 * tree-vect-stmts.c (vectorizable_load): Do not frob
9935 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9937 * config/i386/i386-features.c (remove_partial_avx_dependency):
9938 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
9939 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
9941 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
9943 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
9944 * config/i386/i386.md (SSE FP to FP splitters): Replace
9945 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
9946 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
9947 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
9948 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
9949 * config/i386/x86-tune.def
9950 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
9951 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
9953 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9956 * config/i386/i386-features.c (remove_partial_avx_dependency):
9957 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
9958 before generating vxorps.
9960 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9962 * config/i386/i386-options.c (processor_cost_table): Use
9963 tremont_cost for Tremont.
9964 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
9965 (tremont_memset): Likewise.
9966 (tremont_cost): Likewise.
9967 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
9970 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9972 * common/config/i386/i386-common.c: Use Haswell scheduling model
9974 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
9976 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
9978 (ix86_adjust_cost): Handle Tremont.
9979 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
9981 (X86_TUNE_USE_LEAVE): Likewise.
9982 (X86_TUNE_PUSH_MEMORY): Likewise.
9983 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
9984 (X86_TUNE_USE_CLTD): Likewise.
9985 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
9986 (X86_TUNE_AVOID_MFENCE): Likewise.
9987 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
9988 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
9989 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
9990 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
9991 (X86_TUNE_OPT_AGU): Likewise.
9992 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
9993 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
9994 (X86_TUNE_EXPAND_ABS): Likewise.
9995 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
9996 (X86_TUNE_SLOW_PSHUFB): Likewise.
9998 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
10000 PR rtl-optimization/102306
10001 * combine.c (try_combine): Abort the combination if we are about to
10002 duplicate volatile references.
10004 2021-09-17 liuhongt <hongtao.liu@intel.com>
10006 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
10008 (_mm256_undefined_ph): Likewise.
10009 (_mm512_undefined_ph): Likewise.
10010 (_mm_cvtsh_h): Likewise.
10011 (_mm256_cvtsh_h): Likewise.
10012 (_mm512_cvtsh_h): Likewise.
10013 (_mm512_castph_ps): Likewise.
10014 (_mm512_castph_pd): Likewise.
10015 (_mm512_castph_si512): Likewise.
10016 (_mm512_castph512_ph128): Likewise.
10017 (_mm512_castph512_ph256): Likewise.
10018 (_mm512_castph128_ph512): Likewise.
10019 (_mm512_castph256_ph512): Likewise.
10020 (_mm512_zextph128_ph512): Likewise.
10021 (_mm512_zextph256_ph512): Likewise.
10022 (_mm512_castps_ph): Likewise.
10023 (_mm512_castpd_ph): Likewise.
10024 (_mm512_castsi512_ph): Likewise.
10025 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
10027 (_mm256_castph_ps): Likewise.
10028 (_mm_castph_pd): Likewise.
10029 (_mm256_castph_pd): Likewise.
10030 (_mm_castph_si128): Likewise.
10031 (_mm256_castph_si256): Likewise.
10032 (_mm_castps_ph): Likewise.
10033 (_mm256_castps_ph): Likewise.
10034 (_mm_castpd_ph): Likewise.
10035 (_mm256_castpd_ph): Likewise.
10036 (_mm_castsi128_ph): Likewise.
10037 (_mm256_castsi256_ph): Likewise.
10038 (_mm256_castph256_ph128): Likewise.
10039 (_mm256_castph128_ph256): Likewise.
10040 (_mm256_zextph128_ph256): Likewise.
10042 2021-09-17 liuhongt <hongtao.liu@intel.com>
10044 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
10046 (_mm_mask_cvtsh_ss): Likewise.
10047 (_mm_maskz_cvtsh_ss): Likewise.
10048 (_mm_cvtsh_sd): Likewise.
10049 (_mm_mask_cvtsh_sd): Likewise.
10050 (_mm_maskz_cvtsh_sd): Likewise.
10051 (_mm_cvt_roundsh_ss): Likewise.
10052 (_mm_mask_cvt_roundsh_ss): Likewise.
10053 (_mm_maskz_cvt_roundsh_ss): Likewise.
10054 (_mm_cvt_roundsh_sd): Likewise.
10055 (_mm_mask_cvt_roundsh_sd): Likewise.
10056 (_mm_maskz_cvt_roundsh_sd): Likewise.
10057 (_mm_cvtss_sh): Likewise.
10058 (_mm_mask_cvtss_sh): Likewise.
10059 (_mm_maskz_cvtss_sh): Likewise.
10060 (_mm_cvtsd_sh): Likewise.
10061 (_mm_mask_cvtsd_sh): Likewise.
10062 (_mm_maskz_cvtsd_sh): Likewise.
10063 (_mm_cvt_roundss_sh): Likewise.
10064 (_mm_mask_cvt_roundss_sh): Likewise.
10065 (_mm_maskz_cvt_roundss_sh): Likewise.
10066 (_mm_cvt_roundsd_sh): Likewise.
10067 (_mm_mask_cvt_roundsd_sh): Likewise.
10068 (_mm_maskz_cvt_roundsd_sh): Likewise.
10069 * config/i386/i386-builtin-types.def
10070 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
10071 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
10072 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
10073 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
10074 * config/i386/i386-builtin.def: Add corrresponding new builtins.
10075 * config/i386/i386-expand.c: Handle new builtin types.
10076 * config/i386/sse.md (VF48_128): New mode iterator.
10077 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
10079 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
10082 2021-09-17 liuhongt <hongtao.liu@intel.com>
10084 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
10086 (_mm512_mask_cvtph_pd): Likewise.
10087 (_mm512_maskz_cvtph_pd): Likewise.
10088 (_mm512_cvt_roundph_pd): Likewise.
10089 (_mm512_mask_cvt_roundph_pd): Likewise.
10090 (_mm512_maskz_cvt_roundph_pd): Likewise.
10091 (_mm512_cvtxph_ps): Likewise.
10092 (_mm512_mask_cvtxph_ps): Likewise.
10093 (_mm512_maskz_cvtxph_ps): Likewise.
10094 (_mm512_cvtx_roundph_ps): Likewise.
10095 (_mm512_mask_cvtx_roundph_ps): Likewise.
10096 (_mm512_maskz_cvtx_roundph_ps): Likewise.
10097 (_mm512_cvtxps_ph): Likewise.
10098 (_mm512_mask_cvtxps_ph): Likewise.
10099 (_mm512_maskz_cvtxps_ph): Likewise.
10100 (_mm512_cvtx_roundps_ph): Likewise.
10101 (_mm512_mask_cvtx_roundps_ph): Likewise.
10102 (_mm512_maskz_cvtx_roundps_ph): Likewise.
10103 (_mm512_cvtpd_ph): Likewise.
10104 (_mm512_mask_cvtpd_ph): Likewise.
10105 (_mm512_maskz_cvtpd_ph): Likewise.
10106 (_mm512_cvt_roundpd_ph): Likewise.
10107 (_mm512_mask_cvt_roundpd_ph): Likewise.
10108 (_mm512_maskz_cvt_roundpd_ph): Likewise.
10109 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
10111 (_mm_mask_cvtph_pd): Likewise.
10112 (_mm_maskz_cvtph_pd): Likewise.
10113 (_mm256_cvtph_pd): Likewise.
10114 (_mm256_mask_cvtph_pd): Likewise.
10115 (_mm256_maskz_cvtph_pd): Likewise.
10116 (_mm_cvtxph_ps): Likewise.
10117 (_mm_mask_cvtxph_ps): Likewise.
10118 (_mm_maskz_cvtxph_ps): Likewise.
10119 (_mm256_cvtxph_ps): Likewise.
10120 (_mm256_mask_cvtxph_ps): Likewise.
10121 (_mm256_maskz_cvtxph_ps): Likewise.
10122 (_mm_cvtxps_ph): Likewise.
10123 (_mm_mask_cvtxps_ph): Likewise.
10124 (_mm_maskz_cvtxps_ph): Likewise.
10125 (_mm256_cvtxps_ph): Likewise.
10126 (_mm256_mask_cvtxps_ph): Likewise.
10127 (_mm256_maskz_cvtxps_ph): Likewise.
10128 (_mm_cvtpd_ph): Likewise.
10129 (_mm_mask_cvtpd_ph): Likewise.
10130 (_mm_maskz_cvtpd_ph): Likewise.
10131 (_mm256_cvtpd_ph): Likewise.
10132 (_mm256_mask_cvtpd_ph): Likewise.
10133 (_mm256_maskz_cvtpd_ph): Likewise.
10134 * config/i386/i386-builtin.def: Add corresponding new builtins.
10135 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10136 * config/i386/i386-expand.c: Handle new builtin types.
10137 * config/i386/sse.md
10138 (VF4_128_8_256): New.
10139 (VF48H_AVX512VL): Ditto.
10140 (ssePHmode): Add HF vector modes.
10141 (castmode): Add new convertable modes.
10142 (qq2phsuff): Ditto.
10143 (ph2pssuffix): New.
10144 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
10145 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10146 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10147 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10148 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10149 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
10150 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
10152 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
10153 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
10154 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
10155 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
10157 2021-09-17 liuhongt <hongtao.liu@intel.com>
10159 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
10161 (_mm_cvttsh_u32): Likewise.
10162 (_mm_cvtt_roundsh_i32): Likewise.
10163 (_mm_cvtt_roundsh_u32): Likewise.
10164 (_mm_cvttsh_i64): Likewise.
10165 (_mm_cvttsh_u64): Likewise.
10166 (_mm_cvtt_roundsh_i64): Likewise.
10167 (_mm_cvtt_roundsh_u64): Likewise.
10168 * config/i386/i386-builtin.def: Add corresponding new builtins.
10169 * config/i386/sse.md
10170 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
10173 2021-09-17 liuhongt <hongtao.liu@intel.com>
10175 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
10177 (_mm512_mask_cvttph_epi32): Likewise.
10178 (_mm512_maskz_cvttph_epi32): Likewise.
10179 (_mm512_cvtt_roundph_epi32): Likewise.
10180 (_mm512_mask_cvtt_roundph_epi32): Likewise.
10181 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
10182 (_mm512_cvttph_epu32): Likewise.
10183 (_mm512_mask_cvttph_epu32): Likewise.
10184 (_mm512_maskz_cvttph_epu32): Likewise.
10185 (_mm512_cvtt_roundph_epu32): Likewise.
10186 (_mm512_mask_cvtt_roundph_epu32): Likewise.
10187 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
10188 (_mm512_cvttph_epi64): Likewise.
10189 (_mm512_mask_cvttph_epi64): Likewise.
10190 (_mm512_maskz_cvttph_epi64): Likewise.
10191 (_mm512_cvtt_roundph_epi64): Likewise.
10192 (_mm512_mask_cvtt_roundph_epi64): Likewise.
10193 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
10194 (_mm512_cvttph_epu64): Likewise.
10195 (_mm512_mask_cvttph_epu64): Likewise.
10196 (_mm512_maskz_cvttph_epu64): Likewise.
10197 (_mm512_cvtt_roundph_epu64): Likewise.
10198 (_mm512_mask_cvtt_roundph_epu64): Likewise.
10199 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
10200 (_mm512_cvttph_epi16): Likewise.
10201 (_mm512_mask_cvttph_epi16): Likewise.
10202 (_mm512_maskz_cvttph_epi16): Likewise.
10203 (_mm512_cvtt_roundph_epi16): Likewise.
10204 (_mm512_mask_cvtt_roundph_epi16): Likewise.
10205 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
10206 (_mm512_cvttph_epu16): Likewise.
10207 (_mm512_mask_cvttph_epu16): Likewise.
10208 (_mm512_maskz_cvttph_epu16): Likewise.
10209 (_mm512_cvtt_roundph_epu16): Likewise.
10210 (_mm512_mask_cvtt_roundph_epu16): Likewise.
10211 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
10212 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
10214 (_mm_mask_cvttph_epi32): Likewise.
10215 (_mm_maskz_cvttph_epi32): Likewise.
10216 (_mm256_cvttph_epi32): Likewise.
10217 (_mm256_mask_cvttph_epi32): Likewise.
10218 (_mm256_maskz_cvttph_epi32): Likewise.
10219 (_mm_cvttph_epu32): Likewise.
10220 (_mm_mask_cvttph_epu32): Likewise.
10221 (_mm_maskz_cvttph_epu32): Likewise.
10222 (_mm256_cvttph_epu32): Likewise.
10223 (_mm256_mask_cvttph_epu32): Likewise.
10224 (_mm256_maskz_cvttph_epu32): Likewise.
10225 (_mm_cvttph_epi64): Likewise.
10226 (_mm_mask_cvttph_epi64): Likewise.
10227 (_mm_maskz_cvttph_epi64): Likewise.
10228 (_mm256_cvttph_epi64): Likewise.
10229 (_mm256_mask_cvttph_epi64): Likewise.
10230 (_mm256_maskz_cvttph_epi64): Likewise.
10231 (_mm_cvttph_epu64): Likewise.
10232 (_mm_mask_cvttph_epu64): Likewise.
10233 (_mm_maskz_cvttph_epu64): Likewise.
10234 (_mm256_cvttph_epu64): Likewise.
10235 (_mm256_mask_cvttph_epu64): Likewise.
10236 (_mm256_maskz_cvttph_epu64): Likewise.
10237 (_mm_cvttph_epi16): Likewise.
10238 (_mm_mask_cvttph_epi16): Likewise.
10239 (_mm_maskz_cvttph_epi16): Likewise.
10240 (_mm256_cvttph_epi16): Likewise.
10241 (_mm256_mask_cvttph_epi16): Likewise.
10242 (_mm256_maskz_cvttph_epi16): Likewise.
10243 (_mm_cvttph_epu16): Likewise.
10244 (_mm_mask_cvttph_epu16): Likewise.
10245 (_mm_maskz_cvttph_epu16): Likewise.
10246 (_mm256_cvttph_epu16): Likewise.
10247 (_mm256_mask_cvttph_epu16): Likewise.
10248 (_mm256_maskz_cvttph_epu16): Likewise.
10249 * config/i386/i386-builtin.def: Add new builtins.
10250 * config/i386/sse.md
10251 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
10253 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
10254 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
10255 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
10256 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
10258 2021-09-17 liuhongt <hongtao.liu@intel.com>
10260 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
10261 (_mm_cvtsh_u32): Likewise.
10262 (_mm_cvt_roundsh_i32): Likewise.
10263 (_mm_cvt_roundsh_u32): Likewise.
10264 (_mm_cvtsh_i64): Likewise.
10265 (_mm_cvtsh_u64): Likewise.
10266 (_mm_cvt_roundsh_i64): Likewise.
10267 (_mm_cvt_roundsh_u64): Likewise.
10268 (_mm_cvti32_sh): Likewise.
10269 (_mm_cvtu32_sh): Likewise.
10270 (_mm_cvt_roundi32_sh): Likewise.
10271 (_mm_cvt_roundu32_sh): Likewise.
10272 (_mm_cvti64_sh): Likewise.
10273 (_mm_cvtu64_sh): Likewise.
10274 (_mm_cvt_roundi64_sh): Likewise.
10275 (_mm_cvt_roundu64_sh): Likewise.
10276 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10277 * config/i386/i386-builtin.def: Add corresponding new builtins.
10278 * config/i386/i386-expand.c (ix86_expand_round_builtin):
10279 Handle new builtin types.
10280 * config/i386/sse.md
10281 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
10283 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
10284 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
10286 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
10288 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
10289 (altivec_resolve_new_overloaded_builtin): New forward decl.
10290 (rs6000_new_builtin_type_compatible): New function.
10291 (altivec_resolve_overloaded_builtin): Call
10292 altivec_resolve_new_overloaded_builtin.
10293 (altivec_build_new_resolved_builtin): New function.
10294 (altivec_resolve_new_overloaded_builtin): Likewise.
10295 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
10297 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
10298 name of rs6000_new_builtin_is_supported.
10300 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
10302 * config/i386/i386-protos.h (ix86_decompose_address):
10303 Change return type to bool.
10304 * config/i386/i386.c (ix86_decompose_address): Ditto.
10306 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
10309 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
10310 Added 'build/' to target, use build/%.o rule.
10311 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
10312 '$(build_exeext)' to target and 'build/' for the *.o files.
10313 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
10316 2021-09-16 Martin Jambor <mjambor@suse.cz>
10318 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
10319 the body has been removed. Remove trailing whitespace.
10321 2021-09-16 Richard Biener <rguenther@suse.de>
10323 PR middle-end/102360
10324 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
10325 of non-memory more robust.
10327 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10329 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
10330 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
10331 (leon5_adjust_cost): Increase cost of store with data dependency
10332 on ALU instruction and FPU anti-dependencies.
10333 (sparc_option_override): Add LEON5 costs
10334 (sparc_adjust_cost): Add LEON5 cost adjustments
10335 * config/sparc/sparc.h: Add LEON5
10336 * config/sparc/sparc.md: Include LEON5 scheduling information
10337 * config/sparc/sparc.opt: Add LEON5
10338 * doc/invoke.texi: Add LEON5
10339 * config/sparc/leon5.md: New file.
10341 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10343 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
10344 sensitive sequence for B2BST errata workaround.
10346 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10348 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
10349 functions with atomic instruction in the UT700 errata workaround.
10351 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10353 * config/sparc/sparc.c (next_active_non_empty_insn): New function
10354 that returns next active non empty assembly instruction.
10355 (sparc_do_work_around_errata): Use new function.
10357 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10359 * config/sparc/sparc.c (store_insn_p): Add predicate for store
10361 (load_insn_p): Add predicate for load attributes.
10362 (sparc_do_work_around_errata): Use new predicates.
10364 2021-09-16 Andreas Larsson <andreas@gaisler.com>
10366 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
10369 2021-09-16 Martin Liska <mliska@suse.cz>
10371 * config/mips/netbsd.h: Fix typo in name of a macro.
10373 2021-09-16 liuhongt <hongtao.liu@intel.com>
10375 PR middle-end/102080
10376 * match.pd: Check mask type when doing cond_op related gimple
10378 * tree.c (is_truth_type_for): New function.
10379 * tree.h (is_truth_type_for): New declaration.
10381 2021-09-16 liuhongt <hongtao.liu@intel.com>
10383 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
10385 (_mm512_mask_cvtepi32_ph): Likewise.
10386 (_mm512_maskz_cvtepi32_ph): Likewise.
10387 (_mm512_cvt_roundepi32_ph): Likewise.
10388 (_mm512_mask_cvt_roundepi32_ph): Likewise.
10389 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
10390 (_mm512_cvtepu32_ph): Likewise.
10391 (_mm512_mask_cvtepu32_ph): Likewise.
10392 (_mm512_maskz_cvtepu32_ph): Likewise.
10393 (_mm512_cvt_roundepu32_ph): Likewise.
10394 (_mm512_mask_cvt_roundepu32_ph): Likewise.
10395 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
10396 (_mm512_cvtepi64_ph): Likewise.
10397 (_mm512_mask_cvtepi64_ph): Likewise.
10398 (_mm512_maskz_cvtepi64_ph): Likewise.
10399 (_mm512_cvt_roundepi64_ph): Likewise.
10400 (_mm512_mask_cvt_roundepi64_ph): Likewise.
10401 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
10402 (_mm512_cvtepu64_ph): Likewise.
10403 (_mm512_mask_cvtepu64_ph): Likewise.
10404 (_mm512_maskz_cvtepu64_ph): Likewise.
10405 (_mm512_cvt_roundepu64_ph): Likewise.
10406 (_mm512_mask_cvt_roundepu64_ph): Likewise.
10407 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
10408 (_mm512_cvtepi16_ph): Likewise.
10409 (_mm512_mask_cvtepi16_ph): Likewise.
10410 (_mm512_maskz_cvtepi16_ph): Likewise.
10411 (_mm512_cvt_roundepi16_ph): Likewise.
10412 (_mm512_mask_cvt_roundepi16_ph): Likewise.
10413 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
10414 (_mm512_cvtepu16_ph): Likewise.
10415 (_mm512_mask_cvtepu16_ph): Likewise.
10416 (_mm512_maskz_cvtepu16_ph): Likewise.
10417 (_mm512_cvt_roundepu16_ph): Likewise.
10418 (_mm512_mask_cvt_roundepu16_ph): Likewise.
10419 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
10420 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
10422 (_mm_mask_cvtepi32_ph): Likewise.
10423 (_mm_maskz_cvtepi32_ph): Likewise.
10424 (_mm256_cvtepi32_ph): Likewise.
10425 (_mm256_mask_cvtepi32_ph): Likewise.
10426 (_mm256_maskz_cvtepi32_ph): Likewise.
10427 (_mm_cvtepu32_ph): Likewise.
10428 (_mm_mask_cvtepu32_ph): Likewise.
10429 (_mm_maskz_cvtepu32_ph): Likewise.
10430 (_mm256_cvtepu32_ph): Likewise.
10431 (_mm256_mask_cvtepu32_ph): Likewise.
10432 (_mm256_maskz_cvtepu32_ph): Likewise.
10433 (_mm_cvtepi64_ph): Likewise.
10434 (_mm_mask_cvtepi64_ph): Likewise.
10435 (_mm_maskz_cvtepi64_ph): Likewise.
10436 (_mm256_cvtepi64_ph): Likewise.
10437 (_mm256_mask_cvtepi64_ph): Likewise.
10438 (_mm256_maskz_cvtepi64_ph): Likewise.
10439 (_mm_cvtepu64_ph): Likewise.
10440 (_mm_mask_cvtepu64_ph): Likewise.
10441 (_mm_maskz_cvtepu64_ph): Likewise.
10442 (_mm256_cvtepu64_ph): Likewise.
10443 (_mm256_mask_cvtepu64_ph): Likewise.
10444 (_mm256_maskz_cvtepu64_ph): Likewise.
10445 (_mm_cvtepi16_ph): Likewise.
10446 (_mm_mask_cvtepi16_ph): Likewise.
10447 (_mm_maskz_cvtepi16_ph): Likewise.
10448 (_mm256_cvtepi16_ph): Likewise.
10449 (_mm256_mask_cvtepi16_ph): Likewise.
10450 (_mm256_maskz_cvtepi16_ph): Likewise.
10451 (_mm_cvtepu16_ph): Likewise.
10452 (_mm_mask_cvtepu16_ph): Likewise.
10453 (_mm_maskz_cvtepu16_ph): Likewise.
10454 (_mm256_cvtepu16_ph): Likewise.
10455 (_mm256_mask_cvtepu16_ph): Likewise.
10456 (_mm256_maskz_cvtepu16_ph): Likewise.
10457 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10458 * config/i386/i386-builtin.def: Add corresponding new builtins.
10459 * config/i386/i386-expand.c
10460 (ix86_expand_args_builtin): Handle new builtin types.
10461 (ix86_expand_round_builtin): Ditto.
10462 * config/i386/i386-modes.def: Declare V2HF and V6HF.
10463 * config/i386/sse.md (VI2H_AVX512VL): New.
10464 (qq2phsuff): Ditto.
10465 (sseintvecmode): Add HF vector modes.
10466 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
10468 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
10469 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
10470 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
10471 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
10472 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
10473 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
10474 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
10475 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
10476 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
10477 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
10478 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
10480 2021-09-16 liuhongt <hongtao.liu@intel.com>
10482 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
10484 (_mm512_mask_cvtph_epi32): Likewise.
10485 (_mm512_maskz_cvtph_epi32): Likewise.
10486 (_mm512_cvt_roundph_epi32): Likewise.
10487 (_mm512_mask_cvt_roundph_epi32): Likewise.
10488 (_mm512_maskz_cvt_roundph_epi32): Likewise.
10489 (_mm512_cvtph_epu32): Likewise.
10490 (_mm512_mask_cvtph_epu32): Likewise.
10491 (_mm512_maskz_cvtph_epu32): Likewise.
10492 (_mm512_cvt_roundph_epu32): Likewise.
10493 (_mm512_mask_cvt_roundph_epu32): Likewise.
10494 (_mm512_maskz_cvt_roundph_epu32): Likewise.
10495 (_mm512_cvtph_epi64): Likewise.
10496 (_mm512_mask_cvtph_epi64): Likewise.
10497 (_mm512_maskz_cvtph_epi64): Likewise.
10498 (_mm512_cvt_roundph_epi64): Likewise.
10499 (_mm512_mask_cvt_roundph_epi64): Likewise.
10500 (_mm512_maskz_cvt_roundph_epi64): Likewise.
10501 (_mm512_cvtph_epu64): Likewise.
10502 (_mm512_mask_cvtph_epu64): Likewise.
10503 (_mm512_maskz_cvtph_epu64): Likewise.
10504 (_mm512_cvt_roundph_epu64): Likewise.
10505 (_mm512_mask_cvt_roundph_epu64): Likewise.
10506 (_mm512_maskz_cvt_roundph_epu64): Likewise.
10507 (_mm512_cvtph_epi16): Likewise.
10508 (_mm512_mask_cvtph_epi16): Likewise.
10509 (_mm512_maskz_cvtph_epi16): Likewise.
10510 (_mm512_cvt_roundph_epi16): Likewise.
10511 (_mm512_mask_cvt_roundph_epi16): Likewise.
10512 (_mm512_maskz_cvt_roundph_epi16): Likewise.
10513 (_mm512_cvtph_epu16): Likewise.
10514 (_mm512_mask_cvtph_epu16): Likewise.
10515 (_mm512_maskz_cvtph_epu16): Likewise.
10516 (_mm512_cvt_roundph_epu16): Likewise.
10517 (_mm512_mask_cvt_roundph_epu16): Likewise.
10518 (_mm512_maskz_cvt_roundph_epu16): Likewise.
10519 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
10521 (_mm_mask_cvtph_epi32): Likewise.
10522 (_mm_maskz_cvtph_epi32): Likewise.
10523 (_mm256_cvtph_epi32): Likewise.
10524 (_mm256_mask_cvtph_epi32): Likewise.
10525 (_mm256_maskz_cvtph_epi32): Likewise.
10526 (_mm_cvtph_epu32): Likewise.
10527 (_mm_mask_cvtph_epu32): Likewise.
10528 (_mm_maskz_cvtph_epu32): Likewise.
10529 (_mm256_cvtph_epu32): Likewise.
10530 (_mm256_mask_cvtph_epu32): Likewise.
10531 (_mm256_maskz_cvtph_epu32): Likewise.
10532 (_mm_cvtph_epi64): Likewise.
10533 (_mm_mask_cvtph_epi64): Likewise.
10534 (_mm_maskz_cvtph_epi64): Likewise.
10535 (_mm256_cvtph_epi64): Likewise.
10536 (_mm256_mask_cvtph_epi64): Likewise.
10537 (_mm256_maskz_cvtph_epi64): Likewise.
10538 (_mm_cvtph_epu64): Likewise.
10539 (_mm_mask_cvtph_epu64): Likewise.
10540 (_mm_maskz_cvtph_epu64): Likewise.
10541 (_mm256_cvtph_epu64): Likewise.
10542 (_mm256_mask_cvtph_epu64): Likewise.
10543 (_mm256_maskz_cvtph_epu64): Likewise.
10544 (_mm_cvtph_epi16): Likewise.
10545 (_mm_mask_cvtph_epi16): Likewise.
10546 (_mm_maskz_cvtph_epi16): Likewise.
10547 (_mm256_cvtph_epi16): Likewise.
10548 (_mm256_mask_cvtph_epi16): Likewise.
10549 (_mm256_maskz_cvtph_epi16): Likewise.
10550 (_mm_cvtph_epu16): Likewise.
10551 (_mm_mask_cvtph_epu16): Likewise.
10552 (_mm_maskz_cvtph_epu16): Likewise.
10553 (_mm256_cvtph_epu16): Likewise.
10554 (_mm256_mask_cvtph_epu16): Likewise.
10555 (_mm256_maskz_cvtph_epu16): Likewise.
10556 * config/i386/i386-builtin-types.def: Add new builtin types.
10557 * config/i386/i386-builtin.def: Add new builtins.
10558 * config/i386/i386-expand.c
10559 (ix86_expand_args_builtin): Handle new builtin types.
10560 (ix86_expand_round_builtin): Ditto.
10561 * config/i386/sse.md (sseintconvert): New.
10562 (ssePHmode): Ditto.
10563 (UNSPEC_US_FIX_NOTRUNC): Ditto.
10564 (sseintconvertsignprefix): Ditto.
10565 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
10568 2021-09-16 liuhongt <hongtao.liu@intel.com>
10570 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
10572 (_mm_cvtsi128_si16): Likewise.
10573 (_mm_mask_load_sh): Likewise.
10574 (_mm_maskz_load_sh): Likewise.
10575 (_mm_mask_store_sh): Likewise.
10576 (_mm_move_sh): Likewise.
10577 (_mm_mask_move_sh): Likewise.
10578 (_mm_maskz_move_sh): Likewise.
10579 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10580 * config/i386/i386-builtin.def: Add corresponding new builtins.
10581 * config/i386/i386-expand.c
10582 (ix86_expand_special_args_builtin): Handle new builtin types.
10583 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
10584 * config/i386/sse.md (VI2F): New mode iterator.
10585 (vec_set<mode>_0): Use new mode iterator.
10586 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
10587 (avx512f_store<mode>_mask): Ditto.
10589 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
10591 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
10593 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
10595 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
10596 Proceed if no symbol summary or the symbol alias flag is false.
10598 2021-09-15 Jakub Jelinek <jakub@redhat.com>
10602 * varasm.c (output_constructor_regular_field): Instead of assertion
10603 that array_size_for_constructor result is equal to size of
10604 TREE_TYPE (local->val) in bytes, assert that the type size is greater
10605 or equal to array_size_for_constructor result and use type size as
10608 2021-09-15 Martin Liska <mliska@suse.cz>
10611 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
10613 2021-09-15 Martin Liska <mliska@suse.cz>
10616 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
10617 Check that we have a symbol summary for a symbol.
10619 2021-09-15 Richard Biener <rguenther@suse.de>
10622 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
10623 to inherit from elfos.h
10625 2021-09-15 liuhongt <hongtao.liu@intel.com>
10628 * config/i386/i386-expand.c
10629 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
10631 (ix86_expand_vector_set): Use blendw instead of pinsrw.
10632 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
10633 AVX512FP16 which supports 16bit vector load.
10634 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
10636 (avx512bw_interleave_high<mode><mask_name>): .. this, and
10637 extend to V32HFmode.
10638 (avx2_interleave_highv16hi<mask_name>): Rename to ..
10639 (avx2_interleave_high<mode><mask_name>): .. this, and extend
10641 (vec_interleave_highv8hi<mask_name>): Rename to ..
10642 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
10643 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
10645 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
10646 this, and extend to V32HFmode.
10647 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
10648 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
10649 (vec_interleave_lowv8hi<mask_name>): Rename to ..
10650 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
10651 (sse4_1_pblendw): Rename to ..
10652 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
10653 (avx2_pblendph): New define_expand.
10654 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
10655 sseintmodesuffix instead of ssemodesuffix.
10656 (blendsuf): New mode attr.
10658 2021-09-15 Richard Biener <rguenther@suse.de>
10660 * tree-vectorizer.h (dr_misalignment): Move out of line.
10661 (dr_target_alignment): New.
10662 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
10663 (set_dr_target_alignment): New.
10664 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
10665 * tree-vect-data-refs.c (dr_misalignment): Compute and
10666 return the group members misalignment.
10667 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
10668 (vect_analyze_data_refs_alignment): Compute alignment only
10669 for the first element of a DR group.
10670 (vect_slp_analyze_node_alignment): Likewise.
10672 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
10674 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
10675 * config/i386/avx512fp16vlintrin.h: Likewise.
10676 * config/i386/i386-builtin.def: Adjust builtin name and
10677 enumeration to match AVX512F style.
10679 2021-09-15 Richard Biener <rguenther@suse.de>
10681 PR tree-optimization/102318
10682 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
10683 previous change and do the mode conversion separately from
10684 the sign conversion.
10686 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
10687 Peter Cordes <peter@cordes.ca>
10690 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
10691 (*vec_extract<mode><ssescalarmodelower>_valign): Output
10692 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
10695 2021-09-15 Richard Biener <rguenther@suse.de>
10697 * config.gcc: Remove vax-*-openbsd* configuration.
10699 2021-09-15 Richard Biener <rguenther@suse.de>
10701 * config.gcc: Remove m68k-openbsd.
10703 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
10706 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
10708 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
10710 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
10711 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
10712 (*mma_xxsetaccz): Delete.
10713 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
10714 Use UNSPECV_MMA_XXSETACCZ. Update comment.
10715 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
10717 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
10719 * Makefile.in: Remove variables related to applying no-PIE
10720 to the exes on $build.
10721 * configure: Regenerate.
10722 * configure.ac: Remove configuration related to applying
10723 no-PIE to the exes on $build.
10725 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
10727 * config/arc/arc.md (doloop_end): Add missing mode.
10728 (loop_end): Likewise.
10730 2021-09-14 Jakub Jelinek <jakub@redhat.com>
10732 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
10733 it to recursive calls, for depth above 7 just gimplify or return.
10734 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
10735 __builtin_clear_padding and TARGET_EXPR.
10736 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
10738 2021-09-14 liuhongt <hongtao.liu@intel.com>
10740 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
10742 (_mm_mask_fpclass_sh_mask): Likewise.
10743 (_mm512_mask_fpclass_ph_mask): Likewise.
10744 (_mm512_fpclass_ph_mask): Likewise.
10745 (_mm_getexp_sh): Likewise.
10746 (_mm_mask_getexp_sh): Likewise.
10747 (_mm_maskz_getexp_sh): Likewise.
10748 (_mm512_getexp_ph): Likewise.
10749 (_mm512_mask_getexp_ph): Likewise.
10750 (_mm512_maskz_getexp_ph): Likewise.
10751 (_mm_getexp_round_sh): Likewise.
10752 (_mm_mask_getexp_round_sh): Likewise.
10753 (_mm_maskz_getexp_round_sh): Likewise.
10754 (_mm512_getexp_round_ph): Likewise.
10755 (_mm512_mask_getexp_round_ph): Likewise.
10756 (_mm512_maskz_getexp_round_ph): Likewise.
10757 (_mm_getmant_sh): Likewise.
10758 (_mm_mask_getmant_sh): Likewise.
10759 (_mm_maskz_getmant_sh): Likewise.
10760 (_mm512_getmant_ph): Likewise.
10761 (_mm512_mask_getmant_ph): Likewise.
10762 (_mm512_maskz_getmant_ph): Likewise.
10763 (_mm_getmant_round_sh): Likewise.
10764 (_mm_mask_getmant_round_sh): Likewise.
10765 (_mm_maskz_getmant_round_sh): Likewise.
10766 (_mm512_getmant_round_ph): Likewise.
10767 (_mm512_mask_getmant_round_ph): Likewise.
10768 (_mm512_maskz_getmant_round_ph): Likewise.
10769 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
10771 (_mm_fpclass_ph_mask): Likewise.
10772 (_mm256_mask_fpclass_ph_mask): Likewise.
10773 (_mm256_fpclass_ph_mask): Likewise.
10774 (_mm256_getexp_ph): Likewise.
10775 (_mm256_mask_getexp_ph): Likewise.
10776 (_mm256_maskz_getexp_ph): Likewise.
10777 (_mm_getexp_ph): Likewise.
10778 (_mm_mask_getexp_ph): Likewise.
10779 (_mm_maskz_getexp_ph): Likewise.
10780 (_mm256_getmant_ph): Likewise.
10781 (_mm256_mask_getmant_ph): Likewise.
10782 (_mm256_maskz_getmant_ph): Likewise.
10783 (_mm_getmant_ph): Likewise.
10784 (_mm_mask_getmant_ph): Likewise.
10785 (_mm_maskz_getmant_ph): Likewise.
10786 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10787 * config/i386/i386-builtin.def: Add corresponding new builtins.
10788 * config/i386/i386-expand.c
10789 (ix86_expand_args_builtin): Handle new builtin types.
10790 (ix86_expand_round_builtin): Ditto.
10791 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
10792 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
10793 to support HF vector modes.
10794 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
10796 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
10797 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
10798 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
10799 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
10802 2021-09-14 liuhongt <hongtao.liu@intel.com>
10804 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
10806 (_mm512_mask_reduce_ph): Likewise.
10807 (_mm512_maskz_reduce_ph): Likewise.
10808 (_mm512_reduce_round_ph): Likewise.
10809 (_mm512_mask_reduce_round_ph): Likewise.
10810 (_mm512_maskz_reduce_round_ph): Likewise.
10811 (_mm_reduce_sh): Likewise.
10812 (_mm_mask_reduce_sh): Likewise.
10813 (_mm_maskz_reduce_sh): Likewise.
10814 (_mm_reduce_round_sh): Likewise.
10815 (_mm_mask_reduce_round_sh): Likewise.
10816 (_mm_maskz_reduce_round_sh): Likewise.
10817 (_mm512_roundscale_ph): Likewise.
10818 (_mm512_mask_roundscale_ph): Likewise.
10819 (_mm512_maskz_roundscale_ph): Likewise.
10820 (_mm512_roundscale_round_ph): Likewise.
10821 (_mm512_mask_roundscale_round_ph): Likewise.
10822 (_mm512_maskz_roundscale_round_ph): Likewise.
10823 (_mm_roundscale_sh): Likewise.
10824 (_mm_mask_roundscale_sh): Likewise.
10825 (_mm_maskz_roundscale_sh): Likewise.
10826 (_mm_roundscale_round_sh): Likewise.
10827 (_mm_mask_roundscale_round_sh): Likewise.
10828 (_mm_maskz_roundscale_round_sh): Likewise.
10829 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
10831 (_mm_mask_reduce_ph): Likewise.
10832 (_mm_maskz_reduce_ph): Likewise.
10833 (_mm256_reduce_ph): Likewise.
10834 (_mm256_mask_reduce_ph): Likewise.
10835 (_mm256_maskz_reduce_ph): Likewise.
10836 (_mm_roundscale_ph): Likewise.
10837 (_mm_mask_roundscale_ph): Likewise.
10838 (_mm_maskz_roundscale_ph): Likewise.
10839 (_mm256_roundscale_ph): Likewise.
10840 (_mm256_mask_roundscale_ph): Likewise.
10841 (_mm256_maskz_roundscale_ph): Likewise.
10842 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10843 * config/i386/i386-builtin.def: Add corresponding new builtins.
10844 * config/i386/i386-expand.c
10845 (ix86_expand_args_builtin): Handle new builtin types.
10846 (ix86_expand_round_builtin): Ditto.
10847 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
10849 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
10850 ... this, and adjust for round operands.
10851 (reduces<mode><mask_scalar_name>): Likewise, with ...
10852 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
10854 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
10855 Adjust for HF vector modes.
10856 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
10858 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
10860 2021-09-14 liuhongt <hongtao.liu@intel.com>
10862 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
10864 (_mm512_mask_rcp_ph): Likewise.
10865 (_mm512_maskz_rcp_ph): Likewise.
10866 (_mm_rcp_sh): Likewise.
10867 (_mm_mask_rcp_sh): Likewise.
10868 (_mm_maskz_rcp_sh): Likewise.
10869 (_mm512_scalef_ph): Likewise.
10870 (_mm512_mask_scalef_ph): Likewise.
10871 (_mm512_maskz_scalef_ph): Likewise.
10872 (_mm512_scalef_round_ph): Likewise.
10873 (_mm512_mask_scalef_round_ph): Likewise.
10874 (_mm512_maskz_scalef_round_ph): Likewise.
10875 (_mm_scalef_sh): Likewise.
10876 (_mm_mask_scalef_sh): Likewise.
10877 (_mm_maskz_scalef_sh): Likewise.
10878 (_mm_scalef_round_sh): Likewise.
10879 (_mm_mask_scalef_round_sh): Likewise.
10880 (_mm_maskz_scalef_round_sh): Likewise.
10881 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
10883 (_mm256_rcp_ph): Likewise.
10884 (_mm_mask_rcp_ph): Likewise.
10885 (_mm256_mask_rcp_ph): Likewise.
10886 (_mm_maskz_rcp_ph): Likewise.
10887 (_mm256_maskz_rcp_ph): Likewise.
10888 (_mm_scalef_ph): Likewise.
10889 (_mm256_scalef_ph): Likewise.
10890 (_mm_mask_scalef_ph): Likewise.
10891 (_mm256_mask_scalef_ph): Likewise.
10892 (_mm_maskz_scalef_ph): Likewise.
10893 (_mm256_maskz_scalef_ph): Likewise.
10894 * config/i386/i386-builtin.def: Add new builtins.
10895 * config/i386/sse.md (VFH_AVX512VL): New.
10896 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
10897 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
10898 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
10899 Adjust to support HF vector modes.
10900 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
10902 2021-09-14 liuhongt <hongtao.liu@intel.com>
10904 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
10906 (_mm512_mask_sqrt_ph): Likewise.
10907 (_mm512_maskz_sqrt_ph): Likewise.
10908 (_mm512_sqrt_round_ph): Likewise.
10909 (_mm512_mask_sqrt_round_ph): Likewise.
10910 (_mm512_maskz_sqrt_round_ph): Likewise.
10911 (_mm512_rsqrt_ph): Likewise.
10912 (_mm512_mask_rsqrt_ph): Likewise.
10913 (_mm512_maskz_rsqrt_ph): Likewise.
10914 (_mm_rsqrt_sh): Likewise.
10915 (_mm_mask_rsqrt_sh): Likewise.
10916 (_mm_maskz_rsqrt_sh): Likewise.
10917 (_mm_sqrt_sh): Likewise.
10918 (_mm_mask_sqrt_sh): Likewise.
10919 (_mm_maskz_sqrt_sh): Likewise.
10920 (_mm_sqrt_round_sh): Likewise.
10921 (_mm_mask_sqrt_round_sh): Likewise.
10922 (_mm_maskz_sqrt_round_sh): Likewise.
10923 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
10924 (_mm256_sqrt_ph): Likewise.
10925 (_mm_mask_sqrt_ph): Likewise.
10926 (_mm256_mask_sqrt_ph): Likewise.
10927 (_mm_maskz_sqrt_ph): Likewise.
10928 (_mm256_maskz_sqrt_ph): Likewise.
10929 (_mm_rsqrt_ph): Likewise.
10930 (_mm256_rsqrt_ph): Likewise.
10931 (_mm_mask_rsqrt_ph): Likewise.
10932 (_mm256_mask_rsqrt_ph): Likewise.
10933 (_mm_maskz_rsqrt_ph): Likewise.
10934 (_mm256_maskz_rsqrt_ph): Likewise.
10935 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10936 * config/i386/i386-builtin.def: Add corresponding new builtins.
10937 * config/i386/i386-expand.c
10938 (ix86_expand_args_builtin): Handle new builtins.
10939 (ix86_expand_round_builtin): Ditto.
10940 * config/i386/sse.md (VF_AVX512FP16VL): New.
10941 (sqrt<mode>2): Adjust for HF vector modes.
10942 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
10943 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
10945 (<sse>_rsqrt<mode>2<mask_name>): New.
10946 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
10948 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
10950 PR bootstrap/101574
10951 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
10952 'RESERVED_LOCATION_P' locations.
10953 * warning-control.cc (get_nowarn_spec, suppress_warning)
10954 (copy_warning): Likewise.
10956 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
10958 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
10959 (typedef key_type_t): ... this. Remove.
10960 (nowarn_map): Document.
10961 * diagnostic-spec.c (nowarn_map): Likewise.
10962 * warning-control.cc (convert_to_key): Evolve functions into...
10963 (get_location): ... these. Adjust all users.
10965 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
10967 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
10969 2021-09-13 Jason Merrill <jason@redhat.com>
10971 * params.opt: Add destructive-interference-size and
10972 constructive-interference-size.
10973 * doc/invoke.texi: Document them.
10974 * config/aarch64/aarch64.c (aarch64_override_options_internal):
10976 * config/arm/arm.c (arm_option_override): Set them.
10977 * config/i386/i386-options.c (ix86_option_override_internal):
10980 2021-09-13 Martin Liska <mliska@suse.cz>
10981 H.J. Lu <hjl.tools@gmail.com>
10984 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
10985 for x86-64 micro levels for __builtin_cpu_supports.
10986 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
10987 Add priorities for the micro-arch levels.
10988 (enum processor_features): Add new features.
10989 * common/config/i386/i386-isas.h: Add micro-arch features.
10990 * config/i386/i386-builtins.c (get_builtin_code_for_version):
10991 Support the micro-arch levels by callsing
10992 __builtin_cpu_supports.
10993 * doc/extend.texi: Document that the levels are support by
10994 __builtin_cpu_supports.
10996 2021-09-13 Andrew Pinski <apinski@marvell.com>
10999 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
11001 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
11002 (aarch64_general_gimple_fold_builtin): Likewise.
11004 2021-09-13 Andrew Pinski <apinski@marvell.com>
11006 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
11007 to the Unsupported targets list.
11008 Remove support for m32r-*-linux* and m32rle-*-linux*.
11009 * config/m32r/linux.h: Removed.
11010 * config/m32r/t-linux: Removed.
11012 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
11015 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
11016 register index for SVE predicate modes.
11018 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11020 * tree-ssa-threadbackward.c
11021 (back_threader_profitability::profitable_path_p): Remove FSM
11023 (back_threader_registry::register_path): Same.
11024 * tree-ssa-threadedge.c
11025 (jump_threader::simplify_control_stmt_condition): Same.
11026 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
11027 Add backedge_threads argument.
11028 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
11029 backedge_threads argument.
11030 (back_jt_path_registry::back_jt_path_registry): Same.
11031 (dump_jump_thread_path): Adjust for FSM removal.
11032 (back_jt_path_registry::rewire_first_differing_edge): Same.
11033 (back_jt_path_registry::adjust_paths_after_duplication): Same.
11034 (back_jt_path_registry::update_cfg): Same.
11035 (jt_path_registry::register_jump_thread): Same.
11036 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
11038 (class back_jt_path_registry): Add backedge_threads to
11041 2021-09-13 Martin Liska <mliska@suse.cz>
11044 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
11046 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
11049 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
11050 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
11051 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
11052 TARGET_AVX256_STORE_BY_PIECES instead of
11053 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
11054 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11055 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
11056 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11057 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
11058 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
11060 2021-09-13 liuhongt <hongtao.liu@intel.com>
11062 PR bootstrap/102302
11063 * expmed.c (extract_bit_field_using_extv): Use
11064 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
11066 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11068 * Makefile.in (OBJS): Add value-pointer-equiv.o.
11069 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
11070 value-pointer-equiv.*.
11071 (ssa_equiv_stack::ssa_equiv_stack): Same.
11072 (ssa_equiv_stack::enter): Same.
11073 (ssa_equiv_stack::leave): Same.
11074 (ssa_equiv_stack::push_replacement): Same.
11075 (ssa_equiv_stack::get_replacement): Same.
11076 (is_pointer_ssa): Same.
11077 (class pointer_equiv_analyzer): Same.
11078 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
11079 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
11080 (pointer_equiv_analyzer::set_global_equiv): Same.
11081 (pointer_equiv_analyzer::set_cond_equiv): Same.
11082 (pointer_equiv_analyzer::get_equiv): Same.
11083 (pointer_equiv_analyzer::enter): Same.
11084 (pointer_equiv_analyzer::leave): Same.
11085 (pointer_equiv_analyzer::get_equiv_expr): Same.
11086 (pta_valueize): Same.
11087 (pointer_equiv_analyzer::visit_stmt): Same.
11088 (pointer_equiv_analyzer::visit_edge): Same.
11089 (hybrid_folder::value_of_expr): Same.
11090 (hybrid_folder::value_on_edge): Same.
11091 * value-pointer-equiv.cc: New file.
11092 * value-pointer-equiv.h: New file.
11094 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11097 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
11098 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
11100 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11103 * config/arm/arm.md (movmisaligndi): New define_expand.
11104 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
11106 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11109 * emit-rtl.c (gen_highpart): Use adjust_address to handle
11110 MEM rather than calling simplify_gen_subreg.
11112 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
11114 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
11115 define into a block.
11117 2021-09-13 Richard Biener <rguenther@suse.de>
11119 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
11120 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
11121 change based on DARWIN_PREFER_DWARF not being defined.
11123 2021-09-13 Richard Biener <rguenther@suse.de>
11125 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11126 to inherit from elfos.h
11128 2021-09-13 Richard Biener <rguenther@suse.de>
11130 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
11132 2021-09-13 Richard Biener <rguenther@suse.de>
11134 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
11135 override, pick up DWARF2_DEBUG define from elfos.h
11137 2021-09-13 Richard Biener <rguenther@suse.de>
11139 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
11142 2021-09-13 Richard Biener <rguenther@suse.de>
11144 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
11147 2021-09-13 Richard Biener <rguenther@suse.de>
11149 * config/i386/cygming.h: Always default to DWARF2 debugging.
11150 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
11152 * doc/install.texi: Document binutils 2.16 as minimum
11153 requirement for mingw.
11155 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11157 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
11158 nstmts, nloads and extra_ctor_cost.
11159 (rs6000_density_test): Add load density related heuristics. Do
11160 extra costing on vector construction statements if need.
11161 (rs6000_init_cost): Init new members.
11162 (rs6000_update_target_cost_per_stmt): New function.
11163 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
11164 rs6000_update_target_cost_per_stmt and call it.
11166 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11168 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
11169 (rs6000_init_cost): Adjust.
11171 2021-09-13 liuhongt <hongtao.liu@intel.com>
11173 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
11174 (UNSPEC_XORSIGN): Ditto.
11176 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
11178 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
11179 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
11182 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
11184 * tree-ssa-threadbackward.c (class back_threader_registry): Use
11185 back_jt_path_registry.
11186 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
11187 fwd_jt_path_registry.
11188 * tree-ssa-threadedge.h (class jump_threader): Same..
11189 * tree-ssa-threadupdate.c
11190 (jump_thread_path_registry::jump_thread_path_registry): Rename...
11191 (jt_path_registry::jt_path_registry): ...to this.
11192 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
11193 (jt_path_registry::~jt_path_registry): ...this.
11194 (fwd_jt_path_registry::fwd_jt_path_registry): New.
11195 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
11196 (jump_thread_path_registry::allocate_thread_edge): Rename...
11197 (jt_path_registry::allocate_thread_edge): ...to this.
11198 (jump_thread_path_registry::allocate_thread_path): Rename...
11199 (jt_path_registry::allocate_thread_path): ...to this.
11200 (jump_thread_path_registry::lookup_redirection_data): Rename...
11201 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
11202 (jump_thread_path_registry::thread_block_1): Rename...
11203 (fwd_jt_path_registry::thread_block_1): ...to this.
11204 (jump_thread_path_registry::thread_block): Rename...
11205 (fwd_jt_path_registry::thread_block): ...to this.
11206 (jt_path_registry::thread_through_loop_header): Rename...
11207 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
11208 (jump_thread_path_registry::mark_threaded_blocks): Rename...
11209 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
11210 (jump_thread_path_registry::debug_path): Rename...
11211 (jt_path_registry::debug_path): ...to this.
11212 (jump_thread_path_registry::dump): Rename...
11213 (jt_path_registry::debug): ...to this.
11214 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
11215 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
11216 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
11217 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
11218 (jump_thread_path_registry::duplicate_thread_path): Rename...
11219 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
11220 drop ill-formed candidates.
11221 (jump_thread_path_registry::remove_jump_threads_including): Rename...
11222 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
11223 (jt_path_registry::thread_through_all_blocks): New.
11224 (back_jt_path_registry::update_cfg): New.
11225 (fwd_jt_path_registry::update_cfg): New.
11226 (jump_thread_path_registry::register_jump_thread): Rename...
11227 (jt_path_registry::register_jump_thread): ...to this.
11228 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
11230 (class jt_path_registry): ...here.
11231 (class fwd_jt_path_registry): New.
11232 (class back_jt_path_registry): New.
11234 2021-09-10 liuhongt <hongtao.liu@intel.com>
11237 2021-09-01 liuhongt <hongtao.liu@intel.com>
11239 * emit-rtl.c (validate_subreg): Get rid of all float-int
11242 2021-09-10 Jakub Jelinek <jakub@redhat.com>
11244 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
11245 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
11246 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
11247 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
11248 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
11249 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
11250 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
11251 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
11253 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
11254 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
11255 GF_OMP_ATOMIC_WEAK.
11256 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
11258 * tree.h (OMP_ATOMIC_WEAK): Define.
11259 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
11260 fail memory order being encoded in the same enum and also print
11261 fail clause if present.
11262 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
11263 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
11264 handle pre_p == NULL case as a test mode that only returns value
11265 but doesn't change gimplify nor change anything otherwise, adjust
11266 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
11267 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
11268 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
11270 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
11271 COND_EXPR rhs. Set weak flag on gimple load/store for
11273 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
11274 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
11276 (expand_omp_atomic_cas): New function.
11277 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
11279 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
11280 using expand_omp_atomic_cas.
11282 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
11283 Michael Matz <matz@suse.de>
11285 * tree-pass.h (PROP_loop_opts_done): New.
11286 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
11287 Intersect with global range.
11288 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
11289 * tree-ssa-threadbackward.c
11290 (back_threader_profitability::profitable_path_p): Disable
11291 threading through latches until after loop optimizations have run.
11293 2021-09-10 David Faust <david.faust@oracle.com>
11295 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
11298 2021-09-10 David Faust <david.faust@oracle.com>
11300 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
11301 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
11302 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
11303 (bpf_expand_cbranch): New function.
11304 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
11306 (zero_extendsidi2): Only use mov32 instruction if it is available.
11307 (SIM mode iterator): Conditionalize support for SI mode.
11308 (JM mode iterator): New.
11309 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
11310 (*branch_on_di): Update name, use new JM iterator.
11311 * config/bpf/bpf.opt: (mjmpext): New option.
11312 (malu32): Likewise.
11313 (mjmp32): Likewise.
11315 (bpf_isa): New enum.
11317 2021-09-10 David Faust <david.faust@oracle.com>
11319 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
11320 for register-to-register extensions.
11321 (zero_extendqidi2): Likewise.
11323 2021-09-10 Richard Biener <rguenther@suse.de>
11325 PR middle-end/102273
11326 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
11328 2021-09-10 Richard Biener <rguenther@suse.de>
11330 PR middle-end/102269
11331 * gimplify.c (is_var_need_auto_init): Empty types do not need
11334 2021-09-10 Richard Biener <rguenther@suse.de>
11336 * configure.ac (--with-stabs): Remove.
11337 * configure: Regenerate.
11338 * doc/install.texi: Remove --with-stabs documentation.
11340 2021-09-10 liuhongt <hongtao.liu@intel.com>
11342 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
11344 (_mm512_mask_cmp_ph_mask): Likewise.
11345 (_mm512_cmp_round_ph_mask): Likewise.
11346 (_mm512_mask_cmp_round_ph_mask): Likewise.
11347 (_mm_cmp_sh_mask): Likewise.
11348 (_mm_mask_cmp_sh_mask): Likewise.
11349 (_mm_cmp_round_sh_mask): Likewise.
11350 (_mm_mask_cmp_round_sh_mask): Likewise.
11351 (_mm_comieq_sh): Likewise.
11352 (_mm_comilt_sh): Likewise.
11353 (_mm_comile_sh): Likewise.
11354 (_mm_comigt_sh): Likewise.
11355 (_mm_comige_sh): Likewise.
11356 (_mm_comineq_sh): Likewise.
11357 (_mm_ucomieq_sh): Likewise.
11358 (_mm_ucomilt_sh): Likewise.
11359 (_mm_ucomile_sh): Likewise.
11360 (_mm_ucomigt_sh): Likewise.
11361 (_mm_ucomige_sh): Likewise.
11362 (_mm_ucomineq_sh): Likewise.
11363 (_mm_comi_round_sh): Likewise.
11364 (_mm_comi_sh): Likewise.
11365 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
11366 (_mm_mask_cmp_ph_mask): Likewise.
11367 (_mm256_cmp_ph_mask): Likewise.
11368 (_mm256_mask_cmp_ph_mask): Likewise.
11369 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11370 * config/i386/i386-builtin.def: Add corresponding new builtins.
11371 * config/i386/i386-expand.c
11372 (ix86_expand_args_builtin): Handle new builtin types.
11373 (ix86_expand_round_builtin): Ditto.
11374 * config/i386/i386.md (ssevecmode): Add HF mode.
11375 (MODEFH): New mode iterator.
11376 * config/i386/sse.md
11377 (V48H_AVX512VL): New mode iterator to support HF vector modes.
11378 Ajdust corresponding description.
11379 (ssecmpintprefix): New.
11380 (VI12_AVX512VL): Adjust to support HF vector modes.
11381 (cmp_imm_predicate): Likewise.
11382 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
11384 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
11385 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
11386 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
11388 2021-09-10 liuhongt <hongtao.liu@intel.com>
11390 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
11391 (_mm512_mask_max_ph): Likewise.
11392 (_mm512_maskz_max_ph): Likewise.
11393 (_mm512_min_ph): Likewise.
11394 (_mm512_mask_min_ph): Likewise.
11395 (_mm512_maskz_min_ph): Likewise.
11396 (_mm512_max_round_ph): Likewise.
11397 (_mm512_mask_max_round_ph): Likewise.
11398 (_mm512_maskz_max_round_ph): Likewise.
11399 (_mm512_min_round_ph): Likewise.
11400 (_mm512_mask_min_round_ph): Likewise.
11401 (_mm512_maskz_min_round_ph): Likewise.
11402 (_mm_max_sh): Likewise.
11403 (_mm_mask_max_sh): Likewise.
11404 (_mm_maskz_max_sh): Likewise.
11405 (_mm_min_sh): Likewise.
11406 (_mm_mask_min_sh): Likewise.
11407 (_mm_maskz_min_sh): Likewise.
11408 (_mm_max_round_sh): Likewise.
11409 (_mm_mask_max_round_sh): Likewise.
11410 (_mm_maskz_max_round_sh): Likewise.
11411 (_mm_min_round_sh): Likewise.
11412 (_mm_mask_min_round_sh): Likewise.
11413 (_mm_maskz_min_round_sh): Likewise.
11414 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
11415 (_mm256_max_ph): Likewise.
11416 (_mm_mask_max_ph): Likewise.
11417 (_mm256_mask_max_ph): Likewise.
11418 (_mm_maskz_max_ph): Likewise.
11419 (_mm256_maskz_max_ph): Likewise.
11420 (_mm_min_ph): Likewise.
11421 (_mm256_min_ph): Likewise.
11422 (_mm_mask_min_ph): Likewise.
11423 (_mm256_mask_min_ph): Likewise.
11424 (_mm_maskz_min_ph): Likewise.
11425 (_mm256_maskz_min_ph): Likewise.
11426 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11427 * config/i386/i386-builtin.def: Add corresponding new builtins.
11428 * config/i386/i386-expand.c
11429 (ix86_expand_args_builtin): Handle new builtin types.
11430 * config/i386/sse.md
11431 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
11432 support HF vector modes.
11433 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
11434 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
11436 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
11438 * config/i386/subst.md (round_saeonly_mode512bit_condition):
11439 Adjust for HF vector modes.
11441 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
11443 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
11444 (_mm_mask_add_sh): Likewise.
11445 (_mm_maskz_add_sh): Likewise.
11446 (_mm_sub_sh): Likewise.
11447 (_mm_mask_sub_sh): Likewise.
11448 (_mm_maskz_sub_sh): Likewise.
11449 (_mm_mul_sh): Likewise.
11450 (_mm_mask_mul_sh): Likewise.
11451 (_mm_maskz_mul_sh): Likewise.
11452 (_mm_div_sh): Likewise.
11453 (_mm_mask_div_sh): Likewise.
11454 (_mm_maskz_div_sh): Likewise.
11455 (_mm_add_round_sh): Likewise.
11456 (_mm_mask_add_round_sh): Likewise.
11457 (_mm_maskz_add_round_sh): Likewise.
11458 (_mm_sub_round_sh): Likewise.
11459 (_mm_mask_sub_round_sh): Likewise.
11460 (_mm_maskz_sub_round_sh): Likewise.
11461 (_mm_mul_round_sh): Likewise.
11462 (_mm_mask_mul_round_sh): Likewise.
11463 (_mm_maskz_mul_round_sh): Likewise.
11464 (_mm_div_round_sh): Likewise.
11465 (_mm_mask_div_round_sh): Likewise.
11466 (_mm_maskz_div_round_sh): Likewise.
11467 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11468 * config/i386/i386-builtin.def: Add corresponding new builtins.
11469 * config/i386/i386-expand.c
11470 (ix86_expand_round_builtin): Handle new builtins.
11471 * config/i386/sse.md (VF_128): Change description.
11472 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
11473 Adjust to support HF vector modes.
11474 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
11477 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
11479 * config/i386/i386-expand.c
11480 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
11481 * config/i386/i386.c
11482 (ix86_preferred_simd_mode): Handle HF mode.
11483 * config/i386/sse.md (V_256H): New mode iterator.
11484 (avx_vextractf128<mode>): Use it.
11485 (VEC_INIT_MODE): Align vector HFmode condition to vector
11486 HImodes since there're no real HF instruction used.
11487 (VEC_INIT_HALF_MODE): Ditto.
11489 (VIHF_AVX512BW): Ditto.
11490 (*vec_extracthf): Ditto.
11491 (VEC_EXTRACT_MODE): Ditto.
11493 2021-09-10 Richard Biener <rguenther@suse.de>
11496 * config/dbx.h: Remove.
11497 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
11498 * config/lynx.h: Likewise.
11500 2021-09-10 liuhongt <hongtao.liu@intel.com>
11502 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
11503 right into ANDNOT + AND + IOR, using paradoxical subregs.
11504 (ix86_split_copysign_const): Remove.
11505 (ix86_split_copysign_var): Ditto.
11506 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
11507 (ix86_split_copysign_var): Ditto.
11508 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
11509 (@copysign<mode>3_var): Ditto.
11511 2021-09-09 qing zhao <qing.zhao@oracle.com>
11513 * builtins.c (expand_builtin_memset): Make external visible.
11514 * builtins.h (expand_builtin_memset): Declare extern.
11515 * common.opt (ftrivial-auto-var-init=): New option.
11516 * doc/extend.texi: Document the uninitialized attribute.
11517 * doc/invoke.texi: Document -ftrivial-auto-var-init.
11518 * flag-types.h (enum auto_init_type): New enumerated type
11520 * gimple-fold.c (clear_padding_type): Add one new parameter.
11521 (clear_padding_union): Likewise.
11522 (clear_padding_emit_loop): Likewise.
11523 (clear_type_padding_in_mask): Likewise.
11524 (gimple_fold_builtin_clear_padding): Handle this new parameter.
11525 * gimplify.c (gimple_add_init_for_auto_var): New function.
11526 (gimple_add_padding_init_for_auto_var): New function.
11527 (is_var_need_auto_init): New function.
11528 (gimplify_decl_expr): Add initialization to automatic variables per
11530 (gimplify_call_expr): Add one new parameter for call to
11531 __builtin_clear_padding.
11532 (gimplify_init_constructor): Add padding initialization in the end.
11533 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
11534 (expand_DEFERRED_INIT): New function.
11535 * internal-fn.def (DEFERRED_INIT): New internal function.
11536 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
11537 * tree-sra.c (generate_subtree_deferred_init): New function.
11538 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
11539 calls to .DEFERRED_INIT.
11540 (sra_modify_deferred_init): New function.
11541 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
11542 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
11543 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
11545 (check_defs): Likewise.
11546 (warn_uninitialized_vars): Likewise.
11547 * tree-ssa.c (ssa_undefined_value_p): Likewise.
11548 * tree.c (build_common_builtin_nodes): Build tree node for
11549 BUILT_IN_CLEAR_PADDING when needed.
11551 2021-09-09 Richard Biener <rguenther@suse.de>
11553 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
11556 2021-09-09 Richard Biener <rguenther@suse.de>
11558 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
11559 DOM walk from get_loop_body_in_dom_order using a worklist
11562 2021-09-09 liuhongt <hongtao.liu@intel.com>
11564 * config.gcc: Add avx512fp16vlintrin.h.
11565 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
11566 (_mm512_mask_add_ph): Likewise.
11567 (_mm512_maskz_add_ph): Likewise.
11568 (_mm512_sub_ph): Likewise.
11569 (_mm512_mask_sub_ph): Likewise.
11570 (_mm512_maskz_sub_ph): Likewise.
11571 (_mm512_mul_ph): Likewise.
11572 (_mm512_mask_mul_ph): Likewise.
11573 (_mm512_maskz_mul_ph): Likewise.
11574 (_mm512_div_ph): Likewise.
11575 (_mm512_mask_div_ph): Likewise.
11576 (_mm512_maskz_div_ph): Likewise.
11577 (_mm512_add_round_ph): Likewise.
11578 (_mm512_mask_add_round_ph): Likewise.
11579 (_mm512_maskz_add_round_ph): Likewise.
11580 (_mm512_sub_round_ph): Likewise.
11581 (_mm512_mask_sub_round_ph): Likewise.
11582 (_mm512_maskz_sub_round_ph): Likewise.
11583 (_mm512_mul_round_ph): Likewise.
11584 (_mm512_mask_mul_round_ph): Likewise.
11585 (_mm512_maskz_mul_round_ph): Likewise.
11586 (_mm512_div_round_ph): Likewise.
11587 (_mm512_mask_div_round_ph): Likewise.
11588 (_mm512_maskz_div_round_ph): Likewise.
11589 * config/i386/avx512fp16vlintrin.h: New header.
11590 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
11591 Add new builtin types.
11592 * config/i386/i386-builtin.def: Add corresponding builtins.
11593 * config/i386/i386-expand.c
11594 (ix86_expand_args_builtin): Handle new builtin types.
11595 (ix86_expand_round_builtin): Likewise.
11596 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
11597 * config/i386/sse.md (VFH): New mode_iterator.
11599 (avx512fmaskmode): Add HF vector modes.
11600 (avx512fmaskhalfmode): Likewise.
11601 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
11603 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
11604 (mul<mode>3<mask_name><round_name>): Likewise.
11605 (*mul<mode>3<mask_name><round_name>): Likewise.
11606 (div<mode>3): Likewise.
11607 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
11608 * config/i386/subst.md (SUBST_V): Add HF vector modes.
11609 (SUBST_A): Likewise.
11610 (round_mode512bit_condition): Adjust for V32HFmode.
11612 2021-09-09 liuhongt <hongtao.liu@intel.com>
11615 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
11616 (reduc_plus_scal_v4sf): .. this, New define_expand.
11617 (reduc_plus_scal_v2df): .. and this, New define_expand.
11619 2021-09-09 liuhongt <hongtao.liu@intel.com>
11622 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
11625 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
11628 * doc/trouble.texi (Copy Assignment): Fix description of
11629 behaviour and fix code in example.
11631 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
11634 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
11635 r11 instead of r12 for restoring CR.
11637 2021-09-08 Jakub Jelinek <jakub@redhat.com>
11638 liuhongt <hongtao.liu@intel.com>
11641 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
11642 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
11643 into AND with mask and XOR, using paradoxical subregs.
11644 (ix86_split_xorsign): Remove.
11645 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
11647 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
11649 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
11651 2021-09-08 Jakub Jelinek <jakub@redhat.com>
11654 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
11655 operands[2], emit abs<mode>2 instead.
11656 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
11657 first alternative even for avx, add another alternative with
11658 =&Yv <- 0, Yv, Yvm constraints.
11659 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
11660 to op1, emit vpandn instead.
11662 2021-09-08 liuhongt <hongtao.liu@intel.com>
11664 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
11665 (_mm256_set_ph): Likewise.
11666 (_mm512_set_ph): Likewise.
11667 (_mm_setr_ph): Likewise.
11668 (_mm256_setr_ph): Likewise.
11669 (_mm512_setr_ph): Likewise.
11670 (_mm_set1_ph): Likewise.
11671 (_mm256_set1_ph): Likewise.
11672 (_mm512_set1_ph): Likewise.
11673 (_mm_setzero_ph): Likewise.
11674 (_mm256_setzero_ph): Likewise.
11675 (_mm512_setzero_ph): Likewise.
11676 (_mm_set_sh): Likewise.
11677 (_mm_load_sh): Likewise.
11678 (_mm_store_sh): Likewise.
11679 * config/i386/i386-builtin-types.def (V8HF): New type.
11680 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
11681 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
11682 Support vector HFmodes.
11683 (ix86_expand_vector_init_one_nonzero): Likewise.
11684 (ix86_expand_vector_init_one_var): Likewise.
11685 (ix86_expand_vector_init_interleave): Likewise.
11686 (ix86_expand_vector_init_general): Likewise.
11687 (ix86_expand_vector_set): Likewise.
11688 (ix86_expand_vector_extract): Likewise.
11689 (ix86_expand_vector_init_concat): Likewise.
11690 (ix86_expand_sse_movcc): Handle vector HFmodes.
11691 (ix86_expand_vector_set_var): Ditto.
11692 * config/i386/i386-modes.def: Add HF vector modes in comment.
11693 * config/i386/i386.c (classify_argument): Add HF vector modes.
11694 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
11695 (ix86_vector_mode_supported_p): Likewise.
11696 (ix86_set_reg_reg_cost): Handle vector HFmode.
11697 (ix86_get_ssemov): Handle vector HFmode.
11698 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
11700 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
11702 (function_arg_advance_32): Ditto.
11703 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
11704 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
11705 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
11706 (VALID_SSE2_REG_VHF_MODE): New.
11707 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
11708 (SSE_REG_MODE_P): Add vector HFmode.
11709 * config/i386/i386.md (mode): Add HF vector modes.
11710 (MODE_SIZE): Likewise.
11711 (ssemodesuffix): Add ph suffix for HF vector modes.
11712 * config/i386/sse.md (VFH_128): New mode iterator.
11713 (VMOVE): Adjust for HF vector modes.
11715 (V_256_512): Likewise.
11716 (avx512): Likewise.
11717 (avx512fmaskmode): Likewise.
11718 (shuffletype): Likewise.
11719 (sseinsnmode): Likewise.
11720 (ssedoublevecmode): Likewise.
11721 (ssehalfvecmode): Likewise.
11722 (ssehalfvecmodelower): Likewise.
11723 (ssePScmode): Likewise.
11724 (ssescalarmode): Likewise.
11725 (ssescalarmodelower): Likewise.
11726 (sseintprefix): Likewise.
11728 (bcstscalarsuff): Likewise.
11729 (xtg_mode): Likewise.
11730 (VI12HF_AVX512VL): New mode_iterator.
11731 (VF_AVX512FP16): Likewise.
11733 (VIHF_256): Likewise.
11734 (VIHF_AVX512BW): Likewise.
11735 (V16_256): Likewise.
11736 (V32_512): Likewise.
11737 (sseintmodesuffix): New mode_attr.
11738 (sse): Add scalar and vector HFmodes.
11739 (ssescalarmode): Add vector HFmode mapping.
11740 (ssescalarmodesuffix): Add sh suffix for HFmode.
11741 (*<sse>_vm<insn><mode>3): Use VFH_128.
11742 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
11743 (*ieee_<ieee_maxmin><mode>3): Likewise.
11744 (<avx512>_blendm<mode>): New define_insn.
11745 (vec_setv8hf): New define_expand.
11746 (vec_set<mode>_0): New define_insn for HF vector set.
11747 (*avx512fp16_movsh): Likewise.
11748 (avx512fp16_movsh): Likewise.
11749 (vec_extract_lo_v32hi): Rename to ...
11750 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
11752 (vec_extract_hi_v32hi): Likewise.
11753 (vec_extract_hi_<mode>): Likewise.
11754 (vec_extract_lo_v16hi): Likewise.
11755 (vec_extract_lo_<mode>): Likewise.
11756 (vec_extract_hi_v16hi): Likewise.
11757 (vec_extract_hi_<mode>): Likewise.
11758 (vec_set_hi_v16hi): Likewise.
11759 (vec_set_hi_<mode>): Likewise.
11760 (vec_set_lo_v16hi): Likewise.
11761 (vec_set_lo_<mode>): Likewise.
11762 (*vec_extract<mode>_0): New define_insn_and_split for HF
11764 (*vec_extracthf): New define_insn.
11765 (VEC_EXTRACT_MODE): Add HF vector modes.
11766 (PINSR_MODE): Add V8HF.
11767 (sse2p4_1): Likewise.
11768 (pinsr_evex_isa): Likewise.
11769 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
11770 insert for V8HFmode.
11771 (pbroadcast_evex_isa): Add HF vector modes.
11772 (AVX2_VEC_DUP_MODE): Likewise.
11773 (VEC_INIT_MODE): Likewise.
11774 (VEC_INIT_HALF_MODE): Likewise.
11775 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
11777 (avx2_pbroadcast<mode>_1): Likewise.
11778 (<avx512>_vec_dup<mode>_1): Likewise.
11779 (<avx512>_vec_dup<mode><mask_name>): Likewise.
11780 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
11783 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
11784 H.J. Lu <hongjiu.lu@intel.com>
11785 Liu Hongtao <hongtao.liu@intel.com>
11786 Wang Hongyu <hongyu.wang@intel.com>
11787 Xu Dianhong <dianhong.xu@intel.com>
11789 * common/config/i386/cpuinfo.h (get_available_features):
11790 Detect FEATURE_AVX512FP16.
11791 * common/config/i386/i386-common.c
11792 (OPTION_MASK_ISA_AVX512FP16_SET,
11793 OPTION_MASK_ISA_AVX512FP16_UNSET,
11794 OPTION_MASK_ISA2_AVX512FP16_SET,
11795 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
11796 (OPTION_MASK_ISA2_AVX512BW_UNSET,
11797 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
11798 (ix86_handle_option): Handle -mavx512fp16.
11799 * common/config/i386/i386-cpuinfo.h (enum processor_features):
11800 Add FEATURE_AVX512FP16.
11801 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
11802 * config.gcc: Add avx512fp16intrin.h.
11803 * config/i386/avx512fp16intrin.h: New intrinsic header.
11804 * config/i386/cpuid.h: Add bit_AVX512FP16.
11805 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
11806 * config/i386/i386-builtins.c: Support _Float16 type for i386
11808 (ix86_register_float16_builtin_type): New function.
11809 (ix86_float16_type_node): New.
11810 * config/i386/i386-c.c (ix86_target_macros_internal): Define
11812 * config/i386/i386-expand.c (ix86_expand_branch): Support
11814 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
11815 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
11816 (ix86_expand_fp_movcc): Ditto.
11817 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
11818 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
11819 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
11820 * config/i386/i386.c (ix86_get_ssemov): Use
11821 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
11822 (ix86_get_excess_precision): Use
11823 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
11825 (sse_store_index): Use SFmode cost for HFmode cost.
11826 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
11827 GPR cost for HFmode.
11828 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
11829 (ix86_mangle_type): Add manlging for _Float16 type.
11830 (inline_secondary_memory_needed): No memory is needed for
11831 16bit movement between gpr and sse reg under
11833 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
11834 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
11835 (ix86_division_cost): Ditto.
11836 (ix86_rtx_costs): Ditto.
11837 (ix86_add_stmt_cost): Ditto.
11838 (ix86_optab_supported_p): Ditto.
11839 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
11840 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
11841 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
11842 * config/i386/i386.md (mode): Add HFmode.
11843 (MODE_SIZE): Add HFmode.
11844 (isa): Add avx512fp16.
11845 (enabled): Handle avx512fp16.
11846 (ssemodesuffix): Add sh suffix for HFmode.
11847 (comm): Add mult, div.
11848 (plusminusmultdiv): New code iterator.
11849 (insn): Add mult, div.
11850 (*movhf_internal): Adjust for avx512fp16 instruction.
11851 (*movhi_internal): Ditto.
11852 (*cmpi<unord>hf): New define_insn for HFmode.
11853 (*ieee_s<ieee_maxmin>hf3): Likewise.
11854 (extendhf<mode>2): Likewise.
11855 (trunc<mode>hf2): Likewise.
11856 (float<floatunssuffix><mode>hf2): Likewise.
11857 (*<insn>hf): Likewise.
11858 (cbranchhf4): New expander.
11859 (movhfcc): Likewise.
11860 (<insn>hf3): Likewise.
11861 (mulhf3): Likewise.
11862 (divhf3): Likewise.
11863 * config/i386/i386.opt: Add mavx512fp16.
11864 * config/i386/immintrin.h: Include avx512fp16intrin.h.
11865 * doc/invoke.texi: Add mavx512fp16.
11866 * doc/extend.texi: Add avx512fp16 Usage Notes.
11868 2021-09-08 liuhongt <hongtao.liu@intel.com>
11870 * common.opt: Support -fexcess-precision=16.
11871 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
11872 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
11873 EXCESS_PRECISION_TYPE_FLOAT16.
11874 * config/arm/arm.c (arm_excess_precision): Ditto.
11875 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
11876 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
11877 when EXCESS_PRECISION_TYPE_FLOAT16.
11878 * config/s390/s390.c (s390_excess_precision): Ditto.
11879 * coretypes.h (enum excess_precision_type): Add
11880 EXCESS_PRECISION_TYPE_FLOAT16.
11881 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
11882 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
11883 * doc/extend.texi (Half-Precision): Document
11884 -fexcess-precision=16.
11885 * flag-types.h (enum excess_precision): Add
11886 EXCESS_PRECISION_FLOAT16.
11887 * target.def (excess_precision): Update document.
11888 * tree.c (excess_precision_type): Set excess_precision_type to
11889 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
11891 2021-09-08 liuhongt <hongtao.liu@intel.com>
11893 * doc/extend.texi: (@node Floating Types): Adjust the wording.
11894 (@node Half-Precision): Ditto.
11896 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
11899 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
11900 'CONST_INT_P (src)' to the condition of the block that tries to
11901 eliminate literal when loading integer contant.
11903 2021-09-07 David Faust <david.faust@oracle.com>
11905 * doc/extend.texi (BPF Type Attributes) New node.
11906 Document new preserve_access_index attribute.
11907 Document new preserve_access_index builtin.
11908 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
11910 2021-09-07 David Faust <david.faust@oracle.com>
11912 * config/bpf/bpf.c: Adjust includes.
11913 (bpf_handle_preserve_access_index_attribute): New function.
11914 (bpf_attribute_table): Use it here.
11915 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
11916 (bpf_option_override): Handle "-mco-re" option.
11917 (bpf_asm_init_sections): New.
11918 (TARGET_ASM_INIT_SECTIONS): Redefine.
11919 (bpf_file_end): New.
11920 (TARGET_ASM_FILE_END): Redefine.
11921 (bpf_init_builtins): Add "__builtin_preserve_access_index".
11922 (bpf_core_compute, bpf_core_get_index): New.
11923 (is_attr_preserve_access): New.
11924 (bpf_expand_builtin): Handle new builtins.
11925 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
11926 (bpf_core_walk): New.
11927 (bpf_resolve_overloaded_builtin): New.
11928 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
11929 (handle_attr): New.
11930 (pass_bpf_core_attr): New RTL pass.
11931 * config/bpf/bpf-passes.def: New file.
11932 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
11933 * config/bpf/coreout.c: New file.
11934 * config/bpf/coreout.h: Likewise.
11935 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
11936 (coreout.o): New rule.
11937 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
11938 * config.gcc (bpf): Add coreout.h to extra_headers.
11939 Add coreout.o to extra_objs.
11940 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
11942 2021-09-07 David Faust <david.faust@oracle.com>
11944 * btfout.c (get_btf_id): Function is no longer static.
11945 * ctfc.h: Expose it here.
11947 2021-09-07 David Faust <david.faust@oracle.com>
11949 * ctfc.c (ctf_lookup_tree_type): New function.
11950 * ctfc.h: Likewise.
11952 2021-09-07 David Faust <david.faust@oracle.com>
11954 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
11955 * ctfc.h: Analogous change.
11957 2021-09-07 David Faust <david.faust@oracle.com>
11959 * dwarf2out.c (lookup_type_die): Function is no longer static.
11960 * dwarf2out.h: Expose it here.
11962 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
11964 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
11965 (ctf_debug_early_finish): New definition.
11966 (ctf_debug_finish): Likewise.
11967 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
11968 (ctf_debug_early_finish): New declaration.
11969 (ctf_debug_finish): Likewise.
11970 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
11971 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
11973 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
11975 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
11976 support when compiling for CO-RE.
11977 * config/bpf/bpf.opt: Add new command line option -mco-re.
11979 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
11981 * flag-types.h (enum debug_info_type): Add new enum
11982 DINFO_TYPE_BTF_WITH_CORE.
11983 (BTF_WITH_CORE_DEBUG): New bitmask.
11984 * flags.h (btf_with_core_debuginfo_p): New declaration.
11985 * opts.c (btf_with_core_debuginfo_p): New definition.
11987 2021-09-07 Jason Merrill <jason@redhat.com>
11989 * tree.h (error_operand_p): Change to inline function.
11991 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
11993 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
11994 (empty_block_with_phis_p): ...this.
11995 (potentially_threadable_block): Same.
11996 (jump_threader::thread_through_normal_block): Same.
11998 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
12001 * dwarf2out.c (mark_base_types): New overloaded function.
12002 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
12003 as the compilation unit, and call move_marked_base_types afterward.
12005 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
12008 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
12010 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
12012 2021-09-07 Richard Biener <rguenther@suse.de>
12014 PR tree-optimization/102226
12015 * tree-vect-loop.c (vect_transform_cycle_phi): Record
12016 the converted value for the epilogue PHI use.
12018 2021-09-07 Martin Liska <mliska@suse.cz>
12020 PR gcov-profile/80223
12021 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
12022 options, do not inline when no_profile_instrument_function
12023 attributes are different in early inliner. It's fine to inline
12024 it after PGO instrumentation.
12026 2021-09-07 Richard Biener <rguenther@suse.de>
12028 PR tree-optimization/101555
12029 * tree-ssa-pre.c (translate_vuse_through_block): Do not
12030 perform an alias walk to determine the validity of the
12031 mem at the start of the block which is already guaranteed
12032 by means of prune_clobbered_mems.
12033 (phi_translate_1): Pass edge to translate_vuse_through_block.
12035 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
12038 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
12039 (remainder<mode>3): Likewise.
12041 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
12043 * config/mips/mips.c (mips_file_start): add .module for
12046 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
12048 * wide-int.cc (wi::clz): Reorder tests to ensure the result
12049 is zero for all negative values.
12051 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
12053 * doc/invoke.texi (-foffload-options): Fix @opindex.
12055 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
12058 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
12059 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
12060 source alternative for AVX.
12062 2021-09-06 liuhongt <hongtao.liu@intel.com>
12064 PR middle-end/102182
12065 * optabs.c (expand_fix): Add from1 to avoid from being
12068 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
12070 * dwarf2out.c (modified_type_die): Deal with all array types earlier
12071 and use local variable consistently throughout the function.
12073 2021-09-06 Jakub Jelinek <jakub@redhat.com>
12075 PR tree-optimization/102207
12076 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12077 were promoted from signed to wider unsigned type.
12079 2021-09-06 Andrew Pinski <apinski@marvell.com>
12081 PR tree-optimization/63184
12082 * match.pd: Add simplification of pointer_diff of two pointer_plus
12083 with addr_expr in the first operand of each pointer_plus.
12084 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
12085 in the first operand of each pointer_plus.
12087 2021-09-06 Richard Biener <rguenther@suse.de>
12089 PR tree-optimization/102176
12090 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
12092 (vect_bb_slp_scalar_cost): Use the computed set of
12093 vectorized scalar stmts instead of relying on the out-of-date
12094 and not accurate PURE_SLP_STMT.
12095 (vect_bb_vectorization_profitable_p): Compute the set
12096 of vectorized scalar stmts.
12098 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12100 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
12101 GIMPLE_COND special casing.
12102 (path_range_query::range_defined_in_block): Use range_of_stmt
12103 instead of calling fold_range directly.
12105 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12107 * gimple-range-path.cc (path_range_query::range_of_expr): Set
12108 m_undefined_path when appropriate.
12109 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
12110 (path_range_query::unreachable_path_p): New.
12111 (path_range_query::precompute_ranges): Set m_undefined_path.
12112 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
12113 (path_range_query::internal_range_of_expr): New.
12114 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
12115 Use unreachable_path_p.
12117 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12119 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
12120 Remove argument and call find_taken_edge.
12121 (back_threader::resolve_phi): Do not calculate taken edge before
12122 calling maybe_register_path.
12123 (back_threader::find_paths_to_names): Same.
12125 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
12127 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
12128 * config/h8300/testcompare.md (store_c): Update name, use new
12130 (store_neg_c, store_shifted_c): New patterns.
12132 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
12135 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
12136 instead of r12 for CR save, in all cases.
12138 2021-09-03 Andrew Pinski <apinski@marvell.com>
12140 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
12141 Handle error_mark_node as the type of the type_decl.
12143 2021-09-03 Andrew Pinski <apinski@marvell.com>
12145 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
12147 (aarch64_simd_types): Likewise.
12148 (aarch64_simd_intOI_type_node): Likewise.
12149 (aarch64_simd_intCI_type_node): Likewise.
12150 (aarch64_simd_intXI_type_node): Likewise.
12151 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
12152 (aarch64_fp16_ptr_type_node): Likewise.
12153 (aarch64_bf16_type_node): Likewise.
12154 (aarch64_bf16_ptr_type_node): Likewise.
12156 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12158 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
12160 (minus_op1_op2_relation_effect): ...here.
12161 (class operator_pointer_diff): New.
12162 (operator_pointer_diff::op1_op2_relation_effect): Call
12163 minus_op1_op2_relation_effect.
12164 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
12166 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12168 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
12169 Add may_peel_loop_headers.
12170 (back_threader_registry::thread_through_all_blocks): Same.
12171 (try_thread_blocks): Pass may_peel_loop_headers argument.
12172 (pass_early_thread_jumps::execute): Same.
12174 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12176 * tree-ssa-threadedge.c (has_phis_p): New.
12177 (forwarder_block_p): New.
12178 (potentially_threadable_block): Call forwarder_block_p.
12179 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
12180 (jump_threader::thread_through_normal_block): Call
12183 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12185 * tree-ssa-threadbackward.c (back_threader::dump): New.
12186 (back_threader::debug): New.
12187 (back_threader_profitability::profitable_path_p): Dump blocks
12188 even if we are bailing early.
12190 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12192 * tree-ssa-threadupdate.c (cancel_thread): New.
12193 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
12194 (jump_thread_path_registry::mark_threaded_blocks): Same.
12195 (jump_thread_path_registry::register_jump_thread): Same.
12197 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12199 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
12200 which objects are available.
12201 (jt_state::pop): Same.
12202 (jt_state::register_equiv): Same.
12203 (jt_state::register_equivs_on_edge): Same.
12205 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12207 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
12208 Move pop until after a thread is registered.
12210 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12212 * tree-ssa-threadupdate.c (debug): New.
12214 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12216 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
12217 (push_dump_file::~push_dump_file): New.
12218 (dump_ranger): Change dump_file temporarily while dumping
12220 * gimple-range-trace.h (class push_dump_file): New.
12222 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12224 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
12225 (dump_ranger): Dump function name.
12227 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12229 * gimple-range-path.cc (path_range_query::range_defined_in_block):
12230 Adjust for non-null.
12231 (path_range_query::adjust_for_non_null_uses): New.
12232 (path_range_query::precompute_ranges): Call
12233 adjust_for_non_null_uses.
12234 * gimple-range-path.h: Add m_non_null and
12235 adjust_for_non_null_uses.
12237 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12239 * gimple-range-path.cc (path_range_query::dump): Dump path
12241 (path_range_query::precompute_ranges): Dump entire path.
12243 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12245 * value-relation.cc (relation_oracle::debug): New.
12246 * value-relation.h (relation_oracle::debug): New.
12248 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12250 * tree-ssa-loop-ch.c: Remove unnecessary include file.
12252 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12254 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
12255 Skip statements with no defining BB.
12256 * gimple-range-path.cc (path_range_query::range_defined_in_block):
12257 Do not get confused by statements with no defining BB.
12259 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12261 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
12262 gimple_range_adjustment. Add support for constants.
12263 (adjust_realpart_expr): New.
12264 (gimple_range_adjustment): Move IMAGPART_EXPR code to
12265 adjust_imagpart_expr.
12266 * range-op.cc (integral_table::integral_table): Add entry for
12269 2021-09-03 Jakub Jelinek <jakub@redhat.com>
12271 * omp-expand.c (expand_omp_atomic_pipeline): Use
12272 IFN_ATOMIC_COMPARE_EXCHANGE instead of
12273 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
12276 2021-09-03 Jakub Jelinek <jakub@redhat.com>
12279 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
12280 that is false if DECL_BIT_FIELD.
12281 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
12282 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
12283 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
12284 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
12285 of DECL_FIELD_ABI_IGNORED.
12286 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
12287 SET_DECL_FIELD_ABI_IGNORED instead of writing to
12288 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
12289 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
12290 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
12291 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
12293 2021-09-03 liuhongt <hongtao.liu@intel.com>
12296 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
12297 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
12298 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
12300 2021-09-02 Martin Sebor <msebor@redhat.com>
12302 PR tree-optimization/17506
12304 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
12306 2021-09-02 Richard Biener <rguenther@suse.de>
12308 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
12309 fix for PR78185 and continue processing when leaving
12310 finite inner loops.
12312 2021-09-02 Jakub Jelinek <jakub@redhat.com>
12314 PR tree-optimization/99591
12315 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12318 2021-09-02 Richard Biener <rguenther@suse.de>
12321 2021-09-02 Richard Biener <rguenther@suse.de>
12323 PR tree-optimization/102155
12324 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
12325 over a part of the RPO array and do not recurse here.
12326 Dump blocks marked as always executed.
12327 (fill_always_executed_in): Walk over the RPO array and
12328 process loops whose header we run into.
12329 (loop_invariant_motion_in_fun): Compute the first RPO
12330 using rev_post_order_and_mark_dfs_back_seme in iteration
12331 order and pass that to fill_always_executed_in.
12333 2021-09-02 liuhongt <hongtao.liu@intel.com>
12335 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
12336 * config/i386/i386.c (enum x86_64_reg_class): Add
12337 X86_64_SSEHF_CLASS.
12338 (merge_classes): Handle X86_64_SSEHF_CLASS.
12339 (examine_argument): Ditto.
12340 (construct_container): Ditto.
12341 (classify_argument): Ditto, and set HFmode/HCmode to
12342 X86_64_SSEHF_CLASS.
12343 (function_value_32): Return _FLoat16/Complex Float16 by
12345 (function_value_64): Return _Float16/Complex Float16 by SSE
12347 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
12348 (ix86_secondary_reload): Require gpr as intermediate register
12349 to store _Float16 from sse register when sse4 is not
12351 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
12353 (ix86_scalar_mode_supported_p): Ditto.
12354 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
12355 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
12356 (VALID_INT_MODE_P): Add HFmode and HCmode.
12357 * config/i386/i386.md (*pushhf_rex64): New define_insn.
12359 (*movhf_internal): Ditto.
12360 * doc/extend.texi (Half-Precision Floating Point): Documemt
12363 2021-09-02 Richard Biener <rguenther@suse.de>
12365 PR tree-optimization/102155
12366 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
12367 over a part of the RPO array and do not recurse here.
12368 Dump blocks marked as always executed.
12369 (fill_always_executed_in): Walk over the RPO array and
12370 process loops whose header we run into.
12371 (loop_invariant_motion_in_fun): Compute the first RPO
12372 using rev_post_order_and_mark_dfs_back_seme in iteration
12373 order and pass that to fill_always_executed_in.
12375 2021-09-02 YunQiang Su <syq@debian.org>
12378 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
12380 * config/mips/mips.c (mips_module_isa_name): New.
12381 mips_file_start: add .module mipsREV to all asm output
12383 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
12385 PR tree-optimization/102152
12386 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
12387 comparison to a scalar comparison before calling
12388 update_stmt_if_modified.
12390 2021-09-01 Andrew Pinski <apinski@marvell.com>
12393 * config/aarch64/aarch64.c (aarch64_expand_setmem):
12394 Check STRICT_ALIGNMENT before creating an overlapping
12397 2021-09-01 Martin Sebor <msebor@redhat.com>
12399 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
12400 (check_access): Pass additional argument.
12401 (check_memop_access): Remove template and make a member function.
12402 (maybe_check_dealloc_call): Make a pass_waccess member function.
12403 (class pass_waccess): Add, rename, and remove members.
12404 (pass_waccess::pass_waccess): Adjust to name change.
12405 (pass_waccess::~pass_waccess): Same.
12406 (check_alloca): Make a member function.
12407 (check_alloc_size_call): Same.
12408 (check_strcat): Same.
12409 (check_strncat): Same.
12410 (check_stxcpy): Same.
12411 (check_stxncpy): Same.
12412 (check_strncmp): Same.
12413 (maybe_warn_rdwr_sizes): Rename...
12414 (pass_waccess::maybe_check_access_sizes): ...to this.
12415 (pass_waccess::check_call): Adjust to name changes.
12416 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
12418 (pass_waccess::execute): Adjust to name changes.
12419 * gimple-ssa-warn-access.h (check_memop_access): Remove.
12420 * pointer-query.cc (access_ref::phi): Handle null pointer.
12421 (access_ref::inform_access): Same.
12422 (pointer_query::put_ref): Modify a cached value, not a copy of it.
12423 (pointer_query::dump): New function.
12424 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
12426 * pointer-query.h (pointer_query::dump): Declare.
12427 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
12428 (dump_strlen_info): Use function query.
12429 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
12431 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
12433 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
12435 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
12437 * doc/extend.texi: Document unavailable attribute.
12438 * print-tree.c (print_node): Handle unavailable attribute.
12439 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
12440 * tree.c (error_unavailable_use): New.
12441 * tree.h (TREE_UNAVAILABLE): New.
12442 (error_unavailable_use): New.
12444 2021-09-01 Jakub Jelinek <jakub@redhat.com>
12446 PR tree-optimization/102124
12447 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
12448 MINUS_EXPR, if itype is unsigned with smaller precision than type,
12449 add an extra cast to signed variant of itype to ensure sign-extension.
12451 2021-09-01 Martin Liska <mliska@suse.cz>
12453 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
12454 edges and rather use colors for TRUE and FALSE edges.
12456 2021-09-01 Richard Biener <rguenther@suse.de>
12458 PR tree-optimization/93491
12459 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
12460 after processing the stmt itself. Do not consider
12461 pure functions possibly not returning. Properly avoid
12462 adding possibly trapping calls to EXP_GEN when there's
12463 a preceeding possibly not returning call.
12464 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
12467 2021-09-01 Richard Biener <rguenther@suse.de>
12469 PR tree-optimization/102139
12470 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
12471 type to record a std::pair of the stmt-info and the innermost
12473 (dr_vec_info::group): New member.
12474 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
12475 (vect_compute_data_ref_alignment): Verify the recorded
12476 base alignment can be used.
12477 (data_ref_pair): Remove.
12478 (dr_group_sort_cmp): Adjust.
12479 (vect_analyze_data_ref_accesses): Store the group-ID in the
12480 dr_vec_info and operate on a vector of dr_vec_infos.
12482 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
12484 * read-md.c (md_reader::handle_enum): support value assignation.
12485 * doc/md.texi: record define_c_enum value assignation support.
12487 2021-09-01 Jakub Jelinek <jakub@redhat.com>
12489 PR tree-optimization/102141
12490 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
12491 argument. If false, emit stmts after gsi instead of before, and
12493 (bswap_replace): Adjust callers. When converting output of bswap,
12494 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
12497 2021-09-01 liuhongt <hongtao.liu@intel.com>
12499 * emit-rtl.c (validate_subreg): Get rid of all float-int
12502 2021-09-01 liuhongt <hongtao.liu@intel.com>
12505 2021-08-30 liuhongt <hongtao.liu@intel.com>
12507 * expmed.c (extract_bit_field_1): Make sure we're playing with
12508 integral modes before call extract_integral_bit_field.
12509 (extract_integral_bit_field): Add a parameter of type
12510 scalar_int_mode which corresponds to of tmode.
12511 And call extract_and_convert_fixed_bit_field instead of
12512 extract_fixed_bit_field and convert_extracted_bit_field.
12513 (extract_and_convert_fixed_bit_field): New function, it's a
12514 combination of extract_fixed_bit_field and
12515 convert_extracted_bit_field.
12517 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
12519 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
12521 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
12523 * omp-general.h (omp_is_reference): Rename to...
12524 (omp_privatize_by_reference): ... this. Adjust all users...
12525 * omp-general.c: ... here, ...
12526 * gimplify.c: ... here, ...
12527 * omp-expand.c: ... here, ...
12528 * omp-low.c: ... here.
12530 2021-08-31 Martin Sebor <msebor@redhat.com>
12532 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
12533 pointer element for equality to zero, not that of the cotaining
12536 2021-08-31 Martin Sebor <msebor@redhat.com>
12538 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
12540 2021-08-31 Martin Sebor <msebor@redhat.com>
12542 * function.h (function): Add comments.
12543 (get_range_query): Same. Add attribute returns nonnull.
12545 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
12547 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
12548 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
12549 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
12551 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
12553 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
12555 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
12556 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
12557 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
12558 (m32c-*-elf*, m32c-*-rtems*): Likewise.
12559 * config/cris/cris.h (DBX_NO_XREFS): Remove.
12560 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
12561 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
12562 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
12563 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
12564 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
12566 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
12568 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
12569 allowed on target constructs and only with particular other clauses.
12570 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
12571 'ancestor' is used.
12572 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
12573 structs when 'ancestor' is used.
12574 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
12575 'ancestor' is used.
12576 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
12577 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
12579 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
12581 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
12582 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12584 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
12585 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
12586 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
12587 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
12588 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
12589 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12590 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
12591 would be paradoxical.
12593 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
12595 * combine.c (combine_simplify_rtx): Avoid converting an explicit
12596 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
12597 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
12599 2021-08-31 Richard Biener <rguenther@suse.de>
12601 PR tree-optimization/102142
12602 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
12603 condition under which to unset the visited flag.
12605 2021-08-31 Richard Biener <rguenther@suse.de>
12607 PR middle-end/102129
12608 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
12609 possibly trapping expressions across calls.
12611 2021-08-31 Jakub Jelinek <jakub@redhat.com>
12613 PR tree-optimization/102134
12614 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
12615 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
12618 2021-08-31 Andrew Pinski <apinski@marvell.com>
12621 * collect-utils.c (setup_signals): New declaration.
12622 * collect-utils.h (setup_signals): New function.
12623 * collect2.c (handler): Delete.
12624 (main): Instead of manually setting up the signals,
12625 just call setup_signals.
12626 * lto-wrapper.c (main): Likewise.
12628 2021-08-31 Andrew Pinski <apinski@marvell.com>
12631 * config/i386/i386-protos.h (x86_output_aligned_bss):
12632 Change align argument to unsigned type.
12633 (x86_elf_aligned_decl_common): Likewise.
12634 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
12635 (x86_output_aligned_bss): Likewise.
12637 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
12639 * config/mips/mips.c (mips_module_isa_name): New.
12640 mips_file_start: add .module mipsREV to all asm output
12642 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
12644 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
12645 use enum instead of int for 'isa' member.
12646 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
12647 config/mips/netbsd.h: replace hardcoded numbers with enum.
12649 2021-08-31 liuhongt <hongtao.liu@intel.com>
12651 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
12652 define_split to define_insn_and_split.
12653 (*avx2_eq<mode>3): Removed.
12654 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
12655 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
12656 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
12658 (*avx2_gt<mode>3): Removed.
12659 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
12660 define_insn to define_expand, and adjust pattern.
12661 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
12663 2021-08-30 David Malcolm <dmalcolm@redhat.com>
12666 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
12668 2021-08-30 Jason Merrill <jason@redhat.com>
12670 * doc/invoke.texi: Document -Wmissing-requires.
12672 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
12674 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
12675 TARGET_EXTRA_BUILTINS guard.
12677 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
12679 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
12680 initialization of V2DI_type_node and unsigned_V2DI_type_node.
12682 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
12684 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
12685 decl when new_builtins_are_live.
12686 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
12689 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
12691 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
12692 OPTION_MASK_P10_FUSION_2STORE.
12693 (POWERPC_MASKS): Likewise.
12694 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
12695 store fusion for Power10.
12696 (is_fusable_store): New.
12697 (power10_sched_reorder): Likewise.
12698 (rs6000_sched_reorder): Do Power10 specific reordering.
12699 (rs6000_sched_reorder2): Likewise.
12700 * config/rs6000/rs6000.opt: Add new option.
12702 2021-08-30 Richard Biener <rguenther@suse.de>
12704 PR tree-optimization/102128
12705 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
12706 Move scanning for if-converted scalar code to the caller
12707 and instead delay clearing the visited flag for profitable
12709 (vect_slp_region): Cost all subgraphs before scheduling.
12710 For if-converted BB vectorization scan for scalar COND_EXPRs
12711 and do not vectorize if any found and the cost model is
12714 2021-08-30 Richard Biener <rguenther@suse.de>
12716 * common.opt (fexceptions): Mark
12717 EnabledBy(fnon-call-exceptions).
12718 * doc/invoke.texi (fnon-call-exceptions): Document this
12719 enables -fexceptions.
12721 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
12723 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
12724 is not already defined.
12726 2021-08-30 liuhongt <hongtao.liu@intel.com>
12728 * expmed.c (extract_bit_field_1): Make sure we're playing with
12729 integral modes before call extract_integral_bit_field.
12730 (extract_integral_bit_field): Add a parameter of type
12731 scalar_int_mode which corresponds to of tmode.
12732 And call extract_and_convert_fixed_bit_field instead of
12733 extract_fixed_bit_field and convert_extracted_bit_field.
12734 (extract_and_convert_fixed_bit_field): New function, it's a
12735 combination of extract_fixed_bit_field and
12736 convert_extracted_bit_field.
12738 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
12740 * config/darwin.c (darwin_libc_has_function): Do not run
12741 the checks for x86 or modern Darwin. Make sure that there
12742 is a value set for darwin_macosx_version_min before testing.
12744 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
12746 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
12748 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
12750 * ipa-modref-tree.h (modref_access_node::merge): Break out
12751 logic combining offsets and logic merging ranges to ...
12752 (modref_access_node::combined_offsets): ... here
12753 (modref_access_node::update2): ... here
12754 (modref_access_node::closer_pair_p): New member function.
12755 (modref_access_node::forced_merge): New member function.
12756 (modre_ref_node::insert): Do merging when table is full.
12758 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
12761 * config.gcc: MIPS: use N64 ABI by default if the triple end
12762 with -gnuabi64, which is used by Debian since 2013.
12764 2021-08-28 Alexandre Oliva <oliva@adacore.com>
12766 * ipa-modref.c (analyze_function): Skip debug stmts.
12767 * tree-inline.c (estimate_num_insn): Consider builtins even
12768 without a cgraph_node.
12770 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
12772 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
12773 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
12774 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
12775 * config/h8300/h8300.md (eqne): New code iterator.
12776 (geultu, geultu_to_c): Similarly.
12777 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
12778 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
12779 (cmp<mode>_c): New pattern
12781 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
12783 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
12785 (dom_opt_dom_walker::optimize_stmt): Use it.
12787 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
12789 * config/darwin.c (finalize_ctors): Add a section-start linker-
12791 (finalize_dtors): Likewise.
12792 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
12794 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
12796 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
12797 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
12798 old initialization logic when new builtins are enabled.
12799 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
12800 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
12801 (write_init_file): Likewise.
12803 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
12805 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
12806 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
12807 (amdgcn-* | gcn-*) Likewise.
12809 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
12811 * symbol-summary.h: Added missing template keyword.
12813 2021-08-27 Richard Biener <rguenther@suse.de>
12815 PR tree-optimization/45178
12816 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
12817 infinite loops without exit do not mark control dependent
12818 edges of the latch necessary.
12820 2021-08-27 konglin1 <lingling.kong@intel.com>
12823 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
12825 (<avx512>scattersi<mode>): Likewise.
12826 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
12827 (*avx512f_scatterdi<VI48F:mode>): Likewise
12829 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
12831 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
12832 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
12833 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
12834 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
12835 P10_BUILTIN_PEXTD on Power10.
12837 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
12839 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
12840 signedness for some Power10 bifs.
12842 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
12845 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
12846 computed alignment if the entire struct has attribute packed.
12848 2021-08-27 liuhongt <hongtao.liu@intel.com>
12852 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
12853 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
12854 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
12855 IX86_BUILTIN_SHUFPS256.
12856 (ix86_masked_all_ones): New function.
12858 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
12860 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
12861 (conditional moves with memory inputs splitters): Ditto.
12862 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
12864 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12866 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
12867 search after merging.
12869 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12871 * config/rs6000/rs6000-overload.def: Add remaining overloads.
12873 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12875 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
12877 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12879 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
12880 crypto, and htm stanzas.
12882 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12884 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
12886 2021-08-26 Martin Sebor <msebor@redhat.com>
12888 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
12889 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
12890 (warn_uninitialized_vars): Same. Reduce visibility of locals.
12891 (warn_uninitialized_phi): Same.
12893 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
12895 * tree-ssa-ccp.c (get_individual_bits): Helper function to
12896 extract the individual bits from a widest_int constant (mask).
12897 (gray_code_bit_flips): New read-only table for effiently
12898 enumerating permutations/combinations of bits.
12899 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
12900 by unknown counts that are guaranteed less than the target
12901 precision and four or fewer unknown bits by enumeration.
12902 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
12903 enumeration under the same conditions. Handle remaining
12904 shifts as a mask based upon the minimum possible shift value.
12906 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
12907 Richard Biener <rguenther@suse.de>
12909 * match.pd (shift transformations): Remove a redundant
12910 !POINTER_TYPE_P check.
12912 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
12915 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
12916 Set all_regs to true in the call to replace_rtx.
12918 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12920 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
12921 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
12922 hiting --param modref-max-refs limit.
12923 (modref_tree:insert_base): Be smarter when hitting
12924 --param modref-max-bases limit. Add new parameter REF.
12925 (modref_tree:insert): Update.
12926 (modref_tree:merge): Update.
12927 * ipa-modref.c (read_modref_records): Update.
12929 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12931 * params.opt: (modref-max-adjustments): Add full stop.
12933 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12935 * ipa-modref-tree.h (modref_ref_node::verify): New member
12937 (modref_ref_node::insert): Use it.
12938 (modref_ref_node::try_mere_with): Fix off by one error.
12940 2021-08-26 Martin Liska <mliska@suse.cz>
12941 Stefan Kneifel <stefan.kneifel@bluewin.ch>
12943 * cgraph.h (create_version_clone_with_body): Add new parameter.
12944 * cgraphclones.c: Likewise.
12945 * multiple_target.c (create_dispatcher_calls): Do not use
12947 (create_target_clone): Likewise here.
12949 2021-08-26 Jonathan Yong <10walls@gmail.com>
12951 * doc/extend.texi: Add note about reserved priorities
12952 to the constructor attribute.
12954 2021-08-25 Martin Sebor <msebor@redhat.com>
12956 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
12957 range table header alone.
12958 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
12960 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
12962 * doc/invoke.texi: Document --param modref-max-adjustments.
12963 * ipa-modref-tree.c (test_insert_search_collapse): Update.
12964 (test_merge): Update.
12965 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
12966 (modref_access_node::operator==): Fix handling of access ranges.
12967 (modref_access_node::contains): Constify parameter; handle also
12968 mismatched parm offsets.
12969 (modref_access_node::update): New function.
12970 (modref_access_node::merge): New function.
12971 (unspecified_modref_access_node): Update constructor.
12972 (modref_ref_node::insert_access): Add record_adjustments parameter;
12974 (modref_ref_node::try_merge_with): New private function.
12975 (modref_tree::insert): New record_adjustments parameter.
12976 (modref_tree::merge): New record_adjustments parameter.
12977 (modref_tree::copy_from): Update.
12978 * ipa-modref.c (dump_access): Dump adjustments field.
12979 (get_access): Update constructor.
12980 (record_access): Update call of insert.
12981 (record_access_lto): Update call of insert.
12982 (merge_call_side_effects): Add record_adjustments parameter.
12983 (get_access_for_fnspec): Update.
12984 (process_fnspec): Update.
12985 (analyze_call): Update.
12986 (analyze_function): Update.
12987 (read_modref_records): Update.
12988 (ipa_merge_modref_summary_after_inlining): Update.
12989 (propagate_unknown_call): Update.
12990 (modref_propagate_in_scc): Update.
12991 * params.opt (param-max-modref-adjustments=): New.
12993 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
12995 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
12997 (xxspltiw_v4si): Use vecperm type attribute.
12998 (xxspltiw_v4si_inst): Use vecperm type attribute.
12999 (xxspltiw_v4sf_inst): Likewise.
13000 (xxspltidp_v2df): Use vecperm type attribute. Use
13001 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
13002 (xxspltidp_v2df_inst): Likewise.
13003 (xxsplti32dx_v4si): Use vecperm type attribute.
13004 (xxsplti32dx_v4si_inst): Likewise.
13005 (xxsplti32dx_v4sf_inst): Likewise.
13006 (xxblend_<mode>): Likewise.
13007 (xxpermx): Likewise.
13008 (xxpermx_inst): Likewise.
13009 (xxeval): Likewise.
13011 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
13014 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
13015 * diagnostic.c (diagnostic_initialize_input_context): New function.
13016 * diagnostic.h (diagnostic_initialize_input_context): Declare.
13017 * input.c (default_charset_callback): New function.
13018 (file_cache::initialize_input_context): New function.
13019 (file_cache_slot::create): Added ability to convert the input
13020 according to the input context.
13021 (file_cache::file_cache): Initialize the new input context.
13022 (class file_cache_slot): Added new m_alloc_offset member.
13023 (file_cache_slot::file_cache_slot): Initialize the new member.
13024 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
13025 (file_cache_slot::maybe_grow): Likewise.
13026 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
13027 (file_cache_slot::get_next_line): Likewise.
13028 * input.h (class file_cache): Added input context member.
13030 2021-08-25 Richard Biener <rguenther@suse.de>
13032 PR tree-optimization/102046
13033 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
13034 update ->any_pattern when swapping operands.
13036 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
13039 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
13040 (ix86_decompose_address): Remove retval check for ASHIFT,
13041 allow non-canonical zero extend if AND mask covers ASHIFT
13043 (ix86_legitimate_address_p): Adjust condition for decompose.
13044 (ix86_rtx_costs): Adjust cost for lea with non-canonical
13046 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
13048 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
13050 PR tree-optimization/101145
13051 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
13053 (number_of_iterations_lt): Invoke above function.
13054 (adjust_cond_for_loop_until_wrap):
13055 Merge to number_of_iterations_until_wrap.
13056 (number_of_iterations_cond): Update invokes for
13057 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
13059 2021-08-25 konglin1 <lingling.kong@intel.com>
13062 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
13063 macro define in O0.
13064 (_mm512_mask_fpclass_ps_mask): Ditto.
13066 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
13068 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
13069 (vec_unpacku_hi_v8hi): Likewise.
13070 (vec_unpacku_lo_v16qi): Likewise.
13071 (vec_unpacku_lo_v8hi): Likewise.
13072 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
13073 (vec_unpacku_lo_<VP_small_lc>): Likewise.
13075 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
13077 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
13078 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13079 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13080 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
13082 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13084 PR middle-end/102031
13085 * simplify-rtx.c (simplify_truncation): When comparing precisions
13086 use "subreg_prec" variable, not "subreg_mode".
13088 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13090 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
13093 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13095 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
13096 various pointer type nodes.
13097 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
13098 values for various pointer types.
13099 (ptr_V16QI_type_node): New macro.
13100 (ptr_V1TI_type_node): New macro.
13101 (ptr_V2DI_type_node): New macro.
13102 (ptr_V2DF_type_node): New macro.
13103 (ptr_V4SI_type_node): New macro.
13104 (ptr_V4SF_type_node): New macro.
13105 (ptr_V8HI_type_node): New macro.
13106 (ptr_unsigned_V16QI_type_node): New macro.
13107 (ptr_unsigned_V1TI_type_node): New macro.
13108 (ptr_unsigned_V8HI_type_node): New macro.
13109 (ptr_unsigned_V4SI_type_node): New macro.
13110 (ptr_unsigned_V2DI_type_node): New macro.
13111 (ptr_bool_V16QI_type_node): New macro.
13112 (ptr_bool_V8HI_type_node): New macro.
13113 (ptr_bool_V4SI_type_node): New macro.
13114 (ptr_bool_V2DI_type_node): New macro.
13115 (ptr_bool_V1TI_type_node): New macro.
13116 (ptr_pixel_type_node): New macro.
13117 (ptr_intQI_type_node): New macro.
13118 (ptr_uintQI_type_node): New macro.
13119 (ptr_intHI_type_node): New macro.
13120 (ptr_uintHI_type_node): New macro.
13121 (ptr_intSI_type_node): New macro.
13122 (ptr_uintSI_type_node): New macro.
13123 (ptr_intDI_type_node): New macro.
13124 (ptr_uintDI_type_node): New macro.
13125 (ptr_intTI_type_node): New macro.
13126 (ptr_uintTI_type_node): New macro.
13127 (ptr_long_integer_type_node): New macro.
13128 (ptr_long_unsigned_type_node): New macro.
13129 (ptr_float_type_node): New macro.
13130 (ptr_double_type_node): New macro.
13131 (ptr_long_double_type_node): New macro.
13132 (ptr_dfloat64_type_node): New macro.
13133 (ptr_dfloat128_type_node): New macro.
13134 (ptr_ieee128_type_node): New macro.
13135 (ptr_ibm128_type_node): New macro.
13136 (ptr_vector_pair_type_node): New macro.
13137 (ptr_vector_quad_type_node): New macro.
13138 (ptr_long_long_integer_type_node): New macro.
13139 (ptr_long_long_unsigned_type_node): New macro.
13141 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13143 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
13144 and power9-64 stanzas.
13146 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13147 Tom de Vries <tdevries@suse.de>
13149 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
13150 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
13151 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
13152 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
13153 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
13154 * config/nvptx/nvptx-c.c: New source file.
13155 (nvptx_cpu_cpp_builtins): Move implementation here.
13157 2021-08-24 Martin Sebor <msebor@redhat.com>
13159 PR middle-end/101600
13160 PR middle-end/101977
13161 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
13162 the phrasing of a warning.
13163 (check_access): Use the remaining size after subtracting any offset
13164 rather than the whole object size.
13165 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
13166 clear for any nonnull PHI argument.
13167 (compute_objsize): Clear argument.
13169 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13171 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
13173 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13175 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
13178 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
13180 * value-relation.cc (rr_transitive_table): New.
13181 (relation_transitive): New.
13182 (value_relation::swap): Remove.
13183 (value_relation::apply_transitive): New.
13184 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
13185 (relation_oracle::register_relation): Call register_transitives.
13186 (relation_oracle::register_transitives): New.
13187 * value-relation.h (relation_oracle): Add new temporary bitmap and
13190 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
13193 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
13194 from integer to a pseudo vector register.
13196 2021-08-24 Richard Biener <rguenther@suse.de>
13198 PR tree-optimization/100089
13199 * tree-vectorizer.h (vect_slp_bb): Rename to ...
13200 (vect_slp_if_converted_bb): ... this and get the original
13201 loop as new argument.
13202 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
13203 pass original loop to vect_slp_if_converted_bb.
13204 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13205 If orig_loop was passed scan the not vectorized stmts
13206 for COND_EXPRs and force not profitable if found.
13207 (vect_slp_region): Pass down all SLP instances to costing
13208 if orig_loop was specified.
13209 (vect_slp_bbs): Pass through orig_loop.
13210 (vect_slp_bb): Rename to ...
13211 (vect_slp_if_converted_bb): ... this and get the original
13212 loop as new argument.
13213 (vect_slp_function): Adjust.
13215 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13218 * config/arm/arm.md (attribute arch): Add fix_vlldm.
13219 (arch_enabled): Use it.
13220 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
13221 use when erratum mitigation is needed.
13223 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13226 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
13227 * doc/invoke.texi (Arm Options): Document it.
13228 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
13229 (ALL_QUIRKS): Add quirk_vlldm.
13230 (cortex-m33): Add quirk_vlldm.
13231 (cortex-m35p, cortex-m55): Likewise.
13232 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
13233 targetting an affected CPU and not explicitly controlled on
13236 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13238 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
13239 (lazy_load_multiple_insn): Likewise.
13241 2021-08-24 liuhongt <hongtao.liu@intel.com>
13244 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
13245 Enable avx512 embedded broadcast.
13246 (*<avx512>_vternlog<mode>_all): Ditto.
13247 (<avx512>_vternlog<mode>_mask): Ditto.
13249 2021-08-24 liuhongt <hongtao.liu@intel.com>
13252 * config/i386/i386.c (ix86_rtx_costs): Define cost for
13254 * config/i386/i386.h (STRIP_UNARY): New macro.
13255 * config/i386/predicates.md (reg_or_notreg_operand): New
13257 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
13258 (*<avx512>_vternlog<mode>_1): New pre_reload
13259 define_insn_and_split.
13260 (*<avx512>_vternlog<mode>_2): Ditto.
13261 (*<avx512>_vternlog<mode>_3): Ditto.
13262 (any_logic1,any_logic2): New code iterator.
13263 (logic_op): New code attribute.
13264 (ternlogsuffix): Extend to VNxDF and VNxSF.
13266 2021-08-24 Richard Biener <rguenther@suse.de>
13268 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
13269 * params.opt (--param vect-inner-loop-cost-factor): Adjust
13271 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
13272 inner_loop_cost_factor to the minimum of the estimated number
13273 of iterations of the inner loop and vect-inner-loop-cost-factor.
13275 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13276 Richard Biener <rguenther@suse.de>
13278 * config/i386/i386-features.c (compute_convert_gain): Provide
13279 more accurate values for CONST_INT, when optimizing for size.
13280 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
13281 * config/i386/i386.h (COSTS_N_BYTES): to here.
13283 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13284 Jakub Jelinek <jakub@redhat.com>
13286 PR middle-end/102029
13287 * match.pd (shift transformations): Add an additional check for
13288 !POINTER_TYPE_P in the recently added left shift transformation.
13290 2021-08-24 liuhongt <hongtao.liu@intel.com>
13292 PR tree-optimization/100089
13293 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
13294 loop vectorizer when cost model is very-cheap.
13296 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13298 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
13299 asprintf, which is not available on AIX.
13301 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13303 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
13304 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
13306 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13308 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
13309 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
13311 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
13312 (rbtree.o): Likewise.
13313 (rs6000-gen-builtins): Likewise.
13314 (rs6000-builtins.c): Likewise.
13315 (rs6000-builtins.h): Likewise.
13316 (rs6000.o): Add dependency.
13317 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
13318 (rs6000-vecdefines.h): New target.
13319 (rs6000-builtins.o): Likewise.
13320 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
13321 (rs6000-c.o): Likewise.
13323 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13326 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
13327 Diagnose buffer overrun.
13328 (safe_inc_pos): Fix overrun detection.
13329 (match_identifier): Diagnose buffer overrun.
13330 (match_integer): Likewise.
13331 (match_to_right_bracket): Likewise.
13333 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13335 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
13336 Improve range compare.
13337 (modref_access_node::contains): New member function.
13338 (modref_access_node::search): Remove.
13339 (modref_access_node::insert): Be smarter about subaccesses.
13341 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
13343 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
13344 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
13345 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
13348 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
13350 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
13352 * config/h8300/jumpcall.md (call, call_value): Restrict to
13353 !SIBLING_CALL_P cases.
13354 (subcall, sibcall_value): New patterns & expanders.
13355 * config/h8300/proepi.md (epilogue): Pass new argument to
13356 h8300_expand_epilogue.
13357 (sibcall_epilogue): New expander.
13358 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
13360 (h8300_ok_for_sibcall_p): New function.
13361 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
13363 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13365 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
13366 Handle case where the operand is already the desired mode.
13368 2021-08-23 Richard Biener <rguenther@suse.de>
13371 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
13372 in addition to has_gimple_body.
13374 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13376 PR middle-end/101949
13377 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
13380 2021-08-23 Martin Liska <mliska@suse.cz>
13382 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
13385 2021-08-23 Richard Biener <rguenther@suse.de>
13387 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
13388 Properly scale the inner loop cost only once.
13390 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13392 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
13393 Provide bounds for unsigned (and signed with non-negative operands)
13394 division and modulus.
13396 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13398 * simplify-rtx.c (simplify_truncation): Generalize simplification
13399 of (truncate:A (subreg:B X)).
13400 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
13401 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
13402 already has the desired machine mode.
13403 (test_scalar_int_ops): Add tests that useless extensions and
13404 truncations are optimized away.
13405 (test_scalar_int_ext_ops): New self-test function to confirm
13406 that truncations of extensions are correctly simplified.
13407 (test_scalar_int_ext_ops2): New self-test function to check
13408 truncations of truncations, extensions of extensions, and
13409 truncations of extensions.
13410 (test_scalar_ops): Call the above two functions with a
13411 representative sampling of integer machine modes.
13413 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13415 * match.pd (shift transformations): Change the sign of an
13416 LSHIFT_EXPR if it reduces the number of explicit conversions.
13418 2021-08-23 Jakub Jelinek <jakub@redhat.com>
13420 PR tree-optimization/86723
13421 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
13422 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
13423 non-memory permutation of 64-bit src which only has bytes of
13424 0 or [5..8] and n->range is 4.
13425 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
13426 find_bswap_or_nop_finalize caller, support bswap with some bytes
13427 zeroed, as long as at least two bytes are not zeroed.
13428 (bswap_replace): Add mask argument and handle masking of bswap
13430 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
13431 caller, punt if cast64_to_32 or mask is not all ones.
13432 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
13433 caller, for now punt if cast64_to_32.
13435 2021-08-23 Richard Biener <rguenther@suse.de>
13437 PR tree-optimization/79334
13438 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
13439 a type also for COMPONENT_REFs.
13440 (vn_reference_may_trap): Check ARRAY_REF with constant index
13441 against the array domain.
13443 2021-08-23 liuhongt <hongtao.liu@intel.com>
13446 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
13447 TARGET_AVX512BW to condition.
13449 2021-08-23 Jakub Jelinek <jakub@redhat.com>
13452 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
13453 register variables already during early_dwarf if possible.
13455 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
13457 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
13459 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
13462 * config/arm/arm.opt: Fix typo.
13463 * config/arm/t-rmprofile: Fix typo.
13465 2021-08-23 Jakub Jelinek <jakub@redhat.com>
13467 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
13468 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
13469 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
13470 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
13471 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
13472 if either grainsize or num_tasks clause has the strict modifier.
13474 2021-08-23 Martin Liska <mliska@suse.cz>
13476 * dbgcnt.def (DEBUG_COUNTER): New counter.
13477 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
13479 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13481 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
13483 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
13486 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
13488 * config/mips/mips.c (mips_msa_output_shift_immediate): New
13490 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
13491 vlshr<mode>3): Call it.
13493 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
13494 Martin Liska <mliska@suse.cz>
13496 PR middle-end/101949
13497 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
13500 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
13502 * config/mips/mips.c (mips_function_rodata_section,
13503 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
13505 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
13507 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
13509 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13511 * configure.ac (thread-local storage support): Remove tls_first_major
13512 and tls_first_minor. Use "$conftest_s" to check support.
13513 * configure: Regenerate.
13515 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13517 * configure.ac: Fixup formatting.
13519 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13521 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
13522 * configure.ac: ... update all callers.
13524 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13527 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
13528 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
13529 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
13530 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
13531 Remove remaining checks for in-tree assembler.
13532 * configure: Regenerate.
13534 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
13536 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
13537 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
13539 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
13540 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
13541 Improve logical shifts by 27 bits for H8/S.
13542 (get_shift_alg): Corresponding changes.
13543 (h8300_option_override): Revert to loops for -Os when profitable.
13545 2021-08-20 Richard Biener <rguenther@suse.de>
13547 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
13549 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
13550 index as group_id when dataref_groups were not computed.
13551 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
13552 we advace to the next BB.
13554 2021-08-20 Jakub Jelinek <jakub@redhat.com>
13556 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
13559 2021-08-20 Martin Liska <mliska@suse.cz>
13561 PR gcov-profile/89961
13562 * gcov.c (make_gcov_file_name): Rewrite using std::string.
13563 (mangle_name): Simplify, do not used the second argument.
13564 (strip_extention): New function.
13565 (get_md5sum): Likewise.
13566 (get_gcov_intermediate_filename): Handle properly -p and -x
13568 (output_gcov_file): Use string type.
13569 (generate_results): Likewise.
13570 (md5sum_to_hex): Remove.
13572 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
13574 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
13575 (UNSPEC_XXSPLTIW): Move to vsx.md.
13576 (UNSPEC_XXSPLTID): Move to vsx.md.
13577 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
13578 (UNSPEC_XXBLEND): Move to vsx.md.
13579 (UNSPEC_XXPERMX): Move to vsx.md.
13580 (VM3): Move to vsx.md.
13581 (VM3_char): Move to vsx.md.
13582 (xxspltiw_v4si): Move to vsx.md.
13583 (xxspltiw_v4sf): Move to vsx.md.
13584 (xxspltiw_v4sf_inst): Move to vsx.md.
13585 (xxspltidp_v2df): Move to vsx.md.
13586 (xxspltidp_v2df_inst): Move to vsx.md.
13587 (xxsplti32dx_v4si_inst): Move to vsx.md.
13588 (xxsplti32dx_v4sf): Move to vsx.md.
13589 (xxsplti32dx_v4sf_inst): Move to vsx.md.
13590 (xxblend_<mode>): Move to vsx.md.
13591 (xxpermx): Move to vsx.md.
13592 (xxpermx_inst): Move to vsx.md.
13593 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
13594 (UNSPEC_XXSPLTIW): Move from altivec.md.
13595 (UNSPEC_XXSPLTID): Move from altivec.md.
13596 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
13597 (UNSPEC_XXBLEND): Move from altivec.md.
13598 (UNSPEC_XXPERMX): Move from altivec.md.
13599 (VM3): Move from altivec.md.
13600 (VM3_char): Move from altivec.md.
13601 (xxspltiw_v4si): Move from altivec.md.
13602 (xxspltiw_v4sf): Move from altivec.md.
13603 (xxspltiw_v4sf_inst): Move from altivec.md.
13604 (xxspltidp_v2df): Move from altivec.md.
13605 (xxspltidp_v2df_inst): Move from altivec.md.
13606 (xxsplti32dx_v4si_inst): Move from altivec.md.
13607 (xxsplti32dx_v4sf): Move from altivec.md.
13608 (xxsplti32dx_v4sf_inst): Move from altivec.md.
13609 (xxblend_<mode>): Move from altivec.md.
13610 (xxpermx): Move from altivec.md.
13611 (xxpermx_inst): Move from altivec.md.
13613 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
13615 * tree-vect-generic.c (expand_vector_operations_1): Use either
13616 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
13617 when constructing scalar splat expressions.
13619 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
13622 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
13623 pointer to __vector_pair *.
13625 2021-08-19 Martin Sebor <msebor@redhat.com>
13627 * gimple-range.cc: Add comments.
13628 * gimple-range.h: Same.
13630 2021-08-19 Martin Sebor <msebor@redhat.com>
13632 PR middle-end/101984
13633 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
13636 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
13638 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
13639 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
13640 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
13642 2021-08-19 Jakub Jelinek <jakub@redhat.com>
13644 PR middle-end/101950
13645 * optabs.c (expand_clrsb_using_clz): New function.
13646 (expand_unop): Use it as another clrsb expansion fallback.
13648 2021-08-19 liuhongt <hongtao.liu@intel.com>
13651 2021-07-28 liuhongt <hongtao.liu@intel.com>
13654 * config/i386/i386.h (processor_costs): Add new member
13656 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
13657 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
13658 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
13659 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
13660 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
13661 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
13662 generic_cost, core_cost): Initialize integer_to_sse same value
13664 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
13665 * config/i386/i386.c (ix86_builtin_vectorization_cost):
13666 Use integer_to_sse instead of sse_op to calculate the cost of
13669 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
13671 * config.gcc: Include rpath.opt for Darwin.
13672 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
13674 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
13676 PR bootstrap/101959
13677 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
13680 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
13682 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
13683 pattern and type cast.
13684 (vld4_lane_f32): Use float RTL pattern.
13685 (vld4q_lane_f64): Use float type cast.
13687 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
13689 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
13692 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
13694 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
13695 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
13696 (hash_map_tests_c_tests): Call it.
13698 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
13700 * ggc.h (enum ggc_collect): New.
13701 (ggc_collect): Use it.
13702 * ggc-page.c: Adjust.
13703 * ggc-common.c: Likewise.
13704 * ggc-tests.c: Likewise.
13705 * read-rtl-function.c: Likewise.
13706 * selftest-run-tests.c: Likewise.
13707 * doc/gty.texi (Invoking the garbage collector): Likewise.
13709 2021-08-18 liuhongt <hongtao.liu@intel.com>
13712 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
13714 * config/i386/sse.md (*sse3_haddv2df3_low): Add
13715 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
13716 (*sse3_hsubv2df3_low): Ditto.
13717 * config/i386/x86-tune.def
13718 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
13720 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13722 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
13723 (gori_compute::compute_operand_range): Add tracing.
13724 (gori_compute::logical_combine): Ditto.
13725 (gori_compute::compute_logical_operands): Ditto.
13726 (gori_compute::compute_operand1_range): Ditto.
13727 (gori_compute::compute_operand2_range): Ditto.
13728 (gori_compute::outgoing_edge_range_p): Ditto.
13729 * gimple-range-gori.h (class gori_compute): Add range_tracer.
13731 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13733 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
13734 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
13735 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
13736 * params.opt (--param=evrp-mode): Adjust options.
13738 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13740 * Makefile.in (OBJS): Add gimple-range-trace.o.
13741 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
13742 * gimple-range-fold.cc: Adjust headers.
13743 * gimple-range-trace.cc: New.
13744 * gimple-range-trace.h: New.
13745 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
13746 (gimple_ranger::range_of_expr): Add tracing.
13747 (gimple_ranger::range_on_entry): Ditto.
13748 (gimple_ranger::range_on_exit): Ditto.
13749 (gimple_ranger::range_on_edge): Ditto.
13750 (gimple_ranger::fold_range_internal): Ditto.
13751 (gimple_ranger::dump_bb): Do not calculate edge range twice.
13752 (trace_ranger::*): Remove.
13753 (enable_ranger): Never create a trace_ranger.
13754 (debug_seed_ranger): Move to gimple-range-trace.cc.
13755 (dump_ranger): Ditto.
13756 (debug_ranger): Ditto.
13757 * gimple-range.h: Include gimple-range-trace.h.
13758 (range_on_entry, range_on_exit): No longer virtual.
13759 (class trace_ranger): Remove.
13760 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
13762 2021-08-17 Martin Sebor <msebor@redhat.com>
13764 PR middle-end/101854
13765 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
13766 in gimple-ssa-warn-access.cc.
13767 * calls.c (alloc_max_size): Move code to check_alloca.
13768 (get_size_range): Move to pointer-query.cc.
13769 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
13770 (get_attr_nonstring_decl): Move to tree.c.
13771 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
13772 (append_attrname): Same.
13773 (maybe_warn_rdwr_sizes): Same.
13774 (initialize_argument_information): Move code to
13775 gimple-ssa-warn-access.cc.
13776 * calls.h (maybe_warn_alloc_args_overflow): Move to
13777 gimple-ssa-warn-access.h.
13778 (get_attr_nonstring_decl): Move to tree.h.
13779 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
13780 (enum size_range_flags): Move to pointer-query.h.
13781 (get_size_range): Same.
13782 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
13783 to avoid Clang -Wunused-function.
13784 (get_size_range): Declare static.
13785 (maybe_emit_free_warning): Rename...
13786 (maybe_check_dealloc_call): ...to this for consistency.
13787 (class pass_waccess): Add members.
13788 (pass_waccess::~pass_waccess): Defined.
13789 (alloc_max_size): Move here from calls.c.
13790 (maybe_warn_alloc_args_overflow): Same.
13791 (check_alloca): New function.
13792 (check_alloc_size_call): New function.
13793 (check_strncat): Handle another warning flag.
13794 (pass_waccess::check_builtin): Handle alloca.
13795 (fntype_argno_type): Move here from calls.c.
13796 (append_attrname): Same.
13797 (maybe_warn_rdwr_sizes): Same.
13798 (pass_waccess::check_call): Define.
13799 (check_nonstring_args): New function.
13800 (pass_waccess::check): Call new member functions.
13801 (pass_waccess::execute): Enable ranger.
13802 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
13803 (maybe_warn_nonstring_arg): Same.
13804 * gimple-ssa-warn-restrict.c: Remove #include.
13805 * pointer-query.cc (get_size_range): Move here from calls.c.
13806 * pointer-query.h (enum size_range_flags): Same.
13807 (get_size_range): Same.
13808 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
13809 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
13811 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
13813 * ggc.h (ggc_collect): Add 'force_collect' parameter.
13814 * ggc-page.c (ggc_collect): Use that one instead of global
13815 'ggc_force_collect'. Adjust all users.
13816 * doc/gty.texi (Invoking the garbage collector): Update.
13817 * ggc-internal.h (ggc_force_collect): Remove.
13818 * ggc-common.c (ggc_force_collect): Likewise.
13819 * selftest.h (forcibly_ggc_collect): Remove.
13820 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
13821 * read-rtl-function.c (test_loading_labels): Adjust.
13822 * selftest-run-tests.c (run_tests): Likewise.
13824 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
13826 * config/darwin.c (darwin_file_end): Reset and reclaim the
13827 section names table at the end of compile.
13829 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
13832 * config.in: Regenerate.
13833 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
13834 (ASM_SPEC): Pass options to disable branch shortening where
13836 * configure: Regenerate.
13837 * configure.ac: Detect versions of 'as' that support the
13838 optimisation which has the bug.
13840 2021-08-17 Richard Biener <rguenther@suse.de>
13842 * optabs-query.c (supports_vec_gather_load_p): Also check
13844 (supports_vec_scatter_store_p): Likewise.
13845 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
13846 back to masked variants if non-masked are not supported.
13847 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
13848 When we need to use masked gather/scatter but do not have
13849 a mask set up a constant true one.
13850 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
13851 non-SSA_NAME masks.
13853 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
13855 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
13856 algorithm as PLUS_EXPR to improve subtraction bit bounds.
13857 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
13859 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
13861 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
13862 calculate the mask-value pair result of a multiplication by an
13864 (bit_value_binop) [MULT_EXPR]: Call it from here for
13865 multiplications by (sparse) non-negative constants.
13867 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
13870 * config.gcc (gcc_cv_initfini_array): Leave undefined for
13871 uclinuxfdpiceabi targets.
13873 2021-08-17 Alexandre Oliva <oliva@adacore.com>
13875 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
13876 reverse debug stmts.
13878 2021-08-17 Alexandre Oliva <oliva@adacore.com>
13880 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
13882 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
13884 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
13885 (__LD4Q_LANE_FUNC): Likewise.
13886 (vld4_lane_u8): Define without macro.
13887 (vld4_lane_u16): Likewise.
13888 (vld4_lane_u32): Likewise.
13889 (vld4_lane_u64): Likewise.
13890 (vld4_lane_s8): Likewise.
13891 (vld4_lane_s16): Likewise.
13892 (vld4_lane_s32): Likewise.
13893 (vld4_lane_s64): Likewise.
13894 (vld4_lane_f16): Likewise.
13895 (vld4_lane_f32): Likewise.
13896 (vld4_lane_f64): Likewise.
13897 (vld4_lane_p8): Likewise.
13898 (vld4_lane_p16): Likewise.
13899 (vld4_lane_p64): Likewise.
13900 (vld4q_lane_u8): Likewise.
13901 (vld4q_lane_u16): Likewise.
13902 (vld4q_lane_u32): Likewise.
13903 (vld4q_lane_u64): Likewise.
13904 (vld4q_lane_s8): Likewise.
13905 (vld4q_lane_s16): Likewise.
13906 (vld4q_lane_s32): Likewise.
13907 (vld4q_lane_s64): Likewise.
13908 (vld4q_lane_f16): Likewise.
13909 (vld4q_lane_f32): Likewise.
13910 (vld4q_lane_f64): Likewise.
13911 (vld4q_lane_p8): Likewise.
13912 (vld4q_lane_p16): Likewise.
13913 (vld4q_lane_p64): Likewise.
13914 (vld4_lane_bf16): Likewise.
13915 (vld4q_lane_bf16): Likewise.
13917 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
13919 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
13920 (__LD3Q_LANE_FUNC): Delete.
13921 (vld3_lane_u8): Define without macro.
13922 (vld3_lane_u16): Likewise.
13923 (vld3_lane_u32): Likewise.
13924 (vld3_lane_u64): Likewise.
13925 (vld3_lane_s8): Likewise.
13926 (vld3_lane_s16): Likewise.
13927 (vld3_lane_s32): Likewise.
13928 (vld3_lane_s64): Likewise.
13929 (vld3_lane_f16): Likewise.
13930 (vld3_lane_f32): Likewise.
13931 (vld3_lane_f64): Likewise.
13932 (vld3_lane_p8): Likewise.
13933 (vld3_lane_p16): Likewise.
13934 (vld3_lane_p64): Likewise.
13935 (vld3q_lane_u8): Likewise.
13936 (vld3q_lane_u16): Likewise.
13937 (vld3q_lane_u32): Likewise.
13938 (vld3q_lane_u64): Likewise.
13939 (vld3q_lane_s8): Likewise.
13940 (vld3q_lane_s16): Likewise.
13941 (vld3q_lane_s32): Likewise.
13942 (vld3q_lane_s64): Likewise.
13943 (vld3q_lane_f16): Likewise.
13944 (vld3q_lane_f32): Likewise.
13945 (vld3q_lane_f64): Likewise.
13946 (vld3q_lane_p8): Likewise.
13947 (vld3q_lane_p16): Likewise.
13948 (vld3q_lane_p64): Likewise.
13949 (vld3_lane_bf16): Likewise.
13950 (vld3q_lane_bf16): Likewise.
13952 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
13954 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
13955 (__LD2Q_LANE_FUNC): Likewise.
13956 (vld2_lane_u8): Define without macro.
13957 (vld2_lane_u16): Likewise.
13958 (vld2_lane_u32): Likewise.
13959 (vld2_lane_u64): Likewise.
13960 (vld2_lane_s8): Likewise.
13961 (vld2_lane_s16): Likewise.
13962 (vld2_lane_s32): Likewise.
13963 (vld2_lane_s64): Likewise.
13964 (vld2_lane_f16): Likewise.
13965 (vld2_lane_f32): Likewise.
13966 (vld2_lane_f64): Likewise.
13967 (vld2_lane_p8): Likewise.
13968 (vld2_lane_p16): Likewise.
13969 (vld2_lane_p64): Likewise.
13970 (vld2q_lane_u8): Likewise.
13971 (vld2q_lane_u16): Likewise.
13972 (vld2q_lane_u32): Likewise.
13973 (vld2q_lane_u64): Likewise.
13974 (vld2q_lane_s8): Likewise.
13975 (vld2q_lane_s16): Likewise.
13976 (vld2q_lane_s32): Likewise.
13977 (vld2q_lane_s64): Likewise.
13978 (vld2q_lane_f16): Likewise.
13979 (vld2q_lane_f32): Likewise.
13980 (vld2q_lane_f64): Likewise.
13981 (vld2q_lane_p8): Likewise.
13982 (vld2q_lane_p16): Likewise.
13983 (vld2q_lane_p64): Likewise.
13984 (vld2_lane_bf16): Likewise.
13985 (vld2q_lane_bf16): Likewise.
13987 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
13989 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
13992 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
13994 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
13995 (rank_for_schedule): Use it.
13997 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
13999 PR rtl-optimization/91598
14000 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
14001 insns after memory reads and before memory writes.
14003 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
14005 * rtl.h (CONST_VECTOR_P): New macro.
14006 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
14007 code testing macros.
14008 (aarch64_ptrue_all_mode): Likewise.
14009 (aarch64_expand_mov_immediate): Likewise.
14010 (aarch64_const_vec_all_in_range_p): Likewise.
14011 (aarch64_rtx_costs): Likewise.
14012 (aarch64_legitimate_constant_p): Likewise.
14013 (aarch64_simd_valid_immediate): Likewise.
14014 (aarch64_simd_make_constant): Likewise.
14015 (aarch64_convert_mult_to_shift): Likewise.
14016 (aarch64_expand_sve_vec_perm): Likewise.
14017 (aarch64_vec_fpconst_pow_of_2): Likewise.
14019 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14021 PR tree-optimization/101938
14022 * range-op.cc (operator_abs::op1_range): Special case
14023 -TYPE_MIN_VALUE for flag_wrapv.
14025 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
14027 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
14030 2021-08-17 Jakub Jelinek <jakub@redhat.com>
14032 * tree.def (OMP_SCOPE): New tree code.
14033 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
14034 * tree-nested.c (convert_nonlocal_reference_stmt,
14035 convert_local_reference_stmt, convert_gimple_call): Handle
14037 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
14038 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
14039 * gimple.c (gimple_build_omp_scope): New function.
14040 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
14041 * gimple.h (gimple_build_omp_scope): Declare.
14042 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
14043 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
14044 gimple_omp_scope_set_clauses): New inline functions.
14045 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
14046 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
14047 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
14048 * gimple-walk.c (walk_gimple_stmt): Likewise.
14049 * gimple-low.c (lower_stmt): Likewise.
14050 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14051 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
14052 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
14053 allowing task reductions. Reject inscan reductions on scope.
14054 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
14055 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
14056 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
14057 (estimate_num_insns): Likewise.
14058 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
14059 contexts if var isn't privatized there.
14060 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
14061 (scan_omp_1_stmt): Likewise.
14062 (maybe_add_implicit_barrier_cancel): Look through outer
14064 (lower_omp_scope): New function.
14065 (lower_omp_task_reductions): Handle OMP_SCOPE.
14066 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
14067 (diagnose_sb_1, diagnose_sb_2): Likewise.
14068 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
14069 (expand_omp): Handle GIMPLE_OMP_SCOPE.
14070 (omp_make_gimple_edges): Likewise.
14071 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
14073 2021-08-17 Richard Biener <rguenther@suse.de>
14075 PR tree-optimization/101925
14076 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
14077 reverse on COMPONENT_REF and ARRAY_REF according to
14078 what reverse_storage_order_for_component_p does.
14079 (vn_reference_eq): Compare reversed on reference ops.
14080 (reverse_storage_order_for_component_p): New overload.
14081 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
14082 on the reference looked up.
14084 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
14086 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
14087 shifts on the H8/S.
14088 (h8300_option_override): Use loops on H8/S more often when optimizing
14090 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
14091 accordingly. Handle various arithmetic right shifts with special
14092 sequences that we couldn't handle before.
14094 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
14096 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
14098 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
14100 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
14101 * config/sparc/sparc.c (sparc_gcov_type_size): New.
14102 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
14103 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
14104 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
14105 * doc/tm.texi.in: Regenerate.
14106 * target.def (gcov_type_size): New target hook.
14107 * targhooks.c (default_gcov_type_size): New.
14108 * targhooks.h (default_gcov_type_size): Declare.
14109 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
14111 (gimple_gen_time_profiler): Likewise.
14113 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14115 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
14117 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
14119 PR middle-end/101931
14120 * omp-low.c (omp_runtime_api_call): Update for routines
14121 added in the meanwhile.
14123 2021-08-16 Martin Liska <mliska@suse.cz>
14125 PR tree-optimization/100393
14126 * tree-switch-conversion.c (group_cluster::dump): Use
14127 get_comparison_count.
14128 (jump_table_cluster::find_jump_tables): Pre-compute number of
14129 comparisons and then decrement it. Cache also max_ratio.
14130 (jump_table_cluster::can_be_handled): Change signature.
14131 * tree-switch-conversion.h (get_comparison_count): New.
14133 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14135 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
14136 encodings only when -fgnat-encodings=all is specified.
14137 (add_bound_info): Likewise.
14138 (add_byte_size_attribute): Likewise.
14139 (gen_member_die): Likewise.
14141 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14143 * omp-oacc-neuter-broadcast.cc
14144 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
14146 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14148 * omp-oacc-neuter-broadcast.cc
14149 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
14152 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14154 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
14155 (execute_omp_oacc_neuter_broadcast): ... here.
14156 (install_var_field, build_receiver_ref, build_sender_ref): Take
14157 'field_map_t *' parameter. Adjust all users.
14158 (worker_single_copy, neuter_worker_single): Take a
14159 'record_field_map_t *' parameter. Adjust all users.
14161 2021-08-16 liuhongt <hongtao.liu@intel.com>
14164 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
14167 2021-08-16 Martin Liska <mliska@suse.cz>
14170 * multiple_target.c (create_dispatcher_calls): Make default
14171 function local only if it is a definition.
14173 2021-08-16 Martin Liska <mliska@suse.cz>
14176 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
14177 consider equal SSA_NAMEs when one is a param.
14179 2021-08-16 liuhongt <hongtao.liu@intel.com>
14182 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
14183 Support vpermi2b for V32QI/V16QImode.
14184 (ix86_extract_perm_from_pool_constant): New function.
14185 (ix86_expand_vec_one_operand_perm_avx512): Support
14186 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
14187 (expand_vec_perm_1): Adjust comments for upper.
14188 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
14190 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
14191 (pshufb_truncv4siv4hi_operand): Ditto.
14192 (pshufb_truncv8hiv8qi_operand): Ditto.
14193 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
14194 New pre_reload define_insn_and_split.
14195 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
14196 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
14197 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
14198 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
14199 (*avx512f_permvar_truncv4div4si_1): Ditto.
14200 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
14201 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
14202 (*avx512f_pshufd_truncv2div2si_1): Ditto.
14204 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
14206 * config/riscv/multilib-generator: Support code model option for
14208 * doc/install.texi: Add document of new option for
14209 --with-multilib-generator.
14211 2021-08-15 Clément Chigot <clement.chigot@atos.net>
14213 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
14214 (rs6000_legitimize_tls_address_aix): Use it.
14215 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
14216 xcoff_tls_exec_model_detected is true.
14218 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
14220 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
14221 to allow a bit more code growth, saving many dozens of cycles.
14222 (h8300_option_override): Adjus shift_alg_si if optimizing for
14224 (get_shift_alg): Use special + inline shifts for residuals
14227 2021-08-14 Stafford Horne <shorne@gmail.com>
14230 * config/or1k/or1k-opts.h: New file.
14231 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
14232 Support generating gotha relocations if -mcmodel=large is
14234 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
14236 * config/or1k/or1k.opt (mcmodel=): New option.
14237 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
14239 2021-08-14 Martin Sebor <msebor@redhat.com>
14241 PR middle-end/101791
14242 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
14243 to valid_new_delete_pair_p.
14244 * tree.c (valid_new_delete_pair_p): Add argument.
14245 * tree.h (valid_new_delete_pair_p): Same.
14247 2021-08-14 Jakub Jelinek <jakub@redhat.com>
14250 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
14251 <case E_V64QImode>: For this mode assert
14252 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
14254 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
14257 * config/rs6000/altivec.md (xxeval): Use register_predicate
14258 instead of altivec_register_predicate.
14260 2021-08-13 Martin Sebor <msebor@redhat.com>
14262 PR middle-end/101734
14263 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
14264 (maybe_warn_operand): Call it.
14266 2021-08-13 Martin Liska <mliska@suse.cz>
14269 * attribs.c (decl_attributes): Make naked functions "noipa"
14272 2021-08-13 Martin Liska <mliska@suse.cz>
14275 * symtab.c (symtab_node::noninterposable_alias): Do not create
14276 local aliases for target_clone functions as the clonning pass
14279 2021-08-13 Martin Liska <mliska@suse.cz>
14281 * opts.c (LIVE_PATCHING_OPTION): Define.
14282 (control_options_for_live_patching): Use it in error messages.
14284 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
14286 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
14287 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
14288 ignore_stores_eaf_flags): New constants.
14289 (remove_useless_eaf_flags): New function.
14290 (eaf_flags_useful_p): Use it.
14291 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
14293 (modref_lattice::init): Add EAF_NOREAD.
14294 (modref_lattice::add_escape_point): Do not reacord escape point if
14296 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
14297 use remove_useless_eaf_flags.
14298 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
14299 (modref_lattice::merge_direct_load): Add EAF_NOREAD
14300 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
14301 (analyze_parms): Use remove_useless_eaf_flags.
14302 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
14303 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
14304 use remove_useless_eaf_flags.
14305 (modref_propagate_flags_in_scc): Update.
14306 * ipa-modref.h: Turn eaf_flags_t back to char.
14307 * tree-core.h (EAF_NOT_RETURNED): Fix.
14308 (EAF_NOREAD): New constant
14309 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
14311 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
14312 (handle_pure_call): Likewise.
14314 2021-08-12 Jakub Jelinek <jakub@redhat.com>
14316 * tree.def (OMP_MASKED): New tree code.
14317 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
14318 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
14319 OMP_CLAUSE_FILTER_EXPR): Define.
14320 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
14321 (omp_clause_code_name): Likewise.
14322 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
14323 * tree-nested.c (convert_nonlocal_omp_clauses,
14324 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
14325 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
14326 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
14327 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
14328 (dump_generic_node): Handle OMP_MASTER.
14329 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
14330 * gimple.c (gimple_build_omp_masked): New function.
14331 (gimple_copy): Handle GIMPLE_OMP_MASKED.
14332 * gimple.h (gimple_build_omp_masked): Declare.
14333 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
14334 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
14335 gimple_omp_masked_set_clauses): New inline functions.
14336 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
14337 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
14338 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
14339 * gimple-walk.c (walk_gimple_stmt): Likewise.
14340 * gimple-low.c (lower_stmt): Likewise.
14341 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14342 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
14343 that take one expression rather than decl or constant, force
14344 gimplification of that into a SSA_NAME or temporary unless min
14346 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
14347 (gimplify_expr): Handle OMP_MASKED.
14348 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
14349 (estimate_num_insns): Likewise.
14350 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
14351 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
14352 diagnostics for existence of masked construct.
14353 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
14354 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
14355 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
14358 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
14361 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
14362 (ldexp<mode>3): Use avx512f_scalef<mode>2.
14363 (UNSPEC_SCALEF): Move from ...
14364 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
14366 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
14368 * ipa-split.c (consider_split): Fix condition testing void functions.
14370 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
14372 * doc/invoke.texi: Remove docs for threader-mode param.
14373 * flag-types.h (enum threader_mode): Remove.
14374 * params.opt: Remove threader-mode param.
14375 * tree-ssa-threadbackward.c (class back_threader): Remove
14376 path_is_unreachable_p.
14377 Make find_paths private.
14378 Add maybe_thread and thread_through_all_blocks.
14379 Remove reference marker for m_registry.
14380 Remove reference marker for m_profit.
14381 (back_threader::back_threader): Adjust for registry and profit not
14383 (dump_path): Move down.
14384 (debug): Move down.
14385 (class thread_jumps): Remove.
14386 (class back_threader_registry): Remove m_all_paths.
14388 (thread_jumps::thread_through_all_blocks): Move to back_threader
14390 (fsm_find_thread_path): Remove
14391 (back_threader::maybe_thread): New.
14392 (back_threader::thread_through_all_blocks): Move from
14394 (back_threader_registry::back_threader_registry): Remove
14396 (back_threader_registry::~back_threader_registry): Remove.
14397 (thread_jumps::find_taken_edge): Remove.
14398 (thread_jumps::check_subpath_and_update_thread_path): Remove.
14399 (thread_jumps::maybe_register_path): Remove.
14400 (thread_jumps::handle_phi): Remove.
14401 (handle_assignment_p): Remove.
14402 (thread_jumps::handle_assignment): Remove.
14403 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
14404 (thread_jumps::find_jump_threads_backwards): Remove.
14405 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
14406 (try_thread_blocks): Rename find_jump_threads_backwards to
14408 (pass_early_thread_jumps::execute): Same.
14410 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
14412 * tree-core.h (omp_clause_proc_bind_kind): Add
14413 OMP_CLAUSE_PROC_BIND_PRIMARY.
14414 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
14415 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
14417 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
14419 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
14420 fno-common reference.
14421 * config/arc/arc.c (arc_override_options): Remove overriding of
14424 2021-08-12 Jakub Jelinek <jakub@redhat.com>
14427 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
14428 If d->testing_p, return true after performing checks instead of
14429 actually expanding the insn.
14430 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
14431 !TARGET_AVX512BW and return false.
14433 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
14435 * configure.ac (PE linker --disable-dynamicbase support): New check.
14436 * configure: Regenerate.
14437 * config.in: Likewise.
14438 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
14439 (LINK_SPEC): Use it.
14440 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
14441 (LINK_SPEC): Likewise.
14443 2021-08-12 liuhongt <hongtao.liu@intel.com>
14446 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
14447 post_reload define_insn_and_split.
14448 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
14449 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
14450 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
14451 (*avx2_zero_extendv8hiv8si2_2): Ditto.
14452 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
14453 (*avx512f_zero_extendv8siv8di2_2): Ditto.
14454 (*avx2_zero_extendv4siv4di2_2): Ditto.
14455 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
14456 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
14459 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
14461 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
14464 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
14466 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
14468 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
14470 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
14471 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
14472 initialization of pcvoid_type_node here...
14473 (altivec_init_builtins): ...from here.
14474 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
14475 RS6000_BTI_const_ptr_void.
14476 (pcvoid_type_node): New macro.
14478 2021-08-11 Richard Biener <rguenther@suse.de>
14481 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
14482 hard-register accesses.
14484 2021-08-11 Richard Biener <rguenther@suse.de>
14486 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
14487 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
14488 to determine has_volatile_ops.
14490 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
14492 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
14494 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
14495 Alexandre Oliva <oliva@adacore.com>
14497 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
14499 (modref_lattice::merge_deref): Fix handling of indirect scape points.
14500 (update_escape_summary_1): Likewise.
14501 (update_escape_summary): Likewise.
14502 (ipa_merge_modref_summary_after_inlining): Likewise.
14504 2021-08-11 Richard Biener <rguenther@suse.de>
14506 PR middle-end/101858
14507 * fold-const.c (fold_binary_loc): Guard simplification
14508 of X < (cast) (1 << Y) to integer types.
14510 2021-08-11 Richard Biener <rguenther@suse.de>
14512 PR tree-optimization/101861
14513 * tree-vect-stmts.c (vectorizable_load): Fix error in
14514 previous change with regard to gather vectorization.
14516 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
14519 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
14521 (vdup_n_s16): Likewise.
14522 (vdup_n_s32): Likewise.
14523 (vdup_n_s64): Likewise.
14524 (vdup_n_u8): Likewise.
14525 (vdup_n_u16): Likewise.
14526 (vdup_n_u32): Likewise.
14527 (vdup_n_u64): Likewise.
14528 (vdup_n_p8): Likewise.
14529 (vdup_n_p16): Likewise.
14530 (vdup_n_p64): Likewise.
14531 (vdup_n_f16): Likewise.
14532 (vdup_n_f32): Likewise.
14533 (vdupq_n_s8): Likewise.
14534 (vdupq_n_s16): Likewise.
14535 (vdupq_n_s32): Likewise.
14536 (vdupq_n_s64): Likewise.
14537 (vdupq_n_u8): Likewise.
14538 (vdupq_n_u16): Likewise.
14539 (vdupq_n_u32): Likewise.
14540 (vdupq_n_u64): Likewise.
14541 (vdupq_n_p8): Likewise.
14542 (vdupq_n_p16): Likewise.
14543 (vdupq_n_p64): Likewise.
14544 (vdupq_n_f16): Likewise.
14545 (vdupq_n_f32): Likewise.
14546 (vmov_n_s8): Replace call to builtin with call to corresponding
14548 (vmov_n_s16): Likewise.
14549 (vmov_n_s32): Likewise.
14550 (vmov_n_s64): Likewise.
14551 (vmov_n_u8): Likewise.
14552 (vmov_n_u16): Likewise.
14553 (vmov_n_u32): Likewise.
14554 (vmov_n_u64): Likewise.
14555 (vmov_n_p8): Likewise.
14556 (vmov_n_p16): Likewise.
14557 (vmov_n_f16): Likewise.
14558 (vmov_n_f32): Likewise.
14559 (vmovq_n_s8): Likewise.
14560 (vmovq_n_s16): Likewise.
14561 (vmovq_n_s32): Likewise.
14562 (vmovq_n_s64): Likewise.
14563 (vmovq_n_u8): Likewise.
14564 (vmovq_n_u16): Likewise.
14565 (vmovq_n_u32): Likewise.
14566 (vmovq_n_u64): Likewise.
14567 (vmovq_n_p8): Likewise.
14568 (vmovq_n_p16): Likewise.
14569 (vmovq_n_f16): Likewise.
14570 (vmovq_n_f32): Likewise.
14571 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
14573 2021-08-11 liuhongt <hongtao.liu@intel.com>
14576 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
14577 when TARGET_AVX512F and TARGET_SSE_MATH.
14579 2021-08-10 Jakub Jelinek <jakub@redhat.com>
14582 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
14583 for V32HImode if !TARGET_AVX512BW.
14584 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
14585 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
14586 early, but actually check the permutation.
14588 2021-08-10 Richard Biener <rguenther@suse.de>
14590 PR tree-optimization/101809
14591 * tree-vect-stmts.c (get_load_store_type): Allow emulated
14592 gathers with offset vector nunits being a constant multiple
14593 of the data vector nunits.
14594 (vect_get_gather_scatter_ops): Use the appropriate nunits
14595 for the offset vector defs.
14596 (vectorizable_store): Adjust call to
14597 vect_get_gather_scatter_ops.
14598 (vectorizable_load): Likewise. Handle the case of less
14599 offset vectors than data vectors.
14601 2021-08-10 Jakub Jelinek <jakub@redhat.com>
14604 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
14605 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
14608 2021-08-10 Richard Biener <rguenther@suse.de>
14610 PR tree-optimization/101801
14611 PR tree-optimization/101819
14612 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
14613 * tree-vect-loop.c (vect_emulated_vector_p): New function.
14614 (vectorizable_reduction): Re-instantiate a check for emulated
14616 * tree-vect-stmts.c (vectorizable_shift): Likewise.
14617 (vectorizable_operation): Likewise. Cost emulated vector
14618 operations according to the scalar sequence synthesized by
14621 2021-08-10 Richard Biener <rguenther@suse.de>
14623 PR middle-end/101824
14624 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
14625 volatile in case the variable was.
14627 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
14630 * config/i386/constraints.md (BC): Document for integer SSE
14631 constant all bits set operand.
14632 (BF): New constraint for const floating-point all bits set
14634 * config/i386/i386.c (standard_sse_constant_p): Likewise.
14635 (standard_sse_constant_opcode): Likewise.
14636 * config/i386/sse.md (sseconstm1): New mode attribute.
14637 (mov<mode>_internal): Replace BC with <sseconstm1>.
14639 2021-08-10 liuhongt <hongtao.liu@intel.com>
14641 * config/i386/sse.md (cond_<insn><mode>): New expander.
14642 (VI248_AVX512VLBW): New mode iterator.
14643 * config/i386/predicates.md
14644 (nonimmediate_or_const_vec_dup_operand): New predicate.
14646 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
14648 PR tree-optimization/101741
14649 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
14650 type of parameter for toupper/tolower.
14652 2021-08-09 Martin Jambor <mjambor@suse.cz>
14654 PR testsuite/101654
14655 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
14657 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
14659 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
14661 (is_store_insn1): Verify source is a register.
14663 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
14666 * config/i386/mmx.md (<any_logic:code>v2sf3):
14667 Rename from *mmx_<any_logic:code>v2sf3
14669 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14671 * config/nvptx/nvptx.c: Cross-reference parts adapted in
14672 'gcc/omp-oacc-neuter-broadcast.cc'.
14673 * omp-low.c: Likewise.
14674 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
14677 2021-08-09 Julian Brown <julian@codesourcery.com>
14678 Kwok Cheung Yeung <kcy@codesourcery.com>
14679 Thomas Schwinge <thomas@codesourcery.com>
14681 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
14682 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
14683 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
14684 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
14685 (gcn_fork_join): Update comment.
14686 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
14687 (macc_experimental_workers): Remove unused option.
14689 2021-08-09 Julian Brown <julian@codesourcery.com>
14690 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
14691 Kwok Cheung Yeung <kcy@codesourcery.com>
14692 Thomas Schwinge <thomas@codesourcery.com>
14694 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
14695 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
14696 Add documentation hook.
14697 * doc/tm.texi: Regenerate.
14698 * omp-oacc-neuter-broadcast.cc: New file.
14699 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
14700 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
14701 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
14702 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
14703 * target.def (goacc.create_worker_broadcast_record): Add target
14705 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
14707 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
14708 Rename prototype to...
14709 (gcn_goacc_create_worker_broadcast_record): ... this.
14710 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
14712 (gcn_goacc_create_worker_broadcast_record): ... this.
14713 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
14715 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
14717 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
14720 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
14721 the right iterator.
14723 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14725 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
14727 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14729 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
14730 consider '-foffload-abi'.
14731 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
14732 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
14735 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14737 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
14740 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14742 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
14744 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14746 * doc/gty.texi (Files): Update.
14748 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14750 * doc/gty.texi (Files): Fix GTY header file example.
14752 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
14754 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
14755 determine the upper and lower bounds from a mask-value pair.
14756 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
14757 absolute value and unsigned absolute value expressions.
14758 (bit_value_binop): Initialize *VAL's precision.
14759 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
14760 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
14761 support when the operands are unknown but potentially equal.
14762 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
14764 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
14766 * config/aarch64/aarch64.md
14767 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
14769 2021-08-08 Sergei Trofimovich <siarheit@google.com>
14771 * lra-constraints.c: Fix s/otput/output/ typo.
14773 2021-08-06 Martin Sebor <msebor@redhat.com>
14775 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
14776 (expand_builtin_strcat): Same.
14777 (expand_builtin_stpncpy): Same.
14778 (expand_builtin_strncat): Same.
14779 (check_read_access): Same.
14780 (check_memop_access): Same.
14781 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
14782 (expand_builtin_strnlen): Same.
14783 (expand_builtin_memcpy): Same.
14784 (expand_builtin_memmove): Same.
14785 (expand_builtin_mempcpy): Same.
14786 (expand_builtin_strcpy): Same.
14787 (expand_builtin_strcpy_args): Same.
14788 (expand_builtin_stpcpy_1): Same.
14789 (expand_builtin_strncpy): Same.
14790 (expand_builtin_memset): Same.
14791 (expand_builtin_bzero): Same.
14792 (expand_builtin_strcmp): Same.
14793 (expand_builtin_strncmp): Same.
14794 (expand_builtin): Remove handlers.
14795 (fold_builtin_strlen): Add a comment.
14796 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
14797 * calls.c (maybe_warn_nonstring_arg): Same.
14798 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
14799 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
14800 (gimple_fold_builtin_stpcpy): Same.
14801 * gimple-ssa-warn-access.cc (has_location): New function.
14802 (get_location): Same.
14803 (get_callee_fndecl): Same.
14804 (call_nargs): Same.
14806 (warn_string_no_nul): Define.
14807 (unterminated_array): Same.
14808 (check_nul_terminated_array): Same.
14809 (maybe_warn_nonstring_arg): Same.
14810 (maybe_warn_for_bound): Same.
14811 (warn_for_access): Same.
14812 (check_access): Same.
14813 (check_memop_access): Same.
14814 (check_read_access): Same.
14815 (warn_dealloc_offset): Use helper functions.
14816 (maybe_emit_free_warning): Same.
14817 (class pass_waccess): Add members.
14818 (check_strcat): New function.
14819 (check_strncat): New function.
14820 (check_stxcpy): New function.
14821 (check_stxncpy): New function.
14822 (check_strncmp): New function.
14823 (pass_waccess::check_builtin): New function.
14824 (pass_waccess::check): Call it.
14825 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
14827 (maybe_warn_for_bound): Same.
14828 (check_access): Same.
14829 (check_memop_access): Same.
14830 (check_read_access): Same.
14831 * pointer-query.h (struct access_data): Define a ctor overload.
14833 2021-08-06 Richard Biener <rguenther@suse.de>
14835 PR tree-optimization/101801
14836 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
14837 (vect_can_vectorize_without_simd_p): ... to this.
14838 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
14839 (vect_can_vectorize_without_simd_p): ... to this and fold
14840 in vect_min_worthwhile_factor.
14841 (vect_min_worthwhile_factor): Remove.
14842 (vectorizable_reduction): Adjust and remove the cost part.
14843 * tree-vect-stmts.c (vectorizable_shift): Likewise.
14844 (vectorizable_operation): Likewise.
14846 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
14849 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
14850 Add general_gr_operand predicate to operand 3.
14852 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
14854 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
14855 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
14857 2021-08-06 Tamar Christina <tamar.christina@arm.com>
14859 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
14860 register_svprfop): Pass vec<> by pointer.
14861 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
14862 * langhooks.c (lhd_simulate_enum_decl): Likewise.
14863 * langhooks.h (struct lang_hooks_for_types): Likewise.
14865 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14867 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
14868 __builtin_memcpy instead of constructing an additional
14869 __builtin_aarch64_simd_oi one vector at a time.
14870 (vst1q_bf16_x2): Likewise.
14871 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
14872 an additional __builtin_aarch64_simd_ci one vector at a time.
14873 (vst1q_bf16_x3): Likewise.
14874 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
14875 (vst1q_bf16_x4): Likewise.
14876 (vst2_bf16): Use __builtin_memcpy instead of constructing an
14877 additional __builtin_aarch64_simd_oi one vector at a time.
14878 (vst2q_bf16): Likewise.
14879 (vst3_bf16): Use __builtin_memcpy instead of constructing an
14880 additional __builtin_aarch64_simd_ci mode one vector at a
14882 (vst3q_bf16): Likewise.
14883 (vst4_bf16): Use __builtin_memcpy instead of constructing an
14884 additional __builtin_aarch64_simd_xi one vector at a time.
14885 (vst4q_bf16): Likewise.
14887 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14889 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
14890 (__ST2Q_LANE_FUNC): Delete.
14891 (vst2_lane_f16): Use __builtin_memcpy to copy vector
14892 structure instead of constructing __builtin_aarch64_simd_oi
14893 one vector at a time.
14894 (vst2_lane_f32): Likewise.
14895 (vst2_lane_f64): Likewise.
14896 (vst2_lane_p8): Likewise.
14897 (vst2_lane_p16): Likewise.
14898 (vst2_lane_p64): Likewise.
14899 (vst2_lane_s8): Likewise.
14900 (vst2_lane_s16): Likewise.
14901 (vst2_lane_s32): Likewise.
14902 (vst2_lane_s64): Likewise.
14903 (vst2_lane_u8): Likewise.
14904 (vst2_lane_u16): Likewise.
14905 (vst2_lane_u32): Likewise.
14906 (vst2_lane_u64): Likewise.
14907 (vst2_lane_bf16): Likewise.
14908 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
14909 structure instead of using a union.
14910 (vst2q_lane_f32): Likewise.
14911 (vst2q_lane_f64): Likewise.
14912 (vst2q_lane_p8): Likewise.
14913 (vst2q_lane_p16): Likewise.
14914 (vst2q_lane_p64): Likewise.
14915 (vst2q_lane_s8): Likewise.
14916 (vst2q_lane_s16): Likewise.
14917 (vst2q_lane_s32): Likewise.
14918 (vst2q_lane_s64): Likewise.
14919 (vst2q_lane_u8): Likewise.
14920 (vst2q_lane_u16): Likewise.
14921 (vst2q_lane_u32): Likewise.
14922 (vst2q_lane_u64): Likewise.
14923 (vst2q_lane_bf16): Likewise.
14925 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14927 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
14928 (__ST3Q_LANE_FUNC): Delete.
14929 (vst3_lane_f16): Use __builtin_memcpy to copy vector
14930 structure instead of constructing __builtin_aarch64_simd_ci
14931 one vector at a time.
14932 (vst3_lane_f32): Likewise.
14933 (vst3_lane_f64): Likewise.
14934 (vst3_lane_p8): Likewise.
14935 (vst3_lane_p16): Likewise.
14936 (vst3_lane_p64): Likewise.
14937 (vst3_lane_s8): Likewise.
14938 (vst3_lane_s16): Likewise.
14939 (vst3_lane_s32): Likewise.
14940 (vst3_lane_s64): Likewise.
14941 (vst3_lane_u8): Likewise.
14942 (vst3_lane_u16): Likewise.
14943 (vst3_lane_u32): Likewise.
14944 (vst3_lane_u64): Likewise.
14945 (vst3_lane_bf16): Likewise.
14946 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
14947 structure instead of using a union.
14948 (vst3q_lane_f32): Likewise.
14949 (vst3q_lane_f64): Likewise.
14950 (vst3q_lane_p8): Likewise.
14951 (vst3q_lane_p16): Likewise.
14952 (vst3q_lane_p64): Likewise.
14953 (vst3q_lane_s8): Likewise.
14954 (vst3q_lane_s16): Likewise.
14955 (vst3q_lane_s32): Likewise.
14956 (vst3q_lane_s64): Likewise.
14957 (vst3q_lane_u8): Likewise.
14958 (vst3q_lane_u16): Likewise.
14959 (vst3q_lane_u32): Likewise.
14960 (vst3q_lane_u64): Likewise.
14961 (vst3q_lane_bf16): Likewise.
14963 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14965 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
14966 (__ST4Q_LANE_FUNC): Delete.
14967 (vst4_lane_f16): Use __builtin_memcpy to copy vector
14968 structure instead of constructing __builtin_aarch64_simd_xi
14969 one vector at a time.
14970 (vst4_lane_f32): Likewise.
14971 (vst4_lane_f64): Likewise.
14972 (vst4_lane_p8): Likewise.
14973 (vst4_lane_p16): Likewise.
14974 (vst4_lane_p64): Likewise.
14975 (vst4_lane_s8): Likewise.
14976 (vst4_lane_s16): Likewise.
14977 (vst4_lane_s32): Likewise.
14978 (vst4_lane_s64): Likewise.
14979 (vst4_lane_u8): Likewise.
14980 (vst4_lane_u16): Likewise.
14981 (vst4_lane_u32): Likewise.
14982 (vst4_lane_u64): Likewise.
14983 (vst4_lane_bf16): Likewise.
14984 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
14985 structure instead of using a union.
14986 (vst4q_lane_f32): Likewise.
14987 (vst4q_lane_f64): Likewise.
14988 (vst4q_lane_p8): Likewise.
14989 (vst4q_lane_p16): Likewise.
14990 (vst4q_lane_p64): Likewise.
14991 (vst4q_lane_s8): Likewise.
14992 (vst4q_lane_s16): Likewise.
14993 (vst4q_lane_s32): Likewise.
14994 (vst4q_lane_s64): Likewise.
14995 (vst4q_lane_u8): Likewise.
14996 (vst4q_lane_u16): Likewise.
14997 (vst4q_lane_u32): Likewise.
14998 (vst4q_lane_u64): Likewise.
14999 (vst4q_lane_bf16): Likewise.
15001 2021-08-06 Martin Liska <mliska@suse.cz>
15003 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
15004 a target option is restored, it can have
15005 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
15006 and error should not be emitted.
15008 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
15010 * gcov-io.h (gcov_write): Declare.
15011 * gcov-io.c (gcov_write): New.
15012 (gcov_write_counter): Remove.
15013 (gcov_write_tag_length): Likewise.
15014 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
15015 gcov_write_unsigned().
15016 * doc/invoke.texi (fprofile-info-section): Mention
15017 __gcov_info_to_gdca().
15019 2021-08-06 Martin Sebor <msebor@redhat.com>
15021 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
15022 arguments to by-reference.
15023 (iterate_fix_dominators): Same.
15024 * dominance.h (iterate_fix_dominators): Same.
15025 * ipa-prop.h: Call auto_vec::to_vec_legacy.
15026 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
15027 arguments to by-reference.
15028 (debug_data_dependence_relation): Same.
15029 (dump_data_dependence_relations): Same.
15030 * tree-data-ref.h (debug_data_dependence_relation): Same.
15031 (dump_data_dependence_relations): Same.
15032 * tree-predcom.c (dump_chains): Same.
15033 (initialize_root_vars_lm): Same.
15034 (determine_unroll_factor): Same.
15035 (replace_phis_by_defined_names): Same.
15036 (insert_init_seqs): Same.
15037 (pcom_worker::tree_predictive_commoning_loop): Call
15038 auto_vec::to_vec_legacy.
15039 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
15040 arguments to by-reference.
15041 * tree-ssa-threadbackward.c (populate_worklist): Same.
15042 (back_threader::resolve_def): Same.
15043 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
15044 (vect_enhance_data_refs_alignment): Same.
15045 (vect_check_lower_bound): Same.
15046 (vect_prune_runtime_alias_test_list): Same.
15047 (vect_permute_store_chain): Same.
15048 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
15049 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
15050 * tree-vectorizer.h (vect_permute_store_chain): Same.
15051 * vec.c (test_init): New function.
15052 (vec_c_tests): Call new function.
15053 * vec.h (vec): Declare ctors, dtor, and assignment.
15054 (auto_vec::vec_to_legacy): New function.
15055 (vec::copy): Adjust initialization.
15057 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
15060 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
15061 callee only uses GPRs.
15062 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
15063 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
15064 * config/i386/x86gprintrin.h: Add
15065 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
15066 to disable non-GPR ISAs.
15068 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15070 PR middle-end/101787
15071 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
15073 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15075 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
15076 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
15077 (vect_is_extending_load, vect_is_integer_truncation): New functions,
15078 moved from aarch64.c but given different names.
15079 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
15080 (aarch64_is_reduction, aarch64_reduc_type)
15081 (aarch64_embedded_comparison_type, aarch64_comparison_type)
15082 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
15083 in favor of the above. Update callers accordingly.
15085 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15088 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
15089 writing .cpu directive in asm output.
15090 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
15091 (arm_last_printed_arch_string): Delete.
15092 (arm_last-printed_fpu_string): Delete.
15093 (arm_configure_build_target): If use of floating-point/SIMD is
15094 disabled, remove all fp/simd related features from the target ISA.
15095 (last_arm_targ_options): New variable.
15096 (arm_print_asm_arch_directives): Add new parameters. Change order
15097 of emitted directives and handle all cases here.
15098 (arm_file_start): Always call arm_print_asm_arch_directives, move
15099 all generation of .arch/.arch_extension here.
15100 (arm_file_end): Call arm_print_asm_arch.
15101 (arm_declare_function_name): Call arm_print_asm_arch_directives
15102 instead of printing .arch/.fpu directives directly.
15104 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15106 * config/arm/arm.c (arm_configure_build_target): Don't call
15107 arm_option_reconfigure_globals.
15108 (arm_option_restore): Call arm_option_reconfigure_globals after
15109 reconfiguring the target.
15110 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
15112 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15114 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
15115 arch_name is always set.
15117 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15119 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15120 of vec_select high-half from being added into Neon subtract
15123 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15125 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15126 of vec_select high-half from being added into Neon add cost.
15128 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
15130 * cfgloop.h (loops_list::loops_list): Add one optional argument
15131 root and adjust accordingly, update loop tree walking and factor
15133 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
15135 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
15137 PR tree-optimization/101626
15138 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
15139 reverse scalar storage order on a pointer or vector component.
15141 2021-08-05 liuhongt <hongtao.liu@intel.com>
15143 * config/i386/sse.md (cond_<code><mode>): New expander.
15145 2021-08-05 liuhongt <hongtao.liu@intel.com>
15147 * config/i386/sse.md (cond_<code><mode>): New expander.
15149 2021-08-05 liuhongt <hongtao.liu@intel.com>
15151 * config/i386/sse.md (cond_<code><mode>): New expander.
15153 2021-08-04 David Malcolm <dmalcolm@redhat.com>
15156 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
15158 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15161 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
15162 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
15164 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15167 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
15168 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
15169 data with SSE register from one memory location to another.
15171 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15173 * config/s390/s390.c (expand_perm_with_vpdi): New function.
15174 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
15175 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
15176 parameterized expander.
15177 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
15179 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15181 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
15182 (struct expand_vec_perm_d): Define struct.
15183 (expand_perm_with_merge): New function.
15184 (vectorize_vec_perm_const_1): New function.
15185 (s390_vectorize_vec_perm_const): New function.
15186 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
15188 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15190 * config/s390/vector.md (V_HW_64): Remove mode iterator.
15191 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
15192 * config/s390/vx-builtins.md
15193 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
15196 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15198 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
15200 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
15202 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
15203 instead of an unspec.
15205 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15207 * config/s390/s390-modes.def: Add more vector modes to support
15208 concatenation of two vectors.
15209 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
15211 (s390_expand_merge): Likewise.
15212 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
15213 (s390_expand_merge): New function.
15214 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
15215 Remove constant definitions.
15216 * config/s390/vector.md (V_HW_2): Add mode iterators.
15217 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
15218 (vec_2x_nelts, vec_2x_wide): New mode attributes.
15219 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
15220 New pattern definitions.
15221 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
15222 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
15223 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
15224 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
15226 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
15228 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
15229 emit vec merge pattern.
15231 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
15233 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
15235 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
15236 vec_select high-half from being added into Neon multiply
15238 * rtlanal.c (vec_series_highpart_p): Define.
15239 * rtlanal.h (vec_series_highpart_p): Declare.
15241 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
15243 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
15245 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
15246 vec_select cost from being added into Neon multiply cost.
15248 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15250 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
15251 which old_loop_vinfo is an epilogue loop that handles a constant
15252 number of iterations.
15254 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15256 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
15257 when a reanalyzed loop fails to be cheaper than the current
15260 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15262 * config/aarch64/aarch64.c: Fix a typo.
15264 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
15266 PR gcov-profile/101773
15267 * gcov-io.c (gcov_close): Check return code of a fclose.
15269 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
15272 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
15273 .file statement when needed.
15275 2021-08-04 Richard Biener <rguenther@suse.de>
15277 * tree-vect-data-refs.c (vect_check_gather_scatter):
15278 Include widening conversions only when the result is
15279 still handed by native gather or the current offset
15280 size not already matches the data size.
15281 Also succeed analysis in case there's no native support,
15282 noted by a IFN_LAST ifn and a NULL decl.
15283 (vect_analyze_data_refs): Always consider gathers.
15284 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
15285 Test for no IFN gather rather than decl gather.
15286 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
15287 gather-scatter info and cost emulated gathers accordingly.
15288 (vect_truncate_gather_scatter_offset): Properly test for
15290 (vect_use_strided_gather_scatters_p): Likewise.
15291 (get_load_store_type): Handle emulated gathers and its
15293 (vectorizable_load): Likewise. Emulate them by extracting
15294 scalar offsets, doing scalar loads and a vector construct.
15296 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15299 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
15300 argument to set m_max_size.
15301 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
15302 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
15303 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
15305 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
15306 Marc Glisse <marc.glisse@inria.fr>
15308 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
15309 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
15310 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
15312 2021-08-04 Richard Biener <rguenther@suse.de>
15314 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
15315 out code to decompose vector loads ...
15316 (optimize_vector_load): ... here. Generalize it to
15317 handle intermediate widening and TARGET_MEM_REF loads
15318 and apply it to loads with a supported vector mode as well.
15320 2021-08-04 Richard Biener <rguenther@suse.de>
15322 PR tree-optimization/101756
15323 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
15324 the result of the reduction epilogue is compatible to the original
15327 2021-08-04 liuhongt <hongtao.liu@intel.com>
15330 * config/i386/i386.md (peephole2): Refine predicate from
15331 register_operand to general_reg_operand.
15333 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
15335 * gimple-range-path.h (path_range_query::dump): Mark override.
15337 2021-08-04 Richard Biener <rguenther@suse.de>
15339 PR tree-optimization/101769
15340 * tree-tailcall.c (eliminate_tail_call): Add the created loop
15341 for the first recursion and return it via the new output parameter.
15342 (optimize_tail_call): Pass through new output param.
15343 (tree_optimize_tail_calls_1): After creating all latches,
15344 add the created loop to the loop tree. Do not mark loops for fixup.
15346 2021-08-04 Martin Liska <mliska@suse.cz>
15348 * doc/invoke.texi: Document threader-mode param.
15350 2021-08-04 liuhongt <hongtao.liu@intel.com>
15352 * config/i386/sse.md (cond_fma<mode>): New expander.
15353 (cond_fms<mode>): Ditto.
15354 (cond_fnma<mode>): Ditto.
15355 (cond_fnms<mode>): Ditto.
15357 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
15359 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
15361 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
15363 * config/rs6000/constraints.md: Remove "e" from the list of available
15364 constraint characters.
15366 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
15368 PR gcov-profile/71672
15369 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
15371 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
15373 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
15375 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
15377 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
15378 try XMM31 to avoid vzeroupper.
15380 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15382 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
15383 -mcpu=neoverse-512tvb.
15384 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
15385 * config/aarch64/aarch64-tune.md: Regenerate.
15386 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
15387 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
15388 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
15389 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
15390 (aarch64_adjust_body_cost): Likewise.
15392 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15394 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
15395 record issue information for operations that occur in the
15398 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15400 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
15401 parameter. Detect cases in which an Advanced SIMD MLA would almost
15402 certainly require a MOV.
15403 (aarch64_count_ops): Update accordingly.
15405 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15407 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
15408 function, split out from...
15409 (aarch64_detect_vector_stmt_subtype): ...here.
15410 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
15412 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15414 * config/aarch64/aarch64-protos.h (sve_vec_cost):
15415 Add gather_load_x32_cost and gather_load_x64_cost.
15416 * config/aarch64/aarch64.c (generic_sve_vector_cost)
15417 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
15418 accordingly, using the values given by the scalar_load * number
15419 of elements calculation that we used previously.
15420 (aarch64_detect_vector_stmt_subtype): Use the new fields.
15422 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15424 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
15425 function, split out from...
15426 (aarch64_adjust_body_cost): ...here.
15428 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15430 * config/aarch64/fractional-cost.h: New file.
15431 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
15432 and cost_fraction.h.
15433 (vec_cost_fraction): New typedef.
15434 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
15435 (aarch64_detect_vector_stmt_subtype): Likewise.
15436 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
15437 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
15439 (aarch64_adjust_body_cost): Likewise.
15440 (aarch64_test_cost_fraction): New function.
15441 (aarch64_run_selftests): Call it.
15443 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15445 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
15447 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
15449 (aarch64_estimated_poly_value): Likewise. Use the least significant
15450 set bit for the minimum and likely values. Use the most significant
15451 set bit for the maximum value.
15453 2021-08-03 liuhongt <hongtao.liu@intel.com>
15455 * config/i386/sse.md (cond_<insn><mode>): New expander.
15456 (cond_mul<mode>): Ditto.
15458 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
15460 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
15462 2021-08-03 liuhongt <hongtao.liu@intel.com>
15464 * config/i386/sse.md (cond_<insn><mode>):New expander.
15465 (cond_mul<mode>): Ditto.
15466 (cond_div<mode>): Ditto.
15468 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
15470 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
15471 check stack_realign_needed for stack realignment.
15472 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
15473 than the largest integer supported by vector register.
15474 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
15475 (MOVE_MAX): Set to bytes of the largest integer supported by
15477 (STORE_MAX_PIECES): New.
15479 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
15481 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
15482 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
15483 data from one memory location to another.
15485 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
15487 PR middle-end/90773
15488 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
15490 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
15492 PR tree-optimization/101724
15493 * params.opt: Remove --param=threader-iterative.
15494 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
15497 2021-08-02 Tom de Vries <tdevries@suse.de>
15499 PR middle-end/101665
15500 * doc/extend.texi (nonnull attribute): Improve documentation.
15502 2021-08-02 Andrew Pinski <apinski@marvell.com>
15504 PR rtl-optimization/101683
15505 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
15507 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
15509 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
15510 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
15511 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
15512 (tree_ssa_phiop_worker): Update call to function above.
15514 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
15517 * config/i386/i386.md (bsr_rex64_1_zext): New.
15518 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
15519 with gen_bsr_rex64_1_zext.
15521 2021-07-31 Jakub Jelinek <jakub@redhat.com>
15524 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
15525 define_insn patterns.
15526 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
15527 Add combine splitters for constant - clz.
15528 (clz<mode>2): Use a temporary pseudo for bsr result.
15530 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
15532 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
15533 _mm_floor_sd, _mm_floor_ss): New.
15535 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
15537 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
15538 _mm_ceil_sd, _mm_ceil_ss): New.
15540 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
15542 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
15543 _mm_blend_ps, _mm_blendv_ps): New.
15545 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
15546 Uroš Bizjak <ubizjak@gmail.com>
15548 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
15549 to generate a conditional move using the carry flag after sub $1.
15550 (peephole2): Eliminate a register-to-register move by inverting
15551 the condition of a conditional move.
15553 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
15555 * config/mmix/mmix.md ("call", "call_value", "*call_real")
15556 ("*call_value_real"): Don't generate rtx mentioning the generic
15557 operands 1 and 2 to "call", and similarly for "call_value".
15558 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
15559 (mmix_print_operand): Use '!' instead of 'p'.
15561 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
15563 * doc/md.texi (call): Correct information about operand 2.
15564 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
15566 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
15568 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
15570 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
15572 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
15574 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
15576 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
15577 const_basic_block..
15578 (*::get_bb_range): Ditto.
15579 (*::bb_range_p): Ditto.
15580 * gimple-range-cache.h: Change prototypes.
15582 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
15584 PR middle-end/90773
15585 * builtins.c (builtin_memcpy_read_str): Change the mode argument
15586 from scalar_int_mode to fixed_size_mode.
15587 (builtin_strncpy_read_str): Likewise.
15588 (gen_memset_value_from_prev): New function.
15589 (builtin_memset_read_str): Change the mode argument from
15590 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
15591 and support CONST_VECTOR.
15592 (builtin_memset_gen_str): Likewise.
15593 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
15595 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
15596 with fixed_size_mode.
15597 (builtin_memset_read_str): Likewise.
15598 * expr.c (widest_int_mode_for_size): Renamed to ...
15599 (widest_fixed_size_mode_for_size): Add a bool argument to
15600 indicate if QI vector mode can be used.
15601 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
15602 instead of widest_int_mode_for_size.
15603 (pieces_addr::adjust): Change the mode argument from
15604 scalar_int_mode to fixed_size_mode.
15605 (op_by_pieces_d): Make m_len read-only. Add a bool member,
15606 m_qi_vector_mode, to indicate that QI vector mode can be used.
15607 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
15608 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
15609 instead of widest_int_mode_for_size.
15610 (op_by_pieces_d::get_usable_mode): Change the mode argument from
15611 scalar_int_mode to fixed_size_mode. Call
15612 widest_fixed_size_mode_for_size instead of
15613 widest_int_mode_for_size.
15614 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
15615 function to return the smallest integer or QI vector mode.
15616 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
15617 instead of widest_int_mode_for_size. Call
15618 smallest_fixed_size_mode_for_size instead of
15619 smallest_int_mode_for_size.
15620 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
15621 indicate that QI vector mode can be used and pass it to
15622 op_by_pieces_d::op_by_pieces_d.
15623 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
15624 instead of widest_int_mode_for_size. Pass memsetp to
15625 widest_fixed_size_mode_for_size to support QI vector mode.
15626 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
15627 (store_by_pieces): Pass memsetp to
15628 store_by_pieces_d::store_by_pieces_d.
15629 (clear_by_pieces_1): Removed.
15630 (clear_by_pieces): Replace clear_by_pieces_1 with
15631 builtin_memset_read_str and pass true to store_by_pieces_d to
15632 support vector mode broadcast.
15633 (string_cst_read_str): Change the mode argument from
15634 scalar_int_mode to fixed_size_mode.
15635 * expr.h (by_pieces_constfn): Change scalar_int_mode to
15637 (by_pieces_prev): Likewise.
15638 * rtl.h (lowpart_subreg_regno): New.
15639 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
15640 simplify_subreg_regno.
15641 * target.def (gen_memset_scratch_rtx): New hook.
15642 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
15643 * doc/tm.texi: Regenerated.
15645 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
15648 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
15649 TARGET_EXPR instead of MODIFY_EXPR.
15651 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
15654 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
15655 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
15656 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
15658 (vec_cmpu<IMSA:mode><mode_i>): New expander.
15660 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
15663 * config/i386/i386-options.c (ix86_option_override_internal):
15664 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
15666 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15669 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
15670 explicitly dereferencing __a.
15671 (vld1_s64): Likewise.
15672 (vld1_u64): Likewise.
15673 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
15674 and change to VAR13.
15676 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
15678 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
15679 use of m_range_analyzer.
15680 (loop_versioning::lv_dom_walker::before_dom_children): Same.
15681 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
15682 (loop_versioning::prune_loop_conditions): Replace vr_values use
15683 with range_query interface.
15684 (pass_loop_versioning::execute): Use ranger.
15686 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
15689 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
15690 enum values, and emit a warning if they mismatch.
15692 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
15694 * cfgloop.h (as_const): New function.
15695 (class loop_iterator): Rename to ...
15696 (class loops_list): ... this.
15697 (loop_iterator::next): Rename to ...
15698 (loops_list::Iter::fill_curr_loop): ... this and adjust.
15699 (loop_iterator::loop_iterator): Rename to ...
15700 (loops_list::loops_list): ... this and adjust.
15701 (loops_list::Iter): New class.
15702 (loops_list::iterator): New type.
15703 (loops_list::const_iterator): New type.
15704 (loops_list::begin): New function.
15705 (loops_list::end): Likewise.
15706 (loops_list::begin const): Likewise.
15707 (loops_list::end const): Likewise.
15708 (FOR_EACH_LOOP): Remove.
15709 (FOR_EACH_LOOP_FN): Remove.
15710 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
15711 for loop with loops_list instance.
15712 (sort_sibling_loops): Likewise.
15713 (disambiguate_loops_with_multiple_latches): Likewise.
15714 (verify_loop_structure): Likewise.
15715 * cfgloopmanip.c (create_preheaders): Likewise.
15716 (force_single_succ_latches): Likewise.
15717 * config/aarch64/falkor-tag-collision-avoidance.c
15718 (execute_tag_collision_avoidance): Likewise.
15719 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
15720 * config/s390/s390.c (s390_adjust_loops): Likewise.
15721 * doc/loop.texi: Likewise.
15722 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
15723 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
15724 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
15725 (loop_versioning::make_versioning_decisions): Likewise.
15726 * gimple-ssa-split-paths.c (split_paths): Likewise.
15727 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
15728 * graphite.c (canonicalize_loop_form): Likewise.
15729 (graphite_transform_loops): Likewise.
15730 * ipa-fnsummary.c (analyze_function_body): Likewise.
15731 * ipa-pure-const.c (analyze_function): Likewise.
15732 * loop-doloop.c (doloop_optimize_loops): Likewise.
15733 * loop-init.c (loop_optimizer_finalize): Likewise.
15734 (fix_loop_structure): Likewise.
15735 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
15736 (move_loop_invariants): Likewise.
15737 * loop-unroll.c (decide_unrolling): Likewise.
15738 (unroll_loops): Likewise.
15739 * modulo-sched.c (sms_schedule): Likewise.
15740 * predict.c (predict_loops): Likewise.
15741 (pass_profile::execute): Likewise.
15742 * profile.c (branch_prob): Likewise.
15743 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
15744 (sel_find_rgns): Likewise.
15745 * tree-cfg.c (replace_loop_annotate): Likewise.
15746 (replace_uses_by): Likewise.
15747 (move_sese_region_to_fn): Likewise.
15748 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
15749 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
15750 * tree-parloops.c (parallelize_loops): Likewise.
15751 * tree-predcom.c (tree_predictive_commoning): Likewise.
15752 * tree-scalar-evolution.c (scev_initialize): Likewise.
15753 (scev_reset): Likewise.
15754 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
15755 * tree-ssa-live.c (remove_unused_locals): Likewise.
15756 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
15757 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
15758 (tree_ssa_lim_initialize): Likewise.
15759 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
15760 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
15761 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
15762 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
15763 (free_numbers_of_iterations_estimates): Likewise.
15764 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
15765 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
15766 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
15767 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
15768 (pass_scev_cprop::execute): Likewise.
15769 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
15770 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
15771 * tree-ssa-threadupdate.c
15772 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
15773 * tree-vectorizer.c (vectorize_loops): Likewise.
15774 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
15776 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
15778 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
15779 generating a VOIDmode register for e.g the
15780 function_arg_info::end_marker.
15782 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
15784 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
15785 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
15786 (cc mode_attr): Similarly.
15787 (ccz subst_attr): Similarly.
15788 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
15789 * config/h8300/testcompare.md: Remove various cc0 based patterns
15790 that had been commented out. Add pattern to set CCZ from a bit
15793 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
15794 Julian Brown <julian@codesourcery.com>
15795 Kwok Cheung Yeung <kcy@codesourcery.com>
15797 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
15798 'update_stmt' after modification.
15799 (pass_oacc_loop_designation): New function, extracted out of...
15800 (pass_oacc_device_lower): ... this.
15801 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
15802 (make_pass_oacc_loop_designation): New
15803 * passes.def: Add it.
15804 * tree-parloops.c (create_parallel_loop): Adjust.
15805 * tree-pass.h (make_pass_oacc_loop_designation): New.
15807 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
15809 * flag-types.h (enum threader_mode): New.
15810 * params.opt: Add entry for --param=threader-mode.
15811 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
15812 (class back_threader): New.
15813 (back_threader::back_threader): New.
15814 (back_threader::~back_threader): New.
15815 (back_threader::maybe_register_path): New.
15816 (back_threader::find_taken_edge): New.
15817 (back_threader::find_taken_edge_switch): New.
15818 (back_threader::find_taken_edge_cond): New.
15819 (back_threader::resolve_def): New.
15820 (back_threader::resolve_phi): New.
15821 (back_threader::find_paths_to_names): New.
15822 (back_threader::find_paths): New.
15825 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
15826 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
15827 (pass_thread_jumps::execute): Abstract out code...
15828 (try_thread_blocks): ...here.
15829 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
15830 Abstract out threading candidate code to...
15831 (single_succ_to_potentially_threadable_block): ...here.
15832 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
15834 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
15835 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
15836 Return bool from register_jump_thread.
15838 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
15840 * target.def: in0 and in1 do not need to be registers.
15841 * doc/tm.texi: Regenerate.
15843 2021-07-29 liuhongt <hongtao.liu@intel.com>
15846 * config/i386/i386.c (ix86_widen_mult_cost): New function.
15847 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
15850 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
15853 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
15854 (rs6000_preferred_doloop_mode): New hook.
15855 * doc/tm.texi: Regenerate.
15856 * doc/tm.texi.in: Add hook preferred_doloop_mode.
15857 * target.def (preferred_doloop_mode): New hook.
15858 * targhooks.c (default_preferred_doloop_mode): New hook.
15859 * targhooks.h (default_preferred_doloop_mode): New hook.
15860 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
15861 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
15862 and compute_doloop_base_on_mode.
15864 2021-07-28 Martin Sebor <msebor@redhat.com>
15866 PR middle-end/101494
15867 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
15868 and size computation.
15870 2021-07-28 Martin Sebor <msebor@redhat.com>
15872 PR middle-end/101601
15873 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
15875 Handle pointers to functions.
15877 2021-07-28 Martin Sebor <msebor@redhat.com>
15879 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
15880 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
15881 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
15882 (access_ref::access_ref): Same.
15883 (access_ref::phi): Same.
15884 (access_ref::get_ref): Same.
15885 (access_ref::size_remaining): Same.
15886 (access_ref::offset_in_range): Same.
15887 (access_ref::add_offset): Same.
15888 (access_ref::inform_access): Same.
15889 (ssa_name_limit_t::visit_phi): Same.
15890 (ssa_name_limit_t::leave_phi): Same.
15891 (ssa_name_limit_t::next): Same.
15892 (ssa_name_limit_t::next_phi): Same.
15893 (ssa_name_limit_t::~ssa_name_limit_t): Same.
15894 (pointer_query::pointer_query): Same.
15895 (pointer_query::get_ref): Same.
15896 (pointer_query::put_ref): Same.
15897 (pointer_query::flush_cache): Same.
15898 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
15899 (check_nul_terminated_array): Same.
15900 (unterminated_array): Same.
15901 (maybe_warn_for_bound): Same.
15902 (check_read_access): Same.
15903 (warn_for_access): Same.
15904 (get_size_range): Same.
15905 (check_access): Same.
15906 (gimple_call_alloc_size): Move to tree.c.
15907 (gimple_parm_array_size): Move to pointer-query.cc.
15908 (get_offset_range): Same.
15909 (gimple_call_return_array): Same.
15910 (handle_min_max_size): Same.
15911 (handle_array_ref): Same.
15912 (handle_mem_ref): Same.
15913 (compute_objsize): Same.
15914 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
15915 (call_dealloc_argno): Same.
15916 (fndecl_dealloc_argno): Same.
15917 (new_delete_mismatch_p): Same.
15918 (matching_alloc_calls_p): Same.
15919 (warn_dealloc_offset): Same.
15920 (maybe_emit_free_warning): Same.
15921 * builtins.h (check_nul_terminated_array): Move to
15922 gimple-ssa-warn-access.h.
15923 (check_nul_terminated_array): Same.
15924 (warn_string_no_nul): Same.
15925 (unterminated_array): Same.
15926 (class ssa_name_limit_t): Same.
15927 (class pointer_query): Same.
15928 (struct access_ref): Same.
15929 (class range_query): Same.
15930 (struct access_data): Same.
15931 (gimple_call_alloc_size): Same.
15932 (gimple_parm_array_size): Same.
15933 (compute_objsize): Same.
15934 (class access_data): Same.
15935 (maybe_emit_free_warning): Same.
15936 * calls.c (initialize_argument_information): Remove call to
15937 maybe_emit_free_warning.
15938 * gimple-array-bounds.cc: Include new header..
15939 * gimple-fold.c: Same.
15940 * gimple-ssa-sprintf.c: Same.
15941 * gimple-ssa-warn-restrict.c: Same.
15942 * passes.def: Add pass_warn_access.
15943 * tree-pass.h (make_pass_warn_access): Declare.
15944 * tree-ssa-strlen.c: Include new headers.
15945 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
15946 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
15947 * gimple-ssa-warn-access.cc: New file.
15948 * gimple-ssa-warn-access.h: New file.
15949 * pointer-query.cc: New file.
15950 * pointer-query.h: New file.
15952 2021-07-28 Jakub Jelinek <jakub@redhat.com>
15954 PR middle-end/101624
15955 * ubsan.c (maybe_instrument_pointer_overflow,
15956 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
15957 PARM_DECLs or RESULT_DECLs.
15958 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
15960 2021-07-28 Jakub Jelinek <jakub@redhat.com>
15962 PR middle-end/101642
15963 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
15964 to type of bswap16 for comparison.
15965 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
15967 2021-07-28 Richard Biener <rguenther@suse.de>
15969 PR tree-optimization/101615
15970 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
15971 at CTOR SLP graph entries.
15973 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
15975 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
15976 Add "r,w" alternative.
15978 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
15981 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
15982 AVX_U128_DIRTY when all bits are zero.
15984 2021-07-28 Richard Biener <rguenther@suse.de>
15986 PR tree-optimization/101615
15987 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
15988 external nodes cannot be permuted so make them perm_out 0.
15990 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
15993 * config.in: Regenerate.
15994 * config/gcn/gcn-hsa.h (A_FIJI): New define.
15995 (A_900): New define.
15996 (A_906): New define.
15997 (A_908): New define.
15998 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
15999 * config/gcn/gcn.c (output_file_start): Adjust attributes according
16000 to the assembler capabilities.
16001 * config/gcn/mkoffload.c (main): Likewise.
16002 * configure: Regenerate.
16003 * configure.ac: Add tests for LLVM assembler attribute features.
16005 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
16007 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
16008 cond_false and cond_true on branches.
16010 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16012 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
16015 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16017 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
16019 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16022 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
16023 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
16024 and has special !TARGET_AVX512VL expansion.
16025 (vashrv2di3<mask_name>): Rename to ...
16026 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
16027 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
16029 2021-07-28 Martin Uecker <muecker@gwdg.de>
16031 * calls.c (maybe_warn_rdwr_sizes): Correct argument
16032 numbers in warning that were switched.
16034 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
16036 PR tree-optimization/101596
16037 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
16038 by using new_type's precision instead.
16040 2021-07-28 liuhongt <hongtao.liu@intel.com>
16043 * config/i386/i386.h (processor_costs): Add new member
16045 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
16046 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
16047 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
16048 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
16049 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
16050 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
16051 generic_cost, core_cost): Initialize integer_to_sse same value
16053 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
16054 * config/i386/i386.c (ix86_builtin_vectorization_cost):
16055 Use integer_to_sse instead of sse_op to calculate the cost of
16058 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16060 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
16062 (write_init_file): Call write_ovld_static_init.
16064 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16066 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
16068 (write_init_file): Call write_bif_static_init.
16070 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16072 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
16073 (TYPE_MAP_SIZE): New macro.
16074 (type_map): New initialized variable.
16075 (typemap_cmp): New function.
16076 (write_type_node): Likewise.
16077 (write_fntype_init): Implement.
16079 2021-07-27 Martin Sebor <msebor@redhat.com>
16081 PR tree-optimization/101584
16082 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
16083 (check_defs): Call it.
16085 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16087 * tree-ssa-dom.c (dom_jump_threader_simplifier):
16088 Put avail_exprs_stack in the class, instead of passing it to
16089 jump_threader_simplifier.
16090 (dom_jump_threader_simplifier::simplify): Add state argument.
16091 (dom_opt_dom_walker): Add state.
16092 (pass_dominator::execute): Pass state to threader.
16093 (dom_opt_dom_walker::before_dom_children): Use state.
16094 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
16095 arguments by state.
16096 (jump_threader::record_temporary_equivalences_from_phis):
16097 Register equivalences through the state variable.
16098 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
16099 Record ranges in a statement through the state variable.
16100 (jump_threader::simplify_control_stmt_condition): Pass state to
16102 (jump_threader::simplify_control_stmt_condition_1): Same.
16103 (jump_threader::thread_around_empty_blocks): Remove obsolete
16105 (jump_threader::thread_through_normal_block): Record equivalences
16106 on edge through the state variable.
16107 (jump_threader::thread_across_edge): Abstract state pushing.
16108 (jt_state::jt_state): New.
16109 (jt_state::push): New.
16110 (jt_state::pop): New.
16111 (jt_state::register_equiv): New.
16112 (jt_state::record_ranges_from_stmt): New.
16113 (jt_state::register_equivs_on_edge): New.
16114 (jump_threader_simplifier::jump_threader_simplifier): Move from
16116 (jump_threader_simplifier::simplify): Add state argument.
16117 * tree-ssa-threadedge.h (class jt_state): New.
16118 (class jump_threader): Add state to constructor.
16119 (class jump_threader_simplifier): Add state to simplify. Remove
16120 avail_exprs_stack from class.
16121 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
16123 (vrp_jump_threader::vrp_jump_threader): Add state.
16124 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
16126 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16128 * Makefile.in (OBJS): Add gimple-range-path.o.
16129 * gimple-range-path.cc: New file.
16130 * gimple-range-path.h: New file.
16132 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
16134 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
16135 inside vec_duplicate for all patterns.
16136 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
16137 Push sign/zero-extension inside vec_duplicate.
16139 2021-07-27 Richard Biener <rguenther@suse.de>
16141 PR tree-optimization/101573
16142 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
16143 looking at uninitialized PHI arg defs in some constrained cases.
16144 (warn_uninitialized_vars): Call it.
16145 (execute_early_warn_uninitialized): Calculate dominators.
16147 2021-07-27 Richard Biener <rguenther@suse.de>
16149 PR tree-optimization/39821
16150 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
16151 vector_stmt for widening arithmetic.
16152 (vectorizable_conversion): Adjust.
16154 2021-07-27 Martin Jambor <mjambor@suse.cz>
16156 * cgraph.h (ipa_replace_map): New field force_load_ref.
16157 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
16158 aded new flag load_dereferenced, adjusted comments.
16159 (ipa_get_param_dereferenced): New function.
16160 (ipa_set_param_dereferenced): Likewise.
16161 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
16162 * ipa-cp.c: Include gimple.h.
16163 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
16164 (get_replacement_map): New parameter force_load_ref, set the
16165 appropriate flag in ipa_replace_map if set.
16166 (struct symbol_and_index_together): New type.
16167 (adjust_refs_in_act_callers): New function.
16168 (adjust_references_in_caller): Likewise.
16169 (create_specialized_node): When appropriate, call
16170 adjust_references_in_caller and force only load references.
16171 * ipa-prop.c (load_from_dereferenced_name): New function.
16172 (ipa_analyze_controlled_uses): Also detect loads from a
16173 dereference, harden testing of call statements.
16174 (ipa_write_node_info): Stream the dereferenced flag.
16175 (ipa_read_node_info): Likewise.
16176 (ipa_set_jf_constant): Also create refdesc when jump function
16177 references a variable.
16178 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
16179 also on references of variables and return a symtab_node. Adjust
16181 (propagate_controlled_uses): Also remove references to VAR_DECLs.
16183 2021-07-27 Jakub Jelinek <jakub@redhat.com>
16185 PR middle-end/101586
16186 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
16187 positions above or equal to sz except for diagnostics of flexible
16190 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16192 PR tree-optimization/78888
16193 * gimple-range-fold.cc (get_letter_range): New.
16194 (fold_using_range::range_of_builtin_call): Call get_letter_range.
16196 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16198 PR tree-optimization/78888
16199 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
16200 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
16202 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
16203 Marc Glisse <marc.glisse@inria.fr>
16205 * match.pd (rotate): Simplify equality/inequality of rotations.
16206 (bswap): Simplify equality/inequality tests of byte swapping.
16208 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16210 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
16213 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16215 * range-op.cc (operator_lshift::fold_range): Pass rel to
16216 base class fold_range.
16217 (operator_rshift::fold_range): Same.
16219 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
16222 * toplev.h (min_align_loops_log): Remove declaration.
16223 (min_align_jumps_log, min_align_labels_log): Likewise.
16224 (min_align_functions_log): Likewise.
16226 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16228 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
16230 (execute_vrp): Abstract out simplification of conditionals...
16231 (simplify_casted_conds): ...here.
16233 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16235 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
16236 Add gimple argument.
16237 (array_bounds_checker::check_array_ref): Same.
16238 (array_bounds_checker::check_addr_expr): Same.
16239 (array_bounds_checker::check_array_bounds): Pass statement to
16240 check_array_bounds and check_addr_expr.
16241 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
16242 (check_addr_expr): Same.
16243 (get_value_range): Same.
16245 2021-07-26 Tamar Christina <tamar.christina@arm.com>
16247 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
16248 (sdot_prod, udot_prod): ... This.
16249 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
16251 (<sur>dot_prod<vsi2qi>): ... this.
16252 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
16253 Change operands order.
16254 (<sur>sadv16qi): Use new operands order.
16255 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
16256 vdotq_s32): Use new RTL ordering.
16258 2021-07-26 Tamar Christina <tamar.christina@arm.com>
16260 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
16261 aarch64_types_ternop_suss_qualifiers): New.
16262 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
16263 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
16264 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
16266 2021-07-23 Jakub Jelinek <jakub@redhat.com>
16268 PR rtl-optimization/101562
16269 * expmed.c (store_integral_bit_field): Only use movstrict_optab
16270 if the operand isn't paradoxical.
16272 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
16274 * gimple-array-bounds.h (class array_bounds_checker): Change
16275 ranges type to range_query.
16277 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16279 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
16280 __builtin_memcpy instead of constructing
16281 __builtin_aarch64_simd_oi one vector at a time.
16282 (vst1_u64_x2): Likewise.
16283 (vst1_f64_x2): Likewise.
16284 (vst1_s8_x2): Likewise.
16285 (vst1_p8_x2): Likewise.
16286 (vst1_s16_x2): Likewise.
16287 (vst1_p16_x2): Likewise.
16288 (vst1_s32_x2): Likewise.
16289 (vst1_u8_x2): Likewise.
16290 (vst1_u16_x2): Likewise.
16291 (vst1_u32_x2): Likewise.
16292 (vst1_f16_x2): Likewise.
16293 (vst1_f32_x2): Likewise.
16294 (vst1_p64_x2): Likewise.
16295 (vst1q_s8_x2): Likewise.
16296 (vst1q_p8_x2): Likewise.
16297 (vst1q_s16_x2): Likewise.
16298 (vst1q_p16_x2): Likewise.
16299 (vst1q_s32_x2): Likewise.
16300 (vst1q_s64_x2): Likewise.
16301 (vst1q_u8_x2): Likewise.
16302 (vst1q_u16_x2): Likewise.
16303 (vst1q_u32_x2): Likewise.
16304 (vst1q_u64_x2): Likewise.
16305 (vst1q_f16_x2): Likewise.
16306 (vst1q_f32_x2): Likewise.
16307 (vst1q_f64_x2): Likewise.
16308 (vst1q_p64_x2): Likewise.
16310 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16312 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
16313 __builtin_memcpy instead of constructing
16314 __builtin_aarch64_simd_ci one vector at a time.
16315 (vst1_u64_x3): Likewise.
16316 (vst1_f64_x3): Likewise.
16317 (vst1_s8_x3): Likewise.
16318 (vst1_p8_x3): Likewise.
16319 (vst1_s16_x3): Likewise.
16320 (vst1_p16_x3): Likewise.
16321 (vst1_s32_x3): Likewise.
16322 (vst1_u8_x3): Likewise.
16323 (vst1_u16_x3): Likewise.
16324 (vst1_u32_x3): Likewise.
16325 (vst1_f16_x3): Likewise.
16326 (vst1_f32_x3): Likewise.
16327 (vst1_p64_x3): Likewise.
16328 (vst1q_s8_x3): Likewise.
16329 (vst1q_p8_x3): Likewise.
16330 (vst1q_s16_x3): Likewise.
16331 (vst1q_p16_x3): Likewise.
16332 (vst1q_s32_x3): Likewise.
16333 (vst1q_s64_x3): Likewise.
16334 (vst1q_u8_x3): Likewise.
16335 (vst1q_u16_x3): Likewise.
16336 (vst1q_u32_x3): Likewise.
16337 (vst1q_u64_x3): Likewise.
16338 (vst1q_f16_x3): Likewise.
16339 (vst1q_f32_x3): Likewise.
16340 (vst1q_f64_x3): Likewise.
16341 (vst1q_p64_x3): Likewise.
16343 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
16346 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
16347 hard register when LRA is in progress.
16349 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16351 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
16352 __builtin_memcpy instead of using a union.
16353 (vst1q_s8_x4): Likewise.
16354 (vst1_s16_x4): Likewise.
16355 (vst1q_s16_x4): Likewise.
16356 (vst1_s32_x4): Likewise.
16357 (vst1q_s32_x4): Likewise.
16358 (vst1_u8_x4): Likewise.
16359 (vst1q_u8_x4): Likewise.
16360 (vst1_u16_x4): Likewise.
16361 (vst1q_u16_x4): Likewise.
16362 (vst1_u32_x4): Likewise.
16363 (vst1q_u32_x4): Likewise.
16364 (vst1_f16_x4): Likewise.
16365 (vst1q_f16_x4): Likewise.
16366 (vst1_f32_x4): Likewise.
16367 (vst1q_f32_x4): Likewise.
16368 (vst1_p8_x4): Likewise.
16369 (vst1q_p8_x4): Likewise.
16370 (vst1_p16_x4): Likewise.
16371 (vst1q_p16_x4): Likewise.
16372 (vst1_s64_x4): Likewise.
16373 (vst1_u64_x4): Likewise.
16374 (vst1_p64_x4): Likewise.
16375 (vst1q_s64_x4): Likewise.
16376 (vst1q_u64_x4): Likewise.
16377 (vst1q_p64_x4): Likewise.
16378 (vst1_f64_x4): Likewise.
16379 (vst1q_f64_x4): Likewise.
16381 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
16383 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
16384 instead of constructing __builtin_aarch64_simd_oi one vector
16386 (vst2_u64): Likewise.
16387 (vst2_f64): Likewise.
16388 (vst2_s8): Likewise.
16389 (vst2_p8): Likewise.
16390 (vst2_s16): Likewise.
16391 (vst2_p16): Likewise.
16392 (vst2_s32): Likewise.
16393 (vst2_u8): Likewise.
16394 (vst2_u16): Likewise.
16395 (vst2_u32): Likewise.
16396 (vst2_f16): Likewise.
16397 (vst2_f32): Likewise.
16398 (vst2_p64): Likewise.
16399 (vst2q_s8): Likewise.
16400 (vst2q_p8): Likewise.
16401 (vst2q_s16): Likewise.
16402 (vst2q_p16): Likewise.
16403 (vst2q_s32): Likewise.
16404 (vst2q_s64): Likewise.
16405 (vst2q_u8): Likewise.
16406 (vst2q_u16): Likewise.
16407 (vst2q_u32): Likewise.
16408 (vst2q_u64): Likewise.
16409 (vst2q_f16): Likewise.
16410 (vst2q_f32): Likewise.
16411 (vst2q_f64): Likewise.
16412 (vst2q_p64): Likewise.
16414 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16416 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
16417 instead of constructing __builtin_aarch64_simd_ci one vector
16419 (vst3_u64): Likewise.
16420 (vst3_f64): Likewise.
16421 (vst3_s8): Likewise.
16422 (vst3_p8): Likewise.
16423 (vst3_s16): Likewise.
16424 (vst3_p16): Likewise.
16425 (vst3_s32): Likewise.
16426 (vst3_u8): Likewise.
16427 (vst3_u16): Likewise.
16428 (vst3_u32): Likewise.
16429 (vst3_f16): Likewise.
16430 (vst3_f32): Likewise.
16431 (vst3_p64): Likewise.
16432 (vst3q_s8): Likewise.
16433 (vst3q_p8): Likewise.
16434 (vst3q_s16): Likewise.
16435 (vst3q_p16): Likewise.
16436 (vst3q_s32): Likewise.
16437 (vst3q_s64): Likewise.
16438 (vst3q_u8): Likewise.
16439 (vst3q_u16): Likewise.
16440 (vst3q_u32): Likewise.
16441 (vst3q_u64): Likewise.
16442 (vst3q_f16): Likewise.
16443 (vst3q_f32): Likewise.
16444 (vst3q_f64): Likewise.
16445 (vst3q_p64): Likewise.
16447 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16449 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
16450 instead of constructing __builtin_aarch64_simd_xi one vector
16452 (vst4_u64): Likewise.
16453 (vst4_f64): Likewise.
16454 (vst4_s8): Likewise.
16455 (vst4_p8): Likewise.
16456 (vst4_s16): Likewise.
16457 (vst4_p16): Likewise.
16458 (vst4_s32): Likewise.
16459 (vst4_u8): Likewise.
16460 (vst4_u16): Likewise.
16461 (vst4_u32): Likewise.
16462 (vst4_f16): Likewise.
16463 (vst4_f32): Likewise.
16464 (vst4_p64): Likewise.
16465 (vst4q_s8): Likewise.
16466 (vst4q_p8): Likewise.
16467 (vst4q_s16): Likewise.
16468 (vst4q_p16): Likewise.
16469 (vst4q_s32): Likewise.
16470 (vst4q_s64): Likewise.
16471 (vst4q_u8): Likewise.
16472 (vst4q_u16): Likewise.
16473 (vst4q_u32): Likewise.
16474 (vst4q_u64): Likewise.
16475 (vst4q_f16): Likewise.
16476 (vst4q_f32): Likewise.
16477 (vst4q_f64): Likewise.
16478 (vst4q_p64): Likewise.
16480 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16482 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
16483 instead of constructing __builtin_aarch64_simd_oi one vector
16485 (vtbx4_u8): Likewise.
16486 (vtbx4_p8): Likewise.
16488 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16490 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
16491 instead of constructing __builtin_aarch64_simd_oi one vector
16493 (vtbl3_u8): Likewise.
16494 (vtbl3_p8): Likewise.
16495 (vtbl4_s8): Likewise.
16496 (vtbl4_u8): Likewise.
16497 (vtbl4_p8): Likewise.
16499 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16501 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
16502 instead of constructing __builtin_aarch64_simd_oi one vector
16504 (vqtbx2_u8): Likewise.
16505 (vqtbx2_p8): Likewise.
16506 (vqtbx2q_s8): Likewise.
16507 (vqtbx2q_u8): Likewise.
16508 (vqtbx2q_p8): Likewise.
16509 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
16510 __builtin_aarch64_simd_ci one vector at a time.
16511 (vqtbx3_u8): Likewise.
16512 (vqtbx3_p8): Likewise.
16513 (vqtbx3q_s8): Likewise.
16514 (vqtbx3q_u8): Likewise.
16515 (vqtbx3q_p8): Likewise.
16516 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
16517 __builtin_aarch64_simd_xi one vector at a time.
16518 (vqtbx4_u8): Likewise.
16519 (vqtbx4_p8): Likewise.
16520 (vqtbx4q_s8): Likewise.
16521 (vqtbx4q_u8): Likewise.
16522 (vqtbx4q_p8): Likewise.
16524 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16526 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
16527 instead of constructing __builtin_aarch64_simd_oi one vector
16529 (vqtbl2_u8): Likewise.
16530 (vqtbl2_p8): Likewise.
16531 (vqtbl2q_s8): Likewise.
16532 (vqtbl2q_u8): Likewise.
16533 (vqtbl2q_p8): Likewise.
16534 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
16535 __builtin_aarch64_simd_ci one vector at a time.
16536 (vqtbl3_u8): Likewise.
16537 (vqtbl3_p8): Likewise.
16538 (vqtbl3q_s8): Likewise.
16539 (vqtbl3q_u8): Likewise.
16540 (vqtbl3q_p8): Likewise.
16541 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
16542 __builtin_aarch64_simd_xi one vector at a time.
16543 (vqtbl4_u8): Likewise.
16544 (vqtbl4_p8): Likewise.
16545 (vqtbl4q_s8): Likewise.
16546 (vqtbl4q_u8): Likewise.
16547 (vqtbl4q_p8): Likewise.
16549 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
16552 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
16554 2021-07-22 Andrew Pinski <apinski@marvell.com>
16556 PR tree-optimization/10153
16557 * tree-tailcall.c (create_tailcall_accumulator):
16558 Don't call fold_convert as the type should be correct already.
16559 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
16560 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
16562 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
16564 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
16565 varying_p check for null/non-null check.
16567 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
16569 PR tree-optimization/101511
16570 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
16571 is in ssa2's equiv set, and don't trap if so.
16573 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
16575 PR tree-optimization/101497
16576 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
16579 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
16581 PR tree-optimization/101496
16582 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
16583 first, then vrp_visit_cond_Stmt.
16585 2021-07-22 liuhongt <hongtao.liu@intel.com>
16587 * config/i386/i386-expand.c
16588 (ix86_broadcast_from_integer_constant): Rename to ..
16589 (ix86_broadcast_from_constant): .. this, and extend it to
16591 (ix86_expand_vector_move): Extend to float mode.
16592 * config/i386/i386-features.c
16593 (replace_constant_pool_with_broadcast): Remove.
16594 (remove_partial_avx_dependency_gate): Ditto.
16595 (constant_pool_broadcast): Ditto.
16596 (class pass_constant_pool_broadcast): Ditto.
16597 (make_pass_constant_pool_broadcast): Ditto.
16598 (remove_partial_avx_dependency): Adjust gate.
16599 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
16600 * config/i386/i386-protos.h
16601 (make_pass_constant_pool_broadcast): Remove.
16603 2021-07-22 liuhongt <hongtao.liu@intel.com>
16605 * config/i386/constraints.md (Wb): New constraint.
16607 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
16609 (*ashlqi3_1): Ditto.
16610 (*<insn><mode>3_1): Split to ..
16611 (*ashr<mode>3_1): this, ...
16612 (*lshr<mode>3_1): and this, also extend this pattern to avx512
16614 (*<insn><mode>3_1): Split to ..
16615 (*ashr<mode>3_1): this, ...
16616 (*lshrqi3_1): and this, also extend this pattern to avx512
16618 (*lshrhi3_1): And this, also extend this pattern to avx512
16620 * config/i386/sse.md (k<code><mode>): New define_split after
16621 it to convert generic shift pattern to mask shift ones.
16623 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
16624 Joseph Myers <joseph@codesourcery.com>
16625 Cesar Philippidis <cesar@codesourcery.com>
16627 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
16628 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
16630 * tree-pretty-print.c (dump_omp_clause): Likewise.
16631 * omp-general.c (oacc_verify_routine_clauses): Likewise.
16632 * gimplify.c (gimplify_scan_omp_clauses)
16633 (gimplify_adjust_omp_clauses): Likewise.
16634 * tree-nested.c (convert_nonlocal_omp_clauses)
16635 (convert_local_omp_clauses): Likewise.
16636 * omp-low.c (scan_sharing_clauses): Likewise.
16637 * omp-offload.c (execute_oacc_device_lower): Update.
16639 2021-07-21 Martin Sebor <msebor@redhat.com>
16641 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
16643 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16645 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
16648 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16650 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
16652 (write_fntype_init): New stub function.
16653 (write_init_bif_table): Likewise.
16654 (write_init_ovld_table): New function.
16655 (write_init_file): Implement.
16657 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16659 * config/rs6000/rs6000-gen-builtins.c
16660 (write_autogenerated_header): New function.
16661 (write_decls): Likewise.
16662 (write_extern_fntype): New callback function.
16663 (write_header_file): Implement.
16665 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16667 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
16670 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16672 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
16674 (complete_base_type): Likewise.
16675 (construct_fntype_id): Likewise.
16676 (parse_bif_entry): Call contruct_fntype_id.
16677 (parse_ovld_entry): Likewise.
16679 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16681 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
16682 (MAXOVLDSTANZAS): New macro.
16683 (ovld_stanzas): New variable.
16684 (curr_ovld_stanza): Likewise.
16685 (MAXOVLDS): New macro.
16686 (ovlddata): New struct.
16687 (ovlds): New variable.
16688 (curr_ovld): Likewise.
16689 (max_ovld_args): Likewise.
16690 (parse_ovld_entry): New function.
16691 (parse_ovld_stanza): Likewise.
16692 (parse_ovld): Implement.
16694 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16696 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
16699 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16701 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
16702 (parse_prototype): Implement.
16704 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16706 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
16707 (curr_bif_stanza): New variable.
16708 (stanza_entry): New struct.
16709 (stanza_map): New initialized variable.
16710 (enable_string): Likewise.
16711 (fnkinds): New enum.
16712 (typelist): New struct.
16713 (attrinfo): Likewise.
16714 (MAXRESTROPNDS): New macro.
16715 (prototype): New struct.
16716 (MAXBIFS): New macro.
16717 (bifdata): New struct.
16718 (bifs): New variable.
16719 (curr_bif): Likewise.
16720 (bif_order): Likewise.
16721 (bif_index): Likewise.
16722 (fatal): New function.
16723 (stanza_name_to_stanza): Likewise.
16724 (parse_bif_attrs): New stub function.
16725 (parse_prototype): Likewise.
16726 (parse_bif_entry): New function.
16727 (parse_bif_stanza): Likewise.
16728 (parse_bif): Implement.
16729 (set_bif_order): New function.
16730 (create_bif_order): Implement.
16732 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16734 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
16735 (num_bifs): New variable.
16736 (num_ovld_stanzas): Likewise.
16737 (num_ovlds): Likewise.
16738 (parse_codes): New enum.
16739 (bif_rbt): New variable.
16740 (ovld_rbt): Likewise.
16741 (fntype_rbt): Likewise.
16742 (bifo_rbt): Likewise.
16743 (parse_bif): New stub function.
16744 (create_bif_order): Likewise.
16745 (parse_ovld): Likewise.
16746 (write_header_file): Likewise.
16747 (write_init_file): Likewise.
16748 (write_defines_file): Likewise.
16749 (delete_output_files): New function.
16752 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
16755 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
16756 from CRC32 _builtin functions.
16758 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
16760 * coverage.c (build_gcov_info_var_registration): Mark the object placed
16761 in the linker set as referenced so that it does not get optimized away.
16763 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
16766 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
16768 * config.gcc (riscv*-*-*): Detect which python is available.
16770 2021-07-21 Jakub Jelinek <jakub@redhat.com>
16772 PR middle-end/101535
16773 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
16774 contexts in which decl isn't privatized and for ORT_TARGET return
16775 false if decl is mapped.
16777 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
16779 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
16780 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
16781 has been successfully unrolled.
16783 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
16785 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
16786 exit condition if no tail loop is needed, and if the original exit
16787 condition should therefore be kept as-is.
16788 (tree_transform_and_unroll_loop): Handle that case here too.
16790 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
16792 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
16794 (free_data_refs): Likewise.
16795 * tree-data-ref.h (free_dependence_relations): Likewise.
16796 (free_data_refs): Likewise.
16797 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
16799 (struct component): Likewise.
16800 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
16801 (pcom_worker::~pcom_worker): Likewise.
16802 (pcom_worker::release_chain): Adjust as auto_vec changes.
16803 (pcom_worker::loop): Rename to ...
16804 (pcom_worker::m_loop): ... this.
16805 (pcom_worker::datarefs): Rename to ...
16806 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
16807 (pcom_worker::dependences): Rename to ...
16808 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
16809 (pcom_worker::chains): Rename to ...
16810 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
16811 (pcom_worker::looparound_phis): Rename to ...
16812 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
16814 (pcom_worker::cache): Rename to ...
16815 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
16816 (pcom_worker::release_chain): Adjust for auto_vec changes.
16817 (pcom_worker::release_chains): Adjust for auto_vec and renaming
16819 (release_component): Remove.
16820 (release_components): Adjust for release_component removal.
16821 (component_of): Adjust to use vec.
16822 (merge_comps): Likewise.
16823 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
16824 (pcom_worker::determine_offset): Likewise.
16825 (class comp_ptrs): Remove.
16826 (pcom_worker::split_data_refs_to_components): Adjust for renaming
16827 changes, for comp_ptrs removal with auto_vec.
16828 (pcom_worker::suitable_component_p): Adjust for renaming changes.
16829 (pcom_worker::filter_suitable_components): Adjust for release_component
16831 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
16832 (pcom_worker::find_looparound_phi): Likewise.
16833 (pcom_worker::add_looparound_copies): Likewise.
16834 (pcom_worker::determine_roots_comp): Likewise.
16835 (pcom_worker::single_nonlooparound_use): Likewise.
16836 (pcom_worker::execute_pred_commoning_chain): Likewise.
16837 (pcom_worker::execute_pred_commoning): Likewise.
16838 (pcom_worker::try_combine_chains): Likewise.
16839 (pcom_worker::prepare_initializers_chain): Likewise.
16840 (pcom_worker::prepare_initializers): Likewise.
16841 (pcom_worker::prepare_finalizers_chain): Likewise.
16842 (pcom_worker::prepare_finalizers): Likewise.
16843 (pcom_worker::tree_predictive_commoning_loop): Likewise.
16845 2021-07-20 Martin Sebor <msebor@redhat.com>
16847 PR middle-end/101397
16848 * builtins.c (gimple_call_return_array): Add argument. Correct
16849 offsets for memchr, mempcpy, stpcpy, and stpncpy.
16850 (compute_objsize_r): Adjust offset computation for argument returning
16853 2021-07-20 Martin Sebor <msebor@redhat.com>
16855 PR middle-end/101300
16856 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
16858 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
16860 * function.c (assign_parm_setup_block): Use adjust_address instead
16861 of change_address to preserve MEM_EXPR and friends.
16863 2021-07-20 Martin Sebor <msebor@redhat.com>
16865 * cfgloop.h (single_likely_exit): Adjust by-value argument to
16866 by-const-reference.
16867 * cfgloopanal.c (single_likely_exit): Same.
16868 * cgraph.h (struct cgraph_node): Same.
16869 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
16870 * genautomata.c (merge_states): Same.
16871 * genextract.c (VEC_char_to_string): Same.
16872 * genmatch.c (dt_node::gen_kids_1): Same.
16873 (walk_captures): Adjust by-value argument to by-reference.
16874 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
16875 to by-const-reference.
16876 * gimple.c (gimple_build_call_vec): Same.
16877 (gimple_build_call_internal_vec): Same.
16878 (gimple_build_switch): Same.
16879 (sort_case_labels): Same.
16880 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
16882 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
16883 by-const-reference.
16884 (gimple_build_call_internal_vec): Same.
16885 (gimple_build_switch): Same.
16886 (sort_case_labels): Same.
16887 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
16889 * haifa-sched.c (calc_priorities): Adjust by-value argument to
16890 by-const-reference.
16891 (sched_init_luids): Same.
16892 (haifa_init_h_i_d): Same.
16893 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
16894 (adjust_callers_for_value_intersection): Adjust by-value argument to
16896 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
16897 by-const-reference.
16898 (find_more_contexts_for_caller_subset): Same.
16899 (find_aggregate_values_for_callers_subset): Same.
16900 (copy_useful_known_contexts): Same.
16901 * ipa-fnsummary.c (remap_edge_summaries): Same.
16902 (remap_freqcounting_predicate): Same.
16903 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
16905 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
16906 to by-const-reference.
16907 * ipa-predicate.h (predicate::remap_after_inlining): Same.
16908 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
16909 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
16910 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
16911 * read-rtl.c (add_overload_instance): Same.
16912 * rtl.h (native_decode_rtx): Same.
16913 (native_decode_vector_rtx): Same.
16914 * sched-int.h (sched_init_luids): Same.
16915 (haifa_init_h_i_d): Same.
16916 * simplify-rtx.c (native_decode_vector_rtx): Same.
16917 (native_decode_rtx): Same.
16918 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
16919 (shrink_wrap_one_built_in_call_with_conds): Same.
16920 (shrink_wrap_conditional_dead_built_in_calls): Same.
16921 * tree-data-ref.c (create_runtime_alias_checks): Same.
16922 (compute_all_dependences): Same.
16923 * tree-data-ref.h (compute_all_dependences): Same.
16924 (create_runtime_alias_checks): Same.
16925 (index_in_loop_nest): Same.
16926 * tree-if-conv.c (mask_exists): Same.
16927 * tree-loop-distribution.c (class loop_distribution): Same.
16928 (loop_distribution::create_rdg_vertices): Same.
16929 (dump_rdg_partitions): Same.
16930 (debug_rdg_partitions): Same.
16931 (partition_contains_all_rw): Same.
16932 (loop_distribution::distribute_loop): Same.
16933 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
16934 (oacc_entry_exit_single_gang): Same.
16935 * tree-ssa-loop-im.c (hoist_memory_references): Same.
16936 (loop_suitable_for_sm): Same.
16937 * tree-ssa-loop-niter.c (bound_index): Same.
16938 * tree-ssa-reassoc.c (update_ops): Same.
16939 (swap_ops_for_binary_stmt): Same.
16940 (rewrite_expr_tree): Same.
16941 (rewrite_expr_tree_parallel): Same.
16942 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
16943 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
16944 * tree-ssa-structalias.c (process_all_all_constraints): Same.
16945 (make_constraints_to): Same.
16946 (handle_lhs_call): Same.
16947 (find_func_aliases_for_builtin_call): Same.
16948 (sort_fieldstack): Same.
16949 (check_for_overlaps): Same.
16950 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
16951 (vect_create_cond_for_unequal_addrs): Same.
16952 (vect_create_cond_for_lower_bounds): Same.
16953 (vect_create_cond_for_alias_checks): Same.
16954 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
16955 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
16956 (vect_make_slp_decision): Same.
16957 (vect_slp_bbs): Same.
16958 (duplicate_and_interleave): Same.
16959 (vect_transform_slp_perm_load): Same.
16960 (vect_schedule_slp): Same.
16961 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
16962 (vect_schedule_slp): Same.
16963 (duplicate_and_interleave): Same.
16964 * tree.c (build_vector_from_ctor): Same.
16965 (build_vector): Same.
16966 (check_vector_cst): Same.
16967 (check_vector_cst_duplicate): Same.
16968 (check_vector_cst_fill): Same.
16969 (check_vector_cst_stepped): Same.
16970 * tree.h (build_vector_from_ctor): Same.
16972 2021-07-20 Jakub Jelinek <jakub@redhat.com>
16975 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
16976 type from bool to int.
16977 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
16978 EASY_VECTOR_MSB case if either step or copies is not 1.
16979 (vspltis_shifted): Fix comment typo.
16980 (easy_altivec_constant): Change return type from bool to int, instead
16981 of returning true return byte size of the element mode that should be
16982 used to synthetize the constant.
16983 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
16984 that vspltis_shifted is 0, handle the case where easy_altivec_constant
16985 assumes using different vector mode from CONST_VECTOR's mode.
16986 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
16987 easy_altivec_constant to determine mode in which -1 >> -1 should be
16988 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
16990 2021-07-20 Richard Biener <rguenther@suse.de>
16993 * dwarf2out.h (dwarf_file_data): Add key member.
16994 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
16995 (dwarf_file_hasher::hash): Hash key.
16996 (lookup_filename): Remap the filename and store it in the
16997 filename member of dwarf_file_data when creating a new
16999 (file_name_acquire): Do not remap the filename again.
17000 (maybe_emit_file): Likewise.
17002 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
17004 * config/aarch64/aarch64-simd-builtins.def: Use two variant
17005 generators for all TBL/TBX intrinsics and rename to
17006 consistent forms: qtbl[1234] or qtbx[1234].
17007 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
17009 (aarch64_qtbl1<mode>): This.
17010 (aarch64_tbx1<mode>): Rename to...
17011 (aarch64_qtbx1<mode>): This.
17012 (aarch64_tbl2v16qi): Delete.
17013 (aarch64_tbl3<mode>): Rename to...
17014 (aarch64_qtbl2<mode>): This.
17015 (aarch64_tbx4<mode>): Rename to...
17016 (aarch64_qtbx2<mode>): This.
17017 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
17018 renamed qtbl1 and qtbl2 RTL patterns.
17019 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
17021 (vqtbl1_s8): Likewise.
17022 (vqtbl1_u8): Likewise.
17023 (vqtbl1q_p8): Likewise.
17024 (vqtbl1q_s8): Likewise.
17025 (vqtbl1q_u8): Likewise.
17026 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
17027 (vqtbx1_u8): Likewise.
17028 (vqtbx1_p8): Likewise.
17029 (vqtbx1q_s8): Likewise.
17030 (vqtbx1q_u8): Likewise.
17031 (vqtbx1q_p8): Likewise.
17032 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
17033 (vtbl1_u8): Likewise.
17034 (vtbl1_p8): Likewise.
17035 (vtbl2_s8): Likewise
17036 (vtbl2_u8): Likewise.
17037 (vtbl2_p8): Likewise.
17038 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
17039 (vtbl3_u8): Likewise.
17040 (vtbl3_p8): Likewise.
17041 (vtbl4_s8): Likewise.
17042 (vtbl4_u8): Likewise.
17043 (vtbl4_p8): Likewise.
17044 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
17045 (vtbx2_u8): Likewise.
17046 (vtbx2_p8): Likewise.
17047 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
17048 (vqtbl2_u8): Likewise.
17049 (vqtbl2_p8): Likewise.
17050 (vqtbl2q_s8): Likewise.
17051 (vqtbl2q_u8): Likewise.
17052 (vqtbl2q_p8): Likewise.
17053 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
17054 (vqtbx2_u8): Likewise.
17055 (vqtbx2_p8): Likewise.
17056 (vqtbx2q_s8): Likewise.
17057 (vqtbx2q_u8): Likewise.
17058 (vqtbx2q_p8): Likewise.
17059 (vtbx4_s8): Likewise.
17060 (vtbx4_u8): Likewise.
17061 (vtbx4_p8): Likewise.
17063 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
17066 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
17068 (define_peephole2 atomic_loaddi_fpu): Ditto.
17070 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17072 * config.gcc (riscv*-*-*): Detect which python is available.
17074 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17076 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
17077 (smul<mode>3_highpart): ... this.
17078 (mulhu_<mode>): Rename to...
17079 (umul<mode>3_highpart): ... this.
17080 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
17081 MULHU_V2DI, MULHU_V4SI): Adjust.
17083 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17085 PR tree-optimization/100696
17086 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
17087 * internal-fn.def (IFN_MULH): New internal function.
17088 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
17089 recog normal multiply highpart as IFN_MULH.
17090 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
17093 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17095 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
17096 (BTF_DEBUGGING_INFO): Likewise.
17097 * doc/tm.texi.in: Document the new macros.
17098 * doc/tm.texi: Regenerated.
17099 * toplev.c: Guard initialization of debug hooks.
17101 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17103 * flags.h (ctf_debuginfo_p): New function declaration.
17104 * opts.c (ctf_debuginfo_p): New function definition.
17106 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
17109 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
17110 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
17111 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
17112 * config/gcn/gcn-valu.md: Add a warning comment.
17113 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
17114 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
17115 * config/gcn/gcn.md: Add a warning comment.
17116 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
17117 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
17119 (EF_AMDGPU_XNACK): New.
17120 (EF_AMDGPU_SRAM_ECC): New.
17122 (copy_early_debug_info): Use elf_flags.
17123 (main): Handle -mxnack and -msram-ecc options.
17124 * doc/invoke.texi: Document -mxnack and -msram-ecc.
17126 2021-07-19 Andrew Pinski <apinski@marvell.com>
17129 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
17130 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
17132 2021-07-19 Richard Biener <rguenther@suse.de>
17134 PR tree-optimization/101505
17135 * tree-vect-patterns.c (vect_determine_precisions): Walk
17136 PHIs also for loop vectorization.
17138 2021-07-19 Richard Biener <rguenther@suse.de>
17140 * gimple.h (gimple_expr_type): Remove.
17141 * doc/gimple.texi: Remove gimple_expr_type documentation.
17143 2021-07-19 Richard Biener <rguenther@suse.de>
17145 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
17146 (ao_ref_init_from_vn_reference): Likewise.
17147 (fully_constant_reference): Likewise.
17148 (vn_reference_lookup_call): Do not set vr->type to random
17150 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
17152 * tree-vect-generic.c (expand_vector_piecewise): Pass in
17153 whether we expanded parallel.
17154 (expand_vector_parallel): Adjust.
17155 (expand_vector_addition): Likewise.
17156 (expand_vector_comparison): Likewise.
17157 (expand_vector_operation): Likewise.
17158 (expand_vector_scalar_condition): Likewise.
17159 (expand_vector_conversion): Likewise.
17161 2021-07-19 Richard Biener <rguenther@suse.de>
17163 * tree-vrp.c (register_edge_assert_for_2): Use the
17165 (vrp_folder::fold_predicate_in): Likewise.
17166 * vr-values.c (gimple_assign_nonzero_p): Likewise.
17167 (vr_values::extract_range_from_comparison): Likewise.
17168 (vr_values::extract_range_from_ubsan_builtin): Use the
17169 type of the first operand.
17170 (vr_values::extract_range_basic): Push down type
17171 computation, use the appropriate LHS.
17172 (vr_values::extract_range_from_assignment): Use the
17175 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17178 * common/config/i386/i386-common.c (ix86_handle_option): For
17179 -mgeneral-regs-only, enable the GPR only instructions which are
17180 enabled implicitly by SSE ISAs unless they have been disabled
17183 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17186 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
17187 ix86_avx_u128_mode_needed.
17188 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
17189 returns AVX register.
17191 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
17193 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
17194 (handle_const_call): Liekise
17195 (handle_pure_call): Liekise
17197 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
17199 PR tree-optimization/96542
17200 * range-op.cc (range_operator::wi_fold_in_parts): New.
17201 (range_operator::fold_range): Call wi_fold_in_parts.
17202 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
17203 * range-op.h (wi_fold_in_parts): Add prototype.
17205 2021-07-16 David Malcolm <dmalcolm@redhat.com>
17207 * doc/analyzer.texi: Add __analyzer_dump_state.
17209 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17211 * config/rs6000/rbtree.c: New file.
17212 * config/rs6000/rbtree.h: New file.
17214 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17216 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
17217 (typeinfo): Add restr field.
17218 (match_bracketed_pair): New function.
17219 (match_const_restriction): Implement.
17221 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17223 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
17225 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17227 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
17228 (basetype): Likewise.
17229 (typeinfo): Likewise.
17230 (handle_pointer): New function.
17231 (match_basetype): New stub function.
17232 (match_const_restriction): Likewise.
17233 (match_type): New function.
17235 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17237 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
17239 (advance_line): Likewise.
17240 (safe_inc_pos): Likewise.
17241 (match_identifier): Likewise.
17242 (match_integer): Likewise.
17243 (match_to_right_bracket): Likewise.
17245 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17247 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
17248 (ovld_file): Likewise.
17249 (header_file): Likewise.
17250 (init_file): Likewise.
17251 (defines_file): Likewise.
17252 (pgm_path): Likewise.
17253 (bif_path): Likewise.
17254 (ovld_path): Likewise.
17255 (header_path): Likewise.
17256 (init_path): Likewise.
17257 (defines_path): Likewise.
17258 (LINELEN): New macro.
17259 (linebuf): New variable.
17263 (bif_diag): New function.
17264 (ovld_diag): Likewise.
17266 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17268 * config/rs6000/rs6000-builtin-new.def: New.
17269 * config/rs6000/rs6000-overload.def: New.
17271 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17273 * config/rs6000/rs6000-gen-builtins.c: New.
17275 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17277 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
17278 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
17279 * gengtype-state.c (state_writer::write_state_file_list): Add a
17280 parameter to the fileslist expression for the number of build
17282 (read_state_files_list): Detect build headers and strip the
17283 initial "./" or ".\" from their names.
17284 * gengtype.c (build_headers): New global variable.
17285 (num_build_headers): Likewise.
17286 (open_base_files): Emit #include for each build header.
17287 (main): Detect and count build headers.
17288 * gengtype.h (build_headers): New extern variable.
17289 (num_build_headers): Likewise.
17291 2021-07-16 Richard Biener <rguenther@suse.de>
17293 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
17294 the type of the LHS.
17295 (find_bswap_or_nop_1): Likewise.
17296 (find_bswap_or_nop): Likewise.
17297 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
17299 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
17300 Remove unused parameters, pass in the scalar type. Fix
17301 internal store function handling.
17302 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
17303 (vect_get_vector_types_for_stmt): Move down check for
17304 existing vector stmt after we've determined a scalar type.
17305 Pass down the used scalar type to vect_get_smallest_scalar_type.
17306 * tree-vect-generic.c (expand_vector_condition): Use
17307 the type of the LHS.
17308 (expand_vector_scalar_condition): Likewise.
17309 (expand_vector_operations_1): Likewise.
17310 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
17311 (vect_recog_dot_prod_pattern): Likewise.
17312 (vect_recog_sad_pattern): Likewise.
17313 (vect_recog_widen_op_pattern): Likewise.
17314 (vect_recog_widen_sum_pattern): Likewise.
17315 (vect_recog_mixed_size_cond_pattern): Likewise.
17317 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
17319 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
17320 (dump_eaf_flags): Dump EAF_NOT_RETURNED
17321 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
17322 and EAF_NOT_RETURNED.
17323 (modref_summary::useful_p): Likewise.
17324 (modref_summary_lto::useful_p): Likewise.
17325 (struct) modref_summary_lto: Use eaf_fleags_t.
17326 (deref_flags): Handle EAF_NOT_RETURNED.
17327 (struct escape_point): Use min_flags.
17328 (modref_lattice::init): Add EAF_NOT_RETURNED.
17329 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
17330 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
17332 (analyze_parms): Also analyze const functions; update conition on
17334 (modref_write): Update streaming.
17335 (read_section): Update streaming.
17336 (remap_arg_flags): Use eaf_flags_t.
17337 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
17338 * ipa-modref.h: (eaf_flags_t): New typedef.
17339 (struct modref_summary): Use eaf_flags_t.
17340 * tree-core.h (EAF_NOT_RETURNED): New constant.
17342 2021-07-16 Richard Biener <rguenther@suse.de>
17344 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
17345 the type of the LHS.
17346 (gimple_assign_nonnegative_warnv_p): Likewise.
17347 (gimple_call_nonnegative_warnv_p): Likewise. Return false
17348 if the call has no LHS.
17349 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
17350 * tree-eh.c (stmt_could_throw_1_p): Likewise.
17351 * tree-inline.c (insert_init_stmt): Likewise.
17352 * tree-ssa-loop-niter.c (get_val_for): Likewise.
17353 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
17355 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
17356 gassign *. Use the type of the lhs.
17357 (vn_nary_op_lookup_stmt): Adjust.
17358 (vn_nary_op_insert_stmt): Likewise.
17360 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
17362 * config/s390/predicates.md (bras_sym_operand): Accept all
17363 functions in 64-bit mode, use UNSPEC_PLT31.
17364 (larl_operand): Use UNSPEC_PLT31.
17365 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
17366 (legitimize_pic_address): Likewise.
17367 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
17369 (s390_delegitimize_address): Use UNSPEC_PLT31.
17370 (s390_output_addr_const_extra): Likewise.
17371 (print_operand): Add @PLT to TLS calls, handle %K.
17372 (s390_function_profiler): Mark __fentry__/_mcount as function,
17373 use %K, use UNSPEC_PLT31.
17374 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
17375 (s390_emit_call): Use UNSPEC_PLT31.
17376 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
17377 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
17378 (*movdi_64): Use %K.
17379 (reload_base_64): Likewise.
17380 (*sibcall_brc): Likewise.
17381 (*sibcall_brcl): Likewise.
17382 (*sibcall_value_brc): Likewise.
17383 (*sibcall_value_brcl): Likewise.
17385 (*brasl): Likewise.
17386 (*bras_r): Likewise.
17387 (*brasl_r): Likewise.
17388 (*bras_tls): Likewise.
17389 (*brasl_tls): Likewise.
17390 (main_base_64): Likewise.
17391 (reload_base_64): Likewise.
17392 (@split_stack_call<mode>): Likewise.
17394 2021-07-16 Richard Biener <rguenther@suse.de>
17396 PR tree-optimization/101467
17397 * tree-vect-stmts.c (vect_gen_while): Properly guard
17398 make_temp_ssa_name usage.
17400 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
17402 * config.gcc: Don't use forked print-sysroot-suffix.sh and
17403 t-sysroot-suffix for C-SKY.
17404 * config/csky/print-sysroot-suffix.sh: Delete.
17405 * config/csky/t-csky-linux: Delete.
17406 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
17407 instead of CSKY_MULTILIB_DIRNAMES.
17409 2021-07-16 Richard Biener <rguenther@suse.de>
17411 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
17412 conversion issues with the partial reduction of the reused
17413 vector accumulator.
17415 2021-07-16 Richard Biener <rguenther@suse.de>
17417 * config/i386/i386-options.c (ix86_option_override_internal): Set
17418 param_vect_partial_vector_usage to zero if not set.
17420 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
17423 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
17424 (VALID_INT_MODE_P): Add SDmode and DDmode.
17425 Add TDmode for TARGET_64BIT.
17426 (VALID_DFP_MODE_P): Remove.
17427 * config/i386/i386.c (ix86_hard_regno_mode_ok):
17428 Do not use VALID_DFP_MODE_P.
17430 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
17432 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
17434 (fold_using_range::fold_stmt): Ditto.
17435 (fold_using_range::range_of_range_op): Ditto.
17436 (fold_using_range::range_of_phi): Ditto.
17437 (fold_using_range::range_of_call): Ditto.
17438 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
17439 (fold_using_range::range_of_builtin_call): Ditto.
17440 (fold_using_range::range_of_cond_expr): Ditto.
17441 * gimple-range-fold.h (gimple_range_type): New.
17443 2021-07-15 Martin Sebor <msebor@redhat.com>
17445 PR middle-end/97027
17446 * tree-ssa-strlen.c (handle_assign): New function.
17447 (maybe_warn_overflow): Add argument.
17448 (nonzero_bytes_for_type): New function.
17449 (count_nonzero_bytes): Handle more tree types. Call
17450 nonzero_bytes_for_tye.
17451 (count_nonzero_bytes): Handle types.
17452 (handle_store): Handle stores from function calls.
17453 (strlen_check_and_optimize_call): Move code to handle_assign. Call
17454 it for assignments from function calls.
17456 2021-07-15 David Malcolm <dmalcolm@redhat.com>
17461 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
17463 2021-07-15 David Malcolm <dmalcolm@redhat.com>
17465 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
17467 2021-07-15 Martin Sebor <msebor@redhat.com>
17471 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
17472 (operand_compare::verify_hash_value): Same.
17473 * tree-core.h (OEP_DECL_NAME): New.
17475 2021-07-15 Martin Jambor <mjambor@suse.cz>
17477 * profile-count.h (profile_count::value): Change the return type to
17479 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
17481 * tree-cfg.c (dump_function_to_file): Likewise.
17483 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
17486 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
17487 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
17490 2021-07-15 Richard Biener <rguenther@suse.de>
17492 * tree-vectorizer.h (vect_gen_while): Match up with
17493 vect_gen_while_not.
17494 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
17495 of vect_gen_while_not.
17496 (vect_gen_while_not): Adjust.
17497 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
17499 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
17501 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
17502 (ranger_cache::range_of_def): Call adjust_range.
17503 (ranger_cache::entry_range): Same.
17504 * gimple-range-cache.h (non_null_ref::adjust_range): New.
17505 * gimple-range.cc (gimple_ranger::range_of_expr): Call
17507 (gimple_ranger::range_on_entry): Same.
17509 2021-07-15 Tamar Christina <tamar.christina@arm.com>
17512 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17514 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
17516 2021-07-15 Tamar Christina <tamar.christina@arm.com>
17519 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17521 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
17522 (sdot_prod, udot_prod): ...These.
17523 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
17524 (aarch64_<sur>dot<vsi2qi>): Rename to...
17525 (<sur>dot_prod<vsi2qi>): ...This.
17526 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
17529 2021-07-15 Jakub Jelinek <jakub@redhat.com>
17531 PR middle-end/101437
17532 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
17533 types even if they have non-BLKmode TYPE_MODE.
17535 2021-07-15 Richard Biener <rguenther@suse.de>
17538 * gcc.c (process_command): Process -gtoggle like process_options
17539 would after parsing options.
17541 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
17543 * cfgexpand.c (expand_asm_loc): Adjust.
17544 (expand_asm_stmt): Likewise.
17545 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
17546 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
17547 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
17548 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
17549 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
17550 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
17551 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
17552 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
17553 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
17554 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
17555 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
17556 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
17557 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
17558 * doc/tm.texi: Regenerate.
17559 * target.def: Add location argument to md_asm_adjust.
17561 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
17563 * tree-diagnostic.c (diagnostic_report_current_function): Use the
17564 diagnostic's location, not input_location.
17566 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
17568 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
17570 (expand_asm_stmt): Likewise.
17572 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
17574 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
17575 addressed memory rtx, if any.
17576 (rs6000_split_multireg_move): Fix code formatting.
17577 Handle MMA build built-ins with operands in adjacent memory locations.
17579 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
17581 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
17584 2021-07-14 Jason Merrill <jason@redhat.com>
17586 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
17588 2021-07-14 Jason Merrill <jason@redhat.com>
17590 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
17593 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17595 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
17596 Remove erroneous line.
17598 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
17600 * params.opt (param_evrp_mode): Change default.
17602 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17604 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
17605 (sdot_prod, udot_prod): ...These.
17606 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
17607 (aarch64_<sur>dot<vsi2qi>): Rename to...
17608 (<sur>dot_prod<vsi2qi>): ...This.
17609 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
17612 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17614 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
17616 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17618 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
17620 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17622 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
17624 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17626 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
17627 (usdot_prod<vsi2qi>): ... This.
17628 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
17629 (usdot_prod): ...This.
17630 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
17631 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
17633 (@<sur>dot_prod<vsi2qi>): ...This.
17634 * config/aarch64/aarch64-sve-builtins-base.cc
17635 (svusdot_impl::expand): Use it.
17637 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17639 * optabs.def (usdot_prod_optab): New.
17640 * doc/md.texi: Document it and clarify other dot prod optabs.
17641 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
17642 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
17643 * optabs.c (expand_widen_pattern_expr): Likewise.
17644 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
17645 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
17646 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
17648 (vect_widened_op_tree): Optionally ignore
17650 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
17652 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
17655 * config/i386/driver-i386.c (host_detect_local_cpu): Check
17656 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
17657 Enable UINTR only for 64-bit codegen.
17658 * config/i386/i386-options.c
17659 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
17661 * config/i386/i386.h (ARCH_ARG): New.
17662 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
17663 "[arch|tune] 64" for 64-bit codegen.
17665 2021-07-14 Richard Biener <rguenther@suse.de>
17667 PR tree-optimization/101445
17668 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
17669 of the IV in the correct direction for negative stride
17672 2021-07-14 Jakub Jelinek <jakub@redhat.com>
17675 * godump.c (godump_str_hash): New type.
17676 (godump_container::pot_dummy_types): Use string_hash instead of
17677 ptr_hash in the hash_set.
17679 2021-07-14 Richard Biener <rguenther@suse.de>
17681 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
17682 vector types where the old vector type has a multiple of
17683 the new vector type elements.
17684 (vect_create_partial_epilog): New function, split out from...
17685 (vect_create_epilog_for_reduction): ... here.
17686 (vect_transform_cycle_phi): Reduce the re-used accumulator
17687 to the new vector type.
17689 2021-07-14 Alexandre Oliva <oliva@adacore.com>
17691 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
17692 non-'t'-sized arg check.
17694 2021-07-14 Alexandre Oliva <oliva@adacore.com>
17696 * tree-cfg.c (cleanup_dead_labels_eh): Update
17697 post_landing_pad label upon change of landing pad block's
17699 (cleanup_dead_labels): Check that a removed label is not that
17702 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
17704 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
17706 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
17707 Add Neon to general purpose register case for zero-extend
17709 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
17710 case to prevent some cases opting to go through memory.
17711 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
17712 * rtl.c (rtvec_series_p): Define predicate to determine
17713 whether a vector contains a linear series of integers.
17714 * rtl.h (rtvec_series_p): Define.
17715 * rtlanal.c (vec_series_lowpart_p): Define predicate to
17716 determine if a vector selection is equivalent to the low part
17718 * rtlanal.h (vec_series_lowpart_p): Define.
17719 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
17720 Add vec_select -> subreg simplification.
17722 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
17724 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
17725 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
17726 _mm_test_mix_ones_zeros): New.
17728 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
17729 Richard Biener <rguenther@suse.de>
17731 * gimple.c (gimple_could_trap_p_1): Make S argument a
17732 "const gimple*". Preserve constness in call to
17733 gimple_asm_volatile_p.
17734 (gimple_could_trap_p): Make S argument a "const gimple*".
17735 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
17736 Update function prototypes.
17738 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17740 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
17741 (_loop_vec_info::main_loop_edge): New field.
17742 (_loop_vec_info::skip_main_loop_edge): Likewise.
17743 (_loop_vec_info::skip_this_loop_edge): Likewise.
17744 (_loop_vec_info::reusable_accumulators): Likewise.
17745 (_stmt_vec_info::reduc_scalar_results): Likewise.
17746 (_stmt_vec_info::reused_accumulator): Likewise.
17747 (vect_get_main_loop_result): Declare.
17748 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
17749 reduc_scalar_inputs.
17750 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
17751 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
17752 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
17753 skip_main_loop_edge and skip_this_loop_edge fields.
17754 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
17755 (vect_emit_reduction_init_stmts): New function.
17756 (get_initial_def_for_reduction): Use it.
17757 (get_initial_defs_for_reduction): Likewise. Change the vinfo
17758 parameter to a loop_vec_info.
17759 (vect_create_epilog_for_reduction): Store the scalar results
17760 in the reduc_info. If an epilogue loop is reusing an accumulator
17761 from the main loop, and if the epilogue loop can also be skipped,
17762 try to place the reduction code in the join block. Record
17763 accumulators that could potentially be reused by epilogue loops.
17764 (vect_transform_cycle_phi): When vectorizing epilogue loops,
17765 try to reuse accumulators from the main loop. Record the initial
17766 value in reduc_info for non-SLP reductions too.
17768 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17770 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
17771 adjustment handling. Take the neutral value as an argument,
17772 in place of the code argument.
17773 (vect_transform_cycle_phi): Update accordingly. Handle the
17774 initial values of cond reductions separately from code reductions.
17775 Choose the adjustment here rather than in
17776 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
17778 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17780 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
17781 (neutral_op_for_reduction): ...this, providing a more general
17783 (vect_create_epilog_for_reduction): Update accordingly.
17784 (vectorizable_reduction): Likewise.
17785 (vect_transform_cycle_phi): Likewise.
17787 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17789 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
17790 reduc_info instead of the original stmt_vec_info.
17791 (vect_transform_cycle_phi): Update accordingly.
17793 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17795 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
17796 reduc_info as an additional parameter.
17797 (vect_transform_cycle_phi): Update accordingly.
17799 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17801 * tree-vectorizer.h: Include tree-ssa-operands.h.
17802 (vect_phi_initial_value): New function.
17803 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
17804 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
17805 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
17806 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
17808 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17810 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
17811 the phi results to vectype after creating them. Remove later
17812 conversion code that thus becomes redundant.
17814 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17816 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
17817 the new_phis vector with a reduc_inputs vector. Combine handling
17818 of reduction chains and ncopies > 1.
17820 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17822 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
17823 scalar_results to group_size elements after reducing down from
17824 N*group_size elements. Construct an array_slice of the live-out
17825 stmts and assert that there is one stmt per scalar result.
17827 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17829 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
17830 nested_in_vect_loop and use double_reduc everywhere. Remove dead
17831 assignment to "loop".
17833 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17835 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
17836 vector types first. For scalar types, consider both the preferred
17837 vector mode and the alternative vector modes.
17838 * optabs-query.c (can_vec_mask_load_store_p): Use the same
17839 structure as above, in particular using related_vector_mode
17840 for modes provided by autovectorize_vector_modes.
17842 2021-07-13 Jakub Jelinek <jakub@redhat.com>
17843 Richard Biener <rguenther@suse.de>
17845 PR tree-optimization/101419
17846 * tree-pass.h (PROP_objsz): Define.
17847 (make_pass_early_object_sizes): Declare.
17848 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
17849 there to pass_early_object_sizes, drop parameter.
17850 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
17851 drop parameter, move pass_post_ipa_warn right after that.
17852 * tree-object-size.c (pass_object_sizes::execute): Rename to...
17853 (object_sizes_execute): ... this. Add insert_min_max_p argument.
17854 (pass_data_object_sizes): Move after object_sizes_execute.
17855 (pass_object_sizes): Likewise. In execute method call
17856 object_sizes_execute, drop set_pass_param method and insert_min_max_p
17857 non-static data member and its initializer in the ctor.
17858 (pass_data_early_object_sizes, pass_early_object_sizes,
17859 make_pass_early_object_sizes): New.
17860 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
17861 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
17863 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
17866 * config/riscv/constraints.md ("S"): Update description and remove
17868 * doc/md.texi (Machine Constraints): Document the 'S' constraints
17871 2021-07-13 Richard Biener <rguenther@suse.de>
17874 2021-07-12 Richard Biener <rguenther@suse.de>
17876 * tree-vect-slp.c (vect_slp_region): Show the number of
17877 SLP graph entries in the optimization message.
17879 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
17881 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
17883 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
17884 return type to long.
17885 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
17888 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
17890 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
17891 Query relation between the 2 operands and use it.
17893 2021-07-12 Sergei Trofimovich <siarheit@google.com>
17895 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
17897 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
17900 * config/i386/predicates.md (vec_setm_sse41_operand):
17901 Rename from vec_setm_operand.
17902 (vec_setm_avx2_operand): New predicate.
17903 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
17904 Use vec_setm_sse41_operand as operand 2 predicate.
17905 (vec_set<V_256_512:mode): New expander.
17906 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
17907 as operand 2 predicate.
17909 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
17911 PR tree-optimization/101335
17912 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
17914 2021-07-12 Andrew Pinski <apinski@marvell.com>
17916 * tree-ssa-phiopt.c (match_simplify_replacement): Move
17917 insert of the sequence before the movement of the
17918 statement. Check if to see if the statement is used
17919 outside of the original phi to see if we should move it.
17921 2021-07-12 Richard Biener <rguenther@suse.de>
17923 * dump-context.h (debug_dump_context::debug_dump_context):
17924 Add FILE * parameter defaulted to stderr.
17925 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
17926 * tree-vect-slp.c (dot_slp_tree): New functions.
17928 2021-07-12 Richard Biener <rguenther@suse.de>
17930 PR tree-optimization/101373
17931 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
17932 references when the BB may not return.
17933 (compute_avail): Pass in the function we're working on and
17934 replace cfun references with it. Externally throwing
17935 const calls also possibly terminate the function.
17936 (pass_pre::execute): Pass down the function we're working on.
17937 * gcse.c (compute_hash_table_work): Externally throwing
17938 const/pure calls also need record_last_mem_set_info.
17939 * postreload-gcse.c (record_opr_changes): Looping or externally
17940 throwing const/pure calls also need record_last_mem_set_info.
17942 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
17944 * recog.c (memory_address_addr_space_p): Change the type to bool.
17945 Return true/false instead of 1/0.
17946 (offsettable_memref_p): Ditto.
17947 (offsettable_nonstrict_memref_p): Ditto.
17948 (offsettable_address_addr_space_p): Ditto.
17949 Change the type of addressp indirect function to bool.
17950 * recog.h (memory_address_addr_space_p): Change the type to bool.
17951 (strict_memory_address_addr_space_p): Ditto.
17952 (offsettable_memref_p): Ditto.
17953 (offsettable_nonstrict_memref_p): Ditto.
17954 (offsettable_address_addr_space_p): Ditto.
17955 * reload.c (maybe_memory_address_addr_space_p): Ditto.
17956 (strict_memory_address_addr_space_p): Change the type to bool.
17957 Return true/false instead of 1/0.
17958 (maybe_memory_address_addr_space_p): Change the type to bool.
17960 2021-07-12 Richard Biener <rguenther@suse.de>
17962 * tree-vect-slp.c (vect_slp_region): Show the number of
17963 SLP graph entries in the optimization message.
17965 2021-07-12 Richard Biener <rguenther@suse.de>
17967 PR tree-optimization/101394
17968 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
17969 copies from abnormals for a full redundancy.
17971 2021-07-12 Richard Biener <rguenther@suse.de>
17973 PR middle-end/101423
17974 * gimple.c (gimple_could_trap_p_1): Internal function calls
17976 * tree-eh.c (tree_could_trap_p): Likewise.
17978 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
17981 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
17983 (vmulq_n_u32): Likewise.
17984 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
17985 (vmulq_n_f32): Likewise.
17986 (vmul_n_f16): Likewise.
17987 (vmulq_n_f16): Likewise.
17989 2021-07-12 Martin Liska <mliska@suse.cz>
17991 PR sanitizer/101425
17992 * gcc.c (check_offload_target_name): Call
17993 candidates_list_and_hint only if we have a candidate.
17995 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
17998 * config/arm/neon.md (vec_init): Move to ...
17999 * config/arm/vec-common.md (vec_init): ... here.
18000 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
18002 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
18004 PR tree-optimization/101403
18005 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
18006 signedness of the shift is not the same as the signedness of
18007 the type extension.
18009 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
18010 Uroš Bizjak <ubizjak@gmail.com>
18012 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
18013 divmod of a constant numerator with new define_insn_and_split.
18015 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
18018 * config/i386/i386-expand.c (ix86_expand_call): If a call is
18019 to a non-local-binding, or local but to a public symbol, then
18020 assume that it might be indirected via the lazy symbol binder.
18021 Mark R10 and R10 as clobbered in that case.
18023 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
18026 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
18027 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
18028 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
18030 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
18032 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
18033 log2 range of operands[3] to [1,31].
18034 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
18036 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18038 * doc/md.texi: Don't split @smallexample in multiple @groups.
18040 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18042 * doc/md.texi: Add missing 'see' word.
18044 2021-07-09 Andrew Pinski <apinski@marvell.com>
18046 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
18047 to take sequence and gimple_match_op. Accept the case where
18048 op is a SSA_NAME and one statement in the sequence.
18049 Also allow constants.
18050 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
18051 Update call to phiopt_early_allow. Discard the sequence if not
18054 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
18059 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
18060 instead of ISA_HAS_MSA.
18061 (mips_expand_vec_unpack): Likewise.
18062 (mips_expand_vector_init): Likewise.
18064 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
18066 * config/rs6000/vsx.md (mods_<mode>): Rename to...
18067 (mod<mode>3): ... this.
18068 (modu_<mode>): Rename to...
18069 (umod<mode>3): ... this.
18070 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
18071 MODU_V4SI): Adjust.
18073 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
18075 * config/h8300/shiftrotate.md (variable shifts): Expose condition
18076 code handling for the test before the loop.
18078 2021-07-08 Martin Jambor <mjambor@suse.cz>
18081 * ipa-sra.c (class isra_call_summary): New member
18082 m_before_any_store, initialize it in the constructor.
18083 (isra_call_summary::dump): Dump the new field.
18084 (ipa_sra_call_summaries::duplicate): Copy it.
18085 (process_scan_results): Set it.
18086 (isra_write_edge_summary): Stream it.
18087 (isra_read_edge_summary): Likewise.
18088 (param_splitting_across_edge): Only override
18089 safe_to_import_accesses if m_before_any_store is set.
18091 2021-07-08 Martin Sebor <msebor@redhat.com>
18093 PR bootstrap/101374
18094 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
18095 Use Object Size Type 0 instead of 1.
18097 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18099 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
18102 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18104 * match.pd: Simplify an extend-operate-truncate sequence involving
18107 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
18108 Richard Biener <rguenther@suse.de>
18110 PR tree-optimization/40210
18111 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
18112 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
18113 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
18115 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
18118 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
18120 * config/i386/mmx.md (V_32): New mode iterator.
18121 (mov<V_32:mode>): Use V_32 mode iterator.
18122 (*mov<V_32:mode>_internal): Ditto.
18123 (*push<V_32:mode>2_rex64): Ditto.
18124 (*push<V_32:mode>2): Ditto.
18125 (movmisalign<V_32:mode>): Ditto.
18126 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
18127 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
18128 (vec_unpacks_lo_v4qi): New expander.
18129 (vec_unpacks_hi_v4qi): Ditto.
18130 (vec_unpacku_lo_v4qi): Ditto.
18131 (vec_unpacku_hi_v4qi): Ditto.
18132 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
18133 (VALID_INT_MODE_P): Ditto.
18135 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
18138 * config/rs6000/rs6000.md (udivti3): New insn.
18139 (divti3): New insn.
18140 (umodti3): New insn.
18141 (modti3): New insn.
18143 2021-07-07 Martin Sebor <msebor@redhat.com>
18145 PR tree-optimization/100137
18146 PR tree-optimization/99121
18147 PR tree-optimization/97027
18148 * builtins.c (access_ref::access_ref): Also set offmax.
18149 (access_ref::offset_in_range): Define new function.
18150 (access_ref::add_offset): Set offmax.
18151 (access_ref::inform_access): Handle access_none.
18152 (handle_mem_ref): Clear ostype.
18153 (compute_objsize_r): Handle ASSERT_EXPR.
18154 * builtins.h (struct access_ref): Add offmax member.
18155 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
18156 compute_objsize() and simplify.
18158 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
18160 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
18161 and VSX_BUILTIN_STXVP.
18163 2021-07-07 Martin Sebor <msebor@redhat.com>
18166 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
18167 a stray %K from error_at() missed in r12-2088.
18169 2021-07-07 Richard Biener <rguenther@suse.de>
18171 PR tree-optimization/99728
18172 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
18174 (mem_refs_may_alias_p): Add assert we handled aggregate
18176 (sm_seq_valid_bb): Give up when running into aggregate copies.
18177 (ref_indep_loop_p): Handle aggregate copies as never
18178 being invariant themselves but allow other refs to be
18179 disambiguated against them.
18180 (can_sm_ref_p): Do not try to apply store-motion to aggregate
18183 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
18186 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
18189 2021-07-06 Martin Sebor <msebor@redhat.com>
18191 * gimple-pretty-print.c (percent_G_format): Remove.
18192 * tree-diagnostic.c (default_tree_printer): Remove calls.
18193 * tree-pretty-print.c (percent_K_format): Remove.
18194 * tree-pretty-print.h (percent_K_format): Remove.
18196 2021-07-06 Martin Sebor <msebor@redhat.com>
18198 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
18199 Remove %K and use error_at.
18200 (aarch64_expand_fcmla_builtin): Same.
18201 (aarch64_expand_builtin_tme): Same.
18202 (aarch64_expand_builtin_memtag): Same.
18203 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
18204 (arm_expand_builtin): Same.
18205 * config/arm/arm.c (bounds_check): Same.
18207 2021-07-06 Martin Sebor <msebor@redhat.com>
18209 * builtins.c (warn_string_no_nul): Remove %G.
18210 (maybe_warn_for_bound): Same.
18211 (warn_for_access): Same.
18212 (check_access): Same.
18213 (check_strncat_sizes): Same.
18214 (expand_builtin_strncat): Same.
18215 (expand_builtin_strncmp): Same.
18216 (expand_builtin): Same.
18217 (expand_builtin_object_size): Same.
18218 (warn_dealloc_offset): Same.
18219 (maybe_emit_free_warning): Same.
18220 * calls.c (maybe_warn_alloc_args_overflow): Same.
18221 (maybe_warn_nonstring_arg): Same.
18222 (maybe_warn_rdwr_sizes): Same.
18223 * expr.c (expand_expr_real_1): Remove %K.
18224 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
18225 (gimple_fold_builtin_strncat): Same.
18226 * gimple-ssa-sprintf.c (format_directive): Same.
18227 (handle_printf_call): Same.
18228 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
18229 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
18230 (maybe_diag_access_bounds): Same. Call gimple_location.
18231 (check_bounds_or_overlap): Same.
18232 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
18233 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
18234 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
18235 (maybe_diag_stxncpy_trunc): Same.
18236 (handle_builtin_stxncpy_strncat): Same.
18237 (maybe_warn_pointless_strcmp): Same.
18238 * tree-ssa-uninit.c (maybe_warn_operand): Same.
18240 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
18243 * config/i386/predicates.md (vec_setm_operand): Enable
18244 register_operand for TARGET_SSE4_1.
18245 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
18246 as operand 2 predicate. Call ix86_expand_vector_set_var
18247 for non-constant index operand.
18248 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
18249 Call ix86_expand_vector_set_var for non-constant index operand.
18251 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
18253 * config/h8300/jumpcall.md (*branch): When possible, generate
18254 the comparison in CCZN mode.
18255 * config/h8300/predicates.md (simple_memory_operand): Reject all
18256 auto-increment addressing modes.
18258 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
18260 PR bootstrap/100246
18261 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
18264 2021-07-06 Richard Biener <rguenther@suse.de>
18266 * doc/md.texi (vec_fmaddsub<mode>4): Document.
18267 (vec_fmsubadd<mode>4): Likewise.
18268 * optabs.def (vec_fmaddsub$a4): Add.
18269 (vec_fmsubadd$a4): Likewise.
18270 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
18271 (IFN_VEC_FMSUBADD): Likewise.
18272 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
18273 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
18274 (addsub_pattern::build): Likewise.
18275 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
18276 and CFN_VEC_FMSUBADD are not transparent for permutes.
18277 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
18278 (vec_fmsubadd<mode>4): Likewise.
18280 2021-07-06 Richard Biener <rguenther@suse.de>
18282 * doc/invoke.texi (fmove-loop-stores): Document.
18283 * common.opt (fmove-loop-stores): New option.
18284 * opts.c (default_options_table): Enable -fmove-loop-stores
18285 at -O1 but not -Og.
18286 * tree-ssa-loop-im.c (pass_lim::execute): Pass
18287 flag_move_loop_stores instead of true to
18288 loop_invariant_motion_in_fun.
18290 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
18292 * doc/install.texi: Document --with-dsymutil.
18294 2021-07-06 Andrew Pinski <apinski@marvell.com>
18296 PR tree-optimization/101256
18297 * dbgcnt.def (phiopt_edge_range): New counter.
18298 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
18299 Check to make sure the new name is defined in the same
18300 bb as the conditional before duplicating range info.
18301 Also add debug counter.
18303 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
18305 PR rtl-optimization/100328
18306 * config/i386/i386-options.c (ix86_option_override_internal):
18307 Set param_ira_consider_dup_in_all_alts to 0.
18309 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
18311 PR rtl-optimization/100328
18312 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
18314 * ira.c (ira_get_dup_out_num): Adjust as parameter
18315 param_ira_consider_dup_in_all_alts.
18316 * params.opt (ira-consider-dup-in-all-alts): New.
18317 * ira-conflicts.c (process_regs_for_copy): Add one parameter
18318 single_input_op_has_cstr_p.
18319 (get_freq_for_shuffle_copy): New function.
18320 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
18321 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
18323 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
18325 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
18326 generate condition code aware RTL directly.
18328 2021-07-05 Andrew Pinski <apinski@marvell.com>
18330 PR tree-optimization/101039
18331 * match.pd (A CMP 0 ? A : -A): New patterns.
18332 * tree-ssa-phiopt.c (abs_replacement): Delete function.
18333 (tree_ssa_phiopt_worker): Don't call abs_replacement.
18334 Update comment about abs_replacement.
18336 2021-07-05 Andrew Pinski <apinski@marvell.com>
18338 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
18339 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
18341 2021-07-05 Andrew Pinski <apinski@marvell.com>
18343 * tree-ssa-phiopt.c (match_simplify_replacement):
18344 Add early_p argument. Call gimple_simplify_phiopt
18345 instead of gimple_simplify.
18346 (tree_ssa_phiopt_worker): Update call to
18347 match_simplify_replacement and allow unconditionally.
18348 (phiopt_early_allow): New function.
18349 (gimple_simplify_phiopt): New function.
18351 2021-07-05 Andrew Pinski <apinski@marvell.com>
18353 PR middle-end/101237
18354 * fold-const.c (negate_expr_p): Remove call to element_mode
18355 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
18356 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
18357 (fold_negate_expr_1): Likewise.
18358 (const_unop): Likewise.
18359 (fold_cond_expr_with_comparison): Likewise.
18360 (fold_binary_loc): Likewise.
18361 (fold_ternary_loc): Likewise.
18362 (tree_call_nonnegative_warnv_p): Likewise.
18363 * match.pd (-(A + B) -> (-B) - A): Likewise.
18365 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
18367 * configure.ac: Handle --with-dsymutil in the same way as we
18368 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
18369 Extract the type and version for the dsymutil configured or
18370 found by the default searches.
18371 * config.in: Regenerated.
18372 * configure: Regenerated.
18373 * collect2.c (do_dsymutil): Handle locating dsymutil in the
18374 same way as for the assembler and linker.
18375 * config/darwin.h (DSYMUTIL): Delete.
18376 * gcc.c: Report a configured dsymutil correctly.
18377 * exec-tool.in: Allow for dsymutil.
18379 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
18381 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
18382 Handle V4QI and V2HI modes.
18383 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
18384 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
18385 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
18386 Handle 4-byte dual operands with TARGET_XOP and single operands
18387 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
18388 mmx_pshufbv4qi3 for TARGET_SSSE3.
18389 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
18390 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
18391 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
18392 (expand_vec_perm_even_odd_1): Handle V4QI mode.
18393 (expand_vec_perm_broadcast_1): Handle V4QI mode.
18394 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
18395 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
18396 (mmx_pshufbv4qi3): Ditto.
18397 (*mmx_pblendw32): Ditto.
18398 (*mmx_pblendw64): Rename from *mmx_pblendw.
18399 (mmx_punpckhbw_low): New insn_and_split pattern.
18400 (mmx_punpcklbw_low): Ditto.
18402 2021-07-05 Richard Biener <rguenther@suse.de>
18404 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
18405 set LOOP_C_INFINITE on the vectorized loop.
18407 2021-07-05 Richard Biener <rguenther@suse.de>
18409 PR middle-end/101291
18410 * cfgloopmanip.c (loop_version): Set the loop copy of the
18411 versioned loop to the new loop.
18413 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
18416 * config.gcc: Ensure that Darwin biarch definitions are
18417 added before i386.h.
18418 * config/i386/darwin.h (TARGET_64BIT): Remove.
18419 (PR80556_WORKAROUND): New.
18420 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
18421 (DARWIN_SUBARCH_SPEC): New.
18422 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
18423 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
18424 (REAL_LIBGCC_SPEC): Remove.
18425 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
18426 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
18427 (REAL_LIBGCC_SPEC): Remove.
18429 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
18431 PR middle-end/101294
18432 * expr.c (store_constructor): Don't use vec_duplicate on vector.
18434 2021-07-02 Martin Sebor <msebor@redhat.com>
18436 PR middle-end/98871
18437 PR middle-end/98512
18438 * diagnostic.c (get_any_inlining_info): New.
18439 (update_effective_level_from_pragmas): Handle inlining context.
18440 (diagnostic_enabled): Same.
18441 (diagnostic_report_diagnostic): Same.
18442 * diagnostic.h (struct diagnostic_info): Add ctor.
18443 (struct diagnostic_context): Add new member.
18444 * tree-diagnostic.c (set_inlining_locations): New.
18445 (tree_diagnostics_defaults): Set new callback pointer.
18447 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
18449 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
18451 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
18452 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
18453 lxvp and stxvp built-ins.
18454 (mma_init_builtins): Handle lxvp and stxvp built-ins.
18455 (builtin_function_type): Likewise.
18456 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
18458 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18460 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
18461 additional argument for the code.
18462 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
18463 argument for the code. Just return if the ZN bits are useful or
18464 not rather than the old style CC_* enums.
18465 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
18466 more generic shiftqi patterns.
18467 (shifthi_noscratch, shiftsi_noscratch): Similarly.
18468 (shiftqi_noscratch_set_flags): New pattern.
18469 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
18471 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
18473 PR tree-optimization/101223
18474 * range-op.cc (build_lt): Add -1 for signed values.
18475 (built_gt): Subtract -1 for signed values.
18477 2021-07-02 David Faust <david.faust@oracle.com>
18479 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
18480 (btf_asm_type): Likewise.
18482 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18484 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
18485 an array of rtx rather than a pointer to rtx. Add code argument.
18486 (compute_a_shift_length): Similarly.
18487 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
18488 shift itself isn't an operand. Create dummy operand[0] to carry
18489 a mode and pass a suitable rtx code to compute_a_shift_length.
18490 (get_shift_alg): Adjust operand number of clobber in output templates.
18491 (output_a_shift): Make first argument an array of rtx rather than
18492 a pointer to rtx. Add code argument for the type of shift.
18493 Adjust now that the shift itself is no longer an operand.
18494 (compute_a_shift_length): Similarly.
18495 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
18496 iterator rather than nshift_operator.
18497 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
18498 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
18499 and compute_a_shift_length.
18500 (shiftqi_noscratch_clobber_flags): Likewise.
18501 (shifthi_noscratch_clobber_flags): Likewise.
18502 (shiftsi_noscratch_clobber_flags): Likewise.
18504 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
18507 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
18508 dsymutil for BTF/CTF.
18510 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
18513 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
18514 segment to include BTF.
18515 (BTF_INFO_SECTION_NAME): New.
18517 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18519 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
18520 (small_data_operand, memreg_operand, small_insn_p): Likewise.
18521 * config/m32r/m32r.c (call_operand): Adjust return type.
18522 (small_data_operand, memreg_operand): Likewise.
18524 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18526 * config/frv/frv-protos.h (integer_register_operand): Adjust return
18528 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
18529 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
18530 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
18531 (move_source_operand, move_destination_operand): Likewise.
18532 (condexec_source_operand, condexec_dest_operand): Likewise.
18533 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
18534 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
18535 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
18536 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
18537 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
18538 (even_fpr_operand, odd_fpr_operand): Likewise.
18539 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
18540 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
18541 (uint4_operand, uint1_operand, int_2word_operand): Likewise
18542 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
18543 (relational_operator, float_relational_operator): Likewise.
18544 (ccr_eqne_operator, minmax_operator): Likewise.
18545 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
18546 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
18547 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
18548 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
18549 (quad_acc_operand, accg_operand): Likewise.
18551 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18553 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
18554 return type to a bool.
18555 (nonimmediate_nonstack_operand): Likewise.
18556 (xstormy16_splittable_below100_operand): Likewise.
18557 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
18559 (xstormy16_splittable_below100_operand): Likewise.
18561 2021-07-02 Richard Biener <rguenther@suse.de>
18563 PR tree-optimization/101293
18564 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
18565 with combined offsets.
18566 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
18567 combined with the rest of the offset.
18569 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
18571 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
18572 PIC encodings for PE-COFF targets.
18574 2021-07-02 Jakub Jelinek <jakub@redhat.com>
18577 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
18578 Return nullptr for TImode inner mode.
18580 2021-07-02 Richard Biener <rguenther@suse.de>
18582 PR tree-optimization/101280
18583 PR tree-optimization/101173
18584 * gimple-loop-interchange.cc
18585 (tree_loop_interchange::valid_data_dependences): Properly
18586 guard all dependence checks with DDR_REVERSED_P or its
18589 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
18591 * config/i386/i386-expand.c (ix86_expand_builtin):
18592 Add branch to clear odata when ZF is set for asedecenc_expand
18593 and wideaesdecenc_expand.
18595 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
18597 * config/i386/gcc-auto-profile: regenerate
18599 2021-07-02 liuhongt <hongtao.liu@intel.com>
18601 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
18602 (trunc<mode><pmov_dst_4_lower>2): this.
18604 2021-07-01 David Malcolm <dmalcolm@redhat.com>
18606 * diagnostic.h (diagnostic_context::m_file_cache): New field.
18607 * input.c (class fcache): Rename to...
18608 (class file_cache_slot): ...this, making most members private and
18609 prefixing fields with "m_".
18610 (file_cache_slot::get_file_path): New accessor.
18611 (file_cache_slot::get_use_count): New accessor.
18612 (file_cache_slot::missing_trailing_newline_p): New accessor.
18613 (file_cache_slot::inc_use_count): New.
18614 (fcache_buffer_size): Move to...
18615 (file_cache_slot::buffer_size): ...here.
18616 (fcache_line_record_size): Move to...
18617 (file_cache_slot::line_record_size): ...here.
18618 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
18619 (fcache_tab_size): Move to file_cache::num_file_slots.
18620 (diagnostic_file_cache_init): Update for move of fcache_tab
18621 to global_dc->m_file_cache.
18622 (diagnostic_file_cache_fini): Likewise.
18623 (lookup_file_in_cache_tab): Convert to...
18624 (file_cache::lookup_file): ...this.
18625 (diagnostics_file_cache_forcibly_evict_file): Update for move of
18626 fcache_tab to global_dc->m_file_cache, moving most of
18627 implementation to...
18628 (file_cache::forcibly_evict_file): ...this new function and...
18629 (file_cache_slot::evict): ...this new function.
18630 (evicted_cache_tab_entry): Convert to...
18631 (file_cache::evicted_cache_tab_entry): ...this.
18632 (add_file_to_cache_tab): Convert to...
18633 (file_cache::add_file): ...this, moving bulk of implementation
18635 (file_cache_slot::create): ..this new function.
18636 (file_cache::file_cache): New.
18637 (file_cache::~file_cache): New.
18638 (lookup_or_add_file_to_cache_tab): Convert to...
18639 (file_cache::lookup_or_add_file): ..this new function.
18640 (fcache::fcache): Rename to...
18641 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
18643 (fcache::~fcache): Rename to...
18644 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
18646 (needs_read): Convert to...
18647 (file_cache_slot::needs_read_p): ...this.
18648 (needs_grow): Convert to...
18649 (file_cache_slot::needs_grow_p): ...this.
18650 (maybe_grow): Convert to...
18651 (file_cache_slot::maybe_grow): ...this.
18652 (read_data): Convert to...
18653 (file_cache_slot::read_data): ...this.
18654 (maybe_read_data): Convert to...
18655 (file_cache_slot::maybe_read_data): ...this.
18656 (get_next_line): Convert to...
18657 (file_cache_slot::get_next_line): ...this.
18658 (goto_next_line): Convert to...
18659 (file_cache_slot::goto_next_line): ...this.
18660 (read_line_num): Convert to...
18661 (file_cache_slot::read_line_num): ...this.
18662 (location_get_source_line): Update for moving of globals to
18663 global_dc->m_file_cache.
18664 (location_missing_trailing_newline): Likewise.
18665 * input.h (class file_cache_slot): New forward decl.
18666 (class file_cache): New.
18668 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
18670 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
18671 128-bit floating point conditional move support.
18672 (have_compare_and_set_mask): Add IEEE 128-bit floating point
18674 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
18675 (mov<mode>cc_p10, IEEE128 iterator): New insn.
18676 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
18677 (fpmask<mode>, IEEE128 iterator): New insn.
18678 (xxsel<mode>, IEEE128 iterator): New insn.
18680 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
18683 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
18685 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
18687 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
18689 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
18691 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
18692 (vec_duplicate<mode>): New expander.
18694 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
18697 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
18699 (ix86_byte_broadcast): New function.
18700 (ix86_convert_const_wide_int_to_broadcast): Likewise.
18701 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
18702 size is 16 bytes or bigger.
18703 (ix86_broadcast_from_integer_constant): New function.
18704 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
18705 to broadcast if mode size is 16 bytes or bigger.
18706 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
18708 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
18710 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18712 * config/i386/predicates.md (ix86_endbr_immediate_operand):
18713 Return true/false instead of 1/0.
18714 (movq_parallel): Ditto.
18716 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18718 * recog.c (general_operand): Return true/false instead of 1/0.
18719 (register_operand): Ditto.
18720 (immediate_operand): Ditto.
18721 (const_int_operand): Ditto.
18722 (const_scalar_int_operand): Ditto.
18723 (const_double_operand): Ditto.
18724 (push_operand): Ditto.
18725 (pop_operand): Ditto.
18726 (memory_operand): Ditto.
18727 (indirect_operand): Ditto.
18729 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18731 * genpreds.c (write_predicate_subfunction):
18732 Change the type of written subfunction to bool.
18733 (write_one_predicate_function):
18734 Change the type of written function to bool.
18735 (write_tm_preds_h): Ditto.
18736 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
18737 * recog.c (general_operand): Change the type to bool.
18738 (address_operand): Ditto.
18739 (register_operand): Ditto.
18740 (pmode_register_operand): Ditto.
18741 (scratch_operand): Ditto.
18742 (immediate_operand): Ditto.
18743 (const_int_operand): Ditto.
18744 (const_scalar_int_operand): Ditto.
18745 (const_double_operand): Ditto.
18746 (nonimmediate_operand): Ditto.
18747 (nonmemory_operand): Ditto.
18748 (push_operand): Ditto.
18749 (pop_operand): Ditto.
18750 (memory_operand): Ditto.
18751 (indirect_operand): Ditto.
18752 (ordered_comparison_operator): Ditto.
18753 (comparison_operator): Ditto.
18754 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
18755 Change the type of indirect predicate function to bool.
18756 * config/rs6000/rs6000.c (easy_vector_constant):
18757 Change the type to bool.
18758 * config/mips/mips-protos.h (m16_based_address_p):
18759 Change the type of operand 3 to bool.
18761 2021-07-01 Richard Biener <rguenther@suse.de>
18763 PR tree-optimization/101280
18764 PR tree-optimization/101173
18765 * gimple-loop-interchange.cc
18766 (tree_loop_interchange::valid_data_dependences): Revert
18767 previous change and instead correctly handle DDR_REVERSED_P
18770 2021-07-01 Richard Biener <rguenther@suse.de>
18772 PR tree-optimization/101278
18773 * tree-ssa-dse.c (dse_classify_store): First check for
18774 uses, then ignore stmt for chaining purposes.
18776 2021-07-01 Richard Biener <rguenther@suse.de>
18778 PR tree-optimization/100778
18779 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
18780 vectorized ops ahead of their scalar BB.
18782 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18785 * config/i386/i386.md (*nabs<dwi>2_doubleword):
18786 New insn_and_split pattern.
18787 (*nabs<dwi>2_1): Ditto.
18788 * config/i386/i386-features.c
18789 (general_scalar_chain::compute_convert_gain):
18790 Handle (NEG (ABS (...))) RTX. Rewrite src code
18791 scanner as switch statement.
18792 (general_scalar_chain::convert_insn):
18793 Handle (NEG (ABS (...))) RTX.
18794 (general_scalar_to_vector_candidate_p):
18795 Detect (NEG (ABS (...))) RTX. Reorder case statements
18796 for (AND (NOT (...) ...)) fallthrough.
18798 2021-07-01 Richard Biener <rguenther@suse.de>
18800 PR tree-optimization/101178
18801 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
18802 (slpg::perm_in): Add.
18803 (slpg::get_perm_in): Remove.
18804 (slpg::get_perm_materialized): Add.
18805 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
18806 during permute propagation and materialization.
18808 2021-07-01 Jakub Jelinek <jakub@redhat.com>
18811 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
18813 2021-07-01 Jakub Jelinek <jakub@redhat.com>
18815 PR middle-end/94366
18816 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
18817 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
18818 use boolean_type_node instead of integer_type_node as NE_EXPR type.
18819 (lower_reduction_clauses): Likewise.
18821 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
18823 * config/gcn/gcn.c: Include dwarf2.h.
18824 (gcn_addr_space_debug): New function.
18825 (TARGET_ADDR_SPACE_DEBUG): New hook.
18827 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
18829 * common/config/gcn/gcn-common.c
18830 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
18831 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
18833 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
18834 (gcn_frame_pointer_rqd): New function.
18835 (TARGET_FRAME_POINTER_REQUIRED): New hook.
18837 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
18839 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
18840 prologue register saves.
18841 (gcn_debug_unwind_info): Use UI_DWARF2.
18842 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
18843 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
18844 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
18845 (DWARF_LINK_REGISTER): New define.
18846 (FIRST_PSEUDO_REGISTER): Increment.
18847 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
18848 (CALL_USED_REGISTERS): Likewise.
18849 (REGISTER_NAMES): Likewise.
18851 2021-06-30 Richard Biener <rguenther@suse.de>
18853 PR tree-optimization/101267
18854 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
18855 API and use SLP compatible interface of vect_is_simple_use.
18856 Reject not vectorized SLP defs for callers that do not support
18858 (vect_check_store_rhs): Handle masked stores and pass down
18859 the appropriate operator index.
18860 (vectorizable_call): Adjust.
18861 (vectorizable_store): Likewise.
18862 (vectorizable_load): Likewise. Handle SLP pecularity of
18864 (vect_is_simple_use): Remove special-casing of masked stores.
18866 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
18868 * common.opt (foffload): Remove help as Driver only.
18869 * gcc.c (display_help): Add -foffload.
18871 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
18873 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
18874 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
18875 a candidate list; better inform no offload target is configured
18876 and fix hint extraction when passed target is not '\0' at [len].
18877 * common.opt (foffload): Add tailing '.'.
18878 (foffload-options): Likewise; fix flag name in the help string.
18880 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18883 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
18884 (vcage_f32): Gate comparison on __FAST_MATH__.
18885 (vcageq_f32): Likewise.
18886 (vcale_f32): Likewise.
18887 (vcaleq_f32): Likewise.
18888 (vcagt_f32): Likewise.
18889 (vcagtq_f32): Likewise.
18890 (vcalt_f32): Likewise.
18891 (vcaltq_f32): Likewise.
18892 (vcage_f16): Likewise.
18893 (vcageq_f16): Likewise.
18894 (vcale_f16): Likewise.
18895 (vcaleq_f16): Likewise.
18896 (vcagt_f16): Likewise.
18897 (vcagtq_f16): Likewise.
18898 (vcalt_f16): Likewise.
18899 (vcaltq_f16): Likewise.
18901 2021-06-30 Richard Biener <rguenther@suse.de>
18903 PR tree-optimization/101264
18904 * tree-vect-slp.c (vect_optimize_slp): Propagate the
18905 computed perm_in to all "any" permute successors
18906 we cannot de-duplicate immediately.
18908 2021-06-30 liuhongt <hongtao.liu@intel.com>
18911 * config/i386/sse.md
18912 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
18914 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
18916 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
18917 * config/i386/subst.md (maskz_scalar): New define_subst.
18918 (maskz_scalar_name): New subst_attr.
18919 (maskz_scalar_op5): Ditto.
18920 (round_saeonly_maskz_scalar_op5): Ditto.
18921 (round_saeonly_maskz_scalar_operand5): Ditto.
18923 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
18925 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
18926 Increase code CSECT alignment to at least 32 bytes.
18927 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
18928 alignment designation.
18930 2021-06-29 Sergei Trofimovich <siarheit@google.com>
18932 * doc/generic.texi: Fix s/net yet/not yet/ typo.
18934 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
18936 PR tree-optimization/101254
18937 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
18938 wrapping/non-wrapping when setting the result range.
18940 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
18942 * value-query.cc (gimple_range_global): Allow phis.
18944 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
18946 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
18947 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
18948 statement for location context.
18949 (check_for_binary_op_overflow): Ditto.
18950 (simplify_using_ranges::get_vr_for_comparison): Ditto.
18951 (simplify_using_ranges::compare_name_with_value): Ditto.
18952 (simplify_using_ranges::compare_names): Ditto.
18953 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
18954 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
18955 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
18956 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
18957 (simplify_using_ranges::two_valued_val_range_p): Ditto.
18958 (simplify_using_ranges::simplify): Ditto.
18959 * vr-values.h: Adjust prototypes.
18961 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
18964 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
18966 2021-06-29 Julian Brown <julian@codesourcery.com>
18968 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
18969 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
18970 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
18972 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
18973 (MAX_FIXED_MODE_SIZE): Change to 128.
18975 2021-06-29 Julian Brown <julian@codesourcery.com>
18977 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
18978 (s_mnemonic): Add clrsb.
18979 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
18980 (clrsb<mode>2): Add expander for SImode/DImode.
18982 2021-06-29 Julian Brown <julian@codesourcery.com>
18984 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
18985 muldi3): Add patterns.
18987 2021-06-29 Julian Brown <julian@codesourcery.com>
18989 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
18990 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
18992 2021-06-29 Julian Brown <julian@codesourcery.com>
18994 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
18996 2021-06-29 Joseph Myers <joseph@codesourcery.com>
18998 * btfout.c, ctfout.c: Include "memmodel.h".
19000 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19002 * gcc.c (check_offload_target_name): Cast len argument to
19003 %q.*s to 'int'; avoid -Wstringop-truncation warning.
19005 2021-06-29 Richard Biener <rguenther@suse.de>
19007 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
19008 to "any" permute nodes and relax "any" permute proapgation
19009 during iterative backward propagation.
19011 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19014 * common.opt (-foffload=): Update description.
19015 (-foffload-options=): New.
19016 * doc/invoke.texi (C Language Options): Document
19017 -foffload and -foffload-options.
19018 * gcc.c (check_offload_target_name): New, split off from
19019 handle_foffload_option.
19020 (check_foffload_target_names): New.
19021 (handle_foffload_option): Handle -foffload=default.
19022 (driver_handle_option): Update for -foffload-options.
19023 * lto-opts.c (lto_write_options): Use -foffload-options
19024 instead of -foffload.
19025 * lto-wrapper.c (merge_and_complain, append_offload_options):
19027 * opts.c (common_handle_option): Likewise.
19029 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19031 * doc/invoke.texi (C Language Options): Sort options
19032 alphabetically in optlist and also the description itself.
19033 Remove leftover -fallow-single-precision from and add missing
19034 -fgnu-tm to the optlist.
19036 2021-06-29 Richard Biener <rguenther@suse.de>
19038 * tree-vect-slp.c (slpg_vertex::visited): Remove.
19039 (vect_slp_perms_eq): Handle -1 permutes.
19040 (vect_optimize_slp): Rewrite permute propagation.
19042 2021-06-29 Jakub Jelinek <jakub@redhat.com>
19045 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
19046 perform the optimization in GENERIC when sanitizing and x has a
19049 2021-06-29 Richard Biener <rguenther@suse.de>
19051 PR tree-optimization/101242
19052 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
19053 PHIs with not represented initial values as leafs.
19055 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
19057 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
19058 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
19059 "register" keyword.
19060 (pdp11_initial_elimination_offset) Remove unused variable.
19061 (pdp11_cmp_length) Ditto.
19062 (pdp11_insn_cost): Ditto, and fix signedness warning.
19064 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
19066 * btfout.c: Include tm_p.h.
19069 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19071 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
19073 (bpf_expand_epilogue): Likewise.
19074 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
19075 Do not define DBX_DEBUGGING_INFO.
19077 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19079 * doc/invoke.texi: Document the CTF and BTF debug info options.
19081 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19082 David Faust <david.faust@oracle.com>
19083 Jose E. Marchesi <jose.marchesi@oracle.com>
19084 Weimin Pan <weimin.pan@oracle.com>
19086 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
19087 GTFILES. Add new object files.
19088 * common.opt: Add CTF and BTF debug info options.
19089 * btfout.c: New file.
19090 * ctfc.c: Likewise.
19091 * ctfc.h: Likewise.
19092 * ctfout.c: Likewise.
19093 * dwarf2ctf.c: Likewise.
19094 * dwarf2ctf.h: Likewise.
19095 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
19097 * dwarf2out.c (dwarf2out_source_line): Likewise.
19098 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
19100 (debug_format_do_cu): New function.
19101 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
19103 Include dwarf2ctf.c.
19104 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
19106 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
19107 (CTF_DEBUG): New bitmask.
19108 (BTF_DEBUG): Likewise.
19109 (enum ctf_debug_info_levels): New enum.
19110 * gengtype.c (open_base_files): Handle ctfc.h.
19111 (main): Handle uint32_t type.
19112 * flags.h (btf_debuginfo_p): New definition.
19113 (dwarf_based_debuginfo_p): Likewise.
19114 * opts.c (debug_type_names): Add entries for CTF and BTF.
19115 (btf_debuginfo_p): New function.
19116 (dwarf_based_debuginfo_p): Likewise.
19117 (common_handle_option): Handle -gctfN and -gbtf options.
19118 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
19119 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
19122 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
19124 * dwarf2out.c (AT_class): Function is no longer static.
19125 (AT_int): Likewise.
19126 (AT_unsigned): Likewise.
19127 (AT_loc): Likewise.
19128 (get_AT): Likewise.
19129 (get_AT_string): Likewise.
19130 (get_AT_flag): Likewise.
19131 (get_AT_unsigned): Likewise.
19132 (get_AT_ref): Likewise.
19133 (new_die_raw): Likewise.
19134 (lookup_decl_die): Likewise.
19135 (base_type_die): Likewise.
19136 (add_name_attribute): Likewise.
19137 (add_AT_int): Likewise.
19138 (add_AT_unsigned): Likewise.
19139 (add_AT_loc): Likewise.
19140 (dw_get_die_tag): New function.
19141 (dw_get_die_child): Likewise.
19142 (dw_get_die_sib): Likewise.
19143 (struct dwarf_file_data): Move from here to dwarf2out.h
19144 (struct dw_attr_struct): Likewise.
19145 * dwarf2out.h: Analogous changes.
19147 2021-06-28 Martin Jambor <mjambor@suse.cz>
19150 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
19151 members m_dead_stmts and m_dead_ssas.
19152 * ipa-param-manipulation.c
19153 (ipa_param_body_adjustments::mark_dead_statements): New function.
19154 (ipa_param_body_adjustments::common_initialization): Call it on
19155 all removed but not split parameters.
19156 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
19158 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
19160 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
19161 dead debug statements.
19162 (copy_phis_for_bb): Do not copy dead PHI nodes.
19164 2021-06-28 Martin Jambor <mjambor@suse.cz>
19167 * symtab-clones.h (clone_info): Removed member param_adjustments.
19168 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
19169 deal with pass-through splits now.
19170 (ipa_param_performed_split): Removed.
19171 (ipa_param_adjustments::modify_call): Adjusted parameters.
19172 (class ipa_param_body_adjustments): Adjusted parameters of
19173 register_replacement, modify_gimple_stmt and modify_call_stmt.
19174 (ipa_verify_edge_has_no_modifications): Declare.
19175 (ipa_edge_modifications_finalize): Declare.
19176 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
19177 performed_splits processing, pas only edge to padjs->modify_call,
19178 check that call arguments were not modified if they should not have
19180 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
19182 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
19183 (ipa_edge_modification_info): Likewise.
19184 (ipa_edge_modification_sum): Likewise.
19185 (ipa_edge_modifications): New edge summary.
19186 (ipa_verify_edge_has_no_modifications): New function.
19187 (transitive_split_p): Removed.
19188 (transitive_split_map): Likewise.
19189 (init_transitive_splits): Likewise.
19190 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
19191 summary instead of performed_splits.
19192 (ipa_param_body_adjustments::register_replacement): Drop dummy
19193 parameter, set base_index of the created ipa_param_body_replacement.
19194 (phi_arg_will_live_p): New function.
19195 (ipa_param_body_adjustments::common_initialization): Do not create
19196 IPA_SRA dummy decls.
19197 (simple_tree_swap_info): Removed.
19198 (remap_split_decl_to_dummy): Likewise.
19199 (record_argument_state_1): New function.
19200 (record_argument_state): Likewise.
19201 (ipa_param_body_adjustments::modify_call_stmt): New parameter
19202 orig_stmt. Do not work with dummy decls, save necessary info about
19203 changes to ipa_edge_modifications.
19204 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
19205 orig_stmt, pass it to modify_call_stmt.
19206 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
19207 modify_gimple_stmt.
19208 (ipa_edge_modifications_finalize): New function.
19209 * tree-inline.c (remap_gimple_stmt): Pass original statement to
19210 modify_gimple_stmt.
19211 (copy_phis_for_bb): Do not copy dead PHI nodes.
19212 (expand_call_inline): Do not remap performed_splits.
19213 (update_clone_info): Likewise.
19214 * toplev.c: Include ipa-param-manipulation.h.
19215 (toplev::finalize): Call ipa_edge_modifications_finalize.
19217 2021-06-28 Andrew Pinski <apinski@marvell.com>
19219 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
19220 info if we're the only things setting the target PHI.
19221 (value_replacement): Don't duplicate range here.
19222 (minmax_replacement): Likewise.
19224 2021-06-28 Richard Biener <rguenther@suse.de>
19226 PR tree-optimization/101229
19227 * gimple-walk.c (gimple_walk_op): Handle PHIs.
19229 2021-06-28 Martin Liska <mliska@suse.cz>
19231 * config/v850/v850.c (construct_dispose_instruction): Allocate
19233 (construct_prepare_instruction): Likewise.
19235 2021-06-28 Martin Liska <mliska@suse.cz>
19237 * config/v850/v850.c (v850_option_override): Build default
19239 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
19240 ignored for inlining.
19241 (TARGET_CAN_INLINE_P): New.
19243 2021-06-28 Richard Biener <rguenther@suse.de>
19245 PR tree-optimization/101207
19246 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
19247 permute eliding for load permutations properly.
19249 2021-06-28 Richard Biener <rguenther@suse.de>
19251 PR tree-optimization/101173
19252 * gimple-loop-interchange.cc
19253 (tree_loop_interchange::valid_data_dependences): Disallow outer
19254 loop dependence distance of zero.
19256 2021-06-28 liuhongt <hongtao.liu@intel.com>
19259 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
19260 define_insn_and_split.
19261 (*avx_cmp<mode>3_ltint): Ditto.
19262 (*avx2_pcmp<mode>3_3): Ditto.
19263 (*avx2_pcmp<mode>3_4): Ditto.
19264 (*avx2_pcmp<mode>3_5): Ditto.
19266 2021-06-28 liuhongt <hongtao.liu@intel.com>
19268 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
19269 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
19270 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
19271 IX86_BUILTIN_PBLENDVB128): Replace icode with
19273 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
19275 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
19276 New pre_reload splitter.
19278 2021-06-27 Andrew Pinski <apinski@marvell.com>
19280 PR middle-end/101230
19281 * fold-const.c (fold_ternary_loc): Check
19282 the return value of invert_tree_comparison.
19284 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
19286 * config.gcc: Add SPDX License Identifier.
19287 (powerpc-ibm-aix789): Default to aix73.h.
19288 (powerpc-ibm-aix7.2.*.*): New stanza.
19289 * config/rs6000/aix72.h: Add SPDX License Identifier.
19290 * config/rs6000/aix73.h: New file.
19292 2021-06-26 Jason Merrill <jason@redhat.com>
19294 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
19296 2021-06-26 Andrew Pinski <apinski@marvell.com>
19298 * genmatch.c (lower_cond): Copy for_subst_vec
19299 for the simplify also.
19300 (lower): Swap the order for lower_for and lower_cond.
19302 2021-06-26 Andrew Pinski <apinski@marvell.com>
19304 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
19305 flow senatitive info on the moved ssa set.
19307 2021-06-26 Andrew Pinski <apinski@marvell.com>
19309 * fold-const.c (fold_cond_expr_with_comparison):
19310 Exand arg0 into comp_code, arg00, and arg01.
19311 (fold_ternary_loc): Use invert_tree_comparison
19312 instead of fold_invert_truthvalue for the case
19313 where we have A CMP B ? C : A.
19315 2021-06-25 Martin Sebor <msebor@redhat.com>
19317 PR middle-end/101216
19318 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
19320 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
19322 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
19324 2021-06-25 Richard Biener <rguenther@suse.de>
19326 PR tree-optimization/101202
19327 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
19330 2021-06-25 Richard Biener <rguenther@suse.de>
19332 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
19333 STMT_VINFO_REDUC_DEF from the original representative.
19335 2021-06-25 Martin Sebor <msebor@redhat.com>
19337 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
19338 gimple_no_warning_p and gimple_set_no_warning with
19339 warning_suppressed_p, and suppress_warning.
19341 (maybe_warn_for_bound): Same.
19342 (warn_for_access): Same.
19343 (check_access): Same.
19344 (expand_builtin_strncmp): Same.
19345 (fold_builtin_varargs): Same.
19346 * calls.c (maybe_warn_nonstring_arg): Same.
19347 (maybe_warn_rdwr_sizes): Same.
19348 * cfgexpand.c (expand_call_stmt): Same.
19349 * cgraphunit.c (check_global_declaration): Same.
19350 * fold-const.c (fold_undefer_overflow_warnings): Same.
19351 (fold_truth_not_expr): Same.
19352 (fold_unary_loc): Same.
19353 (fold_checksum_tree): Same.
19354 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
19355 (array_bounds_checker::check_mem_ref): Same.
19356 (array_bounds_checker::check_addr_expr): Same.
19357 (array_bounds_checker::check_array_bounds): Same.
19358 * gimple-expr.c (copy_var_decl): Same.
19359 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
19360 (gimple_fold_builtin_strncat): Same.
19361 (gimple_fold_builtin_stxcpy_chk): Same.
19362 (gimple_fold_builtin_stpcpy): Same.
19363 (gimple_fold_builtin_sprintf): Same.
19364 (fold_stmt_1): Same.
19365 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
19366 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
19367 * gimple-ssa-sprintf.c (handle_printf_call): Same.
19368 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
19369 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19370 * gimple-ssa-warn-restrict.h: Adjust declarations.
19371 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
19372 gimple_no_warning_p and gimple_set_no_warning with
19373 warning_suppressed_p, and suppress_warning.
19374 (check_call): Same.
19375 (check_bounds_or_overlap): Same.
19376 * gimple.c (gimple_build_call_from_tree): Same.
19377 * gimplify.c (gimplify_return_expr): Same.
19378 (gimplify_cond_expr): Same.
19379 (gimplify_modify_expr_complex_part): Same.
19380 (gimplify_modify_expr): Same.
19381 (gimple_push_cleanup): Same.
19382 (gimplify_expr): Same.
19383 * omp-expand.c (expand_omp_for_generic): Same.
19384 (expand_omp_taskloop_for_outer): Same.
19385 * omp-low.c (lower_rec_input_clauses): Same.
19386 (lower_lastprivate_clauses): Same.
19387 (lower_send_clauses): Same.
19388 (lower_omp_target): Same.
19389 * tree-cfg.c (pass_warn_function_return::execute): Same.
19390 * tree-complex.c (create_one_component_var): Same.
19391 * tree-inline.c (remap_gimple_op_r): Same.
19392 (copy_tree_body_r): Same.
19393 (declare_return_variable): Same.
19394 (expand_call_inline): Same.
19395 * tree-nested.c (lookup_field_for_decl): Same.
19396 * tree-sra.c (create_access_replacement): Same.
19397 (generate_subtree_copies): Same.
19398 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
19399 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
19400 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
19401 * tree-ssa-loop-im.c (execute_sm): Same.
19402 * tree-ssa-phiopt.c (cond_store_replacement): Same.
19403 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
19404 (handle_builtin_strcpy): Same.
19405 (maybe_diag_stxncpy_trunc): Same.
19406 (handle_builtin_stxncpy_strncat): Same.
19407 (handle_builtin_strcat): Same.
19408 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
19409 (set_no_uninit_warning): Same.
19410 (uninit_undefined_value_p): Same.
19411 (warn_uninit): Same.
19412 (maybe_warn_operand): Same.
19413 * tree-vrp.c (compare_values_warnv): Same.
19414 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
19415 (test_for_singularity): Same.
19416 * gimple.h (warning_suppressed_p): New function.
19417 (suppress_warning): Same.
19418 (copy_no_warning): Same.
19419 (gimple_set_block): Call gimple_set_location.
19420 (gimple_set_location): Call copy_warning.
19422 2021-06-25 Martin Sebor <msebor@redhat.com>
19424 * tree.h (warning_suppressed_at, copy_warning,
19425 warning_suppressed_p, suppress_warning): New functions.
19427 2021-06-25 Martin Sebor <msebor@redhat.com>
19429 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
19430 * gengtype.c (open_base_files): Add diagnostic-spec.h.
19431 * diagnostic-spec.c: New file.
19432 * diagnostic-spec.h: New file.
19433 * tree.h (no_warning, all_warnings, suppress_warning_at): New
19435 * warning-control.cc: New file.
19437 2021-06-25 liuhongt <hongtao.liu@intel.com>
19440 * config/i386/i386.c (x86_order_regs_for_local_alloc):
19443 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19445 PR tree-optimization/101189
19446 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
19447 LHS range of condition to postfold routine.
19448 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
19449 FALSE edge if the LHS range supports it being taken.
19450 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
19452 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19454 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
19455 (relation_oracle::find_relation_block): Check correct bitmap.
19456 (relation_oracle::dump): Do not dump NULL blocks.
19458 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19460 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
19461 range_on_edge instead of manually calculating.
19463 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19465 * range-op.cc: Fix comment.
19467 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
19470 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
19471 Handle V8QI and V4HI modes.
19472 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
19474 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
19475 (mmxpackmode): New mode attribute.
19476 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
19477 (mmxunpackmode): New mode attribute.
19478 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
19479 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
19480 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
19481 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
19482 * config/i386/i386.md (extsuffix): Move from ...
19483 * config/i386/sse.md: ... here.
19485 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
19487 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
19488 (dwarf2out_finish): ...instead of here.
19490 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
19492 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
19493 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
19494 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
19495 * configure: Regenerate.
19497 2021-06-24 Richard Biener <rguenther@suse.de>
19499 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
19500 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
19501 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
19503 2021-06-24 Richard Biener <rguenther@suse.de>
19505 * config/i386/sse.md (avx_addsubv4df3): Rename to
19507 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
19508 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
19509 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
19510 * config/i386/i386-builtin.def: Adjust.
19511 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
19512 * optabs.def (vec_addsub_optab): New optab.
19513 * tree-vect-slp-patterns.c (class addsub_pattern): New.
19514 (slp_patterns): Add addsub_pattern.
19515 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
19516 across CFN_VEC_ADDSUB.
19517 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
19519 * doc/md.texi (vec_addsub<mode>3): Document.
19521 2021-06-24 Jakub Jelinek <jakub@redhat.com>
19523 PR middle-end/101170
19524 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
19525 where regno + subreg_regno_offset wraps around use 0 as starting
19528 2021-06-24 Jakub Jelinek <jakub@redhat.com>
19530 PR middle-end/101172
19531 * stor-layout.c (finish_bitfield_representative): If nextf has
19532 error_mark_node type, set repr type to error_mark_node too.
19534 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
19536 * config/s390/s390.c (s390_function_profiler): Ignore labelno
19538 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
19540 2021-06-24 Richard Biener <rguenther@suse.de>
19542 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
19543 across operations that have different semantics on different
19546 2021-06-24 Jakub Jelinek <jakub@redhat.com>
19548 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
19549 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
19550 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
19551 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
19552 OMP_TARGET user outer_ctx instead of ctx for placeholders and
19553 initializer/combiner gimplification.
19554 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
19555 on target constructs.
19556 (lower_rec_input_clauses): Likewise.
19557 (lower_omp_target): Likewise.
19558 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
19559 on target if in_reduction is present.
19561 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
19563 * tree-predcom.c (class pcom_worker): New class.
19564 (release_chain): Renamed to...
19565 (pcom_worker::release_chain): ...this.
19566 (release_chains): Renamed to...
19567 (pcom_worker::release_chains): ...this.
19568 (aff_combination_dr_offset): Renamed to...
19569 (pcom_worker::aff_combination_dr_offset): ...this.
19570 (determine_offset): Renamed to...
19571 (pcom_worker::determine_offset): ...this.
19572 (class comp_ptrs): New class.
19573 (split_data_refs_to_components): Renamed to...
19574 (pcom_worker::split_data_refs_to_components): ...this,
19575 and update with class comp_ptrs.
19576 (suitable_component_p): Renamed to...
19577 (pcom_worker::suitable_component_p): ...this.
19578 (filter_suitable_components): Renamed to...
19579 (pcom_worker::filter_suitable_components): ...this.
19580 (valid_initializer_p): Renamed to...
19581 (pcom_worker::valid_initializer_p): ...this.
19582 (find_looparound_phi): Renamed to...
19583 (pcom_worker::find_looparound_phi): ...this.
19584 (add_looparound_copies): Renamed to...
19585 (pcom_worker::add_looparound_copies): ...this.
19586 (determine_roots_comp): Renamed to...
19587 (pcom_worker::determine_roots_comp): ...this.
19588 (determine_roots): Renamed to...
19589 (pcom_worker::determine_roots): ...this.
19590 (single_nonlooparound_use): Renamed to...
19591 (pcom_worker::single_nonlooparound_use): ...this.
19592 (remove_stmt): Renamed to...
19593 (pcom_worker::remove_stmt): ...this.
19594 (execute_pred_commoning_chain): Renamed to...
19595 (pcom_worker::execute_pred_commoning_chain): ...this.
19596 (execute_pred_commoning): Renamed to...
19597 (pcom_worker::execute_pred_commoning): ...this.
19598 (struct epcc_data): New member worker.
19599 (execute_pred_commoning_cbck): Call execute_pred_commoning
19600 with pcom_worker pointer.
19601 (find_use_stmt): Renamed to...
19602 (pcom_worker::find_use_stmt): ...this.
19603 (find_associative_operation_root): Renamed to...
19604 (pcom_worker::find_associative_operation_root): ...this.
19605 (find_common_use_stmt): Renamed to...
19606 (pcom_worker::find_common_use_stmt): ...this.
19607 (combinable_refs_p): Renamed to...
19608 (pcom_worker::combinable_refs_p): ...this.
19609 (reassociate_to_the_same_stmt): Renamed to...
19610 (pcom_worker::reassociate_to_the_same_stmt): ...this.
19611 (stmt_combining_refs): Renamed to...
19612 (pcom_worker::stmt_combining_refs): ...this.
19613 (combine_chains): Renamed to...
19614 (pcom_worker::combine_chains): ...this.
19615 (try_combine_chains): Renamed to...
19616 (pcom_worker::try_combine_chains): ...this.
19617 (prepare_initializers_chain): Renamed to...
19618 (pcom_worker::prepare_initializers_chain): ...this.
19619 (prepare_initializers): Renamed to...
19620 (pcom_worker::prepare_initializers): ...this.
19621 (prepare_finalizers_chain): Renamed to...
19622 (pcom_worker::prepare_finalizers_chain): ...this.
19623 (prepare_finalizers): Renamed to...
19624 (pcom_worker::prepare_finalizers): ...this.
19625 (tree_predictive_commoning_loop): Renamed to...
19626 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
19627 some calls and remove some cleanup code.
19628 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
19629 (static variable looparound_phis): Remove.
19630 (static variable name_expansions): Remove.
19632 2021-06-24 Richard Biener <rguenther@suse.de>
19634 * tree-vect-slp.c (slpg_vertex): New struct.
19635 (vect_slp_build_vertices): Adjust.
19636 (vect_optimize_slp): Likewise. Maintain an outgoing permute
19637 and a materialized one.
19639 2021-06-24 Richard Biener <rguenther@suse.de>
19641 PR tree-optimization/101105
19642 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
19643 Only ignore steps when they are equal or scalar order is preserved.
19645 2021-06-24 liuhongt <hongtao.liu@intel.com>
19648 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
19649 Adjust comments for ix86_expand_vecop_qihi2.
19650 (ix86_expand_vecmul_qihi): Renamed to ..
19651 (ix86_expand_vecop_qihi2): Adjust function prototype to
19652 support shift operation, add static to definition.
19653 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
19654 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
19655 ix86_expand_vec_shift_qihi_constant.
19656 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
19657 (ix86_expand_vec_shift_qihi_constant): Deleted.
19658 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
19660 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
19661 condition TARGET_64BIT.
19662 (mul<mode>3): Ditto.
19663 (<insn><mode>3): Ditto.
19664 (vlshr<mode>3): Extend to support avx512 vlshr.
19665 (v<insn><mode>3): New expander for
19667 (v<insn>v8qi3): Ditto.
19668 (vashrv8hi3<mask_name>): Renamed to ..
19669 (vashr<mode>3): And extend to support V16QImode for avx512.
19670 (vashrv16qi3): Deleted.
19671 (vashrv2di3<mask_name>): Extend expander to support avx512
19674 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
19676 * doc/lto.texi (Design Overview): Update that slim objects are
19679 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
19681 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
19682 of OTHER_POWER10_MASKS so it will not be enabled by default.
19684 2021-06-23 Richard Biener <rguenther@suse.de>
19685 Martin Jambor <mjambor@suse.cz>
19687 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
19688 param replacement unconditionally. Adjust comment.
19690 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19692 * Makefile.in (OBJS): Add gimple-range-fold.o
19693 * gimple-range-fold.cc: New.
19694 * gimple-range-fold.h: New.
19695 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
19696 (gimple_range_calc_op2): Ditto.
19697 * gimple-range-gori.h: Move prototypes to here.
19698 * gimple-range.cc: Adjust include files.
19699 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
19700 (fur_source::get_operand): Ditto.
19701 (fur_source::get_phi_operand): Ditto.
19702 (fur_source::query_relation): Ditto.
19703 (fur_source::register_relation): Ditto.
19704 (class fur_edge): Ditto.
19705 (fur_edge::fur_edge): Ditto.
19706 (fur_edge::get_operand): Ditto.
19707 (fur_edge::get_phi_operand): Ditto.
19708 (fur_stmt::fur_stmt): Ditto.
19709 (fur_stmt::get_operand): Ditto.
19710 (fur_stmt::get_phi_operand): Ditto.
19711 (fur_stmt::query_relation): Ditto.
19712 (class fur_depend): Relocate to gimple-range-fold.h.
19713 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
19714 (fur_depend::register_relation): Ditto.
19715 (fur_depend::register_relation): Ditto.
19716 (class fur_list): Ditto.
19717 (fur_list::fur_list): Ditto.
19718 (fur_list::get_operand): Ditto.
19719 (fur_list::get_phi_operand): Ditto.
19720 (fold_range): Ditto.
19721 (adjust_pointer_diff_expr): Ditto.
19722 (gimple_range_adjustment): Ditto.
19723 (gimple_range_base_of_assignment): Ditto.
19724 (gimple_range_operand1): Ditto.
19725 (gimple_range_operand2): Ditto.
19726 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
19727 (gimple_range_calc_op2): Ditto.
19728 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
19729 (fold_using_range::range_of_range_op): Ditto.
19730 (fold_using_range::range_of_address): Ditto.
19731 (fold_using_range::range_of_phi): Ditto.
19732 (fold_using_range::range_of_call): Ditto.
19733 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
19734 (fold_using_range::range_of_builtin_call): Ditto.
19735 (fold_using_range::range_of_cond_expr): Ditto.
19736 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
19737 (fold_using_range::relation_fold_and_or): Ditto.
19738 (fold_using_range::postfold_gcond_edges): Ditto.
19739 * gimple-range.h: Add gimple-range-fold.h to include files. Change
19740 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
19741 (gimple_range_handler): Relocate to gimple-range-fold.h.
19742 (gimple_range_ssa_p): Ditto.
19743 (range_compatible_p): Ditto.
19744 (class fur_source): Ditto.
19745 (class fur_stmt): Ditto.
19746 (class fold_using_range): Ditto.
19747 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
19748 (gimple_range_calc_op2): Ditto.
19750 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19752 PR tree-optimization/101148
19753 PR tree-optimization/101014
19754 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
19755 (ranger_cache::~ranger_cache): Adjust.
19756 (ranger_cache::block_range): Check if propagation disallowed.
19757 (ranger_cache::propagate_cache): Disallow propagation if new value
19758 can't be stored properly.
19759 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
19761 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19763 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
19764 (sbr_vector::set_bb_range): Return true.
19765 (class sbr_sparse_bitmap): Adjust.
19766 (sbr_sparse_bitmap::set_bb_range): Return value.
19767 (block_range_cache::set_bb_range): Return value.
19768 (ranger_cache::propagate_cache): Use return value to print msg.
19769 * gimple-range-cache.h (class block_range_cache): Adjust.
19771 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19773 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
19775 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
19777 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
19778 so this pattern can be used for test/compare removal. Pass
19779 current insn to compute_logical_op_length and output_logical_op.
19780 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
19781 (h8300_and_costs): Add argument to compute_logical_op_length.
19782 (output_logical_op): Add new argument. Use it to determine if the
19783 condition codes are used and adjust the output accordingly.
19784 (compute_logical_op_length): Add new argument and update length
19785 computations when condition codes are used.
19786 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
19788 (output_logical_op): Likewise.
19790 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
19793 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
19794 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
19795 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
19796 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
19797 * config/i386/sse.md (unspec): ... here.
19799 2021-06-23 Martin Liska <mliska@suse.cz>
19802 * optc-save-gen.awk: Put back arm_fp16_format to
19805 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
19808 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
19811 (clz<mode>2): Update RTX pattern for additions.
19813 2021-06-23 Jakub Jelinek <jakub@redhat.com>
19815 PR middle-end/101167
19816 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
19817 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
19819 2021-06-22 Sergei Trofimovich <siarheit@google.com>
19821 * doc/rtl.texi: drop unbalanced parenthesis.
19823 2021-06-22 Richard Biener <rguenther@suse.de>
19825 PR middle-end/101156
19826 * gimplify.c (gimplify_expr): Remove premature incorrect
19829 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19831 PR tree-optimization/101159
19832 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
19835 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19837 PR middle-end/101160
19838 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
19839 clear crtl->return_rtx instead of keeping it referencing a pseudo.
19841 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19842 Andrew Pinski <apinski@marvell.com>
19844 PR tree-optimization/101162
19845 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
19848 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19850 * range-op.cc (range_relational_tests): New.
19851 (range_op_tests): Call range_relational_tests.
19853 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19855 * range-op.cc (operator_cast::lhs_op1_relation): New.
19856 (operator_identity::lhs_op1_relation): Mew.
19858 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19860 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
19862 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19864 * range-op.cc (operator_plus::lhs_op1_relation): New.
19865 (operator_plus::lhs_op2_relation): New.
19867 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19869 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
19870 relation_oracle if dominators exist.
19871 (ranger_cache::~ranger_cache): Dispose of oracle.
19872 (ranger_cache::dump_bb): Dump oracle.
19873 * gimple-range.cc (fur_source::fur_source): New.
19874 (fur_source::get_operand): Use mmeber query.
19875 (fur_source::get_phi_operand): Use member_query.
19876 (fur_source::query_relation): New.
19877 (fur_source::register_dependency): Delete.
19878 (fur_source::register_relation): New.
19879 (fur_edge::fur_edge): Adjust.
19880 (fur_edge::get_phi_operand): Fix comment.
19881 (fur_edge::query): Delete.
19882 (fur_stmt::fur_stmt): Adjust.
19883 (fur_stmt::query): Delete.
19884 (fur_depend::fur_depend): Adjust.
19885 (fur_depend::register_relation): New.
19886 (fur_depend::register_relation): New.
19887 (fur_list::fur_list): Adjust.
19888 (fur_list::get_operand): Use member query.
19889 (fold_using_range::range_of_range_op): Process and query relations.
19890 (fold_using_range::range_of_address): Adjust dependency call.
19891 (fold_using_range::range_of_phi): Ditto.
19892 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
19893 (fold_using_range::relation_fold_and_or): New.
19894 (fold_using_range::postfold_gcond_edges): New.
19895 * gimple-range.h (class gimple_ranger): Adjust.
19896 (class fur_source): Adjust members.
19897 (class fur_stmt): Ditto.
19898 (class fold_using_range): Ditto.
19900 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19902 * range-op.cc (range_operator::wi_fold): Apply relation effect.
19903 (range_operator::fold_range): Adjust and apply relation effect.
19904 (*::fold_range): Add relation parameters.
19905 (*::op1_range): Ditto.
19906 (*::op2_range): Ditto.
19907 (range_operator::lhs_op1_relation): New.
19908 (range_operator::lhs_op2_relation): New.
19909 (range_operator::op1_op2_relation): New.
19910 (range_operator::op1_op2_relation_effect): New.
19911 (relop_early_resolve): New.
19912 (operator_equal::op1_op2_relation): New.
19913 (operator_equal::fold_range): Call relop_early_resolve.
19914 (operator_not_equal::op1_op2_relation): New.
19915 (operator_not_equal::fold_range): Call relop_early_resolve.
19916 (operator_lt::op1_op2_relation): New.
19917 (operator_lt::fold_range): Call relop_early_resolve.
19918 (operator_le::op1_op2_relation): New.
19919 (operator_le::fold_range): Call relop_early_resolve.
19920 (operator_gt::op1_op2_relation): New.
19921 (operator_gt::fold_range): Call relop_early_resolve.
19922 (operator_ge::op1_op2_relation): New.
19923 (operator_ge::fold_range): Call relop_early_resolve.
19924 * range-op.h (class range_operator): Adjust parameters and methods.
19926 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19928 * Makefile.in (OBJS): Add value-relation.o.
19929 * gimple-range.h: Adjust include files.
19930 * tree-data-ref.c: Adjust include file order.
19931 * value-query.cc (range_query::get_value_range): Default to no oracle.
19932 (range_query::query_relation): New.
19933 (range_query::query_relation): New.
19934 * value-query.h (class range_query): Adjust.
19935 * value-relation.cc: New.
19936 * value-relation.h: New.
19938 2021-06-22 Richard Biener <rguenther@suse.de>
19940 PR tree-optimization/101151
19941 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
19944 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
19946 * config/riscv/riscv.c (thead_c906_tune_info): New.
19947 (riscv_tune_info_table): Use new tune.
19949 2021-06-22 Richard Biener <rguenther@suse.de>
19951 PR tree-optimization/101158
19952 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
19953 checking after checking for matching operation.
19955 2021-06-22 Richard Biener <rguenther@suse.de>
19957 PR tree-optimization/101159
19958 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
19959 missing NULL vectype check.
19961 2021-06-22 Richard Biener <rguenther@suse.de>
19963 PR tree-optimization/101154
19964 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
19966 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19969 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
19970 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
19971 * config/i386/i386.c (ix86_expand_prologue): Clear it.
19972 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
19973 Remove "" from match_operand. Emit new insns using emit_move_insn and
19974 set ix86_last_zero_store_uid to INSN_UID of the last store.
19975 Add peephole2s for 1/2/4 stores of const0_rtx following previous
19978 2021-06-22 Martin Liska <mliska@suse.cz>
19980 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
19983 2021-06-22 Martin Liska <mliska@suse.cz>
19985 * gcov-io.h: Remove padding entries.
19987 2021-06-22 liuhongt <hongtao.liu@intel.com>
19989 PR tree-optimization/97770
19990 * tree-vect-patterns.c (vect_recog_popcount_pattern):
19992 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
19994 2021-06-22 liuhongt <hongtao.liu@intel.com>
19997 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
19998 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
19999 (expand<mode>_mask): this ..
20000 (*expand<mode>_mask): New pre_reload splitter to transform
20001 v{,p}expand* to vmov* when mask is zero, all ones, or has all
20002 ones in it's lower part, otherwise still generate
20005 2021-06-22 liuhongt <hongtao.liu@intel.com>
20008 * config/i386/i386-expand.c
20009 (ix86_expand_special_args_builtin): Keep constm1_operand only
20010 if it satisfies insn's operand predicate.
20012 2021-06-21 Jason Merrill <jason@redhat.com>
20015 * df-scan.c (df_ref_record): Check that regno < endregno.
20016 * function.c (assign_parms, expand_function_end): Do nothing with a
20017 TYPE_EMPTY_P result.
20019 2021-06-21 Richard Biener <rguenther@suse.de>
20021 PR tree-optimization/101120
20022 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
20024 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
20025 DR chain DCE capability.
20026 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
20027 * tree-vect-stmts.c (vectorizable_load): Remove unused
20028 loads in the DR chain for SLP.
20030 2021-06-21 Jakub Jelinek <jakub@redhat.com>
20032 PR inline-asm/100785
20033 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
20034 output or input operands were already error_mark_node.
20035 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
20036 remove all inputs, outputs and clobbers from the asm and
20037 set template to "".
20039 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20041 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
20042 (vceq_s16): Likewise.
20043 (vceq_s32): Likewise.
20044 (vceq_u8): Likewise.
20045 (vceq_u16): Likewise.
20046 (vceq_u32): Likewise.
20047 (vceq_p8): Likewise.
20048 (vceqq_s8): Likewise.
20049 (vceqq_s16): Likewise.
20050 (vceqq_s32): Likewise.
20051 (vceqq_u8): Likewise.
20052 (vceqq_u16): Likewise.
20053 (vceqq_u32): Likewise.
20054 (vceqq_p8): Likewise.
20055 (vceq_f32): Gate __a == __b on __FAST_MATH__.
20056 (vceqq_f32): Likewise.
20057 (vceq_f16): Likewise.
20058 (vceqq_f16): Likewise.
20060 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20063 * config/arm/iterators.md (NEON_VACMP): Remove.
20064 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
20066 (neon_vca<cmp_op><mode>_insn): Likewise.
20067 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
20070 2021-06-21 Richard Biener <rguenther@suse.de>
20072 PR tree-optimization/101121
20073 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
20074 when we just lack a stmt with the desired op when doing permutation.
20075 (vect_build_slp_tree): When caching a failed SLP build attempt
20076 assert that at least one lane is marked as not matching.
20078 2021-06-21 liuhongt <hongtao.liu@intel.com>
20081 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
20082 register alternative.
20083 (*and<mode>_1): Ditto.
20085 (*andn<mode>_1): Ditto.
20086 (*<code><mode>_1): Ditto.
20087 (*<code>qi_1): Ditto.
20088 (*one_cmpl<mode>2_1): Ditto.
20089 (*one_cmplsi2_1_zext): Ditto.
20090 (*one_cmplqi2_1): Ditto.
20091 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
20092 the order of mask registers to be before general registers.
20094 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
20097 * config/i386/i386.md: New define_peephole2s to shrink writing
20098 1, 2 or 4 consecutive zeros to memory when optimizing for size.
20100 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
20102 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
20103 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
20104 more efficient code when the source can be trivially simplified.
20106 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20108 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
20109 a range if global is not available.
20110 (ranger_cache::entry_range): Fallback to range_of_def.
20111 * gimple-range-cache.h (range_of_def): Adjust prototype.
20113 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20115 PR tree-optimization/101014
20116 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
20118 (ranger_cache::~ranger_cache): Ditto.
20119 (ranger_cache::enable_new_values): Delete.
20120 (ranger_cache::push_poor_value): Delete.
20121 (ranger_cache::range_of_def): Remove poor value processing.
20122 (ranger_cache::entry_range): Ditto.
20123 (ranger_cache::fill_block_cache): Ditto.
20124 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
20125 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
20126 * gimple-range.h (class gimple_ranger): Adjust.
20128 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
20131 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
20132 derived from arm_canon_arch.
20133 (arm_canon_arch_option): Call it.
20134 (arm_canon_arch_multilib_option): New function.
20135 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
20136 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
20137 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
20138 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
20139 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
20140 * config/arm/arm.opt (mlibarch): New option.
20141 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
20142 of march on RHS with mlibarch.
20144 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
20146 * config.in: Regenerate.
20147 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
20149 * configure: Regenerate.
20150 * configure.ac: Fix for global_load assembler functions.
20152 2021-06-18 Richard Biener <rguenther@suse.de>
20154 PR tree-optimization/101112
20155 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
20156 to lookup a pattern stmt def.
20158 2021-06-18 Jakub Jelinek <jakub@redhat.com>
20160 PR middle-end/101062
20161 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
20162 representatives in QUAL_UNION_TYPE.
20164 2021-06-18 Andrew Pinski <apinski@marvell.com>
20166 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
20167 Add counting of how many times it is done.
20168 (factor_out_conditional_conversion): Likewise.
20169 (match_simplify_replacement): Likewise.
20170 (value_replacement): Likewise.
20171 (spaceship_replacement): Likewise.
20172 (cond_store_replacement): Likewise.
20173 (cond_if_else_store_replacement_1): Likewise.
20174 (hoist_adjacent_loads): Likewise.
20176 2021-06-18 Andrew Pinski <apinski@marvell.com>
20178 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
20179 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
20180 (verify_gimple_assign_binary): Reject point and offset types on
20181 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
20182 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
20183 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
20185 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
20187 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
20188 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
20189 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
20192 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
20194 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
20195 earlyclobber to alts 0/1.
20196 (gen_addadd): Add earlyclobber to alts 0/1.
20197 * config/rs6000/fusion.md: Regenerate file.
20199 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20201 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
20203 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20205 * gimple-range-cache.cc: Comment cleanups.
20206 * gimple-range-gori.cc: Comment cleanups.
20207 * gimple-range.cc: Comment/spacing cleanups
20208 * value-range.h: Comment cleanups.
20210 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
20213 * calls.c (expand_call): Replace PUSH_ARGS with
20214 targetm.calls.push_argument (0).
20215 (emit_library_call_value_1): Likewise.
20216 * defaults.h (PUSH_ARGS): Removed.
20217 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
20218 targetm.calls.push_argument (0).
20219 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
20220 (emit_push_insn): Pass the number bytes to push to
20221 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
20222 * hooks.c (hook_bool_uint_true): New.
20223 * hooks.h (hook_bool_uint_true): Likewise.
20224 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
20225 targetm.calls.push_argument (0).
20226 * target.def (push_argument): Add a targetm.calls hook.
20227 * targhooks.c (default_push_argument): New.
20228 * targhooks.h (default_push_argument): Likewise.
20229 * config/bpf/bpf.h (PUSH_ARGS): Removed.
20230 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
20231 * config/cr16/cr16.h (PUSH_ARGS): Removed.
20232 * config/i386/i386.c (ix86_push_argument): New.
20233 (TARGET_PUSH_ARGUMENT): Likewise.
20234 * config/i386/i386.h (PUSH_ARGS): Removed.
20235 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
20236 * config/m32c/m32c.h (PUSH_ARGS): Removed.
20237 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
20238 * config/pru/pru.h (PUSH_ARGS): Likewise.
20239 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
20240 TARGET_PUSH_ARGUMENT hook.
20241 * doc/tm.texi: Regenerated.
20243 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
20246 * config/i386/i386-expand.c (expand_vector_set_var):
20247 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
20248 ix86_expand_vector_init_duplicate.
20249 (ix86_expand_vector_init_duplicate): Emit insv_1 for
20250 QImode for !TARGET_PARTIAL_REG_STALL.
20251 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
20252 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
20253 as operand 2 predicate. Call ix86_expand_vector_set_var
20254 for non-constant index operand.
20255 (vec_setv2si): Ditto.
20256 (vec_setv4hi): Ditto.
20257 (vec_setv8qi): ditto.
20259 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
20261 PR tree-optimization/100790
20262 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
20265 2021-06-17 Martin Liska <mliska@suse.cz>
20267 * doc/invoke.texi: Use consistently -O1 instead of -O.
20269 2021-06-17 Martin Liska <mliska@suse.cz>
20271 * gcov-io.h: Update documentation entry about string format.
20273 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
20276 * config/s390/vecintrin.h (vec_doublee): Fix to use
20277 __builtin_s390_vflls.
20278 (vec_floate): Fix to use __builtin_s390_vflrd.
20280 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20282 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
20283 * dominance.h (get_dominated_to_depth): Likewise.
20284 (get_all_dominated_blocks): Likewise.
20285 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
20286 * gcse.c (hoist_code): Likewise.
20287 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
20288 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
20289 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
20290 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
20292 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20294 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
20295 * dominance.h (get_dominated_by_region): Likewise.
20296 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
20297 (gimple_duplicate_sese_tail): Likewise.
20298 (move_sese_region_to_fn): Likewise.
20300 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20302 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
20303 * dominance.h (get_dominated_by): Likewise.
20304 * auto-profile.c (afdo_find_equiv_class): Adjust.
20305 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
20306 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
20307 * tree-cfg.c (test_linear_chain): Likewise.
20308 (test_diamond): Likewise.
20310 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20312 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
20313 * cfgloopanal.c (get_loop_hot_path): Likewise.
20314 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
20316 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20318 * cgraph.c (cgraph_node::collect_callers): Return
20319 auto_vec<cgraph_edge *>.
20320 * cgraph.h (cgraph_node::collect_callers): Likewise.
20321 * ipa-cp.c (create_specialized_node): Adjust.
20322 (decide_about_value): Likewise.
20323 (decide_whether_version_node): Likewise.
20324 * ipa-sra.c (process_isra_node_results): Likewise.
20326 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20328 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
20329 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
20331 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
20334 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
20336 * gimple-range.cc (debug_seed_ranger): New.
20337 (dump_ranger): New.
20338 (debug_ranger): New.
20340 2021-06-17 Richard Biener <rguenther@suse.de>
20342 PR tree-optimization/54400
20343 * tree-vectorizer.h (enum slp_instance_kind): Add
20344 slp_inst_kind_bb_reduc.
20345 (reduction_fn_for_scalar_code): Declare.
20346 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
20347 Check SLP_INSTANCE_KIND instead of looking at the
20349 (vect_slp_analyze_instance_alignment): Likewise.
20350 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
20351 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
20352 chain linearization from vect_build_slp_tree_2 and generalize
20353 for the use of BB reduction vectorization.
20354 (vect_build_slp_tree_2): Adjust accordingly.
20355 (vect_optimize_slp): Elide permutes at the root of BB reduction
20357 (vectorizable_bb_reduc_epilogue): New function.
20358 (vect_slp_prune_covered_roots): Likewise.
20359 (vect_slp_analyze_operations): Use them.
20360 (vect_slp_check_for_constructors): Recognize associatable
20361 chains for BB reduction vectorization.
20362 (vectorize_slp_instance_root_stmt): Generate code for the
20363 BB reduction epilogue.
20365 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20367 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
20369 (gori_compute::may_recompute_p): New.
20370 (gori_compute::outgoing_edge_range_p): Perform recomputations.
20371 * gimple-range-gori.h (class gori_compute): Add prototype.
20373 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20375 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
20376 true when a range can be calculated.
20377 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
20379 2021-06-16 Martin Sebor <msebor@redhat.com>
20381 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
20382 Correct documented defaults.
20384 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
20386 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
20387 m_new_value_p directly.
20389 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
20392 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
20393 Handle 64bit modes for TARGET_SSE4_1.
20394 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
20395 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
20396 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
20397 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
20398 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
20399 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
20401 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20403 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
20404 Change to an expander that emits the correct instruction
20405 depending on endianness.
20406 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
20407 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
20409 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20411 * config/aarch64/aarch64-simd-builtins.def: Split generator
20412 for aarch64_<su>qmovn builtins into scalar and vector
20414 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
20416 (aarch64_<su>qmovn<mode>_insn_be): Define.
20417 (aarch64_<su>qmovn<mode>): Split into scalar and vector
20418 variants. Change vector variant to an expander that emits the
20419 correct instruction depending on endianness.
20421 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20423 * config/aarch64/aarch64-simd-builtins.def: Split generator
20424 for aarch64_sqmovun builtins into scalar and vector variants.
20425 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
20426 Split into scalar and vector variants. Change vector variant
20427 to an expander that emits the correct instruction depending
20429 (aarch64_sqmovun<mode>_insn_le): Define.
20430 (aarch64_sqmovun<mode>_insn_be): Define.
20432 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20434 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
20435 Define - modeling zero-high-half semantics.
20436 (aarch64_xtn<mode>): Change to an expander that emits the
20437 appropriate instruction depending on endianness.
20438 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
20440 (aarch64_xtn2<mode>_le): Rename to...
20441 (aarch64_xtn2<mode>_insn_le): This.
20442 (aarch64_xtn2<mode>_be): Rename to...
20443 (aarch64_xtn2<mode>_insn_be): This.
20444 (vec_pack_trunc_<mode>): Emit truncation instruction instead
20446 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
20447 attribute iterator.
20449 2021-06-16 Martin Jambor <mjambor@suse.cz>
20451 PR tree-optimization/100453
20452 * tree-sra.c (create_access): Disqualify any const candidates
20453 which are written to.
20454 (sra_modify_expr): Do not store sub-replacements back to a const base.
20455 (handle_unscalarized_data_in_subtree): Likewise.
20456 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
20457 instead of constant_decl_p.
20459 2021-06-16 Jakub Jelinek <jakub@redhat.com>
20461 PR middle-end/101062
20462 * stor-layout.c (finish_bitfield_representative): For fields in unions
20463 assume nextf is always NULL.
20464 (finish_bitfield_layout): Compute bit field representatives also in
20465 unions, but handle it as if each bitfield was the only field in the
20468 2021-06-16 Richard Biener <rguenther@suse.de>
20470 PR tree-optimization/101088
20471 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
20472 supported refs on edges. Do not assert same ref but
20473 different kind stores are unsuported but mark them so.
20474 (hoist_memory_references): Only look for supported refs
20477 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
20479 PR rtl-optimization/46235
20480 * config/i386/i386.md: New define_split for bt followed by cmov.
20481 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
20482 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
20483 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
20484 by setnc with zero extension.
20486 2021-06-16 Richard Biener <rguenther@suse.de>
20488 PR tree-optimization/101083
20489 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
20490 vectype as argument.
20491 (vect_build_slp_tree_2): Adjust.
20493 2021-06-15 Martin Sebor <msebor@redhat.com>
20495 PR middle-end/100876
20496 * builtins.c: (gimple_call_return_array): Account for size_t
20497 mangling as either unsigned int or unsigned long
20499 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
20501 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
20502 up before eliminating comparisons.
20504 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
20506 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
20507 nonzero|X is nonzero.
20508 (range_op_bitwise_and_tests): Add tests for above.
20510 2021-06-15 Carl Love <cel@us.ibm.com>
20513 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
20515 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
20518 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
20521 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
20522 (struct gimplify_omp_ctx): Extend defaultmap array by one.
20523 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
20524 (omp_notice_variable): Update type classification for Fortran.
20525 (gimplify_scan_omp_clauses): Update calls for new argument; handle
20526 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
20527 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
20528 * langhooks.c (lhd_omp_scalar_p): Likewise.
20529 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
20530 (LANG_HOOKS_DECLS): Add them.
20531 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
20532 omp_scalar_p pointer type to include the new bool argument.
20534 2021-06-15 David Malcolm <dmalcolm@redhat.com>
20536 * doc/analyzer.texi
20537 (Special Functions for Debugging the Analyzer): Add
20538 __analyzer_dump_capacity.
20540 2021-06-15 Jakub Jelinek <jakub@redhat.com>
20543 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
20544 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
20546 2021-06-15 Richard Biener <rguenther@suse.de>
20548 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
20549 check to identify loop latches.
20550 * cfgloop.c (verify_loop_structure): Likewise.
20551 * loop-init.c (apply_loop_flags): Allow marked irreducible
20552 regions even with multiple latches.
20553 * predict.c (rebuild_frequencies): Simplify.
20555 2021-06-15 Richard Biener <rguenther@suse.de>
20557 * tree-ssa-threadupdate.c
20558 (jump_thread_path_registry::mark_threaded_blocks): Assert we
20559 have marked irreducible regions.
20561 2021-06-14 Martin Sebor <msebor@redhat.com>
20564 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
20565 Handle calls to placement new.
20566 (ndecl_dealloc_argno): Avoid placement delete.
20568 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
20571 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
20572 create_tmp_reg_or_ssa_name().
20574 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
20576 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
20577 (ranger_cache::enable_new_values): Set to specified value and
20578 return the old value.
20579 (ranger_cache::disable_new_values): Delete.
20580 (ranger_cache::fill_block_cache): Disable non 1st order derived
20582 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
20583 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
20585 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
20588 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
20589 Return true early when testing with V2HImode.
20590 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
20592 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
20594 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
20595 (mve_vec_unpack<US>_hi_<mode>): New pattern.
20596 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
20597 (mve_vmovntq_<supf><mode>): Prefix with '@'.
20598 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
20600 (vec_unpack<US>_lo_<mode>): Likewise.
20601 (vec_pack_trunc_<mode>): Rename to
20602 neon_quad_vec_pack_trunc_<mode>.
20603 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
20605 (vec_unpack<US>_lo_<mode>): New.
20606 (vec_pack_trunc_<mode>): New.
20608 2021-06-14 Richard Biener <rguenther@suse.de>
20610 PR tree-optimization/100934
20611 * tree-ssa-dom.c (pass_dominator::execute): Properly
20612 mark irreducible regions.
20614 2021-06-14 Martin Liska <mliska@suse.cz>
20616 * doc/invoke.texi: Put r{...} on the same line as @item.
20618 2021-06-14 Martin Liska <mliska@suse.cz>
20620 * doc/invoke.texi: Add missing newline.
20622 2021-06-14 Martin Liska <mliska@suse.cz>
20624 * doc/invoke.texi: Remove '+' charasters.
20626 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
20628 * config.gcc (arc): Add support for with_cpu option.
20629 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
20631 2021-06-14 Richard Biener <rguenther@suse.de>
20633 PR tree-optimization/101031
20634 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
20635 instead of size when accounting for a possibly string
20638 2021-06-14 Martin Liska <mliska@suse.cz>
20640 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
20642 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
20644 * value-query.cc (gimple_range_global): Call get_range_global
20645 if called after inlining.
20647 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
20650 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
20651 Emit constant permutation insn directly from here.
20653 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
20655 * attribs.c (find_attribute_namespace): Iterate over vec<> with
20657 * auto-profile.c (afdo_find_equiv_class): Likewise.
20658 * gcc.c (do_specs_vec): Likewise.
20659 (do_spec_1): Likewise.
20660 (driver::set_up_specs): Likewise.
20661 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
20662 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
20663 (imm_store_chain_info::try_coalesce_bswap): Likewise.
20664 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
20665 (get_location_for_stmts): Likewise.
20666 * graphite-poly.c (print_iteration_domains): Likewise.
20667 (free_poly_bb): Likewise.
20668 (remove_gbbs_in_scop): Likewise.
20669 (free_scop): Likewise.
20670 (dump_gbb_cases): Likewise.
20671 (dump_gbb_conditions): Likewise.
20672 (print_pdrs): Likewise.
20673 (print_scop): Likewise.
20674 * ifcvt.c (cond_move_process_if_block): Likewise.
20675 * lower-subreg.c (decompose_multiword_subregs): Likewise.
20676 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
20677 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
20678 * sel-sched-dump.c (dump_insn_vector): Likewise.
20679 * store-motion.c (store_ops_ok): Likewise.
20680 (store_killed_in_insn): Likewise.
20681 * timevar.c (timer::named_items::print): Likewise.
20682 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
20683 (cleanup_tree_cfg_noloop): Likewise.
20684 * tree-data-ref.c (dump_data_references): Likewise.
20685 (print_dir_vectors): Likewise.
20686 (print_dist_vectors): Likewise.
20687 (dump_data_dependence_relations): Likewise.
20688 (dump_dist_dir_vectors): Likewise.
20689 (dump_ddrs): Likewise.
20690 (create_runtime_alias_checks): Likewise.
20691 (free_subscripts): Likewise.
20692 (save_dist_v): Likewise.
20693 (save_dir_v): Likewise.
20694 (invariant_access_functions): Likewise.
20695 (same_access_functions): Likewise.
20696 (access_functions_are_affine_or_constant_p): Likewise.
20697 (find_data_references_in_stmt): Likewise.
20698 (graphite_find_data_references_in_stmt): Likewise.
20699 (free_dependence_relations): Likewise.
20700 (free_data_refs): Likewise.
20701 * tree-inline.c (copy_debug_stmts): Likewise.
20702 * tree-into-ssa.c (dump_currdefs): Likewise.
20703 (rewrite_update_phi_arguments): Likewise.
20704 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
20705 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
20707 (vect_slp_analyze_node_dependences): Likewise.
20708 (vect_slp_analyze_instance_dependence): Likewise.
20709 (vect_record_base_alignments): Likewise.
20710 (vect_get_peeling_costs_all_drs): Likewise.
20711 (vect_peeling_supportable): Likewise.
20712 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
20713 (vec_info::free_stmt_vec_infos): Likewise.
20715 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
20717 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
20718 (andqi3_1<cczn>): Removed.
20719 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
20720 (H8/SX bit logicals): Split out from other patterns.
20721 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
20722 mulqihi3_const_clobber_flags.
20723 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
20725 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
20728 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
20729 to true if red zone is used.
20730 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
20731 ix86_red_zone_used.
20732 * config/i386/i386.h (machine_function): Add red_zone_used.
20733 (ix86_red_zone_size): Removed.
20734 (ix86_red_zone_used): New.
20735 * config/i386/i386.md (peephole2 patterns): Replace
20736 ix86_red_zone_size with ix86_red_zone_used.
20738 2021-06-12 Jason Merrill <jason@redhat.com>
20740 * doc/extend.texi (unused variable attribute): Applies to
20741 structure fields as well.
20743 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
20745 * auto-profile.c (read_profile): fix a typo in an error string
20747 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
20749 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
20751 * tree-pretty-print.c (dump_omp_clauses): Update.
20752 (dump_generic_node) <OMP_CLAUSE>: Use it.
20754 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
20757 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
20758 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
20759 the polymorphic variants matching code.
20760 (__arm_vld1q_z): Likewise.
20761 (__arm_vld2q): Likewise.
20762 (__arm_vld4q): Likewise.
20763 (__arm_vldrbq_gather_offset): Likewise.
20764 (__arm_vldrbq_gather_offset_z): Likewise.
20766 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
20768 PR tree-optimization/96392
20769 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
20771 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
20773 PR tree-optimization/96392
20774 * fold-const.c (fold_real_zero_addition_p): Take both arguments
20775 of the addition or subtraction, not just the zero. Use this
20776 other argument in tests for signaling NaNs and signed zeros.
20777 (tree_expr_maybe_real_minus_zero_p): New predicate.
20778 * fold-const.h (fold_real_zero_addition_p): Update prototype.
20779 (tree_expr_maybe_real_minus_zero_p): New function prototype.
20780 * match.pd: Update calls to fold_real_zero_addition_p.
20781 Replace HONOR_NANS with tree_expr_maybe_nan_p.
20782 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
20783 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
20784 * tree-ssa-reassoc.c (eliminate_using_constants): Update
20785 call to fold_real_zero_addition_p.
20787 2021-06-11 Richard Biener <rguenther@suse.de>
20789 PR tree-optimization/101025
20790 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
20791 all refs that require dependence checking.
20793 2021-06-11 Richard Biener <rguenther@suse.de>
20795 PR tree-optimization/101028
20796 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
20797 reassoc discovery fails fatally, mark appropriate lanes
20800 2021-06-11 Richard Biener <rguenther@suse.de>
20802 PR tree-optimization/101026
20803 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
20804 have a representative for the associated chain nodes.
20806 2021-06-11 Jakub Jelinek <jakub@redhat.com>
20808 PR rtl-optimization/101008
20809 * simplify-rtx.c (relational_result): New function.
20810 (simplify_logical_relational_operation,
20811 simplify_relational_operation): Use it.
20813 2021-06-11 Jakub Jelinek <jakub@redhat.com>
20816 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
20818 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
20821 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
20822 false if the permutation can be implemented with constant
20823 permutation instruction in wider mode.
20824 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
20825 Handle V8QImode and V4HImode.
20827 2021-06-11 Martin Liska <mliska@suse.cz>
20829 PR gcov-profile/100788
20830 * common.opt: Add new option.
20831 * coverage.c (coverage_begin_function): Emit warning instead on
20832 the internal compiler error.
20833 * doc/invoke.texi: Document the option.
20834 * toplev.c (process_options): Enable it by default.
20836 2021-06-11 Richard Biener <rguenther@suse.de>
20838 PR middle-end/101009
20839 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
20840 to set *init_b to true when we encounter a constant equal
20842 (compute_affine_dependence): Also dump the actual DR_REF.
20844 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
20846 PR tree-optimization/100984
20847 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
20848 replacements table.
20849 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
20851 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
20853 * config/rs6000/rs6000.md
20854 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
20855 define_insn_and_split.
20857 2021-06-11 Richard Biener <rguenther@suse.de>
20859 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
20860 to sort operands of the associative chain.
20862 2021-06-11 Richard Biener <rguenther@suse.de>
20864 * system.h (gcc_stablesort_r): Declare.
20865 * sort.cc (gcc_sort_r): Support stable sort.
20866 (gcc_stablesort_r): Define.
20867 * vec.h (vec<>::stablesort): Add.
20869 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
20872 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
20873 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
20874 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
20875 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
20876 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
20877 (expand_vec_perm_interleave2): Handle 64bit modes.
20878 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
20879 (expand_vec_perm_even_odd_1): Ditto.
20880 (ix86_vectorize_vec_perm_const): Ditto.
20881 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
20882 * config/i386/sse.md: ... here.
20883 * config/i386/mmx.md (*vec_interleave_lowv2sf):
20884 New insn_and_split pattern.
20885 (*vec_interleave_highv2sf): Ditto.
20886 (mmx_pshufbv8qi3): New insn pattern.
20887 (*mmx_pblendw): Ditto.
20889 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
20891 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
20892 (build_acc): Likewise.
20893 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
20894 source operands in little-endian mode.
20895 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
20896 (mma_init_builtins): Likewise.
20897 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
20898 ordering for the MMA assemble and build source operands.
20899 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
20901 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
20904 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
20906 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
20908 * config/h8300/extensions.md: Replace _clobber_flags patterns
20911 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
20913 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
20914 (vcond_mask_<mode><tointvec>): this.
20916 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
20917 Thomas Schwinge <thomas@codesourcery.com>
20919 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
20920 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
20921 * gimple.h (enum gf_mask): Split
20922 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
20923 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
20924 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
20925 (is_gimple_omp_oacc): Update.
20926 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
20927 * gimplify.c (gimplify_omp_target_update): Likewise.
20928 * omp-expand.c (expand_omp_target, build_omp_regions_1)
20929 (omp_make_gimple_edges): Likewise.
20930 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
20933 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
20935 * value-query.cc (value_query::value_on_edge): Rename name to
20937 (range_query::range_on_edge): Same.
20938 (range_query::value_of_expr): Same.
20939 (range_query::value_on_edge): Same.
20940 * value-query.h (class value_query): Same.
20941 (class range_query): Same.
20943 2021-06-10 Richard Biener <rguenther@suse.de>
20945 PR tree-optimization/101003
20946 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
20947 use the pattern stmt defs when linearizing a chain.
20949 2021-06-10 Jakub Jelinek <jakub@redhat.com>
20952 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
20953 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
20955 2021-06-10 Clement Chigot <clement.chigot@atos.net>
20957 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
20958 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
20960 2021-06-09 Andrew Pinski <apinski@marvell.com>
20962 PR tree-optimization/100925
20963 * match.pd (a ? CST1 : CST2): Limit transformations
20964 that would produce a negative to integeral types only.
20965 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
20967 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
20970 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
20972 * doc/tm.texi: Correctly update.
20974 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
20976 * doc/tm.texi: Correctly update.
20978 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
20981 * doc/tm.texi.in (Trampolines): Add a missing blank line.
20983 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
20986 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
20987 and -ftrampolines work only with Ada.
20988 * doc/tm.texi.in (Trampolines): Likewise.
20989 * doc/tm.texi: Regenerated.
20991 2021-06-09 Carl Love <cel@us.ibm.com>
20993 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
20994 Add define for new builtins.
20995 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
20996 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
20997 overloaded builtin definitions.
20998 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
20999 VSIGNEXTSD2Q): Add builtin expansions.
21000 (SIGNEXT): Add P10 overload definition.
21001 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
21002 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
21003 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
21004 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
21005 vsignextend_si_v2di)[VIlong]: Add define_expand.
21006 Make define_insn vsx_sign_extend_si_v2di visible.
21007 * doc/extend.texi: Add documentation for the vec_signexti,
21008 vec_signextll builtins and vec_signextq.
21010 2021-06-09 Carl Love <cel@us.ibm.com>
21012 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
21013 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
21014 __floattikf_sw, __floatuntikf_sw respectively.
21015 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
21016 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
21017 define_insn for mode IEEE 128.
21019 2021-06-09 Carl Love <cel@us.ibm.com>
21021 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
21022 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
21023 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
21024 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
21025 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
21026 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
21027 uses of VSX_TI to VEC_TI.
21029 2021-06-09 Carl Love <cel@us.ibm.com>
21031 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
21033 2021-06-09 Carl Love <cel@us.ibm.com>
21035 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
21037 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
21038 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
21039 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
21040 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
21041 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
21042 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
21043 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
21045 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
21046 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
21047 altivec_vrlqnm): New define_expands.
21048 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
21049 VCMPGTUT_P): Add macro expansions.
21050 (BU_P10V_AV_P): Add builtin predicate definition.
21051 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
21052 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
21053 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
21054 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
21055 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
21056 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
21057 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
21058 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
21059 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
21060 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
21061 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
21062 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
21063 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
21064 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
21065 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
21066 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
21067 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
21068 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
21069 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
21070 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
21071 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
21072 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
21073 P10V_BUILTIN_MODU_V1TI):
21074 New overloaded definitions.
21075 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
21076 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
21077 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
21078 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
21079 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
21080 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
21082 (altivec_init_builtins): New E_V1TImode case statement.
21083 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
21084 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
21085 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
21086 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
21087 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
21088 E_V1TImode]: New case statements.
21089 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
21090 value RS6000_BTI_bool_V1TI.
21091 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
21092 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
21093 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
21094 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
21095 vlshrv1ti3, vashrv1ti3): New define_expands.
21096 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
21097 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
21098 UNSPEC_VSX_MODUQ): New unspecs.
21099 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
21100 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
21102 (vcmpnet): New define_expand.
21103 * doc/extend.texi: Add documentation for the new builtins vec_rl,
21104 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
21105 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
21106 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
21107 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
21108 vec_any_ge, vec_any_le.
21110 2021-06-09 Carl Love <cel@us.ibm.com>
21112 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
21113 bug in argument generation.
21115 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21117 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
21119 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
21120 remove <supf> iterator.
21121 (mve_vclzq_u<mode>): New.
21122 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
21123 (neon_vclz<mode): Move to ...
21124 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
21125 * config/arm/vec-common.md: ... here. Add support for MVE.
21127 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21129 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
21130 (@mve_vrhaddq_<supf><mode): Likewise.
21131 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
21132 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
21133 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
21135 2021-06-09 imba-tjd <109224573@qq.com>
21137 * doc/invoke.texi: Fix typo.
21139 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
21141 PR middle-end/53267
21142 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
21143 Support evaluation of fmod/fmodf/fmodl at compile-time.
21145 2021-06-09 Richard Biener <rguenther@suse.de>
21147 PR tree-optimization/100981
21148 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
21149 gimple_get_lhs to also handle calls.
21150 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
21153 2021-06-09 Richard Biener <rguenther@suse.de>
21155 PR tree-optimization/97832
21156 * tree-vectorizer.h (_slp_tree::failed): New.
21157 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
21159 (_slp_tree::~_slp_tree): Free failed.
21160 (vect_build_slp_tree): Retain failed nodes and record
21161 matches in them, copying that back out when running
21162 into a cached fail. Dump start and end of discovery.
21163 (dt_sort_cmp): New.
21164 (vect_build_slp_tree_2): Handle associatable chains
21165 together doing more aggressive operand swapping.
21167 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21170 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
21172 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
21173 Linux and GNU targets.
21175 2021-06-09 Richard Biener <rguenther@suse.de>
21177 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
21180 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21182 * config/arc/arc.md (loop_end): Change it to
21183 define_insn_and_split.
21185 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21187 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
21188 (machi): New pattern.
21189 (umaddhisi4): Use VMAC2HU instruction.
21190 (umachi): New pattern.
21192 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21194 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
21195 * config/arc/arc.c (arc_split_move_p): New function.
21196 (arc_split_move): Clean up.
21197 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
21198 (movdf_insn): Likewise.
21199 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
21201 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
21204 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
21205 argument to "raw". Do not emit segment overrides when "raw" is true.
21207 2021-06-09 Martin Liska <mliska@suse.cz>
21209 * doc/gcov.texi: Create a proper JSON files.
21210 * doc/invoke.texi: Remove dots in order to make it a valid
21213 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
21215 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
21216 (insn_is_load_p): Use pattern_is_rotate64.
21217 (insn_is_swap_p): Likewise.
21218 (quad_aligned_load_p): Likewise.
21219 (const_load_sequence_p): Likewise.
21220 (replace_swapped_aligned_load): Likewise.
21221 (recombine_lvx_pattern): Likewise.
21222 (recombine_stvx_pattern): Likewise.
21224 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
21226 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
21227 fur_stmt source record.
21228 * gimple-range.cc (fur_source::get_operand): Generic range query.
21229 (fur_source::get_phi_operand): New.
21230 (fur_source::register_dependency): New.
21231 (fur_source::query): New.
21232 (class fur_edge): New. Edge source for operands.
21233 (fur_edge::fur_edge): New.
21234 (fur_edge::get_operand): New.
21235 (fur_edge::get_phi_operand): New.
21236 (fur_edge::query): New.
21237 (fur_stmt::fur_stmt): New.
21238 (fur_stmt::get_operand): New.
21239 (fur_stmt::get_phi_operand): New.
21240 (fur_stmt::query): New.
21241 (class fur_depend): New. Statement source and process dependencies.
21242 (fur_depend::fur_depend): New.
21243 (fur_depend::register_dependency): New.
21244 (class fur_list): New. List source for operands.
21245 (fur_list::fur_list): New.
21246 (fur_list::get_operand): New.
21247 (fur_list::get_phi_operand): New.
21248 (fold_range): New. Instantiate appropriate fur_source class and fold.
21249 (fold_using_range::range_of_range_op): Use new API.
21250 (fold_using_range::range_of_address): Ditto.
21251 (fold_using_range::range_of_phi): Ditto.
21252 (imple_ranger::fold_range_internal): Use fur_depend class.
21253 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
21254 * gimple-range.h (class fur_source): Now a base class.
21255 (class fur_stmt): New.
21256 (fold_range): New prototypes.
21257 (fur_source::fur_source): Delete.
21259 2021-06-08 Andrew Pinski <apinski@marvell.com>
21261 PR tree-optimization/25290
21262 * tree-ssa-phiopt.c (xor_replacement): Delete.
21263 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
21264 (match_simplify_replacement): Allow one cheap preparation
21265 statement that can be moved to before the if.
21267 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
21269 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
21270 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
21272 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
21274 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
21275 Create length attribute on define_insn_and_split. Only split for cases which we
21277 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
21278 fix length computation.
21279 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
21281 2021-06-08 Richard Biener <rguenther@suse.de>
21283 PR tree-optimization/100923
21284 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
21285 the operand vector to be valueized.
21286 (valueize_refs): Likewise.
21287 (valueize_shared_reference_ops_from_ref): Adjust.
21288 (valueize_shared_reference_ops_from_call): Likewise.
21289 (vn_reference_lookup_3): Likewise.
21290 (vn_reference_lookup_pieces): Likewise. Re-valueize
21291 with honoring availability when we are about to create
21292 the ao_ref and valueized before.
21293 (vn_reference_lookup): Likewise.
21294 (vn_reference_insert_pieces): Adjust.
21296 2021-06-08 Richard Biener <rguenther@suse.de>
21298 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
21299 (_slp_instance::root_stmts): ... a vector.
21300 (SLP_INSTANCE_ROOT_STMT): Rename to ...
21301 (SLP_INSTANCE_ROOT_STMTS): ... this.
21302 (slp_root::root): Change to...
21303 (slp_root::roots): ... a vector.
21304 (slp_root::slp_root): Adjust.
21305 * tree-vect-slp.c (_slp_instance::location): Adjust.
21306 (vect_free_slp_instance): Release the root stmt vector.
21307 (vect_build_slp_instance): Adjust.
21308 (vect_analyze_slp): Likewise.
21309 (_bb_vec_info::~_bb_vec_info): Likewise.
21310 (vect_slp_analyze_operations): Likewise.
21311 (vect_bb_vectorization_profitable_p): Likewise. Adjust
21312 costs for the root stmt.
21313 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
21315 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
21317 (vectorize_slp_instance_root_stmt): Adjust.
21318 (vect_schedule_slp): Likewise.
21320 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
21322 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
21323 (ssa_equiv_stack::ssa_equiv_stack): New.
21324 (ssa_equiv_stack::~ssa_equiv_stack): New.
21325 (ssa_equiv_stack::enter): New.
21326 (ssa_equiv_stack::leave): New.
21327 (ssa_equiv_stack::push_replacement): New.
21328 (ssa_equiv_stack::get_replacement): New.
21329 (is_pointer_ssa): New.
21330 (class pointer_equiv_analyzer): New.
21331 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
21332 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
21333 (pointer_equiv_analyzer::set_global_equiv): New.
21334 (pointer_equiv_analyzer::set_cond_equiv): New.
21335 (pointer_equiv_analyzer::get_equiv): New.
21336 (pointer_equiv_analyzer::enter): New.
21337 (pointer_equiv_analyzer::leave): New.
21338 (pointer_equiv_analyzer::get_equiv_expr): New.
21339 (pta_valueize): New.
21340 (pointer_equiv_analyzer::visit_stmt): New.
21341 (pointer_equiv_analyzer::visit_edge): New.
21342 (hybrid_folder::value_of_expr): Call PTA.
21343 (hybrid_folder::value_on_edge): Same.
21344 (hybrid_folder::pre_fold_bb): New.
21345 (hybrid_folder::post_fold_bb): New.
21346 (hybrid_folder::pre_fold_stmt): New.
21347 (rvrp_folder::pre_fold_bb): New.
21348 (rvrp_folder::post_fold_bb): New.
21349 (rvrp_folder::pre_fold_stmt): New.
21350 (rvrp_folder::value_of_expr): Call PTA.
21351 (rvrp_folder::value_on_edge): Same.
21353 2021-06-08 Jakub Jelinek <jakub@redhat.com>
21356 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
21357 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
21359 2021-06-08 Richard Biener <rguenther@suse.de>
21361 PR middle-end/100951
21362 * tree-vect-generic.c (expand_vector_piecewise): Build a
21363 VECTOR_CST if all elements are constant.
21364 (expand_vector_condition): Likewise.
21365 (lower_vec_perm): Likewise.
21366 (expand_vector_conversion): Likewise.
21368 2021-06-08 Martin Liska <mliska@suse.cz>
21370 * doc/invoke.texi: Document new param evrp-sparse-threshold.
21372 2021-06-08 Martin Liska <mliska@suse.cz>
21374 * genautomata.c (create_automata): Fix typo.
21376 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
21378 PR tree-optimization/100794
21379 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
21380 allow_unroll_p and only allow unrolling when it's true.
21381 (tree_predictive_commoning): Add parameter allow_unroll_p and
21383 (run_tree_predictive_commoning): Likewise.
21384 (pass_predcom::gate): Check flag_tree_loop_vectorize and
21385 global_options_set.x_flag_predictive_commoning.
21386 (pass_predcom::execute): Adjust for allow_unroll_p.
21388 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
21390 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
21391 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
21392 lambda function cleanup, remove scev_reset call, and adjust return
21394 (tree_predictive_commoning): Adjust for different changed values,
21395 only set flag TODO_update_ssa_only_virtuals if changed.
21396 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
21397 from todo_flags_finish.
21399 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
21401 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
21402 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
21403 (sbr_sparse_bitmap::bitmap_set_quad): New.
21404 (sbr_sparse_bitmap::bitmap_get_quad): New.
21405 (sbr_sparse_bitmap::set_bb_range): New.
21406 (sbr_sparse_bitmap::get_bb_range): New.
21407 (sbr_sparse_bitmap::bb_range_p): New.
21408 (block_range_cache::block_range_cache): initialize bitmap obstack.
21409 (block_range_cache::~block_range_cache): Destruct obstack.
21410 (block_range_cache::set_bb_range): Decide when to utilze the
21411 sparse on entry cache.
21412 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
21413 * params.opt (-param=evrp-sparse-threshold): New.
21415 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
21417 * bitmap.c (bitmap_set_aligned_chunk): New.
21418 (bitmap_get_aligned_chunk): New.
21419 (test_aligned_chunk): New.
21420 (bitmap_c_tests): Call test_aligned_chunk.
21421 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
21423 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
21426 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
21428 (ix86_expand_vector_init_one_nonzero): Ditto.
21429 (ix86_expand_vector_init_one_var): Ditto.
21430 (ix86_expand_vector_init_general): Ditto.
21431 * config/i386/mmx.md (vec_initv4qiqi): New expander.
21433 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
21435 * config/h8300/movepush.md: Change most _clobber_flags
21436 patterns to instead use <cczn> subst.
21437 (movsi_cczn): New pattern with usable CC cases split out.
21438 (movsi_h8sx_cczn): Likewise.
21440 2021-06-07 Martin Liska <mliska@suse.cz>
21442 * common/common-target.def: Split long lines and replace them
21444 * target.def: Likewise.
21445 * doc/tm.texi: Re-generated.
21447 2021-06-07 Jakub Jelinek <jakub@redhat.com>
21450 * fold-const.c (fold_read_from_vector): Return NULL if trying to
21451 read from a CONSTRUCTOR with vector type elements.
21453 2021-06-07 Jakub Jelinek <jakub@redhat.com>
21455 PR middle-end/100898
21456 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
21457 should copy any arguments. Don't call gimple_call_num_args
21458 on id->call_stmt or call_stmt more than once.
21460 2021-06-07 liuhongt <hongtao.liu@intel.com>
21463 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
21465 (<insn>v4siv4di2): Delete constraints for define_expand.
21467 2021-06-07 liuhongt <hongtao.liu@intel.com>
21470 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
21471 assignment of cfun->machine->has_explicit_vzeroupper.
21472 * config/i386/i386-features.c
21473 (ix86_add_reg_usage_to_vzerouppers): Delete.
21474 (ix86_add_reg_usage_to_vzeroupper): Ditto.
21475 (rest_of_handle_insert_vzeroupper): Remove
21476 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
21478 (gate): Remove cfun->machine->has_explicit_vzeroupper.
21479 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
21481 * config/i386/i386.c (ix86_insn_callee_abi): New function.
21482 (ix86_initialize_callee_abi): Ditto.
21483 (ix86_expand_avx_vzeroupper): Ditto.
21484 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
21486 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
21487 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
21489 * config/i386/i386.h (struct GTY(()) machine_function): Delete
21490 has_explicit_vzeroupper.
21491 * config/i386/i386.md (enum unspec): New member
21493 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
21494 define_constants for insn callee abi index.
21495 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
21496 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
21497 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
21498 (*avx_vzeroupper): Rename to ..
21499 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
21500 call_insn which has a special vzeroupper ABI.
21501 (*avx_vzeroupper_1): Deleted.
21503 2021-06-07 liuhongt <hongtao.liu@intel.com>
21506 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
21507 it won't use stack pointer reg.
21508 * final.c (leaf_function_p): When call_insn is a fake call, it
21509 won't affect caller as a leaf function.
21510 * reg-stack.c (callee_clobbers_any_stack_reg): New.
21511 (subst_stack_regs): When call_insn doesn't clobber any stack
21512 reg, don't clear the arguments.
21513 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
21515 * shrink-wrap.c (requires_stack_frame_p): No need for stack
21516 frame for a fake call.
21517 * rtl.h (FAKE_CALL_P): New macro.
21519 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
21521 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
21523 (sparc_order_regs_for_local_alloc): ...this.
21524 (sparc_leaf_reg_remap): Declare.
21525 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
21526 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
21527 * config/sparc/sparc.c (leaf_reg_remap): Delete.
21528 (order_regs_for_local_alloc): Rename to...
21529 (sparc_order_regs_for_local_alloc): ...this.
21530 (sparc_leaf_reg_remap): New function.
21531 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
21533 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
21535 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
21536 Use assemble_name to output BSS section name.
21538 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
21540 * config/i386/constraints.md (Bs):
21541 Remove boolean operators from match_test RTX.
21544 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
21547 2021-06-06 Martin Liska <mliska@suse.cz>
21549 * doc/extend.texi: Add missing @headitem.
21550 * doc/invoke.texi: Likewise.
21551 * doc/objc.texi: Likewise.
21553 2021-06-06 Martin Liska <mliska@suse.cz>
21555 * genhooks.c (emit_findices): Remove unused function.
21556 (emit_documentation): Do not call emit_findices
21557 and do not search for @Fcode directives.
21559 2021-06-06 Martin Liska <mliska@suse.cz>
21561 * doc/invoke.texi: Remove extra character.
21563 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21565 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
21567 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21569 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
21570 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
21571 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
21572 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
21574 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21576 * config/or1k/or1k.md (*movdi): Fix empty split condition.
21578 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21580 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
21583 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21585 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
21586 *zero_extendsidi2): Fix empty split condition.
21588 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
21590 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
21592 * config/h8300/bitfield.md: Likewise.
21593 * config/h8300/combiner.md: Likewise.
21594 * config/h8300/divmod.md: Likewise.
21595 * config/h8300/extensions.md: Likewise.
21596 * config/h8300/jumpcall.md: Likewise.
21597 * config/h8300/movepush.md: Likewise.
21598 * config/h8300/multiply.md: Likewise.
21599 * config/h8300/other.md: Likewise.
21600 * config/h8300/shiftrotate.md: Likewise.
21601 * config/h8300/logical.md: Likewise. Fix split pattern to use
21602 code iterator that somehow slipped through.
21604 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
21606 PR middle-end/100905
21607 * tree-nested.c (convert_nonlocal_omp_clauses,
21608 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
21610 2021-06-04 Martin Sebor <msebor@redhat.com>
21612 PR middle-end/100732
21613 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
21614 with either source or destination argument of invalid type.
21615 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
21616 calls with arguments of invalid type.
21618 2021-06-04 Martin Sebor <msebor@redhat.com>
21620 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
21622 (attr_access::vla_bounds): Also handle VLA bounds.
21624 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
21626 * config/i386/predicates.md (GOT_memory_operand):
21627 Implement using match_code RTXes.
21628 (GOT32_symbol_operand): Ditto.
21630 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
21633 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
21635 (ix86_expand_vector_init_general): Ditto.
21636 Use SImode instead of word_mode for logic operations
21637 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
21638 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
21639 implemented by expand_vec_perm_1.
21640 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
21641 should be implemented using standard shuffle patterns.
21642 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
21643 V2HI modes to modes, implementable with shuffle for one operand.
21644 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
21645 (*pshufw_1): New insn pattern.
21646 (*vec_dupv2hi): Ditto.
21647 (vec_initv2hihi): New expander.
21649 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
21651 * config/arm/vfp.md (no_literal_pool_df_immediate,
21652 no_literal_pool_sf_immediate): Fix empty split condition.
21654 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
21656 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
21657 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
21658 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
21659 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
21660 *<sse4_1_avx2>_pblendvb_lt): Likewise.
21662 2021-06-04 Jakub Jelinek <jakub@redhat.com>
21665 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
21666 concatenation from half-sized modes with TImode elements.
21668 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
21670 * config/arc/arc.c (arc_override_options): Disable millicode
21671 thunks when RF16 is on.
21673 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
21675 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
21677 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
21679 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
21680 Replace PROMOTE_MODE marco with its content.
21682 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
21684 * config/cris/cris.md (*addi_reload): Fix empty split condition.
21686 2021-06-03 Jim Wilson <jimw@sifive.com>
21688 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
21689 turn it on for all riscv targets.
21691 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
21694 * config/i386/i386-expand.c (ix86_expand_vector_set):
21695 Handle V2HI and V4QI modes.
21696 (ix86_expand_vector_extract): Ditto.
21697 * config/i386/mmx.md (*pinsrw): New insn pattern.
21700 (*pextrw_zext): Ditto.
21702 (*pextrb_zext): Ditto.
21703 (vec_setv2hi): New expander.
21704 (vec_extractv2hihi): Ditto.
21705 (vec_setv4qi): Ditto.
21706 (vec_extractv4qiqi): Ditto.
21707 (vec_setv8qi): Enable only for TARGET_SSE4_1.
21708 (vec_extractv8qiqi): Ditto.
21710 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
21712 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
21713 order to subf instruction.
21714 * config/rs6000/fusion.md: Regenerate.
21716 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
21718 * calls.c (get_size_range): Use range_of_expr instead of
21719 determine_value_range.
21720 * tree-affine.c (expr_to_aff_combination): Same.
21721 * tree-data-ref.c (split_constant_offset): Same.
21722 * tree-vrp.c (determine_value_range_1): Remove.
21723 (determine_value_range): Remove.
21724 * tree-vrp.h (determine_value_range): Remove.
21726 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
21728 * function-tests.c (test_ranges): Call gimple_range_tests.
21729 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
21731 * gimple-range.cc (fur_source::get_operand): Do not call
21732 get_tree_range or gimple_range_global.
21734 (get_tree_range): Move to value-query.cc.
21735 Call get_arith_expr_range.
21736 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
21737 Include gimple-range-tests.cc.
21738 * gimple-range.h (fold_range): Add argument.
21739 (get_tree_range): Remove.
21740 * selftest.h (gimple_range_tests): New.
21741 * value-query.cc (global_range_query::range_of_expr): Add
21743 (range_query::get_tree_range): Move from gimple-range.cc.
21744 * value-query.h (class range_query): Add get_tree_range and
21745 get_arith_expr_range. Make fur_source a friend.
21746 * vr-values.c (vr_values::range_of_expr): Pass stmt to
21748 * gimple-range-tests.cc: New file.
21750 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
21752 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
21753 update_global_range.
21754 * value-query.cc (update_global_range): New.
21755 * value-query.h (update_global_range): New.
21757 2021-06-03 David Malcolm <dmalcolm@redhat.com>
21759 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
21760 printing the same location twice if there are fix-it hints,
21761 multiple locations, or a label.
21763 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
21765 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
21766 thresholds to narrow the upper bound on epilogue iterations.
21768 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
21770 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
21771 (mve_vabsq_s<mode>): Likewise.
21772 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
21773 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
21774 * config/arm/vec-common.md (neg<mode>2): Rename to
21775 <absneg_str><mode>2.
21777 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
21779 * common/config/arc/arc-common.c (arc_option_optimization_table):
21780 Remove malign-call.
21781 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
21782 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
21783 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
21784 * config/arc/arc.md (abssi2_mixed): Remove pattern.
21785 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
21786 (malign-call): Likewise.
21787 (mmixed-code): Likewise.
21788 * doc/invoke.texi (ARC): Update doc.
21790 2021-06-03 Martin Liska <mliska@suse.cz>
21792 * common.opt: Use proper Enum values.
21793 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
21794 (parse_sanitizer_options): Handle only sanitizer_opts.
21795 (common_handle_option): Just assign value.
21797 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21800 * tree-inline.c (inline_forbidden_p): Remove test on return type.
21802 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21804 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
21805 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
21806 (type_byte_size): Inline into...
21807 (add_byte_size_attribute): ...this and call add_scalar_info.
21809 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21811 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
21812 (typed_binop_from_tree): New function.
21813 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
21814 turn a divide by a power of 2 into a shift.
21815 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
21816 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
21817 typed divide by calling typed_binop_from_tree.
21819 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21821 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
21822 (is_handled_procedure_type): Likewise.
21823 (struct loc_descr_context): Add strict_signedness field.
21824 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
21825 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
21826 (resolve_args_picking): Minor tweak.
21827 (function_to_dwarf_procedure): Initialize strict_signedness field.
21828 (type_byte_size): Likewise.
21829 (field_byte_offset): Likewise.
21830 (gen_descr_array_type_die): Likewise.
21831 (gen_variant_part): Likewise.
21832 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
21833 to true when a context is present before evaluating the arguments.
21834 <COND_EXPR>: Do not generate a useless comparison with zero.
21835 When dereferencing an address, if strict_signedness is true and the
21836 type is small and signed, use DW_OP_deref_type to do the dereference
21837 and then DW_OP_convert to convert back to the generic type.
21839 2021-06-03 Jakub Jelinek <jakub@redhat.com>
21842 * tree-inline.c (copy_tree_body_r): Handle iterators on
21843 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
21845 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
21847 * config/arc/arc.md (*bbit_di): Remove.
21849 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
21851 PR rtl-optimization/100264
21852 * ree.c (get_sub_rtx): Ignore SET expressions without register
21853 destinations and remove assertion, as it is not valid anymore
21854 with this new behaviour.
21855 (merge_def_and_ext): Eliminate destination check for register
21856 as such SET expressions can't occur anymore.
21857 (combine_reaching_defs): Likewise.
21859 2021-06-02 Jakub Jelinek <jakub@redhat.com>
21862 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
21863 -Wtype-limits warnings.
21864 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
21865 in operands to avoid -Wsign-compare warnings.
21867 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
21869 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
21872 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
21874 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
21876 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
21878 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
21880 * config/s390/subst.md(cconly_subst): Use a single constraint
21881 in (match_scratch).
21883 2021-06-02 Martin Liska <mliska@suse.cz>
21885 * ipa-icf.h: Use auto_vec for memory_access_types.
21887 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
21889 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
21890 argument from prototype.
21891 (output_logical_op): Add rtx_code argument.
21892 (compute_logical_op_length): Likewise.
21893 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
21894 to compute_a_shift_length.
21895 (output_logical_op); New argument with the rtx code rather than
21896 extracting it from an operand. Handle QImode too.
21897 (compute_logical_op_length): Similary.
21898 (compute_a_shift_length): Drop unused argument.
21899 * config/h8300/h8300.md (logicals): New code iterator.
21900 * config/h8300/logical.md (<code><mode>3 expander): Combine
21901 the "and" expander with the "ior"/"xor" expander.
21902 (bclr<mode>msx): Combine the QI/HI mode patterns.
21903 (<logical><mode>3 insns): Use code iterator rather than match_operator.
21904 Handle QImode as well. Update call to output_logical_op and
21905 compute_logical_op_length to pass in rtx_code
21906 Fix split condition on all define_insn_and_split patterns.
21907 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
21908 the flags and setting ZN via existing define_subst.
21909 * config/h8300/shiftrotate.md: Drop unused argument from
21910 calls to compute_a_shift_length.
21911 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
21913 2021-06-01 Andrew Pinski <apinski@marvell.com>
21915 PR tree-optimization/25290
21916 * tree-ssa-phiopt.c (match_simplify_replacement):
21918 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
21919 (two_value_replacement): Change the comment about
21920 conditional_replacement.
21921 (conditional_replacement): Delete.
21923 2021-06-01 Andrew Pinski <apinski@marvell.com>
21925 PR tree-optimization/95481
21926 * tree-tailcall.c (find_tail_calls): Handle empty typed
21929 2021-06-01 Andrew Pinski <apinski@marvell.com>
21931 * gimplify.c (zero_sized_field_decl): Delete
21932 (zero_sized_type): Delete
21933 (gimplify_init_ctor_eval): Use is_empty_type instead
21934 of zero_sized_field_decl.
21935 (gimplify_modify_expr): Use is_empty_type instead of
21938 2021-06-01 Jason Merrill <jason@redhat.com>
21941 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
21943 2021-06-01 Jason Merrill <jason@redhat.com>
21946 * diagnostic.h (warning_enabled_at): Declare.
21947 * diagnostic.c (diagnostic_enabled): Factor out from...
21948 (diagnostic_report_diagnostic): ...here.
21949 (warning_enabled_at): New.
21951 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
21953 * gimple-ssa-evrp.c: Enable exporting of global ranges.
21955 2021-06-01 Martin Liska <mliska@suse.cz>
21958 * doc/invoke.texi: Mention that -fgcse-after-reload
21959 is enabled with -O3.
21961 2021-06-01 liuhongt <hongtao.liu@intel.com>
21963 PR tree-optimization/98365
21964 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
21965 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
21966 (convert_scalar_cond_reduction): Ditto.
21967 (predicate_scalar_phi): Ditto.
21969 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21971 PR tree-optimization/100781
21972 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
21973 value calculation by default.
21974 (ranger_cache::enable_new_values): New.
21975 (ranger_cache::disable_new_values): New.
21976 (ranger_cache::push_poor_value): Check if new values are allowed.
21977 * gimple-range-cache.h (class ranger_cache): New member/methods.
21978 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
21979 statement, and disable/renable new value calculation.
21981 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21983 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
21984 (ranger_cache::range_of_def): New.
21985 (ranger_cache::entry_range): New.
21986 (ranger_cache::exit_range): New.
21987 (ranger_cache::range_of_expr): Adjust.
21988 (ranger_cache::range_on_edge): Adjust.
21989 (ranger_cache::propagate_cache): Call exit_range directly.
21990 * gimple-range-cache.h (class ranger_cache): Adjust.
21992 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21994 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
21995 gori_compute being a member rather than base class.
21996 dervied call to member call.
21997 (ranger_cache::dump): No longer dump gori_map.
21998 (ranger_cache::dump_bb): New.
21999 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
22000 being a member rather than base class.
22001 (ranger_cache::set_global_range): Ditto.
22002 (ranger_cache::ssa_range_in_bb): Ditto.
22003 (ranger_cache::range_of_expr): New.
22004 (ranger_cache::range_on_edge): New.
22005 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
22006 (ranger_cache::propagate_cache): Adjust debugging output.
22007 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
22009 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
22010 member, and inherit from range_query instead.
22011 (ranger_cache::dump_bb): New. split from dump.
22012 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
22013 (gori_compute::expr_range_at_stmt): Delete.
22014 (gori_compute::compute_name_range_op): Delete.
22015 (gori_compute::compute_operand_range_switch): Add fur_source.
22016 (gori_compute::compute_operand_range): Add fur_source param, inline
22017 old compute_name_range_op and optimize_logical_operands.
22018 (struct tf_range): Delete.
22019 (gori_compute::logical_combine): Adjust
22020 (gori_compute::optimize_logical_operands): Delete.
22021 (gori_compute::compute_logical_operands_in_chain): Delete.
22022 (gori_compute::compute_logical_operands): Adjust.
22023 (gori_compute::compute_operand1_range): Adjust to fur_source.
22024 (gori_compute::compute_operand2_range): Ditto.
22025 (gori_compute::compute_operand1_and_operand2_range): Ditto.
22026 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
22027 and adjust to fur_source.
22028 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
22029 range_query and fur_source.
22030 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
22031 from the ranger_cache..
22032 (gimple_ranger::fold_range_internal): Adjust to base class change of
22034 (gimple_ranger::dump_bb): Adjust dump.
22035 * gimple-range.h (gimple_ranger):export gori computes object.
22037 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22039 PR tree-optimization/100774
22040 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
22041 Constant values are also not stale.
22042 (ranger_cache::set_global_range): Range invariant values should also
22043 have the correct timestamp.
22045 2021-05-31 Martin Liska <mliska@suse.cz>
22047 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
22048 Unpack FUNCTION_DECL_DECL_TYPE.
22049 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
22050 Stream FUNCTION_DECL_DECL_TYPE instead of
22051 DECL_IS_OPERATOR_NEW_P.
22052 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
22054 (DECL_IS_OPERATOR_NEW_P): Likewise.
22055 (DECL_IS_OPERATOR_DELETE_P): Likewise.
22056 (DECL_LAMBDA_FUNCTION_P): Likewise.
22058 2021-05-31 Richard Biener <rguenther@suse.de>
22061 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
22062 * internal-fn.def (SHUFFLEVECTOR): New.
22063 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
22064 * doc/extend.texi: Document __builtin_shufflevector.
22066 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
22069 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
22070 indexed form addresses.
22072 2021-05-29 Jeff Law <jlaw@tachyum.com>
22074 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
22075 parameter. Call callers fixed.
22077 (output_plussi): Add FALLTHRU markers.
22078 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
22080 2021-05-29 Jakub Jelinek <jakub@redhat.com>
22082 PR middle-end/99928
22083 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
22084 combined with parallel, make sure to add shared clause to
22085 parallel for explicit linear clause.
22087 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
22089 PR tree-optimization/100787
22090 * gimple-ssa-evrp.c: Disable exporting of global ranges.
22092 2021-05-28 Jason Merrill <jason@redhat.com>
22094 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
22095 operator--, operator*, operator==, and operator!=.
22096 (class tsi_range): New.
22098 2021-05-28 Richard Biener <rguenther@suse.de>
22100 PR tree-optimization/100778
22101 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
22102 trapping ops in different BBs.
22104 2021-05-28 Richard Biener <rguenther@suse.de>
22107 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
22108 copy fntype from original call.
22110 2021-05-28 Martin Liska <mliska@suse.cz>
22112 PR gcov-profile/100751
22113 * doc/gcov.texi: Revert partially a hunk that was wrong.
22115 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
22117 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
22119 (HAVE_sync_compare_and_swaphi): Likewise.
22120 (HAVE_sync_compare_and_swapsi): Likewise.
22122 2021-05-28 Jakub Jelinek <jakub@redhat.com>
22124 PR middle-end/99928
22125 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
22127 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
22129 * gimplify.c (gimplify_omp_affinity): New.
22130 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
22131 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
22132 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
22133 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
22134 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
22136 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
22137 Richard Biener <rguenther@suse.de>
22139 * match.pd <popcount & / + pattern matching>:
22140 When generating popcount directly fails, try doing it in two halves.
22142 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
22144 * Makefile.in (generated_files): Add gimple-match.c and
22147 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22149 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
22151 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22153 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
22155 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
22157 PR tree-optimization/99398
22158 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
22159 where the fed operands are CTOR/CST and propagated through
22160 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
22161 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
22163 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
22166 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22168 * config/i386/mmx.md (addv2sf3): Do not call
22169 ix86_fixup_binary_operands_no_copy.
22172 (<smaxmin:code>v2sf3): Ditto.
22173 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
22174 (<plusminus:insn><VI_32:mode>3): Remove expander.
22175 (<plusminus:insn><VI_32:mode>3): Rename from
22176 "*<plusminus:insn><VI_32:mode>3".
22177 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
22178 (mulv2hi3): Remove expander.
22179 (mulv2hi3): Rename from *mulv2hi3.
22180 (<s>mulv2hi3_highpart): Remove expander.
22181 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
22182 (<smaxmin:code><MMXMODE14:mode>3): Rename from
22183 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
22184 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
22185 (SMAXMIN_MMXMODEI): Remove mode iterator.
22186 (<smaxmin:code>v4hi3): New expander.
22187 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
22188 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
22189 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22190 (SMAXMIN_VI_32): Remove mode iterator.
22191 (<umaxmin:code><MMXMODE24:mode>3): Rename from
22192 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
22193 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
22194 (UMAXMIN_MMXMODEI): Remove mode iterator.
22195 (<umaxmin:code>v8qi3): New expander.
22196 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
22197 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
22198 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22199 (UMAXMIN_VI_32): Remove mode iterator.
22200 (<any_shift:insn>v2hi3): Remove expander.
22201 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
22202 (<any_logic:code><MMXMODEI:mode>3): Do not call
22203 ix86_fixup_binary_operands_no_copy.
22204 (<any_logic:code><VI_32:mode>3): Remove expander.
22205 (<any_logic:code><VI_32:mode>3): Rename from
22206 "*<any_logic:code><VI_32:mode>3".
22207 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
22208 * config/i386/sse.md (div<VF2:mode>3): Do not call
22209 ix86_fixup_binary_operands_no_copy.
22210 (div<VF1:mode>3): Ditto.
22211 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
22212 (smulhrsv4hi3): Ditto.
22213 (smulhrsv2hi3): Ditto.
22215 2021-05-27 Martin Sebor <msebor@redhat.com>
22217 * ggc.h (gt_ggc_mx): Add overloads for all integers.
22219 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
22221 (hash_map::operator==): New function.
22223 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22226 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
22227 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
22228 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
22229 (*xop_maskcmp<VI_32:mode>3): Ditto.
22230 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
22231 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
22233 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
22236 * config/arm/arm.c (arm_configure_build_target): Remove parameter
22237 opts_set, directly check opts parameters for being non-null.
22238 (arm_option_restore): Update call to arm_configure_build_target.
22239 (arm_option_override): Likewise.
22240 (arm_can_inline_p): Likewise.
22241 (arm_valid_target_attribute_tree): Likewise.
22242 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
22243 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
22246 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
22248 * vr-values.c (simplify_conversion_using_ranges): Use
22249 get_range_query instead of get_global_range_query.
22251 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
22253 * gimple-range.cc (get_range_global): Move to value-query.cc.
22254 (gimple_range_global): Same.
22255 (get_global_range_query): Same.
22256 (global_range_query::range_of_expr): Same.
22257 * gimple-range.h (class global_range_query): Move to
22259 (gimple_range_global): Same.
22260 * tree-ssanames.c (get_range_info): Move to value-query.cc.
22261 (get_ptr_nonnull): Same.
22262 * tree-ssanames.h (get_range_info): Remove.
22263 (get_ptr_nonnull): Remove.
22264 * value-query.cc (get_ssa_name_range_info): Move from
22266 (get_ssa_name_ptr_info_nonnull): Same.
22267 (get_range_global): Move from gimple-range.cc.
22268 (gimple_range_global): Same.
22269 (get_global_range_query): Same.
22270 (global_range_query::range_of_expr): Same.
22271 * value-query.h (class global_range_query): Move from
22273 (gimple_range_global): Same.
22275 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22278 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
22279 (uavgv2hi3_ceil): Ditto.
22281 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
22284 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
22286 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22288 * tree-ssanames.c (get_range_info): Merge both copies of
22289 get_range_info into one that works with irange.
22290 * tree-ssanames.h (get_range_info): Remove version that works on
22293 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22295 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
22296 (expand_builtin_strnlen): Same.
22297 (determine_block_size): Same.
22298 * fold-const.c (expr_not_equal_to): Same.
22299 * gimple-fold.c (size_must_be_zero_p): Same.
22300 * gimple-match-head.c: Include gimple-range.h.
22301 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
22302 * gimple-ssa-warn-restrict.c
22303 (builtin_memref::extend_offset_range): Same.
22304 * graphite-sese-to-poly.c (add_param_constraints): Same.
22305 * internal-fn.c (get_min_precision): Same.
22306 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
22307 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
22309 * tree-data-ref.c (split_constant_offset): Same.
22310 (dr_step_indicator): Same.
22311 * tree-dfa.c (get_ref_base_and_extent): Same.
22312 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
22313 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
22314 (determine_value_range): Same.
22315 (record_nonwrapping_iv): Same.
22316 (infer_loop_bounds_from_signedness): Same.
22317 (scev_var_range_cant_overflow): Same.
22318 * tree-ssa-phiopt.c (two_value_replacement): Same.
22319 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
22320 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
22321 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
22323 (dump_strlen_info): Same.
22324 (set_strlen_range): Same.
22325 (maybe_diag_stxncpy_trunc): Same.
22326 (get_len_or_size): Same.
22327 (handle_integral_assign): Same.
22328 * tree-ssa-structalias.c (find_what_p_points_to): Same.
22329 * tree-ssa-uninit.c (find_var_cmp_const): Same.
22330 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
22331 * tree-vect-patterns.c (vect_get_range_info): Same.
22332 (vect_recog_divmod_pattern): Same.
22333 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
22334 (register_edge_assert_for_2): Same.
22335 (determine_value_range_1): Same.
22336 * tree.c (get_range_pos_neg): Same.
22337 * vr-values.c (vr_values::get_lattice_entry): Same.
22338 (vr_values::update_value_range): Same.
22339 (simplify_conversion_using_ranges): Same.
22341 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22343 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
22344 get_range_query instead of query argument.
22345 (pass_walloca::execute): Enable and disable global ranger.
22347 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22349 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
22351 (rvrp_folder::~rvrp_folder): Call disable_ranger.
22352 (hybrid_folder::hybrid_folder): Call enable_ranger.
22353 (hybrid_folder::~hybrid_folder): Call disable_ranger.
22355 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22357 * function.c (allocate_struct_function): Set cfun->x_range_query.
22358 * function.h (struct function): Declare x_range_query.
22359 (get_range_query): New.
22360 (get_global_range_query): New.
22361 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
22362 Remove call to safe_grow_cleared.
22363 * gimple-range.cc (get_range_global): New.
22364 (gimple_range_global): Move from gimple-range.h.
22365 (get_global_range_query): New.
22366 (global_range_query::range_of_expr): New.
22367 (enable_ranger): New.
22368 (disable_ranger): New.
22369 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
22370 (class global_range_query): New.
22371 (enable_ranger): New.
22372 (disable_ranger): New.
22373 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
22374 dump_all_value_ranges to dump.
22375 * tree-vrp.c (vrp_prop::finalize): Same.
22376 * value-query.cc (range_query::dump): New.
22377 * value-query.h (range_query::dump): New.
22378 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
22379 (vr_values::dump): ...this.
22380 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
22381 dump and make virtual.
22383 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
22385 * config/i386/i386.c (ix86_autovectorize_vector_modes):
22386 Add V4QImode and V16QImode for TARGET_SSE2.
22387 * doc/sourcebuild.texi (Vector-specific attributes):
22388 Add vect64 and vect32 description.
22390 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
22392 * gimple-range-gori.cc (range_def_chain::register_dependency):
22393 Resize m_def_chain when needed.
22395 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
22397 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
22398 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
22399 * config/arm/vec-common.md: .. here. Add support for MVE.
22401 2021-05-26 Jakub Jelinek <jakub@redhat.com>
22403 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
22405 * config/microblaze/microblaze.c (microblaze_legitimize_address,
22407 microblaze_option_override, print_operand): Likewise.
22408 * config/microblaze/microblaze.md (call_internal_plt,
22409 call_value_intern_plt, call_value_intern): Likewise.
22410 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
22411 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
22412 call_value_multiple_internal1): Likewise.
22413 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
22415 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
22417 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
22418 arc_ccfsm_advance, symbolic_reference_mentioned_p,
22419 arc_raw_symbolic_reference_mentioned_p): Remove register
22422 2021-05-26 Jakub Jelinek <jakub@redhat.com>
22425 * omp-low.c: Include omp-offload.h.
22426 (create_omp_child_function): If current_function_decl has
22427 "omp declare target" attribute and is_gimple_omp_offloaded,
22428 remove that attribute from the copy of attribute list and
22429 add "omp target entrypoint" attribute instead.
22430 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
22431 variables for offloading if in omp_maybe_offloaded_ctx.
22432 * omp-offload.c (pass_omp_target_link::execute): Nullify second
22433 argument to GOMP_target_data_ext in offloaded code.
22435 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22437 * config/csky/csky.c (csky_can_change_mode_class): Delete.
22438 For csky, HF/SF mode use the low bits of VREGS.
22440 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
22442 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
22443 DECL which is a reference for OMP.
22445 2021-05-26 Martin Liska <mliska@suse.cz>
22447 PR gcov-profile/100751
22448 * doc/gcov.texi: Document that __gcov_dump can be called just
22449 once and that __gcov_reset resets run-time counters.
22451 2021-05-26 Martin Liska <mliska@suse.cz>
22453 * doc/install.texi: Port relevant part from install-old.texi
22454 and re-generate list of CPUs and systems.
22456 2021-05-26 Martin Liska <mliska@suse.cz>
22458 * Makefile.in: Remove it.
22459 * doc/include/fdl.texi: Update next/previous chapters.
22460 * doc/install.texi: Likewise.
22461 * doc/install-old.texi: Removed.
22463 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22465 * config/csky/csky.c (ck810_legitimate_index_p): Support
22466 "base + index" with DF mode.
22467 * config/csky/constraints.md ("Y"): New constraint for memory operands
22468 without index register.
22469 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
22470 when mov between memory and general registers, and lower their priority.
22471 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
22473 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22475 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
22477 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22479 * config/csky/csky.md (untyped_call): Emit clobber for return
22480 registers to mark them used.
22482 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22484 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
22486 2021-05-26 Andrew Pinski <apinski@marvell.com>
22488 * match.pd (x < 0 ? ~y : y): New patterns.
22490 2021-05-26 Andrew Pinski <apinski@marvell.com>
22492 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
22493 A?POW2:0 and A?0:POW2.
22495 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22497 * gimple-range-gori.cc (class logical_stmt_cache): Delete
22498 (logical_stmt_cache::logical_stmt_cache ): Delete.
22499 (logical_stmt_cache::~logical_stmt_cache): Delete.
22500 (logical_stmt_cache::cache_entry::dump): Delete.
22501 (logical_stmt_cache::get_range): Delete.
22502 (logical_stmt_cache::cached_name ): Delete.
22503 (logical_stmt_cache::same_cached_name): Delete.
22504 (logical_stmt_cache::cacheable_p): Delete.
22505 (logical_stmt_cache::slot_diagnostics ): Delete.
22506 (logical_stmt_cache::dump): Delete.
22507 (gori_compute_cache::gori_compute_cache): Delete.
22508 (gori_compute_cache::~gori_compute_cache): Delete.
22509 (gori_compute_cache::compute_operand_range): Delete.
22510 (gori_compute_cache::cache_stmt): Delete.
22511 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
22513 (class gori_compute_cache): Delete.
22515 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22517 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
22519 (fold_using_range::range_of_address): Adjust.
22520 (fold_using_range::range_of_phi): Adjust.
22521 * gimple-range.h (class fur_source): Adjust.
22522 (fur_source::fur_source): Adjust.
22524 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22526 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
22527 from expr_range_in_bb and adjust.
22528 (gori_compute::compute_name_range_op): Adjust.
22529 (gori_compute::optimize_logical_operands): Adjust.
22530 (gori_compute::compute_logical_operands_in_chain): Adjust.
22531 (gori_compute::compute_operand1_range): Adjust.
22532 (gori_compute::compute_operand2_range): Adjust.
22533 (ori_compute_cache::cache_stmt): Adjust.
22534 * gimple-range-gori.h (gori_compute): Rename prototype.
22536 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22538 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
22539 checked only after range_of_stmt, not range_on_entry.
22540 (gimple_ranger::range_on_entry): Check for non-null in any
22541 predecessor block, if it is not already non-null.
22542 (gimple_ranger::range_on_exit): DOnt check for non-null after
22543 range on entry call.
22544 (gimple_ranger::dump_bb): New. Split from dump.
22545 (gimple_ranger::dump): Adjust.
22546 * gimple-range.h (class gimple_ranger): Adjust.
22548 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22550 * gimple-range-cache.cc (struct range_timestamp): Delete.
22551 (class temporal_cache): Adjust.
22552 (temporal_cache::get_timestamp): Delete.
22553 (temporal_cache::set_dependency): Delete.
22554 (temporal_cache::temporal_value): Adjust.
22555 (temporal_cache::current_p): Take dependencies as params.
22556 (temporal_cache::set_timestamp): Adjust.
22557 (temporal_cache::set_always_current): Adjust.
22558 (ranger_cache::get_non_stale_global_range): Adjust.
22559 (ranger_cache::register_dependency): Delete.
22560 * gimple-range-cache.h (class range_cache): Adjust.
22562 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22564 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
22566 (range_def_chain::~range_def_chain): Dispose of obstack rather than
22567 each individual bitmap.
22568 (range_def_chain::set_import): New.
22569 (range_def_chain::get_imports): New.
22570 (range_def_chain::chain_import_p): New.
22571 (range_def_chain::register_dependency): Rename from build_def_chain
22573 (range_def_chain::def_chain_in_bitmap_p): New.
22574 (range_def_chain::add_def_chain_to_bitmap): New.
22575 (range_def_chain::has_def_chain): Just check first depenedence.
22576 (range_def_chain::get_def_chain): Process imports, use generic
22577 register_dependency routine.
22578 (range_def_chain::dump): New.
22579 (gori_map::gori_map): Allocate import list.
22580 (gori_map::~gori_map): Release imports.
22581 (gori_map::exports): Check for past allocated block size.
22582 (gori_map::imports): New.
22583 (gori_map::def_chain_in_export_p): Delete.
22584 (gori_map::is_import_p): New.
22585 (gori_map::maybe_add_gori): Handle imports.
22586 (gori_map::dump): Adjust output, add imports.
22587 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
22588 (gori_export_iterator::gori_export_iterator): New.
22589 (gori_export_iterator::next): New.
22590 (gori_export_iterator::get_name): New.
22591 * gimple-range-gori.h (range_def_chain): Add imports and direct
22592 dependecies via struct rdc.
22593 (range_def_chain::depend1): New.
22594 (range_def_chain::depend2): New.
22595 (class gori_map): Adjust.
22596 (FOR_EACH_GORI_IMPORT_NAME): New.
22597 (FOR_EACH_GORI_EXPORT_NAME): New.
22598 (class gori_export_iterator): New.
22600 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22602 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
22603 export cache filling to here.
22604 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
22606 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22608 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
22609 (gori_map): Move to gimple-range-gori.h.
22610 (gori_compute::gori_compute): Adjust.
22611 (gori_compute::~gori_compute): Delete.
22612 (gori_compute::compute_operand_range_switch): Adjust.
22613 (gori_compute::compute_operand_range): Adjust.
22614 (gori_compute::compute_logical_operands): Adjust.
22615 (gori_compute::has_edge_range_p ): Adjust.
22616 (gori_compute::set_range_invariant): Delete.
22617 (gori_compute::dump): Adjust.
22618 (gori_compute::outgoing_edge_range_p): Adjust.
22619 * gimple-range-gori.h (class range_def_chain): Relocate here.
22620 (class gori_map): Relocate here.
22621 (class gori_compute): Inherit from gori_map, and adjust.
22623 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
22625 * value-range.cc (range_tests_legacy): Use
22626 build_nonstandard_integer_type instead of int and short.
22628 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
22630 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
22631 when really creating an initialization statement for it.
22633 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
22635 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
22637 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
22639 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
22641 2021-05-25 Martin Liska <mliska@suse.cz>
22643 PR tree-optimization/92860
22645 * optc-save-gen.awk: Remove exceptions.
22647 2021-05-25 Martin Liska <mliska@suse.cz>
22649 * asan.h (sanitize_coverage_p): New function.
22650 * doc/extend.texi: Document it.
22651 * fold-const.c (fold_range_test): Use sanitize_flags_p
22652 instead of flag_sanitize_coverage.
22653 (fold_truth_andor): Likewise.
22654 * sancov.c: Likewise.
22655 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
22656 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
22657 -fsanitize-coverage when inlining.
22659 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
22661 * config/csky/csky-modes.def : Fix copyright.
22663 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
22665 * config/csky/csky-modes.def : Amend copyright.
22666 * config/csky/csky_insn_fpuv2.md : Likewise.
22667 * config/csky/csky_insn_fpuv3.md : Likewise.
22669 2021-05-25 Richard Biener <rguenther@suse.de>
22671 PR middle-end/100727
22672 * calls.c (initialize_argument_information): Explicitely test
22673 for WITH_SIZE_EXPR.
22674 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
22676 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22678 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
22679 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
22680 the signle definition. The signle definition may not work well
22681 at simplify_subreg_regno().
22682 (HARD_FRAME_POINTER_REGNUM): New.
22683 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
22684 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
22685 csky_initial_elimination_offset, csky_expand_prologue,
22686 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
22688 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22690 * config/csky/csky.c (csky_option_override):
22691 Init csky_arch_isa_features[] in advance, so TARGET_DSP
22692 and TARGET_DIV can be set well.
22694 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22696 * config/csky/constraints.md ("l", "h"): Delete.
22697 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
22698 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
22699 * config/csky/csky.c (regno_reg_classm,
22700 csky_secondary_reload, csky_register_move_cost):
22701 Use HILO_REGS instead of LO_REGS and HI_REGS.
22703 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22705 * config/csky/constraints.md ("W"): New constriant for mem operand
22706 with base reg, index register.
22707 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
22708 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
22710 ("Dv"): New constraint for const double value that can be used at
22712 * config/csky/csky-modes.def (HFmode): New mode.
22713 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
22714 to "csky_valid_mem_constraint_operand" and support new constraint
22716 (csky_get_movedouble_length): New.
22717 (fpuv3_output_move): New.
22718 (fpuv3_const_double): New.
22719 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
22720 (decompose_csky_address): Refine.
22721 (csky_print_operand): New "CONST_DOUBLE" operand.
22722 (csky_output_move): Support fpv3 instructions.
22723 (csky_get_movedouble_length): New.
22724 (fpuv3_output_move): New.
22725 (fpuv3_const_double): New.
22726 (csky_emit_compare): Cover float comparsion.
22727 (csky_emit_compare_float): Refine.
22728 (csky_vaild_fpuv2_mem_operand): Rename to
22729 "csky_valid_mem_constraint_operand" and support new constraint "W".
22730 (ck860_rtx_costs): New.
22731 (csky_rtx_costs): Add the cost calculation of CK860.
22732 (regno_reg_class): New vregs for fpuv3.
22733 (csky_dbx_regno): Likewise.
22734 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
22735 (csky_conditional_register_usage): Suporrot fpuv3.
22736 (csky_dwarf_register_span): Suporrot fpuv3.
22737 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
22738 (ck810_legitimate_index_p): Support fp16.
22739 * config/csky/csky.h (TARGET_TLS): ADD CK860.
22740 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
22741 (TARGET_SINGLE_FPU): Support fpuv3.
22742 (TARGET_SUPPORT_FPV3): New.
22743 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
22744 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
22745 REG_CLASS_CONTENTS): Support fpuv3.
22746 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
22747 (csky_movsf_fpv2): Likewise.
22748 (ck801_movsf): Likewise.
22749 (csky_movsf): Likewise.
22751 (csky_movdf_fpv2): Likewise.
22752 (ck801_movdf): Likewise.
22753 (csky_movdf): Likewise.
22754 (movsicc): Refine. Use "comparison_operatior" instead of
22755 "ordered_comparison_operatior".
22756 (addsicc): Likewise.
22757 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
22758 (call_value_internal_vh): New.
22759 * config/csky/csky_cores.def (CK860): New arch and cpu.
22764 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
22765 into emit-patterns and match-patterns, remain the emit-patterns here,
22766 and move the match-patterns to csky_insn_fpuv2.md or
22767 csky_insn_fpuv3.md.
22768 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
22769 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
22771 * config/csky/csky_isa.def (fcr): New.
22776 (CK860): New definition for ck860.
22777 * config/csky/csky_tables.opt (ck860): New processors ck860,
22778 ck860f. And new arch ck860.
22783 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
22784 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
22785 * config/csky/t-csky-elf: Support 860.
22786 * config/csky/t-csky-linux: Likewise.
22787 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
22789 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
22791 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
22792 add generation of logical-add and add-logical fusion pairs.
22793 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
22795 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
22796 logical-add and add-logical fusion by default.
22797 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
22798 -mpower10-fusion-add-logical options.
22799 * config/rs6000/fusion.md: Regenerate file.
22801 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
22803 * value-range.cc (irange::legacy_equal_p): Check type when
22804 comparing VR_VARYING types.
22805 (range_tests_legacy): Test comparing VARYING ranges of different
22808 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
22810 * config/aarch64/aarch64.c (neoversen1_tunings):
22811 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
22813 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
22815 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
22816 extern weak symbols. Limit symbol offsets for non-GOT symbols with
22819 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
22821 * config/arm/neon.md (vec_load_lanesxi<mode>)
22822 (vec_store_lanexoi<mode>): Move ...
22823 * config/arm/vec-common.md: here.
22825 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
22827 * config/arm/neon.md (vec_load_lanesoi<mode>)
22828 (vec_store_lanesoi<mode>): Move ...
22829 * config/arm/vec-common.md: here.
22831 2021-05-24 liuhongt <hongtao.liu@intel.com>
22834 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
22835 stmt with GIMPLE_NOP when lhs doesn't exist.
22837 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
22840 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
22841 New instruction pattern.
22842 (*push<VI_32:mode>2): Ditto.
22843 (push splitter for SSE registers): New splitter.
22845 2021-05-23 Andrew Pinski <apinski@marvell.com>
22847 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
22849 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
22851 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
22852 * config/rs6000/fusion.md: Regenerate file.
22854 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
22856 * config/rs6000/genfusion.pl (gen_addadd): New function.
22857 * config/rs6000/fusion.md: Regenerate file.
22858 * config/rs6000/rs6000-cpus.def: Add
22859 OPTION_MASK_P10_FUSION_2ADD to masks.
22860 * config/rs6000/rs6000.c (rs6000_option_override_internal):
22861 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
22862 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
22864 2021-05-21 Jakub Jelinek <jakub@redhat.com>
22866 PR middle-end/99928
22867 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
22868 * gimplify.c (enum gimplify_omp_var_data): Fix up
22869 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
22870 (omp_lastprivate_for_combined_outer_constructs): If combined target
22871 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
22872 GOVD_MAP | GOVD_SEEN.
22873 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
22874 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
22875 (gimplify_adjust_omp_clauses): For firstprivate clauses with
22876 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
22877 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
22878 let it be replaced by implicit map clause.
22880 2021-05-21 Jakub Jelinek <jakub@redhat.com>
22882 PR middle-end/99928
22883 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
22885 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
22886 (gimplify_omp_for): Likewise.
22888 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22890 PR middle-end/90115
22891 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
22892 'external' in blocks.
22894 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22896 PR middle-end/90115
22897 * flag-types.h (enum openacc_privatization): New.
22898 * params.opt (-param=openacc-privatization): New.
22899 * doc/invoke.texi (openacc-privatization): Document it.
22900 * omp-general.h (get_openacc_privatization_dump_flags): New
22902 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
22903 * omp-offload.c (execute_oacc_device_lower)
22904 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
22905 * target.def (goacc.adjust_private_decl): Add 'location_t'
22907 * doc/tm.texi: Regenerate.
22908 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
22909 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
22910 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
22911 Likewise. Preserve it for...
22912 (nvptx_goacc_expand_var_decl): ... use here.
22914 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22916 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
22919 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22921 PR middle-end/90115
22922 * omp-low.c (oacc_privatization_candidate_p): New function.
22923 (oacc_privatization_scan_clause_chain)
22924 (oacc_privatization_scan_decl_chain): Use it. Also
22925 'gcc_checking_assert' that we're not seeing duplicates.
22927 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22929 PR middle-end/90115
22930 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
22933 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22935 PR middle-end/90115
22936 * omp-offload.c (execute_oacc_device_lower): Explain.
22938 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22940 PR middle-end/90115
22941 * omp-offload.c (execute_oacc_device_lower)
22942 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
22944 * internal-fn.c (expand_UNIQUE): Don't expect
22945 'IFN_UNIQUE_OACC_PRIVATE'.
22947 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22949 PR middle-end/90115
22950 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
22952 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22954 PR middle-end/90115
22955 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
22956 (nvptx_goacc_expand_var_decl): Tighten.
22958 2021-05-21 Julian Brown <julian@codesourcery.com>
22959 Chung-Lin Tang <cltang@codesourcery.com>
22960 Thomas Schwinge <thomas@codesourcery.com>
22962 PR middle-end/90115
22963 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
22964 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
22965 * doc/tm.texi: Regenerate.
22966 * expr.c (expand_expr_real_1): Expand decls using the
22967 expand_var_decl OpenACC hook if defined.
22968 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
22969 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
22970 * omp-low.c (omp_context): Add oacc_privatization_candidates
22972 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
22974 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
22975 private marker's gimple call arguments, and pass it to
22976 lower_oacc_reductions.
22977 (oacc_privatization_scan_clause_chain)
22978 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
22980 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
22981 * omp-offload.c (convert.h): Include.
22982 (oacc_loop_xform_head_tail): Treat private-variable markers like
22983 fork/join when transforming head/tail sequences.
22984 (struct var_decl_rewrite_info): Add struct.
22985 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
22986 (execute_oacc_device_lower): Support rewriting gang-private
22987 variables using target hook, and fix up addr_expr and var_decl
22989 * target.def (adjust_private_decl, expand_var_decl): New hooks.
22990 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
22992 (gcn_goacc_adjust_private_decl): ...this.
22993 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
22995 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
22996 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
22997 definition using gcn_goacc_adjust_gangprivate_decl...
22998 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
22999 gcn_goacc_adjust_private_decl.
23000 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
23001 (gang_private_shared_size): New global variable.
23002 (gang_private_shared_align): Likewise.
23003 (gang_private_shared_sym): Likewise.
23004 (gang_private_shared_hmap): Likewise.
23005 (nvptx_option_override): Initialize these.
23006 (nvptx_file_end): Output gang_private_shared_sym.
23007 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
23009 (nvptx_set_current_function): Clear gang_private_shared_hmap.
23010 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
23011 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
23013 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
23015 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
23017 2021-05-21 Richard Biener <rguenther@suse.de>
23018 H.J. Lu <hjl.tools@gmail.com>
23020 PR middle-end/90773
23021 * expr.c (expand_constructor): Elide expand_constructor if
23022 move by pieces is preferred.
23024 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23026 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
23027 Take a flag and mode value as arguments.
23028 (aarch64_modifies_global_state_p): Likewise.
23029 (aarch64_reads_global_state_p): Likewise.
23030 (aarch64_could_trap_p): Likewise.
23031 (aarch64_get_attributes): Likewise.
23032 (aarch64_init_simd_builtins): Adjust callsite of above.
23033 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
23034 function attributes to apply to builtins.
23035 (aarch64_init_crc32_builtins): Likewise.
23036 (aarch64_init_builtin_rsqrt): Likewise.
23038 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23040 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
23041 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
23042 (gen_2logical): Use new fusion types.
23043 * config/rs6000/fusion.md: Regenerate.
23045 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23048 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
23049 Handle V4QI and V2HI modes.
23050 (ix86_expand_sse_movcc): Ditto.
23051 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
23052 New instruction pattern.
23053 (*eq<VI_32:mode>3): Ditto.
23054 (*gt<VI_32:mode>3): Ditto.
23055 (*xop_pcmov_<VI_32:mode>): Ditto.
23056 (mmx_pblendvb32): Ditto.
23057 (mmx_pblendvb64): Rename from mmx_pblendvb.
23058 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
23059 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
23060 (vcond<VI_32:mode><VI_32:mode>): Ditto.
23061 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
23062 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
23064 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23066 PR tree-optimization/94589
23067 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
23068 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
23069 not the same as cmp3) treat <= the same as < and >= the same as >.
23070 Don't require that cond2_phi_edge is true edge, instead take
23071 false/true edges into account based on cmp1/cmp2 comparison kinds.
23073 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23076 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
23077 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
23078 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
23079 using SMAXMIN_MMXMODEI mode iterator.
23080 (*<smaxmin:code>v4qi3): New insn pattern.
23081 (*<smaxmin:code>v2hi3): Ditto.
23082 (SMAXMIN_VI_32): New mode iterator.
23083 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
23084 (UMAXMIN_MMXMODEI): New mode iterator.
23085 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
23086 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
23087 using UMAXMIN_MMXMODEI mode iterator.
23088 (*<umaxmin:code>v4qi3): New insn pattern.
23089 (*<umaxmin:code>v2hi3): Ditto.
23090 (UMAXMIN_VI_32): New mode iterator.
23091 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
23092 (abs<VI_32:mode>2): New insn pattern.
23093 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
23094 * config/i386/sse.md: ... here.
23096 2021-05-20 Clement Chigot <clement.chigot@atos.net>
23097 David Edelsohn <dje.gcc@gmail.com>
23099 * collect2.c (scan_prog_file): Issue non-fatal warning for
23102 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
23104 * doc/invoke.texi (-Wno-c++11-extensions)
23105 (-Wno-c++14-extensions, -Wno-c++17-extensions)
23106 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
23109 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23111 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
23112 * config/darwin.c (darwin_override_options): Likewise.
23113 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
23114 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
23115 (DWARF2_FRAME_REG_OUT): Likewise.
23116 * config/mips/mips.c (mips_output_filename): Likewise.
23117 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
23119 (rs6000_dbx_register_number): Likewise.
23120 * dbxout.c: Include flags.h.
23121 * dwarf2cfi.c (cfi_label_required_p): Likewise.
23122 (dwarf2out_do_frame): Likewise.
23123 * except.c: Include flags.h.
23124 * final.c (dwarf2_debug_info_emitted_p): Likewise.
23125 (final_scan_insn_1): Likewise.
23126 * flags.h (dwarf_debuginfo_p): New function declaration.
23127 * opts.c (dwarf_debuginfo_p): New function definition.
23128 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
23129 * toplev.c (process_options): Likewise.
23131 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23133 * common.opt: Change type to support bitmasks.
23134 * flag-types.h (enum debug_info_type): Rename enumerator constants.
23135 (NO_DEBUG): New bitmask.
23136 (DBX_DEBUG): Likewise.
23137 (DWARF2_DEBUG): Likewise.
23138 (XCOFF_DEBUG): Likewise.
23139 (VMS_DEBUG): Likewise.
23140 (VMS_AND_DWARF2_DEBUG): Likewise.
23141 * flags.h (debug_set_to_format): New function declaration.
23142 (debug_set_count): Likewise.
23143 (debug_set_names): Likewise.
23144 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
23145 (debug_set_to_format): New function definition.
23146 (debug_set_count): Likewise.
23147 (debug_set_names): Likewise.
23148 (set_debug_level): Update access to debug_type_names.
23149 * toplev.c: Likewise.
23151 2021-05-20 Martin Sebor <msebor@redhat.com>
23153 PR middle-end/100684
23154 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
23156 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23159 * config/i386/i386.md (isa): Remove x64_bmi.
23160 (enabled): Remove x64_bmi.
23161 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23162 Remove general register alternative.
23163 (*andnot<VI_32:mode>3): Ditto.
23164 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23165 (*<any_logic:code><VI_32:mode>3): Ditto.
23167 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23169 * config/arm/arm.c: Include head files tree-vectorizer.h and
23172 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23175 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
23176 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
23177 (<s>mulv4hi3_highpart): New expander.
23178 (*<s>mulv2hi3_highpart): New insn pattern.
23179 (<s>mulv2hi3_higpart): New expander.
23180 (*<any_shift:insn>v2hi3): New insn pattern.
23181 (<any_shift:insn>v2hi3): New expander.
23182 * config/i386/sse.md (smulhrsv2hi3): New expander.
23183 (*smulhrsv2hi3): New insn pattern.
23185 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23187 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
23189 * params.opt (vect-inner-loop-cost-factor): New.
23190 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
23191 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
23192 tree-vectorizer.h and its required ones.
23193 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
23194 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
23195 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
23196 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
23197 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
23198 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
23200 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
23201 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
23202 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
23204 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
23205 Torbjörn Svensson <torbjorn.svensson@st.com>
23208 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
23210 2021-05-20 Jakub Jelinek <jakub@redhat.com>
23212 PR middle-end/99928
23213 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
23214 explicit linear clause when combined with target, make it map(tofrom:)
23215 instead of no clause or firstprivate.
23217 2021-05-20 Jakub Jelinek <jakub@redhat.com>
23219 PR tree-optimization/94589
23220 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
23221 of integral conversions.
23223 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
23225 * gimple-range.cc (fur_source::get_operand): New.
23226 (gimple_range_fold): Delete.
23227 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
23228 (fold_using_range::range_of_range_op): Move from gimple_ranger.
23229 (fold_using_range::range_of_address): Ditto.
23230 (fold_using_range::range_of_phi): Ditto.
23231 (fold_using_range::range_of_call): Ditto.
23232 (fold_using_range::range_of_builtin_ubsan_call): Move from
23233 range_of_builtin_ubsan_call.
23234 (fold_using_range::range_of_builtin_call): Move from
23235 range_of_builtin_call.
23236 (gimple_ranger::range_of_builtin_call): Delete.
23237 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
23238 (gimple_ranger::fold_range_internal): New.
23239 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
23240 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
23241 gimple_ranger. Improve ranges of SSA_NAMES when possible.
23242 * gimple-range.h (gimple_ranger): Remove various range_of routines.
23243 (class fur_source): New.
23244 (class fold_using_range): New.
23245 (fur_source::fur_source): New.
23247 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
23248 instead of range_of_builtin_call.
23250 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
23252 * doc/cpp.texi (Common Predefined Macros): Update documentation
23253 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
23255 2021-05-19 Alex Coplan <alex.coplan@arm.com>
23258 * config/arm/arm.md (nonsecure_call_internal): Always ensure
23259 callee's address is in a register.
23261 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
23263 * common/config/riscv/riscv-common.c
23264 (riscv_subset_list::parsing_subset_version): Properly parse the letter
23266 (riscv_subset_list::parse_std_ext,
23267 riscv_subset_list::parse_multiletter_ext): To handle errors generated
23268 in riscv_subset_list::parsing_subset_version.
23270 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23272 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
23273 type attribute in patterns generating XTN(2).
23275 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23277 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
23278 Remove as duplicate of...
23279 (aarch64_xtn<mode>): This.
23280 (aarch64_xtn2<mode>_le): Move position in file.
23281 (aarch64_xtn2<mode>_be): Move position in file.
23282 (aarch64_xtn2<mode>): Move position in file.
23283 (vec_pack_trunc_<mode>): Define as an expander.
23285 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23287 * config/aarch64/aarch64-simd-builtins.def: Split builtin
23288 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
23289 separate scalar and vector generators.
23290 * config/aarch64/aarch64-simd.md
23291 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
23293 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
23294 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
23295 * config/aarch64/iterators.md: Define SD_HSDI iterator.
23297 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
23299 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
23301 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
23303 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23305 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
23306 Implement as an expand emitting a big/little endian
23307 instruction pattern.
23308 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
23309 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
23311 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23313 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
23314 Implement as an expand emitting a big/little endian
23315 instruction pattern.
23316 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
23317 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
23318 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
23319 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
23321 2021-05-19 Richard Biener <rguenther@suse.de>
23323 PR middle-end/100672
23324 * fold-const.c (fold_negate_expr_1): Use element_precision.
23325 (negate_expr_p): Likewise.
23327 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
23329 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
23330 (pred_load): New int attribute.
23331 * config/aarch64/aarch64-sve.md
23332 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
23333 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
23334 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
23335 code_for_aarch64_load.
23337 2021-05-19 Richard Biener <rguenther@suse.de>
23339 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
23340 sure TARGET_MEM_REF bases are expanded as memory.
23341 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
23342 Do not mark TARGET_MEM_REF bases addressable.
23343 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
23344 TARGET_MEM_REF bases as never rewritable.
23345 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
23346 walk TARGET_MEM_REF bases as address-takens.
23347 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
23349 2021-05-19 Richard Biener <rguenther@suse.de>
23351 * builtins.c (get_object_alignment_1): Strip outer
23353 * tree-dfa.c (get_ref_base_and_extent): Handle outer
23354 WITH_SIZE_EXPR for size processing and process the
23356 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
23357 outer WITH_SIZE_EXPR.
23358 (ao_ref_base_alias_ptr_type): Likewise.
23359 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
23360 and handle that accordingly, stripping it for the
23361 core alias workers.
23362 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
23363 looking through it instead of returning NULL.
23365 2021-05-19 Jakub Jelinek <jakub@redhat.com>
23367 PR middle-end/100576
23368 * builtins.c (check_read_access): Convert bound to size_type_node if
23371 2021-05-19 Richard Biener <rguenther@suse.de>
23373 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
23374 (verify_types_in_gimple_reference): ... here. Sanitize.
23375 (verify_gimple_call): Verify references in LHS and arguments.
23376 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
23378 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
23380 * config/i386/i386.h (VALID_INT_MODE_P):
23381 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
23382 * config/i386/i386.md (isa): Add x64_bmi.
23383 (enabled): Handle x64_bmi.
23384 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23385 Add alternative using 64bit general registers.
23386 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23388 2021-05-19 Jakub Jelinek <jakub@redhat.com>
23390 PR middle-end/99928
23391 * tree.h (OMP_MASTER_COMBINED): Define.
23392 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
23393 handling for outer combined/composite constructs to a loop.
23394 Handle lastprivate on combined target.
23395 (gimplify_expr): Formatting fix.
23397 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
23399 * passes.def: Add sink_code pass before store_merging.
23400 * tree-ssa-sink.c (pass_sink_code:clone): New.
23402 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
23404 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
23405 rs6000_special_adjust_field_align_p.
23406 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
23407 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
23408 Remove ABI warning.
23409 (rs6000_function_arg): Likewise.
23410 * config/rs6000/rs6000-protos.h
23411 (rs6000_special_adjust_field_align_p): Remove prototype.
23412 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
23414 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
23415 rs6000_special_adjust_field_align_p.
23417 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
23420 * config/i386/i386.h (VALID_SSE2_REG_MODE):
23421 Add V4QI and V2HI modes.
23422 (VALID_INT_MODE_P): Ditto.
23423 * config/i386/mmx.md (VI_32): New mode iterator.
23424 (mmxvecsize): Handle V4QI and V2HI.
23426 (mov<VI_32:mode>): New expander.
23427 (*mov<mode>_internal): New insn pattern.
23428 (movmisalign<VI_32:mode>): New expander.
23429 (neg<VI_32:mode>): New expander.
23430 (<plusminus:insn><VI_32:mode>3): New expander.
23431 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
23432 (mulv2hi3): New expander.
23433 (*mulv2hi3): New insn pattern.
23434 (one_cmpl<VI_32:mode>2): New expander.
23435 (*andnot<VI_32:mode>3): New insn pattern.
23436 (<any_logic:code><VI_32:mode>3): New expander.
23437 (*<any_logic:code><VI_32:mode>3): New insn pattern.
23439 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
23441 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
23442 Fix a mode mismatch with operand 1.
23444 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
23447 * config/i386/i386-expand.c (split_double_mode): Return
23448 temporary register when simplify_gen_subreg fails with
23449 the high half od the paradoxical subreg.
23451 2021-05-18 Richard Biener <rguenther@suse.de>
23453 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
23454 and honor it when expanding.
23455 (expand_used_vars_for_block): Pass through forced_stack_var.
23456 (expand_used_vars): Likewise.
23457 (discover_nonconstant_array_refs_r): Set bits in
23458 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
23459 (avoid_type_punning_on_regs): Likewise.
23460 (discover_nonconstant_array_refs): Likewise.
23461 (pass_expand::execute): Create and pass down forced_stack_var
23462 bitmap. For parameters and returns temporarily set
23463 TREE_ADDRESSABLE when expand_function_start.
23465 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
23467 * doc/sourcebuild.texi: Document 'dg-note'.
23469 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
23472 * configure: Regenerate.
23473 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
23475 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
23477 * gimple.h (is_gimple_omp_oacc): Tighten.
23478 * omp-low.c (check_omp_nesting_restrictions): Adjust.
23480 2021-05-18 Richard Biener <rguenther@suse.de>
23482 * tree-ssa-operands.c (mark_address_taken): Simplify.
23484 2021-05-18 Martin Liska <mliska@suse.cz>
23486 * config/gcn/mkoffload.c (STR): Redefine.
23487 * config/i386/intelmic-mkoffload.c (STR): Likewise.
23488 * config/nvptx/mkoffload.c (STR): Likewise.
23490 2021-05-18 Martin Liska <mliska@suse.cz>
23492 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
23493 Use startswith function instead of strncmp.
23494 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
23495 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
23496 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
23497 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
23498 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
23499 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
23500 * config/arm/arm.c (arm_file_start): Likewise.
23501 (arm_valid_target_attribute_rec): Likewise.
23502 (thumb1_md_asm_adjust): Likewise.
23503 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
23504 * config/avr/avr.c (STR_PREFIX_P): Likewise.
23505 (avr_set_current_function): Likewise.
23506 (avr_handle_addr_attribute): Likewise.
23507 (avr_asm_output_aligned_decl_common): Likewise.
23508 (avr_asm_named_section): Likewise.
23509 (avr_section_type_flags): Likewise.
23510 (avr_asm_select_section): Likewise.
23511 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
23512 (c6x_section_type_flags): Likewise.
23513 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
23514 (darwin_objc_declare_unresolved_class_reference): Likewise.
23515 (darwin_objc_declare_class_definition): Likewise.
23516 * config/darwin.c (indirect_data): Likewise.
23517 (darwin_encode_section_info): Likewise.
23518 (darwin_objc2_section): Likewise.
23519 (darwin_objc1_section): Likewise.
23520 (machopic_select_section): Likewise.
23521 (darwin_globalize_label): Likewise.
23522 (darwin_label_is_anonymous_local_objc_name): Likewise.
23523 (darwin_asm_named_section): Likewise.
23524 (darwin_asm_output_dwarf_offset): Likewise.
23525 * config/frv/frv.c (frv_string_begins_with): Likewise.
23526 (frv_in_small_data_p): Likewise.
23527 * config/gcn/mkoffload.c (STR): Likewise.
23529 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
23530 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
23531 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
23532 (ix86_md_asm_adjust): Likewise.
23533 * config/i386/intelmic-mkoffload.c (STR): Likewise.
23534 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
23535 (i386_pe_file_end): Likewise.
23536 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
23537 (ia64_section_type_flags): Likewise.
23538 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
23539 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
23540 (mips16_stub_function_p): Likewise.
23541 (mips_function_rodata_section): Likewise.
23542 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
23543 (msp430_function_section): Likewise.
23544 (msp430_section_type_flags): Likewise.
23545 (msp430_expand_helper): Likewise.
23546 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
23547 (nios2_valid_target_attribute_rec): Likewise.
23548 * config/nvptx/mkoffload.c (process): Likewise.
23550 * config/pa/som.h: Likewise.
23551 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
23552 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
23553 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
23554 (rs6000_inner_target_options): Likewise.
23555 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
23556 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
23557 * config/vax/vax.c (vax_output_int_move): Likewise.
23558 * config/vms/vms-ld.c (startswith): Likewise.
23559 (process_args): Likewise.
23561 * config/vms/vms.c: Likewise.
23563 2021-05-18 Jakub Jelinek <jakub@redhat.com>
23565 PR rtl-optimization/100590
23566 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
23567 they are NONJUMP_INSN_P.
23569 2021-05-18 Jakub Jelinek <jakub@redhat.com>
23572 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
23573 DECL_ASSEMBLER_NAME on the fn_decl.
23575 2021-05-18 Jakub Jelinek <jakub@redhat.com>
23577 PR tree-optimization/94589
23578 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
23579 phi result used in (res & ~1) == 0 comparison as res >= 0 as
23580 res == 2 would be UB with -ffinite-math-only.
23582 2021-05-18 Martin Liska <mliska@suse.cz>
23584 * Makefile.in: genversion.o should depend on DATESTAMP.
23586 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
23588 * config/arc/simdext.md (negv2si2): Remove round bracket.
23590 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
23592 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
23593 _Bool as macro expanding to _Bool.
23595 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
23598 * tree.c (build_reference_type_for_mode)
23599 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
23601 (build_reference_type, build_pointer_type): Invoke
23602 build_*_type_for_mode with VOIDmode.
23604 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
23606 PR tree-optimization/100512
23607 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
23608 and non-zero pointer ranges as invariant.
23609 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
23610 processing from here.
23612 2021-05-17 Tom de Vries <tdevries@suse.de>
23615 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
23616 * config/nvptx/nvptx.c (nvptx_output_barrier)
23617 (nvptx_output_atomic_insn): New function.
23618 (nvptx_print_operand): Add support for 'B'.
23619 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
23622 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
23624 PR tree-optimization/100349
23625 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
23628 2021-05-17 Tamar Christina <tamar.christina@arm.com>
23630 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
23631 (host_detect_local_cpu): Use it.
23633 2021-05-17 Martin Liska <mliska@suse.cz>
23635 * doc/invoke.texi: Add 2 missing dots.
23637 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
23639 PR bootstrap/100552
23640 * configure.ac: Replace pattern substitution with call to sed.
23641 * configure: Regenerate.
23643 2021-05-17 Richard Biener <rguenther@suse.de>
23645 PR middle-end/100582
23646 * tree.c (array_at_struct_end_p): Get to the base of the
23647 reference before looking for the underlying decl.
23649 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
23651 * genoutput.c (validate_insn_alternatives) Make "wrong number of
23652 alternatives" message more specific, and remove assumption on where
23655 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
23657 * config/arm/iterators.md (V16): New iterator.
23658 (VH_cvtto): New iterator.
23659 (v_cmp_result): Added V4HF and V8HF support.
23660 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
23661 (vcond<mode><mode>): Likewise.
23662 (vcond_mask_<mode><v_cmp_result>): Likewise.
23663 (vcond<VH_cvtto><mode>): New expander.
23665 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
23667 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
23669 * config/arm/arm.c (arm_expand_vector_compare): Add support for
23671 (arm_expand_vcond): Likewise.
23672 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
23673 VCMPEQQ_N_S, VCMPNEQ_N_S.
23674 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
23675 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
23676 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
23677 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
23678 (@mve_vpselq_<supf><mode>): Likewise.
23679 (@mve_vpselq_f<mode>"): Likewise.
23680 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
23681 and move to vec-common.md.
23682 (vec_cmpu<mode><mode>): Likewise.
23683 (vcond<mode><mode>): Likewise.
23684 (vcond<V_cvtto><mode>): Likewise.
23685 (vcondu<mode><v_cmp_result>): Likewise.
23686 (vcond_mask_<mode><v_cmp_result>): Likewise.
23687 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
23688 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
23689 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
23690 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
23691 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
23692 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
23694 (vec_cmpu<mode><mode>): Likewise.
23695 (vcond<mode><mode>): Likewise.
23696 (vcond<V_cvtto><mode>): Likewise.
23697 (vcondu<mode><v_cmp_result>): Likewise.
23698 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
23701 2021-05-17 liuhongt <hongtao.liu@intel.com>
23704 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
23705 gsi_insert_seq_before instead.
23707 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
23709 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
23710 (arm_sat_ok): ...this.
23712 2021-05-17 Martin Liska <mliska@suse.cz>
23714 * lto-wrapper.c (merge_flto_options): Factor out a new function.
23715 (merge_and_complain): Use it.
23716 (run_gcc): Merge also linker command line -flto=foo argument
23719 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
23721 * config/arm/arm.h (CPP_SPEC): Remove error message about
23722 -mlittle-endian/-mbig-endian conflict.
23724 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
23726 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
23727 __ROP_PROTECT__ if -mrop-protect is selected.
23729 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
23731 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
23732 rop_hash_save_offset and rop_hash_size.
23733 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
23734 rop_hash_size and rop_hash_save_offset.
23735 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
23736 (rs6000_emit_prologue): Emit hashst[p] in prologue.
23737 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
23738 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
23740 (hashst): New define_insn.
23741 (hashchk): Likewise.
23743 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
23745 * config/rs6000/rs6000.c (rs6000_option_override_internal):
23746 Disable shrink wrap when inserting ROP-protect instructions.
23747 * config/rs6000/rs6000.opt (mrop-protect): New option.
23748 (mprivileged): Likewise.
23749 * doc/invoke.texi: Document mrop-protect and mprivileged.
23751 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
23753 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
23756 2021-05-15 Martin Jambor <mjambor@suse.cz>
23759 2021-05-13 Martin Jambor <mjambor@suse.cz>
23761 PR tree-optimization/100453
23762 * tree-sra.c (sra_modify_assign): All const base accesses do not
23763 need refreshing, not just those from decl_pool.
23764 (sra_modify_assign): Do not refresh into a const base decl.
23766 2021-05-15 Jakub Jelinek <jakub@redhat.com>
23768 PR rtl-optimization/100342
23769 * regcprop.c (copy_value): When copying a source reg in a wider
23770 mode than it has recorded for the value, adjust recorded destination
23771 mode too or punt if !REG_CAN_CHANGE_MODE_P.
23773 2021-05-14 Jason Merrill <jason@redhat.com>
23775 * intl.h: Add comments.
23777 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23779 * config/aarch64/aarch64-simd.md
23780 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
23781 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
23782 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
23783 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
23784 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
23785 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
23786 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
23787 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
23788 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
23790 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
23793 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
23794 boolean logic equivalent.
23795 (vtst_s16): Likewise.
23796 (vtst_s32): Likewise.
23797 (vtst_u8): Likewise.
23798 (vtst_u16): Likewise.
23799 (vtst_u32): Likewise.
23800 (vtst_p8): Likewise.
23801 (vtst_p16): Likewise.
23802 (vtstq_s8): Likewise.
23803 (vtstq_s16): Likewise.
23804 (vtstq_s32): Likewise.
23805 (vtstq_u8): Likewise.
23806 (vtstq_u16): Likewise.
23807 (vtstq_u32): Likewise.
23808 (vtstq_p8): Likewise.
23809 (vtstq_p16): Likewise.
23810 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
23811 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
23813 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23815 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
23816 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
23817 (aarch64_sqdmlsl2<mode>): Delete.
23818 (aarch64_sqdmlal2_lane<mode>): Merge this...
23819 (aarch64_sqdmlsl2_lane<mode>): ... And this...
23820 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
23821 (aarch64_sqdmlal2_laneq<mode>): Merge this...
23822 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
23823 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
23824 (aarch64_sqdmlal2_n<mode>): Merge this...
23825 (aarch64_sqdmlsl2_n<mode>): ... And this...
23826 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
23828 2021-05-13 Martin Sebor <msebor@redhat.com>
23830 PR middle-end/100574
23831 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
23832 all null arguments.
23834 2021-05-13 Martin Sebor <msebor@redhat.com>
23836 PR tree-optimization/93100
23837 PR middle-end/98583
23838 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
23839 don't modify referenced objects.
23841 2021-05-13 Martin Jambor <mjambor@suse.cz>
23843 PR tree-optimization/100453
23844 * tree-sra.c (sra_modify_assign): All const base accesses do not
23845 need refreshing, not just those from decl_pool.
23846 (sra_modify_assign): Do not refresh into a const base decl.
23848 2021-05-13 Martin Liska <mliska@suse.cz>
23850 * tree-ssa-dom.c: Remove m_simplifier.
23852 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
23855 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
23856 canonicalize DImode inequality comparisons against the
23857 maximum integral value.
23859 2021-05-13 Jakub Jelinek <jakub@redhat.com>
23861 PR tree-optimization/98856
23862 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
23863 Expect V2DI and V4DI arithmetic right shifts to be emulated.
23864 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
23866 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
23867 expand_vec_perm_2perm_pblendv): New functions.
23868 (ix86_expand_vec_perm_const_1): Use them.
23869 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
23870 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
23871 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
23872 (ashrv4di3): New define_expand.
23873 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
23874 and !TARGET_AVX512VL expansion.
23876 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
23879 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
23880 sizes < 16 to a register when constructing vpcmov pattern.
23881 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
23883 2021-05-13 Martin Liska <mliska@suse.cz>
23885 * gcov-io.c (gcov_write_block): Remove.
23886 (gcov_write_words): Likewise.
23887 (gcov_read_words): Re-implement using gcov_read_bytes.
23888 (gcov_allocate): Remove.
23889 (GCOV_BLOCK_SIZE): Likewise.
23890 (struct gcov_var): Remove most of the fields.
23891 (gcov_position): Implement with ftell.
23892 (gcov_rewrite): Remove setting of start and offset fields.
23893 (from_file): Re-format.
23894 (gcov_open): Remove setbuf call. It should not be needed.
23895 (gcov_close): Remove internal buffer handling.
23896 (gcov_magic): Use __builtin_bswap32.
23897 (gcov_write_counter): Use directly gcov_write_unsigned.
23898 (gcov_write_string): Use direct fwrite and do not round
23900 (gcov_seek): Use directly fseek.
23901 (gcov_write_tag): Use gcov_write_unsigned directly.
23902 (gcov_write_length): Likewise.
23903 (gcov_write_tag_length): Likewise.
23904 (gcov_read_bytes): Use directly fread.
23905 (gcov_read_unsigned): Use gcov_read_words.
23906 (gcov_read_counter): Likewise.
23907 (gcov_read_string): Use gcov_read_bytes.
23908 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
23909 that size is not in bytes, but words (4B).
23910 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
23911 (GCOV_TAG_ARCS_LENGTH): Likewise.
23912 (GCOV_TAG_ARCS_NUM): Likewise.
23913 (GCOV_TAG_COUNTER_LENGTH): Likewise.
23914 (GCOV_TAG_COUNTER_NUM): Likewise.
23915 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
23917 2021-05-13 liuhongt <hongtao.liu@intel.com>
23920 * config/i386/sse.md (ssedoublevecmode): Add attribute for
23921 V64QI/V32HI/V16SI/V4DI.
23922 (ssehalfvecmode): Add attribute for V2DI/V2DF.
23923 (*vec_concatv4si_0): Extend to VI124_128.
23924 (*vec_concat<mode>_0): New pre-reload splitter.
23925 * config/i386/predicates.md (movq_parallel): New predicate.
23927 2021-05-13 Alexandre Oliva <oliva@adacore.com>
23929 * targhooks.c (default_zero_call_used_regs): Retry using
23930 successfully-zeroed registers as sources.
23932 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
23934 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
23936 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
23939 * gimple-range.cc (range_of_builtin_call): Skip out on
23940 processing __builtin_clz when varying.
23942 2021-05-12 Tom de Vries <tdevries@suse.de>
23945 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
23946 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
23947 to ptx_version_option.
23948 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
23949 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
23950 (define_insn "nvptx_vote_ballot"): Use sync variant for
23952 * config/nvptx/nvptx.opt (ptx_version): Add enum.
23953 (mptx): Add option.
23954 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
23956 2021-05-12 Richard Biener <rguenther@suse.de>
23958 PR tree-optimization/100566
23959 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
23960 allow_back for all edge queries.
23962 2021-05-12 liuhongt <hongtao.liu@intel.com>
23965 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
23966 splitters for pblendvb of NOT mask register.
23968 2021-05-12 Richard Biener <rguenther@suse.de>
23970 PR tree-optimization/100519
23971 * tree-ssa-reassoc.c (can_associate_p): Split into...
23972 (can_associate_op_p): ... this
23973 (can_associate_type_p): ... and this.
23974 (is_reassociable_op): Call can_associate_op_p.
23975 (break_up_subtract_bb): Call the appropriate predicates.
23976 (reassociate_bb): Likewise.
23978 2021-05-12 Martin Liska <mliska@suse.cz>
23980 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
23981 (run_gcc): Use -flto argument detection for merged
23984 2021-05-12 Martin Liska <mliska@suse.cz>
23986 * lto-wrapper.c (print_lto_docs_link): New function.
23987 (run_gcc): Print warning about missing job server detection
23988 after we know NR of partitions. Do the same for -flto{,=1}.
23989 * opts.c (get_option_html_page): Support -flto option.
23991 2021-05-12 Martin Liska <mliska@suse.cz>
23993 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
23995 (append_option): Remove.
23996 (find_option): Rework to use the vector type.
23997 (remove_option): Remove.
23998 (merge_and_complain): Use vectors for cl_decoded_option data
24000 (append_compiler_options): Likewise.
24001 (append_diag_options): Likewise.
24002 (append_linker_options): Likewise.
24003 (append_offload_options): Likewise.
24004 (compile_offload_image): Likewise.
24005 (compile_images_for_offload_targets): Likewise.
24006 (find_and_merge_options): Likewise.
24007 (run_gcc): Likewise.
24009 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
24012 * dwarf2out.c (dwarf2out_finish): Set
24013 have_multiple_function_sections with multi-range text_section.
24015 2021-05-12 Martin Liska <mliska@suse.cz>
24017 PR bootstrap/100560
24018 * Makefile.in: Remove version.h from linker command line.
24020 2021-05-12 Richard Biener <rguenther@suse.de>
24022 PR middle-end/100547
24023 * rtl.h (rtvec_alloc): Make argument size_t.
24024 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
24026 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24028 PR middle-end/100508
24029 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
24030 type, don't reuse DECL_RTL if it has different mode, instead force
24031 creation of a new DEBUG_EXPR.
24033 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24034 Marc Glisse <marc.glisse@inria.fr>
24036 PR tree-optimization/94589
24037 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
24038 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
24040 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
24043 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
24044 * config/i386/mmx.md (MMXMODE124): New mode iterator.
24046 (mmxintvecmode): New mode attribute.
24047 (mmxintvecmodelower): Ditto.
24048 (*mmx_maskcmpv2sf3_comm): New insn pattern.
24049 (*mmx_maskcmpv2sf3): Ditto.
24050 (vec_cmpv2sfv2si): New expander.
24051 (vcond<V2FI:mode>v2si): Ditto.
24052 (mmx_vlendvps): New insn pattern.
24053 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
24054 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
24055 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
24057 2021-05-11 Martin Sebor <msebor@redhat.com>
24059 PR middle-end/21433
24060 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
24062 2021-05-11 Richard Biener <rguenther@suse.de>
24064 * gimple-fold.c (gimple_fold_call): Do not call
24065 maybe_fold_reference on call arguments or the static chain.
24066 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
24069 2021-05-11 Martin Liska <mliska@suse.cz>
24071 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
24072 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
24073 (DEF_HSAIL_SAT_BUILTIN): Likewise.
24074 (DEF_HSAIL_INTR_BUILTIN): Likewise.
24075 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
24076 * doc/frontends.texi: Remove BRIG.
24077 * doc/install.texi: Likewise.
24078 * doc/invoke.texi: Likewise.
24079 * doc/standards.texi: Likewise.
24080 * brig-builtins.def: Removed.
24081 * brig/ChangeLog: Removed.
24082 * brig/Make-lang.in: Removed.
24083 * brig/brig-builtins.h: Removed.
24084 * brig/brig-c.h: Removed.
24085 * brig/brig-lang.c: Removed.
24086 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
24087 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
24088 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
24089 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
24090 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
24091 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
24092 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
24093 * brig/brigfrontend/brig-comment-handler.cc: Removed.
24094 * brig/brigfrontend/brig-control-handler.cc: Removed.
24095 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
24096 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
24097 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
24098 * brig/brigfrontend/brig-function-handler.cc: Removed.
24099 * brig/brigfrontend/brig-function.cc: Removed.
24100 * brig/brigfrontend/brig-function.h: Removed.
24101 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
24102 * brig/brigfrontend/brig-label-handler.cc: Removed.
24103 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
24104 * brig/brigfrontend/brig-machine.c: Removed.
24105 * brig/brigfrontend/brig-machine.h: Removed.
24106 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
24107 * brig/brigfrontend/brig-module-handler.cc: Removed.
24108 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
24109 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
24110 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
24111 * brig/brigfrontend/brig-to-generic.cc: Removed.
24112 * brig/brigfrontend/brig-to-generic.h: Removed.
24113 * brig/brigfrontend/brig-util.cc: Removed.
24114 * brig/brigfrontend/brig-util.h: Removed.
24115 * brig/brigfrontend/brig-variable-handler.cc: Removed.
24116 * brig/brigfrontend/hsa-brig-format.h: Removed.
24117 * brig/brigfrontend/phsa.h: Removed.
24118 * brig/brigspec.c: Removed.
24119 * brig/config-lang.in: Removed.
24120 * brig/gccbrig.texi: Removed.
24121 * brig/lang-specs.h: Removed.
24122 * brig/lang.opt: Removed.
24124 2021-05-11 Richard Biener <rguenther@suse.de>
24127 * ipa-param-manipulation.c
24128 (ipa_param_body_adjustments::modify_call_stmt): Avoid
24129 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
24130 via gimple_call_lhs_ptr.
24132 2021-05-11 Alex Coplan <alex.coplan@arm.com>
24135 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
24136 Avoid emitting CFA adjusts on the sp if we have the fp.
24138 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
24140 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
24141 (VMULD): New iterator.
24142 (VCOND): Handle V4HF and V8HF.
24143 (VCONQ): Fix entry for V2SF.
24144 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
24145 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
24146 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
24147 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
24148 Use a 128-bit vector mode for the indexed operand. Use stype for
24149 the scheduling type.
24151 2021-05-11 Richard Biener <rguenther@suse.de>
24153 * gimple-fold.c (maybe_fold_reference): Only return
24154 is_gimple_min_invariant values.
24156 2021-05-11 Richard Biener <rguenther@suse.de>
24158 PR middle-end/100509
24159 * gimple-fold.c (fold_gimple_assign): Only call
24160 get_symbol_constant_value on register type symbols.
24162 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
24163 Joe Ramsay <joe.ramsay@arm.com>
24166 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
24167 (__arm_vcmpneq): Remove duplicate definition.
24168 (__arm_vstrwq_scatter_offset_p): Likewise.
24169 (__arm_vmaxq_x): Likewise.
24170 (__arm_vmlsdavaq): Likewise.
24171 (__arm_vmlsdavaxq): Likewise.
24172 (__arm_vmlsdavq_p): Likewise.
24173 (__arm_vmlsdavxq_p): Likewise.
24174 (__arm_vrmlaldavhaq): Likewise.
24175 (__arm_vstrbq_p): Likewise.
24176 (__arm_vstrbq_scatter_offset): Likewise.
24177 (__arm_vstrbq_scatter_offset_p): Likewise.
24178 (__arm_vstrdq_scatter_offset): Likewise.
24179 (__arm_vstrdq_scatter_offset_p): Likewise.
24180 (__arm_vstrdq_scatter_shifted_offset): Likewise.
24181 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
24183 2021-05-11 Jakub Jelinek <jakub@redhat.com>
24185 PR middle-end/100471
24186 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
24187 is 0, bypass the reduction loop including
24188 GOMP_taskgroup_reduction_unregister call.
24190 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24192 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
24193 costing_for_scalar.
24194 (rs6000_density_test): Early return if costing_for_scalar is true.
24195 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
24197 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24199 * doc/tm.texi: Regenerated.
24200 * target.def (init_cost): Add new parameter costing_for_scalar.
24201 * targhooks.c (default_init_cost): Adjust for new parameter.
24202 * targhooks.h (default_init_cost): Likewise.
24203 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
24204 (vect_compute_single_scalar_iteration_cost): Likewise.
24205 (vect_analyze_loop_2): Likewise.
24206 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
24207 (vect_bb_vectorization_profitable_p): Likewise.
24208 * tree-vectorizer.h (init_cost): Likewise.
24209 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
24210 * config/i386/i386.c (ix86_init_cost): Likewise.
24211 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
24213 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24215 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
24216 vect_nonmem and moved into...
24217 (struct rs6000_cost_data): ...here.
24218 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
24219 (rs6000_add_stmt_cost): Likewise.
24220 (rs6000_finish_cost): Likewise.
24222 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
24224 * range-op.cc (get_bool_state): Adjust head comment.
24225 (operator_not_equal::op1_range): Fix comment.
24226 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
24228 2021-05-10 Martin Sebor <msebor@redhat.com>
24230 PR middle-end/100425
24231 PR middle-end/100510
24232 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
24233 (pass_walloca::xlimit_certain_p): ...to this.
24234 (pass_walloca::gate): Execute for any kind of handled warning.
24235 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
24236 warnings when xlimit_certain_p is set.
24238 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
24240 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
24241 Return ALTIVEC_REGS if that is best_class.
24242 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
24244 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24246 * config/arm/arm.h (CPP_SPEC): Remove error message about
24249 2021-05-10 Martin Jambor <mjambor@suse.cz>
24251 * ipa-prop.h (IPA_NODE_REF): Removed.
24252 (IPA_NODE_REF_GET_CREATE): Likewise.
24253 (IPA_EDGE_REF): Likewise.
24254 (IPA_EDGE_REF_GET_CREATE): Likewise.
24255 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
24256 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
24257 use of ipa_node_params_sum.
24258 (ipcp_versionable_function_p): Likewise.
24259 (push_node_to_stack): Likewise.
24260 (pop_node_from_stack): Likewise.
24261 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
24262 direct use of ipa_node_params_sum.
24263 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
24264 ipa_node_params_sum.
24265 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
24267 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
24268 use of ipa_node_params_sum.
24269 (self_recursively_generated_p): Likewise.
24270 (propagate_scalar_across_jump_function): Likewise.
24271 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
24272 direct use of ipa_edge_args_sum, moved the lookup after the early
24273 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
24274 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
24275 direct uses of ipa_node_params_sum.
24276 (propagate_vr_across_jump_function): Likewise.
24277 (propagate_aggregate_lattice): Likewise.
24278 (propagate_aggs_across_jump_function): Likewise.
24279 (propagate_constants_across_call): Likewise, also replaced
24280 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
24281 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
24282 of ipa_node_params_sum.
24283 (estimate_local_effects): Likewise.
24284 (add_all_node_vals_to_toposort): Likewise.
24285 (propagate_constants_topo): Likewise.
24286 (ipcp_propagate_stage): Likewise.
24287 (ipcp_discover_new_direct_edges): Likewise.
24288 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
24289 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
24290 (get_info_about_necessary_edges): Likewise.
24291 (want_remove_some_param_p): Likewise.
24292 (create_specialized_node): Likewise.
24293 (self_recursive_pass_through_p): Likewise.
24294 (self_recursive_agg_pass_through_p): Likewise.
24295 (find_more_scalar_values_for_callers_subset): Likewise and also
24296 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
24297 case replacing two of those with a single query.
24298 (find_more_contexts_for_caller_subset): Likewise for the
24299 ipa_polymorphic_call_context overload.
24300 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
24301 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
24302 ipa_node_params_sum.
24303 (find_aggregate_values_for_callers_subset): Likewise, also reusing
24304 results of ipa_edge_args_sum->get.
24305 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
24306 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
24307 direct use of ipa_edge_args_sum.
24308 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
24309 summary query after the early exit and reused the result later.
24310 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
24311 ipa_node_params_sum.
24312 (decide_whether_version_node): Likewise. Removed re-querying for
24313 summaries after cloning.
24314 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
24315 ipa_node_params_sum.
24316 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
24318 (identify_dead_nodes): Likewise.
24319 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
24320 ipa_node_params_sum.
24321 (ipcp_store_vr_results): Likewise.
24322 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
24323 (ipa_fn_summary_t::duplicate): Likewise.
24324 (analyze_function_body): Likewise.
24325 (estimate_calls_size_and_time): Likewise.
24326 (ipa_cached_call_context::duplicate_from): Likewise.
24327 (ipa_call_context::equal_to): Likewise.
24328 (remap_edge_params): Likewise.
24329 (ipa_merge_fn_summary_after_inlining): Likewise.
24330 (inline_read_section): Likewise.
24331 * ipa-icf.c (sem_function::param_used_p): Likewise.
24332 * ipa-modref.c (compute_parm_map): Likewise.
24333 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
24335 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
24336 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
24338 * ipa-profile.c (check_argument_count): Likewise.
24339 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
24340 with a direct use of ipa_node_params_sum.
24341 (ipa_initialize_node_params): Likewise.
24342 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
24343 direct use of ipa_edge_args_sum and reused the query result.
24344 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
24345 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
24346 direct use of ipa_edge_args_sum.
24347 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
24348 ipa_node_params_sum and reused the result of the query.
24349 (ipa_analyze_node): Likewise.
24350 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
24351 of ipa_node_params_sum.
24352 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
24353 direct uses of ipa_edge_args_sum.
24354 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
24355 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
24356 direct use of ipa_edge_args_sum. Removed superficial re-querying the
24358 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
24359 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
24361 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
24362 direct use of ipa_edge_args_sum.
24363 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
24364 use of ipa_node_params_sum.
24365 (ipa_print_node_params): Likewise.
24366 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
24367 direct uses of ipa_edge_args_sum.
24368 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
24370 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
24371 ipa_node_params_sum.
24372 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
24373 scopes where it is used.
24375 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
24377 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
24378 <case E_V2SImode>: Force op_true to register.
24380 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24382 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
24383 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
24384 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
24386 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
24387 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
24388 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
24389 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
24390 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
24391 (mve_vcmpneq_n_f<mode>): Remove.
24392 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
24393 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
24394 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
24396 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24398 * config/arm/iterators.md (MVE_COMPARISONS): New.
24400 (mve_cmp_type): New.
24401 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
24403 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
24404 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
24405 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
24406 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
24407 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
24408 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
24409 (mve_vcmpneq_n_<mode>): Remove.
24411 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24413 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
24414 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
24415 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
24418 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24420 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
24421 (vcmpneq_n_u): Likewise.
24422 (vcmpeqq_u,): Likewise.
24423 (vcmpeqq_n_u): Likewise.
24424 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
24425 VCMPEQQ_N_U and VCMPNEQ_N_U.
24426 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
24427 (mve_vcmpeqq_n): Likewise.
24428 (mve_vcmpeqq): Likewise.
24429 (mve_vcmpneq_n): Likewise.
24431 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24433 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
24434 the 's' version of the builtin.
24436 2021-05-10 Richard Biener <rguenther@suse.de>
24438 PR tree-optimization/100492
24439 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
24440 Find nothing when the loop contains an irreducible region.
24442 2021-05-10 Richard Biener <rguenther@suse.de>
24444 PR middle-end/100464
24446 * gimple-fold.c (canonicalize_constructor_val): Do not set
24449 2021-05-10 Richard Biener <rguenther@suse.de>
24451 PR tree-optimization/100434
24452 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
24454 (dse_optimize_stmt): Handle call LHS by dropping the
24455 LHS or the whole call if it doesn't have other
24457 (pass_dse::execute): Adjust.
24459 2021-05-10 Martin Liska <mliska@suse.cz>
24461 * Makefile.in: Add missing genversion rule.
24463 2021-05-10 Alex Coplan <alex.coplan@arm.com>
24466 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
24467 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
24469 2021-05-10 Martin Liska <mliska@suse.cz>
24471 * builtins.c (is_builtin_name): Use startswith
24472 function instead of strncmp.
24473 * collect2.c (main): Likewise.
24474 (has_lto_section): Likewise.
24475 (scan_libraries): Likewise.
24476 * coverage.c (coverage_checksum_string): Likewise.
24477 (coverage_init): Likewise.
24478 * dwarf2out.c (is_cxx): Likewise.
24479 (gen_compile_unit_die): Likewise.
24480 * gcc-ar.c (main): Likewise.
24481 * gcc.c (init_spec): Likewise.
24482 (read_specs): Likewise.
24483 (execute): Likewise.
24484 (check_live_switch): Likewise.
24485 * genattrtab.c (write_attr_case): Likewise.
24486 (IS_ATTR_GROUP): Likewise.
24487 * gencfn-macros.c (main): Likewise.
24488 * gengtype.c (type_for_name): Likewise.
24489 (gen_rtx_next): Likewise.
24490 (get_file_langdir): Likewise.
24491 (write_local): Likewise.
24492 * genmatch.c (get_operator): Likewise.
24493 (get_operand_type): Likewise.
24494 (expr::gen_transform): Likewise.
24495 * genoutput.c (validate_optab_operands): Likewise.
24496 * incpath.c (add_sysroot_to_chain): Likewise.
24497 * langhooks.c (lang_GNU_C): Likewise.
24498 (lang_GNU_CXX): Likewise.
24499 (lang_GNU_Fortran): Likewise.
24500 (lang_GNU_OBJC): Likewise.
24501 * lto-wrapper.c (run_gcc): Likewise.
24502 * omp-general.c (omp_max_simt_vf): Likewise.
24503 * omp-low.c (omp_runtime_api_call): Likewise.
24504 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
24505 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
24506 * real.c (real_from_string): Likewise.
24507 * selftest.c (assert_str_startswith): Likewise.
24508 * timevar.c (timer::validate_phases): Likewise.
24509 * tree.c (get_file_function_name): Likewise.
24510 * ubsan.c (ubsan_use_new_style_p): Likewise.
24511 * varasm.c (default_function_rodata_section): Likewise.
24512 (incorporeal_function_p): Likewise.
24513 (default_section_type_flags): Likewise.
24514 * system.h (startswith): Define startswith.
24516 2021-05-10 Martin Liska <mliska@suse.cz>
24518 * bitmap.h (class auto_bitmap): Remove
24519 __cplusplus >= 201103.
24520 * config/aarch64/aarch64.c: Likewise.
24521 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
24523 * sbitmap.h: Likewise.
24525 2021-05-10 Martin Liska <mliska@suse.cz>
24527 * Makefile.in: Rename gcov-iov to genversion and depend
24528 on version.h (instead of gcov-iov.h).
24529 * gcov-io.h: Include version.h instread of gcov-iov.h.
24530 * gengtype-state.c (read_state_version): Likewise.
24531 * gcov-iov.c: Moved to...
24532 * genversion.c: ...here.
24533 * lto-streamer.h (LTO_major_version): Define it with
24535 * version.c: Removed.
24536 * version.h: Removed.
24538 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24540 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
24541 * config/arc/simdext.md (VCT): Add predicates for iterator
24544 (voptab): Likewise.
24545 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
24546 (<voptab>v2si3): New patterns.
24548 (reduc_plus_scal_v4hi): Likewise.
24549 (reduc_plus_scal_v2si): Likewise.
24550 (vec_duplicatev2si): Likewise.
24551 (vec_duplicatev4hi): Likewise.
24553 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24555 * config/arc/simdext.md: Format and cleanup file.
24557 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24559 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
24560 only when munaligned-access option is on.
24561 (movmisalign<mode>): Likewise.
24563 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24565 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
24567 * config/arc/arc.c (arc_reorg): Remove underscore from string.
24569 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24571 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
24572 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
24573 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
24574 (norm_f): Likewise.
24577 (clzsi2): Use fls instruction when available.
24578 (arc_clzsi2): Likewise.
24580 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24582 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
24584 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24586 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
24588 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
24590 PR middle-end/100467
24591 * toplev.c (compile_file): Call insn_locations_init before
24592 targetm.asm_out.code_end.
24594 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
24597 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
24599 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
24601 2021-05-07 Jakub Jelinek <jakub@redhat.com>
24602 Andrew Stubbs <amd@codesourcery.com>
24605 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
24606 emit_move_insn operands.
24608 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
24610 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
24611 location for the outgoing edges of an empty block.
24612 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
24613 type by means of the get_array_descr_info langhook, if it is set and
24614 returns true. Remove obsolete code dealing with unnamed subtypes.
24616 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24618 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
24619 (sbr_vector): Renamed from ssa_block_cache.
24620 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
24621 (ssa_block_ranges::~ssa_block_ranges): Remove.
24622 (sbr_vector::set_bb_range): Use varying and undefined cached values.
24623 (ssa_block_ranges::set_bb_varying): Remove.
24624 (sbr_vector::get_bb_range): Adjust assert.
24625 (sbr_vector::bb_range_p): Adjust assert.
24626 (~block_range_cache): No freeing loop required.
24627 (block_range_cache::get_block_ranges): Remove.
24628 (block_range_cache::set_bb_range): Inline get_block_ranges.
24629 (block_range_cache::set_bb_varying): Remove.
24630 * gimple-range-cache.h (set_bb_varying): Remove prototype.
24631 * value-range.h (irange_allocator::get_memory): New.
24633 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24635 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
24636 dominator tree is available and requested.
24637 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
24638 (ranger_cache::fill_block_cache): Don't search dom tree here either.
24639 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
24641 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24643 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
24644 only PHI nodes better.
24646 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24648 * gimple-range-edge.h (gimple_outgoing_range): Rename from
24650 (gcond_edge_range): Export prototype.
24651 * gimple-range-edge.cc (gcond_edge_range): New.
24652 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
24653 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
24655 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24657 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
24658 default range into a temp and allocate only what is needed.
24660 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24662 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
24664 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24666 * gimple-range.h (gimple_range_global): Pick up parameter initial
24667 values, and use-before defined locals are UNDEFINED.
24669 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
24671 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
24673 * tree.h (reverse_storage_order_for_component_p): Return false if
24674 the type is a pointer.
24676 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
24678 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
24680 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
24683 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
24684 Handle V8QI, V4HI and V2SI modes.
24685 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
24686 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
24687 * config/i386/i386.md (unspec): ... here.
24689 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
24690 Tom de Vries <tdevries@suse.de>
24692 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
24693 a truth_value_p reduction variable is nonintegral.
24695 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
24698 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
24699 Return false for mode sizes < 16.
24701 2021-05-07 Jakub Jelinek <jakub@redhat.com>
24704 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
24706 2021-05-06 Martin Jambor <mjambor@suse.cz>
24708 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
24709 when there is no function summary.
24710 (ipa_sra_summarize_function): produce edge summaries even when
24713 2021-05-06 Tom Tromey <tom@tromey.com>
24715 * godump.c (string_hash_eq): Remove.
24716 (go_finish): Use htab_eq_string.
24718 2021-05-06 Tom Tromey <tom@tromey.com>
24720 * gengtype-state.c (read_state): Use htab_eq_string.
24721 (string_eq): Remove.
24723 2021-05-06 Tom Tromey <tom@tromey.com>
24725 * gensupport.c (htab_eq_string): Remove.
24727 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
24730 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
24731 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
24732 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
24733 set_ignored_loc callbacks.
24734 * debug.c (do_nothing_debug_hooks): Likewise.
24735 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
24736 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
24737 (in_text_section_p, last_text_label, last_cold_label,
24738 switch_text_ranges, switch_cold_ranges): New data items.
24739 (dwarf2out_note_section_used): Remove.
24740 (dwarf2out_begin_prologue): Set fde->ignored_debug and
24742 (mark_ignored_debug_section): New helper function.
24743 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
24744 mark_ignored_debug_section.
24745 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
24746 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
24747 (size_of_aranges): Adjust formula for multi-part text ranges size.
24748 (output_aranges): Output multi-part text ranges.
24749 (dwarf2out_set_ignored_loc): New callback function.
24750 (dwarf2out_finish): Output multi-part text ranges.
24751 (dwarf2out_c_finalize): Clear new data items.
24752 * final.c (final_start_function_1): Call set_ignored_loc callback.
24753 (final_scan_insn_1): Likewise.
24754 * ggc-page.c (gt_ggc_mx): New helper function.
24755 * stringpool.c (gt_pch_nx): Likewise.
24757 2021-05-06 Richard Biener <rguenther@suse.de>
24759 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
24760 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
24761 (TV_TREE_INTO_SSA): New.
24762 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
24763 (rewrite_blocks): Likewise.
24764 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
24766 2021-05-06 Jakub Jelinek <jakub@redhat.com>
24768 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
24769 abs_replacement, xor_replacement,
24770 cond_removal_in_popcount_clz_ctz_pattern,
24771 replace_phi_edge_with_variable): Change type of phi argument from
24772 gimple * to gphi *.
24774 2021-05-06 Richard Biener <rguenther@suse.de>
24776 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
24777 Output an opt-info message.
24778 (do_split_loop_on_cond): Likewise.
24779 (tree_ssa_split_loops): Update SSA form here.
24781 2021-05-06 Richard Biener <rguenther@suse.de>
24783 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
24784 return variable removal.
24786 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
24788 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
24789 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
24790 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
24792 * config/s390/s390.c (s390_const_operand_ok): Remove unused
24795 2021-05-06 Jakub Jelinek <jakub@redhat.com>
24797 PR tree-optimization/94589
24798 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
24799 spaceship_replacement.
24800 (cond_only_block_p, spaceship_replacement): New functions.
24802 2021-05-06 Richard Biener <rguenther@suse.de>
24805 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
24806 dealing with a debug use and only query existing addresses
24808 (lower_emutls_1): Avoid splitting out addresses for debug
24809 stmts, reset the debug stmt when we fail to find existing
24811 (lower_emutls_phi_arg): Set wi.stmt.
24813 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
24816 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
24817 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
24818 (stack_protect_test): Use cbranch helper.
24820 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
24823 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
24824 always return the establisher frame for __builtin_frame_address (0).
24826 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
24829 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
24830 (ix86_cpu_model_var): Likewise.
24831 (ix86_cpu_features2_type_node): Likewise.
24832 (ix86_cpu_features2_var): Likewise.
24833 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
24836 2021-05-05 Martin Sebor <msebor@redhat.com>
24838 * passes.def (pass_warn_printf): Run after SSA.
24840 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24842 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
24843 * config/arm/predicates.md (minus_one_operand): New predicate.
24845 2021-05-05 Jeff Law <jlaw@tachyum.com>
24847 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
24849 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
24851 PR rtl-optimization/100263
24852 * postreload.c (move2add_valid_value_p): Ensure register can
24855 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
24857 PR rtl-optimization/100411
24858 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
24859 and beginning of function markers.
24861 2021-05-05 Jeff Law <jlaw@tachyum.com>
24863 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
24864 * config/cr16/cr16.c (notice_update_cc): Remove.
24865 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
24867 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
24870 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
24871 Handle V8QI, V4HI and V2SI modes.
24872 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
24873 (ix86_build_signbit_mask): Ditto.
24874 * config/i386/mmx.md (MMXMODE14): New mode iterator.
24875 (<smaxmin:code><MMXMODE14:mode>3): New expander.
24876 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
24877 (<umaxmin:code><MMXMODE24:mode>3): New expander.
24878 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
24879 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
24880 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24881 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24882 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24883 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24885 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
24887 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
24888 not expand the VALUE_EXPR of variables put in the non-local frame.
24889 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
24890 to be ignored for debug info, ensure its variable offsets are not.
24892 2021-05-05 Richard Biener <rguenther@suse.de>
24894 PR tree-optimization/79333
24895 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
24896 Fold stmt following SSA edges.
24898 2021-05-05 Richard Biener <rguenther@suse.de>
24900 PR middle-end/100394
24901 * calls.c (expand_call): Preserve possibly throwing calls.
24902 * cfgexpand.c (expand_call_stmt): When a call can throw signal
24903 RTL expansion there are side-effects.
24904 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
24905 mark all possibly throwing stmts necessary unless we can elide
24907 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
24908 -fdelete-dead-exceptions.
24909 * tree.h (DECL_PURE_P): Add note about exceptions.
24911 2021-05-05 Alexandre Oliva <oliva@adacore.com>
24913 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
24916 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
24918 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
24919 get_fnname_from_decl for name of thunk.
24920 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
24921 and ASM_OUTPUT_LABEL.
24922 (rs6000_xcoff_declare_function_name): Use assemble_name and
24924 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
24925 (rs6000_xcoff_encode_section_info): Don't add mapping class
24926 for aliases. Always add [DS] mapping class to primary
24928 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
24930 2021-05-04 Martin Sebor <msebor@redhat.com>
24932 PR middle-end/100307
24933 * builtins.c (compute_objsize_r): Clear base0 for pointers.
24935 2021-05-04 Jeff Law <jlaw@tachyum.com>
24937 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
24939 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
24941 * caller-save.c: Remove CC0.
24942 * cfgcleanup.c: Remove CC0.
24943 * cfgrtl.c: Remove CC0.
24944 * combine.c: Remove CC0.
24945 * compare-elim.c: Remove CC0.
24946 * conditions.h: Remove CC0.
24947 * config/h8300/h8300.h: Remove CC0.
24948 * config/h8300/h8300-protos.h: Remove CC0.
24949 * config/h8300/peepholes.md: Remove CC0.
24950 * config/i386/x86-tune-sched.c: Remove CC0.
24951 * config/m68k/m68k.c: Remove CC0.
24952 * config/rl78/rl78.c: Remove CC0.
24953 * config/sparc/sparc.c: Remove CC0.
24954 * config/xtensa/xtensa.c: Remove CC0.
24955 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
24956 RTL where that is used as a placeholder only.
24957 * cprop.c: Remove CC0.
24958 * cse.c: Remove CC0.
24959 * cselib.c: Remove CC0.
24960 * df-problems.c: Remove CC0.
24961 * df-scan.c: Remove CC0.
24962 * doc/md.texi: Remove CC0. Adjust an example.
24963 * doc/rtl.texi: Remove CC0. Adjust an example.
24964 * doc/tm.texi: Regenerate.
24965 * doc/tm.texi.in: Remove CC0.
24966 * emit-rtl.c: Remove CC0.
24967 * final.c: Remove CC0.
24968 * fwprop.c: Remove CC0.
24969 * gcse-common.c: Remove CC0.
24970 * gcse.c: Remove CC0.
24971 * genattrtab.c: Remove CC0.
24972 * genconfig.c: Remove CC0.
24973 * genemit.c: Remove CC0.
24974 * genextract.c: Remove CC0.
24975 * gengenrtl.c: Remove CC0.
24976 * genrecog.c: Remove CC0.
24977 * haifa-sched.c: Remove CC0.
24978 * ifcvt.c: Remove CC0.
24979 * ira-costs.c: Remove CC0.
24980 * ira.c: Remove CC0.
24981 * jump.c: Remove CC0.
24982 * loop-invariant.c: Remove CC0.
24983 * lra-constraints.c: Remove CC0.
24984 * lra-eliminations.c: Remove CC0.
24985 * optabs.c: Remove CC0.
24986 * postreload-gcse.c: Remove CC0.
24987 * postreload.c: Remove CC0.
24988 * print-rtl.c: Remove CC0.
24989 * read-rtl-function.c: Remove CC0.
24990 * reg-notes.def: Remove CC0.
24991 * reg-stack.c: Remove CC0.
24992 * reginfo.c: Remove CC0.
24993 * regrename.c: Remove CC0.
24994 * reload.c: Remove CC0.
24995 * reload1.c: Remove CC0.
24996 * reorg.c: Remove CC0.
24997 * resource.c: Remove CC0.
24998 * rtl.c: Remove CC0.
24999 * rtl.def: Remove CC0.
25000 * rtl.h: Remove CC0.
25001 * rtlanal.c: Remove CC0.
25002 * sched-deps.c: Remove CC0.
25003 * sched-rgn.c: Remove CC0.
25004 * shrink-wrap.c: Remove CC0.
25005 * simplify-rtx.c: Remove CC0.
25006 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
25007 CC_STATUS_MDEP, and CC_STATUS.
25008 * target.def: Remove CC0.
25009 * valtrack.c: Remove CC0.
25010 * var-tracking.c: Remove CC0.
25012 2021-05-04 Richard Biener <rguenther@suse.de>
25014 PR tree-optimization/100414
25015 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
25017 (tree_ssa_phiopt_worker): But unconditionally here.
25019 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
25021 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
25022 && and || with floating-point and complex arguments.
25024 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25026 * tree-inline.c (insert_debug_decl_map): Delete.
25027 (copy_debug_stmt): Minor tweak.
25028 (setup_one_parameter): Do not use a variable if the value is either
25029 a read-only DECL or a non-addressable local variable in the caller.
25030 In this case, insert the debug-only variable in the map manually.
25031 (expand_call_inline): Do not generate a CLOBBER for these values.
25032 * tree-inline.h (debug_map): Minor tweak.
25034 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25036 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
25037 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
25039 2021-05-04 Richard Biener <rguenther@suse.de>
25041 PR tree-optimization/100329
25042 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
25044 (insert_stmt_after): Assert we're not running into asm goto.
25046 2021-05-04 Richard Biener <rguenther@suse.de>
25048 PR tree-optimization/100398
25049 * tree-ssa-dse.c (pass_dse::execute): Preserve control
25052 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25054 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
25056 2021-05-04 Alexandre Oliva <oliva@adacore.com>
25058 * builtins.c (try_store_by_multiple_pieces): New.
25059 (expand_builtin_memset_args): Use it. If target_char_cast
25060 fails, proceed as for non-constant val. Pass len's ctz to...
25061 * expr.c (clear_storage_hints): ... this. Try store by
25062 multiple pieces after setmem.
25063 (clear_storage): Adjust.
25064 * expr.h (clear_storage_hints): Likewise.
25065 (try_store_by_multiple_pieces): Declare.
25066 * passes.def: Replace the last copy_prop with ccp.
25068 2021-05-03 Tom de Vries <tdevries@suse.de>
25071 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
25074 2021-05-03 Richard Biener <rguenther@suse.de>
25076 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
25078 2021-05-03 Richard Biener <rguenther@suse.de>
25080 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
25081 (dse_dom_walker): Remove.
25082 (dse_dom_walker::dse_optimize_stmt): Rename...
25083 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
25084 (dse_dom_walker::before_dom_children): Inline ...
25085 (pass_dse::execute): ... here. Perform a reverse program
25088 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
25091 * configure: Regenerated.
25093 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
25096 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
25097 (s390_md_asm_adjust): Handle hard registers.
25099 2021-05-03 Jakub Jelinek <jakub@redhat.com>
25101 PR tree-optimization/100382
25102 * tree-ssa-dse.c: Include tree-eh.h.
25103 (dse_dom_walker::before_dom_children): Don't remove stmts if
25104 stmt_unremovable_because_of_non_call_eh_p is true.
25106 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
25108 * varasm.c (compute_reloc_for_var): Split out from...
25109 (get_variable_section): Use it.
25110 * output.h (compute_reloc_for_var): Declare.
25111 * config/rs6000/rs6000-protos.h
25112 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
25114 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
25115 Don't append storage mapping class to symbol.
25116 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
25117 Don't convert TLS BSS to common.
25118 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
25119 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
25121 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
25123 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
25124 If align is 0 from TLS class, use the same rules as varasm.c
25125 If not common, switch to BSS section manually.
25126 If common, emit appropriate comm or lcomm directive.
25127 (rs6000_xcoff_encode_section_info): Add logic to append all
25128 storage mapping classes.
25129 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
25130 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25131 rs6000_xcoff_asm_output_aligned_decl_common.
25132 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25133 rs6000_xcoff_asm_output_aligned_decl_common.
25134 (ASM_OUTPUT_TLS_COMMON): Use
25135 rs6000_xcoff_asm_output_aligned_decl_common.
25137 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25140 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
25141 as first argument of pseudo_node_t constructors.
25143 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25146 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
25148 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
25150 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
25151 (gt_pch_nx (int_range<1> *&)): New.
25152 (gt_ggc_mx (int_range<1> *&)): New.
25153 * value-range.h (class irange): Add GTY support for
25156 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
25158 * doc/options.texi (Negative): Change either or to both and.
25160 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25162 * config/aarch64/aarch64-simd-builtins.def: Add
25163 float_ml[as][q]_laneq builtin generator macros.
25164 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
25165 (aarch64_float_mla_laneq<mode>): Define.
25166 (aarch64_float_mls_laneq<mode>): Define.
25167 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
25168 instead of GCC vector extensions.
25169 (vmlaq_laneq_f32): Likewise.
25170 (vmls_laneq_f32): Likewise.
25171 (vmlsq_laneq_f32): Likewise.
25173 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25175 * config/aarch64/aarch64-simd-builtins.def: Add
25176 float_ml[as]_lane builtin generator macros.
25177 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
25179 (mul_lane<mode>3): This, and re-order arguments.
25180 (aarch64_float_mla_lane<mode>): Define.
25181 (aarch64_float_mls_lane<mode>): Define.
25182 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
25183 instead of GCC vector extensions.
25184 (vmlaq_lane_f32): Likewise.
25185 (vmls_lane_f32): Likewise.
25186 (vmlsq_lane_f32): Likewise.
25188 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25190 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
25191 builtin generator macros.
25192 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
25194 (aarch64_float_mls<mode>): Define.
25195 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
25196 instead of relying on GCC vector extensions.
25197 (vmla_f64): Likewise.
25198 (vmlaq_f32): Likewise.
25199 (vmlaq_f64): Likewise.
25200 (vmls_f32): Likewise.
25201 (vmls_f64): Likewise.
25202 (vmlsq_f32): Likewise.
25203 (vmlsq_f64): Likewise.
25204 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
25206 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25208 * config/aarch64/aarch64-simd-builtins.def: Add
25209 float_ml[as]_n_builtin generator macros.
25210 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
25212 (mul_n<mode>3): This, and re-order arguments.
25213 (aarch64_float_mla_n<mode>): Define.
25214 (aarch64_float_mls_n<mode>): Define.
25215 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
25216 instead of inline asm.
25217 (vmlaq_n_f32): Likewise.
25218 (vmls_n_f32): Likewise.
25219 (vmlsq_n_f32): Likewise.
25221 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
25223 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
25224 builtin generator macros.
25225 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
25226 (aarch64_pmull_hiv16qi_insn): Define.
25227 (aarch64_pmull_hiv16qi): Define.
25228 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
25229 instead of inline asm.
25230 (vmull_p8): Likewise.
25232 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
25234 * config/avr/avr.md: Adjust peepholes to match and
25235 generate parallels with clobber of REG_CC.
25236 (mov<mode>_insn): Rename to mov<mode>_insn_split.
25237 (*mov<mode>_insn): Rename to mov<mode>_insn.
25239 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
25241 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
25242 for VAR_DECLs if -fdata-sections enabled.
25244 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
25246 PR bootstrap/100327
25247 * config/rs6000/rs6000.c
25248 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
25249 (rs6000_libgcc_floating_mode_supported_p): New target hook.
25251 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
25253 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
25255 (class back_threader_registry): ...to here...
25256 (class back_threader_profitability): ...and here...
25257 (thread_jumps::thread_through_all_blocks): Remove argument.
25258 (back_threader_registry::back_threader_registry): New.
25259 (back_threader_registry::~back_threader_registry): New.
25260 (back_threader_registry::thread_through_all_blocks): New.
25261 (thread_jumps::profitable_jump_thread_path): Move from here...
25262 (back_threader_profitability::profitable_path_p): ...to here.
25263 (thread_jumps::find_taken_edge): New.
25264 (thread_jumps::convert_and_register_current_path): Move...
25265 (back_threader_registry::register_path): ...to here.
25266 (thread_jumps::register_jump_thread_path_if_profitable): Move...
25267 (thread_jumps::maybe_register_path): ...to here.
25268 (thread_jumps::handle_phi): Call find_taken_edge and
25269 maybe_register_path.
25270 (thread_jumps::handle_assignment): Same.
25271 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
25272 tree argument to handle_phi and handle_assignment.
25273 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
25274 set of m_speed_p and m_max_threaded_paths.
25275 (pass_thread_jumps::execute): Remove second argument from
25276 find_jump_threads_backwards.
25277 (pass_early_thread_jumps::execute): Same.
25279 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
25281 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
25282 (class dom_opt_dom_walker): Initialize some class variables.
25283 (pass_dominator::execute): Pass evrp_range_analyzer and
25284 dom_jump_threader_simplifier to dom_opt_dom_walker.
25285 Adjust for some functions moving into classes.
25286 (simplify_stmt_for_jump_threading): Adjust and move to...
25287 (jump_threader_simplifier::simplify): ...here.
25288 (dom_opt_dom_walker::before_dom_children): Adjust for
25289 m_evrp_range_analyzer.
25290 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
25291 (test_for_singularity): Place in dom_opt_dom_walker class.
25292 (dom_opt_dom_walker::optimize_stmt): The argument
25293 evrp_range_analyzer is now a class field.
25294 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
25295 (thread_jumps::thread_through_all_blocks): New.
25296 (thread_jumps::convert_and_register_current_path): Use m_registry.
25297 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
25298 being in the threader class.
25299 (pass_early_thread_jumps::execute): Same.
25300 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
25301 (jump_threader::jump_threader): ...here.
25302 (threadedge_finalize_values): Move...
25303 (jump_threader::~jump_threader): ...here.
25304 (jump_threader::remove_jump_threads_including): New.
25305 (jump_threader::thread_through_all_blocks): New.
25306 (record_temporary_equivalences_from_phis): Move...
25307 (jump_threader::record_temporary_equivalences_from_phis): ...here.
25308 (record_temporary_equivalences_from_stmts_at_dest): Move...
25309 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
25311 (simplify_control_stmt_condition_1): Move to jump_threader class.
25312 (simplify_control_stmt_condition): Move...
25313 (jump_threader::simplify_control_stmt_condition): ...here.
25314 (thread_around_empty_blocks): Move...
25315 (jump_threader::thread_around_empty_blocks): ...here.
25316 (thread_through_normal_block): Move...
25317 (jump_threader::thread_through_normal_block): ...here.
25318 (thread_across_edge): Move...
25319 (jump_threader::thread_across_edge): ...here.
25320 (thread_outgoing_edges): Move...
25321 (jump_threader::thread_outgoing_edges): ...here.
25322 * tree-ssa-threadedge.h: Move externally facing functings...
25323 (class jump_threader): ...here...
25324 (class jump_threader_simplifier): ...and here.
25325 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
25326 (jump_thread_path_allocator::jump_thread_path_allocator): New.
25327 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
25328 (jump_thread_path_allocator::allocate_thread_edge): New.
25329 (jump_thread_path_allocator::allocate_thread_path): New.
25330 (jump_thread_path_registry::jump_thread_path_registry): New.
25331 (jump_thread_path_registry::~jump_thread_path_registry): New.
25332 (jump_thread_path_registry::allocate_thread_edge): New.
25333 (jump_thread_path_registry::allocate_thread_path): New.
25334 (dump_jump_thread_path): Make extern.
25335 (debug (const vec<jump_thread_edge *> &path)): New.
25336 (struct removed_edges): Move to tree-ssa-threadupdate.h.
25337 (struct thread_stats_d): Remove.
25338 (remove_ctrl_stmt_and_useless_edges): Make static.
25339 (lookup_redirection_data): Move...
25340 (jump_thread_path_registry::lookup_redirection_data): ...here.
25341 (ssa_redirect_edges): Make static.
25342 (thread_block_1): Move...
25343 (jump_thread_path_registry::thread_block_1): ...here.
25344 (thread_block): Move...
25345 (jump_thread_path_registry::thread_block): ...here.
25346 (thread_through_loop_header): Move...
25347 (jump_thread_path_registry::thread_through_loop_header): ...here.
25348 (mark_threaded_blocks): Move...
25349 (jump_thread_path_registry::mark_threaded_blocks): ...here.
25350 (debug_path): Move...
25351 (jump_thread_path_registry::debug_path): ...here.
25352 (debug_all_paths): Move...
25353 (jump_thread_path_registry::dump): ..here.
25354 (rewire_first_differing_edge): Move...
25355 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
25356 (adjust_paths_after_duplication): Move...
25357 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
25358 (duplicate_thread_path): Move...
25359 (jump_thread_path_registry::duplicate_thread_path): ..here.
25360 (remove_jump_threads_including): Move...
25361 (jump_thread_path_registry::remove_jump_threads_including): ...here.
25362 (thread_through_all_blocks): Move to...
25363 (jump_thread_path_registry::thread_through_all_blocks): ...here.
25364 (delete_jump_thread_path): Remove.
25365 (register_jump_thread): Move...
25366 (jump_thread_path_registry::register_jump_thread): ...here.
25367 * tree-ssa-threadupdate.h: Move externally facing functions...
25368 (class jump_thread_path_allocator): ...here...
25369 (class jump_thread_path_registry): ...and here.
25370 (thread_through_all_blocks): Remove.
25371 (struct removed_edges): New.
25372 (register_jump_thread): Remove.
25373 (remove_jump_threads_including): Remove.
25374 (delete_jump_thread_path): Remove.
25375 (remove_ctrl_stmt_and_useless_edges): Remove.
25376 (free_dom_edge_info): New prototype.
25377 * tree-vrp.c: Remove x_vr_values hack.
25378 (class vrp_jump_threader_simplifier): New.
25379 (vrp_jump_threader_simplifier::simplify): New.
25380 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
25381 Remove m_dummy_cond.
25382 Instantiate m_simplifier and m_threader.
25383 (vrp_jump_threader::thread_through_all_blocks): New.
25384 (vrp_jump_threader::simplify_stmt): Remove.
25385 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
25386 Remove x_vr_values hack.
25387 (execute_vrp): Adjust for thread_through_all_blocks being in a
25390 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
25392 * genflags.c (gen_insn): Print failed expansion string.
25394 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
25396 * expr.c (alignment_for_piecewise_move): Call mode_for_size
25397 without limit to MAX_FIXED_MODE_SIZE.
25399 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
25401 PR middle-end/90773
25402 * builtins.c (builtin_memset_gen_str): Don't use return from
25403 simplify_gen_subreg.
25405 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
25408 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
25409 (*addsi3_carry_zext_0r): Ditto.
25410 (*sub<mode>3_carry_0): Ditto.
25411 (*subsi3_carry_zext_0r): Ditto.
25412 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
25414 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
25415 Also consider ix86_carry_flag_unset_operator to calculate
25416 the cost of adc/sbb insn.
25418 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
25420 PR rtl-optimization/100225
25421 PR rtl-optimization/84878
25422 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
25423 where we have an instruction which touches (writes) any hard
25424 register from df->regular_block_artificial_uses set.
25425 Allow not-single-set instruction only right before basic block
25428 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
25430 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
25432 2021-04-30 LevyHsu <admin@levyhsu.com>
25434 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
25435 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
25436 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
25437 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
25439 2021-04-29 Alexandre Oliva <oliva@adacore.com>
25441 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
25443 2021-04-29 Alexandre Oliva <oliva@adacore.com>
25445 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
25446 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
25447 * config/i386/i386.c: Adjust.
25448 * config/i386/i386.md: Adjust.
25449 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
25450 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25451 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25452 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25453 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25454 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25455 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25456 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25457 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25458 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25459 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
25461 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
25463 * config/i386/i386-expand.c (ix86_expand_int_compare):
25464 Swap operands of GTU and LEU comparison to emit carry flag comparison.
25465 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
25466 predicate to allow more combine opportunities with memory operands.
25467 (*sub<mode>3_carry_0): Ditto.
25469 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
25471 PR rtl-optimization/100303
25472 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
25473 boolean that indicates whether the use will only be used in
25474 debug instructions. Treat it in the same way that existing
25475 cross-EBB debug references would be handled if so.
25476 (function_info::make_uses_available): Likewise.
25477 * rtl-ssa/functions.h (function_info::make_uses_available): Update
25478 prototype accordingly.
25479 (function_info::make_uses_available): Likewise.
25480 * fwprop.c (try_fwprop_subst): Update call accordingly.
25482 2021-04-29 Jeff Law <jlaw@tachyum.com>
25484 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
25487 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
25488 Richard Biener <rguenther@suse.de>
25491 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
25492 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
25493 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
25494 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
25495 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
25496 to PURE_ARGS category.
25497 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
25498 Handle PURE_ARGS category.
25499 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
25501 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
25503 * configure.ac: Check for the presence of sys/locking.h header and
25504 for whether _LK_LOCK is supported by _locking.
25505 * configure: Regenerate.
25506 * config.in: Likewise.
25507 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
25508 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
25509 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
25511 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
25513 * config/i386/predicates.md (fcmov_comparison_operator):
25514 Do not check for trivial FP comparison operator.
25515 <case GEU, case LTU>: Allow CCGZmode.
25516 <case GTU, case LEU>: Do not allow CCCmode.
25517 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
25518 (ix86_carry_flag_operator): Match only LTU and UNLT code.
25519 Do not check for trivial FP comparison operator. Allow CCGZmode.
25521 2021-04-29 Tom de Vries <tdevries@suse.de>
25523 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
25524 fd->loop.step by either step or orig_step.
25526 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
25528 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
25529 (load_got_register): Do the PIC dance here.
25530 (sparc_legitimize_tls_address): Simplify.
25531 (sparc_emit_probe_stack_range): Likewise.
25532 (sparc32_initialize_trampoline): Likewise.
25533 (sparc64_initialize_trampoline): Likewise.
25534 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
25535 (probe_stack_range<P:mode>): Likewise.
25536 (flush<P:mode>): Likewise.
25537 (tgd_hi22<P:mode>): Likewise.
25538 (tgd_lo10<P:mode>): Likewise.
25539 (tgd_add<P:mode>): Likewise.
25540 (tgd_call<P:mode>): Likewise.
25541 (tldm_hi22<P:mode>): Likewise.
25542 (tldm_lo10<P:mode>): Likewise.
25543 (tldm_add<P:mode>): Likewise.
25544 (tldm_call<P:mode>): Likewise.
25545 (tldo_hix22<P:mode>): Likewise.
25546 (tldo_lox10<P:mode>): Likewise.
25547 (tldo_add<P:mode>): Likewise.
25548 (tie_hi22<P:mode>): Likewise.
25549 (tie_lo10<P:mode>): Likewise.
25550 (tie_add<P:mode>): Likewise.
25551 (tle_hix22<P:mode>): Likewise.
25552 (tle_lox10<P:mode>): Likewise.
25553 (stack_protect_setsi): Rename to...
25554 (stack_protect_set32): ...this.
25555 (stack_protect_setdi): Rename to...
25556 (stack_protect_set64): ...this.
25557 (stack_protect_set): Adjust calls to above.
25558 (stack_protect_testsi): Rename to...
25559 (stack_protect_test32): ...this.
25560 (stack_protect_testdi): Rename to...
25561 (stack_protect_test64): ...this.
25562 (stack_protect_test): Adjust calls to above.
25564 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
25566 PR middle-end/90773
25567 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
25568 (builtin_strncpy_read_str): Likewise.
25569 (builtin_memset_read_str): Add an argument for the previous RTL
25570 information and generate the new RTL from the previous RTL info.
25571 (builtin_memset_gen_str): Likewise.
25572 * builtins.h (builtin_strncpy_read_str): Update the prototype.
25573 (builtin_memset_read_str): Likewise.
25574 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
25575 returns true, round up size and alignment to the widest integer
25576 mode for maximum size.
25577 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
25578 and pass it to m_constfn.
25579 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
25580 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
25581 initialize m_push. Initialize m_overlap_op_by_pieces with
25582 targetm.overlap_op_by_pieces_p ().
25583 (op_by_pieces_d::run): Pass the previous RTL information to
25584 pieces_addr::adjust and generate overlapping operations if
25585 m_overlap_op_by_pieces is true.
25587 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
25589 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
25591 (can_store_by_pieces): Use by_pieces_constfn on constfun.
25592 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
25593 for op_by_pieces_d change.
25594 (clear_by_pieces_1): Add a dummy argument.
25595 (clear_by_pieces): Updated for op_by_pieces_d change.
25596 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
25597 (string_cst_read_str): Add a dummy argument.
25598 * expr.h (by_pieces_constfn): Add a dummy argument.
25599 (by_pieces_prev): New.
25600 * target.def (overlap_op_by_pieces_p): New target hook.
25601 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
25602 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
25603 * doc/tm.texi: Regenerated.
25605 2021-04-29 Richard Biener <rguenther@suse.de>
25607 PR tree-optimization/100253
25608 * tree-vect-stmts.c (vectorizable_load): Do not assume
25609 element alignment when DR_MISALIGNMENT is -1.
25610 (vectorizable_store): Likewise.
25612 2021-04-29 Jakub Jelinek <jakub@redhat.com>
25615 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
25616 absu_hwi instead of abs_hwi.
25618 2021-04-29 Richard Biener <rguenther@suse.de>
25620 PR middle-end/38474
25621 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
25622 forwarding when indirect forwarding through ESCAPED
25625 2021-04-29 Tom de Vries <tdevries@suse.de>
25628 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
25629 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
25630 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
25631 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
25633 2021-04-29 Richard Biener <rguenther@suse.de>
25635 PR tree-optimization/99912
25636 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
25637 (dse_dom_walker::todo): Likewise.
25638 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
25640 (dse_dom_walker::before_dom_children): Remove trivially
25641 dead SSA defs and schedule CFG cleanup if we removed all
25643 (pass_dse::execute): Get TODO as computed by the DOM walker
25644 and return it. Wipe dominator info earlier.
25646 2021-04-29 Richard Biener <rguenther@suse.de>
25649 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
25650 Track blocks to cleanup EH in new m_need_eh_cleanup.
25651 (ipcp_modif_dom_walker::cleanup_eh): New.
25652 (ipcp_transform_function): Release dominator info before
25655 2021-04-29 Martin Sebor <msebor@redhat.com>
25657 PR middle-end/100250
25658 * attribs.c (attr_access::array_as_string): Avoid dereferencing
25659 a pointer when it's null.
25661 2021-04-29 Martin Sebor <msebor@redhat.com>
25663 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
25664 * ipa-free-lang-data.cc: New file.
25665 * tree.c: Move pass free_lang_data to file above.
25666 (build_array_type_1): Declare extern.
25667 * tree.h (build_array_type_1): Declare.
25669 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25671 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
25672 make consistent with updated RTL pattern.
25673 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
25674 Implement using ss_truncate and us_truncate rather than
25676 * config/aarch64/iterators.md: Remove redundant unspecs and
25677 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
25679 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25681 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
25682 attributes consistent with those defined in arm_neon.h.
25684 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25686 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
25687 attributes consistent with those defined in arm_neon.h.
25689 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25691 * config/aarch64/aarch64-simd-builtins.def: Add
25692 float_trunc_rodd builtin generator macros.
25693 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
25695 (aarch64_float_trunc_rodd_lo_v2sf): Define.
25696 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
25697 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
25698 (aarch64_float_trunc_rodd_hi_v4sf): Define.
25699 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
25700 instead of inline asm.
25701 (vcvtx_high_f32_f64): Likewise.
25702 (vcvtxd_f32_f64): Likewise.
25703 * config/aarch64/iterators.md: Add FCVTXN unspec.
25705 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25707 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
25709 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
25711 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
25712 instead of inline asm.
25713 (vqtbx1_u8): Likewise.
25714 (vqtbx1_p8): Likewise.
25715 (vqtbx1q_s8): Likewise.
25716 (vqtbx1q_u8): Likewise.
25717 (vqtbx1q_p8): Likewise.
25718 (vtbx2_s8): Likewise.
25719 (vtbx2_u8): Likewise.
25720 (vtbx2_p8): Likewise.
25722 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25724 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
25726 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
25727 instead of inline asm.
25728 (vqtbl1_s8): Likewise.
25729 (vqtbl1_u8): Likewise.
25730 (vqtbl1q_p8): Likewise.
25731 (vqtbl1q_s8): Likewise.
25732 (vqtbl1q_u8): Likewise.
25733 (vtbl1_s8): Likewise.
25734 (vtbl1_u8): Likewise.
25735 (vtbl1_p8): Likewise.
25736 (vtbl2_s8): Likewise.
25737 (vtbl2_u8): Likewise.
25738 (vtbl2_p8): Likewise.
25740 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25742 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
25743 ssri_n buitin generator macro.
25744 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
25745 instead of inline asm.
25746 (vsri_n_p16): Likewise.
25747 (vsri_n_p64): Likewise.
25748 (vsriq_n_p8): Likewise.
25749 (vsriq_n_p16): Likewise.
25750 (vsriq_n_p64): Likewise.
25752 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25754 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
25755 iterator for polynomial ssli_n builtin generator macro.
25756 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
25757 instead of inline asm.
25758 (vsli_n_p16): Likewise.
25759 (vsliq_n_p8): Likewise.
25760 (vsliq_n_p16): Likewise.
25761 * config/aarch64/iterators.md: Define VALLP mode iterator.
25763 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25765 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
25766 iterator to generate [su]adalp RTL builtins.
25767 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
25768 [su]adalp RTL pattern.
25769 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
25770 instead of inline asm.
25771 (vpadal_u32): Likewise.
25773 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25775 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
25776 builtin generator macros.
25777 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
25779 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
25780 instead of inline asm.
25781 (vpaddl_s16): Likewise.
25782 (vpaddl_s32): Likewise.
25783 (vpaddl_u8): Likewise.
25784 (vpaddl_u16): Likewise.
25785 (vpaddl_u32): Likewise.
25786 (vpaddlq_s8): Likewise.
25787 (vpaddlq_s16): Likewise.
25788 (vpaddlq_s32): Likewise.
25789 (vpaddlq_u8): Likewise.
25790 (vpaddlq_u16): Likewise.
25791 (vpaddlq_u32): Liwewise.
25792 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
25793 appropriate attributes.
25795 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25797 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
25798 for aarch64_addp<mode> builtin macro generator.
25799 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
25800 aarch64_addp<mode> RTL pattern.
25801 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
25802 instead of inline asm.
25803 (vpaddq_s16): Likewise.
25804 (vpaddq_s32): Likewise.
25805 (vpaddq_s64): Likewise.
25806 (vpaddq_u8): Likewise.
25807 (vpaddq_u16): Likewise.
25808 (vpaddq_u32): Likewise.
25809 (vpaddq_u64): Likewise.
25811 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25813 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
25814 builtin generator macros.
25815 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
25817 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
25818 instead of inline asm.
25819 (vqdmulh_n_s32): Likewise.
25820 (vqdmulhq_n_s16): Likewise.
25821 (vqdmulhq_n_s32): Likewise.
25822 (vqrdmulh_n_s16): Likewise.
25823 (vqrdmulh_n_s32): Likewise.
25824 (vqrdmulhq_n_s16): Likewise.
25825 (vqrdmulhq_n_s32): Likewise.
25827 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
25829 * doc/install.texi (--enable-offload-defaulted): Document.
25831 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
25833 * config/avr/avr-dimode.md: Turn existing patterns into
25834 define_insn_and_split style patterns where the splitter
25835 adds a clobber of the condition code register. Drop "cc"
25836 attribute. Add new patterns to match output of
25838 * config/avr/avr-fixed.md: Likewise.
25839 * config/avr/avr.c (cc_reg_rtx): New.
25840 (avr_parallel_insn_from_insns): Adjust insn count
25841 for removal of set of cc0.
25842 (avr_is_casesi_sequence): Likewise.
25843 (avr_casei_sequence_check_operands): Likewise.
25844 (avr_optimize_casesi): Likewise. Also insert
25845 new insns after jump_insn.
25846 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
25847 for removal of set of cc0.
25848 (avr_init_expanders): Initialize cc_reg_rtx.
25849 (avr_regno_reg_class): Handle REG_CC.
25850 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
25851 (avr_notice_update_cc): Remove function.
25852 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
25853 (compare_condition): Adjust for PARALLEL with
25855 (out_shift_with_cnt): Likewise.
25856 (ashlhi3_out): Likewise.
25857 (ashrhi3_out): Likewise.
25858 (lshrhi3_out): Likewise.
25859 (avr_class_max_nregs): Return single reg for REG_CC.
25860 (avr_compare_pattern): Check for REG_CC instead
25862 (avr_reorg_remove_redundant_compare): Likewise.
25863 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
25864 (avr_hard_regno_nregs): Return single reg for REG_CC.
25865 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
25866 (avr_md_asm_adjust): Clobber REG_CC.
25867 (TARGET_HARD_REGNO_NREGS): Define.
25868 (TARGET_CLASS_MAX_NREGS): Define.
25869 (TARGET_MD_ASM_ADJUST): Define.
25870 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
25872 (enum reg_class): Add CC_REG class.
25873 (NOTICE_UPDATE_CC): Remove.
25874 (CC_OVERFLOW_UNUSABLE): Remove.
25875 (CC_NO_CARRY): Remove.
25876 * config/avr/avr.md: Turn existing patterns into
25877 define_insn_and_split style patterns where the splitter
25878 adds a clobber of the condition code register. Drop "cc"
25879 attribute. Add new patterns to match output of
25881 (sez): Remove unused pattern.
25883 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
25886 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
25889 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
25892 * config/aarch64/constraints.md (Utq): Require the address to
25893 be valid for both the element mode and for V2DImode.
25895 2021-04-28 Jakub Jelinek <jakub@redhat.com>
25896 Tobias Burnus <tobias@codesourcery.com>
25898 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
25899 * gcc.c (process_command): New variable.
25900 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
25901 set it if -foffload is defaulted.
25902 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
25903 (compile_offload_image): If OFFLOAD_DEFAULTED and
25904 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
25905 if corresponding mkoffload can't be found.
25906 (compile_images_for_offload_targets): Likewise. Free and clear
25907 offload_names if no valid offload is found.
25908 * config.in: Regenerate.
25909 * configure: Regenerate.
25911 2021-04-28 Richard Biener <rguenther@suse.de>
25913 PR tree-optimization/100292
25914 * tree-vect-generic.c (expand_vector_condition): Do not fold
25917 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
25919 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
25920 * config/rs6000/aix64.opt (m64): New.
25923 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
25925 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
25926 (index_term_p): Handle ASHIFT too.
25928 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
25930 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
25931 (sync_lock_test_and_set<mode>): Adjust accordingly.
25932 (sync_lock_release<mode>): Likewise.
25934 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
25936 * config/vax/vax-protos.h (adjacent_operands_p): Remove
25938 * config/vax/vax.c (adjacent_operands_p): Remove.
25940 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
25942 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
25943 through to the non-conditional execution case if getting the
25944 condition for conditional execution has failed.
25946 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
25948 PR middle-end/100284
25949 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
25950 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
25951 than asserting on it.
25953 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
25955 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
25956 with TARGET_AIX_OS.
25958 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
25961 * calls.c (precompute_register_parameters): Additionally test
25962 targetm.precompute_tls_p to pre-compute argument.
25963 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
25964 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
25965 * target.def (precompute_tls_p): New.
25966 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
25967 * doc/tm.texi: Regenerated.
25969 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25972 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
25973 back to HOST_WIDE_INT.
25975 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
25978 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
25979 memory alignment for the outer mode.
25981 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
25983 PR middle-end/90773
25984 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
25985 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
25987 2021-04-27 Alex Coplan <alex.coplan@arm.com>
25990 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
25991 with negative immediates: ensure we expand cbranchsi4_scratch
25992 correctly and ensure we satisfy its constraints.
25993 * config/arm/sync.md
25994 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
25995 attempt to tie two output operands together with constraints;
25996 collapse two alternatives.
25997 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
25998 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
26000 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26003 * config/aarch64/predicates.md (aarch64_sub_immediate,
26004 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
26005 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
26006 * config/aarch64/aarch64.c (aarch64_print_operand,
26007 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
26009 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26011 PR tree-optimization/100239
26012 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
26013 permutations with all indices from the first zero element as vec_shl.
26015 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26017 PR rtl-optimization/100254
26018 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
26019 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
26021 2021-04-27 Richard Biener <rguenther@suse.de>
26023 PR tree-optimization/99912
26024 * passes.def: Add comment about new TODO_remove_unused_locals.
26025 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
26028 2021-04-27 Richard Biener <rguenther@suse.de>
26030 PR tree-optimization/99912
26031 * passes.def (pass_all_optimizations): Add pass_dse before
26032 the first pass_dce, move the first pass_dse before the
26033 pass_dce following pass_pre.
26035 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26037 PR tree-optimization/95527
26038 * generic-match-head.c: Include tm.h.
26039 * gimple-match-head.c: Include tm.h.
26040 * match.pd (CLZ == INTEGER_CST): Don't use
26041 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
26042 if clz == CFN_CLZ. Add missing val declaration.
26043 (CTZ cmp CST): New simplifications.
26045 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26047 PR tree-optimization/96696
26048 * expr.c (expand_expr_divmod): New function.
26049 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
26050 divisions. Formatting fixes.
26051 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
26054 2021-04-27 Martin Jambor <mjambor@suse.cz>
26057 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
26058 If removing a call statement LHS SSA name, release it.
26060 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
26063 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
26064 is valid before including it in the mask.
26066 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26069 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
26072 2021-04-27 Richard Biener <rguenther@suse.de>
26074 PR tree-optimization/100051
26075 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
26076 disambiguator based on access size vs. decl size.
26078 2021-04-27 Richard Biener <rguenther@suse.de>
26080 PR tree-optimization/100278
26081 * tree-ssa-pre.c (compute_avail): Give up when we cannot
26082 adjust TBAA beacuse of mismatching bases.
26084 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26087 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
26088 For any_rotate define_insn_split and following splitters, use
26089 SWI iterator instead of SWI48.
26091 2021-04-27 Richard Biener <rguenther@suse.de>
26093 PR tree-optimization/99776
26094 * match.pd (bit_field_ref (ctor)): Relax element extract
26095 type compatibility checks.
26097 2021-04-27 Cui,Lili <lili.cui@intel.com>
26099 * common/config/i386/i386-common.c (processor_names):
26100 Sync processor_names with processor_type.
26101 * config/i386/i386-options.c (processor_cost_table):
26102 Sync processor_cost_table with processor_type.
26104 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26106 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
26107 (irange::set): Call irange_set_1bit_anti_range for handling all
26108 1-bit ranges. Fall through on ~[MIN,MAX].
26110 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26112 * value-range.cc (irange::legacy_num_pairs): Remove.
26113 (irange::invert): Change gcc_assert to gcc_checking_assert.
26114 * value-range.h (irange::num_pairs): Adjust for a cached
26115 num_pairs(). Also, rename all gcc_assert's to
26116 gcc_checking_assert's.
26118 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26120 * value-range.cc (irange::operator=): Set m_kind.
26121 (irange::copy_to_legacy): Handle varying and undefined sources
26122 as a legacy copy since they can be easily copied.
26123 (irange::irange_set): Set m_kind.
26124 (irange::irange_set_anti_range): Same.
26125 (irange::set): Rename normalize_min_max to normalize_kind.
26126 (irange::verify_range): Adjust for multi-ranges having the
26128 (irange::irange_union): Set m_kind.
26129 (irange::irange_intersect): Same.
26130 (irange::invert): Same.
26131 * value-range.h (irange::kind): Always return m_kind.
26132 (irange::varying_p): Rename to...
26133 (irange::varying_comptaible_p): ...this.
26134 (irange::undefined_p): Only look at m_kind.
26135 (irange::irange): Always set VR_UNDEFINED if applicable.
26136 (irange::set_undefined): Always set VR_UNDEFINED.
26137 (irange::set_varying): Always set m_kind to VR_VARYING.
26138 (irange::normalize_min_max): Rename to...
26139 (irange::normalize_kind): ...this.
26141 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26143 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
26144 Adjust for constant_p including varying_p.
26145 * tree-vrp.c (vrp_prop::finalize): Same.
26146 (determine_value_range): Same.
26147 * vr-values.c (vr_values::range_of_expr): Same.
26148 * value-range.cc (irange::symbolic_p): Do not check varying_p.
26149 (irange::constant_p): Same.
26151 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26153 * value-range.cc (irange::legacy_lower_bound): Replace
26154 !undefined_p check with num_ranges > 0.
26155 (irange::legacy_upper_bound): Same.
26156 * value-range.h (irange::type): Same.
26157 (irange::lower_bound): Same.
26158 (irange::upper_bound): Same.
26160 2021-04-26 Richard Biener <rguenther@suse.de>
26162 PR tree-optimization/99956
26163 * gimple-loop-interchange.cc (compute_access_stride):
26164 Try instantiating the access in a shallower loop nest
26165 if instantiating failed.
26166 (compute_access_strides): Pass adjustable loop_nest
26167 to compute_access_stride.
26169 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
26171 * doc/sourcebuild.texi (arm_cmse_hw): Document.
26173 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26175 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
26177 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
26178 Nathan Sidwell <nathan@codesourcery.com>
26179 Tom de Vries <vries@codesourcery.com>
26180 Julian Brown <julian@codesourcery.com>
26181 Kwok Cheung Yeung <kcy@codesourcery.com>
26183 * omp-offload.c (oacc_validate_dims): Implement
26184 '-Wopenacc-parallelism'.
26185 * doc/invoke.texi (-Wopenacc-parallelism): Document.
26187 2021-04-26 Richard Biener <rguenther@suse.de>
26189 * tree-cfg.h (gimplify_build1): Remove.
26190 (gimplify_build2): Likewise.
26191 (gimplify_build3): Likewise.
26192 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
26193 (gimplify_build2): Likewise.
26194 (gimplify_build3): Likewise.
26195 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
26197 (gimplify_build2): Likewise.
26198 (gimplify_build3): Likewise.
26199 (tree_vec_extract): Use resimplify with following SSA edges.
26200 (expand_vector_parallel): Avoid passing NULL size/bitpos
26201 to tree_vec_extract.
26202 * expr.c (store_constructor): Deal with zero-element CTORs.
26203 * match.pd (bit_field_ref <vector CTOR>): Make sure to
26204 produce vector constants when possible.
26206 2021-04-26 Richard Biener <rguenther@suse.de>
26208 * tree-complex.c: Include gimple-fold.h.
26209 (expand_complex_addition): Use gimple_build.
26210 (expand_complex_multiplication_components): Likewise.
26211 (expand_complex_multiplication): Likewise.
26212 (expand_complex_div_straight): Likewise.
26213 (expand_complex_div_wide): Likewise.
26214 (expand_complex_division): Likewise.
26215 (expand_complex_conjugate): Likewise.
26216 (expand_complex_comparison): Likewise.
26218 2021-04-26 Richard Biener <rguenther@suse.de>
26220 * tree-ssa-phiopt.c (two_value_replacement): Remove use
26221 of legacy gimplify_buildN API.
26223 2021-04-26 Richard Biener <rguenther@suse.de>
26225 PR tree-optimization/99473
26226 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
26229 2021-04-26 Richard Biener <rguenther@suse.de>
26231 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
26232 Use replace_call_with_value.
26234 2021-04-26 Richard Biener <rguenther@suse.de>
26236 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
26237 (update_gimple_call): Likewise.
26238 (update_call_from_tree): Likewise.
26239 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
26240 (valid_gimple_call_p): Likewise.
26241 (move_ssa_defining_stmt_for_defs): Likewise.
26242 (finish_update_gimple_call): Likewise.
26243 (update_gimple_call): Likewise.
26244 (update_call_from_tree): Likewise.
26245 (propagate_tree_value_into_stmt): Use replace_call_with_value.
26246 * gimple-fold.h (update_gimple_call): Declare.
26247 * gimple-fold.c (valid_gimple_rhs_p): Move here from
26248 tree-ssa-propagate.c.
26249 (update_gimple_call): Likewise.
26250 (valid_gimple_call_p): Likewise.
26251 (finish_update_gimple_call): Likewise, and simplify.
26252 (gimplify_and_update_call_from_tree): Implement
26253 update_call_from_tree functionality, avoid excessive
26254 push/pop_gimplify_context.
26255 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
26256 (gimple_fold_call): Likewise.
26257 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
26258 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
26259 (pass_fold_builtins::execute): Likewise.
26260 (optimize_stack_restore): Use replace_call_with_value.
26261 * tree-cfg.c (fold_loop_internal_call): Likewise.
26262 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
26263 only gimplify_and_update_call_from_tree.
26264 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
26265 (handle_builtin_strchr): Likewise.
26266 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
26268 2021-04-26 Jakub Jelinek <jakub@redhat.com>
26271 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
26272 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
26275 2021-04-25 liuhongt <hongtao.liu@intel.com>
26278 * config/i386/i386-builtin.def (BDESC): Change the icode of
26279 the following builtins to CODE_FOR_nothing.
26280 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
26281 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
26282 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
26283 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
26284 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
26285 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
26286 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
26287 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
26288 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
26289 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
26290 (sse2_eq<mode>3): Ditto.
26291 (sse4_1_eqv2di3): Ditto.
26292 (sse2_gt<mode>3): Rename to ..
26293 (*sse2_gt<mode>3): .. this.
26295 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26298 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26301 * config/darwin.c (darwin_binds_local_p): Assume that any
26302 public symbol might be interposed for PIC code. Update function
26303 header comment to reflect current Darwin capability.
26305 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26308 * config/darwin.c (darwin_binds_local_p): Assume that any
26309 public symbol might be interposed for PIC code. Update function
26310 header comment to reflect current Darwin capability.
26312 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
26314 * doc/sourcebuild.texi: Document no-opts and any-opts target
26317 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
26319 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
26321 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
26323 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
26325 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
26328 * config/i386/i386-options.c (ix86_option_override_internal):
26329 Error out when -m96bit-long-double is used with 64bit targets.
26330 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
26332 2021-04-23 Martin Liska <mliska@suse.cz>
26334 * lto-wrapper.c: Remove FIXME about usage of
26335 hardware_concurrency. The function is not on par with
26338 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
26341 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
26342 Copy operand 3 to operand 4. Use sse_reg_operand
26343 as operand 3 predicate.
26344 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
26345 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
26346 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
26347 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
26348 Copy operand 1 to operand 0.
26349 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
26350 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
26351 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
26353 2021-04-23 Alex Coplan <alex.coplan@arm.com>
26355 PR rtl-optimization/100230
26356 * early-remat.c (early_remat::sort_candidates): Use delete[]
26357 instead of delete for array allocated with new[].
26359 2021-04-23 Richard Biener <rguenther@suse.de>
26361 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
26362 (capture_info::capture_info): Likewise.
26363 (capture_info::walk_match): Likewise.
26364 (expr::gen_transform): Likewise.
26365 (dt_simplify::gen_1): Likewise.
26366 * gimple-match-head.c (maybe_resimplify_conditional_op):
26367 Remove VEC_COND_EXPR special-casing.
26368 (gimple_simplify): Likewise.
26369 * gimple.c (gimple_could_trap_p_1): Adjust.
26370 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
26371 to participate in PRE.
26373 2021-04-23 Richard Biener <rguenther@suse.de>
26375 * cfganal.c (connect_infinite_loops_to_exit): First call
26376 add_noreturn_fake_exit_edges.
26377 * ipa-sra.c (process_scan_results): Do not call the now redundant
26378 add_noreturn_fake_exit_edges.
26379 * predict.c (tree_estimate_probability): Likewise.
26380 (rebuild_frequencies): Likewise.
26381 * store-motion.c (one_store_motion_pass): Likewise.
26383 2021-04-23 Richard Biener <rguenther@suse.de>
26385 PR tree-optimization/100222
26386 * predict.c (pass_profile::execute): Remove redundant call to
26387 mark_irreducible_loops.
26388 (report_predictor_hitrates): Likewise.
26390 2021-04-23 Richard Biener <rguenther@suse.de>
26392 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
26393 valid_gimple_rhs_p by instead gimplifying to one.
26395 2021-04-23 Richard Biener <rguenther@suse.de>
26397 PR tree-optimization/99971
26398 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
26399 Always use TBAA for loads.
26401 2021-04-23 liuhongt <hongtao.liu@intel.com>
26404 * config/i386/i386-options.c (ix86_option_override_internal):
26405 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
26406 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
26407 by target attribute.
26409 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
26411 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
26413 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
26415 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
26417 * config.gcc (powerpc-ibm-aix6.*): Remove.
26418 * config/rs6000/aix61.h: Delete.
26420 2021-04-22 Martin Liska <mliska@suse.cz>
26422 PR testsuite/100159
26423 PR testsuite/100192
26424 * builtins.c (expand_builtin): Fix typos and missing comments.
26425 * dwarf2out.c (gen_subprogram_die): Likewise.
26426 (gen_struct_or_union_type_die): Likewise.
26428 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
26431 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
26432 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
26434 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
26436 * config/i386/darwin.h (TARGET_64BIT): Remove definition
26437 based on TARGET_ISA_64BIT.
26438 (TARGET_64BIT_P): Remove definition based on
26439 TARGET_ISA_64BIT_P().
26441 2021-04-21 Martin Liska <mliska@suse.cz>
26444 2021-04-21 Martin Liska <mliska@suse.cz>
26446 * lto-wrapper.c (cpuset_popcount): Remove.
26447 (init_num_threads): Remove and use hardware_concurrency.
26449 2021-04-21 Martin Liska <mliska@suse.cz>
26452 * main.c (main): Call toplev::finalize in CHECKING_P mode.
26453 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
26454 when incremental LTO linking happens.
26456 2021-04-21 Martin Liska <mliska@suse.cz>
26458 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
26459 makeserver cannot be detected, then use -flto=N fallback.
26461 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
26463 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
26464 default to yes for aarch64-linux-gnu.
26465 * configure: Regenerate.
26467 2021-04-21 Martin Liska <mliska@suse.cz>
26469 * lto-wrapper.c (cpuset_popcount): Remove.
26470 (init_num_threads): Remove and use hardware_concurrency.
26472 2021-04-21 Martin Liska <mliska@suse.cz>
26474 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
26475 which remains to be '(... || 0)' and clang complains about it.
26476 * dwarf2out.c (AT_vms_delta): Declare conditionally.
26477 (add_AT_vms_delta): Likewise.
26478 * tree.c (fld_simplified_type): Use rather more common pattern
26479 for disabling of something (#if 0).
26480 (get_tree_code_name): Likewise.
26481 (verify_type_variant): Likewise.
26483 2021-04-21 Martin Liska <mliska@suse.cz>
26485 * config/i386/i386-expand.c (decide_alignment): Use newly named
26486 macro TARGET_CPU_P.
26487 * config/i386/i386.c (ix86_decompose_address): Likewise.
26488 (ix86_address_cost): Likewise.
26489 (ix86_lea_outperforms): Likewise.
26490 (ix86_avoid_lea_for_addr): Likewise.
26491 (ix86_add_stmt_cost): Likewise.
26492 * config/i386/i386.h (TARGET_*): Remove.
26493 (TARGET_CPU_P): New macro.
26494 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
26495 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
26496 (swap_top_of_ready_list): Likewise.
26497 (ix86_atom_sched_reorder): Likewise.
26498 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
26499 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
26501 2021-04-21 Martin Liska <mliska@suse.cz>
26503 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
26505 (SET_TARGET_NO_SAHF): Likewise.
26506 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
26507 (SET_TARGET_PREFETCH_SSE): Likewise.
26508 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
26509 (SET_TARGET_NO_TUNE): Likewise.
26510 (TARGET_EXPLICIT_NO_80387_P): Likewise.
26511 (SET_TARGET_NO_80387): Likewise.
26513 * config/i386/i386.h (TARGET_*): Remove.
26514 * opth-gen.awk: Generate new used macros.
26516 2021-04-21 Martin Liska <mliska@suse.cz>
26518 * config/i386/i386.h (PTA_*): Remove.
26519 (enum pta_flag): New.
26520 (DEF_PTA): Generate PTA_* values from i386-isa.def.
26521 * config/i386/i386-isa.def: New file.
26523 2021-04-21 Alex Coplan <alex.coplan@arm.com>
26526 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
26527 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
26528 jump table targets.
26530 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
26532 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
26533 x86_64-*-* targets.
26534 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
26536 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
26537 (ix86_handle_option): Handle -mmwait.
26538 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26539 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
26540 __builtin_ia32_monitor and __builtin_ia32_mwait.
26541 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
26542 (ix86_valid_target_attribute_inner_p): Likewise.
26543 (ix86_option_override_internal): Enable mwait/monitor
26544 instructions for -msse3.
26545 * config/i386/i386.h (TARGET_MWAIT): New.
26546 (TARGET_MWAIT_P): Likewise.
26547 * config/i386/i386.opt: Add -mmwait.
26548 * config/i386/mwaitintrin.h: New file.
26549 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
26550 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
26552 (@sse3_monitor_<mode>): Likewise.
26553 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
26554 * doc/extend.texi: Document mwait target attribute.
26555 * doc/invoke.texi: Document -mmwait.
26557 2021-04-21 Martin Liska <mliska@suse.cz>
26559 * config/i386/i386-options.c (DEF_ENUM): Remove it.
26560 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
26561 * config/i386/stringop.def (DEF_ENUM): Likewise.
26563 2021-04-21 Martin Liska <mliska@suse.cz>
26565 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
26566 of print_generic_expr.
26568 2021-04-21 Jakub Jelinek <jakub@redhat.com>
26570 PR rtl-optimization/100148
26571 * cprop.c (constprop_register): Use next_nondebug_insn instead of
26574 2021-04-21 Martin Liska <mliska@suse.cz>
26577 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
26578 free_dominance_info calls.
26580 2021-04-21 Richard Biener <rguenther@suse.de>
26582 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
26583 parameter (and assume it to be false).
26584 (fold_gimple_assign): Adjust, remove all callers of
26585 maybe_fold_reference calling it with is_lhs true.
26586 (gimple_fold_call): Likewise.
26587 (fold_stmt_1): Likewise.
26589 2021-04-21 Richard Biener <rguenther@suse.de>
26591 * fold-const.c (pedantic_non_lvalue_loc): Remove.
26592 (fold_binary_loc): Adjust.
26593 (fold_ternary_loc): Likewise.
26595 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
26597 PR middle-end/100130
26598 * varasm.c (get_block_for_decl): Make sure that any use of the
26599 retain attribute matches the section's retain flag.
26600 (switch_to_section): Check for retain mismatches even when
26601 changing sections, but do not warn if the given decl is the
26602 section's named.decl.
26603 (output_object_block): Pass the first decl in the block (if any)
26604 to switch_to_section.
26606 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
26608 * config/i386/i386-c.c (ix86_target_macros_internal): Define
26609 __CRC32__ for -mcrc32.
26610 * config/i386/i386-options.c (ix86_option_override_internal):
26611 Enable crc32 instruction for -msse4.2.
26612 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
26614 (sse4_2_crc32di): Likewise.
26615 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
26618 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
26621 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
26624 2021-04-20 Martin Liska <mliska@suse.cz>
26626 * doc/invoke.texi: Fix typo.
26627 * params.opt: Likewise.
26629 2021-04-20 Martin Liska <mliska@suse.cz>
26631 * doc/invoke.texi: Document new param.
26633 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
26635 PR tree-optimization/100081
26636 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
26637 rather than gori_compute_cache.
26638 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
26639 (range_def_chain::m_logical_depth): New member.
26640 (range_def_chain::range_def_chain): Initialize m_logical_depth.
26641 (range_def_chain::get_def_chain): Don't build defchains through more
26642 than LOGICAL_LIMIT logical expressions.
26643 * params.opt (param_ranger_logical_depth): New.
26645 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
26648 * config/arm/arm.c (arm_configure_build_target): Do not strip
26649 extended FPU/SIMD feature bits from the target ISA when -mfpu
26650 is specified (partial revert of r11-8168).
26652 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
26654 * params.opt (-param=openacc-kernels=): Add.
26655 * omp-oacc-kernels-decompose.cc
26656 (pass_omp_oacc_kernels_decompose::gate): Use it.
26657 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
26658 (--param): ... here, 'openacc-kernels'.
26660 2021-04-19 Martin Liska <mliska@suse.cz>
26663 * gengtype.c (finish_root_table): Align function arguments
26664 in between declaration and definition.
26666 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
26668 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
26669 frames larger than the SEH maximum frame size.
26671 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
26673 PR rtl-optimization/99927
26674 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
26675 is dead, just drop it.
26677 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
26680 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
26681 * doc/tm.texi: Regenerate.
26682 * doc/tm.texi.in (D language and ABI): Add @hook for
26683 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
26685 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
26687 * config/darwin-d.c (darwin_d_handle_target_object_format): New
26689 (darwin_d_register_target_info): New function.
26690 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26691 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
26693 (dragonfly_d_register_target_info): New function.
26694 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26695 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
26697 (freebsd_d_register_target_info): New function.
26698 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26699 * config/glibc-d.c (glibc_d_handle_target_object_format): New
26701 (glibc_d_register_target_info): New function.
26702 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26703 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
26705 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
26706 as handler for objectFormat key.
26707 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
26709 (winnt_d_register_target_info): New function.
26710 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26711 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
26713 (netbsd_d_register_target_info): New function.
26714 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26715 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
26717 (openbsd_d_register_target_info): New function.
26718 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26719 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
26720 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
26721 handler for objectFormat key.
26722 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
26724 (rs6000_d_register_target_info): Add
26725 rs6000_d_handle_target_object_format as handler for objectFormat key.
26726 * config/sol2-d.c (solaris_d_handle_target_object_format): New
26728 (solaris_d_register_target_info): New function.
26729 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26731 2021-04-16 Jakub Jelinek <jakub@redhat.com>
26734 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
26735 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
26737 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
26738 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
26739 the old and new alignment after applying MIN/MAX to it is different.
26741 2021-04-16 Tamar Christina <tamar.christina@arm.com>
26744 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
26745 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
26747 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
26749 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
26751 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
26752 this section and its subsections.
26754 2021-04-16 Jakub Jelinek <jakub@redhat.com>
26757 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
26758 define_insn patterns.
26760 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
26762 PR rtl-optimization/98689
26763 * reg-notes.def (UNTYPED_CALL): New note.
26764 * combine.c (distribute_notes): Handle it.
26765 * emit-rtl.c (try_split): Likewise.
26766 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
26767 that calls with the note implicitly set all return value registers.
26768 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
26771 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
26773 PR rtl-optimization/99596
26774 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
26775 register accesses for const calls. Assume that pure functions
26776 can only read from global registers. Ignore cases in which
26777 the stack pointer has been marked global.
26779 2021-04-16 Jakub Jelinek <jakub@redhat.com>
26782 * tree-vect-loop.c (vect_transform_loop): Don't remove just
26783 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
26784 them by their last argument.
26786 2021-04-15 Martin Liska <mliska@suse.cz>
26788 * doc/invoke.texi: Other params don't use it, remove it.
26790 2021-04-15 Richard Biener <rguenther@suse.de>
26792 * gimple-builder.h: Add deprecation note.
26794 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
26797 * attribs.h (restrict_type_identity_attributes_to): Declare.
26798 * attribs.c (restrict_type_identity_attributes_to): New function.
26800 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
26803 * attribs.h (affects_type_identity_attributes): Declare.
26804 * attribs.c (remove_attributes_matching): New function.
26805 (affects_type_identity_attributes): Likewise.
26807 2021-04-15 Jakub Jelinek <jakub@redhat.com>
26810 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
26811 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
26812 ZERO_EXTEND, SIGN_EXTEND or AND.
26814 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
26816 PR rtl-optimization/99929
26817 * rtl.h (same_vector_encodings_p): New function.
26818 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
26819 * cselib.c (rtx_equal_for_cselib_1): Likewise.
26820 * jump.c (rtx_renumbered_equal_p): Likewise.
26821 * lra-constraints.c (operands_match_p): Likewise.
26822 * reload.c (operands_match_p): Likewise.
26823 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
26825 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
26827 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
26828 more information about variable-length CONST_VECTORs.
26830 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
26832 PR rtl-optimization/100066
26833 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
26834 ordered modes when choosing splitting mode for hard reg.
26836 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
26839 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
26841 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
26843 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
26845 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
26846 for mask operand types.
26847 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
26848 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
26850 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
26852 * config/s390/s390.c (s390_const_operand_ok): Check the new
26853 operand types and generate a list of valid values.
26855 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
26857 * doc/tm.texi: Regenerate.
26858 * doc/tm.texi.in (D language and ABI): Add @hook for
26859 TARGET_D_REGISTER_OS_TARGET_INFO.
26861 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
26863 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
26865 (aarch64_d_register_target_info): New function.
26866 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
26868 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
26870 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
26871 (arm_d_register_target_info): New function.
26872 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
26873 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26874 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
26875 (ix86_d_register_target_info): New function.
26876 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
26877 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26878 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
26879 (mips_d_register_target_info): New function.
26880 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
26881 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26882 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
26883 (pa_d_register_target_info): New function.
26884 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
26885 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26886 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
26888 (riscv_d_register_target_info): New function.
26889 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
26890 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26891 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
26893 (rs6000_d_register_target_info): New function.
26894 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
26896 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26897 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
26898 (s390_d_register_target_info): New function.
26899 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
26900 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26901 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
26903 (sparc_d_register_target_info): New function.
26904 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
26905 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26906 * doc/tm.texi: Regenerate.
26907 * doc/tm.texi.in (D language and ABI): Add @hook for
26908 TARGET_D_REGISTER_CPU_TARGET_INFO.
26910 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
26912 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
26913 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
26914 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
26915 * doc/tm.texi: Regenerate.
26916 * doc/tm.texi.in (D language and ABI): Add @hook for
26917 TARGET_D_HAS_STDCALL_CONVENTION.
26919 2021-04-14 Richard Biener <rguenther@suse.de>
26921 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
26922 VEC_COND_EXPRs have a gimple_val condition.
26923 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
26924 can no longer have a GENERIC condition.
26926 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
26929 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
26930 from the isa_delta when -mfpu has been used.
26931 (arm_options_perform_arch_sanity_checks): It's the architecture that
26932 lacks an FPU not the processor.
26934 2021-04-13 Richard Biener <rguenther@suse.de>
26936 PR tree-optimization/100053
26937 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
26938 not use optimistic dominance queries for backedges to validate
26940 (dominated_by_p_w_unex): Add parameter to ignore executable
26941 state on backedges.
26942 (rpo_elim::eliminate_avail): Adjust.
26944 2021-04-13 Jakub Jelinek <jakub@redhat.com>
26947 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
26948 *aarch64_bfxilsi_extrdi): New define_insn patterns.
26950 2021-04-13 Jakub Jelinek <jakub@redhat.com>
26953 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
26954 outermode, return NULL if the result doesn't encode back to the
26955 original byte sequence.
26956 (simplify_gen_subreg): Don't create SUBREGs from constants to
26957 MODE_COMPOSITE_P outermode.
26959 2021-04-12 Jakub Jelinek <jakub@redhat.com>
26961 PR rtl-optimization/99905
26962 * combine.c (expand_compound_operation): If pos + len > modewidth,
26963 perform the right shift by pos in inner_mode and then convert to mode,
26964 instead of trying to simplify a shift of rtx with inner_mode by pos
26965 as if it was a shift in mode.
26967 2021-04-12 Jakub Jelinek <jakub@redhat.com>
26970 * combine.c (simplify_and_const_int_1): Don't optimize varop
26971 away if it has side-effects.
26973 2021-04-12 Martin Liska <mliska@suse.cz>
26975 * doc/extend.texi: Escape @smallexample content.
26977 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
26979 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
26980 alternative in order to load a DFP zero.
26982 2021-04-12 Martin Liska <mliska@suse.cz>
26984 * doc/extend.texi: Be more precise in documentation
26985 of symver attribute.
26987 2021-04-12 Martin Liska <mliska@suse.cz>
26990 * gimplify.c (gimplify_expr): Right now, we unpoison all
26991 variables before a goto <dest>. We should not do it if we are
26994 2021-04-12 Cui,Lili <lili.cui@intel.com>
26996 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
26998 * common/config/i386/i386-common.c (processor_names): Add
27000 (processor_alias_table): Add rocketlake.
27001 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
27002 INTEL_COREI7_ROCKETLAKE.
27003 * config.gcc: Add -march=rocketlake.
27004 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
27006 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
27007 (processor_cost_table): Add rocketlake cost.
27008 * config/i386/i386.h (ix86_size_cost) : Define
27010 (processor_type) : Add PROCESSOR_ROCKETLAKE.
27011 (PTA_ROCKETLAKE): Ditto.
27012 * doc/extend.texi: Add rocketlake.
27013 * doc/invoke.texi: Add rocketlake.
27015 2021-04-12 Cui,Lili <lili.cui@intel.com>
27017 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
27018 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
27019 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
27020 * doc/invoke.texi: Change alderlake ISA list.
27022 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
27024 PR middle-end/98088
27025 * omp-expand.c (expand_oacc_collapse_init): Update condition in
27028 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
27031 * config/i386/serializeintrin.h (_serialize): Defined as macro.
27033 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27036 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
27037 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
27039 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27041 PR middle-end/99989
27042 * gimple-ssa-warn-alloca.c
27043 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
27044 0 with integer precision unconditionally.
27046 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27048 PR rtl-optimization/98601
27049 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
27050 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
27051 unaligned_mems handle VOIDmode like BLKmode.
27053 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
27056 * tree.c (free_lang_data_in_decl): Do not release body of
27057 declare_variant_alt.
27059 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27061 * config/aarch64/aarch64.c (aarch64_option_restore): If the
27062 architecture was specified explicitly and the tuning wasn't,
27063 tune for the architecture rather than the configured default CPU.
27065 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27067 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
27068 as the temporary register.
27070 2021-04-09 Martin Liska <mliska@suse.cz>
27072 * doc/extend.texi: Move non-target attributes on the top level.
27074 2021-04-09 Martin Liska <mliska@suse.cz>
27076 * doc/invoke.texi: Document minimum and maximum value of the
27077 argument for both supported compression algorithms.
27079 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
27081 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
27082 TLS BSS before TLS data.
27083 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
27085 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27087 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
27089 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27091 * match.pd: Extend vec_cond folds to handle shifts.
27093 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
27095 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
27098 2021-04-08 Alex Coplan <alex.coplan@arm.com>
27101 * config/arm/iterators.md (MVE_vecs): New.
27102 (V_elem): Also handle V2DF.
27103 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
27104 (*mve_vdup<mode>): ... this. Remove second alternative since
27105 vec_duplicate of const_int is not canonical RTL, and we don't
27106 want to match symbol_refs.
27107 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
27109 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
27111 * fold-const.c (fold_single_bit_test): Fix typo.
27112 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
27115 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27117 PR tree-optimization/97513
27118 * tree-vect-slp.c (vect_add_slp_permutation): New function,
27120 (vectorizable_slp_permutation): ...here. Detect cases in which
27121 all VEC_PERM_EXPRs are guaranteed to have the same stepped
27122 permute vector and only generate one permute vector for that case.
27123 Extend that case to handle variable-length vectors.
27125 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27127 PR tree-optimization/99873
27128 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
27129 (vect_build_slp_instance): Don't split store groups that could
27130 use IFN_STORE_LANES.
27132 2021-04-07 Jakub Jelinek <jakub@redhat.com>
27135 * varasm.c (output_constant_pool_contents): Don't strip name encoding
27136 from XSTR (desc->sym, 0) or from label before passing those to
27139 2021-04-07 Richard Biener <rguenther@suse.de>
27141 PR tree-optimization/99954
27142 * tree-loop-distribution.c: Include tree-affine.h.
27143 (generate_memcpy_builtin): Try using tree-affine to prove
27145 (loop_distribution::classify_builtin_ldst): Always classify
27148 2021-04-07 Richard Biener <rguenther@suse.de>
27150 PR tree-optimization/99947
27151 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
27152 steps vector to avoid pushing elements from the reallocated
27155 2021-04-07 Richard Biener <rguenther@suse.de>
27157 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
27158 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
27160 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
27162 (debug_vn_reference_ops): New.
27164 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
27166 PR tree-optimization/98736
27167 * tree-loop-distribution.c
27168 * (loop_distribution::bb_top_order_init):
27169 Compute RPO with programing order preserved by calling function
27170 rev_post_order_and_mark_dfs_back_seme.
27172 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
27175 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
27176 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
27178 (process_bb_lives): Don't update biggest mode of hard reg for
27179 implicit in multi-register group. Use the new functions for
27180 updating dead_set and unused_set by register notes.
27182 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
27184 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
27187 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
27189 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
27190 (skylake_memset): Likewise.
27191 (skylake_cost): Change CLEAR_RATIO to 17.
27192 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
27193 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
27194 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
27196 2021-04-06 Richard Biener <rguenther@suse.de>
27198 PR tree-optimization/99880
27199 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
27200 set vectorized defs of relevant PHIs.
27202 2021-04-06 Richard Biener <rguenther@suse.de>
27204 PR tree-optimization/99924
27205 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
27206 nodes w/o scalar stmts as visited.
27208 2021-04-06 Alex Coplan <alex.coplan@arm.com>
27211 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
27212 PCS for [su]fix_optab.
27214 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
27216 * config/darwin.c (machopic_legitimize_pic_address): Check
27217 that the current pic register is one of the hard reg set
27218 before setting liveness.
27220 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
27222 * config/darwin.c (machopic_legitimize_pic_address): Fix
27223 whitespace, remove unused code.
27225 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27227 PR tree-optimization/99882
27228 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
27231 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27233 PR rtl-optimization/99863
27234 * dse.c (replace_read): Drop regs_live argument. Instead of
27235 regs_live, use store_insn->fixed_regs_live if non-NULL,
27236 otherwise punt if insns sequence clobbers or sets any hard
27239 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27242 * targhooks.h (default_print_patchable_function_entry_1): Declare.
27243 * targhooks.c (default_print_patchable_function_entry_1): New function,
27244 copied from default_print_patchable_function_entry with an added flags
27246 (default_print_patchable_function_entry): Rewritten into a small
27247 wrapper around default_print_patchable_function_entry_1.
27248 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
27250 (rs6000_print_patchable_function_entry): New function.
27252 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
27254 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
27256 2021-04-01 Jason Merrill <jason@redhat.com>
27259 * common.opt: Document v15 and v16.
27261 2021-04-01 Richard Biener <rguenther@suse.de>
27263 PR tree-optimization/99863
27264 * gimplify.c (gimplify_init_constructor): Recompute vector
27267 2021-04-01 Jakub Jelinek <jakub@redhat.com>
27269 * doc/extend.texi (symver attribute): Fix up syntax errors
27272 2021-04-01 Jakub Jelinek <jakub@redhat.com>
27274 PR tree-optimization/96573
27275 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
27276 also pointer types.
27278 2021-04-01 Richard Biener <rguenther@suse.de>
27280 PR tree-optimization/99856
27281 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
27282 precision to vector element precision.
27284 2021-04-01 Martin Jambor <mjambor@suse.cz>
27286 PR tree-optimization/97009
27287 * tree-sra.c (access_or_its_child_written): New function.
27288 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
27291 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
27294 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
27296 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
27299 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
27300 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
27301 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
27302 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
27303 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
27304 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
27306 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
27307 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
27308 (define_attr "prefixed"): Update initializer.
27310 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27313 * dwarf2out.c (debug_ranges_dwo_section): New variable.
27314 (DW_RANGES_IDX_SKELETON): Define.
27315 (struct dw_ranges): Add begin_entry and end_entry members.
27316 (DEBUG_DWO_RNGLISTS_SECTION): Define.
27317 (add_ranges_num): Adjust r initializer for addition of *_entry
27319 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
27320 set idx to DW_RANGES_IDX_SKELETON.
27321 (use_distinct_base_address_for_range): New function.
27322 (index_rnglists): Don't set r->idx if it is equal to
27323 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
27324 r->end_entry for -gsplit-dwarf if those will be needed by
27326 (output_rnglists): Add DWO argument. If true, switch to
27327 debug_ranges_dwo_section rather than debug_ranges_section.
27328 Adjust l1/l2 label indexes. Only output the offset table when
27329 dwo is true and don't include in there the skeleton range
27330 entry if present. For -gsplit-dwarf, skip ranges that belong
27331 to the other rnglists section. Change return type from void
27332 to bool and return true if there are any range entries for
27333 the other section. For dwarf_split_debug_info use
27334 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
27335 entries instead of DW_RLE_start_end, DW_RLE_start_length and
27336 DW_RLE_base_address. Use use_distinct_base_address_for_range.
27337 (init_sections_and_labels): Initialize debug_ranges_dwo_section
27338 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
27339 and range_base_label indexes.
27340 (dwarf2out_finish): Call index_rnglists earlier before finalizing
27341 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
27342 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
27343 with different dwo arguments.
27344 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
27346 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27348 PR tree-optimization/98268
27349 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
27350 recompute_tree_invariant_for_addr_expr after successfully
27351 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
27353 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27355 PR tree-optimization/99726
27356 * tree-data-ref.c (create_intersect_range_checks_index): Bail
27357 out if there is more than one access function SCEV for the loop
27360 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27362 PR rtl-optimization/97141
27363 PR rtl-optimization/98726
27364 * emit-rtl.c (valid_for_const_vector_p): Return true for
27366 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
27367 poly_wide_int instead of a wide_int.
27368 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
27370 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
27371 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
27372 false for CONST_VECTORs that cannot be forced to memory.
27373 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
27374 is too complex to force to memory, build it up from individual
27377 2021-03-31 Jan Hubicka <jh@suse.cz>
27380 * cgraph.c (cgraph_node::release_body): Fix overactive check.
27382 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
27385 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
27388 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
27390 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
27391 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
27393 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
27394 "rep movsb/stosb" only for known sizes.
27395 * config/i386/i386-options.c (processor_cost_table): Use Ice
27396 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
27397 Rapids and Alder Lake.
27398 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
27399 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
27400 (icelake_memset): Likewise.
27401 (icelake_cost): Likewise.
27402 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
27405 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27408 * config/aarch64/aarch64.c
27409 (aarch64_vectorize_preferred_vector_alignment): Query the size
27410 of the provided SVE vector; do not assume that all SVE vectors
27411 have the same size.
27413 2021-03-31 Jan Hubicka <jh@suse.cz>
27416 * cgraph.c (cgraph_node::release_body): Remove all callers and
27418 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
27419 * cgraphunit.c (cgraph_node::expand): And here.
27421 2021-03-31 Martin Liska <mliska@suse.cz>
27423 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
27424 and one negated condition.
27426 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27427 Richard Sandiford <richard.sandiford@arm.com>
27430 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
27431 constraints on operands[2] and similarly 0 and rk constraints
27432 on operands[1] corresponding to that.
27434 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27437 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
27438 linker doesn't support DWARF sections new in DWARF5.
27439 * config/i386/i386-options.c (ix86_option_override_internal): Default
27440 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
27442 * config.in: Regenerated.
27443 * configure: Regenerated.
27445 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27448 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
27449 available issue_info before using it.
27451 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27454 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
27457 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
27460 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
27461 (altivec_lvsl_reg_<mode>): ... this.
27462 (altivec_lvsr_reg): Change to ...
27463 (altivec_lvsr_reg_<mode>): ... this.
27464 * config/rs6000/predicates.md (vec_set_index_operand): New.
27465 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
27466 Enable 32bit variable vec_insert for all TARGET_VSX.
27467 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
27468 Enable 32bit variable vec_insert for p9 and above.
27469 (rs6000_expand_vector_set_var_p8): Rename to ...
27470 (rs6000_expand_vector_set_var_p7): ... this.
27471 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
27473 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
27474 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
27475 gen_altivec_lvsr_reg_di.
27477 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
27480 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
27481 (__rdtscp): Likewise.
27483 2021-03-30 Tamar Christina <tamar.christina@arm.com>
27485 PR tree-optimization/99825
27486 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
27487 Reject non-mult 2 lanes.
27489 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
27492 * config/arm/arm.c (arm_file_start): Fix emission of
27493 Tag_ABI_VFP_args attribute.
27495 2021-03-30 Richard Biener <rguenther@suse.de>
27497 PR tree-optimization/99824
27498 * stor-layout.c (set_min_and_max_values_for_integral_type):
27499 Assert the precision is within the bounds of
27500 WIDE_INT_MAX_PRECISION.
27501 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
27502 the outermost component ref only to lower the access size
27503 and initialize that from the access type.
27505 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
27508 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
27509 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
27511 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
27513 * config/aarch64/aarch64.md
27514 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
27515 attribute to disambiguate between SIMD and FP variants of the
27518 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
27520 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
27521 (analyze_ssa_name_flags): Fix typo in comment.
27523 2021-03-29 Alex Coplan <alex.coplan@arm.com>
27526 * config/aarch64/aarch64-sve-builtins.cc
27527 (function_builder::add_function): Add placeholder_p argument, use
27528 placeholder decls if this is set.
27529 (function_builder::add_unique_function): Instead of conditionally adding
27530 direct overloads, unconditionally add either a direct overload or a
27532 (function_builder::add_overloaded_function): Set placeholder_p if we're
27533 using C++ overloads. Use the obstack for string storage instead
27534 of relying on the tree nodes.
27535 (function_builder::add_overloaded_functions): Don't return early for
27536 m_direct_overloads: we need to add placeholders.
27537 * config/aarch64/aarch64-sve-builtins.h
27538 (function_builder::add_function): Add placeholder_p argument.
27540 2021-03-29 Richard Biener <rguenther@suse.de>
27542 PR tree-optimization/99807
27543 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
27544 assert below VEC_PERM handling.
27546 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27549 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
27550 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
27551 matching const_int 0.
27552 (move_lo_quad_internal_be_<mode>): Likewise.
27553 (move_lo_quad_<mode>): Update for the above.
27554 * config/aarch64/iterators.md (VQ_2E): Delete.
27556 2021-03-29 Jakub Jelinek <jakub@redhat.com>
27558 PR tree-optimization/99777
27559 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
27560 types other than scalar integral types.
27562 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
27564 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
27565 XCOFF TLS reloc decorations.
27567 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
27569 * doc/analyzer.texi (Analyzer Internals): Update link to
27570 "A Memory Model for Static Analysis of C Programs".
27572 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
27574 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
27575 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
27577 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
27578 (rs6000_special_round_type_align): Recursively check innermost first
27581 2021-03-26 Jakub Jelinek <jakub@redhat.com>
27584 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
27585 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
27586 assignment with drap_reg active, queue reg save for hfp with offset 0
27587 and flush queued reg saves. When handling a push with rule18,
27588 defer queueing reg save for hfp and just assert the offset is 0.
27589 (scan_trace): Assert that fde->rule18 is false.
27591 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
27594 * ira-costs.c (record_reg_classes): Put case with
27595 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
27596 * ira.c (ira_setup_alts): Ditto.
27597 * lra-constraints.c (process_alt_operands): Ditto.
27598 * recog.c (asm_operand_ok): Ditto.
27599 * reload.c (find_reloads): Ditto.
27601 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27603 * config/aarch64/aarch64-protos.h
27604 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
27605 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
27606 * config/aarch64/aarch64.c (generic_addrcost_table): Update
27607 accordingly, using the same costs as for post_modify.
27608 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
27609 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
27610 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
27611 (a64fx_addrcost_table): Likewise.
27612 (neoversev1_addrcost_table): New.
27613 (neoversev1_tunings): Use neoversev1_addrcost_table.
27614 (aarch64_address_cost): Use the new post_modify costs for CImode
27617 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27619 * config/aarch64/aarch64.opt
27620 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
27621 * doc/invoke.texi: Document it.
27622 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
27623 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
27624 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
27625 (aarch64_vec_issue_info): New structures.
27626 (cpu_vector_cost): Write comments above the variables rather
27628 (cpu_vector_cost::issue_info): New member variable.
27629 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
27630 and tree-ssa-loop-niter.h.
27631 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
27632 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
27633 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
27634 (thunderx3t110_vector_cost): Initialize issue_info to null.
27635 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
27636 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
27637 (neoversev1_vector_cost): Use them.
27638 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
27639 (aarch64_vector_costs::saw_sve_only_op): New member variable.
27640 (aarch64_vector_costs::num_vector_iterations): Likewise.
27641 (aarch64_vector_costs::scalar_ops): Likewise.
27642 (aarch64_vector_costs::advsimd_ops): Likewise.
27643 (aarch64_vector_costs::sve_ops): Likewise.
27644 (aarch64_vector_costs::seen_loads): Likewise.
27645 (aarch64_simd_vec_costs_for_flags): New function.
27646 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
27647 Count the number of predicate operations required by SVE WHILE
27649 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
27650 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
27651 (aarch64_count_ops): Likewise.
27652 (aarch64_add_stmt_cost): Record whether see an SVE operation
27653 that cannot currently be implementing using Advanced SIMD.
27654 Record issue information about the scalar, Advanced SIMD
27655 and (where relevant) SVE versions of a loop.
27656 (aarch64_vec_op_count::dump): New function.
27657 (aarch64_sve_op_count::dump): Likewise.
27658 (aarch64_estimate_min_cycles_per_iter): Likewise.
27659 (aarch64_adjust_body_cost): If issue information is available,
27660 try to compare the issue rates of the various loop implementations
27661 and increase or decrease the vector body cost accordingly.
27663 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27665 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
27666 Assume a zero cost for induction phis.
27668 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27670 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
27672 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
27673 vector comparisons.
27675 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27677 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
27679 (aarch64_add_stmt_cost): Call it.
27681 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27683 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
27684 New tuning parameter.
27685 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
27686 (aarch64_estimated_sve_vq): New function.
27687 (aarch64_vector_costs::analyzed_vinfo): New member variable.
27688 (aarch64_vector_costs::is_loop): Likewise.
27689 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
27690 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
27691 (aarch64_record_potential_advsimd_unrolling): New function.
27692 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
27693 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
27694 aarch64_analyze_bb_vinfo on the first use of a costs structure.
27695 Detect whether we're vectorizing a loop for SVE that might be
27696 completely unrolled if it used Advanced SIMD instead.
27697 (aarch64_adjust_body_cost_for_latency): New function.
27698 (aarch64_finish_cost): Call it.
27700 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27702 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
27703 (aarch64_init_cost): New function.
27704 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
27705 the default unsigned[3].
27706 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
27707 (TARGET_VECTORIZE_INIT_COST): Override.
27708 (TARGET_VECTORIZE_FINISH_COST): Likewise.
27709 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
27711 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27713 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
27714 (neoversev1_sve_vector_cost): New cost structures.
27715 (neoversev1_vector_cost): Likewise.
27716 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
27718 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27720 * config/aarch64/aarch64-protos.h
27721 (sve_vec_cost::scatter_store_elt_cost): New member variable.
27722 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
27723 accordingly, taking the cost from the cost of a scalar_store.
27724 (a64fx_sve_vector_cost): Likewise.
27725 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
27727 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27729 * config/aarch64/aarch64-protos.h
27730 (simd_vec_cost::store_elt_extra_cost): New member variable.
27731 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
27732 accordingly, using the vec_to_scalar cost for the new field.
27733 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
27734 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
27735 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
27736 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
27737 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
27738 (thunderx3t110_advsimd_vector_cost): Likewise.
27739 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
27741 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27743 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
27744 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
27745 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
27746 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
27747 accordingly, using zero for the new costs.
27748 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
27749 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
27750 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
27751 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
27752 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
27753 (thunderx3t110_advsimd_vector_cost): Likewise.
27754 (aarch64_ld234_st234_vectors): New function.
27755 (aarch64_adjust_stmt_cost): Likewise.
27756 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
27757 the new vector costs.
27759 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27761 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
27762 derived class of simd_vec_cost. Add information about CLAST[AB]
27763 and FADDA instructions.
27764 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
27765 accordingly, using the vec_to_scalar costs for the new fields.
27766 (a64fx_sve_vector_cost): Likewise.
27767 (aarch64_reduc_type): New function.
27768 (aarch64_sve_in_loop_reduction_latency): Likewise.
27769 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
27770 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
27771 that occur in the loop body.
27772 (aarch64_add_stmt_cost): Update call accordingly.
27774 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27776 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
27778 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
27779 above the fields rather than to the right.
27780 (simd_vec_cost::reduc_i8_cost): New member variable.
27781 (simd_vec_cost::reduc_i16_cost): Likewise.
27782 (simd_vec_cost::reduc_i32_cost): Likewise.
27783 (simd_vec_cost::reduc_i64_cost): Likewise.
27784 (simd_vec_cost::reduc_f16_cost): Likewise.
27785 (simd_vec_cost::reduc_f32_cost): Likewise.
27786 (simd_vec_cost::reduc_f64_cost): Likewise.
27787 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
27788 accordingly, using the vec_to_scalar_cost for the new fields.
27789 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
27790 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
27791 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
27792 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
27793 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
27794 (thunderx3t110_advsimd_vector_cost): Likewise.
27795 (aarch64_use_new_vector_costs_p): New function.
27796 (aarch64_simd_vec_costs): New function, split out from...
27797 (aarch64_builtin_vectorization_cost): ...here.
27798 (aarch64_is_reduction): New function.
27799 (aarch64_detect_vector_stmt_subtype): Likewise.
27800 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
27801 using the new vector costs.
27803 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27806 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
27807 TLS declarations as public.
27809 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27811 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
27812 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
27813 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
27814 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
27815 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
27816 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
27817 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
27818 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
27819 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
27821 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27824 * config.gcc (*-*-cygwin*): Add winnt-d.o
27825 (*-*-mingw*): Likewise.
27826 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
27827 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
27828 * config/i386/t-cygming: Add winnt-d.o.
27829 * config/i386/winnt-d.c: New file.
27831 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27833 * config/freebsd-d.c: Include memmodel.h.
27835 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27838 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
27839 * config/t-openbsd: Add openbsd-d.o.
27840 * config/openbsd-d.c: New file.
27842 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
27844 PR tree-optimization/96974
27845 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
27846 with graceful exit.
27848 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
27851 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
27855 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
27856 always_inline in system headers.
27858 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
27860 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
27862 2021-03-25 Jakub Jelinek <jakub@redhat.com>
27865 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
27866 * fold-const.c (operand_compare::operand_equal_p): Don't compare
27867 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
27869 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
27873 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
27874 always_inline in system headers.
27876 2021-03-25 Richard Biener <rguenther@suse.de>
27878 PR tree-optimization/99746
27879 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
27880 the scalar stmt as patterned. Instead set up required things
27883 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
27885 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
27888 2021-03-24 Martin Liska <mliska@suse.cz>
27891 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
27893 * config/i386/i386-options.c (ix86_option_override_internal):
27894 Add run-time assert.
27896 2021-03-24 Martin Jambor <mjambor@suse.cz>
27899 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
27900 parameters with unknown type.
27901 (ipacp_value_safe_for_type): New function.
27902 (propagate_vals_across_arith_jfunc): Verify that the constant type
27903 can be used for a type of the formal parameter.
27904 (propagate_vals_across_ancestor): Likewise.
27905 (propagate_scalar_across_jump_function): Likewise. Pass the type
27906 also to propagate_vals_across_ancestor.
27908 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
27911 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
27913 (movmisalign<mode>_mve_load): Likewise.
27915 2021-03-24 Jakub Jelinek <jakub@redhat.com>
27918 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
27919 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
27921 2021-03-24 Alexandre Oliva <oliva@adacore.com>
27923 * doc/sourcebuild.texi (sysconf): New effective target.
27925 2021-03-24 Alexandre Oliva <oliva@adacore.com>
27927 * config/i386/predicates.md (reg_or_const_vec_operand): New.
27928 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
27929 the now *-prefixed insn_and_split, turn the splitter const vec
27930 into an input for the insn, making it an ignored immediate for
27931 non-split cases, and loaded into the scratch register
27934 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
27937 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
27938 Use define_relaxed_memory_constraint for them.
27940 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
27943 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
27944 colon to the diagnostic message.
27946 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
27948 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
27950 (try_fwprop_subst_note): Use set_info instead of insn_info.
27951 (try_fwprop_subst_pattern): Likewise.
27952 (try_fwprop_subst_notes): Likewise.
27953 (try_fwprop_subst): Likewise.
27954 (forward_propagate_subreg): Likewise.
27955 (forward_propagate_and_simplify): Likewise.
27956 (forward_propagate_into): Likewise.
27957 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
27959 (set_info::single_nondebug_insn_use): Likewise.
27960 (set_info::single_phi_use): Likewise.
27961 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
27963 (set_info::single_nondebug_insn_use): Likewise.
27964 (set_info::single_phi_use): Likewise.
27966 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
27968 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
27970 2021-03-23 Jakub Jelinek <jakub@redhat.com>
27973 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
27974 expand_mult to perform an unsigned rather than a signed
27977 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
27980 * config/i386/cpuid.h (__cpuid): Add __volatile__.
27981 (__cpuid_count): Likewise.
27983 2021-03-23 Richard Biener <rguenther@suse.de>
27985 PR tree-optimization/99721
27986 * tree-vect-slp.c (vect_slp_analyze_node_operations):
27987 Make sure we can schedule the node.
27989 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27991 * config/riscv/riscv.c (riscv_subword): Take endianness into
27992 account when calculating the byte offset.
27994 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27996 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
27997 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
27998 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
27999 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
28000 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
28001 new predicate "subreg_lowpart_operator"
28003 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28005 * config/riscv/riscv.c (riscv_swap_instruction): New function
28006 to byteswap an SImode rtx containing an instruction.
28007 (riscv_trampoline_init): Byteswap the generated instructions
28010 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28012 * common/config/riscv/riscv-common.c
28013 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
28014 * config.gcc (riscv32be-*, riscv64be-*): Set
28015 TARGET_BIG_ENDIAN_DEFAULT to 1.
28016 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
28017 depending on default endianness.
28018 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28019 * config/riscv/linux.h (LINK_SPEC): Likewise.
28020 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
28021 default endianness.
28022 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
28024 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28026 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
28027 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28028 * config/riscv/linux.h (LINK_SPEC): Likewise.
28029 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
28031 (BYTES_BIG_ENDIAN): Handle big endian.
28032 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
28033 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
28035 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
28037 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28039 * regcprop.c (find_oldest_value_reg): Ask target whether
28040 different mode is fine for replacement register.
28042 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
28044 PR tree-optimization/99296
28045 * value-range.cc (irange::irange_set_1bit_anti_range): New.
28046 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
28047 * value-range.h (irange::irange_set_1bit_anti_range): New.
28049 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
28052 * config/aarch64/constraints.md (UtQ): Use
28053 define_relaxed_memory_constraint for it.
28054 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
28055 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28056 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
28057 (have_relaxed_memory_constraints): New static var.
28058 (relaxed_memory_start, relaxed_memory_end): Ditto.
28059 (add_constraint): Add arg is_relaxed_memory. Check name for
28060 relaxed memory. Set up is_relaxed_memory in constraint_data and
28061 have_relaxed_memory_constraints. Adjust calls.
28062 (choose_enum_order): Process relaxed memory.
28063 (write_tm_preds_h): Ditto.
28064 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28065 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28066 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
28067 * ira-lives.c (single_reg_class): Use
28068 insn_extra_relaxed_memory_constraint.
28069 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
28070 * lra-constraints.c (valid_address_p): Use
28071 insn_extra_relaxed_memory_constraint instead of other memory
28073 (process_alt_operands): Process CT_RELAXED_MEMORY.
28074 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
28075 * recog.c (asm_operand_ok, preprocess_constraints): Process
28077 * reload.c (find_reloads): Ditto.
28078 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
28079 * stmt.c (parse_input_constraint): Use
28080 insn_extra_relaxed_memory_constraint.
28082 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
28085 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
28088 2021-03-22 Alex Coplan <alex.coplan@arm.com>
28091 * config/arm/arm-protos.h (neon_make_constant): Add generate
28092 argument to guard emitting insns, default to true.
28093 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
28094 CONST_VECTORs which neon_make_constant can't handle.
28095 (neon_vdup_constant): Add generate argument, avoid emitting
28096 insns if it's not set.
28097 (neon_make_constant): Plumb new generate argument through.
28098 * config/arm/constraints.md (Ui): New. Use it...
28099 * config/arm/mve.md (*mve_mov<mode>): ... here.
28100 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
28101 synthesize constants.
28103 2021-03-22 Richard Biener <rguenther@suse.de>
28105 * debug.h: Add deprecation warning.
28107 2021-03-22 Richard Biener <rguenther@suse.de>
28109 PR tree-optimization/99694
28110 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
28113 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
28116 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
28117 after type checking.
28119 2021-03-22 Jakub Jelinek <jakub@redhat.com>
28123 * dwarf2out.c (get_full_len): Use get_precision rather than
28125 (add_const_value_attribute): Make sure add_AT_wide argument has
28126 precision prec rather than some very wide one.
28128 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
28130 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
28131 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
28132 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
28133 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
28134 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
28135 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
28136 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
28137 *fix_trunc<mode>si2_internal): Fix empty split condition.
28138 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
28139 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
28140 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
28141 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
28143 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
28146 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28147 Convert idx to DImode.
28148 (rs6000_expand_vector_set_var_p8): Likewise.
28150 2021-03-21 Jakub Jelinek <jakub@redhat.com>
28153 * dwarf2out.c (insert_float): Change return type from void to
28154 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
28155 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
28158 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
28161 * config/i386/i386.c (construct_container): Check cfun != NULL
28162 before accessing silent_p.
28164 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
28166 * asan.c: Fix typos in comments.
28168 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
28170 PR rtl-optimization/99680
28171 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
28172 (process_address_1): Check empty constraint before using
28175 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
28177 * config/rs6000/rs6000.c (power10_cost): New.
28178 (rs6000_option_override_internal): Set Power10 costs.
28179 (rs6000_issue_rate): Set Power10 issue rate.
28180 * config/rs6000/power10.md: Rewrite for Power10.
28182 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
28185 * lra-constraints.c (process_address_1): Don't use unknown
28186 constraint for address constraint.
28188 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
28191 * config.gcc (powerpc-*-darwin8): Delete the reference to
28192 the now removed darwin8.h.
28194 2021-03-19 Olivier Hainque <hainque@adacore.com>
28197 * config/vxworksae.h (VX_CPU_PREFIX): Define.
28199 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
28201 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
28203 2021-03-19 Tamar Christina <tamar.christina@arm.com>
28205 PR tree-optimization/99656
28206 * tree-vect-slp-patterns.c (linear_loads_p,
28207 complex_add_pattern::matches, is_eq_or_top,
28208 vect_validate_multiplication, complex_mul_pattern::matches,
28209 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
28210 * tree-vectorizer.h: (complex_load_perm_t): Removed.
28211 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
28212 complex_load_perm_t.
28214 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
28217 * config/i386/i386-options.c (ix86_init_machine_status): Set
28219 * config/i386/i386.c (init_cumulative_args): Set silent_p to
28221 (construct_container): Return early for return and argument
28222 errors if silent_p is true.
28223 * config/i386/i386.h (machine_function): Add silent_p.
28225 2021-03-19 Jakub Jelinek <jakub@redhat.com>
28228 * config/arm/constraints.md (Ds): New constraint.
28229 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
28230 constraint instead of w,Dm.
28232 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
28234 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
28237 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
28239 PR middle-end/99641
28240 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
28241 array type, do the computation of the current position in sizetype.
28243 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
28246 * lra-constraints.c (process_address_1): Use lookup_constraint
28247 only for a single constraint.
28249 2021-03-18 Martin Sebor <msebor@redhat.com>
28251 PR middle-end/99502
28252 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
28253 (inbounds_memaccess_p): ...to this. Check the ending offset of
28254 the accessed member.
28256 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
28258 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
28259 %> quote markers to error messages.
28260 (gcn_goacc_validate_dims): Likewise.
28261 (gcn_conditional_register_usage): Remove exclaimation mark from error
28263 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
28265 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
28267 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
28270 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
28271 Kito Cheng <kito.cheng@sifive.com>
28273 * config/riscv/riscv.c (riscv_block_move_straight): Change type
28274 to unsigned HOST_WIDE_INT for parameter and local variable with
28275 HOST_WIDE_INT type.
28276 (riscv_adjust_block_mem): Ditto.
28277 (riscv_block_move_loop): Ditto.
28278 (riscv_expand_block_move): Ditto.
28280 2021-03-18 Nick Clifton <nickc@redhat.com>
28282 * config/v850/v850.c (construct_restore_jr): Increase static
28284 (construct_save_jarl): Likewise.
28285 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
28287 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28289 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
28290 (aarch64_override_options_internal): Use it.
28291 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
28294 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
28296 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
28297 error message format issues.
28298 (nios2_option_override): Likewise.
28299 (nios2_expand_fpu_builtin): Likewise.
28300 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
28301 truncation warning.
28302 (nios2_expand_custom_builtin): More error message format fixes.
28303 (nios2_expand_rdwrctl_builtin): Likewise.
28304 (nios2_expand_rdprs_builtin): Likewise.
28305 (nios2_expand_eni_builtin): Likewise.
28306 (nios2_expand_builtin): Likewise.
28307 (nios2_register_custom_code): Likewise.
28308 (nios2_valid_target_attribute_rec): Likewise.
28309 (nios2_add_insn_asm): Fix uninitialized variable warning.
28311 2021-03-17 Jan Hubicka <jh@suse.cz>
28313 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
28314 of gather to match reality.
28315 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
28317 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28319 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
28320 to compare against CC_REG rather than NE.
28322 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
28325 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
28326 inline assembly statements.
28327 (ix86_print_operand): Update 'P' handling for -fno-plt.
28329 2021-03-17 Tamar Christina <tamar.christina@arm.com>
28332 * config/aarch64/aarch64.c
28333 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
28335 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
28338 * config/rs6000/predicates.md (branch_comparison_operator): Allow
28339 ordered and unordered for CCFPmode, if flag_finite_math_only.
28341 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28344 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
28345 rather than ASHIFT.
28346 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
28348 2021-03-16 Martin Liska <mliska@suse.cz>
28351 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
28352 cl_optimization_compare function.
28354 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
28356 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
28359 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28362 * config/i386/i386.h (struct machine_function): Add
28363 has_explicit_vzeroupper bitfield.
28364 * config/i386/i386-expand.c (ix86_expand_builtin): Set
28365 cfun->machine->has_explicit_vzeroupper when expanding
28366 IX86_BUILTIN_VZEROUPPER.
28367 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
28368 Do the mode switching only when TARGET_VZEROUPPER, expensive
28369 optimizations turned on and not optimizing for size.
28370 (pass_insert_vzeroupper::gate): Enable even when
28371 cfun->machine->has_explicit_vzeroupper is set.
28373 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28376 * config/aarch64/aarch64.c
28377 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
28378 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
28379 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
28381 2021-03-15 Richard Biener <rguenther@suse.de>
28383 PR tree-optimization/98834
28384 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
28385 subsetting by truncating the access size.
28387 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
28389 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
28390 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
28393 2021-03-15 Martin Liska <mliska@suse.cz>
28395 * spellcheck.c: Add missing comma in initialization.
28397 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
28399 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
28400 alternative 2 and alternative 1 with alternative 3 using
28401 YW register constraint.
28402 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
28403 using YW register constraint.
28404 (*vec_extractv16qi_zext): Ditto.
28405 (*vec_extractv4si): Merge alternatives 4 and 5
28406 using Yw register constraint.
28407 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
28409 2021-03-13 Martin Sebor <msebor@redhat.com>
28411 PR tree-optimization/99489
28412 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
28413 is not a call statement.
28415 2021-03-13 Jakub Jelinek <jakub@redhat.com>
28417 PR tree-optimization/99544
28418 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
28419 if for vector types multiplication can't be done in type's mode.
28421 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
28424 * config/sparc/constraints.md (w): Rename to...
28425 (W): ... this and ditch previous implementation.
28426 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
28427 (*movdf_insn_sp64): Likewise.
28428 (*mov<VM64:mode>_insn_sp64): Likewise.
28429 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
28431 (atomic_compare_and_swap_leon3_1): Likewise.
28432 (*atomic_compare_and_swapdi_v8plus): Likewise.
28433 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
28434 architecture and add missing address validity check during LRA.
28436 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
28439 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
28440 occuring for assumed-size arrays in use_device_{ptr,addr}.
28442 2021-03-12 Jakub Jelinek <jakub@redhat.com>
28445 * config/i386/constraints.md (YW): New internal constraint.
28446 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
28447 (*<sse2_avx2>_<insn><mode>3<mask_name>,
28448 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
28449 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
28451 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
28452 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
28453 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
28454 into one, use Yw instead of former x,v.
28455 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
28456 the last alternative.
28457 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
28458 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
28459 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
28460 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
28461 into one, use <v_Yw> instead of former x,v.
28462 (avx2_interleave_highv32qi<mask_name>,
28463 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
28464 constraints. Add && <mask_avx512bw_condition> to condition.
28465 (avx2_interleave_lowv32qi<mask_name>,
28466 vec_interleave_lowv16qi<mask_name>,
28467 avx2_interleave_highv16hi<mask_name>,
28468 vec_interleave_highv8hi<mask_name>,
28469 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
28470 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
28471 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
28472 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
28473 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
28474 Yw instead of v in constraints.
28475 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
28476 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
28477 instead of Yv in constraints.
28478 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
28479 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
28480 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
28481 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
28482 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
28484 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
28485 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
28486 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
28487 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
28488 two, one with just x, another isa avx512vl with v.
28490 2021-03-12 Martin Liska <mliska@suse.cz>
28492 * doc/invoke.texi: Add missing param documentation.
28494 2021-03-11 David Malcolm <dmalcolm@redhat.com>
28497 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
28498 analyzer/trimmed-graph.o.
28499 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
28500 feasibility checking to reflect new implementation.
28501 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
28503 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
28505 2021-03-11 David Malcolm <dmalcolm@redhat.com>
28507 * digraph.cc (selftest::test_shortest_paths): Update
28508 shortest_paths init for new param. Add test of
28509 SPS_TO_GIVEN_TARGET.
28510 * shortest-paths.h (enum shortest_path_sense): New.
28511 (shortest_paths::shortest_paths): Add "sense" param.
28512 Update for renamings. Generalize to use "sense" param.
28513 (shortest_paths::get_shortest_path): Rename param.
28514 (shortest_paths::m_sense): New field.
28515 (shortest_paths::m_prev): Rename...
28516 (shortest_paths::m_best_edge): ...to this.
28517 (shortest_paths::get_shortest_path): Update for renamings.
28518 Conditionalize flipping of path on sense of traversal.
28520 2021-03-11 David Malcolm <dmalcolm@redhat.com>
28522 * digraph.cc (selftest::test_shortest_paths): Add test coverage
28523 for paths from B and C.
28524 * shortest-paths.h (shortest_paths::shortest_paths): Handle
28525 unreachable nodes, rather than asserting.
28527 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
28530 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
28531 xcoff_tbss_section_name.
28532 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
28533 * xcoffout.c (xcoff_tbss_section_name): Delete.
28534 * xcoffout.h (xcoff_tbss_section_name): Delete.
28536 2021-03-11 Richard Biener <rguenther@suse.de>
28538 PR tree-optimization/99523
28539 * tree-cfg.c (dump_function_to_file): Dump SSA names
28540 w/o identifier to the decls section as well, not only those
28541 without a VAR_DECL.
28543 2021-03-11 Jakub Jelinek <jakub@redhat.com>
28546 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
28547 function calls with lhs fail if the lhs don't have compatible types.
28549 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
28551 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
28552 Change FRAME_POINTER_REGNUM to correspond to a new faked
28553 register faked_fp, part of GENNONACR_REGS like faked_ap.
28554 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
28555 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
28556 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
28557 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
28558 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
28560 (CRIS_REAL_FP_REGNUM): New constant.
28561 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
28562 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
28563 (cris_initial_elimination_offset): Handle elimination changes
28564 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
28565 and add one from FRAME_POINTER_REGNUM to
28566 HARD_FRAME_POINTER_REGNUM.
28567 (cris_expand_prologue, cris_expand_epilogue): Emit code for
28568 hard_frame_pointer_rtx instead of frame_pointer_rtx.
28570 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
28573 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
28574 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
28576 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
28579 * lra-constraints.c (process_address_1): Don't check unknown
28580 constraint, use X for empty constraint.
28582 2021-03-10 Alex Coplan <alex.coplan@arm.com>
28584 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
28585 Fix typo in comment describing "is_ha" argument.
28587 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
28589 * doc/sourcebuild.texi: Document LRA target selector.
28591 2021-03-10 David Malcolm <dmalcolm@redhat.com>
28593 * doc/ux.texi: Add subsection contrasting interactive versus
28594 batch usage of GCC.
28596 2021-03-10 Joel Hutton <joel.hutton@arm.com>
28599 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
28601 (vectorizable_load): Fix gather load mask check condition.
28603 2021-03-10 Richard Biener <rguenther@suse.de>
28605 PR tree-optimization/99510
28606 * tree.c (check_aligned_type): Check that the candidate
28607 has TYPE_USER_ALIGN set instead of matching with the
28610 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
28612 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
28613 float and vector integer modes only if the mode is not larger.
28615 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
28617 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
28619 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
28621 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
28623 * ira-lives.c (single_reg_class): Ditto.
28625 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
28627 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
28628 the architecture-specific rtems.h.
28629 (aarch64-*-rtems*): Likewise.
28630 (arm*-*-rtems*): Likewise.
28631 (epiphany-*-rtems*): Likewise.
28632 (riscv*-*-rtems*): Likewise.
28634 2021-03-09 Jakub Jelinek <jakub@redhat.com>
28636 PR tree-optimization/99305
28637 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
28638 before integer_all_onesp instead of vice versa.
28640 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
28642 * common/config/arm/arm-common.c (arm_config_default): Change type
28643 of 'i' to unsigned.
28645 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
28648 * lra-constraints.c (process_address_1): Process constraint 'g'
28649 separately and digital constraints containing more one digit.
28651 2021-03-09 Nick Clifton <nickc@redhat.com>
28653 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
28654 (DWARF"_DEBUGGING_INFO): Define.
28656 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
28659 * calls.c (initialize_argument_information): When the argument
28660 is passed by reference, do not make a copy in a thunk only if
28661 the argument is already in memory. Remove redundant test for
28662 the case of callee copy.
28664 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
28667 * lra-constraints.c (process_address_1): Process 0..9 constraints
28668 in process_address_1.
28670 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
28672 * config/s390/s390.c (struct s390_processor processor_table):
28673 Binutils name string must not be empty.
28675 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
28677 * config/arc/arc.c (arc_attr_type): Remove function.
28679 2021-03-09 Martin Liska <mliska@suse.cz>
28682 * config/i386/i386-options.c (ix86_option_override_internal):
28683 Set isa_flags for OPTS argument and not for the global
28686 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
28688 * config/rs6000/predicates.md (ds_form_mem_operand): Check
28691 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
28694 * config/rs6000/predicates.md (ds_form_mem_operand) New
28696 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
28697 ds_form_mem_operand in ld/lwa patterns.
28698 * config/rs6000/fusion.md: Regenerate file.
28700 2021-03-08 Martin Sebor <msebor@redhat.com>
28702 PR middle-end/98266
28703 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
28704 (array_bounds_checker::check_array_bounds): Call it.
28706 2021-03-08 Martin Sebor <msebor@redhat.com>
28708 PR middle-end/97631
28709 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
28710 (handle_builtin_stxncpy_strncat): Rename locals. Determine
28711 destination size from allocation calls. Issue a more appropriate
28713 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
28714 (handle_builtin_memset): Same.
28716 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
28719 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
28720 to ensure we do not have an Altivec style address.
28721 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
28722 an Altivec style address.
28723 (*vsx_le_perm_store_<mode>): Likewise.
28724 (splitters after *vsx_le_perm_store_<mode>): Likewise.
28725 (vsx_load_<mode>): Disable special expander if passed an Altivec
28727 (vsx_store_<mode>): Likewise.
28729 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28732 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
28733 (aarch64_simd_shift_imm_vec_hi): Likewise.
28734 (aarch64_simd_shift_imm_vec_si): Likewise.
28735 (aarch64_simd_shift_imm_vec_di): Likewise.
28736 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
28737 predicate from above.
28738 (aarch64_shrn<mode>_insn_be): Likewise.
28739 (aarch64_rshrn<mode>_insn_le): Likewise.
28740 (aarch64_rshrn<mode>_insn_be): Likewise.
28741 (aarch64_shrn2<mode>_insn_le): Likewise.
28742 (aarch64_shrn2<mode>_insn_be): Likewise.
28743 (aarch64_rshrn2<mode>_insn_le): Likewise.
28744 (aarch64_rshrn2<mode>_insn_be): Likewise.
28746 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
28749 * lra-constraints.c (skip_contraint_modifiers): New function.
28750 (process_address_1): Use it before lookup_constraint call.
28752 2021-03-08 Martin Liska <mliska@suse.cz>
28755 * config/i386/i386-options.c (ix86_option_override_internal):
28756 Enable UINTR and HRESET for -march that supports it.
28758 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
28760 * config/s390/s390.c (f_constraint_p): New function.
28761 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
28762 (TARGET_MD_ASM_ADJUST): Likewise.
28764 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
28767 * tree-nested.c (convert_local_reference_stmt): Avoid calling
28768 lookup_field_for_decl for Fortran module (= namespace context).
28770 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
28772 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
28773 comparison with arithmetic right shift.
28774 (s390_expand_vcond): No need for a force_reg anymore.
28775 s390_vec_compare will do it.
28776 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
28777 immediate operands.
28779 2021-03-07 Jakub Jelinek <jakub@redhat.com>
28782 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
28783 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
28785 * config/i386/sse.md (v_Yw): New define_mode_attr.
28786 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
28787 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
28789 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
28790 xYw in constraints.
28792 2021-03-06 Julian Brown <julian@codesourcery.com>
28794 * tree-pretty-print.c (dump_generic_node): Emit non-generic
28795 address space info for aggregates.
28797 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
28799 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
28801 2021-03-05 Jakub Jelinek <jakub@redhat.com>
28803 PR middle-end/99322
28804 * tree-cfg.c (bb_to_omp_idx): New variable.
28805 (execute_build_cfg): Release the bb_to_omp_idx vector after
28806 cleanup_tree_cfg returns.
28807 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
28808 for bb_to_omp_idx being a vec<int> instead of pointer to array
28810 (make_edges): Remove bb_to_omp_idx local variable, don't pass
28811 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
28812 vec<int> instead of pointer to array of ints and don't free/release
28814 (remove_bb): When removing a bb and placing forced label somewhere
28815 else, ensure it is put into the same OpenMP region during cfg
28816 pass if possible or to entry successor as fallback. Unregister
28817 bb from bb_to_omp_idx.
28819 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
28822 * lra-constraints.c (process_address_1): Skip decomposing address
28823 for asm insn operand with unknown constraint.
28825 2021-03-05 Martin Jambor <mjambor@suse.cz>
28828 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
28829 corresponding speculative edges if we are about to resolve
28830 sepculation. Make edge direct (and so resolve speculations) before
28831 removing it from call_site_hash.
28832 (cgraph_edge::make_direct): Relax the initial assert to allow calling
28833 the function on speculative direct edges.
28835 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
28837 PR rtl-optimization/99376
28838 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
28839 of low-order zero bits is too large, set the result to 0 directly.
28841 2021-03-04 Jakub Jelinek <jakub@redhat.com>
28843 PR middle-end/93235
28844 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
28845 SUBREG and a SUBREG to op_mode can't be created.
28847 2021-03-04 Alex Coplan <alex.coplan@arm.com>
28850 * config/aarch64/aarch64-sve-builtins.cc
28851 (function_resolver::require_vector_type): Handle error_mark_node.
28853 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
28855 * cfgexpand.c (expand_asm_loc): Pass new parameter.
28856 (expand_asm_stmt): Likewise.
28857 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
28859 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
28860 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
28861 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
28862 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
28863 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
28864 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
28865 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
28866 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
28867 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
28868 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
28869 * doc/tm.texi (md_asm_adjust): Likewise.
28870 * target.def (md_asm_adjust): Likewise.
28872 2021-03-04 Richard Biener <rguenther@suse.de>
28874 PR middle-end/97855
28875 * tree-pretty-print.c: Poison pp_printf.
28876 (dump_decl_name): Avoid use of pp_printf.
28877 (dump_block_node): Likewise.
28878 (dump_generic_node): Likewise.
28880 2021-03-04 Martin Sebor <msebor@redhat.com>
28882 PR middle-end/96963
28883 PR middle-end/94655
28884 * builtins.c (handle_array_ref): New helper.
28885 (handle_mem_ref): New helper.
28886 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
28887 into new helper functions. Correct a workaround for vectorized
28890 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
28892 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
28893 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
28894 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
28895 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
28896 attribute for Power10.
28897 * config/rs6000/mma.md (*movoo): Likewise.
28898 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
28899 (define_mode_attr bits): Add DD/TD modes.
28900 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
28901 store_conditionalpti): Update size attribute for Power10.
28903 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
28906 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
28907 -Wuninitialized, -Wmaybe-uninitialized.
28908 (wide-int.o-warn): Likewise.
28910 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
28912 * common/config/arm/arm-common.c: Include configargs.h.
28913 (arm_config_default): New function.
28914 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
28915 processors that do not support Thumb. Take into account the
28916 --with-mode configuration setting for selecting the default.
28917 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
28918 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
28920 2021-03-03 Martin Liska <mliska@suse.cz>
28922 PR gcov-profile/97461
28923 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
28925 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
28928 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
28929 point back the hard frame pointer to its default location when the
28930 frame is larger than SEH_MAX_FRAME_SIZE.
28932 2021-03-03 Jakub Jelinek <jakub@redhat.com>
28935 * config/i386/predicates.md (logic_operator): New define_predicate.
28936 * config/i386/i386.md (mov + mem using comm arith peephole2):
28937 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
28938 and the inner mode is [QH]Imode.
28940 2021-03-03 Jakub Jelinek <jakub@redhat.com>
28943 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
28944 (new_loc_list): Clear end_entry.
28945 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
28946 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
28948 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
28949 initialize also end_entry.
28951 2021-03-03 Jakub Jelinek <jakub@redhat.com>
28954 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
28955 partitions, if in non-layout mode after reorder_blocks also move
28956 affected blocks to ensure a single partition transition.
28958 2021-03-03 Jason Merrill <jason@redhat.com>
28961 * cgraphunit.c (process_function_and_variable_attributes): Don't
28962 warn about flatten on an alias if the target also has it.
28963 * cgraph.h (symtab_node::get_alias_target_tree): New.
28965 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
28967 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
28968 period to symbol name.
28969 (tls_get_addr_internal<mode>): Same.
28971 2021-03-02 David Malcolm <dmalcolm@redhat.com>
28974 * diagnostic-show-locus.c
28975 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
28978 2021-03-02 Martin Sebor <msebor@redhat.com>
28980 PR middle-end/99276
28981 * builtins.c (warn_for_access): Remove stray warning text.
28983 2021-03-02 Martin Sebor <msebor@redhat.com>
28985 PR middle-end/99295
28986 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
28989 2021-03-02 Jakub Jelinek <jakub@redhat.com>
28992 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
28993 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
28994 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
28995 DW_MACRO_define_strx and DW_MACRO_undef_strx.
28996 (save_macinfo_strings): Use DW_MACRO_*_str* even with
28997 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
28998 DW_MACRO_undef_strx.
29000 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29002 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
29004 (BT_FN_V8HI_V8HI_UINT): Likewise.
29005 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
29006 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
29007 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
29008 New builtin definitions.
29009 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
29010 vector extension version.
29011 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
29012 available with current -march level.
29013 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
29014 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
29015 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
29016 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
29017 (vec_extend_to_fp32_lo): Likewise.
29018 (vec_round_from_fp32): Likewise.
29019 (vec_convert_to_fp16): Likewise.
29020 (vec_convert_from_fp16): Likewise.
29021 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
29022 (vclfnls_v8hi): Likewise.
29023 (vcrnfs_v8hi): Likewise.
29024 (vcfn_v8hi): Likewise.
29025 (vcnf_v8hi): Likewise.
29027 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29029 * common/config/s390/s390-common.c (processor_flags_table): New entry.
29030 * config.gcc: Enable arch14 for --with-arch and --with-tune.
29031 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
29032 arch14 for unknown CPU models.
29033 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
29034 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
29035 (s390_get_sched_attrmask): Likewise.
29036 (s390_get_unit_mask): Likewise.
29037 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
29038 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
29039 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
29040 (TARGET_NNPA_P): New macro definitions.
29041 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
29042 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
29044 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29046 PR middle-end/95757
29047 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
29048 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
29049 != 1 comparisons if name is lhs of a comparison.
29051 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
29055 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
29056 * config/darwin.c (darwin_should_restore_cfa_state): New.
29057 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
29058 * doc/tm.texi: Regenerated.
29059 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
29060 * dwarf2cfi.c (connect_traces): If the target requests, restore
29061 the CFA expression after a DW_CFA_restore.
29062 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
29064 2021-03-01 Martin Liska <mliska@suse.cz>
29067 * optc-save-gen.awk: Add 4 more exceptions.
29069 2021-03-01 Nathan Sidwell <nathan@acm.org>
29072 * tree.h (TYPE_ALIGN_RAW): New accessor.
29073 (TYPE_ALIGN): Use it.
29075 2021-03-01 Jan Hubicka <jh@suse.cz>
29078 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
29080 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
29083 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29084 point the hard frame pointer to the SSE register save area instead
29085 of the general register save area. Perform only minimal adjustment
29086 for small frames if it is initially not correctly aligned.
29087 (ix86_expand_prologue): Remove early saves for a SEH target.
29088 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
29090 2021-02-28 Jakub Jelinek <jakub@redhat.com>
29093 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
29094 typo - referneced -> referenced.
29095 * tree.c (component_ref_size): Fix comment typo -
29096 refernce -> reference.
29097 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
29098 traling -> trailing.
29099 (aliasing_component_refs_p): Fix comment typos -
29100 refernce -> reference and refernece -> reference and
29101 traling -> trailing.
29102 (nonoverlapping_refs_since_match_p): Fix comment typo -
29103 referneces -> references.
29104 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
29105 referneces -> references.
29107 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
29109 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
29110 diagnostic message to avoid use of a contraction and format
29113 2021-02-27 Jakub Jelinek <jakub@redhat.com>
29116 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
29117 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
29118 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
29119 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
29122 2021-02-26 Richard Biener <rguenther@suse.de>
29124 PR middle-end/99281
29125 * expr.c (store_field): For calls with return-slot optimization
29126 and addressable return type expand the store directly.
29128 2021-02-26 Richard Biener <rguenther@suse.de>
29131 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
29133 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
29136 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
29139 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
29141 * config.gcc: Add rs6000-pcrel-opt.o.
29142 * config/rs6000/rs6000-pcrel-opt.c: New file.
29143 * config/rs6000/pcrel-opt.md: New file.
29144 * config/rs6000/predicates.md: Add d_form_memory predicate.
29145 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
29146 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
29147 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
29148 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
29149 and make_pass_pcrel_opt().
29150 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
29151 (rs6000_option_override_internal): Add pcrel-opt.
29152 (rs6000_delegitimize_address): Support pcrel-opt.
29153 (rs6000_opt_masks): Add pcrel-opt.
29154 (pcrel_opt_valid_mem_p): New function.
29155 (reg_to_non_prefixed): Make global.
29156 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
29157 (output_pcrel_opt_reloc): New function.
29158 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
29159 (pcrel_extern_addr): Set loads_extern_addr.
29160 Add include for pcrel-opt.md.
29161 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
29162 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
29165 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
29168 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
29169 If TARGET_64BIT and dest is SUBREG, we check the width, if it
29170 equal to SImode, we use SImode operation, just like what we are
29173 2021-02-26 Marek Polacek <polacek@redhat.com>
29175 * builtins.c (warn_for_access): Fix typos.
29177 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
29179 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
29180 mark in front of the immediate quantity.
29181 (<optab>_rolsi3_uxtw): Likewise.
29183 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
29186 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
29187 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
29188 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
29189 address and disable when the FPCXT is not available.
29190 (nonsecure_call_value_reg_thumb2): Likewise.
29192 2021-02-25 Nathan Sidwell <nathan@acm.org>
29195 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
29197 2021-02-25 Tamar Christina <tamar.christina@arm.com>
29199 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
29201 2021-02-25 Richard Biener <rguenther@suse.de>
29203 PR tree-optimization/99253
29204 * tree-vect-loop.c (check_reduction_path): First compute
29205 code, then verify out-of-loop uses.
29207 2021-02-25 Jakub Jelinek <jakub@redhat.com>
29210 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
29212 2021-02-25 Jakub Jelinek <jakub@redhat.com>
29214 PR tree-optimization/80635
29215 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
29216 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
29217 has mode precision.
29219 2021-02-25 Richard Biener <rguenther@suse.de>
29221 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
29222 load_map population.
29223 (vect_match_slp_patterns_2): Revert part of last change.
29224 (vect_analyze_slp): Do not interleave optimize_load_redistribution
29225 with pattern detection but do it afterwards. Dump the
29226 whole SLP graph after pattern recognition and load
29227 redistribution optimization finished.
29229 2021-02-24 Jakub Jelinek <jakub@redhat.com>
29232 * omp-low.c (struct omp_context): Add teams_nested_p and
29233 nonteams_nested_p members.
29234 (scan_omp_target): Diagnose teams nested inside of target with other
29235 directives strictly nested inside of the same target.
29236 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
29237 ctx->nonteams_nested_p as needed.
29239 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
29241 PR inline-asm/99123
29242 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
29244 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
29246 * config/cris/cris.c (cris_expand_prologue): Set
29247 current_function_static_stack_size, if flag_stack_usage_info.
29249 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
29251 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
29252 (rs6000_final_prescan_insn): Adjust.
29253 (rs6000_asm_output_opcode): Likewise.
29255 2021-02-24 Martin Sebor <msebor@redhat.com>
29257 PR middle-end/97172
29258 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
29259 from function arguments.
29261 2021-02-24 Tamar Christina <tamar.christina@arm.com>
29263 PR tree-optimization/99220
29264 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
29265 node from cache when it's about to be deleted.
29267 2021-02-24 Jakub Jelinek <jakub@redhat.com>
29269 PR tree-optimization/99225
29270 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
29271 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
29272 build_int_cst (..., 1). Formatting fixes.
29274 2021-02-24 Tamar Christina <tamar.christina@arm.com>
29276 PR tree-optimization/99149
29277 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
29279 (vect_slp_reset_pattern): Remove.
29280 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
29281 (complex_mul_pattern::build, complex_fma_pattern::build,
29282 complex_fms_pattern::build): Fix ref counts.
29283 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
29284 when node is being deleted.
29285 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
29286 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
29288 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
29290 2021-02-24 Matthias Klose <doko@ubuntu.com>
29293 2020-12-07 Matthias Klose <doko@ubuntu.com>
29295 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
29296 and ENABLE_RTL_FLAG_CHECKING.
29298 2021-02-24 Richard Biener <rguenther@suse.de>
29301 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
29303 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
29305 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
29306 (vsx_assemble_pair): ...to this.
29307 (*mma_assemble_pair): Rename from this...
29308 (*vsx_assemble_pair): ...to this.
29309 (mma_disassemble_pair): Rename from this...
29310 (vsx_disassemble_pair): ...to this.
29311 (*mma_disassemble_pair): Rename from this...
29312 (*vsx_disassemble_pair): ...to this.
29313 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
29314 BU_COMPAT): New macros.
29315 (mma_assemble_pair): Rename from this...
29316 (vsx_assemble_pair): ...to this.
29317 (mma_disassemble_pair): Rename from this...
29318 (vsx_disassemble_pair): ...to this.
29319 (mma_assemble_pair): New compatibility built-in.
29320 (mma_disassemble_pair): Likewise.
29321 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
29322 (RS6000_BUILTIN_COMPAT): Define.
29323 (bdesc_compat): New.
29324 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
29325 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
29326 and VSX_BUILTIN_ASSEMBLE_PAIR.
29327 (rs6000_init_builtins): Register compatibility built-ins.
29328 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
29329 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
29330 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
29331 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
29332 (__builtin_vsx_assemble_pair): ...to this.
29333 (__builtin_mma_disassemble_pair): Rename from this...
29334 (__builtin_vsx_disassemble_pair): ...to this.
29336 2021-02-23 Martin Liska <mliska@suse.cz>
29339 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
29340 with different alignment. That leads to an invalid red zone
29341 size allocated in runtime.
29343 2021-02-23 Jakub Jelinek <jakub@redhat.com>
29345 PR tree-optimization/99204
29346 * fold-const.c (fold_read_from_constant_string): Check that
29347 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
29349 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
29350 Kewen Lin <linkw@gcc.gnu.org>
29352 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
29353 (rotl<mode>3_insert_3): ...this.
29354 (plus_ior_xor): New code_iterator.
29355 (define_split for GPR rl*imi): New splitter.
29356 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
29357 for integer merging.
29359 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29361 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
29363 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
29364 into a register when the above is enabled.
29365 * config/aarch64/aarch64.c (neoversev1_tunings):
29366 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
29367 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
29369 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
29371 * config/cris/cris.c (cris_print_operand) <'T'>: Change
29372 valid operand from is now an addi mult-value to shift-value.
29373 * config/cris/cris.md (*addi): Change expression of scaled
29374 operand from mult to ashift.
29375 * config/cris/cris.md (*addi_reload): New insn_and_split.
29377 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
29380 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
29381 hook_bool_const_tree_hwi_hwi_const_tree_true.
29382 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
29384 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
29386 PR rtl-optimization/98791
29387 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
29388 for unordered modes.
29390 2021-02-22 Martin Liska <mliska@suse.cz>
29392 * tree-inline.c (inline_forbidden_p): Set
29393 inline_forbidden_reason.
29395 2021-02-22 Richard Biener <rguenther@suse.de>
29397 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
29400 2021-02-22 Richard Biener <rguenther@suse.de>
29402 PR tree-optimization/99165
29403 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
29404 Accumulate changed to ret.
29406 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
29409 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
29411 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
29413 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
29416 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
29418 (trunctf<DFP_ALL:mode>2): Likewise.
29419 (trunctdtf2_vr): Likewise.
29420 (trunctdtf2): Likewise.
29421 (extend<DFP_ALL:mode>tf2_vr): Likewise.
29422 (extend<DFP_ALL:mode>tf2): Likewise.
29423 (extendtftd2_vr): Likewise.
29424 (extendtftd2): Likewise.
29426 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
29428 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
29429 add memory alternative.
29430 (tf_to_fprx2): New pattern.
29432 2021-02-19 Martin Sebor <msebor@redhat.com>
29435 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
29436 (attr_access::free_lang_data): Remove a spurious test.
29438 2021-02-19 Nathan Sidwell <nathan@acm.org>
29440 * doc/invoke.texi (flang-info-module-read): Document.
29442 2021-02-19 Martin Liska <mliska@suse.cz>
29444 PR translation/99167
29445 * params.opt: Fix typo.
29447 2021-02-19 Richard Biener <rguenther@suse.de>
29449 PR middle-end/99122
29450 * tree-inline.c (inline_forbidden_p): Do not inline functions
29451 with VLA arguments or return value.
29453 2021-02-19 Jakub Jelinek <jakub@redhat.com>
29456 * config/arm/arm.md (*stack_protect_combined_set_insn,
29457 *stack_protect_combined_test_insn): If force_const_mem result
29458 is not valid general operand, force its address into the destination
29461 2021-02-19 Jakub Jelinek <jakub@redhat.com>
29464 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
29465 pad or non-local label, put FORCED_LABELs from bb b after that label
29466 rather than before it.
29468 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
29471 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
29472 expand_vector_broadcast' to emit the vec_duplicate operand.
29474 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
29476 PR rtl-optimization/96264
29477 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
29480 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
29483 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
29484 looking up the retain attribute.
29485 (resolve_unique_section): Likewise.
29486 (get_variable_section): Likewise.
29487 (switch_to_section): Likewise. Warn when a symbol without the
29488 retain attribute and a symbol with the retain attribute are
29489 placed in the section with the same name, instead of the used
29491 * doc/extend.texi: Document the "retain" attribute.
29493 2021-02-18 Nathan Sidwell <nathan@acm.org>
29496 * doc/invoke.texi (flang-info-include-translate): Document header
29499 2021-02-18 Richard Biener <rguenther@suse.de>
29501 PR middle-end/99122
29502 * ipa-fnsummary.c (analyze_function_body): Set
29503 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
29504 * tree-inline.c (insert_init_debug_bind): Pass NULL for
29505 error_mark_node values.
29506 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
29508 (setup_one_parameter): Delay force_value_to_type until when
29511 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
29513 PR tree-optimization/99142
29514 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
29516 2021-02-18 Jakub Jelinek <jakub@redhat.com>
29518 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
29519 wide_int_bitmask::wide_int_bitmask (uint64_t),
29520 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
29521 wide_int_bitmask::operator ~ () const,
29522 wide_int_bitmask::operator | (wide_int_bitmask) const,
29523 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
29525 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
29526 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
29527 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
29528 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
29529 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
29530 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
29531 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
29532 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
29533 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
29534 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
29535 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
29536 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
29537 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
29538 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
29539 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
29540 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
29541 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
29542 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
29543 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
29544 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
29545 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
29546 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
29547 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
29548 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
29550 2021-02-18 Jakub Jelinek <jakub@redhat.com>
29552 PR middle-end/99109
29553 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
29554 (build_printable_array_type): ... this. Add nelts argument. For
29555 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
29556 nelts, call build_array_type_nelts.
29557 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
29558 instead of build_zero_elt_array_type and build_array_type_nelts.
29560 2021-02-18 Jakub Jelinek <jakub@redhat.com>
29563 * config/i386/i386.c (distance_non_agu_define): Don't call
29564 extract_insn_cached here.
29565 (ix86_lea_outperforms): Save and restore recog_data around call
29566 to distance_non_agu_define and distance_agu_use.
29567 (ix86_ok_to_clobber_flags): Remove.
29568 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
29569 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
29570 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
29571 into define_insn. Move the splitting to define_peephole2 and
29572 check there using peep2_regno_dead_p if FLAGS_REG is dead.
29574 2021-02-17 Julian Brown <julian@codesourcery.com>
29576 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
29579 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
29582 * config/mips/mips.c (mips_symbol_insns): Do not use
29583 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
29585 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
29587 PR inline-asm/98096
29588 * stmt.c (resolve_operand_name_1): Take inout operands into account
29589 for access to labels by names.
29590 * doc/extend.texi: Describe counting operands for accessing labels.
29592 2021-02-16 Richard Biener <rguenther@suse.de>
29594 PR tree-optimization/38474
29595 * tree-ssa-structalias.c (variable_info::address_taken): New.
29596 (new_var_info): Initialize address_taken.
29597 (process_constraint): Set address_taken.
29598 (solve_constraints): Use the new address_taken flag rather
29599 than is_reg_var for sorting variables.
29600 (dump_constraint): Dump the variable number if the name
29603 2021-02-16 Jakub Jelinek <jakub@redhat.com>
29606 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
29607 multiply by 4096 and for inbranch by 8192.
29608 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
29609 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
29611 2021-02-15 Maya Rashish <coypu@sdf.org>
29613 * config/aarch64/aarch64.c (aarch64_init_builtins):
29614 Call SUBTARGET_INIT_BUILTINS.
29616 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
29618 PR rtl-optimization/98872
29619 * init-regs.c (initialize_uninitialized_regs): Skip initialization
29620 if CONST0_RTX is NULL.
29622 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
29624 PR rtl-optimization/98863
29625 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
29626 (function_info::build_info): Turn into a declaration, moving the
29627 definition to internals.h.
29628 (function_info::bb_walker): Declare.
29629 (function_info::create_reg_use): Likewise.
29630 (function_info::calculate_potential_phi_regs): Take a build_info
29632 (function_info::place_phis, function_info::create_ebbs): Declare.
29633 (function_info::calculate_ebb_live_in_for_debug): Likewise.
29634 (function_info::populate_backedge_phis): Delete.
29635 (function_info::start_block, function_info::end_block): Declare.
29636 (function_info::populate_phi_inputs): Delete.
29637 (function_info::m_potential_phi_regs): Move information to build_info.
29638 * rtl-ssa/internals.h: New file.
29639 (function_info::bb_phi_info): New class.
29640 (function_info::build_info): Moved from functions.h.
29641 Add a constructor and destructor.
29642 (function_info::build_info::ebb_use): Delete.
29643 (function_info::build_info::ebb_def): Likewise.
29644 (function_info::build_info::bb_live_out): Likewise.
29645 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
29646 (function_info::build_info::potential_phi_regs): Likewise.
29647 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
29648 (function_info::build_info::ebb_def_regs): Likewise.
29649 (function_info::build_info::bb_phis): Likewise.
29650 (function_info::build_info::bb_mem_live_out): Likewise.
29651 (function_info::build_info::bb_to_rpo): Likewise.
29652 (function_info::build_info::def_stack): Likewise.
29653 (function_info::build_info::old_def_stack_limit): Likewise.
29654 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
29655 Remove the regno argument. Push the previous definition onto the
29656 definition stack where necessary.
29657 * rtl-ssa/accesses.cc: Include internals.h.
29658 * rtl-ssa/changes.cc: Likewise.
29659 * rtl-ssa/blocks.cc: Likewise.
29660 (function_info::build_info::build_info): Define.
29661 (function_info::build_info::~build_info): Likewise.
29662 (function_info::bb_walker): New class.
29663 (function_info::bb_walker::bb_walker): Define.
29664 (function_info::add_live_out_use): Convert a logarithmic-complexity
29665 test into a linear one. Allow the same definition to be passed
29667 (function_info::calculate_potential_phi_regs): Moved from
29668 functions.cc. Take a build_info parameter and store the
29669 information there instead.
29670 (function_info::place_phis): New function.
29671 (function_info::add_entry_block_defs): Update call to record_reg_def.
29672 (function_info::calculate_ebb_live_in_for_debug): New function.
29673 (function_info::add_phi_nodes): Use bb_phis to decide which
29674 registers need phi nodes and initialize ebb_def_regs accordingly.
29675 Do not add degenerate phis here.
29676 (function_info::add_artificial_accesses): Use create_reg_use.
29677 Assert that all definitions are listed in the DF LR sets.
29678 Update call to record_reg_def.
29679 (function_info::record_block_live_out): Record live-out register
29680 values in the phis of successor blocks. Use the live-out set
29681 when processing the last block in an EBB, instead of always
29682 using the live-in sets of successor blocks. AND the live sets
29683 with the set of registers that have been defined in the EBB,
29684 rather than with all potential phi registers. Cope correctly
29685 with branches back to the start of the current EBB.
29686 (function_info::start_block): New function.
29687 (function_info::end_block): Likewise.
29688 (function_info::populate_phi_inputs): Likewise.
29689 (function_info::create_ebbs): Likewise.
29690 (function_info::process_all_blocks): Rewrite into a multi-phase
29692 * rtl-ssa/functions.cc: Include internals.h.
29693 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
29694 (function_info::init_function_data): Remove caller.
29695 * rtl-ssa/insns.cc: Include internals.h
29696 (function_info::create_reg_use): New function. Lazily any
29697 degenerate phis needed by the linear RPO view.
29698 (function_info::record_use): Use create_reg_use. When processing
29699 debug uses, use potential_phi_regs and test it before checking
29700 whether the register is live on entry to the current EBB. Lazily
29701 calculate ebb_live_in_for_debug.
29702 (function_info::record_call_clobbers): Update call to record_reg_def.
29703 (function_info::record_def): Likewise.
29705 2021-02-15 Martin Liska <mliska@suse.cz>
29707 * toplev.c (init_asm_output): Free output of
29708 gen_command_line_string function.
29709 (process_options): Likewise.
29711 2021-02-15 Martin Liska <mliska@suse.cz>
29713 * params.opt: Add 2 missing Param keywords.
29715 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
29717 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
29719 2021-02-15 Jakub Jelinek <jakub@redhat.com>
29721 PR tree-optimization/99079
29722 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
29723 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
29724 require both type and TREE_TYPE (@1) to be integral types and either
29725 type having smaller or equal precision, or TREE_TYPE (@1) being
29726 unsigned type, or type being signed type. If TREE_TYPE (@1)
29727 doesn't have wrapping overflow, perform the subtraction of one in
29730 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
29731 Richard Biener <rguether@suse.de>
29734 * ipa-reference.c (ipa_init): Only conditinally initialize
29735 reference_vars_to_consider.
29736 (propagate): Conditionally deninitialize reference_vars_to_consider.
29737 (ipa_reference_write_optimization_summary): Sanity check that
29738 reference_vars_to_consider is not allocated.
29740 2021-02-13 Levy Hsu <admin@levyhsu.com>
29743 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
29744 extend parameter to get_si_mem_base_reg declaration.
29745 (get_si_mem_base_reg): Add extend parameter. Set it.
29746 (analyze): Pass extend arg to get_si_mem_base_reg.
29747 (transform): Likewise. Use it when rewriting mems.
29748 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
29749 loads and emit sign/zero extending load followed by subreg move.
29751 2021-02-13 Jim Wilson <jimw@sifive.com>
29754 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
29755 exit when !reload_completed. Only perform check for compressed reg
29756 if reload_completed.
29757 (riscv_rtx_costs): In MEM case, when optimizing for size and
29758 shorten memrefs, if not compressible, then increase cost.
29760 2021-02-13 Jakub Jelinek <jakub@redhat.com>
29762 PR rtl-optimization/98439
29763 * recog.c (pass_split_before_regstack::gate): Enable even when
29764 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
29767 2021-02-13 Jakub Jelinek <jakub@redhat.com>
29770 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
29771 swap of V2SImode elements in memory into DImode memory rotate by 32.
29773 2021-02-12 Martin Sebor <msebor@redhat.com>
29775 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
29777 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
29779 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
29780 m_temp_obstack rather than m_obstack to allocate the temporary use.
29782 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
29784 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
29785 as read-modify operations.
29787 2021-02-12 Richard Biener <rguenther@suse.de>
29789 PR middle-end/38474
29790 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
29791 fbi->aa_walk_budget is bigger than zero. Update
29792 fbi->aa_walk_budget.
29793 (param_change_prob): Likewise.
29794 * ipa-prop.c (detect_type_change_from_memory_writes):
29795 Properly account walk_aliased_vdefs.
29796 (parm_preserved_before_stmt_p): Canonicalize updates.
29797 (parm_ref_data_preserved_p): Likewise.
29798 (parm_ref_data_pass_through_p): Likewise.
29799 (determine_known_aggregate_parts): Account own alias queries.
29801 2021-02-12 Martin Liska <mliska@suse.cz>
29803 * opts-common.c (decode_cmdline_option): Release werror_arg.
29804 * opts.c (gen_producer_string): Release output of
29805 gen_command_line_string.
29807 2021-02-12 Richard Biener <rguenther@suse.de>
29809 PR tree-optimization/38474
29810 * params.opt (-param=max-store-chains-to-track=): New param.
29811 (-param=max-stores-to-track=): Likewise.
29812 * doc/invoke.texi (max-store-chains-to-track): Document.
29813 (max-stores-to-track): Likewise.
29814 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
29816 (pass_store_merging::m_n_stores): Likewise.
29817 (pass_store_merging::terminate_and_process_chain): Update
29818 m_n_stores and m_n_chains.
29819 (pass_store_merging::process_store): Likewise. Terminate
29820 oldest chains if the number of stores or chains get too large.
29821 (imm_store_chain_info::terminate_and_process_chain): Dump
29824 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
29826 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
29827 the cold section, emit a nop before the directive if the previous
29828 active instruction can throw.
29830 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
29833 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
29834 memory addresses that are legal for quad word accesses.
29836 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
29839 * config/arm/thumb2.md (*doloop_end_internal): Generate
29840 alternative sequence to handle long range branches.
29842 2021-02-11 Joel Hutton <joel.hutton@arm.com>
29844 PR tree-optimization/98772
29845 * optabs-tree.c (supportable_half_widening_operation): New function
29846 to check for supportable V8QI->V8HI widening patterns.
29847 * optabs-tree.h (supportable_half_widening_operation): New function.
29848 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
29849 to create promotion stmts for V8QI->V8HI widening patterns.
29850 (vectorizable_conversion): Add case for V8QI->V8HI.
29852 2021-02-11 Richard Biener <rguenther@suse.de>
29854 * sparseset.h (SPARSESET_ELT_BITS): Remove.
29855 (SPARSESET_ELT_TYPE): Use unsigned int.
29856 * fwprop.c: Do not include sparseset.h.
29858 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29861 * varasm.c (declare_weak): For -fsyntax-only, allow even
29862 TREE_ASM_WRITTEN function decls.
29864 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29867 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
29868 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
29869 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
29870 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
29871 calling simplify_gen_subreg on it.
29873 2021-02-10 Martin Liska <mliska@suse.cz>
29875 * config/nvptx/nvptx.c (nvptx_option_override): Use
29876 flag_patchable_function_entry instead of the removed
29877 function_entry_patch_area_size.
29879 2021-02-10 Martin Liska <mliska@suse.cz>
29881 PR tree-optimization/99002
29882 PR tree-optimization/99026
29883 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
29884 leak when adjacent cases are merged.
29885 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
29887 (make_pass_lower_switch): Remove trailing whitespace.
29888 * tree-switch-conversion.h (release_clusters): New.
29890 2021-02-10 Richard Biener <rguenther@suse.de>
29892 PR rtl-optimization/99054
29893 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
29894 (fixup_partitions): Adjust.
29895 (rtl_verify_edges): Likewise.
29897 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29899 PR middle-end/99007
29900 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
29901 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
29904 2021-02-10 Richard Biener <rguenther@suse.de>
29907 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
29910 2021-02-10 Richard Biener <rguenther@suse.de>
29912 PR tree-optimization/99024
29913 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
29914 clear loop->aux if it is associated with the destroyed loop_vinfo.
29916 2021-02-10 Martin Liska <mliska@suse.cz>
29918 PR tree-optimization/99002
29919 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
29922 2021-02-10 Martin Liska <mliska@suse.cz>
29925 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
29926 a reference exists.
29928 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29931 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
29932 at class scope for DWARF5+.
29934 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
29936 PR rtl-optimization/96015
29937 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
29938 (relax_delay_slots): When deleting a jump to the next active
29939 instruction over a barrier, first delete the barrier if the
29940 jump is the only way to reach the target label.
29942 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
29944 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
29945 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
29946 vector multiplies and vect.alu for SSRA.
29947 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
29948 vect.mul cost field.
29949 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
29950 * config/arm/arm.c: Likewise.
29952 2021-02-09 Richard Biener <rguenther@suse.de>
29954 PR tree-optimization/98863
29955 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
29956 * tree-ssa-sccvn.c (last_pushed_avail): New global.
29957 (rpo_elim::eliminate_push_avail): Chain pushed avails.
29958 (unwind_state::avail_top): Add.
29959 (do_unwind): Rewrite unwinding of avail entries.
29960 (do_rpo_vn): Initialize last_pushed_avail and
29961 avail_top of the undo state.
29963 2021-02-09 Jakub Jelinek <jakub@redhat.com>
29965 PR middle-end/99004
29966 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
29967 const char * to char * and free those pointers after use.
29969 2021-02-09 Richard Biener <rguenther@suse.de>
29971 PR tree-optimization/99017
29972 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
29973 zero vector cost entries.
29975 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
29977 PR middle-end/98974
29978 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
29979 parameter in vectorizable_condition.
29981 2021-02-08 Richard Biener <rguenther@suse.de>
29984 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
29986 2021-02-08 Martin Liska <mliska@suse.cz>
29989 * cfgexpand.c (pass_expand::execute): Parse per-function option
29990 flag_patchable_function_entry and use it.
29991 * common.opt: Remove function_entry_patch_area_size and
29992 function_entry_patch_area_start global variables.
29993 * opts.c (parse_and_check_patch_area): New function.
29994 (common_handle_option): Use it.
29995 * opts.h (parse_and_check_patch_area): New function.
29996 * toplev.c (process_options): Parse and use
29997 function_entry_patch_area_size.
29999 2021-02-08 Martin Sebor <msebor@redhat.com>
30001 * doc/extend.texi (attribute malloc): Correct typos.
30003 2021-02-05 Nathan Sidwell <nathan@acm.org>
30006 * gcc.c (driver::maybe_run_linker): Check for input file
30007 accessibility if not linking.
30009 2021-02-05 Richard Biener <rguenther@suse.de>
30011 PR tree-optimization/98855
30012 * tree-vectorizer.h (add_stmt_cost): New overload.
30013 * tree-vect-slp.c (li_cost_vec_cmp): New.
30014 (vect_bb_slp_scalar_cost): Cost individual loop regions
30015 separately. Account for the scalar instance root stmt.
30017 2021-02-05 Tom de Vries <tdevries@suse.de>
30020 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
30022 (bit_test_cluster::emit): Reuse location_t for newly created
30024 (switch_decision_tree::try_switch_expansion): Preserve
30026 * tree-switch-conversion.h: Change function signatures.
30028 2021-02-05 Jakub Jelinek <jakub@redhat.com>
30031 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
30032 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
30033 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
30034 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
30036 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30038 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
30039 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
30040 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
30041 (vget_high_f16): Reimplement using new builtin.
30042 (vget_high_f32): Likewise.
30043 (vget_high_f64): Likewise.
30044 (vget_high_p8): Likewise.
30045 (vget_high_p16): Likewise.
30046 (vget_high_p64): Likewise.
30047 (vget_high_s8): Likewise.
30048 (vget_high_s16): Likewise.
30049 (vget_high_s32): Likewise.
30050 (vget_high_s64): Likewise.
30051 (vget_high_u8): Likewise.
30052 (vget_high_u16): Likewise.
30053 (vget_high_u32): Likewise.
30054 (vget_high_u64): Likewise.
30056 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30058 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
30059 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
30060 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
30061 (vget_low_f16): Reimplement using new builtin.
30062 (vget_low_f32): Likewise.
30063 (vget_low_f64): Likewise.
30064 (vget_low_p8): Likewise.
30065 (vget_low_p16): Likewise.
30066 (vget_low_p64): Likewise.
30067 (vget_low_s8): Likewise.
30068 (vget_low_s16): Likewise.
30069 (vget_low_s32): Likewise.
30070 (vget_low_s64): Likewise.
30071 (vget_low_u8): Likewise.
30072 (vget_low_u16): Likewise.
30073 (vget_low_u32): Likewise.
30074 (vget_low_u64): Likewise.
30076 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
30078 * gcc.c (print_multilib_info): Check all required argument is provided
30081 2021-02-05 liuhongt <hongtao.liu@intel.com>
30084 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
30085 generate integer mask comparison for 128/256-bits vector when
30086 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
30087 delete redundant !maskcmp condition.
30088 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
30090 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
30091 condition directly to if (maskcmp), add extra check for
30092 cmpmode, it should be MODE_INT.
30093 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
30094 parameters op_true/op_false.
30095 (ix86_use_mask_cmp_p): New.
30097 2021-02-05 liuhongt <hongtao.liu@intel.com>
30100 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
30101 Remove m_GENERIC from ~list.
30102 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
30104 2021-02-04 David Malcolm <dmalcolm@redhat.com>
30107 * diagnostic-show-locus.c (compatible_locations_p): Require
30108 locations in the same macro map to be either both from the
30109 macro definition, or both from the macro arguments.
30111 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30113 * config/aarch64/aarch64-simd-builtins.def: Add
30114 [su]mull_hi_lane[q] builtin generator macros.
30115 * config/aarch64/aarch64-simd.md
30116 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
30117 (aarch64_<su>mull_hi_lane<mode>): Define.
30118 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
30119 (aarch64_<su>mull_hi_laneq<mode>): Define.
30120 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
30121 builtin instead of inline asm.
30122 (vmull_high_lane_s32): Likewise.
30123 (vmull_high_lane_u16): Likewise.
30124 (vmull_high_lane_u32): Likewise.
30125 (vmull_high_laneq_s16): Likewise.
30126 (vmull_high_laneq_s32): Likewise.
30127 (vmull_high_laneq_u16): Likewise.
30128 (vmull_high_laneq_u32): Liekwise.
30130 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30132 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
30133 builtin generator macros.
30134 * config/aarch64/aarch64-simd.md
30135 (aarch64_<su>mull_hi_n<mode>_insn): Define.
30136 (aarch64_<su>mull_hi_n<mode>): Define.
30137 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
30138 instead of inline asm.
30139 (vmull_high_n_s32): Likewise.
30140 (vmull_high_n_u16): Likewise.
30141 (vmull_high_n_u32): Likewise.
30143 2021-02-04 Richard Biener <rguenther@suse.de>
30145 PR tree-optimization/98855
30146 * tree-vect-loop.c (vectorizable_phi): Do not cost
30147 single-argument PHIs.
30148 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
30149 * tree-vect-stmts.c (vectorizable_bswap): Also perform
30150 costing for SLP operation.
30152 2021-02-04 Martin Liska <mliska@suse.cz>
30154 * doc/extend.texi: Mention -mprefer-vector-width in target
30157 2021-02-03 Martin Sebor <msebor@redhat.com>
30159 PR tree-optimization/98937
30160 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
30161 Flush pointer_query cache.
30163 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30165 * config/rs6000/genfusion.pl (gen_2logical): Add missing
30166 fixes based on patch review.
30167 * config/rs6000/fusion.md: Regenerate file.
30169 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30171 * config/rs6000/t-rs6000: Comment out auto generation of
30174 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
30176 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
30177 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
30178 (output_file_start): Add gfx908.
30179 * config/gcn/gcn.opt (gpu_type): Add gfx908.
30180 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
30181 (MULTILIB_DIRNAMES): Add gfx908.
30182 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
30183 (main): Recognize gfx908.
30184 * config/gcn/t-omp-device: Add gfx908.
30186 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30188 * config/aarch64/aarch64-simd-builtins.def: Add
30189 [su]mlsl_hi_lane[q] builtin macro generators.
30190 * config/aarch64/aarch64-simd.md
30191 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
30192 (aarch64_<su>mlsl_hi_lane<mode>): Define.
30193 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
30194 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
30195 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
30196 builtin instead of inline asm.
30197 (vmlsl_high_lane_s32): Likewise.
30198 (vmlsl_high_lane_u16): Likewise.
30199 (vmlsl_high_lane_u32): Likewise.
30200 (vmlsl_high_laneq_s16): Likewise.
30201 (vmlsl_high_laneq_s32): Likewise.
30202 (vmlsl_high_laneq_u16): Likewise.
30203 (vmlsl_high_laneq_u32): Likewise.
30204 (vmlal_high_laneq_u32): Likewise.
30206 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30208 * config/aarch64/aarch64-simd-builtins.def: Add
30209 [su]mlal_hi_lane[q] builtin generator macros.
30210 * config/aarch64/aarch64-simd.md
30211 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
30212 (aarch64_<su>mlal_hi_lane<mode>): Define.
30213 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
30214 (aarch64_<su>mlal_hi_laneq<mode>): Define.
30215 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
30216 builtin instead of inline asm.
30217 (vmlal_high_lane_s32): Likewise.
30218 (vmlal_high_lane_u16): Likewise.
30219 (vmlal_high_lane_u32): Likewise.
30220 (vmlal_high_laneq_s16): Likewise.
30221 (vmlal_high_laneq_s32): Likewise.
30222 (vmlal_high_laneq_u16): Likewise.
30223 (vmlal_high_laneq_u32): Likewise.
30225 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30227 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
30228 builtin generator macros.
30229 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
30231 (aarch64_<su>mlsl_hi_n<mode>): Define.
30232 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
30233 instead of inline asm.
30234 (vmlsl_high_n_s32): Likewise.
30235 (vmlsl_high_n_u16): Likewise.
30236 (vmlsl_high_n_u32): Likewise.
30238 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30240 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
30241 builtin generator macros.
30242 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
30244 (aarch64_<su>mlal_hi_n<mode>): Define.
30245 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
30246 instead of inline asm.
30247 (vmlal_high_n_s32): Likewise.
30248 (vmlal_high_n_u16): Likewise.
30249 (vmlal_high_n_u32): Likewise.
30251 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30253 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
30255 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
30257 (aarch64_<su>mlal_hi<mode>_insn): This.
30258 (aarch64_<su>mlal_hi<mode>): Define.
30259 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
30260 instead of inline asm.
30261 (vmlal_high_s16): Likewise.
30262 (vmlal_high_s32): Likewise.
30263 (vmlal_high_u8): Likewise.
30264 (vmlal_high_u16): Likewise.
30265 (vmlal_high_u32): Likewise.
30267 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
30269 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
30270 after calling alter_subreg() on a (mem).
30272 2021-02-03 Martin Liska <mliska@suse.cz>
30275 * lto-streamer-out.c (produce_lto_section): Fill up missing
30277 * lto-streamer.h (struct lto_section): Add _padding field.
30279 2021-02-03 Richard Biener <rguenther@suse.de>
30281 * lto-streamer.c (lto_get_section_name): Free temporary
30283 * tree-loop-distribution.c
30284 (loop_distribution::merge_dep_scc_partitions): Free edge data.
30286 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30288 PR middle-end/97487
30289 * ifcvt.c (noce_can_force_operand): New function.
30290 (noce_emit_move_insn): Use it.
30291 (noce_try_sign_mask): Likewise. Formatting fix.
30293 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30295 PR middle-end/97971
30296 * lra-constraints.c (process_alt_operands): For inline asm, don't call
30297 fatal_insn, but instead return false.
30299 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30301 PR tree-optimization/98287
30302 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
30305 2021-02-03 Tamar Christina <tamar.christina@arm.com>
30307 PR tree-optimization/98928
30308 * tree-vect-loop.c (vect_analyze_loop_2): Change
30309 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
30310 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
30311 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
30312 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
30314 2021-02-02 Richard Biener <rguenther@suse.de>
30316 * gimple-loop-interchange.cc (prepare_data_references):
30318 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
30319 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
30320 * tree-vect-stmts.c (vectorizable_condition): Do not
30322 (vectorizable_comparison): Likewise.
30324 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30326 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
30327 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
30328 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
30329 (vrsqrteq_u32): Likewise.
30331 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30333 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
30334 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
30335 (aarch64_sqxtun2<mode>_be): Likewise.
30336 (aarch64_sqxtun2<mode>): Likewise.
30337 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
30338 (vqmovun_high_s32): Likewise.
30339 (vqmovun_high_s64): Likewise.
30340 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
30342 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30344 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
30346 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
30348 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30350 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
30351 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
30352 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
30353 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
30354 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
30355 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
30356 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
30357 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
30359 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30361 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
30362 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
30363 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
30365 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30367 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
30368 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
30370 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30372 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
30373 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
30374 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
30375 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
30376 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
30377 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
30378 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
30379 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
30380 NONE builtin flags.
30382 2021-02-02 Jakub Jelinek <jakub@redhat.com>
30384 PR tree-optimization/98848
30385 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
30386 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
30388 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
30391 * expr.c: Check mode before calling store_expr.
30393 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
30395 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
30397 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
30398 instruction using expression ior.
30399 (mve_vornq_u<mode>): New expander.
30400 (mve_vornq_f<mode>): Use ior code instead of unspec.
30401 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
30403 2021-02-02 Alexandre Oliva <oliva@adacore.com>
30405 * tree-nested.c (convert_nonlocal_reference_op): Move
30406 current_function_decl restore after re-gimplification.
30407 (convert_local_reference_op): Likewise.
30409 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30411 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
30413 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
30415 (aarch64_rshrn<mode>_insn_be): Likewise.
30416 (aarch64_rshrn<mode>): Likewise.
30417 (aarch64_rshrn2<mode>_insn_le): Likewise.
30418 (aarch64_rshrn2<mode>_insn_be): Likewise.
30419 (aarch64_rshrn2<mode>): Likewise.
30420 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
30421 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
30423 (vrshrn_high_n_s32): Likewise.
30424 (vrshrn_high_n_s64): Likewise.
30425 (vrshrn_high_n_u16): Likewise.
30426 (vrshrn_high_n_u32): Likewise.
30427 (vrshrn_high_n_u64): Likewise.
30428 (vrshrn_n_s16): Likewise.
30429 (vrshrn_n_s32): Likewise.
30430 (vrshrn_n_s64): Likewise.
30431 (vrshrn_n_u16): Likewise.
30432 (vrshrn_n_u32): Likewise.
30433 (vrshrn_n_u64): Likewise.
30435 2021-02-01 Sergei Trofimovich <siarheit@google.com>
30437 PR tree-optimization/98499
30438 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
30439 conservatively and assume all possible side-effects.
30441 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30443 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
30444 vec_unpacku_hi_): Define builtins.
30445 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
30447 (vmovl_high_s16): Likewise.
30448 (vmovl_high_s32): Likewise.
30449 (vmovl_high_u8): Likewise.
30450 (vmovl_high_u16): Likewise.
30451 (vmovl_high_u32): Likewise.
30453 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30455 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
30457 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
30459 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
30461 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
30463 (vabdl_s16): Likewise.
30464 (vabdl_s32): Likewise.
30465 (vabdl_u8): Likewise.
30466 (vabdl_u16): Likewise.
30467 (vabdl_u32): Likewise.
30468 * config/aarch64/iterators.md (ABDL): New int iterator.
30469 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
30471 2021-02-01 Martin Sebor <msebor@redhat.com>
30473 * tree.h (BLOCK_VARS): Add comment.
30474 (BLOCK_SUBBLOCKS): Same.
30475 (BLOCK_SUPERCONTEXT): Same.
30476 (BLOCK_ABSTRACT_ORIGIN): Same.
30477 (inlined_function_outer_scope_p): Same.
30479 2021-02-01 Martin Sebor <msebor@redhat.com>
30481 PR middle-end/97172
30482 * attribs.c (attr_access::free_lang_data): Define new function.
30483 * attribs.h (attr_access::free_lang_data): Declare new function.
30485 2021-02-01 Richard Biener <rguenther@suse.de>
30487 * vec.h (auto_vec::auto_vec): Add memory stat parameters
30489 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
30491 2021-02-01 Tamar Christina <tamar.christina@arm.com>
30493 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
30494 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
30496 2021-02-01 Richard Biener <rguenther@suse.de>
30498 PR rtl-optimization/98863
30499 * config/i386/i386-features.c (convert_scalars_to_vector):
30500 Set DF_RD_PRUNE_DEAD_DEFS.
30502 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
30504 * system.h (SIZE_MAX): Define if not already defined.
30506 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
30508 * config/rs6000/genfusion.pl (gen_2logical): New function to
30509 generate patterns for logical-logical fusion.
30510 * config/rs6000/fusion.md: Regenerated patterns.
30511 * config/rs6000/rs6000-cpus.def: Add
30512 OPTION_MASK_P10_FUSION_2LOGICAL.
30513 * config/rs6000/rs6000.c (rs6000_option_override_internal):
30514 Enable logical-logical fusion for p10.
30515 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
30517 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
30519 * config/rs6000/rs6000.opt: Add periods to new AIX options.
30521 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
30523 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
30524 (mabi=vec-default): New.
30525 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
30526 __EXTABI__ for AIX Vector extended ABI.
30527 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
30529 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
30531 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
30533 2021-01-30 Jakub Jelinek <jakub@redhat.com>
30535 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
30536 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
30538 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
30541 * lra-constraints.c (in_class_p): Don't narrow class only for REG
30544 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
30546 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
30547 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
30548 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
30550 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
30552 PR tree-optimization/98866
30553 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
30554 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
30555 (gori_map::m_maybe_invariant): Rename from all_outgoing.
30556 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
30557 (gori_map::is_export_p): Ditto.
30558 (gori_map::calculate_gori): Ditto.
30559 (gori_compute::set_range_invariant): New.
30560 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
30561 invariant for pointers evaluating to [1, +INF].
30563 2021-01-29 Richard Biener <rguenther@suse.de>
30565 PR rtl-optimization/98863
30566 * config/i386/i386-features.c (remove_partial_avx_dependency):
30567 Do not perform DF analysis.
30568 (pass_data_remove_partial_avx_dependency): Remove
30571 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
30573 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
30574 builtin generator macros.
30575 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
30577 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
30578 instead of inline asm.
30579 (vmull_n_s32): Likewise.
30580 (vmull_n_u16): Likewise.
30581 (vmull_n_u32): Likewise.
30583 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30585 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
30587 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
30589 (aarch64_<sur>abdl2<mode>): ... This.
30590 (<sur>sadv16qi): Adjust use of above.
30591 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
30593 (vabdl_high_s16): Likewise.
30594 (vabdl_high_s32): Likewise.
30595 (vabdl_high_u8): Likewise.
30596 (vabdl_high_u16): Likewise.
30597 (vabdl_high_u32): Likewise.
30599 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30601 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
30603 (uabal2): Likewise.
30604 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
30606 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
30608 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
30610 (vabal_high_s16): Likewise.
30611 (vabal_high_s32): Likewise.
30612 (vabal_high_u8): Likewise.
30613 (vabal_high_u16): Likewise.
30614 (vabal_high_u32): Likewise.
30615 * config/aarch64/iterators.md (ABAL2): New mode iterator.
30616 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
30618 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30620 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
30623 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
30625 (aarch64_<sur>abal<mode>): ... This
30626 (<sur>sadv16qi): Adust use of the above.
30627 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
30629 (vabal_s16): Likewise.
30630 (vabal_s32): Likewise.
30631 (vabal_u8): Likewise.
30632 (vabal_u16): Likewise.
30633 (vabal_u32): Likewise.
30635 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30637 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
30639 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
30641 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
30643 (vaddlv_s16): Likewise.
30644 (vaddlv_u8): Likewise.
30645 (vaddlv_u16): Likewise.
30646 (vaddlvq_s8): Likewise.
30647 (vaddlvq_s16): Likewise.
30648 (vaddlvq_s32): Likewise.
30649 (vaddlvq_u8): Likewise.
30650 (vaddlvq_u16): Likewise.
30651 (vaddlvq_u32): Likewise.
30652 (vaddlv_s32): Likewise.
30653 (vaddlv_u32): Likewise.
30654 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
30655 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
30656 (Vwstype): New mode attribute.
30658 (VWIDE_S): Likewise.
30659 (USADDLV): New int iterator.
30660 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
30662 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
30664 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
30665 builtin generator macros.
30666 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
30668 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
30669 instead of inline asm.
30670 (vmlsl_lane_s32): Likewise.
30671 (vmlsl_lane_u16): Likewise.
30672 (vmlsl_lane_u32): Likewise.
30673 (vmlsl_laneq_s16): Likewise.
30674 (vmlsl_laneq_s32): Likewise.
30675 (vmlsl_laneq_u16): Likewise.
30676 (vmlsl_laneq_u32): Likewise.
30678 2021-01-29 Richard Biener <rguenther@suse.de>
30680 * doc/invoke.texi (--param max-gcse-memory): Document unit
30682 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
30683 * params.opt (--param max-gcse-memory): Adjust default and
30684 document unit of size.
30686 2021-01-29 Richard Biener <rguenther@suse.de>
30688 PR rtl-optimization/98863
30689 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
30690 HOST_WIDE_INT for the memory estimate.
30692 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
30693 Richard Biener <rguenther@suse.de>
30695 PR tree-optimization/97627
30696 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
30697 Do not analyze fake edges.
30699 2021-01-29 Richard Biener <rguenther@suse.de>
30701 PR rtl-optimization/98144
30702 * df.h (df_mir_bb_info): Add con_visited member.
30703 * df-problems.c (df_mir_alloc): Initialize con_visited,
30704 do not fully populate IN and OUT.
30705 (df_mir_reset): Likewise.
30706 (df_mir_confluence_0): Set con_visited.
30707 (df_mir_confluence_n): Properly handle implicitely
30708 fully populated IN and OUT as designated by con_visited
30709 and update con_visited accordingly.
30711 2021-01-29 Jakub Jelinek <jakub@redhat.com>
30714 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
30715 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
30716 && !TARGET_REALLY_IWMMXT to conditions.
30718 2021-01-29 Jakub Jelinek <jakub@redhat.com>
30721 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
30724 2021-01-28 Marek Polacek <polacek@redhat.com>
30727 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
30728 the main variant, maybe reset it in its variants too.
30729 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
30730 (check_aligned_type): Check if TYPE_USER_ALIGN match.
30732 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
30735 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
30736 of constant zero for comparisons.
30738 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
30740 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
30741 support for mapping built-in function names for long double
30742 built-in functions if long double is IEEE 128-bit.
30744 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
30746 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
30747 builtin generator macros.
30748 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
30750 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
30751 instead of inline asm.
30752 (vmlsl_n_s32): Likewise.
30753 (vmlsl_n_u16): Likewise.
30754 (vmlsl_n_u32): Likewise.
30756 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
30758 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
30759 builtin generator macros.
30760 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
30762 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
30763 instead of inline asm.
30764 (vmlal_n_s32): Likewise.
30765 (vmlal_n_u16): Likewise.
30766 (vmlal_n_u32): Likewise.
30768 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30770 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
30772 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
30774 (aarch64_shrn2<mode>_insn_be): Likewise.
30775 (aarch64_shrn2<mode>): Likewise.
30776 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
30778 (vshrn_high_n_s32): Likewise.
30779 (vshrn_high_n_s64): Likewise.
30780 (vshrn_high_n_u16): Likewise.
30781 (vshrn_high_n_u32): Likewise.
30782 (vshrn_high_n_u64): Likewise.
30784 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30786 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
30788 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
30790 (aarch64_shrn<mode>_insn_be): Likewise.
30791 (aarch64_shrn<mode>): Likewise.
30792 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
30794 (vshrn_n_s32): Likewise.
30795 (vshrn_n_s64): Likewise.
30796 (vshrn_n_u16): Likewise.
30797 (vshrn_n_u32): Likewise.
30798 (vshrn_n_u64): Likewise.
30799 * config/aarch64/iterators.md (vn_mode): New mode attribute.
30801 2021-01-28 Richard Biener <rguenther@suse.de>
30803 PR rtl-optimization/80960
30804 * dse.c (check_mem_read_rtx): Call get_addr on the
30807 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
30808 David Edelsohn <dje.gcc@gmail.com>
30811 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
30812 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
30814 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
30816 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
30817 wrapper call rs6000_expand_vector_set_var for cleanup. Call
30818 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
30820 (rs6000_expand_vector_set_var): Delete.
30821 (rs6000_expand_vector_set_var_p9): Make static.
30822 (rs6000_expand_vector_set_var_p8): Make static.
30824 2021-01-28 Xing GUO <higuoxing@gmail.com>
30826 * common/config/riscv/riscv-common.c
30827 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
30828 when `p` extension exists.
30830 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
30832 PR rtl-optimization/97684
30833 * ira.c (ira): Call ira_set_pseudo_classes before
30834 update_equiv_regs when it is necessary.
30836 2021-01-27 Jakub Jelinek <jakub@redhat.com>
30839 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
30840 %w0, %w1 and %2 instead of %0, %1 and %2.
30842 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
30844 * config/rs6000/genfusion.pl: New script to generate
30845 define_insn_and_split patterns so combine can arrange fused
30846 instructions next to each other.
30847 * config/rs6000/fusion.md: New file, generated fused instruction
30848 patterns for combine.
30849 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
30850 (non_update_memory_operand): New predicate.
30851 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
30852 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
30854 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
30856 * config/rs6000/rs6000.c (rs6000_option_override_internal):
30857 Automatically set OPTION_MASK_P10_FUSION and
30858 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
30859 (rs600_opt_masks): Allow -mpower10-fusion
30860 in function attributes.
30861 (address_is_non_pfx_d_or_x): New function.
30862 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
30863 * config/rs6000/rs6000.md: Include fusion.md.
30864 * config/rs6000/rs6000.opt: Add -mpower10-fusion
30865 and -mpower10-fusion-ld-cmpi.
30866 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
30868 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30870 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
30871 builtin generator macros.
30872 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
30874 (aarch64_<su>mlal<mode>): This.
30875 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
30876 instead of inline asm.
30877 (vmlal_s16): Likewise.
30878 (vmlal_s32): Likewise.
30879 (vmlal_u8): Likewise.
30880 (vmlal_u16): Likewise.
30881 (vmlal_u32): Likewise.
30883 2021-01-27 Richard Biener <rguenther@suse.de>
30885 PR tree-optimization/98854
30886 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
30887 PHIs from scalars when the number of CTORs matches the
30888 number of children.
30890 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30892 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
30894 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
30896 (aarch64_mls_n<mode>): This.
30897 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
30899 (vmls_n_s32): Likewise.
30900 (vmls_n_u16): Likewise.
30901 (vmls_n_u32): Likewise.
30902 (vmlsq_n_s16): Likewise.
30903 (vmlsq_n_s32): Likewise.
30904 (vmlsq_n_u16): Likewise.
30905 (vmlsq_n_u32): Likewise.
30907 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30909 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
30911 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
30913 (vmls_s16): Likewise.
30914 (vmls_s32): Likewise.
30915 (vmls_u8): Likewise.
30916 (vmls_u16): Likewise.
30917 (vmls_u32): Likewise.
30918 (vmlsq_s8): Likewise.
30919 (vmlsq_s16): Likewise.
30920 (vmlsq_s32): Likewise.
30921 (vmlsq_u8): Likewise.
30922 (vmlsq_u16): Likewise.
30923 (vmlsq_u32): Likewise.
30925 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30927 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
30929 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
30931 (aarch64_mla_n<mode>): This.
30932 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
30934 (vmla_n_s32): Likewise.
30935 (vmla_n_u16): Likewise.
30936 (vmla_n_u32): Likewise.
30937 (vmlaq_n_s16): Likewise.
30938 (vmlaq_n_s32): Likewise.
30939 (vmlaq_n_u16): Likewise.
30940 (vmlaq_n_u32): Likewise.
30942 2021-01-27 liuhongt <hongtao.liu@intel.com>
30945 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
30946 (*sse2_eq<mode>3): Ditto.
30948 2021-01-27 Jakub Jelinek <jakub@redhat.com>
30950 * tree-pass.h (PROP_trees): Rename to ...
30951 (PROP_gimple): ... this.
30952 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
30953 * passes.c (execute_function_dump, execute_function_todo,
30954 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
30955 * varpool.c (ctor_for_folding): Likewise.
30957 2021-01-27 Jakub Jelinek <jakub@redhat.com>
30959 PR tree-optimization/97260
30960 * varpool.c: Include tree-pass.h.
30961 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
30962 non-TREE_SIDE_EFFECTS automatic variables.
30964 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
30966 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
30968 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
30969 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
30970 (gen_compile_unit_die): Recognise C++23.
30972 2021-01-26 Jakub Jelinek <jakub@redhat.com>
30975 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
30978 2021-01-26 Jakub Jelinek <jakub@redhat.com>
30981 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
30982 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
30983 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
30985 2021-01-26 Richard Biener <rguenther@suse.de>
30987 * gimple-pretty-print.c (dump_binary_rhs): Handle
30988 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
30990 2021-01-26 Richard Biener <rguenther@suse.de>
30992 PR middle-end/98726
30993 * tree.h (vector_cst_int_elt): Remove.
30994 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
30997 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
30999 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
31000 for V64DFmode min/max reductions.
31002 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31004 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
31005 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
31006 two halves, one with x and the other with const0_rtx, ordered
31007 depending on endianity.
31009 2021-01-26 Alexandre Oliva <oliva@adacore.com>
31011 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
31012 temporaries not seen in binding block, and not about to be
31013 added as gimple variables.
31015 2021-01-25 Martin Sebor <msebor@redhat.com>
31018 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
31020 2021-01-25 Martin Liska <mliska@suse.cz>
31022 * value-prof.c (get_nth_most_common_value): Use %s instead
31025 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31028 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
31029 readelf -wi is able to read the emitted .debug_info back.
31030 * configure: Regenerated.
31032 2021-01-25 Martin Liska <mliska@suse.cz>
31034 PR gcov-profile/98739
31035 * common.opt: Add missing sign symbol.
31036 * value-prof.c (get_nth_most_common_value): Restore handling
31037 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
31038 PROFILE_REPRODUCIBILITY_MULTITHREADED.
31040 2021-01-25 Richard Biener <rguenther@suse.de>
31042 PR middle-end/98807
31043 * tree.c (vector_element_bits): Always use precision of
31044 the element type for boolean vectors.
31046 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31048 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
31049 (ENDFILE_SPEC): Evaluate qnolinkcmds.
31051 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31053 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
31054 nostartfiles handling since this is already done by
31055 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
31056 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
31057 is already done by LINK_COMMAND_SPEC.
31058 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
31059 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
31062 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31065 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
31066 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
31067 value rather than host size_t.
31068 (fold_const_call): Change type of s2 from size_t to
31069 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
31070 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
31071 instead of s2 as last argument.
31073 2021-01-25 Tamar Christina <tamar.christina@arm.com>
31075 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
31076 VCMLA_OP, VCMUL_OP): New.
31077 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
31078 * config/arm/neon.md (cmul<conj_op><mode>3): New.
31079 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
31080 UNSPEC_VCMUL_CONJ): New.
31081 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
31082 cml<fcmac1><conj_op><mode>4): New.
31084 2021-01-23 Jakub Jelinek <jakub@redhat.com>
31087 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
31089 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
31091 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
31093 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
31095 (vmla_s16): Likewise.
31096 (vmla_s32): Likewise.
31097 (vmla_u8): Likewise.
31098 (vmla_u16): Likewise.
31099 (vmla_u32): Likewise.
31100 (vmlaq_s8): Likewise.
31101 (vmlaq_s16): Likewise.
31102 (vmlaq_s32): Likewise.
31103 (vmlaq_u8): Likewise.
31104 (vmlaq_u16): Likewise.
31105 (vmlaq_u32): Likewise.
31107 2021-01-22 David Malcolm <dmalcolm@redhat.com>
31109 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
31112 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31115 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
31116 filenames to emit, still emit the required 0 index directory and
31117 filename entries that match DW_AT_comp_dir and DW_AT_name of the
31120 2021-01-22 Marek Polacek <polacek@redhat.com>
31123 * doc/invoke.texi: Update C++ ABI Version 15 description.
31125 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31127 PR tree-optimization/98766
31128 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
31129 comparing against type size with param_avoid_fma_max_bits.
31131 2021-01-22 Richard Biener <rguenther@suse.de>
31133 PR middle-end/98793
31134 * tree.c (vector_element_bits): Key single-bit bool vector on
31135 integer mode rather than not vector mode.
31137 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31140 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31141 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
31143 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
31144 to call different path for P8 and P9.
31145 (rs6000_expand_vector_set_var_p9): New function.
31146 (rs6000_expand_vector_set_var_p8): New function.
31148 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31152 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31153 Ajdust variable index vec_insert from address dereference to
31154 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
31155 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31157 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
31159 2021-01-22 Martin Liska <mliska@suse.cz>
31161 PR gcov-profile/98739
31162 * profile.c (compute_value_histograms): Drop time profile for
31163 -fprofile-reproducible=multithreaded.
31165 2021-01-22 Nathan Sidwell <nathan@acm.org>
31167 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
31170 2021-01-22 Richard Biener <rguenther@suse.de>
31172 PR middle-end/98773
31173 * tree-data-ref.c (initalize_matrix_A): Revert previous
31174 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
31176 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31178 PR tree-optimization/90248
31179 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
31180 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
31182 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
31183 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
31185 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31187 PR tree-optimization/98255
31188 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
31189 extend index - low_bound from sizetype's precision rather than index
31191 (get_addr_base_and_unit_offset_1): Likewise.
31192 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
31193 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
31195 2021-01-22 Richard Biener <rguenther@suse.de>
31197 PR tree-optimization/98786
31198 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
31199 adding new uses of abnormals. Verify we deal with a conditional
31202 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
31205 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
31207 2021-01-22 liuhongt <hongtao.liu@intel.com>
31211 * config/i386/sse.md (VI_128_256): New mode iterator.
31212 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
31213 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
31214 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
31215 define_insn_and_split to lower avx512 vector comparison to avx
31216 version when dest is vector.
31217 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
31218 define_insn_and_split for negating the comparison result.
31219 * config/i386/predicates.md (float_vector_all_ones_operand):
31221 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
31222 general NOT operator without UNSPEC_MASKOP.
31224 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
31226 PR rtl-optimization/98777
31227 * lra-int.h (lra_pmode_pseudo): New extern.
31228 * lra.c (lra_pmode_pseudo): New global.
31230 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
31232 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
31234 * fwprop.c (fwprop_propagation::classify_result): Allow
31235 (subreg (mem)) simplifications.
31237 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31239 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
31241 (aarch64_sqdmlal<mode>): ... This...
31242 (aarch64_sqdmlsl<mode>): ... And this.
31243 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
31244 (aarch64_sqdmlal_lane<mode>): ... This...
31245 (aarch64_sqdmlsl_lane<mode>): ... And this.
31246 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
31247 (aarch64_sqdmlsl_laneq<mode>): ... This...
31248 (aarch64_sqdmlal_laneq<mode>): ... And this.
31249 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
31250 (aarch64_sqdmlsl_n<mode>): ... This...
31251 (aarch64_sqdmlal_n<mode>): ... And this.
31252 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
31253 (aarch64_sqdmlal2<mode>_internal): ... This...
31254 (aarch64_sqdmlsl2<mode>_internal): ... And this.
31256 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
31258 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
31260 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
31263 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
31265 2021-01-21 liuhongt <hongtao.liu@intel.com>
31267 PR rtl-optimization/98694
31268 * regcprop.c (copy_value): If SRC had been assigned a mode
31269 narrower than the copy, we can't link DEST into the chain even
31270 they have same hard_regno_nregs(i.e. HImode/SImode in i386
31273 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31275 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
31276 Convert to define_insn_and_split. Split into simple move when moving
31279 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
31281 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
31282 Adjust comment. Simplify code.
31284 2021-01-20 Jakub Jelinek <jakub@redhat.com>
31287 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
31288 with DW_FORM_line_strp form.
31289 (prune_unused_types_update_strings): Don't add into debug_str_hash
31290 indirect strings with DW_FORM_line_strp form.
31291 (adjust_name_comp_dir): New function.
31292 (dwarf2out_finish): Call it on CU DIEs after resetting
31293 debug_line_str_hash.
31295 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
31297 PR rtl-optimization/98722
31298 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
31299 has no 3-op add insn to transform insns containing two pluses.
31301 2021-01-20 Richard Biener <rguenther@suse.de>
31303 * hwint.h (add_hwi): New function.
31304 (mul_hwi): Likewise.
31305 * tree-data-ref.c (initialize_matrix_A): Properly translate
31306 tree constants and avoid HOST_WIDE_INT_MIN.
31307 (lambda_matrix_row_add): Avoid undefined integer overflow
31308 and return true on such overflow.
31309 (lambda_matrix_right_hermite): Handle overflow from
31310 lambda_matrix_row_add gracefully. Simplify previous fix.
31311 (analyze_subscript_affine_affine): Likewise.
31313 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
31315 PR tree-optimization/96674
31316 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
31317 x >= y && y != XXX_MIN --> x > y - 1
31319 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
31321 PR tree-optimization/98535
31322 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
31323 If the high and low permutes are the same, remove the high permutes
31324 from the working set and only continue with the low ones.
31326 2021-01-20 Jakub Jelinek <jakub@redhat.com>
31328 PR tree-optimization/98721
31329 * builtins.c (access_ref::inform_access): Don't assume
31330 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
31331 object whenever allocfn is NULL, rather than only when DECL_P
31332 is true. Use %qE instead of %qD for that. Formatting fixes.
31334 2021-01-20 Richard Biener <rguenther@suse.de>
31336 PR tree-optimization/98758
31337 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
31338 (lambda_matrix_right_hermite): Avoid undefinedness with
31339 signed integer abs and multiplication.
31340 (analyze_subscript_affine_affine): Use lambda_int.
31342 2021-01-20 David Malcolm <dmalcolm@redhat.com>
31345 * dwarf2out.c (output_line_info): Rename static variable
31346 "generation", moving it out of the function to...
31347 (output_line_info_generation): New.
31348 (init_sections_and_labels): Likewise, renaming the variable to...
31349 (init_sections_and_labels_generation): New.
31350 (dwarf2out_c_finalize): Reset the new variables.
31352 2021-01-19 Martin Sebor <msebor@redhat.com>
31354 PR middle-end/98664
31355 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
31356 all functions, even if they're not declared artificial or inline.
31357 * tree.c (tree_inlined_location): Use macro expansion location
31358 only if scope traversal fails to expose one.
31360 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
31362 PR rtl-optimization/92294
31363 * alias.c (compare_base_symbol_refs): Take an extra parameter
31364 and add the distance between two symbols to it. Enshrine in
31365 comments that -1 means "either 0 or 1, but we can't tell
31366 which at compile time".
31367 (memrefs_conflict_p): Update call accordingly.
31368 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
31371 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31373 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
31374 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
31375 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
31376 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
31377 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
31378 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
31379 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
31380 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
31381 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
31382 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
31383 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
31384 sqmovun): Use NONE flags.
31386 2021-01-19 Richard Biener <rguenther@suse.de>
31389 * ipa-modref.c (analyze_stmt): Only record a summary for a
31392 2021-01-19 Richard Biener <rguenther@suse.de>
31394 PR middle-end/98638
31395 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
31397 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
31399 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
31400 built-in define __FIX_LEON3FT_TN0018.
31402 2021-01-19 Richard Biener <rguenther@suse.de>
31405 * tree-inline.c (tree_function_versioning): Set input_location
31406 to UNKNOWN_LOCATION throughout the function.
31408 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
31411 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
31413 2021-01-19 Martin Jambor <mjambor@suse.cz>
31416 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
31417 whether non-call exceptions allow removal of a statement.
31418 (isra_analyze_call): Pass the appropriate function to
31419 ssa_name_only_returned_p.
31421 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
31423 * config/riscv/arch-canonicalize (longext_sort): New function for
31424 sorting 'multi-letter'.
31425 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
31426 'alts'. The 'arch' may not be the first of 'alts'.
31427 (_expand_combination): Add underline for the 'ext' without '*'.
31428 This is because, a single-letter extension can always be treated well
31429 with a '_' prefix, but it cannot be separated out if it is appended
31432 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
31435 * ira.c (ira): Skip abnormal critical edge splitting.
31437 2021-01-18 Jakub Jelinek <jakub@redhat.com>
31439 PR tree-optimization/98727
31440 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
31441 second .MUL_OVERFLOW operand for signed multiplication with overflow
31442 checking if the second operand of multiplication is not constant.
31444 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
31446 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
31447 defaults to version 4.
31449 2021-01-18 David Malcolm <dmalcolm@redhat.com>
31451 * attribs.h (fndecl_dealloc_argno): New decl.
31452 * builtins.c (call_dealloc_argno): Split out second half of
31454 (fndecl_dealloc_argno): New.
31455 * doc/extend.texi (Common Function Attributes): Document the
31456 interaction between the analyzer and the malloc attribute.
31457 * doc/invoke.texi (Static Analyzer Options): Likewise.
31459 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
31461 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
31462 dwarf_version to 4.
31463 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
31465 2021-01-17 Martin Jambor <mjambor@suse.cz>
31468 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
31471 2021-01-17 Mark Wielaard <mark@klomp.org>
31473 * common.opt (gdwarf-): Init(5).
31474 * doc/invoke.texi (-gdwarf): Document default to 5.
31476 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
31478 * builtin-types.def
31479 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
31481 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
31482 ...this. Add extra argument.
31483 * gimplify.c (omp_default_clause): Ensure that event handle is
31484 firstprivate in a task region.
31485 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
31486 (gimplify_adjust_omp_clauses): Likewise.
31487 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
31488 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
31489 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
31490 if detach clause specified. Add detach argument when generating
31492 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
31494 (finish_taskreg_scan): Move field for variable containing the event
31495 handle to the front of the struct.
31496 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
31498 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
31499 OMP_CLAUSE_DETACH clause.
31500 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
31501 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
31502 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
31504 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
31506 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
31508 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
31510 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
31511 multilib directory names. Use MULTILIB_REQUIRED instead of
31512 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
31513 -mcustom-fpu-cfg=fph2 multilib.
31515 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
31517 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
31518 (nios2_init_fpu_configs): Provide register values for new
31519 -mcustom-fpu-cfg=fph2 option variant.
31520 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
31523 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
31525 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
31526 custom instruction warnings.
31528 2021-01-16 Jakub Jelinek <jakub@redhat.com>
31530 PR tree-optimization/96669
31531 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
31533 2021-01-16 Jakub Jelinek <jakub@redhat.com>
31535 PR tree-optimization/96271
31536 * passes.def: Pass false argument to first two pass_cd_dce
31537 instances and true to last instance. Add comment that
31538 last instance rewrites no longer addressed locals.
31539 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
31541 (pass_cd_dce::set_pass_param): New method.
31542 (pass_cd_dce::execute): Return TODO_update_address_taken from
31543 last cd_dce instance.
31545 2021-01-15 Carl Love <cel@us.ibm.com>
31547 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
31549 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
31550 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
31551 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
31552 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
31553 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
31554 Add builtin define.
31555 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
31556 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
31557 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
31558 New overloaded definitions.
31559 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
31560 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
31561 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
31562 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
31563 P10V_BUILTIN_MULHU_V4SI]: Add case
31564 statement for builtins.
31565 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
31566 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
31567 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
31568 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
31569 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
31570 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
31571 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
31572 Add define_insn, mode is VIlong.
31573 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
31574 Add builtin descriptions.
31576 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
31578 * final.c (final_start_function_1): Reset force_source_line.
31580 2021-01-15 Jakub Jelinek <jakub@redhat.com>
31582 PR tree-optimization/96669
31583 * match.pd (((1 << A) & 1) != 0 -> A == 0,
31584 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
31585 possibly different power of two constants and to right shift too.
31587 2021-01-15 Jakub Jelinek <jakub@redhat.com>
31589 PR tree-optimization/96681
31590 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
31591 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
31592 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
31593 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
31595 2021-01-15 Alexandre Oliva <oliva@adacore.com>
31597 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
31599 2021-01-15 Tamar Christina <tamar.christina@arm.com>
31601 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
31602 cmul<conj_op><mode>3): New.
31603 * config/aarch64/iterators.md (UNSPEC_FCMUL,
31604 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
31605 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
31606 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
31607 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
31608 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
31609 (rot_op): Renamed to conj_op.
31610 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
31611 cmul<conj_op><mode>3): New.
31612 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
31613 cmul<conj_op><mode>3): New.
31615 2021-01-15 David Malcolm <dmalcolm@redhat.com>
31619 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
31620 Escape the tempfile name when constructing the expected output.
31622 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31624 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
31626 (aarch64_<su>mlsl_hi<mode>): ... This.
31627 (aarch64_<su>mlsl_hi<mode>): Define.
31628 (*aarch64_<su>mlsl<mode): Rename to...
31629 (aarch64_<su>mlsl<mode): ... This.
31630 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
31631 smlsl_hi, umlsl_hi): Define builtins.
31632 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
31633 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
31634 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
31635 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
31637 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
31639 * config/i386/i386-c.c (ix86_target_macros):
31640 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
31642 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
31645 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
31646 * Makefile.in (RTL_SSA_H): New variable.
31647 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
31648 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
31649 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
31651 * config/aarch64/aarch64-cc-fusion.cc: New file.
31653 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
31655 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
31656 calling cancel_changes for changes that no longer exist.
31658 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
31660 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
31661 (function_info::reg_defs): ...this.
31662 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
31663 (function_info::reg_defs): ...this.
31665 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31668 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
31670 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31673 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31676 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
31678 2021-01-15 Richard Biener <rguenther@suse.de>
31680 PR tree-optimization/96376
31681 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
31682 for VMAT_INVARIANT.
31684 2021-01-15 Martin Liska <mliska@suse.cz>
31686 * doc/install.texi: Document that some tests need pytest module.
31687 * doc/sourcebuild.texi: Likewise.
31689 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31692 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
31694 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31696 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
31697 (mve_vshrq_n_u<mode>_imm): Likewise.
31698 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
31699 * config/arm/vec-common.md: ... here.
31701 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31703 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
31705 * config/arm/neon.md (vashl<mode>3): Delete.
31706 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
31707 (vasl<mode>3): New expander.
31709 2021-01-15 Richard Biener <rguenther@suse.de>
31711 PR tree-optimization/98685
31712 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
31713 of vector extern defs.
31715 2021-01-14 David Malcolm <dmalcolm@redhat.com>
31718 * diagnostic.c (diagnostic_kind_text): Break out this array
31720 (diagnostic_build_prefix): ...here.
31721 (fancy_abort): Detect when diagnostic_initialize has not yet been
31722 called and fall back to a minimal implementation of printing the
31723 ICE, rather than segfaulting in internal_error.
31725 2021-01-14 David Malcolm <dmalcolm@redhat.com>
31727 * diagnostic.c (diagnostic_initialize): Eliminate
31728 parseable_fixits_p in favor of initializing extra_output_kind from
31729 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
31730 (convert_column_unit): New function, split out from...
31731 (diagnostic_converted_column): ...this.
31732 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
31733 Use them to call convert_column_unit on the column values.
31734 (diagnostic_report_diagnostic): Eliminate conditional on
31735 parseable_fixits_p in favor of a switch statement on
31736 extra_output_kind, passing the appropriate values to the new
31737 params of print_parseable_fixits.
31738 (selftest::test_print_parseable_fixits_none): Update for new
31739 params of print_parseable_fixits.
31740 (selftest::test_print_parseable_fixits_insert): Likewise.
31741 (selftest::test_print_parseable_fixits_remove): Likewise.
31742 (selftest::test_print_parseable_fixits_replace): Likewise.
31743 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
31745 (selftest::diagnostic_c_tests): Call it.
31746 * diagnostic.h (enum diagnostics_extra_output_kind): New.
31747 (diagnostic_context::parseable_fixits_p): Delete field in favor
31749 (diagnostic_context::extra_output_kind): ...this new field.
31750 * doc/invoke.texi (Environment Variables): Add
31751 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
31752 * opts.c (common_handle_option): Update handling of
31753 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
31756 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31758 * tree-vect-slp-patterns.c (class complex_operations_pattern,
31759 complex_operations_pattern::matches,
31760 complex_operations_pattern::recognize,
31761 complex_operations_pattern::build): New.
31762 (slp_patterns): Use it.
31764 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31766 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
31767 * optabs.def (cmls_optab, cmls_conj_optab): New.
31768 * doc/md.texi: Document them.
31769 * tree-vect-slp-patterns.c (class complex_fms_pattern,
31770 complex_fms_pattern::matches, complex_fms_pattern::recognize,
31771 complex_fms_pattern::build): New.
31773 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31775 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
31776 * optabs.def (cmla_optab, cmla_conj_optab): New.
31777 * doc/md.texi: Document them.
31778 * tree-vect-slp-patterns.c (vect_match_call_p,
31779 class complex_fma_pattern, vect_slp_reset_pattern,
31780 complex_fma_pattern::matches, complex_fma_pattern::recognize,
31781 complex_fma_pattern::build): New.
31783 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31785 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
31786 * optabs.def (cmul_optab, cmul_conj_optab): New.
31787 * doc/md.texi: Document them.
31788 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
31789 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
31790 vect_build_combine_node, class complex_mul_pattern,
31791 complex_mul_pattern::matches, complex_mul_pattern::recognize,
31792 complex_mul_pattern::build): New.
31794 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31796 * tree-vect-slp.c (optimize_load_redistribution_1): New.
31797 (optimize_load_redistribution, vect_is_slp_load_node): New.
31798 (vect_match_slp_patterns): Use it.
31800 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31802 * tree-vect-slp-patterns.c (complex_add_pattern::build):
31805 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
31807 * config/gcn/mkoffload.c (main): Create an offload image only in
31808 64-bit configurations.
31810 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
31813 * config/i386/i386-options.c (ix86_option_override_internal):
31814 Issue an error for -fcf-protection with CF_BRANCH when compiling
31815 for 32-bit non-TARGET_CMOV targets.
31817 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
31820 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
31821 Remove declaration and initialization of shadow variable "ret".
31822 (ix86_option_override_internal): Remove delcaration of
31823 shadow variable "i". Redeclare shadowed variable to unsigned.
31824 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
31825 * config/i386/i386-builtins.c (get_builtin_code_for_version):
31826 Update for redeclaration.
31827 * config/i386/i386.h (pta_size): Ditto.
31829 2021-01-14 Richard Biener <rguenther@suse.de>
31831 PR tree-optimization/98674
31832 * tree-data-ref.c (base_supports_access_fn_components_p): New.
31833 (initialize_data_dependence_relation): For two bases without
31834 possible access fns resort to type size equality when determining
31835 shape compatibility.
31837 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
31840 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
31841 <=, >= operators in vcle and vcge intrinsics respectively.
31842 * config/arm/arm_neon_builtins.def: Remove entry for
31845 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
31848 * config/i386/i386-options.c (ix86_function_specific_save):
31849 Remove redundant assignment to opts->x_ix86_branch_cost.
31850 * config/i386/i386.c (ix86_prefetch_sse):
31851 Rename from x86_prefetch_sse. Update all uses.
31852 * config/i386/i386.h: Update for rename.
31853 * config/i386/i386-options.h: Ditto.
31855 2021-01-14 Jakub Jelinek <jakub@redhat.com>
31858 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
31859 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
31860 Use Bm instead of m for non-avx. Add isa attribute.
31862 2021-01-14 Jakub Jelinek <jakub@redhat.com>
31864 PR tree-optimization/96688
31865 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
31866 ~X can be simplified.
31868 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
31870 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
31871 IFN_LOAD_LANES results.
31873 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31875 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
31877 (aarch64_xtn<mode>): Likewise.
31878 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
31881 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
31883 (vmovl_s16): Likewise.
31884 (vmovl_s32): Likewise.
31885 (vmovl_u8): Likewise.
31886 (vmovl_u16): Likewise.
31887 (vmovl_u32): Likewise.
31888 (vmovn_s16): Likewise.
31889 (vmovn_s32): Likewise.
31890 (vmovn_s64): Likewise.
31891 (vmovn_u16): Likewise.
31892 (vmovn_u32): Likewise.
31893 (vmovn_u64): Likewise.
31895 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31897 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
31899 (aarch64_<su>qxtn2<mode>_be): Likewise.
31900 (aarch64_<su>qxtn2<mode>): Likewise.
31901 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
31903 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
31904 (su): Handle ss_truncate and us_truncate.
31905 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
31907 (vqmovn_high_s32): Likewise.
31908 (vqmovn_high_s64): Likewise.
31909 (vqmovn_high_u16): Likewise.
31910 (vqmovn_high_u32): Likewise.
31911 (vqmovn_high_u64): Likewise.
31913 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31915 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
31917 (aarch64_xtn2<mode>_be): Likewise.
31918 (aarch64_xtn2<mode>): Likewise.
31919 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
31921 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
31923 (vmovn_high_s32): Likewise.
31924 (vmovn_high_s64): Likewise.
31925 (vmovn_high_u16): Likewise.
31926 (vmovn_high_u32): Likewise.
31927 (vmovn_high_u64): Likewise.
31929 2021-01-13 Stafford Horne <shorne@gmail.com>
31931 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
31933 2021-01-13 Stafford Horne <shorne@gmail.com>
31935 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
31937 2021-01-13 Stafford Horne <shorne@gmail.com>
31939 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
31940 define for __or1k_hard_float__.
31942 2021-01-13 Stafford Horne <shorne@gmail.com>
31944 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
31945 (PROFILE_HOOK): Define to call _mcount.
31946 (FUNCTION_PROFILER): Change from abort to no-op.
31948 2021-01-13 Jakub Jelinek <jakub@redhat.com>
31950 PR tree-optimization/96691
31951 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
31952 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
31953 (~D ^ C) or (D ^ C) can be simplified.
31955 2021-01-13 Richard Biener <rguenther@suse.de>
31957 PR tree-optimization/92645
31958 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
31959 until after vector lowering.
31961 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31963 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
31965 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
31966 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
31968 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31970 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
31972 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
31973 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
31975 2021-01-13 Richard Biener <rguenther@suse.de>
31977 PR tree-optimization/92645
31978 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
31979 BIT_FIELD_REF argument.
31980 (vect_build_slp_tree_2): Record the desired vector type
31981 on the external vector def.
31982 (vectorizable_slp_permutation): Handle required punning
31983 of existing vector defs.
31985 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31987 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
31989 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31991 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
31993 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
31995 * config.gcc [$target == *-*-gnu*]: Enable
31996 'default_gnu_indirect_function'.
31998 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32001 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
32002 registers before calling targetm.vectorize.vec_perm_const, only after
32004 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
32005 two argument permutation when one operand is zero vector and only
32006 after that force operands into registers.
32007 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
32008 define_insn_and_split pattern.
32009 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
32010 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
32011 (*avx2_zero_extendv8hiv8si2_1): Likewise.
32012 (*avx512f_zero_extendv8siv8di2_1): Likewise.
32013 (*avx2_zero_extendv4siv4di2_1): Likewise.
32014 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
32016 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
32017 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
32018 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
32019 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
32020 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
32021 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
32023 2021-01-13 Martin Liska <mliska@suse.cz>
32025 PR tree-optimization/98455
32026 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
32027 Record also virtual PHIs.
32028 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
32031 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
32033 * doc/invoke.texi (C++ Modules): Fix typos.
32035 2021-01-13 Richard Biener <rguenther@suse.de>
32037 PR tree-optimization/98640
32038 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
32039 handle plus or minus from a truncated operand to be
32042 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32045 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
32046 define_insn_and_split patterns.
32047 (splitter after *btr<mode>_2): New splitter.
32049 2021-01-13 Martin Liska <mliska@suse.cz>
32052 * cgraphunit.c (analyze_functions): Remove dead code.
32054 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
32056 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
32057 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
32058 (a64fx_regmove_cost, a64fx_vector_cost): New.
32059 (a64fx_tunings): Use the new added cost tables.
32061 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32064 * config/i386/predicates.md (pmovzx_parallel): New predicate.
32065 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
32066 define_insn_and_split pattern.
32067 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
32068 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
32070 2021-01-13 Julian Brown <julian@codesourcery.com>
32072 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
32073 to fix v0 register.
32075 2021-01-13 Julian Brown <julian@codesourcery.com>
32077 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
32080 2021-01-13 Julian Brown <julian@codesourcery.com>
32082 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
32083 for reciprocal-approximation instructions.
32084 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
32085 refinement and division result.
32086 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
32088 2021-01-13 Julian Brown <julian@codesourcery.com>
32090 * config/gcn/gcn-valu.md (subdf): Rename to...
32093 2021-01-12 Martin Liska <mliska@suse.cz>
32095 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
32097 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
32099 * function-abi.h: Fix typo.
32101 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
32105 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
32106 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
32107 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
32108 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
32109 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
32110 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
32111 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
32112 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
32113 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
32114 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
32115 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
32116 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
32117 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
32118 (ARM_HAVE_V2DI_LDST): Likewise.
32119 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
32120 (*movmisalign<mode>_mve_load): New pattern.
32121 * config/arm/neon.md (movmisalign<mode>): Move to ...
32122 * config/arm/vec-common.md: ... here.
32124 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
32127 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
32128 of pattern 'plus (plus (hard reg, const), pseudo)'.
32130 2021-01-12 Richard Biener <rguenther@suse.de>
32132 PR tree-optimization/98550
32133 * tree-vect-slp.c (vect_record_max_nunits): Check whether
32134 the group size is a multiple of the vector element count.
32135 (vect_build_slp_tree_1): When we need to fail because
32136 the vector type choosen causes unrolling do so lazily
32137 without affecting matches only at the end to guide group splitting.
32139 2021-01-12 Martin Liska <mliska@suse.cz>
32142 * optc-save-gen.awk: Compare also n_target_save vars with
32145 2021-01-12 Martin Liska <mliska@suse.cz>
32147 * gcov.c (source_info::debug): New.
32148 (print_usage): Add --debug (-D) option.
32149 (process_args): Likewise.
32150 (generate_results): Call src->debug after
32151 accumulate_line_counts.
32152 (read_graph_file): Properly assign id for EXIT_BLOCK.
32153 * profile.c (branch_prob): Dump function body before it is
32156 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32158 PR tree-optimization/98629
32159 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
32160 unless returning non-zero.
32162 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32164 PR tree-optimization/95731
32165 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
32166 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
32167 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
32168 only after optimize_range_tests_var_bound.
32170 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32172 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
32173 * configure: Regenerated.
32175 2021-01-12 liuhongt <hongtao.liu@intel.com>
32178 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
32180 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
32183 2021-01-12 Alexandre Oliva <oliva@adacore.com>
32185 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
32187 (auto_end_imm_use_stmt_traverse): New struct.
32188 (FOR_EACH_IMM_USE_STMT): Use it.
32189 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
32191 * gimple-ssa-strength-reduction.c: ... here, ...
32192 * graphite-scop-detection.c: ... here, ...
32193 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
32194 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
32195 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
32196 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
32197 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
32198 * tree-vect-slp.c: ... and here, ...
32199 * doc/tree-ssa.texi: ... and the example here.
32201 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32203 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
32204 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
32205 (*sdiv_pow2<mode>3): New pattern.
32206 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
32207 Wrap the ASRD in an UNSPEC_PRED_X.
32208 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
32209 predicate with a constant PTRUE, if it isn't already.
32210 (*cond_<sve_int_op><mode>_z): Replace with...
32211 (*cond_<sve_int_op><mode>_any): ...this new pattern.
32213 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32215 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
32216 SVE_FULL_I to SVE_I.
32217 (*cond_bic<mode>_any): Likewise.
32219 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32221 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
32222 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
32225 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32227 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
32228 SVE_FULL_I to SVE_I.
32229 (*aarch64_cond_<su>abd<mode>_2): Likewise.
32230 (*aarch64_cond_<su>abd<mode>_any): Likewise.
32231 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
32232 for the max and min but not for the minus.
32233 (*aarch64_cond_<su>abd<mode>_3): New pattern.
32235 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32237 * config/aarch64/iterators.md (SVE_24I): New iterator.
32238 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
32239 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
32241 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32243 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
32244 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
32246 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
32247 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
32248 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
32249 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
32251 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32253 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
32254 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
32255 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
32258 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32260 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
32261 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
32262 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
32264 2021-01-11 Martin Liska <mliska@suse.cz>
32267 * symtab-clones.h (clone_info::release): Release
32268 symtab::m_clones with ggc_delete as it's a GGC memory.
32270 2021-01-11 Matthias Klose <doko@ubuntu.com>
32272 * Makefile.in (LINK_PROGRESS): Show the link target.
32274 2021-01-11 Richard Biener <rguenther@suse.de>
32276 PR tree-optimization/91403
32277 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
32278 single-element interleaving group size at 4096 elements.
32280 2021-01-11 Richard Biener <rguenther@suse.de>
32282 PR tree-optimization/98526
32283 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
32284 of the actual reduction op for the regular case.
32285 (vectorizable_reduction): Cost the stmts
32286 vect_transform_reduction produces here.
32288 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
32290 * tree-ssa-forwprop.c (simplify_vector_constructor): For
32291 big-endian, use UNPACK[_FLOAT]_HI.
32293 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32295 * tree-vect-slp-patterns.c (class complex_pattern,
32296 class complex_add_pattern): Add parameters to matches.
32297 (complex_add_pattern::build): Free memory.
32298 (complex_add_pattern::matches): Move validation end of match.
32299 (complex_add_pattern::recognize): Likewise.
32301 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32303 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
32305 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32307 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
32309 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32311 PR tree-optimization/95867
32312 * tree-ssa-math-opts.h: New header.
32313 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
32314 (powi_as_mults): No longer static. Use build_one_cst instead of
32315 build_real. Formatting fix.
32316 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
32317 (attempt_builtin_powi): Handle multiplication reassociation without
32318 powi_fndecl using powi_as_mults.
32319 (reassociate_bb): For integral types don't require
32320 -funsafe-math-optimizations to call attempt_builtin_powi.
32322 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32324 PR tree-optimization/95852
32325 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
32326 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
32327 allow in the bb any of the stmts in that vector, div_stmt and
32328 up to 3 cast stmts.
32329 (arith_cast_equal_p): New function.
32330 (arith_overflow_check_p): Add cast_stmt argument, handle signed
32331 multiply overflow checks.
32332 (match_arith_overflow): Adjust caller. Handle signed multiply
32335 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32337 PR tree-optimization/95852
32338 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
32339 (uaddsub_overflow_check_p): Renamed to ...
32340 (arith_overflow_check_p): ... this. Handle also multiplication
32341 with overflow check.
32342 (match_uaddsub_overflow): Renamed to ...
32343 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
32344 also multiplication with overflow check. Adjust function comment.
32345 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
32346 match_arith_overflow also for MULT_EXPR.
32348 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32350 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32351 __builtin_convertvector.
32352 (vmovl_s16): Likewise.
32353 (vmovl_s32): Likewise.
32354 (vmovl_u8): Likewise.
32355 (vmovl_u16): Likewise.
32356 (vmovl_u32): Likewise.
32357 (vmovn_s16): Likewise.
32358 (vmovn_s32): Likewise.
32359 (vmovn_s64): Likewise.
32360 (vmovn_u16): Likewise.
32361 (vmovn_u32): Likewise.
32362 (vmovn_u64): Likewise.
32364 2021-01-11 Martin Liska <mliska@suse.cz>
32366 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
32367 (if_chain::is_beneficial): Delete clusters
32368 (find_conditions): Make second argument of conditions_in_bbs a
32369 pointer so that we control over it's lifetime.
32370 (pass_if_to_switch::execute): Delete them.
32372 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
32374 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
32377 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32379 * config/vax/vax.md (cc): Remove mode attribute.
32380 (subst_<cc>, subst_f<cc>): Rename to...
32381 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
32382 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
32383 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
32384 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
32386 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32388 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
32389 `const_double_zero'.
32391 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32393 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
32394 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
32397 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32399 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
32401 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
32402 with `const_double_zero'.
32403 * doc/rtl.texi (Constant Expression Types): Document it.
32405 2021-01-09 Jakub Jelinek <jakub@redhat.com>
32408 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
32409 POINTER_DIFF_EXPR to be any integral type.
32411 2021-01-09 Jakub Jelinek <jakub@redhat.com>
32413 PR rtl-optimization/98603
32414 * function.c (instantiate_virtual_regs_in_insn): For asm goto
32415 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
32416 if any, set ASM_OPERANDS mode to VOIDmode and change
32417 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
32419 2021-01-09 Alexandre Oliva <oliva@gnu.org>
32422 * final.c (notice_source_line): Narrow down the condition to
32423 skip a line-0 marker.
32425 2021-01-08 Sergei Trofimovich <siarheit@google.com>
32427 * ipa-modref.c (merge_call_side_effects): Fix
32428 linebreak split by reordering two print calls.
32430 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
32432 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
32433 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
32434 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
32435 consistency, fix constraint.
32437 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
32439 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
32440 callables instead of mask values.
32441 (struct target_flag_set_p): New predicate.
32442 (s390_cpu_cpp_builtins_internal): Define or undefine
32443 __LONG_DOUBLE_VX__ macro.
32445 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
32448 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
32449 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
32452 2021-01-08 Richard Biener <rguenther@suse.de>
32454 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
32456 2021-01-08 Richard Biener <rguenther@suse.de>
32458 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
32459 (vect_build_slp_tree): On cache hit release the matched
32460 scalar stmts vector.
32461 * tree-vect-stmts.c (vectorizable_store): Properly free
32462 vec_oprnds before possibly gathering them again.
32464 2021-01-08 Richard Biener <rguenther@suse.de>
32466 PR tree-optimization/98544
32467 * tree-vect-slp.c (vect_optimize_slp): Always materialize
32468 permutes at a permute node.
32470 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
32473 * config/i386/i386.c (x86_function_profiler): Use R10 to call
32474 mcount in large model. Sorry for large model with PIC.
32476 2021-01-08 Jakub Jelinek <jakub@redhat.com>
32479 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
32480 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
32481 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
32482 TargetSave and initialize for variables with enum types.
32483 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
32484 mstack-protector-guard-symbol=): Add Save.
32485 * config/i386/i386-options.c (ix86_function_specific_save,
32486 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
32487 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
32488 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
32489 x_ix86_veclibabi_type.
32491 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
32493 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
32494 SVE_FULL_I to SVE_I.
32495 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
32497 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
32499 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
32500 SVE_FULL_I to SVE_I.
32501 (*cond_uxt<mode>_any): Likewise.
32503 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32505 * config/aarch64/iterators.md (Vwhalf): New iterator.
32506 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
32508 (aarch64_<sur>adalp<mode>): ... This. Make more
32510 (<sur>sadv16qi): Adjust callsite of the above.
32511 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
32513 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
32515 (vpadal_s16): Likewise.
32516 (vpadal_u8): Likewise.
32517 (vpadal_u16): Likewise.
32518 (vpadalq_s8): Likewise.
32519 (vpadalq_s16): Likewise.
32520 (vpadalq_s32): Likewise.
32521 (vpadalq_u8): Likewise.
32522 (vpadalq_u16): Likewise.
32523 (vpadalq_u32): Likewise.
32525 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32527 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
32529 (aarch64_<su>abd<mode>): ... This.
32530 (<sur>sadv16qi): Adjust callsite of the above.
32531 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
32533 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
32535 (vabd_s16): Likewise.
32536 (vabd_s32): Likewise.
32537 (vabd_u8): Likewise.
32538 (vabd_u16): Likewise.
32539 (vabd_u32): Likewise.
32540 (vabdq_s8): Likewise.
32541 (vabdq_s16): Likewise.
32542 (vabdq_s32): Likewise.
32543 (vabdq_u8): Likewise.
32544 (vabdq_u16): Likewise.
32545 (vabdq_u32): Likewise.
32547 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32549 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
32551 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
32552 (vaba_s16): Likewise.
32553 (vaba_s32): Likewise.
32554 (vaba_u8): Likewise.
32555 (vaba_u16): Likewise.
32556 (vaba_u32): Likewise.
32557 (vabaq_s8): Likewise.
32558 (vabaq_s16): Likewise.
32559 (vabaq_s32): Likewise.
32560 (vabaq_u8): Likewise.
32561 (vabaq_u16): Likewise.
32562 (vabaq_u32): Likewise.
32564 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32566 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
32567 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
32568 Change RTL pattern to match.
32570 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
32572 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
32573 * config/riscv/riscv-c.c (riscv-subset.h): New.
32574 (INCLUDE_STRING): Define.
32575 (riscv_cpu_cpp_builtins): Add new style architecture extension
32577 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
32578 (riscv_subset_list::end): New.
32579 (riscv_current_subset_list): New.
32581 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
32583 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
32584 Move to riscv-subset.h.
32585 (struct riscv_subset_t): Ditto.
32586 (class riscv_subset_list): Ditto.
32587 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
32588 from riscv-common.c.
32589 (struct riscv_subset_t): Ditto.
32590 (class riscv_subset_list): Ditto.
32591 * config/riscv/t-riscv ($(common_out_file)): Add file
32594 2021-01-07 Jakub Jelinek <jakub@redhat.com>
32597 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
32598 New define_insn patterns.
32600 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
32602 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
32603 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
32604 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
32606 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
32608 PR tree-optimization/98560
32609 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
32610 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
32612 (vec_cond_direct): Likewise argument 2.
32613 (vec_condu_direct, vec_condeq_direct): Delete.
32614 (expand_vect_cond_optab_fn): Rename to...
32615 (expand_vec_cond_optab_fn): ...this, replacing old macro.
32616 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
32617 (expand_vect_cond_mask_optab_fn): Rename to...
32618 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
32619 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
32621 (direct_vec_cond_optab_supported_p): Likewise.
32622 (direct_vec_condu_optab_supported_p): Delete.
32623 (direct_vec_condeq_optab_supported_p): Delete.
32624 * gimple-isel.cc: Include internal-fn.h.
32625 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
32628 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
32630 PR tree-optimization/98560
32631 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
32632 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
32634 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
32636 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
32637 rotate_insn and optab code attributes.
32638 Update all uses to merged code attribute.
32639 * config/i386/sse.md: Update all uses to merged code attribute.
32640 * config/i386/mmx.md: Update all uses to merged code attribute.
32642 2021-01-07 Jakub Jelinek <jakub@redhat.com>
32644 PR tree-optimization/98568
32645 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
32646 (bswap_replace): Use it.
32648 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
32650 PR rtl-optimization/97978
32651 * lra-int.h (lra_hard_reg_split_p): New external.
32652 * lra.c (lra_hard_reg_split_p): New global.
32653 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
32654 * lra-assigns.c (lra_assign): Don't check allocation correctness
32655 after hard reg splitting.
32657 2021-01-06 Martin Sebor <msebor@redhat.com>
32660 * builtins.c (new_delete_mismatch_p): New overload.
32661 (new_delete_mismatch_p (tree, tree)): Call it.
32663 2021-01-06 Alexandre Oliva <oliva@adacore.com>
32665 * Makefile.in (T_GLIMITS_H): New.
32666 (stmp-int-hdrs): Depend on it, use it.
32667 * config/t-vxworks (T_GLIMITS_H): Override it.
32668 (vxw-glimits.h): New.
32670 2021-01-06 Richard Biener <rguenther@suse.de>
32672 PR tree-optimization/98513
32673 * value-range.cc (intersect_ranges): Compare the upper bounds
32674 for the expected relation.
32676 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
32679 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
32681 * doc/standards.texi (HSAIL): Remove section.
32683 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
32685 * configure: Re-generate.
32687 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32689 * doc/invoke.texi (-std=c++20): Adjust for the publication of
32690 ISO 14882:2020 standard.
32691 * doc/standards.texi: Likewise.
32693 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32695 PR tree-optimization/94802
32696 * expr.h (maybe_optimize_sub_cmp_0): Declare.
32697 * expr.c: Include tree-pretty-print.h and flags.h.
32698 (maybe_optimize_sub_cmp_0): New function.
32699 (do_store_flag): Use it.
32700 * cfgexpand.c (expand_gimple_cond): Likewise.
32702 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32704 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
32705 * rtlanal.c (simple_regno_set): Tweak description to clarify the
32708 2021-01-05 Richard Biener <rguenther@suse.de>
32710 PR tree-optimization/98516
32711 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
32712 lanes when materializing on a VEC_PERM node.
32713 (vectorizable_slp_permutation): Dump the permute properly.
32715 2021-01-05 Richard Biener <rguenther@suse.de>
32717 * tree-vect-slp.c (vect_slp_region): Move debug counter
32718 to cover individual subgraphs.
32720 2021-01-05 Richard Biener <rguenther@suse.de>
32722 PR tree-optimization/98428
32723 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
32724 vector lane extracts for loop vectorization.
32726 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32728 PR tree-optimization/98514
32729 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
32731 (operand_rank): Change type from hash_map<tree, long> to
32732 hash_map<tree, int64_t>.
32733 (phi_rank): Change return type from long to int64_t.
32734 (loop_carried_phi): Change block_rank variable type from long to
32736 (propagate_rank): Change return type, rank parameter type and
32737 op_rank variable type from long to int64_t.
32738 (find_operand_rank): Change return type from long to int64_t
32739 and change slot variable type from long * to int64_t *.
32740 (insert_operand_rank): Change rank parameter type from long to
32742 (get_rank): Change return type and rank variable type from long to
32743 int64_t. Use PRId64 instead of ld to print the rank.
32744 (init_reassoc): Change rank variable type from long to int64_t
32745 and adjust correspondingly bb_rank and operand_rank initialization.
32747 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32749 PR tree-optimization/96928
32750 * tree-ssa-phiopt.c (xor_replacement): New function.
32751 (tree_ssa_phiopt_worker): Call it.
32753 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32755 PR tree-optimization/96930
32756 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
32757 from narrower value which has the same type as 1 << B, perform
32758 the right shift on the narrower value followed by extension.
32760 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32762 PR tree-optimization/96239
32763 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
32765 (get_status_for_store_merging): Don't return BB_INVALID for blocks
32766 with potential bswap optimizable CONSTRUCTORs.
32767 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
32770 2021-01-05 Richard Biener <rguenther@suse.de>
32772 PR tree-optimization/98381
32773 * tree.c (vector_element_bits): Properly compute bool vector
32775 * tree-vect-loop.c (vectorizable_live_operation): Properly
32776 compute the last lane bit offset.
32778 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
32781 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
32782 Clear the top 64 bytes of the input XMM register.
32783 (sse_cvttps2pi): Ditto.
32785 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
32788 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
32790 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
32793 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
32796 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
32798 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
32799 (umaddsidi4_split): Likewise.
32801 2021-01-05 liuhongt <hongtao.liu@intel.com>
32804 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
32805 define_insn_and_split for zero_extend of subreg HI of pmovskb
32807 (*sse2_pmovskb_zexthisi): Add new combine splitters for
32808 zero_extend of not of subreg HI of pmovskb result.
32810 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32813 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
32815 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
32816 convert_memory_address to convert symbolic immediates to ptr_mode
32817 before forcing them to memory.
32819 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32821 PR rtl-optimization/97144
32822 * recog.c (constrain_operands): Initialize matching_operand
32823 for each alternative, rather than only doing it once.
32825 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32827 PR rtl-optimization/98403
32828 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
32829 why we don't remove call clobbers.
32830 (function_info::apply_changes_to_insn): Don't attempt to add
32831 call clobbers here.
32833 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32835 PR tree-optimization/98371
32836 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
32837 (vect_analyze_loop): If an epilogue loop appears to be cheaper
32838 than the main loop, re-analyze it as a main loop before adopting
32841 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
32844 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
32845 * aclocal.m4, configure: Regenerate.
32846 * Makefile.in (NETLIBS): Define.
32847 (BACKEND): Remove $(CODYLIB).
32849 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32851 PR rtl-optimization/98334
32852 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
32853 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
32855 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
32857 * tree-inline.c (expand_call_inline): Restore input_location.
32858 Return result from recursive call.
32860 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
32862 PR tree-optimization/95401
32863 * config/aarch64/aarch64-sve-builtins.cc
32864 (gimple_folder::load_store_cookie): Use bits rather than bytes
32865 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
32866 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
32867 * tree-vect-stmts.c (vectorizable_store): Likewise.
32868 (vectorizable_load): Likewise.
32870 2021-01-04 Richard Biener <rguenther@suse.de>
32872 PR tree-optimization/98308
32873 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
32876 2021-01-04 Jakub Jelinek <jakub@redhat.com>
32878 PR tree-optimization/95771
32879 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
32880 with precision smaller than int's precision and types with precision
32881 twice as large as long long. Formatting fixes.
32883 2021-01-04 Richard Biener <rguenther@suse.de>
32885 PR tree-optimization/98464
32886 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
32887 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
32888 (process_bb): Adjust.
32890 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
32893 * doc/invoke.texi (-fsanitize=address): Fix wording describing
32894 clash with -fsanitize=hwaddress.
32896 2021-01-04 Richard Biener <rguenther@suse.de>
32898 PR tree-optimization/98282
32899 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
32900 invariants as VN_NARY.
32902 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
32905 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
32906 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
32907 to handle zero operands.
32909 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
32911 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
32912 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
32913 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
32916 2021-01-04 Richard Biener <rguenther@suse.de>
32918 PR tree-optimization/98393
32919 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
32920 when hitting the limit.
32922 2021-01-04 Richard Biener <rguenther@suse.de>
32924 PR tree-optimization/98291
32925 * tree-vect-loop.c (vectorizable_reduction): Bypass
32926 associativity check for SLP reductions with VF 1.
32928 2021-01-04 Jakub Jelinek <jakub@redhat.com>
32930 PR tree-optimization/96782
32931 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
32933 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
32935 * collect-utils.c (collect_execute): Check dumppfx.
32936 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
32937 to collect_execute.
32938 (do_link): Add new parameter atsuffix.
32939 (main): Handle -dumpdir option. Skip one argument for
32940 -o, -isystem and -B options.
32941 * gcc.c (make_at_file): New helper function.
32942 (close_at_file): Use it.
32944 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32946 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
32947 Amend handling for LD64_VERSION fallback defaults.
32949 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32951 * config.gcc: Compute default version information
32952 from the configured target. Likewise defaults for
32954 * config/darwin10.h: Removed.
32955 * config/darwin12.h: Removed.
32956 * config/darwin9.h: Removed.
32957 * config/rs6000/darwin8.h: Removed.
32959 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32961 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
32963 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32965 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
32966 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
32968 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32970 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
32972 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
32974 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32976 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
32977 for the Darwin10 unwinder stub from here ...
32978 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
32980 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32982 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
32983 (ASM_DEBUG_SPEC):Only define if the assembler supports
32985 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
32986 (DARWIN_PREFER_DWARF): Define.
32987 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
32988 (DARWIN_PREFER_DWARF): Likewise
32989 (DSYMUTIL_SPEC): Likewise.
32990 (COLLECT_RUN_DSYMUTIL): Likewise.
32991 (ASM_DEBUG_SPEC): Likewise.
32992 (ASM_DEBUG_OPTION_SPEC): Likewise.
32994 2021-01-02 Jan Hubicka <jh@suse.cz>
32996 * cfg.c (free_block): ggc_free bb.
32998 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33000 * gcc.c (process_command): Update copyright notice dates.
33001 * gcov-dump.c (print_version): Ditto.
33002 * gcov.c (print_version): Ditto.
33003 * gcov-tool.c (print_version): Ditto.
33004 * gengtype.c (create_file): Ditto.
33005 * doc/cpp.texi: Bump @copying's copyright year.
33006 * doc/cppinternals.texi: Ditto.
33007 * doc/gcc.texi: Ditto.
33008 * doc/gccint.texi: Ditto.
33009 * doc/gcov.texi: Ditto.
33010 * doc/install.texi: Ditto.
33011 * doc/invoke.texi: Ditto.
33013 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33015 * ChangeLog-2020: Rotate ChangeLog. New file.
33018 Copyright (C) 2021 Free Software Foundation, Inc.
33020 Copying and distribution of this file, with or without modification,
33021 are permitted in any medium without royalty provided the copyright
33022 notice and this notice are preserved.