1 2021-11-21 Jakub Jelinek <jakub@redhat.com>
4 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
5 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
6 instead of target_option_default_node.
8 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
11 * ipa-modref.c (parm_map_for_arg): Rename to ...
12 (parm_map_for_ptr): .. this one; handle static chain and calls to
14 (modref_access_analysis::get_access): Use parm_map_for_ptr.
15 (modref_access_analysis::process_fnspec): Update.
16 (modref_access_analysis::analyze_load): Update.
17 (modref_access_analysis::analyze_store): Update.
19 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
21 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
23 (ignore_retval_p): Likewise.
24 (ignore_stores_p): Likewise.
25 (parm_map_for_arg): Likewise.
26 (class modref_access_analysis): New class.
27 (modref_access_analysis::set_side_effects): New member function.
28 (modref_access_analysis::set_nondeterministic): New member function.
29 (get_access): Turn to ...
30 (modref_access_analysis::get_access): ... this one.
31 (record_access): Turn to ...
32 (modref_access_analysis::record_access): ... this one.
33 (record_access_lto): Turn to ...
34 (modref_access_analysis::record_access_lto): ... This one.
35 (record_access_p): Turn to ...
36 (modref_access_analysis::record_access_p): ... This one
37 (modref_access_analysis::record_unknown_load): New member function.
38 (modref_access_analysis::record_unknown_store): New member function.
39 (get_access_for_fnspec): Turn to ...
40 (modref_access_analysis::get_access_for_fnspec): ... this one.
41 (merge_call_side_effects): Turn to ...
42 (moderf_access_analysis::merge_call_side_effects): Turn to ...
43 (collapse_loads): Move later in source code.
44 (collapse_stores): Move later in source code.
45 (process_fnspec): Turn to ...
46 (modref_access_analysis::process_fnspec): ... this one.
47 (analyze_call): Turn to ...
48 (modref_access_analysis::analyze_call): ... this one.
49 (struct summary_ptrs): Remove.
50 (analyze_load): Turn to ...
51 (modref_access_analysis::analyze_load): ... this one.
52 (analyze_store): Turn to ...
53 (modref_access_analysis::analyze_store): ... this one.
54 (analyze_stmt): Turn to ...
55 (modref_access_analysis::analyze_stmt): ... This one.
56 (remove_summary): Remove.
57 (modref_access_analysis::propagate): Break out from ...
58 (modref_access_analysis::analyze): Break out from ...
59 (analyze_function): ... here.
61 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
62 Robin Dapp <rdapp@linux.ibm.com>
65 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
66 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
68 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
71 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
72 (merge_call_side_effects): Improve debug output.
74 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
77 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
79 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
81 PR tree-optimization/103226
82 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
85 2021-11-20 Andrew Pinski <apinski@marvell.com>
87 PR tree-optimization/103220
88 * match.pd ((type) X bitop CST): Don't check if CST
89 fits into the type if only the sign changes.
91 2021-11-20 Alexandre Oliva <oliva@adacore.com>
93 PR tree-optimization/102988
94 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
97 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
99 * doc/invoke.texi: Remove whitespace after an @option.
101 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
103 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
104 _ARCH_PWR9, optimize vec_sum2s when LE.
106 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
109 * config/darwin-driver.c (darwin_driver_init): Handle exported
110 symbols and symbol lists (suppress automatic export of the TLS
112 * config/darwin.c (darwin_rename_builtins): Remove workaround.
113 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
114 (REAL_LIBGCC_SPEC): Handle revised library uses.
115 * config/darwin.opt (nodefaultexport): New.
116 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
117 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
118 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
120 2021-11-19 Martin Jambor <mjambor@suse.cz>
122 * opts.c (default_options_table): Switch off
123 flag_semantic_interposition at Ofast.
124 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
125 -fsemantic-interposition.
127 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
129 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
130 by !gimple_call_chain.
132 2021-11-19 Martin Sebor <msebor@redhat.com>
136 * doc/invoke.texi (-Waddress): Update.
138 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
140 PR tree-optimization/103254
141 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
142 depth for all statements with multple ssa names.
144 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
146 * config/s390/s390.md (define_peephole2): Variable insn points
147 to the first matched insn. Use peep2_next_insn(1) to refer to
148 the second matched insn.
150 2021-11-19 Tamar Christina <tamar.christina@arm.com>
152 PR tree-optimization/103311
154 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
156 (complex_mul_pattern::matches): Move check downwards.
158 2021-11-19 Martin Liska <mliska@suse.cz>
161 2021-11-19 Martin Liska <mliska@suse.cz>
163 * cfgexpand.c (pass_expand::execute): Use option directly.
164 * function.c (allocate_struct_function): Likewise.
165 * gimple-low.c (lower_function_body): Likewise.
166 (lower_stmt): Likewise.
167 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
168 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
169 * ipa-split.c (split_function): Likewise.
170 * lto-streamer-in.c (input_function): Likewise.
171 * sese.c (sese_insert_phis_for_liveouts): Likewise.
172 * ssa-iterators.h (num_imm_uses): Likewise.
173 * tree-cfg.c (make_blocks): Likewise.
174 (gimple_merge_blocks): Likewise.
175 * tree-inline.c (tree_function_versioning): Likewise.
176 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
177 * tree-sra.c (analyze_access_subtree): Likewise.
178 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
179 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
180 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
181 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
182 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
183 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
184 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
185 (target_for_debug_bind): Likewise.
186 (insert_debug_temp_for_var_def): Likewise.
187 (insert_debug_temps_for_defs): Likewise.
188 (reset_debug_uses): Likewise.
189 * tree-ssanames.c (release_ssa_name_fn): Likewise.
190 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
191 (adjust_debug_stmts): Likewise.
192 (adjust_phi_and_debug_stmts): Likewise.
193 (vect_do_peeling): Likewise.
194 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
195 (vect_transform_loop): Likewise.
196 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
197 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
198 (MAY_HAVE_DEBUG_STMTS): Use options directly.
200 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
202 * gcc.c (process_command): Skip dumpdir override if file is a
204 * doc/invoke.texi: Update -dumpdir documentation.
206 2021-11-19 Andrew Pinski <apinski@marvell.com>
208 PR tree-optimization/103314
209 * match.pd ((type) X op CST): Restrict the equal
210 TYPE_PRECISION case to GIMPLE only.
212 2021-11-19 Martin Liska <mliska@suse.cz>
215 * ipa-modref-tree.h (struct modref_parm_map): Add default
217 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
219 2021-11-19 Richard Biener <rguenther@suse.de>
222 * tree-eh.c (operation_could_trap_helper_p): Properly handle
223 fixed-point RDIV_EXPR.
225 2021-11-19 Richard Biener <rguenther@suse.de>
227 PR tree-optimization/102436
228 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
229 to just create the if structure and return the then block.
230 (execute_sm): Add flag to indicate the var will re-use
232 (hoist_memory_references): Support a single conditional
233 block with all stores as special case.
235 2021-11-19 Andrew Pinski <apinski@marvell.com>
237 PR tree-optimization/103317
238 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
239 middle bb case, check to make sure it has a single predecessor.
241 2021-11-19 Andrew Pinski <apinski@marvell.com>
243 PR tree-optimization/103257
245 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
246 Disable until !canonicalize_math_p ().
248 2021-11-19 Marek Polacek <polacek@redhat.com>
252 * doc/invoke.texi: Update documentation for -Wuninitialized.
253 * tree.c (stabilize_reference): Set location.
255 2021-11-19 liuhongt <hongtao.liu@intel.com>
258 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
259 storing 256/512-bit SSE register to be equal to cost of
260 unaligned store to avoid odd alignment peeling.
261 (icelake_cost): Ditto.
263 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
265 * config/rs6000/predicates.md (current_file_function_operand):
266 Add flag_semantic_interposition to call of decl_replaceable_p.
268 2021-11-18 Martin Liska <mliska@suse.cz>
270 * ipa-modref.c (analyze_function): Do not execute the code
271 only if dump_file != NULL.
273 2021-11-18 Martin Liska <mliska@suse.cz>
275 * ipa-modref.c (analyze_function): Use fnode instead of repeated
276 cgraph_node::get (current_function_decl).
278 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
280 * cgraph.c (cgraph_node::get_availability): Update call of
282 (cgraph_node::verify_node): Verify that semantic_interposition flag
284 * cgraph.h: (symtab_node): Add semantic_interposition flag.
285 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
286 semantic_interposition flag.
287 * cgraphunit.c (cgraph_node::finalize_function): Set
288 semantic_interposition flag.
289 (cgraph_node::add_new_function): Likewise.
290 (varpool_node::finalize_decl): Likewise.
291 (cgraph_node::create_wrapper): Likewise.
292 * common.opt (fsemantic-interposition): Turn to optimization node.
293 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
294 (lto_output_varpool_node): Likewise.
295 (input_overwrite_node): Likewise.
296 (input_varpool_node): Likewise.
297 * symtab.c (symtab_node::dump_base): Dump new flag.
298 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
300 * varasm.h (decl_replaceable_p): Update declaration.
301 * varpool.c (varpool_node::ctor_useable_for_folding_p):
302 Use semantic_interposition flag.
303 (varpool_node::get_availability): Likewise.
304 (varpool_node::create_alias): Copy semantic_interposition flag.
306 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
309 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
310 parameter may still be returned.
311 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
312 even for unused function args.
314 2021-11-18 Tamar Christina <tamar.christina@arm.com>
316 PR tree-optimization/103311
317 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
318 complex_fms_pattern::matches): Check for multiplications.
320 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
323 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
324 CS prefix for -mindirect-branch-cs-prefix.
325 (ix86_output_indirect_branch_via_reg): Likewise.
326 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
327 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
329 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
331 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
334 2021-11-18 Tamar Christina <tamar.christina@arm.com>
336 PR tree-optimization/103253
337 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
339 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
341 * doc/extend.texi: Document __builtin_assoc_barrier.
343 2021-11-18 Martin Liska <mliska@suse.cz>
345 * cfgexpand.c (pass_expand::execute): Use option directly.
346 * function.c (allocate_struct_function): Likewise.
347 * gimple-low.c (lower_function_body): Likewise.
348 (lower_stmt): Likewise.
349 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
350 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
351 * ipa-split.c (split_function): Likewise.
352 * lto-streamer-in.c (input_function): Likewise.
353 * sese.c (sese_insert_phis_for_liveouts): Likewise.
354 * ssa-iterators.h (num_imm_uses): Likewise.
355 * tree-cfg.c (make_blocks): Likewise.
356 (gimple_merge_blocks): Likewise.
357 * tree-inline.c (tree_function_versioning): Likewise.
358 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
359 * tree-sra.c (analyze_access_subtree): Likewise.
360 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
361 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
362 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
363 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
364 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
365 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
366 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
367 (target_for_debug_bind): Likewise.
368 (insert_debug_temp_for_var_def): Likewise.
369 (insert_debug_temps_for_defs): Likewise.
370 (reset_debug_uses): Likewise.
371 * tree-ssanames.c (release_ssa_name_fn): Likewise.
372 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
373 (adjust_debug_stmts): Likewise.
374 (adjust_phi_and_debug_stmts): Likewise.
375 (vect_do_peeling): Likewise.
376 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
377 (vect_transform_loop): Likewise.
378 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
379 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
380 (MAY_HAVE_DEBUG_STMTS): Use options directly.
382 2021-11-18 Richard Biener <rguenther@suse.de>
384 PR tree-optimization/103277
385 * tree-ssa-dse.c (need_ab_cleanup): New.
386 (dse_optimize_redundant_stores): Adjust.
387 (delete_dead_or_redundant_assignment): Get extra
388 need_ab_cleanup argument and set when abnormal cleanup is
390 (dse_optimize_call): Adjust.
391 (dse_optimize_stmt): Likewise.
392 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
393 Perform abnormal cleanup.
394 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
396 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
398 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
399 Adjust generated cfg to avoid infinite loop.
401 2021-11-18 konglin1 <lingling.kong@intel.com>
403 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
404 (_mm512_mask_mul_pch): Likewise.
405 (_mm512_maskz_mul_pch): Likewise.
406 (_mm512_mul_round_pch): Likewise.
407 (_mm512_mask_mul_round_pch): Likewise.
408 (_mm512_maskz_mul_round_pch): Likewise.
409 (_mm512_cmul_pch): Likewise.
410 (_mm512_mask_cmul_pch): Likewise.
411 (_mm512_maskz_cmul_pch): Likewise.
412 (_mm512_cmul_round_pch): Likewise.
413 (_mm512_mask_cmul_round_pch): Likewise.
414 (_mm512_maskz_cmul_round_pch): Likewise.
415 (_mm_mul_sch): Likewise.
416 (_mm_mask_mul_sch): Likewise.
417 (_mm_maskz_mul_sch): Likewise.
418 (_mm_mul_round_sch): Likewise.
419 (_mm_mask_mul_round_sch): Likewise.
420 (_mm_maskz_mul_round_sch): Likewise.
421 (_mm_cmul_sch): Likewise.
422 (_mm_mask_cmul_sch): Likewise.
423 (_mm_maskz_cmul_sch): Likewise.
424 (_mm_cmul_round_sch): Likewise.
425 (_mm_mask_cmul_round_sch): Likewise.
426 (_mm_maskz_cmul_round_sch): Likewise.
427 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
428 (_mm_mask_mul_pch): Likewise.
429 (_mm_maskz_mul_pch): Likewise.
430 (_mm256_mul_pch): Likewise.
431 (_mm256_mask_mul_pch): Likewise.
432 (_mm256_maskz_mul_pch): Likewise.
433 (_mm_cmul_pch): Likewise.
434 (_mm_mask_cmul_pch): Likewise.
435 (_mm_maskz_cmul_pch): Likewise.
436 (_mm256_cmul_pch): Likewise.
437 (_mm256_mask_cmul_pch): Likewise.
438 (_mm256_maskz_cmul_pch): Likewise.
440 2021-11-17 Andrew Pinski <apinski@marvell.com>
442 PR tree-optimization/103228
443 PR tree-optimization/55177
444 * match.pd ((type) X bitop CST): Also do this
445 transformation for nop conversions.
447 2021-11-17 Martin Sebor <msebor@redhat.com>
449 PR tree-optimization/102759
450 * gimple-array-bounds.cc (build_printable_array_type): Move...
451 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
452 pathological function redeclarations that remove a previously
454 Improve formatting of function arguments in informational notes.
455 * pointer-query.cc (build_printable_array_type): ...to here.
456 * pointer-query.h (build_printable_array_type): Declared.
458 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
461 * config/i386/i386-opts.h (harden_sls): New enum.
462 * config/i386/i386.c (output_indirect_thunk): Mitigate against
463 SLS for function return.
464 (ix86_output_function_return): Likewise.
465 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
467 (ix86_output_indirect_jmp): Likewise.
468 (ix86_output_call_insn): Likewise.
469 * config/i386/i386.opt: Add -mharden-sls=.
470 * doc/invoke.texi: Document -mharden-sls=.
472 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
475 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
476 (ix86_output_function_return): Likewise.
477 * config/i386/i386.md (simple_return_pop_internal): Likewise.
479 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
482 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
485 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
487 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
488 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
489 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
490 on indirect_thunks_used.
491 (ix86_output_indirect_function_return): Ditto.
493 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
495 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
496 (modref_access_node::stream_out): New member function.
497 (modref_access_node::stream_in): New member function.
498 * ipa-modref-tree.h (modref_access_node::stream_out,
499 modref_access_node::stream_in): Declare.
500 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
501 (modref_summary_lto::dump): Dump kills.
502 (analyze_store): Record kills for LTO
503 (analyze_stmt): Likewise.
504 (modref_summaries_lto::duplicate): Duplicate kills.
505 (write_modref_records): Use new stream_out member function.
506 (read_modref_records): Likewise.
507 (modref_write): Stream out kills.
508 (read_section): Stream in kills
509 (remap_kills): New function.
510 (update_signature): Use it.
512 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
514 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
515 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
516 * config/i386/i386.c (zero_all_vector_registers):
517 Use LEGACY_SSE_REGNO_P predicate.
518 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
519 and EXT_REG_SSE_REGNO_P predicates.
520 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
521 and LEGACY_SSE_REGNO_P predicates.
523 2021-11-17 Jason Merrill <jason@redhat.com>
525 * doc/invoke.texi (C++ Dialect Options): Document
526 -fimplicit-constexpr.
528 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
530 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
531 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
533 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
536 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
540 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
541 (gimplify_compound_lval): Gimplify base expression first.
542 (gimplify_target_expr): Add comment.
544 2021-11-17 Jakub Jelinek <jakub@redhat.com>
546 PR tree-optimization/103192
547 * tree-ssa-loop-im.c (move_computations_worker): Use
548 reset_flow_sensitive_info instead of manually clearing
549 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
552 2021-11-17 Jakub Jelinek <jakub@redhat.com>
554 PR tree-optimization/103255
555 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
556 range_nonzero rather than unadjusted base's range. Formatting fixes.
558 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
560 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
561 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
562 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
563 * internal-fn.c (first_commutative_argument): Handle them.
564 (FOR_EACH_COND_FN_PAIR): Likewise.
565 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
566 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
569 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
571 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
572 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
573 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
574 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
575 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
578 2021-11-17 Andrew Pinski <apinski@marvell.com>
580 PR tree-optimization/103288
581 * tree-ssa-phiopt.c (value_replacement): Return early if middle
582 block has more than one pred.
584 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
586 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
587 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
588 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
589 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
590 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
591 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
592 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
593 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
595 2021-11-17 Marek Polacek <polacek@redhat.com>
597 PR preprocessor/103026
598 * doc/invoke.texi: Document -Wbidi-chars.
600 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
603 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
606 2021-11-16 Jan Hubicka <jh@suse.cz>
608 * ipa-modref.c (get_modref_function_summary): Declare.
609 * ipa-modref.h (get_modref_function_summary): New function.
610 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
611 (clear_bytes_written_by): ... here; also clear memory killed by
614 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
616 * ggc-common.c (gt_pch_save): If we cannot find a suitable
617 memory segment for save, then error-out, do not try to
619 (gt_pch_restore): Save the existing line table, and when
620 the replacement is being read, use that when constructing
623 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
626 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
628 (*mma_assemble_acc): Likewise.
630 2021-11-16 Martin Sebor <msebor@redhat.com>
632 PR tree-optimization/102960
633 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
634 than a pointer to it.
635 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
637 (get_maxval_strlen): Use an auto_bitmap.
638 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
640 (get_range_strlen_phi): ...into this function.
641 Avoid assuming maximum string length is constant
642 (printf_strlen_execute): Dump pointer query cache contents when
643 details are requisted.
645 2021-11-16 Jason Merrill <jason@redhat.com>
647 * langhooks.h (struct lang_hooks): Adjust comment.
648 * print-tree.c (print_node): Also call print_xnode hook for
651 2021-11-16 Andrew Pinski <apinski@marvell.com>
653 PR tree-optimization/103218
654 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
656 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
658 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
659 (umaddhisi4): Likewise.
660 (machi): Update pattern.
663 2021-11-16 Richard Biener <rguenther@suse.de>
665 PR tree-optimization/102880
666 * tree-ssa-dce.c (sort_phi_args): New function.
667 (make_forwarders_with_degenerate_phis): Likewise.
668 (perform_tree_ssa_dce): Call
669 make_forwarders_with_degenerate_phis.
671 2021-11-16 Richard Biener <rguenther@suse.de>
673 PR tree-optimization/102880
674 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
675 single_pred (bb1) condition to places that really need it.
676 (match_simplify_replacement): Likewise.
677 (value_replacement): Likewise.
678 (replace_phi_edge_with_variable): Deal with extra edges
681 2021-11-16 Martin Jambor <mjambor@suse.cz>
683 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
684 add a fixme note about the mode assignment perhaps being unnecessary.
685 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
687 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
688 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
689 * tree-inline.c (remap_ssa_name): Likewise.
690 (tree_function_versioning): Likewise.
691 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
692 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
693 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
695 2021-11-16 Jakub Jelinek <jakub@redhat.com>
697 PR tree-optimization/103208
698 * omp-expand.c (expand_omp_build_cond): New function.
699 (expand_omp_for_init_counts, expand_omp_for_init_vars,
700 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
702 2021-11-16 Jakub Jelinek <jakub@redhat.com>
704 PR tree-optimization/102009
705 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
706 Punt if any of alloc_size arguments is out of bounds vs. number of
709 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
711 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
712 on !optimize_function_for_size_p.
713 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
714 (define_splits): Conditionalize on !optimize_function_for_size_p.
715 (*bmi2_rorxsi3_1_zext): Likewise.
716 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
717 (define_splits): Conditionalize on !optimize_function_for_size_p.
719 2021-11-16 Jan Hubicka <jh@suse.cz>
722 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
725 2021-11-16 Andrew Pinski <apinski@marvell.com>
727 PR tree-optimization/103245
728 * match.pd: Combine the abs pattern matching using multiplication.
729 Adding optional nop_convert too.
731 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
734 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
737 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
739 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
740 determine if it is safe to transform to strcat.
741 (gimple_fold_builtin_snprintf): Likewise.
743 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
745 * gimple-fold.c (known_lower): New function.
746 (gimple_fold_builtin_strncat_chk,
747 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
748 gimple_fold_builtin_stxncpy_chk,
749 gimple_fold_builtin_snprintf_chk,
750 gimple_fold_builtin_sprintf_chk): Use it.
752 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
754 * gimple-fold.c (dump_transformation): New function.
755 (gimple_fold_builtin_stxcpy_chk,
756 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
757 BUILT_IN_STRNCPY if return value is not used.
759 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
762 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
763 before transforming equivalent, but slighly different cases to
764 their canonical forms.
766 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
769 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
770 * doc/tm.texi: Regenerated.
771 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
772 * ipa.c (cgraph_build_static_cdtor_1): Return the built
774 (build_cxa_atexit_decl): New.
775 (build_dso_handle_decl): New.
776 (build_cxa_dtor_registrations): New.
777 (compare_cdtor_tu_order): New.
778 (build_cxa_atexit_fns): New.
779 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
780 process the DTORs/CTORs accordingly.
781 (pass_ipa_cdtor_merge::gate): Also run if
782 dtors_from_cxa_atexit is set.
783 * target.def (dtors_from_cxa_atexit): New hook.
785 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
787 * config.in: Regenerate.
788 * configure: Regenerate.
789 * configure.ac: Test ld64 for -platform-version support.
791 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
793 * diagnostic-spec.h (typedef xint_hash_t)
794 (typedef xint_hash_map_t): Replace with...
795 (typedef nowarn_map_t): ... this.
796 (nowarn_map): Adjust.
797 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
799 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
801 * profile.c (branch_prob): Use 'location_hash' for
804 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
806 PR tree-optimization/103207
807 * value-range.cc (irange::set): Drop overflow.
809 2021-11-15 Jakub Jelinek <jakub@redhat.com>
811 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
812 to OMP_TARGET_CLAUSES if it isn't there already.
814 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
816 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
818 (path_range_query::ssa_defined_in_bb): New.
819 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
820 could trigger additional lookups.
821 Do not use the cache for ARGs defined in this block.
822 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
823 (path_range_query::maybe_register_phi_relation): Same.
824 (path_range_query::range_of_stmt): Adjust comment.
825 * gimple-range-path.h (ssa_defined_in_bb): New.
827 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
829 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
830 Default to global range if nothing found.
832 2021-11-15 Richard Biener <rguenther@suse.de>
834 PR tree-optimization/103237
835 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
836 double reductions with multiple inner loop LC PHI nodes.
838 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
841 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
843 * config/i386/i386-options.c (ix86_target_string): Add
844 -mrelax-cmpxchg-loop flag.
845 (ix86_valid_target_attribute_inner_p): Likewise.
846 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
847 New expand function prototype.
848 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
849 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
851 (atomic_<logic>_fetch<mode>): Likewise.
852 (atomic_fetch_nand<mode>): Likewise.
853 (atomic_nand_fetch<mode>): Likewise.
854 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
855 (atomic_<logic>_fetch<mode>): Likewise.
856 (atomic_fetch_nand<mode>): Likewise.
857 (atomic_nand_fetch<mode>): Likewise.
858 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
860 2021-11-15 Richard Biener <rguenther@suse.de>
862 PR tree-optimization/103219
863 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
864 to determine the exit for the VN region.
866 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
868 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
869 (pass_data_vectorize): Likewise.
870 (make_pass_vectorize): Likewise.
871 * tree-vectorizer.c (vectorize_loops): Merge with
872 pass_vectorize::execute and replace cfun occurences with fun param.
873 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
874 fun, and adjust callers approrpiately.
875 (note_simd_array_uses): Likewise.
876 (vect_loop_dist_alias_call): Likewise.
877 (set_uid_loop_bbs): Likewise.
878 (vect_transform_loops): Likewise.
879 (try_vectorize_loop_1): Likewise.
880 (try_vectorize_loop): Likewise.
882 2021-11-15 Jakub Jelinek <jakub@redhat.com>
885 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
886 atomic_bit_test_and_complement<mode>,
887 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
890 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
892 * config/vax/vax.c (vax_lra_p): New prototype and function.
893 (TARGET_LRA_P): Wire it.
894 * config/vax/vax.opt (mlra): New option.
895 * doc/invoke.texi (Option Summary, VAX Options): Document the
898 2021-11-14 Jan Hubicka <jh@suse.cz>
900 * ipa-modref.h (struct modref_summary): Add nondeterministic
901 and calls_interposable flags.
902 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
903 (modref_summary::useful_p): Check new flags.
904 (struct modref_summary_lto): Add nondeterministic and
905 calls_interposable flags.
906 (modref_summary_lto::modref_summary_lto): Initialize new flags.
907 (modref_summary_lto::useful_p): Check new flags.
908 (modref_summary::dump): Dump new flags.
909 (modref_summary_lto::dump): Dump new flags.
910 (ignore_nondeterminism_p): New function.
911 (merge_call_side_effects): Merge new flags.
912 (process_fnspec): Likewise.
913 (analyze_load): Volatile access is nondeterministic.
914 (analyze_store): Liekwise.
915 (analyze_stmt): Volatile ASM is nondeterministic.
916 (analyze_function): Clear new flags.
917 (modref_summaries::duplicate): Duplicate new flags.
918 (modref_summaries_lto::duplicate): Duplicate new flags.
919 (modref_write): Stream new flags.
920 (read_section): Stream new flags.
921 (propagate_unknown_call): Update new flags.
922 (modref_propagate_in_scc): Propagate new flags.
923 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
925 * tree-ssa-structalias.c (determine_global_memory_access):
928 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
930 * config/vax/vax.h (SET_RATIO): New macro.
931 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
932 (setmemhi): New expander.
933 (setmemhi1): New insn and splitter.
934 (*setmemhi1): New insn.
936 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
938 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
940 (modref_access_node::merge_for_kills): Likewise.
941 (modref_access_node::insert_kill): Likewise.
942 * ipa-modref-tree.h (modref_access_node::update_for_kills,
943 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
945 (modref_access_node::useful_for_kill): New member function.
946 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
947 (lto_modref_summary): Add kills.
948 (modref_summary::dump): Dump kills.
949 (record_access): Add mdoref_access_node parameter.
950 (record_access_lto): Likewise.
951 (merge_call_side_effects): Merge kills.
952 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
953 (struct summary_ptrs): Add always_executed filed.
954 (analyze_load): Update.
955 (analyze_store): Update; record kills.
956 (analyze_stmt): Add always_executed; record kills in clobbers.
957 (analyze_function): Track always_executed.
958 (modref_summaries::duplicate): Duplicate kills.
959 (update_signature): Release kills.
960 * ipa-modref.h (struct modref_summary): Add kills.
961 * tree-ssa-alias.c (alias_stats): Add kill stats.
962 (dump_alias_stats): Dump kill stats.
963 (store_kills_ref_p): Break out from ...
964 (stmt_kills_ref_p): Use it; handle modref info based kills.
966 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
968 PR tree-optimization/103229
969 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
970 null value to memset.
972 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
974 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
976 (modref_access_node::get_ao_ref): Likewise.
977 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
978 (modref_access_node::get_ao_ref): Declare.
979 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
980 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
982 2021-11-13 Jan Hubicka <jh@suse.cz>
985 * dbgcnt.def (ipa_attr): New counters.
986 * ipa-pure-const.c: Include dbgcnt.c
987 (ipa_make_function_const): Use debug counter.
988 (ipa_make_function_pure): Likewise.
989 (propagate_pure_const): Fix bug in my previous change.
991 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
993 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
994 Offline from ipa-modref-tree.h.
995 (modref_access_node::dump): Move from ipa-modref.c; make member
997 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
998 modref_access_node::dump): Declare.
999 * ipa-modref.c (dump_access): Remove.
1000 (dump_records): Update.
1001 (dump_lto_records): Update.
1002 (record_access): Update.
1003 (record_access_lto): Update.
1005 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
1007 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
1008 (modref_summary::dump): Dump try_dse.
1009 (modref_summary::finalize): Add FUN attribute; compute try-dse.
1010 (analyze_function): Update.
1011 (read_section): Update.
1012 (update_signature): Update.
1013 (pass_ipa_modref::execute): Update.
1014 * ipa-modref.h (struct modref_summary):
1015 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
1016 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
1017 * tree-ssa-dse.c (dse_optimize_call): New function.
1018 (dse_optimize_stmt): Use it.
1020 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
1022 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
1024 2021-11-13 Jan Hubicka <jh@suse.cz>
1027 (struct modref_access_node): Move longer member functions to
1029 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
1031 * ipa-modref-tree.c (modref_access_node::contains): Move here
1032 from ipa-modref-tree.h.
1033 (modref_access_node::update): Likewise.
1034 (modref_access_node::merge): Likewise.
1035 (modref_access_node::closer_pair_p): Likewise.
1036 (modref_access_node::forced_merge): Likewise.
1037 (modref_access_node::update2): Likewise.
1038 (modref_access_node::combined_offsets): Likewise.
1039 (modref_access_node::try_merge_with): Likewise.
1040 (modref_access_node::insert): Likewise.
1042 2021-11-13 Jan Hubicka <jh@suse.cz>
1044 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
1045 (modref_summary::global_memory_written_p): Remove.
1046 (modref_summary::dump): Dump new flags.
1047 (modref_summary::finalize): New member function.
1048 (analyze_function): Call it.
1049 (read_section): Call it.
1050 (update_signature): Call it.
1051 (pass_ipa_modref::execute): Call it.
1052 * ipa-modref.h (struct modref_summary): Remove
1053 global_memory_read_p and global_memory_written_p.
1054 Add global_memory_read, global_memory_written.
1055 * tree-ssa-structalias.c (determine_global_memory_access):
1058 2021-11-13 Jan Hubicka <jh@suse.cz>
1060 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
1061 * ipa-param-manipulation.c
1062 (ipa_param_adjustments::type_attribute_allowed_p):
1063 New member function.
1064 (drop_type_attribute_if_params_changed_p): New function.
1065 (build_adjusted_function_type): Use it.
1066 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
1068 2021-11-13 David Malcolm <dmalcolm@redhat.com>
1070 * doc/invoke.texi (Static Analyzer Options): Add
1071 -Wno-analyzer-tainted-allocation-size,
1072 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
1073 -Wno-analyzer-tainted-size to list. Add
1074 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
1075 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
1076 of options effectively enabled by -fanalyzer.
1077 (-Wanalyzer-tainted-allocation-size): New.
1078 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
1079 (-Wanalyzer-tainted-divisor): New.
1080 (-Wanalyzer-tainted-offset): New.
1081 (-Wanalyzer-tainted-size): New.
1083 2021-11-13 Jan Hubicka <jh@suse.cz>
1085 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
1086 * gimple.c (gimple_call_arg_flags): ... here.
1087 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
1088 (modref_merge_call_site_flags): Use arg_eaf_flags.
1090 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
1092 PR tree-optimization/103222
1093 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
1095 (path_range_query::compute_ranges_in_block): Call
1096 compute_ranges_in_phis.
1097 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
1100 2021-11-13 Jan Hubicka <jh@suse.cz>
1102 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
1103 changes on "fn spec" attribute; give up on varadic types.
1104 * ipa-param-manipulation.c: Include attribs.h.
1105 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
1106 true remove "fn spec" attribute.
1107 (ipa_param_adjustments::build_new_function_type): Update.
1108 (ipa_param_body_adjustments::modify_formal_parameters): update.
1109 * ipa-sra.c: Include attribs.h.
1110 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
1112 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
1114 * gimple-range-path.cc (path_range_query::path_range_query): Merge
1116 (path_range_query::import_p): Move from header file.
1117 (path_range_query::~path_range_query): Adjust for combined ctors.
1118 * gimple-range-path.h: Merge ctors.
1119 (path_range_query::import_p): Move to .cc file.
1121 2021-11-13 Jan Hubicka <jh@suse.cz>
1123 PR tree-optimization/103182
1124 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
1125 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
1127 2021-11-12 Stafford Horne <shorne@gmail.com>
1129 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
1130 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
1131 new function or1k_profile_hook.
1132 * config/or1k/or1k.c (struct machine_function): Add new field
1133 set_mcount_arg_insn.
1134 (or1k_profile_hook): New function.
1135 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
1137 (or1k_frame_pointer_required): Frame pointer no longer needed
1140 2021-11-12 Jan Hubicka <jh@suse.cz>
1142 PR tree-optimization/103209
1143 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
1144 use of handle_rhs_call
1146 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
1148 PR tree-optimization/103202
1149 * gimple-range-path.cc
1150 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
1152 2021-11-12 Jan Hubicka <jh@suse.cz>
1154 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
1155 fix call of ipa_make_function_const and ipa_make_function_pure.
1157 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1159 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
1160 (aarch64_vector_op::m_seen_loads): Likewise.
1161 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
1163 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
1164 Remove code that tries to predict different vec_flags from the
1166 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
1167 Remove m_advsimd_ops handling.
1169 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1171 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
1172 (aarch64_sve_only_stmt_p): Delete.
1173 (aarch64_vector_costs::prefer_unrolled_loop): New function,
1174 extracted from adjust_body_cost.
1175 (aarch64_vector_costs::better_main_loop_than_p): New function,
1176 using heuristics extracted from adjust_body_cost and
1177 adjust_body_cost_sve.
1178 (aarch64_vector_costs::adjust_body_cost_sve): Remove
1179 advsimd_cycles_per_iter and could_use_advsimd parameters.
1180 Update after changes above.
1181 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
1183 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1185 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
1186 New member variable.
1187 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
1188 (aarch64_vec_op_count::vf_factor): New function.
1189 (aarch64_vector_costs::aarch64_vector_costs): When costing for
1190 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
1192 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
1193 instead of hard-coding 2.
1195 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1197 * config/aarch64/aarch64.c
1198 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
1199 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
1200 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
1201 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
1202 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
1203 above properties too.
1204 (aarch64_estimate_min_cycles_per_iter): Delete.
1205 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
1206 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
1207 routine to print CPI estimates.
1208 (adjust_body_cost_sve): Likewise. Use the other functions above
1209 instead of doing the work inline.
1211 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1213 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
1215 (aarch64_vec_op_count::base_issue_info): Remove handling of null
1217 (aarch64_vec_op_count::simd_issue_info): Likewise.
1218 (aarch64_vec_op_count::sve_issue_info): Likewise.
1219 (aarch64_vector_costs::m_ops): Turn into a vector.
1220 (aarch64_vector_costs::m_advsimd_ops): Likewise.
1221 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
1222 the vectors based on aarch64_tune_params.
1223 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
1224 of all entries in m_ops.
1225 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
1227 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
1228 parameter and get the information from the ops instead.
1229 (aarch64_vector_costs::adjust_body_cost_sve): Take a
1230 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
1231 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
1232 Exit earlier if m_ops is empty for either cost structure.
1234 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1236 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
1237 (aarch64_vector_costs::m_sve_ops): Replace with...
1238 (aarch64_vector_costs::m_ops): ...this.
1239 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
1240 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
1241 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
1242 Initialize m_vec_flags here rather than in add_stmt_cost.
1243 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
1244 Allow vectype to be null.
1245 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
1246 code too. Don't require vectype to be nonnull.
1247 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
1248 and scalar costs as parameters. Use the scalar costs to determine
1249 the cycles per iteration of the scalar loop, then multiply it
1250 by the estimated VF.
1251 (aarch64_vector_costs::finish_cost): Update call accordingly.
1253 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1255 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
1256 (aarch64_vector_costs::count_ops): Use it rather than the
1257 vectype to determine floatness.
1259 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1261 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
1262 Remove vectype parameter and get floatness from the type of the
1264 (arch64_in_loop_reduction_latency): Likewise.
1265 (aarch64_detect_vector_stmt_subtype): Update caller.
1266 (aarch64_vector_costs::count_ops): Likewise.
1268 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1270 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
1271 (aarch64_vec_op_count): ...this. Add a constructor.
1272 (aarch64_vec_op_count::vec_flags): New function.
1273 (aarch64_vec_op_count::base_issue_info): Likewise.
1274 (aarch64_vec_op_count::simd_issue_info): Likewise.
1275 (aarch64_vec_op_count::sve_issue_info): Likewise.
1276 (aarch64_vec_op_count::m_issue_info): New member variable.
1277 (aarch64_vec_op_count::m_vec_flags): Likewise.
1278 (aarch64_vector_costs): Add a constructor.
1279 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
1280 (aarch64_vector_costs::aarch64_vector_costs): New function.
1281 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
1282 (aarch64_vector_costs::count_ops): Remove vec_flags and
1283 issue_info parameters, using the new aarch64_vec_op_count
1285 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
1286 (aarch64_sve_op_count::dump): Fold into...
1287 (aarch64_vec_op_count::dump): ..here.
1289 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
1291 * config/aarch64/aarch64.c: Include tree-dfa.h.
1292 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
1293 and MEM_OFFSET into account.
1294 (aarch64_swap_ldrstr_operands): Use it.
1295 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
1296 address of the second memory doesn't depend on the result of
1299 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
1301 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
1303 2021-11-12 Jan Hubicka <jh@suse.cz>
1305 PR tree-optimization/103175
1306 * ipa-modref.c (modref_lattice::merge): Add sanity check.
1307 (callee_to_caller_flags): Make flags adjustment sane.
1308 (modref_eaf_analysis::analyze_ssa_name): Likewise.
1310 2021-11-12 Jan Hubicka <jh@suse.cz>
1313 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
1314 not mark pure/const function if there are side-effects.
1316 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
1318 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
1319 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
1320 * tree-pretty-print.c (dump_omp_clause): Add support for printing
1321 implicit attribute in tree dumping.
1322 * gimplify.c (gimplify_adjust_omp_clauses_1):
1323 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
1325 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
1326 clauses, from simple append, to starting of list, after non-map clauses.
1327 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
1328 values passed to libgomp for implicit maps.
1330 2021-11-12 Jakub Jelinek <jakub@redhat.com>
1332 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
1333 (BUILT_IN_GOMP_TEAMS4): New.
1334 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
1335 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
1336 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
1337 GOMP_teams, pass to it also num_teams lower-bound expression
1338 or a dup of upper-bound if it is missing and a flag whether
1339 it is the first call or not.
1341 2021-11-12 Martin Liska <mliska@suse.cz>
1343 PR tree-optimization/102497
1344 * gimple-predicate-analysis.cc (add_pred): Remove unused
1347 2021-11-12 Richard Biener <rguenther@suse.de>
1349 PR tree-optimization/103204
1350 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
1351 top operand after folding in an address.
1353 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
1355 * gimple-range-path.cc (path_range_query::path_range_query): New
1356 ctor without a ranger.
1357 (path_range_query::~path_range_query): Free ranger if necessary.
1358 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
1359 (path_range_query::ssa_range_in_phi): Same.
1360 (path_range_query::compute_ranges_in_block): Same.
1361 (path_range_query::compute_imports): Same.
1362 (path_range_query::compute_ranges): Same.
1363 (path_range_query::range_of_stmt): Same.
1364 (path_range_query::compute_outgoing_relations): Same.
1365 * gimple-range-path.h (class path_range_query): New ctor.
1366 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
1367 as path_range_query allocates one.
1368 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
1369 (back_threader::~back_threader): Same.
1371 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
1373 * tree-ssa-threadbackward.c
1374 (back_threader_profitability::profitable_path_p): Remove loop
1375 crossing restriction.
1377 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
1379 * ipa-modref.c (analyze_function): Do pure/const discovery, return
1381 (pass_modref::execute): If pure/const is discovered fixup cfg.
1382 (ignore_edge): Do not ignore pure/const edges.
1383 (modref_propagate_in_scc): Do pure/const discovery, return true if
1384 cdtor was promoted pure/const.
1385 (pass_ipa_modref::execute): If needed remove unreachable functions.
1386 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
1387 (warn_function_cold): Likewise.
1388 (skip_function_for_local_pure_const): Move earlier.
1389 (ipa_make_function_const): Break out from ...
1390 (ipa_make_function_pure): Break out from ...
1391 (propagate_pure_const): ... here.
1392 (pass_local_pure_const::execute): Use it.
1393 * ipa-utils.h (ipa_make_function_const): Declare.
1394 (ipa_make_function_pure): Declare.
1395 * passes.def: Move early modref after pure-const.
1397 2021-11-11 David Malcolm <dmalcolm@redhat.com>
1400 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
1402 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1404 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
1405 (TYPES_COMBINEP): Delete.
1406 * config/aarch64/aarch64-simd-builtins.def: Declare type-
1407 qualified builtins for vcombine_* intrinsics.
1408 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
1410 (vcombine_s16): Likewise.
1411 (vcombine_s32): Likewise.
1412 (vcombine_f32): Likewise.
1413 (vcombine_u8): Use type-qualified builtin and remove casts.
1414 (vcombine_u16): Likewise.
1415 (vcombine_u32): Likewise.
1416 (vcombine_u64): Likewise.
1417 (vcombine_p8): Likewise.
1418 (vcombine_p16): Likewise.
1419 (vcombine_p64): Likewise.
1420 (vcombine_bf16): Remove unnecessary cast.
1421 * config/aarch64/iterators.md (VD_I): New mode iterator.
1422 (VDC_P): New mode iterator.
1424 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1426 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
1427 (TYPES_LOAD1_P): Define.
1428 (TYPES_STORE1_U): Define.
1429 (TYPES_STORE1P): Rename to...
1430 (TYPES_STORE1_P): This.
1431 (get_mem_type_for_load_store): Add unsigned and poly types.
1432 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
1433 type-qualified builtin declarations.
1434 * config/aarch64/aarch64-simd-builtins.def: Declare type-
1435 qualified builtins for LD1/ST1.
1436 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
1437 builtin and remove cast.
1438 (vld1_p16): Likewise.
1439 (vld1_u8): Likewise.
1440 (vld1_u16): Likewise.
1441 (vld1_u32): Likewise.
1442 (vld1q_p8): Likewise.
1443 (vld1q_p16): Likewise.
1444 (vld1q_p64): Likewise.
1445 (vld1q_u8): Likewise.
1446 (vld1q_u16): Likewise.
1447 (vld1q_u32): Likewise.
1448 (vld1q_u64): Likewise.
1449 (vst1_p8): Likewise.
1450 (vst1_p16): Likewise.
1451 (vst1_u8): Likewise.
1452 (vst1_u16): Likewise.
1453 (vst1_u32): Likewise.
1454 (vst1q_p8): Likewise.
1455 (vst1q_p16): Likewise.
1456 (vst1q_p64): Likewise.
1457 (vst1q_u8): Likewise.
1458 (vst1q_u16): Likewise.
1459 (vst1q_u32): Likewise.
1460 (vst1q_u64): Likewise.
1461 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
1463 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1465 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
1466 builtins for vector reduction.
1467 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
1468 builtin and remove casts.
1469 (vaddv_u16): Likewise.
1470 (vaddv_u32): Likewise.
1471 (vaddvq_u8): Likewise.
1472 (vaddvq_u16): Likewise.
1473 (vaddvq_u32): Likewise.
1474 (vaddvq_u64): Likewise.
1476 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1478 * config/aarch64/aarch64-simd-builtins.def:
1479 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
1480 builtin and remove casts.
1481 (vpaddq_u16): Likewise.
1482 (vpaddq_u32): Likewise.
1483 (vpaddq_u64): Likewise.
1484 (vpadd_u8): Likewise.
1485 (vpadd_u16): Likewise.
1486 (vpadd_u32): Likewise.
1487 (vpaddd_u64): Likewise.
1489 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1491 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
1492 builtins for [r]subhn[2].
1493 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
1495 (vsubhn_s32): Likewise.
1496 (vsubhn_s64): Likewise.
1497 (vsubhn_u16): Use type-qualified builtin and remove casts.
1498 (vsubhn_u32): Likewise.
1499 (vsubhn_u64): Likewise.
1500 (vrsubhn_s16): Remove unnecessary cast.
1501 (vrsubhn_s32): Likewise.
1502 (vrsubhn_s64): Likewise.
1503 (vrsubhn_u16): Use type-qualified builtin and remove casts.
1504 (vrsubhn_u32): Likewise.
1505 (vrsubhn_u64): Likewise.
1506 (vrsubhn_high_s16): Remove unnecessary cast.
1507 (vrsubhn_high_s32): Likewise.
1508 (vrsubhn_high_s64): Likewise.
1509 (vrsubhn_high_u16): Use type-qualified builtin and remove
1511 (vrsubhn_high_u32): Likewise.
1512 (vrsubhn_high_u64): Likewise.
1513 (vsubhn_high_s16): Remove unnecessary cast.
1514 (vsubhn_high_s32): Likewise.
1515 (vsubhn_high_s64): Likewise.
1516 (vsubhn_high_u16): Use type-qualified builtin and remove
1518 (vsubhn_high_u32): Likewise.
1519 (vsubhn_high_u64): Likewise.
1521 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1523 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
1524 builtins for [r]addhn[2].
1525 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
1527 (vaddhn_s32): Likewise.
1528 (vaddhn_s64): Likewise.
1529 (vaddhn_u16): Use type-qualified builtin and remove casts.
1530 (vaddhn_u32): Likewise.
1531 (vaddhn_u64): Likewise.
1532 (vraddhn_s16): Remove unnecessary cast.
1533 (vraddhn_s32): Likewise.
1534 (vraddhn_s64): Likewise.
1535 (vraddhn_u16): Use type-qualified builtin and remove casts.
1536 (vraddhn_u32): Likewise.
1537 (vraddhn_u64): Likewise.
1538 (vaddhn_high_s16): Remove unnecessary cast.
1539 (vaddhn_high_s32): Likewise.
1540 (vaddhn_high_s64): Likewise.
1541 (vaddhn_high_u16): Use type-qualified builtin and remove
1543 (vaddhn_high_u32): Likewise.
1544 (vaddhn_high_u64): Likewise.
1545 (vraddhn_high_s16): Remove unnecessary cast.
1546 (vraddhn_high_s32): Likewise.
1547 (vraddhn_high_s64): Likewise.
1548 (vraddhn_high_u16): Use type-qualified builtin and remove
1550 (vraddhn_high_u32): Likewise.
1551 (vraddhn_high_u64): Likewise.
1553 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1555 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
1556 qualifiers in generator macros for uhsub builtins.
1557 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
1559 (vhsub_s16): Likewise.
1560 (vhsub_s32): Likewise.
1561 (vhsub_u8): Use type-qualified builtin and remove casts.
1562 (vhsub_u16): Likewise.
1563 (vhsub_u32): Likewise.
1564 (vhsubq_s8): Remove unnecessary cast.
1565 (vhsubq_s16): Likewise.
1566 (vhsubq_s32): Likewise.
1567 (vhsubq_u8): Use type-qualified builtin and remove casts.
1568 (vhsubq_u16): Likewise.
1569 (vhsubq_u32): Likewise.
1571 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1573 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
1574 qualifiers in generator macros for u[r]hadd builtins.
1575 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
1577 (vhadd_s16): Likewise.
1578 (vhadd_s32): Likewise.
1579 (vhadd_u8): Use type-qualified builtin and remove casts.
1580 (vhadd_u16): Likewise.
1581 (vhadd_u32): Likewise.
1582 (vhaddq_s8): Remove unnecessary cast.
1583 (vhaddq_s16): Likewise.
1584 (vhaddq_s32): Likewise.
1585 (vhaddq_u8): Use type-qualified builtin and remove casts.
1586 (vhaddq_u16): Likewise.
1587 (vhaddq_u32): Likewise.
1588 (vrhadd_s8): Remove unnecessary cast.
1589 (vrhadd_s16): Likewise.
1590 (vrhadd_s32): Likewise.
1591 (vrhadd_u8): Use type-qualified builtin and remove casts.
1592 (vrhadd_u16): Likewise.
1593 (vrhadd_u32): Likewise.
1594 (vrhaddq_s8): Remove unnecessary cast.
1595 (vrhaddq_s16): Likewise.
1596 (vrhaddq_s32): Likewise.
1597 (vrhaddq_u8): Use type-wualified builtin and remove casts.
1598 (vrhaddq_u16): Likewise.
1599 (vrhaddq_u32): Likewise.
1601 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1603 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
1604 qualifiers in generator macros for usub[lw][2] builtins.
1605 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
1607 (vsubl_s16): Likewise.
1608 (vsubl_s32): Likewise.
1609 (vsubl_u8): Use type-qualified builtin and remove casts.
1610 (vsubl_u16): Likewise.
1611 (vsubl_u32): Likewise.
1612 (vsubl_high_s8): Remove unnecessary cast.
1613 (vsubl_high_s16): Likewise.
1614 (vsubl_high_s32): Likewise.
1615 (vsubl_high_u8): Use type-qualified builtin and remove casts.
1616 (vsubl_high_u16): Likewise.
1617 (vsubl_high_u32): Likewise.
1618 (vsubw_s8): Remove unnecessary casts.
1619 (vsubw_s16): Likewise.
1620 (vsubw_s32): Likewise.
1621 (vsubw_u8): Use type-qualified builtin and remove casts.
1622 (vsubw_u16): Likewise.
1623 (vsubw_u32): Likewise.
1624 (vsubw_high_s8): Remove unnecessary cast.
1625 (vsubw_high_s16): Likewise.
1626 (vsubw_high_s32): Likewise.
1627 (vsubw_high_u8): Use type-qualified builtin and remove casts.
1628 (vsubw_high_u16): Likewise.
1629 (vsubw_high_u32): Likewise.
1631 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1633 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
1634 qualifiers in generator macros for uadd[lw][2] builtins.
1635 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
1637 (vaddl_s16): Likewise.
1638 (vaddl_s32): Likewise.
1639 (vaddl_u8): Use type-qualified builtin and remove casts.
1640 (vaddl_u16): Likewise.
1641 (vaddl_u32): Likewise.
1642 (vaddl_high_s8): Remove unnecessary cast.
1643 (vaddl_high_s16): Likewise.
1644 (vaddl_high_s32): Likewise.
1645 (vaddl_high_u8): Use type-qualified builtin and remove casts.
1646 (vaddl_high_u16): Likewise.
1647 (vaddl_high_u32): Likewise.
1648 (vaddw_s8): Remove unnecessary cast.
1649 (vaddw_s16): Likewise.
1650 (vaddw_s32): Likewise.
1651 (vaddw_u8): Use type-qualified builtin and remove casts.
1652 (vaddw_u16): Likewise.
1653 (vaddw_u32): Likewise.
1654 (vaddw_high_s8): Remove unnecessary cast.
1655 (vaddw_high_s16): Likewise.
1656 (vaddw_high_s32): Likewise.
1657 (vaddw_high_u8): Use type-qualified builtin and remove casts.
1658 (vaddw_high_u16): Likewise.
1659 (vaddw_high_u32): Likewise.
1661 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1663 * config/aarch64/aarch64-simd-builtins.def: Declare type-
1664 qualified builtins for [R]SHRN[2].
1665 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
1666 builtin and remove casts.
1667 (vshrn_n_u32): Likewise.
1668 (vshrn_n_u64): Likewise.
1669 (vrshrn_high_n_u16): Likewise.
1670 (vrshrn_high_n_u32): Likewise.
1671 (vrshrn_high_n_u64): Likewise.
1672 (vrshrn_n_u16): Likewise.
1673 (vrshrn_n_u32): Likewise.
1674 (vrshrn_n_u64): Likewise.
1675 (vshrn_high_n_u16): Likewise.
1676 (vshrn_high_n_u32): Likewise.
1677 (vshrn_high_n_u64): Likewise.
1679 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1681 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
1682 type-qualified builtins for XTN[2].
1683 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
1684 qualified builtin and remove casts.
1685 (vmovn_high_u32): Likewise.
1686 (vmovn_high_u64): Likewise.
1687 (vmovn_u16): Likewise.
1688 (vmovn_u32): Likewise.
1689 (vmovn_u64): Likewise.
1691 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1693 * config/aarch64/aarch64-simd-builtins.def: Use poly type
1694 qualifier in builtin generator macros.
1695 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
1696 builtin and remove casts.
1697 (vmulq_p8): Likewise.
1698 (vmull_high_p8): Likewise.
1699 (vmull_p8): Likewise.
1701 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
1703 * config/aarch64/aarch64-simd-builtins.def: Declare type-
1704 qualified builtin generators for unsigned MLA/MLS intrinsics.
1705 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
1707 (vmla_n_u32): Likewise.
1708 (vmla_u8): Likewise.
1709 (vmla_u16): Likewise.
1710 (vmla_u32): Likewise.
1711 (vmlaq_n_u16): Likewise.
1712 (vmlaq_n_u32): Likewise.
1713 (vmlaq_u8): Likewise.
1714 (vmlaq_u16): Likewise.
1715 (vmlaq_u32): Likewise.
1716 (vmls_n_u16): Likewise.
1717 (vmls_n_u32): Likewise.
1718 (vmls_u8): Likewise.
1719 (vmls_u16): Likewise.
1720 (vmls_u32): Likewise.
1721 (vmlsq_n_u16): Likewise.
1722 (vmlsq_n_u32): Likewise.
1723 (vmlsq_u8): Likewise.
1724 (vmlsq_u16): Likewise.
1725 (vmlsq_u32): Likewise.
1727 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
1729 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
1730 with looping const/pure.
1731 (modref_summary_lto::useful_p): Likewise.
1732 (merge_call_side_effects): Merge side effects before early exit
1734 (process_fnspec): Also handle pure functions.
1735 (analyze_call): Do not early exit on looping pure const.
1736 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
1737 (modref_propagate_in_scc): Update.
1739 2021-11-11 Richard Biener <rguenther@suse.de>
1741 PR tree-optimization/103190
1742 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
1744 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
1746 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
1748 (path_range_query::compute_imports): ...this. Adapt it so it can
1749 be passed the imports bitmap instead of working on m_imports.
1750 (path_range_query::compute_ranges): Call compute_imports in all
1751 cases unless an imports bitmap is passed.
1752 * gimple-range-path.h (path_range_query::compute_imports): New.
1753 (path_range_query::add_copies_to_imports): Remove.
1754 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
1755 (back_threader::find_paths_to_names): Inline resolve_def.
1756 (back_threader::find_paths): Call compute_imports.
1757 (back_threader::resolve_phi): Adjust comment.
1759 2021-11-11 Richard Biener <rguenther@suse.de>
1761 PR tree-optimization/103188
1762 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
1763 Remove query parameter, split out check for size
1765 (ch_base::m_ranger, cb_base::m_query): Remove.
1766 (ch_base::copy_headers): Split processing loop into
1767 analysis around which we allocate and use ranger and
1768 transform where we do not.
1769 (pass_ch::execute): Do not allocate/free ranger here.
1770 (pass_ch_vect::execute): Likewise.
1772 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
1774 * ipa-pure-const.c (propagate_pure_const): Self recursion is
1777 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
1779 * cgraph.c (set_noreturn_flag_1): New function.
1780 (cgraph_node::set_noreturn_flag): New member function
1781 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
1782 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
1784 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
1785 Richard Biener <rguenther@suse.de>
1787 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
1788 statically to the edge remaining in the loop.
1790 2021-11-11 Richard Biener <rguenther@suse.de>
1792 PR middle-end/103181
1793 * tree-eh.c (operation_could_trap_helper_p): Properly
1794 check vector constants for a zero element for integer
1795 division. Separate floating point and integer division code.
1797 2021-11-11 Jakub Jelinek <jakub@redhat.com>
1800 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
1801 handling only for DECL_BIT_FIELD_TYPE decls.
1803 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
1806 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
1807 token is arch extension without leading '+' and emit appropriate
1808 diagnostic for the same.
1810 2021-11-11 Jakub Jelinek <jakub@redhat.com>
1812 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
1813 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
1814 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
1815 * tree.c (omp_clause_num_ops): Increase num ops for
1816 OMP_CLAUSE_NUM_TEAMS to 2.
1817 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
1818 for OMP_CLAUSE_NUM_TEAMS.
1819 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
1820 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
1821 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
1822 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
1823 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
1824 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
1825 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
1827 2021-11-11 Richard Biener <rguenther@suse.de>
1829 * cfganal.c (find_pdom): Remove.
1830 (control_dependences::find_control_dependence): Remove
1831 special-casing of entry block, call get_immediate_dominator
1833 * gimple-predicate-analysis.cc (find_pdom): Remove.
1834 (find_dom): Likewise.
1835 (find_control_equiv_block): Call get_immediate_dominator
1837 (compute_control_dep_chain): Likewise.
1838 (predicate::init_from_phi_def): Likewise.
1840 2021-11-11 Richard Biener <rguenther@suse.de>
1842 * cfganal.h (control_dependences::control_dependence_map):
1844 (control_dependences::m_bitmaps): New.
1845 * cfganal.c (control_dependences::set_control_dependence_map_bit):
1847 (control_dependences::clear_control_dependence_bitmap):
1849 (control_dependences::find_control_dependence): Do not
1850 find_edge for the abnormal edge test.
1851 (control_dependences::control_dependences): Instead do not
1852 add abnormal edges to the edge list. Adjust.
1853 (control_dependences::~control_dependences): Likewise.
1854 (control_dependences::get_edges_dependent_on): Likewise.
1855 * function-tests.c: Include bitmap.h.
1857 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
1859 * doc/invoke.texi: Change references to "future cpu" to "power10",
1860 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
1862 2021-11-11 Cui,Lili <lili.cui@intel.com>
1864 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
1866 (processor_cost_table): Use alderlake_cost for Alderlake.
1867 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
1868 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
1870 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
1872 (ix86_adjust_cost): Handle Alderlake.
1873 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
1874 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
1875 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
1876 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
1877 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
1878 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
1879 (X86_TUNE_USE_LEAVE): Likewise.
1880 (X86_TUNE_PUSH_MEMORY): Likewise.
1881 (X86_TUNE_USE_INCDEC): Likewise.
1882 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
1883 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
1884 (X86_TUNE_USE_SAHF): Likewise.
1885 (X86_TUNE_USE_BT): Likewise.
1886 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
1887 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
1888 (X86_TUNE_AVOID_MFENCE): Likewise.
1889 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
1890 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
1891 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
1892 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
1893 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
1894 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
1895 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
1896 (X86_TUNE_USE_GATHER): Disable for Alderlake.
1897 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
1898 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
1900 2021-11-11 liuhongt <hongtao.liu@intel.com>
1903 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
1904 (avxsizesuffix): Ditto.
1906 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
1908 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
1909 wrong marco checking.
1911 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
1913 PR tree-optimization/102906
1914 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
1915 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
1916 (class ch_base): Add m_ranger and m_query.
1917 (ch_base::copy_headers): Pass m_query to
1918 entry_loop_condition_is_static.
1919 (pass_ch::execute): Allocate and deallocate m_ranger and
1921 (pass_ch_vect::execute): Same.
1923 2021-11-10 Andrew Pinski <apinski@marvell.com>
1926 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
1927 Use vwcore iterator for the r constraint output string.
1929 2021-11-10 qing zhao <qing.zhao@oracle.com>
1931 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
1932 only when have_insn_for return true for the mode. Fix a memory leak.
1934 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
1936 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
1937 cortexa5_extra_costs, cortexa7_extra_costs,
1938 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
1939 Initialize movi, dup and extract costing fields.
1941 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
1943 * gimple-range-path.cc (path_range_query::path_range_query): Do
1945 (path_range_query::dump): Change m_path uses to non-pointer.
1946 (path_range_query::defined_outside_path): Same.
1947 (path_range_query::set_path): Same.
1948 (path_range_query::add_copies_to_imports): Same.
1949 (path_range_query::range_of_stmt): Same.
1950 (path_range_query::compute_outgoing_relations): Same.
1951 (path_range_query::compute_ranges): Imports are now optional.
1952 Implement overload that takes an edge.
1953 * gimple-range-path.h (class path_range_query): Make imports
1954 optional for compute_ranges. Add compute_ranges(edge) overload.
1955 Make m_path an auto_vec instead of a pointer and adjust
1958 2021-11-10 Tamar Christina <tamar.christina@arm.com>
1960 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
1961 (default_hash_traits<scalar_conf_masked_key>): Likewise.
1962 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
1964 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
1965 Register mask inverses.
1967 2021-11-10 Tamar Christina <tamar.christina@arm.com>
1969 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
1970 successful vectorization.
1972 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
1974 * gimple-range-cache.cc (sbr_vector::grow): New.
1975 (sbr_vector::set_bb_range): Call grow.
1976 (sbr_vector::get_bb_range): Same.
1977 (sbr_vector::bb_range_p): Remove assert.
1979 2021-11-10 Tamar Christina <tamar.christina@arm.com>
1981 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
1982 ,*aarch64_topbits_shuffle<mode>_be): Remove.
1984 2021-11-10 Jan Hubicka <jh@suse.cz>
1986 * ipa-modref.c: Include tree-eh.h
1987 (modref_summary::modref_summary): Initialize side_effects.
1988 (struct modref_summary_lto): New bool field side_effects.
1989 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
1990 (modref_summary::dump): Dump side_effects.
1991 (modref_summary_lto::dump): Dump side_effects.
1992 (merge_call_side_effects): Merge side effects.
1993 (process_fnspec): Calls to non-const/pure or looping
1994 function is a side effect.
1995 (analyze_call): Self-recursion is a side-effect; handle
1997 (analyze_load): Watch for volatile and throwing memory.
1998 (analyze_store): Likewise.
1999 (analyze_stmt): Watch for volatitle asm.
2000 (analyze_function): Handle side_effects.
2001 (modref_summaries::duplicate): Duplicate side_effects.
2002 (modref_summaries_lto::duplicate): Likewise.
2003 (modref_write): Stream side_effects.
2004 (read_section): Likewise.
2005 (update_signature): Update.
2006 (propagate_unknown_call): Handle side_effects.
2007 (modref_propagate_in_scc): Likewise.
2008 * ipa-modref.h (struct modref_summary): Add side_effects.
2009 * ipa-pure-const.c (special_builtin_state): Rename to ...
2010 (builtin_safe_for_const_function_p): ... this one.
2011 (check_call): Update.
2012 (finite_function_p): Break out from ...
2013 (propagate_pure_const): ... here
2014 * ipa-utils.h (finite_function): Declare.
2016 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
2018 * config.gcc (powerpc*-*-*): Remove -rpath from
2019 --with-advance-toolchain.
2021 2021-11-10 Marek Polacek <polacek@redhat.com>
2024 * attribs.c (struct scoped_attributes): Add a bool member.
2025 (lookup_scoped_attribute_spec): Forward declare.
2026 (register_scoped_attributes): New bool parameter, defaulted to
2028 (handle_ignored_attributes_option): New function.
2029 (free_attr_data): New function.
2030 (init_attributes): Call handle_ignored_attributes_option.
2031 (attr_namespace_ignored_p): New function.
2032 (decl_attributes): Check attr_namespace_ignored_p before
2034 * attribs.h (free_attr_data): Declare.
2035 (register_scoped_attributes): Adjust declaration.
2036 (handle_ignored_attributes_option): Declare.
2037 (canonicalize_attr_name): New function template.
2038 (canonicalize_attr_name): Use it.
2039 * common.opt (Wattributes=): New option with a variable.
2040 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
2041 * doc/invoke.texi: Document -Wno-attributes=.
2042 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
2043 * plugin.h (register_scoped_attributes): Adjust declaration.
2044 * toplev.c (compile_file): Call free_attr_data.
2046 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2048 * config/arm/arm-cpus.in (cortex-a710): New CPU.
2049 * config/arm/arm-tables.opt: Regenerate.
2050 * config/arm/arm-tune.md: Regenerate.
2051 * doc/invoke.texi: Update docs.
2053 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
2055 * config/aarch64/aarch64-builtins.c
2056 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
2058 2021-11-10 Martin Liska <mliska@suse.cz>
2060 * lto-wrapper.c (merge_and_complain): Make the first argument
2063 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2065 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
2066 smax_nan and fmin_nan instead of smin_nan.
2067 (maxmin_uns): Rename to...
2068 (fmaxmin): ...this and make the same changes. Remove entries
2069 unrelated to fmax* and fmin*.
2070 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
2071 (<fmaxmin><mode>3): ...this.
2072 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
2074 (aarch64_<optab>p<mode>): ...this.
2075 (<maxmin_uns><mode>3): Rename to...
2076 (<fmaxmin><mode>3): ...this.
2077 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
2078 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
2079 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
2080 (aarch64_reduc_<optab>_internal<mode>): ...this.
2081 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
2082 (aarch64_reduc_<optab>_internalv2si): ...this.
2083 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
2084 (<fmaxmin><mode>3): ...this.
2085 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
2087 (fmax_nan, fmin_nan): ...this.
2088 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
2089 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
2090 (vmin_f16, vminq_f16): Update accordingly.
2092 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2094 * tree-vectorizer.h (vector_costs::finish_cost): Take the
2095 corresponding scalar costs as a parameter.
2096 (finish_cost): Likewise.
2097 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
2098 (vect_estimate_min_profitable_iters): Update accordingly.
2099 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
2100 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
2101 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
2103 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
2105 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2107 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
2109 (_loop_vec_info::single_scalar_iteration_cost): Delete.
2110 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
2111 (vector_costs::total_cost): New function.
2112 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
2113 after above changes.
2114 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
2115 (vect_compute_single_scalar_iteration_cost): Store the costs
2116 in loop_vinfo->scalar_costs.
2117 (vect_estimate_min_profitable_iters): Get the scalar cost from
2118 loop_vinfo->scalar_costs.
2120 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2122 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
2123 (vector_costs::better_epilogue_loop_than_p)
2124 (vector_costs::compare_inside_loop_cost)
2125 (vector_costs::compare_outside_loop_cost): Likewise.
2126 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
2127 (vector_costs::better_epilogue_loop_than_p)
2128 (vector_costs::compare_inside_loop_cost)
2129 (vector_costs::compare_outside_loop_cost): New functions,
2130 containing code moved from...
2131 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
2133 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2135 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
2136 and vec_inside_cost.
2137 (vector_costs::outside_cost): New function.
2138 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
2140 (vect_estimate_min_profitable_iters): Likewise.
2141 (vect_better_loop_vinfo_p): Get the inside and outside costs
2142 from the loop_vec_infos' vector_costs.
2144 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
2146 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
2147 (_loop_vec_info::vector_costs): ...this.
2148 (LOOP_VINFO_TARGET_COST_DATA): Delete.
2149 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
2151 (vec_info::~vec_info): Remove corresponding delete.
2152 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
2153 vector_costs to null.
2154 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
2155 (vect_analyze_loop_operations): Update after above changes.
2156 (vect_analyze_loop_2): Likewise.
2157 (vect_estimate_min_profitable_iters): Likewise.
2158 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
2160 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
2162 * tree-core.h (EAF_DIRECT): Remove.
2163 (EAF_NOCLOBBER): Remove.
2164 (EAF_UNUSED): Remove.
2165 (EAF_NOESCAPE): Remove.
2166 (EAF_NO_DIRECT_CLOBBER): New.
2167 (EAF_NO_INDIRECT_CLOBBER): New.
2168 (EAF_NODIRECTESCAPE): Remove.
2169 (EAF_NO_DIRECT_ESCAPE): New.
2170 (EAF_NO_INDIRECT_ESCAPE): New.
2171 (EAF_NOT_RETURNED): Remove.
2172 (EAF_NOT_RETURNED_INDIRECTLY): New.
2173 (EAF_NOREAD): Remove.
2174 (EAF_NO_DIRECT_READ): New.
2175 (EAF_NO_INDIRECT_READ): New.
2176 * gimple.c (gimple_call_arg_flags): Update for new flags.
2177 (gimple_call_retslot_flags): Update for new flags.
2178 * ipa-modref.c (dump_eaf_flags): Likewise.
2179 (remove_useless_eaf_flags): Likewise.
2180 (deref_flags): Likewise.
2181 (modref_lattice::init): Likewise.
2182 (modref_lattice::merge): Likewise.
2183 (modref_lattice::merge_direct_load): Likewise.
2184 (modref_lattice::merge_direct_store): Likewise.
2185 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
2186 (callee_to_caller_flags): Likewise.
2187 (modref_eaf_analysis::analyze_ssa_name): Likewise.
2188 (modref_eaf_analysis::propagate): Likewise.
2189 (modref_merge_call_site_flags): Likewise.
2190 * ipa-modref.h (interposable_eaf_flags): Likewise.
2191 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
2192 * tree-ssa-structalias.c (handle_call_arg): Likewise.
2193 (handle_rhs_call): Likewise.
2194 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
2196 2021-11-10 Tamar Christina <tamar.christina@arm.com>
2198 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
2200 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
2202 * config/aarch64/aarch64-builtins.c
2203 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
2206 2021-11-10 Jan Hubicka <jh@suse.cz>
2208 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
2210 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
2211 Hongtao Liu <hongtao.liu@intel.com>
2213 PR middle-end/102566
2214 * match.pd (nop_atomic_bit_test_and_p): New match.
2215 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
2216 (gimple_nop_atomic_bit_test_and_p): New prototype.
2217 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
2218 different cases to their canonical forms.
2220 2021-11-10 liuhongt <hongtao.liu@intel.com>
2222 PR tree-optimization/103126
2223 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
2224 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
2225 * tree-if-conv.c : Include tree-vectorizer.h.
2226 (is_cond_scalar_reduction): Handle
2227 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
2228 (convert_scalar_cond_reduction): Ditto.
2230 2021-11-10 konglin1 <lingling.kong@intel.com>
2232 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
2233 (cmla<conj_op><mode>4): Likewise
2235 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2237 * tree-ssa-threadedge.c: Do not include
2238 gimple-ssa-evrp-analyze.h.
2239 * value-pointer-equiv.cc: Same.
2241 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2243 * tree-ssa-threadbackward.c
2244 (back_threader::maybe_register_path_dump): Abstract path dumping...
2245 (dump_path): ...here.
2246 (back_threader::resolve_phi): Call dump_path.
2249 2021-11-10 konglin1 <lingling.kong@intel.com>
2251 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
2252 Add new define_insn.
2253 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
2254 (fma_<mode>_fcmaddc_bcst): Likewise
2256 2021-11-10 liuhongt <hongtao.liu@intel.com>
2259 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
2262 2021-11-10 Andrew Pinski <apinski@marvell.com>
2265 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
2266 as an input, use original one.
2268 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
2270 * config/nios2/nios2.c (nios2_can_inline_p): New.
2271 (TARGET_CAN_INLINE_P): Define.
2273 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
2275 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
2276 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
2277 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
2278 (lxvrse_expand_builtin): Modify the expansion for sign extension.
2279 All extensions are done within VSX registers.
2281 2021-11-09 Andrew Pinski <apinski@marvell.com>
2283 PR tree-optimization/10352
2284 * match.pd: Remove check of TYPE_PRECISION for
2287 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
2289 * function.c (allocate_struct_function): Don't set x_range_query.
2290 * function.h (get_range_query): Move to value-query.h.
2291 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
2292 (disable_ranger): Clear function current query field.
2293 * value-query.cc (get_global_range_query): Relocate to:
2294 * value-query.h (get_global_range_query): Here and inline.
2295 (get_range_query): Relocate here from function.h.
2297 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2299 * tree-ssa-threadbackward.c
2300 (back_threader::maybe_register_path_dump): New.
2301 (back_threader::maybe_register_path): Call maybe_register_path_dump.
2303 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2305 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
2306 Return NULL when unprofitable.
2308 2021-11-09 Martin Jambor <mjambor@suse.cz>
2310 * tree.h (build_debug_expr_decl): Declare.
2311 * tree.c (build_debug_expr_decl): New function.
2312 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
2313 instead of building a DEBUG_EXPR_DECL.
2314 * ipa-param-manipulation.c
2315 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
2316 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
2317 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
2318 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
2319 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
2321 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
2323 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
2324 (def_builtin): Change debug formatting for easier parsing and
2325 include more information.
2326 (rs6000_init_builtins): Add dump of autogenerated builtins.
2327 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
2330 2021-11-09 Marek Polacek <polacek@redhat.com>
2332 * ipa-modref.c (analyze_parms): Fix a typo.
2334 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
2337 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
2338 assertion for Z*-ext.
2340 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
2342 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
2343 and past_static_chain; merge past summary with current one.
2344 (analyze_function): Update.
2346 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2348 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
2350 2021-11-09 Martin Liska <mliska@suse.cz>
2352 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
2353 conditional based on
2354 * print-rtl.h (class rtx_writer): Likewise.
2356 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
2358 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
2359 for 'Deleted'. Turn into a '#define'.
2361 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2363 * gimple-range-path.cc (path_range_query::dump): Clean up.
2364 (path_range_query::compute_ranges): Same.
2365 * value-relation.cc (path_oracle::dump): Same.
2367 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2369 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
2370 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
2372 * flag-types.h (enum threader_debug): New.
2373 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
2374 * params.opt: Add entry for --param=threader-debug=.
2376 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
2378 PR middle-end/101204
2380 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
2381 (struct xint_hash_t): ... this.
2382 * doc/gty.texi: Update.
2384 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2386 * config/arm/arm-cpus.in (armv9): New define.
2387 (ARMv9a): New group.
2388 (armv9-a): New arch definition.
2389 * config/arm/arm-tables.opt: Regenerate.
2390 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
2391 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
2392 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
2393 to MULTILIB_MATCHES.
2394 * config/arm/t-multilib: Added v9_a_nosimd_variants and
2395 v9_a_simd_variants to MULTILIB_MATCHES.
2396 * doc/invoke.texi: Update docs.
2398 2021-11-09 Martin Jambor <mjambor@suse.cz>
2401 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
2402 return with error_mark_mode when part of expression is mapped to
2404 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
2405 mapped value to NULL if walk_tree returns error_mark_mode.
2407 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
2409 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
2411 (get_addr_stridx): Same.
2413 (get_range_strlen_dynamic): Same.
2414 (handle_builtin_strlen): Same.
2415 (handle_builtin_strchr): Same.
2416 (handle_builtin_strcpy): Same.
2417 (maybe_diag_stxncpy_trunc): Same.
2418 (handle_builtin_stxncpy_strncat): Same.
2419 (handle_builtin_memcpy): Same.
2420 (handle_builtin_strcat): Same.
2421 (handle_alloc_call): Same.
2422 (handle_builtin_memset): Same.
2423 (handle_builtin_string_cmp): Same.
2424 (handle_pointer_plus): Same.
2425 (count_nonzero_bytes_addr): Same.
2426 (count_nonzero_bytes): Same.
2427 (handle_store): Same.
2428 (fold_strstr_to_strncmp): Same.
2429 (handle_integral_assign): Same.
2430 (check_and_optimize_stmt): Same.
2431 (class strlen_dom_walker): Replace evrp with ranger.
2432 (strlen_dom_walker::before_dom_children): Remove evrp.
2433 (strlen_dom_walker::after_dom_children): Remove evrp.
2434 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
2435 Restrict sprintf output.
2437 2021-11-09 Martin Liska <mliska@suse.cz>
2439 * genconditions.c (write_one_condition): Add const qualifier
2442 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
2444 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
2447 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
2449 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
2450 escape-newline support.
2451 (VEC_INIT_V4SI): Likewise.
2452 (VEC_INIT_V8HI): Likewise.
2453 (PACK_V1TI): Likewise.
2455 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
2456 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
2457 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
2461 (FMAF128_ODD): Likewise.
2462 (VSCEQPUO): Likewise.
2464 (VSIEQPF): Likewise.
2465 (VSTDCQP): Likewise.
2466 (PACK_TD): Likewise.
2467 (TABORTDC): Likewise.
2468 (TABORTDCI): Likewise.
2469 (SE_LXVRBX): Likewise.
2470 (SE_LXVRHX): Likewise.
2471 (SE_LXVRWX): Likewise.
2472 (SE_LXVRDX): Likewise.
2473 (VREPLACE_UN_UV2DI): Likewise.
2474 (VREPLACE_UN_UV4SI): Likewise.
2475 (VREPLACE_UN_V2DI): Likewise.
2476 (VREPLACE_ELT_UV2DI): Likewise.
2477 (VREPLACE_ELT_V2DI): Likewise.
2478 (ZE_LXVRBX): Likewise.
2479 (ZE_LXVRHX): Likewise.
2480 (ZE_LXVRWX): Likewise.
2481 (ZE_LXVRDX): Likewise.
2483 (CNTLZDM): Likewise.
2484 (CNTTZDM): Likewise.
2487 (PMXVBF16GER2): Likewise.
2488 (PMXVBF16GER2_INTERNAL): Likewise.
2489 (PMXVBF16GER2NN): Likewise.
2490 (PMXVBF16GER2NN_INTERNAL): Likewise.
2491 (PMXVBF16GER2NP): Likewise.
2492 (PMXVBF16GER2NP_INTERNAL): Likewise.
2493 (PMXVBF16GER2PN): Likewise.
2494 (PMXVBF16GER2PN_INTERNAL): Likewise.
2495 (PMXVBF16GER2PP): Likewise.
2496 (PMXVBF16GER2PP_INTERNAL): Likewise.
2497 (PMXVF16GER2): Likewise.
2498 (PMXVF16GER2_INTERNAL): Likewise.
2499 (PMXVF16GER2NN): Likewise.
2500 (PMXVF16GER2NN_INTERNAL): Likewise.
2501 (PMXVF16GER2NP): Likewise.
2502 (PMXVF16GER2NP_INTERNAL): Likewise.
2503 (PMXVF16GER2PN): Likewise.
2504 (PMXVF16GER2PN_INTERNAL): Likewise.
2505 (PMXVF16GER2PP): Likewise.
2506 (PMXVF16GER2PP_INTERNAL): Likewise.
2507 (PMXVF32GER_INTERNAL): Likewise.
2508 (PMXVF32GERNN): Likewise.
2509 (PMXVF32GERNN_INTERNAL): Likewise.
2510 (PMXVF32GERNP): Likewise.
2511 (PMXVF32GERNP_INTERNAL): Likewise.
2512 (PMXVF32GERPN): Likewise.
2513 (PMXVF32GERPN_INTERNAL): Likewise.
2514 (PMXVF32GERPP): Likewise.
2515 (PMXVF32GERPP_INTERNAL): Likewise.
2516 (PMXVF64GER): Likewise.
2517 (PMXVF64GER_INTERNAL): Likewise.
2518 (PMXVF64GERNN): Likewise.
2519 (PMXVF64GERNN_INTERNAL): Likewise.
2520 (PMXVF64GERNP): Likewise.
2521 (PMXVF64GERNP_INTERNAL): Likewise.
2522 (PMXVF64GERPN): Likewise.
2523 (PMXVF64GERPN_INTERNAL): Likewise.
2524 (PMXVF64GERPP): Likewise.
2525 (PMXVF64GERPP_INTERNAL): Likewise.
2526 (PMXVI16GER2): Likewise.
2527 (PMXVI16GER2_INTERNAL): Likewise.
2528 (PMXVI16GER2PP): Likewise.
2529 (PMXVI16GER2PP_INTERNAL): Likewise.
2530 (PMXVI16GER2S): Likewise.
2531 (PMXVI16GER2S_INTERNAL): Likewise.
2532 (PMXVI16GER2SPP): Likewise.
2533 (PMXVI16GER2SPP_INTERNAL): Likewise.
2534 (PMXVI4GER8): Likewise.
2535 (PMXVI4GER8_INTERNAL): Likewise.
2536 (PMXVI4GER8PP): Likewise.
2537 (PMXVI4GER8PP_INTERNAL): Likewise.
2538 (PMXVI8GER4): Likewise.
2539 (PMXVI8GER4_INTERNAL): Likewise.
2540 (PMXVI8GER4PP): Likewise.
2541 (PMXVI8GER4PP_INTERNAL): Likewise.
2542 (PMXVI8GER4SPP): Likewise.
2543 (PMXVI8GER4SPP_INTERNAL): Likewise.
2544 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
2545 (linebuf): Increase size.
2546 (lines): New variable.
2547 (lastline): Likewise.
2548 (real_line_pos): New function.
2549 (diag): Change signature.
2550 (bif_diag): Change signature; support escape-newline handling.
2551 (ovld_diag): Likewise.
2552 (fatal): Move earlier.
2553 (consume_whitespace): Adjust diag call.
2554 (advance_line): Add escape-newline handling; call fatal.
2555 (safe_inc_pos): Adjust diag call.
2556 (match_identifier): Likewise.
2557 (match_integer): Likewise.
2558 (match_to_right_bracket): Call fatal instead of diag; adjust diag
2560 (match_basetype): Adjust diag calls.
2561 (match_bracketed_pair): Likewise.
2562 (match_const_restriction): Likewise.
2563 (match_type): Likewise.
2564 (parse_args): Likewise.
2565 (parse_bif_attrs): Likewise.
2566 (complete_vector_type): Likewise.
2567 (complete_base_type): Likewise.
2568 (parse_prototype): Likewise.
2569 (parse_bif_entry): Likewise.
2570 (parse_bif_stanza): Likewise.
2571 (parse_ovld_entry): Likewise.
2572 (parse_ovld_stanza): Likewise.
2573 (main): Allocate buffers for lines[].
2575 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
2577 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
2578 use the REG argument on the CM_LARGE_PIC code path as well.
2579 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
2580 specially with the large code models.
2582 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
2584 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
2585 allowing interposition since issues with binds_to_local_def were
2588 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
2590 PR tree-optimization/103122
2591 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
2592 entry with "calulate new values" set to false.
2594 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
2596 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
2597 and builtin_return_address.
2598 (builtin_fnspec): Annotate builtin_return,
2599 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
2600 builtin_cxa_end_cleanup, builtin_eh_copy_values,
2601 builtin_frame_address, builtin_apply_args,
2602 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
2603 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
2605 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
2606 and builtin_return_address.
2608 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
2610 PR tree-optimization/103177
2611 * passes.def: Move uncprop after pure/const and modref.
2613 2021-11-08 Martin Jambor <mjambor@suse.cz>
2617 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
2618 location before invoking remap_with_debug_expressions on it.
2619 * ipa-param-manipulation.c
2620 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
2622 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
2624 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
2627 2021-11-08 Richard Biener <rguenther@suse.de>
2629 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
2630 instead of an sbitmap to cache irreducible state.
2632 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
2634 PR tree-optimization/103120
2635 * gimple-range-path.cc (path_range_query::range_defined_in_block):
2636 Bail if there's a cache entry.
2638 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
2640 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
2641 rs6000_builtin_decls_x when appropriate.
2642 (add_condition_to_bb): Likewise.
2643 (rs6000_atomic_assign_expand_fenv): Likewise.
2645 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
2647 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
2648 (rs6000_builtin_decl): Call it.
2650 2021-11-08 Martin Liska <mliska@suse.cz>
2652 * cgraph.c (cgraph_node::dump): Dump it from decl.
2654 2021-11-08 Martin Liska <mliska@suse.cz>
2656 PR gcov-profile/100520
2657 * coverage.c (coverage_compute_profile_id): Strip .gk when
2658 compare debug is used.
2659 * system.h (endswith): New function.
2661 2021-11-08 Martin Liska <mliska@suse.cz>
2663 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
2665 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
2667 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
2668 * config/rs6000/rbtree.h: Likewise.
2669 * config/rs6000/rs6000-builtin-new.def: Likewise.
2670 * config/rs6000/rs6000-gen-builtins.c: Likewise.
2671 * config/rs6000/rs6000-overload.def: Likewise.
2673 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
2675 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
2676 (aarch64_address_cost): Use it instead of testing for CImode and
2679 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
2681 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
2682 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
2683 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
2684 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
2685 NUM_MODE_DECIMAL_FLOAT.
2686 (REAL_MODE_FORMAT): Likewise.
2688 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
2690 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
2693 2021-11-08 Richard Biener <rguenther@suse.de>
2695 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
2697 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
2698 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
2699 LOOP_VINFO_NITERS_ASSUMPTIONS.
2700 (vect_analyze_loop_1): Adjust.
2701 (vect_analyze_loop): Move loop constraint setting and
2702 SCEV/niter reset here from vect_create_loop_vinfo to perform
2704 (vect_analyze_loop_form): Move dumping of symbolic niters
2705 here from vect_create_loop_vinfo.
2707 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
2709 * ipa-modref-tree.h (enum modref_special_parms): New enum.
2710 (struct modref_access_node): update for special parms.
2711 (struct modref_ref_node): Likewise.
2712 (struct modref_parm_map): Likewise.
2713 (struct modref_tree): Likewise.
2714 * ipa-modref.c (dump_access): Likewise.
2715 (get_access): Detect static chain.
2716 (parm_map_for_arg): Take tree as arg instead of
2718 (merge_call_side_effects): Compute map for static chain.
2719 (process_fnspec): Update.
2720 (struct escape_point): Remove retslot_arg and static_chain_arg.
2721 (analyze_parms): Update.
2722 (compute_parm_map): Update.
2723 (propagate_unknown_call): Update.
2724 (modref_propagate_in_scc): Update.
2725 (modref_merge_call_site_flags): Update.
2726 (ipa_merge_modref_summary_after_inlining): Update.
2727 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
2728 * ipa-modref-tree.c (test_merge): Update.
2730 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
2732 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
2733 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
2734 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
2737 2021-11-08 liuhongt <hongtao.liu@intel.com>
2739 PR tree-optimization/103077
2740 * doc/invoke.texi (Options That Control Optimization):
2741 Update documentation for -ftree-loop-vectorize and
2742 -ftree-slp-vectorize which are enabled by default at -02.
2744 2021-11-08 liuhongt <hongtao.liu@intel.com>
2747 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
2748 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
2750 2021-11-08 liuhongt <hongtao.liu@intel.com>
2753 * match.pd: Simplify
2754 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
2755 c) under flag_unsafe_math_optimizations.
2757 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
2760 * params.opt (modref-max-depth): Add range.
2761 (modref-max-adjustments): Fix range.
2763 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
2765 * tree-pass.h (make_pass_vrp_threader): Remove.
2766 * tree-ssa-threadbackward.c
2767 (back_threader_profitability::profitable_path_p): Remove
2768 ASSERT_EXPR references.
2769 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
2770 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
2771 (execute_vrp): Same.
2772 (class hybrid_threader): Remove.
2773 (hybrid_threader::hybrid_threader): Remove.
2774 (hybrid_threader::~hybrid_threader): Remove.
2775 (hybrid_threader::before_dom_children): Remove.
2776 (hybrid_threader::after_dom_children): Remove.
2777 (execute_vrp_threader): Remove.
2778 (class pass_vrp_threader): Remove.
2779 (make_pass_vrp_threader): Remove.
2781 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
2783 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
2784 (gimple_call_retslot_flags): Likewise.
2785 (gimple_call_static_chain_flags): Likewise.
2786 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
2788 (modref_summary::useful_p): Likewise.
2789 (modref_summary_lto::useful_p): Likewise.
2790 (analyze_parms): Do not give up on NOVOPS.
2791 (analyze_function): When dumping report chnages in EAF flags
2792 between IPA and local pass.
2793 (modref_merge_call_site_flags): Compute implicit eaf flags
2794 based on callee ecf_flags and fnspec; if the function does not
2795 bind to current defs use interposable_eaf_flags.
2796 (modref_propagate_flags_in_scc): Update.
2797 * ipa-modref.h (interposable_eaf_flags): New function.
2799 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
2801 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
2803 (rs6000_invalid_new_builtin): New function.
2804 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
2805 (rs6000_expand_ldst_mask): New function.
2806 (new_cpu_expand_builtin): Likewise.
2807 (elemrev_icode): Likewise.
2808 (ldv_expand_builtin): Likewise.
2809 (lxvrse_expand_builtin): Likewise.
2810 (lxvrze_expand_builtin): Likewise.
2811 (stv_expand_builtin): Likewise.
2812 (new_mma_expand_builtin): Likewise.
2813 (new_htm_spr_num): Likewise.
2814 (new_htm_expand_builtin): Likewise.
2815 (rs6000_expand_new_builtin): Likewise.
2816 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
2818 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
2820 * ipa-modref.c (modref_lattice): Add do_dataflow,
2821 changed and propagate_to fields.
2822 (modref_lattice::release): Free propagate_to
2823 (modref_lattice::merge): Do not give up early on unknown
2825 (modref_lattice::merge_deref): Likewise.
2826 (modref_eaf_analysis): Update toplevel comment.
2827 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
2828 do optimistic dataflow initialization.
2829 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
2830 (modref_eaf_analysis::propagate): New member function.
2831 (analyze_parms): Update to new API of modref_eaf_analysis.
2833 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
2835 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
2836 return true on functions from other partition.
2838 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
2840 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
2841 (vsx_splat_v4si_di): Change constraint to "wa".
2843 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
2845 PR tree-optimization/103061
2846 * value-relation.cc (path_oracle::path_oracle): Initialize
2848 (path_oracle::killing_def): Set m_killed_defs.
2849 (path_oracle::query_relation): Do not look at the root oracle for
2851 * value-relation.h (class path_oracle): Add m_killed_defs.
2853 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
2855 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
2856 Remove gotos and other cleanups.
2858 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
2861 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
2862 paradoxical and zero sized accesses.
2864 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
2867 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
2870 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
2872 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
2873 to the list of symbol prefixes that must be made linker-
2876 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
2878 * config.host: Add support for aarch64-*-darwin.
2879 * config/aarch64/host-aarch64-darwin.c: New file.
2880 * config/aarch64/x-darwin: New file.
2882 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
2884 PR tree-optimization/103093
2885 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
2887 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
2889 PR tree-optimization/102943
2890 * gimple-range-cache.cc (class update_list): New.
2891 (update_list::add): Replace add_to_update.
2892 (update_list::pop): New.
2893 (ranger_cache::ranger_cache): Adjust.
2894 (ranger_cache::~ranger_cache): Adjust.
2895 (ranger_cache::add_to_update): Delete.
2896 (ranger_cache::propagate_cache): Adjust to new class.
2897 (ranger_cache::propagate_updated_value): Ditto.
2898 (ranger_cache::fill_block_cache): Ditto.
2899 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
2901 2021-11-05 Richard Biener <rguenther@suse.de>
2903 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
2904 comment and expand on another one. Combine nested if.
2906 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
2909 * config/pa/pa-modes.def: Add OImode integer type.
2910 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
2912 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
2913 to UNITS_PER_WORD if IN_LIBGCC2.
2914 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
2915 negvti2, ashlti3, shrpd_internal): New patterns.
2916 Change some multi instruction types to multi.
2918 2021-11-05 Jakub Jelinek <jakub@redhat.com>
2921 * config/i386/i386.h
2922 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
2925 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
2928 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
2930 * config/aarch64/constraints.md: Support const in Usw.
2932 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
2934 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
2935 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
2937 2021-11-05 Richard Biener <rguenther@suse.de>
2939 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
2940 (LOOP_VINFO_N_STMTS): Likewise.
2941 (vec_info_for_bb): Remove unused function.
2942 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
2943 Initialize n_stmts member.
2944 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
2945 (vect_create_loop_vinfo): Do not set loop->aux.
2946 (vect_analyze_loop_2): Do not get n_stmts as argument,
2947 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
2949 (vect_analyze_loop_1): Remove callback, get the mode iterator
2950 and autodetected_vector_mode as argument, advancing the
2951 iterator and initializing autodetected_vector_mode here.
2952 (vect_analyze_loop): Split analysis loop into two, first
2953 processing main loops only and then epilogues.
2955 2021-11-05 Martin Jambor <mjambor@suse.cz>
2957 * ipa-prop.c (compute_complex_assign_jump_func): Remove
2958 unnecessary check for RECORD_TYPE.
2960 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
2962 * doc/install.texi2html: Do not generate old.html any longer.
2964 2021-11-05 Martin Liska <mliska@suse.cz>
2967 * opts.c (finish_options): Reset flag_gtoggle when it is used.
2969 2021-11-05 Jakub Jelinek <jakub@redhat.com>
2972 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
2973 in CONST_WIDE_INT handling. Adjust recursive calls.
2974 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
2975 new add_const_value_attribute argument.
2976 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
2977 add_const_value_attribute argument.
2979 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
2981 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
2984 2021-11-05 Richard Biener <rguenther@suse.de>
2986 * tree-vectorizer.h (struct vect_loop_form_info): New.
2987 (vect_analyze_loop_form): Adjust.
2988 (vect_create_loop_vinfo): New.
2989 * tree-parloops.c (gather_scalar_reductions): Adjust for
2990 vect_analyze_loop_form API change.
2991 * tree-vect-loop.c: Include <functional>.
2992 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
2993 take struct vect_loop_form_info as output parameter and adjust.
2994 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
2995 split out call to the original vect_analyze_loop_form_1.
2996 (vect_reanalyze_as_main_loop): Rename to...
2997 (vect_analyze_loop_1): ... this, factor out the call to
2998 vect_analyze_loop_form and generalize to be able to use it twice ...
2999 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
3002 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
3005 * config/rs6000/fusion.md: Regenerate.
3006 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
3008 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
3010 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
3013 2021-11-04 Tamar Christina <tamar.christina@arm.com>
3015 * config/aarch64/aarch64-builtins.c
3016 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
3017 ashr_simd, lshr, lshr_simd.
3018 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
3019 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
3020 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
3022 2021-11-04 Tamar Christina <tamar.christina@arm.com>
3024 * match.pd: New negate+shift pattern.
3026 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
3028 PR tree-optimization/103079
3029 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
3031 (gimple_range_calc_op2): Ditto.
3033 2021-11-04 Martin Jambor <mjambor@suse.cz>
3036 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
3037 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
3038 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
3039 parameter to mark_dead_statements.
3040 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
3041 (ipa_param_body_adjustments::mark_dead_statements): New parameter
3042 debugstack, push into it all SSA names used in debug statements,
3043 produce m_dead_ssa_debug_equiv mapping for the removed param.
3044 (replace_with_mapped_expr): New function.
3045 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
3046 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3047 (ipa_param_body_adjustments::common_initialization): Gather and
3048 procecc SSA which will be removed but are in debug statements. Simplify.
3049 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
3051 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
3052 when avoiding a copy of an unnecessary statement. Remap removed SSA
3053 names in existing debug statements.
3054 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
3055 parameters if we have already done so.
3057 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
3060 * gimple.c (gimple_call_static_chain_flags): Handle case when
3061 nested function does not bind locally.
3063 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
3065 * config/aarch64/aarch64.c (aarch64_function_value): Generate
3066 a register rtx for Neon vector-tuple modes.
3067 (aarch64_layout_arg): Likewise.
3069 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
3071 * lower-subreg.c (simple_move): Prevent decomposition if
3072 modes are not tieable.
3074 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
3075 Richard Sandiford <richard.sandiford@arm.com>
3077 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
3078 (v2x4hi_UP): Likewise.
3079 (v2x4hf_UP): Likewise.
3080 (v2x4bf_UP): Likewise.
3081 (v2x2si_UP): Likewise.
3082 (v2x2sf_UP): Likewise.
3083 (v2x1di_UP): Likewise.
3084 (v2x1df_UP): Likewise.
3085 (v2x16qi_UP): Likewise.
3086 (v2x8hi_UP): Likewise.
3087 (v2x8hf_UP): Likewise.
3088 (v2x8bf_UP): Likewise.
3089 (v2x4si_UP): Likewise.
3090 (v2x4sf_UP): Likewise.
3091 (v2x2di_UP): Likewise.
3092 (v2x2df_UP): Likewise.
3093 (v3x8qi_UP): Likewise.
3094 (v3x4hi_UP): Likewise.
3095 (v3x4hf_UP): Likewise.
3096 (v3x4bf_UP): Likewise.
3097 (v3x2si_UP): Likewise.
3098 (v3x2sf_UP): Likewise.
3099 (v3x1di_UP): Likewise.
3100 (v3x1df_UP): Likewise.
3101 (v3x16qi_UP): Likewise.
3102 (v3x8hi_UP): Likewise.
3103 (v3x8hf_UP): Likewise.
3104 (v3x8bf_UP): Likewise.
3105 (v3x4si_UP): Likewise.
3106 (v3x4sf_UP): Likewise.
3107 (v3x2di_UP): Likewise.
3108 (v3x2df_UP): Likewise.
3109 (v4x8qi_UP): Likewise.
3110 (v4x4hi_UP): Likewise.
3111 (v4x4hf_UP): Likewise.
3112 (v4x4bf_UP): Likewise.
3113 (v4x2si_UP): Likewise.
3114 (v4x2sf_UP): Likewise.
3115 (v4x1di_UP): Likewise.
3116 (v4x1df_UP): Likewise.
3117 (v4x16qi_UP): Likewise.
3118 (v4x8hi_UP): Likewise.
3119 (v4x8hf_UP): Likewise.
3120 (v4x8bf_UP): Likewise.
3121 (v4x4si_UP): Likewise.
3122 (v4x4sf_UP): Likewise.
3123 (v4x2di_UP): Likewise.
3124 (v4x2df_UP): Likewise.
3125 (TYPES_GETREGP): Delete.
3126 (TYPES_SETREGP): Likewise.
3127 (TYPES_LOADSTRUCT_U): Define.
3128 (TYPES_LOADSTRUCT_P): Likewise.
3129 (TYPES_LOADSTRUCT_LANE_U): Likewise.
3130 (TYPES_LOADSTRUCT_LANE_P): Likewise.
3131 (TYPES_STORE1P): Move for consistency.
3132 (TYPES_STORESTRUCT_U): Define.
3133 (TYPES_STORESTRUCT_P): Likewise.
3134 (TYPES_STORESTRUCT_LANE_U): Likewise.
3135 (TYPES_STORESTRUCT_LANE_P): Likewise.
3136 (aarch64_simd_tuple_types): Define.
3137 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
3138 (aarch64_init_simd_builtin_functions): Update frontend lookup
3139 for builtin functions after handling arm_neon.h pragma.
3140 (register_tuple_type): Manually set modes of single-integer
3141 tuple types. Record tuple types.
3142 * config/aarch64/aarch64-modes.def
3143 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
3144 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
3145 (SVE_MODES): Give single-vector modes priority over vector-
3147 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
3148 be after all single-vector modes.
3149 * config/aarch64/aarch64-simd-builtins.def: Update builtin
3150 generator macros to reflect modifications to the backend
3152 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
3153 Use vector-tuple mode iterator and rename to...
3154 (aarch64_simd_ld2<vstruct_elt>): This.
3155 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
3157 (aarch64_simd_ld2r<vstruct_elt>): This.
3158 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
3159 iterator and rename to...
3160 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
3161 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
3163 (vec_load_lanes<mode><vstruct_elt>): This.
3164 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
3166 (aarch64_simd_st2<vstruct_elt>): This.
3167 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
3168 iterator and rename to...
3169 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
3170 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
3172 (vec_store_lanes<mode><vstruct_elt>): This.
3173 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
3175 (aarch64_simd_ld3<vstruct_elt>): This.
3176 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
3178 (aarch64_simd_ld3r<vstruct_elt>): This.
3179 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
3180 iterator and rename to...
3181 (vec_load_lanesci<mode>): This.
3182 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
3184 (aarch64_simd_st3<vstruct_elt>): This.
3185 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
3186 iterator and rename to...
3187 (vec_store_lanesci<mode>): This.
3188 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
3190 (aarch64_simd_ld4<vstruct_elt>): This.
3191 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
3193 (aarch64_simd_ld4r<vstruct_elt>): This.
3194 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
3195 iterator and rename to...
3196 (vec_load_lanesxi<mode>): This.
3197 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
3199 (aarch64_simd_st4<vstruct_elt>): This.
3200 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
3201 iterator and rename to...
3202 (vec_store_lanesxi<mode>): This.
3203 (mov<mode>): Define for Neon vector-tuple modes.
3204 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
3206 (aarch64_ld1x3<vstruct_elt>): This.
3207 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
3209 (aarch64_ld1_x3_<vstruct_elt>): This.
3210 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
3212 (aarch64_ld1x4<vstruct_elt>): This.
3213 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
3215 (aarch64_ld1_x4_<vstruct_elt>): This.
3216 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
3218 (aarch64_st1x2<vstruct_elt>): This.
3219 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
3221 (aarch64_st1_x2_<vstruct_elt>): This.
3222 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
3224 (aarch64_st1x3<vstruct_elt>): This.
3225 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
3227 (aarch64_st1_x3_<vstruct_elt>): This.
3228 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
3230 (aarch64_st1x4<vstruct_elt>): This.
3231 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
3233 (aarch64_st1_x4_<vstruct_elt>): This.
3234 (*aarch64_mov<mode>): Define for vector-tuple modes.
3235 (*aarch64_be_mov<mode>): Likewise.
3236 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
3237 mode iterator and rename to...
3238 (aarch64_ld<nregs>r<vstruct_elt>): This.
3239 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
3241 (aarch64_ld2<vstruct_elt>_dreg): This.
3242 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
3244 (aarch64_ld3<vstruct_elt>_dreg): This.
3245 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
3247 (aarch64_ld4<vstruct_elt>_dreg): This.
3248 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
3249 iterator and rename to...
3250 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
3251 iterator and rename to...
3252 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
3253 (aarch64_ld1x2<VQ:mode>): Delete.
3254 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
3256 (aarch64_ld1x2<vstruct_elt>): This.
3257 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
3258 tuple mode iterator and rename to...
3259 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
3260 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
3261 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
3262 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
3264 (aarch64_st2<vstruct_elt>_dreg): This.
3265 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
3267 (aarch64_st3<vstruct_elt>_dreg): This.
3268 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
3270 (aarch64_st4<vstruct_elt>_dreg): This.
3271 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
3272 iterator and rename to...
3273 (aarch64_st<nregs><vstruct_elt>): This.
3274 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
3275 iterator and rename to aarch64_st<nregs><vstruct_elt>.
3276 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
3277 tuple mode iterator and rename to...
3278 (aarch64_st<nregs>_lane<vstruct_elt>): This.
3279 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
3280 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
3282 (aarch64_simd_ld1<vstruct_elt>_x2): This.
3283 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
3284 Refactor to include new vector-tuple modes.
3285 (aarch64_classify_vector_mode): Add cases for new vector-
3287 (aarch64_advsimd_partial_struct_mode_p): Define.
3288 (aarch64_advsimd_full_struct_mode_p): Likewise.
3289 (aarch64_advsimd_vector_array_mode): Likewise.
3290 (aarch64_sve_data_mode): Change location in file.
3291 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
3292 (aarch64_hard_regno_nregs): Handle case of partial Neon
3294 (aarch64_classify_address): Refactor to include handling of
3295 Neon vector-tuple modes.
3296 (aarch64_print_operand): Print "d" for "%R" for a partial
3297 Neon vector structure.
3298 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
3299 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
3300 modes with scalar machines modes larger than 8 bytes.
3301 (aarch64_can_change_mode_class): Don't allow changes between
3302 partial and full Neon vector-structure modes.
3303 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
3304 builtin and remove boiler-plate code for opaque mode.
3305 (vst2_lane_f32): Likewise.
3306 (vst2_lane_f64): Likewise.
3307 (vst2_lane_p8): Likewise.
3308 (vst2_lane_p16): Likewise.
3309 (vst2_lane_p64): Likewise.
3310 (vst2_lane_s8): Likewise.
3311 (vst2_lane_s16): Likewise.
3312 (vst2_lane_s32): Likewise.
3313 (vst2_lane_s64): Likewise.
3314 (vst2_lane_u8): Likewise.
3315 (vst2_lane_u16): Likewise.
3316 (vst2_lane_u32): Likewise.
3317 (vst2_lane_u64): Likewise.
3318 (vst2q_lane_f16): Likewise.
3319 (vst2q_lane_f32): Likewise.
3320 (vst2q_lane_f64): Likewise.
3321 (vst2q_lane_p8): Likewise.
3322 (vst2q_lane_p16): Likewise.
3323 (vst2q_lane_p64): Likewise.
3324 (vst2q_lane_s8): Likewise.
3325 (vst2q_lane_s16): Likewise.
3326 (vst2q_lane_s32): Likewise.
3327 (vst2q_lane_s64): Likewise.
3328 (vst2q_lane_u8): Likewise.
3329 (vst2q_lane_u16): Likewise.
3330 (vst2q_lane_u32): Likewise.
3331 (vst2q_lane_u64): Likewise.
3332 (vst3_lane_f16): Likewise.
3333 (vst3_lane_f32): Likewise.
3334 (vst3_lane_f64): Likewise.
3335 (vst3_lane_p8): Likewise.
3336 (vst3_lane_p16): Likewise.
3337 (vst3_lane_p64): Likewise.
3338 (vst3_lane_s8): Likewise.
3339 (vst3_lane_s16): Likewise.
3340 (vst3_lane_s32): Likewise.
3341 (vst3_lane_s64): Likewise.
3342 (vst3_lane_u8): Likewise.
3343 (vst3_lane_u16): Likewise.
3344 (vst3_lane_u32): Likewise.
3345 (vst3_lane_u64): Likewise.
3346 (vst3q_lane_f16): Likewise.
3347 (vst3q_lane_f32): Likewise.
3348 (vst3q_lane_f64): Likewise.
3349 (vst3q_lane_p8): Likewise.
3350 (vst3q_lane_p16): Likewise.
3351 (vst3q_lane_p64): Likewise.
3352 (vst3q_lane_s8): Likewise.
3353 (vst3q_lane_s16): Likewise.
3354 (vst3q_lane_s32): Likewise.
3355 (vst3q_lane_s64): Likewise.
3356 (vst3q_lane_u8): Likewise.
3357 (vst3q_lane_u16): Likewise.
3358 (vst3q_lane_u32): Likewise.
3359 (vst3q_lane_u64): Likewise.
3360 (vst4_lane_f16): Likewise.
3361 (vst4_lane_f32): Likewise.
3362 (vst4_lane_f64): Likewise.
3363 (vst4_lane_p8): Likewise.
3364 (vst4_lane_p16): Likewise.
3365 (vst4_lane_p64): Likewise.
3366 (vst4_lane_s8): Likewise.
3367 (vst4_lane_s16): Likewise.
3368 (vst4_lane_s32): Likewise.
3369 (vst4_lane_s64): Likewise.
3370 (vst4_lane_u8): Likewise.
3371 (vst4_lane_u16): Likewise.
3372 (vst4_lane_u32): Likewise.
3373 (vst4_lane_u64): Likewise.
3374 (vst4q_lane_f16): Likewise.
3375 (vst4q_lane_f32): Likewise.
3376 (vst4q_lane_f64): Likewise.
3377 (vst4q_lane_p8): Likewise.
3378 (vst4q_lane_p16): Likewise.
3379 (vst4q_lane_p64): Likewise.
3380 (vst4q_lane_s8): Likewise.
3381 (vst4q_lane_s16): Likewise.
3382 (vst4q_lane_s32): Likewise.
3383 (vst4q_lane_s64): Likewise.
3384 (vst4q_lane_u8): Likewise.
3385 (vst4q_lane_u16): Likewise.
3386 (vst4q_lane_u32): Likewise.
3387 (vst4q_lane_u64): Likewise.
3388 (vtbl3_s8): Likewise.
3389 (vtbl3_u8): Likewise.
3390 (vtbl3_p8): Likewise.
3391 (vtbl4_s8): Likewise.
3392 (vtbl4_u8): Likewise.
3393 (vtbl4_p8): Likewise.
3394 (vld1_u8_x3): Likewise.
3395 (vld1_s8_x3): Likewise.
3396 (vld1_u16_x3): Likewise.
3397 (vld1_s16_x3): Likewise.
3398 (vld1_u32_x3): Likewise.
3399 (vld1_s32_x3): Likewise.
3400 (vld1_u64_x3): Likewise.
3401 (vld1_s64_x3): Likewise.
3402 (vld1_f16_x3): Likewise.
3403 (vld1_f32_x3): Likewise.
3404 (vld1_f64_x3): Likewise.
3405 (vld1_p8_x3): Likewise.
3406 (vld1_p16_x3): Likewise.
3407 (vld1_p64_x3): Likewise.
3408 (vld1q_u8_x3): Likewise.
3409 (vld1q_s8_x3): Likewise.
3410 (vld1q_u16_x3): Likewise.
3411 (vld1q_s16_x3): Likewise.
3412 (vld1q_u32_x3): Likewise.
3413 (vld1q_s32_x3): Likewise.
3414 (vld1q_u64_x3): Likewise.
3415 (vld1q_s64_x3): Likewise.
3416 (vld1q_f16_x3): Likewise.
3417 (vld1q_f32_x3): Likewise.
3418 (vld1q_f64_x3): Likewise.
3419 (vld1q_p8_x3): Likewise.
3420 (vld1q_p16_x3): Likewise.
3421 (vld1q_p64_x3): Likewise.
3422 (vld1_u8_x2): Likewise.
3423 (vld1_s8_x2): Likewise.
3424 (vld1_u16_x2): Likewise.
3425 (vld1_s16_x2): Likewise.
3426 (vld1_u32_x2): Likewise.
3427 (vld1_s32_x2): Likewise.
3428 (vld1_u64_x2): Likewise.
3429 (vld1_s64_x2): Likewise.
3430 (vld1_f16_x2): Likewise.
3431 (vld1_f32_x2): Likewise.
3432 (vld1_f64_x2): Likewise.
3433 (vld1_p8_x2): Likewise.
3434 (vld1_p16_x2): Likewise.
3435 (vld1_p64_x2): Likewise.
3436 (vld1q_u8_x2): Likewise.
3437 (vld1q_s8_x2): Likewise.
3438 (vld1q_u16_x2): Likewise.
3439 (vld1q_s16_x2): Likewise.
3440 (vld1q_u32_x2): Likewise.
3441 (vld1q_s32_x2): Likewise.
3442 (vld1q_u64_x2): Likewise.
3443 (vld1q_s64_x2): Likewise.
3444 (vld1q_f16_x2): Likewise.
3445 (vld1q_f32_x2): Likewise.
3446 (vld1q_f64_x2): Likewise.
3447 (vld1q_p8_x2): Likewise.
3448 (vld1q_p16_x2): Likewise.
3449 (vld1q_p64_x2): Likewise.
3450 (vld1_s8_x4): Likewise.
3451 (vld1q_s8_x4): Likewise.
3452 (vld1_s16_x4): Likewise.
3453 (vld1q_s16_x4): Likewise.
3454 (vld1_s32_x4): Likewise.
3455 (vld1q_s32_x4): Likewise.
3456 (vld1_u8_x4): Likewise.
3457 (vld1q_u8_x4): Likewise.
3458 (vld1_u16_x4): Likewise.
3459 (vld1q_u16_x4): Likewise.
3460 (vld1_u32_x4): Likewise.
3461 (vld1q_u32_x4): Likewise.
3462 (vld1_f16_x4): Likewise.
3463 (vld1q_f16_x4): Likewise.
3464 (vld1_f32_x4): Likewise.
3465 (vld1q_f32_x4): Likewise.
3466 (vld1_p8_x4): Likewise.
3467 (vld1q_p8_x4): Likewise.
3468 (vld1_p16_x4): Likewise.
3469 (vld1q_p16_x4): Likewise.
3470 (vld1_s64_x4): Likewise.
3471 (vld1_u64_x4): Likewise.
3472 (vld1_p64_x4): Likewise.
3473 (vld1q_s64_x4): Likewise.
3474 (vld1q_u64_x4): Likewise.
3475 (vld1q_p64_x4): Likewise.
3476 (vld1_f64_x4): Likewise.
3477 (vld1q_f64_x4): Likewise.
3478 (vld2_s64): Likewise.
3479 (vld2_u64): Likewise.
3480 (vld2_f64): Likewise.
3481 (vld2_s8): Likewise.
3482 (vld2_p8): Likewise.
3483 (vld2_p64): Likewise.
3484 (vld2_s16): Likewise.
3485 (vld2_p16): Likewise.
3486 (vld2_s32): Likewise.
3487 (vld2_u8): Likewise.
3488 (vld2_u16): Likewise.
3489 (vld2_u32): Likewise.
3490 (vld2_f16): Likewise.
3491 (vld2_f32): Likewise.
3492 (vld2q_s8): Likewise.
3493 (vld2q_p8): Likewise.
3494 (vld2q_s16): Likewise.
3495 (vld2q_p16): Likewise.
3496 (vld2q_p64): Likewise.
3497 (vld2q_s32): Likewise.
3498 (vld2q_s64): Likewise.
3499 (vld2q_u8): Likewise.
3500 (vld2q_u16): Likewise.
3501 (vld2q_u32): Likewise.
3502 (vld2q_u64): Likewise.
3503 (vld2q_f16): Likewise.
3504 (vld2q_f32): Likewise.
3505 (vld2q_f64): Likewise.
3506 (vld3_s64): Likewise.
3507 (vld3_u64): Likewise.
3508 (vld3_f64): Likewise.
3509 (vld3_s8): Likewise.
3510 (vld3_p8): Likewise.
3511 (vld3_s16): Likewise.
3512 (vld3_p16): Likewise.
3513 (vld3_s32): Likewise.
3514 (vld3_u8): Likewise.
3515 (vld3_u16): Likewise.
3516 (vld3_u32): Likewise.
3517 (vld3_f16): Likewise.
3518 (vld3_f32): Likewise.
3519 (vld3_p64): Likewise.
3520 (vld3q_s8): Likewise.
3521 (vld3q_p8): Likewise.
3522 (vld3q_s16): Likewise.
3523 (vld3q_p16): Likewise.
3524 (vld3q_s32): Likewise.
3525 (vld3q_s64): Likewise.
3526 (vld3q_u8): Likewise.
3527 (vld3q_u16): Likewise.
3528 (vld3q_u32): Likewise.
3529 (vld3q_u64): Likewise.
3530 (vld3q_f16): Likewise.
3531 (vld3q_f32): Likewise.
3532 (vld3q_f64): Likewise.
3533 (vld3q_p64): Likewise.
3534 (vld4_s64): Likewise.
3535 (vld4_u64): Likewise.
3536 (vld4_f64): Likewise.
3537 (vld4_s8): Likewise.
3538 (vld4_p8): Likewise.
3539 (vld4_s16): Likewise.
3540 (vld4_p16): Likewise.
3541 (vld4_s32): Likewise.
3542 (vld4_u8): Likewise.
3543 (vld4_u16): Likewise.
3544 (vld4_u32): Likewise.
3545 (vld4_f16): Likewise.
3546 (vld4_f32): Likewise.
3547 (vld4_p64): Likewise.
3548 (vld4q_s8): Likewise.
3549 (vld4q_p8): Likewise.
3550 (vld4q_s16): Likewise.
3551 (vld4q_p16): Likewise.
3552 (vld4q_s32): Likewise.
3553 (vld4q_s64): Likewise.
3554 (vld4q_u8): Likewise.
3555 (vld4q_u16): Likewise.
3556 (vld4q_u32): Likewise.
3557 (vld4q_u64): Likewise.
3558 (vld4q_f16): Likewise.
3559 (vld4q_f32): Likewise.
3560 (vld4q_f64): Likewise.
3561 (vld4q_p64): Likewise.
3562 (vld2_dup_s8): Likewise.
3563 (vld2_dup_s16): Likewise.
3564 (vld2_dup_s32): Likewise.
3565 (vld2_dup_f16): Likewise.
3566 (vld2_dup_f32): Likewise.
3567 (vld2_dup_f64): Likewise.
3568 (vld2_dup_u8): Likewise.
3569 (vld2_dup_u16): Likewise.
3570 (vld2_dup_u32): Likewise.
3571 (vld2_dup_p8): Likewise.
3572 (vld2_dup_p16): Likewise.
3573 (vld2_dup_p64): Likewise.
3574 (vld2_dup_s64): Likewise.
3575 (vld2_dup_u64): Likewise.
3576 (vld2q_dup_s8): Likewise.
3577 (vld2q_dup_p8): Likewise.
3578 (vld2q_dup_s16): Likewise.
3579 (vld2q_dup_p16): Likewise.
3580 (vld2q_dup_s32): Likewise.
3581 (vld2q_dup_s64): Likewise.
3582 (vld2q_dup_u8): Likewise.
3583 (vld2q_dup_u16): Likewise.
3584 (vld2q_dup_u32): Likewise.
3585 (vld2q_dup_u64): Likewise.
3586 (vld2q_dup_f16): Likewise.
3587 (vld2q_dup_f32): Likewise.
3588 (vld2q_dup_f64): Likewise.
3589 (vld2q_dup_p64): Likewise.
3590 (vld3_dup_s64): Likewise.
3591 (vld3_dup_u64): Likewise.
3592 (vld3_dup_f64): Likewise.
3593 (vld3_dup_s8): Likewise.
3594 (vld3_dup_p8): Likewise.
3595 (vld3_dup_s16): Likewise.
3596 (vld3_dup_p16): Likewise.
3597 (vld3_dup_s32): Likewise.
3598 (vld3_dup_u8): Likewise.
3599 (vld3_dup_u16): Likewise.
3600 (vld3_dup_u32): Likewise.
3601 (vld3_dup_f16): Likewise.
3602 (vld3_dup_f32): Likewise.
3603 (vld3_dup_p64): Likewise.
3604 (vld3q_dup_s8): Likewise.
3605 (vld3q_dup_p8): Likewise.
3606 (vld3q_dup_s16): Likewise.
3607 (vld3q_dup_p16): Likewise.
3608 (vld3q_dup_s32): Likewise.
3609 (vld3q_dup_s64): Likewise.
3610 (vld3q_dup_u8): Likewise.
3611 (vld3q_dup_u16): Likewise.
3612 (vld3q_dup_u32): Likewise.
3613 (vld3q_dup_u64): Likewise.
3614 (vld3q_dup_f16): Likewise.
3615 (vld3q_dup_f32): Likewise.
3616 (vld3q_dup_f64): Likewise.
3617 (vld3q_dup_p64): Likewise.
3618 (vld4_dup_s64): Likewise.
3619 (vld4_dup_u64): Likewise.
3620 (vld4_dup_f64): Likewise.
3621 (vld4_dup_s8): Likewise.
3622 (vld4_dup_p8): Likewise.
3623 (vld4_dup_s16): Likewise.
3624 (vld4_dup_p16): Likewise.
3625 (vld4_dup_s32): Likewise.
3626 (vld4_dup_u8): Likewise.
3627 (vld4_dup_u16): Likewise.
3628 (vld4_dup_u32): Likewise.
3629 (vld4_dup_f16): Likewise.
3630 (vld4_dup_f32): Likewise.
3631 (vld4_dup_p64): Likewise.
3632 (vld4q_dup_s8): Likewise.
3633 (vld4q_dup_p8): Likewise.
3634 (vld4q_dup_s16): Likewise.
3635 (vld4q_dup_p16): Likewise.
3636 (vld4q_dup_s32): Likewise.
3637 (vld4q_dup_s64): Likewise.
3638 (vld4q_dup_u8): Likewise.
3639 (vld4q_dup_u16): Likewise.
3640 (vld4q_dup_u32): Likewise.
3641 (vld4q_dup_u64): Likewise.
3642 (vld4q_dup_f16): Likewise.
3643 (vld4q_dup_f32): Likewise.
3644 (vld4q_dup_f64): Likewise.
3645 (vld4q_dup_p64): Likewise.
3646 (vld2_lane_u8): Likewise.
3647 (vld2_lane_u16): Likewise.
3648 (vld2_lane_u32): Likewise.
3649 (vld2_lane_u64): Likewise.
3650 (vld2_lane_s8): Likewise.
3651 (vld2_lane_s16): Likewise.
3652 (vld2_lane_s32): Likewise.
3653 (vld2_lane_s64): Likewise.
3654 (vld2_lane_f16): Likewise.
3655 (vld2_lane_f32): Likewise.
3656 (vld2_lane_f64): Likewise.
3657 (vld2_lane_p8): Likewise.
3658 (vld2_lane_p16): Likewise.
3659 (vld2_lane_p64): Likewise.
3660 (vld2q_lane_u8): Likewise.
3661 (vld2q_lane_u16): Likewise.
3662 (vld2q_lane_u32): Likewise.
3663 (vld2q_lane_u64): Likewise.
3664 (vld2q_lane_s8): Likewise.
3665 (vld2q_lane_s16): Likewise.
3666 (vld2q_lane_s32): Likewise.
3667 (vld2q_lane_s64): Likewise.
3668 (vld2q_lane_f16): Likewise.
3669 (vld2q_lane_f32): Likewise.
3670 (vld2q_lane_f64): Likewise.
3671 (vld2q_lane_p8): Likewise.
3672 (vld2q_lane_p16): Likewise.
3673 (vld2q_lane_p64): Likewise.
3674 (vld3_lane_u8): Likewise.
3675 (vld3_lane_u16): Likewise.
3676 (vld3_lane_u32): Likewise.
3677 (vld3_lane_u64): Likewise.
3678 (vld3_lane_s8): Likewise.
3679 (vld3_lane_s16): Likewise.
3680 (vld3_lane_s32): Likewise.
3681 (vld3_lane_s64): Likewise.
3682 (vld3_lane_f16): Likewise.
3683 (vld3_lane_f32): Likewise.
3684 (vld3_lane_f64): Likewise.
3685 (vld3_lane_p8): Likewise.
3686 (vld3_lane_p16): Likewise.
3687 (vld3_lane_p64): Likewise.
3688 (vld3q_lane_u8): Likewise.
3689 (vld3q_lane_u16): Likewise.
3690 (vld3q_lane_u32): Likewise.
3691 (vld3q_lane_u64): Likewise.
3692 (vld3q_lane_s8): Likewise.
3693 (vld3q_lane_s16): Likewise.
3694 (vld3q_lane_s32): Likewise.
3695 (vld3q_lane_s64): Likewise.
3696 (vld3q_lane_f16): Likewise.
3697 (vld3q_lane_f32): Likewise.
3698 (vld3q_lane_f64): Likewise.
3699 (vld3q_lane_p8): Likewise.
3700 (vld3q_lane_p16): Likewise.
3701 (vld3q_lane_p64): Likewise.
3702 (vld4_lane_u8): Likewise.
3703 (vld4_lane_u16): Likewise.
3704 (vld4_lane_u32): Likewise.
3705 (vld4_lane_u64): Likewise.
3706 (vld4_lane_s8): Likewise.
3707 (vld4_lane_s16): Likewise.
3708 (vld4_lane_s32): Likewise.
3709 (vld4_lane_s64): Likewise.
3710 (vld4_lane_f16): Likewise.
3711 (vld4_lane_f32): Likewise.
3712 (vld4_lane_f64): Likewise.
3713 (vld4_lane_p8): Likewise.
3714 (vld4_lane_p16): Likewise.
3715 (vld4_lane_p64): Likewise.
3716 (vld4q_lane_u8): Likewise.
3717 (vld4q_lane_u16): Likewise.
3718 (vld4q_lane_u32): Likewise.
3719 (vld4q_lane_u64): Likewise.
3720 (vld4q_lane_s8): Likewise.
3721 (vld4q_lane_s16): Likewise.
3722 (vld4q_lane_s32): Likewise.
3723 (vld4q_lane_s64): Likewise.
3724 (vld4q_lane_f16): Likewise.
3725 (vld4q_lane_f32): Likewise.
3726 (vld4q_lane_f64): Likewise.
3727 (vld4q_lane_p8): Likewise.
3728 (vld4q_lane_p16): Likewise.
3729 (vld4q_lane_p64): Likewise.
3730 (vqtbl2_s8): Likewise.
3731 (vqtbl2_u8): Likewise.
3732 (vqtbl2_p8): Likewise.
3733 (vqtbl2q_s8): Likewise.
3734 (vqtbl2q_u8): Likewise.
3735 (vqtbl2q_p8): Likewise.
3736 (vqtbl3_s8): Likewise.
3737 (vqtbl3_u8): Likewise.
3738 (vqtbl3_p8): Likewise.
3739 (vqtbl3q_s8): Likewise.
3740 (vqtbl3q_u8): Likewise.
3741 (vqtbl3q_p8): Likewise.
3742 (vqtbl4_s8): Likewise.
3743 (vqtbl4_u8): Likewise.
3744 (vqtbl4_p8): Likewise.
3745 (vqtbl4q_s8): Likewise.
3746 (vqtbl4q_u8): Likewise.
3747 (vqtbl4q_p8): Likewise.
3748 (vqtbx2_s8): Likewise.
3749 (vqtbx2_u8): Likewise.
3750 (vqtbx2_p8): Likewise.
3751 (vqtbx2q_s8): Likewise.
3752 (vqtbx2q_u8): Likewise.
3753 (vqtbx2q_p8): Likewise.
3754 (vqtbx3_s8): Likewise.
3755 (vqtbx3_u8): Likewise.
3756 (vqtbx3_p8): Likewise.
3757 (vqtbx3q_s8): Likewise.
3758 (vqtbx3q_u8): Likewise.
3759 (vqtbx3q_p8): Likewise.
3760 (vqtbx4_s8): Likewise.
3761 (vqtbx4_u8): Likewise.
3762 (vqtbx4_p8): Likewise.
3763 (vqtbx4q_s8): Likewise.
3764 (vqtbx4q_u8): Likewise.
3765 (vqtbx4q_p8): Likewise.
3766 (vst1_s64_x2): Likewise.
3767 (vst1_u64_x2): Likewise.
3768 (vst1_f64_x2): Likewise.
3769 (vst1_s8_x2): Likewise.
3770 (vst1_p8_x2): Likewise.
3771 (vst1_s16_x2): Likewise.
3772 (vst1_p16_x2): Likewise.
3773 (vst1_s32_x2): Likewise.
3774 (vst1_u8_x2): Likewise.
3775 (vst1_u16_x2): Likewise.
3776 (vst1_u32_x2): Likewise.
3777 (vst1_f16_x2): Likewise.
3778 (vst1_f32_x2): Likewise.
3779 (vst1_p64_x2): Likewise.
3780 (vst1q_s8_x2): Likewise.
3781 (vst1q_p8_x2): Likewise.
3782 (vst1q_s16_x2): Likewise.
3783 (vst1q_p16_x2): Likewise.
3784 (vst1q_s32_x2): Likewise.
3785 (vst1q_s64_x2): Likewise.
3786 (vst1q_u8_x2): Likewise.
3787 (vst1q_u16_x2): Likewise.
3788 (vst1q_u32_x2): Likewise.
3789 (vst1q_u64_x2): Likewise.
3790 (vst1q_f16_x2): Likewise.
3791 (vst1q_f32_x2): Likewise.
3792 (vst1q_f64_x2): Likewise.
3793 (vst1q_p64_x2): Likewise.
3794 (vst1_s64_x3): Likewise.
3795 (vst1_u64_x3): Likewise.
3796 (vst1_f64_x3): Likewise.
3797 (vst1_s8_x3): Likewise.
3798 (vst1_p8_x3): Likewise.
3799 (vst1_s16_x3): Likewise.
3800 (vst1_p16_x3): Likewise.
3801 (vst1_s32_x3): Likewise.
3802 (vst1_u8_x3): Likewise.
3803 (vst1_u16_x3): Likewise.
3804 (vst1_u32_x3): Likewise.
3805 (vst1_f16_x3): Likewise.
3806 (vst1_f32_x3): Likewise.
3807 (vst1_p64_x3): Likewise.
3808 (vst1q_s8_x3): Likewise.
3809 (vst1q_p8_x3): Likewise.
3810 (vst1q_s16_x3): Likewise.
3811 (vst1q_p16_x3): Likewise.
3812 (vst1q_s32_x3): Likewise.
3813 (vst1q_s64_x3): Likewise.
3814 (vst1q_u8_x3): Likewise.
3815 (vst1q_u16_x3): Likewise.
3816 (vst1q_u32_x3): Likewise.
3817 (vst1q_u64_x3): Likewise.
3818 (vst1q_f16_x3): Likewise.
3819 (vst1q_f32_x3): Likewise.
3820 (vst1q_f64_x3): Likewise.
3821 (vst1q_p64_x3): Likewise.
3822 (vst1_s8_x4): Likewise.
3823 (vst1q_s8_x4): Likewise.
3824 (vst1_s16_x4): Likewise.
3825 (vst1q_s16_x4): Likewise.
3826 (vst1_s32_x4): Likewise.
3827 (vst1q_s32_x4): Likewise.
3828 (vst1_u8_x4): Likewise.
3829 (vst1q_u8_x4): Likewise.
3830 (vst1_u16_x4): Likewise.
3831 (vst1q_u16_x4): Likewise.
3832 (vst1_u32_x4): Likewise.
3833 (vst1q_u32_x4): Likewise.
3834 (vst1_f16_x4): Likewise.
3835 (vst1q_f16_x4): Likewise.
3836 (vst1_f32_x4): Likewise.
3837 (vst1q_f32_x4): Likewise.
3838 (vst1_p8_x4): Likewise.
3839 (vst1q_p8_x4): Likewise.
3840 (vst1_p16_x4): Likewise.
3841 (vst1q_p16_x4): Likewise.
3842 (vst1_s64_x4): Likewise.
3843 (vst1_u64_x4): Likewise.
3844 (vst1_p64_x4): Likewise.
3845 (vst1q_s64_x4): Likewise.
3846 (vst1q_u64_x4): Likewise.
3847 (vst1q_p64_x4): Likewise.
3848 (vst1_f64_x4): Likewise.
3849 (vst1q_f64_x4): Likewise.
3850 (vst2_s64): Likewise.
3851 (vst2_u64): Likewise.
3852 (vst2_f64): Likewise.
3853 (vst2_s8): Likewise.
3854 (vst2_p8): Likewise.
3855 (vst2_s16): Likewise.
3856 (vst2_p16): Likewise.
3857 (vst2_s32): Likewise.
3858 (vst2_u8): Likewise.
3859 (vst2_u16): Likewise.
3860 (vst2_u32): Likewise.
3861 (vst2_f16): Likewise.
3862 (vst2_f32): Likewise.
3863 (vst2_p64): Likewise.
3864 (vst2q_s8): Likewise.
3865 (vst2q_p8): Likewise.
3866 (vst2q_s16): Likewise.
3867 (vst2q_p16): Likewise.
3868 (vst2q_s32): Likewise.
3869 (vst2q_s64): Likewise.
3870 (vst2q_u8): Likewise.
3871 (vst2q_u16): Likewise.
3872 (vst2q_u32): Likewise.
3873 (vst2q_u64): Likewise.
3874 (vst2q_f16): Likewise.
3875 (vst2q_f32): Likewise.
3876 (vst2q_f64): Likewise.
3877 (vst2q_p64): Likewise.
3878 (vst3_s64): Likewise.
3879 (vst3_u64): Likewise.
3880 (vst3_f64): Likewise.
3881 (vst3_s8): Likewise.
3882 (vst3_p8): Likewise.
3883 (vst3_s16): Likewise.
3884 (vst3_p16): Likewise.
3885 (vst3_s32): Likewise.
3886 (vst3_u8): Likewise.
3887 (vst3_u16): Likewise.
3888 (vst3_u32): Likewise.
3889 (vst3_f16): Likewise.
3890 (vst3_f32): Likewise.
3891 (vst3_p64): Likewise.
3892 (vst3q_s8): Likewise.
3893 (vst3q_p8): Likewise.
3894 (vst3q_s16): Likewise.
3895 (vst3q_p16): Likewise.
3896 (vst3q_s32): Likewise.
3897 (vst3q_s64): Likewise.
3898 (vst3q_u8): Likewise.
3899 (vst3q_u16): Likewise.
3900 (vst3q_u32): Likewise.
3901 (vst3q_u64): Likewise.
3902 (vst3q_f16): Likewise.
3903 (vst3q_f32): Likewise.
3904 (vst3q_f64): Likewise.
3905 (vst3q_p64): Likewise.
3906 (vst4_s64): Likewise.
3907 (vst4_u64): Likewise.
3908 (vst4_f64): Likewise.
3909 (vst4_s8): Likewise.
3910 (vst4_p8): Likewise.
3911 (vst4_s16): Likewise.
3912 (vst4_p16): Likewise.
3913 (vst4_s32): Likewise.
3914 (vst4_u8): Likewise.
3915 (vst4_u16): Likewise.
3916 (vst4_u32): Likewise.
3917 (vst4_f16): Likewise.
3918 (vst4_f32): Likewise.
3919 (vst4_p64): Likewise.
3920 (vst4q_s8): Likewise.
3921 (vst4q_p8): Likewise.
3922 (vst4q_s16): Likewise.
3923 (vst4q_p16): Likewise.
3924 (vst4q_s32): Likewise.
3925 (vst4q_s64): Likewise.
3926 (vst4q_u8): Likewise.
3927 (vst4q_u16): Likewise.
3928 (vst4q_u32): Likewise.
3929 (vst4q_u64): Likewise.
3930 (vst4q_f16): Likewise.
3931 (vst4q_f32): Likewise.
3932 (vst4q_f64): Likewise.
3933 (vst4q_p64): Likewise.
3934 (vtbx4_s8): Likewise.
3935 (vtbx4_u8): Likewise.
3936 (vtbx4_p8): Likewise.
3937 (vld1_bf16_x2): Likewise.
3938 (vld1q_bf16_x2): Likewise.
3939 (vld1_bf16_x3): Likewise.
3940 (vld1q_bf16_x3): Likewise.
3941 (vld1_bf16_x4): Likewise.
3942 (vld1q_bf16_x4): Likewise.
3943 (vld2_bf16): Likewise.
3944 (vld2q_bf16): Likewise.
3945 (vld2_dup_bf16): Likewise.
3946 (vld2q_dup_bf16): Likewise.
3947 (vld3_bf16): Likewise.
3948 (vld3q_bf16): Likewise.
3949 (vld3_dup_bf16): Likewise.
3950 (vld3q_dup_bf16): Likewise.
3951 (vld4_bf16): Likewise.
3952 (vld4q_bf16): Likewise.
3953 (vld4_dup_bf16): Likewise.
3954 (vld4q_dup_bf16): Likewise.
3955 (vst1_bf16_x2): Likewise.
3956 (vst1q_bf16_x2): Likewise.
3957 (vst1_bf16_x3): Likewise.
3958 (vst1q_bf16_x3): Likewise.
3959 (vst1_bf16_x4): Likewise.
3960 (vst1q_bf16_x4): Likewise.
3961 (vst2_bf16): Likewise.
3962 (vst2q_bf16): Likewise.
3963 (vst3_bf16): Likewise.
3964 (vst3q_bf16): Likewise.
3965 (vst4_bf16): Likewise.
3966 (vst4q_bf16): Likewise.
3967 (vld2_lane_bf16): Likewise.
3968 (vld2q_lane_bf16): Likewise.
3969 (vld3_lane_bf16): Likewise.
3970 (vld3q_lane_bf16): Likewise.
3971 (vld4_lane_bf16): Likewise.
3972 (vld4q_lane_bf16): Likewise.
3973 (vst2_lane_bf16): Likewise.
3974 (vst2q_lane_bf16): Likewise.
3975 (vst3_lane_bf16): Likewise.
3976 (vst3q_lane_bf16): Likewise.
3977 (vst4_lane_bf16): Likewise.
3978 (vst4q_lane_bf16): Likewise.
3979 * config/aarch64/geniterators.sh: Modify iterator regex to
3980 match new vector-tuple modes.
3981 * config/aarch64/iterators.md (insn_count): Extend mode
3982 attribute with vector-tuple type information.
3984 (Vendreg): Likewise.
3987 (VSTRUCT_2D): New mode iterator.
3988 (VSTRUCT_2DNX): Likewise.
3989 (VSTRUCT_2DX): Likewise.
3990 (VSTRUCT_2Q): Likewise.
3991 (VSTRUCT_2QD): Likewise.
3992 (VSTRUCT_3D): Likewise.
3993 (VSTRUCT_3DNX): Likewise.
3994 (VSTRUCT_3DX): Likewise.
3995 (VSTRUCT_3Q): Likewise.
3996 (VSTRUCT_3QD): Likewise.
3997 (VSTRUCT_4D): Likewise.
3998 (VSTRUCT_4DNX): Likewise.
3999 (VSTRUCT_4DX): Likewise.
4000 (VSTRUCT_4Q): Likewise.
4001 (VSTRUCT_4QD): Likewise.
4002 (VSTRUCT_D): Likewise.
4003 (VSTRUCT_Q): Likewise.
4004 (VSTRUCT_QD): Likewise.
4005 (VSTRUCT_ELT): New mode attribute.
4006 (vstruct_elt): Likewise.
4007 * genmodes.c (VECTOR_MODE): Add default prefix and order
4009 (VECTOR_MODE_WITH_PREFIX): Define.
4010 (make_vector_mode): Add mode prefix and order parameters.
4012 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4014 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
4016 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4018 * expr.c (emit_group_load_1): Remove historic workaround.
4020 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4022 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
4023 Factor out main loop to...
4024 (aarch64_init_simd_builtin_functions): This new function.
4025 (register_tuple_type): Define.
4026 (aarch64_scalar_builtin_type_p): Define.
4027 (handle_arm_neon_h): Define.
4028 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
4029 pragma for arm_neon.h.
4030 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
4032 (handle_arm_neon_h): Likewise.
4033 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
4034 Remove static modifier.
4035 * config/aarch64/arm_neon.h (target): Remove Neon vector
4036 structure type definitions.
4038 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4040 PR tree-optimization/102943
4041 * gimple-range-path.cc (path_range_query::range_on_path_entry):
4042 Prefer range_of_expr unless there are no statements in the BB.
4044 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4046 PR tree-optimization/102943
4047 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4048 Avoid duplicate calculation of paths.
4050 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4052 PR tree-optimization/102943
4053 * gimple-range-path.cc (path_range_query::compute_phi_relations):
4054 Only compute relations for SSA names in the import list.
4055 (path_range_query::compute_outgoing_relations): Same.
4056 * gimple-range-path.h (path_range_query::import_p): New.
4058 2021-11-04 Richard Biener <rguenther@suse.de>
4060 PR rtl-optimization/103075
4061 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
4062 false for a VOIDmode operand.
4064 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4066 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
4067 variables private and add "m_" to their names. Remove is_loop.
4068 (aarch64_record_potential_advsimd_unrolling): Replace with...
4069 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
4070 (aarch64_analyze_loop_vinfo): Replace with...
4071 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
4072 Move initialization of (m_)vec_flags to add_stmt_cost.
4073 (aarch64_analyze_bb_vinfo): Delete.
4074 (aarch64_count_ops): Replace with...
4075 (aarch64_vector_costs::count_ops): ...this.
4076 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
4077 using m_costing_for_scalar to test whether we're costing
4078 scalar or vector code.
4079 (aarch64_adjust_body_cost_sve): Replace with...
4080 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
4081 (aarch64_adjust_body_cost): Replace with...
4082 (aarch64_vector_costs::adjust_body_cost): ...this.
4083 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
4085 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4087 * target.def (targetm.vectorize.init_cost): Replace with...
4088 (targetm.vectorize.create_costs): ...this.
4089 (targetm.vectorize.add_stmt_cost): Delete.
4090 (targetm.vectorize.finish_cost): Likewise.
4091 (targetm.vectorize.destroy_cost_data): Likewise.
4092 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
4093 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4094 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4095 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4096 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4097 * doc/tm.texi: Regenerate.
4098 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
4100 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
4101 (vector_costs): New class.
4102 (init_cost): Take a vec_info and return a vector_costs.
4103 (dump_stmt_cost): Remove data parameter.
4104 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
4105 (add_stmt_costs): Likewise.
4106 (finish_cost): Replace data parameter with a vector_costs.
4107 (destroy_cost_data): Delete.
4108 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
4110 (vec_info::vec_info): Remove the target_cost_data parameter and
4111 initialize the member variable to null instead.
4112 (vec_info::~vec_info): Delete target_cost_data instead of calling
4114 (vector_costs::add_stmt_cost): New function.
4115 (vector_costs::finish_cost): Likewise.
4116 (vector_costs::record_stmt_cost): Likewise.
4117 (vector_costs::adjust_cost_for_freq): Likewise.
4118 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
4119 call to vec_info::vec_info.
4120 (vect_compute_single_scalar_iteration_cost): Update after above
4121 changes to costing interface.
4122 (vect_analyze_loop_operations): Likewise.
4123 (vect_estimate_min_profitable_iters): Likewise.
4124 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
4125 at the start_over point, where it needs to be recreated after
4126 trying without slp. Update retry code accordingly.
4127 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
4128 to vec_info::vec_info.
4129 (vect_slp_analyze_operation): Update after above changes to costing
4131 (vect_bb_vectorization_profitable_p): Likewise.
4132 * targhooks.h (default_init_cost): Replace with...
4133 (default_vectorize_create_costs): ...this.
4134 (default_add_stmt_cost): Delete.
4135 (default_finish_cost, default_destroy_cost_data): Likewise.
4136 * targhooks.c (default_init_cost): Replace with...
4137 (default_vectorize_create_costs): ...this.
4138 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
4139 (default_finish_cost, default_destroy_cost_data): Delete.
4140 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
4141 vector_costs. Add a constructor.
4142 (aarch64_init_cost): Replace with...
4143 (aarch64_vectorize_create_costs): ...this.
4144 (aarch64_add_stmt_cost): Replace with...
4145 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
4146 to adjust the cost for inner loops.
4147 (aarch64_finish_cost): Replace with...
4148 (aarch64_vector_costs::finish_cost): ...this.
4149 (aarch64_destroy_cost_data): Delete.
4150 (TARGET_VECTORIZE_INIT_COST): Replace with...
4151 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4152 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4153 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4154 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4155 * config/i386/i386.c (ix86_vector_costs): New structure.
4156 (ix86_init_cost): Replace with...
4157 (ix86_vectorize_create_costs): ...this.
4158 (ix86_add_stmt_cost): Replace with...
4159 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
4160 to adjust the cost for inner loops.
4161 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
4162 (TARGET_VECTORIZE_INIT_COST): Replace with...
4163 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4164 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4165 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4166 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4167 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
4168 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
4169 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
4170 (TARGET_VECTORIZE_FINISH_COST): Likewise.
4171 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
4172 (rs6000_cost_data): Inherit from vector_costs.
4173 Add a constructor. Drop loop_info, cost and costing_for_scalar
4174 in favor of the corresponding vector_costs member variables.
4175 Add "m_" to the names of the remaining member variables and
4177 (rs6000_density_test): Replace with...
4178 (rs6000_cost_data::density_test): ...this.
4179 (rs6000_init_cost): Replace with...
4180 (rs6000_vectorize_create_costs): ...this.
4181 (rs6000_update_target_cost_per_stmt): Replace with...
4182 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
4183 (rs6000_add_stmt_cost): Replace with...
4184 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
4185 to adjust the cost for inner loops.
4186 (rs6000_adjust_vect_cost_per_loop): Replace with...
4187 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
4188 (rs6000_finish_cost): Replace with...
4189 (rs6000_cost_data::finish_cost): ...this. Group loop code
4190 into a single if statement and pass the loop_vinfo down to
4192 (rs6000_destroy_cost_data): Delete.
4194 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
4196 PR tree-optimization/103062
4197 PR tree-optimization/103062
4198 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
4199 Increase size of allocation by 1.
4200 (ssa_equiv_stack::push_replacement): Grow as needed.
4201 (ssa_equiv_stack::get_replacement): Same.
4202 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
4203 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
4204 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
4205 (pointer_equiv_analyzer::get_equiv): Same.
4206 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
4207 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
4208 const markers. Use auto_vec instead of tree *.
4210 2021-11-04 Richard Biener <rguenther@suse.de>
4212 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
4213 true parameter and inline valueization.
4214 (vn_nary_op_lookup_1): Inline valueization from ...
4215 (vn_nary_op_compute_hash): ... here and remove it here.
4216 * tree-ssa-pre.c (phi_translate_1): Do not valueize
4217 before vn_nary_lookup_pieces.
4218 (get_representative_for): Mark created SSA representatives
4221 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4223 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
4224 that the operand has a vector mode. Use subreg_lowpart_offset
4225 to test whether an index corresponds to the low part.
4227 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
4229 * read-rtl.c: Remove dead !GENERATOR_FILE block.
4230 * read-rtl-function.c (function_reader::consolidate_singletons):
4231 Generate canonical CONST_VECTORs.
4233 2021-11-04 liuhongt <hongtao.liu@intel.com>
4236 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
4237 (regmem_or_bitnot_regmem_operand): .. and extend to handle
4239 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
4240 the operands which are required to be register_operand.
4241 (*<avx512>_vpternlog<mode>_2): Ditto.
4242 (*<avx512>_vpternlog<mode>_3): Ditto.
4243 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
4244 vector HFmodes since it's not a real AVX512FP16 instruction.
4246 2021-11-04 liuhongt <hongtao.liu@intel.com>
4249 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
4250 .COPYSIGN (a,b) when a and b are same type as the truncation
4251 type and has less precision than extend type.
4253 2021-11-04 Richard Biener <rguenther@suse.de>
4255 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
4258 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
4260 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
4261 (VI1_AVX512VNNI): Likewise.
4262 (SDOT_VPDP_SUF): New mode_attr.
4265 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
4266 vpdpwssd when VNNI targets available.
4267 (usdot_prod<mode>): New expander for vector QImode.
4269 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
4271 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
4272 parentheses to base and stride.
4273 (_tile_stream_loadd_internal): Likewise.
4274 (_tile_stored_internal): Likewise.
4276 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
4278 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
4279 arguments to `reg_class_subset_p'.
4281 2021-11-03 Joseph Myers <joseph@codesourcery.com>
4284 * fold-const.c (fold_init): New function.
4285 * fold-const.h (fold_init): New prototype.
4287 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4289 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
4291 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4293 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
4294 a range from edge if arg == phidef.
4296 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4298 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
4299 for constant before any other processing.
4301 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4303 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
4305 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
4307 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
4308 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
4309 no explcit context and current_bb is set.
4310 (gimple_ranger::fold_stmt): New.
4311 * gimple-range.h (current_bb, fold_stmt): New.
4312 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
4314 2021-11-03 Richard Biener <rguenther@suse.de>
4316 PR tree-optimization/102970
4317 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
4318 when translating a MEM_REF over a backedge.
4320 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
4322 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
4323 * config/aarch64/aarch64-tune.md: Regenerate.
4324 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
4325 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
4326 * doc/invoke.texi: Add documentation for Ampere-1 core.
4328 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
4330 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
4332 (ldr_got_small_<mode>): Remove pattern.
4333 (ldr_got_small_sidi): Likewise.
4334 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
4335 GOT accesses as moves.
4336 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
4337 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
4338 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
4340 2021-11-03 Martin Liska <mliska@suse.cz>
4342 * gcov.c (read_line): Remove dead variable.
4344 2021-11-03 Martin Liska <mliska@suse.cz>
4347 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
4348 (ipa_fn_summary::account_size_time): Likewise.
4349 (edge_set_predicate): Likewise.
4350 (set_hint_predicate): Likewise.
4351 (add_freqcounting_predicate): Likewise.
4352 (evaluate_conditions_for_known_args): Likewise.
4353 (evaluate_properties_for_edge): Likewise.
4354 (remap_freqcounting_preds_after_dup): Likewise.
4355 (ipa_fn_summary_t::duplicate): Likewise.
4356 (set_cond_stmt_execution_predicate): Likewise.
4357 (set_switch_stmt_execution_predicate): Likewise.
4358 (compute_bb_predicates): Likewise.
4359 (will_be_nonconstant_expr_predicate): Likewise.
4360 (will_be_nonconstant_predicate): Likewise.
4361 (phi_result_unknown_predicate): Likewise.
4362 (predicate_for_phi_result): Likewise.
4363 (analyze_function_body): Likewise.
4364 (compute_fn_summary): Likewise.
4365 (summarize_calls_size_and_time): Likewise.
4366 (estimate_calls_size_and_time): Likewise.
4367 (ipa_call_context::estimate_size_and_time): Likewise.
4368 (remap_edge_summaries): Likewise.
4369 (remap_freqcounting_predicate): Likewise.
4370 (ipa_merge_fn_summary_after_inlining): Likewise.
4371 (ipa_update_overall_fn_summary): Likewise.
4372 (read_ipa_call_summary): Likewise.
4373 (inline_read_section): Likewise.
4374 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
4375 * ipa-predicate.c (predicate::add_clause): Likewise.
4376 (ipa_predicate::add_clause): Likewise.
4377 (predicate::or_with): Likewise.
4378 (ipa_predicate::or_with): Likewise.
4379 (predicate::evaluate): Likewise.
4380 (ipa_predicate::evaluate): Likewise.
4381 (predicate::probability): Likewise.
4382 (ipa_predicate::probability): Likewise.
4383 (dump_condition): Likewise.
4384 (dump_clause): Likewise.
4385 (predicate::dump): Likewise.
4386 (ipa_predicate::dump): Likewise.
4387 (predicate::debug): Likewise.
4388 (ipa_predicate::debug): Likewise.
4389 (predicate::remap_after_duplication): Likewise.
4390 (ipa_predicate::remap_after_duplication): Likewise.
4391 (predicate::remap_after_inlining): Likewise.
4392 (ipa_predicate::remap_after_inlining): Likewise.
4393 (predicate::stream_in): Likewise.
4394 (ipa_predicate::stream_in): Likewise.
4395 (predicate::stream_out): Likewise.
4396 (ipa_predicate::stream_out): Likewise.
4397 (add_condition): Likewise.
4398 * ipa-predicate.h (class predicate): Likewise.
4399 (class ipa_predicate): Likewise.
4400 (add_condition): Likewise.
4402 2021-11-03 Richard Biener <rguenther@suse.de>
4404 * bitmap.h (bitmap_bit_p): Change the return type to bool.
4405 * bitmap.c (bitmap_bit_p): Likewise.
4406 * sbitmap.h (bitmap_bit_p): Likewise.
4407 (bitmap_set_bit): Return whether the bit changed.
4408 (bitmap_clear_bit): Likewise.
4409 * tree-ssa.c (verify_vssa): Make use of the changed state
4410 from bitmap_set_bit.
4412 2021-11-03 Richard Biener <rguenther@suse.de>
4414 PR middle-end/103033
4415 * internal-fn.c (expand_DEFERRED_INIT): Elide the
4416 native_interpret_expr path in favor of folding the
4417 VIEW_CONVERT_EXPR generated when punning the RHS.
4419 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
4421 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
4424 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
4427 * ipa-modref.c (callee_to_caller_flags): New function.
4428 (modref_eaf_analysis::analyze_ssa_name): Use it.
4429 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
4431 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
4433 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
4436 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
4437 Uroš Bizjak <ubizjak@gmail.com>
4439 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
4440 rotations by non-constant amounts.
4442 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
4444 * ipa-modref.c: Fix anonymous namespace placement.
4445 (class modref_eaf_analysis): New class.
4446 (analyze_ssa_name_flags): Turn to ...
4447 (modref_eaf_analysis::analyze_ssa_name): ... this one.
4448 (merge_call_lhs_flags): Turn to ...
4449 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
4450 (modref_eaf_analysis::merge_with_ssa_name): New member function.
4451 (record_escape_points): Turn to ...
4452 (modref_eaf_analysis::record_escape_points): ... this one.
4453 (analyze_parms): Updat
4454 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
4456 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
4458 * gimple.c (gimple_call_static_chain_flags): New function.
4459 * gimple.h (gimple_call_static_chain_flags): Declare
4460 * ipa-modref.c (modref_summary::modref_summary): Initialize
4462 (modref_summary_lto::modref_summary_lto): Likewise.
4463 (modref_summary::useful_p): Test static_chain_flags.
4464 (modref_summary_lto::useful_p): Likewise.
4465 (struct modref_summary_lto): Add static_chain_flags.
4466 (modref_summary::dump): Dump static_chain_flags.
4467 (modref_summary_lto::dump): Likewise.
4468 (struct escape_point): Add static_cahin_arg.
4469 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
4470 (analyze_parms): Handle static chains.
4471 (modref_summaries::duplicate): Duplicate static_chain_flags.
4472 (modref_summaries_lto::duplicate): Likewise.
4473 (modref_write): Stream static_chain_flags.
4474 (read_section): Likewise.
4475 (modref_merge_call_site_flags): Handle static_chain_flags.
4476 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
4477 * tree-ssa-structalias.c (handle_rhs_call): Use
4478 gimple_static_chain_flags.
4480 2021-11-02 Richard Biener <rguenther@suse.de>
4482 PR tree-optimization/103029
4483 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
4484 the PHI nodes in the loop maintain their original operand
4487 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
4489 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
4490 (EAF_NOREAD): Renumber.
4491 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
4492 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
4493 (deref_flags): Likewise.
4494 (modref_lattice::init): Likewise.
4495 (modref_lattice::merge): Likewise.
4496 (merge_call_lhs_flags): Likewise.
4497 (analyze_ssa_name_flags): Likewise.
4498 (modref_merge_call_site_flags): Likewise.
4499 * tree-ssa-structalias.c (handle_call_arg): Likewise.
4501 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
4503 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
4504 check and remove an unused local variable with shNadd/shNadd.uw
4507 2021-11-02 Tamar Christina <tamar.christina@arm.com>
4509 PR tree-optimization/103007
4510 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
4513 2021-11-02 Richard Biener <rguenther@suse.de>
4515 PR middle-end/103038
4516 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
4517 (can_native_interpret_type_p): Likewise.
4518 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
4519 RHS if the LHS is an SSA name.
4521 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
4523 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
4524 * langhooks-def.h (lhd_simulate_record_decl): Declare.
4525 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
4526 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
4527 * langhooks.c (lhd_simulate_record_decl): New function.
4529 2021-11-02 Jakub Jelinek <jakub@redhat.com>
4532 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
4533 modes with TImode inner mode if 32-bit.
4535 2021-11-02 liuhongt <hongtao.liu@intel.com>
4537 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
4538 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
4539 vect_slp_v4si_store_unalign): Document efficient target.
4540 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
4541 vect_slp_v16qi_store_unalign_1): Ditto.
4542 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
4543 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
4544 (struct_4char_block_move, struct_8char_block_move,
4545 struct_16char_block_move): Ditto.
4547 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
4548 Jakub Jelinek <jakub@redhat.com>
4551 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
4552 ix86_expand_ti_to_v1ti): New helper functions.
4553 (ix86_expand_v1ti_shift): Check if the amount operand is an
4554 integer constant, and expand as a TImode shift if it isn't.
4555 (ix86_expand_v1ti_rotate): Check if the amount operand is an
4556 integer constant, and expand as a TImode rotate if it isn't.
4557 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
4558 right shifts of V1TImode quantities.
4559 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
4560 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
4561 to QImode general_operand, and let the helper functions lower
4562 shifts by non-constant operands, as TImode shifts. Make
4563 conditional on TARGET_64BIT.
4564 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
4565 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
4566 Make conditional on TARGET_64BIT.
4568 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
4570 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
4571 "*tabort_1_plus"): Remove operands non-null check.
4573 2021-11-01 Martin Liska <mliska@suse.cz>
4575 * opt-functions.awk: Add new sanity checking.
4576 * optc-gen.awk: Add new argument to integer_range_info.
4577 * params.opt: Update 2 params which have negative IntegerRange.
4579 2021-11-01 qing zhao <qing.zhao@oracle.com>
4581 * gimplify.c (gimplify_decl_expr): Do not add call to
4582 __builtin_clear_padding when a variable is a gimple register
4583 or it might not have padding.
4584 (gimplify_init_constructor): Likewise.
4586 2021-11-01 Tamar Christina <tamar.christina@arm.com>
4588 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
4589 movi, dup and extract costing fields.
4590 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
4591 thunderx_extra_costs, thunderx2t99_extra_costs,
4592 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
4594 * config/arm/aarch-cost-tables.h (generic_extra_costs,
4595 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
4596 exynosm1_extra_costs, xgene1_extra_costs): Likewise
4597 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
4598 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
4600 2021-11-01 Tamar Christina <tamar.christina@arm.com>
4602 * cse.c (add_to_set): New.
4603 (find_sets_in_insn): Register constants in sets.
4604 (canonicalize_insn): Use auto_vec instead.
4605 (cse_insn): Try materializing using vec_dup.
4606 * rtl.h (simplify_context::simplify_gen_vec_select,
4607 simplify_gen_vec_select): New.
4608 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
4610 2021-11-01 David Malcolm <dmalcolm@redhat.com>
4612 * common.opt (fdiagnostics-escape-format=): New.
4613 (diagnostics_escape_format): New enum.
4614 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
4615 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
4616 * diagnostic-format-json.cc (json_end_diagnostic): Add
4617 "escape-source" attribute.
4618 * diagnostic-show-locus.c
4619 (exploc_with_display_col::exploc_with_display_col): Replace
4620 "tabstop" param with a cpp_char_column_policy and add an "aspect"
4621 param. Use these to compute m_display_col accordingly.
4622 (struct char_display_policy): New struct.
4623 (layout::m_policy): New field.
4624 (layout::m_escape_on_output): New field.
4625 (def_policy): New function.
4626 (make_range): Update for changes to exploc_with_display_col ctor.
4627 (default_print_decoded_ch): New.
4628 (width_per_escaped_byte): New.
4629 (escape_as_bytes_width): New.
4630 (escape_as_bytes_print): New.
4631 (escape_as_unicode_width): New.
4632 (escape_as_unicode_print): New.
4634 (layout::layout): Initialize new fields. Update m_exploc ctor
4635 call for above change to ctor.
4636 (layout::maybe_add_location_range): Update for changes to
4637 exploc_with_display_col ctor.
4638 (layout::calculate_x_offset_display): Update for change to
4640 (layout::print_source_line): Pass policy
4641 to cpp_display_width_computation. Capture cpp_decoded_char when
4642 calling process_next_codepoint. Move printing of source code to
4643 m_policy.m_print_cb.
4644 (line_label::line_label): Pass in policy rather than context.
4645 (layout::print_any_labels): Update for change to line_label ctor.
4646 (get_affected_range): Pass in policy rather than context, updating
4647 calls to location_compute_display_column accordingly.
4648 (get_printed_columns): Likewise, also for cpp_display_width.
4649 (correction::correction): Pass in policy rather than tabstop.
4650 (correction::compute_display_cols): Pass m_policy rather than
4651 m_tabstop to cpp_display_width.
4652 (correction::m_tabstop): Replace with...
4653 (correction::m_policy): ...this.
4654 (line_corrections::line_corrections): Pass in policy rather than
4656 (line_corrections::m_context): Replace with...
4657 (line_corrections::m_policy): ...this.
4658 (line_corrections::add_hint): Update to use m_policy rather than
4660 (line_corrections::add_hint): Likewise.
4661 (layout::print_trailing_fixits): Likewise.
4662 (selftest::test_display_widths): New.
4663 (selftest::test_layout_x_offset_display_utf8): Update to use
4664 policy rather than tabstop.
4665 (selftest::test_one_liner_labels_utf8): Add test of escaping
4667 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
4668 use policy rather than tabstop.
4669 (selftest::test_overlapped_fixit_printing): Likewise.
4670 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
4671 (selftest::test_overlapped_fixit_printing_2): Likewise.
4672 (selftest::test_tab_expansion): Likewise.
4673 (selftest::test_escaping_bytes_1): New.
4674 (selftest::test_escaping_bytes_2): New.
4675 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
4676 * diagnostic.c (diagnostic_initialize): Initialize
4677 context->escape_format.
4678 (convert_column_unit): Update to use default character width policy.
4679 (selftest::test_diagnostic_get_location_text): Likewise.
4680 * diagnostic.h (enum diagnostics_escape_format): New enum.
4681 (diagnostic_context::escape_format): New field.
4682 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
4683 (-fdiagnostics-format=): Add "escape-source" attribute to examples
4684 of JSON output, and document it.
4685 * input.c (location_compute_display_column): Pass in "policy"
4686 rather than "tabstop", passing to
4687 cpp_byte_column_to_display_column.
4688 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
4689 * input.h (class cpp_char_column_policy): New forward decl.
4690 (location_compute_display_column): Pass in "policy" rather than
4692 * opts.c (common_handle_option): Handle
4693 OPT_fdiagnostics_escape_format_.
4694 * selftest.c (temp_source_file::temp_source_file): New ctor
4695 overload taking a size_t.
4696 * selftest.h (temp_source_file::temp_source_file): Likewise.
4698 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
4700 * dbgcnt.def: Add debug counter for back_thread[12] and
4701 back_threadfull[12].
4702 * passes.def: Pass "first" argument to each back threading pass.
4703 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
4705 (back_threader::debug_counter): New.
4706 (back_threader::maybe_register_path): Call debug_counter.
4708 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
4710 * tree-ssa-threadbackward.c (BT_NONE): New.
4713 (back_threader::back_threader): Add flags.
4714 Move loop initialization here.
4715 (back_threader::~back_threader): New.
4716 (back_threader::find_taken_edge_switch): Change solver and ranger
4718 (back_threader::find_taken_edge_cond): Same.
4719 (back_threader::find_paths_to_names): Same.
4720 (back_threader::find_paths): Same.
4721 (back_threader::dump): Same.
4722 (try_thread_blocks): Merge into thread_blocks.
4723 (back_threader::thread_blocks): New.
4724 (do_early_thread_jumps): Merge into thread_blocks.
4725 (do_thread_jumps): Merge into thread_blocks.
4726 (back_threader::thread_through_all_blocks): Remove.
4728 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
4730 PR tree-optimization/103003
4731 * value-relation.cc (dom_oracle::register_relation): If the 2
4732 ssa names are the same, don't register any relation.
4734 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
4736 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
4737 redundant check for calls_eh_return.
4738 * config/aarch64/aarch64.md (*do_return): Likewise.
4740 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
4742 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
4743 duplicate_loop_to_header_edge to
4744 duplicate_loop_body_to_header_edge.
4745 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
4746 * cfghooks.h (struct cfg_hooks): Likewise.
4747 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
4748 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
4749 (clone_loop_to_header_edge): Likewise.
4750 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
4751 * cfgrtl.c (struct cfg_hooks): Likewise.
4752 * doc/loop.texi: Likewise.
4753 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
4754 (unroll_loop_runtime_iterations): Likewise.
4755 (unroll_loop_stupid): Likewise.
4756 (apply_opt_in_copies): Likewise.
4757 * tree-cfg.c (struct cfg_hooks): Likewise.
4758 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
4759 (try_peel_loop): Likewise.
4760 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
4761 (gimple_duplicate_loop_body_to_header_edge): Likewise.
4762 (tree_transform_and_unroll_loop): Likewise.
4763 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
4766 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
4768 * cfgloopmanip.c (loop_version): Refactor loopify to
4769 loop_version. Move condition generation after loopify.
4771 * cfgloopmanip.h (loopify): Delete.
4773 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
4775 * ipa-fnsummary.c: Include tree-dfa.h.
4776 (points_to_local_or_readonly_memory_p): Return true on return
4778 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
4781 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
4783 PR middle-end/102972
4784 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
4785 internal Fortran name; new permit_num_teams arg to permit
4786 omp_get_num_teams and omp_get_team_num.
4787 (scan_omp_1_stmt): Update call to it, add missing call for
4788 reverse offload, and check for strictly nested API calls in teams.
4790 2021-10-30 Jakub Jelinek <jakub@redhat.com>
4792 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
4794 2021-10-29 David Malcolm <dmalcolm@redhat.com>
4796 * selftest.c (assert_streq): Add newlines when emitting non-equal
4799 2021-10-29 David Malcolm <dmalcolm@redhat.com>
4801 * Makefile.in: Fix syntax for reference to LIBDEPS in
4804 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
4806 * doc/install.texi: Bump required minimum DejaGnu version.
4808 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
4809 Andrew MacLeod <amacleod@redhat.com>
4811 * value-relation.cc (path_oracle::killing_def): Add a
4812 self-equivalence so we don't look to the root oracle.
4814 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
4816 * passes.def: Replace the pass_thread_jumps before VRP* with
4817 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
4818 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
4819 Remove hyphen from "thread-full" name.
4821 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
4823 PR tree-optimization/102983
4824 * gimple-range-cache.h (propagate_updated_value): Make public.
4825 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
4826 when processing gcond stmts.
4828 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
4830 * gimple.c (gimple_call_retslot_flags): New function.
4831 * gimple.h (gimple_call_retslot_flags): Declare.
4832 * ipa-modref.c: Include tree-cfg.h.
4833 (struct escape_entry): Turn parm_index to signed.
4834 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
4835 (modref_summary::modref_summary): Initialize retslot_flags.
4836 (struct modref_summary_lto): Likewise.
4837 (modref_summary::useful_p): Check retslot_flags.
4838 (modref_summary_lto::useful_p): Likewise.
4839 (modref_summary::dump): Dump retslot_flags.
4840 (modref_summary_lto::dump): Likewise.
4841 (struct escape_point): Add hidden_args enum.
4842 (analyze_ssa_name_flags): Ignore return slot return;
4843 use gimple_call_retslot_flags.
4844 (record_escape_points): Break out from ...
4845 (analyze_parms): ... here; handle retslot_flags.
4846 (modref_summaries::duplicate): Duplicate retslot_flags.
4847 (modref_summaries_lto::duplicate): Likewise.
4848 (modref_write_escape_summary): Stream parm_index as signed.
4849 (modref_read_escape_summary): Likewise.
4850 (modref_write): Stream retslot_flags.
4851 (read_section): Likewise.
4852 (struct escape_map): Fix typo in comment.
4853 (update_escape_summary_1): Fix whitespace.
4854 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
4855 (modref_merge_call_site_flags): Merge retslot_flags.
4856 * ipa-modref.h (struct modref_summary): Add retslot_flags.
4857 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
4859 2021-10-29 Tamar Christina <tamar.christina@arm.com>
4861 PR tree-optimization/102977
4862 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
4863 (vect_detect_pair_op): Add crosslane check.
4864 (vect_match_call_complex_mla): Remove.
4865 (class complex_mul_pattern): Update comment.
4866 (complex_mul_pattern::matches): Update detection.
4867 (class complex_fma_pattern): Remove.
4868 (complex_fma_pattern::matches): Remove.
4869 (complex_fma_pattern::recognize): Remove.
4870 (complex_fma_pattern::build): Remove.
4871 (class complex_fms_pattern): Update comment.
4872 (complex_fms_pattern::matches): Remove.
4873 (complex_operations_pattern::recognize): Remove complex_fma_pattern
4875 2021-10-29 Jakub Jelinek <jakub@redhat.com>
4877 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
4880 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
4883 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
4884 patterns match and emit for VSX xxpermdi.
4886 2021-10-29 liuhongt <hongtao.liu@intel.com>
4889 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
4891 (V16HF_FTYPE_V16HF): Ditto.
4892 (V32HF_FTYPE_V32HF): Ditto.
4893 (V8HF_FTYPE_V8HF_ROUND): Ditto.
4894 (V16HF_FTYPE_V16HF_ROUND): Ditto.
4895 (V32HF_FTYPE_V32HF_ROUND): Ditto.
4896 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
4897 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
4898 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
4899 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
4900 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
4901 * config/i386/i386-builtins.c
4902 (ix86_builtin_vectorized_function): Enable vectorization for
4903 HFmode FLOOR/CEIL/TRUNC operation.
4904 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
4906 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
4909 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
4910 Andrew MacLeod <amacleod@redhat.com>
4912 * value-relation.cc (path_oracle::killing_def): Walk the
4913 equivalency list and remove SSA from any equivalencies.
4915 2021-10-28 Stafford Horne <shorne@gmail.com>
4917 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
4920 2021-10-28 Jakub Jelinek <jakub@redhat.com>
4922 PR tree-optimization/102951
4923 * fold-const.h (address_compare): Declare.
4924 * fold-const.c (address_compare): New function.
4925 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
4926 address_compare helper.
4927 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
4930 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
4932 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
4934 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
4936 PR tree-optimization/102940
4937 * tree-vrp.c (execute_ranger_vrp): Reset scev.
4939 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
4941 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
4943 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
4945 * configure: Regenerate.
4946 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
4948 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
4950 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
4952 2021-10-28 Richard Biener <rguenther@suse.de>
4955 * fold-const.c (fold_convert_const): Avoid int to float
4956 constant folding with -frounding-math and inexact result.
4957 * simplify-rtx.c (simplify_const_unary_operation): Likewise
4958 for both float and unsigned_float.
4960 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
4962 * tree-ssa-threadbackward.c
4963 (back_threader::find_taken_edge_switch): Use find_case_label_range
4964 instead of find_taken_edge.
4966 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
4968 * tree-ssa-threadbackward.c (class back_threader_registry):
4969 Inherit from back_jt_path_registry.
4970 (back_threader_registry::thread_through_all_blocks): Remove.
4971 (back_threader_registry::register_path): Remove
4972 m_lowlevel_registry prefix.
4974 2021-10-28 Richard Biener <rguenther@suse.de>
4977 * fold-const.c (fold_convert_const_real_from_real): Honor
4978 -frounding-math if the conversion is not exact.
4979 * simplify-rtx.c (simplify_const_unary_operation): Do not
4980 simplify FLOAT_TRUNCATE with sign dependent rounding.
4982 2021-10-28 Richard Biener <rguenther@suse.de>
4984 PR tree-optimization/102949
4985 * tree-vect-stmts.c (ensure_base_align): Look at the
4986 dr_info of a group leader and assert we are looking at
4987 one with analyzed alignment.
4989 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
4992 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
4993 V1T1 mode for unaligned load and store.
4995 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
4997 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
5000 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
5002 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
5005 2021-10-28 Alexandre Oliva <oliva@adacore.com>
5007 * common.opt (fharden-compares): New.
5008 (fharden-conditional-branches): New.
5009 * doc/invoke.texi: Document new options.
5010 * gimple-harden-conditionals.cc: New.
5011 * Makefile.in (OBJS): Build it.
5012 * passes.def: Add new passes.
5013 * tree-pass.h (make_pass_harden_compares): Declare.
5014 (make_pass_harden_conditional_branches): Declare.
5016 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
5019 * config/rs6000/altivec.md: Add vsx register constraints.
5020 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
5021 (vsx_xxsel<mode>2): Likewise.
5022 (vsx_xxsel<mode>3): Likewise.
5023 (vsx_xxsel<mode>4): Likewise.
5025 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
5028 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
5029 (altivec_vsel<mode>): ... this and update define.
5030 (*altivec_vsel<mode>_uns): Delete.
5031 (altivec_vsel<mode>2): New define_insn.
5032 (altivec_vsel<mode>3): Likewise.
5033 (altivec_vsel<mode>4): Likewise.
5034 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
5035 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
5037 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
5038 selection instead of per element.
5039 * config/rs6000/vector.md:
5040 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
5041 (vsx_xxsel<mode>): ... this and update define.
5042 (*vsx_xxsel<mode>_uns): Delete.
5043 (vsx_xxsel<mode>2): New define_insn.
5044 (vsx_xxsel<mode>3): Likewise.
5045 (vsx_xxsel<mode>4): Likewise.
5047 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
5049 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
5050 HFmode rsqrt without TARGET_SSE_MATH.
5051 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
5052 btrunc condition to be restricted by -ftrapping-math, adjust
5053 use_rsqrt_p function call.
5054 * config/i386/i386.md (rcphf2): New define_insn.
5055 (rsqrthf2): Likewise.
5056 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
5057 (div<mode>3): New expander for HF mode.
5058 (rsqrt<mode>2): Likewise.
5059 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
5060 (*avx512fp16_vmrsqrtv8hf2): Likewise.
5062 2021-10-27 Saagar Jha <saagar@saagarjha.com>
5064 * config.gcc: Adjust for Darwin21.
5065 * config/darwin-c.c (macosx_version_as_macro): Likewise.
5066 * config/darwin-driver.c (validate_macosx_version_min):
5068 (darwin_find_version_from_kernel): Likewise.
5070 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
5072 * gimple-range-path.cc
5073 (path_range_query::range_defined_in_block): Call killing_def.
5075 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
5077 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
5078 operands as well as relation.
5079 * gimple-range-path.cc
5080 (path_range_query::compute_ranges_in_block): Compute PHI relations
5081 first. Compute outgoing relations at the end.
5082 (path_range_query::compute_ranges): Remove call to compute_relations.
5083 (path_range_query::compute_relations): Remove.
5084 (path_range_query::maybe_register_phi_relation): New.
5085 (path_range_query::compute_phi_relations): Abstract out
5086 registering one PHI relation to...
5087 (path_range_query::compute_outgoing_relations): ...here.
5088 * gimple-range-path.h (class path_range_query): Remove
5090 Add maybe_register_phi_relation.
5092 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
5093 Andrew MacLeod <amacleod@redhat.com>
5095 * value-relation.cc (path_oracle::killing_def): Kill second
5098 2021-10-27 Martin Jambor <mjambor@suse.cz>
5100 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
5101 profile feedback depending on their local availability.
5103 2021-10-27 Martin Jambor <mjambor@suse.cz>
5105 * params.opt (param_ipa_cp_profile_count_base): New parameter.
5106 * doc/invoke.texi (Optimize Options): Add entry for
5107 ipa-cp-profile-count-base.
5108 * ipa-cp.c (max_count): Replace with base_count, replace all
5109 occurrences too, unless otherwise stated.
5110 (ipcp_cloning_candidate_p): identify mostly-directly called
5111 functions based on their counts, not max_count.
5112 (compare_edge_profile_counts): New function.
5113 (ipcp_propagate_stage): Instead of setting max_count, find the
5114 appropriate edge count in a sorted vector of counts of eligible
5115 edges and make it the base_count.
5117 2021-10-27 Martin Jambor <mjambor@suse.cz>
5119 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
5120 n_nonrec_calls and itself, document all fields.
5121 (init_caller_stats): Initialize the above new fields.
5122 (gather_caller_stats): Gather self-recursive counts and calls number.
5123 (get_info_about_necessary_edges): Gather counts of self-recursive and
5124 other edges bringing in the requested value separately.
5125 (dump_profile_updates): Rework to dump info about a single node only.
5126 (lenient_count_portion_handling): New function.
5127 (struct gather_other_count_struct): New type.
5128 (gather_count_of_non_rec_edges): New function.
5129 (struct desc_incoming_count_struct): New type.
5130 (analyze_clone_icoming_counts): New function.
5131 (adjust_clone_incoming_counts): Likewise.
5132 (update_counts_for_self_gen_clones): Likewise.
5133 (update_profiling_info): Rewritten.
5134 (update_specialized_profile): Adjust call to dump_profile_updates.
5135 (create_specialized_node): Do not update profiling info.
5136 (decide_about_value): New parameter self_gen_clones, either push new
5137 clones into it or updat their profile counts. For self-recursively
5138 generated values, use a portion of the node count instead of count
5139 from self-recursive edges to estimate goodness.
5140 (decide_whether_version_node): Gather clones for self-generated values
5141 in a new vector, update their profiles at once at the end.
5143 2021-10-27 Richard Biener <rguenther@suse.de>
5145 * tree-vectorizer.c (vect_transform_loops): New function,
5147 (try_vectorize_loop_1): ... here. Simplify as epilogues
5148 are now fully handled in the split part.
5150 2021-10-27 Jakub Jelinek <jakub@redhat.com>
5152 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
5153 iterators with pointer types.
5154 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
5156 2021-10-26 Martin Sebor <msebor@redhat.com>
5158 PR tree-optimization/102238
5159 PR tree-optimization/102919
5160 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
5161 (array_elt_at_offset): Move to pointer-query.
5162 (set_aggregate_size_and_offset): New function.
5163 (field_at_offset): Move to pointer-query.
5164 (get_origin_and_offset): Rename...
5165 (get_origin_and_offset_r): this. Add an argument. Make aggregate
5166 handling more robust.
5167 (get_origin_and_offset): New.
5168 (alias_offset): Add an argument.
5169 (format_string): Use subobject size determined by get_origin_and_offset.
5170 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
5171 Improve/correct handling of aggregates.
5172 (array_elt_at_offset): Same.
5173 * pointer-query.h (field_at_offset): Declare.
5174 (array_elt_at_offset): Declare.
5176 2021-10-26 Martin Sebor <msebor@redhat.com>
5178 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
5180 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
5181 (expand_builtin_fork_or_exec): Same.
5182 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
5183 compute_objsize additional arguments.
5184 (inbounds_memaccess_p): Same.
5185 (array_bounds_checker::check_array_bounds): Add an assert. Stash
5186 statement in a member.
5187 (check_array_bounds_dom_walker::before_dom_children): Same.
5188 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
5189 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
5190 (handle_printf_call): Pass a new argument.
5191 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
5192 (check_access): Add an argument and pass it along to callees.
5193 (check_read_access): Make a member function.
5194 (pass_waccess::check_strcat): Pass access_data ctor additional
5196 (pass_waccess::check_strncat): Same.
5197 (pass_waccess::check_stxcpy): Same.
5198 (pass_waccess::check_stxncpy): Same.
5199 (pass_waccess::check_strncmp): Same.
5200 (pass_waccess::check_read_access): Same.
5201 (pass_waccess::check_builtin): Same.
5202 (pass_waccess::maybe_check_access_sizes): Same.
5203 (pass_waccess::maybe_check_dealloc_call): Same.
5204 * gimple-ssa-warn-access.h (check_read_access): Declare a new
5206 * pointer-query.cc (compute_objsize_r): Add an argument.
5207 (gimple_call_return_array): Same.
5208 (gimple_call_alloc_size): Same.
5209 (access_ref::access_ref): Same.
5210 (access_ref::get_ref): Same.
5211 (pointer_query::get_ref): Same.
5212 (handle_min_max_size): Pass an arguments to callees.
5213 (handle_array_ref): Add an argument.
5214 (handle_mem_ref): Same.
5215 (compute_objsize): Same.
5216 * pointer-query.h (struct access_ref): Adjust signatures.
5217 (struct access_data): Same.
5218 (gimple_call_alloc_size): Add an argument.
5219 (gimple_parm_array_size): Same.
5220 (compute_objsize): Same.
5221 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
5222 argument to compute_objsize.
5223 (strlen_pass::maybe_warn_overflow): Same.
5224 (maybe_diag_stxncpy_trunc): Same.
5226 2021-10-26 Martin Sebor <msebor@redhat.com>
5228 PR middle-end/102453
5229 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
5230 (pass_waccess::check_atomic_builtin): Call it.
5232 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
5234 PR rtl-optimization/102842
5235 * lra-constraints.c (match_reload): Ignore out in checking values
5237 (curr_insn_transform): Collect outputs before doing reloads of operands.
5239 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
5242 * config/rs6000/x86intrin.h: Move some included headers to new
5243 headers. Include new immintrin.h instead of those headers.
5244 * config/rs6000/immintrin.h: New.
5245 * config/rs6000/x86gprintrin.h: New.
5246 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
5248 2021-10-26 Richard Biener <rguenther@suse.de>
5250 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
5251 Remove byte_offset parameter.
5252 (vect_create_data_ref_ptr): Likewise.
5253 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
5255 (vect_create_data_ref_ptr): Likewise.
5256 * tree-vect-stmts.c (vectorizable_store): Adjust.
5257 (vectorizable_load): Likewise.
5259 2021-10-26 Richard Biener <rguenther@suse.de>
5261 PR tree-optimization/96109
5262 * tree-vectorizer.h (dr_misalignment): Add optional offset
5264 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
5265 offset applied for negative stride accesses.
5266 (vect_enhance_data_refs_alignment): Compute negative stride
5267 access offset and pass it to dr_misalignment.
5268 * tree-vect-stmts.c (get_negative_load_store_type): Pass
5269 negative offset to dr_misalignment.
5270 (get_group_load_store_type): Likewise.
5271 (get_load_store_type): Likewise.
5272 (vectorizable_store): Remove asserts about alignment.
5273 (vectorizable_load): Likewise.
5275 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
5277 PR tree-optimization/102897
5278 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
5280 2021-10-26 Richard Biener <rguenther@suse.de>
5282 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
5283 Take offset in bytes.
5284 (vect_create_data_ref_ptr): Likewise.
5285 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
5286 offset by element size.
5287 (vect_create_cond_for_align_checks): Likewise.
5288 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
5289 (vectorizable_load): Remove duplicate leftover from merge
5292 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
5294 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
5295 function to expand V1TI mode logical shifts by integer constants.
5296 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
5297 mode rotations by integer constants.
5298 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
5299 ix86_expand_v1ti_rotate): Prototype new functions here.
5300 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
5301 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
5303 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
5305 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
5306 Avoid threading circular paths.
5308 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
5310 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
5311 Attempt to resolve all incoming paths to a PHI.
5312 (back_threader::resolve_def): Always return true for PHIs.
5314 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
5316 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
5317 Always try to resolve path without looking back.
5318 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
5319 edge is a back edge.
5321 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
5323 PR tree-optimization/102789
5324 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
5325 update inits of simd_lane_access.
5327 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
5329 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
5330 move to vr-values.c.
5331 (simplify_casted_conds): Move to vrp_folder class.
5332 (execute_vrp): Call via vrp_folder now.
5333 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
5334 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
5335 * vr-values.h (simplify_casted_cond): Add prototype.
5337 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
5339 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
5342 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
5344 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
5346 (_mm_testc_si128): Likewise.
5347 (_mm_testnzc_si128): Likewise.
5348 (_mm_blend_ps): Likewise.
5349 (_mm_blendv_ps): Likewise.
5350 (_mm_blend_pd): Likewise.
5351 (_mm_blendv_pd): Likewise.
5352 (_mm_ceil_pd): Likewise.
5353 (_mm_ceil_sd): Likewise.
5354 (_mm_ceil_ps): Likewise.
5355 (_mm_ceil_ss): Likewise.
5356 (_mm_floor_pd): Likewise.
5357 (_mm_floor_sd): Likewise.
5358 (_mm_floor_ps): Likewise.
5359 (_mm_floor_ss): Likewise.
5360 (_mm_minpos_epu16): Likewise.
5361 (_mm_mul_epi32): Likewise.
5362 (_mm_cvtepi8_epi16): Likewise.
5363 (_mm_packus_epi32): Likewise.
5364 (_mm_cmpgt_epi64): Likewise.
5366 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
5368 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
5369 shifts of the mode's smin_value and smax_value when the bit count
5370 operand doesn't have side-effects.
5371 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
5372 when the bit count operand doesn't have side-effects.
5373 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
5374 compile-time evaluation of saturating left shifts with constant
5377 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
5379 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
5382 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
5384 * flag-types.h (enum ranger_debug): Adjust values.
5385 * params.opt (ranger_debug): Ditto.
5387 2021-10-25 Martin Jambor <mjambor@suse.cz>
5389 PR tree-optimization/102886
5390 * tree-sra.c (totally_scalarize_subtree): Fix the out of
5393 2021-10-25 Andrew Pinski <apinski@marvell.com>
5395 * tree-ssa-dce.c (simple_dce_from_worklist):
5396 Check stmt_unremovable_because_of_non_call_eh_p also
5397 before removing the statement.
5399 2021-10-25 Richard Biener <rguenther@suse.de>
5401 PR tree-optimization/102905
5402 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
5403 Use vect_supportable_dr_alignment again to determine whether
5404 an access is supported when not aligned.
5406 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
5408 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
5411 2021-10-25 Jim Wilson <jimw@sifive.com>
5412 Kito Cheng <kito.cheng@sifive.com>
5413 Jia-Wei Chen <jiawei@iscas.ac.cn>
5414 Shi-Hua Liao <shihua@iscas.ac.cn>
5416 * config/riscv/bitmanip.md (shiftm1): New.
5417 (*bset<mode>): Ditto.
5418 (*bset<mode>_mask): Ditto.
5419 (*bset<mode>_1): Ditto.
5420 (*bset<mode>_1_mask): Ditto.
5421 (*bseti<mode>): Ditto.
5422 (*bclr<mode>): Ditto.
5423 (*bclri<mode>): Ditto.
5424 (*binv<mode>): Ditto.
5425 (*binvi<mode>): Ditto.
5426 (*bext<mode>): Ditto.
5428 * config/riscv/predicates.md (splittable_const_int_operand):
5430 (single_bit_mask_operand): New.
5431 (not_single_bit_mask_operand): Ditto.
5432 (const31_operand): Ditto.
5433 (const63_operand): Ditto.
5434 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
5435 (riscv_output_move): Ditto.
5436 (riscv_print_operand): Handle new operand type: T and S.
5437 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
5439 2021-10-25 Jim Wilson <jimw@sifive.com>
5441 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
5444 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
5446 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
5448 (riscv_rtx_costs): Ditto.
5450 2021-10-25 Jim Wilson <jimw@sifive.com>
5451 Kito Cheng <kito.cheng@sifive.com>
5452 Jia-Wei Chen <jiawei@iscas.ac.cn>
5454 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
5455 (bitmanip_minmax): New.
5456 (clz_ctz_pcnt): New.
5457 (bitmanip_optab): New.
5458 (bitmanip_insn): New.
5459 (*<optab>_not<mode>): New.
5460 (*xor_not<mode>): New.
5461 (<bitmanip_optab>si2): New.
5462 (*<bitmanip_optab>disi2): New.
5463 (<bitmanip_optab>di2): New.
5464 (*zero_extendhi<GPR:mode>2_bitmanip): New.
5465 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
5466 (*zero_extendhi<GPR:mode>2_zbb): New.
5469 (rotrsi3_sext): New.
5472 (rotlsi3_sext): New.
5473 (bswap<mode>2): New.
5474 (<bitmanip_optab><mode>3): New.
5475 * config/riscv/riscv.md (type): Add rotate.
5476 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
5477 (*zero_extendhi<GPR:mode>2): New.
5478 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
5479 (*extend<SHORT:mode><SUPERQI:mode>2): New.
5481 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
5483 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
5485 (riscv_rtx_costs): Ditto.
5487 2021-10-25 Jim Wilson <jimw@sifive.com>
5488 Kito Cheng <kito.cheng@sifive.com>
5489 Jia-Wei Chen <jiawei@iscas.ac.cn>
5491 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
5496 (riscv_rtx_costs): Ditto.
5497 * config/riscv/riscv.md: Include bitmanip.md
5498 (type): Add bitmanip bype.
5499 (zero_extendsidi2): Change to define_expand pattern.
5500 (*zero_extendsidi2_internal): New.
5501 (zero_extendsidi2_shifted): Disable for ZBA.
5503 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
5505 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
5506 Add zba, zbb, zbc and zbs.
5507 (riscv_ext_flag_table): Ditto.
5508 * config/riscv/riscv-opts.h (MASK_ZBA): New.
5512 (TARGET_ZBA): Ditto.
5513 (TARGET_ZBB): Ditto.
5514 (TARGET_ZBC): Ditto.
5515 (TARGET_ZBS): Ditto.
5516 * config/riscv/riscv.opt (riscv_zb_subext): New.
5518 2021-10-25 liuhongt <hongtao.liu@intel.com>
5521 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
5522 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
5524 2021-10-25 Richard Biener <rguenther@suse.de>
5526 PR tree-optimization/102920
5527 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
5528 controlling VN_TOP matching behavior.
5529 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
5530 (vn_phi_eq): Do not optimistically match VN_TOP.
5532 2021-10-25 konglin1 <lingling.kong@intel.com>
5534 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
5535 define_insn_and_split.
5536 (fma_<mode>_fadd_fcmul):Likewise
5537 (fma_<complexopname>_<mode>_fma_zero):Likewise
5539 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
5541 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
5542 TARGET_DISABLE_FPREGS.
5543 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
5544 MASK_DISABLE_FPREGS.
5545 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
5546 cost of hardware integer multiplication.
5547 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
5548 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
5549 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
5550 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
5551 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
5552 SOFT_FLOAT. Revise comment.
5553 (msoft-float): New option.
5555 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
5557 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
5559 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
5561 * doc/install.texi (Specific): Remove obsolete details
5562 around GNU/Linux on Itanium.
5563 (Specific): Remove reference to Windows for Itanium.
5565 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
5567 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
5570 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
5572 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
5574 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
5576 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
5577 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
5578 (one_cmplv1ti2): New define expand.
5580 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
5583 * Makefile.in: Handle dvidir and install-dvi target.
5584 * configure: Regenerate.
5585 * configure.ac: Add install-dvi to target_list.
5587 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
5589 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
5592 2021-10-22 Richard Biener <rguenther@suse.de>
5594 PR tree-optimization/102893
5595 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
5596 test for an exit edge.
5598 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
5599 Andrew MacLeod <amacleod@redhat.com>
5601 * gimple-range-path.cc (path_range_query::compute_phi_relations):
5602 Kill any global relations we may know before registering a new
5604 * value-relation.cc (path_oracle::killing_def): New.
5605 * value-relation.h (path_oracle::killing_def): New.
5607 2021-10-22 Richard Biener <rguenther@suse.de>
5610 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
5612 (vn_phi_lookup): Likewise.
5614 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
5617 * doc/invoke.texi: Document -fcf-protection requires i686 or
5620 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
5622 PR middle-end/102764
5623 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
5625 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
5627 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
5628 macro and all invocations.
5630 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
5632 * doc/invoke.texi (ranger-debug): Document.
5633 * flag-types.h (enum ranger_debug): New.
5634 (enum evrp_mode): Remove debug values.
5635 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
5636 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
5637 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
5638 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
5639 (execute_early_vrp): Use evrp-mode directly.
5640 * params.opt (enum evrp_mode): Remove debug values.
5641 (ranger-debug): New.
5642 (ranger-logical-depth): Relocate to be in alphabetical order.
5644 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
5646 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
5647 * flag-types.h: (enum vrp_mode): New.
5648 * params.opt: (vrp1-mode, vrp2-mode): New.
5649 * tree-vrp.c (vrp_pass_num): New.
5650 (pass_vrp::pass_vrp): Set pass number.
5651 (pass_vrp::execute): Choose which VRP mode to execute.
5653 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
5655 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
5656 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
5657 * tree-vrp.c (class rvrp_folder): Relocate here.
5658 (execute_ranger_vrp): New.
5659 * tree-vrp.h (execute_ranger_vrp): Export.
5661 2021-10-21 Martin Liska <mliska@suse.cz>
5665 * opts.c (finish_options): Process flag_var_tracking* options
5666 here as they can be adjusted by optimize attribute.
5667 Process also flag_syntax_only and flag_gtoggle.
5668 * toplev.c (process_options): Remove it here.
5669 * common.opt: Make debug_nonbind_markers_p as PerFunction
5670 attribute as it depends on optimization level.
5672 2021-10-21 Martin Jambor <mjambor@suse.cz>
5674 PR tree-optimization/102505
5675 * tree-sra.c (totally_scalarize_subtree): Check that the
5676 encountered field fits within the acces we would like to put it
5679 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
5681 * tree-ssa-threadbackward.c
5682 (back_threader::maybe_register_path): Remove circular paths check.
5684 2021-10-21 Richard Biener <rguenther@suse.de>
5686 * toplev.c (process_options): Move the initial debug_hooks
5688 (toplev::main): ... before the call of the post_options
5691 2021-10-21 Richard Biener <rguenther@suse.de>
5693 PR tree-optimization/102847
5694 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
5695 load cost in the prologue for VMAT_INVARIANT.
5697 2021-10-21 Richard Biener <rguenther@suse.de>
5699 PR tree-optimization/102847
5700 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
5701 handle VMAT_INVARIANT as a splat in the prologue.
5703 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
5706 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
5707 move to use the same logic as HImode.
5709 2021-10-21 Richard Biener <rguenther@suse.de>
5711 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
5712 superfluous gimple_call_nothrow_p check.
5714 2021-10-21 Andrew Pinski <apinski@marvell.com>
5716 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
5717 Mark the ssa-name of the rhs as one to be removed.
5718 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
5719 Call simple_dce_from_worklist at the end to a simple dce.
5721 2021-10-21 Andrew Pinski <apinski@marvell.com>
5723 * tree-cfg.c (maybe_remove_writeonly_store): New function
5724 factored out from ...
5725 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
5727 2021-10-21 Andrew Pinski <apinski@marvell.com>
5729 * tree-cfg.c (execute_fixup_cfg): Remove comment
5730 about standalone pass.
5732 2021-10-21 Andrew Pinski <apinski@marvell.com>
5734 * tree-cfg.c (execute_fixup_cfg): Output when the statement
5735 is removed when it is a write only var.
5737 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
5739 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
5740 Avoid threading circular paths.
5742 2021-10-20 Alex Coplan <alex.coplan@arm.com>
5744 * calls.c (initialize_argument_information): Remove some dead
5745 code, remove handling for function_arg returning const_int.
5746 * doc/tm.texi: Delete documentation for unused target hooks.
5747 * doc/tm.texi.in: Likewise.
5748 * target.def (load_bounds_for_arg): Delete.
5749 (store_bounds_for_arg): Delete.
5750 (load_returned_bounds): Delete.
5751 (store_returned_bounds): Delete.
5752 * targhooks.c (default_load_bounds_for_arg): Delete.
5753 (default_store_bounds_for_arg): Delete.
5754 (default_load_returned_bounds): Delete.
5755 (default_store_returned_bounds): Delete.
5756 * targhooks.h (default_load_bounds_for_arg): Delete.
5757 (default_store_bounds_for_arg): Delete.
5758 (default_load_returned_bounds): Delete.
5759 (default_store_returned_bounds): Delete.
5761 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
5763 * doc/extend.texi (Basic Asm): Clarify that asm is not an
5765 * doc/invoke.texi (-fno-asm): Fix description for C++.
5767 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
5769 * doc/install.texi: Remove link to old.html
5771 2021-10-20 Tamar Christina <tamar.christina@arm.com>
5773 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
5775 2021-10-20 Tamar Christina <tamar.christina@arm.com>
5777 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
5779 2021-10-20 Tamar Christina <tamar.christina@arm.com>
5781 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
5783 * config/aarch64/constraints.md (D1): New.
5785 2021-10-20 Tamar Christina <tamar.christina@arm.com>
5787 * config/aarch64/aarch64-simd.md
5788 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
5789 (*aarch64_topbits_shuffle<mode>_le): New.
5790 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
5791 (*aarch64_topbits_shuffle<mode>_be): New.
5792 * config/aarch64/predicates.md
5793 (aarch64_simd_shift_imm_vec_exact_top): New.
5795 2021-10-20 Tamar Christina <tamar.christina@arm.com>
5797 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
5798 *aarch64_<srn_op>shrn<mode>2_vect_le,
5799 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
5800 * config/aarch64/iterators.md (srn_op): New.
5802 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
5804 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
5806 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
5807 ctx->outer instead of ctx.
5809 2021-10-20 Martin Liska <mliska@suse.cz>
5812 2021-10-19 Martin Liska <mliska@suse.cz>
5815 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
5816 * system.h (strip_whilespaces): New function.
5818 2021-10-20 Martin Liska <mliska@suse.cz>
5821 2021-10-19 Martin Liska <mliska@suse.cz>
5824 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
5827 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
5829 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
5830 Change to 8 with -Os, 11 otherwise.
5832 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
5834 * config/aarch64/aarch64.c (neoversev1_tunings):
5835 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
5836 (neoversen2_tunings): Likewise.
5838 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
5841 * config/aarch64/aarch64.md (frint_pattern): Update comment.
5842 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
5843 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
5844 * config/aarch64/arm_neon.h: Likewise.
5845 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
5847 2021-10-20 Martin Liska <mliska@suse.cz>
5849 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
5850 already declared global variable.
5851 (arm_unwind_emit_set): Use out_file as function argument.
5852 (arm_unwind_emit): Likewise.
5853 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
5854 (machopic_output_stub_indirection): Likewise.
5855 (machopic_output_indirection): Likewise.
5856 (machopic_finish): Likewise.
5857 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
5858 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
5859 * config/ia64/ia64.c (process_epilogue): Likewise.
5860 (process_cfa_adjust_cfa): Likewise.
5861 (process_cfa_register): Likewise.
5862 (process_cfa_offset): Likewise.
5863 (ia64_asm_unwind_emit): Likewise.
5864 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
5866 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
5868 * config/aarch64/aarch64-builtins.c
5869 (aarch64_general_gimple_fold_builtin):
5870 lower vld1 and vst1 variants of the neon builtins
5871 * config/aarch64/aarch64-protos.h:
5872 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
5873 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
5876 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
5878 * match.pd: Generate IFN_TRUNC.
5880 2021-10-20 Richard Biener <rguenther@suse.de>
5882 PR tree-optimization/102853
5883 * tree-data-ref.c (split_constant_offset_1): Bail out
5884 immediately if the expression traps on overflow.
5886 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
5888 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
5890 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
5892 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
5893 Remove superflous debugging message.
5894 (back_jt_path_registry::duplicate_thread_path): Same.
5896 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
5898 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
5900 (back_threader_registry::register_path): Remove m_threaded_paths.
5902 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
5904 PR tree-optimization/102814
5905 * doc/invoke.texi: Document --param=max-fsm-thread-length.
5906 * params.opt: Add --param=max-fsm-thread-length.
5907 * tree-ssa-threadbackward.c
5908 (back_threader_profitability::profitable_path_p): Fail on paths
5909 longer than max-fsm-thread-length.
5911 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
5913 PR middle-end/102764
5914 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
5915 statement to reset the current location for the outgoing edges.
5917 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
5918 Richard Biener <rguenther@suse.de>
5920 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
5921 on the same line as the threading cancellation.
5922 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
5923 Avoid threading through loop headers where the path remains in the
5926 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
5928 * tree-object-size.c (unknown): Make into a function. Adjust
5930 (unknown_object_size): Simplify implementation.
5932 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
5933 Kewen Lin <linkw@linux.ibm.com>
5935 * doc/sourcebuild.texi (Effective-Target Keywords): Document
5936 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
5937 vect_slp_v16qi_store, vect_slp_v2hi_store,
5938 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
5940 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
5942 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
5944 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
5948 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
5949 * config/rs6000/pmmintrin.h: Same.
5950 * config/rs6000/smmintrin.h: Same.
5951 * config/rs6000/tmmintrin.h: Same.
5953 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
5955 * config.gcc (extra_headers): Add nmmintrin.h.
5957 2021-10-19 Richard Biener <rguenther@suse.de>
5959 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
5960 misalignment parameter.
5961 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
5962 Do not change DR_MISALIGNMENT in place, instead pass the
5963 adjusted misalignment to vect_supportable_dr_alignment.
5964 (vect_peeling_supportable): Likewise.
5965 (vect_peeling_hash_get_lowest_cost): Adjust.
5966 (vect_enhance_data_refs_alignment): Likewise.
5967 (vect_vfa_access_size): Likewise.
5968 (vect_supportable_dr_alignment): Add misalignment
5969 parameter and simplify.
5970 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
5971 (get_group_load_store_type): Likewise.
5972 (get_load_store_type): Likewise.
5974 2021-10-19 Clément Chigot <clement.chigot@atos.net>
5976 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
5977 __tls_get_addr reference to .text csect.
5979 2021-10-19 Martin Liska <mliska@suse.cz>
5982 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
5985 2021-10-19 Richard Biener <rguenther@suse.de>
5987 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
5988 (vect_get_load_cost): Likewise.
5989 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
5990 alignment support scheme and misalignment as arguments
5992 (vect_get_peeling_costs_all_drs): Compute that info here
5993 and note that we shouldn't need to.
5994 * tree-vect-stmts.c (vect_model_store_cost): Get
5995 alignment support scheme and misalignment as arguments.
5996 (vect_get_store_cost): Likewise.
5997 (vect_model_load_cost): Likewise.
5998 (vect_get_load_cost): Likewise.
5999 (vectorizable_store): Pass down alignment support scheme
6000 and misalignment to costing.
6001 (vectorizable_load): Likewise.
6003 2021-10-19 Richard Biener <rguenther@suse.de>
6005 * tree-vect-stmts.c (get_negative_load_store_type): Add
6006 offset output parameter and initialize it.
6007 (get_group_load_store_type): Likewise.
6008 (get_load_store_type): Likewise.
6009 (vectorizable_store): Use offset as computed by
6010 get_load_store_type.
6011 (vectorizable_load): Likewise.
6013 2021-10-19 Richard Biener <rguenther@suse.de>
6015 PR tree-optimization/102827
6016 * tree-if-conv.c (predicate_statements): Add pe parameter
6017 and use that edge to insert invariant stmts on.
6018 (combine_blocks): Pass through pe.
6019 (tree_if_conversion): Compute the edge to insert invariant
6020 stmts on and pass it along.
6022 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
6025 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
6026 sssubaddv2hi3): Swap the order of operators in vec_concat.
6028 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
6030 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
6031 (altivec_vmrghb_direct): New.
6032 (*altivec_vmrghh_internal): Delete.
6033 (altivec_vmrghh_direct): New.
6034 (*altivec_vmrghw_internal): Delete.
6035 (altivec_vmrghw_direct_<mode>): New.
6036 (altivec_vmrghw_direct): Delete.
6037 (*altivec_vmrglb_internal): Delete.
6038 (altivec_vmrglb_direct): New.
6039 (*altivec_vmrglh_internal): Delete.
6040 (altivec_vmrglh_direct): New.
6041 (*altivec_vmrglw_internal): Delete.
6042 (altivec_vmrglw_direct_<mode>): New.
6043 (altivec_vmrglw_direct): Delete.
6044 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
6045 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
6047 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
6048 (vsx_xxmrglw_<mode>): Adjust.
6050 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
6052 * passes.def: Change threading comment before pass_ccp pass.
6054 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
6056 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
6057 Modify the expansion for sign extension. All extensions are done
6058 within VSX registers.
6060 2021-10-19 Richard Biener <rguenther@suse.de>
6062 * tree-vect-stmts.c (get_group_load_store_type): Add
6063 misalignment output parameter and initialize it.
6064 (get_group_load_store_type): Likewise.
6065 (vectorizable_store): Remove now redundant queries.
6066 (vectorizable_load): Likewise.
6068 2021-10-19 Richard Biener <rguenther@suse.de>
6070 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
6071 check_aligned argument.
6072 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
6074 (vect_peeling_hash_insert): Add supportable_if_not_aligned
6075 argument and do not call vect_supportable_dr_alignment here.
6076 (vect_peeling_supportable): Adjust.
6077 (vect_enhance_data_refs_alignment): Compute whether the
6078 access is supported with different alignment here and
6079 pass that down to vect_peeling_hash_insert.
6080 (vect_vfa_access_size): Adjust.
6081 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
6082 (vect_get_load_cost): Likewise.
6083 (get_negative_load_store_type): Likewise.
6084 (get_group_load_store_type): Likewise.
6085 (get_load_store_type): Likewise.
6087 2021-10-19 Martin Liska <mliska@suse.cz>
6090 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
6091 * system.h (strip_whilespaces): New function.
6093 2021-10-19 dianhong xu <dianhong.xu@intel.com>
6095 * config/i386/avx512fp16intrin.h:
6096 (_mm512_set1_pch): New intrinsic.
6097 * config/i386/avx512fp16vlintrin.h:
6098 (_mm256_set1_pch): New intrinsic.
6099 (_mm_set1_pch): Ditto.
6101 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
6103 PR tree-optimization/102796
6104 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
6105 normally. Return get_tree_range for non gimple_range_ssa_p names.
6106 (gimple_ranger::range_of_stmt): Use get_tree_range for non
6107 gimple_range_ssa_p names.
6109 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
6112 * config/i386/i386.c (ix86_print_operand_address):
6113 Error out for non-address_operand asm operands.
6115 2021-10-18 Richard Biener <rguenther@suse.de>
6117 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
6118 not auto-convert dr_alignment_support to bool.
6119 (vect_peeling_supportable): Likewise.
6120 (vect_enhance_data_refs_alignment): Likewise.
6121 (vect_supportable_dr_alignment): Commonize read/write case.
6122 * tree-vect-stmts.c (vect_get_store_cost): Use
6123 dr_alignment_support, not int, for the vect_supportable_dr_alignment
6125 (vect_get_load_cost): Likewise.
6127 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
6129 * tree-object-size.c (object_sizes_execute): Consolidate LHS
6130 null check and do it early.
6132 2021-10-18 Richard Biener <rguenther@suse.de>
6134 * tree-vect-stmts.c (vectorizable_store): Use the
6135 computed alignment scheme instead of querying
6138 2021-10-18 Richard Biener <rguenther@suse.de>
6140 * tree-vect-stmts.c (vectorizable_store): Do not recompute
6141 alignment scheme already determined by get_load_store_type.
6143 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
6145 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
6146 Clone corresponding pass.
6148 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
6150 * combine.c (recog_for_combine): For an unrecognized move/set of
6151 a constant, try force_const_mem to place it in the constant pool.
6152 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
6153 Delete global variables (that are no longer assigned a cost value).
6154 (cse_insn): Simplify logic for deciding whether to place a folded
6155 constant in the constant pool using force_const_mem.
6156 (cse_main): Remove zero initialization of constant_pool_entries_cost
6157 and constant_pool_entries_regcost.
6158 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
6159 fractionally more expensive, when optimizing for speed.
6161 2021-10-18 Martin Liska <mliska@suse.cz>
6163 PR gcov-profile/102746
6164 PR gcov-profile/102747
6165 * gcov.c (main): Return return_code.
6166 (output_gcov_file): Mark return_code when error happens.
6167 (generate_results): Likewise.
6168 (read_graph_file): Likewise.
6169 (read_count_file): Likewise.
6171 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
6173 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
6174 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
6175 a popcount, allowing compile-time evaluation/simplification.
6176 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
6178 2021-10-18 Richard Biener <rguenther@suse.de>
6180 PR tree-optimization/102788
6181 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
6183 (vect_pattern_recog_1): Likewise.
6184 (vect_recog_bool_pattern): Continue matching the pattern
6185 even if we do not have a vector type for a conversion
6188 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
6190 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
6191 Evalute SS_NEG and SS_ABS of a constant argument.
6193 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
6196 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
6197 * internal-fn.def: Add entry for COND_NEG internal function.
6198 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
6199 NEGATE_EXPR, COND_NEG mapping.
6200 * optabs.def: Add entry for cond_neg_optab.
6201 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
6202 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
6203 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
6205 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
6207 * tree-ssa-strlen.c (class strlen_pass): Rename from
6209 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
6210 (handle_assign): Same.
6211 (adjust_last_stmt): Same.
6212 (maybe_warn_overflow): Same.
6213 (handle_builtin_strlen): Same.
6214 (handle_builtin_strchr): Same.
6215 (handle_builtin_strcpy): Same.
6216 (handle_builtin_strncat): Same.
6217 (handle_builtin_stxncpy_strncat): Same.
6218 (handle_builtin_memcpy): Same.
6219 (handle_builtin_strcat): Same.
6220 (handle_alloc_call): Same.
6221 (handle_builtin_memset): Same.
6222 (handle_builtin_memcmp): Same.
6223 (get_len_or_size): Same.
6224 (strxcmp_eqz_result): Same.
6225 (handle_builtin_string_cmp): Same.
6226 (handle_pointer_plus): Same.
6227 (count_nonzero_bytes_addr): Same.
6228 (count_nonzero_bytes): Same.
6229 (handle_store): Same.
6230 (strlen_check_and_optimize_call): Same.
6231 (handle_integral_assign): Same.
6232 (check_and_optimize_stmt): Same.
6233 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
6235 2021-10-18 Richard Biener <rguenther@suse.de>
6237 PR tree-optimization/102798
6238 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6239 Only copy points-to info to newly generated SSA names.
6241 2021-10-18 Martin Liska <mliska@suse.cz>
6243 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
6244 * gcov.c (get_cycles_count): Likewise.
6245 * lto-compress.c (lto_compression_zlib): Likewise.
6246 (lto_uncompression_zlib): Likewise.
6247 * targhooks.c (default_pch_valid_p): Likewise.
6249 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
6251 * tree-pass.h (make_pass_thread_jumps_full): New.
6252 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
6253 (try_thread_blocks): Add resolve and speed arguments.
6254 (pass_thread_jumps::execute): Inline.
6255 (do_early_thread_jumps): New.
6256 (do_thread_jumps): New.
6257 (make_pass_thread_jumps): Move.
6258 (pass_early_thread_jumps::gate): Inline.
6259 (pass_early_thread_jumps::execute): Inline.
6260 (class pass_thread_jumps_full): New.
6262 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
6264 * configure.ac: Treat powerpc64*-*-freebsd* the same as
6265 powerpc64-*-freebsd*.
6266 * configure: Regenerate.
6268 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
6270 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
6271 set_ptr_nonull to set_ptr_nonnull in comments.
6273 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
6275 PR tree-optimization/102720
6276 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
6277 of call used and clobbered sets.
6279 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
6281 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
6282 skip gomp and itm when r or nodefaultlibs is given.
6283 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
6284 Update ordering of exclusions, remove duplicate 'v' addition
6285 (collect2 will add this from the main command line).
6287 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
6289 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
6290 filelist and framework options in specs instead of code.
6291 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
6292 specs that are really driver ones.
6293 (DARWIN_CC1_SPEC): Likewise.
6294 (CPP_SPEC): Likewise.
6295 (SYSROOT_SPEC): Append space.
6296 (LINK_SYSROOT_SPEC): Remove most driver link specs.
6297 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
6298 (STARTFILE_SPEC): Likewise.
6299 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
6300 (ASM_SPEC): Update driver-related specs.
6301 (ASM_FINAL_SPEC): Likewise.
6302 * config/darwin.opt: Remove now unused option aliases.
6303 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
6304 (ASM_SPEC): Update driver-related specs.
6306 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
6308 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
6309 allow standard_sse_constant_p immediate constants.
6311 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
6313 * config.gcc: Add tm-dwarf2.h to tm_d-file.
6315 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
6317 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
6318 that occur in abnormal phis.
6319 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
6320 abnormal and EH edges.
6321 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
6322 (rvrp_folder::value_on_edge): Ditto.
6323 (rvrp_folder::value_of_stmt): Ditto.
6324 (hybrid_folder::value_of_expr): Ditto for ranger queries.
6325 (hybrid_folder::value_on_edge): Ditto.
6326 (hybrid_folder::value_of_stmt): Ditto.
6327 * value-query.cc (gimple_range_global): Always return a range if
6328 the type is supported.
6330 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
6332 * config/pa/pa.md: Consistently use "rG" constraint for copy
6333 instruction in move patterns.
6335 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
6337 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
6340 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
6341 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
6342 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
6343 (ELFABIVERSION_AMDGPU_HSA_V4): New.
6344 (SET_SRAM_ECC_UNSUPPORTED): New.
6345 (copy_early_debug_info): Create elf flags to match the other objects.
6346 (main): Just let the attribute flags pass through.
6348 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6350 * tree-loop-distribution.c (reduction_var_overflows_first):
6351 Pass the type of reduction_var as first argument as it is also
6352 done for the load type.
6353 (loop_distribution::transform_reduction_loop): Add missing
6354 TREE_TYPE while determining precission of reduction_var.
6356 2021-10-15 Richard Biener <rguenther@suse.de>
6358 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
6360 * toplev.c (process_options): Warn when STABS debugging is
6361 enabled but not the preferred format.
6362 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
6364 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
6366 2021-10-15 Richard Biener <rguenther@suse.de>
6369 * tree-inline.c (copy_bb): Avoid underflowing nargs.
6371 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
6373 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
6374 HFmode input operand to HImode.
6375 (ix86_vectorize_vec_perm_const): Likewise.
6376 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
6378 (*avx512f_permvar_truncv8siv8hi_1_hf):
6381 2021-10-15 Richard Biener <rguenther@suse.de>
6383 PR middle-end/102682
6384 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
6385 not create a paradoxical subreg.
6387 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
6389 * config/i386/i386-expand.c (ix86_expand_vector_init):
6390 For half_vector concat for HFmode, handle them like HImode.
6392 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
6394 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
6395 (back_threader::back_threader): Same.
6396 (back_threader::resolve_phi): Try to solve without looking back if
6398 (back_threader::find_paths_to_names): Same.
6399 (try_thread_blocks): Pass resolve argument to back threader.
6400 (pass_early_thread_jumps::execute): Same.
6402 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
6404 * doc/invoke.texi: Remove max-fsm-thread-length,
6405 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
6407 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
6409 (back_threader_registry::back_threader_registry): Same.
6410 (back_threader_profitability::profitable_path_p): Remove
6411 param_max_fsm_thread-length.
6412 (back_threader_registry::register_path): Remove
6413 m_max_allowable_paths.
6415 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
6417 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
6419 (back_threader::~back_threader): Do not release m_path.
6421 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
6423 PR tree-optimization/102738
6424 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
6426 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
6428 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
6429 (omp_mark_declare_variant): Move from c-omp.c.
6430 (omp_context_name_list_prop): Update for Fortran strings.
6431 * omp-general.h (omp_check_context_selector): New prototype.
6432 (omp_mark_declare_variant): New prototype.
6434 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
6436 * config/rs6000/rs6000.c (rs6000_density_test): Move early
6437 exit test further up the function.
6439 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
6441 * config/arm/arm.c (arm_add_stmt_cost): Delete.
6442 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6444 2021-10-14 Martin Jambor <mjambor@suse.cz>
6446 * doc/invoke.texi (Optimize Options): Add entry for
6447 ipa-cp-recursive-freq-factor.
6449 2021-10-14 Tamar Christina <tamar.christina@arm.com>
6451 * match.pd: New rule.
6453 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
6456 * ipa-modref-tree.h (modref_access_node::update2):
6457 Also check that parm_offset is unchanged.
6458 (modref_ref_node::insert_access): Fix updating of
6461 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
6463 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
6466 2021-10-14 Richard Biener <rguenther@suse.de>
6468 PR tree-optimization/102659
6469 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
6470 rewrite pointer typed undefined overflow operations.
6471 (predicate_statements): Likewise. Make sure to emit invariant
6472 conversions in the preheader.
6473 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
6474 non-empty preheaders.
6475 * tree-data-ref.c (dr_analyze_indices): Strip useless
6476 conversions to the MEM_REF base type.
6478 2021-10-14 Martin Liska <mliska@suse.cz>
6480 * common.opt: Stop using AUTODETECT_VALUE
6481 and use EnabledBy where possible.
6482 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
6483 * toplev.c (AUTODETECT_VALUE): Remove macro.
6484 (process_options): Simplify by using EnabledBy and
6485 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
6488 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
6490 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
6492 (vld1_s32_x3): Likewise.
6494 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
6496 PR tree-optimization/102736
6497 PR tree-optimization/102736
6498 * gimple-range-path.cc (path_range_query::range_on_path_entry):
6499 Assert that the requested range is defined outside the path.
6500 (path_range_query::ssa_range_in_phi): Do not call
6501 range_on_path_entry for SSA names that are defined within the
6504 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
6506 * config/darwin-driver.c (darwin_find_version_from_kernel):
6507 Quote internal identifiers and avoid contractions in
6509 (darwin_default_min_version): Likewise.
6510 (darwin_driver_init): Likewise.
6512 2021-10-14 Martin Jambor <mjambor@suse.cz>
6514 * params.opt (ipa-cp-recursive-freq-factor): New.
6515 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
6516 scc_no, self_recursion_generated_level, same_scc and
6517 self_recursion_generated_p.
6518 (ipcp_lattice::add_value): Replaced parameter unlimited with
6519 same_lat_gen_level, usit it determine limit of values and store it to
6521 (ipcp_lattice<valtype>::print): Dump the new fileds.
6522 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
6523 parameter and store it to the new value.
6524 (self_recursively_generated_p): Removed.
6525 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
6526 instead of self_recursively_generated_p, store self generation level
6528 (value_topo_info<valtype>::add_val): Set scc_no.
6529 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
6530 recursively feeding values and self generated values by appropriate
6533 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
6535 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
6536 redundant test for flag_vect_cost_model.
6538 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
6540 * bitmap.c (debug): New overloaded function for auto_bitmaps.
6541 * bitmap.h (debug): Same.
6543 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
6545 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
6546 Adjust builtin call.
6547 (_mm512_mask3_fcmadd_pch): Likewise.
6548 (_mm512_mask_fmadd_pch): Likewise
6549 (_mm512_mask3_fmadd_pch): Likewise
6550 (_mm512_mask_fcmadd_round_pch): Likewise
6551 (_mm512_mask3_fcmadd_round_pch): Likewise
6552 (_mm512_mask_fmadd_round_pch): Likewise
6553 (_mm512_mask3_fmadd_round_pch): Likewise
6554 (_mm_mask_fcmadd_sch): Likewise
6555 (_mm_mask3_fcmadd_sch): Likewise
6556 (_mm_mask_fmadd_sch): Likewise
6557 (_mm_mask3_fmadd_sch): Likewise
6558 (_mm_mask_fcmadd_round_sch): Likewise
6559 (_mm_mask3_fcmadd_round_sch): Likewise
6560 (_mm_mask_fmadd_round_sch): Likewise
6561 (_mm_mask3_fmadd_round_sch): Likewise
6562 (_mm_fcmadd_round_sch): Likewise
6563 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
6564 Adjust builtin call.
6565 (_mm_mask3_fmadd_pch): Likewise
6566 (_mm256_mask_fmadd_pch): Likewise
6567 (_mm256_mask3_fmadd_pch): Likewise
6568 (_mm_mask_fcmadd_pch): Likewise
6569 (_mm_mask3_fcmadd_pch): Likewise
6570 (_mm256_mask_fcmadd_pch): Likewise
6571 (_mm256_mask3_fcmadd_pch): Likewise
6572 * config/i386/i386-builtin.def: Add mask3 builtin for complex
6573 fma, and adjust mask_builtin to corresponding expander.
6574 * config/i386/i386-expand.c (ix86_expand_round_builtin):
6575 Skip eraseing embedded rounding for expanders that emits
6577 * config/i386/sse.md (complexmove): New mode_attr.
6578 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
6579 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
6580 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
6581 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
6582 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
6583 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
6584 * config/i386/subst.md (round_embedded_complex): New subst.
6586 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
6588 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
6589 (cbranchdf4): Likewise.
6590 Add missing move patterns for TARGET_SOFT_FLOAT.
6592 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
6594 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
6595 pseudo intermediate when moving a SUBREG into a hard register,
6596 by checking ix86_hardreg_mov_ok.
6597 (ix86_expand_vector_extract): Store zero-extended SImode
6598 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
6600 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
6601 complex (SUBREG) sets of (vector) hard registers before reload, by
6602 checking ix86_hardreg_mov_ok.
6604 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
6606 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
6608 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
6610 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
6612 2021-10-13 Martin Sebor <msebor@redhat.com>
6614 PR middle-end/102630
6615 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
6617 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
6619 * collect2.c (is_lto_object_file): Release simple-object
6620 resources, close files.
6622 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
6624 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
6625 when checking mve addressing modes.
6626 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
6628 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
6629 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
6630 there is a single Ux constraint.
6632 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
6634 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
6635 multiplication on 32-bit PA 1.1 and 2.0 targets.
6637 2021-10-13 Martin Liska <mliska@suse.cz>
6639 PR gcov-profile/90364
6640 * coverage.c (build_info): Emit checksum to the global variable.
6641 (build_info_type): Add new field for checksum.
6642 (coverage_obj_finish): Pass object_checksum.
6643 (coverage_init): Use 0 as checksum for .gcno files.
6644 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
6645 * gcov.c (read_graph_file): Read also checksum.
6646 * doc/invoke.texi: Document the behaviour change.
6648 2021-10-13 Richard Biener <rguenther@suse.de>
6650 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
6651 start at integer value 2.
6652 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
6653 the iterator for GSI_LAST_NEW_STMT.
6654 (gsi_insert_seq_nodes_after): Likewise.
6655 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
6656 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
6657 arguments to gsi_replace.
6659 2021-10-13 Martin Liska <mliska@suse.cz>
6662 * common.opt: Use EnabledBy instead of detection in
6663 finish_options and process_options.
6664 * opts.c (finish_options): Remove handling of
6665 x_flag_unroll_all_loops.
6666 * toplev.c (process_options): Likewise for flag_web and
6667 flag_rename_registers.
6669 2021-10-13 Richard Biener <rguenther@suse.de>
6671 PR tree-optimization/102659
6672 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
6673 (if_convertible_gimple_assign_stmt_p): Mark the loop for
6674 rewrite when stmts with undefined behavior on integer
6676 (combine_blocks): Predicate also when we need to rewrite stmts.
6677 (predicate_statements): Rewrite affected stmts to something
6678 with well-defined behavior on overflow.
6679 (tree_if_conversion): Initialize need_to_rewrite_undefined.
6681 2021-10-13 Richard Biener <rguenther@suse.de>
6684 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
6687 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
6689 * dwarf2ctf.c: Fix typo in comment.
6691 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
6694 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
6695 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
6696 avoid generating an automatic #define of vec_cpsgn. Use the
6697 correct built-in for V4SFmode that doesn't depend on VSX.
6699 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
6703 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
6704 define_insn_and_split pattern. Add alternative 1 and split it
6705 post reload to insert operand 1 into the low part of operand 0.
6706 (*sub<mode>_1_slp): Ditto.
6707 (*and<mode>_1_slp): Ditto.
6708 (*<any_or:code><mode>_1_slp): Ditto.
6709 (*ashl<mode>3_1_slp): Ditto.
6710 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
6711 (*<any_rotate:insn><mode>3_1_slp): Ditto.
6712 (*neg<mode>_1_slp): New insn_and_split pattern.
6713 (*one_cmpl<mode>_1_slp): Ditto.
6715 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
6717 * doc/install.texi: Update MinGW and mingw-64 Binaries
6720 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
6722 * doc/extend.texi (Common Variable Attributes): Fix typos in
6723 alloc_size documentation.
6725 2021-10-12 Richard Biener <rguenther@suse.de>
6727 PR tree-optimization/102696
6728 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
6729 the tree fatally failed when we reject a BIT_FIELD_REF.
6731 2021-10-12 Richard Biener <rguenther@suse.de>
6733 PR tree-optimization/102572
6734 * tree-vect-stmts.c (vect_build_gather_load_calls): When
6735 gathering the vectorized defs for the mask pass in the
6736 desired mask vector type so invariants will be handled
6739 2021-10-12 Tamar Christina <tamar.christina@arm.com>
6741 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
6742 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
6743 *fcmuo<mode>_nor_combine): New.
6745 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
6748 * config/sparc/sparc-modes.def (OI): New integer mode.
6750 2021-10-12 Jakub Jelinek <jakub@redhat.com>
6752 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
6753 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
6756 2021-10-12 Jakub Jelinek <jakub@redhat.com>
6758 * tree-vectorizer.h (loop_cost_model): New function.
6759 (unlimited_cost_model): Use it.
6760 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
6761 call instead of flag_vect_cost_model.
6762 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
6763 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
6764 of flag_simd_cost_model.
6766 2021-10-12 liuhongt <hongtao.liu@intel.com>
6769 * config/i386/i386-expand.c (emit_reduc_half): Handle
6771 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
6772 (reduc_plus_scal_v4qi): Ditto.
6774 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
6776 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
6777 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
6778 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
6780 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
6782 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
6783 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
6784 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
6785 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
6786 _mm_cvtepu32_epi64): New.
6788 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
6790 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
6791 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
6793 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
6795 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
6796 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
6797 _mm_max_epi32, _mm_max_epu32): New.
6799 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
6801 * ipa-modref-tree.h (struct modref_access_node): Revert
6803 (struct modref_ref_node): Likewise.
6805 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
6807 * ipa-modref-tree.h (modref_tree::global_access_p): New member
6810 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
6811 ignore_stores_eaf_flags): Move to ipa-modref.h
6812 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
6813 (modref_summary::global_memory_read_p): New member function.
6814 (modref_summary::global_memory_written_p): New member function.
6815 * ipa-modref.h (modref_summary::global_memory_read_p,
6816 modref_summary::global_memory_written_p): Declare.
6817 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
6818 ignore_stores_eaf_flags): move here.
6819 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
6821 (handle_rhs_call): Rewrite.
6822 (handle_call_arg): New function.
6823 (determine_global_memory_access): New function.
6824 (handle_const_call): Remove
6825 (handle_pure_call): Remove
6826 (find_func_aliases_for_call): Update use of handle_rhs_call.
6827 (compute_points_to_sets): Handle global memory acccesses
6830 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
6832 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
6833 documentation, mention UBSAN_OPTIONS, similar to what is done
6834 for AddressSanitizer.
6836 2021-10-11 Richard Biener <rguenther@suse.de>
6838 PR middle-end/102683
6839 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
6840 availability before building an integer type for storage
6843 2021-10-11 Richard Biener <rguenther@suse.de>
6845 PR middle-end/101480
6846 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
6849 2021-10-11 Martin Liska <mliska@suse.cz>
6851 * common.opt: Remove Init(2) for some options.
6852 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
6853 use rather OPTION_SET_P.
6855 2021-10-11 Martin Liska <mliska@suse.cz>
6857 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
6858 * flag-types.h (enum ira_region): Likewise.
6859 * toplev.c (process_options): Use OPTION_SET_P instead of
6860 IRA_REGION_AUTODETECT.
6862 2021-10-11 Jakub Jelinek <jakub@redhat.com>
6864 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
6865 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
6867 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6869 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
6870 * config/s390/s390.c (s390_rawmemchr): New function.
6871 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
6872 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
6873 the pattern vfees<mode> from vx-builtins.md.
6874 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
6876 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6878 * builtins.c (get_memory_rtx): Change to external linkage.
6879 * builtins.h (get_memory_rtx): Add function prototype.
6880 * doc/md.texi (rawmemchr<mode>): Document.
6881 * internal-fn.c (expand_RAWMEMCHR): Define.
6882 * internal-fn.def (RAWMEMCHR): Add.
6883 * optabs.def (rawmemchr_optab): Add.
6884 * tree-loop-distribution.c (find_single_drs): Change return code
6885 behaviour by also returning true if no single store was found
6887 (loop_distribution::classify_partition): Respect the new return
6888 code behaviour of function find_single_drs.
6889 (loop_distribution::execute): Call new function
6890 transform_reduction_loop in order to replace rawmemchr or strlen
6891 like loops by calls into builtins.
6892 (generate_reduction_builtin_1): New function.
6893 (generate_rawmemchr_builtin): New function.
6894 (generate_strlen_builtin_1): New function.
6895 (generate_strlen_builtin): New function.
6896 (generate_strlen_builtin_using_rawmemchr): New function.
6897 (reduction_var_overflows_first): New function.
6898 (determine_reduction_stmt_1): New function.
6899 (determine_reduction_stmt): New function.
6900 (loop_distribution::transform_reduction_loop): New function.
6902 2021-10-11 Martin Liska <mliska@suse.cz>
6904 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
6905 and remove legacy hashing code.
6907 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
6910 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
6911 CONST_INT for BEGIN and END, and use gcc_assert rather than
6914 2021-10-10 Jakub Jelinek <jakub@redhat.com>
6917 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
6918 use MO_VAL_SET if loc is not sp.
6920 2021-10-10 Andrew Pinski <apinski@marvell.com>
6922 PR tree-optimization/102622
6923 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
6924 Swap the order of a?0:pow2cst and a?0:-1 transformations.
6926 2021-10-09 liuhongt <hongtao.liu@intel.com>
6929 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
6931 (ix86_use_mask_cmp_p): Ditto.
6932 (ix86_expand_sse_movcc): Ditto.
6933 * config/i386/i386.md (setcc_hf_mask): New define_insn.
6934 (movhf_mask): Ditto.
6935 (UNSPEC_MOVCC_MASK): New unspec.
6936 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
6938 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
6940 PR rtl-optimization/102627
6941 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
6943 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
6945 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
6948 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
6950 * value-range.cc (irange::debug): New.
6951 * value-range.h (irange::debug): New.
6953 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
6955 PR tree-optimization/102385
6956 * predict.h (change_edge_frequency): Declare.
6957 * predict.c (change_edge_frequency): New function.
6958 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
6960 (tree_unroll_loop): Likewise.
6961 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
6962 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
6964 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
6965 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
6966 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
6967 to retrieve the exit edges. Make all the old profile update code
6968 conditional on !single_loop_p -- the case it was written for --
6969 and use a different approach for the single-loop case.
6971 2021-10-08 Martin Liska <mliska@suse.cz>
6973 * config/alpha/alpha.c (alpha_option_override): Use new macro
6975 * config/arc/arc.c (arc_override_options): Likewise.
6976 * config/arm/arm.c (arm_option_override): Likewise.
6977 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
6978 * config/c6x/c6x.c (c6x_option_override): Likewise.
6979 * config/csky/csky.c: Likewise.
6980 * config/darwin.c (darwin_override_options): Likewise.
6981 * config/frv/frv.c (frv_option_override): Likewise.
6982 * config/i386/djgpp.h: Likewise.
6983 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
6984 (ix86_max_noce_ifcvt_seq_cost): Likewise.
6985 * config/ia64/ia64.c (ia64_option_override): Likewise.
6986 (ia64_override_options_after_change): Likewise.
6987 * config/m32c/m32c.c (m32c_option_override): Likewise.
6988 * config/m32r/m32r.c (m32r_init): Likewise.
6989 * config/m68k/m68k.c (m68k_option_override): Likewise.
6990 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
6991 * config/mips/mips.c (mips_option_override): Likewise.
6992 * config/nios2/nios2.c (nios2_option_override): Likewise.
6993 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
6994 * config/pa/pa.c (pa_option_override): Likewise.
6995 * config/riscv/riscv.c (riscv_option_override): Likewise.
6996 * config/rs6000/aix71.h: Likewise.
6997 * config/rs6000/aix72.h: Likewise.
6998 * config/rs6000/aix73.h: Likewise.
6999 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
7000 (rs6000_override_options_after_change): Likewise.
7001 (rs6000_linux64_override_options): Likewise.
7002 (glibc_supports_ieee_128bit): Likewise.
7003 (rs6000_option_override_internal): Likewise.
7004 (rs6000_file_start): Likewise.
7005 (rs6000_darwin_file_start): Likewise.
7006 * config/rs6000/rtems.h: Likewise.
7007 * config/rs6000/sysv4.h: Likewise.
7008 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
7009 * config/s390/s390.c (s390_option_override): Likewise.
7010 * config/sh/linux.h: Likewise.
7011 * config/sh/netbsd-elf.h (while): Likewise.
7012 * config/sh/sh.c (sh_option_override): Likewise.
7013 * config/sol2.c (solaris_override_options): Likewise.
7014 * config/sparc/sparc.c (sparc_option_override): Likewise.
7015 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
7016 * config/visium/visium.c (visium_option_override): Likewise.
7017 * config/vxworks.c (vxworks_override_options): Likewise.
7018 * lto-opts.c (lto_write_options): Likewise.
7019 * omp-expand.c (expand_omp_simd): Likewise.
7020 * omp-general.c (omp_max_vf): Likewise.
7021 * omp-offload.c (oacc_xform_loop): Likewise.
7022 * opts.h (OPTION_SET_P): Likewise.
7023 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
7024 * toplev.c (process_options): Likewise.
7025 * tree-predcom.c: Likewise.
7026 * tree-sra.c (analyze_all_variable_accesses): Likewise.
7028 2021-10-08 liuhongt <hongtao.liu@intel.com>
7031 * config/i386/i386.c (ix86_optab_supported_p):
7032 Return true for HFmode.
7033 * match.pd: Simplify (_Float16) ceil ((double) x) to
7034 __builtin_ceilf16 (a) when a is _Float16 type and
7035 direct_internal_fn_supported_p.
7037 2021-10-08 liuhongt <hongtao.liu@intel.com>
7040 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
7041 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
7042 (reduc_<code>_scal_v4hi): New.
7044 2021-10-08 liuhongt <hongtao.liu@intel.com>
7046 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
7047 * doc/invoke.texi (Options That Control Optimization): Update
7049 * opts.c (default_options_table): Enable auto-vectorization at
7050 O2 with very-cheap cost model.
7051 (finish_options): Use cheap cost model for
7052 explicit -ftree{,-loop}-vectorize.
7054 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
7056 * ctfc.c (ctfc_delete_container): Free hash table contents.
7058 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
7060 * toplev.c (process_options): Do not warn for GNU GIMPLE.
7062 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
7064 * tree-object-size.c (addr_object_size,
7065 compute_builtin_object_size): Drop PDECL and POFF arguments.
7066 (addr_object_size): Adjust calls.
7067 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
7070 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
7072 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
7073 representing signed and unsigned high-part multiplication resp.
7074 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
7075 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
7076 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
7077 SS_DIV, US_DIV]: Similar simplifications for saturating
7079 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
7080 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
7081 Implement compile-time evaluation for constant operands.
7082 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
7084 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
7085 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
7086 Mention the new smul_highpart and umul_highpart RTX codes.
7087 * doc/invoke.texi: Silence @xref "compilation" warnings.
7089 2021-10-07 Martin Jambor <mjambor@suse.cz>
7092 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
7093 case when the source reference description corresponds to a
7094 referance taken in a function src->caller is inlined to.
7096 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
7099 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
7101 (modref_access_node::try_merge_with): Add sanity check that there
7102 are no redundant entries in the list.
7104 2021-10-07 Richard Biener <rguenther@suse.de>
7106 PR tree-optimization/102608
7107 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
7110 2021-10-07 Martin Liska <mliska@suse.cz>
7112 * toplev.c (toplev::main): Make
7113 save_opt_decoded_options a pointer type
7114 * toplev.h: Likewise.
7116 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
7118 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
7119 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
7120 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
7122 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
7124 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
7125 Adjust for new -msram-ecc=any behaviour.
7126 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
7127 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
7128 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
7129 (EF_AMDGPU_XNACK_V3): ... this.
7130 (EF_AMDGPU_SRAM_ECC): Rename to ...
7131 (EF_AMDGPU_SRAM_ECC_V3): ... this.
7132 (EF_AMDGPU_FEATURE_XNACK_V4): New.
7133 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
7134 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
7135 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
7136 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
7137 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
7138 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
7139 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
7140 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
7141 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
7142 (SET_XNACK_ON): New.
7143 (SET_XNACK_OFF): New.
7145 (SET_SRAM_ECC_ON): New.
7146 (SET_SRAM_ECC_ANY): New.
7147 (SET_SRAM_ECC_OFF): New.
7148 (TEST_SRAM_ECC_ANY): New.
7149 (TEST_SRAM_ECC_ON): New.
7150 (main): Implement HSACOv4 and -msram-ecc=any.
7152 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
7154 * config.in: Regenerate.
7155 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
7159 (A_FIJI): Rename to ...
7161 (A_900): Rename to ...
7163 (A_906): Rename to ...
7165 (A_908): Rename to ...
7167 (SRAMOPT): New macro.
7168 (ASM_SPEC): Adjust xnack option usage.
7169 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
7170 * configure: Regenerate.
7171 * configure.ac: Detect LLVM assembler dialect.
7173 2021-10-07 Richard Biener <rguenther@suse.de>
7175 * tree-pretty-print.c (dump_generic_node): Do not elide
7176 printing '&' when dumping with -gimple.
7178 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7180 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
7182 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
7183 (adjust_imagpart_expr): Ditto.
7184 * value-range.cc (irange::irange_intersect): Call new routine if
7185 RHS is a single pair.
7186 (irange::intersect): New wide_int version.
7187 * value-range.h (class irange): New prototype.
7189 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7191 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
7192 Add parameter to limit size when recognizing switches.
7193 (gimple_outgoing_range::edge_range_p): Check size limit.
7194 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
7195 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
7196 that exceed the size limit.
7197 (gori_compute::gori_compute): Add initializer.
7198 * params.opt (evrp-switch-limit): New.
7199 * doc/invoke.texi: Update docs.
7201 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7203 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
7204 TYPE_MAX_VALUE instead of creating new trees when possible.
7206 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
7208 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
7209 zero and non-zero more efficently.
7211 2021-10-06 Richard Biener <rguenther@suse.de>
7214 * dumpfile.h (TDF_GIMPLE_VAL): New.
7215 (dump_flag): Re-order and adjust TDF_* flags. Make
7216 the enum uint32_t. Use std::underlying_type in the
7218 (optgroup_flag): Likewise for the operator overloads.
7219 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
7220 in _Literal if TDF_GIMPLE_VAL.
7221 * gimple-pretty-print.c (dump_gimple_assign): Add
7222 TDF_GIMPLE_VAL to flags when dumping operands where only
7223 is_gimple_val are allowed.
7224 (dump_gimple_cond): Likewise.
7226 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
7228 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
7231 2021-10-05 qing zhao <qing.zhao@oracle.com>
7233 PR middle-end/102359
7234 * gimplify.c (gimplify_decl_expr): Not add initialization for an
7235 auto variable when it has been initialized by frontend.
7237 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
7239 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
7242 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
7244 * common/config/avr/avr-common.c (avr_handle_option): Mark
7245 argument as ATTRIBUTE_UNUSED.
7247 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
7249 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
7250 Undefine before redefinition.
7252 2021-10-05 Richard Biener <rguenther@suse.de>
7254 * toplev.c (no_backend): Remove global var.
7255 (process_options): Pass in no_backend, move post_options
7256 langhook call to toplev::main.
7257 (do_compile): Pass in no_backend, move process_options call
7259 (toplev::run_self_tests): Check no_backend at the caller.
7260 (toplev::main): Call post_options and process_options
7261 split out from do_compile, do self-tests only if
7262 no_backend is initialized.
7264 2021-10-05 Richard Biener <rguenther@suse.de>
7266 * tree-cfg.c (dump_function_to_file): Dump the UID of the
7267 function as part of the name when requested.
7268 * tree-pretty-print.c (dump_function_name): Dump the UID when
7269 requested and the langhook produced the actual name.
7271 2021-10-05 Richard Biener <rguenther@suse.de>
7273 PR middle-end/102587
7274 PR middle-end/102285
7275 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
7276 zero-initialization as last resort, use the constant
7277 size as given by the DEFERRED_INIT argument to build
7280 2021-10-04 Marek Polacek <polacek@redhat.com>
7283 * doc/invoke.texi: Document -Warray-compare.
7285 2021-10-04 Richard Biener <rguenther@suse.de>
7287 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
7288 variables are not to be initialized.
7290 2021-10-04 Richard Biener <rguenther@suse.de>
7292 * expr.h (non_mem_decl_p): Declare.
7293 (mem_ref_refers_to_non_mem_p): Likewise.
7294 * expr.c (non_mem_decl_p): Export.
7295 (mem_ref_refers_to_non_mem_p): Likewise.
7296 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
7297 but check the base with mem_ref_refers_to_non_mem_p
7300 2021-10-04 Richard Biener <rguenther@suse.de>
7302 PR tree-optimization/102570
7303 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
7304 we are using clique for the internal function code.
7305 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
7306 internal function code.
7307 (print_vn_reference_ops): Print the internal function code.
7308 (vn_reference_op_compute_hash): Hash it.
7309 (copy_reference_ops_from_call): Record it.
7310 (visit_stmt): Remove the restriction around internal function
7312 (fully_constant_vn_reference_p): Use fold_const_call and handle
7314 (vn_reference_eq): Compare call return types.
7315 * tree-ssa-pre.c (create_expression_by_pieces): Handle
7316 generating calls to internal functions.
7317 (compute_avail): Remove the restriction around internal function
7320 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
7322 PR tree-optimization/102560
7323 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
7324 marker for invalid_range.
7326 2021-10-04 Richard Biener <rguenther@suse.de>
7328 PR middle-end/102587
7329 * internal-fn.c (expand_DEFERRED_INIT): Guard register
7330 initialization path an avoid initializing VLA registers
7333 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
7335 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
7337 2021-10-03 Martin Liska <mliska@suse.cz>
7339 * toplev.c (toplev::main): Check opt_index if it is a part
7342 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
7344 PR tree-optimization/102563
7345 * range-op.cc (operator_lshift::op1_range): Do not clobber
7348 2021-10-02 Martin Liska <mliska@suse.cz>
7350 * toplev.c (toplev::main): save_decoded_options[0] is program
7351 name and so it should be skipped.
7353 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
7355 PR tree-optimization/102546
7356 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
7357 X << Y is non-zero implies X is also non-zero.
7359 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
7361 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
7363 * config/aarch64/aarch64-tune.md: Regenerate.
7364 * doc/invoke.texi: Update docs.
7366 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
7368 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
7370 * config/aarch64/aarch64-tune.md: Regenerate.
7371 * doc/invoke.texi: Update docs.
7373 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
7375 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
7377 * config/aarch64/aarch64-tune.md: Regenerate.
7378 * doc/invoke.texi: Update docs.
7380 2021-10-01 Martin Sebor <msebor@redhat.com>
7383 * doc/invoke.texi (-Waddress): Update.
7384 * gengtype.c (write_types): Avoid -Waddress.
7385 * poly-int.h (POLY_SET_COEFF): Avoid using null.
7387 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
7390 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
7393 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
7395 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
7397 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
7399 * gimple-range-path.cc (path_range_query::compute_ranges): Use
7401 * gimple-range-path.h (class path_range_query): Remove shadowed
7403 (path_range_query::get_path_oracle): New.
7405 2021-10-01 Jakub Jelinek <jakub@redhat.com>
7406 Richard Biener <rguenther@suse.de>
7409 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
7410 INT_MIN / -1 division detection from here ...
7411 (-fsanitize=signed-integer-overflow): ... and add it here.
7413 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
7415 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
7417 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
7418 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
7419 (AARCH64_ISA_V9): New ISA flag.
7420 * doc/invoke.texi: Update docs.
7422 2021-10-01 Martin Liska <mliska@suse.cz>
7424 * toplev.c (toplev::main): Save decoded optimization options.
7425 * toplev.h (save_opt_decoded_options): New.
7426 * doc/extend.texi: Be more clear about optimize and target
7429 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
7431 * explow.c: Include langhooks.h.
7432 (set_stack_check_libfunc): Build a proper function type.
7434 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
7437 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
7438 not return true for external weak function symbols in medium model.
7440 2021-10-01 Jakub Jelinek <jakub@redhat.com>
7442 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
7443 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
7444 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
7445 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
7446 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
7447 OMP_CLAUSE_SCHEDULE_STATIC.
7449 2021-10-01 Richard Biener <rguenther@suse.de>
7451 PR middle-end/102518
7452 * tree-inline.c (setup_one_parameter): Avoid substituting
7453 an invariant into contexts where a GIMPLE register is not valid.
7455 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
7457 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
7458 * config/arm/arm-tables.opt: Regenerate.
7459 * config/arm/arm-tune.md: Regenerate.
7460 * doc/invoke.texi: Update docs.
7462 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
7465 * config/i386/i386.md
7466 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
7469 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
7471 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
7472 omp_{c,re}alloc, fix omp_alloc/omp_free.
7474 2021-09-30 Martin Liska <mliska@suse.cz>
7476 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
7477 asm_out_file and stream directly to MYFILE.
7479 2021-09-30 Richard Biener <rguenther@suse.de>
7481 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
7482 Restore and fix condition under which we apply npeel to
7483 the DRs misalignment value.
7485 2021-09-30 Richard Biener <rguenther@suse.de>
7487 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
7488 Fix npeel check for variable amount of peeling.
7490 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
7492 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
7494 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
7496 * gimple-range.cc (gimple_ranger::debug): New.
7497 * gimple-range.h (class gimple_ranger): Add debug.
7499 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
7501 PR middle-end/102519
7502 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
7504 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
7507 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
7508 (btf_finalize): Empty the hash_map btf_var_ids.
7510 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
7512 * tree-vrp.c (thread_through_all_blocks): Return bool.
7513 (execute_vrp_threader): Return TODO_* flags.
7514 (pass_data_vrp_threader): Set todo_flags_finish to 0.
7516 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
7518 * timevar.def (TV_TREE_VRP_THREADER): New.
7519 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
7521 2021-09-29 David Faust <david.faust@oracle.com>
7523 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
7525 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
7527 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
7528 new type qualifier enum.
7529 (TYPES_TERNOP_SSSU): Likewise.
7530 (TYPES_TERNOP_PPPU): Likewise.
7531 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
7532 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
7534 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
7535 builtin and remove casts.
7536 (vqtbl1_s8): Likewise.
7537 (vqtbl1q_p8): Likewise.
7538 (vqtbl1q_s8): Likewise.
7539 (vqtbx1_s8): Likewise.
7540 (vqtbx1_p8): Likewise.
7541 (vqtbx1q_s8): Likewise.
7542 (vqtbx1q_p8): Likewise.
7543 (vtbl1_p8): Likewise.
7544 (vtbl2_p8): Likewise.
7545 (vtbx2_p8): Likewise.
7547 2021-09-29 Richard Biener <rguenther@suse.de>
7549 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
7551 (vect_update_misalignment_for_peel): Use it to update
7552 misaligned to the value necessary for an aligned access.
7553 (vect_get_peeling_costs_all_drs): Likewise.
7554 (vect_enhance_data_refs_alignment): Likewise.
7556 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
7558 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
7559 emitted operations and adjust heuristic for code size.
7561 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
7563 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
7564 emitted operations and adjust heuristic for code size.
7566 2021-09-29 Jakub Jelinek <jakub@redhat.com>
7568 PR middle-end/102504
7569 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
7570 in OMP_SCOPE clauses, not just on worksharing construct clauses.
7572 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
7574 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
7575 (<u>mulditi3): Call <su>muldi3_highpart.
7576 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
7577 (<u>mulsidi3): Call <su>mulsi3_highpart.
7578 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
7580 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
7582 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
7584 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
7586 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
7587 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
7590 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
7592 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
7593 (AARCH64_FL_V8_7): Likewise.
7594 (AARCH64_FL_FOR_ARCH8_7): Likewise.
7595 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
7596 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
7597 * doc/invoke.texi: Document the above.
7599 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
7601 * dbgcnt.c (dbg_cnt_counter): New.
7602 * dbgcnt.h (dbg_cnt_counter): New.
7603 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
7604 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
7605 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
7606 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
7609 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
7611 * cfgcleanup.c (pass_jump::execute): Check
7612 flag_expensive_optimizations.
7613 (pass_jump_after_combine::gate): Same.
7614 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
7615 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
7616 * tree-ssa-threadupdate.c
7617 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
7620 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
7622 * tree-ssa-reassoc.c (biased_names): New global.
7623 (propagate_bias_p): New function.
7624 (loop_carried_phi): Remove.
7625 (propagate_rank): Propagate bias along single uses.
7626 (get_rank): Update biased_names when needed.
7628 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
7630 * passes.def (pass_reassoc): Rename parameter to early_p.
7631 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
7633 (phi_rank): Don't bias loop-carried phi ranks
7634 before vectorization pass.
7635 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
7636 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
7638 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
7640 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
7642 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
7644 2021-09-28 Jakub Jelinek <jakub@redhat.com>
7647 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
7648 special 80387 instruction XFmode constants if flag_rounding_math.
7650 2021-09-28 Richard Biener <rguenther@suse.de>
7652 PR tree-optimization/100112
7653 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
7654 referece into the hashtable twice in case last_vuse is
7655 different from the original vuse on the stmt.
7657 2021-09-28 Jakub Jelinek <jakub@redhat.com>
7659 PR middle-end/102492
7660 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
7661 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
7662 clauses on SIMD constructs.
7664 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
7666 PR tree-optimization/102511
7667 * gimple-range-path.cc (path_range_query::range_on_path_entry):
7668 Return VARYING when nothing found.
7670 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
7673 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
7675 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
7676 (VALID_MMX_REG_MODE): Add V4HFmode.
7677 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
7678 vector mode condition.
7679 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
7681 (function_arg_32): Add V4HFmode.
7682 (function_arg_advance_32): Likewise.
7683 * config/i386/i386.md (mode): Add V4HF/V2HF.
7684 (MODE_SIZE): Likewise.
7685 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
7686 (V_32): Add V2HF mode.
7687 (VHF_32_64): New mode iterator.
7688 (*mov<mode>_internal): Adjust sse alternatives to support
7690 (*mov<mode>_internal): Adjust sse alternatives to support
7692 (<insn><mode>3): New define_insn for add/sub/mul/div.
7694 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
7696 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
7698 (pass_early_thread_jumps::gate): Same.
7699 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
7700 Return if !flag_thread_jumps.
7701 * tree-ssa-threadupdate.c
7702 (jt_path_registry::register_jump_thread): Assert that
7703 flag_thread_jumps is true.
7705 2021-09-28 liuhongt <hongtao.liu@intel.com>
7708 (simplify_context::simplify_binary_operation_1): Relax
7709 condition of simplifying (vec_concat:M (vec_select op0
7710 index0)(vec_select op1 index1)) to allow different modes
7711 between op0 and M, but have same inner mode.
7713 2021-09-28 liuhongt <hongtao.liu@intel.com>
7715 * config/i386/i386-expand.c (emit_reduc_half): Handle
7716 V8HF/V16HF/V32HFmode.
7717 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
7718 (REDUC_SSE_SMINMAX_MODE): Ditto.
7719 (REDUC_PLUS_MODE): Add V16HF and V32HF.
7720 (REDUC_SMINMAX_MODE): Ditto.
7722 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
7724 * gimple-range-path.cc
7725 (path_range_query::precompute_ranges_in_block): Rename to...
7726 (path_range_query::compute_ranges_in_block): ...this.
7727 (path_range_query::precompute_ranges): Rename to...
7728 (path_range_query::compute_ranges): ...this.
7729 (path_range_query::precompute_relations): Rename to...
7730 (path_range_query::compute_relations): ...this.
7731 (path_range_query::precompute_phi_relations): Rename to...
7732 (path_range_query::compute_phi_relations): ...this.
7733 * gimple-range-path.h: Rename precompute* to compute*.
7734 * tree-ssa-threadbackward.c
7735 (back_threader::find_taken_edge_switch): Same.
7736 (back_threader::find_taken_edge_cond): Same.
7737 * tree-ssa-threadedge.c
7738 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
7739 (hybrid_jt_state::register_equivs_stmt): Inline...
7740 * tree-ssa-threadedge.h: ...here.
7742 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
7744 * tree-vrp.c (lhs_of_dominating_assert): Remove.
7745 (class vrp_jt_state): Remove.
7746 (class vrp_jt_simplifier): Remove.
7747 (vrp_jt_simplifier::simplify): Remove.
7748 (class vrp_jump_threader): Remove.
7749 (vrp_jump_threader::vrp_jump_threader): Remove.
7750 (vrp_jump_threader::~vrp_jump_threader): Remove.
7751 (vrp_jump_threader::before_dom_children): Remove.
7752 (vrp_jump_threader::after_dom_children): Remove.
7754 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
7756 * passes.def (pass_vrp_threader): New.
7757 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
7758 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
7759 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
7760 (hybrid_jt_simplifier::simplify): New.
7761 (hybrid_jt_simplifier::compute_ranges_from_state): New.
7762 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
7763 (class hybrid_jt_simplifier): New.
7764 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
7766 (class hybrid_threader): New.
7767 (hybrid_threader::hybrid_threader): New.
7768 (hybrid_threader::~hybrid_threader): New.
7769 (hybrid_threader::before_dom_children): New.
7770 (hybrid_threader::after_dom_children): New.
7771 (execute_vrp_threader): New.
7772 (class pass_vrp_threader): New.
7773 (make_pass_vrp_threader): New.
7775 2021-09-27 Martin Liska <mliska@suse.cz>
7777 * output.h (enum section_flag): New.
7778 (SECTION_FORGET): Remove.
7779 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
7780 (SECTION_STYLE_MASK): Define it based on other enum
7782 * varasm.c (switch_to_section): Remove unused handling of
7785 2021-09-27 Martin Liska <mliska@suse.cz>
7787 * common.opt: Add new variable flag_default_complex_method.
7788 * opts.c (finish_options): Handle flags related to
7789 x_flag_complex_method.
7790 * toplev.c (process_options): Remove option handling related
7791 to flag_complex_method.
7793 2021-09-27 Richard Biener <rguenther@suse.de>
7795 PR middle-end/102450
7796 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
7797 type_for_size, instead use int_mode_for_size.
7799 2021-09-27 Andrew Pinski <apinski@marvell.com>
7802 * gimplify.c (gimplify_save_expr): Return early
7803 if the type of val is error_mark_node.
7805 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
7807 * tree-ssanames.c (ssa_name_has_boolean_range): Use
7810 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
7812 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
7813 vrp_visit_cond_stmt.
7814 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
7815 (cprop_into_stmt): Same.
7816 (dom_opt_dom_walker::optimize_stmt): Same.
7818 2021-09-27 Richard Biener <rguenther@suse.de>
7820 PR tree-optimization/97351
7821 PR tree-optimization/97352
7822 PR tree-optimization/82426
7823 * tree-vectorizer.h (dr_misalignment): Add vector type
7825 (aligned_access_p): Likewise.
7826 (known_alignment_for_access_p): Likewise.
7827 (vect_supportable_dr_alignment): Likewise.
7828 (vect_known_alignment_in_bytes): Likewise. Refactor.
7829 (DR_MISALIGNMENT): Remove.
7830 (vect_update_shared_vectype): Likewise.
7831 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
7832 a vector type with larger alignment requirement and apply
7833 the negative step adjustment here.
7834 (vect_calculate_target_alignment): Remove.
7835 (vect_compute_data_ref_alignment): Get explicit vector type
7836 argument, do not apply a negative step alignment adjustment
7838 (vect_slp_analyze_node_alignment): Re-analyze alignment
7839 when we re-visit the DR with a bigger desired alignment but
7840 keep more precise results from smaller alignments.
7841 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
7842 (vect_slp_analyze_node_operations_1): Do not update the
7843 shared vector type on stmts.
7844 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
7845 vector type of an SLP node to the representative stmt-info.
7846 (vect_transform_stmt): Likewise.
7848 2021-09-27 liuhongt <hongtao.liu@intel.com>
7851 2021-09-09 liuhongt <hongtao.liu@intel.com>
7854 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
7855 (reduc_plus_scal_v4sf): .. this, New define_expand.
7856 (reduc_plus_scal_v2df): .. and this, New define_expand.
7858 2021-09-26 liuhongt <hongtao.liu@intel.com>
7860 * doc/extend.texi (Half-Precision): Remove storage only
7861 description for _Float16 w/o avx512fp16.
7863 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
7865 * config/pru/constraints.md (Rrio): New constraint.
7866 * config/pru/predicates.md (regio_operand): New predicate.
7867 * config/pru/pru-pragma.c (pru_register_pragmas): Register
7868 the __regio_symbol address space.
7869 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
7870 * config/pru/pru.c (pru_symref2ioregno): New helper function.
7871 (pru_legitimate_address_p): Remove.
7872 (pru_addr_space_legitimate_address_p): Use the address space
7874 (pru_nongeneric_pointer_addrspace): New helper function.
7875 (pru_insert_attributes): New function to validate __regio_symbol
7877 (TARGET_INSERT_ATTRIBUTES): New macro.
7878 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
7879 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
7880 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
7881 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
7883 (*regio_nozext_writesi): New pattern to write to I/O registers.
7884 (*regio_zext_write_r30<EQS0:mode>): Ditto.
7885 * doc/extend.texi: Document the new PRU Named Address Space.
7887 2021-09-24 Patrick Palka <ppalka@redhat.com>
7891 * real.c (encode_ieee_double): Avoid unwanted sign extension.
7892 (encode_ieee_quad): Likewise.
7894 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
7896 PR rtl-optimization/102147
7897 * ira-build.c (ira_conflict_vector_profitable_p): Make
7898 profitability calculation independent of host compiler pointer and
7901 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
7903 * gimple-range-path.cc (path_range_query::path_range_query):
7904 Move debugging header...
7905 (path_range_query::precompute_ranges): ...here.
7906 (path_range_query::internal_range_of_expr): Do not call
7907 range_on_path_entry if NAME is defined in the current block.
7909 2021-09-24 Richard Biener <rguenther@suse.de>
7911 * cfghooks.c (verify_flow_info): Verify unallocated BB and
7912 edge flags are not set.
7914 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
7916 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
7918 (jt_path_registry::register_jump_thread): Call
7919 cancel_invalid_paths.
7920 * tree-ssa-threadupdate.h (class jt_path_registry): Add
7921 cancel_invalid_paths.
7923 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
7925 PR tree-optimization/102400
7926 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
7927 result_vdef to zero value.
7929 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
7931 PR tree-optimization/102451
7932 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
7935 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
7937 * config/i386/sse.md (cond_<insn><mode>): Extend to support
7939 (cond_mul<mode>): Likewise.
7940 (cond_div<mode>): Likewise.
7941 (cond_<code><mode>): Likewise.
7942 (cond_fma<mode>): Likewise.
7943 (cond_fms<mode>): Likewise.
7944 (cond_fnma<mode>): Likewise.
7945 (cond_fnms<mode>): Likewise.
7947 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
7949 PR tree-optimization/102463
7950 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
7951 there is no range-ops handler, don't look for a relation.
7953 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
7955 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
7956 non-executable_edge flag as parameter.
7957 * gimple-range-cache.h (ranger_cache): Adjust prototype.
7958 * gimple-range-gori.cc (gori_compute::gori_compute): Take
7959 non-executable_edge flag as parameter.
7960 (gori_compute::outgoing_edge_range_p): Check new flag.
7961 * gimple-range-gori.h (gori_compute): Adjust prototype.
7962 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
7963 (gimple_ranger::range_on_edge): Check new flag.
7964 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
7965 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
7966 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
7967 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
7968 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
7969 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
7970 (simplify_using_ranges::simplify_switch_using_ranges): Clear
7971 EDGE_EXECUTABLE like it originally did.
7972 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
7973 NON_EXECUTABLE flags.
7974 (simplify_using_ranges::simplify_using_ranges): Adjust.
7975 * vr-values.h (class simplify_using_ranges): Adjust.
7976 (simplify_using_ranges::set_range_query): Add non-executable flag param.
7978 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
7981 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
7982 zero-width bit fields and return indicator.
7983 (rs6000_discover_homogeneous_aggregate): Diagnose when the
7984 presence of a zero-width bit field changes parameter passing in
7987 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
7989 * gimple-range-fold.cc (fold_using_range::range_of_phi):
7990 Remove dominator check.
7992 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
7994 * gimple-range-path.cc (path_range_query::precompute_relations):
7995 Hoist edge calculations before using EDGE_SUCC.
7997 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
7999 * configure.ac: Fix --with-multilib-list description.
8000 * configure: Regenerate.
8002 2021-09-23 Richard Biener <rguenther@suse.de>
8004 PR tree-optimization/102448
8005 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
8006 Clear alignment info copied from DR_PTR_INFO.
8008 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8010 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
8012 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
8013 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
8014 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
8015 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
8016 (vcond<mode><mode>): New expander.
8017 (vcond<mode><sseintvecmodelower>): Likewise.
8018 (vcond<sseintvecmodelower><mode>): Likewise.
8019 (vcondu<mode><sseintvecmodelower>): Likewise.
8021 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8023 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
8025 (extendv4hf<mode>2): Likewise.
8026 (extendv2hfv2df2): Likewise.
8027 (trunc<mode><ssePHmodelower>2): Likewise.
8028 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
8029 (trunc<mode>v4hf2): ... this, and drop constraints.
8030 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
8031 (truncv2dfv2hf2): ... this, and likewise.
8033 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8035 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
8037 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
8039 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
8040 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
8041 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
8043 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8045 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
8046 (fixuns_trunchfhi2): Likewise.
8047 (*fixuns_trunchfsi2zext): New define_insn.
8048 * config/i386/sse.md (ssePHmodelower): New mode_attr.
8049 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
8050 New expander for same element vector fix_truncate.
8051 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
8052 Likewise for V4HF to V4SI/V4DI fix_truncate.
8053 (fix<fixunssuffix>_truncv2hfv2di2):
8054 Likeise for V2HF to V2DI fix_truncate.
8056 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
8058 * config/i386/i386.md (<code>hf3): New expander.
8060 2021-09-23 liuhongt <hongtao.liu@intel.com>
8062 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
8063 (VFH_SF_AVX512VL): Extend to handle HFmode.
8064 (VF_SF_AVX512VL): Deleted.
8066 2021-09-23 liuhongt <hongtao.liu@intel.com>
8068 * config/i386/i386.md (rinthf2): New expander.
8069 (nearbyinthf2): New expander.
8071 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
8073 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
8074 (class dom_jt_state): ...this and provide virtual overrides.
8075 (dom_jt_state::register_equiv): New.
8076 (class dom_jt_simplifier): Rename from
8077 dom_jump_threader_simplifier.
8078 (dom_jump_threader_simplifier::simplify): Rename...
8079 (dom_jt_simplifier::simplify): ...to this.
8080 (pass_dominator::execute): Use dom_jt_simplifier and
8082 * tree-ssa-threadedge.c (jump_threader::jump_threader):
8084 (jt_state::register_equivs_stmt): Abstract out...
8085 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
8087 (jump_threader::thread_around_empty_blocks): Update state.
8088 (jump_threader::thread_through_normal_block): Same.
8089 (jt_state::jt_state): Remove.
8090 (jt_state::push): Remove pass specific bits. Keep block vector
8092 (jt_state::append_path): New.
8093 (jt_state::pop): Remove pass specific bits.
8094 (jt_state::register_equiv): Same.
8095 (jt_state::record_ranges_from_stmt): Same.
8096 (jt_state::register_equivs_on_edge): Same. Rename...
8097 (jt_state::register_equivs_edge): ...to this.
8098 (jt_state::dump): New.
8099 (jt_state::debug): New.
8100 (jump_threader_simplifier::simplify): Remove.
8101 (jt_state::get_path): New.
8102 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
8103 class. Expose common functionality as virtual methods.
8104 (class jump_threader_simplifier): Same. Rename...
8105 (class jt_simplifier): ...to this.
8106 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
8107 (class vrp_jt_simplifier): ...to this. Provide pass specific
8109 (class vrp_jt_state): New.
8110 (vrp_jump_threader_simplifier::simplify): Rename...
8111 (vrp_jt_simplifier::simplify): ...to this. Inline code from
8112 what used to be the base class.
8113 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
8116 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
8119 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
8122 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
8123 Richard Biener <rguenther@suse.de>
8125 * match.pd (negation simplifications): Implement some negation
8126 folding transformations from fold-const.c's fold_negate_expr.
8127 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
8128 argument, to control whether the op should be simplified prior
8129 to looking up/assigning a value number.
8130 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
8131 (vn_nary_simplify): Likewise.
8132 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
8133 now call vn_nary_build_or_lookup_1 disabling simplification.
8135 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
8137 PR tree-optimization/102087
8138 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
8139 Update bound/cmp/control for niter.
8141 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
8143 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
8144 Move check for non-empty BB here.
8145 (fur_source::register_outgoing_edges): ...from here.
8147 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
8149 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
8150 Remove call to improve_range_with_equivs.
8151 (path_range_query::improve_range_with_equivs): Remove
8152 * gimple-range-path.h: Remove improve_range_with_equivs.
8154 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8156 * config/i386/avx512fp16intrin.h:
8157 (_mm512_mask_blend_ph): New intrinsic.
8158 (_mm512_permutex2var_ph): Ditto.
8159 (_mm512_permutexvar_ph): Ditto.
8160 * config/i386/avx512fp16vlintrin.h:
8161 (_mm256_mask_blend_ph): New intrinsic.
8162 (_mm256_permutex2var_ph): Ditto.
8163 (_mm256_permutexvar_ph): Ditto.
8164 (_mm_mask_blend_ph): Ditto.
8165 (_mm_permutex2var_ph): Ditto.
8166 (_mm_permutexvar_ph): Ditto.
8168 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8170 * config/i386/avx512fp16intrin.h: Add new intrinsics.
8171 (_mm512_conj_pch): New intrinsic.
8172 (_mm512_mask_conj_pch): Ditto.
8173 (_mm512_maskz_conj_pch): Ditto.
8174 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
8175 (_mm256_conj_pch): New intrinsic.
8176 (_mm256_mask_conj_pch): Ditto.
8177 (_mm256_maskz_conj_pch): Ditto.
8178 (_mm_conj_pch): Ditto.
8179 (_mm_mask_conj_pch): Ditto.
8180 (_mm_maskz_conj_pch): Ditto.
8182 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8184 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
8185 (_mm512_reduce_add_ph): New intrinsic.
8186 (_mm512_reduce_mul_ph): Ditto.
8187 (_mm512_reduce_min_ph): Ditto.
8188 (_mm512_reduce_max_ph): Ditto.
8189 * config/i386/avx512fp16vlintrin.h
8190 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
8191 (_mm256_reduce_add_ph): New intrinsic.
8192 (_mm256_reduce_mul_ph): Ditto.
8193 (_mm256_reduce_min_ph): Ditto.
8194 (_mm256_reduce_max_ph): Ditto.
8195 (_mm_reduce_add_ph): Ditto.
8196 (_mm_reduce_mul_ph): Ditto.
8197 (_mm_reduce_min_ph): Ditto.
8198 (_mm_reduce_max_ph): Ditto.
8200 2021-09-22 dianhong xu <dianhong.xu@intel.com>
8202 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
8203 __m128h_u): New typedef.
8204 (_mm512_load_ph): New intrinsic.
8205 (_mm256_load_ph): Ditto.
8206 (_mm_load_ph): Ditto.
8207 (_mm512_loadu_ph): Ditto.
8208 (_mm256_loadu_ph): Ditto.
8209 (_mm_loadu_ph): Ditto.
8210 (_mm512_store_ph): Ditto.
8211 (_mm256_store_ph): Ditto.
8212 (_mm_store_ph): Ditto.
8213 (_mm512_storeu_ph): Ditto.
8214 (_mm256_storeu_ph): Ditto.
8215 (_mm_storeu_ph): Ditto.
8216 (_mm512_abs_ph): Ditto.
8217 * config/i386/avx512fp16vlintrin.h
8218 (_mm_abs_ph): Ditto.
8219 (_mm256_abs_ph): Ditto.
8221 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
8223 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
8225 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
8228 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
8229 is actually a full copy of the source operand into the target.
8230 Don't emit a strict low part move if source and target mode match.
8232 2021-09-22 Jakub Jelinek <jakub@redhat.com>
8234 PR middle-end/102415
8235 * omp-expand.c (expand_omp_single): If region->exit is NULL,
8236 assert region->entry is GIMPLE_OMP_SCOPE region and return.
8238 2021-09-22 Jakub Jelinek <jakub@redhat.com>
8240 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
8241 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
8242 arguments from 2 to 3.
8243 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
8244 allocate clause allocator and print align if present.
8245 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
8246 for omp_default_mem_alloc if align modifier is present. If align
8247 modifier is present, use TREE_LIST to encode both allocator and
8249 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
8250 Handle align modifier on allocator clause if present.
8252 2021-09-22 liuhongt <hongtao.liu@intel.com>
8254 * config/i386/i386.md (define_attr "isa"): Add
8256 (define_attr "enabled"): Correspond fma_or_avx512vl to
8257 TARGET_FMA || TARGET_AVX512VL.
8258 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
8263 2021-09-22 liuhongt <hongtao.liu@intel.com>
8265 * config/i386/i386.md (cstorehf3): New define_expand.
8267 2021-09-22 liuhongt <hongtao.liu@intel.com>
8269 * config/i386/i386.md (<rounding_insn>hf2): New expander.
8270 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
8271 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
8272 Extend from VF_128 to VFH_128.
8274 2021-09-22 liuhongt <hongtao.liu@intel.com>
8276 * config/i386/i386-features.c (i386-features.c): Handle
8278 * config/i386/i386.md (sqrthf2): New expander.
8279 (*sqrthf2): New define_insn.
8280 * config/i386/sse.md
8281 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
8284 2021-09-22 liuhongt <hongtao.liu@intel.com>
8286 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
8288 (_mm_mask3_fcmadd_sch): Likewise.
8289 (_mm_maskz_fcmadd_sch): Likewise.
8290 (_mm_fcmadd_sch): Likewise.
8291 (_mm_mask_fmadd_sch): Likewise.
8292 (_mm_mask3_fmadd_sch): Likewise.
8293 (_mm_maskz_fmadd_sch): Likewise.
8294 (_mm_fmadd_sch): Likewise.
8295 (_mm_mask_fcmadd_round_sch): Likewise.
8296 (_mm_mask3_fcmadd_round_sch): Likewise.
8297 (_mm_maskz_fcmadd_round_sch): Likewise.
8298 (_mm_fcmadd_round_sch): Likewise.
8299 (_mm_mask_fmadd_round_sch): Likewise.
8300 (_mm_mask3_fmadd_round_sch): Likewise.
8301 (_mm_maskz_fmadd_round_sch): Likewise.
8302 (_mm_fmadd_round_sch): Likewise.
8303 (_mm_fcmul_sch): Likewise.
8304 (_mm_mask_fcmul_sch): Likewise.
8305 (_mm_maskz_fcmul_sch): Likewise.
8306 (_mm_fmul_sch): Likewise.
8307 (_mm_mask_fmul_sch): Likewise.
8308 (_mm_maskz_fmul_sch): Likewise.
8309 (_mm_fcmul_round_sch): Likewise.
8310 (_mm_mask_fcmul_round_sch): Likewise.
8311 (_mm_maskz_fcmul_round_sch): Likewise.
8312 (_mm_fmul_round_sch): Likewise.
8313 (_mm_mask_fmul_round_sch): Likewise.
8314 (_mm_maskz_fmul_round_sch): Likewise.
8315 * config/i386/i386-builtin.def: Add corresponding new builtins.
8316 * config/i386/sse.md
8317 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
8318 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
8319 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
8321 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
8322 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
8324 * config/i386/subst.md (mask_scalarcz_name): New.
8325 (mask_scalarc_name): Ditto.
8326 (mask_scalarc_operand3): Ditto.
8327 (mask_scalarcz_operand4): Ditto.
8328 (round_scalarcz_name): Ditto.
8329 (round_scalarc_mask_operand3): Ditto.
8330 (round_scalarcz_mask_operand4): Ditto.
8331 (round_scalarc_mask_op3): Ditto.
8332 (round_scalarcz_mask_op4): Ditto.
8333 (round_scalarcz_constraint): Ditto.
8334 (round_scalarcz_nimm_predicate): Ditto.
8335 (mask_scalarcz): Ditto.
8336 (mask_scalarc): Ditto.
8337 (round_scalarcz): Ditto.
8339 2021-09-22 liuhongt <hongtao.liu@intel.com>
8341 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
8343 (_mm512_mask_fcmadd_pch): Likewise.
8344 (_mm512_mask3_fcmadd_pch): Likewise.
8345 (_mm512_maskz_fcmadd_pch): Likewise.
8346 (_mm512_fmadd_pch): Likewise.
8347 (_mm512_mask_fmadd_pch): Likewise.
8348 (_mm512_mask3_fmadd_pch): Likewise.
8349 (_mm512_maskz_fmadd_pch): Likewise.
8350 (_mm512_fcmadd_round_pch): Likewise.
8351 (_mm512_mask_fcmadd_round_pch): Likewise.
8352 (_mm512_mask3_fcmadd_round_pch): Likewise.
8353 (_mm512_maskz_fcmadd_round_pch): Likewise.
8354 (_mm512_fmadd_round_pch): Likewise.
8355 (_mm512_mask_fmadd_round_pch): Likewise.
8356 (_mm512_mask3_fmadd_round_pch): Likewise.
8357 (_mm512_maskz_fmadd_round_pch): Likewise.
8358 (_mm512_fcmul_pch): Likewise.
8359 (_mm512_mask_fcmul_pch): Likewise.
8360 (_mm512_maskz_fcmul_pch): Likewise.
8361 (_mm512_fmul_pch): Likewise.
8362 (_mm512_mask_fmul_pch): Likewise.
8363 (_mm512_maskz_fmul_pch): Likewise.
8364 (_mm512_fcmul_round_pch): Likewise.
8365 (_mm512_mask_fcmul_round_pch): Likewise.
8366 (_mm512_maskz_fcmul_round_pch): Likewise.
8367 (_mm512_fmul_round_pch): Likewise.
8368 (_mm512_mask_fmul_round_pch): Likewise.
8369 (_mm512_maskz_fmul_round_pch): Likewise.
8370 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
8372 (_mm_mask_fmadd_pch): Likewise.
8373 (_mm_mask3_fmadd_pch): Likewise.
8374 (_mm_maskz_fmadd_pch): Likewise.
8375 (_mm256_fmadd_pch): Likewise.
8376 (_mm256_mask_fmadd_pch): Likewise.
8377 (_mm256_mask3_fmadd_pch): Likewise.
8378 (_mm256_maskz_fmadd_pch): Likewise.
8379 (_mm_fcmadd_pch): Likewise.
8380 (_mm_mask_fcmadd_pch): Likewise.
8381 (_mm_mask3_fcmadd_pch): Likewise.
8382 (_mm_maskz_fcmadd_pch): Likewise.
8383 (_mm256_fcmadd_pch): Likewise.
8384 (_mm256_mask_fcmadd_pch): Likewise.
8385 (_mm256_mask3_fcmadd_pch): Likewise.
8386 (_mm256_maskz_fcmadd_pch): Likewise.
8387 (_mm_fmul_pch): Likewise.
8388 (_mm_mask_fmul_pch): Likewise.
8389 (_mm_maskz_fmul_pch): Likewise.
8390 (_mm256_fmul_pch): Likewise.
8391 (_mm256_mask_fmul_pch): Likewise.
8392 (_mm256_maskz_fmul_pch): Likewise.
8393 (_mm_fcmul_pch): Likewise.
8394 (_mm_mask_fcmul_pch): Likewise.
8395 (_mm_maskz_fcmul_pch): Likewise.
8396 (_mm256_fcmul_pch): Likewise.
8397 (_mm256_mask_fcmul_pch): Likewise.
8398 (_mm256_maskz_fcmul_pch): Likewise.
8399 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
8400 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
8401 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
8402 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
8403 * config/i386/i386-builtin.def: Add new builtins.
8404 * config/i386/i386-expand.c: Handle new builtin types.
8405 * config/i386/subst.md (SUBST_CV): New.
8406 (maskc_name): Ditto.
8407 (maskc_operand3): Ditto.
8409 (sdc_maskz_name): Ditto.
8410 (sdc_mask_op4): Ditto.
8411 (sdc_mask_op5): Ditto.
8412 (sdc_mask_mode512bit_condition): Ditto.
8414 (round_maskc_operand3): Ditto.
8415 (round_sdc_mask_operand4): Ditto.
8416 (round_maskc_op3): Ditto.
8417 (round_sdc_mask_op4): Ditto.
8418 (round_saeonly_sdc_mask_operand5): Ditto.
8419 * config/i386/sse.md (unspec): Add complex fma unspecs.
8420 (avx512fmaskcmode): New.
8421 (UNSPEC_COMPLEX_F_C_MA): Ditto.
8422 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
8423 (complexopname): Ditto.
8424 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
8425 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
8426 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
8428 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
8429 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
8431 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
8433 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
8434 rs6000-density-size-threshold, rs6000-density-penalty,
8435 rs6000-density-load-pct-threshold,
8436 rs6000-density-load-num-threshold): New parameter.
8437 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
8438 corresponding parameters.
8440 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8442 * gimple-range-path.cc (path_range_query::defined_outside_path):
8444 (path_range_query::range_on_path_entry): New.
8445 (path_range_query::internal_range_of_expr): Resolve unknowns
8447 (path_range_query::improve_range_with_equivs): New.
8448 (path_range_query::ssa_range_in_phi): Resolve unknowns with
8450 * gimple-range-path.h (class path_range_query): Add
8451 defined_outside_path, range_on_path_entry, and
8452 improve_range_with_equivs.
8454 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8456 * gimple-range-path.cc (path_range_query::add_to_imports): New.
8457 (path_range_query::add_copies_to_imports): New.
8458 (path_range_query::precompute_ranges): Call
8459 add_copies_to_imports.
8460 * gimple-range-path.h (class path_range_query): Add prototypes
8461 for add_copies_to_imports and add_to_imports.
8463 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8465 * gimple-range-path.cc (path_range_query::range_defined_in_block):
8466 Remove useless code.
8468 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8470 * gimple-range-fold.h (class fur_source): Make oracle protected.
8471 * gimple-range-path.cc (path_range_query::path_range_query): Add
8472 resolve argument. Initialize oracle.
8473 (path_range_query::~path_range_query): Delete oracle.
8474 (path_range_query::range_of_stmt): Adapt to use relations.
8475 (path_range_query::precompute_ranges): Pre-compute relations.
8476 (class jt_fur_source): New
8477 (jt_fur_source::jt_fur_source): New.
8478 (jt_fur_source::register_relation): New.
8479 (jt_fur_source::query_relation): New.
8480 (path_range_query::precompute_relations): New.
8481 (path_range_query::precompute_phi_relations): New.
8482 * gimple-range-path.h (path_range_query): Add resolve argument.
8483 Add oracle, precompute_relations, precompute_phi_relations.
8484 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
8485 resolve argument to solver.
8487 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8489 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
8490 Rename postfold_gcond_edges to register_outgoing_edges and
8492 (fold_using_range::postfold_gcond_edges): Rename...
8493 (fur_source::register_outgoing_edges): ...to this.
8494 * gimple-range-fold.h (postfold_gcond_edges): Rename to
8495 register_outgoing_edges and move to fur_source.
8497 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8499 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
8500 dom_info_available_p.
8502 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
8504 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
8505 and quick_grow_cleared instead of safe_grow_cleared.
8507 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
8510 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
8513 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
8515 * configure.ac: Detect when the assembler supports new-style
8516 architecture extensions.
8517 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
8518 the full CPU string if the assembler can grok it.
8519 (arm_rewrite_march): Likewise but for the architecture.
8520 * config.in: Regenerate.
8521 * configure: Regenerate.
8523 2021-09-21 Richard Biener <rguenther@suse.de>
8525 PR tree-optimization/102421
8526 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
8527 adjust alignment info.
8529 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
8531 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
8532 bitfield stream out.
8534 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
8536 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
8537 undefined edges, apply an equivalence if appropriate.
8538 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
8539 UNDEFINED if EDGE_EXECUTABLE is not set.
8540 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
8541 as EXECUTABLE upon startup.
8542 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
8543 EDGE_EXECUTABLE set.
8544 * vr-values.c (set_and_propagate_unexecutable): New.
8545 (simplify_using_ranges::fold_cond): Call set_and_propagate.
8546 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
8547 * vr-values.h: Add prototype.
8549 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
8551 * value-relation.cc (equiv_oracle::register_initial_def): New.
8552 (equiv_oracle::register_relation): Call register_initial_def.
8553 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
8554 (relation_oracle::register_stmt): Check def block of PHI arguments.
8555 * value-relation.h (equiv_oracle): Add new prototypes.
8557 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
8559 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
8560 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
8561 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
8562 __ROUNDING_MATH__ according to their corresponding flags.
8563 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
8564 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
8565 and __ROUNDING_MATH__.
8567 2021-09-20 Richard Biener <rguenther@suse.de>
8569 * tree-vect-stmts.c (vectorizable_load): Use the vectype
8572 2021-09-20 Richard Biener <rguenther@suse.de>
8574 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
8575 Do not compute alignment of the vectorized access here.
8577 2021-09-20 Richard Biener <rguenther@suse.de>
8579 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
8580 Store -1 for runtime alias peeling iterations.
8582 2021-09-20 Richard Biener <rguenther@suse.de>
8584 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
8586 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
8588 * input.c (string_concat_db::record_string_concatenation)
8589 (string_concat_db::get_string_concatenation): Skip for
8590 'RESERVED_LOCATION_P'.
8592 2021-09-20 Richard Biener <rguenther@suse.de>
8594 PR tree-optimization/65206
8595 * tree-data-ref.h (struct data_reference): Add alt_indices,
8597 * tree-data-ref.c (free_data_ref): Release alt_indices.
8598 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
8599 (create_data_ref): Adjust.
8600 (initialize_data_dependence_relation): Split into head
8601 and tail. When the base objects fail to match up try
8602 again with pointer-based analysis of indices.
8603 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
8604 not compare the lazily computed alternate set of indices.
8606 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
8608 * gcc.c: Test for execute OK when we find the
8609 programs for assembler linker and dsymutil and those
8610 were specified at configure-time.
8612 2021-09-19 Martin Sebor <msebor@redhat.com>
8614 PR middle-end/102403
8615 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
8616 Correct a function pre/postcondition.
8618 2021-09-19 Martin Sebor <msebor@redhat.com>
8620 PR middle-end/102243
8621 * tree-ssa-strlen.c (get_range): Handle null cfun.
8623 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
8625 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
8626 unwinder shim as a convenience library.
8628 2021-09-19 Andrew Pinski <apinski@marvell.com>
8630 * doc/install.texi: Add note about
8631 binutils 2.35 is required for LTO usage.
8633 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
8635 * tree-ssa-threadbackward.c
8636 (back_threader_registry::register_path): Use push_edge.
8637 * tree-ssa-threadedge.c
8638 (jump_threader::thread_around_empty_blocks): Same.
8639 (jump_threader::thread_through_normal_block): Same.
8640 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
8642 * tree-ssa-threadupdate.c
8643 (jt_path_registry::allocate_thread_edge): Remove.
8644 (jt_path_registry::push_edge): New.
8645 (dump_jump_thread_path): Make static.
8646 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
8649 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
8651 * gimple-range-path.cc (path_range_query::path_range_query): Add
8653 (path_range_query::dump): Remove extern declaration of dump_ranger.
8654 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
8655 * gimple-range-trace.h (dump_ranger): Add prototype.
8657 2021-09-19 John Ericson <git@JohnEricson.me>
8659 * gcc.c (find_a_program): New function, factored out of...
8660 (find_a_file): Here.
8661 (execute): Use find_a_program when looking for programs rather
8664 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
8666 * config/avr/avr-mcus.def: Add atmega324pb.
8667 * doc/avr-mmcu.texi: Corresponding changes.
8669 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
8672 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
8673 the same transformations as to @1. For comparisons against NaN,
8674 don't check HONOR_SNANS but confirm that neither operand is a
8677 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
8679 * attribs.c (make_unique_name): Delete.
8680 * attribs.h (make_unique_name): Delete.
8682 2021-09-19 Andrew Pinski <apinski@marvell.com>
8684 * lra-constraints.c (check_and_process_move): Assert
8685 that dclass and sclass are greater than or equal to NO_REGS.
8687 2021-09-18 Jakub Jelinek <jakub@redhat.com>
8689 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
8690 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
8691 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
8693 2021-09-18 liuhongt <hongtao.liu@intel.com>
8695 * config/i386/i386-features.c (remove_partial_avx_dependency):
8696 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
8697 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
8699 2021-09-18 Jakub Jelinek <jakub@redhat.com>
8701 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
8702 if file/namespace scope variable doesn't have predetermined sharing,
8703 treat it as if there was default(none).
8705 2021-09-18 liuhongt <hongtao.liu@intel.com>
8707 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
8709 (_mm_mask_fmadd_sh): Likewise.
8710 (_mm_mask3_fmadd_sh): Likewise.
8711 (_mm_maskz_fmadd_sh): Likewise.
8712 (_mm_fmadd_round_sh): Likewise.
8713 (_mm_mask_fmadd_round_sh): Likewise.
8714 (_mm_mask3_fmadd_round_sh): Likewise.
8715 (_mm_maskz_fmadd_round_sh): Likewise.
8716 (_mm_fnmadd_sh): Likewise.
8717 (_mm_mask_fnmadd_sh): Likewise.
8718 (_mm_mask3_fnmadd_sh): Likewise.
8719 (_mm_maskz_fnmadd_sh): Likewise.
8720 (_mm_fnmadd_round_sh): Likewise.
8721 (_mm_mask_fnmadd_round_sh): Likewise.
8722 (_mm_mask3_fnmadd_round_sh): Likewise.
8723 (_mm_maskz_fnmadd_round_sh): Likewise.
8724 (_mm_fmsub_sh): Likewise.
8725 (_mm_mask_fmsub_sh): Likewise.
8726 (_mm_mask3_fmsub_sh): Likewise.
8727 (_mm_maskz_fmsub_sh): Likewise.
8728 (_mm_fmsub_round_sh): Likewise.
8729 (_mm_mask_fmsub_round_sh): Likewise.
8730 (_mm_mask3_fmsub_round_sh): Likewise.
8731 (_mm_maskz_fmsub_round_sh): Likewise.
8732 (_mm_fnmsub_sh): Likewise.
8733 (_mm_mask_fnmsub_sh): Likewise.
8734 (_mm_mask3_fnmsub_sh): Likewise.
8735 (_mm_maskz_fnmsub_sh): Likewise.
8736 (_mm_fnmsub_round_sh): Likewise.
8737 (_mm_mask_fnmsub_round_sh): Likewise.
8738 (_mm_mask3_fnmsub_round_sh): Likewise.
8739 (_mm_maskz_fnmsub_round_sh): Likewise.
8740 * config/i386/i386-builtin-types.def
8741 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
8742 * config/i386/i386-builtin.def: Add new builtins.
8743 * config/i386/i386-expand.c: Handle new builtin type.
8744 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
8745 Ajdust to support FP16.
8746 (fmai_vmfmsub_<mode><round_name>): Ditto.
8747 (fmai_vmfnmadd_<mode><round_name>): Ditto.
8748 (fmai_vmfnmsub_<mode><round_name>): Ditto.
8749 (*fmai_fmadd_<mode>): Ditto.
8750 (*fmai_fmsub_<mode>): Ditto.
8751 (*fmai_fnmadd_<mode><round_name>): Ditto.
8752 (*fmai_fnmsub_<mode><round_name>): Ditto.
8753 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
8754 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
8755 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
8756 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
8757 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
8758 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
8759 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
8760 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
8761 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
8762 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
8763 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
8764 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
8765 adjust to support FP16.
8766 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
8767 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
8768 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
8771 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
8773 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
8775 (maskload<mode><avx512fmaskmodelower>): Ditto.
8776 (maskstore<mode><avx512fmaskmodelower>): Ditto.
8778 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
8780 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
8782 (ix86_expand_copysign): Ditto.
8783 (ix86_expand_xorsign): Ditto.
8784 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
8786 (ix86_build_signbit_mask): Ditto.
8787 (ix86_can_change_mode_class): Ditto.
8788 * config/i386/i386.md
8789 (SSEMODEF): Add HFmode.
8790 (ssevecmodef): Ditto.
8791 (<code>hf2): New define_expand.
8792 (*<code>hf2_1): New define_insn_and_split.
8793 (copysign<mode>): Extend to support HFmode under AVX512FP16.
8794 (xorsign<mode>): Ditto.
8795 * config/i386/sse.md (VFB): New mode iterator.
8796 (VFB_128_256): Ditto.
8798 (sseintvecmode2): Support HF vector mode.
8799 (<code><mode>2): Use new mode iterator.
8800 (*<code><mode>2): Ditto.
8801 (copysign<mode>3): Ditto.
8802 (xorsign<mode>3): Ditto.
8803 (<code><mode>3<mask_name>): Ditto.
8804 (<code><mode>3<mask_name>): Ditto.
8805 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
8806 (<sse>_andnot<mode>3<mask_name>): Ditto.
8807 (*<code><mode>3<mask_name>): Ditto.
8808 (*<code><mode>3<mask_name>): Ditto.
8810 2021-09-18 liuhongt <hongtao.liu@intel.com>
8812 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
8814 (_mm512_mask3_fmadd_ph): Likewise.
8815 (_mm512_maskz_fmadd_ph): Likewise.
8816 (_mm512_fmadd_round_ph): Likewise.
8817 (_mm512_mask_fmadd_round_ph): Likewise.
8818 (_mm512_mask3_fmadd_round_ph): Likewise.
8819 (_mm512_maskz_fmadd_round_ph): Likewise.
8820 (_mm512_fnmadd_ph): Likewise.
8821 (_mm512_mask_fnmadd_ph): Likewise.
8822 (_mm512_mask3_fnmadd_ph): Likewise.
8823 (_mm512_maskz_fnmadd_ph): Likewise.
8824 (_mm512_fnmadd_round_ph): Likewise.
8825 (_mm512_mask_fnmadd_round_ph): Likewise.
8826 (_mm512_mask3_fnmadd_round_ph): Likewise.
8827 (_mm512_maskz_fnmadd_round_ph): Likewise.
8828 (_mm512_fmsub_ph): Likewise.
8829 (_mm512_mask_fmsub_ph): Likewise.
8830 (_mm512_mask3_fmsub_ph): Likewise.
8831 (_mm512_maskz_fmsub_ph): Likewise.
8832 (_mm512_fmsub_round_ph): Likewise.
8833 (_mm512_mask_fmsub_round_ph): Likewise.
8834 (_mm512_mask3_fmsub_round_ph): Likewise.
8835 (_mm512_maskz_fmsub_round_ph): Likewise.
8836 (_mm512_fnmsub_ph): Likewise.
8837 (_mm512_mask_fnmsub_ph): Likewise.
8838 (_mm512_mask3_fnmsub_ph): Likewise.
8839 (_mm512_maskz_fnmsub_ph): Likewise.
8840 (_mm512_fnmsub_round_ph): Likewise.
8841 (_mm512_mask_fnmsub_round_ph): Likewise.
8842 (_mm512_mask3_fnmsub_round_ph): Likewise.
8843 (_mm512_maskz_fnmsub_round_ph): Likewise.
8844 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
8846 (_mm256_mask_fmadd_ph): Likewise.
8847 (_mm256_mask3_fmadd_ph): Likewise.
8848 (_mm256_maskz_fmadd_ph): Likewise.
8849 (_mm_fmadd_ph): Likewise.
8850 (_mm_mask_fmadd_ph): Likewise.
8851 (_mm_mask3_fmadd_ph): Likewise.
8852 (_mm_maskz_fmadd_ph): Likewise.
8853 (_mm256_fnmadd_ph): Likewise.
8854 (_mm256_mask_fnmadd_ph): Likewise.
8855 (_mm256_mask3_fnmadd_ph): Likewise.
8856 (_mm256_maskz_fnmadd_ph): Likewise.
8857 (_mm_fnmadd_ph): Likewise.
8858 (_mm_mask_fnmadd_ph): Likewise.
8859 (_mm_mask3_fnmadd_ph): Likewise.
8860 (_mm_maskz_fnmadd_ph): Likewise.
8861 (_mm256_fmsub_ph): Likewise.
8862 (_mm256_mask_fmsub_ph): Likewise.
8863 (_mm256_mask3_fmsub_ph): Likewise.
8864 (_mm256_maskz_fmsub_ph): Likewise.
8865 (_mm_fmsub_ph): Likewise.
8866 (_mm_mask_fmsub_ph): Likewise.
8867 (_mm_mask3_fmsub_ph): Likewise.
8868 (_mm_maskz_fmsub_ph): Likewise.
8869 (_mm256_fnmsub_ph): Likewise.
8870 (_mm256_mask_fnmsub_ph): Likewise.
8871 (_mm256_mask3_fnmsub_ph): Likewise.
8872 (_mm256_maskz_fnmsub_ph): Likewise.
8873 (_mm_fnmsub_ph): Likewise.
8874 (_mm_mask_fnmsub_ph): Likewise.
8875 (_mm_mask3_fnmsub_ph): Likewise.
8876 (_mm_maskz_fnmsub_ph): Likewise.
8877 * config/i386/i386-builtin.def: Add corresponding new builtins.
8878 * config/i386/sse.md
8879 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
8880 support HF vector modes.
8881 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
8883 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
8884 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
8885 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
8886 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
8887 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
8888 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
8889 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
8891 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
8892 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
8893 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
8894 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
8895 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
8896 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
8898 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
8899 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
8900 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
8901 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
8902 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
8903 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
8904 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
8906 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
8907 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
8908 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
8909 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
8910 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
8912 2021-09-18 liuhongt <hongtao.liu@intel.com>
8914 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
8916 (_mm512_mask_fmaddsub_ph): Likewise.
8917 (_mm512_mask3_fmaddsub_ph): Likewise.
8918 (_mm512_maskz_fmaddsub_ph): Likewise.
8919 (_mm512_fmaddsub_round_ph): Likewise.
8920 (_mm512_mask_fmaddsub_round_ph): Likewise.
8921 (_mm512_mask3_fmaddsub_round_ph): Likewise.
8922 (_mm512_maskz_fmaddsub_round_ph): Likewise.
8923 (_mm512_mask_fmsubadd_ph): Likewise.
8924 (_mm512_mask3_fmsubadd_ph): Likewise.
8925 (_mm512_maskz_fmsubadd_ph): Likewise.
8926 (_mm512_fmsubadd_round_ph): Likewise.
8927 (_mm512_mask_fmsubadd_round_ph): Likewise.
8928 (_mm512_mask3_fmsubadd_round_ph): Likewise.
8929 (_mm512_maskz_fmsubadd_round_ph): Likewise.
8930 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
8932 (_mm256_mask_fmaddsub_ph): Likewise.
8933 (_mm256_mask3_fmaddsub_ph): Likewise.
8934 (_mm256_maskz_fmaddsub_ph): Likewise.
8935 (_mm_fmaddsub_ph): Likewise.
8936 (_mm_mask_fmaddsub_ph): Likewise.
8937 (_mm_mask3_fmaddsub_ph): Likewise.
8938 (_mm_maskz_fmaddsub_ph): Likewise.
8939 (_mm256_fmsubadd_ph): Likewise.
8940 (_mm256_mask_fmsubadd_ph): Likewise.
8941 (_mm256_mask3_fmsubadd_ph): Likewise.
8942 (_mm256_maskz_fmsubadd_ph): Likewise.
8943 (_mm_fmsubadd_ph): Likewise.
8944 (_mm_mask_fmsubadd_ph): Likewise.
8945 (_mm_mask3_fmsubadd_ph): Likewise.
8946 (_mm_maskz_fmsubadd_ph): Likewise.
8947 * config/i386/i386-builtin.def: Add corresponding new builtins.
8948 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
8949 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
8950 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
8952 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
8954 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
8955 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
8956 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
8958 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
8959 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
8961 2021-09-18 liuhongt <hongtao.liu@intel.com>
8964 * config/i386/i386.c (ix86_print_operand): Handle
8965 V8HF/V16HF/V32HFmode.
8966 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
8967 * config/i386/sse.md (avx512bcst): Remove.
8969 2021-09-17 Martin Sebor <msebor@redhat.com>
8971 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
8972 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
8973 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
8974 (check_defs): Add comment.
8975 (can_skip_redundant_opnd): Update comment.
8976 (compute_uninit_opnds_pos): Adjust to namespace change.
8977 (find_pdom): Move to gimple-predicate-analysis.cc.
8979 (struct uninit_undef_val_t): New.
8980 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
8981 (find_control_equiv_block): Same.
8982 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
8983 (MAX_SWITCH_CASES): Same.
8984 (compute_control_dep_chain): Same.
8985 (find_uninit_use): Use predicate analyzer.
8986 (struct pred_info): Move to gimple-predicate-analysis.
8987 (convert_control_dep_chain_into_preds): Same.
8988 (find_predicates): Same.
8989 (collect_phi_def_edges): Same.
8990 (warn_uninitialized_phi): Use predicate analyzer.
8991 (find_def_preds): Move to gimple-predicate-analysis.
8992 (dump_pred_info): Same.
8993 (dump_pred_chain): Same.
8994 (dump_predicates): Same.
8995 (destroy_predicate_vecs): Remove.
8996 (execute_late_warn_uninitialized): New.
8997 (get_cmp_code): Move to gimple-predicate-analysis.
8998 (is_value_included_in): Same.
8999 (value_sat_pred_p): Same.
9000 (find_matching_predicate_in_rest_chains): Same.
9001 (is_use_properly_guarded): Same.
9002 (prune_uninit_phi_opnds): Same.
9003 (find_var_cmp_const): Same.
9004 (use_pred_not_overlap_with_undef_path_pred): Same.
9005 (pred_equal_p): Same.
9006 (is_neq_relop_p): Same.
9007 (is_neq_zero_form_p): Same.
9008 (pred_expr_equal_p): Same.
9009 (is_pred_expr_subset_of): Same.
9010 (is_pred_chain_subset_of): Same.
9011 (is_included_in): Same.
9012 (is_superset_of): Same.
9014 (simplify_pred): Same.
9015 (simplify_preds_2): Same.
9016 (simplify_preds_3): Same.
9017 (simplify_preds_4): Same.
9018 (simplify_preds): Same.
9020 (push_to_worklist): Same.
9021 (get_pred_info_from_cmp): Same.
9022 (is_degenerated_phi): Same.
9023 (normalize_one_pred_1): Same.
9024 (normalize_one_pred): Same.
9025 (normalize_one_pred_chain): Same.
9026 (normalize_preds): Same.
9027 (can_one_predicate_be_invalidated_p): Same.
9028 (can_chain_union_be_invalidated_p): Same.
9029 (uninit_uses_cannot_happen): Same.
9030 (pass_late_warn_uninitialized::execute): Define.
9031 * gimple-predicate-analysis.cc: New file.
9032 * gimple-predicate-analysis.h: New file.
9034 2021-09-17 Julian Brown <julian@codesourcery.com>
9036 * config/gcn/gcn.c (gimple.h): Include.
9037 (gcn_fork_join): Emit barrier for worker-level joins.
9038 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
9039 writes_gang_private bitmap parameter. Set bit for blocks
9040 containing gang-private variable writes.
9041 (worker_single_simple): Don't emit barrier after predicated block.
9042 (worker_single_copy): Don't emit barrier if we're not broadcasting
9043 anything and the block contains no gang-private writes.
9044 (neuter_worker_single): Don't predicate blocks that only contain
9045 NOPs or internal marker functions. Pass has_gang_private_write
9046 argument to worker_single_copy.
9047 (oacc_do_neutering): Add writes_gang_private bitmap handling.
9049 2021-09-17 Julian Brown <julian@codesourcery.com>
9051 * config/gcn/gcn-protos.h
9052 (gcn_goacc_create_worker_broadcast_record): Update prototype.
9053 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
9054 preallocated block of LDS memory. Do not cache/share decls for
9055 reduction temporaries between invocations.
9056 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
9057 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
9058 and return temporary LDS space at that offset. Return pointer in
9060 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
9062 (ACC_LDS_SIZE): Define as acc_lds_size.
9063 (gcn_init_machine_status): Don't initialise lds_allocated,
9064 lds_allocs, reduc_decls fields of machine function struct.
9065 (gcn_option_override): Handle default size for gang-private
9066 variables and -mgang-private-size option.
9067 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
9068 initialising M0_REG.
9069 (gcn_shared_mem_layout): New function.
9070 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
9071 gang_private_hwm variable.
9072 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
9073 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
9074 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
9075 * config/gcn/gcn.opt (gang_private_size_opt): New global.
9076 (mgang-private-size=): New option.
9077 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
9079 * doc/tm.texi: Regenerate.
9080 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
9082 (build_sender_ref): Handle sender_decl being pointer.
9083 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
9084 parameters. Pass placement argument to
9085 create_worker_broadcast_record hook invocations. Handle
9086 sender_decl being pointer and isolate_broadcasts inserting extra
9088 (blk_offset_map_t): Add typedef.
9089 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
9090 preallocated range to worker_single_copy call.
9091 (dfs_broadcast_reachable_1): New function.
9092 (idx_decl_pair_t, used_range_vec_t): New typedefs.
9093 (sort_size_descending): New function.
9094 (addr_range): New class.
9095 (splay_tree_compare_addr_range, splay_tree_free_key)
9096 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
9097 (execute_omp_oacc_neuter_broadcast): Rename to...
9098 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
9099 parameters. Arrange layout of shared memory for broadcast
9101 (execute_omp_oacc_neuter_broadcast): New function.
9102 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
9103 handling from here. Enable pass for all OpenACC routines in order
9104 to call shared memory-layout hook.
9105 * target.def (create_worker_broadcast_record): Add OFFSET
9107 (shared_mem_layout): New hook.
9109 2021-09-17 Julian Brown <julian@codesourcery.com>
9110 Thomas Schwinge <thomas@codesourcery.com>
9112 * omp-oacc-neuter-broadcast.cc
9113 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
9115 (execute_omp_oacc_neuter_broadcast): Adjust.
9117 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
9119 * value-relation.cc (class equiv_chain): Move to header file.
9120 (path_oracle::path_oracle): New.
9121 (path_oracle::~path_oracle): New.
9122 (path_oracle::register_relation): New.
9123 (path_oracle::query_relation): New.
9124 (path_oracle::reset_path): New.
9125 (path_oracle::dump): New.
9126 * value-relation.h (class equiv_chain): Move to here.
9127 (class path_oracle): New.
9129 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
9131 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
9133 * gimple-range-fold.cc (fur_depend::register_relation): Use
9134 register_stmt/edge routines.
9135 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
9136 (equiv_oracle::equiv_oracle): Create self equivalence cache.
9137 (equiv_oracle::~equiv_oracle): Release same.
9138 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
9139 are no equivalences.
9140 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
9141 (equiv_oracle::register_relation): Rename from register_equiv.
9142 (relation_chain_head::find_relation): Relocate from dom_oracle.
9143 (relation_oracle::register_stmt): New.
9144 (relation_oracle::register_edge): New.
9145 (dom_oracle::*): Rename from relation_oracle.
9146 (dom_oracle::register_relation): Adjust to call equiv_oracle.
9147 (dom_oracle::set_one_relation): Split from register_relation.
9148 (dom_oracle::register_transitives): Consolidate 2 methods.
9149 (dom_oracle::find_relation_block): Move core to relation_chain.
9150 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
9151 * value-relation.h (class relation_oracle): New pure virtual base.
9152 (class equiv_oracle): Inherit from relation_oracle and adjust.
9153 (class dom_oracle): Rename from old relation_oracle and adjust.
9155 2021-09-17 Martin Sebor <msebor@redhat.com>
9157 PR middle-end/102200
9158 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
9159 (handle_min_max_size): Change argument. Store original SSA_NAME for
9160 operands to potentially distinct (sub)objects.
9161 (compute_objsize_r): Adjust call to the above.
9163 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9165 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
9166 (rs6000_new_builtin_vectorized_function): New function.
9167 (rs6000_new_builtin_md_vectorized_function): Likewise.
9168 (rs6000_builtin_vectorized_function): Call
9169 rs6000_new_builtin_vectorized_function.
9170 (rs6000_builtin_md_vectorized_function): Call
9171 rs6000_new_builtin_md_vectorized_function.
9173 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9175 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
9176 (ASSEMBLE_PAIR): Likewise.
9177 (BUILD_ACC): Likewise.
9178 (DISASSEMBLE_ACC): Likewise.
9179 (DISASSEMBLE_PAIR): Likewise.
9180 (PMXVBF16GER2): Likewise.
9181 (PMXVBF16GER2NN): Likewise.
9182 (PMXVBF16GER2NP): Likewise.
9183 (PMXVBF16GER2PN): Likewise.
9184 (PMXVBF16GER2PP): Likewise.
9185 (PMXVF16GER2): Likewise.
9186 (PMXVF16GER2NN): Likewise.
9187 (PMXVF16GER2NP): Likewise.
9188 (PMXVF16GER2PN): Likewise.
9189 (PMXVF16GER2PP): Likewise.
9190 (PMXVF32GER): Likewise.
9191 (PMXVF32GERNN): Likewise.
9192 (PMXVF32GERNP): Likewise.
9193 (PMXVF32GERPN): Likewise.
9194 (PMXVF32GERPP): Likewise.
9195 (PMXVF64GER): Likewise.
9196 (PMXVF64GERNN): Likewise.
9197 (PMXVF64GERNP): Likewise.
9198 (PMXVF64GERPN): Likewise.
9199 (PMXVF64GERPP): Likewise.
9200 (PMXVI16GER2): Likewise.
9201 (PMXVI16GER2PP): Likewise.
9202 (PMXVI16GER2S): Likewise.
9203 (PMXVI16GER2SPP): Likewise.
9204 (PMXVI4GER8): Likewise.
9205 (PMXVI4GER8PP): Likewise.
9206 (PMXVI8GER4): Likewise.
9207 (PMXVI8GER4PP): Likewise.
9208 (PMXVI8GER4SPP): Likewise.
9209 (XVBF16GER2): Likewise.
9210 (XVBF16GER2NN): Likewise.
9211 (XVBF16GER2NP): Likewise.
9212 (XVBF16GER2PN): Likewise.
9213 (XVBF16GER2PP): Likewise.
9214 (XVF16GER2): Likewise.
9215 (XVF16GER2NN): Likewise.
9216 (XVF16GER2NP): Likewise.
9217 (XVF16GER2PN): Likewise.
9218 (XVF16GER2PP): Likewise.
9219 (XVF32GER): Likewise.
9220 (XVF32GERNN): Likewise.
9221 (XVF32GERNP): Likewise.
9222 (XVF32GERPN): Likewise.
9223 (XVF32GERPP): Likewise.
9224 (XVF64GER): Likewise.
9225 (XVF64GERNN): Likewise.
9226 (XVF64GERNP): Likewise.
9227 (XVF64GERPN): Likewise.
9228 (XVF64GERPP): Likewise.
9229 (XVI16GER2): Likewise.
9230 (XVI16GER2PP): Likewise.
9231 (XVI16GER2S): Likewise.
9232 (XVI16GER2SPP): Likewise.
9233 (XVI4GER8): Likewise.
9234 (XVI4GER8PP): Likewise.
9235 (XVI8GER4): Likewise.
9236 (XVI8GER4PP): Likewise.
9237 (XVI8GER4SPP): Likewise.
9238 (XXMFACC): Likewise.
9239 (XXMTACC): Likewise.
9240 (XXSETACCZ): Likewise.
9241 (ASSEMBLE_PAIR_V): Likewise.
9242 (BUILD_PAIR): Likewise.
9243 (DISASSEMBLE_PAIR_V): Likewise.
9246 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
9247 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
9248 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
9249 (parse_bif_attrs): Handle ismmaint.
9250 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
9251 (write_bif_static_init): Handle ismmaint.
9253 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9255 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
9257 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
9258 (rs6000_new_builtin_valid_without_lhs): New function.
9259 (rs6000_gimple_fold_new_mma_builtin): Likewise.
9260 (rs6000_gimple_fold_new_builtin): Likewise.
9262 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
9264 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
9265 Destruct stale Value objects.
9266 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
9269 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
9272 * match.pd (shift optimizations): Disable recent sign-changing
9273 optimization for shifts by zero, these will be folded later.
9275 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
9277 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
9278 [power9] to [always].
9280 2021-09-17 Richard Biener <rguenther@suse.de>
9282 * tree-vect-stmts.c (vectorizable_load): Do not frob
9285 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9287 * config/i386/i386-features.c (remove_partial_avx_dependency):
9288 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
9289 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
9291 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
9293 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
9294 * config/i386/i386.md (SSE FP to FP splitters): Replace
9295 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
9296 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
9297 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
9298 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
9299 * config/i386/x86-tune.def
9300 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
9301 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
9303 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9306 * config/i386/i386-features.c (remove_partial_avx_dependency):
9307 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
9308 before generating vxorps.
9310 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9312 * config/i386/i386-options.c (processor_cost_table): Use
9313 tremont_cost for Tremont.
9314 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
9315 (tremont_memset): Likewise.
9316 (tremont_cost): Likewise.
9317 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
9320 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
9322 * common/config/i386/i386-common.c: Use Haswell scheduling model
9324 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
9326 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
9328 (ix86_adjust_cost): Handle Tremont.
9329 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
9331 (X86_TUNE_USE_LEAVE): Likewise.
9332 (X86_TUNE_PUSH_MEMORY): Likewise.
9333 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
9334 (X86_TUNE_USE_CLTD): Likewise.
9335 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
9336 (X86_TUNE_AVOID_MFENCE): Likewise.
9337 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
9338 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
9339 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
9340 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
9341 (X86_TUNE_OPT_AGU): Likewise.
9342 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
9343 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
9344 (X86_TUNE_EXPAND_ABS): Likewise.
9345 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
9346 (X86_TUNE_SLOW_PSHUFB): Likewise.
9348 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
9350 PR rtl-optimization/102306
9351 * combine.c (try_combine): Abort the combination if we are about to
9352 duplicate volatile references.
9354 2021-09-17 liuhongt <hongtao.liu@intel.com>
9356 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
9358 (_mm256_undefined_ph): Likewise.
9359 (_mm512_undefined_ph): Likewise.
9360 (_mm_cvtsh_h): Likewise.
9361 (_mm256_cvtsh_h): Likewise.
9362 (_mm512_cvtsh_h): Likewise.
9363 (_mm512_castph_ps): Likewise.
9364 (_mm512_castph_pd): Likewise.
9365 (_mm512_castph_si512): Likewise.
9366 (_mm512_castph512_ph128): Likewise.
9367 (_mm512_castph512_ph256): Likewise.
9368 (_mm512_castph128_ph512): Likewise.
9369 (_mm512_castph256_ph512): Likewise.
9370 (_mm512_zextph128_ph512): Likewise.
9371 (_mm512_zextph256_ph512): Likewise.
9372 (_mm512_castps_ph): Likewise.
9373 (_mm512_castpd_ph): Likewise.
9374 (_mm512_castsi512_ph): Likewise.
9375 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
9377 (_mm256_castph_ps): Likewise.
9378 (_mm_castph_pd): Likewise.
9379 (_mm256_castph_pd): Likewise.
9380 (_mm_castph_si128): Likewise.
9381 (_mm256_castph_si256): Likewise.
9382 (_mm_castps_ph): Likewise.
9383 (_mm256_castps_ph): Likewise.
9384 (_mm_castpd_ph): Likewise.
9385 (_mm256_castpd_ph): Likewise.
9386 (_mm_castsi128_ph): Likewise.
9387 (_mm256_castsi256_ph): Likewise.
9388 (_mm256_castph256_ph128): Likewise.
9389 (_mm256_castph128_ph256): Likewise.
9390 (_mm256_zextph128_ph256): Likewise.
9392 2021-09-17 liuhongt <hongtao.liu@intel.com>
9394 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
9396 (_mm_mask_cvtsh_ss): Likewise.
9397 (_mm_maskz_cvtsh_ss): Likewise.
9398 (_mm_cvtsh_sd): Likewise.
9399 (_mm_mask_cvtsh_sd): Likewise.
9400 (_mm_maskz_cvtsh_sd): Likewise.
9401 (_mm_cvt_roundsh_ss): Likewise.
9402 (_mm_mask_cvt_roundsh_ss): Likewise.
9403 (_mm_maskz_cvt_roundsh_ss): Likewise.
9404 (_mm_cvt_roundsh_sd): Likewise.
9405 (_mm_mask_cvt_roundsh_sd): Likewise.
9406 (_mm_maskz_cvt_roundsh_sd): Likewise.
9407 (_mm_cvtss_sh): Likewise.
9408 (_mm_mask_cvtss_sh): Likewise.
9409 (_mm_maskz_cvtss_sh): Likewise.
9410 (_mm_cvtsd_sh): Likewise.
9411 (_mm_mask_cvtsd_sh): Likewise.
9412 (_mm_maskz_cvtsd_sh): Likewise.
9413 (_mm_cvt_roundss_sh): Likewise.
9414 (_mm_mask_cvt_roundss_sh): Likewise.
9415 (_mm_maskz_cvt_roundss_sh): Likewise.
9416 (_mm_cvt_roundsd_sh): Likewise.
9417 (_mm_mask_cvt_roundsd_sh): Likewise.
9418 (_mm_maskz_cvt_roundsd_sh): Likewise.
9419 * config/i386/i386-builtin-types.def
9420 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
9421 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
9422 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
9423 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
9424 * config/i386/i386-builtin.def: Add corrresponding new builtins.
9425 * config/i386/i386-expand.c: Handle new builtin types.
9426 * config/i386/sse.md (VF48_128): New mode iterator.
9427 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
9429 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
9432 2021-09-17 liuhongt <hongtao.liu@intel.com>
9434 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
9436 (_mm512_mask_cvtph_pd): Likewise.
9437 (_mm512_maskz_cvtph_pd): Likewise.
9438 (_mm512_cvt_roundph_pd): Likewise.
9439 (_mm512_mask_cvt_roundph_pd): Likewise.
9440 (_mm512_maskz_cvt_roundph_pd): Likewise.
9441 (_mm512_cvtxph_ps): Likewise.
9442 (_mm512_mask_cvtxph_ps): Likewise.
9443 (_mm512_maskz_cvtxph_ps): Likewise.
9444 (_mm512_cvtx_roundph_ps): Likewise.
9445 (_mm512_mask_cvtx_roundph_ps): Likewise.
9446 (_mm512_maskz_cvtx_roundph_ps): Likewise.
9447 (_mm512_cvtxps_ph): Likewise.
9448 (_mm512_mask_cvtxps_ph): Likewise.
9449 (_mm512_maskz_cvtxps_ph): Likewise.
9450 (_mm512_cvtx_roundps_ph): Likewise.
9451 (_mm512_mask_cvtx_roundps_ph): Likewise.
9452 (_mm512_maskz_cvtx_roundps_ph): Likewise.
9453 (_mm512_cvtpd_ph): Likewise.
9454 (_mm512_mask_cvtpd_ph): Likewise.
9455 (_mm512_maskz_cvtpd_ph): Likewise.
9456 (_mm512_cvt_roundpd_ph): Likewise.
9457 (_mm512_mask_cvt_roundpd_ph): Likewise.
9458 (_mm512_maskz_cvt_roundpd_ph): Likewise.
9459 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
9461 (_mm_mask_cvtph_pd): Likewise.
9462 (_mm_maskz_cvtph_pd): Likewise.
9463 (_mm256_cvtph_pd): Likewise.
9464 (_mm256_mask_cvtph_pd): Likewise.
9465 (_mm256_maskz_cvtph_pd): Likewise.
9466 (_mm_cvtxph_ps): Likewise.
9467 (_mm_mask_cvtxph_ps): Likewise.
9468 (_mm_maskz_cvtxph_ps): Likewise.
9469 (_mm256_cvtxph_ps): Likewise.
9470 (_mm256_mask_cvtxph_ps): Likewise.
9471 (_mm256_maskz_cvtxph_ps): Likewise.
9472 (_mm_cvtxps_ph): Likewise.
9473 (_mm_mask_cvtxps_ph): Likewise.
9474 (_mm_maskz_cvtxps_ph): Likewise.
9475 (_mm256_cvtxps_ph): Likewise.
9476 (_mm256_mask_cvtxps_ph): Likewise.
9477 (_mm256_maskz_cvtxps_ph): Likewise.
9478 (_mm_cvtpd_ph): Likewise.
9479 (_mm_mask_cvtpd_ph): Likewise.
9480 (_mm_maskz_cvtpd_ph): Likewise.
9481 (_mm256_cvtpd_ph): Likewise.
9482 (_mm256_mask_cvtpd_ph): Likewise.
9483 (_mm256_maskz_cvtpd_ph): Likewise.
9484 * config/i386/i386-builtin.def: Add corresponding new builtins.
9485 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
9486 * config/i386/i386-expand.c: Handle new builtin types.
9487 * config/i386/sse.md
9488 (VF4_128_8_256): New.
9489 (VF48H_AVX512VL): Ditto.
9490 (ssePHmode): Add HF vector modes.
9491 (castmode): Add new convertable modes.
9494 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
9495 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
9496 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
9497 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
9498 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
9499 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
9500 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
9502 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
9503 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
9504 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
9505 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
9507 2021-09-17 liuhongt <hongtao.liu@intel.com>
9509 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
9511 (_mm_cvttsh_u32): Likewise.
9512 (_mm_cvtt_roundsh_i32): Likewise.
9513 (_mm_cvtt_roundsh_u32): Likewise.
9514 (_mm_cvttsh_i64): Likewise.
9515 (_mm_cvttsh_u64): Likewise.
9516 (_mm_cvtt_roundsh_i64): Likewise.
9517 (_mm_cvtt_roundsh_u64): Likewise.
9518 * config/i386/i386-builtin.def: Add corresponding new builtins.
9519 * config/i386/sse.md
9520 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
9523 2021-09-17 liuhongt <hongtao.liu@intel.com>
9525 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
9527 (_mm512_mask_cvttph_epi32): Likewise.
9528 (_mm512_maskz_cvttph_epi32): Likewise.
9529 (_mm512_cvtt_roundph_epi32): Likewise.
9530 (_mm512_mask_cvtt_roundph_epi32): Likewise.
9531 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
9532 (_mm512_cvttph_epu32): Likewise.
9533 (_mm512_mask_cvttph_epu32): Likewise.
9534 (_mm512_maskz_cvttph_epu32): Likewise.
9535 (_mm512_cvtt_roundph_epu32): Likewise.
9536 (_mm512_mask_cvtt_roundph_epu32): Likewise.
9537 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
9538 (_mm512_cvttph_epi64): Likewise.
9539 (_mm512_mask_cvttph_epi64): Likewise.
9540 (_mm512_maskz_cvttph_epi64): Likewise.
9541 (_mm512_cvtt_roundph_epi64): Likewise.
9542 (_mm512_mask_cvtt_roundph_epi64): Likewise.
9543 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
9544 (_mm512_cvttph_epu64): Likewise.
9545 (_mm512_mask_cvttph_epu64): Likewise.
9546 (_mm512_maskz_cvttph_epu64): Likewise.
9547 (_mm512_cvtt_roundph_epu64): Likewise.
9548 (_mm512_mask_cvtt_roundph_epu64): Likewise.
9549 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
9550 (_mm512_cvttph_epi16): Likewise.
9551 (_mm512_mask_cvttph_epi16): Likewise.
9552 (_mm512_maskz_cvttph_epi16): Likewise.
9553 (_mm512_cvtt_roundph_epi16): Likewise.
9554 (_mm512_mask_cvtt_roundph_epi16): Likewise.
9555 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
9556 (_mm512_cvttph_epu16): Likewise.
9557 (_mm512_mask_cvttph_epu16): Likewise.
9558 (_mm512_maskz_cvttph_epu16): Likewise.
9559 (_mm512_cvtt_roundph_epu16): Likewise.
9560 (_mm512_mask_cvtt_roundph_epu16): Likewise.
9561 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
9562 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
9564 (_mm_mask_cvttph_epi32): Likewise.
9565 (_mm_maskz_cvttph_epi32): Likewise.
9566 (_mm256_cvttph_epi32): Likewise.
9567 (_mm256_mask_cvttph_epi32): Likewise.
9568 (_mm256_maskz_cvttph_epi32): Likewise.
9569 (_mm_cvttph_epu32): Likewise.
9570 (_mm_mask_cvttph_epu32): Likewise.
9571 (_mm_maskz_cvttph_epu32): Likewise.
9572 (_mm256_cvttph_epu32): Likewise.
9573 (_mm256_mask_cvttph_epu32): Likewise.
9574 (_mm256_maskz_cvttph_epu32): Likewise.
9575 (_mm_cvttph_epi64): Likewise.
9576 (_mm_mask_cvttph_epi64): Likewise.
9577 (_mm_maskz_cvttph_epi64): Likewise.
9578 (_mm256_cvttph_epi64): Likewise.
9579 (_mm256_mask_cvttph_epi64): Likewise.
9580 (_mm256_maskz_cvttph_epi64): Likewise.
9581 (_mm_cvttph_epu64): Likewise.
9582 (_mm_mask_cvttph_epu64): Likewise.
9583 (_mm_maskz_cvttph_epu64): Likewise.
9584 (_mm256_cvttph_epu64): Likewise.
9585 (_mm256_mask_cvttph_epu64): Likewise.
9586 (_mm256_maskz_cvttph_epu64): Likewise.
9587 (_mm_cvttph_epi16): Likewise.
9588 (_mm_mask_cvttph_epi16): Likewise.
9589 (_mm_maskz_cvttph_epi16): Likewise.
9590 (_mm256_cvttph_epi16): Likewise.
9591 (_mm256_mask_cvttph_epi16): Likewise.
9592 (_mm256_maskz_cvttph_epi16): Likewise.
9593 (_mm_cvttph_epu16): Likewise.
9594 (_mm_mask_cvttph_epu16): Likewise.
9595 (_mm_maskz_cvttph_epu16): Likewise.
9596 (_mm256_cvttph_epu16): Likewise.
9597 (_mm256_mask_cvttph_epu16): Likewise.
9598 (_mm256_maskz_cvttph_epu16): Likewise.
9599 * config/i386/i386-builtin.def: Add new builtins.
9600 * config/i386/sse.md
9601 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
9603 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
9604 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
9605 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
9606 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
9608 2021-09-17 liuhongt <hongtao.liu@intel.com>
9610 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
9611 (_mm_cvtsh_u32): Likewise.
9612 (_mm_cvt_roundsh_i32): Likewise.
9613 (_mm_cvt_roundsh_u32): Likewise.
9614 (_mm_cvtsh_i64): Likewise.
9615 (_mm_cvtsh_u64): Likewise.
9616 (_mm_cvt_roundsh_i64): Likewise.
9617 (_mm_cvt_roundsh_u64): Likewise.
9618 (_mm_cvti32_sh): Likewise.
9619 (_mm_cvtu32_sh): Likewise.
9620 (_mm_cvt_roundi32_sh): Likewise.
9621 (_mm_cvt_roundu32_sh): Likewise.
9622 (_mm_cvti64_sh): Likewise.
9623 (_mm_cvtu64_sh): Likewise.
9624 (_mm_cvt_roundi64_sh): Likewise.
9625 (_mm_cvt_roundu64_sh): Likewise.
9626 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
9627 * config/i386/i386-builtin.def: Add corresponding new builtins.
9628 * config/i386/i386-expand.c (ix86_expand_round_builtin):
9629 Handle new builtin types.
9630 * config/i386/sse.md
9631 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
9633 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
9634 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
9636 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
9638 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
9639 (altivec_resolve_new_overloaded_builtin): New forward decl.
9640 (rs6000_new_builtin_type_compatible): New function.
9641 (altivec_resolve_overloaded_builtin): Call
9642 altivec_resolve_new_overloaded_builtin.
9643 (altivec_build_new_resolved_builtin): New function.
9644 (altivec_resolve_new_overloaded_builtin): Likewise.
9645 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
9647 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
9648 name of rs6000_new_builtin_is_supported.
9650 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
9652 * config/i386/i386-protos.h (ix86_decompose_address):
9653 Change return type to bool.
9654 * config/i386/i386.c (ix86_decompose_address): Ditto.
9656 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
9659 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
9660 Added 'build/' to target, use build/%.o rule.
9661 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
9662 '$(build_exeext)' to target and 'build/' for the *.o files.
9663 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
9666 2021-09-16 Martin Jambor <mjambor@suse.cz>
9668 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
9669 the body has been removed. Remove trailing whitespace.
9671 2021-09-16 Richard Biener <rguenther@suse.de>
9673 PR middle-end/102360
9674 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
9675 of non-memory more robust.
9677 2021-09-16 Daniel Cederman <cederman@gaisler.com>
9679 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
9680 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
9681 (leon5_adjust_cost): Increase cost of store with data dependency
9682 on ALU instruction and FPU anti-dependencies.
9683 (sparc_option_override): Add LEON5 costs
9684 (sparc_adjust_cost): Add LEON5 cost adjustments
9685 * config/sparc/sparc.h: Add LEON5
9686 * config/sparc/sparc.md: Include LEON5 scheduling information
9687 * config/sparc/sparc.opt: Add LEON5
9688 * doc/invoke.texi: Add LEON5
9689 * config/sparc/leon5.md: New file.
9691 2021-09-16 Daniel Cederman <cederman@gaisler.com>
9693 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
9694 sensitive sequence for B2BST errata workaround.
9696 2021-09-16 Daniel Cederman <cederman@gaisler.com>
9698 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
9699 functions with atomic instruction in the UT700 errata workaround.
9701 2021-09-16 Daniel Cederman <cederman@gaisler.com>
9703 * config/sparc/sparc.c (next_active_non_empty_insn): New function
9704 that returns next active non empty assembly instruction.
9705 (sparc_do_work_around_errata): Use new function.
9707 2021-09-16 Daniel Cederman <cederman@gaisler.com>
9709 * config/sparc/sparc.c (store_insn_p): Add predicate for store
9711 (load_insn_p): Add predicate for load attributes.
9712 (sparc_do_work_around_errata): Use new predicates.
9714 2021-09-16 Andreas Larsson <andreas@gaisler.com>
9716 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
9719 2021-09-16 Martin Liska <mliska@suse.cz>
9721 * config/mips/netbsd.h: Fix typo in name of a macro.
9723 2021-09-16 liuhongt <hongtao.liu@intel.com>
9725 PR middle-end/102080
9726 * match.pd: Check mask type when doing cond_op related gimple
9728 * tree.c (is_truth_type_for): New function.
9729 * tree.h (is_truth_type_for): New declaration.
9731 2021-09-16 liuhongt <hongtao.liu@intel.com>
9733 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
9735 (_mm512_mask_cvtepi32_ph): Likewise.
9736 (_mm512_maskz_cvtepi32_ph): Likewise.
9737 (_mm512_cvt_roundepi32_ph): Likewise.
9738 (_mm512_mask_cvt_roundepi32_ph): Likewise.
9739 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
9740 (_mm512_cvtepu32_ph): Likewise.
9741 (_mm512_mask_cvtepu32_ph): Likewise.
9742 (_mm512_maskz_cvtepu32_ph): Likewise.
9743 (_mm512_cvt_roundepu32_ph): Likewise.
9744 (_mm512_mask_cvt_roundepu32_ph): Likewise.
9745 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
9746 (_mm512_cvtepi64_ph): Likewise.
9747 (_mm512_mask_cvtepi64_ph): Likewise.
9748 (_mm512_maskz_cvtepi64_ph): Likewise.
9749 (_mm512_cvt_roundepi64_ph): Likewise.
9750 (_mm512_mask_cvt_roundepi64_ph): Likewise.
9751 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
9752 (_mm512_cvtepu64_ph): Likewise.
9753 (_mm512_mask_cvtepu64_ph): Likewise.
9754 (_mm512_maskz_cvtepu64_ph): Likewise.
9755 (_mm512_cvt_roundepu64_ph): Likewise.
9756 (_mm512_mask_cvt_roundepu64_ph): Likewise.
9757 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
9758 (_mm512_cvtepi16_ph): Likewise.
9759 (_mm512_mask_cvtepi16_ph): Likewise.
9760 (_mm512_maskz_cvtepi16_ph): Likewise.
9761 (_mm512_cvt_roundepi16_ph): Likewise.
9762 (_mm512_mask_cvt_roundepi16_ph): Likewise.
9763 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
9764 (_mm512_cvtepu16_ph): Likewise.
9765 (_mm512_mask_cvtepu16_ph): Likewise.
9766 (_mm512_maskz_cvtepu16_ph): Likewise.
9767 (_mm512_cvt_roundepu16_ph): Likewise.
9768 (_mm512_mask_cvt_roundepu16_ph): Likewise.
9769 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
9770 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
9772 (_mm_mask_cvtepi32_ph): Likewise.
9773 (_mm_maskz_cvtepi32_ph): Likewise.
9774 (_mm256_cvtepi32_ph): Likewise.
9775 (_mm256_mask_cvtepi32_ph): Likewise.
9776 (_mm256_maskz_cvtepi32_ph): Likewise.
9777 (_mm_cvtepu32_ph): Likewise.
9778 (_mm_mask_cvtepu32_ph): Likewise.
9779 (_mm_maskz_cvtepu32_ph): Likewise.
9780 (_mm256_cvtepu32_ph): Likewise.
9781 (_mm256_mask_cvtepu32_ph): Likewise.
9782 (_mm256_maskz_cvtepu32_ph): Likewise.
9783 (_mm_cvtepi64_ph): Likewise.
9784 (_mm_mask_cvtepi64_ph): Likewise.
9785 (_mm_maskz_cvtepi64_ph): Likewise.
9786 (_mm256_cvtepi64_ph): Likewise.
9787 (_mm256_mask_cvtepi64_ph): Likewise.
9788 (_mm256_maskz_cvtepi64_ph): Likewise.
9789 (_mm_cvtepu64_ph): Likewise.
9790 (_mm_mask_cvtepu64_ph): Likewise.
9791 (_mm_maskz_cvtepu64_ph): Likewise.
9792 (_mm256_cvtepu64_ph): Likewise.
9793 (_mm256_mask_cvtepu64_ph): Likewise.
9794 (_mm256_maskz_cvtepu64_ph): Likewise.
9795 (_mm_cvtepi16_ph): Likewise.
9796 (_mm_mask_cvtepi16_ph): Likewise.
9797 (_mm_maskz_cvtepi16_ph): Likewise.
9798 (_mm256_cvtepi16_ph): Likewise.
9799 (_mm256_mask_cvtepi16_ph): Likewise.
9800 (_mm256_maskz_cvtepi16_ph): Likewise.
9801 (_mm_cvtepu16_ph): Likewise.
9802 (_mm_mask_cvtepu16_ph): Likewise.
9803 (_mm_maskz_cvtepu16_ph): Likewise.
9804 (_mm256_cvtepu16_ph): Likewise.
9805 (_mm256_mask_cvtepu16_ph): Likewise.
9806 (_mm256_maskz_cvtepu16_ph): Likewise.
9807 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
9808 * config/i386/i386-builtin.def: Add corresponding new builtins.
9809 * config/i386/i386-expand.c
9810 (ix86_expand_args_builtin): Handle new builtin types.
9811 (ix86_expand_round_builtin): Ditto.
9812 * config/i386/i386-modes.def: Declare V2HF and V6HF.
9813 * config/i386/sse.md (VI2H_AVX512VL): New.
9815 (sseintvecmode): Add HF vector modes.
9816 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
9818 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
9819 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
9820 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
9821 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
9822 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
9823 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
9824 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
9825 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
9826 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
9827 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
9828 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
9830 2021-09-16 liuhongt <hongtao.liu@intel.com>
9832 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
9834 (_mm512_mask_cvtph_epi32): Likewise.
9835 (_mm512_maskz_cvtph_epi32): Likewise.
9836 (_mm512_cvt_roundph_epi32): Likewise.
9837 (_mm512_mask_cvt_roundph_epi32): Likewise.
9838 (_mm512_maskz_cvt_roundph_epi32): Likewise.
9839 (_mm512_cvtph_epu32): Likewise.
9840 (_mm512_mask_cvtph_epu32): Likewise.
9841 (_mm512_maskz_cvtph_epu32): Likewise.
9842 (_mm512_cvt_roundph_epu32): Likewise.
9843 (_mm512_mask_cvt_roundph_epu32): Likewise.
9844 (_mm512_maskz_cvt_roundph_epu32): Likewise.
9845 (_mm512_cvtph_epi64): Likewise.
9846 (_mm512_mask_cvtph_epi64): Likewise.
9847 (_mm512_maskz_cvtph_epi64): Likewise.
9848 (_mm512_cvt_roundph_epi64): Likewise.
9849 (_mm512_mask_cvt_roundph_epi64): Likewise.
9850 (_mm512_maskz_cvt_roundph_epi64): Likewise.
9851 (_mm512_cvtph_epu64): Likewise.
9852 (_mm512_mask_cvtph_epu64): Likewise.
9853 (_mm512_maskz_cvtph_epu64): Likewise.
9854 (_mm512_cvt_roundph_epu64): Likewise.
9855 (_mm512_mask_cvt_roundph_epu64): Likewise.
9856 (_mm512_maskz_cvt_roundph_epu64): Likewise.
9857 (_mm512_cvtph_epi16): Likewise.
9858 (_mm512_mask_cvtph_epi16): Likewise.
9859 (_mm512_maskz_cvtph_epi16): Likewise.
9860 (_mm512_cvt_roundph_epi16): Likewise.
9861 (_mm512_mask_cvt_roundph_epi16): Likewise.
9862 (_mm512_maskz_cvt_roundph_epi16): Likewise.
9863 (_mm512_cvtph_epu16): Likewise.
9864 (_mm512_mask_cvtph_epu16): Likewise.
9865 (_mm512_maskz_cvtph_epu16): Likewise.
9866 (_mm512_cvt_roundph_epu16): Likewise.
9867 (_mm512_mask_cvt_roundph_epu16): Likewise.
9868 (_mm512_maskz_cvt_roundph_epu16): Likewise.
9869 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
9871 (_mm_mask_cvtph_epi32): Likewise.
9872 (_mm_maskz_cvtph_epi32): Likewise.
9873 (_mm256_cvtph_epi32): Likewise.
9874 (_mm256_mask_cvtph_epi32): Likewise.
9875 (_mm256_maskz_cvtph_epi32): Likewise.
9876 (_mm_cvtph_epu32): Likewise.
9877 (_mm_mask_cvtph_epu32): Likewise.
9878 (_mm_maskz_cvtph_epu32): Likewise.
9879 (_mm256_cvtph_epu32): Likewise.
9880 (_mm256_mask_cvtph_epu32): Likewise.
9881 (_mm256_maskz_cvtph_epu32): Likewise.
9882 (_mm_cvtph_epi64): Likewise.
9883 (_mm_mask_cvtph_epi64): Likewise.
9884 (_mm_maskz_cvtph_epi64): Likewise.
9885 (_mm256_cvtph_epi64): Likewise.
9886 (_mm256_mask_cvtph_epi64): Likewise.
9887 (_mm256_maskz_cvtph_epi64): Likewise.
9888 (_mm_cvtph_epu64): Likewise.
9889 (_mm_mask_cvtph_epu64): Likewise.
9890 (_mm_maskz_cvtph_epu64): Likewise.
9891 (_mm256_cvtph_epu64): Likewise.
9892 (_mm256_mask_cvtph_epu64): Likewise.
9893 (_mm256_maskz_cvtph_epu64): Likewise.
9894 (_mm_cvtph_epi16): Likewise.
9895 (_mm_mask_cvtph_epi16): Likewise.
9896 (_mm_maskz_cvtph_epi16): Likewise.
9897 (_mm256_cvtph_epi16): Likewise.
9898 (_mm256_mask_cvtph_epi16): Likewise.
9899 (_mm256_maskz_cvtph_epi16): Likewise.
9900 (_mm_cvtph_epu16): Likewise.
9901 (_mm_mask_cvtph_epu16): Likewise.
9902 (_mm_maskz_cvtph_epu16): Likewise.
9903 (_mm256_cvtph_epu16): Likewise.
9904 (_mm256_mask_cvtph_epu16): Likewise.
9905 (_mm256_maskz_cvtph_epu16): Likewise.
9906 * config/i386/i386-builtin-types.def: Add new builtin types.
9907 * config/i386/i386-builtin.def: Add new builtins.
9908 * config/i386/i386-expand.c
9909 (ix86_expand_args_builtin): Handle new builtin types.
9910 (ix86_expand_round_builtin): Ditto.
9911 * config/i386/sse.md (sseintconvert): New.
9913 (UNSPEC_US_FIX_NOTRUNC): Ditto.
9914 (sseintconvertsignprefix): Ditto.
9915 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
9918 2021-09-16 liuhongt <hongtao.liu@intel.com>
9920 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
9922 (_mm_cvtsi128_si16): Likewise.
9923 (_mm_mask_load_sh): Likewise.
9924 (_mm_maskz_load_sh): Likewise.
9925 (_mm_mask_store_sh): Likewise.
9926 (_mm_move_sh): Likewise.
9927 (_mm_mask_move_sh): Likewise.
9928 (_mm_maskz_move_sh): Likewise.
9929 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
9930 * config/i386/i386-builtin.def: Add corresponding new builtins.
9931 * config/i386/i386-expand.c
9932 (ix86_expand_special_args_builtin): Handle new builtin types.
9933 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
9934 * config/i386/sse.md (VI2F): New mode iterator.
9935 (vec_set<mode>_0): Use new mode iterator.
9936 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
9937 (avx512f_store<mode>_mask): Ditto.
9939 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
9941 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
9943 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
9945 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
9946 Proceed if no symbol summary or the symbol alias flag is false.
9948 2021-09-15 Jakub Jelinek <jakub@redhat.com>
9952 * varasm.c (output_constructor_regular_field): Instead of assertion
9953 that array_size_for_constructor result is equal to size of
9954 TREE_TYPE (local->val) in bytes, assert that the type size is greater
9955 or equal to array_size_for_constructor result and use type size as
9958 2021-09-15 Martin Liska <mliska@suse.cz>
9961 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
9963 2021-09-15 Martin Liska <mliska@suse.cz>
9966 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
9967 Check that we have a symbol summary for a symbol.
9969 2021-09-15 Richard Biener <rguenther@suse.de>
9972 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
9973 to inherit from elfos.h
9975 2021-09-15 liuhongt <hongtao.liu@intel.com>
9978 * config/i386/i386-expand.c
9979 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
9981 (ix86_expand_vector_set): Use blendw instead of pinsrw.
9982 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
9983 AVX512FP16 which supports 16bit vector load.
9984 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
9986 (avx512bw_interleave_high<mode><mask_name>): .. this, and
9987 extend to V32HFmode.
9988 (avx2_interleave_highv16hi<mask_name>): Rename to ..
9989 (avx2_interleave_high<mode><mask_name>): .. this, and extend
9991 (vec_interleave_highv8hi<mask_name>): Rename to ..
9992 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
9993 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
9995 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
9996 this, and extend to V32HFmode.
9997 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
9998 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
9999 (vec_interleave_lowv8hi<mask_name>): Rename to ..
10000 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
10001 (sse4_1_pblendw): Rename to ..
10002 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
10003 (avx2_pblendph): New define_expand.
10004 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
10005 sseintmodesuffix instead of ssemodesuffix.
10006 (blendsuf): New mode attr.
10008 2021-09-15 Richard Biener <rguenther@suse.de>
10010 * tree-vectorizer.h (dr_misalignment): Move out of line.
10011 (dr_target_alignment): New.
10012 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
10013 (set_dr_target_alignment): New.
10014 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
10015 * tree-vect-data-refs.c (dr_misalignment): Compute and
10016 return the group members misalignment.
10017 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
10018 (vect_analyze_data_refs_alignment): Compute alignment only
10019 for the first element of a DR group.
10020 (vect_slp_analyze_node_alignment): Likewise.
10022 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
10024 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
10025 * config/i386/avx512fp16vlintrin.h: Likewise.
10026 * config/i386/i386-builtin.def: Adjust builtin name and
10027 enumeration to match AVX512F style.
10029 2021-09-15 Richard Biener <rguenther@suse.de>
10031 PR tree-optimization/102318
10032 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
10033 previous change and do the mode conversion separately from
10034 the sign conversion.
10036 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
10037 Peter Cordes <peter@cordes.ca>
10040 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
10041 (*vec_extract<mode><ssescalarmodelower>_valign): Output
10042 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
10045 2021-09-15 Richard Biener <rguenther@suse.de>
10047 * config.gcc: Remove vax-*-openbsd* configuration.
10049 2021-09-15 Richard Biener <rguenther@suse.de>
10051 * config.gcc: Remove m68k-openbsd.
10053 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
10056 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
10058 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
10060 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
10061 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
10062 (*mma_xxsetaccz): Delete.
10063 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
10064 Use UNSPECV_MMA_XXSETACCZ. Update comment.
10065 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
10067 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
10069 * Makefile.in: Remove variables related to applying no-PIE
10070 to the exes on $build.
10071 * configure: Regenerate.
10072 * configure.ac: Remove configuration related to applying
10073 no-PIE to the exes on $build.
10075 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
10077 * config/arc/arc.md (doloop_end): Add missing mode.
10078 (loop_end): Likewise.
10080 2021-09-14 Jakub Jelinek <jakub@redhat.com>
10082 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
10083 it to recursive calls, for depth above 7 just gimplify or return.
10084 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
10085 __builtin_clear_padding and TARGET_EXPR.
10086 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
10088 2021-09-14 liuhongt <hongtao.liu@intel.com>
10090 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
10092 (_mm_mask_fpclass_sh_mask): Likewise.
10093 (_mm512_mask_fpclass_ph_mask): Likewise.
10094 (_mm512_fpclass_ph_mask): Likewise.
10095 (_mm_getexp_sh): Likewise.
10096 (_mm_mask_getexp_sh): Likewise.
10097 (_mm_maskz_getexp_sh): Likewise.
10098 (_mm512_getexp_ph): Likewise.
10099 (_mm512_mask_getexp_ph): Likewise.
10100 (_mm512_maskz_getexp_ph): Likewise.
10101 (_mm_getexp_round_sh): Likewise.
10102 (_mm_mask_getexp_round_sh): Likewise.
10103 (_mm_maskz_getexp_round_sh): Likewise.
10104 (_mm512_getexp_round_ph): Likewise.
10105 (_mm512_mask_getexp_round_ph): Likewise.
10106 (_mm512_maskz_getexp_round_ph): Likewise.
10107 (_mm_getmant_sh): Likewise.
10108 (_mm_mask_getmant_sh): Likewise.
10109 (_mm_maskz_getmant_sh): Likewise.
10110 (_mm512_getmant_ph): Likewise.
10111 (_mm512_mask_getmant_ph): Likewise.
10112 (_mm512_maskz_getmant_ph): Likewise.
10113 (_mm_getmant_round_sh): Likewise.
10114 (_mm_mask_getmant_round_sh): Likewise.
10115 (_mm_maskz_getmant_round_sh): Likewise.
10116 (_mm512_getmant_round_ph): Likewise.
10117 (_mm512_mask_getmant_round_ph): Likewise.
10118 (_mm512_maskz_getmant_round_ph): Likewise.
10119 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
10121 (_mm_fpclass_ph_mask): Likewise.
10122 (_mm256_mask_fpclass_ph_mask): Likewise.
10123 (_mm256_fpclass_ph_mask): Likewise.
10124 (_mm256_getexp_ph): Likewise.
10125 (_mm256_mask_getexp_ph): Likewise.
10126 (_mm256_maskz_getexp_ph): Likewise.
10127 (_mm_getexp_ph): Likewise.
10128 (_mm_mask_getexp_ph): Likewise.
10129 (_mm_maskz_getexp_ph): Likewise.
10130 (_mm256_getmant_ph): Likewise.
10131 (_mm256_mask_getmant_ph): Likewise.
10132 (_mm256_maskz_getmant_ph): Likewise.
10133 (_mm_getmant_ph): Likewise.
10134 (_mm_mask_getmant_ph): Likewise.
10135 (_mm_maskz_getmant_ph): Likewise.
10136 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10137 * config/i386/i386-builtin.def: Add corresponding new builtins.
10138 * config/i386/i386-expand.c
10139 (ix86_expand_args_builtin): Handle new builtin types.
10140 (ix86_expand_round_builtin): Ditto.
10141 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
10142 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
10143 to support HF vector modes.
10144 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
10146 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
10147 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
10148 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
10149 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
10152 2021-09-14 liuhongt <hongtao.liu@intel.com>
10154 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
10156 (_mm512_mask_reduce_ph): Likewise.
10157 (_mm512_maskz_reduce_ph): Likewise.
10158 (_mm512_reduce_round_ph): Likewise.
10159 (_mm512_mask_reduce_round_ph): Likewise.
10160 (_mm512_maskz_reduce_round_ph): Likewise.
10161 (_mm_reduce_sh): Likewise.
10162 (_mm_mask_reduce_sh): Likewise.
10163 (_mm_maskz_reduce_sh): Likewise.
10164 (_mm_reduce_round_sh): Likewise.
10165 (_mm_mask_reduce_round_sh): Likewise.
10166 (_mm_maskz_reduce_round_sh): Likewise.
10167 (_mm512_roundscale_ph): Likewise.
10168 (_mm512_mask_roundscale_ph): Likewise.
10169 (_mm512_maskz_roundscale_ph): Likewise.
10170 (_mm512_roundscale_round_ph): Likewise.
10171 (_mm512_mask_roundscale_round_ph): Likewise.
10172 (_mm512_maskz_roundscale_round_ph): Likewise.
10173 (_mm_roundscale_sh): Likewise.
10174 (_mm_mask_roundscale_sh): Likewise.
10175 (_mm_maskz_roundscale_sh): Likewise.
10176 (_mm_roundscale_round_sh): Likewise.
10177 (_mm_mask_roundscale_round_sh): Likewise.
10178 (_mm_maskz_roundscale_round_sh): Likewise.
10179 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
10181 (_mm_mask_reduce_ph): Likewise.
10182 (_mm_maskz_reduce_ph): Likewise.
10183 (_mm256_reduce_ph): Likewise.
10184 (_mm256_mask_reduce_ph): Likewise.
10185 (_mm256_maskz_reduce_ph): Likewise.
10186 (_mm_roundscale_ph): Likewise.
10187 (_mm_mask_roundscale_ph): Likewise.
10188 (_mm_maskz_roundscale_ph): Likewise.
10189 (_mm256_roundscale_ph): Likewise.
10190 (_mm256_mask_roundscale_ph): Likewise.
10191 (_mm256_maskz_roundscale_ph): Likewise.
10192 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10193 * config/i386/i386-builtin.def: Add corresponding new builtins.
10194 * config/i386/i386-expand.c
10195 (ix86_expand_args_builtin): Handle new builtin types.
10196 (ix86_expand_round_builtin): Ditto.
10197 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
10199 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
10200 ... this, and adjust for round operands.
10201 (reduces<mode><mask_scalar_name>): Likewise, with ...
10202 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
10204 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
10205 Adjust for HF vector modes.
10206 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
10208 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
10210 2021-09-14 liuhongt <hongtao.liu@intel.com>
10212 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
10214 (_mm512_mask_rcp_ph): Likewise.
10215 (_mm512_maskz_rcp_ph): Likewise.
10216 (_mm_rcp_sh): Likewise.
10217 (_mm_mask_rcp_sh): Likewise.
10218 (_mm_maskz_rcp_sh): Likewise.
10219 (_mm512_scalef_ph): Likewise.
10220 (_mm512_mask_scalef_ph): Likewise.
10221 (_mm512_maskz_scalef_ph): Likewise.
10222 (_mm512_scalef_round_ph): Likewise.
10223 (_mm512_mask_scalef_round_ph): Likewise.
10224 (_mm512_maskz_scalef_round_ph): Likewise.
10225 (_mm_scalef_sh): Likewise.
10226 (_mm_mask_scalef_sh): Likewise.
10227 (_mm_maskz_scalef_sh): Likewise.
10228 (_mm_scalef_round_sh): Likewise.
10229 (_mm_mask_scalef_round_sh): Likewise.
10230 (_mm_maskz_scalef_round_sh): Likewise.
10231 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
10233 (_mm256_rcp_ph): Likewise.
10234 (_mm_mask_rcp_ph): Likewise.
10235 (_mm256_mask_rcp_ph): Likewise.
10236 (_mm_maskz_rcp_ph): Likewise.
10237 (_mm256_maskz_rcp_ph): Likewise.
10238 (_mm_scalef_ph): Likewise.
10239 (_mm256_scalef_ph): Likewise.
10240 (_mm_mask_scalef_ph): Likewise.
10241 (_mm256_mask_scalef_ph): Likewise.
10242 (_mm_maskz_scalef_ph): Likewise.
10243 (_mm256_maskz_scalef_ph): Likewise.
10244 * config/i386/i386-builtin.def: Add new builtins.
10245 * config/i386/sse.md (VFH_AVX512VL): New.
10246 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
10247 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
10248 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
10249 Adjust to support HF vector modes.
10250 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
10252 2021-09-14 liuhongt <hongtao.liu@intel.com>
10254 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
10256 (_mm512_mask_sqrt_ph): Likewise.
10257 (_mm512_maskz_sqrt_ph): Likewise.
10258 (_mm512_sqrt_round_ph): Likewise.
10259 (_mm512_mask_sqrt_round_ph): Likewise.
10260 (_mm512_maskz_sqrt_round_ph): Likewise.
10261 (_mm512_rsqrt_ph): Likewise.
10262 (_mm512_mask_rsqrt_ph): Likewise.
10263 (_mm512_maskz_rsqrt_ph): Likewise.
10264 (_mm_rsqrt_sh): Likewise.
10265 (_mm_mask_rsqrt_sh): Likewise.
10266 (_mm_maskz_rsqrt_sh): Likewise.
10267 (_mm_sqrt_sh): Likewise.
10268 (_mm_mask_sqrt_sh): Likewise.
10269 (_mm_maskz_sqrt_sh): Likewise.
10270 (_mm_sqrt_round_sh): Likewise.
10271 (_mm_mask_sqrt_round_sh): Likewise.
10272 (_mm_maskz_sqrt_round_sh): Likewise.
10273 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
10274 (_mm256_sqrt_ph): Likewise.
10275 (_mm_mask_sqrt_ph): Likewise.
10276 (_mm256_mask_sqrt_ph): Likewise.
10277 (_mm_maskz_sqrt_ph): Likewise.
10278 (_mm256_maskz_sqrt_ph): Likewise.
10279 (_mm_rsqrt_ph): Likewise.
10280 (_mm256_rsqrt_ph): Likewise.
10281 (_mm_mask_rsqrt_ph): Likewise.
10282 (_mm256_mask_rsqrt_ph): Likewise.
10283 (_mm_maskz_rsqrt_ph): Likewise.
10284 (_mm256_maskz_rsqrt_ph): Likewise.
10285 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10286 * config/i386/i386-builtin.def: Add corresponding new builtins.
10287 * config/i386/i386-expand.c
10288 (ix86_expand_args_builtin): Handle new builtins.
10289 (ix86_expand_round_builtin): Ditto.
10290 * config/i386/sse.md (VF_AVX512FP16VL): New.
10291 (sqrt<mode>2): Adjust for HF vector modes.
10292 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
10293 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
10295 (<sse>_rsqrt<mode>2<mask_name>): New.
10296 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
10298 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
10300 PR bootstrap/101574
10301 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
10302 'RESERVED_LOCATION_P' locations.
10303 * warning-control.cc (get_nowarn_spec, suppress_warning)
10304 (copy_warning): Likewise.
10306 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
10308 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
10309 (typedef key_type_t): ... this. Remove.
10310 (nowarn_map): Document.
10311 * diagnostic-spec.c (nowarn_map): Likewise.
10312 * warning-control.cc (convert_to_key): Evolve functions into...
10313 (get_location): ... these. Adjust all users.
10315 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
10317 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
10319 2021-09-13 Jason Merrill <jason@redhat.com>
10321 * params.opt: Add destructive-interference-size and
10322 constructive-interference-size.
10323 * doc/invoke.texi: Document them.
10324 * config/aarch64/aarch64.c (aarch64_override_options_internal):
10326 * config/arm/arm.c (arm_option_override): Set them.
10327 * config/i386/i386-options.c (ix86_option_override_internal):
10330 2021-09-13 Martin Liska <mliska@suse.cz>
10331 H.J. Lu <hjl.tools@gmail.com>
10334 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
10335 for x86-64 micro levels for __builtin_cpu_supports.
10336 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
10337 Add priorities for the micro-arch levels.
10338 (enum processor_features): Add new features.
10339 * common/config/i386/i386-isas.h: Add micro-arch features.
10340 * config/i386/i386-builtins.c (get_builtin_code_for_version):
10341 Support the micro-arch levels by callsing
10342 __builtin_cpu_supports.
10343 * doc/extend.texi: Document that the levels are support by
10344 __builtin_cpu_supports.
10346 2021-09-13 Andrew Pinski <apinski@marvell.com>
10349 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
10351 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
10352 (aarch64_general_gimple_fold_builtin): Likewise.
10354 2021-09-13 Andrew Pinski <apinski@marvell.com>
10356 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
10357 to the Unsupported targets list.
10358 Remove support for m32r-*-linux* and m32rle-*-linux*.
10359 * config/m32r/linux.h: Removed.
10360 * config/m32r/t-linux: Removed.
10362 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
10365 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
10366 register index for SVE predicate modes.
10368 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
10370 * tree-ssa-threadbackward.c
10371 (back_threader_profitability::profitable_path_p): Remove FSM
10373 (back_threader_registry::register_path): Same.
10374 * tree-ssa-threadedge.c
10375 (jump_threader::simplify_control_stmt_condition): Same.
10376 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
10377 Add backedge_threads argument.
10378 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
10379 backedge_threads argument.
10380 (back_jt_path_registry::back_jt_path_registry): Same.
10381 (dump_jump_thread_path): Adjust for FSM removal.
10382 (back_jt_path_registry::rewire_first_differing_edge): Same.
10383 (back_jt_path_registry::adjust_paths_after_duplication): Same.
10384 (back_jt_path_registry::update_cfg): Same.
10385 (jt_path_registry::register_jump_thread): Same.
10386 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
10388 (class back_jt_path_registry): Add backedge_threads to
10391 2021-09-13 Martin Liska <mliska@suse.cz>
10394 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
10396 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
10399 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
10400 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
10401 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
10402 TARGET_AVX256_STORE_BY_PIECES instead of
10403 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
10404 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
10405 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
10406 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
10407 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
10408 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
10410 2021-09-13 liuhongt <hongtao.liu@intel.com>
10412 PR bootstrap/102302
10413 * expmed.c (extract_bit_field_using_extv): Use
10414 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
10416 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
10418 * Makefile.in (OBJS): Add value-pointer-equiv.o.
10419 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
10420 value-pointer-equiv.*.
10421 (ssa_equiv_stack::ssa_equiv_stack): Same.
10422 (ssa_equiv_stack::enter): Same.
10423 (ssa_equiv_stack::leave): Same.
10424 (ssa_equiv_stack::push_replacement): Same.
10425 (ssa_equiv_stack::get_replacement): Same.
10426 (is_pointer_ssa): Same.
10427 (class pointer_equiv_analyzer): Same.
10428 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
10429 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
10430 (pointer_equiv_analyzer::set_global_equiv): Same.
10431 (pointer_equiv_analyzer::set_cond_equiv): Same.
10432 (pointer_equiv_analyzer::get_equiv): Same.
10433 (pointer_equiv_analyzer::enter): Same.
10434 (pointer_equiv_analyzer::leave): Same.
10435 (pointer_equiv_analyzer::get_equiv_expr): Same.
10436 (pta_valueize): Same.
10437 (pointer_equiv_analyzer::visit_stmt): Same.
10438 (pointer_equiv_analyzer::visit_edge): Same.
10439 (hybrid_folder::value_of_expr): Same.
10440 (hybrid_folder::value_on_edge): Same.
10441 * value-pointer-equiv.cc: New file.
10442 * value-pointer-equiv.h: New file.
10444 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
10447 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
10448 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
10450 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
10453 * config/arm/arm.md (movmisaligndi): New define_expand.
10454 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
10456 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
10459 * emit-rtl.c (gen_highpart): Use adjust_address to handle
10460 MEM rather than calling simplify_gen_subreg.
10462 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
10464 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
10465 define into a block.
10467 2021-09-13 Richard Biener <rguenther@suse.de>
10469 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
10470 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
10471 change based on DARWIN_PREFER_DWARF not being defined.
10473 2021-09-13 Richard Biener <rguenther@suse.de>
10475 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
10476 to inherit from elfos.h
10478 2021-09-13 Richard Biener <rguenther@suse.de>
10480 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
10482 2021-09-13 Richard Biener <rguenther@suse.de>
10484 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
10485 override, pick up DWARF2_DEBUG define from elfos.h
10487 2021-09-13 Richard Biener <rguenther@suse.de>
10489 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
10492 2021-09-13 Richard Biener <rguenther@suse.de>
10494 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
10497 2021-09-13 Richard Biener <rguenther@suse.de>
10499 * config/i386/cygming.h: Always default to DWARF2 debugging.
10500 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
10502 * doc/install.texi: Document binutils 2.16 as minimum
10503 requirement for mingw.
10505 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
10507 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
10508 nstmts, nloads and extra_ctor_cost.
10509 (rs6000_density_test): Add load density related heuristics. Do
10510 extra costing on vector construction statements if need.
10511 (rs6000_init_cost): Init new members.
10512 (rs6000_update_target_cost_per_stmt): New function.
10513 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
10514 rs6000_update_target_cost_per_stmt and call it.
10516 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
10518 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
10519 (rs6000_init_cost): Adjust.
10521 2021-09-13 liuhongt <hongtao.liu@intel.com>
10523 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
10524 (UNSPEC_XORSIGN): Ditto.
10526 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
10528 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
10529 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
10532 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
10534 * tree-ssa-threadbackward.c (class back_threader_registry): Use
10535 back_jt_path_registry.
10536 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
10537 fwd_jt_path_registry.
10538 * tree-ssa-threadedge.h (class jump_threader): Same..
10539 * tree-ssa-threadupdate.c
10540 (jump_thread_path_registry::jump_thread_path_registry): Rename...
10541 (jt_path_registry::jt_path_registry): ...to this.
10542 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
10543 (jt_path_registry::~jt_path_registry): ...this.
10544 (fwd_jt_path_registry::fwd_jt_path_registry): New.
10545 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
10546 (jump_thread_path_registry::allocate_thread_edge): Rename...
10547 (jt_path_registry::allocate_thread_edge): ...to this.
10548 (jump_thread_path_registry::allocate_thread_path): Rename...
10549 (jt_path_registry::allocate_thread_path): ...to this.
10550 (jump_thread_path_registry::lookup_redirection_data): Rename...
10551 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
10552 (jump_thread_path_registry::thread_block_1): Rename...
10553 (fwd_jt_path_registry::thread_block_1): ...to this.
10554 (jump_thread_path_registry::thread_block): Rename...
10555 (fwd_jt_path_registry::thread_block): ...to this.
10556 (jt_path_registry::thread_through_loop_header): Rename...
10557 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
10558 (jump_thread_path_registry::mark_threaded_blocks): Rename...
10559 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
10560 (jump_thread_path_registry::debug_path): Rename...
10561 (jt_path_registry::debug_path): ...to this.
10562 (jump_thread_path_registry::dump): Rename...
10563 (jt_path_registry::debug): ...to this.
10564 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
10565 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
10566 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
10567 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
10568 (jump_thread_path_registry::duplicate_thread_path): Rename...
10569 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
10570 drop ill-formed candidates.
10571 (jump_thread_path_registry::remove_jump_threads_including): Rename...
10572 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
10573 (jt_path_registry::thread_through_all_blocks): New.
10574 (back_jt_path_registry::update_cfg): New.
10575 (fwd_jt_path_registry::update_cfg): New.
10576 (jump_thread_path_registry::register_jump_thread): Rename...
10577 (jt_path_registry::register_jump_thread): ...to this.
10578 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
10580 (class jt_path_registry): ...here.
10581 (class fwd_jt_path_registry): New.
10582 (class back_jt_path_registry): New.
10584 2021-09-10 liuhongt <hongtao.liu@intel.com>
10587 2021-09-01 liuhongt <hongtao.liu@intel.com>
10589 * emit-rtl.c (validate_subreg): Get rid of all float-int
10592 2021-09-10 Jakub Jelinek <jakub@redhat.com>
10594 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
10595 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
10596 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
10597 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
10598 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
10599 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
10600 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
10601 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
10603 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
10604 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
10605 GF_OMP_ATOMIC_WEAK.
10606 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
10608 * tree.h (OMP_ATOMIC_WEAK): Define.
10609 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
10610 fail memory order being encoded in the same enum and also print
10611 fail clause if present.
10612 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
10613 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
10614 handle pre_p == NULL case as a test mode that only returns value
10615 but doesn't change gimplify nor change anything otherwise, adjust
10616 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
10617 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
10618 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
10620 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
10621 COND_EXPR rhs. Set weak flag on gimple load/store for
10623 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
10624 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
10626 (expand_omp_atomic_cas): New function.
10627 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
10629 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
10630 using expand_omp_atomic_cas.
10632 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
10633 Michael Matz <matz@suse.de>
10635 * tree-pass.h (PROP_loop_opts_done): New.
10636 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
10637 Intersect with global range.
10638 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
10639 * tree-ssa-threadbackward.c
10640 (back_threader_profitability::profitable_path_p): Disable
10641 threading through latches until after loop optimizations have run.
10643 2021-09-10 David Faust <david.faust@oracle.com>
10645 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
10648 2021-09-10 David Faust <david.faust@oracle.com>
10650 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
10651 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
10652 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
10653 (bpf_expand_cbranch): New function.
10654 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
10656 (zero_extendsidi2): Only use mov32 instruction if it is available.
10657 (SIM mode iterator): Conditionalize support for SI mode.
10658 (JM mode iterator): New.
10659 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
10660 (*branch_on_di): Update name, use new JM iterator.
10661 * config/bpf/bpf.opt: (mjmpext): New option.
10662 (malu32): Likewise.
10663 (mjmp32): Likewise.
10665 (bpf_isa): New enum.
10667 2021-09-10 David Faust <david.faust@oracle.com>
10669 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
10670 for register-to-register extensions.
10671 (zero_extendqidi2): Likewise.
10673 2021-09-10 Richard Biener <rguenther@suse.de>
10675 PR middle-end/102273
10676 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
10678 2021-09-10 Richard Biener <rguenther@suse.de>
10680 PR middle-end/102269
10681 * gimplify.c (is_var_need_auto_init): Empty types do not need
10684 2021-09-10 Richard Biener <rguenther@suse.de>
10686 * configure.ac (--with-stabs): Remove.
10687 * configure: Regenerate.
10688 * doc/install.texi: Remove --with-stabs documentation.
10690 2021-09-10 liuhongt <hongtao.liu@intel.com>
10692 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
10694 (_mm512_mask_cmp_ph_mask): Likewise.
10695 (_mm512_cmp_round_ph_mask): Likewise.
10696 (_mm512_mask_cmp_round_ph_mask): Likewise.
10697 (_mm_cmp_sh_mask): Likewise.
10698 (_mm_mask_cmp_sh_mask): Likewise.
10699 (_mm_cmp_round_sh_mask): Likewise.
10700 (_mm_mask_cmp_round_sh_mask): Likewise.
10701 (_mm_comieq_sh): Likewise.
10702 (_mm_comilt_sh): Likewise.
10703 (_mm_comile_sh): Likewise.
10704 (_mm_comigt_sh): Likewise.
10705 (_mm_comige_sh): Likewise.
10706 (_mm_comineq_sh): Likewise.
10707 (_mm_ucomieq_sh): Likewise.
10708 (_mm_ucomilt_sh): Likewise.
10709 (_mm_ucomile_sh): Likewise.
10710 (_mm_ucomigt_sh): Likewise.
10711 (_mm_ucomige_sh): Likewise.
10712 (_mm_ucomineq_sh): Likewise.
10713 (_mm_comi_round_sh): Likewise.
10714 (_mm_comi_sh): Likewise.
10715 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
10716 (_mm_mask_cmp_ph_mask): Likewise.
10717 (_mm256_cmp_ph_mask): Likewise.
10718 (_mm256_mask_cmp_ph_mask): Likewise.
10719 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10720 * config/i386/i386-builtin.def: Add corresponding new builtins.
10721 * config/i386/i386-expand.c
10722 (ix86_expand_args_builtin): Handle new builtin types.
10723 (ix86_expand_round_builtin): Ditto.
10724 * config/i386/i386.md (ssevecmode): Add HF mode.
10725 (MODEFH): New mode iterator.
10726 * config/i386/sse.md
10727 (V48H_AVX512VL): New mode iterator to support HF vector modes.
10728 Ajdust corresponding description.
10729 (ssecmpintprefix): New.
10730 (VI12_AVX512VL): Adjust to support HF vector modes.
10731 (cmp_imm_predicate): Likewise.
10732 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
10734 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
10735 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
10736 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
10738 2021-09-10 liuhongt <hongtao.liu@intel.com>
10740 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
10741 (_mm512_mask_max_ph): Likewise.
10742 (_mm512_maskz_max_ph): Likewise.
10743 (_mm512_min_ph): Likewise.
10744 (_mm512_mask_min_ph): Likewise.
10745 (_mm512_maskz_min_ph): Likewise.
10746 (_mm512_max_round_ph): Likewise.
10747 (_mm512_mask_max_round_ph): Likewise.
10748 (_mm512_maskz_max_round_ph): Likewise.
10749 (_mm512_min_round_ph): Likewise.
10750 (_mm512_mask_min_round_ph): Likewise.
10751 (_mm512_maskz_min_round_ph): Likewise.
10752 (_mm_max_sh): Likewise.
10753 (_mm_mask_max_sh): Likewise.
10754 (_mm_maskz_max_sh): Likewise.
10755 (_mm_min_sh): Likewise.
10756 (_mm_mask_min_sh): Likewise.
10757 (_mm_maskz_min_sh): Likewise.
10758 (_mm_max_round_sh): Likewise.
10759 (_mm_mask_max_round_sh): Likewise.
10760 (_mm_maskz_max_round_sh): Likewise.
10761 (_mm_min_round_sh): Likewise.
10762 (_mm_mask_min_round_sh): Likewise.
10763 (_mm_maskz_min_round_sh): Likewise.
10764 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
10765 (_mm256_max_ph): Likewise.
10766 (_mm_mask_max_ph): Likewise.
10767 (_mm256_mask_max_ph): Likewise.
10768 (_mm_maskz_max_ph): Likewise.
10769 (_mm256_maskz_max_ph): Likewise.
10770 (_mm_min_ph): Likewise.
10771 (_mm256_min_ph): Likewise.
10772 (_mm_mask_min_ph): Likewise.
10773 (_mm256_mask_min_ph): Likewise.
10774 (_mm_maskz_min_ph): Likewise.
10775 (_mm256_maskz_min_ph): Likewise.
10776 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10777 * config/i386/i386-builtin.def: Add corresponding new builtins.
10778 * config/i386/i386-expand.c
10779 (ix86_expand_args_builtin): Handle new builtin types.
10780 * config/i386/sse.md
10781 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
10782 support HF vector modes.
10783 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
10784 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
10786 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
10788 * config/i386/subst.md (round_saeonly_mode512bit_condition):
10789 Adjust for HF vector modes.
10791 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
10793 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
10794 (_mm_mask_add_sh): Likewise.
10795 (_mm_maskz_add_sh): Likewise.
10796 (_mm_sub_sh): Likewise.
10797 (_mm_mask_sub_sh): Likewise.
10798 (_mm_maskz_sub_sh): Likewise.
10799 (_mm_mul_sh): Likewise.
10800 (_mm_mask_mul_sh): Likewise.
10801 (_mm_maskz_mul_sh): Likewise.
10802 (_mm_div_sh): Likewise.
10803 (_mm_mask_div_sh): Likewise.
10804 (_mm_maskz_div_sh): Likewise.
10805 (_mm_add_round_sh): Likewise.
10806 (_mm_mask_add_round_sh): Likewise.
10807 (_mm_maskz_add_round_sh): Likewise.
10808 (_mm_sub_round_sh): Likewise.
10809 (_mm_mask_sub_round_sh): Likewise.
10810 (_mm_maskz_sub_round_sh): Likewise.
10811 (_mm_mul_round_sh): Likewise.
10812 (_mm_mask_mul_round_sh): Likewise.
10813 (_mm_maskz_mul_round_sh): Likewise.
10814 (_mm_div_round_sh): Likewise.
10815 (_mm_mask_div_round_sh): Likewise.
10816 (_mm_maskz_div_round_sh): Likewise.
10817 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10818 * config/i386/i386-builtin.def: Add corresponding new builtins.
10819 * config/i386/i386-expand.c
10820 (ix86_expand_round_builtin): Handle new builtins.
10821 * config/i386/sse.md (VF_128): Change description.
10822 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
10823 Adjust to support HF vector modes.
10824 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
10827 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
10829 * config/i386/i386-expand.c
10830 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
10831 * config/i386/i386.c
10832 (ix86_preferred_simd_mode): Handle HF mode.
10833 * config/i386/sse.md (V_256H): New mode iterator.
10834 (avx_vextractf128<mode>): Use it.
10835 (VEC_INIT_MODE): Align vector HFmode condition to vector
10836 HImodes since there're no real HF instruction used.
10837 (VEC_INIT_HALF_MODE): Ditto.
10839 (VIHF_AVX512BW): Ditto.
10840 (*vec_extracthf): Ditto.
10841 (VEC_EXTRACT_MODE): Ditto.
10843 2021-09-10 Richard Biener <rguenther@suse.de>
10846 * config/dbx.h: Remove.
10847 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
10848 * config/lynx.h: Likewise.
10850 2021-09-10 liuhongt <hongtao.liu@intel.com>
10852 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
10853 right into ANDNOT + AND + IOR, using paradoxical subregs.
10854 (ix86_split_copysign_const): Remove.
10855 (ix86_split_copysign_var): Ditto.
10856 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
10857 (ix86_split_copysign_var): Ditto.
10858 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
10859 (@copysign<mode>3_var): Ditto.
10861 2021-09-09 qing zhao <qing.zhao@oracle.com>
10863 * builtins.c (expand_builtin_memset): Make external visible.
10864 * builtins.h (expand_builtin_memset): Declare extern.
10865 * common.opt (ftrivial-auto-var-init=): New option.
10866 * doc/extend.texi: Document the uninitialized attribute.
10867 * doc/invoke.texi: Document -ftrivial-auto-var-init.
10868 * flag-types.h (enum auto_init_type): New enumerated type
10870 * gimple-fold.c (clear_padding_type): Add one new parameter.
10871 (clear_padding_union): Likewise.
10872 (clear_padding_emit_loop): Likewise.
10873 (clear_type_padding_in_mask): Likewise.
10874 (gimple_fold_builtin_clear_padding): Handle this new parameter.
10875 * gimplify.c (gimple_add_init_for_auto_var): New function.
10876 (gimple_add_padding_init_for_auto_var): New function.
10877 (is_var_need_auto_init): New function.
10878 (gimplify_decl_expr): Add initialization to automatic variables per
10880 (gimplify_call_expr): Add one new parameter for call to
10881 __builtin_clear_padding.
10882 (gimplify_init_constructor): Add padding initialization in the end.
10883 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
10884 (expand_DEFERRED_INIT): New function.
10885 * internal-fn.def (DEFERRED_INIT): New internal function.
10886 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
10887 * tree-sra.c (generate_subtree_deferred_init): New function.
10888 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
10889 calls to .DEFERRED_INIT.
10890 (sra_modify_deferred_init): New function.
10891 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
10892 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
10893 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
10895 (check_defs): Likewise.
10896 (warn_uninitialized_vars): Likewise.
10897 * tree-ssa.c (ssa_undefined_value_p): Likewise.
10898 * tree.c (build_common_builtin_nodes): Build tree node for
10899 BUILT_IN_CLEAR_PADDING when needed.
10901 2021-09-09 Richard Biener <rguenther@suse.de>
10903 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
10906 2021-09-09 Richard Biener <rguenther@suse.de>
10908 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
10909 DOM walk from get_loop_body_in_dom_order using a worklist
10912 2021-09-09 liuhongt <hongtao.liu@intel.com>
10914 * config.gcc: Add avx512fp16vlintrin.h.
10915 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
10916 (_mm512_mask_add_ph): Likewise.
10917 (_mm512_maskz_add_ph): Likewise.
10918 (_mm512_sub_ph): Likewise.
10919 (_mm512_mask_sub_ph): Likewise.
10920 (_mm512_maskz_sub_ph): Likewise.
10921 (_mm512_mul_ph): Likewise.
10922 (_mm512_mask_mul_ph): Likewise.
10923 (_mm512_maskz_mul_ph): Likewise.
10924 (_mm512_div_ph): Likewise.
10925 (_mm512_mask_div_ph): Likewise.
10926 (_mm512_maskz_div_ph): Likewise.
10927 (_mm512_add_round_ph): Likewise.
10928 (_mm512_mask_add_round_ph): Likewise.
10929 (_mm512_maskz_add_round_ph): Likewise.
10930 (_mm512_sub_round_ph): Likewise.
10931 (_mm512_mask_sub_round_ph): Likewise.
10932 (_mm512_maskz_sub_round_ph): Likewise.
10933 (_mm512_mul_round_ph): Likewise.
10934 (_mm512_mask_mul_round_ph): Likewise.
10935 (_mm512_maskz_mul_round_ph): Likewise.
10936 (_mm512_div_round_ph): Likewise.
10937 (_mm512_mask_div_round_ph): Likewise.
10938 (_mm512_maskz_div_round_ph): Likewise.
10939 * config/i386/avx512fp16vlintrin.h: New header.
10940 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
10941 Add new builtin types.
10942 * config/i386/i386-builtin.def: Add corresponding builtins.
10943 * config/i386/i386-expand.c
10944 (ix86_expand_args_builtin): Handle new builtin types.
10945 (ix86_expand_round_builtin): Likewise.
10946 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
10947 * config/i386/sse.md (VFH): New mode_iterator.
10949 (avx512fmaskmode): Add HF vector modes.
10950 (avx512fmaskhalfmode): Likewise.
10951 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
10953 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
10954 (mul<mode>3<mask_name><round_name>): Likewise.
10955 (*mul<mode>3<mask_name><round_name>): Likewise.
10956 (div<mode>3): Likewise.
10957 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
10958 * config/i386/subst.md (SUBST_V): Add HF vector modes.
10959 (SUBST_A): Likewise.
10960 (round_mode512bit_condition): Adjust for V32HFmode.
10962 2021-09-09 liuhongt <hongtao.liu@intel.com>
10965 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
10966 (reduc_plus_scal_v4sf): .. this, New define_expand.
10967 (reduc_plus_scal_v2df): .. and this, New define_expand.
10969 2021-09-09 liuhongt <hongtao.liu@intel.com>
10972 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
10975 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
10978 * doc/trouble.texi (Copy Assignment): Fix description of
10979 behaviour and fix code in example.
10981 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
10984 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
10985 r11 instead of r12 for restoring CR.
10987 2021-09-08 Jakub Jelinek <jakub@redhat.com>
10988 liuhongt <hongtao.liu@intel.com>
10991 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
10992 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
10993 into AND with mask and XOR, using paradoxical subregs.
10994 (ix86_split_xorsign): Remove.
10995 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
10997 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
10999 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
11001 2021-09-08 Jakub Jelinek <jakub@redhat.com>
11004 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
11005 operands[2], emit abs<mode>2 instead.
11006 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
11007 first alternative even for avx, add another alternative with
11008 =&Yv <- 0, Yv, Yvm constraints.
11009 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
11010 to op1, emit vpandn instead.
11012 2021-09-08 liuhongt <hongtao.liu@intel.com>
11014 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
11015 (_mm256_set_ph): Likewise.
11016 (_mm512_set_ph): Likewise.
11017 (_mm_setr_ph): Likewise.
11018 (_mm256_setr_ph): Likewise.
11019 (_mm512_setr_ph): Likewise.
11020 (_mm_set1_ph): Likewise.
11021 (_mm256_set1_ph): Likewise.
11022 (_mm512_set1_ph): Likewise.
11023 (_mm_setzero_ph): Likewise.
11024 (_mm256_setzero_ph): Likewise.
11025 (_mm512_setzero_ph): Likewise.
11026 (_mm_set_sh): Likewise.
11027 (_mm_load_sh): Likewise.
11028 (_mm_store_sh): Likewise.
11029 * config/i386/i386-builtin-types.def (V8HF): New type.
11030 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
11031 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
11032 Support vector HFmodes.
11033 (ix86_expand_vector_init_one_nonzero): Likewise.
11034 (ix86_expand_vector_init_one_var): Likewise.
11035 (ix86_expand_vector_init_interleave): Likewise.
11036 (ix86_expand_vector_init_general): Likewise.
11037 (ix86_expand_vector_set): Likewise.
11038 (ix86_expand_vector_extract): Likewise.
11039 (ix86_expand_vector_init_concat): Likewise.
11040 (ix86_expand_sse_movcc): Handle vector HFmodes.
11041 (ix86_expand_vector_set_var): Ditto.
11042 * config/i386/i386-modes.def: Add HF vector modes in comment.
11043 * config/i386/i386.c (classify_argument): Add HF vector modes.
11044 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
11045 (ix86_vector_mode_supported_p): Likewise.
11046 (ix86_set_reg_reg_cost): Handle vector HFmode.
11047 (ix86_get_ssemov): Handle vector HFmode.
11048 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
11050 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
11052 (function_arg_advance_32): Ditto.
11053 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
11054 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
11055 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
11056 (VALID_SSE2_REG_VHF_MODE): New.
11057 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
11058 (SSE_REG_MODE_P): Add vector HFmode.
11059 * config/i386/i386.md (mode): Add HF vector modes.
11060 (MODE_SIZE): Likewise.
11061 (ssemodesuffix): Add ph suffix for HF vector modes.
11062 * config/i386/sse.md (VFH_128): New mode iterator.
11063 (VMOVE): Adjust for HF vector modes.
11065 (V_256_512): Likewise.
11066 (avx512): Likewise.
11067 (avx512fmaskmode): Likewise.
11068 (shuffletype): Likewise.
11069 (sseinsnmode): Likewise.
11070 (ssedoublevecmode): Likewise.
11071 (ssehalfvecmode): Likewise.
11072 (ssehalfvecmodelower): Likewise.
11073 (ssePScmode): Likewise.
11074 (ssescalarmode): Likewise.
11075 (ssescalarmodelower): Likewise.
11076 (sseintprefix): Likewise.
11078 (bcstscalarsuff): Likewise.
11079 (xtg_mode): Likewise.
11080 (VI12HF_AVX512VL): New mode_iterator.
11081 (VF_AVX512FP16): Likewise.
11083 (VIHF_256): Likewise.
11084 (VIHF_AVX512BW): Likewise.
11085 (V16_256): Likewise.
11086 (V32_512): Likewise.
11087 (sseintmodesuffix): New mode_attr.
11088 (sse): Add scalar and vector HFmodes.
11089 (ssescalarmode): Add vector HFmode mapping.
11090 (ssescalarmodesuffix): Add sh suffix for HFmode.
11091 (*<sse>_vm<insn><mode>3): Use VFH_128.
11092 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
11093 (*ieee_<ieee_maxmin><mode>3): Likewise.
11094 (<avx512>_blendm<mode>): New define_insn.
11095 (vec_setv8hf): New define_expand.
11096 (vec_set<mode>_0): New define_insn for HF vector set.
11097 (*avx512fp16_movsh): Likewise.
11098 (avx512fp16_movsh): Likewise.
11099 (vec_extract_lo_v32hi): Rename to ...
11100 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
11102 (vec_extract_hi_v32hi): Likewise.
11103 (vec_extract_hi_<mode>): Likewise.
11104 (vec_extract_lo_v16hi): Likewise.
11105 (vec_extract_lo_<mode>): Likewise.
11106 (vec_extract_hi_v16hi): Likewise.
11107 (vec_extract_hi_<mode>): Likewise.
11108 (vec_set_hi_v16hi): Likewise.
11109 (vec_set_hi_<mode>): Likewise.
11110 (vec_set_lo_v16hi): Likewise.
11111 (vec_set_lo_<mode>): Likewise.
11112 (*vec_extract<mode>_0): New define_insn_and_split for HF
11114 (*vec_extracthf): New define_insn.
11115 (VEC_EXTRACT_MODE): Add HF vector modes.
11116 (PINSR_MODE): Add V8HF.
11117 (sse2p4_1): Likewise.
11118 (pinsr_evex_isa): Likewise.
11119 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
11120 insert for V8HFmode.
11121 (pbroadcast_evex_isa): Add HF vector modes.
11122 (AVX2_VEC_DUP_MODE): Likewise.
11123 (VEC_INIT_MODE): Likewise.
11124 (VEC_INIT_HALF_MODE): Likewise.
11125 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
11127 (avx2_pbroadcast<mode>_1): Likewise.
11128 (<avx512>_vec_dup<mode>_1): Likewise.
11129 (<avx512>_vec_dup<mode><mask_name>): Likewise.
11130 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
11133 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
11134 H.J. Lu <hongjiu.lu@intel.com>
11135 Liu Hongtao <hongtao.liu@intel.com>
11136 Wang Hongyu <hongyu.wang@intel.com>
11137 Xu Dianhong <dianhong.xu@intel.com>
11139 * common/config/i386/cpuinfo.h (get_available_features):
11140 Detect FEATURE_AVX512FP16.
11141 * common/config/i386/i386-common.c
11142 (OPTION_MASK_ISA_AVX512FP16_SET,
11143 OPTION_MASK_ISA_AVX512FP16_UNSET,
11144 OPTION_MASK_ISA2_AVX512FP16_SET,
11145 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
11146 (OPTION_MASK_ISA2_AVX512BW_UNSET,
11147 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
11148 (ix86_handle_option): Handle -mavx512fp16.
11149 * common/config/i386/i386-cpuinfo.h (enum processor_features):
11150 Add FEATURE_AVX512FP16.
11151 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
11152 * config.gcc: Add avx512fp16intrin.h.
11153 * config/i386/avx512fp16intrin.h: New intrinsic header.
11154 * config/i386/cpuid.h: Add bit_AVX512FP16.
11155 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
11156 * config/i386/i386-builtins.c: Support _Float16 type for i386
11158 (ix86_register_float16_builtin_type): New function.
11159 (ix86_float16_type_node): New.
11160 * config/i386/i386-c.c (ix86_target_macros_internal): Define
11162 * config/i386/i386-expand.c (ix86_expand_branch): Support
11164 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
11165 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
11166 (ix86_expand_fp_movcc): Ditto.
11167 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
11168 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
11169 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
11170 * config/i386/i386.c (ix86_get_ssemov): Use
11171 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
11172 (ix86_get_excess_precision): Use
11173 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
11175 (sse_store_index): Use SFmode cost for HFmode cost.
11176 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
11177 GPR cost for HFmode.
11178 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
11179 (ix86_mangle_type): Add manlging for _Float16 type.
11180 (inline_secondary_memory_needed): No memory is needed for
11181 16bit movement between gpr and sse reg under
11183 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
11184 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
11185 (ix86_division_cost): Ditto.
11186 (ix86_rtx_costs): Ditto.
11187 (ix86_add_stmt_cost): Ditto.
11188 (ix86_optab_supported_p): Ditto.
11189 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
11190 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
11191 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
11192 * config/i386/i386.md (mode): Add HFmode.
11193 (MODE_SIZE): Add HFmode.
11194 (isa): Add avx512fp16.
11195 (enabled): Handle avx512fp16.
11196 (ssemodesuffix): Add sh suffix for HFmode.
11197 (comm): Add mult, div.
11198 (plusminusmultdiv): New code iterator.
11199 (insn): Add mult, div.
11200 (*movhf_internal): Adjust for avx512fp16 instruction.
11201 (*movhi_internal): Ditto.
11202 (*cmpi<unord>hf): New define_insn for HFmode.
11203 (*ieee_s<ieee_maxmin>hf3): Likewise.
11204 (extendhf<mode>2): Likewise.
11205 (trunc<mode>hf2): Likewise.
11206 (float<floatunssuffix><mode>hf2): Likewise.
11207 (*<insn>hf): Likewise.
11208 (cbranchhf4): New expander.
11209 (movhfcc): Likewise.
11210 (<insn>hf3): Likewise.
11211 (mulhf3): Likewise.
11212 (divhf3): Likewise.
11213 * config/i386/i386.opt: Add mavx512fp16.
11214 * config/i386/immintrin.h: Include avx512fp16intrin.h.
11215 * doc/invoke.texi: Add mavx512fp16.
11216 * doc/extend.texi: Add avx512fp16 Usage Notes.
11218 2021-09-08 liuhongt <hongtao.liu@intel.com>
11220 * common.opt: Support -fexcess-precision=16.
11221 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
11222 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
11223 EXCESS_PRECISION_TYPE_FLOAT16.
11224 * config/arm/arm.c (arm_excess_precision): Ditto.
11225 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
11226 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
11227 when EXCESS_PRECISION_TYPE_FLOAT16.
11228 * config/s390/s390.c (s390_excess_precision): Ditto.
11229 * coretypes.h (enum excess_precision_type): Add
11230 EXCESS_PRECISION_TYPE_FLOAT16.
11231 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
11232 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
11233 * doc/extend.texi (Half-Precision): Document
11234 -fexcess-precision=16.
11235 * flag-types.h (enum excess_precision): Add
11236 EXCESS_PRECISION_FLOAT16.
11237 * target.def (excess_precision): Update document.
11238 * tree.c (excess_precision_type): Set excess_precision_type to
11239 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
11241 2021-09-08 liuhongt <hongtao.liu@intel.com>
11243 * doc/extend.texi: (@node Floating Types): Adjust the wording.
11244 (@node Half-Precision): Ditto.
11246 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
11249 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
11250 'CONST_INT_P (src)' to the condition of the block that tries to
11251 eliminate literal when loading integer contant.
11253 2021-09-07 David Faust <david.faust@oracle.com>
11255 * doc/extend.texi (BPF Type Attributes) New node.
11256 Document new preserve_access_index attribute.
11257 Document new preserve_access_index builtin.
11258 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
11260 2021-09-07 David Faust <david.faust@oracle.com>
11262 * config/bpf/bpf.c: Adjust includes.
11263 (bpf_handle_preserve_access_index_attribute): New function.
11264 (bpf_attribute_table): Use it here.
11265 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
11266 (bpf_option_override): Handle "-mco-re" option.
11267 (bpf_asm_init_sections): New.
11268 (TARGET_ASM_INIT_SECTIONS): Redefine.
11269 (bpf_file_end): New.
11270 (TARGET_ASM_FILE_END): Redefine.
11271 (bpf_init_builtins): Add "__builtin_preserve_access_index".
11272 (bpf_core_compute, bpf_core_get_index): New.
11273 (is_attr_preserve_access): New.
11274 (bpf_expand_builtin): Handle new builtins.
11275 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
11276 (bpf_core_walk): New.
11277 (bpf_resolve_overloaded_builtin): New.
11278 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
11279 (handle_attr): New.
11280 (pass_bpf_core_attr): New RTL pass.
11281 * config/bpf/bpf-passes.def: New file.
11282 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
11283 * config/bpf/coreout.c: New file.
11284 * config/bpf/coreout.h: Likewise.
11285 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
11286 (coreout.o): New rule.
11287 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
11288 * config.gcc (bpf): Add coreout.h to extra_headers.
11289 Add coreout.o to extra_objs.
11290 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
11292 2021-09-07 David Faust <david.faust@oracle.com>
11294 * btfout.c (get_btf_id): Function is no longer static.
11295 * ctfc.h: Expose it here.
11297 2021-09-07 David Faust <david.faust@oracle.com>
11299 * ctfc.c (ctf_lookup_tree_type): New function.
11300 * ctfc.h: Likewise.
11302 2021-09-07 David Faust <david.faust@oracle.com>
11304 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
11305 * ctfc.h: Analogous change.
11307 2021-09-07 David Faust <david.faust@oracle.com>
11309 * dwarf2out.c (lookup_type_die): Function is no longer static.
11310 * dwarf2out.h: Expose it here.
11312 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
11314 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
11315 (ctf_debug_early_finish): New definition.
11316 (ctf_debug_finish): Likewise.
11317 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
11318 (ctf_debug_early_finish): New declaration.
11319 (ctf_debug_finish): Likewise.
11320 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
11321 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
11323 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
11325 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
11326 support when compiling for CO-RE.
11327 * config/bpf/bpf.opt: Add new command line option -mco-re.
11329 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
11331 * flag-types.h (enum debug_info_type): Add new enum
11332 DINFO_TYPE_BTF_WITH_CORE.
11333 (BTF_WITH_CORE_DEBUG): New bitmask.
11334 * flags.h (btf_with_core_debuginfo_p): New declaration.
11335 * opts.c (btf_with_core_debuginfo_p): New definition.
11337 2021-09-07 Jason Merrill <jason@redhat.com>
11339 * tree.h (error_operand_p): Change to inline function.
11341 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
11343 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
11344 (empty_block_with_phis_p): ...this.
11345 (potentially_threadable_block): Same.
11346 (jump_threader::thread_through_normal_block): Same.
11348 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
11351 * dwarf2out.c (mark_base_types): New overloaded function.
11352 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
11353 as the compilation unit, and call move_marked_base_types afterward.
11355 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
11358 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
11360 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
11362 2021-09-07 Richard Biener <rguenther@suse.de>
11364 PR tree-optimization/102226
11365 * tree-vect-loop.c (vect_transform_cycle_phi): Record
11366 the converted value for the epilogue PHI use.
11368 2021-09-07 Martin Liska <mliska@suse.cz>
11370 PR gcov-profile/80223
11371 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
11372 options, do not inline when no_profile_instrument_function
11373 attributes are different in early inliner. It's fine to inline
11374 it after PGO instrumentation.
11376 2021-09-07 Richard Biener <rguenther@suse.de>
11378 PR tree-optimization/101555
11379 * tree-ssa-pre.c (translate_vuse_through_block): Do not
11380 perform an alias walk to determine the validity of the
11381 mem at the start of the block which is already guaranteed
11382 by means of prune_clobbered_mems.
11383 (phi_translate_1): Pass edge to translate_vuse_through_block.
11385 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
11388 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
11389 (remainder<mode>3): Likewise.
11391 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
11393 * config/mips/mips.c (mips_file_start): add .module for
11396 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
11398 * wide-int.cc (wi::clz): Reorder tests to ensure the result
11399 is zero for all negative values.
11401 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
11403 * doc/invoke.texi (-foffload-options): Fix @opindex.
11405 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
11408 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
11409 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
11410 source alternative for AVX.
11412 2021-09-06 liuhongt <hongtao.liu@intel.com>
11414 PR middle-end/102182
11415 * optabs.c (expand_fix): Add from1 to avoid from being
11418 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
11420 * dwarf2out.c (modified_type_die): Deal with all array types earlier
11421 and use local variable consistently throughout the function.
11423 2021-09-06 Jakub Jelinek <jakub@redhat.com>
11425 PR tree-optimization/102207
11426 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
11427 were promoted from signed to wider unsigned type.
11429 2021-09-06 Andrew Pinski <apinski@marvell.com>
11431 PR tree-optimization/63184
11432 * match.pd: Add simplification of pointer_diff of two pointer_plus
11433 with addr_expr in the first operand of each pointer_plus.
11434 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
11435 in the first operand of each pointer_plus.
11437 2021-09-06 Richard Biener <rguenther@suse.de>
11439 PR tree-optimization/102176
11440 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
11442 (vect_bb_slp_scalar_cost): Use the computed set of
11443 vectorized scalar stmts instead of relying on the out-of-date
11444 and not accurate PURE_SLP_STMT.
11445 (vect_bb_vectorization_profitable_p): Compute the set
11446 of vectorized scalar stmts.
11448 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
11450 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
11451 GIMPLE_COND special casing.
11452 (path_range_query::range_defined_in_block): Use range_of_stmt
11453 instead of calling fold_range directly.
11455 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
11457 * gimple-range-path.cc (path_range_query::range_of_expr): Set
11458 m_undefined_path when appropriate.
11459 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
11460 (path_range_query::unreachable_path_p): New.
11461 (path_range_query::precompute_ranges): Set m_undefined_path.
11462 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
11463 (path_range_query::internal_range_of_expr): New.
11464 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
11465 Use unreachable_path_p.
11467 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
11469 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
11470 Remove argument and call find_taken_edge.
11471 (back_threader::resolve_phi): Do not calculate taken edge before
11472 calling maybe_register_path.
11473 (back_threader::find_paths_to_names): Same.
11475 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
11477 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
11478 * config/h8300/testcompare.md (store_c): Update name, use new
11480 (store_neg_c, store_shifted_c): New patterns.
11482 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
11485 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
11486 instead of r12 for CR save, in all cases.
11488 2021-09-03 Andrew Pinski <apinski@marvell.com>
11490 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
11491 Handle error_mark_node as the type of the type_decl.
11493 2021-09-03 Andrew Pinski <apinski@marvell.com>
11495 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
11497 (aarch64_simd_types): Likewise.
11498 (aarch64_simd_intOI_type_node): Likewise.
11499 (aarch64_simd_intCI_type_node): Likewise.
11500 (aarch64_simd_intXI_type_node): Likewise.
11501 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
11502 (aarch64_fp16_ptr_type_node): Likewise.
11503 (aarch64_bf16_type_node): Likewise.
11504 (aarch64_bf16_ptr_type_node): Likewise.
11506 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11508 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
11510 (minus_op1_op2_relation_effect): ...here.
11511 (class operator_pointer_diff): New.
11512 (operator_pointer_diff::op1_op2_relation_effect): Call
11513 minus_op1_op2_relation_effect.
11514 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
11516 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11518 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
11519 Add may_peel_loop_headers.
11520 (back_threader_registry::thread_through_all_blocks): Same.
11521 (try_thread_blocks): Pass may_peel_loop_headers argument.
11522 (pass_early_thread_jumps::execute): Same.
11524 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11526 * tree-ssa-threadedge.c (has_phis_p): New.
11527 (forwarder_block_p): New.
11528 (potentially_threadable_block): Call forwarder_block_p.
11529 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
11530 (jump_threader::thread_through_normal_block): Call
11533 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11535 * tree-ssa-threadbackward.c (back_threader::dump): New.
11536 (back_threader::debug): New.
11537 (back_threader_profitability::profitable_path_p): Dump blocks
11538 even if we are bailing early.
11540 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11542 * tree-ssa-threadupdate.c (cancel_thread): New.
11543 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
11544 (jump_thread_path_registry::mark_threaded_blocks): Same.
11545 (jump_thread_path_registry::register_jump_thread): Same.
11547 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11549 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
11550 which objects are available.
11551 (jt_state::pop): Same.
11552 (jt_state::register_equiv): Same.
11553 (jt_state::register_equivs_on_edge): Same.
11555 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11557 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
11558 Move pop until after a thread is registered.
11560 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11562 * tree-ssa-threadupdate.c (debug): New.
11564 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11566 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
11567 (push_dump_file::~push_dump_file): New.
11568 (dump_ranger): Change dump_file temporarily while dumping
11570 * gimple-range-trace.h (class push_dump_file): New.
11572 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11574 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
11575 (dump_ranger): Dump function name.
11577 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11579 * gimple-range-path.cc (path_range_query::range_defined_in_block):
11580 Adjust for non-null.
11581 (path_range_query::adjust_for_non_null_uses): New.
11582 (path_range_query::precompute_ranges): Call
11583 adjust_for_non_null_uses.
11584 * gimple-range-path.h: Add m_non_null and
11585 adjust_for_non_null_uses.
11587 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11589 * gimple-range-path.cc (path_range_query::dump): Dump path
11591 (path_range_query::precompute_ranges): Dump entire path.
11593 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11595 * value-relation.cc (relation_oracle::debug): New.
11596 * value-relation.h (relation_oracle::debug): New.
11598 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11600 * tree-ssa-loop-ch.c: Remove unnecessary include file.
11602 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11604 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
11605 Skip statements with no defining BB.
11606 * gimple-range-path.cc (path_range_query::range_defined_in_block):
11607 Do not get confused by statements with no defining BB.
11609 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
11611 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
11612 gimple_range_adjustment. Add support for constants.
11613 (adjust_realpart_expr): New.
11614 (gimple_range_adjustment): Move IMAGPART_EXPR code to
11615 adjust_imagpart_expr.
11616 * range-op.cc (integral_table::integral_table): Add entry for
11619 2021-09-03 Jakub Jelinek <jakub@redhat.com>
11621 * omp-expand.c (expand_omp_atomic_pipeline): Use
11622 IFN_ATOMIC_COMPARE_EXCHANGE instead of
11623 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
11626 2021-09-03 Jakub Jelinek <jakub@redhat.com>
11629 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
11630 that is false if DECL_BIT_FIELD.
11631 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
11632 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
11633 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
11634 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
11635 of DECL_FIELD_ABI_IGNORED.
11636 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
11637 SET_DECL_FIELD_ABI_IGNORED instead of writing to
11638 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
11639 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
11640 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
11641 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
11643 2021-09-03 liuhongt <hongtao.liu@intel.com>
11646 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
11647 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
11648 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
11650 2021-09-02 Martin Sebor <msebor@redhat.com>
11652 PR tree-optimization/17506
11654 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
11656 2021-09-02 Richard Biener <rguenther@suse.de>
11658 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
11659 fix for PR78185 and continue processing when leaving
11660 finite inner loops.
11662 2021-09-02 Jakub Jelinek <jakub@redhat.com>
11664 PR tree-optimization/99591
11665 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
11668 2021-09-02 Richard Biener <rguenther@suse.de>
11671 2021-09-02 Richard Biener <rguenther@suse.de>
11673 PR tree-optimization/102155
11674 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
11675 over a part of the RPO array and do not recurse here.
11676 Dump blocks marked as always executed.
11677 (fill_always_executed_in): Walk over the RPO array and
11678 process loops whose header we run into.
11679 (loop_invariant_motion_in_fun): Compute the first RPO
11680 using rev_post_order_and_mark_dfs_back_seme in iteration
11681 order and pass that to fill_always_executed_in.
11683 2021-09-02 liuhongt <hongtao.liu@intel.com>
11685 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
11686 * config/i386/i386.c (enum x86_64_reg_class): Add
11687 X86_64_SSEHF_CLASS.
11688 (merge_classes): Handle X86_64_SSEHF_CLASS.
11689 (examine_argument): Ditto.
11690 (construct_container): Ditto.
11691 (classify_argument): Ditto, and set HFmode/HCmode to
11692 X86_64_SSEHF_CLASS.
11693 (function_value_32): Return _FLoat16/Complex Float16 by
11695 (function_value_64): Return _Float16/Complex Float16 by SSE
11697 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
11698 (ix86_secondary_reload): Require gpr as intermediate register
11699 to store _Float16 from sse register when sse4 is not
11701 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
11703 (ix86_scalar_mode_supported_p): Ditto.
11704 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
11705 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
11706 (VALID_INT_MODE_P): Add HFmode and HCmode.
11707 * config/i386/i386.md (*pushhf_rex64): New define_insn.
11709 (*movhf_internal): Ditto.
11710 * doc/extend.texi (Half-Precision Floating Point): Documemt
11713 2021-09-02 Richard Biener <rguenther@suse.de>
11715 PR tree-optimization/102155
11716 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
11717 over a part of the RPO array and do not recurse here.
11718 Dump blocks marked as always executed.
11719 (fill_always_executed_in): Walk over the RPO array and
11720 process loops whose header we run into.
11721 (loop_invariant_motion_in_fun): Compute the first RPO
11722 using rev_post_order_and_mark_dfs_back_seme in iteration
11723 order and pass that to fill_always_executed_in.
11725 2021-09-02 YunQiang Su <syq@debian.org>
11728 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
11730 * config/mips/mips.c (mips_module_isa_name): New.
11731 mips_file_start: add .module mipsREV to all asm output
11733 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
11735 PR tree-optimization/102152
11736 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
11737 comparison to a scalar comparison before calling
11738 update_stmt_if_modified.
11740 2021-09-01 Andrew Pinski <apinski@marvell.com>
11743 * config/aarch64/aarch64.c (aarch64_expand_setmem):
11744 Check STRICT_ALIGNMENT before creating an overlapping
11747 2021-09-01 Martin Sebor <msebor@redhat.com>
11749 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
11750 (check_access): Pass additional argument.
11751 (check_memop_access): Remove template and make a member function.
11752 (maybe_check_dealloc_call): Make a pass_waccess member function.
11753 (class pass_waccess): Add, rename, and remove members.
11754 (pass_waccess::pass_waccess): Adjust to name change.
11755 (pass_waccess::~pass_waccess): Same.
11756 (check_alloca): Make a member function.
11757 (check_alloc_size_call): Same.
11758 (check_strcat): Same.
11759 (check_strncat): Same.
11760 (check_stxcpy): Same.
11761 (check_stxncpy): Same.
11762 (check_strncmp): Same.
11763 (maybe_warn_rdwr_sizes): Rename...
11764 (pass_waccess::maybe_check_access_sizes): ...to this.
11765 (pass_waccess::check_call): Adjust to name changes.
11766 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
11768 (pass_waccess::execute): Adjust to name changes.
11769 * gimple-ssa-warn-access.h (check_memop_access): Remove.
11770 * pointer-query.cc (access_ref::phi): Handle null pointer.
11771 (access_ref::inform_access): Same.
11772 (pointer_query::put_ref): Modify a cached value, not a copy of it.
11773 (pointer_query::dump): New function.
11774 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
11776 * pointer-query.h (pointer_query::dump): Declare.
11777 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
11778 (dump_strlen_info): Use function query.
11779 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
11781 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
11783 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
11785 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
11787 * doc/extend.texi: Document unavailable attribute.
11788 * print-tree.c (print_node): Handle unavailable attribute.
11789 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
11790 * tree.c (error_unavailable_use): New.
11791 * tree.h (TREE_UNAVAILABLE): New.
11792 (error_unavailable_use): New.
11794 2021-09-01 Jakub Jelinek <jakub@redhat.com>
11796 PR tree-optimization/102124
11797 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
11798 MINUS_EXPR, if itype is unsigned with smaller precision than type,
11799 add an extra cast to signed variant of itype to ensure sign-extension.
11801 2021-09-01 Martin Liska <mliska@suse.cz>
11803 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
11804 edges and rather use colors for TRUE and FALSE edges.
11806 2021-09-01 Richard Biener <rguenther@suse.de>
11808 PR tree-optimization/93491
11809 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
11810 after processing the stmt itself. Do not consider
11811 pure functions possibly not returning. Properly avoid
11812 adding possibly trapping calls to EXP_GEN when there's
11813 a preceeding possibly not returning call.
11814 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
11817 2021-09-01 Richard Biener <rguenther@suse.de>
11819 PR tree-optimization/102139
11820 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
11821 type to record a std::pair of the stmt-info and the innermost
11823 (dr_vec_info::group): New member.
11824 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
11825 (vect_compute_data_ref_alignment): Verify the recorded
11826 base alignment can be used.
11827 (data_ref_pair): Remove.
11828 (dr_group_sort_cmp): Adjust.
11829 (vect_analyze_data_ref_accesses): Store the group-ID in the
11830 dr_vec_info and operate on a vector of dr_vec_infos.
11832 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
11834 * read-md.c (md_reader::handle_enum): support value assignation.
11835 * doc/md.texi: record define_c_enum value assignation support.
11837 2021-09-01 Jakub Jelinek <jakub@redhat.com>
11839 PR tree-optimization/102141
11840 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
11841 argument. If false, emit stmts after gsi instead of before, and
11843 (bswap_replace): Adjust callers. When converting output of bswap,
11844 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
11847 2021-09-01 liuhongt <hongtao.liu@intel.com>
11849 * emit-rtl.c (validate_subreg): Get rid of all float-int
11852 2021-09-01 liuhongt <hongtao.liu@intel.com>
11855 2021-08-30 liuhongt <hongtao.liu@intel.com>
11857 * expmed.c (extract_bit_field_1): Make sure we're playing with
11858 integral modes before call extract_integral_bit_field.
11859 (extract_integral_bit_field): Add a parameter of type
11860 scalar_int_mode which corresponds to of tmode.
11861 And call extract_and_convert_fixed_bit_field instead of
11862 extract_fixed_bit_field and convert_extracted_bit_field.
11863 (extract_and_convert_fixed_bit_field): New function, it's a
11864 combination of extract_fixed_bit_field and
11865 convert_extracted_bit_field.
11867 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
11869 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
11871 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
11873 * omp-general.h (omp_is_reference): Rename to...
11874 (omp_privatize_by_reference): ... this. Adjust all users...
11875 * omp-general.c: ... here, ...
11876 * gimplify.c: ... here, ...
11877 * omp-expand.c: ... here, ...
11878 * omp-low.c: ... here.
11880 2021-08-31 Martin Sebor <msebor@redhat.com>
11882 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
11883 pointer element for equality to zero, not that of the cotaining
11886 2021-08-31 Martin Sebor <msebor@redhat.com>
11888 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
11890 2021-08-31 Martin Sebor <msebor@redhat.com>
11892 * function.h (function): Add comments.
11893 (get_range_query): Same. Add attribute returns nonnull.
11895 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
11897 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
11898 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
11899 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
11901 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
11903 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
11905 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
11906 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
11907 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
11908 (m32c-*-elf*, m32c-*-rtems*): Likewise.
11909 * config/cris/cris.h (DBX_NO_XREFS): Remove.
11910 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
11911 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
11912 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
11913 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
11914 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
11916 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
11918 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
11919 allowed on target constructs and only with particular other clauses.
11920 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
11921 'ancestor' is used.
11922 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
11923 structs when 'ancestor' is used.
11924 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
11925 'ancestor' is used.
11926 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
11927 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
11929 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
11931 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
11932 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
11934 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
11935 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
11936 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
11937 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
11938 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
11939 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
11940 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
11941 would be paradoxical.
11943 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
11945 * combine.c (combine_simplify_rtx): Avoid converting an explicit
11946 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
11947 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
11949 2021-08-31 Richard Biener <rguenther@suse.de>
11951 PR tree-optimization/102142
11952 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
11953 condition under which to unset the visited flag.
11955 2021-08-31 Richard Biener <rguenther@suse.de>
11957 PR middle-end/102129
11958 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
11959 possibly trapping expressions across calls.
11961 2021-08-31 Jakub Jelinek <jakub@redhat.com>
11963 PR tree-optimization/102134
11964 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
11965 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
11968 2021-08-31 Andrew Pinski <apinski@marvell.com>
11971 * collect-utils.c (setup_signals): New declaration.
11972 * collect-utils.h (setup_signals): New function.
11973 * collect2.c (handler): Delete.
11974 (main): Instead of manually setting up the signals,
11975 just call setup_signals.
11976 * lto-wrapper.c (main): Likewise.
11978 2021-08-31 Andrew Pinski <apinski@marvell.com>
11981 * config/i386/i386-protos.h (x86_output_aligned_bss):
11982 Change align argument to unsigned type.
11983 (x86_elf_aligned_decl_common): Likewise.
11984 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
11985 (x86_output_aligned_bss): Likewise.
11987 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
11989 * config/mips/mips.c (mips_module_isa_name): New.
11990 mips_file_start: add .module mipsREV to all asm output
11992 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
11994 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
11995 use enum instead of int for 'isa' member.
11996 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
11997 config/mips/netbsd.h: replace hardcoded numbers with enum.
11999 2021-08-31 liuhongt <hongtao.liu@intel.com>
12001 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
12002 define_split to define_insn_and_split.
12003 (*avx2_eq<mode>3): Removed.
12004 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
12005 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
12006 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
12008 (*avx2_gt<mode>3): Removed.
12009 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
12010 define_insn to define_expand, and adjust pattern.
12011 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
12013 2021-08-30 David Malcolm <dmalcolm@redhat.com>
12016 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
12018 2021-08-30 Jason Merrill <jason@redhat.com>
12020 * doc/invoke.texi: Document -Wmissing-requires.
12022 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
12024 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
12025 TARGET_EXTRA_BUILTINS guard.
12027 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
12029 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
12030 initialization of V2DI_type_node and unsigned_V2DI_type_node.
12032 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
12034 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
12035 decl when new_builtins_are_live.
12036 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
12039 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
12041 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
12042 OPTION_MASK_P10_FUSION_2STORE.
12043 (POWERPC_MASKS): Likewise.
12044 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
12045 store fusion for Power10.
12046 (is_fusable_store): New.
12047 (power10_sched_reorder): Likewise.
12048 (rs6000_sched_reorder): Do Power10 specific reordering.
12049 (rs6000_sched_reorder2): Likewise.
12050 * config/rs6000/rs6000.opt: Add new option.
12052 2021-08-30 Richard Biener <rguenther@suse.de>
12054 PR tree-optimization/102128
12055 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
12056 Move scanning for if-converted scalar code to the caller
12057 and instead delay clearing the visited flag for profitable
12059 (vect_slp_region): Cost all subgraphs before scheduling.
12060 For if-converted BB vectorization scan for scalar COND_EXPRs
12061 and do not vectorize if any found and the cost model is
12064 2021-08-30 Richard Biener <rguenther@suse.de>
12066 * common.opt (fexceptions): Mark
12067 EnabledBy(fnon-call-exceptions).
12068 * doc/invoke.texi (fnon-call-exceptions): Document this
12069 enables -fexceptions.
12071 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
12073 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
12074 is not already defined.
12076 2021-08-30 liuhongt <hongtao.liu@intel.com>
12078 * expmed.c (extract_bit_field_1): Make sure we're playing with
12079 integral modes before call extract_integral_bit_field.
12080 (extract_integral_bit_field): Add a parameter of type
12081 scalar_int_mode which corresponds to of tmode.
12082 And call extract_and_convert_fixed_bit_field instead of
12083 extract_fixed_bit_field and convert_extracted_bit_field.
12084 (extract_and_convert_fixed_bit_field): New function, it's a
12085 combination of extract_fixed_bit_field and
12086 convert_extracted_bit_field.
12088 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
12090 * config/darwin.c (darwin_libc_has_function): Do not run
12091 the checks for x86 or modern Darwin. Make sure that there
12092 is a value set for darwin_macosx_version_min before testing.
12094 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
12096 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
12098 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
12100 * ipa-modref-tree.h (modref_access_node::merge): Break out
12101 logic combining offsets and logic merging ranges to ...
12102 (modref_access_node::combined_offsets): ... here
12103 (modref_access_node::update2): ... here
12104 (modref_access_node::closer_pair_p): New member function.
12105 (modref_access_node::forced_merge): New member function.
12106 (modre_ref_node::insert): Do merging when table is full.
12108 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
12111 * config.gcc: MIPS: use N64 ABI by default if the triple end
12112 with -gnuabi64, which is used by Debian since 2013.
12114 2021-08-28 Alexandre Oliva <oliva@adacore.com>
12116 * ipa-modref.c (analyze_function): Skip debug stmts.
12117 * tree-inline.c (estimate_num_insn): Consider builtins even
12118 without a cgraph_node.
12120 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
12122 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
12123 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
12124 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
12125 * config/h8300/h8300.md (eqne): New code iterator.
12126 (geultu, geultu_to_c): Similarly.
12127 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
12128 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
12129 (cmp<mode>_c): New pattern
12131 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
12133 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
12135 (dom_opt_dom_walker::optimize_stmt): Use it.
12137 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
12139 * config/darwin.c (finalize_ctors): Add a section-start linker-
12141 (finalize_dtors): Likewise.
12142 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
12144 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
12146 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
12147 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
12148 old initialization logic when new builtins are enabled.
12149 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
12150 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
12151 (write_init_file): Likewise.
12153 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
12155 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
12156 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
12157 (amdgcn-* | gcn-*) Likewise.
12159 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
12161 * symbol-summary.h: Added missing template keyword.
12163 2021-08-27 Richard Biener <rguenther@suse.de>
12165 PR tree-optimization/45178
12166 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
12167 infinite loops without exit do not mark control dependent
12168 edges of the latch necessary.
12170 2021-08-27 konglin1 <lingling.kong@intel.com>
12173 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
12175 (<avx512>scattersi<mode>): Likewise.
12176 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
12177 (*avx512f_scatterdi<VI48F:mode>): Likewise
12179 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
12181 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
12182 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
12183 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
12184 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
12185 P10_BUILTIN_PEXTD on Power10.
12187 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
12189 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
12190 signedness for some Power10 bifs.
12192 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
12195 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
12196 computed alignment if the entire struct has attribute packed.
12198 2021-08-27 liuhongt <hongtao.liu@intel.com>
12202 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
12203 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
12204 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
12205 IX86_BUILTIN_SHUFPS256.
12206 (ix86_masked_all_ones): New function.
12208 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
12210 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
12211 (conditional moves with memory inputs splitters): Ditto.
12212 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
12214 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12216 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
12217 search after merging.
12219 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12221 * config/rs6000/rs6000-overload.def: Add remaining overloads.
12223 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12225 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
12227 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12229 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
12230 crypto, and htm stanzas.
12232 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
12234 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
12236 2021-08-26 Martin Sebor <msebor@redhat.com>
12238 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
12239 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
12240 (warn_uninitialized_vars): Same. Reduce visibility of locals.
12241 (warn_uninitialized_phi): Same.
12243 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
12245 * tree-ssa-ccp.c (get_individual_bits): Helper function to
12246 extract the individual bits from a widest_int constant (mask).
12247 (gray_code_bit_flips): New read-only table for effiently
12248 enumerating permutations/combinations of bits.
12249 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
12250 by unknown counts that are guaranteed less than the target
12251 precision and four or fewer unknown bits by enumeration.
12252 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
12253 enumeration under the same conditions. Handle remaining
12254 shifts as a mask based upon the minimum possible shift value.
12256 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
12257 Richard Biener <rguenther@suse.de>
12259 * match.pd (shift transformations): Remove a redundant
12260 !POINTER_TYPE_P check.
12262 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
12265 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
12266 Set all_regs to true in the call to replace_rtx.
12268 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12270 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
12271 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
12272 hiting --param modref-max-refs limit.
12273 (modref_tree:insert_base): Be smarter when hitting
12274 --param modref-max-bases limit. Add new parameter REF.
12275 (modref_tree:insert): Update.
12276 (modref_tree:merge): Update.
12277 * ipa-modref.c (read_modref_records): Update.
12279 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12281 * params.opt: (modref-max-adjustments): Add full stop.
12283 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
12285 * ipa-modref-tree.h (modref_ref_node::verify): New member
12287 (modref_ref_node::insert): Use it.
12288 (modref_ref_node::try_mere_with): Fix off by one error.
12290 2021-08-26 Martin Liska <mliska@suse.cz>
12291 Stefan Kneifel <stefan.kneifel@bluewin.ch>
12293 * cgraph.h (create_version_clone_with_body): Add new parameter.
12294 * cgraphclones.c: Likewise.
12295 * multiple_target.c (create_dispatcher_calls): Do not use
12297 (create_target_clone): Likewise here.
12299 2021-08-26 Jonathan Yong <10walls@gmail.com>
12301 * doc/extend.texi: Add note about reserved priorities
12302 to the constructor attribute.
12304 2021-08-25 Martin Sebor <msebor@redhat.com>
12306 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
12307 range table header alone.
12308 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
12310 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
12312 * doc/invoke.texi: Document --param modref-max-adjustments.
12313 * ipa-modref-tree.c (test_insert_search_collapse): Update.
12314 (test_merge): Update.
12315 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
12316 (modref_access_node::operator==): Fix handling of access ranges.
12317 (modref_access_node::contains): Constify parameter; handle also
12318 mismatched parm offsets.
12319 (modref_access_node::update): New function.
12320 (modref_access_node::merge): New function.
12321 (unspecified_modref_access_node): Update constructor.
12322 (modref_ref_node::insert_access): Add record_adjustments parameter;
12324 (modref_ref_node::try_merge_with): New private function.
12325 (modref_tree::insert): New record_adjustments parameter.
12326 (modref_tree::merge): New record_adjustments parameter.
12327 (modref_tree::copy_from): Update.
12328 * ipa-modref.c (dump_access): Dump adjustments field.
12329 (get_access): Update constructor.
12330 (record_access): Update call of insert.
12331 (record_access_lto): Update call of insert.
12332 (merge_call_side_effects): Add record_adjustments parameter.
12333 (get_access_for_fnspec): Update.
12334 (process_fnspec): Update.
12335 (analyze_call): Update.
12336 (analyze_function): Update.
12337 (read_modref_records): Update.
12338 (ipa_merge_modref_summary_after_inlining): Update.
12339 (propagate_unknown_call): Update.
12340 (modref_propagate_in_scc): Update.
12341 * params.opt (param-max-modref-adjustments=): New.
12343 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
12345 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
12347 (xxspltiw_v4si): Use vecperm type attribute.
12348 (xxspltiw_v4si_inst): Use vecperm type attribute.
12349 (xxspltiw_v4sf_inst): Likewise.
12350 (xxspltidp_v2df): Use vecperm type attribute. Use
12351 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
12352 (xxspltidp_v2df_inst): Likewise.
12353 (xxsplti32dx_v4si): Use vecperm type attribute.
12354 (xxsplti32dx_v4si_inst): Likewise.
12355 (xxsplti32dx_v4sf_inst): Likewise.
12356 (xxblend_<mode>): Likewise.
12357 (xxpermx): Likewise.
12358 (xxpermx_inst): Likewise.
12359 (xxeval): Likewise.
12361 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
12364 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
12365 * diagnostic.c (diagnostic_initialize_input_context): New function.
12366 * diagnostic.h (diagnostic_initialize_input_context): Declare.
12367 * input.c (default_charset_callback): New function.
12368 (file_cache::initialize_input_context): New function.
12369 (file_cache_slot::create): Added ability to convert the input
12370 according to the input context.
12371 (file_cache::file_cache): Initialize the new input context.
12372 (class file_cache_slot): Added new m_alloc_offset member.
12373 (file_cache_slot::file_cache_slot): Initialize the new member.
12374 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
12375 (file_cache_slot::maybe_grow): Likewise.
12376 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
12377 (file_cache_slot::get_next_line): Likewise.
12378 * input.h (class file_cache): Added input context member.
12380 2021-08-25 Richard Biener <rguenther@suse.de>
12382 PR tree-optimization/102046
12383 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
12384 update ->any_pattern when swapping operands.
12386 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
12389 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
12390 (ix86_decompose_address): Remove retval check for ASHIFT,
12391 allow non-canonical zero extend if AND mask covers ASHIFT
12393 (ix86_legitimate_address_p): Adjust condition for decompose.
12394 (ix86_rtx_costs): Adjust cost for lea with non-canonical
12396 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
12398 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
12400 PR tree-optimization/101145
12401 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
12403 (number_of_iterations_lt): Invoke above function.
12404 (adjust_cond_for_loop_until_wrap):
12405 Merge to number_of_iterations_until_wrap.
12406 (number_of_iterations_cond): Update invokes for
12407 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
12409 2021-08-25 konglin1 <lingling.kong@intel.com>
12412 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
12413 macro define in O0.
12414 (_mm512_mask_fpclass_ps_mask): Ditto.
12416 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
12418 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
12419 (vec_unpacku_hi_v8hi): Likewise.
12420 (vec_unpacku_lo_v16qi): Likewise.
12421 (vec_unpacku_lo_v8hi): Likewise.
12422 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
12423 (vec_unpacku_lo_<VP_small_lc>): Likewise.
12425 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
12427 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
12428 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
12429 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
12430 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
12432 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
12434 PR middle-end/102031
12435 * simplify-rtx.c (simplify_truncation): When comparing precisions
12436 use "subreg_prec" variable, not "subreg_mode".
12438 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
12440 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
12443 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
12445 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
12446 various pointer type nodes.
12447 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
12448 values for various pointer types.
12449 (ptr_V16QI_type_node): New macro.
12450 (ptr_V1TI_type_node): New macro.
12451 (ptr_V2DI_type_node): New macro.
12452 (ptr_V2DF_type_node): New macro.
12453 (ptr_V4SI_type_node): New macro.
12454 (ptr_V4SF_type_node): New macro.
12455 (ptr_V8HI_type_node): New macro.
12456 (ptr_unsigned_V16QI_type_node): New macro.
12457 (ptr_unsigned_V1TI_type_node): New macro.
12458 (ptr_unsigned_V8HI_type_node): New macro.
12459 (ptr_unsigned_V4SI_type_node): New macro.
12460 (ptr_unsigned_V2DI_type_node): New macro.
12461 (ptr_bool_V16QI_type_node): New macro.
12462 (ptr_bool_V8HI_type_node): New macro.
12463 (ptr_bool_V4SI_type_node): New macro.
12464 (ptr_bool_V2DI_type_node): New macro.
12465 (ptr_bool_V1TI_type_node): New macro.
12466 (ptr_pixel_type_node): New macro.
12467 (ptr_intQI_type_node): New macro.
12468 (ptr_uintQI_type_node): New macro.
12469 (ptr_intHI_type_node): New macro.
12470 (ptr_uintHI_type_node): New macro.
12471 (ptr_intSI_type_node): New macro.
12472 (ptr_uintSI_type_node): New macro.
12473 (ptr_intDI_type_node): New macro.
12474 (ptr_uintDI_type_node): New macro.
12475 (ptr_intTI_type_node): New macro.
12476 (ptr_uintTI_type_node): New macro.
12477 (ptr_long_integer_type_node): New macro.
12478 (ptr_long_unsigned_type_node): New macro.
12479 (ptr_float_type_node): New macro.
12480 (ptr_double_type_node): New macro.
12481 (ptr_long_double_type_node): New macro.
12482 (ptr_dfloat64_type_node): New macro.
12483 (ptr_dfloat128_type_node): New macro.
12484 (ptr_ieee128_type_node): New macro.
12485 (ptr_ibm128_type_node): New macro.
12486 (ptr_vector_pair_type_node): New macro.
12487 (ptr_vector_quad_type_node): New macro.
12488 (ptr_long_long_integer_type_node): New macro.
12489 (ptr_long_long_unsigned_type_node): New macro.
12491 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
12493 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
12494 and power9-64 stanzas.
12496 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
12497 Tom de Vries <tdevries@suse.de>
12499 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
12500 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
12501 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
12502 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
12503 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
12504 * config/nvptx/nvptx-c.c: New source file.
12505 (nvptx_cpu_cpp_builtins): Move implementation here.
12507 2021-08-24 Martin Sebor <msebor@redhat.com>
12509 PR middle-end/101600
12510 PR middle-end/101977
12511 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
12512 the phrasing of a warning.
12513 (check_access): Use the remaining size after subtracting any offset
12514 rather than the whole object size.
12515 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
12516 clear for any nonnull PHI argument.
12517 (compute_objsize): Clear argument.
12519 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
12521 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
12523 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
12525 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
12528 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
12530 * value-relation.cc (rr_transitive_table): New.
12531 (relation_transitive): New.
12532 (value_relation::swap): Remove.
12533 (value_relation::apply_transitive): New.
12534 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
12535 (relation_oracle::register_relation): Call register_transitives.
12536 (relation_oracle::register_transitives): New.
12537 * value-relation.h (relation_oracle): Add new temporary bitmap and
12540 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
12543 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
12544 from integer to a pseudo vector register.
12546 2021-08-24 Richard Biener <rguenther@suse.de>
12548 PR tree-optimization/100089
12549 * tree-vectorizer.h (vect_slp_bb): Rename to ...
12550 (vect_slp_if_converted_bb): ... this and get the original
12551 loop as new argument.
12552 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
12553 pass original loop to vect_slp_if_converted_bb.
12554 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
12555 If orig_loop was passed scan the not vectorized stmts
12556 for COND_EXPRs and force not profitable if found.
12557 (vect_slp_region): Pass down all SLP instances to costing
12558 if orig_loop was specified.
12559 (vect_slp_bbs): Pass through orig_loop.
12560 (vect_slp_bb): Rename to ...
12561 (vect_slp_if_converted_bb): ... this and get the original
12562 loop as new argument.
12563 (vect_slp_function): Adjust.
12565 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
12568 * config/arm/arm.md (attribute arch): Add fix_vlldm.
12569 (arch_enabled): Use it.
12570 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
12571 use when erratum mitigation is needed.
12573 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
12576 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
12577 * doc/invoke.texi (Arm Options): Document it.
12578 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
12579 (ALL_QUIRKS): Add quirk_vlldm.
12580 (cortex-m33): Add quirk_vlldm.
12581 (cortex-m35p, cortex-m55): Likewise.
12582 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
12583 targetting an affected CPU and not explicitly controlled on
12586 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
12588 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
12589 (lazy_load_multiple_insn): Likewise.
12591 2021-08-24 liuhongt <hongtao.liu@intel.com>
12594 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
12595 Enable avx512 embedded broadcast.
12596 (*<avx512>_vternlog<mode>_all): Ditto.
12597 (<avx512>_vternlog<mode>_mask): Ditto.
12599 2021-08-24 liuhongt <hongtao.liu@intel.com>
12602 * config/i386/i386.c (ix86_rtx_costs): Define cost for
12604 * config/i386/i386.h (STRIP_UNARY): New macro.
12605 * config/i386/predicates.md (reg_or_notreg_operand): New
12607 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
12608 (*<avx512>_vternlog<mode>_1): New pre_reload
12609 define_insn_and_split.
12610 (*<avx512>_vternlog<mode>_2): Ditto.
12611 (*<avx512>_vternlog<mode>_3): Ditto.
12612 (any_logic1,any_logic2): New code iterator.
12613 (logic_op): New code attribute.
12614 (ternlogsuffix): Extend to VNxDF and VNxSF.
12616 2021-08-24 Richard Biener <rguenther@suse.de>
12618 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
12619 * params.opt (--param vect-inner-loop-cost-factor): Adjust
12621 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
12622 inner_loop_cost_factor to the minimum of the estimated number
12623 of iterations of the inner loop and vect-inner-loop-cost-factor.
12625 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
12626 Richard Biener <rguenther@suse.de>
12628 * config/i386/i386-features.c (compute_convert_gain): Provide
12629 more accurate values for CONST_INT, when optimizing for size.
12630 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
12631 * config/i386/i386.h (COSTS_N_BYTES): to here.
12633 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
12634 Jakub Jelinek <jakub@redhat.com>
12636 PR middle-end/102029
12637 * match.pd (shift transformations): Add an additional check for
12638 !POINTER_TYPE_P in the recently added left shift transformation.
12640 2021-08-24 liuhongt <hongtao.liu@intel.com>
12642 PR tree-optimization/100089
12643 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
12644 loop vectorizer when cost model is very-cheap.
12646 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
12648 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
12649 asprintf, which is not available on AIX.
12651 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
12653 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
12654 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
12656 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
12658 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
12659 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
12661 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
12662 (rbtree.o): Likewise.
12663 (rs6000-gen-builtins): Likewise.
12664 (rs6000-builtins.c): Likewise.
12665 (rs6000-builtins.h): Likewise.
12666 (rs6000.o): Add dependency.
12667 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
12668 (rs6000-vecdefines.h): New target.
12669 (rs6000-builtins.o): Likewise.
12670 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
12671 (rs6000-c.o): Likewise.
12673 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
12676 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
12677 Diagnose buffer overrun.
12678 (safe_inc_pos): Fix overrun detection.
12679 (match_identifier): Diagnose buffer overrun.
12680 (match_integer): Likewise.
12681 (match_to_right_bracket): Likewise.
12683 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
12685 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
12686 Improve range compare.
12687 (modref_access_node::contains): New member function.
12688 (modref_access_node::search): Remove.
12689 (modref_access_node::insert): Be smarter about subaccesses.
12691 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
12693 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
12694 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
12695 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
12698 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
12700 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
12702 * config/h8300/jumpcall.md (call, call_value): Restrict to
12703 !SIBLING_CALL_P cases.
12704 (subcall, sibcall_value): New patterns & expanders.
12705 * config/h8300/proepi.md (epilogue): Pass new argument to
12706 h8300_expand_epilogue.
12707 (sibcall_epilogue): New expander.
12708 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
12710 (h8300_ok_for_sibcall_p): New function.
12711 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
12713 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
12715 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
12716 Handle case where the operand is already the desired mode.
12718 2021-08-23 Richard Biener <rguenther@suse.de>
12721 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
12722 in addition to has_gimple_body.
12724 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
12726 PR middle-end/101949
12727 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
12730 2021-08-23 Martin Liska <mliska@suse.cz>
12732 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
12735 2021-08-23 Richard Biener <rguenther@suse.de>
12737 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
12738 Properly scale the inner loop cost only once.
12740 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
12742 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
12743 Provide bounds for unsigned (and signed with non-negative operands)
12744 division and modulus.
12746 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
12748 * simplify-rtx.c (simplify_truncation): Generalize simplification
12749 of (truncate:A (subreg:B X)).
12750 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
12751 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
12752 already has the desired machine mode.
12753 (test_scalar_int_ops): Add tests that useless extensions and
12754 truncations are optimized away.
12755 (test_scalar_int_ext_ops): New self-test function to confirm
12756 that truncations of extensions are correctly simplified.
12757 (test_scalar_int_ext_ops2): New self-test function to check
12758 truncations of truncations, extensions of extensions, and
12759 truncations of extensions.
12760 (test_scalar_ops): Call the above two functions with a
12761 representative sampling of integer machine modes.
12763 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
12765 * match.pd (shift transformations): Change the sign of an
12766 LSHIFT_EXPR if it reduces the number of explicit conversions.
12768 2021-08-23 Jakub Jelinek <jakub@redhat.com>
12770 PR tree-optimization/86723
12771 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
12772 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
12773 non-memory permutation of 64-bit src which only has bytes of
12774 0 or [5..8] and n->range is 4.
12775 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
12776 find_bswap_or_nop_finalize caller, support bswap with some bytes
12777 zeroed, as long as at least two bytes are not zeroed.
12778 (bswap_replace): Add mask argument and handle masking of bswap
12780 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
12781 caller, punt if cast64_to_32 or mask is not all ones.
12782 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
12783 caller, for now punt if cast64_to_32.
12785 2021-08-23 Richard Biener <rguenther@suse.de>
12787 PR tree-optimization/79334
12788 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
12789 a type also for COMPONENT_REFs.
12790 (vn_reference_may_trap): Check ARRAY_REF with constant index
12791 against the array domain.
12793 2021-08-23 liuhongt <hongtao.liu@intel.com>
12796 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
12797 TARGET_AVX512BW to condition.
12799 2021-08-23 Jakub Jelinek <jakub@redhat.com>
12802 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
12803 register variables already during early_dwarf if possible.
12805 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
12807 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
12809 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
12812 * config/arm/arm.opt: Fix typo.
12813 * config/arm/t-rmprofile: Fix typo.
12815 2021-08-23 Jakub Jelinek <jakub@redhat.com>
12817 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
12818 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
12819 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
12820 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
12821 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
12822 if either grainsize or num_tasks clause has the strict modifier.
12824 2021-08-23 Martin Liska <mliska@suse.cz>
12826 * dbgcnt.def (DEBUG_COUNTER): New counter.
12827 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
12829 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
12831 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
12833 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
12836 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
12838 * config/mips/mips.c (mips_msa_output_shift_immediate): New
12840 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
12841 vlshr<mode>3): Call it.
12843 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
12844 Martin Liska <mliska@suse.cz>
12846 PR middle-end/101949
12847 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
12850 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
12852 * config/mips/mips.c (mips_function_rodata_section,
12853 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
12855 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
12857 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
12859 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
12861 * configure.ac (thread-local storage support): Remove tls_first_major
12862 and tls_first_minor. Use "$conftest_s" to check support.
12863 * configure: Regenerate.
12865 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
12867 * configure.ac: Fixup formatting.
12869 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
12871 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
12872 * configure.ac: ... update all callers.
12874 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
12877 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
12878 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
12879 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
12880 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
12881 Remove remaining checks for in-tree assembler.
12882 * configure: Regenerate.
12884 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
12886 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
12887 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
12889 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
12890 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
12891 Improve logical shifts by 27 bits for H8/S.
12892 (get_shift_alg): Corresponding changes.
12893 (h8300_option_override): Revert to loops for -Os when profitable.
12895 2021-08-20 Richard Biener <rguenther@suse.de>
12897 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
12899 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
12900 index as group_id when dataref_groups were not computed.
12901 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
12902 we advace to the next BB.
12904 2021-08-20 Jakub Jelinek <jakub@redhat.com>
12906 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
12909 2021-08-20 Martin Liska <mliska@suse.cz>
12911 PR gcov-profile/89961
12912 * gcov.c (make_gcov_file_name): Rewrite using std::string.
12913 (mangle_name): Simplify, do not used the second argument.
12914 (strip_extention): New function.
12915 (get_md5sum): Likewise.
12916 (get_gcov_intermediate_filename): Handle properly -p and -x
12918 (output_gcov_file): Use string type.
12919 (generate_results): Likewise.
12920 (md5sum_to_hex): Remove.
12922 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
12924 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
12925 (UNSPEC_XXSPLTIW): Move to vsx.md.
12926 (UNSPEC_XXSPLTID): Move to vsx.md.
12927 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
12928 (UNSPEC_XXBLEND): Move to vsx.md.
12929 (UNSPEC_XXPERMX): Move to vsx.md.
12930 (VM3): Move to vsx.md.
12931 (VM3_char): Move to vsx.md.
12932 (xxspltiw_v4si): Move to vsx.md.
12933 (xxspltiw_v4sf): Move to vsx.md.
12934 (xxspltiw_v4sf_inst): Move to vsx.md.
12935 (xxspltidp_v2df): Move to vsx.md.
12936 (xxspltidp_v2df_inst): Move to vsx.md.
12937 (xxsplti32dx_v4si_inst): Move to vsx.md.
12938 (xxsplti32dx_v4sf): Move to vsx.md.
12939 (xxsplti32dx_v4sf_inst): Move to vsx.md.
12940 (xxblend_<mode>): Move to vsx.md.
12941 (xxpermx): Move to vsx.md.
12942 (xxpermx_inst): Move to vsx.md.
12943 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
12944 (UNSPEC_XXSPLTIW): Move from altivec.md.
12945 (UNSPEC_XXSPLTID): Move from altivec.md.
12946 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
12947 (UNSPEC_XXBLEND): Move from altivec.md.
12948 (UNSPEC_XXPERMX): Move from altivec.md.
12949 (VM3): Move from altivec.md.
12950 (VM3_char): Move from altivec.md.
12951 (xxspltiw_v4si): Move from altivec.md.
12952 (xxspltiw_v4sf): Move from altivec.md.
12953 (xxspltiw_v4sf_inst): Move from altivec.md.
12954 (xxspltidp_v2df): Move from altivec.md.
12955 (xxspltidp_v2df_inst): Move from altivec.md.
12956 (xxsplti32dx_v4si_inst): Move from altivec.md.
12957 (xxsplti32dx_v4sf): Move from altivec.md.
12958 (xxsplti32dx_v4sf_inst): Move from altivec.md.
12959 (xxblend_<mode>): Move from altivec.md.
12960 (xxpermx): Move from altivec.md.
12961 (xxpermx_inst): Move from altivec.md.
12963 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
12965 * tree-vect-generic.c (expand_vector_operations_1): Use either
12966 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
12967 when constructing scalar splat expressions.
12969 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
12972 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
12973 pointer to __vector_pair *.
12975 2021-08-19 Martin Sebor <msebor@redhat.com>
12977 * gimple-range.cc: Add comments.
12978 * gimple-range.h: Same.
12980 2021-08-19 Martin Sebor <msebor@redhat.com>
12982 PR middle-end/101984
12983 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
12986 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
12988 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
12989 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
12990 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
12992 2021-08-19 Jakub Jelinek <jakub@redhat.com>
12994 PR middle-end/101950
12995 * optabs.c (expand_clrsb_using_clz): New function.
12996 (expand_unop): Use it as another clrsb expansion fallback.
12998 2021-08-19 liuhongt <hongtao.liu@intel.com>
13001 2021-07-28 liuhongt <hongtao.liu@intel.com>
13004 * config/i386/i386.h (processor_costs): Add new member
13006 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
13007 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
13008 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
13009 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
13010 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
13011 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
13012 generic_cost, core_cost): Initialize integer_to_sse same value
13014 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
13015 * config/i386/i386.c (ix86_builtin_vectorization_cost):
13016 Use integer_to_sse instead of sse_op to calculate the cost of
13019 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
13021 * config.gcc: Include rpath.opt for Darwin.
13022 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
13024 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
13026 PR bootstrap/101959
13027 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
13030 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
13032 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
13033 pattern and type cast.
13034 (vld4_lane_f32): Use float RTL pattern.
13035 (vld4q_lane_f64): Use float type cast.
13037 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
13039 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
13042 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
13044 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
13045 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
13046 (hash_map_tests_c_tests): Call it.
13048 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
13050 * ggc.h (enum ggc_collect): New.
13051 (ggc_collect): Use it.
13052 * ggc-page.c: Adjust.
13053 * ggc-common.c: Likewise.
13054 * ggc-tests.c: Likewise.
13055 * read-rtl-function.c: Likewise.
13056 * selftest-run-tests.c: Likewise.
13057 * doc/gty.texi (Invoking the garbage collector): Likewise.
13059 2021-08-18 liuhongt <hongtao.liu@intel.com>
13062 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
13064 * config/i386/sse.md (*sse3_haddv2df3_low): Add
13065 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
13066 (*sse3_hsubv2df3_low): Ditto.
13067 * config/i386/x86-tune.def
13068 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
13070 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13072 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
13073 (gori_compute::compute_operand_range): Add tracing.
13074 (gori_compute::logical_combine): Ditto.
13075 (gori_compute::compute_logical_operands): Ditto.
13076 (gori_compute::compute_operand1_range): Ditto.
13077 (gori_compute::compute_operand2_range): Ditto.
13078 (gori_compute::outgoing_edge_range_p): Ditto.
13079 * gimple-range-gori.h (class gori_compute): Add range_tracer.
13081 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13083 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
13084 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
13085 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
13086 * params.opt (--param=evrp-mode): Adjust options.
13088 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13090 * Makefile.in (OBJS): Add gimple-range-trace.o.
13091 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
13092 * gimple-range-fold.cc: Adjust headers.
13093 * gimple-range-trace.cc: New.
13094 * gimple-range-trace.h: New.
13095 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
13096 (gimple_ranger::range_of_expr): Add tracing.
13097 (gimple_ranger::range_on_entry): Ditto.
13098 (gimple_ranger::range_on_exit): Ditto.
13099 (gimple_ranger::range_on_edge): Ditto.
13100 (gimple_ranger::fold_range_internal): Ditto.
13101 (gimple_ranger::dump_bb): Do not calculate edge range twice.
13102 (trace_ranger::*): Remove.
13103 (enable_ranger): Never create a trace_ranger.
13104 (debug_seed_ranger): Move to gimple-range-trace.cc.
13105 (dump_ranger): Ditto.
13106 (debug_ranger): Ditto.
13107 * gimple-range.h: Include gimple-range-trace.h.
13108 (range_on_entry, range_on_exit): No longer virtual.
13109 (class trace_ranger): Remove.
13110 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
13112 2021-08-17 Martin Sebor <msebor@redhat.com>
13114 PR middle-end/101854
13115 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
13116 in gimple-ssa-warn-access.cc.
13117 * calls.c (alloc_max_size): Move code to check_alloca.
13118 (get_size_range): Move to pointer-query.cc.
13119 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
13120 (get_attr_nonstring_decl): Move to tree.c.
13121 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
13122 (append_attrname): Same.
13123 (maybe_warn_rdwr_sizes): Same.
13124 (initialize_argument_information): Move code to
13125 gimple-ssa-warn-access.cc.
13126 * calls.h (maybe_warn_alloc_args_overflow): Move to
13127 gimple-ssa-warn-access.h.
13128 (get_attr_nonstring_decl): Move to tree.h.
13129 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
13130 (enum size_range_flags): Move to pointer-query.h.
13131 (get_size_range): Same.
13132 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
13133 to avoid Clang -Wunused-function.
13134 (get_size_range): Declare static.
13135 (maybe_emit_free_warning): Rename...
13136 (maybe_check_dealloc_call): ...to this for consistency.
13137 (class pass_waccess): Add members.
13138 (pass_waccess::~pass_waccess): Defined.
13139 (alloc_max_size): Move here from calls.c.
13140 (maybe_warn_alloc_args_overflow): Same.
13141 (check_alloca): New function.
13142 (check_alloc_size_call): New function.
13143 (check_strncat): Handle another warning flag.
13144 (pass_waccess::check_builtin): Handle alloca.
13145 (fntype_argno_type): Move here from calls.c.
13146 (append_attrname): Same.
13147 (maybe_warn_rdwr_sizes): Same.
13148 (pass_waccess::check_call): Define.
13149 (check_nonstring_args): New function.
13150 (pass_waccess::check): Call new member functions.
13151 (pass_waccess::execute): Enable ranger.
13152 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
13153 (maybe_warn_nonstring_arg): Same.
13154 * gimple-ssa-warn-restrict.c: Remove #include.
13155 * pointer-query.cc (get_size_range): Move here from calls.c.
13156 * pointer-query.h (enum size_range_flags): Same.
13157 (get_size_range): Same.
13158 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
13159 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
13161 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
13163 * ggc.h (ggc_collect): Add 'force_collect' parameter.
13164 * ggc-page.c (ggc_collect): Use that one instead of global
13165 'ggc_force_collect'. Adjust all users.
13166 * doc/gty.texi (Invoking the garbage collector): Update.
13167 * ggc-internal.h (ggc_force_collect): Remove.
13168 * ggc-common.c (ggc_force_collect): Likewise.
13169 * selftest.h (forcibly_ggc_collect): Remove.
13170 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
13171 * read-rtl-function.c (test_loading_labels): Adjust.
13172 * selftest-run-tests.c (run_tests): Likewise.
13174 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
13176 * config/darwin.c (darwin_file_end): Reset and reclaim the
13177 section names table at the end of compile.
13179 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
13182 * config.in: Regenerate.
13183 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
13184 (ASM_SPEC): Pass options to disable branch shortening where
13186 * configure: Regenerate.
13187 * configure.ac: Detect versions of 'as' that support the
13188 optimisation which has the bug.
13190 2021-08-17 Richard Biener <rguenther@suse.de>
13192 * optabs-query.c (supports_vec_gather_load_p): Also check
13194 (supports_vec_scatter_store_p): Likewise.
13195 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
13196 back to masked variants if non-masked are not supported.
13197 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
13198 When we need to use masked gather/scatter but do not have
13199 a mask set up a constant true one.
13200 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
13201 non-SSA_NAME masks.
13203 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
13205 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
13206 algorithm as PLUS_EXPR to improve subtraction bit bounds.
13207 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
13209 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
13211 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
13212 calculate the mask-value pair result of a multiplication by an
13214 (bit_value_binop) [MULT_EXPR]: Call it from here for
13215 multiplications by (sparse) non-negative constants.
13217 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
13220 * config.gcc (gcc_cv_initfini_array): Leave undefined for
13221 uclinuxfdpiceabi targets.
13223 2021-08-17 Alexandre Oliva <oliva@adacore.com>
13225 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
13226 reverse debug stmts.
13228 2021-08-17 Alexandre Oliva <oliva@adacore.com>
13230 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
13232 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
13234 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
13235 (__LD4Q_LANE_FUNC): Likewise.
13236 (vld4_lane_u8): Define without macro.
13237 (vld4_lane_u16): Likewise.
13238 (vld4_lane_u32): Likewise.
13239 (vld4_lane_u64): Likewise.
13240 (vld4_lane_s8): Likewise.
13241 (vld4_lane_s16): Likewise.
13242 (vld4_lane_s32): Likewise.
13243 (vld4_lane_s64): Likewise.
13244 (vld4_lane_f16): Likewise.
13245 (vld4_lane_f32): Likewise.
13246 (vld4_lane_f64): Likewise.
13247 (vld4_lane_p8): Likewise.
13248 (vld4_lane_p16): Likewise.
13249 (vld4_lane_p64): Likewise.
13250 (vld4q_lane_u8): Likewise.
13251 (vld4q_lane_u16): Likewise.
13252 (vld4q_lane_u32): Likewise.
13253 (vld4q_lane_u64): Likewise.
13254 (vld4q_lane_s8): Likewise.
13255 (vld4q_lane_s16): Likewise.
13256 (vld4q_lane_s32): Likewise.
13257 (vld4q_lane_s64): Likewise.
13258 (vld4q_lane_f16): Likewise.
13259 (vld4q_lane_f32): Likewise.
13260 (vld4q_lane_f64): Likewise.
13261 (vld4q_lane_p8): Likewise.
13262 (vld4q_lane_p16): Likewise.
13263 (vld4q_lane_p64): Likewise.
13264 (vld4_lane_bf16): Likewise.
13265 (vld4q_lane_bf16): Likewise.
13267 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
13269 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
13270 (__LD3Q_LANE_FUNC): Delete.
13271 (vld3_lane_u8): Define without macro.
13272 (vld3_lane_u16): Likewise.
13273 (vld3_lane_u32): Likewise.
13274 (vld3_lane_u64): Likewise.
13275 (vld3_lane_s8): Likewise.
13276 (vld3_lane_s16): Likewise.
13277 (vld3_lane_s32): Likewise.
13278 (vld3_lane_s64): Likewise.
13279 (vld3_lane_f16): Likewise.
13280 (vld3_lane_f32): Likewise.
13281 (vld3_lane_f64): Likewise.
13282 (vld3_lane_p8): Likewise.
13283 (vld3_lane_p16): Likewise.
13284 (vld3_lane_p64): Likewise.
13285 (vld3q_lane_u8): Likewise.
13286 (vld3q_lane_u16): Likewise.
13287 (vld3q_lane_u32): Likewise.
13288 (vld3q_lane_u64): Likewise.
13289 (vld3q_lane_s8): Likewise.
13290 (vld3q_lane_s16): Likewise.
13291 (vld3q_lane_s32): Likewise.
13292 (vld3q_lane_s64): Likewise.
13293 (vld3q_lane_f16): Likewise.
13294 (vld3q_lane_f32): Likewise.
13295 (vld3q_lane_f64): Likewise.
13296 (vld3q_lane_p8): Likewise.
13297 (vld3q_lane_p16): Likewise.
13298 (vld3q_lane_p64): Likewise.
13299 (vld3_lane_bf16): Likewise.
13300 (vld3q_lane_bf16): Likewise.
13302 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
13304 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
13305 (__LD2Q_LANE_FUNC): Likewise.
13306 (vld2_lane_u8): Define without macro.
13307 (vld2_lane_u16): Likewise.
13308 (vld2_lane_u32): Likewise.
13309 (vld2_lane_u64): Likewise.
13310 (vld2_lane_s8): Likewise.
13311 (vld2_lane_s16): Likewise.
13312 (vld2_lane_s32): Likewise.
13313 (vld2_lane_s64): Likewise.
13314 (vld2_lane_f16): Likewise.
13315 (vld2_lane_f32): Likewise.
13316 (vld2_lane_f64): Likewise.
13317 (vld2_lane_p8): Likewise.
13318 (vld2_lane_p16): Likewise.
13319 (vld2_lane_p64): Likewise.
13320 (vld2q_lane_u8): Likewise.
13321 (vld2q_lane_u16): Likewise.
13322 (vld2q_lane_u32): Likewise.
13323 (vld2q_lane_u64): Likewise.
13324 (vld2q_lane_s8): Likewise.
13325 (vld2q_lane_s16): Likewise.
13326 (vld2q_lane_s32): Likewise.
13327 (vld2q_lane_s64): Likewise.
13328 (vld2q_lane_f16): Likewise.
13329 (vld2q_lane_f32): Likewise.
13330 (vld2q_lane_f64): Likewise.
13331 (vld2q_lane_p8): Likewise.
13332 (vld2q_lane_p16): Likewise.
13333 (vld2q_lane_p64): Likewise.
13334 (vld2_lane_bf16): Likewise.
13335 (vld2q_lane_bf16): Likewise.
13337 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
13339 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
13342 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
13344 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
13345 (rank_for_schedule): Use it.
13347 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
13349 PR rtl-optimization/91598
13350 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
13351 insns after memory reads and before memory writes.
13353 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
13355 * rtl.h (CONST_VECTOR_P): New macro.
13356 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
13357 code testing macros.
13358 (aarch64_ptrue_all_mode): Likewise.
13359 (aarch64_expand_mov_immediate): Likewise.
13360 (aarch64_const_vec_all_in_range_p): Likewise.
13361 (aarch64_rtx_costs): Likewise.
13362 (aarch64_legitimate_constant_p): Likewise.
13363 (aarch64_simd_valid_immediate): Likewise.
13364 (aarch64_simd_make_constant): Likewise.
13365 (aarch64_convert_mult_to_shift): Likewise.
13366 (aarch64_expand_sve_vec_perm): Likewise.
13367 (aarch64_vec_fpconst_pow_of_2): Likewise.
13369 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
13371 PR tree-optimization/101938
13372 * range-op.cc (operator_abs::op1_range): Special case
13373 -TYPE_MIN_VALUE for flag_wrapv.
13375 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
13377 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
13380 2021-08-17 Jakub Jelinek <jakub@redhat.com>
13382 * tree.def (OMP_SCOPE): New tree code.
13383 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
13384 * tree-nested.c (convert_nonlocal_reference_stmt,
13385 convert_local_reference_stmt, convert_gimple_call): Handle
13387 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
13388 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
13389 * gimple.c (gimple_build_omp_scope): New function.
13390 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
13391 * gimple.h (gimple_build_omp_scope): Declare.
13392 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
13393 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
13394 gimple_omp_scope_set_clauses): New inline functions.
13395 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
13396 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
13397 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
13398 * gimple-walk.c (walk_gimple_stmt): Likewise.
13399 * gimple-low.c (lower_stmt): Likewise.
13400 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
13401 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
13402 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
13403 allowing task reductions. Reject inscan reductions on scope.
13404 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
13405 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
13406 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
13407 (estimate_num_insns): Likewise.
13408 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
13409 contexts if var isn't privatized there.
13410 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
13411 (scan_omp_1_stmt): Likewise.
13412 (maybe_add_implicit_barrier_cancel): Look through outer
13414 (lower_omp_scope): New function.
13415 (lower_omp_task_reductions): Handle OMP_SCOPE.
13416 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
13417 (diagnose_sb_1, diagnose_sb_2): Likewise.
13418 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
13419 (expand_omp): Handle GIMPLE_OMP_SCOPE.
13420 (omp_make_gimple_edges): Likewise.
13421 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
13423 2021-08-17 Richard Biener <rguenther@suse.de>
13425 PR tree-optimization/101925
13426 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
13427 reverse on COMPONENT_REF and ARRAY_REF according to
13428 what reverse_storage_order_for_component_p does.
13429 (vn_reference_eq): Compare reversed on reference ops.
13430 (reverse_storage_order_for_component_p): New overload.
13431 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
13432 on the reference looked up.
13434 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
13436 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
13437 shifts on the H8/S.
13438 (h8300_option_override): Use loops on H8/S more often when optimizing
13440 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
13441 accordingly. Handle various arithmetic right shifts with special
13442 sequences that we couldn't handle before.
13444 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
13446 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
13448 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
13450 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
13451 * config/sparc/sparc.c (sparc_gcov_type_size): New.
13452 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
13453 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
13454 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
13455 * doc/tm.texi.in: Regenerate.
13456 * target.def (gcov_type_size): New target hook.
13457 * targhooks.c (default_gcov_type_size): New.
13458 * targhooks.h (default_gcov_type_size): Declare.
13459 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
13461 (gimple_gen_time_profiler): Likewise.
13463 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
13465 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
13467 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
13469 PR middle-end/101931
13470 * omp-low.c (omp_runtime_api_call): Update for routines
13471 added in the meanwhile.
13473 2021-08-16 Martin Liska <mliska@suse.cz>
13475 PR tree-optimization/100393
13476 * tree-switch-conversion.c (group_cluster::dump): Use
13477 get_comparison_count.
13478 (jump_table_cluster::find_jump_tables): Pre-compute number of
13479 comparisons and then decrement it. Cache also max_ratio.
13480 (jump_table_cluster::can_be_handled): Change signature.
13481 * tree-switch-conversion.h (get_comparison_count): New.
13483 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
13485 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
13486 encodings only when -fgnat-encodings=all is specified.
13487 (add_bound_info): Likewise.
13488 (add_byte_size_attribute): Likewise.
13489 (gen_member_die): Likewise.
13491 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
13493 * omp-oacc-neuter-broadcast.cc
13494 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
13496 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
13498 * omp-oacc-neuter-broadcast.cc
13499 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
13502 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
13504 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
13505 (execute_omp_oacc_neuter_broadcast): ... here.
13506 (install_var_field, build_receiver_ref, build_sender_ref): Take
13507 'field_map_t *' parameter. Adjust all users.
13508 (worker_single_copy, neuter_worker_single): Take a
13509 'record_field_map_t *' parameter. Adjust all users.
13511 2021-08-16 liuhongt <hongtao.liu@intel.com>
13514 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
13517 2021-08-16 Martin Liska <mliska@suse.cz>
13520 * multiple_target.c (create_dispatcher_calls): Make default
13521 function local only if it is a definition.
13523 2021-08-16 Martin Liska <mliska@suse.cz>
13526 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
13527 consider equal SSA_NAMEs when one is a param.
13529 2021-08-16 liuhongt <hongtao.liu@intel.com>
13532 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
13533 Support vpermi2b for V32QI/V16QImode.
13534 (ix86_extract_perm_from_pool_constant): New function.
13535 (ix86_expand_vec_one_operand_perm_avx512): Support
13536 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
13537 (expand_vec_perm_1): Adjust comments for upper.
13538 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
13540 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
13541 (pshufb_truncv4siv4hi_operand): Ditto.
13542 (pshufb_truncv8hiv8qi_operand): Ditto.
13543 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
13544 New pre_reload define_insn_and_split.
13545 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
13546 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
13547 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
13548 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
13549 (*avx512f_permvar_truncv4div4si_1): Ditto.
13550 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
13551 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
13552 (*avx512f_pshufd_truncv2div2si_1): Ditto.
13554 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
13556 * config/riscv/multilib-generator: Support code model option for
13558 * doc/install.texi: Add document of new option for
13559 --with-multilib-generator.
13561 2021-08-15 Clément Chigot <clement.chigot@atos.net>
13563 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
13564 (rs6000_legitimize_tls_address_aix): Use it.
13565 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
13566 xcoff_tls_exec_model_detected is true.
13568 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
13570 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
13571 to allow a bit more code growth, saving many dozens of cycles.
13572 (h8300_option_override): Adjus shift_alg_si if optimizing for
13574 (get_shift_alg): Use special + inline shifts for residuals
13577 2021-08-14 Stafford Horne <shorne@gmail.com>
13580 * config/or1k/or1k-opts.h: New file.
13581 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
13582 Support generating gotha relocations if -mcmodel=large is
13584 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
13586 * config/or1k/or1k.opt (mcmodel=): New option.
13587 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
13589 2021-08-14 Martin Sebor <msebor@redhat.com>
13591 PR middle-end/101791
13592 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
13593 to valid_new_delete_pair_p.
13594 * tree.c (valid_new_delete_pair_p): Add argument.
13595 * tree.h (valid_new_delete_pair_p): Same.
13597 2021-08-14 Jakub Jelinek <jakub@redhat.com>
13600 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
13601 <case E_V64QImode>: For this mode assert
13602 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
13604 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
13607 * config/rs6000/altivec.md (xxeval): Use register_predicate
13608 instead of altivec_register_predicate.
13610 2021-08-13 Martin Sebor <msebor@redhat.com>
13612 PR middle-end/101734
13613 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
13614 (maybe_warn_operand): Call it.
13616 2021-08-13 Martin Liska <mliska@suse.cz>
13619 * attribs.c (decl_attributes): Make naked functions "noipa"
13622 2021-08-13 Martin Liska <mliska@suse.cz>
13625 * symtab.c (symtab_node::noninterposable_alias): Do not create
13626 local aliases for target_clone functions as the clonning pass
13629 2021-08-13 Martin Liska <mliska@suse.cz>
13631 * opts.c (LIVE_PATCHING_OPTION): Define.
13632 (control_options_for_live_patching): Use it in error messages.
13634 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
13636 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
13637 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
13638 ignore_stores_eaf_flags): New constants.
13639 (remove_useless_eaf_flags): New function.
13640 (eaf_flags_useful_p): Use it.
13641 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
13643 (modref_lattice::init): Add EAF_NOREAD.
13644 (modref_lattice::add_escape_point): Do not reacord escape point if
13646 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
13647 use remove_useless_eaf_flags.
13648 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
13649 (modref_lattice::merge_direct_load): Add EAF_NOREAD
13650 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
13651 (analyze_parms): Use remove_useless_eaf_flags.
13652 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
13653 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
13654 use remove_useless_eaf_flags.
13655 (modref_propagate_flags_in_scc): Update.
13656 * ipa-modref.h: Turn eaf_flags_t back to char.
13657 * tree-core.h (EAF_NOT_RETURNED): Fix.
13658 (EAF_NOREAD): New constant
13659 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
13661 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
13662 (handle_pure_call): Likewise.
13664 2021-08-12 Jakub Jelinek <jakub@redhat.com>
13666 * tree.def (OMP_MASKED): New tree code.
13667 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
13668 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
13669 OMP_CLAUSE_FILTER_EXPR): Define.
13670 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
13671 (omp_clause_code_name): Likewise.
13672 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
13673 * tree-nested.c (convert_nonlocal_omp_clauses,
13674 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
13675 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
13676 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
13677 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
13678 (dump_generic_node): Handle OMP_MASTER.
13679 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
13680 * gimple.c (gimple_build_omp_masked): New function.
13681 (gimple_copy): Handle GIMPLE_OMP_MASKED.
13682 * gimple.h (gimple_build_omp_masked): Declare.
13683 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
13684 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
13685 gimple_omp_masked_set_clauses): New inline functions.
13686 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
13687 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
13688 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
13689 * gimple-walk.c (walk_gimple_stmt): Likewise.
13690 * gimple-low.c (lower_stmt): Likewise.
13691 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
13692 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
13693 that take one expression rather than decl or constant, force
13694 gimplification of that into a SSA_NAME or temporary unless min
13696 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
13697 (gimplify_expr): Handle OMP_MASKED.
13698 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
13699 (estimate_num_insns): Likewise.
13700 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
13701 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
13702 diagnostics for existence of masked construct.
13703 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
13704 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
13705 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
13708 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
13711 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
13712 (ldexp<mode>3): Use avx512f_scalef<mode>2.
13713 (UNSPEC_SCALEF): Move from ...
13714 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
13716 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
13718 * ipa-split.c (consider_split): Fix condition testing void functions.
13720 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
13722 * doc/invoke.texi: Remove docs for threader-mode param.
13723 * flag-types.h (enum threader_mode): Remove.
13724 * params.opt: Remove threader-mode param.
13725 * tree-ssa-threadbackward.c (class back_threader): Remove
13726 path_is_unreachable_p.
13727 Make find_paths private.
13728 Add maybe_thread and thread_through_all_blocks.
13729 Remove reference marker for m_registry.
13730 Remove reference marker for m_profit.
13731 (back_threader::back_threader): Adjust for registry and profit not
13733 (dump_path): Move down.
13734 (debug): Move down.
13735 (class thread_jumps): Remove.
13736 (class back_threader_registry): Remove m_all_paths.
13738 (thread_jumps::thread_through_all_blocks): Move to back_threader
13740 (fsm_find_thread_path): Remove
13741 (back_threader::maybe_thread): New.
13742 (back_threader::thread_through_all_blocks): Move from
13744 (back_threader_registry::back_threader_registry): Remove
13746 (back_threader_registry::~back_threader_registry): Remove.
13747 (thread_jumps::find_taken_edge): Remove.
13748 (thread_jumps::check_subpath_and_update_thread_path): Remove.
13749 (thread_jumps::maybe_register_path): Remove.
13750 (thread_jumps::handle_phi): Remove.
13751 (handle_assignment_p): Remove.
13752 (thread_jumps::handle_assignment): Remove.
13753 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
13754 (thread_jumps::find_jump_threads_backwards): Remove.
13755 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
13756 (try_thread_blocks): Rename find_jump_threads_backwards to
13758 (pass_early_thread_jumps::execute): Same.
13760 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
13762 * tree-core.h (omp_clause_proc_bind_kind): Add
13763 OMP_CLAUSE_PROC_BIND_PRIMARY.
13764 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
13765 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
13767 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
13769 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
13770 fno-common reference.
13771 * config/arc/arc.c (arc_override_options): Remove overriding of
13774 2021-08-12 Jakub Jelinek <jakub@redhat.com>
13777 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
13778 If d->testing_p, return true after performing checks instead of
13779 actually expanding the insn.
13780 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
13781 !TARGET_AVX512BW and return false.
13783 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
13785 * configure.ac (PE linker --disable-dynamicbase support): New check.
13786 * configure: Regenerate.
13787 * config.in: Likewise.
13788 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
13789 (LINK_SPEC): Use it.
13790 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
13791 (LINK_SPEC): Likewise.
13793 2021-08-12 liuhongt <hongtao.liu@intel.com>
13796 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
13797 post_reload define_insn_and_split.
13798 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
13799 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
13800 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
13801 (*avx2_zero_extendv8hiv8si2_2): Ditto.
13802 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
13803 (*avx512f_zero_extendv8siv8di2_2): Ditto.
13804 (*avx2_zero_extendv4siv4di2_2): Ditto.
13805 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
13806 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
13809 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
13811 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
13814 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
13816 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
13818 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
13820 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
13821 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
13822 initialization of pcvoid_type_node here...
13823 (altivec_init_builtins): ...from here.
13824 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
13825 RS6000_BTI_const_ptr_void.
13826 (pcvoid_type_node): New macro.
13828 2021-08-11 Richard Biener <rguenther@suse.de>
13831 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
13832 hard-register accesses.
13834 2021-08-11 Richard Biener <rguenther@suse.de>
13836 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
13837 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
13838 to determine has_volatile_ops.
13840 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
13842 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
13844 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
13845 Alexandre Oliva <oliva@adacore.com>
13847 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
13849 (modref_lattice::merge_deref): Fix handling of indirect scape points.
13850 (update_escape_summary_1): Likewise.
13851 (update_escape_summary): Likewise.
13852 (ipa_merge_modref_summary_after_inlining): Likewise.
13854 2021-08-11 Richard Biener <rguenther@suse.de>
13856 PR middle-end/101858
13857 * fold-const.c (fold_binary_loc): Guard simplification
13858 of X < (cast) (1 << Y) to integer types.
13860 2021-08-11 Richard Biener <rguenther@suse.de>
13862 PR tree-optimization/101861
13863 * tree-vect-stmts.c (vectorizable_load): Fix error in
13864 previous change with regard to gather vectorization.
13866 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
13869 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
13871 (vdup_n_s16): Likewise.
13872 (vdup_n_s32): Likewise.
13873 (vdup_n_s64): Likewise.
13874 (vdup_n_u8): Likewise.
13875 (vdup_n_u16): Likewise.
13876 (vdup_n_u32): Likewise.
13877 (vdup_n_u64): Likewise.
13878 (vdup_n_p8): Likewise.
13879 (vdup_n_p16): Likewise.
13880 (vdup_n_p64): Likewise.
13881 (vdup_n_f16): Likewise.
13882 (vdup_n_f32): Likewise.
13883 (vdupq_n_s8): Likewise.
13884 (vdupq_n_s16): Likewise.
13885 (vdupq_n_s32): Likewise.
13886 (vdupq_n_s64): Likewise.
13887 (vdupq_n_u8): Likewise.
13888 (vdupq_n_u16): Likewise.
13889 (vdupq_n_u32): Likewise.
13890 (vdupq_n_u64): Likewise.
13891 (vdupq_n_p8): Likewise.
13892 (vdupq_n_p16): Likewise.
13893 (vdupq_n_p64): Likewise.
13894 (vdupq_n_f16): Likewise.
13895 (vdupq_n_f32): Likewise.
13896 (vmov_n_s8): Replace call to builtin with call to corresponding
13898 (vmov_n_s16): Likewise.
13899 (vmov_n_s32): Likewise.
13900 (vmov_n_s64): Likewise.
13901 (vmov_n_u8): Likewise.
13902 (vmov_n_u16): Likewise.
13903 (vmov_n_u32): Likewise.
13904 (vmov_n_u64): Likewise.
13905 (vmov_n_p8): Likewise.
13906 (vmov_n_p16): Likewise.
13907 (vmov_n_f16): Likewise.
13908 (vmov_n_f32): Likewise.
13909 (vmovq_n_s8): Likewise.
13910 (vmovq_n_s16): Likewise.
13911 (vmovq_n_s32): Likewise.
13912 (vmovq_n_s64): Likewise.
13913 (vmovq_n_u8): Likewise.
13914 (vmovq_n_u16): Likewise.
13915 (vmovq_n_u32): Likewise.
13916 (vmovq_n_u64): Likewise.
13917 (vmovq_n_p8): Likewise.
13918 (vmovq_n_p16): Likewise.
13919 (vmovq_n_f16): Likewise.
13920 (vmovq_n_f32): Likewise.
13921 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
13923 2021-08-11 liuhongt <hongtao.liu@intel.com>
13926 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
13927 when TARGET_AVX512F and TARGET_SSE_MATH.
13929 2021-08-10 Jakub Jelinek <jakub@redhat.com>
13932 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
13933 for V32HImode if !TARGET_AVX512BW.
13934 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
13935 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
13936 early, but actually check the permutation.
13938 2021-08-10 Richard Biener <rguenther@suse.de>
13940 PR tree-optimization/101809
13941 * tree-vect-stmts.c (get_load_store_type): Allow emulated
13942 gathers with offset vector nunits being a constant multiple
13943 of the data vector nunits.
13944 (vect_get_gather_scatter_ops): Use the appropriate nunits
13945 for the offset vector defs.
13946 (vectorizable_store): Adjust call to
13947 vect_get_gather_scatter_ops.
13948 (vectorizable_load): Likewise. Handle the case of less
13949 offset vectors than data vectors.
13951 2021-08-10 Jakub Jelinek <jakub@redhat.com>
13954 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
13955 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
13958 2021-08-10 Richard Biener <rguenther@suse.de>
13960 PR tree-optimization/101801
13961 PR tree-optimization/101819
13962 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
13963 * tree-vect-loop.c (vect_emulated_vector_p): New function.
13964 (vectorizable_reduction): Re-instantiate a check for emulated
13966 * tree-vect-stmts.c (vectorizable_shift): Likewise.
13967 (vectorizable_operation): Likewise. Cost emulated vector
13968 operations according to the scalar sequence synthesized by
13971 2021-08-10 Richard Biener <rguenther@suse.de>
13973 PR middle-end/101824
13974 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
13975 volatile in case the variable was.
13977 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
13980 * config/i386/constraints.md (BC): Document for integer SSE
13981 constant all bits set operand.
13982 (BF): New constraint for const floating-point all bits set
13984 * config/i386/i386.c (standard_sse_constant_p): Likewise.
13985 (standard_sse_constant_opcode): Likewise.
13986 * config/i386/sse.md (sseconstm1): New mode attribute.
13987 (mov<mode>_internal): Replace BC with <sseconstm1>.
13989 2021-08-10 liuhongt <hongtao.liu@intel.com>
13991 * config/i386/sse.md (cond_<insn><mode>): New expander.
13992 (VI248_AVX512VLBW): New mode iterator.
13993 * config/i386/predicates.md
13994 (nonimmediate_or_const_vec_dup_operand): New predicate.
13996 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
13998 PR tree-optimization/101741
13999 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
14000 type of parameter for toupper/tolower.
14002 2021-08-09 Martin Jambor <mjambor@suse.cz>
14004 PR testsuite/101654
14005 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
14007 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
14009 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
14011 (is_store_insn1): Verify source is a register.
14013 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
14016 * config/i386/mmx.md (<any_logic:code>v2sf3):
14017 Rename from *mmx_<any_logic:code>v2sf3
14019 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14021 * config/nvptx/nvptx.c: Cross-reference parts adapted in
14022 'gcc/omp-oacc-neuter-broadcast.cc'.
14023 * omp-low.c: Likewise.
14024 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
14027 2021-08-09 Julian Brown <julian@codesourcery.com>
14028 Kwok Cheung Yeung <kcy@codesourcery.com>
14029 Thomas Schwinge <thomas@codesourcery.com>
14031 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
14032 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
14033 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
14034 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
14035 (gcn_fork_join): Update comment.
14036 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
14037 (macc_experimental_workers): Remove unused option.
14039 2021-08-09 Julian Brown <julian@codesourcery.com>
14040 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
14041 Kwok Cheung Yeung <kcy@codesourcery.com>
14042 Thomas Schwinge <thomas@codesourcery.com>
14044 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
14045 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
14046 Add documentation hook.
14047 * doc/tm.texi: Regenerate.
14048 * omp-oacc-neuter-broadcast.cc: New file.
14049 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
14050 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
14051 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
14052 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
14053 * target.def (goacc.create_worker_broadcast_record): Add target
14055 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
14057 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
14058 Rename prototype to...
14059 (gcn_goacc_create_worker_broadcast_record): ... this.
14060 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
14062 (gcn_goacc_create_worker_broadcast_record): ... this.
14063 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
14065 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
14067 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
14070 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
14071 the right iterator.
14073 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14075 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
14077 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14079 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
14080 consider '-foffload-abi'.
14081 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
14082 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
14085 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14087 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
14090 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14092 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
14094 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14096 * doc/gty.texi (Files): Update.
14098 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
14100 * doc/gty.texi (Files): Fix GTY header file example.
14102 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
14104 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
14105 determine the upper and lower bounds from a mask-value pair.
14106 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
14107 absolute value and unsigned absolute value expressions.
14108 (bit_value_binop): Initialize *VAL's precision.
14109 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
14110 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
14111 support when the operands are unknown but potentially equal.
14112 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
14114 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
14116 * config/aarch64/aarch64.md
14117 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
14119 2021-08-08 Sergei Trofimovich <siarheit@google.com>
14121 * lra-constraints.c: Fix s/otput/output/ typo.
14123 2021-08-06 Martin Sebor <msebor@redhat.com>
14125 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
14126 (expand_builtin_strcat): Same.
14127 (expand_builtin_stpncpy): Same.
14128 (expand_builtin_strncat): Same.
14129 (check_read_access): Same.
14130 (check_memop_access): Same.
14131 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
14132 (expand_builtin_strnlen): Same.
14133 (expand_builtin_memcpy): Same.
14134 (expand_builtin_memmove): Same.
14135 (expand_builtin_mempcpy): Same.
14136 (expand_builtin_strcpy): Same.
14137 (expand_builtin_strcpy_args): Same.
14138 (expand_builtin_stpcpy_1): Same.
14139 (expand_builtin_strncpy): Same.
14140 (expand_builtin_memset): Same.
14141 (expand_builtin_bzero): Same.
14142 (expand_builtin_strcmp): Same.
14143 (expand_builtin_strncmp): Same.
14144 (expand_builtin): Remove handlers.
14145 (fold_builtin_strlen): Add a comment.
14146 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
14147 * calls.c (maybe_warn_nonstring_arg): Same.
14148 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
14149 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
14150 (gimple_fold_builtin_stpcpy): Same.
14151 * gimple-ssa-warn-access.cc (has_location): New function.
14152 (get_location): Same.
14153 (get_callee_fndecl): Same.
14154 (call_nargs): Same.
14156 (warn_string_no_nul): Define.
14157 (unterminated_array): Same.
14158 (check_nul_terminated_array): Same.
14159 (maybe_warn_nonstring_arg): Same.
14160 (maybe_warn_for_bound): Same.
14161 (warn_for_access): Same.
14162 (check_access): Same.
14163 (check_memop_access): Same.
14164 (check_read_access): Same.
14165 (warn_dealloc_offset): Use helper functions.
14166 (maybe_emit_free_warning): Same.
14167 (class pass_waccess): Add members.
14168 (check_strcat): New function.
14169 (check_strncat): New function.
14170 (check_stxcpy): New function.
14171 (check_stxncpy): New function.
14172 (check_strncmp): New function.
14173 (pass_waccess::check_builtin): New function.
14174 (pass_waccess::check): Call it.
14175 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
14177 (maybe_warn_for_bound): Same.
14178 (check_access): Same.
14179 (check_memop_access): Same.
14180 (check_read_access): Same.
14181 * pointer-query.h (struct access_data): Define a ctor overload.
14183 2021-08-06 Richard Biener <rguenther@suse.de>
14185 PR tree-optimization/101801
14186 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
14187 (vect_can_vectorize_without_simd_p): ... to this.
14188 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
14189 (vect_can_vectorize_without_simd_p): ... to this and fold
14190 in vect_min_worthwhile_factor.
14191 (vect_min_worthwhile_factor): Remove.
14192 (vectorizable_reduction): Adjust and remove the cost part.
14193 * tree-vect-stmts.c (vectorizable_shift): Likewise.
14194 (vectorizable_operation): Likewise.
14196 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
14199 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
14200 Add general_gr_operand predicate to operand 3.
14202 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
14204 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
14205 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
14207 2021-08-06 Tamar Christina <tamar.christina@arm.com>
14209 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
14210 register_svprfop): Pass vec<> by pointer.
14211 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
14212 * langhooks.c (lhd_simulate_enum_decl): Likewise.
14213 * langhooks.h (struct lang_hooks_for_types): Likewise.
14215 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14217 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
14218 __builtin_memcpy instead of constructing an additional
14219 __builtin_aarch64_simd_oi one vector at a time.
14220 (vst1q_bf16_x2): Likewise.
14221 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
14222 an additional __builtin_aarch64_simd_ci one vector at a time.
14223 (vst1q_bf16_x3): Likewise.
14224 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
14225 (vst1q_bf16_x4): Likewise.
14226 (vst2_bf16): Use __builtin_memcpy instead of constructing an
14227 additional __builtin_aarch64_simd_oi one vector at a time.
14228 (vst2q_bf16): Likewise.
14229 (vst3_bf16): Use __builtin_memcpy instead of constructing an
14230 additional __builtin_aarch64_simd_ci mode one vector at a
14232 (vst3q_bf16): Likewise.
14233 (vst4_bf16): Use __builtin_memcpy instead of constructing an
14234 additional __builtin_aarch64_simd_xi one vector at a time.
14235 (vst4q_bf16): Likewise.
14237 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14239 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
14240 (__ST2Q_LANE_FUNC): Delete.
14241 (vst2_lane_f16): Use __builtin_memcpy to copy vector
14242 structure instead of constructing __builtin_aarch64_simd_oi
14243 one vector at a time.
14244 (vst2_lane_f32): Likewise.
14245 (vst2_lane_f64): Likewise.
14246 (vst2_lane_p8): Likewise.
14247 (vst2_lane_p16): Likewise.
14248 (vst2_lane_p64): Likewise.
14249 (vst2_lane_s8): Likewise.
14250 (vst2_lane_s16): Likewise.
14251 (vst2_lane_s32): Likewise.
14252 (vst2_lane_s64): Likewise.
14253 (vst2_lane_u8): Likewise.
14254 (vst2_lane_u16): Likewise.
14255 (vst2_lane_u32): Likewise.
14256 (vst2_lane_u64): Likewise.
14257 (vst2_lane_bf16): Likewise.
14258 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
14259 structure instead of using a union.
14260 (vst2q_lane_f32): Likewise.
14261 (vst2q_lane_f64): Likewise.
14262 (vst2q_lane_p8): Likewise.
14263 (vst2q_lane_p16): Likewise.
14264 (vst2q_lane_p64): Likewise.
14265 (vst2q_lane_s8): Likewise.
14266 (vst2q_lane_s16): Likewise.
14267 (vst2q_lane_s32): Likewise.
14268 (vst2q_lane_s64): Likewise.
14269 (vst2q_lane_u8): Likewise.
14270 (vst2q_lane_u16): Likewise.
14271 (vst2q_lane_u32): Likewise.
14272 (vst2q_lane_u64): Likewise.
14273 (vst2q_lane_bf16): Likewise.
14275 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14277 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
14278 (__ST3Q_LANE_FUNC): Delete.
14279 (vst3_lane_f16): Use __builtin_memcpy to copy vector
14280 structure instead of constructing __builtin_aarch64_simd_ci
14281 one vector at a time.
14282 (vst3_lane_f32): Likewise.
14283 (vst3_lane_f64): Likewise.
14284 (vst3_lane_p8): Likewise.
14285 (vst3_lane_p16): Likewise.
14286 (vst3_lane_p64): Likewise.
14287 (vst3_lane_s8): Likewise.
14288 (vst3_lane_s16): Likewise.
14289 (vst3_lane_s32): Likewise.
14290 (vst3_lane_s64): Likewise.
14291 (vst3_lane_u8): Likewise.
14292 (vst3_lane_u16): Likewise.
14293 (vst3_lane_u32): Likewise.
14294 (vst3_lane_u64): Likewise.
14295 (vst3_lane_bf16): Likewise.
14296 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
14297 structure instead of using a union.
14298 (vst3q_lane_f32): Likewise.
14299 (vst3q_lane_f64): Likewise.
14300 (vst3q_lane_p8): Likewise.
14301 (vst3q_lane_p16): Likewise.
14302 (vst3q_lane_p64): Likewise.
14303 (vst3q_lane_s8): Likewise.
14304 (vst3q_lane_s16): Likewise.
14305 (vst3q_lane_s32): Likewise.
14306 (vst3q_lane_s64): Likewise.
14307 (vst3q_lane_u8): Likewise.
14308 (vst3q_lane_u16): Likewise.
14309 (vst3q_lane_u32): Likewise.
14310 (vst3q_lane_u64): Likewise.
14311 (vst3q_lane_bf16): Likewise.
14313 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
14315 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
14316 (__ST4Q_LANE_FUNC): Delete.
14317 (vst4_lane_f16): Use __builtin_memcpy to copy vector
14318 structure instead of constructing __builtin_aarch64_simd_xi
14319 one vector at a time.
14320 (vst4_lane_f32): Likewise.
14321 (vst4_lane_f64): Likewise.
14322 (vst4_lane_p8): Likewise.
14323 (vst4_lane_p16): Likewise.
14324 (vst4_lane_p64): Likewise.
14325 (vst4_lane_s8): Likewise.
14326 (vst4_lane_s16): Likewise.
14327 (vst4_lane_s32): Likewise.
14328 (vst4_lane_s64): Likewise.
14329 (vst4_lane_u8): Likewise.
14330 (vst4_lane_u16): Likewise.
14331 (vst4_lane_u32): Likewise.
14332 (vst4_lane_u64): Likewise.
14333 (vst4_lane_bf16): Likewise.
14334 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
14335 structure instead of using a union.
14336 (vst4q_lane_f32): Likewise.
14337 (vst4q_lane_f64): Likewise.
14338 (vst4q_lane_p8): Likewise.
14339 (vst4q_lane_p16): Likewise.
14340 (vst4q_lane_p64): Likewise.
14341 (vst4q_lane_s8): Likewise.
14342 (vst4q_lane_s16): Likewise.
14343 (vst4q_lane_s32): Likewise.
14344 (vst4q_lane_s64): Likewise.
14345 (vst4q_lane_u8): Likewise.
14346 (vst4q_lane_u16): Likewise.
14347 (vst4q_lane_u32): Likewise.
14348 (vst4q_lane_u64): Likewise.
14349 (vst4q_lane_bf16): Likewise.
14351 2021-08-06 Martin Liska <mliska@suse.cz>
14353 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
14354 a target option is restored, it can have
14355 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
14356 and error should not be emitted.
14358 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
14360 * gcov-io.h (gcov_write): Declare.
14361 * gcov-io.c (gcov_write): New.
14362 (gcov_write_counter): Remove.
14363 (gcov_write_tag_length): Likewise.
14364 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
14365 gcov_write_unsigned().
14366 * doc/invoke.texi (fprofile-info-section): Mention
14367 __gcov_info_to_gdca().
14369 2021-08-06 Martin Sebor <msebor@redhat.com>
14371 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
14372 arguments to by-reference.
14373 (iterate_fix_dominators): Same.
14374 * dominance.h (iterate_fix_dominators): Same.
14375 * ipa-prop.h: Call auto_vec::to_vec_legacy.
14376 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
14377 arguments to by-reference.
14378 (debug_data_dependence_relation): Same.
14379 (dump_data_dependence_relations): Same.
14380 * tree-data-ref.h (debug_data_dependence_relation): Same.
14381 (dump_data_dependence_relations): Same.
14382 * tree-predcom.c (dump_chains): Same.
14383 (initialize_root_vars_lm): Same.
14384 (determine_unroll_factor): Same.
14385 (replace_phis_by_defined_names): Same.
14386 (insert_init_seqs): Same.
14387 (pcom_worker::tree_predictive_commoning_loop): Call
14388 auto_vec::to_vec_legacy.
14389 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
14390 arguments to by-reference.
14391 * tree-ssa-threadbackward.c (populate_worklist): Same.
14392 (back_threader::resolve_def): Same.
14393 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
14394 (vect_enhance_data_refs_alignment): Same.
14395 (vect_check_lower_bound): Same.
14396 (vect_prune_runtime_alias_test_list): Same.
14397 (vect_permute_store_chain): Same.
14398 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
14399 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
14400 * tree-vectorizer.h (vect_permute_store_chain): Same.
14401 * vec.c (test_init): New function.
14402 (vec_c_tests): Call new function.
14403 * vec.h (vec): Declare ctors, dtor, and assignment.
14404 (auto_vec::vec_to_legacy): New function.
14405 (vec::copy): Adjust initialization.
14407 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
14410 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
14411 callee only uses GPRs.
14412 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
14413 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
14414 * config/i386/x86gprintrin.h: Add
14415 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
14416 to disable non-GPR ISAs.
14418 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
14420 PR middle-end/101787
14421 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
14423 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
14425 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
14426 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
14427 (vect_is_extending_load, vect_is_integer_truncation): New functions,
14428 moved from aarch64.c but given different names.
14429 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
14430 (aarch64_is_reduction, aarch64_reduc_type)
14431 (aarch64_embedded_comparison_type, aarch64_comparison_type)
14432 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
14433 in favor of the above. Update callers accordingly.
14435 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
14438 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
14439 writing .cpu directive in asm output.
14440 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
14441 (arm_last_printed_arch_string): Delete.
14442 (arm_last-printed_fpu_string): Delete.
14443 (arm_configure_build_target): If use of floating-point/SIMD is
14444 disabled, remove all fp/simd related features from the target ISA.
14445 (last_arm_targ_options): New variable.
14446 (arm_print_asm_arch_directives): Add new parameters. Change order
14447 of emitted directives and handle all cases here.
14448 (arm_file_start): Always call arm_print_asm_arch_directives, move
14449 all generation of .arch/.arch_extension here.
14450 (arm_file_end): Call arm_print_asm_arch.
14451 (arm_declare_function_name): Call arm_print_asm_arch_directives
14452 instead of printing .arch/.fpu directives directly.
14454 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
14456 * config/arm/arm.c (arm_configure_build_target): Don't call
14457 arm_option_reconfigure_globals.
14458 (arm_option_restore): Call arm_option_reconfigure_globals after
14459 reconfiguring the target.
14460 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
14462 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
14464 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
14465 arch_name is always set.
14467 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
14469 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
14470 of vec_select high-half from being added into Neon subtract
14473 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
14475 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
14476 of vec_select high-half from being added into Neon add cost.
14478 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
14480 * cfgloop.h (loops_list::loops_list): Add one optional argument
14481 root and adjust accordingly, update loop tree walking and factor
14483 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
14485 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
14487 PR tree-optimization/101626
14488 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
14489 reverse scalar storage order on a pointer or vector component.
14491 2021-08-05 liuhongt <hongtao.liu@intel.com>
14493 * config/i386/sse.md (cond_<code><mode>): New expander.
14495 2021-08-05 liuhongt <hongtao.liu@intel.com>
14497 * config/i386/sse.md (cond_<code><mode>): New expander.
14499 2021-08-05 liuhongt <hongtao.liu@intel.com>
14501 * config/i386/sse.md (cond_<code><mode>): New expander.
14503 2021-08-04 David Malcolm <dmalcolm@redhat.com>
14506 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
14508 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
14511 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
14512 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
14514 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
14517 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
14518 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
14519 data with SSE register from one memory location to another.
14521 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
14523 * config/s390/s390.c (expand_perm_with_vpdi): New function.
14524 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
14525 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
14526 parameterized expander.
14527 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
14529 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
14531 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
14532 (struct expand_vec_perm_d): Define struct.
14533 (expand_perm_with_merge): New function.
14534 (vectorize_vec_perm_const_1): New function.
14535 (s390_vectorize_vec_perm_const): New function.
14536 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
14538 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
14540 * config/s390/vector.md (V_HW_64): Remove mode iterator.
14541 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
14542 * config/s390/vx-builtins.md
14543 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
14546 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
14548 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
14550 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
14552 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
14553 instead of an unspec.
14555 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
14557 * config/s390/s390-modes.def: Add more vector modes to support
14558 concatenation of two vectors.
14559 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
14561 (s390_expand_merge): Likewise.
14562 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
14563 (s390_expand_merge): New function.
14564 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
14565 Remove constant definitions.
14566 * config/s390/vector.md (V_HW_2): Add mode iterators.
14567 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
14568 (vec_2x_nelts, vec_2x_wide): New mode attributes.
14569 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
14570 New pattern definitions.
14571 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
14572 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
14573 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
14574 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
14576 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
14578 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
14579 emit vec merge pattern.
14581 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
14583 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
14585 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
14586 vec_select high-half from being added into Neon multiply
14588 * rtlanal.c (vec_series_highpart_p): Define.
14589 * rtlanal.h (vec_series_highpart_p): Declare.
14591 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
14593 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
14595 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
14596 vec_select cost from being added into Neon multiply cost.
14598 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
14600 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
14601 which old_loop_vinfo is an epilogue loop that handles a constant
14602 number of iterations.
14604 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
14606 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
14607 when a reanalyzed loop fails to be cheaper than the current
14610 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
14612 * config/aarch64/aarch64.c: Fix a typo.
14614 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
14616 PR gcov-profile/101773
14617 * gcov-io.c (gcov_close): Check return code of a fclose.
14619 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
14622 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
14623 .file statement when needed.
14625 2021-08-04 Richard Biener <rguenther@suse.de>
14627 * tree-vect-data-refs.c (vect_check_gather_scatter):
14628 Include widening conversions only when the result is
14629 still handed by native gather or the current offset
14630 size not already matches the data size.
14631 Also succeed analysis in case there's no native support,
14632 noted by a IFN_LAST ifn and a NULL decl.
14633 (vect_analyze_data_refs): Always consider gathers.
14634 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
14635 Test for no IFN gather rather than decl gather.
14636 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
14637 gather-scatter info and cost emulated gathers accordingly.
14638 (vect_truncate_gather_scatter_offset): Properly test for
14640 (vect_use_strided_gather_scatters_p): Likewise.
14641 (get_load_store_type): Handle emulated gathers and its
14643 (vectorizable_load): Likewise. Emulate them by extracting
14644 scalar offsets, doing scalar loads and a vector construct.
14646 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
14649 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
14650 argument to set m_max_size.
14651 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
14652 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
14653 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
14655 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
14656 Marc Glisse <marc.glisse@inria.fr>
14658 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
14659 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
14660 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
14662 2021-08-04 Richard Biener <rguenther@suse.de>
14664 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
14665 out code to decompose vector loads ...
14666 (optimize_vector_load): ... here. Generalize it to
14667 handle intermediate widening and TARGET_MEM_REF loads
14668 and apply it to loads with a supported vector mode as well.
14670 2021-08-04 Richard Biener <rguenther@suse.de>
14672 PR tree-optimization/101756
14673 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
14674 the result of the reduction epilogue is compatible to the original
14677 2021-08-04 liuhongt <hongtao.liu@intel.com>
14680 * config/i386/i386.md (peephole2): Refine predicate from
14681 register_operand to general_reg_operand.
14683 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
14685 * gimple-range-path.h (path_range_query::dump): Mark override.
14687 2021-08-04 Richard Biener <rguenther@suse.de>
14689 PR tree-optimization/101769
14690 * tree-tailcall.c (eliminate_tail_call): Add the created loop
14691 for the first recursion and return it via the new output parameter.
14692 (optimize_tail_call): Pass through new output param.
14693 (tree_optimize_tail_calls_1): After creating all latches,
14694 add the created loop to the loop tree. Do not mark loops for fixup.
14696 2021-08-04 Martin Liska <mliska@suse.cz>
14698 * doc/invoke.texi: Document threader-mode param.
14700 2021-08-04 liuhongt <hongtao.liu@intel.com>
14702 * config/i386/sse.md (cond_fma<mode>): New expander.
14703 (cond_fms<mode>): Ditto.
14704 (cond_fnma<mode>): Ditto.
14705 (cond_fnms<mode>): Ditto.
14707 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
14709 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
14711 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
14713 * config/rs6000/constraints.md: Remove "e" from the list of available
14714 constraint characters.
14716 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
14718 PR gcov-profile/71672
14719 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
14721 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
14723 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
14725 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
14727 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
14728 try XMM31 to avoid vzeroupper.
14730 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14732 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
14733 -mcpu=neoverse-512tvb.
14734 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
14735 * config/aarch64/aarch64-tune.md: Regenerate.
14736 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
14737 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
14738 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
14739 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
14740 (aarch64_adjust_body_cost): Likewise.
14742 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14744 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
14745 record issue information for operations that occur in the
14748 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14750 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
14751 parameter. Detect cases in which an Advanced SIMD MLA would almost
14752 certainly require a MOV.
14753 (aarch64_count_ops): Update accordingly.
14755 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14757 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
14758 function, split out from...
14759 (aarch64_detect_vector_stmt_subtype): ...here.
14760 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
14762 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14764 * config/aarch64/aarch64-protos.h (sve_vec_cost):
14765 Add gather_load_x32_cost and gather_load_x64_cost.
14766 * config/aarch64/aarch64.c (generic_sve_vector_cost)
14767 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
14768 accordingly, using the values given by the scalar_load * number
14769 of elements calculation that we used previously.
14770 (aarch64_detect_vector_stmt_subtype): Use the new fields.
14772 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14774 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
14775 function, split out from...
14776 (aarch64_adjust_body_cost): ...here.
14778 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14780 * config/aarch64/fractional-cost.h: New file.
14781 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
14782 and cost_fraction.h.
14783 (vec_cost_fraction): New typedef.
14784 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
14785 (aarch64_detect_vector_stmt_subtype): Likewise.
14786 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
14787 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
14789 (aarch64_adjust_body_cost): Likewise.
14790 (aarch64_test_cost_fraction): New function.
14791 (aarch64_run_selftests): Call it.
14793 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
14795 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
14797 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
14799 (aarch64_estimated_poly_value): Likewise. Use the least significant
14800 set bit for the minimum and likely values. Use the most significant
14801 set bit for the maximum value.
14803 2021-08-03 liuhongt <hongtao.liu@intel.com>
14805 * config/i386/sse.md (cond_<insn><mode>): New expander.
14806 (cond_mul<mode>): Ditto.
14808 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
14810 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
14812 2021-08-03 liuhongt <hongtao.liu@intel.com>
14814 * config/i386/sse.md (cond_<insn><mode>):New expander.
14815 (cond_mul<mode>): Ditto.
14816 (cond_div<mode>): Ditto.
14818 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
14820 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
14821 check stack_realign_needed for stack realignment.
14822 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
14823 than the largest integer supported by vector register.
14824 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
14825 (MOVE_MAX): Set to bytes of the largest integer supported by
14827 (STORE_MAX_PIECES): New.
14829 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
14831 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
14832 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
14833 data from one memory location to another.
14835 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
14837 PR middle-end/90773
14838 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
14840 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
14842 PR tree-optimization/101724
14843 * params.opt: Remove --param=threader-iterative.
14844 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
14847 2021-08-02 Tom de Vries <tdevries@suse.de>
14849 PR middle-end/101665
14850 * doc/extend.texi (nonnull attribute): Improve documentation.
14852 2021-08-02 Andrew Pinski <apinski@marvell.com>
14854 PR rtl-optimization/101683
14855 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
14857 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
14859 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
14860 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
14861 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
14862 (tree_ssa_phiop_worker): Update call to function above.
14864 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
14867 * config/i386/i386.md (bsr_rex64_1_zext): New.
14868 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
14869 with gen_bsr_rex64_1_zext.
14871 2021-07-31 Jakub Jelinek <jakub@redhat.com>
14874 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
14875 define_insn patterns.
14876 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
14877 Add combine splitters for constant - clz.
14878 (clz<mode>2): Use a temporary pseudo for bsr result.
14880 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
14882 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
14883 _mm_floor_sd, _mm_floor_ss): New.
14885 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
14887 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
14888 _mm_ceil_sd, _mm_ceil_ss): New.
14890 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
14892 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
14893 _mm_blend_ps, _mm_blendv_ps): New.
14895 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
14896 Uroš Bizjak <ubizjak@gmail.com>
14898 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
14899 to generate a conditional move using the carry flag after sub $1.
14900 (peephole2): Eliminate a register-to-register move by inverting
14901 the condition of a conditional move.
14903 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
14905 * config/mmix/mmix.md ("call", "call_value", "*call_real")
14906 ("*call_value_real"): Don't generate rtx mentioning the generic
14907 operands 1 and 2 to "call", and similarly for "call_value".
14908 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
14909 (mmix_print_operand): Use '!' instead of 'p'.
14911 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
14913 * doc/md.texi (call): Correct information about operand 2.
14914 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
14916 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
14918 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
14920 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
14922 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
14924 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
14926 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
14927 const_basic_block..
14928 (*::get_bb_range): Ditto.
14929 (*::bb_range_p): Ditto.
14930 * gimple-range-cache.h: Change prototypes.
14932 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
14934 PR middle-end/90773
14935 * builtins.c (builtin_memcpy_read_str): Change the mode argument
14936 from scalar_int_mode to fixed_size_mode.
14937 (builtin_strncpy_read_str): Likewise.
14938 (gen_memset_value_from_prev): New function.
14939 (builtin_memset_read_str): Change the mode argument from
14940 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
14941 and support CONST_VECTOR.
14942 (builtin_memset_gen_str): Likewise.
14943 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
14945 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
14946 with fixed_size_mode.
14947 (builtin_memset_read_str): Likewise.
14948 * expr.c (widest_int_mode_for_size): Renamed to ...
14949 (widest_fixed_size_mode_for_size): Add a bool argument to
14950 indicate if QI vector mode can be used.
14951 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
14952 instead of widest_int_mode_for_size.
14953 (pieces_addr::adjust): Change the mode argument from
14954 scalar_int_mode to fixed_size_mode.
14955 (op_by_pieces_d): Make m_len read-only. Add a bool member,
14956 m_qi_vector_mode, to indicate that QI vector mode can be used.
14957 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
14958 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
14959 instead of widest_int_mode_for_size.
14960 (op_by_pieces_d::get_usable_mode): Change the mode argument from
14961 scalar_int_mode to fixed_size_mode. Call
14962 widest_fixed_size_mode_for_size instead of
14963 widest_int_mode_for_size.
14964 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
14965 function to return the smallest integer or QI vector mode.
14966 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
14967 instead of widest_int_mode_for_size. Call
14968 smallest_fixed_size_mode_for_size instead of
14969 smallest_int_mode_for_size.
14970 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
14971 indicate that QI vector mode can be used and pass it to
14972 op_by_pieces_d::op_by_pieces_d.
14973 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
14974 instead of widest_int_mode_for_size. Pass memsetp to
14975 widest_fixed_size_mode_for_size to support QI vector mode.
14976 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
14977 (store_by_pieces): Pass memsetp to
14978 store_by_pieces_d::store_by_pieces_d.
14979 (clear_by_pieces_1): Removed.
14980 (clear_by_pieces): Replace clear_by_pieces_1 with
14981 builtin_memset_read_str and pass true to store_by_pieces_d to
14982 support vector mode broadcast.
14983 (string_cst_read_str): Change the mode argument from
14984 scalar_int_mode to fixed_size_mode.
14985 * expr.h (by_pieces_constfn): Change scalar_int_mode to
14987 (by_pieces_prev): Likewise.
14988 * rtl.h (lowpart_subreg_regno): New.
14989 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
14990 simplify_subreg_regno.
14991 * target.def (gen_memset_scratch_rtx): New hook.
14992 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
14993 * doc/tm.texi: Regenerated.
14995 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
14998 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
14999 TARGET_EXPR instead of MODIFY_EXPR.
15001 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
15004 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
15005 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
15006 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
15008 (vec_cmpu<IMSA:mode><mode_i>): New expander.
15010 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
15013 * config/i386/i386-options.c (ix86_option_override_internal):
15014 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
15016 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15019 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
15020 explicitly dereferencing __a.
15021 (vld1_s64): Likewise.
15022 (vld1_u64): Likewise.
15023 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
15024 and change to VAR13.
15026 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
15028 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
15029 use of m_range_analyzer.
15030 (loop_versioning::lv_dom_walker::before_dom_children): Same.
15031 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
15032 (loop_versioning::prune_loop_conditions): Replace vr_values use
15033 with range_query interface.
15034 (pass_loop_versioning::execute): Use ranger.
15036 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
15039 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
15040 enum values, and emit a warning if they mismatch.
15042 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
15044 * cfgloop.h (as_const): New function.
15045 (class loop_iterator): Rename to ...
15046 (class loops_list): ... this.
15047 (loop_iterator::next): Rename to ...
15048 (loops_list::Iter::fill_curr_loop): ... this and adjust.
15049 (loop_iterator::loop_iterator): Rename to ...
15050 (loops_list::loops_list): ... this and adjust.
15051 (loops_list::Iter): New class.
15052 (loops_list::iterator): New type.
15053 (loops_list::const_iterator): New type.
15054 (loops_list::begin): New function.
15055 (loops_list::end): Likewise.
15056 (loops_list::begin const): Likewise.
15057 (loops_list::end const): Likewise.
15058 (FOR_EACH_LOOP): Remove.
15059 (FOR_EACH_LOOP_FN): Remove.
15060 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
15061 for loop with loops_list instance.
15062 (sort_sibling_loops): Likewise.
15063 (disambiguate_loops_with_multiple_latches): Likewise.
15064 (verify_loop_structure): Likewise.
15065 * cfgloopmanip.c (create_preheaders): Likewise.
15066 (force_single_succ_latches): Likewise.
15067 * config/aarch64/falkor-tag-collision-avoidance.c
15068 (execute_tag_collision_avoidance): Likewise.
15069 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
15070 * config/s390/s390.c (s390_adjust_loops): Likewise.
15071 * doc/loop.texi: Likewise.
15072 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
15073 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
15074 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
15075 (loop_versioning::make_versioning_decisions): Likewise.
15076 * gimple-ssa-split-paths.c (split_paths): Likewise.
15077 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
15078 * graphite.c (canonicalize_loop_form): Likewise.
15079 (graphite_transform_loops): Likewise.
15080 * ipa-fnsummary.c (analyze_function_body): Likewise.
15081 * ipa-pure-const.c (analyze_function): Likewise.
15082 * loop-doloop.c (doloop_optimize_loops): Likewise.
15083 * loop-init.c (loop_optimizer_finalize): Likewise.
15084 (fix_loop_structure): Likewise.
15085 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
15086 (move_loop_invariants): Likewise.
15087 * loop-unroll.c (decide_unrolling): Likewise.
15088 (unroll_loops): Likewise.
15089 * modulo-sched.c (sms_schedule): Likewise.
15090 * predict.c (predict_loops): Likewise.
15091 (pass_profile::execute): Likewise.
15092 * profile.c (branch_prob): Likewise.
15093 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
15094 (sel_find_rgns): Likewise.
15095 * tree-cfg.c (replace_loop_annotate): Likewise.
15096 (replace_uses_by): Likewise.
15097 (move_sese_region_to_fn): Likewise.
15098 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
15099 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
15100 * tree-parloops.c (parallelize_loops): Likewise.
15101 * tree-predcom.c (tree_predictive_commoning): Likewise.
15102 * tree-scalar-evolution.c (scev_initialize): Likewise.
15103 (scev_reset): Likewise.
15104 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
15105 * tree-ssa-live.c (remove_unused_locals): Likewise.
15106 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
15107 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
15108 (tree_ssa_lim_initialize): Likewise.
15109 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
15110 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
15111 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
15112 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
15113 (free_numbers_of_iterations_estimates): Likewise.
15114 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
15115 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
15116 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
15117 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
15118 (pass_scev_cprop::execute): Likewise.
15119 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
15120 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
15121 * tree-ssa-threadupdate.c
15122 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
15123 * tree-vectorizer.c (vectorize_loops): Likewise.
15124 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
15126 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
15128 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
15129 generating a VOIDmode register for e.g the
15130 function_arg_info::end_marker.
15132 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
15134 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
15135 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
15136 (cc mode_attr): Similarly.
15137 (ccz subst_attr): Similarly.
15138 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
15139 * config/h8300/testcompare.md: Remove various cc0 based patterns
15140 that had been commented out. Add pattern to set CCZ from a bit
15143 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
15144 Julian Brown <julian@codesourcery.com>
15145 Kwok Cheung Yeung <kcy@codesourcery.com>
15147 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
15148 'update_stmt' after modification.
15149 (pass_oacc_loop_designation): New function, extracted out of...
15150 (pass_oacc_device_lower): ... this.
15151 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
15152 (make_pass_oacc_loop_designation): New
15153 * passes.def: Add it.
15154 * tree-parloops.c (create_parallel_loop): Adjust.
15155 * tree-pass.h (make_pass_oacc_loop_designation): New.
15157 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
15159 * flag-types.h (enum threader_mode): New.
15160 * params.opt: Add entry for --param=threader-mode.
15161 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
15162 (class back_threader): New.
15163 (back_threader::back_threader): New.
15164 (back_threader::~back_threader): New.
15165 (back_threader::maybe_register_path): New.
15166 (back_threader::find_taken_edge): New.
15167 (back_threader::find_taken_edge_switch): New.
15168 (back_threader::find_taken_edge_cond): New.
15169 (back_threader::resolve_def): New.
15170 (back_threader::resolve_phi): New.
15171 (back_threader::find_paths_to_names): New.
15172 (back_threader::find_paths): New.
15175 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
15176 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
15177 (pass_thread_jumps::execute): Abstract out code...
15178 (try_thread_blocks): ...here.
15179 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
15180 Abstract out threading candidate code to...
15181 (single_succ_to_potentially_threadable_block): ...here.
15182 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
15184 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
15185 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
15186 Return bool from register_jump_thread.
15188 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
15190 * target.def: in0 and in1 do not need to be registers.
15191 * doc/tm.texi: Regenerate.
15193 2021-07-29 liuhongt <hongtao.liu@intel.com>
15196 * config/i386/i386.c (ix86_widen_mult_cost): New function.
15197 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
15200 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
15203 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
15204 (rs6000_preferred_doloop_mode): New hook.
15205 * doc/tm.texi: Regenerate.
15206 * doc/tm.texi.in: Add hook preferred_doloop_mode.
15207 * target.def (preferred_doloop_mode): New hook.
15208 * targhooks.c (default_preferred_doloop_mode): New hook.
15209 * targhooks.h (default_preferred_doloop_mode): New hook.
15210 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
15211 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
15212 and compute_doloop_base_on_mode.
15214 2021-07-28 Martin Sebor <msebor@redhat.com>
15216 PR middle-end/101494
15217 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
15218 and size computation.
15220 2021-07-28 Martin Sebor <msebor@redhat.com>
15222 PR middle-end/101601
15223 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
15225 Handle pointers to functions.
15227 2021-07-28 Martin Sebor <msebor@redhat.com>
15229 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
15230 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
15231 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
15232 (access_ref::access_ref): Same.
15233 (access_ref::phi): Same.
15234 (access_ref::get_ref): Same.
15235 (access_ref::size_remaining): Same.
15236 (access_ref::offset_in_range): Same.
15237 (access_ref::add_offset): Same.
15238 (access_ref::inform_access): Same.
15239 (ssa_name_limit_t::visit_phi): Same.
15240 (ssa_name_limit_t::leave_phi): Same.
15241 (ssa_name_limit_t::next): Same.
15242 (ssa_name_limit_t::next_phi): Same.
15243 (ssa_name_limit_t::~ssa_name_limit_t): Same.
15244 (pointer_query::pointer_query): Same.
15245 (pointer_query::get_ref): Same.
15246 (pointer_query::put_ref): Same.
15247 (pointer_query::flush_cache): Same.
15248 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
15249 (check_nul_terminated_array): Same.
15250 (unterminated_array): Same.
15251 (maybe_warn_for_bound): Same.
15252 (check_read_access): Same.
15253 (warn_for_access): Same.
15254 (get_size_range): Same.
15255 (check_access): Same.
15256 (gimple_call_alloc_size): Move to tree.c.
15257 (gimple_parm_array_size): Move to pointer-query.cc.
15258 (get_offset_range): Same.
15259 (gimple_call_return_array): Same.
15260 (handle_min_max_size): Same.
15261 (handle_array_ref): Same.
15262 (handle_mem_ref): Same.
15263 (compute_objsize): Same.
15264 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
15265 (call_dealloc_argno): Same.
15266 (fndecl_dealloc_argno): Same.
15267 (new_delete_mismatch_p): Same.
15268 (matching_alloc_calls_p): Same.
15269 (warn_dealloc_offset): Same.
15270 (maybe_emit_free_warning): Same.
15271 * builtins.h (check_nul_terminated_array): Move to
15272 gimple-ssa-warn-access.h.
15273 (check_nul_terminated_array): Same.
15274 (warn_string_no_nul): Same.
15275 (unterminated_array): Same.
15276 (class ssa_name_limit_t): Same.
15277 (class pointer_query): Same.
15278 (struct access_ref): Same.
15279 (class range_query): Same.
15280 (struct access_data): Same.
15281 (gimple_call_alloc_size): Same.
15282 (gimple_parm_array_size): Same.
15283 (compute_objsize): Same.
15284 (class access_data): Same.
15285 (maybe_emit_free_warning): Same.
15286 * calls.c (initialize_argument_information): Remove call to
15287 maybe_emit_free_warning.
15288 * gimple-array-bounds.cc: Include new header..
15289 * gimple-fold.c: Same.
15290 * gimple-ssa-sprintf.c: Same.
15291 * gimple-ssa-warn-restrict.c: Same.
15292 * passes.def: Add pass_warn_access.
15293 * tree-pass.h (make_pass_warn_access): Declare.
15294 * tree-ssa-strlen.c: Include new headers.
15295 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
15296 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
15297 * gimple-ssa-warn-access.cc: New file.
15298 * gimple-ssa-warn-access.h: New file.
15299 * pointer-query.cc: New file.
15300 * pointer-query.h: New file.
15302 2021-07-28 Jakub Jelinek <jakub@redhat.com>
15304 PR middle-end/101624
15305 * ubsan.c (maybe_instrument_pointer_overflow,
15306 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
15307 PARM_DECLs or RESULT_DECLs.
15308 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
15310 2021-07-28 Jakub Jelinek <jakub@redhat.com>
15312 PR middle-end/101642
15313 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
15314 to type of bswap16 for comparison.
15315 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
15317 2021-07-28 Richard Biener <rguenther@suse.de>
15319 PR tree-optimization/101615
15320 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
15321 at CTOR SLP graph entries.
15323 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
15325 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
15326 Add "r,w" alternative.
15328 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
15331 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
15332 AVX_U128_DIRTY when all bits are zero.
15334 2021-07-28 Richard Biener <rguenther@suse.de>
15336 PR tree-optimization/101615
15337 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
15338 external nodes cannot be permuted so make them perm_out 0.
15340 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
15343 * config.in: Regenerate.
15344 * config/gcn/gcn-hsa.h (A_FIJI): New define.
15345 (A_900): New define.
15346 (A_906): New define.
15347 (A_908): New define.
15348 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
15349 * config/gcn/gcn.c (output_file_start): Adjust attributes according
15350 to the assembler capabilities.
15351 * config/gcn/mkoffload.c (main): Likewise.
15352 * configure: Regenerate.
15353 * configure.ac: Add tests for LLVM assembler attribute features.
15355 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
15357 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
15358 cond_false and cond_true on branches.
15360 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
15362 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
15365 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
15367 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
15369 2021-07-28 Jakub Jelinek <jakub@redhat.com>
15372 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
15373 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
15374 and has special !TARGET_AVX512VL expansion.
15375 (vashrv2di3<mask_name>): Rename to ...
15376 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
15377 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
15379 2021-07-28 Martin Uecker <muecker@gwdg.de>
15381 * calls.c (maybe_warn_rdwr_sizes): Correct argument
15382 numbers in warning that were switched.
15384 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
15386 PR tree-optimization/101596
15387 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
15388 by using new_type's precision instead.
15390 2021-07-28 liuhongt <hongtao.liu@intel.com>
15393 * config/i386/i386.h (processor_costs): Add new member
15395 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
15396 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
15397 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
15398 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
15399 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
15400 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
15401 generic_cost, core_cost): Initialize integer_to_sse same value
15403 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
15404 * config/i386/i386.c (ix86_builtin_vectorization_cost):
15405 Use integer_to_sse instead of sse_op to calculate the cost of
15408 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
15410 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
15412 (write_init_file): Call write_ovld_static_init.
15414 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
15416 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
15418 (write_init_file): Call write_bif_static_init.
15420 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
15422 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
15423 (TYPE_MAP_SIZE): New macro.
15424 (type_map): New initialized variable.
15425 (typemap_cmp): New function.
15426 (write_type_node): Likewise.
15427 (write_fntype_init): Implement.
15429 2021-07-27 Martin Sebor <msebor@redhat.com>
15431 PR tree-optimization/101584
15432 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
15433 (check_defs): Call it.
15435 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
15437 * tree-ssa-dom.c (dom_jump_threader_simplifier):
15438 Put avail_exprs_stack in the class, instead of passing it to
15439 jump_threader_simplifier.
15440 (dom_jump_threader_simplifier::simplify): Add state argument.
15441 (dom_opt_dom_walker): Add state.
15442 (pass_dominator::execute): Pass state to threader.
15443 (dom_opt_dom_walker::before_dom_children): Use state.
15444 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
15445 arguments by state.
15446 (jump_threader::record_temporary_equivalences_from_phis):
15447 Register equivalences through the state variable.
15448 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
15449 Record ranges in a statement through the state variable.
15450 (jump_threader::simplify_control_stmt_condition): Pass state to
15452 (jump_threader::simplify_control_stmt_condition_1): Same.
15453 (jump_threader::thread_around_empty_blocks): Remove obsolete
15455 (jump_threader::thread_through_normal_block): Record equivalences
15456 on edge through the state variable.
15457 (jump_threader::thread_across_edge): Abstract state pushing.
15458 (jt_state::jt_state): New.
15459 (jt_state::push): New.
15460 (jt_state::pop): New.
15461 (jt_state::register_equiv): New.
15462 (jt_state::record_ranges_from_stmt): New.
15463 (jt_state::register_equivs_on_edge): New.
15464 (jump_threader_simplifier::jump_threader_simplifier): Move from
15466 (jump_threader_simplifier::simplify): Add state argument.
15467 * tree-ssa-threadedge.h (class jt_state): New.
15468 (class jump_threader): Add state to constructor.
15469 (class jump_threader_simplifier): Add state to simplify. Remove
15470 avail_exprs_stack from class.
15471 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
15473 (vrp_jump_threader::vrp_jump_threader): Add state.
15474 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
15476 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
15478 * Makefile.in (OBJS): Add gimple-range-path.o.
15479 * gimple-range-path.cc: New file.
15480 * gimple-range-path.h: New file.
15482 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
15484 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
15485 inside vec_duplicate for all patterns.
15486 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
15487 Push sign/zero-extension inside vec_duplicate.
15489 2021-07-27 Richard Biener <rguenther@suse.de>
15491 PR tree-optimization/101573
15492 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
15493 looking at uninitialized PHI arg defs in some constrained cases.
15494 (warn_uninitialized_vars): Call it.
15495 (execute_early_warn_uninitialized): Calculate dominators.
15497 2021-07-27 Richard Biener <rguenther@suse.de>
15499 PR tree-optimization/39821
15500 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
15501 vector_stmt for widening arithmetic.
15502 (vectorizable_conversion): Adjust.
15504 2021-07-27 Martin Jambor <mjambor@suse.cz>
15506 * cgraph.h (ipa_replace_map): New field force_load_ref.
15507 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
15508 aded new flag load_dereferenced, adjusted comments.
15509 (ipa_get_param_dereferenced): New function.
15510 (ipa_set_param_dereferenced): Likewise.
15511 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
15512 * ipa-cp.c: Include gimple.h.
15513 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
15514 (get_replacement_map): New parameter force_load_ref, set the
15515 appropriate flag in ipa_replace_map if set.
15516 (struct symbol_and_index_together): New type.
15517 (adjust_refs_in_act_callers): New function.
15518 (adjust_references_in_caller): Likewise.
15519 (create_specialized_node): When appropriate, call
15520 adjust_references_in_caller and force only load references.
15521 * ipa-prop.c (load_from_dereferenced_name): New function.
15522 (ipa_analyze_controlled_uses): Also detect loads from a
15523 dereference, harden testing of call statements.
15524 (ipa_write_node_info): Stream the dereferenced flag.
15525 (ipa_read_node_info): Likewise.
15526 (ipa_set_jf_constant): Also create refdesc when jump function
15527 references a variable.
15528 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
15529 also on references of variables and return a symtab_node. Adjust
15531 (propagate_controlled_uses): Also remove references to VAR_DECLs.
15533 2021-07-27 Jakub Jelinek <jakub@redhat.com>
15535 PR middle-end/101586
15536 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
15537 positions above or equal to sz except for diagnostics of flexible
15540 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
15542 PR tree-optimization/78888
15543 * gimple-range-fold.cc (get_letter_range): New.
15544 (fold_using_range::range_of_builtin_call): Call get_letter_range.
15546 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
15548 PR tree-optimization/78888
15549 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
15550 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
15552 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
15553 Marc Glisse <marc.glisse@inria.fr>
15555 * match.pd (rotate): Simplify equality/inequality of rotations.
15556 (bswap): Simplify equality/inequality tests of byte swapping.
15558 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
15560 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
15563 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
15565 * range-op.cc (operator_lshift::fold_range): Pass rel to
15566 base class fold_range.
15567 (operator_rshift::fold_range): Same.
15569 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
15572 * toplev.h (min_align_loops_log): Remove declaration.
15573 (min_align_jumps_log, min_align_labels_log): Likewise.
15574 (min_align_functions_log): Likewise.
15576 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
15578 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
15580 (execute_vrp): Abstract out simplification of conditionals...
15581 (simplify_casted_conds): ...here.
15583 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
15585 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
15586 Add gimple argument.
15587 (array_bounds_checker::check_array_ref): Same.
15588 (array_bounds_checker::check_addr_expr): Same.
15589 (array_bounds_checker::check_array_bounds): Pass statement to
15590 check_array_bounds and check_addr_expr.
15591 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
15592 (check_addr_expr): Same.
15593 (get_value_range): Same.
15595 2021-07-26 Tamar Christina <tamar.christina@arm.com>
15597 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
15598 (sdot_prod, udot_prod): ... This.
15599 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
15601 (<sur>dot_prod<vsi2qi>): ... this.
15602 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
15603 Change operands order.
15604 (<sur>sadv16qi): Use new operands order.
15605 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
15606 vdotq_s32): Use new RTL ordering.
15608 2021-07-26 Tamar Christina <tamar.christina@arm.com>
15610 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
15611 aarch64_types_ternop_suss_qualifiers): New.
15612 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
15613 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
15614 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
15616 2021-07-23 Jakub Jelinek <jakub@redhat.com>
15618 PR rtl-optimization/101562
15619 * expmed.c (store_integral_bit_field): Only use movstrict_optab
15620 if the operand isn't paradoxical.
15622 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
15624 * gimple-array-bounds.h (class array_bounds_checker): Change
15625 ranges type to range_query.
15627 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15629 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
15630 __builtin_memcpy instead of constructing
15631 __builtin_aarch64_simd_oi one vector at a time.
15632 (vst1_u64_x2): Likewise.
15633 (vst1_f64_x2): Likewise.
15634 (vst1_s8_x2): Likewise.
15635 (vst1_p8_x2): Likewise.
15636 (vst1_s16_x2): Likewise.
15637 (vst1_p16_x2): Likewise.
15638 (vst1_s32_x2): Likewise.
15639 (vst1_u8_x2): Likewise.
15640 (vst1_u16_x2): Likewise.
15641 (vst1_u32_x2): Likewise.
15642 (vst1_f16_x2): Likewise.
15643 (vst1_f32_x2): Likewise.
15644 (vst1_p64_x2): Likewise.
15645 (vst1q_s8_x2): Likewise.
15646 (vst1q_p8_x2): Likewise.
15647 (vst1q_s16_x2): Likewise.
15648 (vst1q_p16_x2): Likewise.
15649 (vst1q_s32_x2): Likewise.
15650 (vst1q_s64_x2): Likewise.
15651 (vst1q_u8_x2): Likewise.
15652 (vst1q_u16_x2): Likewise.
15653 (vst1q_u32_x2): Likewise.
15654 (vst1q_u64_x2): Likewise.
15655 (vst1q_f16_x2): Likewise.
15656 (vst1q_f32_x2): Likewise.
15657 (vst1q_f64_x2): Likewise.
15658 (vst1q_p64_x2): Likewise.
15660 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15662 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
15663 __builtin_memcpy instead of constructing
15664 __builtin_aarch64_simd_ci one vector at a time.
15665 (vst1_u64_x3): Likewise.
15666 (vst1_f64_x3): Likewise.
15667 (vst1_s8_x3): Likewise.
15668 (vst1_p8_x3): Likewise.
15669 (vst1_s16_x3): Likewise.
15670 (vst1_p16_x3): Likewise.
15671 (vst1_s32_x3): Likewise.
15672 (vst1_u8_x3): Likewise.
15673 (vst1_u16_x3): Likewise.
15674 (vst1_u32_x3): Likewise.
15675 (vst1_f16_x3): Likewise.
15676 (vst1_f32_x3): Likewise.
15677 (vst1_p64_x3): Likewise.
15678 (vst1q_s8_x3): Likewise.
15679 (vst1q_p8_x3): Likewise.
15680 (vst1q_s16_x3): Likewise.
15681 (vst1q_p16_x3): Likewise.
15682 (vst1q_s32_x3): Likewise.
15683 (vst1q_s64_x3): Likewise.
15684 (vst1q_u8_x3): Likewise.
15685 (vst1q_u16_x3): Likewise.
15686 (vst1q_u32_x3): Likewise.
15687 (vst1q_u64_x3): Likewise.
15688 (vst1q_f16_x3): Likewise.
15689 (vst1q_f32_x3): Likewise.
15690 (vst1q_f64_x3): Likewise.
15691 (vst1q_p64_x3): Likewise.
15693 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
15696 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
15697 hard register when LRA is in progress.
15699 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15701 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
15702 __builtin_memcpy instead of using a union.
15703 (vst1q_s8_x4): Likewise.
15704 (vst1_s16_x4): Likewise.
15705 (vst1q_s16_x4): Likewise.
15706 (vst1_s32_x4): Likewise.
15707 (vst1q_s32_x4): Likewise.
15708 (vst1_u8_x4): Likewise.
15709 (vst1q_u8_x4): Likewise.
15710 (vst1_u16_x4): Likewise.
15711 (vst1q_u16_x4): Likewise.
15712 (vst1_u32_x4): Likewise.
15713 (vst1q_u32_x4): Likewise.
15714 (vst1_f16_x4): Likewise.
15715 (vst1q_f16_x4): Likewise.
15716 (vst1_f32_x4): Likewise.
15717 (vst1q_f32_x4): Likewise.
15718 (vst1_p8_x4): Likewise.
15719 (vst1q_p8_x4): Likewise.
15720 (vst1_p16_x4): Likewise.
15721 (vst1q_p16_x4): Likewise.
15722 (vst1_s64_x4): Likewise.
15723 (vst1_u64_x4): Likewise.
15724 (vst1_p64_x4): Likewise.
15725 (vst1q_s64_x4): Likewise.
15726 (vst1q_u64_x4): Likewise.
15727 (vst1q_p64_x4): Likewise.
15728 (vst1_f64_x4): Likewise.
15729 (vst1q_f64_x4): Likewise.
15731 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
15733 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
15734 instead of constructing __builtin_aarch64_simd_oi one vector
15736 (vst2_u64): Likewise.
15737 (vst2_f64): Likewise.
15738 (vst2_s8): Likewise.
15739 (vst2_p8): Likewise.
15740 (vst2_s16): Likewise.
15741 (vst2_p16): Likewise.
15742 (vst2_s32): Likewise.
15743 (vst2_u8): Likewise.
15744 (vst2_u16): Likewise.
15745 (vst2_u32): Likewise.
15746 (vst2_f16): Likewise.
15747 (vst2_f32): Likewise.
15748 (vst2_p64): Likewise.
15749 (vst2q_s8): Likewise.
15750 (vst2q_p8): Likewise.
15751 (vst2q_s16): Likewise.
15752 (vst2q_p16): Likewise.
15753 (vst2q_s32): Likewise.
15754 (vst2q_s64): Likewise.
15755 (vst2q_u8): Likewise.
15756 (vst2q_u16): Likewise.
15757 (vst2q_u32): Likewise.
15758 (vst2q_u64): Likewise.
15759 (vst2q_f16): Likewise.
15760 (vst2q_f32): Likewise.
15761 (vst2q_f64): Likewise.
15762 (vst2q_p64): Likewise.
15764 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15766 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
15767 instead of constructing __builtin_aarch64_simd_ci one vector
15769 (vst3_u64): Likewise.
15770 (vst3_f64): Likewise.
15771 (vst3_s8): Likewise.
15772 (vst3_p8): Likewise.
15773 (vst3_s16): Likewise.
15774 (vst3_p16): Likewise.
15775 (vst3_s32): Likewise.
15776 (vst3_u8): Likewise.
15777 (vst3_u16): Likewise.
15778 (vst3_u32): Likewise.
15779 (vst3_f16): Likewise.
15780 (vst3_f32): Likewise.
15781 (vst3_p64): Likewise.
15782 (vst3q_s8): Likewise.
15783 (vst3q_p8): Likewise.
15784 (vst3q_s16): Likewise.
15785 (vst3q_p16): Likewise.
15786 (vst3q_s32): Likewise.
15787 (vst3q_s64): Likewise.
15788 (vst3q_u8): Likewise.
15789 (vst3q_u16): Likewise.
15790 (vst3q_u32): Likewise.
15791 (vst3q_u64): Likewise.
15792 (vst3q_f16): Likewise.
15793 (vst3q_f32): Likewise.
15794 (vst3q_f64): Likewise.
15795 (vst3q_p64): Likewise.
15797 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15799 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
15800 instead of constructing __builtin_aarch64_simd_xi one vector
15802 (vst4_u64): Likewise.
15803 (vst4_f64): Likewise.
15804 (vst4_s8): Likewise.
15805 (vst4_p8): Likewise.
15806 (vst4_s16): Likewise.
15807 (vst4_p16): Likewise.
15808 (vst4_s32): Likewise.
15809 (vst4_u8): Likewise.
15810 (vst4_u16): Likewise.
15811 (vst4_u32): Likewise.
15812 (vst4_f16): Likewise.
15813 (vst4_f32): Likewise.
15814 (vst4_p64): Likewise.
15815 (vst4q_s8): Likewise.
15816 (vst4q_p8): Likewise.
15817 (vst4q_s16): Likewise.
15818 (vst4q_p16): Likewise.
15819 (vst4q_s32): Likewise.
15820 (vst4q_s64): Likewise.
15821 (vst4q_u8): Likewise.
15822 (vst4q_u16): Likewise.
15823 (vst4q_u32): Likewise.
15824 (vst4q_u64): Likewise.
15825 (vst4q_f16): Likewise.
15826 (vst4q_f32): Likewise.
15827 (vst4q_f64): Likewise.
15828 (vst4q_p64): Likewise.
15830 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15832 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
15833 instead of constructing __builtin_aarch64_simd_oi one vector
15835 (vtbx4_u8): Likewise.
15836 (vtbx4_p8): Likewise.
15838 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15840 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
15841 instead of constructing __builtin_aarch64_simd_oi one vector
15843 (vtbl3_u8): Likewise.
15844 (vtbl3_p8): Likewise.
15845 (vtbl4_s8): Likewise.
15846 (vtbl4_u8): Likewise.
15847 (vtbl4_p8): Likewise.
15849 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15851 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
15852 instead of constructing __builtin_aarch64_simd_oi one vector
15854 (vqtbx2_u8): Likewise.
15855 (vqtbx2_p8): Likewise.
15856 (vqtbx2q_s8): Likewise.
15857 (vqtbx2q_u8): Likewise.
15858 (vqtbx2q_p8): Likewise.
15859 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
15860 __builtin_aarch64_simd_ci one vector at a time.
15861 (vqtbx3_u8): Likewise.
15862 (vqtbx3_p8): Likewise.
15863 (vqtbx3q_s8): Likewise.
15864 (vqtbx3q_u8): Likewise.
15865 (vqtbx3q_p8): Likewise.
15866 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
15867 __builtin_aarch64_simd_xi one vector at a time.
15868 (vqtbx4_u8): Likewise.
15869 (vqtbx4_p8): Likewise.
15870 (vqtbx4q_s8): Likewise.
15871 (vqtbx4q_u8): Likewise.
15872 (vqtbx4q_p8): Likewise.
15874 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
15876 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
15877 instead of constructing __builtin_aarch64_simd_oi one vector
15879 (vqtbl2_u8): Likewise.
15880 (vqtbl2_p8): Likewise.
15881 (vqtbl2q_s8): Likewise.
15882 (vqtbl2q_u8): Likewise.
15883 (vqtbl2q_p8): Likewise.
15884 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
15885 __builtin_aarch64_simd_ci one vector at a time.
15886 (vqtbl3_u8): Likewise.
15887 (vqtbl3_p8): Likewise.
15888 (vqtbl3q_s8): Likewise.
15889 (vqtbl3q_u8): Likewise.
15890 (vqtbl3q_p8): Likewise.
15891 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
15892 __builtin_aarch64_simd_xi one vector at a time.
15893 (vqtbl4_u8): Likewise.
15894 (vqtbl4_p8): Likewise.
15895 (vqtbl4q_s8): Likewise.
15896 (vqtbl4q_u8): Likewise.
15897 (vqtbl4q_p8): Likewise.
15899 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
15902 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
15904 2021-07-22 Andrew Pinski <apinski@marvell.com>
15906 PR tree-optimization/10153
15907 * tree-tailcall.c (create_tailcall_accumulator):
15908 Don't call fold_convert as the type should be correct already.
15909 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
15910 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
15912 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
15914 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
15915 varying_p check for null/non-null check.
15917 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
15919 PR tree-optimization/101511
15920 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
15921 is in ssa2's equiv set, and don't trap if so.
15923 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
15925 PR tree-optimization/101497
15926 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
15929 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
15931 PR tree-optimization/101496
15932 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
15933 first, then vrp_visit_cond_Stmt.
15935 2021-07-22 liuhongt <hongtao.liu@intel.com>
15937 * config/i386/i386-expand.c
15938 (ix86_broadcast_from_integer_constant): Rename to ..
15939 (ix86_broadcast_from_constant): .. this, and extend it to
15941 (ix86_expand_vector_move): Extend to float mode.
15942 * config/i386/i386-features.c
15943 (replace_constant_pool_with_broadcast): Remove.
15944 (remove_partial_avx_dependency_gate): Ditto.
15945 (constant_pool_broadcast): Ditto.
15946 (class pass_constant_pool_broadcast): Ditto.
15947 (make_pass_constant_pool_broadcast): Ditto.
15948 (remove_partial_avx_dependency): Adjust gate.
15949 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
15950 * config/i386/i386-protos.h
15951 (make_pass_constant_pool_broadcast): Remove.
15953 2021-07-22 liuhongt <hongtao.liu@intel.com>
15955 * config/i386/constraints.md (Wb): New constraint.
15957 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
15959 (*ashlqi3_1): Ditto.
15960 (*<insn><mode>3_1): Split to ..
15961 (*ashr<mode>3_1): this, ...
15962 (*lshr<mode>3_1): and this, also extend this pattern to avx512
15964 (*<insn><mode>3_1): Split to ..
15965 (*ashr<mode>3_1): this, ...
15966 (*lshrqi3_1): and this, also extend this pattern to avx512
15968 (*lshrhi3_1): And this, also extend this pattern to avx512
15970 * config/i386/sse.md (k<code><mode>): New define_split after
15971 it to convert generic shift pattern to mask shift ones.
15973 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
15974 Joseph Myers <joseph@codesourcery.com>
15975 Cesar Philippidis <cesar@codesourcery.com>
15977 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
15978 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
15980 * tree-pretty-print.c (dump_omp_clause): Likewise.
15981 * omp-general.c (oacc_verify_routine_clauses): Likewise.
15982 * gimplify.c (gimplify_scan_omp_clauses)
15983 (gimplify_adjust_omp_clauses): Likewise.
15984 * tree-nested.c (convert_nonlocal_omp_clauses)
15985 (convert_local_omp_clauses): Likewise.
15986 * omp-low.c (scan_sharing_clauses): Likewise.
15987 * omp-offload.c (execute_oacc_device_lower): Update.
15989 2021-07-21 Martin Sebor <msebor@redhat.com>
15991 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
15993 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
15995 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
15998 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16000 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
16002 (write_fntype_init): New stub function.
16003 (write_init_bif_table): Likewise.
16004 (write_init_ovld_table): New function.
16005 (write_init_file): Implement.
16007 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16009 * config/rs6000/rs6000-gen-builtins.c
16010 (write_autogenerated_header): New function.
16011 (write_decls): Likewise.
16012 (write_extern_fntype): New callback function.
16013 (write_header_file): Implement.
16015 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16017 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
16020 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16022 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
16024 (complete_base_type): Likewise.
16025 (construct_fntype_id): Likewise.
16026 (parse_bif_entry): Call contruct_fntype_id.
16027 (parse_ovld_entry): Likewise.
16029 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16031 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
16032 (MAXOVLDSTANZAS): New macro.
16033 (ovld_stanzas): New variable.
16034 (curr_ovld_stanza): Likewise.
16035 (MAXOVLDS): New macro.
16036 (ovlddata): New struct.
16037 (ovlds): New variable.
16038 (curr_ovld): Likewise.
16039 (max_ovld_args): Likewise.
16040 (parse_ovld_entry): New function.
16041 (parse_ovld_stanza): Likewise.
16042 (parse_ovld): Implement.
16044 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16046 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
16049 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16051 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
16052 (parse_prototype): Implement.
16054 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16056 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
16057 (curr_bif_stanza): New variable.
16058 (stanza_entry): New struct.
16059 (stanza_map): New initialized variable.
16060 (enable_string): Likewise.
16061 (fnkinds): New enum.
16062 (typelist): New struct.
16063 (attrinfo): Likewise.
16064 (MAXRESTROPNDS): New macro.
16065 (prototype): New struct.
16066 (MAXBIFS): New macro.
16067 (bifdata): New struct.
16068 (bifs): New variable.
16069 (curr_bif): Likewise.
16070 (bif_order): Likewise.
16071 (bif_index): Likewise.
16072 (fatal): New function.
16073 (stanza_name_to_stanza): Likewise.
16074 (parse_bif_attrs): New stub function.
16075 (parse_prototype): Likewise.
16076 (parse_bif_entry): New function.
16077 (parse_bif_stanza): Likewise.
16078 (parse_bif): Implement.
16079 (set_bif_order): New function.
16080 (create_bif_order): Implement.
16082 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
16084 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
16085 (num_bifs): New variable.
16086 (num_ovld_stanzas): Likewise.
16087 (num_ovlds): Likewise.
16088 (parse_codes): New enum.
16089 (bif_rbt): New variable.
16090 (ovld_rbt): Likewise.
16091 (fntype_rbt): Likewise.
16092 (bifo_rbt): Likewise.
16093 (parse_bif): New stub function.
16094 (create_bif_order): Likewise.
16095 (parse_ovld): Likewise.
16096 (write_header_file): Likewise.
16097 (write_init_file): Likewise.
16098 (write_defines_file): Likewise.
16099 (delete_output_files): New function.
16102 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
16105 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
16106 from CRC32 _builtin functions.
16108 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
16110 * coverage.c (build_gcov_info_var_registration): Mark the object placed
16111 in the linker set as referenced so that it does not get optimized away.
16113 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
16116 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
16118 * config.gcc (riscv*-*-*): Detect which python is available.
16120 2021-07-21 Jakub Jelinek <jakub@redhat.com>
16122 PR middle-end/101535
16123 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
16124 contexts in which decl isn't privatized and for ORT_TARGET return
16125 false if decl is mapped.
16127 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
16129 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
16130 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
16131 has been successfully unrolled.
16133 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
16135 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
16136 exit condition if no tail loop is needed, and if the original exit
16137 condition should therefore be kept as-is.
16138 (tree_transform_and_unroll_loop): Handle that case here too.
16140 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
16142 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
16144 (free_data_refs): Likewise.
16145 * tree-data-ref.h (free_dependence_relations): Likewise.
16146 (free_data_refs): Likewise.
16147 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
16149 (struct component): Likewise.
16150 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
16151 (pcom_worker::~pcom_worker): Likewise.
16152 (pcom_worker::release_chain): Adjust as auto_vec changes.
16153 (pcom_worker::loop): Rename to ...
16154 (pcom_worker::m_loop): ... this.
16155 (pcom_worker::datarefs): Rename to ...
16156 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
16157 (pcom_worker::dependences): Rename to ...
16158 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
16159 (pcom_worker::chains): Rename to ...
16160 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
16161 (pcom_worker::looparound_phis): Rename to ...
16162 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
16164 (pcom_worker::cache): Rename to ...
16165 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
16166 (pcom_worker::release_chain): Adjust for auto_vec changes.
16167 (pcom_worker::release_chains): Adjust for auto_vec and renaming
16169 (release_component): Remove.
16170 (release_components): Adjust for release_component removal.
16171 (component_of): Adjust to use vec.
16172 (merge_comps): Likewise.
16173 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
16174 (pcom_worker::determine_offset): Likewise.
16175 (class comp_ptrs): Remove.
16176 (pcom_worker::split_data_refs_to_components): Adjust for renaming
16177 changes, for comp_ptrs removal with auto_vec.
16178 (pcom_worker::suitable_component_p): Adjust for renaming changes.
16179 (pcom_worker::filter_suitable_components): Adjust for release_component
16181 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
16182 (pcom_worker::find_looparound_phi): Likewise.
16183 (pcom_worker::add_looparound_copies): Likewise.
16184 (pcom_worker::determine_roots_comp): Likewise.
16185 (pcom_worker::single_nonlooparound_use): Likewise.
16186 (pcom_worker::execute_pred_commoning_chain): Likewise.
16187 (pcom_worker::execute_pred_commoning): Likewise.
16188 (pcom_worker::try_combine_chains): Likewise.
16189 (pcom_worker::prepare_initializers_chain): Likewise.
16190 (pcom_worker::prepare_initializers): Likewise.
16191 (pcom_worker::prepare_finalizers_chain): Likewise.
16192 (pcom_worker::prepare_finalizers): Likewise.
16193 (pcom_worker::tree_predictive_commoning_loop): Likewise.
16195 2021-07-20 Martin Sebor <msebor@redhat.com>
16197 PR middle-end/101397
16198 * builtins.c (gimple_call_return_array): Add argument. Correct
16199 offsets for memchr, mempcpy, stpcpy, and stpncpy.
16200 (compute_objsize_r): Adjust offset computation for argument returning
16203 2021-07-20 Martin Sebor <msebor@redhat.com>
16205 PR middle-end/101300
16206 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
16208 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
16210 * function.c (assign_parm_setup_block): Use adjust_address instead
16211 of change_address to preserve MEM_EXPR and friends.
16213 2021-07-20 Martin Sebor <msebor@redhat.com>
16215 * cfgloop.h (single_likely_exit): Adjust by-value argument to
16216 by-const-reference.
16217 * cfgloopanal.c (single_likely_exit): Same.
16218 * cgraph.h (struct cgraph_node): Same.
16219 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
16220 * genautomata.c (merge_states): Same.
16221 * genextract.c (VEC_char_to_string): Same.
16222 * genmatch.c (dt_node::gen_kids_1): Same.
16223 (walk_captures): Adjust by-value argument to by-reference.
16224 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
16225 to by-const-reference.
16226 * gimple.c (gimple_build_call_vec): Same.
16227 (gimple_build_call_internal_vec): Same.
16228 (gimple_build_switch): Same.
16229 (sort_case_labels): Same.
16230 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
16232 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
16233 by-const-reference.
16234 (gimple_build_call_internal_vec): Same.
16235 (gimple_build_switch): Same.
16236 (sort_case_labels): Same.
16237 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
16239 * haifa-sched.c (calc_priorities): Adjust by-value argument to
16240 by-const-reference.
16241 (sched_init_luids): Same.
16242 (haifa_init_h_i_d): Same.
16243 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
16244 (adjust_callers_for_value_intersection): Adjust by-value argument to
16246 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
16247 by-const-reference.
16248 (find_more_contexts_for_caller_subset): Same.
16249 (find_aggregate_values_for_callers_subset): Same.
16250 (copy_useful_known_contexts): Same.
16251 * ipa-fnsummary.c (remap_edge_summaries): Same.
16252 (remap_freqcounting_predicate): Same.
16253 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
16255 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
16256 to by-const-reference.
16257 * ipa-predicate.h (predicate::remap_after_inlining): Same.
16258 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
16259 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
16260 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
16261 * read-rtl.c (add_overload_instance): Same.
16262 * rtl.h (native_decode_rtx): Same.
16263 (native_decode_vector_rtx): Same.
16264 * sched-int.h (sched_init_luids): Same.
16265 (haifa_init_h_i_d): Same.
16266 * simplify-rtx.c (native_decode_vector_rtx): Same.
16267 (native_decode_rtx): Same.
16268 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
16269 (shrink_wrap_one_built_in_call_with_conds): Same.
16270 (shrink_wrap_conditional_dead_built_in_calls): Same.
16271 * tree-data-ref.c (create_runtime_alias_checks): Same.
16272 (compute_all_dependences): Same.
16273 * tree-data-ref.h (compute_all_dependences): Same.
16274 (create_runtime_alias_checks): Same.
16275 (index_in_loop_nest): Same.
16276 * tree-if-conv.c (mask_exists): Same.
16277 * tree-loop-distribution.c (class loop_distribution): Same.
16278 (loop_distribution::create_rdg_vertices): Same.
16279 (dump_rdg_partitions): Same.
16280 (debug_rdg_partitions): Same.
16281 (partition_contains_all_rw): Same.
16282 (loop_distribution::distribute_loop): Same.
16283 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
16284 (oacc_entry_exit_single_gang): Same.
16285 * tree-ssa-loop-im.c (hoist_memory_references): Same.
16286 (loop_suitable_for_sm): Same.
16287 * tree-ssa-loop-niter.c (bound_index): Same.
16288 * tree-ssa-reassoc.c (update_ops): Same.
16289 (swap_ops_for_binary_stmt): Same.
16290 (rewrite_expr_tree): Same.
16291 (rewrite_expr_tree_parallel): Same.
16292 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
16293 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
16294 * tree-ssa-structalias.c (process_all_all_constraints): Same.
16295 (make_constraints_to): Same.
16296 (handle_lhs_call): Same.
16297 (find_func_aliases_for_builtin_call): Same.
16298 (sort_fieldstack): Same.
16299 (check_for_overlaps): Same.
16300 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
16301 (vect_create_cond_for_unequal_addrs): Same.
16302 (vect_create_cond_for_lower_bounds): Same.
16303 (vect_create_cond_for_alias_checks): Same.
16304 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
16305 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
16306 (vect_make_slp_decision): Same.
16307 (vect_slp_bbs): Same.
16308 (duplicate_and_interleave): Same.
16309 (vect_transform_slp_perm_load): Same.
16310 (vect_schedule_slp): Same.
16311 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
16312 (vect_schedule_slp): Same.
16313 (duplicate_and_interleave): Same.
16314 * tree.c (build_vector_from_ctor): Same.
16315 (build_vector): Same.
16316 (check_vector_cst): Same.
16317 (check_vector_cst_duplicate): Same.
16318 (check_vector_cst_fill): Same.
16319 (check_vector_cst_stepped): Same.
16320 * tree.h (build_vector_from_ctor): Same.
16322 2021-07-20 Jakub Jelinek <jakub@redhat.com>
16325 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
16326 type from bool to int.
16327 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
16328 EASY_VECTOR_MSB case if either step or copies is not 1.
16329 (vspltis_shifted): Fix comment typo.
16330 (easy_altivec_constant): Change return type from bool to int, instead
16331 of returning true return byte size of the element mode that should be
16332 used to synthetize the constant.
16333 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
16334 that vspltis_shifted is 0, handle the case where easy_altivec_constant
16335 assumes using different vector mode from CONST_VECTOR's mode.
16336 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
16337 easy_altivec_constant to determine mode in which -1 >> -1 should be
16338 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
16340 2021-07-20 Richard Biener <rguenther@suse.de>
16343 * dwarf2out.h (dwarf_file_data): Add key member.
16344 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
16345 (dwarf_file_hasher::hash): Hash key.
16346 (lookup_filename): Remap the filename and store it in the
16347 filename member of dwarf_file_data when creating a new
16349 (file_name_acquire): Do not remap the filename again.
16350 (maybe_emit_file): Likewise.
16352 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
16354 * config/aarch64/aarch64-simd-builtins.def: Use two variant
16355 generators for all TBL/TBX intrinsics and rename to
16356 consistent forms: qtbl[1234] or qtbx[1234].
16357 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
16359 (aarch64_qtbl1<mode>): This.
16360 (aarch64_tbx1<mode>): Rename to...
16361 (aarch64_qtbx1<mode>): This.
16362 (aarch64_tbl2v16qi): Delete.
16363 (aarch64_tbl3<mode>): Rename to...
16364 (aarch64_qtbl2<mode>): This.
16365 (aarch64_tbx4<mode>): Rename to...
16366 (aarch64_qtbx2<mode>): This.
16367 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
16368 renamed qtbl1 and qtbl2 RTL patterns.
16369 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
16371 (vqtbl1_s8): Likewise.
16372 (vqtbl1_u8): Likewise.
16373 (vqtbl1q_p8): Likewise.
16374 (vqtbl1q_s8): Likewise.
16375 (vqtbl1q_u8): Likewise.
16376 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
16377 (vqtbx1_u8): Likewise.
16378 (vqtbx1_p8): Likewise.
16379 (vqtbx1q_s8): Likewise.
16380 (vqtbx1q_u8): Likewise.
16381 (vqtbx1q_p8): Likewise.
16382 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
16383 (vtbl1_u8): Likewise.
16384 (vtbl1_p8): Likewise.
16385 (vtbl2_s8): Likewise
16386 (vtbl2_u8): Likewise.
16387 (vtbl2_p8): Likewise.
16388 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
16389 (vtbl3_u8): Likewise.
16390 (vtbl3_p8): Likewise.
16391 (vtbl4_s8): Likewise.
16392 (vtbl4_u8): Likewise.
16393 (vtbl4_p8): Likewise.
16394 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
16395 (vtbx2_u8): Likewise.
16396 (vtbx2_p8): Likewise.
16397 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
16398 (vqtbl2_u8): Likewise.
16399 (vqtbl2_p8): Likewise.
16400 (vqtbl2q_s8): Likewise.
16401 (vqtbl2q_u8): Likewise.
16402 (vqtbl2q_p8): Likewise.
16403 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
16404 (vqtbx2_u8): Likewise.
16405 (vqtbx2_p8): Likewise.
16406 (vqtbx2q_s8): Likewise.
16407 (vqtbx2q_u8): Likewise.
16408 (vqtbx2q_p8): Likewise.
16409 (vtbx4_s8): Likewise.
16410 (vtbx4_u8): Likewise.
16411 (vtbx4_p8): Likewise.
16413 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
16416 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
16418 (define_peephole2 atomic_loaddi_fpu): Ditto.
16420 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
16422 * config.gcc (riscv*-*-*): Detect which python is available.
16424 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
16426 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
16427 (smul<mode>3_highpart): ... this.
16428 (mulhu_<mode>): Rename to...
16429 (umul<mode>3_highpart): ... this.
16430 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
16431 MULHU_V2DI, MULHU_V4SI): Adjust.
16433 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
16435 PR tree-optimization/100696
16436 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
16437 * internal-fn.def (IFN_MULH): New internal function.
16438 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
16439 recog normal multiply highpart as IFN_MULH.
16440 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
16443 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
16445 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
16446 (BTF_DEBUGGING_INFO): Likewise.
16447 * doc/tm.texi.in: Document the new macros.
16448 * doc/tm.texi: Regenerated.
16449 * toplev.c: Guard initialization of debug hooks.
16451 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
16453 * flags.h (ctf_debuginfo_p): New function declaration.
16454 * opts.c (ctf_debuginfo_p): New function definition.
16456 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
16459 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
16460 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
16461 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
16462 * config/gcn/gcn-valu.md: Add a warning comment.
16463 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
16464 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
16465 * config/gcn/gcn.md: Add a warning comment.
16466 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
16467 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
16469 (EF_AMDGPU_XNACK): New.
16470 (EF_AMDGPU_SRAM_ECC): New.
16472 (copy_early_debug_info): Use elf_flags.
16473 (main): Handle -mxnack and -msram-ecc options.
16474 * doc/invoke.texi: Document -mxnack and -msram-ecc.
16476 2021-07-19 Andrew Pinski <apinski@marvell.com>
16479 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
16480 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
16482 2021-07-19 Richard Biener <rguenther@suse.de>
16484 PR tree-optimization/101505
16485 * tree-vect-patterns.c (vect_determine_precisions): Walk
16486 PHIs also for loop vectorization.
16488 2021-07-19 Richard Biener <rguenther@suse.de>
16490 * gimple.h (gimple_expr_type): Remove.
16491 * doc/gimple.texi: Remove gimple_expr_type documentation.
16493 2021-07-19 Richard Biener <rguenther@suse.de>
16495 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
16496 (ao_ref_init_from_vn_reference): Likewise.
16497 (fully_constant_reference): Likewise.
16498 (vn_reference_lookup_call): Do not set vr->type to random
16500 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
16502 * tree-vect-generic.c (expand_vector_piecewise): Pass in
16503 whether we expanded parallel.
16504 (expand_vector_parallel): Adjust.
16505 (expand_vector_addition): Likewise.
16506 (expand_vector_comparison): Likewise.
16507 (expand_vector_operation): Likewise.
16508 (expand_vector_scalar_condition): Likewise.
16509 (expand_vector_conversion): Likewise.
16511 2021-07-19 Richard Biener <rguenther@suse.de>
16513 * tree-vrp.c (register_edge_assert_for_2): Use the
16515 (vrp_folder::fold_predicate_in): Likewise.
16516 * vr-values.c (gimple_assign_nonzero_p): Likewise.
16517 (vr_values::extract_range_from_comparison): Likewise.
16518 (vr_values::extract_range_from_ubsan_builtin): Use the
16519 type of the first operand.
16520 (vr_values::extract_range_basic): Push down type
16521 computation, use the appropriate LHS.
16522 (vr_values::extract_range_from_assignment): Use the
16525 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
16528 * common/config/i386/i386-common.c (ix86_handle_option): For
16529 -mgeneral-regs-only, enable the GPR only instructions which are
16530 enabled implicitly by SSE ISAs unless they have been disabled
16533 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
16536 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
16537 ix86_avx_u128_mode_needed.
16538 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
16539 returns AVX register.
16541 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
16543 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
16544 (handle_const_call): Liekise
16545 (handle_pure_call): Liekise
16547 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
16549 PR tree-optimization/96542
16550 * range-op.cc (range_operator::wi_fold_in_parts): New.
16551 (range_operator::fold_range): Call wi_fold_in_parts.
16552 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
16553 * range-op.h (wi_fold_in_parts): Add prototype.
16555 2021-07-16 David Malcolm <dmalcolm@redhat.com>
16557 * doc/analyzer.texi: Add __analyzer_dump_state.
16559 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16561 * config/rs6000/rbtree.c: New file.
16562 * config/rs6000/rbtree.h: New file.
16564 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16566 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
16567 (typeinfo): Add restr field.
16568 (match_bracketed_pair): New function.
16569 (match_const_restriction): Implement.
16571 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16573 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
16575 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16577 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
16578 (basetype): Likewise.
16579 (typeinfo): Likewise.
16580 (handle_pointer): New function.
16581 (match_basetype): New stub function.
16582 (match_const_restriction): Likewise.
16583 (match_type): New function.
16585 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16587 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
16589 (advance_line): Likewise.
16590 (safe_inc_pos): Likewise.
16591 (match_identifier): Likewise.
16592 (match_integer): Likewise.
16593 (match_to_right_bracket): Likewise.
16595 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16597 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
16598 (ovld_file): Likewise.
16599 (header_file): Likewise.
16600 (init_file): Likewise.
16601 (defines_file): Likewise.
16602 (pgm_path): Likewise.
16603 (bif_path): Likewise.
16604 (ovld_path): Likewise.
16605 (header_path): Likewise.
16606 (init_path): Likewise.
16607 (defines_path): Likewise.
16608 (LINELEN): New macro.
16609 (linebuf): New variable.
16613 (bif_diag): New function.
16614 (ovld_diag): Likewise.
16616 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16618 * config/rs6000/rs6000-builtin-new.def: New.
16619 * config/rs6000/rs6000-overload.def: New.
16621 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16623 * config/rs6000/rs6000-gen-builtins.c: New.
16625 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
16627 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
16628 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
16629 * gengtype-state.c (state_writer::write_state_file_list): Add a
16630 parameter to the fileslist expression for the number of build
16632 (read_state_files_list): Detect build headers and strip the
16633 initial "./" or ".\" from their names.
16634 * gengtype.c (build_headers): New global variable.
16635 (num_build_headers): Likewise.
16636 (open_base_files): Emit #include for each build header.
16637 (main): Detect and count build headers.
16638 * gengtype.h (build_headers): New extern variable.
16639 (num_build_headers): Likewise.
16641 2021-07-16 Richard Biener <rguenther@suse.de>
16643 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
16644 the type of the LHS.
16645 (find_bswap_or_nop_1): Likewise.
16646 (find_bswap_or_nop): Likewise.
16647 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
16649 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
16650 Remove unused parameters, pass in the scalar type. Fix
16651 internal store function handling.
16652 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
16653 (vect_get_vector_types_for_stmt): Move down check for
16654 existing vector stmt after we've determined a scalar type.
16655 Pass down the used scalar type to vect_get_smallest_scalar_type.
16656 * tree-vect-generic.c (expand_vector_condition): Use
16657 the type of the LHS.
16658 (expand_vector_scalar_condition): Likewise.
16659 (expand_vector_operations_1): Likewise.
16660 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
16661 (vect_recog_dot_prod_pattern): Likewise.
16662 (vect_recog_sad_pattern): Likewise.
16663 (vect_recog_widen_op_pattern): Likewise.
16664 (vect_recog_widen_sum_pattern): Likewise.
16665 (vect_recog_mixed_size_cond_pattern): Likewise.
16667 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
16669 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
16670 (dump_eaf_flags): Dump EAF_NOT_RETURNED
16671 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
16672 and EAF_NOT_RETURNED.
16673 (modref_summary::useful_p): Likewise.
16674 (modref_summary_lto::useful_p): Likewise.
16675 (struct) modref_summary_lto: Use eaf_fleags_t.
16676 (deref_flags): Handle EAF_NOT_RETURNED.
16677 (struct escape_point): Use min_flags.
16678 (modref_lattice::init): Add EAF_NOT_RETURNED.
16679 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
16680 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
16682 (analyze_parms): Also analyze const functions; update conition on
16684 (modref_write): Update streaming.
16685 (read_section): Update streaming.
16686 (remap_arg_flags): Use eaf_flags_t.
16687 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
16688 * ipa-modref.h: (eaf_flags_t): New typedef.
16689 (struct modref_summary): Use eaf_flags_t.
16690 * tree-core.h (EAF_NOT_RETURNED): New constant.
16692 2021-07-16 Richard Biener <rguenther@suse.de>
16694 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
16695 the type of the LHS.
16696 (gimple_assign_nonnegative_warnv_p): Likewise.
16697 (gimple_call_nonnegative_warnv_p): Likewise. Return false
16698 if the call has no LHS.
16699 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
16700 * tree-eh.c (stmt_could_throw_1_p): Likewise.
16701 * tree-inline.c (insert_init_stmt): Likewise.
16702 * tree-ssa-loop-niter.c (get_val_for): Likewise.
16703 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
16705 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
16706 gassign *. Use the type of the lhs.
16707 (vn_nary_op_lookup_stmt): Adjust.
16708 (vn_nary_op_insert_stmt): Likewise.
16710 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
16712 * config/s390/predicates.md (bras_sym_operand): Accept all
16713 functions in 64-bit mode, use UNSPEC_PLT31.
16714 (larl_operand): Use UNSPEC_PLT31.
16715 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
16716 (legitimize_pic_address): Likewise.
16717 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
16719 (s390_delegitimize_address): Use UNSPEC_PLT31.
16720 (s390_output_addr_const_extra): Likewise.
16721 (print_operand): Add @PLT to TLS calls, handle %K.
16722 (s390_function_profiler): Mark __fentry__/_mcount as function,
16723 use %K, use UNSPEC_PLT31.
16724 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
16725 (s390_emit_call): Use UNSPEC_PLT31.
16726 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
16727 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
16728 (*movdi_64): Use %K.
16729 (reload_base_64): Likewise.
16730 (*sibcall_brc): Likewise.
16731 (*sibcall_brcl): Likewise.
16732 (*sibcall_value_brc): Likewise.
16733 (*sibcall_value_brcl): Likewise.
16735 (*brasl): Likewise.
16736 (*bras_r): Likewise.
16737 (*brasl_r): Likewise.
16738 (*bras_tls): Likewise.
16739 (*brasl_tls): Likewise.
16740 (main_base_64): Likewise.
16741 (reload_base_64): Likewise.
16742 (@split_stack_call<mode>): Likewise.
16744 2021-07-16 Richard Biener <rguenther@suse.de>
16746 PR tree-optimization/101467
16747 * tree-vect-stmts.c (vect_gen_while): Properly guard
16748 make_temp_ssa_name usage.
16750 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
16752 * config.gcc: Don't use forked print-sysroot-suffix.sh and
16753 t-sysroot-suffix for C-SKY.
16754 * config/csky/print-sysroot-suffix.sh: Delete.
16755 * config/csky/t-csky-linux: Delete.
16756 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
16757 instead of CSKY_MULTILIB_DIRNAMES.
16759 2021-07-16 Richard Biener <rguenther@suse.de>
16761 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
16762 conversion issues with the partial reduction of the reused
16763 vector accumulator.
16765 2021-07-16 Richard Biener <rguenther@suse.de>
16767 * config/i386/i386-options.c (ix86_option_override_internal): Set
16768 param_vect_partial_vector_usage to zero if not set.
16770 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
16773 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
16774 (VALID_INT_MODE_P): Add SDmode and DDmode.
16775 Add TDmode for TARGET_64BIT.
16776 (VALID_DFP_MODE_P): Remove.
16777 * config/i386/i386.c (ix86_hard_regno_mode_ok):
16778 Do not use VALID_DFP_MODE_P.
16780 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
16782 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
16784 (fold_using_range::fold_stmt): Ditto.
16785 (fold_using_range::range_of_range_op): Ditto.
16786 (fold_using_range::range_of_phi): Ditto.
16787 (fold_using_range::range_of_call): Ditto.
16788 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
16789 (fold_using_range::range_of_builtin_call): Ditto.
16790 (fold_using_range::range_of_cond_expr): Ditto.
16791 * gimple-range-fold.h (gimple_range_type): New.
16793 2021-07-15 Martin Sebor <msebor@redhat.com>
16795 PR middle-end/97027
16796 * tree-ssa-strlen.c (handle_assign): New function.
16797 (maybe_warn_overflow): Add argument.
16798 (nonzero_bytes_for_type): New function.
16799 (count_nonzero_bytes): Handle more tree types. Call
16800 nonzero_bytes_for_tye.
16801 (count_nonzero_bytes): Handle types.
16802 (handle_store): Handle stores from function calls.
16803 (strlen_check_and_optimize_call): Move code to handle_assign. Call
16804 it for assignments from function calls.
16806 2021-07-15 David Malcolm <dmalcolm@redhat.com>
16811 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
16813 2021-07-15 David Malcolm <dmalcolm@redhat.com>
16815 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
16817 2021-07-15 Martin Sebor <msebor@redhat.com>
16821 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
16822 (operand_compare::verify_hash_value): Same.
16823 * tree-core.h (OEP_DECL_NAME): New.
16825 2021-07-15 Martin Jambor <mjambor@suse.cz>
16827 * profile-count.h (profile_count::value): Change the return type to
16829 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
16831 * tree-cfg.c (dump_function_to_file): Likewise.
16833 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
16836 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
16837 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
16840 2021-07-15 Richard Biener <rguenther@suse.de>
16842 * tree-vectorizer.h (vect_gen_while): Match up with
16843 vect_gen_while_not.
16844 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
16845 of vect_gen_while_not.
16846 (vect_gen_while_not): Adjust.
16847 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
16849 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
16851 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
16852 (ranger_cache::range_of_def): Call adjust_range.
16853 (ranger_cache::entry_range): Same.
16854 * gimple-range-cache.h (non_null_ref::adjust_range): New.
16855 * gimple-range.cc (gimple_ranger::range_of_expr): Call
16857 (gimple_ranger::range_on_entry): Same.
16859 2021-07-15 Tamar Christina <tamar.christina@arm.com>
16862 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16864 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
16866 2021-07-15 Tamar Christina <tamar.christina@arm.com>
16869 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16871 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
16872 (sdot_prod, udot_prod): ...These.
16873 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
16874 (aarch64_<sur>dot<vsi2qi>): Rename to...
16875 (<sur>dot_prod<vsi2qi>): ...This.
16876 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
16879 2021-07-15 Jakub Jelinek <jakub@redhat.com>
16881 PR middle-end/101437
16882 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
16883 types even if they have non-BLKmode TYPE_MODE.
16885 2021-07-15 Richard Biener <rguenther@suse.de>
16888 * gcc.c (process_command): Process -gtoggle like process_options
16889 would after parsing options.
16891 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
16893 * cfgexpand.c (expand_asm_loc): Adjust.
16894 (expand_asm_stmt): Likewise.
16895 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
16896 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
16897 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
16898 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
16899 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
16900 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
16901 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
16902 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
16903 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
16904 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
16905 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
16906 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
16907 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
16908 * doc/tm.texi: Regenerate.
16909 * target.def: Add location argument to md_asm_adjust.
16911 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
16913 * tree-diagnostic.c (diagnostic_report_current_function): Use the
16914 diagnostic's location, not input_location.
16916 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
16918 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
16920 (expand_asm_stmt): Likewise.
16922 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
16924 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
16925 addressed memory rtx, if any.
16926 (rs6000_split_multireg_move): Fix code formatting.
16927 Handle MMA build built-ins with operands in adjacent memory locations.
16929 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
16931 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
16934 2021-07-14 Jason Merrill <jason@redhat.com>
16936 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
16938 2021-07-14 Jason Merrill <jason@redhat.com>
16940 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
16943 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16945 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
16946 Remove erroneous line.
16948 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
16950 * params.opt (param_evrp_mode): Change default.
16952 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16954 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
16955 (sdot_prod, udot_prod): ...These.
16956 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
16957 (aarch64_<sur>dot<vsi2qi>): Rename to...
16958 (<sur>dot_prod<vsi2qi>): ...This.
16959 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
16962 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16964 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
16966 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16968 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
16970 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16972 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
16974 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16976 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
16977 (usdot_prod<vsi2qi>): ... This.
16978 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
16979 (usdot_prod): ...This.
16980 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
16981 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
16983 (@<sur>dot_prod<vsi2qi>): ...This.
16984 * config/aarch64/aarch64-sve-builtins-base.cc
16985 (svusdot_impl::expand): Use it.
16987 2021-07-14 Tamar Christina <tamar.christina@arm.com>
16989 * optabs.def (usdot_prod_optab): New.
16990 * doc/md.texi: Document it and clarify other dot prod optabs.
16991 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
16992 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
16993 * optabs.c (expand_widen_pattern_expr): Likewise.
16994 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
16995 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
16996 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
16998 (vect_widened_op_tree): Optionally ignore
17000 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
17002 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
17005 * config/i386/driver-i386.c (host_detect_local_cpu): Check
17006 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
17007 Enable UINTR only for 64-bit codegen.
17008 * config/i386/i386-options.c
17009 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
17011 * config/i386/i386.h (ARCH_ARG): New.
17012 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
17013 "[arch|tune] 64" for 64-bit codegen.
17015 2021-07-14 Richard Biener <rguenther@suse.de>
17017 PR tree-optimization/101445
17018 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
17019 of the IV in the correct direction for negative stride
17022 2021-07-14 Jakub Jelinek <jakub@redhat.com>
17025 * godump.c (godump_str_hash): New type.
17026 (godump_container::pot_dummy_types): Use string_hash instead of
17027 ptr_hash in the hash_set.
17029 2021-07-14 Richard Biener <rguenther@suse.de>
17031 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
17032 vector types where the old vector type has a multiple of
17033 the new vector type elements.
17034 (vect_create_partial_epilog): New function, split out from...
17035 (vect_create_epilog_for_reduction): ... here.
17036 (vect_transform_cycle_phi): Reduce the re-used accumulator
17037 to the new vector type.
17039 2021-07-14 Alexandre Oliva <oliva@adacore.com>
17041 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
17042 non-'t'-sized arg check.
17044 2021-07-14 Alexandre Oliva <oliva@adacore.com>
17046 * tree-cfg.c (cleanup_dead_labels_eh): Update
17047 post_landing_pad label upon change of landing pad block's
17049 (cleanup_dead_labels): Check that a removed label is not that
17052 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
17054 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
17056 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
17057 Add Neon to general purpose register case for zero-extend
17059 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
17060 case to prevent some cases opting to go through memory.
17061 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
17062 * rtl.c (rtvec_series_p): Define predicate to determine
17063 whether a vector contains a linear series of integers.
17064 * rtl.h (rtvec_series_p): Define.
17065 * rtlanal.c (vec_series_lowpart_p): Define predicate to
17066 determine if a vector selection is equivalent to the low part
17068 * rtlanal.h (vec_series_lowpart_p): Define.
17069 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
17070 Add vec_select -> subreg simplification.
17072 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
17074 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
17075 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
17076 _mm_test_mix_ones_zeros): New.
17078 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
17079 Richard Biener <rguenther@suse.de>
17081 * gimple.c (gimple_could_trap_p_1): Make S argument a
17082 "const gimple*". Preserve constness in call to
17083 gimple_asm_volatile_p.
17084 (gimple_could_trap_p): Make S argument a "const gimple*".
17085 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
17086 Update function prototypes.
17088 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17090 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
17091 (_loop_vec_info::main_loop_edge): New field.
17092 (_loop_vec_info::skip_main_loop_edge): Likewise.
17093 (_loop_vec_info::skip_this_loop_edge): Likewise.
17094 (_loop_vec_info::reusable_accumulators): Likewise.
17095 (_stmt_vec_info::reduc_scalar_results): Likewise.
17096 (_stmt_vec_info::reused_accumulator): Likewise.
17097 (vect_get_main_loop_result): Declare.
17098 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
17099 reduc_scalar_inputs.
17100 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
17101 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
17102 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
17103 skip_main_loop_edge and skip_this_loop_edge fields.
17104 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
17105 (vect_emit_reduction_init_stmts): New function.
17106 (get_initial_def_for_reduction): Use it.
17107 (get_initial_defs_for_reduction): Likewise. Change the vinfo
17108 parameter to a loop_vec_info.
17109 (vect_create_epilog_for_reduction): Store the scalar results
17110 in the reduc_info. If an epilogue loop is reusing an accumulator
17111 from the main loop, and if the epilogue loop can also be skipped,
17112 try to place the reduction code in the join block. Record
17113 accumulators that could potentially be reused by epilogue loops.
17114 (vect_transform_cycle_phi): When vectorizing epilogue loops,
17115 try to reuse accumulators from the main loop. Record the initial
17116 value in reduc_info for non-SLP reductions too.
17118 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17120 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
17121 adjustment handling. Take the neutral value as an argument,
17122 in place of the code argument.
17123 (vect_transform_cycle_phi): Update accordingly. Handle the
17124 initial values of cond reductions separately from code reductions.
17125 Choose the adjustment here rather than in
17126 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
17128 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17130 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
17131 (neutral_op_for_reduction): ...this, providing a more general
17133 (vect_create_epilog_for_reduction): Update accordingly.
17134 (vectorizable_reduction): Likewise.
17135 (vect_transform_cycle_phi): Likewise.
17137 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17139 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
17140 reduc_info instead of the original stmt_vec_info.
17141 (vect_transform_cycle_phi): Update accordingly.
17143 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17145 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
17146 reduc_info as an additional parameter.
17147 (vect_transform_cycle_phi): Update accordingly.
17149 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17151 * tree-vectorizer.h: Include tree-ssa-operands.h.
17152 (vect_phi_initial_value): New function.
17153 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
17154 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
17155 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
17156 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
17158 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17160 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
17161 the phi results to vectype after creating them. Remove later
17162 conversion code that thus becomes redundant.
17164 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17166 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
17167 the new_phis vector with a reduc_inputs vector. Combine handling
17168 of reduction chains and ncopies > 1.
17170 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17172 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
17173 scalar_results to group_size elements after reducing down from
17174 N*group_size elements. Construct an array_slice of the live-out
17175 stmts and assert that there is one stmt per scalar result.
17177 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17179 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
17180 nested_in_vect_loop and use double_reduc everywhere. Remove dead
17181 assignment to "loop".
17183 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
17185 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
17186 vector types first. For scalar types, consider both the preferred
17187 vector mode and the alternative vector modes.
17188 * optabs-query.c (can_vec_mask_load_store_p): Use the same
17189 structure as above, in particular using related_vector_mode
17190 for modes provided by autovectorize_vector_modes.
17192 2021-07-13 Jakub Jelinek <jakub@redhat.com>
17193 Richard Biener <rguenther@suse.de>
17195 PR tree-optimization/101419
17196 * tree-pass.h (PROP_objsz): Define.
17197 (make_pass_early_object_sizes): Declare.
17198 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
17199 there to pass_early_object_sizes, drop parameter.
17200 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
17201 drop parameter, move pass_post_ipa_warn right after that.
17202 * tree-object-size.c (pass_object_sizes::execute): Rename to...
17203 (object_sizes_execute): ... this. Add insert_min_max_p argument.
17204 (pass_data_object_sizes): Move after object_sizes_execute.
17205 (pass_object_sizes): Likewise. In execute method call
17206 object_sizes_execute, drop set_pass_param method and insert_min_max_p
17207 non-static data member and its initializer in the ctor.
17208 (pass_data_early_object_sizes, pass_early_object_sizes,
17209 make_pass_early_object_sizes): New.
17210 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
17211 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
17213 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
17216 * config/riscv/constraints.md ("S"): Update description and remove
17218 * doc/md.texi (Machine Constraints): Document the 'S' constraints
17221 2021-07-13 Richard Biener <rguenther@suse.de>
17224 2021-07-12 Richard Biener <rguenther@suse.de>
17226 * tree-vect-slp.c (vect_slp_region): Show the number of
17227 SLP graph entries in the optimization message.
17229 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
17231 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
17233 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
17234 return type to long.
17235 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
17238 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
17240 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
17241 Query relation between the 2 operands and use it.
17243 2021-07-12 Sergei Trofimovich <siarheit@google.com>
17245 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
17247 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
17250 * config/i386/predicates.md (vec_setm_sse41_operand):
17251 Rename from vec_setm_operand.
17252 (vec_setm_avx2_operand): New predicate.
17253 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
17254 Use vec_setm_sse41_operand as operand 2 predicate.
17255 (vec_set<V_256_512:mode): New expander.
17256 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
17257 as operand 2 predicate.
17259 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
17261 PR tree-optimization/101335
17262 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
17264 2021-07-12 Andrew Pinski <apinski@marvell.com>
17266 * tree-ssa-phiopt.c (match_simplify_replacement): Move
17267 insert of the sequence before the movement of the
17268 statement. Check if to see if the statement is used
17269 outside of the original phi to see if we should move it.
17271 2021-07-12 Richard Biener <rguenther@suse.de>
17273 * dump-context.h (debug_dump_context::debug_dump_context):
17274 Add FILE * parameter defaulted to stderr.
17275 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
17276 * tree-vect-slp.c (dot_slp_tree): New functions.
17278 2021-07-12 Richard Biener <rguenther@suse.de>
17280 PR tree-optimization/101373
17281 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
17282 references when the BB may not return.
17283 (compute_avail): Pass in the function we're working on and
17284 replace cfun references with it. Externally throwing
17285 const calls also possibly terminate the function.
17286 (pass_pre::execute): Pass down the function we're working on.
17287 * gcse.c (compute_hash_table_work): Externally throwing
17288 const/pure calls also need record_last_mem_set_info.
17289 * postreload-gcse.c (record_opr_changes): Looping or externally
17290 throwing const/pure calls also need record_last_mem_set_info.
17292 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
17294 * recog.c (memory_address_addr_space_p): Change the type to bool.
17295 Return true/false instead of 1/0.
17296 (offsettable_memref_p): Ditto.
17297 (offsettable_nonstrict_memref_p): Ditto.
17298 (offsettable_address_addr_space_p): Ditto.
17299 Change the type of addressp indirect function to bool.
17300 * recog.h (memory_address_addr_space_p): Change the type to bool.
17301 (strict_memory_address_addr_space_p): Ditto.
17302 (offsettable_memref_p): Ditto.
17303 (offsettable_nonstrict_memref_p): Ditto.
17304 (offsettable_address_addr_space_p): Ditto.
17305 * reload.c (maybe_memory_address_addr_space_p): Ditto.
17306 (strict_memory_address_addr_space_p): Change the type to bool.
17307 Return true/false instead of 1/0.
17308 (maybe_memory_address_addr_space_p): Change the type to bool.
17310 2021-07-12 Richard Biener <rguenther@suse.de>
17312 * tree-vect-slp.c (vect_slp_region): Show the number of
17313 SLP graph entries in the optimization message.
17315 2021-07-12 Richard Biener <rguenther@suse.de>
17317 PR tree-optimization/101394
17318 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
17319 copies from abnormals for a full redundancy.
17321 2021-07-12 Richard Biener <rguenther@suse.de>
17323 PR middle-end/101423
17324 * gimple.c (gimple_could_trap_p_1): Internal function calls
17326 * tree-eh.c (tree_could_trap_p): Likewise.
17328 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
17331 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
17333 (vmulq_n_u32): Likewise.
17334 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
17335 (vmulq_n_f32): Likewise.
17336 (vmul_n_f16): Likewise.
17337 (vmulq_n_f16): Likewise.
17339 2021-07-12 Martin Liska <mliska@suse.cz>
17341 PR sanitizer/101425
17342 * gcc.c (check_offload_target_name): Call
17343 candidates_list_and_hint only if we have a candidate.
17345 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
17348 * config/arm/neon.md (vec_init): Move to ...
17349 * config/arm/vec-common.md (vec_init): ... here.
17350 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
17352 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
17354 PR tree-optimization/101403
17355 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
17356 signedness of the shift is not the same as the signedness of
17357 the type extension.
17359 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
17360 Uroš Bizjak <ubizjak@gmail.com>
17362 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
17363 divmod of a constant numerator with new define_insn_and_split.
17365 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
17368 * config/i386/i386-expand.c (ix86_expand_call): If a call is
17369 to a non-local-binding, or local but to a public symbol, then
17370 assume that it might be indirected via the lazy symbol binder.
17371 Mark R10 and R10 as clobbered in that case.
17373 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
17376 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
17377 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
17378 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
17380 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
17382 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
17383 log2 range of operands[3] to [1,31].
17384 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
17386 2021-07-09 Sergei Trofimovich <siarheit@google.com>
17388 * doc/md.texi: Don't split @smallexample in multiple @groups.
17390 2021-07-09 Sergei Trofimovich <siarheit@google.com>
17392 * doc/md.texi: Add missing 'see' word.
17394 2021-07-09 Andrew Pinski <apinski@marvell.com>
17396 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
17397 to take sequence and gimple_match_op. Accept the case where
17398 op is a SSA_NAME and one statement in the sequence.
17399 Also allow constants.
17400 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
17401 Update call to phiopt_early_allow. Discard the sequence if not
17404 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
17409 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
17410 instead of ISA_HAS_MSA.
17411 (mips_expand_vec_unpack): Likewise.
17412 (mips_expand_vector_init): Likewise.
17414 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
17416 * config/rs6000/vsx.md (mods_<mode>): Rename to...
17417 (mod<mode>3): ... this.
17418 (modu_<mode>): Rename to...
17419 (umod<mode>3): ... this.
17420 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
17421 MODU_V4SI): Adjust.
17423 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
17425 * config/h8300/shiftrotate.md (variable shifts): Expose condition
17426 code handling for the test before the loop.
17428 2021-07-08 Martin Jambor <mjambor@suse.cz>
17431 * ipa-sra.c (class isra_call_summary): New member
17432 m_before_any_store, initialize it in the constructor.
17433 (isra_call_summary::dump): Dump the new field.
17434 (ipa_sra_call_summaries::duplicate): Copy it.
17435 (process_scan_results): Set it.
17436 (isra_write_edge_summary): Stream it.
17437 (isra_read_edge_summary): Likewise.
17438 (param_splitting_across_edge): Only override
17439 safe_to_import_accesses if m_before_any_store is set.
17441 2021-07-08 Martin Sebor <msebor@redhat.com>
17443 PR bootstrap/101374
17444 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
17445 Use Object Size Type 0 instead of 1.
17447 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
17449 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
17452 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
17454 * match.pd: Simplify an extend-operate-truncate sequence involving
17457 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
17458 Richard Biener <rguenther@suse.de>
17460 PR tree-optimization/40210
17461 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
17462 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
17463 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
17465 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
17468 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
17470 * config/i386/mmx.md (V_32): New mode iterator.
17471 (mov<V_32:mode>): Use V_32 mode iterator.
17472 (*mov<V_32:mode>_internal): Ditto.
17473 (*push<V_32:mode>2_rex64): Ditto.
17474 (*push<V_32:mode>2): Ditto.
17475 (movmisalign<V_32:mode>): Ditto.
17476 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
17477 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
17478 (vec_unpacks_lo_v4qi): New expander.
17479 (vec_unpacks_hi_v4qi): Ditto.
17480 (vec_unpacku_lo_v4qi): Ditto.
17481 (vec_unpacku_hi_v4qi): Ditto.
17482 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
17483 (VALID_INT_MODE_P): Ditto.
17485 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
17488 * config/rs6000/rs6000.md (udivti3): New insn.
17489 (divti3): New insn.
17490 (umodti3): New insn.
17491 (modti3): New insn.
17493 2021-07-07 Martin Sebor <msebor@redhat.com>
17495 PR tree-optimization/100137
17496 PR tree-optimization/99121
17497 PR tree-optimization/97027
17498 * builtins.c (access_ref::access_ref): Also set offmax.
17499 (access_ref::offset_in_range): Define new function.
17500 (access_ref::add_offset): Set offmax.
17501 (access_ref::inform_access): Handle access_none.
17502 (handle_mem_ref): Clear ostype.
17503 (compute_objsize_r): Handle ASSERT_EXPR.
17504 * builtins.h (struct access_ref): Add offmax member.
17505 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
17506 compute_objsize() and simplify.
17508 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
17510 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
17511 and VSX_BUILTIN_STXVP.
17513 2021-07-07 Martin Sebor <msebor@redhat.com>
17516 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
17517 a stray %K from error_at() missed in r12-2088.
17519 2021-07-07 Richard Biener <rguenther@suse.de>
17521 PR tree-optimization/99728
17522 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
17524 (mem_refs_may_alias_p): Add assert we handled aggregate
17526 (sm_seq_valid_bb): Give up when running into aggregate copies.
17527 (ref_indep_loop_p): Handle aggregate copies as never
17528 being invariant themselves but allow other refs to be
17529 disambiguated against them.
17530 (can_sm_ref_p): Do not try to apply store-motion to aggregate
17533 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
17536 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
17539 2021-07-06 Martin Sebor <msebor@redhat.com>
17541 * gimple-pretty-print.c (percent_G_format): Remove.
17542 * tree-diagnostic.c (default_tree_printer): Remove calls.
17543 * tree-pretty-print.c (percent_K_format): Remove.
17544 * tree-pretty-print.h (percent_K_format): Remove.
17546 2021-07-06 Martin Sebor <msebor@redhat.com>
17548 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
17549 Remove %K and use error_at.
17550 (aarch64_expand_fcmla_builtin): Same.
17551 (aarch64_expand_builtin_tme): Same.
17552 (aarch64_expand_builtin_memtag): Same.
17553 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
17554 (arm_expand_builtin): Same.
17555 * config/arm/arm.c (bounds_check): Same.
17557 2021-07-06 Martin Sebor <msebor@redhat.com>
17559 * builtins.c (warn_string_no_nul): Remove %G.
17560 (maybe_warn_for_bound): Same.
17561 (warn_for_access): Same.
17562 (check_access): Same.
17563 (check_strncat_sizes): Same.
17564 (expand_builtin_strncat): Same.
17565 (expand_builtin_strncmp): Same.
17566 (expand_builtin): Same.
17567 (expand_builtin_object_size): Same.
17568 (warn_dealloc_offset): Same.
17569 (maybe_emit_free_warning): Same.
17570 * calls.c (maybe_warn_alloc_args_overflow): Same.
17571 (maybe_warn_nonstring_arg): Same.
17572 (maybe_warn_rdwr_sizes): Same.
17573 * expr.c (expand_expr_real_1): Remove %K.
17574 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
17575 (gimple_fold_builtin_strncat): Same.
17576 * gimple-ssa-sprintf.c (format_directive): Same.
17577 (handle_printf_call): Same.
17578 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
17579 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
17580 (maybe_diag_access_bounds): Same. Call gimple_location.
17581 (check_bounds_or_overlap): Same.
17582 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
17583 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
17584 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
17585 (maybe_diag_stxncpy_trunc): Same.
17586 (handle_builtin_stxncpy_strncat): Same.
17587 (maybe_warn_pointless_strcmp): Same.
17588 * tree-ssa-uninit.c (maybe_warn_operand): Same.
17590 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
17593 * config/i386/predicates.md (vec_setm_operand): Enable
17594 register_operand for TARGET_SSE4_1.
17595 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
17596 as operand 2 predicate. Call ix86_expand_vector_set_var
17597 for non-constant index operand.
17598 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
17599 Call ix86_expand_vector_set_var for non-constant index operand.
17601 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
17603 * config/h8300/jumpcall.md (*branch): When possible, generate
17604 the comparison in CCZN mode.
17605 * config/h8300/predicates.md (simple_memory_operand): Reject all
17606 auto-increment addressing modes.
17608 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
17610 PR bootstrap/100246
17611 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
17614 2021-07-06 Richard Biener <rguenther@suse.de>
17616 * doc/md.texi (vec_fmaddsub<mode>4): Document.
17617 (vec_fmsubadd<mode>4): Likewise.
17618 * optabs.def (vec_fmaddsub$a4): Add.
17619 (vec_fmsubadd$a4): Likewise.
17620 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
17621 (IFN_VEC_FMSUBADD): Likewise.
17622 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
17623 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
17624 (addsub_pattern::build): Likewise.
17625 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
17626 and CFN_VEC_FMSUBADD are not transparent for permutes.
17627 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
17628 (vec_fmsubadd<mode>4): Likewise.
17630 2021-07-06 Richard Biener <rguenther@suse.de>
17632 * doc/invoke.texi (fmove-loop-stores): Document.
17633 * common.opt (fmove-loop-stores): New option.
17634 * opts.c (default_options_table): Enable -fmove-loop-stores
17635 at -O1 but not -Og.
17636 * tree-ssa-loop-im.c (pass_lim::execute): Pass
17637 flag_move_loop_stores instead of true to
17638 loop_invariant_motion_in_fun.
17640 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
17642 * doc/install.texi: Document --with-dsymutil.
17644 2021-07-06 Andrew Pinski <apinski@marvell.com>
17646 PR tree-optimization/101256
17647 * dbgcnt.def (phiopt_edge_range): New counter.
17648 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
17649 Check to make sure the new name is defined in the same
17650 bb as the conditional before duplicating range info.
17651 Also add debug counter.
17653 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
17655 PR rtl-optimization/100328
17656 * config/i386/i386-options.c (ix86_option_override_internal):
17657 Set param_ira_consider_dup_in_all_alts to 0.
17659 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
17661 PR rtl-optimization/100328
17662 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
17664 * ira.c (ira_get_dup_out_num): Adjust as parameter
17665 param_ira_consider_dup_in_all_alts.
17666 * params.opt (ira-consider-dup-in-all-alts): New.
17667 * ira-conflicts.c (process_regs_for_copy): Add one parameter
17668 single_input_op_has_cstr_p.
17669 (get_freq_for_shuffle_copy): New function.
17670 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
17671 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
17673 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
17675 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
17676 generate condition code aware RTL directly.
17678 2021-07-05 Andrew Pinski <apinski@marvell.com>
17680 PR tree-optimization/101039
17681 * match.pd (A CMP 0 ? A : -A): New patterns.
17682 * tree-ssa-phiopt.c (abs_replacement): Delete function.
17683 (tree_ssa_phiopt_worker): Don't call abs_replacement.
17684 Update comment about abs_replacement.
17686 2021-07-05 Andrew Pinski <apinski@marvell.com>
17688 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
17689 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
17691 2021-07-05 Andrew Pinski <apinski@marvell.com>
17693 * tree-ssa-phiopt.c (match_simplify_replacement):
17694 Add early_p argument. Call gimple_simplify_phiopt
17695 instead of gimple_simplify.
17696 (tree_ssa_phiopt_worker): Update call to
17697 match_simplify_replacement and allow unconditionally.
17698 (phiopt_early_allow): New function.
17699 (gimple_simplify_phiopt): New function.
17701 2021-07-05 Andrew Pinski <apinski@marvell.com>
17703 PR middle-end/101237
17704 * fold-const.c (negate_expr_p): Remove call to element_mode
17705 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
17706 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
17707 (fold_negate_expr_1): Likewise.
17708 (const_unop): Likewise.
17709 (fold_cond_expr_with_comparison): Likewise.
17710 (fold_binary_loc): Likewise.
17711 (fold_ternary_loc): Likewise.
17712 (tree_call_nonnegative_warnv_p): Likewise.
17713 * match.pd (-(A + B) -> (-B) - A): Likewise.
17715 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
17717 * configure.ac: Handle --with-dsymutil in the same way as we
17718 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
17719 Extract the type and version for the dsymutil configured or
17720 found by the default searches.
17721 * config.in: Regenerated.
17722 * configure: Regenerated.
17723 * collect2.c (do_dsymutil): Handle locating dsymutil in the
17724 same way as for the assembler and linker.
17725 * config/darwin.h (DSYMUTIL): Delete.
17726 * gcc.c: Report a configured dsymutil correctly.
17727 * exec-tool.in: Allow for dsymutil.
17729 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
17731 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
17732 Handle V4QI and V2HI modes.
17733 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
17734 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
17735 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
17736 Handle 4-byte dual operands with TARGET_XOP and single operands
17737 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
17738 mmx_pshufbv4qi3 for TARGET_SSSE3.
17739 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
17740 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
17741 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
17742 (expand_vec_perm_even_odd_1): Handle V4QI mode.
17743 (expand_vec_perm_broadcast_1): Handle V4QI mode.
17744 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
17745 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
17746 (mmx_pshufbv4qi3): Ditto.
17747 (*mmx_pblendw32): Ditto.
17748 (*mmx_pblendw64): Rename from *mmx_pblendw.
17749 (mmx_punpckhbw_low): New insn_and_split pattern.
17750 (mmx_punpcklbw_low): Ditto.
17752 2021-07-05 Richard Biener <rguenther@suse.de>
17754 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
17755 set LOOP_C_INFINITE on the vectorized loop.
17757 2021-07-05 Richard Biener <rguenther@suse.de>
17759 PR middle-end/101291
17760 * cfgloopmanip.c (loop_version): Set the loop copy of the
17761 versioned loop to the new loop.
17763 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
17766 * config.gcc: Ensure that Darwin biarch definitions are
17767 added before i386.h.
17768 * config/i386/darwin.h (TARGET_64BIT): Remove.
17769 (PR80556_WORKAROUND): New.
17770 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
17771 (DARWIN_SUBARCH_SPEC): New.
17772 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
17773 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
17774 (REAL_LIBGCC_SPEC): Remove.
17775 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
17776 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
17777 (REAL_LIBGCC_SPEC): Remove.
17779 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
17781 PR middle-end/101294
17782 * expr.c (store_constructor): Don't use vec_duplicate on vector.
17784 2021-07-02 Martin Sebor <msebor@redhat.com>
17786 PR middle-end/98871
17787 PR middle-end/98512
17788 * diagnostic.c (get_any_inlining_info): New.
17789 (update_effective_level_from_pragmas): Handle inlining context.
17790 (diagnostic_enabled): Same.
17791 (diagnostic_report_diagnostic): Same.
17792 * diagnostic.h (struct diagnostic_info): Add ctor.
17793 (struct diagnostic_context): Add new member.
17794 * tree-diagnostic.c (set_inlining_locations): New.
17795 (tree_diagnostics_defaults): Set new callback pointer.
17797 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
17799 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
17801 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
17802 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
17803 lxvp and stxvp built-ins.
17804 (mma_init_builtins): Handle lxvp and stxvp built-ins.
17805 (builtin_function_type): Likewise.
17806 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
17808 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
17810 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
17811 additional argument for the code.
17812 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
17813 argument for the code. Just return if the ZN bits are useful or
17814 not rather than the old style CC_* enums.
17815 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
17816 more generic shiftqi patterns.
17817 (shifthi_noscratch, shiftsi_noscratch): Similarly.
17818 (shiftqi_noscratch_set_flags): New pattern.
17819 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
17821 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
17823 PR tree-optimization/101223
17824 * range-op.cc (build_lt): Add -1 for signed values.
17825 (built_gt): Subtract -1 for signed values.
17827 2021-07-02 David Faust <david.faust@oracle.com>
17829 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
17830 (btf_asm_type): Likewise.
17832 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
17834 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
17835 an array of rtx rather than a pointer to rtx. Add code argument.
17836 (compute_a_shift_length): Similarly.
17837 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
17838 shift itself isn't an operand. Create dummy operand[0] to carry
17839 a mode and pass a suitable rtx code to compute_a_shift_length.
17840 (get_shift_alg): Adjust operand number of clobber in output templates.
17841 (output_a_shift): Make first argument an array of rtx rather than
17842 a pointer to rtx. Add code argument for the type of shift.
17843 Adjust now that the shift itself is no longer an operand.
17844 (compute_a_shift_length): Similarly.
17845 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
17846 iterator rather than nshift_operator.
17847 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
17848 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
17849 and compute_a_shift_length.
17850 (shiftqi_noscratch_clobber_flags): Likewise.
17851 (shifthi_noscratch_clobber_flags): Likewise.
17852 (shiftsi_noscratch_clobber_flags): Likewise.
17854 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
17857 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
17858 dsymutil for BTF/CTF.
17860 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
17863 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
17864 segment to include BTF.
17865 (BTF_INFO_SECTION_NAME): New.
17867 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
17869 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
17870 (small_data_operand, memreg_operand, small_insn_p): Likewise.
17871 * config/m32r/m32r.c (call_operand): Adjust return type.
17872 (small_data_operand, memreg_operand): Likewise.
17874 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
17876 * config/frv/frv-protos.h (integer_register_operand): Adjust return
17878 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
17879 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
17880 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
17881 (move_source_operand, move_destination_operand): Likewise.
17882 (condexec_source_operand, condexec_dest_operand): Likewise.
17883 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
17884 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
17885 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
17886 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
17887 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
17888 (even_fpr_operand, odd_fpr_operand): Likewise.
17889 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
17890 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
17891 (uint4_operand, uint1_operand, int_2word_operand): Likewise
17892 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
17893 (relational_operator, float_relational_operator): Likewise.
17894 (ccr_eqne_operator, minmax_operator): Likewise.
17895 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
17896 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
17897 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
17898 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
17899 (quad_acc_operand, accg_operand): Likewise.
17901 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
17903 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
17904 return type to a bool.
17905 (nonimmediate_nonstack_operand): Likewise.
17906 (xstormy16_splittable_below100_operand): Likewise.
17907 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
17909 (xstormy16_splittable_below100_operand): Likewise.
17911 2021-07-02 Richard Biener <rguenther@suse.de>
17913 PR tree-optimization/101293
17914 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
17915 with combined offsets.
17916 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
17917 combined with the rest of the offset.
17919 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
17921 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
17922 PIC encodings for PE-COFF targets.
17924 2021-07-02 Jakub Jelinek <jakub@redhat.com>
17927 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
17928 Return nullptr for TImode inner mode.
17930 2021-07-02 Richard Biener <rguenther@suse.de>
17932 PR tree-optimization/101280
17933 PR tree-optimization/101173
17934 * gimple-loop-interchange.cc
17935 (tree_loop_interchange::valid_data_dependences): Properly
17936 guard all dependence checks with DDR_REVERSED_P or its
17939 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
17941 * config/i386/i386-expand.c (ix86_expand_builtin):
17942 Add branch to clear odata when ZF is set for asedecenc_expand
17943 and wideaesdecenc_expand.
17945 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
17947 * config/i386/gcc-auto-profile: regenerate
17949 2021-07-02 liuhongt <hongtao.liu@intel.com>
17951 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
17952 (trunc<mode><pmov_dst_4_lower>2): this.
17954 2021-07-01 David Malcolm <dmalcolm@redhat.com>
17956 * diagnostic.h (diagnostic_context::m_file_cache): New field.
17957 * input.c (class fcache): Rename to...
17958 (class file_cache_slot): ...this, making most members private and
17959 prefixing fields with "m_".
17960 (file_cache_slot::get_file_path): New accessor.
17961 (file_cache_slot::get_use_count): New accessor.
17962 (file_cache_slot::missing_trailing_newline_p): New accessor.
17963 (file_cache_slot::inc_use_count): New.
17964 (fcache_buffer_size): Move to...
17965 (file_cache_slot::buffer_size): ...here.
17966 (fcache_line_record_size): Move to...
17967 (file_cache_slot::line_record_size): ...here.
17968 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
17969 (fcache_tab_size): Move to file_cache::num_file_slots.
17970 (diagnostic_file_cache_init): Update for move of fcache_tab
17971 to global_dc->m_file_cache.
17972 (diagnostic_file_cache_fini): Likewise.
17973 (lookup_file_in_cache_tab): Convert to...
17974 (file_cache::lookup_file): ...this.
17975 (diagnostics_file_cache_forcibly_evict_file): Update for move of
17976 fcache_tab to global_dc->m_file_cache, moving most of
17977 implementation to...
17978 (file_cache::forcibly_evict_file): ...this new function and...
17979 (file_cache_slot::evict): ...this new function.
17980 (evicted_cache_tab_entry): Convert to...
17981 (file_cache::evicted_cache_tab_entry): ...this.
17982 (add_file_to_cache_tab): Convert to...
17983 (file_cache::add_file): ...this, moving bulk of implementation
17985 (file_cache_slot::create): ..this new function.
17986 (file_cache::file_cache): New.
17987 (file_cache::~file_cache): New.
17988 (lookup_or_add_file_to_cache_tab): Convert to...
17989 (file_cache::lookup_or_add_file): ..this new function.
17990 (fcache::fcache): Rename to...
17991 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
17993 (fcache::~fcache): Rename to...
17994 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
17996 (needs_read): Convert to...
17997 (file_cache_slot::needs_read_p): ...this.
17998 (needs_grow): Convert to...
17999 (file_cache_slot::needs_grow_p): ...this.
18000 (maybe_grow): Convert to...
18001 (file_cache_slot::maybe_grow): ...this.
18002 (read_data): Convert to...
18003 (file_cache_slot::read_data): ...this.
18004 (maybe_read_data): Convert to...
18005 (file_cache_slot::maybe_read_data): ...this.
18006 (get_next_line): Convert to...
18007 (file_cache_slot::get_next_line): ...this.
18008 (goto_next_line): Convert to...
18009 (file_cache_slot::goto_next_line): ...this.
18010 (read_line_num): Convert to...
18011 (file_cache_slot::read_line_num): ...this.
18012 (location_get_source_line): Update for moving of globals to
18013 global_dc->m_file_cache.
18014 (location_missing_trailing_newline): Likewise.
18015 * input.h (class file_cache_slot): New forward decl.
18016 (class file_cache): New.
18018 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
18020 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
18021 128-bit floating point conditional move support.
18022 (have_compare_and_set_mask): Add IEEE 128-bit floating point
18024 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
18025 (mov<mode>cc_p10, IEEE128 iterator): New insn.
18026 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
18027 (fpmask<mode>, IEEE128 iterator): New insn.
18028 (xxsel<mode>, IEEE128 iterator): New insn.
18030 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
18033 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
18035 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
18037 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
18039 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
18041 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
18042 (vec_duplicate<mode>): New expander.
18044 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
18047 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
18049 (ix86_byte_broadcast): New function.
18050 (ix86_convert_const_wide_int_to_broadcast): Likewise.
18051 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
18052 size is 16 bytes or bigger.
18053 (ix86_broadcast_from_integer_constant): New function.
18054 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
18055 to broadcast if mode size is 16 bytes or bigger.
18056 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
18058 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
18060 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18062 * config/i386/predicates.md (ix86_endbr_immediate_operand):
18063 Return true/false instead of 1/0.
18064 (movq_parallel): Ditto.
18066 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18068 * recog.c (general_operand): Return true/false instead of 1/0.
18069 (register_operand): Ditto.
18070 (immediate_operand): Ditto.
18071 (const_int_operand): Ditto.
18072 (const_scalar_int_operand): Ditto.
18073 (const_double_operand): Ditto.
18074 (push_operand): Ditto.
18075 (pop_operand): Ditto.
18076 (memory_operand): Ditto.
18077 (indirect_operand): Ditto.
18079 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18081 * genpreds.c (write_predicate_subfunction):
18082 Change the type of written subfunction to bool.
18083 (write_one_predicate_function):
18084 Change the type of written function to bool.
18085 (write_tm_preds_h): Ditto.
18086 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
18087 * recog.c (general_operand): Change the type to bool.
18088 (address_operand): Ditto.
18089 (register_operand): Ditto.
18090 (pmode_register_operand): Ditto.
18091 (scratch_operand): Ditto.
18092 (immediate_operand): Ditto.
18093 (const_int_operand): Ditto.
18094 (const_scalar_int_operand): Ditto.
18095 (const_double_operand): Ditto.
18096 (nonimmediate_operand): Ditto.
18097 (nonmemory_operand): Ditto.
18098 (push_operand): Ditto.
18099 (pop_operand): Ditto.
18100 (memory_operand): Ditto.
18101 (indirect_operand): Ditto.
18102 (ordered_comparison_operator): Ditto.
18103 (comparison_operator): Ditto.
18104 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
18105 Change the type of indirect predicate function to bool.
18106 * config/rs6000/rs6000.c (easy_vector_constant):
18107 Change the type to bool.
18108 * config/mips/mips-protos.h (m16_based_address_p):
18109 Change the type of operand 3 to bool.
18111 2021-07-01 Richard Biener <rguenther@suse.de>
18113 PR tree-optimization/101280
18114 PR tree-optimization/101173
18115 * gimple-loop-interchange.cc
18116 (tree_loop_interchange::valid_data_dependences): Revert
18117 previous change and instead correctly handle DDR_REVERSED_P
18120 2021-07-01 Richard Biener <rguenther@suse.de>
18122 PR tree-optimization/101278
18123 * tree-ssa-dse.c (dse_classify_store): First check for
18124 uses, then ignore stmt for chaining purposes.
18126 2021-07-01 Richard Biener <rguenther@suse.de>
18128 PR tree-optimization/100778
18129 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
18130 vectorized ops ahead of their scalar BB.
18132 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
18135 * config/i386/i386.md (*nabs<dwi>2_doubleword):
18136 New insn_and_split pattern.
18137 (*nabs<dwi>2_1): Ditto.
18138 * config/i386/i386-features.c
18139 (general_scalar_chain::compute_convert_gain):
18140 Handle (NEG (ABS (...))) RTX. Rewrite src code
18141 scanner as switch statement.
18142 (general_scalar_chain::convert_insn):
18143 Handle (NEG (ABS (...))) RTX.
18144 (general_scalar_to_vector_candidate_p):
18145 Detect (NEG (ABS (...))) RTX. Reorder case statements
18146 for (AND (NOT (...) ...)) fallthrough.
18148 2021-07-01 Richard Biener <rguenther@suse.de>
18150 PR tree-optimization/101178
18151 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
18152 (slpg::perm_in): Add.
18153 (slpg::get_perm_in): Remove.
18154 (slpg::get_perm_materialized): Add.
18155 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
18156 during permute propagation and materialization.
18158 2021-07-01 Jakub Jelinek <jakub@redhat.com>
18161 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
18163 2021-07-01 Jakub Jelinek <jakub@redhat.com>
18165 PR middle-end/94366
18166 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
18167 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
18168 use boolean_type_node instead of integer_type_node as NE_EXPR type.
18169 (lower_reduction_clauses): Likewise.
18171 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
18173 * config/gcn/gcn.c: Include dwarf2.h.
18174 (gcn_addr_space_debug): New function.
18175 (TARGET_ADDR_SPACE_DEBUG): New hook.
18177 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
18179 * common/config/gcn/gcn-common.c
18180 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
18181 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
18183 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
18184 (gcn_frame_pointer_rqd): New function.
18185 (TARGET_FRAME_POINTER_REQUIRED): New hook.
18187 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
18189 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
18190 prologue register saves.
18191 (gcn_debug_unwind_info): Use UI_DWARF2.
18192 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
18193 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
18194 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
18195 (DWARF_LINK_REGISTER): New define.
18196 (FIRST_PSEUDO_REGISTER): Increment.
18197 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
18198 (CALL_USED_REGISTERS): Likewise.
18199 (REGISTER_NAMES): Likewise.
18201 2021-06-30 Richard Biener <rguenther@suse.de>
18203 PR tree-optimization/101267
18204 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
18205 API and use SLP compatible interface of vect_is_simple_use.
18206 Reject not vectorized SLP defs for callers that do not support
18208 (vect_check_store_rhs): Handle masked stores and pass down
18209 the appropriate operator index.
18210 (vectorizable_call): Adjust.
18211 (vectorizable_store): Likewise.
18212 (vectorizable_load): Likewise. Handle SLP pecularity of
18214 (vect_is_simple_use): Remove special-casing of masked stores.
18216 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
18218 * common.opt (foffload): Remove help as Driver only.
18219 * gcc.c (display_help): Add -foffload.
18221 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
18223 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
18224 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
18225 a candidate list; better inform no offload target is configured
18226 and fix hint extraction when passed target is not '\0' at [len].
18227 * common.opt (foffload): Add tailing '.'.
18228 (foffload-options): Likewise; fix flag name in the help string.
18230 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18233 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
18234 (vcage_f32): Gate comparison on __FAST_MATH__.
18235 (vcageq_f32): Likewise.
18236 (vcale_f32): Likewise.
18237 (vcaleq_f32): Likewise.
18238 (vcagt_f32): Likewise.
18239 (vcagtq_f32): Likewise.
18240 (vcalt_f32): Likewise.
18241 (vcaltq_f32): Likewise.
18242 (vcage_f16): Likewise.
18243 (vcageq_f16): Likewise.
18244 (vcale_f16): Likewise.
18245 (vcaleq_f16): Likewise.
18246 (vcagt_f16): Likewise.
18247 (vcagtq_f16): Likewise.
18248 (vcalt_f16): Likewise.
18249 (vcaltq_f16): Likewise.
18251 2021-06-30 Richard Biener <rguenther@suse.de>
18253 PR tree-optimization/101264
18254 * tree-vect-slp.c (vect_optimize_slp): Propagate the
18255 computed perm_in to all "any" permute successors
18256 we cannot de-duplicate immediately.
18258 2021-06-30 liuhongt <hongtao.liu@intel.com>
18261 * config/i386/sse.md
18262 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
18264 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
18266 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
18267 * config/i386/subst.md (maskz_scalar): New define_subst.
18268 (maskz_scalar_name): New subst_attr.
18269 (maskz_scalar_op5): Ditto.
18270 (round_saeonly_maskz_scalar_op5): Ditto.
18271 (round_saeonly_maskz_scalar_operand5): Ditto.
18273 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
18275 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
18276 Increase code CSECT alignment to at least 32 bytes.
18277 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
18278 alignment designation.
18280 2021-06-29 Sergei Trofimovich <siarheit@google.com>
18282 * doc/generic.texi: Fix s/net yet/not yet/ typo.
18284 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
18286 PR tree-optimization/101254
18287 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
18288 wrapping/non-wrapping when setting the result range.
18290 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
18292 * value-query.cc (gimple_range_global): Allow phis.
18294 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
18296 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
18297 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
18298 statement for location context.
18299 (check_for_binary_op_overflow): Ditto.
18300 (simplify_using_ranges::get_vr_for_comparison): Ditto.
18301 (simplify_using_ranges::compare_name_with_value): Ditto.
18302 (simplify_using_ranges::compare_names): Ditto.
18303 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
18304 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
18305 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
18306 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
18307 (simplify_using_ranges::two_valued_val_range_p): Ditto.
18308 (simplify_using_ranges::simplify): Ditto.
18309 * vr-values.h: Adjust prototypes.
18311 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
18314 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
18316 2021-06-29 Julian Brown <julian@codesourcery.com>
18318 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
18319 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
18320 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
18322 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
18323 (MAX_FIXED_MODE_SIZE): Change to 128.
18325 2021-06-29 Julian Brown <julian@codesourcery.com>
18327 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
18328 (s_mnemonic): Add clrsb.
18329 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
18330 (clrsb<mode>2): Add expander for SImode/DImode.
18332 2021-06-29 Julian Brown <julian@codesourcery.com>
18334 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
18335 muldi3): Add patterns.
18337 2021-06-29 Julian Brown <julian@codesourcery.com>
18339 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
18340 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
18342 2021-06-29 Julian Brown <julian@codesourcery.com>
18344 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
18346 2021-06-29 Joseph Myers <joseph@codesourcery.com>
18348 * btfout.c, ctfout.c: Include "memmodel.h".
18350 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
18352 * gcc.c (check_offload_target_name): Cast len argument to
18353 %q.*s to 'int'; avoid -Wstringop-truncation warning.
18355 2021-06-29 Richard Biener <rguenther@suse.de>
18357 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
18358 to "any" permute nodes and relax "any" permute proapgation
18359 during iterative backward propagation.
18361 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
18364 * common.opt (-foffload=): Update description.
18365 (-foffload-options=): New.
18366 * doc/invoke.texi (C Language Options): Document
18367 -foffload and -foffload-options.
18368 * gcc.c (check_offload_target_name): New, split off from
18369 handle_foffload_option.
18370 (check_foffload_target_names): New.
18371 (handle_foffload_option): Handle -foffload=default.
18372 (driver_handle_option): Update for -foffload-options.
18373 * lto-opts.c (lto_write_options): Use -foffload-options
18374 instead of -foffload.
18375 * lto-wrapper.c (merge_and_complain, append_offload_options):
18377 * opts.c (common_handle_option): Likewise.
18379 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
18381 * doc/invoke.texi (C Language Options): Sort options
18382 alphabetically in optlist and also the description itself.
18383 Remove leftover -fallow-single-precision from and add missing
18384 -fgnu-tm to the optlist.
18386 2021-06-29 Richard Biener <rguenther@suse.de>
18388 * tree-vect-slp.c (slpg_vertex::visited): Remove.
18389 (vect_slp_perms_eq): Handle -1 permutes.
18390 (vect_optimize_slp): Rewrite permute propagation.
18392 2021-06-29 Jakub Jelinek <jakub@redhat.com>
18395 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
18396 perform the optimization in GENERIC when sanitizing and x has a
18399 2021-06-29 Richard Biener <rguenther@suse.de>
18401 PR tree-optimization/101242
18402 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
18403 PHIs with not represented initial values as leafs.
18405 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
18407 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
18408 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
18409 "register" keyword.
18410 (pdp11_initial_elimination_offset) Remove unused variable.
18411 (pdp11_cmp_length) Ditto.
18412 (pdp11_insn_cost): Ditto, and fix signedness warning.
18414 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
18416 * btfout.c: Include tm_p.h.
18419 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
18421 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
18423 (bpf_expand_epilogue): Likewise.
18424 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
18425 Do not define DBX_DEBUGGING_INFO.
18427 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
18429 * doc/invoke.texi: Document the CTF and BTF debug info options.
18431 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
18432 David Faust <david.faust@oracle.com>
18433 Jose E. Marchesi <jose.marchesi@oracle.com>
18434 Weimin Pan <weimin.pan@oracle.com>
18436 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
18437 GTFILES. Add new object files.
18438 * common.opt: Add CTF and BTF debug info options.
18439 * btfout.c: New file.
18440 * ctfc.c: Likewise.
18441 * ctfc.h: Likewise.
18442 * ctfout.c: Likewise.
18443 * dwarf2ctf.c: Likewise.
18444 * dwarf2ctf.h: Likewise.
18445 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
18447 * dwarf2out.c (dwarf2out_source_line): Likewise.
18448 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
18450 (debug_format_do_cu): New function.
18451 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
18453 Include dwarf2ctf.c.
18454 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
18456 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
18457 (CTF_DEBUG): New bitmask.
18458 (BTF_DEBUG): Likewise.
18459 (enum ctf_debug_info_levels): New enum.
18460 * gengtype.c (open_base_files): Handle ctfc.h.
18461 (main): Handle uint32_t type.
18462 * flags.h (btf_debuginfo_p): New definition.
18463 (dwarf_based_debuginfo_p): Likewise.
18464 * opts.c (debug_type_names): Add entries for CTF and BTF.
18465 (btf_debuginfo_p): New function.
18466 (dwarf_based_debuginfo_p): Likewise.
18467 (common_handle_option): Handle -gctfN and -gbtf options.
18468 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
18469 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
18472 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
18474 * dwarf2out.c (AT_class): Function is no longer static.
18475 (AT_int): Likewise.
18476 (AT_unsigned): Likewise.
18477 (AT_loc): Likewise.
18478 (get_AT): Likewise.
18479 (get_AT_string): Likewise.
18480 (get_AT_flag): Likewise.
18481 (get_AT_unsigned): Likewise.
18482 (get_AT_ref): Likewise.
18483 (new_die_raw): Likewise.
18484 (lookup_decl_die): Likewise.
18485 (base_type_die): Likewise.
18486 (add_name_attribute): Likewise.
18487 (add_AT_int): Likewise.
18488 (add_AT_unsigned): Likewise.
18489 (add_AT_loc): Likewise.
18490 (dw_get_die_tag): New function.
18491 (dw_get_die_child): Likewise.
18492 (dw_get_die_sib): Likewise.
18493 (struct dwarf_file_data): Move from here to dwarf2out.h
18494 (struct dw_attr_struct): Likewise.
18495 * dwarf2out.h: Analogous changes.
18497 2021-06-28 Martin Jambor <mjambor@suse.cz>
18500 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
18501 members m_dead_stmts and m_dead_ssas.
18502 * ipa-param-manipulation.c
18503 (ipa_param_body_adjustments::mark_dead_statements): New function.
18504 (ipa_param_body_adjustments::common_initialization): Call it on
18505 all removed but not split parameters.
18506 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
18508 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
18510 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
18511 dead debug statements.
18512 (copy_phis_for_bb): Do not copy dead PHI nodes.
18514 2021-06-28 Martin Jambor <mjambor@suse.cz>
18517 * symtab-clones.h (clone_info): Removed member param_adjustments.
18518 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
18519 deal with pass-through splits now.
18520 (ipa_param_performed_split): Removed.
18521 (ipa_param_adjustments::modify_call): Adjusted parameters.
18522 (class ipa_param_body_adjustments): Adjusted parameters of
18523 register_replacement, modify_gimple_stmt and modify_call_stmt.
18524 (ipa_verify_edge_has_no_modifications): Declare.
18525 (ipa_edge_modifications_finalize): Declare.
18526 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
18527 performed_splits processing, pas only edge to padjs->modify_call,
18528 check that call arguments were not modified if they should not have
18530 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
18532 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
18533 (ipa_edge_modification_info): Likewise.
18534 (ipa_edge_modification_sum): Likewise.
18535 (ipa_edge_modifications): New edge summary.
18536 (ipa_verify_edge_has_no_modifications): New function.
18537 (transitive_split_p): Removed.
18538 (transitive_split_map): Likewise.
18539 (init_transitive_splits): Likewise.
18540 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
18541 summary instead of performed_splits.
18542 (ipa_param_body_adjustments::register_replacement): Drop dummy
18543 parameter, set base_index of the created ipa_param_body_replacement.
18544 (phi_arg_will_live_p): New function.
18545 (ipa_param_body_adjustments::common_initialization): Do not create
18546 IPA_SRA dummy decls.
18547 (simple_tree_swap_info): Removed.
18548 (remap_split_decl_to_dummy): Likewise.
18549 (record_argument_state_1): New function.
18550 (record_argument_state): Likewise.
18551 (ipa_param_body_adjustments::modify_call_stmt): New parameter
18552 orig_stmt. Do not work with dummy decls, save necessary info about
18553 changes to ipa_edge_modifications.
18554 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
18555 orig_stmt, pass it to modify_call_stmt.
18556 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
18557 modify_gimple_stmt.
18558 (ipa_edge_modifications_finalize): New function.
18559 * tree-inline.c (remap_gimple_stmt): Pass original statement to
18560 modify_gimple_stmt.
18561 (copy_phis_for_bb): Do not copy dead PHI nodes.
18562 (expand_call_inline): Do not remap performed_splits.
18563 (update_clone_info): Likewise.
18564 * toplev.c: Include ipa-param-manipulation.h.
18565 (toplev::finalize): Call ipa_edge_modifications_finalize.
18567 2021-06-28 Andrew Pinski <apinski@marvell.com>
18569 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
18570 info if we're the only things setting the target PHI.
18571 (value_replacement): Don't duplicate range here.
18572 (minmax_replacement): Likewise.
18574 2021-06-28 Richard Biener <rguenther@suse.de>
18576 PR tree-optimization/101229
18577 * gimple-walk.c (gimple_walk_op): Handle PHIs.
18579 2021-06-28 Martin Liska <mliska@suse.cz>
18581 * config/v850/v850.c (construct_dispose_instruction): Allocate
18583 (construct_prepare_instruction): Likewise.
18585 2021-06-28 Martin Liska <mliska@suse.cz>
18587 * config/v850/v850.c (v850_option_override): Build default
18589 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
18590 ignored for inlining.
18591 (TARGET_CAN_INLINE_P): New.
18593 2021-06-28 Richard Biener <rguenther@suse.de>
18595 PR tree-optimization/101207
18596 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
18597 permute eliding for load permutations properly.
18599 2021-06-28 Richard Biener <rguenther@suse.de>
18601 PR tree-optimization/101173
18602 * gimple-loop-interchange.cc
18603 (tree_loop_interchange::valid_data_dependences): Disallow outer
18604 loop dependence distance of zero.
18606 2021-06-28 liuhongt <hongtao.liu@intel.com>
18609 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
18610 define_insn_and_split.
18611 (*avx_cmp<mode>3_ltint): Ditto.
18612 (*avx2_pcmp<mode>3_3): Ditto.
18613 (*avx2_pcmp<mode>3_4): Ditto.
18614 (*avx2_pcmp<mode>3_5): Ditto.
18616 2021-06-28 liuhongt <hongtao.liu@intel.com>
18618 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
18619 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
18620 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
18621 IX86_BUILTIN_PBLENDVB128): Replace icode with
18623 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
18625 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
18626 New pre_reload splitter.
18628 2021-06-27 Andrew Pinski <apinski@marvell.com>
18630 PR middle-end/101230
18631 * fold-const.c (fold_ternary_loc): Check
18632 the return value of invert_tree_comparison.
18634 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
18636 * config.gcc: Add SPDX License Identifier.
18637 (powerpc-ibm-aix789): Default to aix73.h.
18638 (powerpc-ibm-aix7.2.*.*): New stanza.
18639 * config/rs6000/aix72.h: Add SPDX License Identifier.
18640 * config/rs6000/aix73.h: New file.
18642 2021-06-26 Jason Merrill <jason@redhat.com>
18644 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
18646 2021-06-26 Andrew Pinski <apinski@marvell.com>
18648 * genmatch.c (lower_cond): Copy for_subst_vec
18649 for the simplify also.
18650 (lower): Swap the order for lower_for and lower_cond.
18652 2021-06-26 Andrew Pinski <apinski@marvell.com>
18654 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
18655 flow senatitive info on the moved ssa set.
18657 2021-06-26 Andrew Pinski <apinski@marvell.com>
18659 * fold-const.c (fold_cond_expr_with_comparison):
18660 Exand arg0 into comp_code, arg00, and arg01.
18661 (fold_ternary_loc): Use invert_tree_comparison
18662 instead of fold_invert_truthvalue for the case
18663 where we have A CMP B ? C : A.
18665 2021-06-25 Martin Sebor <msebor@redhat.com>
18667 PR middle-end/101216
18668 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
18670 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
18672 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
18674 2021-06-25 Richard Biener <rguenther@suse.de>
18676 PR tree-optimization/101202
18677 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
18680 2021-06-25 Richard Biener <rguenther@suse.de>
18682 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
18683 STMT_VINFO_REDUC_DEF from the original representative.
18685 2021-06-25 Martin Sebor <msebor@redhat.com>
18687 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
18688 gimple_no_warning_p and gimple_set_no_warning with
18689 warning_suppressed_p, and suppress_warning.
18691 (maybe_warn_for_bound): Same.
18692 (warn_for_access): Same.
18693 (check_access): Same.
18694 (expand_builtin_strncmp): Same.
18695 (fold_builtin_varargs): Same.
18696 * calls.c (maybe_warn_nonstring_arg): Same.
18697 (maybe_warn_rdwr_sizes): Same.
18698 * cfgexpand.c (expand_call_stmt): Same.
18699 * cgraphunit.c (check_global_declaration): Same.
18700 * fold-const.c (fold_undefer_overflow_warnings): Same.
18701 (fold_truth_not_expr): Same.
18702 (fold_unary_loc): Same.
18703 (fold_checksum_tree): Same.
18704 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
18705 (array_bounds_checker::check_mem_ref): Same.
18706 (array_bounds_checker::check_addr_expr): Same.
18707 (array_bounds_checker::check_array_bounds): Same.
18708 * gimple-expr.c (copy_var_decl): Same.
18709 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
18710 (gimple_fold_builtin_strncat): Same.
18711 (gimple_fold_builtin_stxcpy_chk): Same.
18712 (gimple_fold_builtin_stpcpy): Same.
18713 (gimple_fold_builtin_sprintf): Same.
18714 (fold_stmt_1): Same.
18715 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
18716 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
18717 * gimple-ssa-sprintf.c (handle_printf_call): Same.
18718 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
18719 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
18720 * gimple-ssa-warn-restrict.h: Adjust declarations.
18721 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
18722 gimple_no_warning_p and gimple_set_no_warning with
18723 warning_suppressed_p, and suppress_warning.
18724 (check_call): Same.
18725 (check_bounds_or_overlap): Same.
18726 * gimple.c (gimple_build_call_from_tree): Same.
18727 * gimplify.c (gimplify_return_expr): Same.
18728 (gimplify_cond_expr): Same.
18729 (gimplify_modify_expr_complex_part): Same.
18730 (gimplify_modify_expr): Same.
18731 (gimple_push_cleanup): Same.
18732 (gimplify_expr): Same.
18733 * omp-expand.c (expand_omp_for_generic): Same.
18734 (expand_omp_taskloop_for_outer): Same.
18735 * omp-low.c (lower_rec_input_clauses): Same.
18736 (lower_lastprivate_clauses): Same.
18737 (lower_send_clauses): Same.
18738 (lower_omp_target): Same.
18739 * tree-cfg.c (pass_warn_function_return::execute): Same.
18740 * tree-complex.c (create_one_component_var): Same.
18741 * tree-inline.c (remap_gimple_op_r): Same.
18742 (copy_tree_body_r): Same.
18743 (declare_return_variable): Same.
18744 (expand_call_inline): Same.
18745 * tree-nested.c (lookup_field_for_decl): Same.
18746 * tree-sra.c (create_access_replacement): Same.
18747 (generate_subtree_copies): Same.
18748 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
18749 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
18750 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
18751 * tree-ssa-loop-im.c (execute_sm): Same.
18752 * tree-ssa-phiopt.c (cond_store_replacement): Same.
18753 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
18754 (handle_builtin_strcpy): Same.
18755 (maybe_diag_stxncpy_trunc): Same.
18756 (handle_builtin_stxncpy_strncat): Same.
18757 (handle_builtin_strcat): Same.
18758 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
18759 (set_no_uninit_warning): Same.
18760 (uninit_undefined_value_p): Same.
18761 (warn_uninit): Same.
18762 (maybe_warn_operand): Same.
18763 * tree-vrp.c (compare_values_warnv): Same.
18764 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
18765 (test_for_singularity): Same.
18766 * gimple.h (warning_suppressed_p): New function.
18767 (suppress_warning): Same.
18768 (copy_no_warning): Same.
18769 (gimple_set_block): Call gimple_set_location.
18770 (gimple_set_location): Call copy_warning.
18772 2021-06-25 Martin Sebor <msebor@redhat.com>
18774 * tree.h (warning_suppressed_at, copy_warning,
18775 warning_suppressed_p, suppress_warning): New functions.
18777 2021-06-25 Martin Sebor <msebor@redhat.com>
18779 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
18780 * gengtype.c (open_base_files): Add diagnostic-spec.h.
18781 * diagnostic-spec.c: New file.
18782 * diagnostic-spec.h: New file.
18783 * tree.h (no_warning, all_warnings, suppress_warning_at): New
18785 * warning-control.cc: New file.
18787 2021-06-25 liuhongt <hongtao.liu@intel.com>
18790 * config/i386/i386.c (x86_order_regs_for_local_alloc):
18793 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
18795 PR tree-optimization/101189
18796 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
18797 LHS range of condition to postfold routine.
18798 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
18799 FALSE edge if the LHS range supports it being taken.
18800 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
18802 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
18804 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
18805 (relation_oracle::find_relation_block): Check correct bitmap.
18806 (relation_oracle::dump): Do not dump NULL blocks.
18808 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
18810 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
18811 range_on_edge instead of manually calculating.
18813 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
18815 * range-op.cc: Fix comment.
18817 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
18820 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
18821 Handle V8QI and V4HI modes.
18822 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
18824 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
18825 (mmxpackmode): New mode attribute.
18826 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
18827 (mmxunpackmode): New mode attribute.
18828 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
18829 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
18830 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
18831 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
18832 * config/i386/i386.md (extsuffix): Move from ...
18833 * config/i386/sse.md: ... here.
18835 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
18837 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
18838 (dwarf2out_finish): ...instead of here.
18840 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
18842 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
18843 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
18844 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
18845 * configure: Regenerate.
18847 2021-06-24 Richard Biener <rguenther@suse.de>
18849 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
18850 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
18851 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
18853 2021-06-24 Richard Biener <rguenther@suse.de>
18855 * config/i386/sse.md (avx_addsubv4df3): Rename to
18857 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
18858 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
18859 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
18860 * config/i386/i386-builtin.def: Adjust.
18861 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
18862 * optabs.def (vec_addsub_optab): New optab.
18863 * tree-vect-slp-patterns.c (class addsub_pattern): New.
18864 (slp_patterns): Add addsub_pattern.
18865 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
18866 across CFN_VEC_ADDSUB.
18867 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
18869 * doc/md.texi (vec_addsub<mode>3): Document.
18871 2021-06-24 Jakub Jelinek <jakub@redhat.com>
18873 PR middle-end/101170
18874 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
18875 where regno + subreg_regno_offset wraps around use 0 as starting
18878 2021-06-24 Jakub Jelinek <jakub@redhat.com>
18880 PR middle-end/101172
18881 * stor-layout.c (finish_bitfield_representative): If nextf has
18882 error_mark_node type, set repr type to error_mark_node too.
18884 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
18886 * config/s390/s390.c (s390_function_profiler): Ignore labelno
18888 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
18890 2021-06-24 Richard Biener <rguenther@suse.de>
18892 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
18893 across operations that have different semantics on different
18896 2021-06-24 Jakub Jelinek <jakub@redhat.com>
18898 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
18899 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
18900 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
18901 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
18902 OMP_TARGET user outer_ctx instead of ctx for placeholders and
18903 initializer/combiner gimplification.
18904 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
18905 on target constructs.
18906 (lower_rec_input_clauses): Likewise.
18907 (lower_omp_target): Likewise.
18908 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
18909 on target if in_reduction is present.
18911 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
18913 * tree-predcom.c (class pcom_worker): New class.
18914 (release_chain): Renamed to...
18915 (pcom_worker::release_chain): ...this.
18916 (release_chains): Renamed to...
18917 (pcom_worker::release_chains): ...this.
18918 (aff_combination_dr_offset): Renamed to...
18919 (pcom_worker::aff_combination_dr_offset): ...this.
18920 (determine_offset): Renamed to...
18921 (pcom_worker::determine_offset): ...this.
18922 (class comp_ptrs): New class.
18923 (split_data_refs_to_components): Renamed to...
18924 (pcom_worker::split_data_refs_to_components): ...this,
18925 and update with class comp_ptrs.
18926 (suitable_component_p): Renamed to...
18927 (pcom_worker::suitable_component_p): ...this.
18928 (filter_suitable_components): Renamed to...
18929 (pcom_worker::filter_suitable_components): ...this.
18930 (valid_initializer_p): Renamed to...
18931 (pcom_worker::valid_initializer_p): ...this.
18932 (find_looparound_phi): Renamed to...
18933 (pcom_worker::find_looparound_phi): ...this.
18934 (add_looparound_copies): Renamed to...
18935 (pcom_worker::add_looparound_copies): ...this.
18936 (determine_roots_comp): Renamed to...
18937 (pcom_worker::determine_roots_comp): ...this.
18938 (determine_roots): Renamed to...
18939 (pcom_worker::determine_roots): ...this.
18940 (single_nonlooparound_use): Renamed to...
18941 (pcom_worker::single_nonlooparound_use): ...this.
18942 (remove_stmt): Renamed to...
18943 (pcom_worker::remove_stmt): ...this.
18944 (execute_pred_commoning_chain): Renamed to...
18945 (pcom_worker::execute_pred_commoning_chain): ...this.
18946 (execute_pred_commoning): Renamed to...
18947 (pcom_worker::execute_pred_commoning): ...this.
18948 (struct epcc_data): New member worker.
18949 (execute_pred_commoning_cbck): Call execute_pred_commoning
18950 with pcom_worker pointer.
18951 (find_use_stmt): Renamed to...
18952 (pcom_worker::find_use_stmt): ...this.
18953 (find_associative_operation_root): Renamed to...
18954 (pcom_worker::find_associative_operation_root): ...this.
18955 (find_common_use_stmt): Renamed to...
18956 (pcom_worker::find_common_use_stmt): ...this.
18957 (combinable_refs_p): Renamed to...
18958 (pcom_worker::combinable_refs_p): ...this.
18959 (reassociate_to_the_same_stmt): Renamed to...
18960 (pcom_worker::reassociate_to_the_same_stmt): ...this.
18961 (stmt_combining_refs): Renamed to...
18962 (pcom_worker::stmt_combining_refs): ...this.
18963 (combine_chains): Renamed to...
18964 (pcom_worker::combine_chains): ...this.
18965 (try_combine_chains): Renamed to...
18966 (pcom_worker::try_combine_chains): ...this.
18967 (prepare_initializers_chain): Renamed to...
18968 (pcom_worker::prepare_initializers_chain): ...this.
18969 (prepare_initializers): Renamed to...
18970 (pcom_worker::prepare_initializers): ...this.
18971 (prepare_finalizers_chain): Renamed to...
18972 (pcom_worker::prepare_finalizers_chain): ...this.
18973 (prepare_finalizers): Renamed to...
18974 (pcom_worker::prepare_finalizers): ...this.
18975 (tree_predictive_commoning_loop): Renamed to...
18976 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
18977 some calls and remove some cleanup code.
18978 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
18979 (static variable looparound_phis): Remove.
18980 (static variable name_expansions): Remove.
18982 2021-06-24 Richard Biener <rguenther@suse.de>
18984 * tree-vect-slp.c (slpg_vertex): New struct.
18985 (vect_slp_build_vertices): Adjust.
18986 (vect_optimize_slp): Likewise. Maintain an outgoing permute
18987 and a materialized one.
18989 2021-06-24 Richard Biener <rguenther@suse.de>
18991 PR tree-optimization/101105
18992 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
18993 Only ignore steps when they are equal or scalar order is preserved.
18995 2021-06-24 liuhongt <hongtao.liu@intel.com>
18998 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
18999 Adjust comments for ix86_expand_vecop_qihi2.
19000 (ix86_expand_vecmul_qihi): Renamed to ..
19001 (ix86_expand_vecop_qihi2): Adjust function prototype to
19002 support shift operation, add static to definition.
19003 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
19004 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
19005 ix86_expand_vec_shift_qihi_constant.
19006 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
19007 (ix86_expand_vec_shift_qihi_constant): Deleted.
19008 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
19010 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
19011 condition TARGET_64BIT.
19012 (mul<mode>3): Ditto.
19013 (<insn><mode>3): Ditto.
19014 (vlshr<mode>3): Extend to support avx512 vlshr.
19015 (v<insn><mode>3): New expander for
19017 (v<insn>v8qi3): Ditto.
19018 (vashrv8hi3<mask_name>): Renamed to ..
19019 (vashr<mode>3): And extend to support V16QImode for avx512.
19020 (vashrv16qi3): Deleted.
19021 (vashrv2di3<mask_name>): Extend expander to support avx512
19024 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
19026 * doc/lto.texi (Design Overview): Update that slim objects are
19029 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
19031 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
19032 of OTHER_POWER10_MASKS so it will not be enabled by default.
19034 2021-06-23 Richard Biener <rguenther@suse.de>
19035 Martin Jambor <mjambor@suse.cz>
19037 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
19038 param replacement unconditionally. Adjust comment.
19040 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19042 * Makefile.in (OBJS): Add gimple-range-fold.o
19043 * gimple-range-fold.cc: New.
19044 * gimple-range-fold.h: New.
19045 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
19046 (gimple_range_calc_op2): Ditto.
19047 * gimple-range-gori.h: Move prototypes to here.
19048 * gimple-range.cc: Adjust include files.
19049 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
19050 (fur_source::get_operand): Ditto.
19051 (fur_source::get_phi_operand): Ditto.
19052 (fur_source::query_relation): Ditto.
19053 (fur_source::register_relation): Ditto.
19054 (class fur_edge): Ditto.
19055 (fur_edge::fur_edge): Ditto.
19056 (fur_edge::get_operand): Ditto.
19057 (fur_edge::get_phi_operand): Ditto.
19058 (fur_stmt::fur_stmt): Ditto.
19059 (fur_stmt::get_operand): Ditto.
19060 (fur_stmt::get_phi_operand): Ditto.
19061 (fur_stmt::query_relation): Ditto.
19062 (class fur_depend): Relocate to gimple-range-fold.h.
19063 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
19064 (fur_depend::register_relation): Ditto.
19065 (fur_depend::register_relation): Ditto.
19066 (class fur_list): Ditto.
19067 (fur_list::fur_list): Ditto.
19068 (fur_list::get_operand): Ditto.
19069 (fur_list::get_phi_operand): Ditto.
19070 (fold_range): Ditto.
19071 (adjust_pointer_diff_expr): Ditto.
19072 (gimple_range_adjustment): Ditto.
19073 (gimple_range_base_of_assignment): Ditto.
19074 (gimple_range_operand1): Ditto.
19075 (gimple_range_operand2): Ditto.
19076 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
19077 (gimple_range_calc_op2): Ditto.
19078 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
19079 (fold_using_range::range_of_range_op): Ditto.
19080 (fold_using_range::range_of_address): Ditto.
19081 (fold_using_range::range_of_phi): Ditto.
19082 (fold_using_range::range_of_call): Ditto.
19083 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
19084 (fold_using_range::range_of_builtin_call): Ditto.
19085 (fold_using_range::range_of_cond_expr): Ditto.
19086 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
19087 (fold_using_range::relation_fold_and_or): Ditto.
19088 (fold_using_range::postfold_gcond_edges): Ditto.
19089 * gimple-range.h: Add gimple-range-fold.h to include files. Change
19090 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
19091 (gimple_range_handler): Relocate to gimple-range-fold.h.
19092 (gimple_range_ssa_p): Ditto.
19093 (range_compatible_p): Ditto.
19094 (class fur_source): Ditto.
19095 (class fur_stmt): Ditto.
19096 (class fold_using_range): Ditto.
19097 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
19098 (gimple_range_calc_op2): Ditto.
19100 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19102 PR tree-optimization/101148
19103 PR tree-optimization/101014
19104 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
19105 (ranger_cache::~ranger_cache): Adjust.
19106 (ranger_cache::block_range): Check if propagation disallowed.
19107 (ranger_cache::propagate_cache): Disallow propagation if new value
19108 can't be stored properly.
19109 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
19111 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19113 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
19114 (sbr_vector::set_bb_range): Return true.
19115 (class sbr_sparse_bitmap): Adjust.
19116 (sbr_sparse_bitmap::set_bb_range): Return value.
19117 (block_range_cache::set_bb_range): Return value.
19118 (ranger_cache::propagate_cache): Use return value to print msg.
19119 * gimple-range-cache.h (class block_range_cache): Adjust.
19121 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
19123 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
19125 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
19127 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
19128 so this pattern can be used for test/compare removal. Pass
19129 current insn to compute_logical_op_length and output_logical_op.
19130 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
19131 (h8300_and_costs): Add argument to compute_logical_op_length.
19132 (output_logical_op): Add new argument. Use it to determine if the
19133 condition codes are used and adjust the output accordingly.
19134 (compute_logical_op_length): Add new argument and update length
19135 computations when condition codes are used.
19136 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
19138 (output_logical_op): Likewise.
19140 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
19143 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
19144 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
19145 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
19146 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
19147 * config/i386/sse.md (unspec): ... here.
19149 2021-06-23 Martin Liska <mliska@suse.cz>
19152 * optc-save-gen.awk: Put back arm_fp16_format to
19155 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
19158 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
19161 (clz<mode>2): Update RTX pattern for additions.
19163 2021-06-23 Jakub Jelinek <jakub@redhat.com>
19165 PR middle-end/101167
19166 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
19167 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
19169 2021-06-22 Sergei Trofimovich <siarheit@google.com>
19171 * doc/rtl.texi: drop unbalanced parenthesis.
19173 2021-06-22 Richard Biener <rguenther@suse.de>
19175 PR middle-end/101156
19176 * gimplify.c (gimplify_expr): Remove premature incorrect
19179 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19181 PR tree-optimization/101159
19182 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
19185 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19187 PR middle-end/101160
19188 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
19189 clear crtl->return_rtx instead of keeping it referencing a pseudo.
19191 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19192 Andrew Pinski <apinski@marvell.com>
19194 PR tree-optimization/101162
19195 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
19198 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19200 * range-op.cc (range_relational_tests): New.
19201 (range_op_tests): Call range_relational_tests.
19203 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19205 * range-op.cc (operator_cast::lhs_op1_relation): New.
19206 (operator_identity::lhs_op1_relation): Mew.
19208 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19210 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
19212 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19214 * range-op.cc (operator_plus::lhs_op1_relation): New.
19215 (operator_plus::lhs_op2_relation): New.
19217 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19219 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
19220 relation_oracle if dominators exist.
19221 (ranger_cache::~ranger_cache): Dispose of oracle.
19222 (ranger_cache::dump_bb): Dump oracle.
19223 * gimple-range.cc (fur_source::fur_source): New.
19224 (fur_source::get_operand): Use mmeber query.
19225 (fur_source::get_phi_operand): Use member_query.
19226 (fur_source::query_relation): New.
19227 (fur_source::register_dependency): Delete.
19228 (fur_source::register_relation): New.
19229 (fur_edge::fur_edge): Adjust.
19230 (fur_edge::get_phi_operand): Fix comment.
19231 (fur_edge::query): Delete.
19232 (fur_stmt::fur_stmt): Adjust.
19233 (fur_stmt::query): Delete.
19234 (fur_depend::fur_depend): Adjust.
19235 (fur_depend::register_relation): New.
19236 (fur_depend::register_relation): New.
19237 (fur_list::fur_list): Adjust.
19238 (fur_list::get_operand): Use member query.
19239 (fold_using_range::range_of_range_op): Process and query relations.
19240 (fold_using_range::range_of_address): Adjust dependency call.
19241 (fold_using_range::range_of_phi): Ditto.
19242 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
19243 (fold_using_range::relation_fold_and_or): New.
19244 (fold_using_range::postfold_gcond_edges): New.
19245 * gimple-range.h (class gimple_ranger): Adjust.
19246 (class fur_source): Adjust members.
19247 (class fur_stmt): Ditto.
19248 (class fold_using_range): Ditto.
19250 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19252 * range-op.cc (range_operator::wi_fold): Apply relation effect.
19253 (range_operator::fold_range): Adjust and apply relation effect.
19254 (*::fold_range): Add relation parameters.
19255 (*::op1_range): Ditto.
19256 (*::op2_range): Ditto.
19257 (range_operator::lhs_op1_relation): New.
19258 (range_operator::lhs_op2_relation): New.
19259 (range_operator::op1_op2_relation): New.
19260 (range_operator::op1_op2_relation_effect): New.
19261 (relop_early_resolve): New.
19262 (operator_equal::op1_op2_relation): New.
19263 (operator_equal::fold_range): Call relop_early_resolve.
19264 (operator_not_equal::op1_op2_relation): New.
19265 (operator_not_equal::fold_range): Call relop_early_resolve.
19266 (operator_lt::op1_op2_relation): New.
19267 (operator_lt::fold_range): Call relop_early_resolve.
19268 (operator_le::op1_op2_relation): New.
19269 (operator_le::fold_range): Call relop_early_resolve.
19270 (operator_gt::op1_op2_relation): New.
19271 (operator_gt::fold_range): Call relop_early_resolve.
19272 (operator_ge::op1_op2_relation): New.
19273 (operator_ge::fold_range): Call relop_early_resolve.
19274 * range-op.h (class range_operator): Adjust parameters and methods.
19276 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
19278 * Makefile.in (OBJS): Add value-relation.o.
19279 * gimple-range.h: Adjust include files.
19280 * tree-data-ref.c: Adjust include file order.
19281 * value-query.cc (range_query::get_value_range): Default to no oracle.
19282 (range_query::query_relation): New.
19283 (range_query::query_relation): New.
19284 * value-query.h (class range_query): Adjust.
19285 * value-relation.cc: New.
19286 * value-relation.h: New.
19288 2021-06-22 Richard Biener <rguenther@suse.de>
19290 PR tree-optimization/101151
19291 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
19294 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
19296 * config/riscv/riscv.c (thead_c906_tune_info): New.
19297 (riscv_tune_info_table): Use new tune.
19299 2021-06-22 Richard Biener <rguenther@suse.de>
19301 PR tree-optimization/101158
19302 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
19303 checking after checking for matching operation.
19305 2021-06-22 Richard Biener <rguenther@suse.de>
19307 PR tree-optimization/101159
19308 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
19309 missing NULL vectype check.
19311 2021-06-22 Richard Biener <rguenther@suse.de>
19313 PR tree-optimization/101154
19314 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
19316 2021-06-22 Jakub Jelinek <jakub@redhat.com>
19319 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
19320 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
19321 * config/i386/i386.c (ix86_expand_prologue): Clear it.
19322 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
19323 Remove "" from match_operand. Emit new insns using emit_move_insn and
19324 set ix86_last_zero_store_uid to INSN_UID of the last store.
19325 Add peephole2s for 1/2/4 stores of const0_rtx following previous
19328 2021-06-22 Martin Liska <mliska@suse.cz>
19330 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
19333 2021-06-22 Martin Liska <mliska@suse.cz>
19335 * gcov-io.h: Remove padding entries.
19337 2021-06-22 liuhongt <hongtao.liu@intel.com>
19339 PR tree-optimization/97770
19340 * tree-vect-patterns.c (vect_recog_popcount_pattern):
19342 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
19344 2021-06-22 liuhongt <hongtao.liu@intel.com>
19347 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
19348 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
19349 (expand<mode>_mask): this ..
19350 (*expand<mode>_mask): New pre_reload splitter to transform
19351 v{,p}expand* to vmov* when mask is zero, all ones, or has all
19352 ones in it's lower part, otherwise still generate
19355 2021-06-22 liuhongt <hongtao.liu@intel.com>
19358 * config/i386/i386-expand.c
19359 (ix86_expand_special_args_builtin): Keep constm1_operand only
19360 if it satisfies insn's operand predicate.
19362 2021-06-21 Jason Merrill <jason@redhat.com>
19365 * df-scan.c (df_ref_record): Check that regno < endregno.
19366 * function.c (assign_parms, expand_function_end): Do nothing with a
19367 TYPE_EMPTY_P result.
19369 2021-06-21 Richard Biener <rguenther@suse.de>
19371 PR tree-optimization/101120
19372 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
19374 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
19375 DR chain DCE capability.
19376 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
19377 * tree-vect-stmts.c (vectorizable_load): Remove unused
19378 loads in the DR chain for SLP.
19380 2021-06-21 Jakub Jelinek <jakub@redhat.com>
19382 PR inline-asm/100785
19383 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
19384 output or input operands were already error_mark_node.
19385 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
19386 remove all inputs, outputs and clobbers from the asm and
19387 set template to "".
19389 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19391 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
19392 (vceq_s16): Likewise.
19393 (vceq_s32): Likewise.
19394 (vceq_u8): Likewise.
19395 (vceq_u16): Likewise.
19396 (vceq_u32): Likewise.
19397 (vceq_p8): Likewise.
19398 (vceqq_s8): Likewise.
19399 (vceqq_s16): Likewise.
19400 (vceqq_s32): Likewise.
19401 (vceqq_u8): Likewise.
19402 (vceqq_u16): Likewise.
19403 (vceqq_u32): Likewise.
19404 (vceqq_p8): Likewise.
19405 (vceq_f32): Gate __a == __b on __FAST_MATH__.
19406 (vceqq_f32): Likewise.
19407 (vceq_f16): Likewise.
19408 (vceqq_f16): Likewise.
19410 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19413 * config/arm/iterators.md (NEON_VACMP): Remove.
19414 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
19416 (neon_vca<cmp_op><mode>_insn): Likewise.
19417 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
19420 2021-06-21 Richard Biener <rguenther@suse.de>
19422 PR tree-optimization/101121
19423 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
19424 when we just lack a stmt with the desired op when doing permutation.
19425 (vect_build_slp_tree): When caching a failed SLP build attempt
19426 assert that at least one lane is marked as not matching.
19428 2021-06-21 liuhongt <hongtao.liu@intel.com>
19431 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
19432 register alternative.
19433 (*and<mode>_1): Ditto.
19435 (*andn<mode>_1): Ditto.
19436 (*<code><mode>_1): Ditto.
19437 (*<code>qi_1): Ditto.
19438 (*one_cmpl<mode>2_1): Ditto.
19439 (*one_cmplsi2_1_zext): Ditto.
19440 (*one_cmplqi2_1): Ditto.
19441 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
19442 the order of mask registers to be before general registers.
19444 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
19447 * config/i386/i386.md: New define_peephole2s to shrink writing
19448 1, 2 or 4 consecutive zeros to memory when optimizing for size.
19450 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
19452 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
19453 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
19454 more efficient code when the source can be trivially simplified.
19456 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
19458 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
19459 a range if global is not available.
19460 (ranger_cache::entry_range): Fallback to range_of_def.
19461 * gimple-range-cache.h (range_of_def): Adjust prototype.
19463 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
19465 PR tree-optimization/101014
19466 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
19468 (ranger_cache::~ranger_cache): Ditto.
19469 (ranger_cache::enable_new_values): Delete.
19470 (ranger_cache::push_poor_value): Delete.
19471 (ranger_cache::range_of_def): Remove poor value processing.
19472 (ranger_cache::entry_range): Ditto.
19473 (ranger_cache::fill_block_cache): Ditto.
19474 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
19475 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
19476 * gimple-range.h (class gimple_ranger): Adjust.
19478 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
19481 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
19482 derived from arm_canon_arch.
19483 (arm_canon_arch_option): Call it.
19484 (arm_canon_arch_multilib_option): New function.
19485 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
19486 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
19487 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
19488 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
19489 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
19490 * config/arm/arm.opt (mlibarch): New option.
19491 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
19492 of march on RHS with mlibarch.
19494 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
19496 * config.in: Regenerate.
19497 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
19499 * configure: Regenerate.
19500 * configure.ac: Fix for global_load assembler functions.
19502 2021-06-18 Richard Biener <rguenther@suse.de>
19504 PR tree-optimization/101112
19505 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
19506 to lookup a pattern stmt def.
19508 2021-06-18 Jakub Jelinek <jakub@redhat.com>
19510 PR middle-end/101062
19511 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
19512 representatives in QUAL_UNION_TYPE.
19514 2021-06-18 Andrew Pinski <apinski@marvell.com>
19516 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
19517 Add counting of how many times it is done.
19518 (factor_out_conditional_conversion): Likewise.
19519 (match_simplify_replacement): Likewise.
19520 (value_replacement): Likewise.
19521 (spaceship_replacement): Likewise.
19522 (cond_store_replacement): Likewise.
19523 (cond_if_else_store_replacement_1): Likewise.
19524 (hoist_adjacent_loads): Likewise.
19526 2021-06-18 Andrew Pinski <apinski@marvell.com>
19528 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
19529 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
19530 (verify_gimple_assign_binary): Reject point and offset types on
19531 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
19532 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
19533 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
19535 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
19537 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
19538 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
19539 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
19542 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
19544 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
19545 earlyclobber to alts 0/1.
19546 (gen_addadd): Add earlyclobber to alts 0/1.
19547 * config/rs6000/fusion.md: Regenerate file.
19549 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19551 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
19553 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
19555 * gimple-range-cache.cc: Comment cleanups.
19556 * gimple-range-gori.cc: Comment cleanups.
19557 * gimple-range.cc: Comment/spacing cleanups
19558 * value-range.h: Comment cleanups.
19560 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
19563 * calls.c (expand_call): Replace PUSH_ARGS with
19564 targetm.calls.push_argument (0).
19565 (emit_library_call_value_1): Likewise.
19566 * defaults.h (PUSH_ARGS): Removed.
19567 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
19568 targetm.calls.push_argument (0).
19569 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
19570 (emit_push_insn): Pass the number bytes to push to
19571 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
19572 * hooks.c (hook_bool_uint_true): New.
19573 * hooks.h (hook_bool_uint_true): Likewise.
19574 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
19575 targetm.calls.push_argument (0).
19576 * target.def (push_argument): Add a targetm.calls hook.
19577 * targhooks.c (default_push_argument): New.
19578 * targhooks.h (default_push_argument): Likewise.
19579 * config/bpf/bpf.h (PUSH_ARGS): Removed.
19580 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
19581 * config/cr16/cr16.h (PUSH_ARGS): Removed.
19582 * config/i386/i386.c (ix86_push_argument): New.
19583 (TARGET_PUSH_ARGUMENT): Likewise.
19584 * config/i386/i386.h (PUSH_ARGS): Removed.
19585 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
19586 * config/m32c/m32c.h (PUSH_ARGS): Removed.
19587 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
19588 * config/pru/pru.h (PUSH_ARGS): Likewise.
19589 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
19590 TARGET_PUSH_ARGUMENT hook.
19591 * doc/tm.texi: Regenerated.
19593 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
19596 * config/i386/i386-expand.c (expand_vector_set_var):
19597 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
19598 ix86_expand_vector_init_duplicate.
19599 (ix86_expand_vector_init_duplicate): Emit insv_1 for
19600 QImode for !TARGET_PARTIAL_REG_STALL.
19601 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
19602 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
19603 as operand 2 predicate. Call ix86_expand_vector_set_var
19604 for non-constant index operand.
19605 (vec_setv2si): Ditto.
19606 (vec_setv4hi): Ditto.
19607 (vec_setv8qi): ditto.
19609 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
19611 PR tree-optimization/100790
19612 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
19615 2021-06-17 Martin Liska <mliska@suse.cz>
19617 * doc/invoke.texi: Use consistently -O1 instead of -O.
19619 2021-06-17 Martin Liska <mliska@suse.cz>
19621 * gcov-io.h: Update documentation entry about string format.
19623 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
19626 * config/s390/vecintrin.h (vec_doublee): Fix to use
19627 __builtin_s390_vflls.
19628 (vec_floate): Fix to use __builtin_s390_vflrd.
19630 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19632 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
19633 * dominance.h (get_dominated_to_depth): Likewise.
19634 (get_all_dominated_blocks): Likewise.
19635 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
19636 * gcse.c (hoist_code): Likewise.
19637 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
19638 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
19639 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
19640 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
19642 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19644 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
19645 * dominance.h (get_dominated_by_region): Likewise.
19646 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
19647 (gimple_duplicate_sese_tail): Likewise.
19648 (move_sese_region_to_fn): Likewise.
19650 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19652 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
19653 * dominance.h (get_dominated_by): Likewise.
19654 * auto-profile.c (afdo_find_equiv_class): Adjust.
19655 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
19656 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
19657 * tree-cfg.c (test_linear_chain): Likewise.
19658 (test_diamond): Likewise.
19660 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19662 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
19663 * cfgloopanal.c (get_loop_hot_path): Likewise.
19664 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
19666 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19668 * cgraph.c (cgraph_node::collect_callers): Return
19669 auto_vec<cgraph_edge *>.
19670 * cgraph.h (cgraph_node::collect_callers): Likewise.
19671 * ipa-cp.c (create_specialized_node): Adjust.
19672 (decide_about_value): Likewise.
19673 (decide_whether_version_node): Likewise.
19674 * ipa-sra.c (process_isra_node_results): Likewise.
19676 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
19678 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
19679 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
19681 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
19684 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
19686 * gimple-range.cc (debug_seed_ranger): New.
19687 (dump_ranger): New.
19688 (debug_ranger): New.
19690 2021-06-17 Richard Biener <rguenther@suse.de>
19692 PR tree-optimization/54400
19693 * tree-vectorizer.h (enum slp_instance_kind): Add
19694 slp_inst_kind_bb_reduc.
19695 (reduction_fn_for_scalar_code): Declare.
19696 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
19697 Check SLP_INSTANCE_KIND instead of looking at the
19699 (vect_slp_analyze_instance_alignment): Likewise.
19700 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
19701 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
19702 chain linearization from vect_build_slp_tree_2 and generalize
19703 for the use of BB reduction vectorization.
19704 (vect_build_slp_tree_2): Adjust accordingly.
19705 (vect_optimize_slp): Elide permutes at the root of BB reduction
19707 (vectorizable_bb_reduc_epilogue): New function.
19708 (vect_slp_prune_covered_roots): Likewise.
19709 (vect_slp_analyze_operations): Use them.
19710 (vect_slp_check_for_constructors): Recognize associatable
19711 chains for BB reduction vectorization.
19712 (vectorize_slp_instance_root_stmt): Generate code for the
19713 BB reduction epilogue.
19715 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
19717 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
19719 (gori_compute::may_recompute_p): New.
19720 (gori_compute::outgoing_edge_range_p): Perform recomputations.
19721 * gimple-range-gori.h (class gori_compute): Add prototype.
19723 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
19725 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
19726 true when a range can be calculated.
19727 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
19729 2021-06-16 Martin Sebor <msebor@redhat.com>
19731 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
19732 Correct documented defaults.
19734 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
19736 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
19737 m_new_value_p directly.
19739 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
19742 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
19743 Handle 64bit modes for TARGET_SSE4_1.
19744 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
19745 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
19746 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
19747 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
19748 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
19749 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
19751 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
19753 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
19754 Change to an expander that emits the correct instruction
19755 depending on endianness.
19756 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
19757 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
19759 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
19761 * config/aarch64/aarch64-simd-builtins.def: Split generator
19762 for aarch64_<su>qmovn builtins into scalar and vector
19764 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
19766 (aarch64_<su>qmovn<mode>_insn_be): Define.
19767 (aarch64_<su>qmovn<mode>): Split into scalar and vector
19768 variants. Change vector variant to an expander that emits the
19769 correct instruction depending on endianness.
19771 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
19773 * config/aarch64/aarch64-simd-builtins.def: Split generator
19774 for aarch64_sqmovun builtins into scalar and vector variants.
19775 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
19776 Split into scalar and vector variants. Change vector variant
19777 to an expander that emits the correct instruction depending
19779 (aarch64_sqmovun<mode>_insn_le): Define.
19780 (aarch64_sqmovun<mode>_insn_be): Define.
19782 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
19784 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
19785 Define - modeling zero-high-half semantics.
19786 (aarch64_xtn<mode>): Change to an expander that emits the
19787 appropriate instruction depending on endianness.
19788 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
19790 (aarch64_xtn2<mode>_le): Rename to...
19791 (aarch64_xtn2<mode>_insn_le): This.
19792 (aarch64_xtn2<mode>_be): Rename to...
19793 (aarch64_xtn2<mode>_insn_be): This.
19794 (vec_pack_trunc_<mode>): Emit truncation instruction instead
19796 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
19797 attribute iterator.
19799 2021-06-16 Martin Jambor <mjambor@suse.cz>
19801 PR tree-optimization/100453
19802 * tree-sra.c (create_access): Disqualify any const candidates
19803 which are written to.
19804 (sra_modify_expr): Do not store sub-replacements back to a const base.
19805 (handle_unscalarized_data_in_subtree): Likewise.
19806 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
19807 instead of constant_decl_p.
19809 2021-06-16 Jakub Jelinek <jakub@redhat.com>
19811 PR middle-end/101062
19812 * stor-layout.c (finish_bitfield_representative): For fields in unions
19813 assume nextf is always NULL.
19814 (finish_bitfield_layout): Compute bit field representatives also in
19815 unions, but handle it as if each bitfield was the only field in the
19818 2021-06-16 Richard Biener <rguenther@suse.de>
19820 PR tree-optimization/101088
19821 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
19822 supported refs on edges. Do not assert same ref but
19823 different kind stores are unsuported but mark them so.
19824 (hoist_memory_references): Only look for supported refs
19827 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
19829 PR rtl-optimization/46235
19830 * config/i386/i386.md: New define_split for bt followed by cmov.
19831 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
19832 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
19833 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
19834 by setnc with zero extension.
19836 2021-06-16 Richard Biener <rguenther@suse.de>
19838 PR tree-optimization/101083
19839 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
19840 vectype as argument.
19841 (vect_build_slp_tree_2): Adjust.
19843 2021-06-15 Martin Sebor <msebor@redhat.com>
19845 PR middle-end/100876
19846 * builtins.c: (gimple_call_return_array): Account for size_t
19847 mangling as either unsigned int or unsigned long
19849 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
19851 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
19852 up before eliminating comparisons.
19854 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
19856 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
19857 nonzero|X is nonzero.
19858 (range_op_bitwise_and_tests): Add tests for above.
19860 2021-06-15 Carl Love <cel@us.ibm.com>
19863 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
19865 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
19868 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
19871 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
19872 (struct gimplify_omp_ctx): Extend defaultmap array by one.
19873 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
19874 (omp_notice_variable): Update type classification for Fortran.
19875 (gimplify_scan_omp_clauses): Update calls for new argument; handle
19876 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
19877 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
19878 * langhooks.c (lhd_omp_scalar_p): Likewise.
19879 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
19880 (LANG_HOOKS_DECLS): Add them.
19881 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
19882 omp_scalar_p pointer type to include the new bool argument.
19884 2021-06-15 David Malcolm <dmalcolm@redhat.com>
19886 * doc/analyzer.texi
19887 (Special Functions for Debugging the Analyzer): Add
19888 __analyzer_dump_capacity.
19890 2021-06-15 Jakub Jelinek <jakub@redhat.com>
19893 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
19894 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
19896 2021-06-15 Richard Biener <rguenther@suse.de>
19898 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
19899 check to identify loop latches.
19900 * cfgloop.c (verify_loop_structure): Likewise.
19901 * loop-init.c (apply_loop_flags): Allow marked irreducible
19902 regions even with multiple latches.
19903 * predict.c (rebuild_frequencies): Simplify.
19905 2021-06-15 Richard Biener <rguenther@suse.de>
19907 * tree-ssa-threadupdate.c
19908 (jump_thread_path_registry::mark_threaded_blocks): Assert we
19909 have marked irreducible regions.
19911 2021-06-14 Martin Sebor <msebor@redhat.com>
19914 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
19915 Handle calls to placement new.
19916 (ndecl_dealloc_argno): Avoid placement delete.
19918 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
19921 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
19922 create_tmp_reg_or_ssa_name().
19924 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
19926 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
19927 (ranger_cache::enable_new_values): Set to specified value and
19928 return the old value.
19929 (ranger_cache::disable_new_values): Delete.
19930 (ranger_cache::fill_block_cache): Disable non 1st order derived
19932 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
19933 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
19935 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
19938 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
19939 Return true early when testing with V2HImode.
19940 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
19942 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
19944 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
19945 (mve_vec_unpack<US>_hi_<mode>): New pattern.
19946 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
19947 (mve_vmovntq_<supf><mode>): Prefix with '@'.
19948 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
19950 (vec_unpack<US>_lo_<mode>): Likewise.
19951 (vec_pack_trunc_<mode>): Rename to
19952 neon_quad_vec_pack_trunc_<mode>.
19953 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
19955 (vec_unpack<US>_lo_<mode>): New.
19956 (vec_pack_trunc_<mode>): New.
19958 2021-06-14 Richard Biener <rguenther@suse.de>
19960 PR tree-optimization/100934
19961 * tree-ssa-dom.c (pass_dominator::execute): Properly
19962 mark irreducible regions.
19964 2021-06-14 Martin Liska <mliska@suse.cz>
19966 * doc/invoke.texi: Put r{...} on the same line as @item.
19968 2021-06-14 Martin Liska <mliska@suse.cz>
19970 * doc/invoke.texi: Add missing newline.
19972 2021-06-14 Martin Liska <mliska@suse.cz>
19974 * doc/invoke.texi: Remove '+' charasters.
19976 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
19978 * config.gcc (arc): Add support for with_cpu option.
19979 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
19981 2021-06-14 Richard Biener <rguenther@suse.de>
19983 PR tree-optimization/101031
19984 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
19985 instead of size when accounting for a possibly string
19988 2021-06-14 Martin Liska <mliska@suse.cz>
19990 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
19992 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
19994 * value-query.cc (gimple_range_global): Call get_range_global
19995 if called after inlining.
19997 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
20000 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
20001 Emit constant permutation insn directly from here.
20003 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
20005 * attribs.c (find_attribute_namespace): Iterate over vec<> with
20007 * auto-profile.c (afdo_find_equiv_class): Likewise.
20008 * gcc.c (do_specs_vec): Likewise.
20009 (do_spec_1): Likewise.
20010 (driver::set_up_specs): Likewise.
20011 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
20012 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
20013 (imm_store_chain_info::try_coalesce_bswap): Likewise.
20014 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
20015 (get_location_for_stmts): Likewise.
20016 * graphite-poly.c (print_iteration_domains): Likewise.
20017 (free_poly_bb): Likewise.
20018 (remove_gbbs_in_scop): Likewise.
20019 (free_scop): Likewise.
20020 (dump_gbb_cases): Likewise.
20021 (dump_gbb_conditions): Likewise.
20022 (print_pdrs): Likewise.
20023 (print_scop): Likewise.
20024 * ifcvt.c (cond_move_process_if_block): Likewise.
20025 * lower-subreg.c (decompose_multiword_subregs): Likewise.
20026 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
20027 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
20028 * sel-sched-dump.c (dump_insn_vector): Likewise.
20029 * store-motion.c (store_ops_ok): Likewise.
20030 (store_killed_in_insn): Likewise.
20031 * timevar.c (timer::named_items::print): Likewise.
20032 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
20033 (cleanup_tree_cfg_noloop): Likewise.
20034 * tree-data-ref.c (dump_data_references): Likewise.
20035 (print_dir_vectors): Likewise.
20036 (print_dist_vectors): Likewise.
20037 (dump_data_dependence_relations): Likewise.
20038 (dump_dist_dir_vectors): Likewise.
20039 (dump_ddrs): Likewise.
20040 (create_runtime_alias_checks): Likewise.
20041 (free_subscripts): Likewise.
20042 (save_dist_v): Likewise.
20043 (save_dir_v): Likewise.
20044 (invariant_access_functions): Likewise.
20045 (same_access_functions): Likewise.
20046 (access_functions_are_affine_or_constant_p): Likewise.
20047 (find_data_references_in_stmt): Likewise.
20048 (graphite_find_data_references_in_stmt): Likewise.
20049 (free_dependence_relations): Likewise.
20050 (free_data_refs): Likewise.
20051 * tree-inline.c (copy_debug_stmts): Likewise.
20052 * tree-into-ssa.c (dump_currdefs): Likewise.
20053 (rewrite_update_phi_arguments): Likewise.
20054 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
20055 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
20057 (vect_slp_analyze_node_dependences): Likewise.
20058 (vect_slp_analyze_instance_dependence): Likewise.
20059 (vect_record_base_alignments): Likewise.
20060 (vect_get_peeling_costs_all_drs): Likewise.
20061 (vect_peeling_supportable): Likewise.
20062 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
20063 (vec_info::free_stmt_vec_infos): Likewise.
20065 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
20067 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
20068 (andqi3_1<cczn>): Removed.
20069 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
20070 (H8/SX bit logicals): Split out from other patterns.
20071 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
20072 mulqihi3_const_clobber_flags.
20073 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
20075 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
20078 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
20079 to true if red zone is used.
20080 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
20081 ix86_red_zone_used.
20082 * config/i386/i386.h (machine_function): Add red_zone_used.
20083 (ix86_red_zone_size): Removed.
20084 (ix86_red_zone_used): New.
20085 * config/i386/i386.md (peephole2 patterns): Replace
20086 ix86_red_zone_size with ix86_red_zone_used.
20088 2021-06-12 Jason Merrill <jason@redhat.com>
20090 * doc/extend.texi (unused variable attribute): Applies to
20091 structure fields as well.
20093 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
20095 * auto-profile.c (read_profile): fix a typo in an error string
20097 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
20099 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
20101 * tree-pretty-print.c (dump_omp_clauses): Update.
20102 (dump_generic_node) <OMP_CLAUSE>: Use it.
20104 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
20107 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
20108 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
20109 the polymorphic variants matching code.
20110 (__arm_vld1q_z): Likewise.
20111 (__arm_vld2q): Likewise.
20112 (__arm_vld4q): Likewise.
20113 (__arm_vldrbq_gather_offset): Likewise.
20114 (__arm_vldrbq_gather_offset_z): Likewise.
20116 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
20118 PR tree-optimization/96392
20119 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
20121 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
20123 PR tree-optimization/96392
20124 * fold-const.c (fold_real_zero_addition_p): Take both arguments
20125 of the addition or subtraction, not just the zero. Use this
20126 other argument in tests for signaling NaNs and signed zeros.
20127 (tree_expr_maybe_real_minus_zero_p): New predicate.
20128 * fold-const.h (fold_real_zero_addition_p): Update prototype.
20129 (tree_expr_maybe_real_minus_zero_p): New function prototype.
20130 * match.pd: Update calls to fold_real_zero_addition_p.
20131 Replace HONOR_NANS with tree_expr_maybe_nan_p.
20132 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
20133 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
20134 * tree-ssa-reassoc.c (eliminate_using_constants): Update
20135 call to fold_real_zero_addition_p.
20137 2021-06-11 Richard Biener <rguenther@suse.de>
20139 PR tree-optimization/101025
20140 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
20141 all refs that require dependence checking.
20143 2021-06-11 Richard Biener <rguenther@suse.de>
20145 PR tree-optimization/101028
20146 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
20147 reassoc discovery fails fatally, mark appropriate lanes
20150 2021-06-11 Richard Biener <rguenther@suse.de>
20152 PR tree-optimization/101026
20153 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
20154 have a representative for the associated chain nodes.
20156 2021-06-11 Jakub Jelinek <jakub@redhat.com>
20158 PR rtl-optimization/101008
20159 * simplify-rtx.c (relational_result): New function.
20160 (simplify_logical_relational_operation,
20161 simplify_relational_operation): Use it.
20163 2021-06-11 Jakub Jelinek <jakub@redhat.com>
20166 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
20168 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
20171 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
20172 false if the permutation can be implemented with constant
20173 permutation instruction in wider mode.
20174 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
20175 Handle V8QImode and V4HImode.
20177 2021-06-11 Martin Liska <mliska@suse.cz>
20179 PR gcov-profile/100788
20180 * common.opt: Add new option.
20181 * coverage.c (coverage_begin_function): Emit warning instead on
20182 the internal compiler error.
20183 * doc/invoke.texi: Document the option.
20184 * toplev.c (process_options): Enable it by default.
20186 2021-06-11 Richard Biener <rguenther@suse.de>
20188 PR middle-end/101009
20189 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
20190 to set *init_b to true when we encounter a constant equal
20192 (compute_affine_dependence): Also dump the actual DR_REF.
20194 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
20196 PR tree-optimization/100984
20197 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
20198 replacements table.
20199 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
20201 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
20203 * config/rs6000/rs6000.md
20204 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
20205 define_insn_and_split.
20207 2021-06-11 Richard Biener <rguenther@suse.de>
20209 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
20210 to sort operands of the associative chain.
20212 2021-06-11 Richard Biener <rguenther@suse.de>
20214 * system.h (gcc_stablesort_r): Declare.
20215 * sort.cc (gcc_sort_r): Support stable sort.
20216 (gcc_stablesort_r): Define.
20217 * vec.h (vec<>::stablesort): Add.
20219 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
20222 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
20223 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
20224 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
20225 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
20226 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
20227 (expand_vec_perm_interleave2): Handle 64bit modes.
20228 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
20229 (expand_vec_perm_even_odd_1): Ditto.
20230 (ix86_vectorize_vec_perm_const): Ditto.
20231 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
20232 * config/i386/sse.md: ... here.
20233 * config/i386/mmx.md (*vec_interleave_lowv2sf):
20234 New insn_and_split pattern.
20235 (*vec_interleave_highv2sf): Ditto.
20236 (mmx_pshufbv8qi3): New insn pattern.
20237 (*mmx_pblendw): Ditto.
20239 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
20241 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
20242 (build_acc): Likewise.
20243 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
20244 source operands in little-endian mode.
20245 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
20246 (mma_init_builtins): Likewise.
20247 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
20248 ordering for the MMA assemble and build source operands.
20249 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
20251 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
20254 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
20256 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
20258 * config/h8300/extensions.md: Replace _clobber_flags patterns
20261 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
20263 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
20264 (vcond_mask_<mode><tointvec>): this.
20266 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
20267 Thomas Schwinge <thomas@codesourcery.com>
20269 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
20270 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
20271 * gimple.h (enum gf_mask): Split
20272 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
20273 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
20274 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
20275 (is_gimple_omp_oacc): Update.
20276 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
20277 * gimplify.c (gimplify_omp_target_update): Likewise.
20278 * omp-expand.c (expand_omp_target, build_omp_regions_1)
20279 (omp_make_gimple_edges): Likewise.
20280 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
20283 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
20285 * value-query.cc (value_query::value_on_edge): Rename name to
20287 (range_query::range_on_edge): Same.
20288 (range_query::value_of_expr): Same.
20289 (range_query::value_on_edge): Same.
20290 * value-query.h (class value_query): Same.
20291 (class range_query): Same.
20293 2021-06-10 Richard Biener <rguenther@suse.de>
20295 PR tree-optimization/101003
20296 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
20297 use the pattern stmt defs when linearizing a chain.
20299 2021-06-10 Jakub Jelinek <jakub@redhat.com>
20302 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
20303 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
20305 2021-06-10 Clement Chigot <clement.chigot@atos.net>
20307 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
20308 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
20310 2021-06-09 Andrew Pinski <apinski@marvell.com>
20312 PR tree-optimization/100925
20313 * match.pd (a ? CST1 : CST2): Limit transformations
20314 that would produce a negative to integeral types only.
20315 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
20317 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
20320 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
20322 * doc/tm.texi: Correctly update.
20324 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
20326 * doc/tm.texi: Correctly update.
20328 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
20331 * doc/tm.texi.in (Trampolines): Add a missing blank line.
20333 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
20336 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
20337 and -ftrampolines work only with Ada.
20338 * doc/tm.texi.in (Trampolines): Likewise.
20339 * doc/tm.texi: Regenerated.
20341 2021-06-09 Carl Love <cel@us.ibm.com>
20343 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
20344 Add define for new builtins.
20345 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
20346 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
20347 overloaded builtin definitions.
20348 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
20349 VSIGNEXTSD2Q): Add builtin expansions.
20350 (SIGNEXT): Add P10 overload definition.
20351 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
20352 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
20353 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
20354 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
20355 vsignextend_si_v2di)[VIlong]: Add define_expand.
20356 Make define_insn vsx_sign_extend_si_v2di visible.
20357 * doc/extend.texi: Add documentation for the vec_signexti,
20358 vec_signextll builtins and vec_signextq.
20360 2021-06-09 Carl Love <cel@us.ibm.com>
20362 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
20363 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
20364 __floattikf_sw, __floatuntikf_sw respectively.
20365 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
20366 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
20367 define_insn for mode IEEE 128.
20369 2021-06-09 Carl Love <cel@us.ibm.com>
20371 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
20372 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
20373 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
20374 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
20375 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
20376 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
20377 uses of VSX_TI to VEC_TI.
20379 2021-06-09 Carl Love <cel@us.ibm.com>
20381 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
20383 2021-06-09 Carl Love <cel@us.ibm.com>
20385 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
20387 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
20388 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
20389 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
20390 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
20391 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
20392 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
20393 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
20395 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
20396 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
20397 altivec_vrlqnm): New define_expands.
20398 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
20399 VCMPGTUT_P): Add macro expansions.
20400 (BU_P10V_AV_P): Add builtin predicate definition.
20401 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
20402 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
20403 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
20404 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
20405 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
20406 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
20407 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
20408 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
20409 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
20410 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
20411 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
20412 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
20413 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
20414 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
20415 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
20416 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
20417 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
20418 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
20419 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
20420 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
20421 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
20422 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
20423 P10V_BUILTIN_MODU_V1TI):
20424 New overloaded definitions.
20425 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
20426 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
20427 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
20428 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
20429 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
20430 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
20432 (altivec_init_builtins): New E_V1TImode case statement.
20433 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
20434 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
20435 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
20436 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
20437 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
20438 E_V1TImode]: New case statements.
20439 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
20440 value RS6000_BTI_bool_V1TI.
20441 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
20442 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
20443 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
20444 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
20445 vlshrv1ti3, vashrv1ti3): New define_expands.
20446 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
20447 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
20448 UNSPEC_VSX_MODUQ): New unspecs.
20449 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
20450 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
20452 (vcmpnet): New define_expand.
20453 * doc/extend.texi: Add documentation for the new builtins vec_rl,
20454 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
20455 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
20456 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
20457 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
20458 vec_any_ge, vec_any_le.
20460 2021-06-09 Carl Love <cel@us.ibm.com>
20462 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
20463 bug in argument generation.
20465 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
20467 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
20469 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
20470 remove <supf> iterator.
20471 (mve_vclzq_u<mode>): New.
20472 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
20473 (neon_vclz<mode): Move to ...
20474 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
20475 * config/arm/vec-common.md: ... here. Add support for MVE.
20477 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
20479 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
20480 (@mve_vrhaddq_<supf><mode): Likewise.
20481 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
20482 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
20483 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
20485 2021-06-09 imba-tjd <109224573@qq.com>
20487 * doc/invoke.texi: Fix typo.
20489 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
20491 PR middle-end/53267
20492 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
20493 Support evaluation of fmod/fmodf/fmodl at compile-time.
20495 2021-06-09 Richard Biener <rguenther@suse.de>
20497 PR tree-optimization/100981
20498 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
20499 gimple_get_lhs to also handle calls.
20500 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
20503 2021-06-09 Richard Biener <rguenther@suse.de>
20505 PR tree-optimization/97832
20506 * tree-vectorizer.h (_slp_tree::failed): New.
20507 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
20509 (_slp_tree::~_slp_tree): Free failed.
20510 (vect_build_slp_tree): Retain failed nodes and record
20511 matches in them, copying that back out when running
20512 into a cached fail. Dump start and end of discovery.
20513 (dt_sort_cmp): New.
20514 (vect_build_slp_tree_2): Handle associatable chains
20515 together doing more aggressive operand swapping.
20517 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
20520 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
20522 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
20523 Linux and GNU targets.
20525 2021-06-09 Richard Biener <rguenther@suse.de>
20527 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
20530 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
20532 * config/arc/arc.md (loop_end): Change it to
20533 define_insn_and_split.
20535 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
20537 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
20538 (machi): New pattern.
20539 (umaddhisi4): Use VMAC2HU instruction.
20540 (umachi): New pattern.
20542 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
20544 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
20545 * config/arc/arc.c (arc_split_move_p): New function.
20546 (arc_split_move): Clean up.
20547 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
20548 (movdf_insn): Likewise.
20549 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
20551 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
20554 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
20555 argument to "raw". Do not emit segment overrides when "raw" is true.
20557 2021-06-09 Martin Liska <mliska@suse.cz>
20559 * doc/gcov.texi: Create a proper JSON files.
20560 * doc/invoke.texi: Remove dots in order to make it a valid
20563 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
20565 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
20566 (insn_is_load_p): Use pattern_is_rotate64.
20567 (insn_is_swap_p): Likewise.
20568 (quad_aligned_load_p): Likewise.
20569 (const_load_sequence_p): Likewise.
20570 (replace_swapped_aligned_load): Likewise.
20571 (recombine_lvx_pattern): Likewise.
20572 (recombine_stvx_pattern): Likewise.
20574 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
20576 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
20577 fur_stmt source record.
20578 * gimple-range.cc (fur_source::get_operand): Generic range query.
20579 (fur_source::get_phi_operand): New.
20580 (fur_source::register_dependency): New.
20581 (fur_source::query): New.
20582 (class fur_edge): New. Edge source for operands.
20583 (fur_edge::fur_edge): New.
20584 (fur_edge::get_operand): New.
20585 (fur_edge::get_phi_operand): New.
20586 (fur_edge::query): New.
20587 (fur_stmt::fur_stmt): New.
20588 (fur_stmt::get_operand): New.
20589 (fur_stmt::get_phi_operand): New.
20590 (fur_stmt::query): New.
20591 (class fur_depend): New. Statement source and process dependencies.
20592 (fur_depend::fur_depend): New.
20593 (fur_depend::register_dependency): New.
20594 (class fur_list): New. List source for operands.
20595 (fur_list::fur_list): New.
20596 (fur_list::get_operand): New.
20597 (fur_list::get_phi_operand): New.
20598 (fold_range): New. Instantiate appropriate fur_source class and fold.
20599 (fold_using_range::range_of_range_op): Use new API.
20600 (fold_using_range::range_of_address): Ditto.
20601 (fold_using_range::range_of_phi): Ditto.
20602 (imple_ranger::fold_range_internal): Use fur_depend class.
20603 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
20604 * gimple-range.h (class fur_source): Now a base class.
20605 (class fur_stmt): New.
20606 (fold_range): New prototypes.
20607 (fur_source::fur_source): Delete.
20609 2021-06-08 Andrew Pinski <apinski@marvell.com>
20611 PR tree-optimization/25290
20612 * tree-ssa-phiopt.c (xor_replacement): Delete.
20613 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
20614 (match_simplify_replacement): Allow one cheap preparation
20615 statement that can be moved to before the if.
20617 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
20619 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
20620 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
20622 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
20624 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
20625 Create length attribute on define_insn_and_split. Only split for cases which we
20627 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
20628 fix length computation.
20629 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
20631 2021-06-08 Richard Biener <rguenther@suse.de>
20633 PR tree-optimization/100923
20634 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
20635 the operand vector to be valueized.
20636 (valueize_refs): Likewise.
20637 (valueize_shared_reference_ops_from_ref): Adjust.
20638 (valueize_shared_reference_ops_from_call): Likewise.
20639 (vn_reference_lookup_3): Likewise.
20640 (vn_reference_lookup_pieces): Likewise. Re-valueize
20641 with honoring availability when we are about to create
20642 the ao_ref and valueized before.
20643 (vn_reference_lookup): Likewise.
20644 (vn_reference_insert_pieces): Adjust.
20646 2021-06-08 Richard Biener <rguenther@suse.de>
20648 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
20649 (_slp_instance::root_stmts): ... a vector.
20650 (SLP_INSTANCE_ROOT_STMT): Rename to ...
20651 (SLP_INSTANCE_ROOT_STMTS): ... this.
20652 (slp_root::root): Change to...
20653 (slp_root::roots): ... a vector.
20654 (slp_root::slp_root): Adjust.
20655 * tree-vect-slp.c (_slp_instance::location): Adjust.
20656 (vect_free_slp_instance): Release the root stmt vector.
20657 (vect_build_slp_instance): Adjust.
20658 (vect_analyze_slp): Likewise.
20659 (_bb_vec_info::~_bb_vec_info): Likewise.
20660 (vect_slp_analyze_operations): Likewise.
20661 (vect_bb_vectorization_profitable_p): Likewise. Adjust
20662 costs for the root stmt.
20663 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
20665 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
20667 (vectorize_slp_instance_root_stmt): Adjust.
20668 (vect_schedule_slp): Likewise.
20670 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
20672 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
20673 (ssa_equiv_stack::ssa_equiv_stack): New.
20674 (ssa_equiv_stack::~ssa_equiv_stack): New.
20675 (ssa_equiv_stack::enter): New.
20676 (ssa_equiv_stack::leave): New.
20677 (ssa_equiv_stack::push_replacement): New.
20678 (ssa_equiv_stack::get_replacement): New.
20679 (is_pointer_ssa): New.
20680 (class pointer_equiv_analyzer): New.
20681 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
20682 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
20683 (pointer_equiv_analyzer::set_global_equiv): New.
20684 (pointer_equiv_analyzer::set_cond_equiv): New.
20685 (pointer_equiv_analyzer::get_equiv): New.
20686 (pointer_equiv_analyzer::enter): New.
20687 (pointer_equiv_analyzer::leave): New.
20688 (pointer_equiv_analyzer::get_equiv_expr): New.
20689 (pta_valueize): New.
20690 (pointer_equiv_analyzer::visit_stmt): New.
20691 (pointer_equiv_analyzer::visit_edge): New.
20692 (hybrid_folder::value_of_expr): Call PTA.
20693 (hybrid_folder::value_on_edge): Same.
20694 (hybrid_folder::pre_fold_bb): New.
20695 (hybrid_folder::post_fold_bb): New.
20696 (hybrid_folder::pre_fold_stmt): New.
20697 (rvrp_folder::pre_fold_bb): New.
20698 (rvrp_folder::post_fold_bb): New.
20699 (rvrp_folder::pre_fold_stmt): New.
20700 (rvrp_folder::value_of_expr): Call PTA.
20701 (rvrp_folder::value_on_edge): Same.
20703 2021-06-08 Jakub Jelinek <jakub@redhat.com>
20706 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
20707 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
20709 2021-06-08 Richard Biener <rguenther@suse.de>
20711 PR middle-end/100951
20712 * tree-vect-generic.c (expand_vector_piecewise): Build a
20713 VECTOR_CST if all elements are constant.
20714 (expand_vector_condition): Likewise.
20715 (lower_vec_perm): Likewise.
20716 (expand_vector_conversion): Likewise.
20718 2021-06-08 Martin Liska <mliska@suse.cz>
20720 * doc/invoke.texi: Document new param evrp-sparse-threshold.
20722 2021-06-08 Martin Liska <mliska@suse.cz>
20724 * genautomata.c (create_automata): Fix typo.
20726 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
20728 PR tree-optimization/100794
20729 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
20730 allow_unroll_p and only allow unrolling when it's true.
20731 (tree_predictive_commoning): Add parameter allow_unroll_p and
20733 (run_tree_predictive_commoning): Likewise.
20734 (pass_predcom::gate): Check flag_tree_loop_vectorize and
20735 global_options_set.x_flag_predictive_commoning.
20736 (pass_predcom::execute): Adjust for allow_unroll_p.
20738 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
20740 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
20741 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
20742 lambda function cleanup, remove scev_reset call, and adjust return
20744 (tree_predictive_commoning): Adjust for different changed values,
20745 only set flag TODO_update_ssa_only_virtuals if changed.
20746 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
20747 from todo_flags_finish.
20749 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
20751 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
20752 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
20753 (sbr_sparse_bitmap::bitmap_set_quad): New.
20754 (sbr_sparse_bitmap::bitmap_get_quad): New.
20755 (sbr_sparse_bitmap::set_bb_range): New.
20756 (sbr_sparse_bitmap::get_bb_range): New.
20757 (sbr_sparse_bitmap::bb_range_p): New.
20758 (block_range_cache::block_range_cache): initialize bitmap obstack.
20759 (block_range_cache::~block_range_cache): Destruct obstack.
20760 (block_range_cache::set_bb_range): Decide when to utilze the
20761 sparse on entry cache.
20762 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
20763 * params.opt (-param=evrp-sparse-threshold): New.
20765 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
20767 * bitmap.c (bitmap_set_aligned_chunk): New.
20768 (bitmap_get_aligned_chunk): New.
20769 (test_aligned_chunk): New.
20770 (bitmap_c_tests): Call test_aligned_chunk.
20771 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
20773 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
20776 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
20778 (ix86_expand_vector_init_one_nonzero): Ditto.
20779 (ix86_expand_vector_init_one_var): Ditto.
20780 (ix86_expand_vector_init_general): Ditto.
20781 * config/i386/mmx.md (vec_initv4qiqi): New expander.
20783 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
20785 * config/h8300/movepush.md: Change most _clobber_flags
20786 patterns to instead use <cczn> subst.
20787 (movsi_cczn): New pattern with usable CC cases split out.
20788 (movsi_h8sx_cczn): Likewise.
20790 2021-06-07 Martin Liska <mliska@suse.cz>
20792 * common/common-target.def: Split long lines and replace them
20794 * target.def: Likewise.
20795 * doc/tm.texi: Re-generated.
20797 2021-06-07 Jakub Jelinek <jakub@redhat.com>
20800 * fold-const.c (fold_read_from_vector): Return NULL if trying to
20801 read from a CONSTRUCTOR with vector type elements.
20803 2021-06-07 Jakub Jelinek <jakub@redhat.com>
20805 PR middle-end/100898
20806 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
20807 should copy any arguments. Don't call gimple_call_num_args
20808 on id->call_stmt or call_stmt more than once.
20810 2021-06-07 liuhongt <hongtao.liu@intel.com>
20813 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
20815 (<insn>v4siv4di2): Delete constraints for define_expand.
20817 2021-06-07 liuhongt <hongtao.liu@intel.com>
20820 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
20821 assignment of cfun->machine->has_explicit_vzeroupper.
20822 * config/i386/i386-features.c
20823 (ix86_add_reg_usage_to_vzerouppers): Delete.
20824 (ix86_add_reg_usage_to_vzeroupper): Ditto.
20825 (rest_of_handle_insert_vzeroupper): Remove
20826 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
20828 (gate): Remove cfun->machine->has_explicit_vzeroupper.
20829 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
20831 * config/i386/i386.c (ix86_insn_callee_abi): New function.
20832 (ix86_initialize_callee_abi): Ditto.
20833 (ix86_expand_avx_vzeroupper): Ditto.
20834 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
20836 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
20837 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
20839 * config/i386/i386.h (struct GTY(()) machine_function): Delete
20840 has_explicit_vzeroupper.
20841 * config/i386/i386.md (enum unspec): New member
20843 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
20844 define_constants for insn callee abi index.
20845 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
20846 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
20847 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
20848 (*avx_vzeroupper): Rename to ..
20849 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
20850 call_insn which has a special vzeroupper ABI.
20851 (*avx_vzeroupper_1): Deleted.
20853 2021-06-07 liuhongt <hongtao.liu@intel.com>
20856 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
20857 it won't use stack pointer reg.
20858 * final.c (leaf_function_p): When call_insn is a fake call, it
20859 won't affect caller as a leaf function.
20860 * reg-stack.c (callee_clobbers_any_stack_reg): New.
20861 (subst_stack_regs): When call_insn doesn't clobber any stack
20862 reg, don't clear the arguments.
20863 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
20865 * shrink-wrap.c (requires_stack_frame_p): No need for stack
20866 frame for a fake call.
20867 * rtl.h (FAKE_CALL_P): New macro.
20869 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
20871 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
20873 (sparc_order_regs_for_local_alloc): ...this.
20874 (sparc_leaf_reg_remap): Declare.
20875 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
20876 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
20877 * config/sparc/sparc.c (leaf_reg_remap): Delete.
20878 (order_regs_for_local_alloc): Rename to...
20879 (sparc_order_regs_for_local_alloc): ...this.
20880 (sparc_leaf_reg_remap): New function.
20881 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
20883 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
20885 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
20886 Use assemble_name to output BSS section name.
20888 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
20890 * config/i386/constraints.md (Bs):
20891 Remove boolean operators from match_test RTX.
20894 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
20897 2021-06-06 Martin Liska <mliska@suse.cz>
20899 * doc/extend.texi: Add missing @headitem.
20900 * doc/invoke.texi: Likewise.
20901 * doc/objc.texi: Likewise.
20903 2021-06-06 Martin Liska <mliska@suse.cz>
20905 * genhooks.c (emit_findices): Remove unused function.
20906 (emit_documentation): Do not call emit_findices
20907 and do not search for @Fcode directives.
20909 2021-06-06 Martin Liska <mliska@suse.cz>
20911 * doc/invoke.texi: Remove extra character.
20913 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
20915 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
20917 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
20919 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
20920 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
20921 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
20922 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
20924 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
20926 * config/or1k/or1k.md (*movdi): Fix empty split condition.
20928 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
20930 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
20933 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
20935 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
20936 *zero_extendsidi2): Fix empty split condition.
20938 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
20940 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
20942 * config/h8300/bitfield.md: Likewise.
20943 * config/h8300/combiner.md: Likewise.
20944 * config/h8300/divmod.md: Likewise.
20945 * config/h8300/extensions.md: Likewise.
20946 * config/h8300/jumpcall.md: Likewise.
20947 * config/h8300/movepush.md: Likewise.
20948 * config/h8300/multiply.md: Likewise.
20949 * config/h8300/other.md: Likewise.
20950 * config/h8300/shiftrotate.md: Likewise.
20951 * config/h8300/logical.md: Likewise. Fix split pattern to use
20952 code iterator that somehow slipped through.
20954 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
20956 PR middle-end/100905
20957 * tree-nested.c (convert_nonlocal_omp_clauses,
20958 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
20960 2021-06-04 Martin Sebor <msebor@redhat.com>
20962 PR middle-end/100732
20963 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
20964 with either source or destination argument of invalid type.
20965 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
20966 calls with arguments of invalid type.
20968 2021-06-04 Martin Sebor <msebor@redhat.com>
20970 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
20972 (attr_access::vla_bounds): Also handle VLA bounds.
20974 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
20976 * config/i386/predicates.md (GOT_memory_operand):
20977 Implement using match_code RTXes.
20978 (GOT32_symbol_operand): Ditto.
20980 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
20983 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
20985 (ix86_expand_vector_init_general): Ditto.
20986 Use SImode instead of word_mode for logic operations
20987 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
20988 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
20989 implemented by expand_vec_perm_1.
20990 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
20991 should be implemented using standard shuffle patterns.
20992 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
20993 V2HI modes to modes, implementable with shuffle for one operand.
20994 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
20995 (*pshufw_1): New insn pattern.
20996 (*vec_dupv2hi): Ditto.
20997 (vec_initv2hihi): New expander.
20999 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
21001 * config/arm/vfp.md (no_literal_pool_df_immediate,
21002 no_literal_pool_sf_immediate): Fix empty split condition.
21004 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
21006 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
21007 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
21008 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
21009 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
21010 *<sse4_1_avx2>_pblendvb_lt): Likewise.
21012 2021-06-04 Jakub Jelinek <jakub@redhat.com>
21015 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
21016 concatenation from half-sized modes with TImode elements.
21018 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
21020 * config/arc/arc.c (arc_override_options): Disable millicode
21021 thunks when RF16 is on.
21023 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
21025 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
21027 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
21029 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
21030 Replace PROMOTE_MODE marco with its content.
21032 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
21034 * config/cris/cris.md (*addi_reload): Fix empty split condition.
21036 2021-06-03 Jim Wilson <jimw@sifive.com>
21038 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
21039 turn it on for all riscv targets.
21041 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
21044 * config/i386/i386-expand.c (ix86_expand_vector_set):
21045 Handle V2HI and V4QI modes.
21046 (ix86_expand_vector_extract): Ditto.
21047 * config/i386/mmx.md (*pinsrw): New insn pattern.
21050 (*pextrw_zext): Ditto.
21052 (*pextrb_zext): Ditto.
21053 (vec_setv2hi): New expander.
21054 (vec_extractv2hihi): Ditto.
21055 (vec_setv4qi): Ditto.
21056 (vec_extractv4qiqi): Ditto.
21057 (vec_setv8qi): Enable only for TARGET_SSE4_1.
21058 (vec_extractv8qiqi): Ditto.
21060 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
21062 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
21063 order to subf instruction.
21064 * config/rs6000/fusion.md: Regenerate.
21066 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
21068 * calls.c (get_size_range): Use range_of_expr instead of
21069 determine_value_range.
21070 * tree-affine.c (expr_to_aff_combination): Same.
21071 * tree-data-ref.c (split_constant_offset): Same.
21072 * tree-vrp.c (determine_value_range_1): Remove.
21073 (determine_value_range): Remove.
21074 * tree-vrp.h (determine_value_range): Remove.
21076 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
21078 * function-tests.c (test_ranges): Call gimple_range_tests.
21079 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
21081 * gimple-range.cc (fur_source::get_operand): Do not call
21082 get_tree_range or gimple_range_global.
21084 (get_tree_range): Move to value-query.cc.
21085 Call get_arith_expr_range.
21086 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
21087 Include gimple-range-tests.cc.
21088 * gimple-range.h (fold_range): Add argument.
21089 (get_tree_range): Remove.
21090 * selftest.h (gimple_range_tests): New.
21091 * value-query.cc (global_range_query::range_of_expr): Add
21093 (range_query::get_tree_range): Move from gimple-range.cc.
21094 * value-query.h (class range_query): Add get_tree_range and
21095 get_arith_expr_range. Make fur_source a friend.
21096 * vr-values.c (vr_values::range_of_expr): Pass stmt to
21098 * gimple-range-tests.cc: New file.
21100 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
21102 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
21103 update_global_range.
21104 * value-query.cc (update_global_range): New.
21105 * value-query.h (update_global_range): New.
21107 2021-06-03 David Malcolm <dmalcolm@redhat.com>
21109 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
21110 printing the same location twice if there are fix-it hints,
21111 multiple locations, or a label.
21113 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
21115 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
21116 thresholds to narrow the upper bound on epilogue iterations.
21118 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
21120 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
21121 (mve_vabsq_s<mode>): Likewise.
21122 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
21123 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
21124 * config/arm/vec-common.md (neg<mode>2): Rename to
21125 <absneg_str><mode>2.
21127 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
21129 * common/config/arc/arc-common.c (arc_option_optimization_table):
21130 Remove malign-call.
21131 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
21132 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
21133 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
21134 * config/arc/arc.md (abssi2_mixed): Remove pattern.
21135 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
21136 (malign-call): Likewise.
21137 (mmixed-code): Likewise.
21138 * doc/invoke.texi (ARC): Update doc.
21140 2021-06-03 Martin Liska <mliska@suse.cz>
21142 * common.opt: Use proper Enum values.
21143 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
21144 (parse_sanitizer_options): Handle only sanitizer_opts.
21145 (common_handle_option): Just assign value.
21147 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21150 * tree-inline.c (inline_forbidden_p): Remove test on return type.
21152 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21154 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
21155 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
21156 (type_byte_size): Inline into...
21157 (add_byte_size_attribute): ...this and call add_scalar_info.
21159 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21161 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
21162 (typed_binop_from_tree): New function.
21163 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
21164 turn a divide by a power of 2 into a shift.
21165 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
21166 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
21167 typed divide by calling typed_binop_from_tree.
21169 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
21171 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
21172 (is_handled_procedure_type): Likewise.
21173 (struct loc_descr_context): Add strict_signedness field.
21174 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
21175 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
21176 (resolve_args_picking): Minor tweak.
21177 (function_to_dwarf_procedure): Initialize strict_signedness field.
21178 (type_byte_size): Likewise.
21179 (field_byte_offset): Likewise.
21180 (gen_descr_array_type_die): Likewise.
21181 (gen_variant_part): Likewise.
21182 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
21183 to true when a context is present before evaluating the arguments.
21184 <COND_EXPR>: Do not generate a useless comparison with zero.
21185 When dereferencing an address, if strict_signedness is true and the
21186 type is small and signed, use DW_OP_deref_type to do the dereference
21187 and then DW_OP_convert to convert back to the generic type.
21189 2021-06-03 Jakub Jelinek <jakub@redhat.com>
21192 * tree-inline.c (copy_tree_body_r): Handle iterators on
21193 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
21195 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
21197 * config/arc/arc.md (*bbit_di): Remove.
21199 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
21201 PR rtl-optimization/100264
21202 * ree.c (get_sub_rtx): Ignore SET expressions without register
21203 destinations and remove assertion, as it is not valid anymore
21204 with this new behaviour.
21205 (merge_def_and_ext): Eliminate destination check for register
21206 as such SET expressions can't occur anymore.
21207 (combine_reaching_defs): Likewise.
21209 2021-06-02 Jakub Jelinek <jakub@redhat.com>
21212 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
21213 -Wtype-limits warnings.
21214 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
21215 in operands to avoid -Wsign-compare warnings.
21217 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
21219 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
21222 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
21224 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
21226 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
21228 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
21230 * config/s390/subst.md(cconly_subst): Use a single constraint
21231 in (match_scratch).
21233 2021-06-02 Martin Liska <mliska@suse.cz>
21235 * ipa-icf.h: Use auto_vec for memory_access_types.
21237 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
21239 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
21240 argument from prototype.
21241 (output_logical_op): Add rtx_code argument.
21242 (compute_logical_op_length): Likewise.
21243 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
21244 to compute_a_shift_length.
21245 (output_logical_op); New argument with the rtx code rather than
21246 extracting it from an operand. Handle QImode too.
21247 (compute_logical_op_length): Similary.
21248 (compute_a_shift_length): Drop unused argument.
21249 * config/h8300/h8300.md (logicals): New code iterator.
21250 * config/h8300/logical.md (<code><mode>3 expander): Combine
21251 the "and" expander with the "ior"/"xor" expander.
21252 (bclr<mode>msx): Combine the QI/HI mode patterns.
21253 (<logical><mode>3 insns): Use code iterator rather than match_operator.
21254 Handle QImode as well. Update call to output_logical_op and
21255 compute_logical_op_length to pass in rtx_code
21256 Fix split condition on all define_insn_and_split patterns.
21257 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
21258 the flags and setting ZN via existing define_subst.
21259 * config/h8300/shiftrotate.md: Drop unused argument from
21260 calls to compute_a_shift_length.
21261 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
21263 2021-06-01 Andrew Pinski <apinski@marvell.com>
21265 PR tree-optimization/25290
21266 * tree-ssa-phiopt.c (match_simplify_replacement):
21268 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
21269 (two_value_replacement): Change the comment about
21270 conditional_replacement.
21271 (conditional_replacement): Delete.
21273 2021-06-01 Andrew Pinski <apinski@marvell.com>
21275 PR tree-optimization/95481
21276 * tree-tailcall.c (find_tail_calls): Handle empty typed
21279 2021-06-01 Andrew Pinski <apinski@marvell.com>
21281 * gimplify.c (zero_sized_field_decl): Delete
21282 (zero_sized_type): Delete
21283 (gimplify_init_ctor_eval): Use is_empty_type instead
21284 of zero_sized_field_decl.
21285 (gimplify_modify_expr): Use is_empty_type instead of
21288 2021-06-01 Jason Merrill <jason@redhat.com>
21291 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
21293 2021-06-01 Jason Merrill <jason@redhat.com>
21296 * diagnostic.h (warning_enabled_at): Declare.
21297 * diagnostic.c (diagnostic_enabled): Factor out from...
21298 (diagnostic_report_diagnostic): ...here.
21299 (warning_enabled_at): New.
21301 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
21303 * gimple-ssa-evrp.c: Enable exporting of global ranges.
21305 2021-06-01 Martin Liska <mliska@suse.cz>
21308 * doc/invoke.texi: Mention that -fgcse-after-reload
21309 is enabled with -O3.
21311 2021-06-01 liuhongt <hongtao.liu@intel.com>
21313 PR tree-optimization/98365
21314 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
21315 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
21316 (convert_scalar_cond_reduction): Ditto.
21317 (predicate_scalar_phi): Ditto.
21319 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21321 PR tree-optimization/100781
21322 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
21323 value calculation by default.
21324 (ranger_cache::enable_new_values): New.
21325 (ranger_cache::disable_new_values): New.
21326 (ranger_cache::push_poor_value): Check if new values are allowed.
21327 * gimple-range-cache.h (class ranger_cache): New member/methods.
21328 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
21329 statement, and disable/renable new value calculation.
21331 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21333 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
21334 (ranger_cache::range_of_def): New.
21335 (ranger_cache::entry_range): New.
21336 (ranger_cache::exit_range): New.
21337 (ranger_cache::range_of_expr): Adjust.
21338 (ranger_cache::range_on_edge): Adjust.
21339 (ranger_cache::propagate_cache): Call exit_range directly.
21340 * gimple-range-cache.h (class ranger_cache): Adjust.
21342 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21344 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
21345 gori_compute being a member rather than base class.
21346 dervied call to member call.
21347 (ranger_cache::dump): No longer dump gori_map.
21348 (ranger_cache::dump_bb): New.
21349 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
21350 being a member rather than base class.
21351 (ranger_cache::set_global_range): Ditto.
21352 (ranger_cache::ssa_range_in_bb): Ditto.
21353 (ranger_cache::range_of_expr): New.
21354 (ranger_cache::range_on_edge): New.
21355 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
21356 (ranger_cache::propagate_cache): Adjust debugging output.
21357 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
21359 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
21360 member, and inherit from range_query instead.
21361 (ranger_cache::dump_bb): New. split from dump.
21362 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
21363 (gori_compute::expr_range_at_stmt): Delete.
21364 (gori_compute::compute_name_range_op): Delete.
21365 (gori_compute::compute_operand_range_switch): Add fur_source.
21366 (gori_compute::compute_operand_range): Add fur_source param, inline
21367 old compute_name_range_op and optimize_logical_operands.
21368 (struct tf_range): Delete.
21369 (gori_compute::logical_combine): Adjust
21370 (gori_compute::optimize_logical_operands): Delete.
21371 (gori_compute::compute_logical_operands_in_chain): Delete.
21372 (gori_compute::compute_logical_operands): Adjust.
21373 (gori_compute::compute_operand1_range): Adjust to fur_source.
21374 (gori_compute::compute_operand2_range): Ditto.
21375 (gori_compute::compute_operand1_and_operand2_range): Ditto.
21376 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
21377 and adjust to fur_source.
21378 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
21379 range_query and fur_source.
21380 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
21381 from the ranger_cache..
21382 (gimple_ranger::fold_range_internal): Adjust to base class change of
21384 (gimple_ranger::dump_bb): Adjust dump.
21385 * gimple-range.h (gimple_ranger):export gori computes object.
21387 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
21389 PR tree-optimization/100774
21390 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
21391 Constant values are also not stale.
21392 (ranger_cache::set_global_range): Range invariant values should also
21393 have the correct timestamp.
21395 2021-05-31 Martin Liska <mliska@suse.cz>
21397 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
21398 Unpack FUNCTION_DECL_DECL_TYPE.
21399 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
21400 Stream FUNCTION_DECL_DECL_TYPE instead of
21401 DECL_IS_OPERATOR_NEW_P.
21402 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
21404 (DECL_IS_OPERATOR_NEW_P): Likewise.
21405 (DECL_IS_OPERATOR_DELETE_P): Likewise.
21406 (DECL_LAMBDA_FUNCTION_P): Likewise.
21408 2021-05-31 Richard Biener <rguenther@suse.de>
21411 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
21412 * internal-fn.def (SHUFFLEVECTOR): New.
21413 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
21414 * doc/extend.texi: Document __builtin_shufflevector.
21416 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
21419 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
21420 indexed form addresses.
21422 2021-05-29 Jeff Law <jlaw@tachyum.com>
21424 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
21425 parameter. Call callers fixed.
21427 (output_plussi): Add FALLTHRU markers.
21428 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
21430 2021-05-29 Jakub Jelinek <jakub@redhat.com>
21432 PR middle-end/99928
21433 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
21434 combined with parallel, make sure to add shared clause to
21435 parallel for explicit linear clause.
21437 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
21439 PR tree-optimization/100787
21440 * gimple-ssa-evrp.c: Disable exporting of global ranges.
21442 2021-05-28 Jason Merrill <jason@redhat.com>
21444 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
21445 operator--, operator*, operator==, and operator!=.
21446 (class tsi_range): New.
21448 2021-05-28 Richard Biener <rguenther@suse.de>
21450 PR tree-optimization/100778
21451 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
21452 trapping ops in different BBs.
21454 2021-05-28 Richard Biener <rguenther@suse.de>
21457 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
21458 copy fntype from original call.
21460 2021-05-28 Martin Liska <mliska@suse.cz>
21462 PR gcov-profile/100751
21463 * doc/gcov.texi: Revert partially a hunk that was wrong.
21465 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
21467 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
21469 (HAVE_sync_compare_and_swaphi): Likewise.
21470 (HAVE_sync_compare_and_swapsi): Likewise.
21472 2021-05-28 Jakub Jelinek <jakub@redhat.com>
21474 PR middle-end/99928
21475 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
21477 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
21479 * gimplify.c (gimplify_omp_affinity): New.
21480 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
21481 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
21482 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
21483 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
21484 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
21486 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
21487 Richard Biener <rguenther@suse.de>
21489 * match.pd <popcount & / + pattern matching>:
21490 When generating popcount directly fails, try doing it in two halves.
21492 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
21494 * Makefile.in (generated_files): Add gimple-match.c and
21497 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
21499 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
21501 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
21503 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
21505 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
21507 PR tree-optimization/99398
21508 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
21509 where the fed operands are CTOR/CST and propagated through
21510 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
21511 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
21513 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
21516 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
21518 * config/i386/mmx.md (addv2sf3): Do not call
21519 ix86_fixup_binary_operands_no_copy.
21522 (<smaxmin:code>v2sf3): Ditto.
21523 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
21524 (<plusminus:insn><VI_32:mode>3): Remove expander.
21525 (<plusminus:insn><VI_32:mode>3): Rename from
21526 "*<plusminus:insn><VI_32:mode>3".
21527 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
21528 (mulv2hi3): Remove expander.
21529 (mulv2hi3): Rename from *mulv2hi3.
21530 (<s>mulv2hi3_highpart): Remove expander.
21531 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
21532 (<smaxmin:code><MMXMODE14:mode>3): Rename from
21533 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
21534 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
21535 (SMAXMIN_MMXMODEI): Remove mode iterator.
21536 (<smaxmin:code>v4hi3): New expander.
21537 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
21538 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
21539 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
21540 (SMAXMIN_VI_32): Remove mode iterator.
21541 (<umaxmin:code><MMXMODE24:mode>3): Rename from
21542 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
21543 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
21544 (UMAXMIN_MMXMODEI): Remove mode iterator.
21545 (<umaxmin:code>v8qi3): New expander.
21546 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
21547 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
21548 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
21549 (UMAXMIN_VI_32): Remove mode iterator.
21550 (<any_shift:insn>v2hi3): Remove expander.
21551 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
21552 (<any_logic:code><MMXMODEI:mode>3): Do not call
21553 ix86_fixup_binary_operands_no_copy.
21554 (<any_logic:code><VI_32:mode>3): Remove expander.
21555 (<any_logic:code><VI_32:mode>3): Rename from
21556 "*<any_logic:code><VI_32:mode>3".
21557 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
21558 * config/i386/sse.md (div<VF2:mode>3): Do not call
21559 ix86_fixup_binary_operands_no_copy.
21560 (div<VF1:mode>3): Ditto.
21561 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
21562 (smulhrsv4hi3): Ditto.
21563 (smulhrsv2hi3): Ditto.
21565 2021-05-27 Martin Sebor <msebor@redhat.com>
21567 * ggc.h (gt_ggc_mx): Add overloads for all integers.
21569 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
21571 (hash_map::operator==): New function.
21573 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
21576 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
21577 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
21578 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
21579 (*xop_maskcmp<VI_32:mode>3): Ditto.
21580 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
21581 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
21583 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
21586 * config/arm/arm.c (arm_configure_build_target): Remove parameter
21587 opts_set, directly check opts parameters for being non-null.
21588 (arm_option_restore): Update call to arm_configure_build_target.
21589 (arm_option_override): Likewise.
21590 (arm_can_inline_p): Likewise.
21591 (arm_valid_target_attribute_tree): Likewise.
21592 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
21593 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
21596 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
21598 * vr-values.c (simplify_conversion_using_ranges): Use
21599 get_range_query instead of get_global_range_query.
21601 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
21603 * gimple-range.cc (get_range_global): Move to value-query.cc.
21604 (gimple_range_global): Same.
21605 (get_global_range_query): Same.
21606 (global_range_query::range_of_expr): Same.
21607 * gimple-range.h (class global_range_query): Move to
21609 (gimple_range_global): Same.
21610 * tree-ssanames.c (get_range_info): Move to value-query.cc.
21611 (get_ptr_nonnull): Same.
21612 * tree-ssanames.h (get_range_info): Remove.
21613 (get_ptr_nonnull): Remove.
21614 * value-query.cc (get_ssa_name_range_info): Move from
21616 (get_ssa_name_ptr_info_nonnull): Same.
21617 (get_range_global): Move from gimple-range.cc.
21618 (gimple_range_global): Same.
21619 (get_global_range_query): Same.
21620 (global_range_query::range_of_expr): Same.
21621 * value-query.h (class global_range_query): Move from
21623 (gimple_range_global): Same.
21625 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
21628 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
21629 (uavgv2hi3_ceil): Ditto.
21631 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
21634 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
21636 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
21638 * tree-ssanames.c (get_range_info): Merge both copies of
21639 get_range_info into one that works with irange.
21640 * tree-ssanames.h (get_range_info): Remove version that works on
21643 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
21645 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
21646 (expand_builtin_strnlen): Same.
21647 (determine_block_size): Same.
21648 * fold-const.c (expr_not_equal_to): Same.
21649 * gimple-fold.c (size_must_be_zero_p): Same.
21650 * gimple-match-head.c: Include gimple-range.h.
21651 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
21652 * gimple-ssa-warn-restrict.c
21653 (builtin_memref::extend_offset_range): Same.
21654 * graphite-sese-to-poly.c (add_param_constraints): Same.
21655 * internal-fn.c (get_min_precision): Same.
21656 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
21657 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
21659 * tree-data-ref.c (split_constant_offset): Same.
21660 (dr_step_indicator): Same.
21661 * tree-dfa.c (get_ref_base_and_extent): Same.
21662 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
21663 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
21664 (determine_value_range): Same.
21665 (record_nonwrapping_iv): Same.
21666 (infer_loop_bounds_from_signedness): Same.
21667 (scev_var_range_cant_overflow): Same.
21668 * tree-ssa-phiopt.c (two_value_replacement): Same.
21669 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
21670 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
21671 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
21673 (dump_strlen_info): Same.
21674 (set_strlen_range): Same.
21675 (maybe_diag_stxncpy_trunc): Same.
21676 (get_len_or_size): Same.
21677 (handle_integral_assign): Same.
21678 * tree-ssa-structalias.c (find_what_p_points_to): Same.
21679 * tree-ssa-uninit.c (find_var_cmp_const): Same.
21680 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
21681 * tree-vect-patterns.c (vect_get_range_info): Same.
21682 (vect_recog_divmod_pattern): Same.
21683 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
21684 (register_edge_assert_for_2): Same.
21685 (determine_value_range_1): Same.
21686 * tree.c (get_range_pos_neg): Same.
21687 * vr-values.c (vr_values::get_lattice_entry): Same.
21688 (vr_values::update_value_range): Same.
21689 (simplify_conversion_using_ranges): Same.
21691 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
21693 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
21694 get_range_query instead of query argument.
21695 (pass_walloca::execute): Enable and disable global ranger.
21697 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
21699 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
21701 (rvrp_folder::~rvrp_folder): Call disable_ranger.
21702 (hybrid_folder::hybrid_folder): Call enable_ranger.
21703 (hybrid_folder::~hybrid_folder): Call disable_ranger.
21705 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
21707 * function.c (allocate_struct_function): Set cfun->x_range_query.
21708 * function.h (struct function): Declare x_range_query.
21709 (get_range_query): New.
21710 (get_global_range_query): New.
21711 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
21712 Remove call to safe_grow_cleared.
21713 * gimple-range.cc (get_range_global): New.
21714 (gimple_range_global): Move from gimple-range.h.
21715 (get_global_range_query): New.
21716 (global_range_query::range_of_expr): New.
21717 (enable_ranger): New.
21718 (disable_ranger): New.
21719 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
21720 (class global_range_query): New.
21721 (enable_ranger): New.
21722 (disable_ranger): New.
21723 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
21724 dump_all_value_ranges to dump.
21725 * tree-vrp.c (vrp_prop::finalize): Same.
21726 * value-query.cc (range_query::dump): New.
21727 * value-query.h (range_query::dump): New.
21728 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
21729 (vr_values::dump): ...this.
21730 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
21731 dump and make virtual.
21733 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
21735 * config/i386/i386.c (ix86_autovectorize_vector_modes):
21736 Add V4QImode and V16QImode for TARGET_SSE2.
21737 * doc/sourcebuild.texi (Vector-specific attributes):
21738 Add vect64 and vect32 description.
21740 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
21742 * gimple-range-gori.cc (range_def_chain::register_dependency):
21743 Resize m_def_chain when needed.
21745 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
21747 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
21748 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
21749 * config/arm/vec-common.md: .. here. Add support for MVE.
21751 2021-05-26 Jakub Jelinek <jakub@redhat.com>
21753 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
21755 * config/microblaze/microblaze.c (microblaze_legitimize_address,
21757 microblaze_option_override, print_operand): Likewise.
21758 * config/microblaze/microblaze.md (call_internal_plt,
21759 call_value_intern_plt, call_value_intern): Likewise.
21760 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
21761 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
21762 call_value_multiple_internal1): Likewise.
21763 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
21765 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
21767 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
21768 arc_ccfsm_advance, symbolic_reference_mentioned_p,
21769 arc_raw_symbolic_reference_mentioned_p): Remove register
21772 2021-05-26 Jakub Jelinek <jakub@redhat.com>
21775 * omp-low.c: Include omp-offload.h.
21776 (create_omp_child_function): If current_function_decl has
21777 "omp declare target" attribute and is_gimple_omp_offloaded,
21778 remove that attribute from the copy of attribute list and
21779 add "omp target entrypoint" attribute instead.
21780 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
21781 variables for offloading if in omp_maybe_offloaded_ctx.
21782 * omp-offload.c (pass_omp_target_link::execute): Nullify second
21783 argument to GOMP_target_data_ext in offloaded code.
21785 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
21787 * config/csky/csky.c (csky_can_change_mode_class): Delete.
21788 For csky, HF/SF mode use the low bits of VREGS.
21790 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
21792 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
21793 DECL which is a reference for OMP.
21795 2021-05-26 Martin Liska <mliska@suse.cz>
21797 PR gcov-profile/100751
21798 * doc/gcov.texi: Document that __gcov_dump can be called just
21799 once and that __gcov_reset resets run-time counters.
21801 2021-05-26 Martin Liska <mliska@suse.cz>
21803 * doc/install.texi: Port relevant part from install-old.texi
21804 and re-generate list of CPUs and systems.
21806 2021-05-26 Martin Liska <mliska@suse.cz>
21808 * Makefile.in: Remove it.
21809 * doc/include/fdl.texi: Update next/previous chapters.
21810 * doc/install.texi: Likewise.
21811 * doc/install-old.texi: Removed.
21813 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
21815 * config/csky/csky.c (ck810_legitimate_index_p): Support
21816 "base + index" with DF mode.
21817 * config/csky/constraints.md ("Y"): New constraint for memory operands
21818 without index register.
21819 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
21820 when mov between memory and general registers, and lower their priority.
21821 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
21823 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
21825 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
21827 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
21829 * config/csky/csky.md (untyped_call): Emit clobber for return
21830 registers to mark them used.
21832 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
21834 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
21836 2021-05-26 Andrew Pinski <apinski@marvell.com>
21838 * match.pd (x < 0 ? ~y : y): New patterns.
21840 2021-05-26 Andrew Pinski <apinski@marvell.com>
21842 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
21843 A?POW2:0 and A?0:POW2.
21845 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21847 * gimple-range-gori.cc (class logical_stmt_cache): Delete
21848 (logical_stmt_cache::logical_stmt_cache ): Delete.
21849 (logical_stmt_cache::~logical_stmt_cache): Delete.
21850 (logical_stmt_cache::cache_entry::dump): Delete.
21851 (logical_stmt_cache::get_range): Delete.
21852 (logical_stmt_cache::cached_name ): Delete.
21853 (logical_stmt_cache::same_cached_name): Delete.
21854 (logical_stmt_cache::cacheable_p): Delete.
21855 (logical_stmt_cache::slot_diagnostics ): Delete.
21856 (logical_stmt_cache::dump): Delete.
21857 (gori_compute_cache::gori_compute_cache): Delete.
21858 (gori_compute_cache::~gori_compute_cache): Delete.
21859 (gori_compute_cache::compute_operand_range): Delete.
21860 (gori_compute_cache::cache_stmt): Delete.
21861 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
21863 (class gori_compute_cache): Delete.
21865 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21867 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
21869 (fold_using_range::range_of_address): Adjust.
21870 (fold_using_range::range_of_phi): Adjust.
21871 * gimple-range.h (class fur_source): Adjust.
21872 (fur_source::fur_source): Adjust.
21874 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21876 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
21877 from expr_range_in_bb and adjust.
21878 (gori_compute::compute_name_range_op): Adjust.
21879 (gori_compute::optimize_logical_operands): Adjust.
21880 (gori_compute::compute_logical_operands_in_chain): Adjust.
21881 (gori_compute::compute_operand1_range): Adjust.
21882 (gori_compute::compute_operand2_range): Adjust.
21883 (ori_compute_cache::cache_stmt): Adjust.
21884 * gimple-range-gori.h (gori_compute): Rename prototype.
21886 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21888 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
21889 checked only after range_of_stmt, not range_on_entry.
21890 (gimple_ranger::range_on_entry): Check for non-null in any
21891 predecessor block, if it is not already non-null.
21892 (gimple_ranger::range_on_exit): DOnt check for non-null after
21893 range on entry call.
21894 (gimple_ranger::dump_bb): New. Split from dump.
21895 (gimple_ranger::dump): Adjust.
21896 * gimple-range.h (class gimple_ranger): Adjust.
21898 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21900 * gimple-range-cache.cc (struct range_timestamp): Delete.
21901 (class temporal_cache): Adjust.
21902 (temporal_cache::get_timestamp): Delete.
21903 (temporal_cache::set_dependency): Delete.
21904 (temporal_cache::temporal_value): Adjust.
21905 (temporal_cache::current_p): Take dependencies as params.
21906 (temporal_cache::set_timestamp): Adjust.
21907 (temporal_cache::set_always_current): Adjust.
21908 (ranger_cache::get_non_stale_global_range): Adjust.
21909 (ranger_cache::register_dependency): Delete.
21910 * gimple-range-cache.h (class range_cache): Adjust.
21912 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21914 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
21916 (range_def_chain::~range_def_chain): Dispose of obstack rather than
21917 each individual bitmap.
21918 (range_def_chain::set_import): New.
21919 (range_def_chain::get_imports): New.
21920 (range_def_chain::chain_import_p): New.
21921 (range_def_chain::register_dependency): Rename from build_def_chain
21923 (range_def_chain::def_chain_in_bitmap_p): New.
21924 (range_def_chain::add_def_chain_to_bitmap): New.
21925 (range_def_chain::has_def_chain): Just check first depenedence.
21926 (range_def_chain::get_def_chain): Process imports, use generic
21927 register_dependency routine.
21928 (range_def_chain::dump): New.
21929 (gori_map::gori_map): Allocate import list.
21930 (gori_map::~gori_map): Release imports.
21931 (gori_map::exports): Check for past allocated block size.
21932 (gori_map::imports): New.
21933 (gori_map::def_chain_in_export_p): Delete.
21934 (gori_map::is_import_p): New.
21935 (gori_map::maybe_add_gori): Handle imports.
21936 (gori_map::dump): Adjust output, add imports.
21937 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
21938 (gori_export_iterator::gori_export_iterator): New.
21939 (gori_export_iterator::next): New.
21940 (gori_export_iterator::get_name): New.
21941 * gimple-range-gori.h (range_def_chain): Add imports and direct
21942 dependecies via struct rdc.
21943 (range_def_chain::depend1): New.
21944 (range_def_chain::depend2): New.
21945 (class gori_map): Adjust.
21946 (FOR_EACH_GORI_IMPORT_NAME): New.
21947 (FOR_EACH_GORI_EXPORT_NAME): New.
21948 (class gori_export_iterator): New.
21950 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21952 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
21953 export cache filling to here.
21954 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
21956 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
21958 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
21959 (gori_map): Move to gimple-range-gori.h.
21960 (gori_compute::gori_compute): Adjust.
21961 (gori_compute::~gori_compute): Delete.
21962 (gori_compute::compute_operand_range_switch): Adjust.
21963 (gori_compute::compute_operand_range): Adjust.
21964 (gori_compute::compute_logical_operands): Adjust.
21965 (gori_compute::has_edge_range_p ): Adjust.
21966 (gori_compute::set_range_invariant): Delete.
21967 (gori_compute::dump): Adjust.
21968 (gori_compute::outgoing_edge_range_p): Adjust.
21969 * gimple-range-gori.h (class range_def_chain): Relocate here.
21970 (class gori_map): Relocate here.
21971 (class gori_compute): Inherit from gori_map, and adjust.
21973 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
21975 * value-range.cc (range_tests_legacy): Use
21976 build_nonstandard_integer_type instead of int and short.
21978 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
21980 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
21981 when really creating an initialization statement for it.
21983 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
21985 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
21987 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
21989 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
21991 2021-05-25 Martin Liska <mliska@suse.cz>
21993 PR tree-optimization/92860
21995 * optc-save-gen.awk: Remove exceptions.
21997 2021-05-25 Martin Liska <mliska@suse.cz>
21999 * asan.h (sanitize_coverage_p): New function.
22000 * doc/extend.texi: Document it.
22001 * fold-const.c (fold_range_test): Use sanitize_flags_p
22002 instead of flag_sanitize_coverage.
22003 (fold_truth_andor): Likewise.
22004 * sancov.c: Likewise.
22005 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
22006 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
22007 -fsanitize-coverage when inlining.
22009 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
22011 * config/csky/csky-modes.def : Fix copyright.
22013 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
22015 * config/csky/csky-modes.def : Amend copyright.
22016 * config/csky/csky_insn_fpuv2.md : Likewise.
22017 * config/csky/csky_insn_fpuv3.md : Likewise.
22019 2021-05-25 Richard Biener <rguenther@suse.de>
22021 PR middle-end/100727
22022 * calls.c (initialize_argument_information): Explicitely test
22023 for WITH_SIZE_EXPR.
22024 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
22026 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22028 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
22029 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
22030 the signle definition. The signle definition may not work well
22031 at simplify_subreg_regno().
22032 (HARD_FRAME_POINTER_REGNUM): New.
22033 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
22034 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
22035 csky_initial_elimination_offset, csky_expand_prologue,
22036 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
22038 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22040 * config/csky/csky.c (csky_option_override):
22041 Init csky_arch_isa_features[] in advance, so TARGET_DSP
22042 and TARGET_DIV can be set well.
22044 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22046 * config/csky/constraints.md ("l", "h"): Delete.
22047 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
22048 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
22049 * config/csky/csky.c (regno_reg_classm,
22050 csky_secondary_reload, csky_register_move_cost):
22051 Use HILO_REGS instead of LO_REGS and HI_REGS.
22053 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
22055 * config/csky/constraints.md ("W"): New constriant for mem operand
22056 with base reg, index register.
22057 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
22058 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
22060 ("Dv"): New constraint for const double value that can be used at
22062 * config/csky/csky-modes.def (HFmode): New mode.
22063 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
22064 to "csky_valid_mem_constraint_operand" and support new constraint
22066 (csky_get_movedouble_length): New.
22067 (fpuv3_output_move): New.
22068 (fpuv3_const_double): New.
22069 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
22070 (decompose_csky_address): Refine.
22071 (csky_print_operand): New "CONST_DOUBLE" operand.
22072 (csky_output_move): Support fpv3 instructions.
22073 (csky_get_movedouble_length): New.
22074 (fpuv3_output_move): New.
22075 (fpuv3_const_double): New.
22076 (csky_emit_compare): Cover float comparsion.
22077 (csky_emit_compare_float): Refine.
22078 (csky_vaild_fpuv2_mem_operand): Rename to
22079 "csky_valid_mem_constraint_operand" and support new constraint "W".
22080 (ck860_rtx_costs): New.
22081 (csky_rtx_costs): Add the cost calculation of CK860.
22082 (regno_reg_class): New vregs for fpuv3.
22083 (csky_dbx_regno): Likewise.
22084 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
22085 (csky_conditional_register_usage): Suporrot fpuv3.
22086 (csky_dwarf_register_span): Suporrot fpuv3.
22087 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
22088 (ck810_legitimate_index_p): Support fp16.
22089 * config/csky/csky.h (TARGET_TLS): ADD CK860.
22090 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
22091 (TARGET_SINGLE_FPU): Support fpuv3.
22092 (TARGET_SUPPORT_FPV3): New.
22093 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
22094 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
22095 REG_CLASS_CONTENTS): Support fpuv3.
22096 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
22097 (csky_movsf_fpv2): Likewise.
22098 (ck801_movsf): Likewise.
22099 (csky_movsf): Likewise.
22101 (csky_movdf_fpv2): Likewise.
22102 (ck801_movdf): Likewise.
22103 (csky_movdf): Likewise.
22104 (movsicc): Refine. Use "comparison_operatior" instead of
22105 "ordered_comparison_operatior".
22106 (addsicc): Likewise.
22107 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
22108 (call_value_internal_vh): New.
22109 * config/csky/csky_cores.def (CK860): New arch and cpu.
22114 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
22115 into emit-patterns and match-patterns, remain the emit-patterns here,
22116 and move the match-patterns to csky_insn_fpuv2.md or
22117 csky_insn_fpuv3.md.
22118 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
22119 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
22121 * config/csky/csky_isa.def (fcr): New.
22126 (CK860): New definition for ck860.
22127 * config/csky/csky_tables.opt (ck860): New processors ck860,
22128 ck860f. And new arch ck860.
22133 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
22134 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
22135 * config/csky/t-csky-elf: Support 860.
22136 * config/csky/t-csky-linux: Likewise.
22137 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
22139 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
22141 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
22142 add generation of logical-add and add-logical fusion pairs.
22143 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
22145 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
22146 logical-add and add-logical fusion by default.
22147 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
22148 -mpower10-fusion-add-logical options.
22149 * config/rs6000/fusion.md: Regenerate file.
22151 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
22153 * value-range.cc (irange::legacy_equal_p): Check type when
22154 comparing VR_VARYING types.
22155 (range_tests_legacy): Test comparing VARYING ranges of different
22158 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
22160 * config/aarch64/aarch64.c (neoversen1_tunings):
22161 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
22163 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
22165 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
22166 extern weak symbols. Limit symbol offsets for non-GOT symbols with
22169 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
22171 * config/arm/neon.md (vec_load_lanesxi<mode>)
22172 (vec_store_lanexoi<mode>): Move ...
22173 * config/arm/vec-common.md: here.
22175 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
22177 * config/arm/neon.md (vec_load_lanesoi<mode>)
22178 (vec_store_lanesoi<mode>): Move ...
22179 * config/arm/vec-common.md: here.
22181 2021-05-24 liuhongt <hongtao.liu@intel.com>
22184 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
22185 stmt with GIMPLE_NOP when lhs doesn't exist.
22187 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
22190 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
22191 New instruction pattern.
22192 (*push<VI_32:mode>2): Ditto.
22193 (push splitter for SSE registers): New splitter.
22195 2021-05-23 Andrew Pinski <apinski@marvell.com>
22197 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
22199 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
22201 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
22202 * config/rs6000/fusion.md: Regenerate file.
22204 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
22206 * config/rs6000/genfusion.pl (gen_addadd): New function.
22207 * config/rs6000/fusion.md: Regenerate file.
22208 * config/rs6000/rs6000-cpus.def: Add
22209 OPTION_MASK_P10_FUSION_2ADD to masks.
22210 * config/rs6000/rs6000.c (rs6000_option_override_internal):
22211 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
22212 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
22214 2021-05-21 Jakub Jelinek <jakub@redhat.com>
22216 PR middle-end/99928
22217 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
22218 * gimplify.c (enum gimplify_omp_var_data): Fix up
22219 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
22220 (omp_lastprivate_for_combined_outer_constructs): If combined target
22221 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
22222 GOVD_MAP | GOVD_SEEN.
22223 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
22224 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
22225 (gimplify_adjust_omp_clauses): For firstprivate clauses with
22226 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
22227 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
22228 let it be replaced by implicit map clause.
22230 2021-05-21 Jakub Jelinek <jakub@redhat.com>
22232 PR middle-end/99928
22233 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
22235 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
22236 (gimplify_omp_for): Likewise.
22238 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22240 PR middle-end/90115
22241 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
22242 'external' in blocks.
22244 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22246 PR middle-end/90115
22247 * flag-types.h (enum openacc_privatization): New.
22248 * params.opt (-param=openacc-privatization): New.
22249 * doc/invoke.texi (openacc-privatization): Document it.
22250 * omp-general.h (get_openacc_privatization_dump_flags): New
22252 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
22253 * omp-offload.c (execute_oacc_device_lower)
22254 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
22255 * target.def (goacc.adjust_private_decl): Add 'location_t'
22257 * doc/tm.texi: Regenerate.
22258 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
22259 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
22260 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
22261 Likewise. Preserve it for...
22262 (nvptx_goacc_expand_var_decl): ... use here.
22264 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22266 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
22269 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22271 PR middle-end/90115
22272 * omp-low.c (oacc_privatization_candidate_p): New function.
22273 (oacc_privatization_scan_clause_chain)
22274 (oacc_privatization_scan_decl_chain): Use it. Also
22275 'gcc_checking_assert' that we're not seeing duplicates.
22277 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22279 PR middle-end/90115
22280 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
22283 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22285 PR middle-end/90115
22286 * omp-offload.c (execute_oacc_device_lower): Explain.
22288 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22290 PR middle-end/90115
22291 * omp-offload.c (execute_oacc_device_lower)
22292 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
22294 * internal-fn.c (expand_UNIQUE): Don't expect
22295 'IFN_UNIQUE_OACC_PRIVATE'.
22297 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22299 PR middle-end/90115
22300 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
22302 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
22304 PR middle-end/90115
22305 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
22306 (nvptx_goacc_expand_var_decl): Tighten.
22308 2021-05-21 Julian Brown <julian@codesourcery.com>
22309 Chung-Lin Tang <cltang@codesourcery.com>
22310 Thomas Schwinge <thomas@codesourcery.com>
22312 PR middle-end/90115
22313 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
22314 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
22315 * doc/tm.texi: Regenerate.
22316 * expr.c (expand_expr_real_1): Expand decls using the
22317 expand_var_decl OpenACC hook if defined.
22318 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
22319 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
22320 * omp-low.c (omp_context): Add oacc_privatization_candidates
22322 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
22324 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
22325 private marker's gimple call arguments, and pass it to
22326 lower_oacc_reductions.
22327 (oacc_privatization_scan_clause_chain)
22328 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
22330 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
22331 * omp-offload.c (convert.h): Include.
22332 (oacc_loop_xform_head_tail): Treat private-variable markers like
22333 fork/join when transforming head/tail sequences.
22334 (struct var_decl_rewrite_info): Add struct.
22335 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
22336 (execute_oacc_device_lower): Support rewriting gang-private
22337 variables using target hook, and fix up addr_expr and var_decl
22339 * target.def (adjust_private_decl, expand_var_decl): New hooks.
22340 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
22342 (gcn_goacc_adjust_private_decl): ...this.
22343 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
22345 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
22346 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
22347 definition using gcn_goacc_adjust_gangprivate_decl...
22348 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
22349 gcn_goacc_adjust_private_decl.
22350 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
22351 (gang_private_shared_size): New global variable.
22352 (gang_private_shared_align): Likewise.
22353 (gang_private_shared_sym): Likewise.
22354 (gang_private_shared_hmap): Likewise.
22355 (nvptx_option_override): Initialize these.
22356 (nvptx_file_end): Output gang_private_shared_sym.
22357 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
22359 (nvptx_set_current_function): Clear gang_private_shared_hmap.
22360 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
22361 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
22363 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
22365 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
22367 2021-05-21 Richard Biener <rguenther@suse.de>
22368 H.J. Lu <hjl.tools@gmail.com>
22370 PR middle-end/90773
22371 * expr.c (expand_constructor): Elide expand_constructor if
22372 move by pieces is preferred.
22374 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22376 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
22377 Take a flag and mode value as arguments.
22378 (aarch64_modifies_global_state_p): Likewise.
22379 (aarch64_reads_global_state_p): Likewise.
22380 (aarch64_could_trap_p): Likewise.
22381 (aarch64_get_attributes): Likewise.
22382 (aarch64_init_simd_builtins): Adjust callsite of above.
22383 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
22384 function attributes to apply to builtins.
22385 (aarch64_init_crc32_builtins): Likewise.
22386 (aarch64_init_builtin_rsqrt): Likewise.
22388 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
22390 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
22391 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
22392 (gen_2logical): Use new fusion types.
22393 * config/rs6000/fusion.md: Regenerate.
22395 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
22398 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
22399 Handle V4QI and V2HI modes.
22400 (ix86_expand_sse_movcc): Ditto.
22401 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
22402 New instruction pattern.
22403 (*eq<VI_32:mode>3): Ditto.
22404 (*gt<VI_32:mode>3): Ditto.
22405 (*xop_pcmov_<VI_32:mode>): Ditto.
22406 (mmx_pblendvb32): Ditto.
22407 (mmx_pblendvb64): Rename from mmx_pblendvb.
22408 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
22409 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
22410 (vcond<VI_32:mode><VI_32:mode>): Ditto.
22411 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
22412 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
22414 2021-05-21 Jakub Jelinek <jakub@redhat.com>
22416 PR tree-optimization/94589
22417 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
22418 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
22419 not the same as cmp3) treat <= the same as < and >= the same as >.
22420 Don't require that cond2_phi_edge is true edge, instead take
22421 false/true edges into account based on cmp1/cmp2 comparison kinds.
22423 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
22426 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
22427 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
22428 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
22429 using SMAXMIN_MMXMODEI mode iterator.
22430 (*<smaxmin:code>v4qi3): New insn pattern.
22431 (*<smaxmin:code>v2hi3): Ditto.
22432 (SMAXMIN_VI_32): New mode iterator.
22433 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
22434 (UMAXMIN_MMXMODEI): New mode iterator.
22435 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
22436 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
22437 using UMAXMIN_MMXMODEI mode iterator.
22438 (*<umaxmin:code>v4qi3): New insn pattern.
22439 (*<umaxmin:code>v2hi3): Ditto.
22440 (UMAXMIN_VI_32): New mode iterator.
22441 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
22442 (abs<VI_32:mode>2): New insn pattern.
22443 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
22444 * config/i386/sse.md: ... here.
22446 2021-05-20 Clement Chigot <clement.chigot@atos.net>
22447 David Edelsohn <dje.gcc@gmail.com>
22449 * collect2.c (scan_prog_file): Issue non-fatal warning for
22452 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
22454 * doc/invoke.texi (-Wno-c++11-extensions)
22455 (-Wno-c++14-extensions, -Wno-c++17-extensions)
22456 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
22459 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
22461 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
22462 * config/darwin.c (darwin_override_options): Likewise.
22463 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
22464 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
22465 (DWARF2_FRAME_REG_OUT): Likewise.
22466 * config/mips/mips.c (mips_output_filename): Likewise.
22467 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
22469 (rs6000_dbx_register_number): Likewise.
22470 * dbxout.c: Include flags.h.
22471 * dwarf2cfi.c (cfi_label_required_p): Likewise.
22472 (dwarf2out_do_frame): Likewise.
22473 * except.c: Include flags.h.
22474 * final.c (dwarf2_debug_info_emitted_p): Likewise.
22475 (final_scan_insn_1): Likewise.
22476 * flags.h (dwarf_debuginfo_p): New function declaration.
22477 * opts.c (dwarf_debuginfo_p): New function definition.
22478 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
22479 * toplev.c (process_options): Likewise.
22481 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
22483 * common.opt: Change type to support bitmasks.
22484 * flag-types.h (enum debug_info_type): Rename enumerator constants.
22485 (NO_DEBUG): New bitmask.
22486 (DBX_DEBUG): Likewise.
22487 (DWARF2_DEBUG): Likewise.
22488 (XCOFF_DEBUG): Likewise.
22489 (VMS_DEBUG): Likewise.
22490 (VMS_AND_DWARF2_DEBUG): Likewise.
22491 * flags.h (debug_set_to_format): New function declaration.
22492 (debug_set_count): Likewise.
22493 (debug_set_names): Likewise.
22494 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
22495 (debug_set_to_format): New function definition.
22496 (debug_set_count): Likewise.
22497 (debug_set_names): Likewise.
22498 (set_debug_level): Update access to debug_type_names.
22499 * toplev.c: Likewise.
22501 2021-05-20 Martin Sebor <msebor@redhat.com>
22503 PR middle-end/100684
22504 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
22506 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
22509 * config/i386/i386.md (isa): Remove x64_bmi.
22510 (enabled): Remove x64_bmi.
22511 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
22512 Remove general register alternative.
22513 (*andnot<VI_32:mode>3): Ditto.
22514 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
22515 (*<any_logic:code><VI_32:mode>3): Ditto.
22517 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
22519 * config/arm/arm.c: Include head files tree-vectorizer.h and
22522 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
22525 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
22526 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
22527 (<s>mulv4hi3_highpart): New expander.
22528 (*<s>mulv2hi3_highpart): New insn pattern.
22529 (<s>mulv2hi3_higpart): New expander.
22530 (*<any_shift:insn>v2hi3): New insn pattern.
22531 (<any_shift:insn>v2hi3): New expander.
22532 * config/i386/sse.md (smulhrsv2hi3): New expander.
22533 (*smulhrsv2hi3): New insn pattern.
22535 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
22537 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
22539 * params.opt (vect-inner-loop-cost-factor): New.
22540 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
22541 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
22542 tree-vectorizer.h and its required ones.
22543 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
22544 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
22545 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
22546 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
22547 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
22548 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
22550 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
22551 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
22552 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
22554 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
22555 Torbjörn Svensson <torbjorn.svensson@st.com>
22558 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
22560 2021-05-20 Jakub Jelinek <jakub@redhat.com>
22562 PR middle-end/99928
22563 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
22564 explicit linear clause when combined with target, make it map(tofrom:)
22565 instead of no clause or firstprivate.
22567 2021-05-20 Jakub Jelinek <jakub@redhat.com>
22569 PR tree-optimization/94589
22570 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
22571 of integral conversions.
22573 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
22575 * gimple-range.cc (fur_source::get_operand): New.
22576 (gimple_range_fold): Delete.
22577 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
22578 (fold_using_range::range_of_range_op): Move from gimple_ranger.
22579 (fold_using_range::range_of_address): Ditto.
22580 (fold_using_range::range_of_phi): Ditto.
22581 (fold_using_range::range_of_call): Ditto.
22582 (fold_using_range::range_of_builtin_ubsan_call): Move from
22583 range_of_builtin_ubsan_call.
22584 (fold_using_range::range_of_builtin_call): Move from
22585 range_of_builtin_call.
22586 (gimple_ranger::range_of_builtin_call): Delete.
22587 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
22588 (gimple_ranger::fold_range_internal): New.
22589 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
22590 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
22591 gimple_ranger. Improve ranges of SSA_NAMES when possible.
22592 * gimple-range.h (gimple_ranger): Remove various range_of routines.
22593 (class fur_source): New.
22594 (class fold_using_range): New.
22595 (fur_source::fur_source): New.
22597 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
22598 instead of range_of_builtin_call.
22600 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
22602 * doc/cpp.texi (Common Predefined Macros): Update documentation
22603 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
22605 2021-05-19 Alex Coplan <alex.coplan@arm.com>
22608 * config/arm/arm.md (nonsecure_call_internal): Always ensure
22609 callee's address is in a register.
22611 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
22613 * common/config/riscv/riscv-common.c
22614 (riscv_subset_list::parsing_subset_version): Properly parse the letter
22616 (riscv_subset_list::parse_std_ext,
22617 riscv_subset_list::parse_multiletter_ext): To handle errors generated
22618 in riscv_subset_list::parsing_subset_version.
22620 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
22622 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
22623 type attribute in patterns generating XTN(2).
22625 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
22627 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
22628 Remove as duplicate of...
22629 (aarch64_xtn<mode>): This.
22630 (aarch64_xtn2<mode>_le): Move position in file.
22631 (aarch64_xtn2<mode>_be): Move position in file.
22632 (aarch64_xtn2<mode>): Move position in file.
22633 (vec_pack_trunc_<mode>): Define as an expander.
22635 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
22637 * config/aarch64/aarch64-simd-builtins.def: Split builtin
22638 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
22639 separate scalar and vector generators.
22640 * config/aarch64/aarch64-simd.md
22641 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
22643 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
22644 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
22645 * config/aarch64/iterators.md: Define SD_HSDI iterator.
22647 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
22649 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
22651 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
22653 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
22655 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
22656 Implement as an expand emitting a big/little endian
22657 instruction pattern.
22658 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
22659 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
22661 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
22663 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
22664 Implement as an expand emitting a big/little endian
22665 instruction pattern.
22666 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
22667 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
22668 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
22669 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
22671 2021-05-19 Richard Biener <rguenther@suse.de>
22673 PR middle-end/100672
22674 * fold-const.c (fold_negate_expr_1): Use element_precision.
22675 (negate_expr_p): Likewise.
22677 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
22679 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
22680 (pred_load): New int attribute.
22681 * config/aarch64/aarch64-sve.md
22682 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
22683 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
22684 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
22685 code_for_aarch64_load.
22687 2021-05-19 Richard Biener <rguenther@suse.de>
22689 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
22690 sure TARGET_MEM_REF bases are expanded as memory.
22691 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
22692 Do not mark TARGET_MEM_REF bases addressable.
22693 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
22694 TARGET_MEM_REF bases as never rewritable.
22695 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
22696 walk TARGET_MEM_REF bases as address-takens.
22697 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
22699 2021-05-19 Richard Biener <rguenther@suse.de>
22701 * builtins.c (get_object_alignment_1): Strip outer
22703 * tree-dfa.c (get_ref_base_and_extent): Handle outer
22704 WITH_SIZE_EXPR for size processing and process the
22706 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
22707 outer WITH_SIZE_EXPR.
22708 (ao_ref_base_alias_ptr_type): Likewise.
22709 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
22710 and handle that accordingly, stripping it for the
22711 core alias workers.
22712 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
22713 looking through it instead of returning NULL.
22715 2021-05-19 Jakub Jelinek <jakub@redhat.com>
22717 PR middle-end/100576
22718 * builtins.c (check_read_access): Convert bound to size_type_node if
22721 2021-05-19 Richard Biener <rguenther@suse.de>
22723 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
22724 (verify_types_in_gimple_reference): ... here. Sanitize.
22725 (verify_gimple_call): Verify references in LHS and arguments.
22726 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
22728 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
22730 * config/i386/i386.h (VALID_INT_MODE_P):
22731 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
22732 * config/i386/i386.md (isa): Add x64_bmi.
22733 (enabled): Handle x64_bmi.
22734 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
22735 Add alternative using 64bit general registers.
22736 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
22738 2021-05-19 Jakub Jelinek <jakub@redhat.com>
22740 PR middle-end/99928
22741 * tree.h (OMP_MASTER_COMBINED): Define.
22742 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
22743 handling for outer combined/composite constructs to a loop.
22744 Handle lastprivate on combined target.
22745 (gimplify_expr): Formatting fix.
22747 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
22749 * passes.def: Add sink_code pass before store_merging.
22750 * tree-ssa-sink.c (pass_sink_code:clone): New.
22752 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
22754 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
22755 rs6000_special_adjust_field_align_p.
22756 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
22757 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
22758 Remove ABI warning.
22759 (rs6000_function_arg): Likewise.
22760 * config/rs6000/rs6000-protos.h
22761 (rs6000_special_adjust_field_align_p): Remove prototype.
22762 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
22764 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
22765 rs6000_special_adjust_field_align_p.
22767 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
22770 * config/i386/i386.h (VALID_SSE2_REG_MODE):
22771 Add V4QI and V2HI modes.
22772 (VALID_INT_MODE_P): Ditto.
22773 * config/i386/mmx.md (VI_32): New mode iterator.
22774 (mmxvecsize): Handle V4QI and V2HI.
22776 (mov<VI_32:mode>): New expander.
22777 (*mov<mode>_internal): New insn pattern.
22778 (movmisalign<VI_32:mode>): New expander.
22779 (neg<VI_32:mode>): New expander.
22780 (<plusminus:insn><VI_32:mode>3): New expander.
22781 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
22782 (mulv2hi3): New expander.
22783 (*mulv2hi3): New insn pattern.
22784 (one_cmpl<VI_32:mode>2): New expander.
22785 (*andnot<VI_32:mode>3): New insn pattern.
22786 (<any_logic:code><VI_32:mode>3): New expander.
22787 (*<any_logic:code><VI_32:mode>3): New insn pattern.
22789 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
22791 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
22792 Fix a mode mismatch with operand 1.
22794 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
22797 * config/i386/i386-expand.c (split_double_mode): Return
22798 temporary register when simplify_gen_subreg fails with
22799 the high half od the paradoxical subreg.
22801 2021-05-18 Richard Biener <rguenther@suse.de>
22803 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
22804 and honor it when expanding.
22805 (expand_used_vars_for_block): Pass through forced_stack_var.
22806 (expand_used_vars): Likewise.
22807 (discover_nonconstant_array_refs_r): Set bits in
22808 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
22809 (avoid_type_punning_on_regs): Likewise.
22810 (discover_nonconstant_array_refs): Likewise.
22811 (pass_expand::execute): Create and pass down forced_stack_var
22812 bitmap. For parameters and returns temporarily set
22813 TREE_ADDRESSABLE when expand_function_start.
22815 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
22817 * doc/sourcebuild.texi: Document 'dg-note'.
22819 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
22822 * configure: Regenerate.
22823 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
22825 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
22827 * gimple.h (is_gimple_omp_oacc): Tighten.
22828 * omp-low.c (check_omp_nesting_restrictions): Adjust.
22830 2021-05-18 Richard Biener <rguenther@suse.de>
22832 * tree-ssa-operands.c (mark_address_taken): Simplify.
22834 2021-05-18 Martin Liska <mliska@suse.cz>
22836 * config/gcn/mkoffload.c (STR): Redefine.
22837 * config/i386/intelmic-mkoffload.c (STR): Likewise.
22838 * config/nvptx/mkoffload.c (STR): Likewise.
22840 2021-05-18 Martin Liska <mliska@suse.cz>
22842 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
22843 Use startswith function instead of strncmp.
22844 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
22845 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
22846 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
22847 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
22848 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
22849 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
22850 * config/arm/arm.c (arm_file_start): Likewise.
22851 (arm_valid_target_attribute_rec): Likewise.
22852 (thumb1_md_asm_adjust): Likewise.
22853 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
22854 * config/avr/avr.c (STR_PREFIX_P): Likewise.
22855 (avr_set_current_function): Likewise.
22856 (avr_handle_addr_attribute): Likewise.
22857 (avr_asm_output_aligned_decl_common): Likewise.
22858 (avr_asm_named_section): Likewise.
22859 (avr_section_type_flags): Likewise.
22860 (avr_asm_select_section): Likewise.
22861 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
22862 (c6x_section_type_flags): Likewise.
22863 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
22864 (darwin_objc_declare_unresolved_class_reference): Likewise.
22865 (darwin_objc_declare_class_definition): Likewise.
22866 * config/darwin.c (indirect_data): Likewise.
22867 (darwin_encode_section_info): Likewise.
22868 (darwin_objc2_section): Likewise.
22869 (darwin_objc1_section): Likewise.
22870 (machopic_select_section): Likewise.
22871 (darwin_globalize_label): Likewise.
22872 (darwin_label_is_anonymous_local_objc_name): Likewise.
22873 (darwin_asm_named_section): Likewise.
22874 (darwin_asm_output_dwarf_offset): Likewise.
22875 * config/frv/frv.c (frv_string_begins_with): Likewise.
22876 (frv_in_small_data_p): Likewise.
22877 * config/gcn/mkoffload.c (STR): Likewise.
22879 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
22880 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
22881 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
22882 (ix86_md_asm_adjust): Likewise.
22883 * config/i386/intelmic-mkoffload.c (STR): Likewise.
22884 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
22885 (i386_pe_file_end): Likewise.
22886 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
22887 (ia64_section_type_flags): Likewise.
22888 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
22889 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
22890 (mips16_stub_function_p): Likewise.
22891 (mips_function_rodata_section): Likewise.
22892 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
22893 (msp430_function_section): Likewise.
22894 (msp430_section_type_flags): Likewise.
22895 (msp430_expand_helper): Likewise.
22896 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
22897 (nios2_valid_target_attribute_rec): Likewise.
22898 * config/nvptx/mkoffload.c (process): Likewise.
22900 * config/pa/som.h: Likewise.
22901 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
22902 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
22903 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
22904 (rs6000_inner_target_options): Likewise.
22905 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
22906 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
22907 * config/vax/vax.c (vax_output_int_move): Likewise.
22908 * config/vms/vms-ld.c (startswith): Likewise.
22909 (process_args): Likewise.
22911 * config/vms/vms.c: Likewise.
22913 2021-05-18 Jakub Jelinek <jakub@redhat.com>
22915 PR rtl-optimization/100590
22916 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
22917 they are NONJUMP_INSN_P.
22919 2021-05-18 Jakub Jelinek <jakub@redhat.com>
22922 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
22923 DECL_ASSEMBLER_NAME on the fn_decl.
22925 2021-05-18 Jakub Jelinek <jakub@redhat.com>
22927 PR tree-optimization/94589
22928 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
22929 phi result used in (res & ~1) == 0 comparison as res >= 0 as
22930 res == 2 would be UB with -ffinite-math-only.
22932 2021-05-18 Martin Liska <mliska@suse.cz>
22934 * Makefile.in: genversion.o should depend on DATESTAMP.
22936 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
22938 * config/arc/simdext.md (negv2si2): Remove round bracket.
22940 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
22942 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
22943 _Bool as macro expanding to _Bool.
22945 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
22948 * tree.c (build_reference_type_for_mode)
22949 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
22951 (build_reference_type, build_pointer_type): Invoke
22952 build_*_type_for_mode with VOIDmode.
22954 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
22956 PR tree-optimization/100512
22957 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
22958 and non-zero pointer ranges as invariant.
22959 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
22960 processing from here.
22962 2021-05-17 Tom de Vries <tdevries@suse.de>
22965 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
22966 * config/nvptx/nvptx.c (nvptx_output_barrier)
22967 (nvptx_output_atomic_insn): New function.
22968 (nvptx_print_operand): Add support for 'B'.
22969 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
22972 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
22974 PR tree-optimization/100349
22975 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
22978 2021-05-17 Tamar Christina <tamar.christina@arm.com>
22980 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
22981 (host_detect_local_cpu): Use it.
22983 2021-05-17 Martin Liska <mliska@suse.cz>
22985 * doc/invoke.texi: Add 2 missing dots.
22987 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
22989 PR bootstrap/100552
22990 * configure.ac: Replace pattern substitution with call to sed.
22991 * configure: Regenerate.
22993 2021-05-17 Richard Biener <rguenther@suse.de>
22995 PR middle-end/100582
22996 * tree.c (array_at_struct_end_p): Get to the base of the
22997 reference before looking for the underlying decl.
22999 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
23001 * genoutput.c (validate_insn_alternatives) Make "wrong number of
23002 alternatives" message more specific, and remove assumption on where
23005 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
23007 * config/arm/iterators.md (V16): New iterator.
23008 (VH_cvtto): New iterator.
23009 (v_cmp_result): Added V4HF and V8HF support.
23010 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
23011 (vcond<mode><mode>): Likewise.
23012 (vcond_mask_<mode><v_cmp_result>): Likewise.
23013 (vcond<VH_cvtto><mode>): New expander.
23015 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
23017 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
23019 * config/arm/arm.c (arm_expand_vector_compare): Add support for
23021 (arm_expand_vcond): Likewise.
23022 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
23023 VCMPEQQ_N_S, VCMPNEQ_N_S.
23024 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
23025 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
23026 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
23027 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
23028 (@mve_vpselq_<supf><mode>): Likewise.
23029 (@mve_vpselq_f<mode>"): Likewise.
23030 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
23031 and move to vec-common.md.
23032 (vec_cmpu<mode><mode>): Likewise.
23033 (vcond<mode><mode>): Likewise.
23034 (vcond<V_cvtto><mode>): Likewise.
23035 (vcondu<mode><v_cmp_result>): Likewise.
23036 (vcond_mask_<mode><v_cmp_result>): Likewise.
23037 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
23038 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
23039 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
23040 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
23041 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
23042 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
23044 (vec_cmpu<mode><mode>): Likewise.
23045 (vcond<mode><mode>): Likewise.
23046 (vcond<V_cvtto><mode>): Likewise.
23047 (vcondu<mode><v_cmp_result>): Likewise.
23048 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
23051 2021-05-17 liuhongt <hongtao.liu@intel.com>
23054 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
23055 gsi_insert_seq_before instead.
23057 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
23059 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
23060 (arm_sat_ok): ...this.
23062 2021-05-17 Martin Liska <mliska@suse.cz>
23064 * lto-wrapper.c (merge_flto_options): Factor out a new function.
23065 (merge_and_complain): Use it.
23066 (run_gcc): Merge also linker command line -flto=foo argument
23069 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
23071 * config/arm/arm.h (CPP_SPEC): Remove error message about
23072 -mlittle-endian/-mbig-endian conflict.
23074 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
23076 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
23077 __ROP_PROTECT__ if -mrop-protect is selected.
23079 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
23081 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
23082 rop_hash_save_offset and rop_hash_size.
23083 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
23084 rop_hash_size and rop_hash_save_offset.
23085 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
23086 (rs6000_emit_prologue): Emit hashst[p] in prologue.
23087 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
23088 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
23090 (hashst): New define_insn.
23091 (hashchk): Likewise.
23093 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
23095 * config/rs6000/rs6000.c (rs6000_option_override_internal):
23096 Disable shrink wrap when inserting ROP-protect instructions.
23097 * config/rs6000/rs6000.opt (mrop-protect): New option.
23098 (mprivileged): Likewise.
23099 * doc/invoke.texi: Document mrop-protect and mprivileged.
23101 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
23103 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
23106 2021-05-15 Martin Jambor <mjambor@suse.cz>
23109 2021-05-13 Martin Jambor <mjambor@suse.cz>
23111 PR tree-optimization/100453
23112 * tree-sra.c (sra_modify_assign): All const base accesses do not
23113 need refreshing, not just those from decl_pool.
23114 (sra_modify_assign): Do not refresh into a const base decl.
23116 2021-05-15 Jakub Jelinek <jakub@redhat.com>
23118 PR rtl-optimization/100342
23119 * regcprop.c (copy_value): When copying a source reg in a wider
23120 mode than it has recorded for the value, adjust recorded destination
23121 mode too or punt if !REG_CAN_CHANGE_MODE_P.
23123 2021-05-14 Jason Merrill <jason@redhat.com>
23125 * intl.h: Add comments.
23127 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23129 * config/aarch64/aarch64-simd.md
23130 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
23131 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
23132 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
23133 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
23134 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
23135 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
23136 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
23137 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
23138 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
23140 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
23143 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
23144 boolean logic equivalent.
23145 (vtst_s16): Likewise.
23146 (vtst_s32): Likewise.
23147 (vtst_u8): Likewise.
23148 (vtst_u16): Likewise.
23149 (vtst_u32): Likewise.
23150 (vtst_p8): Likewise.
23151 (vtst_p16): Likewise.
23152 (vtstq_s8): Likewise.
23153 (vtstq_s16): Likewise.
23154 (vtstq_s32): Likewise.
23155 (vtstq_u8): Likewise.
23156 (vtstq_u16): Likewise.
23157 (vtstq_u32): Likewise.
23158 (vtstq_p8): Likewise.
23159 (vtstq_p16): Likewise.
23160 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
23161 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
23163 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23165 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
23166 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
23167 (aarch64_sqdmlsl2<mode>): Delete.
23168 (aarch64_sqdmlal2_lane<mode>): Merge this...
23169 (aarch64_sqdmlsl2_lane<mode>): ... And this...
23170 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
23171 (aarch64_sqdmlal2_laneq<mode>): Merge this...
23172 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
23173 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
23174 (aarch64_sqdmlal2_n<mode>): Merge this...
23175 (aarch64_sqdmlsl2_n<mode>): ... And this...
23176 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
23178 2021-05-13 Martin Sebor <msebor@redhat.com>
23180 PR middle-end/100574
23181 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
23182 all null arguments.
23184 2021-05-13 Martin Sebor <msebor@redhat.com>
23186 PR tree-optimization/93100
23187 PR middle-end/98583
23188 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
23189 don't modify referenced objects.
23191 2021-05-13 Martin Jambor <mjambor@suse.cz>
23193 PR tree-optimization/100453
23194 * tree-sra.c (sra_modify_assign): All const base accesses do not
23195 need refreshing, not just those from decl_pool.
23196 (sra_modify_assign): Do not refresh into a const base decl.
23198 2021-05-13 Martin Liska <mliska@suse.cz>
23200 * tree-ssa-dom.c: Remove m_simplifier.
23202 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
23205 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
23206 canonicalize DImode inequality comparisons against the
23207 maximum integral value.
23209 2021-05-13 Jakub Jelinek <jakub@redhat.com>
23211 PR tree-optimization/98856
23212 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
23213 Expect V2DI and V4DI arithmetic right shifts to be emulated.
23214 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
23216 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
23217 expand_vec_perm_2perm_pblendv): New functions.
23218 (ix86_expand_vec_perm_const_1): Use them.
23219 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
23220 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
23221 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
23222 (ashrv4di3): New define_expand.
23223 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
23224 and !TARGET_AVX512VL expansion.
23226 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
23229 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
23230 sizes < 16 to a register when constructing vpcmov pattern.
23231 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
23233 2021-05-13 Martin Liska <mliska@suse.cz>
23235 * gcov-io.c (gcov_write_block): Remove.
23236 (gcov_write_words): Likewise.
23237 (gcov_read_words): Re-implement using gcov_read_bytes.
23238 (gcov_allocate): Remove.
23239 (GCOV_BLOCK_SIZE): Likewise.
23240 (struct gcov_var): Remove most of the fields.
23241 (gcov_position): Implement with ftell.
23242 (gcov_rewrite): Remove setting of start and offset fields.
23243 (from_file): Re-format.
23244 (gcov_open): Remove setbuf call. It should not be needed.
23245 (gcov_close): Remove internal buffer handling.
23246 (gcov_magic): Use __builtin_bswap32.
23247 (gcov_write_counter): Use directly gcov_write_unsigned.
23248 (gcov_write_string): Use direct fwrite and do not round
23250 (gcov_seek): Use directly fseek.
23251 (gcov_write_tag): Use gcov_write_unsigned directly.
23252 (gcov_write_length): Likewise.
23253 (gcov_write_tag_length): Likewise.
23254 (gcov_read_bytes): Use directly fread.
23255 (gcov_read_unsigned): Use gcov_read_words.
23256 (gcov_read_counter): Likewise.
23257 (gcov_read_string): Use gcov_read_bytes.
23258 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
23259 that size is not in bytes, but words (4B).
23260 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
23261 (GCOV_TAG_ARCS_LENGTH): Likewise.
23262 (GCOV_TAG_ARCS_NUM): Likewise.
23263 (GCOV_TAG_COUNTER_LENGTH): Likewise.
23264 (GCOV_TAG_COUNTER_NUM): Likewise.
23265 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
23267 2021-05-13 liuhongt <hongtao.liu@intel.com>
23270 * config/i386/sse.md (ssedoublevecmode): Add attribute for
23271 V64QI/V32HI/V16SI/V4DI.
23272 (ssehalfvecmode): Add attribute for V2DI/V2DF.
23273 (*vec_concatv4si_0): Extend to VI124_128.
23274 (*vec_concat<mode>_0): New pre-reload splitter.
23275 * config/i386/predicates.md (movq_parallel): New predicate.
23277 2021-05-13 Alexandre Oliva <oliva@adacore.com>
23279 * targhooks.c (default_zero_call_used_regs): Retry using
23280 successfully-zeroed registers as sources.
23282 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
23284 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
23286 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
23289 * gimple-range.cc (range_of_builtin_call): Skip out on
23290 processing __builtin_clz when varying.
23292 2021-05-12 Tom de Vries <tdevries@suse.de>
23295 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
23296 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
23297 to ptx_version_option.
23298 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
23299 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
23300 (define_insn "nvptx_vote_ballot"): Use sync variant for
23302 * config/nvptx/nvptx.opt (ptx_version): Add enum.
23303 (mptx): Add option.
23304 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
23306 2021-05-12 Richard Biener <rguenther@suse.de>
23308 PR tree-optimization/100566
23309 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
23310 allow_back for all edge queries.
23312 2021-05-12 liuhongt <hongtao.liu@intel.com>
23315 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
23316 splitters for pblendvb of NOT mask register.
23318 2021-05-12 Richard Biener <rguenther@suse.de>
23320 PR tree-optimization/100519
23321 * tree-ssa-reassoc.c (can_associate_p): Split into...
23322 (can_associate_op_p): ... this
23323 (can_associate_type_p): ... and this.
23324 (is_reassociable_op): Call can_associate_op_p.
23325 (break_up_subtract_bb): Call the appropriate predicates.
23326 (reassociate_bb): Likewise.
23328 2021-05-12 Martin Liska <mliska@suse.cz>
23330 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
23331 (run_gcc): Use -flto argument detection for merged
23334 2021-05-12 Martin Liska <mliska@suse.cz>
23336 * lto-wrapper.c (print_lto_docs_link): New function.
23337 (run_gcc): Print warning about missing job server detection
23338 after we know NR of partitions. Do the same for -flto{,=1}.
23339 * opts.c (get_option_html_page): Support -flto option.
23341 2021-05-12 Martin Liska <mliska@suse.cz>
23343 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
23345 (append_option): Remove.
23346 (find_option): Rework to use the vector type.
23347 (remove_option): Remove.
23348 (merge_and_complain): Use vectors for cl_decoded_option data
23350 (append_compiler_options): Likewise.
23351 (append_diag_options): Likewise.
23352 (append_linker_options): Likewise.
23353 (append_offload_options): Likewise.
23354 (compile_offload_image): Likewise.
23355 (compile_images_for_offload_targets): Likewise.
23356 (find_and_merge_options): Likewise.
23357 (run_gcc): Likewise.
23359 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
23362 * dwarf2out.c (dwarf2out_finish): Set
23363 have_multiple_function_sections with multi-range text_section.
23365 2021-05-12 Martin Liska <mliska@suse.cz>
23367 PR bootstrap/100560
23368 * Makefile.in: Remove version.h from linker command line.
23370 2021-05-12 Richard Biener <rguenther@suse.de>
23372 PR middle-end/100547
23373 * rtl.h (rtvec_alloc): Make argument size_t.
23374 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
23376 2021-05-12 Jakub Jelinek <jakub@redhat.com>
23378 PR middle-end/100508
23379 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
23380 type, don't reuse DECL_RTL if it has different mode, instead force
23381 creation of a new DEBUG_EXPR.
23383 2021-05-12 Jakub Jelinek <jakub@redhat.com>
23384 Marc Glisse <marc.glisse@inria.fr>
23386 PR tree-optimization/94589
23387 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
23388 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
23390 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
23393 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
23394 * config/i386/mmx.md (MMXMODE124): New mode iterator.
23396 (mmxintvecmode): New mode attribute.
23397 (mmxintvecmodelower): Ditto.
23398 (*mmx_maskcmpv2sf3_comm): New insn pattern.
23399 (*mmx_maskcmpv2sf3): Ditto.
23400 (vec_cmpv2sfv2si): New expander.
23401 (vcond<V2FI:mode>v2si): Ditto.
23402 (mmx_vlendvps): New insn pattern.
23403 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
23404 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
23405 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
23407 2021-05-11 Martin Sebor <msebor@redhat.com>
23409 PR middle-end/21433
23410 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
23412 2021-05-11 Richard Biener <rguenther@suse.de>
23414 * gimple-fold.c (gimple_fold_call): Do not call
23415 maybe_fold_reference on call arguments or the static chain.
23416 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
23419 2021-05-11 Martin Liska <mliska@suse.cz>
23421 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
23422 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
23423 (DEF_HSAIL_SAT_BUILTIN): Likewise.
23424 (DEF_HSAIL_INTR_BUILTIN): Likewise.
23425 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
23426 * doc/frontends.texi: Remove BRIG.
23427 * doc/install.texi: Likewise.
23428 * doc/invoke.texi: Likewise.
23429 * doc/standards.texi: Likewise.
23430 * brig-builtins.def: Removed.
23431 * brig/ChangeLog: Removed.
23432 * brig/Make-lang.in: Removed.
23433 * brig/brig-builtins.h: Removed.
23434 * brig/brig-c.h: Removed.
23435 * brig/brig-lang.c: Removed.
23436 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
23437 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
23438 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
23439 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
23440 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
23441 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
23442 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
23443 * brig/brigfrontend/brig-comment-handler.cc: Removed.
23444 * brig/brigfrontend/brig-control-handler.cc: Removed.
23445 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
23446 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
23447 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
23448 * brig/brigfrontend/brig-function-handler.cc: Removed.
23449 * brig/brigfrontend/brig-function.cc: Removed.
23450 * brig/brigfrontend/brig-function.h: Removed.
23451 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
23452 * brig/brigfrontend/brig-label-handler.cc: Removed.
23453 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
23454 * brig/brigfrontend/brig-machine.c: Removed.
23455 * brig/brigfrontend/brig-machine.h: Removed.
23456 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
23457 * brig/brigfrontend/brig-module-handler.cc: Removed.
23458 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
23459 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
23460 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
23461 * brig/brigfrontend/brig-to-generic.cc: Removed.
23462 * brig/brigfrontend/brig-to-generic.h: Removed.
23463 * brig/brigfrontend/brig-util.cc: Removed.
23464 * brig/brigfrontend/brig-util.h: Removed.
23465 * brig/brigfrontend/brig-variable-handler.cc: Removed.
23466 * brig/brigfrontend/hsa-brig-format.h: Removed.
23467 * brig/brigfrontend/phsa.h: Removed.
23468 * brig/brigspec.c: Removed.
23469 * brig/config-lang.in: Removed.
23470 * brig/gccbrig.texi: Removed.
23471 * brig/lang-specs.h: Removed.
23472 * brig/lang.opt: Removed.
23474 2021-05-11 Richard Biener <rguenther@suse.de>
23477 * ipa-param-manipulation.c
23478 (ipa_param_body_adjustments::modify_call_stmt): Avoid
23479 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
23480 via gimple_call_lhs_ptr.
23482 2021-05-11 Alex Coplan <alex.coplan@arm.com>
23485 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
23486 Avoid emitting CFA adjusts on the sp if we have the fp.
23488 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
23490 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
23491 (VMULD): New iterator.
23492 (VCOND): Handle V4HF and V8HF.
23493 (VCONQ): Fix entry for V2SF.
23494 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
23495 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
23496 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
23497 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
23498 Use a 128-bit vector mode for the indexed operand. Use stype for
23499 the scheduling type.
23501 2021-05-11 Richard Biener <rguenther@suse.de>
23503 * gimple-fold.c (maybe_fold_reference): Only return
23504 is_gimple_min_invariant values.
23506 2021-05-11 Richard Biener <rguenther@suse.de>
23508 PR middle-end/100509
23509 * gimple-fold.c (fold_gimple_assign): Only call
23510 get_symbol_constant_value on register type symbols.
23512 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
23513 Joe Ramsay <joe.ramsay@arm.com>
23516 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
23517 (__arm_vcmpneq): Remove duplicate definition.
23518 (__arm_vstrwq_scatter_offset_p): Likewise.
23519 (__arm_vmaxq_x): Likewise.
23520 (__arm_vmlsdavaq): Likewise.
23521 (__arm_vmlsdavaxq): Likewise.
23522 (__arm_vmlsdavq_p): Likewise.
23523 (__arm_vmlsdavxq_p): Likewise.
23524 (__arm_vrmlaldavhaq): Likewise.
23525 (__arm_vstrbq_p): Likewise.
23526 (__arm_vstrbq_scatter_offset): Likewise.
23527 (__arm_vstrbq_scatter_offset_p): Likewise.
23528 (__arm_vstrdq_scatter_offset): Likewise.
23529 (__arm_vstrdq_scatter_offset_p): Likewise.
23530 (__arm_vstrdq_scatter_shifted_offset): Likewise.
23531 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
23533 2021-05-11 Jakub Jelinek <jakub@redhat.com>
23535 PR middle-end/100471
23536 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
23537 is 0, bypass the reduction loop including
23538 GOMP_taskgroup_reduction_unregister call.
23540 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
23542 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
23543 costing_for_scalar.
23544 (rs6000_density_test): Early return if costing_for_scalar is true.
23545 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
23547 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
23549 * doc/tm.texi: Regenerated.
23550 * target.def (init_cost): Add new parameter costing_for_scalar.
23551 * targhooks.c (default_init_cost): Adjust for new parameter.
23552 * targhooks.h (default_init_cost): Likewise.
23553 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
23554 (vect_compute_single_scalar_iteration_cost): Likewise.
23555 (vect_analyze_loop_2): Likewise.
23556 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
23557 (vect_bb_vectorization_profitable_p): Likewise.
23558 * tree-vectorizer.h (init_cost): Likewise.
23559 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
23560 * config/i386/i386.c (ix86_init_cost): Likewise.
23561 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
23563 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
23565 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
23566 vect_nonmem and moved into...
23567 (struct rs6000_cost_data): ...here.
23568 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
23569 (rs6000_add_stmt_cost): Likewise.
23570 (rs6000_finish_cost): Likewise.
23572 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
23574 * range-op.cc (get_bool_state): Adjust head comment.
23575 (operator_not_equal::op1_range): Fix comment.
23576 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
23578 2021-05-10 Martin Sebor <msebor@redhat.com>
23580 PR middle-end/100425
23581 PR middle-end/100510
23582 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
23583 (pass_walloca::xlimit_certain_p): ...to this.
23584 (pass_walloca::gate): Execute for any kind of handled warning.
23585 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
23586 warnings when xlimit_certain_p is set.
23588 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
23590 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
23591 Return ALTIVEC_REGS if that is best_class.
23592 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
23594 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
23596 * config/arm/arm.h (CPP_SPEC): Remove error message about
23599 2021-05-10 Martin Jambor <mjambor@suse.cz>
23601 * ipa-prop.h (IPA_NODE_REF): Removed.
23602 (IPA_NODE_REF_GET_CREATE): Likewise.
23603 (IPA_EDGE_REF): Likewise.
23604 (IPA_EDGE_REF_GET_CREATE): Likewise.
23605 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
23606 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
23607 use of ipa_node_params_sum.
23608 (ipcp_versionable_function_p): Likewise.
23609 (push_node_to_stack): Likewise.
23610 (pop_node_from_stack): Likewise.
23611 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
23612 direct use of ipa_node_params_sum.
23613 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
23614 ipa_node_params_sum.
23615 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
23617 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
23618 use of ipa_node_params_sum.
23619 (self_recursively_generated_p): Likewise.
23620 (propagate_scalar_across_jump_function): Likewise.
23621 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
23622 direct use of ipa_edge_args_sum, moved the lookup after the early
23623 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
23624 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
23625 direct uses of ipa_node_params_sum.
23626 (propagate_vr_across_jump_function): Likewise.
23627 (propagate_aggregate_lattice): Likewise.
23628 (propagate_aggs_across_jump_function): Likewise.
23629 (propagate_constants_across_call): Likewise, also replaced
23630 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
23631 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
23632 of ipa_node_params_sum.
23633 (estimate_local_effects): Likewise.
23634 (add_all_node_vals_to_toposort): Likewise.
23635 (propagate_constants_topo): Likewise.
23636 (ipcp_propagate_stage): Likewise.
23637 (ipcp_discover_new_direct_edges): Likewise.
23638 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
23639 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
23640 (get_info_about_necessary_edges): Likewise.
23641 (want_remove_some_param_p): Likewise.
23642 (create_specialized_node): Likewise.
23643 (self_recursive_pass_through_p): Likewise.
23644 (self_recursive_agg_pass_through_p): Likewise.
23645 (find_more_scalar_values_for_callers_subset): Likewise and also
23646 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
23647 case replacing two of those with a single query.
23648 (find_more_contexts_for_caller_subset): Likewise for the
23649 ipa_polymorphic_call_context overload.
23650 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
23651 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
23652 ipa_node_params_sum.
23653 (find_aggregate_values_for_callers_subset): Likewise, also reusing
23654 results of ipa_edge_args_sum->get.
23655 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
23656 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
23657 direct use of ipa_edge_args_sum.
23658 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
23659 summary query after the early exit and reused the result later.
23660 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
23661 ipa_node_params_sum.
23662 (decide_whether_version_node): Likewise. Removed re-querying for
23663 summaries after cloning.
23664 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
23665 ipa_node_params_sum.
23666 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
23668 (identify_dead_nodes): Likewise.
23669 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
23670 ipa_node_params_sum.
23671 (ipcp_store_vr_results): Likewise.
23672 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
23673 (ipa_fn_summary_t::duplicate): Likewise.
23674 (analyze_function_body): Likewise.
23675 (estimate_calls_size_and_time): Likewise.
23676 (ipa_cached_call_context::duplicate_from): Likewise.
23677 (ipa_call_context::equal_to): Likewise.
23678 (remap_edge_params): Likewise.
23679 (ipa_merge_fn_summary_after_inlining): Likewise.
23680 (inline_read_section): Likewise.
23681 * ipa-icf.c (sem_function::param_used_p): Likewise.
23682 * ipa-modref.c (compute_parm_map): Likewise.
23683 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
23685 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
23686 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
23688 * ipa-profile.c (check_argument_count): Likewise.
23689 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
23690 with a direct use of ipa_node_params_sum.
23691 (ipa_initialize_node_params): Likewise.
23692 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
23693 direct use of ipa_edge_args_sum and reused the query result.
23694 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
23695 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
23696 direct use of ipa_edge_args_sum.
23697 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
23698 ipa_node_params_sum and reused the result of the query.
23699 (ipa_analyze_node): Likewise.
23700 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
23701 of ipa_node_params_sum.
23702 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
23703 direct uses of ipa_edge_args_sum.
23704 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
23705 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
23706 direct use of ipa_edge_args_sum. Removed superficial re-querying the
23708 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
23709 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
23711 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
23712 direct use of ipa_edge_args_sum.
23713 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
23714 use of ipa_node_params_sum.
23715 (ipa_print_node_params): Likewise.
23716 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
23717 direct uses of ipa_edge_args_sum.
23718 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
23720 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
23721 ipa_node_params_sum.
23722 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
23723 scopes where it is used.
23725 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
23727 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
23728 <case E_V2SImode>: Force op_true to register.
23730 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
23732 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
23733 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
23734 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
23736 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
23737 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
23738 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
23739 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
23740 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
23741 (mve_vcmpneq_n_f<mode>): Remove.
23742 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
23743 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
23744 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
23746 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
23748 * config/arm/iterators.md (MVE_COMPARISONS): New.
23750 (mve_cmp_type): New.
23751 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
23753 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
23754 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
23755 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
23756 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
23757 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
23758 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
23759 (mve_vcmpneq_n_<mode>): Remove.
23761 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
23763 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
23764 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
23765 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
23768 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
23770 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
23771 (vcmpneq_n_u): Likewise.
23772 (vcmpeqq_u,): Likewise.
23773 (vcmpeqq_n_u): Likewise.
23774 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
23775 VCMPEQQ_N_U and VCMPNEQ_N_U.
23776 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
23777 (mve_vcmpeqq_n): Likewise.
23778 (mve_vcmpeqq): Likewise.
23779 (mve_vcmpneq_n): Likewise.
23781 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
23783 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
23784 the 's' version of the builtin.
23786 2021-05-10 Richard Biener <rguenther@suse.de>
23788 PR tree-optimization/100492
23789 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
23790 Find nothing when the loop contains an irreducible region.
23792 2021-05-10 Richard Biener <rguenther@suse.de>
23794 PR middle-end/100464
23796 * gimple-fold.c (canonicalize_constructor_val): Do not set
23799 2021-05-10 Richard Biener <rguenther@suse.de>
23801 PR tree-optimization/100434
23802 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
23804 (dse_optimize_stmt): Handle call LHS by dropping the
23805 LHS or the whole call if it doesn't have other
23807 (pass_dse::execute): Adjust.
23809 2021-05-10 Martin Liska <mliska@suse.cz>
23811 * Makefile.in: Add missing genversion rule.
23813 2021-05-10 Alex Coplan <alex.coplan@arm.com>
23816 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
23817 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
23819 2021-05-10 Martin Liska <mliska@suse.cz>
23821 * builtins.c (is_builtin_name): Use startswith
23822 function instead of strncmp.
23823 * collect2.c (main): Likewise.
23824 (has_lto_section): Likewise.
23825 (scan_libraries): Likewise.
23826 * coverage.c (coverage_checksum_string): Likewise.
23827 (coverage_init): Likewise.
23828 * dwarf2out.c (is_cxx): Likewise.
23829 (gen_compile_unit_die): Likewise.
23830 * gcc-ar.c (main): Likewise.
23831 * gcc.c (init_spec): Likewise.
23832 (read_specs): Likewise.
23833 (execute): Likewise.
23834 (check_live_switch): Likewise.
23835 * genattrtab.c (write_attr_case): Likewise.
23836 (IS_ATTR_GROUP): Likewise.
23837 * gencfn-macros.c (main): Likewise.
23838 * gengtype.c (type_for_name): Likewise.
23839 (gen_rtx_next): Likewise.
23840 (get_file_langdir): Likewise.
23841 (write_local): Likewise.
23842 * genmatch.c (get_operator): Likewise.
23843 (get_operand_type): Likewise.
23844 (expr::gen_transform): Likewise.
23845 * genoutput.c (validate_optab_operands): Likewise.
23846 * incpath.c (add_sysroot_to_chain): Likewise.
23847 * langhooks.c (lang_GNU_C): Likewise.
23848 (lang_GNU_CXX): Likewise.
23849 (lang_GNU_Fortran): Likewise.
23850 (lang_GNU_OBJC): Likewise.
23851 * lto-wrapper.c (run_gcc): Likewise.
23852 * omp-general.c (omp_max_simt_vf): Likewise.
23853 * omp-low.c (omp_runtime_api_call): Likewise.
23854 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
23855 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
23856 * real.c (real_from_string): Likewise.
23857 * selftest.c (assert_str_startswith): Likewise.
23858 * timevar.c (timer::validate_phases): Likewise.
23859 * tree.c (get_file_function_name): Likewise.
23860 * ubsan.c (ubsan_use_new_style_p): Likewise.
23861 * varasm.c (default_function_rodata_section): Likewise.
23862 (incorporeal_function_p): Likewise.
23863 (default_section_type_flags): Likewise.
23864 * system.h (startswith): Define startswith.
23866 2021-05-10 Martin Liska <mliska@suse.cz>
23868 * bitmap.h (class auto_bitmap): Remove
23869 __cplusplus >= 201103.
23870 * config/aarch64/aarch64.c: Likewise.
23871 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
23873 * sbitmap.h: Likewise.
23875 2021-05-10 Martin Liska <mliska@suse.cz>
23877 * Makefile.in: Rename gcov-iov to genversion and depend
23878 on version.h (instead of gcov-iov.h).
23879 * gcov-io.h: Include version.h instread of gcov-iov.h.
23880 * gengtype-state.c (read_state_version): Likewise.
23881 * gcov-iov.c: Moved to...
23882 * genversion.c: ...here.
23883 * lto-streamer.h (LTO_major_version): Define it with
23885 * version.c: Removed.
23886 * version.h: Removed.
23888 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23890 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
23891 * config/arc/simdext.md (VCT): Add predicates for iterator
23894 (voptab): Likewise.
23895 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
23896 (<voptab>v2si3): New patterns.
23898 (reduc_plus_scal_v4hi): Likewise.
23899 (reduc_plus_scal_v2si): Likewise.
23900 (vec_duplicatev2si): Likewise.
23901 (vec_duplicatev4hi): Likewise.
23903 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23905 * config/arc/simdext.md: Format and cleanup file.
23907 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23909 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
23910 only when munaligned-access option is on.
23911 (movmisalign<mode>): Likewise.
23913 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23915 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
23917 * config/arc/arc.c (arc_reorg): Remove underscore from string.
23919 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23921 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
23922 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
23923 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
23924 (norm_f): Likewise.
23927 (clzsi2): Use fls instruction when available.
23928 (arc_clzsi2): Likewise.
23930 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23932 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
23934 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
23936 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
23938 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
23940 PR middle-end/100467
23941 * toplev.c (compile_file): Call insn_locations_init before
23942 targetm.asm_out.code_end.
23944 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
23947 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
23949 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
23951 2021-05-07 Jakub Jelinek <jakub@redhat.com>
23952 Andrew Stubbs <amd@codesourcery.com>
23955 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
23956 emit_move_insn operands.
23958 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
23960 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
23961 location for the outgoing edges of an empty block.
23962 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
23963 type by means of the get_array_descr_info langhook, if it is set and
23964 returns true. Remove obsolete code dealing with unnamed subtypes.
23966 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
23968 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
23969 (sbr_vector): Renamed from ssa_block_cache.
23970 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
23971 (ssa_block_ranges::~ssa_block_ranges): Remove.
23972 (sbr_vector::set_bb_range): Use varying and undefined cached values.
23973 (ssa_block_ranges::set_bb_varying): Remove.
23974 (sbr_vector::get_bb_range): Adjust assert.
23975 (sbr_vector::bb_range_p): Adjust assert.
23976 (~block_range_cache): No freeing loop required.
23977 (block_range_cache::get_block_ranges): Remove.
23978 (block_range_cache::set_bb_range): Inline get_block_ranges.
23979 (block_range_cache::set_bb_varying): Remove.
23980 * gimple-range-cache.h (set_bb_varying): Remove prototype.
23981 * value-range.h (irange_allocator::get_memory): New.
23983 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
23985 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
23986 dominator tree is available and requested.
23987 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
23988 (ranger_cache::fill_block_cache): Don't search dom tree here either.
23989 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
23991 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
23993 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
23994 only PHI nodes better.
23996 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
23998 * gimple-range-edge.h (gimple_outgoing_range): Rename from
24000 (gcond_edge_range): Export prototype.
24001 * gimple-range-edge.cc (gcond_edge_range): New.
24002 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
24003 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
24005 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24007 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
24008 default range into a temp and allocate only what is needed.
24010 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24012 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
24014 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
24016 * gimple-range.h (gimple_range_global): Pick up parameter initial
24017 values, and use-before defined locals are UNDEFINED.
24019 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
24021 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
24023 * tree.h (reverse_storage_order_for_component_p): Return false if
24024 the type is a pointer.
24026 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
24028 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
24030 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
24033 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
24034 Handle V8QI, V4HI and V2SI modes.
24035 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
24036 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
24037 * config/i386/i386.md (unspec): ... here.
24039 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
24040 Tom de Vries <tdevries@suse.de>
24042 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
24043 a truth_value_p reduction variable is nonintegral.
24045 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
24048 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
24049 Return false for mode sizes < 16.
24051 2021-05-07 Jakub Jelinek <jakub@redhat.com>
24054 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
24056 2021-05-06 Martin Jambor <mjambor@suse.cz>
24058 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
24059 when there is no function summary.
24060 (ipa_sra_summarize_function): produce edge summaries even when
24063 2021-05-06 Tom Tromey <tom@tromey.com>
24065 * godump.c (string_hash_eq): Remove.
24066 (go_finish): Use htab_eq_string.
24068 2021-05-06 Tom Tromey <tom@tromey.com>
24070 * gengtype-state.c (read_state): Use htab_eq_string.
24071 (string_eq): Remove.
24073 2021-05-06 Tom Tromey <tom@tromey.com>
24075 * gensupport.c (htab_eq_string): Remove.
24077 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
24080 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
24081 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
24082 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
24083 set_ignored_loc callbacks.
24084 * debug.c (do_nothing_debug_hooks): Likewise.
24085 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
24086 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
24087 (in_text_section_p, last_text_label, last_cold_label,
24088 switch_text_ranges, switch_cold_ranges): New data items.
24089 (dwarf2out_note_section_used): Remove.
24090 (dwarf2out_begin_prologue): Set fde->ignored_debug and
24092 (mark_ignored_debug_section): New helper function.
24093 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
24094 mark_ignored_debug_section.
24095 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
24096 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
24097 (size_of_aranges): Adjust formula for multi-part text ranges size.
24098 (output_aranges): Output multi-part text ranges.
24099 (dwarf2out_set_ignored_loc): New callback function.
24100 (dwarf2out_finish): Output multi-part text ranges.
24101 (dwarf2out_c_finalize): Clear new data items.
24102 * final.c (final_start_function_1): Call set_ignored_loc callback.
24103 (final_scan_insn_1): Likewise.
24104 * ggc-page.c (gt_ggc_mx): New helper function.
24105 * stringpool.c (gt_pch_nx): Likewise.
24107 2021-05-06 Richard Biener <rguenther@suse.de>
24109 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
24110 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
24111 (TV_TREE_INTO_SSA): New.
24112 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
24113 (rewrite_blocks): Likewise.
24114 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
24116 2021-05-06 Jakub Jelinek <jakub@redhat.com>
24118 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
24119 abs_replacement, xor_replacement,
24120 cond_removal_in_popcount_clz_ctz_pattern,
24121 replace_phi_edge_with_variable): Change type of phi argument from
24122 gimple * to gphi *.
24124 2021-05-06 Richard Biener <rguenther@suse.de>
24126 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
24127 Output an opt-info message.
24128 (do_split_loop_on_cond): Likewise.
24129 (tree_ssa_split_loops): Update SSA form here.
24131 2021-05-06 Richard Biener <rguenther@suse.de>
24133 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
24134 return variable removal.
24136 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
24138 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
24139 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
24140 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
24142 * config/s390/s390.c (s390_const_operand_ok): Remove unused
24145 2021-05-06 Jakub Jelinek <jakub@redhat.com>
24147 PR tree-optimization/94589
24148 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
24149 spaceship_replacement.
24150 (cond_only_block_p, spaceship_replacement): New functions.
24152 2021-05-06 Richard Biener <rguenther@suse.de>
24155 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
24156 dealing with a debug use and only query existing addresses
24158 (lower_emutls_1): Avoid splitting out addresses for debug
24159 stmts, reset the debug stmt when we fail to find existing
24161 (lower_emutls_phi_arg): Set wi.stmt.
24163 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
24166 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
24167 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
24168 (stack_protect_test): Use cbranch helper.
24170 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
24173 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
24174 always return the establisher frame for __builtin_frame_address (0).
24176 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
24179 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
24180 (ix86_cpu_model_var): Likewise.
24181 (ix86_cpu_features2_type_node): Likewise.
24182 (ix86_cpu_features2_var): Likewise.
24183 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
24186 2021-05-05 Martin Sebor <msebor@redhat.com>
24188 * passes.def (pass_warn_printf): Run after SSA.
24190 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24192 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
24193 * config/arm/predicates.md (minus_one_operand): New predicate.
24195 2021-05-05 Jeff Law <jlaw@tachyum.com>
24197 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
24199 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
24201 PR rtl-optimization/100263
24202 * postreload.c (move2add_valid_value_p): Ensure register can
24205 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
24207 PR rtl-optimization/100411
24208 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
24209 and beginning of function markers.
24211 2021-05-05 Jeff Law <jlaw@tachyum.com>
24213 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
24214 * config/cr16/cr16.c (notice_update_cc): Remove.
24215 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
24217 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
24220 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
24221 Handle V8QI, V4HI and V2SI modes.
24222 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
24223 (ix86_build_signbit_mask): Ditto.
24224 * config/i386/mmx.md (MMXMODE14): New mode iterator.
24225 (<smaxmin:code><MMXMODE14:mode>3): New expander.
24226 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
24227 (<umaxmin:code><MMXMODE24:mode>3): New expander.
24228 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
24229 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
24230 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24231 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24232 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24233 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
24235 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
24237 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
24238 not expand the VALUE_EXPR of variables put in the non-local frame.
24239 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
24240 to be ignored for debug info, ensure its variable offsets are not.
24242 2021-05-05 Richard Biener <rguenther@suse.de>
24244 PR tree-optimization/79333
24245 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
24246 Fold stmt following SSA edges.
24248 2021-05-05 Richard Biener <rguenther@suse.de>
24250 PR middle-end/100394
24251 * calls.c (expand_call): Preserve possibly throwing calls.
24252 * cfgexpand.c (expand_call_stmt): When a call can throw signal
24253 RTL expansion there are side-effects.
24254 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
24255 mark all possibly throwing stmts necessary unless we can elide
24257 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
24258 -fdelete-dead-exceptions.
24259 * tree.h (DECL_PURE_P): Add note about exceptions.
24261 2021-05-05 Alexandre Oliva <oliva@adacore.com>
24263 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
24266 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
24268 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
24269 get_fnname_from_decl for name of thunk.
24270 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
24271 and ASM_OUTPUT_LABEL.
24272 (rs6000_xcoff_declare_function_name): Use assemble_name and
24274 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
24275 (rs6000_xcoff_encode_section_info): Don't add mapping class
24276 for aliases. Always add [DS] mapping class to primary
24278 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
24280 2021-05-04 Martin Sebor <msebor@redhat.com>
24282 PR middle-end/100307
24283 * builtins.c (compute_objsize_r): Clear base0 for pointers.
24285 2021-05-04 Jeff Law <jlaw@tachyum.com>
24287 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
24289 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
24291 * caller-save.c: Remove CC0.
24292 * cfgcleanup.c: Remove CC0.
24293 * cfgrtl.c: Remove CC0.
24294 * combine.c: Remove CC0.
24295 * compare-elim.c: Remove CC0.
24296 * conditions.h: Remove CC0.
24297 * config/h8300/h8300.h: Remove CC0.
24298 * config/h8300/h8300-protos.h: Remove CC0.
24299 * config/h8300/peepholes.md: Remove CC0.
24300 * config/i386/x86-tune-sched.c: Remove CC0.
24301 * config/m68k/m68k.c: Remove CC0.
24302 * config/rl78/rl78.c: Remove CC0.
24303 * config/sparc/sparc.c: Remove CC0.
24304 * config/xtensa/xtensa.c: Remove CC0.
24305 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
24306 RTL where that is used as a placeholder only.
24307 * cprop.c: Remove CC0.
24308 * cse.c: Remove CC0.
24309 * cselib.c: Remove CC0.
24310 * df-problems.c: Remove CC0.
24311 * df-scan.c: Remove CC0.
24312 * doc/md.texi: Remove CC0. Adjust an example.
24313 * doc/rtl.texi: Remove CC0. Adjust an example.
24314 * doc/tm.texi: Regenerate.
24315 * doc/tm.texi.in: Remove CC0.
24316 * emit-rtl.c: Remove CC0.
24317 * final.c: Remove CC0.
24318 * fwprop.c: Remove CC0.
24319 * gcse-common.c: Remove CC0.
24320 * gcse.c: Remove CC0.
24321 * genattrtab.c: Remove CC0.
24322 * genconfig.c: Remove CC0.
24323 * genemit.c: Remove CC0.
24324 * genextract.c: Remove CC0.
24325 * gengenrtl.c: Remove CC0.
24326 * genrecog.c: Remove CC0.
24327 * haifa-sched.c: Remove CC0.
24328 * ifcvt.c: Remove CC0.
24329 * ira-costs.c: Remove CC0.
24330 * ira.c: Remove CC0.
24331 * jump.c: Remove CC0.
24332 * loop-invariant.c: Remove CC0.
24333 * lra-constraints.c: Remove CC0.
24334 * lra-eliminations.c: Remove CC0.
24335 * optabs.c: Remove CC0.
24336 * postreload-gcse.c: Remove CC0.
24337 * postreload.c: Remove CC0.
24338 * print-rtl.c: Remove CC0.
24339 * read-rtl-function.c: Remove CC0.
24340 * reg-notes.def: Remove CC0.
24341 * reg-stack.c: Remove CC0.
24342 * reginfo.c: Remove CC0.
24343 * regrename.c: Remove CC0.
24344 * reload.c: Remove CC0.
24345 * reload1.c: Remove CC0.
24346 * reorg.c: Remove CC0.
24347 * resource.c: Remove CC0.
24348 * rtl.c: Remove CC0.
24349 * rtl.def: Remove CC0.
24350 * rtl.h: Remove CC0.
24351 * rtlanal.c: Remove CC0.
24352 * sched-deps.c: Remove CC0.
24353 * sched-rgn.c: Remove CC0.
24354 * shrink-wrap.c: Remove CC0.
24355 * simplify-rtx.c: Remove CC0.
24356 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
24357 CC_STATUS_MDEP, and CC_STATUS.
24358 * target.def: Remove CC0.
24359 * valtrack.c: Remove CC0.
24360 * var-tracking.c: Remove CC0.
24362 2021-05-04 Richard Biener <rguenther@suse.de>
24364 PR tree-optimization/100414
24365 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
24367 (tree_ssa_phiopt_worker): But unconditionally here.
24369 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
24371 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
24372 && and || with floating-point and complex arguments.
24374 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
24376 * tree-inline.c (insert_debug_decl_map): Delete.
24377 (copy_debug_stmt): Minor tweak.
24378 (setup_one_parameter): Do not use a variable if the value is either
24379 a read-only DECL or a non-addressable local variable in the caller.
24380 In this case, insert the debug-only variable in the map manually.
24381 (expand_call_inline): Do not generate a CLOBBER for these values.
24382 * tree-inline.h (debug_map): Minor tweak.
24384 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
24386 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
24387 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
24389 2021-05-04 Richard Biener <rguenther@suse.de>
24391 PR tree-optimization/100329
24392 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
24394 (insert_stmt_after): Assert we're not running into asm goto.
24396 2021-05-04 Richard Biener <rguenther@suse.de>
24398 PR tree-optimization/100398
24399 * tree-ssa-dse.c (pass_dse::execute): Preserve control
24402 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24404 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
24406 2021-05-04 Alexandre Oliva <oliva@adacore.com>
24408 * builtins.c (try_store_by_multiple_pieces): New.
24409 (expand_builtin_memset_args): Use it. If target_char_cast
24410 fails, proceed as for non-constant val. Pass len's ctz to...
24411 * expr.c (clear_storage_hints): ... this. Try store by
24412 multiple pieces after setmem.
24413 (clear_storage): Adjust.
24414 * expr.h (clear_storage_hints): Likewise.
24415 (try_store_by_multiple_pieces): Declare.
24416 * passes.def: Replace the last copy_prop with ccp.
24418 2021-05-03 Tom de Vries <tdevries@suse.de>
24421 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
24424 2021-05-03 Richard Biener <rguenther@suse.de>
24426 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
24428 2021-05-03 Richard Biener <rguenther@suse.de>
24430 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
24431 (dse_dom_walker): Remove.
24432 (dse_dom_walker::dse_optimize_stmt): Rename...
24433 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
24434 (dse_dom_walker::before_dom_children): Inline ...
24435 (pass_dse::execute): ... here. Perform a reverse program
24438 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
24441 * configure: Regenerated.
24443 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
24446 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
24447 (s390_md_asm_adjust): Handle hard registers.
24449 2021-05-03 Jakub Jelinek <jakub@redhat.com>
24451 PR tree-optimization/100382
24452 * tree-ssa-dse.c: Include tree-eh.h.
24453 (dse_dom_walker::before_dom_children): Don't remove stmts if
24454 stmt_unremovable_because_of_non_call_eh_p is true.
24456 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
24458 * varasm.c (compute_reloc_for_var): Split out from...
24459 (get_variable_section): Use it.
24460 * output.h (compute_reloc_for_var): Declare.
24461 * config/rs6000/rs6000-protos.h
24462 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
24464 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
24465 Don't append storage mapping class to symbol.
24466 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
24467 Don't convert TLS BSS to common.
24468 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
24469 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
24471 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
24473 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
24474 If align is 0 from TLS class, use the same rules as varasm.c
24475 If not common, switch to BSS section manually.
24476 If common, emit appropriate comm or lcomm directive.
24477 (rs6000_xcoff_encode_section_info): Add logic to append all
24478 storage mapping classes.
24479 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
24480 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
24481 rs6000_xcoff_asm_output_aligned_decl_common.
24482 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
24483 rs6000_xcoff_asm_output_aligned_decl_common.
24484 (ASM_OUTPUT_TLS_COMMON): Use
24485 rs6000_xcoff_asm_output_aligned_decl_common.
24487 2021-05-02 Jakub Jelinek <jakub@redhat.com>
24490 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
24491 as first argument of pseudo_node_t constructors.
24493 2021-05-02 Jakub Jelinek <jakub@redhat.com>
24496 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
24498 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
24500 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
24501 (gt_pch_nx (int_range<1> *&)): New.
24502 (gt_ggc_mx (int_range<1> *&)): New.
24503 * value-range.h (class irange): Add GTY support for
24506 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
24508 * doc/options.texi (Negative): Change either or to both and.
24510 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
24512 * config/aarch64/aarch64-simd-builtins.def: Add
24513 float_ml[as][q]_laneq builtin generator macros.
24514 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
24515 (aarch64_float_mla_laneq<mode>): Define.
24516 (aarch64_float_mls_laneq<mode>): Define.
24517 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
24518 instead of GCC vector extensions.
24519 (vmlaq_laneq_f32): Likewise.
24520 (vmls_laneq_f32): Likewise.
24521 (vmlsq_laneq_f32): Likewise.
24523 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
24525 * config/aarch64/aarch64-simd-builtins.def: Add
24526 float_ml[as]_lane builtin generator macros.
24527 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
24529 (mul_lane<mode>3): This, and re-order arguments.
24530 (aarch64_float_mla_lane<mode>): Define.
24531 (aarch64_float_mls_lane<mode>): Define.
24532 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
24533 instead of GCC vector extensions.
24534 (vmlaq_lane_f32): Likewise.
24535 (vmls_lane_f32): Likewise.
24536 (vmlsq_lane_f32): Likewise.
24538 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
24540 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
24541 builtin generator macros.
24542 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
24544 (aarch64_float_mls<mode>): Define.
24545 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
24546 instead of relying on GCC vector extensions.
24547 (vmla_f64): Likewise.
24548 (vmlaq_f32): Likewise.
24549 (vmlaq_f64): Likewise.
24550 (vmls_f32): Likewise.
24551 (vmls_f64): Likewise.
24552 (vmlsq_f32): Likewise.
24553 (vmlsq_f64): Likewise.
24554 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
24556 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
24558 * config/aarch64/aarch64-simd-builtins.def: Add
24559 float_ml[as]_n_builtin generator macros.
24560 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
24562 (mul_n<mode>3): This, and re-order arguments.
24563 (aarch64_float_mla_n<mode>): Define.
24564 (aarch64_float_mls_n<mode>): Define.
24565 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
24566 instead of inline asm.
24567 (vmlaq_n_f32): Likewise.
24568 (vmls_n_f32): Likewise.
24569 (vmlsq_n_f32): Likewise.
24571 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
24573 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
24574 builtin generator macros.
24575 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
24576 (aarch64_pmull_hiv16qi_insn): Define.
24577 (aarch64_pmull_hiv16qi): Define.
24578 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
24579 instead of inline asm.
24580 (vmull_p8): Likewise.
24582 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
24584 * config/avr/avr.md: Adjust peepholes to match and
24585 generate parallels with clobber of REG_CC.
24586 (mov<mode>_insn): Rename to mov<mode>_insn_split.
24587 (*mov<mode>_insn): Rename to mov<mode>_insn.
24589 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
24591 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
24592 for VAR_DECLs if -fdata-sections enabled.
24594 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
24596 PR bootstrap/100327
24597 * config/rs6000/rs6000.c
24598 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
24599 (rs6000_libgcc_floating_mode_supported_p): New target hook.
24601 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
24603 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
24605 (class back_threader_registry): ...to here...
24606 (class back_threader_profitability): ...and here...
24607 (thread_jumps::thread_through_all_blocks): Remove argument.
24608 (back_threader_registry::back_threader_registry): New.
24609 (back_threader_registry::~back_threader_registry): New.
24610 (back_threader_registry::thread_through_all_blocks): New.
24611 (thread_jumps::profitable_jump_thread_path): Move from here...
24612 (back_threader_profitability::profitable_path_p): ...to here.
24613 (thread_jumps::find_taken_edge): New.
24614 (thread_jumps::convert_and_register_current_path): Move...
24615 (back_threader_registry::register_path): ...to here.
24616 (thread_jumps::register_jump_thread_path_if_profitable): Move...
24617 (thread_jumps::maybe_register_path): ...to here.
24618 (thread_jumps::handle_phi): Call find_taken_edge and
24619 maybe_register_path.
24620 (thread_jumps::handle_assignment): Same.
24621 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
24622 tree argument to handle_phi and handle_assignment.
24623 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
24624 set of m_speed_p and m_max_threaded_paths.
24625 (pass_thread_jumps::execute): Remove second argument from
24626 find_jump_threads_backwards.
24627 (pass_early_thread_jumps::execute): Same.
24629 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
24631 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
24632 (class dom_opt_dom_walker): Initialize some class variables.
24633 (pass_dominator::execute): Pass evrp_range_analyzer and
24634 dom_jump_threader_simplifier to dom_opt_dom_walker.
24635 Adjust for some functions moving into classes.
24636 (simplify_stmt_for_jump_threading): Adjust and move to...
24637 (jump_threader_simplifier::simplify): ...here.
24638 (dom_opt_dom_walker::before_dom_children): Adjust for
24639 m_evrp_range_analyzer.
24640 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
24641 (test_for_singularity): Place in dom_opt_dom_walker class.
24642 (dom_opt_dom_walker::optimize_stmt): The argument
24643 evrp_range_analyzer is now a class field.
24644 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
24645 (thread_jumps::thread_through_all_blocks): New.
24646 (thread_jumps::convert_and_register_current_path): Use m_registry.
24647 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
24648 being in the threader class.
24649 (pass_early_thread_jumps::execute): Same.
24650 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
24651 (jump_threader::jump_threader): ...here.
24652 (threadedge_finalize_values): Move...
24653 (jump_threader::~jump_threader): ...here.
24654 (jump_threader::remove_jump_threads_including): New.
24655 (jump_threader::thread_through_all_blocks): New.
24656 (record_temporary_equivalences_from_phis): Move...
24657 (jump_threader::record_temporary_equivalences_from_phis): ...here.
24658 (record_temporary_equivalences_from_stmts_at_dest): Move...
24659 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
24661 (simplify_control_stmt_condition_1): Move to jump_threader class.
24662 (simplify_control_stmt_condition): Move...
24663 (jump_threader::simplify_control_stmt_condition): ...here.
24664 (thread_around_empty_blocks): Move...
24665 (jump_threader::thread_around_empty_blocks): ...here.
24666 (thread_through_normal_block): Move...
24667 (jump_threader::thread_through_normal_block): ...here.
24668 (thread_across_edge): Move...
24669 (jump_threader::thread_across_edge): ...here.
24670 (thread_outgoing_edges): Move...
24671 (jump_threader::thread_outgoing_edges): ...here.
24672 * tree-ssa-threadedge.h: Move externally facing functings...
24673 (class jump_threader): ...here...
24674 (class jump_threader_simplifier): ...and here.
24675 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
24676 (jump_thread_path_allocator::jump_thread_path_allocator): New.
24677 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
24678 (jump_thread_path_allocator::allocate_thread_edge): New.
24679 (jump_thread_path_allocator::allocate_thread_path): New.
24680 (jump_thread_path_registry::jump_thread_path_registry): New.
24681 (jump_thread_path_registry::~jump_thread_path_registry): New.
24682 (jump_thread_path_registry::allocate_thread_edge): New.
24683 (jump_thread_path_registry::allocate_thread_path): New.
24684 (dump_jump_thread_path): Make extern.
24685 (debug (const vec<jump_thread_edge *> &path)): New.
24686 (struct removed_edges): Move to tree-ssa-threadupdate.h.
24687 (struct thread_stats_d): Remove.
24688 (remove_ctrl_stmt_and_useless_edges): Make static.
24689 (lookup_redirection_data): Move...
24690 (jump_thread_path_registry::lookup_redirection_data): ...here.
24691 (ssa_redirect_edges): Make static.
24692 (thread_block_1): Move...
24693 (jump_thread_path_registry::thread_block_1): ...here.
24694 (thread_block): Move...
24695 (jump_thread_path_registry::thread_block): ...here.
24696 (thread_through_loop_header): Move...
24697 (jump_thread_path_registry::thread_through_loop_header): ...here.
24698 (mark_threaded_blocks): Move...
24699 (jump_thread_path_registry::mark_threaded_blocks): ...here.
24700 (debug_path): Move...
24701 (jump_thread_path_registry::debug_path): ...here.
24702 (debug_all_paths): Move...
24703 (jump_thread_path_registry::dump): ..here.
24704 (rewire_first_differing_edge): Move...
24705 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
24706 (adjust_paths_after_duplication): Move...
24707 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
24708 (duplicate_thread_path): Move...
24709 (jump_thread_path_registry::duplicate_thread_path): ..here.
24710 (remove_jump_threads_including): Move...
24711 (jump_thread_path_registry::remove_jump_threads_including): ...here.
24712 (thread_through_all_blocks): Move to...
24713 (jump_thread_path_registry::thread_through_all_blocks): ...here.
24714 (delete_jump_thread_path): Remove.
24715 (register_jump_thread): Move...
24716 (jump_thread_path_registry::register_jump_thread): ...here.
24717 * tree-ssa-threadupdate.h: Move externally facing functions...
24718 (class jump_thread_path_allocator): ...here...
24719 (class jump_thread_path_registry): ...and here.
24720 (thread_through_all_blocks): Remove.
24721 (struct removed_edges): New.
24722 (register_jump_thread): Remove.
24723 (remove_jump_threads_including): Remove.
24724 (delete_jump_thread_path): Remove.
24725 (remove_ctrl_stmt_and_useless_edges): Remove.
24726 (free_dom_edge_info): New prototype.
24727 * tree-vrp.c: Remove x_vr_values hack.
24728 (class vrp_jump_threader_simplifier): New.
24729 (vrp_jump_threader_simplifier::simplify): New.
24730 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
24731 Remove m_dummy_cond.
24732 Instantiate m_simplifier and m_threader.
24733 (vrp_jump_threader::thread_through_all_blocks): New.
24734 (vrp_jump_threader::simplify_stmt): Remove.
24735 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
24736 Remove x_vr_values hack.
24737 (execute_vrp): Adjust for thread_through_all_blocks being in a
24740 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
24742 * genflags.c (gen_insn): Print failed expansion string.
24744 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
24746 * expr.c (alignment_for_piecewise_move): Call mode_for_size
24747 without limit to MAX_FIXED_MODE_SIZE.
24749 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
24751 PR middle-end/90773
24752 * builtins.c (builtin_memset_gen_str): Don't use return from
24753 simplify_gen_subreg.
24755 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
24758 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
24759 (*addsi3_carry_zext_0r): Ditto.
24760 (*sub<mode>3_carry_0): Ditto.
24761 (*subsi3_carry_zext_0r): Ditto.
24762 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
24764 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
24765 Also consider ix86_carry_flag_unset_operator to calculate
24766 the cost of adc/sbb insn.
24768 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
24770 PR rtl-optimization/100225
24771 PR rtl-optimization/84878
24772 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
24773 where we have an instruction which touches (writes) any hard
24774 register from df->regular_block_artificial_uses set.
24775 Allow not-single-set instruction only right before basic block
24778 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
24780 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
24782 2021-04-30 LevyHsu <admin@levyhsu.com>
24784 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
24785 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
24786 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
24787 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
24789 2021-04-29 Alexandre Oliva <oliva@adacore.com>
24791 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
24793 2021-04-29 Alexandre Oliva <oliva@adacore.com>
24795 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
24796 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
24797 * config/i386/i386.c: Adjust.
24798 * config/i386/i386.md: Adjust.
24799 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
24800 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24801 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24802 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24803 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24804 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24805 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24806 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24807 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24808 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
24809 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
24811 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
24813 * config/i386/i386-expand.c (ix86_expand_int_compare):
24814 Swap operands of GTU and LEU comparison to emit carry flag comparison.
24815 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
24816 predicate to allow more combine opportunities with memory operands.
24817 (*sub<mode>3_carry_0): Ditto.
24819 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
24821 PR rtl-optimization/100303
24822 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
24823 boolean that indicates whether the use will only be used in
24824 debug instructions. Treat it in the same way that existing
24825 cross-EBB debug references would be handled if so.
24826 (function_info::make_uses_available): Likewise.
24827 * rtl-ssa/functions.h (function_info::make_uses_available): Update
24828 prototype accordingly.
24829 (function_info::make_uses_available): Likewise.
24830 * fwprop.c (try_fwprop_subst): Update call accordingly.
24832 2021-04-29 Jeff Law <jlaw@tachyum.com>
24834 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
24837 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
24838 Richard Biener <rguenther@suse.de>
24841 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
24842 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
24843 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
24844 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
24845 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
24846 to PURE_ARGS category.
24847 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
24848 Handle PURE_ARGS category.
24849 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
24851 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
24853 * configure.ac: Check for the presence of sys/locking.h header and
24854 for whether _LK_LOCK is supported by _locking.
24855 * configure: Regenerate.
24856 * config.in: Likewise.
24857 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
24858 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
24859 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
24861 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
24863 * config/i386/predicates.md (fcmov_comparison_operator):
24864 Do not check for trivial FP comparison operator.
24865 <case GEU, case LTU>: Allow CCGZmode.
24866 <case GTU, case LEU>: Do not allow CCCmode.
24867 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
24868 (ix86_carry_flag_operator): Match only LTU and UNLT code.
24869 Do not check for trivial FP comparison operator. Allow CCGZmode.
24871 2021-04-29 Tom de Vries <tdevries@suse.de>
24873 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
24874 fd->loop.step by either step or orig_step.
24876 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
24878 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
24879 (load_got_register): Do the PIC dance here.
24880 (sparc_legitimize_tls_address): Simplify.
24881 (sparc_emit_probe_stack_range): Likewise.
24882 (sparc32_initialize_trampoline): Likewise.
24883 (sparc64_initialize_trampoline): Likewise.
24884 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
24885 (probe_stack_range<P:mode>): Likewise.
24886 (flush<P:mode>): Likewise.
24887 (tgd_hi22<P:mode>): Likewise.
24888 (tgd_lo10<P:mode>): Likewise.
24889 (tgd_add<P:mode>): Likewise.
24890 (tgd_call<P:mode>): Likewise.
24891 (tldm_hi22<P:mode>): Likewise.
24892 (tldm_lo10<P:mode>): Likewise.
24893 (tldm_add<P:mode>): Likewise.
24894 (tldm_call<P:mode>): Likewise.
24895 (tldo_hix22<P:mode>): Likewise.
24896 (tldo_lox10<P:mode>): Likewise.
24897 (tldo_add<P:mode>): Likewise.
24898 (tie_hi22<P:mode>): Likewise.
24899 (tie_lo10<P:mode>): Likewise.
24900 (tie_add<P:mode>): Likewise.
24901 (tle_hix22<P:mode>): Likewise.
24902 (tle_lox10<P:mode>): Likewise.
24903 (stack_protect_setsi): Rename to...
24904 (stack_protect_set32): ...this.
24905 (stack_protect_setdi): Rename to...
24906 (stack_protect_set64): ...this.
24907 (stack_protect_set): Adjust calls to above.
24908 (stack_protect_testsi): Rename to...
24909 (stack_protect_test32): ...this.
24910 (stack_protect_testdi): Rename to...
24911 (stack_protect_test64): ...this.
24912 (stack_protect_test): Adjust calls to above.
24914 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
24916 PR middle-end/90773
24917 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
24918 (builtin_strncpy_read_str): Likewise.
24919 (builtin_memset_read_str): Add an argument for the previous RTL
24920 information and generate the new RTL from the previous RTL info.
24921 (builtin_memset_gen_str): Likewise.
24922 * builtins.h (builtin_strncpy_read_str): Update the prototype.
24923 (builtin_memset_read_str): Likewise.
24924 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
24925 returns true, round up size and alignment to the widest integer
24926 mode for maximum size.
24927 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
24928 and pass it to m_constfn.
24929 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
24930 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
24931 initialize m_push. Initialize m_overlap_op_by_pieces with
24932 targetm.overlap_op_by_pieces_p ().
24933 (op_by_pieces_d::run): Pass the previous RTL information to
24934 pieces_addr::adjust and generate overlapping operations if
24935 m_overlap_op_by_pieces is true.
24937 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
24939 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
24941 (can_store_by_pieces): Use by_pieces_constfn on constfun.
24942 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
24943 for op_by_pieces_d change.
24944 (clear_by_pieces_1): Add a dummy argument.
24945 (clear_by_pieces): Updated for op_by_pieces_d change.
24946 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
24947 (string_cst_read_str): Add a dummy argument.
24948 * expr.h (by_pieces_constfn): Add a dummy argument.
24949 (by_pieces_prev): New.
24950 * target.def (overlap_op_by_pieces_p): New target hook.
24951 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
24952 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
24953 * doc/tm.texi: Regenerated.
24955 2021-04-29 Richard Biener <rguenther@suse.de>
24957 PR tree-optimization/100253
24958 * tree-vect-stmts.c (vectorizable_load): Do not assume
24959 element alignment when DR_MISALIGNMENT is -1.
24960 (vectorizable_store): Likewise.
24962 2021-04-29 Jakub Jelinek <jakub@redhat.com>
24965 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
24966 absu_hwi instead of abs_hwi.
24968 2021-04-29 Richard Biener <rguenther@suse.de>
24970 PR middle-end/38474
24971 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
24972 forwarding when indirect forwarding through ESCAPED
24975 2021-04-29 Tom de Vries <tdevries@suse.de>
24978 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
24979 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
24980 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
24981 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
24983 2021-04-29 Richard Biener <rguenther@suse.de>
24985 PR tree-optimization/99912
24986 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
24987 (dse_dom_walker::todo): Likewise.
24988 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
24990 (dse_dom_walker::before_dom_children): Remove trivially
24991 dead SSA defs and schedule CFG cleanup if we removed all
24993 (pass_dse::execute): Get TODO as computed by the DOM walker
24994 and return it. Wipe dominator info earlier.
24996 2021-04-29 Richard Biener <rguenther@suse.de>
24999 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
25000 Track blocks to cleanup EH in new m_need_eh_cleanup.
25001 (ipcp_modif_dom_walker::cleanup_eh): New.
25002 (ipcp_transform_function): Release dominator info before
25005 2021-04-29 Martin Sebor <msebor@redhat.com>
25007 PR middle-end/100250
25008 * attribs.c (attr_access::array_as_string): Avoid dereferencing
25009 a pointer when it's null.
25011 2021-04-29 Martin Sebor <msebor@redhat.com>
25013 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
25014 * ipa-free-lang-data.cc: New file.
25015 * tree.c: Move pass free_lang_data to file above.
25016 (build_array_type_1): Declare extern.
25017 * tree.h (build_array_type_1): Declare.
25019 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25021 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
25022 make consistent with updated RTL pattern.
25023 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
25024 Implement using ss_truncate and us_truncate rather than
25026 * config/aarch64/iterators.md: Remove redundant unspecs and
25027 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
25029 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25031 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
25032 attributes consistent with those defined in arm_neon.h.
25034 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25036 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
25037 attributes consistent with those defined in arm_neon.h.
25039 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25041 * config/aarch64/aarch64-simd-builtins.def: Add
25042 float_trunc_rodd builtin generator macros.
25043 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
25045 (aarch64_float_trunc_rodd_lo_v2sf): Define.
25046 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
25047 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
25048 (aarch64_float_trunc_rodd_hi_v4sf): Define.
25049 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
25050 instead of inline asm.
25051 (vcvtx_high_f32_f64): Likewise.
25052 (vcvtxd_f32_f64): Likewise.
25053 * config/aarch64/iterators.md: Add FCVTXN unspec.
25055 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25057 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
25059 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
25061 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
25062 instead of inline asm.
25063 (vqtbx1_u8): Likewise.
25064 (vqtbx1_p8): Likewise.
25065 (vqtbx1q_s8): Likewise.
25066 (vqtbx1q_u8): Likewise.
25067 (vqtbx1q_p8): Likewise.
25068 (vtbx2_s8): Likewise.
25069 (vtbx2_u8): Likewise.
25070 (vtbx2_p8): Likewise.
25072 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25074 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
25076 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
25077 instead of inline asm.
25078 (vqtbl1_s8): Likewise.
25079 (vqtbl1_u8): Likewise.
25080 (vqtbl1q_p8): Likewise.
25081 (vqtbl1q_s8): Likewise.
25082 (vqtbl1q_u8): Likewise.
25083 (vtbl1_s8): Likewise.
25084 (vtbl1_u8): Likewise.
25085 (vtbl1_p8): Likewise.
25086 (vtbl2_s8): Likewise.
25087 (vtbl2_u8): Likewise.
25088 (vtbl2_p8): Likewise.
25090 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25092 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
25093 ssri_n buitin generator macro.
25094 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
25095 instead of inline asm.
25096 (vsri_n_p16): Likewise.
25097 (vsri_n_p64): Likewise.
25098 (vsriq_n_p8): Likewise.
25099 (vsriq_n_p16): Likewise.
25100 (vsriq_n_p64): Likewise.
25102 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25104 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
25105 iterator for polynomial ssli_n builtin generator macro.
25106 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
25107 instead of inline asm.
25108 (vsli_n_p16): Likewise.
25109 (vsliq_n_p8): Likewise.
25110 (vsliq_n_p16): Likewise.
25111 * config/aarch64/iterators.md: Define VALLP mode iterator.
25113 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25115 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
25116 iterator to generate [su]adalp RTL builtins.
25117 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
25118 [su]adalp RTL pattern.
25119 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
25120 instead of inline asm.
25121 (vpadal_u32): Likewise.
25123 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25125 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
25126 builtin generator macros.
25127 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
25129 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
25130 instead of inline asm.
25131 (vpaddl_s16): Likewise.
25132 (vpaddl_s32): Likewise.
25133 (vpaddl_u8): Likewise.
25134 (vpaddl_u16): Likewise.
25135 (vpaddl_u32): Likewise.
25136 (vpaddlq_s8): Likewise.
25137 (vpaddlq_s16): Likewise.
25138 (vpaddlq_s32): Likewise.
25139 (vpaddlq_u8): Likewise.
25140 (vpaddlq_u16): Likewise.
25141 (vpaddlq_u32): Liwewise.
25142 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
25143 appropriate attributes.
25145 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25147 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
25148 for aarch64_addp<mode> builtin macro generator.
25149 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
25150 aarch64_addp<mode> RTL pattern.
25151 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
25152 instead of inline asm.
25153 (vpaddq_s16): Likewise.
25154 (vpaddq_s32): Likewise.
25155 (vpaddq_s64): Likewise.
25156 (vpaddq_u8): Likewise.
25157 (vpaddq_u16): Likewise.
25158 (vpaddq_u32): Likewise.
25159 (vpaddq_u64): Likewise.
25161 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
25163 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
25164 builtin generator macros.
25165 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
25167 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
25168 instead of inline asm.
25169 (vqdmulh_n_s32): Likewise.
25170 (vqdmulhq_n_s16): Likewise.
25171 (vqdmulhq_n_s32): Likewise.
25172 (vqrdmulh_n_s16): Likewise.
25173 (vqrdmulh_n_s32): Likewise.
25174 (vqrdmulhq_n_s16): Likewise.
25175 (vqrdmulhq_n_s32): Likewise.
25177 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
25179 * doc/install.texi (--enable-offload-defaulted): Document.
25181 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
25183 * config/avr/avr-dimode.md: Turn existing patterns into
25184 define_insn_and_split style patterns where the splitter
25185 adds a clobber of the condition code register. Drop "cc"
25186 attribute. Add new patterns to match output of
25188 * config/avr/avr-fixed.md: Likewise.
25189 * config/avr/avr.c (cc_reg_rtx): New.
25190 (avr_parallel_insn_from_insns): Adjust insn count
25191 for removal of set of cc0.
25192 (avr_is_casesi_sequence): Likewise.
25193 (avr_casei_sequence_check_operands): Likewise.
25194 (avr_optimize_casesi): Likewise. Also insert
25195 new insns after jump_insn.
25196 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
25197 for removal of set of cc0.
25198 (avr_init_expanders): Initialize cc_reg_rtx.
25199 (avr_regno_reg_class): Handle REG_CC.
25200 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
25201 (avr_notice_update_cc): Remove function.
25202 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
25203 (compare_condition): Adjust for PARALLEL with
25205 (out_shift_with_cnt): Likewise.
25206 (ashlhi3_out): Likewise.
25207 (ashrhi3_out): Likewise.
25208 (lshrhi3_out): Likewise.
25209 (avr_class_max_nregs): Return single reg for REG_CC.
25210 (avr_compare_pattern): Check for REG_CC instead
25212 (avr_reorg_remove_redundant_compare): Likewise.
25213 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
25214 (avr_hard_regno_nregs): Return single reg for REG_CC.
25215 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
25216 (avr_md_asm_adjust): Clobber REG_CC.
25217 (TARGET_HARD_REGNO_NREGS): Define.
25218 (TARGET_CLASS_MAX_NREGS): Define.
25219 (TARGET_MD_ASM_ADJUST): Define.
25220 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
25222 (enum reg_class): Add CC_REG class.
25223 (NOTICE_UPDATE_CC): Remove.
25224 (CC_OVERFLOW_UNUSABLE): Remove.
25225 (CC_NO_CARRY): Remove.
25226 * config/avr/avr.md: Turn existing patterns into
25227 define_insn_and_split style patterns where the splitter
25228 adds a clobber of the condition code register. Drop "cc"
25229 attribute. Add new patterns to match output of
25231 (sez): Remove unused pattern.
25233 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
25236 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
25239 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
25242 * config/aarch64/constraints.md (Utq): Require the address to
25243 be valid for both the element mode and for V2DImode.
25245 2021-04-28 Jakub Jelinek <jakub@redhat.com>
25246 Tobias Burnus <tobias@codesourcery.com>
25248 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
25249 * gcc.c (process_command): New variable.
25250 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
25251 set it if -foffload is defaulted.
25252 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
25253 (compile_offload_image): If OFFLOAD_DEFAULTED and
25254 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
25255 if corresponding mkoffload can't be found.
25256 (compile_images_for_offload_targets): Likewise. Free and clear
25257 offload_names if no valid offload is found.
25258 * config.in: Regenerate.
25259 * configure: Regenerate.
25261 2021-04-28 Richard Biener <rguenther@suse.de>
25263 PR tree-optimization/100292
25264 * tree-vect-generic.c (expand_vector_condition): Do not fold
25267 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
25269 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
25270 * config/rs6000/aix64.opt (m64): New.
25273 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
25275 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
25276 (index_term_p): Handle ASHIFT too.
25278 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
25280 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
25281 (sync_lock_test_and_set<mode>): Adjust accordingly.
25282 (sync_lock_release<mode>): Likewise.
25284 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
25286 * config/vax/vax-protos.h (adjacent_operands_p): Remove
25288 * config/vax/vax.c (adjacent_operands_p): Remove.
25290 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
25292 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
25293 through to the non-conditional execution case if getting the
25294 condition for conditional execution has failed.
25296 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
25298 PR middle-end/100284
25299 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
25300 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
25301 than asserting on it.
25303 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
25305 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
25306 with TARGET_AIX_OS.
25308 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
25311 * calls.c (precompute_register_parameters): Additionally test
25312 targetm.precompute_tls_p to pre-compute argument.
25313 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
25314 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
25315 * target.def (precompute_tls_p): New.
25316 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
25317 * doc/tm.texi: Regenerated.
25319 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25322 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
25323 back to HOST_WIDE_INT.
25325 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
25328 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
25329 memory alignment for the outer mode.
25331 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
25333 PR middle-end/90773
25334 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
25335 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
25337 2021-04-27 Alex Coplan <alex.coplan@arm.com>
25340 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
25341 with negative immediates: ensure we expand cbranchsi4_scratch
25342 correctly and ensure we satisfy its constraints.
25343 * config/arm/sync.md
25344 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
25345 attempt to tie two output operands together with constraints;
25346 collapse two alternatives.
25347 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
25348 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
25350 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25353 * config/aarch64/predicates.md (aarch64_sub_immediate,
25354 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
25355 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
25356 * config/aarch64/aarch64.c (aarch64_print_operand,
25357 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
25359 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25361 PR tree-optimization/100239
25362 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
25363 permutations with all indices from the first zero element as vec_shl.
25365 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25367 PR rtl-optimization/100254
25368 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
25369 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
25371 2021-04-27 Richard Biener <rguenther@suse.de>
25373 PR tree-optimization/99912
25374 * passes.def: Add comment about new TODO_remove_unused_locals.
25375 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
25378 2021-04-27 Richard Biener <rguenther@suse.de>
25380 PR tree-optimization/99912
25381 * passes.def (pass_all_optimizations): Add pass_dse before
25382 the first pass_dce, move the first pass_dse before the
25383 pass_dce following pass_pre.
25385 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25387 PR tree-optimization/95527
25388 * generic-match-head.c: Include tm.h.
25389 * gimple-match-head.c: Include tm.h.
25390 * match.pd (CLZ == INTEGER_CST): Don't use
25391 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
25392 if clz == CFN_CLZ. Add missing val declaration.
25393 (CTZ cmp CST): New simplifications.
25395 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25397 PR tree-optimization/96696
25398 * expr.c (expand_expr_divmod): New function.
25399 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
25400 divisions. Formatting fixes.
25401 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
25404 2021-04-27 Martin Jambor <mjambor@suse.cz>
25407 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
25408 If removing a call statement LHS SSA name, release it.
25410 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
25413 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
25414 is valid before including it in the mask.
25416 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
25419 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
25422 2021-04-27 Richard Biener <rguenther@suse.de>
25424 PR tree-optimization/100051
25425 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
25426 disambiguator based on access size vs. decl size.
25428 2021-04-27 Richard Biener <rguenther@suse.de>
25430 PR tree-optimization/100278
25431 * tree-ssa-pre.c (compute_avail): Give up when we cannot
25432 adjust TBAA beacuse of mismatching bases.
25434 2021-04-27 Jakub Jelinek <jakub@redhat.com>
25437 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
25438 For any_rotate define_insn_split and following splitters, use
25439 SWI iterator instead of SWI48.
25441 2021-04-27 Richard Biener <rguenther@suse.de>
25443 PR tree-optimization/99776
25444 * match.pd (bit_field_ref (ctor)): Relax element extract
25445 type compatibility checks.
25447 2021-04-27 Cui,Lili <lili.cui@intel.com>
25449 * common/config/i386/i386-common.c (processor_names):
25450 Sync processor_names with processor_type.
25451 * config/i386/i386-options.c (processor_cost_table):
25452 Sync processor_cost_table with processor_type.
25454 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
25456 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
25457 (irange::set): Call irange_set_1bit_anti_range for handling all
25458 1-bit ranges. Fall through on ~[MIN,MAX].
25460 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
25462 * value-range.cc (irange::legacy_num_pairs): Remove.
25463 (irange::invert): Change gcc_assert to gcc_checking_assert.
25464 * value-range.h (irange::num_pairs): Adjust for a cached
25465 num_pairs(). Also, rename all gcc_assert's to
25466 gcc_checking_assert's.
25468 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
25470 * value-range.cc (irange::operator=): Set m_kind.
25471 (irange::copy_to_legacy): Handle varying and undefined sources
25472 as a legacy copy since they can be easily copied.
25473 (irange::irange_set): Set m_kind.
25474 (irange::irange_set_anti_range): Same.
25475 (irange::set): Rename normalize_min_max to normalize_kind.
25476 (irange::verify_range): Adjust for multi-ranges having the
25478 (irange::irange_union): Set m_kind.
25479 (irange::irange_intersect): Same.
25480 (irange::invert): Same.
25481 * value-range.h (irange::kind): Always return m_kind.
25482 (irange::varying_p): Rename to...
25483 (irange::varying_comptaible_p): ...this.
25484 (irange::undefined_p): Only look at m_kind.
25485 (irange::irange): Always set VR_UNDEFINED if applicable.
25486 (irange::set_undefined): Always set VR_UNDEFINED.
25487 (irange::set_varying): Always set m_kind to VR_VARYING.
25488 (irange::normalize_min_max): Rename to...
25489 (irange::normalize_kind): ...this.
25491 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
25493 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
25494 Adjust for constant_p including varying_p.
25495 * tree-vrp.c (vrp_prop::finalize): Same.
25496 (determine_value_range): Same.
25497 * vr-values.c (vr_values::range_of_expr): Same.
25498 * value-range.cc (irange::symbolic_p): Do not check varying_p.
25499 (irange::constant_p): Same.
25501 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
25503 * value-range.cc (irange::legacy_lower_bound): Replace
25504 !undefined_p check with num_ranges > 0.
25505 (irange::legacy_upper_bound): Same.
25506 * value-range.h (irange::type): Same.
25507 (irange::lower_bound): Same.
25508 (irange::upper_bound): Same.
25510 2021-04-26 Richard Biener <rguenther@suse.de>
25512 PR tree-optimization/99956
25513 * gimple-loop-interchange.cc (compute_access_stride):
25514 Try instantiating the access in a shallower loop nest
25515 if instantiating failed.
25516 (compute_access_strides): Pass adjustable loop_nest
25517 to compute_access_stride.
25519 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
25521 * doc/sourcebuild.texi (arm_cmse_hw): Document.
25523 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25525 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
25527 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
25528 Nathan Sidwell <nathan@codesourcery.com>
25529 Tom de Vries <vries@codesourcery.com>
25530 Julian Brown <julian@codesourcery.com>
25531 Kwok Cheung Yeung <kcy@codesourcery.com>
25533 * omp-offload.c (oacc_validate_dims): Implement
25534 '-Wopenacc-parallelism'.
25535 * doc/invoke.texi (-Wopenacc-parallelism): Document.
25537 2021-04-26 Richard Biener <rguenther@suse.de>
25539 * tree-cfg.h (gimplify_build1): Remove.
25540 (gimplify_build2): Likewise.
25541 (gimplify_build3): Likewise.
25542 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
25543 (gimplify_build2): Likewise.
25544 (gimplify_build3): Likewise.
25545 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
25547 (gimplify_build2): Likewise.
25548 (gimplify_build3): Likewise.
25549 (tree_vec_extract): Use resimplify with following SSA edges.
25550 (expand_vector_parallel): Avoid passing NULL size/bitpos
25551 to tree_vec_extract.
25552 * expr.c (store_constructor): Deal with zero-element CTORs.
25553 * match.pd (bit_field_ref <vector CTOR>): Make sure to
25554 produce vector constants when possible.
25556 2021-04-26 Richard Biener <rguenther@suse.de>
25558 * tree-complex.c: Include gimple-fold.h.
25559 (expand_complex_addition): Use gimple_build.
25560 (expand_complex_multiplication_components): Likewise.
25561 (expand_complex_multiplication): Likewise.
25562 (expand_complex_div_straight): Likewise.
25563 (expand_complex_div_wide): Likewise.
25564 (expand_complex_division): Likewise.
25565 (expand_complex_conjugate): Likewise.
25566 (expand_complex_comparison): Likewise.
25568 2021-04-26 Richard Biener <rguenther@suse.de>
25570 * tree-ssa-phiopt.c (two_value_replacement): Remove use
25571 of legacy gimplify_buildN API.
25573 2021-04-26 Richard Biener <rguenther@suse.de>
25575 PR tree-optimization/99473
25576 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
25579 2021-04-26 Richard Biener <rguenther@suse.de>
25581 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
25582 Use replace_call_with_value.
25584 2021-04-26 Richard Biener <rguenther@suse.de>
25586 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
25587 (update_gimple_call): Likewise.
25588 (update_call_from_tree): Likewise.
25589 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
25590 (valid_gimple_call_p): Likewise.
25591 (move_ssa_defining_stmt_for_defs): Likewise.
25592 (finish_update_gimple_call): Likewise.
25593 (update_gimple_call): Likewise.
25594 (update_call_from_tree): Likewise.
25595 (propagate_tree_value_into_stmt): Use replace_call_with_value.
25596 * gimple-fold.h (update_gimple_call): Declare.
25597 * gimple-fold.c (valid_gimple_rhs_p): Move here from
25598 tree-ssa-propagate.c.
25599 (update_gimple_call): Likewise.
25600 (valid_gimple_call_p): Likewise.
25601 (finish_update_gimple_call): Likewise, and simplify.
25602 (gimplify_and_update_call_from_tree): Implement
25603 update_call_from_tree functionality, avoid excessive
25604 push/pop_gimplify_context.
25605 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
25606 (gimple_fold_call): Likewise.
25607 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
25608 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
25609 (pass_fold_builtins::execute): Likewise.
25610 (optimize_stack_restore): Use replace_call_with_value.
25611 * tree-cfg.c (fold_loop_internal_call): Likewise.
25612 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
25613 only gimplify_and_update_call_from_tree.
25614 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
25615 (handle_builtin_strchr): Likewise.
25616 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
25618 2021-04-26 Jakub Jelinek <jakub@redhat.com>
25621 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
25622 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
25625 2021-04-25 liuhongt <hongtao.liu@intel.com>
25628 * config/i386/i386-builtin.def (BDESC): Change the icode of
25629 the following builtins to CODE_FOR_nothing.
25630 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
25631 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
25632 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
25633 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
25634 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
25635 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
25636 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
25637 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
25638 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
25639 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
25640 (sse2_eq<mode>3): Ditto.
25641 (sse4_1_eqv2di3): Ditto.
25642 (sse2_gt<mode>3): Rename to ..
25643 (*sse2_gt<mode>3): .. this.
25645 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
25648 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
25651 * config/darwin.c (darwin_binds_local_p): Assume that any
25652 public symbol might be interposed for PIC code. Update function
25653 header comment to reflect current Darwin capability.
25655 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
25658 * config/darwin.c (darwin_binds_local_p): Assume that any
25659 public symbol might be interposed for PIC code. Update function
25660 header comment to reflect current Darwin capability.
25662 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
25664 * doc/sourcebuild.texi: Document no-opts and any-opts target
25667 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
25669 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
25671 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
25673 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
25675 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
25678 * config/i386/i386-options.c (ix86_option_override_internal):
25679 Error out when -m96bit-long-double is used with 64bit targets.
25680 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
25682 2021-04-23 Martin Liska <mliska@suse.cz>
25684 * lto-wrapper.c: Remove FIXME about usage of
25685 hardware_concurrency. The function is not on par with
25688 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
25691 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
25692 Copy operand 3 to operand 4. Use sse_reg_operand
25693 as operand 3 predicate.
25694 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
25695 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
25696 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
25697 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
25698 Copy operand 1 to operand 0.
25699 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
25700 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
25701 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
25703 2021-04-23 Alex Coplan <alex.coplan@arm.com>
25705 PR rtl-optimization/100230
25706 * early-remat.c (early_remat::sort_candidates): Use delete[]
25707 instead of delete for array allocated with new[].
25709 2021-04-23 Richard Biener <rguenther@suse.de>
25711 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
25712 (capture_info::capture_info): Likewise.
25713 (capture_info::walk_match): Likewise.
25714 (expr::gen_transform): Likewise.
25715 (dt_simplify::gen_1): Likewise.
25716 * gimple-match-head.c (maybe_resimplify_conditional_op):
25717 Remove VEC_COND_EXPR special-casing.
25718 (gimple_simplify): Likewise.
25719 * gimple.c (gimple_could_trap_p_1): Adjust.
25720 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
25721 to participate in PRE.
25723 2021-04-23 Richard Biener <rguenther@suse.de>
25725 * cfganal.c (connect_infinite_loops_to_exit): First call
25726 add_noreturn_fake_exit_edges.
25727 * ipa-sra.c (process_scan_results): Do not call the now redundant
25728 add_noreturn_fake_exit_edges.
25729 * predict.c (tree_estimate_probability): Likewise.
25730 (rebuild_frequencies): Likewise.
25731 * store-motion.c (one_store_motion_pass): Likewise.
25733 2021-04-23 Richard Biener <rguenther@suse.de>
25735 PR tree-optimization/100222
25736 * predict.c (pass_profile::execute): Remove redundant call to
25737 mark_irreducible_loops.
25738 (report_predictor_hitrates): Likewise.
25740 2021-04-23 Richard Biener <rguenther@suse.de>
25742 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
25743 valid_gimple_rhs_p by instead gimplifying to one.
25745 2021-04-23 Richard Biener <rguenther@suse.de>
25747 PR tree-optimization/99971
25748 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
25749 Always use TBAA for loads.
25751 2021-04-23 liuhongt <hongtao.liu@intel.com>
25754 * config/i386/i386-options.c (ix86_option_override_internal):
25755 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
25756 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
25757 by target attribute.
25759 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
25761 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
25763 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
25765 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
25767 * config.gcc (powerpc-ibm-aix6.*): Remove.
25768 * config/rs6000/aix61.h: Delete.
25770 2021-04-22 Martin Liska <mliska@suse.cz>
25772 PR testsuite/100159
25773 PR testsuite/100192
25774 * builtins.c (expand_builtin): Fix typos and missing comments.
25775 * dwarf2out.c (gen_subprogram_die): Likewise.
25776 (gen_struct_or_union_type_die): Likewise.
25778 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
25781 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
25782 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
25784 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
25786 * config/i386/darwin.h (TARGET_64BIT): Remove definition
25787 based on TARGET_ISA_64BIT.
25788 (TARGET_64BIT_P): Remove definition based on
25789 TARGET_ISA_64BIT_P().
25791 2021-04-21 Martin Liska <mliska@suse.cz>
25794 2021-04-21 Martin Liska <mliska@suse.cz>
25796 * lto-wrapper.c (cpuset_popcount): Remove.
25797 (init_num_threads): Remove and use hardware_concurrency.
25799 2021-04-21 Martin Liska <mliska@suse.cz>
25802 * main.c (main): Call toplev::finalize in CHECKING_P mode.
25803 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
25804 when incremental LTO linking happens.
25806 2021-04-21 Martin Liska <mliska@suse.cz>
25808 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
25809 makeserver cannot be detected, then use -flto=N fallback.
25811 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
25813 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
25814 default to yes for aarch64-linux-gnu.
25815 * configure: Regenerate.
25817 2021-04-21 Martin Liska <mliska@suse.cz>
25819 * lto-wrapper.c (cpuset_popcount): Remove.
25820 (init_num_threads): Remove and use hardware_concurrency.
25822 2021-04-21 Martin Liska <mliska@suse.cz>
25824 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
25825 which remains to be '(... || 0)' and clang complains about it.
25826 * dwarf2out.c (AT_vms_delta): Declare conditionally.
25827 (add_AT_vms_delta): Likewise.
25828 * tree.c (fld_simplified_type): Use rather more common pattern
25829 for disabling of something (#if 0).
25830 (get_tree_code_name): Likewise.
25831 (verify_type_variant): Likewise.
25833 2021-04-21 Martin Liska <mliska@suse.cz>
25835 * config/i386/i386-expand.c (decide_alignment): Use newly named
25836 macro TARGET_CPU_P.
25837 * config/i386/i386.c (ix86_decompose_address): Likewise.
25838 (ix86_address_cost): Likewise.
25839 (ix86_lea_outperforms): Likewise.
25840 (ix86_avoid_lea_for_addr): Likewise.
25841 (ix86_add_stmt_cost): Likewise.
25842 * config/i386/i386.h (TARGET_*): Remove.
25843 (TARGET_CPU_P): New macro.
25844 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
25845 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
25846 (swap_top_of_ready_list): Likewise.
25847 (ix86_atom_sched_reorder): Likewise.
25848 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
25849 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
25851 2021-04-21 Martin Liska <mliska@suse.cz>
25853 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
25855 (SET_TARGET_NO_SAHF): Likewise.
25856 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
25857 (SET_TARGET_PREFETCH_SSE): Likewise.
25858 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
25859 (SET_TARGET_NO_TUNE): Likewise.
25860 (TARGET_EXPLICIT_NO_80387_P): Likewise.
25861 (SET_TARGET_NO_80387): Likewise.
25863 * config/i386/i386.h (TARGET_*): Remove.
25864 * opth-gen.awk: Generate new used macros.
25866 2021-04-21 Martin Liska <mliska@suse.cz>
25868 * config/i386/i386.h (PTA_*): Remove.
25869 (enum pta_flag): New.
25870 (DEF_PTA): Generate PTA_* values from i386-isa.def.
25871 * config/i386/i386-isa.def: New file.
25873 2021-04-21 Alex Coplan <alex.coplan@arm.com>
25876 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
25877 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
25878 jump table targets.
25880 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
25882 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
25883 x86_64-*-* targets.
25884 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
25886 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
25887 (ix86_handle_option): Handle -mmwait.
25888 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
25889 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
25890 __builtin_ia32_monitor and __builtin_ia32_mwait.
25891 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
25892 (ix86_valid_target_attribute_inner_p): Likewise.
25893 (ix86_option_override_internal): Enable mwait/monitor
25894 instructions for -msse3.
25895 * config/i386/i386.h (TARGET_MWAIT): New.
25896 (TARGET_MWAIT_P): Likewise.
25897 * config/i386/i386.opt: Add -mmwait.
25898 * config/i386/mwaitintrin.h: New file.
25899 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
25900 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
25902 (@sse3_monitor_<mode>): Likewise.
25903 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
25904 * doc/extend.texi: Document mwait target attribute.
25905 * doc/invoke.texi: Document -mmwait.
25907 2021-04-21 Martin Liska <mliska@suse.cz>
25909 * config/i386/i386-options.c (DEF_ENUM): Remove it.
25910 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
25911 * config/i386/stringop.def (DEF_ENUM): Likewise.
25913 2021-04-21 Martin Liska <mliska@suse.cz>
25915 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
25916 of print_generic_expr.
25918 2021-04-21 Jakub Jelinek <jakub@redhat.com>
25920 PR rtl-optimization/100148
25921 * cprop.c (constprop_register): Use next_nondebug_insn instead of
25924 2021-04-21 Martin Liska <mliska@suse.cz>
25927 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
25928 free_dominance_info calls.
25930 2021-04-21 Richard Biener <rguenther@suse.de>
25932 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
25933 parameter (and assume it to be false).
25934 (fold_gimple_assign): Adjust, remove all callers of
25935 maybe_fold_reference calling it with is_lhs true.
25936 (gimple_fold_call): Likewise.
25937 (fold_stmt_1): Likewise.
25939 2021-04-21 Richard Biener <rguenther@suse.de>
25941 * fold-const.c (pedantic_non_lvalue_loc): Remove.
25942 (fold_binary_loc): Adjust.
25943 (fold_ternary_loc): Likewise.
25945 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
25947 PR middle-end/100130
25948 * varasm.c (get_block_for_decl): Make sure that any use of the
25949 retain attribute matches the section's retain flag.
25950 (switch_to_section): Check for retain mismatches even when
25951 changing sections, but do not warn if the given decl is the
25952 section's named.decl.
25953 (output_object_block): Pass the first decl in the block (if any)
25954 to switch_to_section.
25956 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
25958 * config/i386/i386-c.c (ix86_target_macros_internal): Define
25959 __CRC32__ for -mcrc32.
25960 * config/i386/i386-options.c (ix86_option_override_internal):
25961 Enable crc32 instruction for -msse4.2.
25962 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
25964 (sse4_2_crc32di): Likewise.
25965 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
25968 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
25971 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
25974 2021-04-20 Martin Liska <mliska@suse.cz>
25976 * doc/invoke.texi: Fix typo.
25977 * params.opt: Likewise.
25979 2021-04-20 Martin Liska <mliska@suse.cz>
25981 * doc/invoke.texi: Document new param.
25983 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
25985 PR tree-optimization/100081
25986 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
25987 rather than gori_compute_cache.
25988 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
25989 (range_def_chain::m_logical_depth): New member.
25990 (range_def_chain::range_def_chain): Initialize m_logical_depth.
25991 (range_def_chain::get_def_chain): Don't build defchains through more
25992 than LOGICAL_LIMIT logical expressions.
25993 * params.opt (param_ranger_logical_depth): New.
25995 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
25998 * config/arm/arm.c (arm_configure_build_target): Do not strip
25999 extended FPU/SIMD feature bits from the target ISA when -mfpu
26000 is specified (partial revert of r11-8168).
26002 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
26004 * params.opt (-param=openacc-kernels=): Add.
26005 * omp-oacc-kernels-decompose.cc
26006 (pass_omp_oacc_kernels_decompose::gate): Use it.
26007 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
26008 (--param): ... here, 'openacc-kernels'.
26010 2021-04-19 Martin Liska <mliska@suse.cz>
26013 * gengtype.c (finish_root_table): Align function arguments
26014 in between declaration and definition.
26016 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
26018 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
26019 frames larger than the SEH maximum frame size.
26021 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
26023 PR rtl-optimization/99927
26024 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
26025 is dead, just drop it.
26027 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
26030 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
26031 * doc/tm.texi: Regenerate.
26032 * doc/tm.texi.in (D language and ABI): Add @hook for
26033 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
26035 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
26037 * config/darwin-d.c (darwin_d_handle_target_object_format): New
26039 (darwin_d_register_target_info): New function.
26040 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26041 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
26043 (dragonfly_d_register_target_info): New function.
26044 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26045 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
26047 (freebsd_d_register_target_info): New function.
26048 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26049 * config/glibc-d.c (glibc_d_handle_target_object_format): New
26051 (glibc_d_register_target_info): New function.
26052 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26053 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
26055 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
26056 as handler for objectFormat key.
26057 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
26059 (winnt_d_register_target_info): New function.
26060 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26061 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
26063 (netbsd_d_register_target_info): New function.
26064 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26065 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
26067 (openbsd_d_register_target_info): New function.
26068 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26069 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
26070 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
26071 handler for objectFormat key.
26072 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
26074 (rs6000_d_register_target_info): Add
26075 rs6000_d_handle_target_object_format as handler for objectFormat key.
26076 * config/sol2-d.c (solaris_d_handle_target_object_format): New
26078 (solaris_d_register_target_info): New function.
26079 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
26081 2021-04-16 Jakub Jelinek <jakub@redhat.com>
26084 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
26085 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
26087 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
26088 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
26089 the old and new alignment after applying MIN/MAX to it is different.
26091 2021-04-16 Tamar Christina <tamar.christina@arm.com>
26094 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
26095 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
26097 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
26099 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
26101 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
26102 this section and its subsections.
26104 2021-04-16 Jakub Jelinek <jakub@redhat.com>
26107 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
26108 define_insn patterns.
26110 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
26112 PR rtl-optimization/98689
26113 * reg-notes.def (UNTYPED_CALL): New note.
26114 * combine.c (distribute_notes): Handle it.
26115 * emit-rtl.c (try_split): Likewise.
26116 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
26117 that calls with the note implicitly set all return value registers.
26118 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
26121 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
26123 PR rtl-optimization/99596
26124 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
26125 register accesses for const calls. Assume that pure functions
26126 can only read from global registers. Ignore cases in which
26127 the stack pointer has been marked global.
26129 2021-04-16 Jakub Jelinek <jakub@redhat.com>
26132 * tree-vect-loop.c (vect_transform_loop): Don't remove just
26133 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
26134 them by their last argument.
26136 2021-04-15 Martin Liska <mliska@suse.cz>
26138 * doc/invoke.texi: Other params don't use it, remove it.
26140 2021-04-15 Richard Biener <rguenther@suse.de>
26142 * gimple-builder.h: Add deprecation note.
26144 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
26147 * attribs.h (restrict_type_identity_attributes_to): Declare.
26148 * attribs.c (restrict_type_identity_attributes_to): New function.
26150 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
26153 * attribs.h (affects_type_identity_attributes): Declare.
26154 * attribs.c (remove_attributes_matching): New function.
26155 (affects_type_identity_attributes): Likewise.
26157 2021-04-15 Jakub Jelinek <jakub@redhat.com>
26160 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
26161 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
26162 ZERO_EXTEND, SIGN_EXTEND or AND.
26164 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
26166 PR rtl-optimization/99929
26167 * rtl.h (same_vector_encodings_p): New function.
26168 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
26169 * cselib.c (rtx_equal_for_cselib_1): Likewise.
26170 * jump.c (rtx_renumbered_equal_p): Likewise.
26171 * lra-constraints.c (operands_match_p): Likewise.
26172 * reload.c (operands_match_p): Likewise.
26173 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
26175 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
26177 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
26178 more information about variable-length CONST_VECTORs.
26180 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
26182 PR rtl-optimization/100066
26183 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
26184 ordered modes when choosing splitting mode for hard reg.
26186 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
26189 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
26191 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
26193 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
26195 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
26196 for mask operand types.
26197 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
26198 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
26200 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
26202 * config/s390/s390.c (s390_const_operand_ok): Check the new
26203 operand types and generate a list of valid values.
26205 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
26207 * doc/tm.texi: Regenerate.
26208 * doc/tm.texi.in (D language and ABI): Add @hook for
26209 TARGET_D_REGISTER_OS_TARGET_INFO.
26211 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
26213 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
26215 (aarch64_d_register_target_info): New function.
26216 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
26218 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
26220 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
26221 (arm_d_register_target_info): New function.
26222 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
26223 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26224 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
26225 (ix86_d_register_target_info): New function.
26226 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
26227 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26228 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
26229 (mips_d_register_target_info): New function.
26230 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
26231 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26232 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
26233 (pa_d_register_target_info): New function.
26234 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
26235 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26236 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
26238 (riscv_d_register_target_info): New function.
26239 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
26240 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26241 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
26243 (rs6000_d_register_target_info): New function.
26244 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
26246 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26247 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
26248 (s390_d_register_target_info): New function.
26249 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
26250 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26251 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
26253 (sparc_d_register_target_info): New function.
26254 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
26255 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
26256 * doc/tm.texi: Regenerate.
26257 * doc/tm.texi.in (D language and ABI): Add @hook for
26258 TARGET_D_REGISTER_CPU_TARGET_INFO.
26260 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
26262 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
26263 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
26264 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
26265 * doc/tm.texi: Regenerate.
26266 * doc/tm.texi.in (D language and ABI): Add @hook for
26267 TARGET_D_HAS_STDCALL_CONVENTION.
26269 2021-04-14 Richard Biener <rguenther@suse.de>
26271 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
26272 VEC_COND_EXPRs have a gimple_val condition.
26273 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
26274 can no longer have a GENERIC condition.
26276 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
26279 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
26280 from the isa_delta when -mfpu has been used.
26281 (arm_options_perform_arch_sanity_checks): It's the architecture that
26282 lacks an FPU not the processor.
26284 2021-04-13 Richard Biener <rguenther@suse.de>
26286 PR tree-optimization/100053
26287 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
26288 not use optimistic dominance queries for backedges to validate
26290 (dominated_by_p_w_unex): Add parameter to ignore executable
26291 state on backedges.
26292 (rpo_elim::eliminate_avail): Adjust.
26294 2021-04-13 Jakub Jelinek <jakub@redhat.com>
26297 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
26298 *aarch64_bfxilsi_extrdi): New define_insn patterns.
26300 2021-04-13 Jakub Jelinek <jakub@redhat.com>
26303 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
26304 outermode, return NULL if the result doesn't encode back to the
26305 original byte sequence.
26306 (simplify_gen_subreg): Don't create SUBREGs from constants to
26307 MODE_COMPOSITE_P outermode.
26309 2021-04-12 Jakub Jelinek <jakub@redhat.com>
26311 PR rtl-optimization/99905
26312 * combine.c (expand_compound_operation): If pos + len > modewidth,
26313 perform the right shift by pos in inner_mode and then convert to mode,
26314 instead of trying to simplify a shift of rtx with inner_mode by pos
26315 as if it was a shift in mode.
26317 2021-04-12 Jakub Jelinek <jakub@redhat.com>
26320 * combine.c (simplify_and_const_int_1): Don't optimize varop
26321 away if it has side-effects.
26323 2021-04-12 Martin Liska <mliska@suse.cz>
26325 * doc/extend.texi: Escape @smallexample content.
26327 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
26329 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
26330 alternative in order to load a DFP zero.
26332 2021-04-12 Martin Liska <mliska@suse.cz>
26334 * doc/extend.texi: Be more precise in documentation
26335 of symver attribute.
26337 2021-04-12 Martin Liska <mliska@suse.cz>
26340 * gimplify.c (gimplify_expr): Right now, we unpoison all
26341 variables before a goto <dest>. We should not do it if we are
26344 2021-04-12 Cui,Lili <lili.cui@intel.com>
26346 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
26348 * common/config/i386/i386-common.c (processor_names): Add
26350 (processor_alias_table): Add rocketlake.
26351 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
26352 INTEL_COREI7_ROCKETLAKE.
26353 * config.gcc: Add -march=rocketlake.
26354 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
26356 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
26357 (processor_cost_table): Add rocketlake cost.
26358 * config/i386/i386.h (ix86_size_cost) : Define
26360 (processor_type) : Add PROCESSOR_ROCKETLAKE.
26361 (PTA_ROCKETLAKE): Ditto.
26362 * doc/extend.texi: Add rocketlake.
26363 * doc/invoke.texi: Add rocketlake.
26365 2021-04-12 Cui,Lili <lili.cui@intel.com>
26367 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
26368 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
26369 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
26370 * doc/invoke.texi: Change alderlake ISA list.
26372 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
26374 PR middle-end/98088
26375 * omp-expand.c (expand_oacc_collapse_init): Update condition in
26378 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
26381 * config/i386/serializeintrin.h (_serialize): Defined as macro.
26383 2021-04-10 Jakub Jelinek <jakub@redhat.com>
26386 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
26387 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
26389 2021-04-10 Jakub Jelinek <jakub@redhat.com>
26391 PR middle-end/99989
26392 * gimple-ssa-warn-alloca.c
26393 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
26394 0 with integer precision unconditionally.
26396 2021-04-10 Jakub Jelinek <jakub@redhat.com>
26398 PR rtl-optimization/98601
26399 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
26400 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
26401 unaligned_mems handle VOIDmode like BLKmode.
26403 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
26406 * tree.c (free_lang_data_in_decl): Do not release body of
26407 declare_variant_alt.
26409 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
26411 * config/aarch64/aarch64.c (aarch64_option_restore): If the
26412 architecture was specified explicitly and the tuning wasn't,
26413 tune for the architecture rather than the configured default CPU.
26415 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
26417 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
26418 as the temporary register.
26420 2021-04-09 Martin Liska <mliska@suse.cz>
26422 * doc/extend.texi: Move non-target attributes on the top level.
26424 2021-04-09 Martin Liska <mliska@suse.cz>
26426 * doc/invoke.texi: Document minimum and maximum value of the
26427 argument for both supported compression algorithms.
26429 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
26431 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
26432 TLS BSS before TLS data.
26433 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
26435 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
26437 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
26439 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
26441 * match.pd: Extend vec_cond folds to handle shifts.
26443 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
26445 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
26448 2021-04-08 Alex Coplan <alex.coplan@arm.com>
26451 * config/arm/iterators.md (MVE_vecs): New.
26452 (V_elem): Also handle V2DF.
26453 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
26454 (*mve_vdup<mode>): ... this. Remove second alternative since
26455 vec_duplicate of const_int is not canonical RTL, and we don't
26456 want to match symbol_refs.
26457 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
26459 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
26461 * fold-const.c (fold_single_bit_test): Fix typo.
26462 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
26465 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
26467 PR tree-optimization/97513
26468 * tree-vect-slp.c (vect_add_slp_permutation): New function,
26470 (vectorizable_slp_permutation): ...here. Detect cases in which
26471 all VEC_PERM_EXPRs are guaranteed to have the same stepped
26472 permute vector and only generate one permute vector for that case.
26473 Extend that case to handle variable-length vectors.
26475 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
26477 PR tree-optimization/99873
26478 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
26479 (vect_build_slp_instance): Don't split store groups that could
26480 use IFN_STORE_LANES.
26482 2021-04-07 Jakub Jelinek <jakub@redhat.com>
26485 * varasm.c (output_constant_pool_contents): Don't strip name encoding
26486 from XSTR (desc->sym, 0) or from label before passing those to
26489 2021-04-07 Richard Biener <rguenther@suse.de>
26491 PR tree-optimization/99954
26492 * tree-loop-distribution.c: Include tree-affine.h.
26493 (generate_memcpy_builtin): Try using tree-affine to prove
26495 (loop_distribution::classify_builtin_ldst): Always classify
26498 2021-04-07 Richard Biener <rguenther@suse.de>
26500 PR tree-optimization/99947
26501 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
26502 steps vector to avoid pushing elements from the reallocated
26505 2021-04-07 Richard Biener <rguenther@suse.de>
26507 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
26508 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
26510 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
26512 (debug_vn_reference_ops): New.
26514 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
26516 PR tree-optimization/98736
26517 * tree-loop-distribution.c
26518 * (loop_distribution::bb_top_order_init):
26519 Compute RPO with programing order preserved by calling function
26520 rev_post_order_and_mark_dfs_back_seme.
26522 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
26525 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
26526 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
26528 (process_bb_lives): Don't update biggest mode of hard reg for
26529 implicit in multi-register group. Use the new functions for
26530 updating dead_set and unused_set by register notes.
26532 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
26534 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
26537 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
26539 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
26540 (skylake_memset): Likewise.
26541 (skylake_cost): Change CLEAR_RATIO to 17.
26542 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
26543 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
26544 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
26546 2021-04-06 Richard Biener <rguenther@suse.de>
26548 PR tree-optimization/99880
26549 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
26550 set vectorized defs of relevant PHIs.
26552 2021-04-06 Richard Biener <rguenther@suse.de>
26554 PR tree-optimization/99924
26555 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
26556 nodes w/o scalar stmts as visited.
26558 2021-04-06 Alex Coplan <alex.coplan@arm.com>
26561 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
26562 PCS for [su]fix_optab.
26564 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
26566 * config/darwin.c (machopic_legitimize_pic_address): Check
26567 that the current pic register is one of the hard reg set
26568 before setting liveness.
26570 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
26572 * config/darwin.c (machopic_legitimize_pic_address): Fix
26573 whitespace, remove unused code.
26575 2021-04-03 Jakub Jelinek <jakub@redhat.com>
26577 PR tree-optimization/99882
26578 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
26581 2021-04-03 Jakub Jelinek <jakub@redhat.com>
26583 PR rtl-optimization/99863
26584 * dse.c (replace_read): Drop regs_live argument. Instead of
26585 regs_live, use store_insn->fixed_regs_live if non-NULL,
26586 otherwise punt if insns sequence clobbers or sets any hard
26589 2021-04-03 Jakub Jelinek <jakub@redhat.com>
26592 * targhooks.h (default_print_patchable_function_entry_1): Declare.
26593 * targhooks.c (default_print_patchable_function_entry_1): New function,
26594 copied from default_print_patchable_function_entry with an added flags
26596 (default_print_patchable_function_entry): Rewritten into a small
26597 wrapper around default_print_patchable_function_entry_1.
26598 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
26600 (rs6000_print_patchable_function_entry): New function.
26602 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
26604 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
26606 2021-04-01 Jason Merrill <jason@redhat.com>
26609 * common.opt: Document v15 and v16.
26611 2021-04-01 Richard Biener <rguenther@suse.de>
26613 PR tree-optimization/99863
26614 * gimplify.c (gimplify_init_constructor): Recompute vector
26617 2021-04-01 Jakub Jelinek <jakub@redhat.com>
26619 * doc/extend.texi (symver attribute): Fix up syntax errors
26622 2021-04-01 Jakub Jelinek <jakub@redhat.com>
26624 PR tree-optimization/96573
26625 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
26626 also pointer types.
26628 2021-04-01 Richard Biener <rguenther@suse.de>
26630 PR tree-optimization/99856
26631 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
26632 precision to vector element precision.
26634 2021-04-01 Martin Jambor <mjambor@suse.cz>
26636 PR tree-optimization/97009
26637 * tree-sra.c (access_or_its_child_written): New function.
26638 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
26641 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
26644 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
26646 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
26649 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
26650 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
26651 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
26652 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
26653 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
26654 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
26656 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
26657 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
26658 (define_attr "prefixed"): Update initializer.
26660 2021-03-31 Jakub Jelinek <jakub@redhat.com>
26663 * dwarf2out.c (debug_ranges_dwo_section): New variable.
26664 (DW_RANGES_IDX_SKELETON): Define.
26665 (struct dw_ranges): Add begin_entry and end_entry members.
26666 (DEBUG_DWO_RNGLISTS_SECTION): Define.
26667 (add_ranges_num): Adjust r initializer for addition of *_entry
26669 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
26670 set idx to DW_RANGES_IDX_SKELETON.
26671 (use_distinct_base_address_for_range): New function.
26672 (index_rnglists): Don't set r->idx if it is equal to
26673 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
26674 r->end_entry for -gsplit-dwarf if those will be needed by
26676 (output_rnglists): Add DWO argument. If true, switch to
26677 debug_ranges_dwo_section rather than debug_ranges_section.
26678 Adjust l1/l2 label indexes. Only output the offset table when
26679 dwo is true and don't include in there the skeleton range
26680 entry if present. For -gsplit-dwarf, skip ranges that belong
26681 to the other rnglists section. Change return type from void
26682 to bool and return true if there are any range entries for
26683 the other section. For dwarf_split_debug_info use
26684 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
26685 entries instead of DW_RLE_start_end, DW_RLE_start_length and
26686 DW_RLE_base_address. Use use_distinct_base_address_for_range.
26687 (init_sections_and_labels): Initialize debug_ranges_dwo_section
26688 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
26689 and range_base_label indexes.
26690 (dwarf2out_finish): Call index_rnglists earlier before finalizing
26691 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
26692 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
26693 with different dwo arguments.
26694 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
26696 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
26698 PR tree-optimization/98268
26699 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
26700 recompute_tree_invariant_for_addr_expr after successfully
26701 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
26703 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
26705 PR tree-optimization/99726
26706 * tree-data-ref.c (create_intersect_range_checks_index): Bail
26707 out if there is more than one access function SCEV for the loop
26710 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
26712 PR rtl-optimization/97141
26713 PR rtl-optimization/98726
26714 * emit-rtl.c (valid_for_const_vector_p): Return true for
26716 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
26717 poly_wide_int instead of a wide_int.
26718 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
26720 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
26721 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
26722 false for CONST_VECTORs that cannot be forced to memory.
26723 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
26724 is too complex to force to memory, build it up from individual
26727 2021-03-31 Jan Hubicka <jh@suse.cz>
26730 * cgraph.c (cgraph_node::release_body): Fix overactive check.
26732 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
26735 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
26738 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
26740 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
26741 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
26743 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
26744 "rep movsb/stosb" only for known sizes.
26745 * config/i386/i386-options.c (processor_cost_table): Use Ice
26746 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
26747 Rapids and Alder Lake.
26748 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
26749 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
26750 (icelake_memset): Likewise.
26751 (icelake_cost): Likewise.
26752 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
26755 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
26758 * config/aarch64/aarch64.c
26759 (aarch64_vectorize_preferred_vector_alignment): Query the size
26760 of the provided SVE vector; do not assume that all SVE vectors
26761 have the same size.
26763 2021-03-31 Jan Hubicka <jh@suse.cz>
26766 * cgraph.c (cgraph_node::release_body): Remove all callers and
26768 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
26769 * cgraphunit.c (cgraph_node::expand): And here.
26771 2021-03-31 Martin Liska <mliska@suse.cz>
26773 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
26774 and one negated condition.
26776 2021-03-31 Jakub Jelinek <jakub@redhat.com>
26777 Richard Sandiford <richard.sandiford@arm.com>
26780 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
26781 constraints on operands[2] and similarly 0 and rk constraints
26782 on operands[1] corresponding to that.
26784 2021-03-31 Jakub Jelinek <jakub@redhat.com>
26787 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
26788 linker doesn't support DWARF sections new in DWARF5.
26789 * config/i386/i386-options.c (ix86_option_override_internal): Default
26790 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
26792 * config.in: Regenerated.
26793 * configure: Regenerated.
26795 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26798 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
26799 available issue_info before using it.
26801 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26804 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
26807 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
26810 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
26811 (altivec_lvsl_reg_<mode>): ... this.
26812 (altivec_lvsr_reg): Change to ...
26813 (altivec_lvsr_reg_<mode>): ... this.
26814 * config/rs6000/predicates.md (vec_set_index_operand): New.
26815 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
26816 Enable 32bit variable vec_insert for all TARGET_VSX.
26817 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
26818 Enable 32bit variable vec_insert for p9 and above.
26819 (rs6000_expand_vector_set_var_p8): Rename to ...
26820 (rs6000_expand_vector_set_var_p7): ... this.
26821 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
26823 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
26824 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
26825 gen_altivec_lvsr_reg_di.
26827 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
26830 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
26831 (__rdtscp): Likewise.
26833 2021-03-30 Tamar Christina <tamar.christina@arm.com>
26835 PR tree-optimization/99825
26836 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
26837 Reject non-mult 2 lanes.
26839 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
26842 * config/arm/arm.c (arm_file_start): Fix emission of
26843 Tag_ABI_VFP_args attribute.
26845 2021-03-30 Richard Biener <rguenther@suse.de>
26847 PR tree-optimization/99824
26848 * stor-layout.c (set_min_and_max_values_for_integral_type):
26849 Assert the precision is within the bounds of
26850 WIDE_INT_MAX_PRECISION.
26851 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
26852 the outermost component ref only to lower the access size
26853 and initialize that from the access type.
26855 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
26858 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
26859 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
26861 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
26863 * config/aarch64/aarch64.md
26864 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
26865 attribute to disambiguate between SIMD and FP variants of the
26868 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
26870 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
26871 (analyze_ssa_name_flags): Fix typo in comment.
26873 2021-03-29 Alex Coplan <alex.coplan@arm.com>
26876 * config/aarch64/aarch64-sve-builtins.cc
26877 (function_builder::add_function): Add placeholder_p argument, use
26878 placeholder decls if this is set.
26879 (function_builder::add_unique_function): Instead of conditionally adding
26880 direct overloads, unconditionally add either a direct overload or a
26882 (function_builder::add_overloaded_function): Set placeholder_p if we're
26883 using C++ overloads. Use the obstack for string storage instead
26884 of relying on the tree nodes.
26885 (function_builder::add_overloaded_functions): Don't return early for
26886 m_direct_overloads: we need to add placeholders.
26887 * config/aarch64/aarch64-sve-builtins.h
26888 (function_builder::add_function): Add placeholder_p argument.
26890 2021-03-29 Richard Biener <rguenther@suse.de>
26892 PR tree-optimization/99807
26893 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
26894 assert below VEC_PERM handling.
26896 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26899 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
26900 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
26901 matching const_int 0.
26902 (move_lo_quad_internal_be_<mode>): Likewise.
26903 (move_lo_quad_<mode>): Update for the above.
26904 * config/aarch64/iterators.md (VQ_2E): Delete.
26906 2021-03-29 Jakub Jelinek <jakub@redhat.com>
26908 PR tree-optimization/99777
26909 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
26910 types other than scalar integral types.
26912 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
26914 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
26915 XCOFF TLS reloc decorations.
26917 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
26919 * doc/analyzer.texi (Analyzer Internals): Update link to
26920 "A Memory Model for Static Analysis of C Programs".
26922 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
26924 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
26925 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
26927 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
26928 (rs6000_special_round_type_align): Recursively check innermost first
26931 2021-03-26 Jakub Jelinek <jakub@redhat.com>
26934 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
26935 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
26936 assignment with drap_reg active, queue reg save for hfp with offset 0
26937 and flush queued reg saves. When handling a push with rule18,
26938 defer queueing reg save for hfp and just assert the offset is 0.
26939 (scan_trace): Assert that fde->rule18 is false.
26941 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
26944 * ira-costs.c (record_reg_classes): Put case with
26945 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
26946 * ira.c (ira_setup_alts): Ditto.
26947 * lra-constraints.c (process_alt_operands): Ditto.
26948 * recog.c (asm_operand_ok): Ditto.
26949 * reload.c (find_reloads): Ditto.
26951 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
26953 * config/aarch64/aarch64-protos.h
26954 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
26955 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
26956 * config/aarch64/aarch64.c (generic_addrcost_table): Update
26957 accordingly, using the same costs as for post_modify.
26958 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
26959 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
26960 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
26961 (a64fx_addrcost_table): Likewise.
26962 (neoversev1_addrcost_table): New.
26963 (neoversev1_tunings): Use neoversev1_addrcost_table.
26964 (aarch64_address_cost): Use the new post_modify costs for CImode
26967 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
26969 * config/aarch64/aarch64.opt
26970 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
26971 * doc/invoke.texi: Document it.
26972 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
26973 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
26974 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
26975 (aarch64_vec_issue_info): New structures.
26976 (cpu_vector_cost): Write comments above the variables rather
26978 (cpu_vector_cost::issue_info): New member variable.
26979 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
26980 and tree-ssa-loop-niter.h.
26981 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
26982 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
26983 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
26984 (thunderx3t110_vector_cost): Initialize issue_info to null.
26985 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
26986 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
26987 (neoversev1_vector_cost): Use them.
26988 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
26989 (aarch64_vector_costs::saw_sve_only_op): New member variable.
26990 (aarch64_vector_costs::num_vector_iterations): Likewise.
26991 (aarch64_vector_costs::scalar_ops): Likewise.
26992 (aarch64_vector_costs::advsimd_ops): Likewise.
26993 (aarch64_vector_costs::sve_ops): Likewise.
26994 (aarch64_vector_costs::seen_loads): Likewise.
26995 (aarch64_simd_vec_costs_for_flags): New function.
26996 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
26997 Count the number of predicate operations required by SVE WHILE
26999 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
27000 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
27001 (aarch64_count_ops): Likewise.
27002 (aarch64_add_stmt_cost): Record whether see an SVE operation
27003 that cannot currently be implementing using Advanced SIMD.
27004 Record issue information about the scalar, Advanced SIMD
27005 and (where relevant) SVE versions of a loop.
27006 (aarch64_vec_op_count::dump): New function.
27007 (aarch64_sve_op_count::dump): Likewise.
27008 (aarch64_estimate_min_cycles_per_iter): Likewise.
27009 (aarch64_adjust_body_cost): If issue information is available,
27010 try to compare the issue rates of the various loop implementations
27011 and increase or decrease the vector body cost accordingly.
27013 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27015 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
27016 Assume a zero cost for induction phis.
27018 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27020 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
27022 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
27023 vector comparisons.
27025 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27027 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
27029 (aarch64_add_stmt_cost): Call it.
27031 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27033 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
27034 New tuning parameter.
27035 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
27036 (aarch64_estimated_sve_vq): New function.
27037 (aarch64_vector_costs::analyzed_vinfo): New member variable.
27038 (aarch64_vector_costs::is_loop): Likewise.
27039 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
27040 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
27041 (aarch64_record_potential_advsimd_unrolling): New function.
27042 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
27043 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
27044 aarch64_analyze_bb_vinfo on the first use of a costs structure.
27045 Detect whether we're vectorizing a loop for SVE that might be
27046 completely unrolled if it used Advanced SIMD instead.
27047 (aarch64_adjust_body_cost_for_latency): New function.
27048 (aarch64_finish_cost): Call it.
27050 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27052 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
27053 (aarch64_init_cost): New function.
27054 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
27055 the default unsigned[3].
27056 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
27057 (TARGET_VECTORIZE_INIT_COST): Override.
27058 (TARGET_VECTORIZE_FINISH_COST): Likewise.
27059 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
27061 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27063 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
27064 (neoversev1_sve_vector_cost): New cost structures.
27065 (neoversev1_vector_cost): Likewise.
27066 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
27068 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27070 * config/aarch64/aarch64-protos.h
27071 (sve_vec_cost::scatter_store_elt_cost): New member variable.
27072 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
27073 accordingly, taking the cost from the cost of a scalar_store.
27074 (a64fx_sve_vector_cost): Likewise.
27075 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
27077 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27079 * config/aarch64/aarch64-protos.h
27080 (simd_vec_cost::store_elt_extra_cost): New member variable.
27081 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
27082 accordingly, using the vec_to_scalar cost for the new field.
27083 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
27084 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
27085 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
27086 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
27087 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
27088 (thunderx3t110_advsimd_vector_cost): Likewise.
27089 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
27091 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27093 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
27094 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
27095 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
27096 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
27097 accordingly, using zero for the new costs.
27098 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
27099 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
27100 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
27101 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
27102 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
27103 (thunderx3t110_advsimd_vector_cost): Likewise.
27104 (aarch64_ld234_st234_vectors): New function.
27105 (aarch64_adjust_stmt_cost): Likewise.
27106 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
27107 the new vector costs.
27109 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27111 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
27112 derived class of simd_vec_cost. Add information about CLAST[AB]
27113 and FADDA instructions.
27114 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
27115 accordingly, using the vec_to_scalar costs for the new fields.
27116 (a64fx_sve_vector_cost): Likewise.
27117 (aarch64_reduc_type): New function.
27118 (aarch64_sve_in_loop_reduction_latency): Likewise.
27119 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
27120 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
27121 that occur in the loop body.
27122 (aarch64_add_stmt_cost): Update call accordingly.
27124 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27126 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
27128 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
27129 above the fields rather than to the right.
27130 (simd_vec_cost::reduc_i8_cost): New member variable.
27131 (simd_vec_cost::reduc_i16_cost): Likewise.
27132 (simd_vec_cost::reduc_i32_cost): Likewise.
27133 (simd_vec_cost::reduc_i64_cost): Likewise.
27134 (simd_vec_cost::reduc_f16_cost): Likewise.
27135 (simd_vec_cost::reduc_f32_cost): Likewise.
27136 (simd_vec_cost::reduc_f64_cost): Likewise.
27137 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
27138 accordingly, using the vec_to_scalar_cost for the new fields.
27139 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
27140 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
27141 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
27142 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
27143 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
27144 (thunderx3t110_advsimd_vector_cost): Likewise.
27145 (aarch64_use_new_vector_costs_p): New function.
27146 (aarch64_simd_vec_costs): New function, split out from...
27147 (aarch64_builtin_vectorization_cost): ...here.
27148 (aarch64_is_reduction): New function.
27149 (aarch64_detect_vector_stmt_subtype): Likewise.
27150 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
27151 using the new vector costs.
27153 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27156 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
27157 TLS declarations as public.
27159 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27161 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
27162 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
27163 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
27164 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
27165 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
27166 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
27167 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
27168 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
27169 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
27171 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27174 * config.gcc (*-*-cygwin*): Add winnt-d.o
27175 (*-*-mingw*): Likewise.
27176 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
27177 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
27178 * config/i386/t-cygming: Add winnt-d.o.
27179 * config/i386/winnt-d.c: New file.
27181 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27183 * config/freebsd-d.c: Include memmodel.h.
27185 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
27188 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
27189 * config/t-openbsd: Add openbsd-d.o.
27190 * config/openbsd-d.c: New file.
27192 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
27194 PR tree-optimization/96974
27195 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
27196 with graceful exit.
27198 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
27201 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
27205 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
27206 always_inline in system headers.
27208 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
27210 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
27212 2021-03-25 Jakub Jelinek <jakub@redhat.com>
27215 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
27216 * fold-const.c (operand_compare::operand_equal_p): Don't compare
27217 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
27219 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
27223 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
27224 always_inline in system headers.
27226 2021-03-25 Richard Biener <rguenther@suse.de>
27228 PR tree-optimization/99746
27229 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
27230 the scalar stmt as patterned. Instead set up required things
27233 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
27235 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
27238 2021-03-24 Martin Liska <mliska@suse.cz>
27241 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
27243 * config/i386/i386-options.c (ix86_option_override_internal):
27244 Add run-time assert.
27246 2021-03-24 Martin Jambor <mjambor@suse.cz>
27249 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
27250 parameters with unknown type.
27251 (ipacp_value_safe_for_type): New function.
27252 (propagate_vals_across_arith_jfunc): Verify that the constant type
27253 can be used for a type of the formal parameter.
27254 (propagate_vals_across_ancestor): Likewise.
27255 (propagate_scalar_across_jump_function): Likewise. Pass the type
27256 also to propagate_vals_across_ancestor.
27258 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
27261 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
27263 (movmisalign<mode>_mve_load): Likewise.
27265 2021-03-24 Jakub Jelinek <jakub@redhat.com>
27268 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
27269 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
27271 2021-03-24 Alexandre Oliva <oliva@adacore.com>
27273 * doc/sourcebuild.texi (sysconf): New effective target.
27275 2021-03-24 Alexandre Oliva <oliva@adacore.com>
27277 * config/i386/predicates.md (reg_or_const_vec_operand): New.
27278 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
27279 the now *-prefixed insn_and_split, turn the splitter const vec
27280 into an input for the insn, making it an ignored immediate for
27281 non-split cases, and loaded into the scratch register
27284 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
27287 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
27288 Use define_relaxed_memory_constraint for them.
27290 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
27293 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
27294 colon to the diagnostic message.
27296 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
27298 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
27300 (try_fwprop_subst_note): Use set_info instead of insn_info.
27301 (try_fwprop_subst_pattern): Likewise.
27302 (try_fwprop_subst_notes): Likewise.
27303 (try_fwprop_subst): Likewise.
27304 (forward_propagate_subreg): Likewise.
27305 (forward_propagate_and_simplify): Likewise.
27306 (forward_propagate_into): Likewise.
27307 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
27309 (set_info::single_nondebug_insn_use): Likewise.
27310 (set_info::single_phi_use): Likewise.
27311 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
27313 (set_info::single_nondebug_insn_use): Likewise.
27314 (set_info::single_phi_use): Likewise.
27316 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
27318 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
27320 2021-03-23 Jakub Jelinek <jakub@redhat.com>
27323 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
27324 expand_mult to perform an unsigned rather than a signed
27327 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
27330 * config/i386/cpuid.h (__cpuid): Add __volatile__.
27331 (__cpuid_count): Likewise.
27333 2021-03-23 Richard Biener <rguenther@suse.de>
27335 PR tree-optimization/99721
27336 * tree-vect-slp.c (vect_slp_analyze_node_operations):
27337 Make sure we can schedule the node.
27339 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27341 * config/riscv/riscv.c (riscv_subword): Take endianness into
27342 account when calculating the byte offset.
27344 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27346 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
27347 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
27348 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
27349 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
27350 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
27351 new predicate "subreg_lowpart_operator"
27353 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27355 * config/riscv/riscv.c (riscv_swap_instruction): New function
27356 to byteswap an SImode rtx containing an instruction.
27357 (riscv_trampoline_init): Byteswap the generated instructions
27360 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27362 * common/config/riscv/riscv-common.c
27363 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
27364 * config.gcc (riscv32be-*, riscv64be-*): Set
27365 TARGET_BIG_ENDIAN_DEFAULT to 1.
27366 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
27367 depending on default endianness.
27368 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
27369 * config/riscv/linux.h (LINK_SPEC): Likewise.
27370 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
27371 default endianness.
27372 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
27374 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
27376 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
27377 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
27378 * config/riscv/linux.h (LINK_SPEC): Likewise.
27379 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
27381 (BYTES_BIG_ENDIAN): Handle big endian.
27382 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
27383 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
27385 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
27387 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
27389 * regcprop.c (find_oldest_value_reg): Ask target whether
27390 different mode is fine for replacement register.
27392 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
27394 PR tree-optimization/99296
27395 * value-range.cc (irange::irange_set_1bit_anti_range): New.
27396 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
27397 * value-range.h (irange::irange_set_1bit_anti_range): New.
27399 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
27402 * config/aarch64/constraints.md (UtQ): Use
27403 define_relaxed_memory_constraint for it.
27404 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
27405 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
27406 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
27407 (have_relaxed_memory_constraints): New static var.
27408 (relaxed_memory_start, relaxed_memory_end): Ditto.
27409 (add_constraint): Add arg is_relaxed_memory. Check name for
27410 relaxed memory. Set up is_relaxed_memory in constraint_data and
27411 have_relaxed_memory_constraints. Adjust calls.
27412 (choose_enum_order): Process relaxed memory.
27413 (write_tm_preds_h): Ditto.
27414 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
27415 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
27416 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
27417 * ira-lives.c (single_reg_class): Use
27418 insn_extra_relaxed_memory_constraint.
27419 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
27420 * lra-constraints.c (valid_address_p): Use
27421 insn_extra_relaxed_memory_constraint instead of other memory
27423 (process_alt_operands): Process CT_RELAXED_MEMORY.
27424 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
27425 * recog.c (asm_operand_ok, preprocess_constraints): Process
27427 * reload.c (find_reloads): Ditto.
27428 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
27429 * stmt.c (parse_input_constraint): Use
27430 insn_extra_relaxed_memory_constraint.
27432 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
27435 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
27438 2021-03-22 Alex Coplan <alex.coplan@arm.com>
27441 * config/arm/arm-protos.h (neon_make_constant): Add generate
27442 argument to guard emitting insns, default to true.
27443 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
27444 CONST_VECTORs which neon_make_constant can't handle.
27445 (neon_vdup_constant): Add generate argument, avoid emitting
27446 insns if it's not set.
27447 (neon_make_constant): Plumb new generate argument through.
27448 * config/arm/constraints.md (Ui): New. Use it...
27449 * config/arm/mve.md (*mve_mov<mode>): ... here.
27450 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
27451 synthesize constants.
27453 2021-03-22 Richard Biener <rguenther@suse.de>
27455 * debug.h: Add deprecation warning.
27457 2021-03-22 Richard Biener <rguenther@suse.de>
27459 PR tree-optimization/99694
27460 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
27463 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
27466 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
27467 after type checking.
27469 2021-03-22 Jakub Jelinek <jakub@redhat.com>
27473 * dwarf2out.c (get_full_len): Use get_precision rather than
27475 (add_const_value_attribute): Make sure add_AT_wide argument has
27476 precision prec rather than some very wide one.
27478 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
27480 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
27481 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
27482 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
27483 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
27484 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
27485 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
27486 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
27487 *fix_trunc<mode>si2_internal): Fix empty split condition.
27488 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
27489 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
27490 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
27491 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
27493 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
27496 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
27497 Convert idx to DImode.
27498 (rs6000_expand_vector_set_var_p8): Likewise.
27500 2021-03-21 Jakub Jelinek <jakub@redhat.com>
27503 * dwarf2out.c (insert_float): Change return type from void to
27504 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
27505 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
27508 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
27511 * config/i386/i386.c (construct_container): Check cfun != NULL
27512 before accessing silent_p.
27514 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
27516 * asan.c: Fix typos in comments.
27518 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
27520 PR rtl-optimization/99680
27521 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
27522 (process_address_1): Check empty constraint before using
27525 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
27527 * config/rs6000/rs6000.c (power10_cost): New.
27528 (rs6000_option_override_internal): Set Power10 costs.
27529 (rs6000_issue_rate): Set Power10 issue rate.
27530 * config/rs6000/power10.md: Rewrite for Power10.
27532 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
27535 * lra-constraints.c (process_address_1): Don't use unknown
27536 constraint for address constraint.
27538 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
27541 * config.gcc (powerpc-*-darwin8): Delete the reference to
27542 the now removed darwin8.h.
27544 2021-03-19 Olivier Hainque <hainque@adacore.com>
27547 * config/vxworksae.h (VX_CPU_PREFIX): Define.
27549 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
27551 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
27553 2021-03-19 Tamar Christina <tamar.christina@arm.com>
27555 PR tree-optimization/99656
27556 * tree-vect-slp-patterns.c (linear_loads_p,
27557 complex_add_pattern::matches, is_eq_or_top,
27558 vect_validate_multiplication, complex_mul_pattern::matches,
27559 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
27560 * tree-vectorizer.h: (complex_load_perm_t): Removed.
27561 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
27562 complex_load_perm_t.
27564 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
27567 * config/i386/i386-options.c (ix86_init_machine_status): Set
27569 * config/i386/i386.c (init_cumulative_args): Set silent_p to
27571 (construct_container): Return early for return and argument
27572 errors if silent_p is true.
27573 * config/i386/i386.h (machine_function): Add silent_p.
27575 2021-03-19 Jakub Jelinek <jakub@redhat.com>
27578 * config/arm/constraints.md (Ds): New constraint.
27579 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
27580 constraint instead of w,Dm.
27582 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
27584 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
27587 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
27589 PR middle-end/99641
27590 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
27591 array type, do the computation of the current position in sizetype.
27593 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
27596 * lra-constraints.c (process_address_1): Use lookup_constraint
27597 only for a single constraint.
27599 2021-03-18 Martin Sebor <msebor@redhat.com>
27601 PR middle-end/99502
27602 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
27603 (inbounds_memaccess_p): ...to this. Check the ending offset of
27604 the accessed member.
27606 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
27608 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
27609 %> quote markers to error messages.
27610 (gcn_goacc_validate_dims): Likewise.
27611 (gcn_conditional_register_usage): Remove exclaimation mark from error
27613 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
27615 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
27617 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
27620 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
27621 Kito Cheng <kito.cheng@sifive.com>
27623 * config/riscv/riscv.c (riscv_block_move_straight): Change type
27624 to unsigned HOST_WIDE_INT for parameter and local variable with
27625 HOST_WIDE_INT type.
27626 (riscv_adjust_block_mem): Ditto.
27627 (riscv_block_move_loop): Ditto.
27628 (riscv_expand_block_move): Ditto.
27630 2021-03-18 Nick Clifton <nickc@redhat.com>
27632 * config/v850/v850.c (construct_restore_jr): Increase static
27634 (construct_save_jarl): Likewise.
27635 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
27637 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27639 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
27640 (aarch64_override_options_internal): Use it.
27641 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
27644 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
27646 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
27647 error message format issues.
27648 (nios2_option_override): Likewise.
27649 (nios2_expand_fpu_builtin): Likewise.
27650 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
27651 truncation warning.
27652 (nios2_expand_custom_builtin): More error message format fixes.
27653 (nios2_expand_rdwrctl_builtin): Likewise.
27654 (nios2_expand_rdprs_builtin): Likewise.
27655 (nios2_expand_eni_builtin): Likewise.
27656 (nios2_expand_builtin): Likewise.
27657 (nios2_register_custom_code): Likewise.
27658 (nios2_valid_target_attribute_rec): Likewise.
27659 (nios2_add_insn_asm): Fix uninitialized variable warning.
27661 2021-03-17 Jan Hubicka <jh@suse.cz>
27663 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
27664 of gather to match reality.
27665 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
27667 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27669 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
27670 to compare against CC_REG rather than NE.
27672 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
27675 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
27676 inline assembly statements.
27677 (ix86_print_operand): Update 'P' handling for -fno-plt.
27679 2021-03-17 Tamar Christina <tamar.christina@arm.com>
27682 * config/aarch64/aarch64.c
27683 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
27685 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
27688 * config/rs6000/predicates.md (branch_comparison_operator): Allow
27689 ordered and unordered for CCFPmode, if flag_finite_math_only.
27691 2021-03-16 Jakub Jelinek <jakub@redhat.com>
27694 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
27695 rather than ASHIFT.
27696 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
27698 2021-03-16 Martin Liska <mliska@suse.cz>
27701 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
27702 cl_optimization_compare function.
27704 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
27706 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
27709 2021-03-16 Jakub Jelinek <jakub@redhat.com>
27712 * config/i386/i386.h (struct machine_function): Add
27713 has_explicit_vzeroupper bitfield.
27714 * config/i386/i386-expand.c (ix86_expand_builtin): Set
27715 cfun->machine->has_explicit_vzeroupper when expanding
27716 IX86_BUILTIN_VZEROUPPER.
27717 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
27718 Do the mode switching only when TARGET_VZEROUPPER, expensive
27719 optimizations turned on and not optimizing for size.
27720 (pass_insert_vzeroupper::gate): Enable even when
27721 cfun->machine->has_explicit_vzeroupper is set.
27723 2021-03-16 Jakub Jelinek <jakub@redhat.com>
27726 * config/aarch64/aarch64.c
27727 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
27728 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
27729 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
27731 2021-03-15 Richard Biener <rguenther@suse.de>
27733 PR tree-optimization/98834
27734 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
27735 subsetting by truncating the access size.
27737 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
27739 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
27740 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
27743 2021-03-15 Martin Liska <mliska@suse.cz>
27745 * spellcheck.c: Add missing comma in initialization.
27747 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
27749 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
27750 alternative 2 and alternative 1 with alternative 3 using
27751 YW register constraint.
27752 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
27753 using YW register constraint.
27754 (*vec_extractv16qi_zext): Ditto.
27755 (*vec_extractv4si): Merge alternatives 4 and 5
27756 using Yw register constraint.
27757 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
27759 2021-03-13 Martin Sebor <msebor@redhat.com>
27761 PR tree-optimization/99489
27762 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
27763 is not a call statement.
27765 2021-03-13 Jakub Jelinek <jakub@redhat.com>
27767 PR tree-optimization/99544
27768 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
27769 if for vector types multiplication can't be done in type's mode.
27771 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
27774 * config/sparc/constraints.md (w): Rename to...
27775 (W): ... this and ditch previous implementation.
27776 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
27777 (*movdf_insn_sp64): Likewise.
27778 (*mov<VM64:mode>_insn_sp64): Likewise.
27779 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
27781 (atomic_compare_and_swap_leon3_1): Likewise.
27782 (*atomic_compare_and_swapdi_v8plus): Likewise.
27783 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
27784 architecture and add missing address validity check during LRA.
27786 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
27789 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
27790 occuring for assumed-size arrays in use_device_{ptr,addr}.
27792 2021-03-12 Jakub Jelinek <jakub@redhat.com>
27795 * config/i386/constraints.md (YW): New internal constraint.
27796 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
27797 (*<sse2_avx2>_<insn><mode>3<mask_name>,
27798 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
27799 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
27801 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
27802 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
27803 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
27804 into one, use Yw instead of former x,v.
27805 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
27806 the last alternative.
27807 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
27808 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
27809 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
27810 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
27811 into one, use <v_Yw> instead of former x,v.
27812 (avx2_interleave_highv32qi<mask_name>,
27813 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
27814 constraints. Add && <mask_avx512bw_condition> to condition.
27815 (avx2_interleave_lowv32qi<mask_name>,
27816 vec_interleave_lowv16qi<mask_name>,
27817 avx2_interleave_highv16hi<mask_name>,
27818 vec_interleave_highv8hi<mask_name>,
27819 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
27820 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
27821 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
27822 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
27823 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
27824 Yw instead of v in constraints.
27825 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
27826 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
27827 instead of Yv in constraints.
27828 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
27829 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
27830 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
27831 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
27832 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
27834 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
27835 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
27836 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
27837 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
27838 two, one with just x, another isa avx512vl with v.
27840 2021-03-12 Martin Liska <mliska@suse.cz>
27842 * doc/invoke.texi: Add missing param documentation.
27844 2021-03-11 David Malcolm <dmalcolm@redhat.com>
27847 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
27848 analyzer/trimmed-graph.o.
27849 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
27850 feasibility checking to reflect new implementation.
27851 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
27853 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
27855 2021-03-11 David Malcolm <dmalcolm@redhat.com>
27857 * digraph.cc (selftest::test_shortest_paths): Update
27858 shortest_paths init for new param. Add test of
27859 SPS_TO_GIVEN_TARGET.
27860 * shortest-paths.h (enum shortest_path_sense): New.
27861 (shortest_paths::shortest_paths): Add "sense" param.
27862 Update for renamings. Generalize to use "sense" param.
27863 (shortest_paths::get_shortest_path): Rename param.
27864 (shortest_paths::m_sense): New field.
27865 (shortest_paths::m_prev): Rename...
27866 (shortest_paths::m_best_edge): ...to this.
27867 (shortest_paths::get_shortest_path): Update for renamings.
27868 Conditionalize flipping of path on sense of traversal.
27870 2021-03-11 David Malcolm <dmalcolm@redhat.com>
27872 * digraph.cc (selftest::test_shortest_paths): Add test coverage
27873 for paths from B and C.
27874 * shortest-paths.h (shortest_paths::shortest_paths): Handle
27875 unreachable nodes, rather than asserting.
27877 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
27880 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
27881 xcoff_tbss_section_name.
27882 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
27883 * xcoffout.c (xcoff_tbss_section_name): Delete.
27884 * xcoffout.h (xcoff_tbss_section_name): Delete.
27886 2021-03-11 Richard Biener <rguenther@suse.de>
27888 PR tree-optimization/99523
27889 * tree-cfg.c (dump_function_to_file): Dump SSA names
27890 w/o identifier to the decls section as well, not only those
27891 without a VAR_DECL.
27893 2021-03-11 Jakub Jelinek <jakub@redhat.com>
27896 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
27897 function calls with lhs fail if the lhs don't have compatible types.
27899 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
27901 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
27902 Change FRAME_POINTER_REGNUM to correspond to a new faked
27903 register faked_fp, part of GENNONACR_REGS like faked_ap.
27904 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
27905 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
27906 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
27907 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
27908 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
27910 (CRIS_REAL_FP_REGNUM): New constant.
27911 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
27912 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
27913 (cris_initial_elimination_offset): Handle elimination changes
27914 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
27915 and add one from FRAME_POINTER_REGNUM to
27916 HARD_FRAME_POINTER_REGNUM.
27917 (cris_expand_prologue, cris_expand_epilogue): Emit code for
27918 hard_frame_pointer_rtx instead of frame_pointer_rtx.
27920 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
27923 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
27924 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
27926 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
27929 * lra-constraints.c (process_address_1): Don't check unknown
27930 constraint, use X for empty constraint.
27932 2021-03-10 Alex Coplan <alex.coplan@arm.com>
27934 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
27935 Fix typo in comment describing "is_ha" argument.
27937 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
27939 * doc/sourcebuild.texi: Document LRA target selector.
27941 2021-03-10 David Malcolm <dmalcolm@redhat.com>
27943 * doc/ux.texi: Add subsection contrasting interactive versus
27944 batch usage of GCC.
27946 2021-03-10 Joel Hutton <joel.hutton@arm.com>
27949 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
27951 (vectorizable_load): Fix gather load mask check condition.
27953 2021-03-10 Richard Biener <rguenther@suse.de>
27955 PR tree-optimization/99510
27956 * tree.c (check_aligned_type): Check that the candidate
27957 has TYPE_USER_ALIGN set instead of matching with the
27960 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
27962 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
27963 float and vector integer modes only if the mode is not larger.
27965 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
27967 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
27969 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
27971 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
27973 * ira-lives.c (single_reg_class): Ditto.
27975 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
27977 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
27978 the architecture-specific rtems.h.
27979 (aarch64-*-rtems*): Likewise.
27980 (arm*-*-rtems*): Likewise.
27981 (epiphany-*-rtems*): Likewise.
27982 (riscv*-*-rtems*): Likewise.
27984 2021-03-09 Jakub Jelinek <jakub@redhat.com>
27986 PR tree-optimization/99305
27987 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
27988 before integer_all_onesp instead of vice versa.
27990 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
27992 * common/config/arm/arm-common.c (arm_config_default): Change type
27993 of 'i' to unsigned.
27995 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
27998 * lra-constraints.c (process_address_1): Process constraint 'g'
27999 separately and digital constraints containing more one digit.
28001 2021-03-09 Nick Clifton <nickc@redhat.com>
28003 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
28004 (DWARF"_DEBUGGING_INFO): Define.
28006 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
28009 * calls.c (initialize_argument_information): When the argument
28010 is passed by reference, do not make a copy in a thunk only if
28011 the argument is already in memory. Remove redundant test for
28012 the case of callee copy.
28014 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
28017 * lra-constraints.c (process_address_1): Process 0..9 constraints
28018 in process_address_1.
28020 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
28022 * config/s390/s390.c (struct s390_processor processor_table):
28023 Binutils name string must not be empty.
28025 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
28027 * config/arc/arc.c (arc_attr_type): Remove function.
28029 2021-03-09 Martin Liska <mliska@suse.cz>
28032 * config/i386/i386-options.c (ix86_option_override_internal):
28033 Set isa_flags for OPTS argument and not for the global
28036 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
28038 * config/rs6000/predicates.md (ds_form_mem_operand): Check
28041 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
28044 * config/rs6000/predicates.md (ds_form_mem_operand) New
28046 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
28047 ds_form_mem_operand in ld/lwa patterns.
28048 * config/rs6000/fusion.md: Regenerate file.
28050 2021-03-08 Martin Sebor <msebor@redhat.com>
28052 PR middle-end/98266
28053 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
28054 (array_bounds_checker::check_array_bounds): Call it.
28056 2021-03-08 Martin Sebor <msebor@redhat.com>
28058 PR middle-end/97631
28059 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
28060 (handle_builtin_stxncpy_strncat): Rename locals. Determine
28061 destination size from allocation calls. Issue a more appropriate
28063 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
28064 (handle_builtin_memset): Same.
28066 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
28069 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
28070 to ensure we do not have an Altivec style address.
28071 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
28072 an Altivec style address.
28073 (*vsx_le_perm_store_<mode>): Likewise.
28074 (splitters after *vsx_le_perm_store_<mode>): Likewise.
28075 (vsx_load_<mode>): Disable special expander if passed an Altivec
28077 (vsx_store_<mode>): Likewise.
28079 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28082 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
28083 (aarch64_simd_shift_imm_vec_hi): Likewise.
28084 (aarch64_simd_shift_imm_vec_si): Likewise.
28085 (aarch64_simd_shift_imm_vec_di): Likewise.
28086 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
28087 predicate from above.
28088 (aarch64_shrn<mode>_insn_be): Likewise.
28089 (aarch64_rshrn<mode>_insn_le): Likewise.
28090 (aarch64_rshrn<mode>_insn_be): Likewise.
28091 (aarch64_shrn2<mode>_insn_le): Likewise.
28092 (aarch64_shrn2<mode>_insn_be): Likewise.
28093 (aarch64_rshrn2<mode>_insn_le): Likewise.
28094 (aarch64_rshrn2<mode>_insn_be): Likewise.
28096 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
28099 * lra-constraints.c (skip_contraint_modifiers): New function.
28100 (process_address_1): Use it before lookup_constraint call.
28102 2021-03-08 Martin Liska <mliska@suse.cz>
28105 * config/i386/i386-options.c (ix86_option_override_internal):
28106 Enable UINTR and HRESET for -march that supports it.
28108 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
28110 * config/s390/s390.c (f_constraint_p): New function.
28111 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
28112 (TARGET_MD_ASM_ADJUST): Likewise.
28114 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
28117 * tree-nested.c (convert_local_reference_stmt): Avoid calling
28118 lookup_field_for_decl for Fortran module (= namespace context).
28120 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
28122 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
28123 comparison with arithmetic right shift.
28124 (s390_expand_vcond): No need for a force_reg anymore.
28125 s390_vec_compare will do it.
28126 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
28127 immediate operands.
28129 2021-03-07 Jakub Jelinek <jakub@redhat.com>
28132 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
28133 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
28135 * config/i386/sse.md (v_Yw): New define_mode_attr.
28136 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
28137 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
28139 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
28140 xYw in constraints.
28142 2021-03-06 Julian Brown <julian@codesourcery.com>
28144 * tree-pretty-print.c (dump_generic_node): Emit non-generic
28145 address space info for aggregates.
28147 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
28149 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
28151 2021-03-05 Jakub Jelinek <jakub@redhat.com>
28153 PR middle-end/99322
28154 * tree-cfg.c (bb_to_omp_idx): New variable.
28155 (execute_build_cfg): Release the bb_to_omp_idx vector after
28156 cleanup_tree_cfg returns.
28157 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
28158 for bb_to_omp_idx being a vec<int> instead of pointer to array
28160 (make_edges): Remove bb_to_omp_idx local variable, don't pass
28161 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
28162 vec<int> instead of pointer to array of ints and don't free/release
28164 (remove_bb): When removing a bb and placing forced label somewhere
28165 else, ensure it is put into the same OpenMP region during cfg
28166 pass if possible or to entry successor as fallback. Unregister
28167 bb from bb_to_omp_idx.
28169 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
28172 * lra-constraints.c (process_address_1): Skip decomposing address
28173 for asm insn operand with unknown constraint.
28175 2021-03-05 Martin Jambor <mjambor@suse.cz>
28178 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
28179 corresponding speculative edges if we are about to resolve
28180 sepculation. Make edge direct (and so resolve speculations) before
28181 removing it from call_site_hash.
28182 (cgraph_edge::make_direct): Relax the initial assert to allow calling
28183 the function on speculative direct edges.
28185 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
28187 PR rtl-optimization/99376
28188 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
28189 of low-order zero bits is too large, set the result to 0 directly.
28191 2021-03-04 Jakub Jelinek <jakub@redhat.com>
28193 PR middle-end/93235
28194 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
28195 SUBREG and a SUBREG to op_mode can't be created.
28197 2021-03-04 Alex Coplan <alex.coplan@arm.com>
28200 * config/aarch64/aarch64-sve-builtins.cc
28201 (function_resolver::require_vector_type): Handle error_mark_node.
28203 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
28205 * cfgexpand.c (expand_asm_loc): Pass new parameter.
28206 (expand_asm_stmt): Likewise.
28207 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
28209 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
28210 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
28211 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
28212 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
28213 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
28214 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
28215 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
28216 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
28217 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
28218 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
28219 * doc/tm.texi (md_asm_adjust): Likewise.
28220 * target.def (md_asm_adjust): Likewise.
28222 2021-03-04 Richard Biener <rguenther@suse.de>
28224 PR middle-end/97855
28225 * tree-pretty-print.c: Poison pp_printf.
28226 (dump_decl_name): Avoid use of pp_printf.
28227 (dump_block_node): Likewise.
28228 (dump_generic_node): Likewise.
28230 2021-03-04 Martin Sebor <msebor@redhat.com>
28232 PR middle-end/96963
28233 PR middle-end/94655
28234 * builtins.c (handle_array_ref): New helper.
28235 (handle_mem_ref): New helper.
28236 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
28237 into new helper functions. Correct a workaround for vectorized
28240 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
28242 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
28243 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
28244 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
28245 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
28246 attribute for Power10.
28247 * config/rs6000/mma.md (*movoo): Likewise.
28248 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
28249 (define_mode_attr bits): Add DD/TD modes.
28250 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
28251 store_conditionalpti): Update size attribute for Power10.
28253 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
28256 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
28257 -Wuninitialized, -Wmaybe-uninitialized.
28258 (wide-int.o-warn): Likewise.
28260 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
28262 * common/config/arm/arm-common.c: Include configargs.h.
28263 (arm_config_default): New function.
28264 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
28265 processors that do not support Thumb. Take into account the
28266 --with-mode configuration setting for selecting the default.
28267 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
28268 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
28270 2021-03-03 Martin Liska <mliska@suse.cz>
28272 PR gcov-profile/97461
28273 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
28275 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
28278 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
28279 point back the hard frame pointer to its default location when the
28280 frame is larger than SEH_MAX_FRAME_SIZE.
28282 2021-03-03 Jakub Jelinek <jakub@redhat.com>
28285 * config/i386/predicates.md (logic_operator): New define_predicate.
28286 * config/i386/i386.md (mov + mem using comm arith peephole2):
28287 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
28288 and the inner mode is [QH]Imode.
28290 2021-03-03 Jakub Jelinek <jakub@redhat.com>
28293 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
28294 (new_loc_list): Clear end_entry.
28295 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
28296 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
28298 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
28299 initialize also end_entry.
28301 2021-03-03 Jakub Jelinek <jakub@redhat.com>
28304 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
28305 partitions, if in non-layout mode after reorder_blocks also move
28306 affected blocks to ensure a single partition transition.
28308 2021-03-03 Jason Merrill <jason@redhat.com>
28311 * cgraphunit.c (process_function_and_variable_attributes): Don't
28312 warn about flatten on an alias if the target also has it.
28313 * cgraph.h (symtab_node::get_alias_target_tree): New.
28315 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
28317 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
28318 period to symbol name.
28319 (tls_get_addr_internal<mode>): Same.
28321 2021-03-02 David Malcolm <dmalcolm@redhat.com>
28324 * diagnostic-show-locus.c
28325 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
28328 2021-03-02 Martin Sebor <msebor@redhat.com>
28330 PR middle-end/99276
28331 * builtins.c (warn_for_access): Remove stray warning text.
28333 2021-03-02 Martin Sebor <msebor@redhat.com>
28335 PR middle-end/99295
28336 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
28339 2021-03-02 Jakub Jelinek <jakub@redhat.com>
28342 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
28343 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
28344 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
28345 DW_MACRO_define_strx and DW_MACRO_undef_strx.
28346 (save_macinfo_strings): Use DW_MACRO_*_str* even with
28347 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
28348 DW_MACRO_undef_strx.
28350 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
28352 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
28354 (BT_FN_V8HI_V8HI_UINT): Likewise.
28355 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
28356 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
28357 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
28358 New builtin definitions.
28359 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
28360 vector extension version.
28361 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
28362 available with current -march level.
28363 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
28364 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
28365 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
28366 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
28367 (vec_extend_to_fp32_lo): Likewise.
28368 (vec_round_from_fp32): Likewise.
28369 (vec_convert_to_fp16): Likewise.
28370 (vec_convert_from_fp16): Likewise.
28371 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
28372 (vclfnls_v8hi): Likewise.
28373 (vcrnfs_v8hi): Likewise.
28374 (vcfn_v8hi): Likewise.
28375 (vcnf_v8hi): Likewise.
28377 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
28379 * common/config/s390/s390-common.c (processor_flags_table): New entry.
28380 * config.gcc: Enable arch14 for --with-arch and --with-tune.
28381 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
28382 arch14 for unknown CPU models.
28383 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
28384 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
28385 (s390_get_sched_attrmask): Likewise.
28386 (s390_get_unit_mask): Likewise.
28387 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
28388 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
28389 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
28390 (TARGET_NNPA_P): New macro definitions.
28391 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
28392 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
28394 2021-03-02 Jakub Jelinek <jakub@redhat.com>
28396 PR middle-end/95757
28397 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
28398 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
28399 != 1 comparisons if name is lhs of a comparison.
28401 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
28405 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
28406 * config/darwin.c (darwin_should_restore_cfa_state): New.
28407 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
28408 * doc/tm.texi: Regenerated.
28409 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
28410 * dwarf2cfi.c (connect_traces): If the target requests, restore
28411 the CFA expression after a DW_CFA_restore.
28412 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
28414 2021-03-01 Martin Liska <mliska@suse.cz>
28417 * optc-save-gen.awk: Add 4 more exceptions.
28419 2021-03-01 Nathan Sidwell <nathan@acm.org>
28422 * tree.h (TYPE_ALIGN_RAW): New accessor.
28423 (TYPE_ALIGN): Use it.
28425 2021-03-01 Jan Hubicka <jh@suse.cz>
28428 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
28430 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
28433 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
28434 point the hard frame pointer to the SSE register save area instead
28435 of the general register save area. Perform only minimal adjustment
28436 for small frames if it is initially not correctly aligned.
28437 (ix86_expand_prologue): Remove early saves for a SEH target.
28438 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
28440 2021-02-28 Jakub Jelinek <jakub@redhat.com>
28443 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
28444 typo - referneced -> referenced.
28445 * tree.c (component_ref_size): Fix comment typo -
28446 refernce -> reference.
28447 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
28448 traling -> trailing.
28449 (aliasing_component_refs_p): Fix comment typos -
28450 refernce -> reference and refernece -> reference and
28451 traling -> trailing.
28452 (nonoverlapping_refs_since_match_p): Fix comment typo -
28453 referneces -> references.
28454 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
28455 referneces -> references.
28457 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
28459 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
28460 diagnostic message to avoid use of a contraction and format
28463 2021-02-27 Jakub Jelinek <jakub@redhat.com>
28466 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
28467 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
28468 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
28469 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
28472 2021-02-26 Richard Biener <rguenther@suse.de>
28474 PR middle-end/99281
28475 * expr.c (store_field): For calls with return-slot optimization
28476 and addressable return type expand the store directly.
28478 2021-02-26 Richard Biener <rguenther@suse.de>
28481 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
28483 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
28486 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
28489 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
28491 * config.gcc: Add rs6000-pcrel-opt.o.
28492 * config/rs6000/rs6000-pcrel-opt.c: New file.
28493 * config/rs6000/pcrel-opt.md: New file.
28494 * config/rs6000/predicates.md: Add d_form_memory predicate.
28495 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
28496 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
28497 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
28498 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
28499 and make_pass_pcrel_opt().
28500 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
28501 (rs6000_option_override_internal): Add pcrel-opt.
28502 (rs6000_delegitimize_address): Support pcrel-opt.
28503 (rs6000_opt_masks): Add pcrel-opt.
28504 (pcrel_opt_valid_mem_p): New function.
28505 (reg_to_non_prefixed): Make global.
28506 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
28507 (output_pcrel_opt_reloc): New function.
28508 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
28509 (pcrel_extern_addr): Set loads_extern_addr.
28510 Add include for pcrel-opt.md.
28511 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
28512 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
28515 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
28518 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
28519 If TARGET_64BIT and dest is SUBREG, we check the width, if it
28520 equal to SImode, we use SImode operation, just like what we are
28523 2021-02-26 Marek Polacek <polacek@redhat.com>
28525 * builtins.c (warn_for_access): Fix typos.
28527 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
28529 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
28530 mark in front of the immediate quantity.
28531 (<optab>_rolsi3_uxtw): Likewise.
28533 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
28536 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
28537 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
28538 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
28539 address and disable when the FPCXT is not available.
28540 (nonsecure_call_value_reg_thumb2): Likewise.
28542 2021-02-25 Nathan Sidwell <nathan@acm.org>
28545 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
28547 2021-02-25 Tamar Christina <tamar.christina@arm.com>
28549 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
28551 2021-02-25 Richard Biener <rguenther@suse.de>
28553 PR tree-optimization/99253
28554 * tree-vect-loop.c (check_reduction_path): First compute
28555 code, then verify out-of-loop uses.
28557 2021-02-25 Jakub Jelinek <jakub@redhat.com>
28560 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
28562 2021-02-25 Jakub Jelinek <jakub@redhat.com>
28564 PR tree-optimization/80635
28565 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
28566 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
28567 has mode precision.
28569 2021-02-25 Richard Biener <rguenther@suse.de>
28571 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
28572 load_map population.
28573 (vect_match_slp_patterns_2): Revert part of last change.
28574 (vect_analyze_slp): Do not interleave optimize_load_redistribution
28575 with pattern detection but do it afterwards. Dump the
28576 whole SLP graph after pattern recognition and load
28577 redistribution optimization finished.
28579 2021-02-24 Jakub Jelinek <jakub@redhat.com>
28582 * omp-low.c (struct omp_context): Add teams_nested_p and
28583 nonteams_nested_p members.
28584 (scan_omp_target): Diagnose teams nested inside of target with other
28585 directives strictly nested inside of the same target.
28586 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
28587 ctx->nonteams_nested_p as needed.
28589 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
28591 PR inline-asm/99123
28592 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
28594 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
28596 * config/cris/cris.c (cris_expand_prologue): Set
28597 current_function_static_stack_size, if flag_stack_usage_info.
28599 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
28601 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
28602 (rs6000_final_prescan_insn): Adjust.
28603 (rs6000_asm_output_opcode): Likewise.
28605 2021-02-24 Martin Sebor <msebor@redhat.com>
28607 PR middle-end/97172
28608 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
28609 from function arguments.
28611 2021-02-24 Tamar Christina <tamar.christina@arm.com>
28613 PR tree-optimization/99220
28614 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
28615 node from cache when it's about to be deleted.
28617 2021-02-24 Jakub Jelinek <jakub@redhat.com>
28619 PR tree-optimization/99225
28620 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
28621 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
28622 build_int_cst (..., 1). Formatting fixes.
28624 2021-02-24 Tamar Christina <tamar.christina@arm.com>
28626 PR tree-optimization/99149
28627 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
28629 (vect_slp_reset_pattern): Remove.
28630 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
28631 (complex_mul_pattern::build, complex_fma_pattern::build,
28632 complex_fms_pattern::build): Fix ref counts.
28633 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
28634 when node is being deleted.
28635 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
28636 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
28638 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
28640 2021-02-24 Matthias Klose <doko@ubuntu.com>
28643 2020-12-07 Matthias Klose <doko@ubuntu.com>
28645 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
28646 and ENABLE_RTL_FLAG_CHECKING.
28648 2021-02-24 Richard Biener <rguenther@suse.de>
28651 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
28653 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
28655 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
28656 (vsx_assemble_pair): ...to this.
28657 (*mma_assemble_pair): Rename from this...
28658 (*vsx_assemble_pair): ...to this.
28659 (mma_disassemble_pair): Rename from this...
28660 (vsx_disassemble_pair): ...to this.
28661 (*mma_disassemble_pair): Rename from this...
28662 (*vsx_disassemble_pair): ...to this.
28663 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
28664 BU_COMPAT): New macros.
28665 (mma_assemble_pair): Rename from this...
28666 (vsx_assemble_pair): ...to this.
28667 (mma_disassemble_pair): Rename from this...
28668 (vsx_disassemble_pair): ...to this.
28669 (mma_assemble_pair): New compatibility built-in.
28670 (mma_disassemble_pair): Likewise.
28671 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
28672 (RS6000_BUILTIN_COMPAT): Define.
28673 (bdesc_compat): New.
28674 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
28675 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
28676 and VSX_BUILTIN_ASSEMBLE_PAIR.
28677 (rs6000_init_builtins): Register compatibility built-ins.
28678 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
28679 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
28680 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
28681 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
28682 (__builtin_vsx_assemble_pair): ...to this.
28683 (__builtin_mma_disassemble_pair): Rename from this...
28684 (__builtin_vsx_disassemble_pair): ...to this.
28686 2021-02-23 Martin Liska <mliska@suse.cz>
28689 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
28690 with different alignment. That leads to an invalid red zone
28691 size allocated in runtime.
28693 2021-02-23 Jakub Jelinek <jakub@redhat.com>
28695 PR tree-optimization/99204
28696 * fold-const.c (fold_read_from_constant_string): Check that
28697 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
28699 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
28700 Kewen Lin <linkw@gcc.gnu.org>
28702 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
28703 (rotl<mode>3_insert_3): ...this.
28704 (plus_ior_xor): New code_iterator.
28705 (define_split for GPR rl*imi): New splitter.
28706 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
28707 for integer merging.
28709 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28711 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
28713 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
28714 into a register when the above is enabled.
28715 * config/aarch64/aarch64.c (neoversev1_tunings):
28716 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
28717 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
28719 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
28721 * config/cris/cris.c (cris_print_operand) <'T'>: Change
28722 valid operand from is now an addi mult-value to shift-value.
28723 * config/cris/cris.md (*addi): Change expression of scaled
28724 operand from mult to ashift.
28725 * config/cris/cris.md (*addi_reload): New insn_and_split.
28727 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
28730 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
28731 hook_bool_const_tree_hwi_hwi_const_tree_true.
28732 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
28734 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
28736 PR rtl-optimization/98791
28737 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
28738 for unordered modes.
28740 2021-02-22 Martin Liska <mliska@suse.cz>
28742 * tree-inline.c (inline_forbidden_p): Set
28743 inline_forbidden_reason.
28745 2021-02-22 Richard Biener <rguenther@suse.de>
28747 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
28750 2021-02-22 Richard Biener <rguenther@suse.de>
28752 PR tree-optimization/99165
28753 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
28754 Accumulate changed to ret.
28756 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
28759 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
28761 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
28763 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
28766 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
28768 (trunctf<DFP_ALL:mode>2): Likewise.
28769 (trunctdtf2_vr): Likewise.
28770 (trunctdtf2): Likewise.
28771 (extend<DFP_ALL:mode>tf2_vr): Likewise.
28772 (extend<DFP_ALL:mode>tf2): Likewise.
28773 (extendtftd2_vr): Likewise.
28774 (extendtftd2): Likewise.
28776 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
28778 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
28779 add memory alternative.
28780 (tf_to_fprx2): New pattern.
28782 2021-02-19 Martin Sebor <msebor@redhat.com>
28785 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
28786 (attr_access::free_lang_data): Remove a spurious test.
28788 2021-02-19 Nathan Sidwell <nathan@acm.org>
28790 * doc/invoke.texi (flang-info-module-read): Document.
28792 2021-02-19 Martin Liska <mliska@suse.cz>
28794 PR translation/99167
28795 * params.opt: Fix typo.
28797 2021-02-19 Richard Biener <rguenther@suse.de>
28799 PR middle-end/99122
28800 * tree-inline.c (inline_forbidden_p): Do not inline functions
28801 with VLA arguments or return value.
28803 2021-02-19 Jakub Jelinek <jakub@redhat.com>
28806 * config/arm/arm.md (*stack_protect_combined_set_insn,
28807 *stack_protect_combined_test_insn): If force_const_mem result
28808 is not valid general operand, force its address into the destination
28811 2021-02-19 Jakub Jelinek <jakub@redhat.com>
28814 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
28815 pad or non-local label, put FORCED_LABELs from bb b after that label
28816 rather than before it.
28818 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
28821 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
28822 expand_vector_broadcast' to emit the vec_duplicate operand.
28824 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
28826 PR rtl-optimization/96264
28827 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
28830 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
28833 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
28834 looking up the retain attribute.
28835 (resolve_unique_section): Likewise.
28836 (get_variable_section): Likewise.
28837 (switch_to_section): Likewise. Warn when a symbol without the
28838 retain attribute and a symbol with the retain attribute are
28839 placed in the section with the same name, instead of the used
28841 * doc/extend.texi: Document the "retain" attribute.
28843 2021-02-18 Nathan Sidwell <nathan@acm.org>
28846 * doc/invoke.texi (flang-info-include-translate): Document header
28849 2021-02-18 Richard Biener <rguenther@suse.de>
28851 PR middle-end/99122
28852 * ipa-fnsummary.c (analyze_function_body): Set
28853 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
28854 * tree-inline.c (insert_init_debug_bind): Pass NULL for
28855 error_mark_node values.
28856 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
28858 (setup_one_parameter): Delay force_value_to_type until when
28861 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
28863 PR tree-optimization/99142
28864 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
28866 2021-02-18 Jakub Jelinek <jakub@redhat.com>
28868 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
28869 wide_int_bitmask::wide_int_bitmask (uint64_t),
28870 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
28871 wide_int_bitmask::operator ~ () const,
28872 wide_int_bitmask::operator | (wide_int_bitmask) const,
28873 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
28875 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
28876 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
28877 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
28878 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
28879 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
28880 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
28881 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
28882 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
28883 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
28884 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
28885 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
28886 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
28887 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
28888 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
28889 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
28890 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
28891 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
28892 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
28893 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
28894 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
28895 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
28896 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
28897 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
28898 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
28900 2021-02-18 Jakub Jelinek <jakub@redhat.com>
28902 PR middle-end/99109
28903 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
28904 (build_printable_array_type): ... this. Add nelts argument. For
28905 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
28906 nelts, call build_array_type_nelts.
28907 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
28908 instead of build_zero_elt_array_type and build_array_type_nelts.
28910 2021-02-18 Jakub Jelinek <jakub@redhat.com>
28913 * config/i386/i386.c (distance_non_agu_define): Don't call
28914 extract_insn_cached here.
28915 (ix86_lea_outperforms): Save and restore recog_data around call
28916 to distance_non_agu_define and distance_agu_use.
28917 (ix86_ok_to_clobber_flags): Remove.
28918 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
28919 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
28920 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
28921 into define_insn. Move the splitting to define_peephole2 and
28922 check there using peep2_regno_dead_p if FLAGS_REG is dead.
28924 2021-02-17 Julian Brown <julian@codesourcery.com>
28926 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
28929 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
28932 * config/mips/mips.c (mips_symbol_insns): Do not use
28933 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
28935 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
28937 PR inline-asm/98096
28938 * stmt.c (resolve_operand_name_1): Take inout operands into account
28939 for access to labels by names.
28940 * doc/extend.texi: Describe counting operands for accessing labels.
28942 2021-02-16 Richard Biener <rguenther@suse.de>
28944 PR tree-optimization/38474
28945 * tree-ssa-structalias.c (variable_info::address_taken): New.
28946 (new_var_info): Initialize address_taken.
28947 (process_constraint): Set address_taken.
28948 (solve_constraints): Use the new address_taken flag rather
28949 than is_reg_var for sorting variables.
28950 (dump_constraint): Dump the variable number if the name
28953 2021-02-16 Jakub Jelinek <jakub@redhat.com>
28956 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
28957 multiply by 4096 and for inbranch by 8192.
28958 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
28959 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
28961 2021-02-15 Maya Rashish <coypu@sdf.org>
28963 * config/aarch64/aarch64.c (aarch64_init_builtins):
28964 Call SUBTARGET_INIT_BUILTINS.
28966 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
28968 PR rtl-optimization/98872
28969 * init-regs.c (initialize_uninitialized_regs): Skip initialization
28970 if CONST0_RTX is NULL.
28972 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
28974 PR rtl-optimization/98863
28975 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
28976 (function_info::build_info): Turn into a declaration, moving the
28977 definition to internals.h.
28978 (function_info::bb_walker): Declare.
28979 (function_info::create_reg_use): Likewise.
28980 (function_info::calculate_potential_phi_regs): Take a build_info
28982 (function_info::place_phis, function_info::create_ebbs): Declare.
28983 (function_info::calculate_ebb_live_in_for_debug): Likewise.
28984 (function_info::populate_backedge_phis): Delete.
28985 (function_info::start_block, function_info::end_block): Declare.
28986 (function_info::populate_phi_inputs): Delete.
28987 (function_info::m_potential_phi_regs): Move information to build_info.
28988 * rtl-ssa/internals.h: New file.
28989 (function_info::bb_phi_info): New class.
28990 (function_info::build_info): Moved from functions.h.
28991 Add a constructor and destructor.
28992 (function_info::build_info::ebb_use): Delete.
28993 (function_info::build_info::ebb_def): Likewise.
28994 (function_info::build_info::bb_live_out): Likewise.
28995 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
28996 (function_info::build_info::potential_phi_regs): Likewise.
28997 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
28998 (function_info::build_info::ebb_def_regs): Likewise.
28999 (function_info::build_info::bb_phis): Likewise.
29000 (function_info::build_info::bb_mem_live_out): Likewise.
29001 (function_info::build_info::bb_to_rpo): Likewise.
29002 (function_info::build_info::def_stack): Likewise.
29003 (function_info::build_info::old_def_stack_limit): Likewise.
29004 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
29005 Remove the regno argument. Push the previous definition onto the
29006 definition stack where necessary.
29007 * rtl-ssa/accesses.cc: Include internals.h.
29008 * rtl-ssa/changes.cc: Likewise.
29009 * rtl-ssa/blocks.cc: Likewise.
29010 (function_info::build_info::build_info): Define.
29011 (function_info::build_info::~build_info): Likewise.
29012 (function_info::bb_walker): New class.
29013 (function_info::bb_walker::bb_walker): Define.
29014 (function_info::add_live_out_use): Convert a logarithmic-complexity
29015 test into a linear one. Allow the same definition to be passed
29017 (function_info::calculate_potential_phi_regs): Moved from
29018 functions.cc. Take a build_info parameter and store the
29019 information there instead.
29020 (function_info::place_phis): New function.
29021 (function_info::add_entry_block_defs): Update call to record_reg_def.
29022 (function_info::calculate_ebb_live_in_for_debug): New function.
29023 (function_info::add_phi_nodes): Use bb_phis to decide which
29024 registers need phi nodes and initialize ebb_def_regs accordingly.
29025 Do not add degenerate phis here.
29026 (function_info::add_artificial_accesses): Use create_reg_use.
29027 Assert that all definitions are listed in the DF LR sets.
29028 Update call to record_reg_def.
29029 (function_info::record_block_live_out): Record live-out register
29030 values in the phis of successor blocks. Use the live-out set
29031 when processing the last block in an EBB, instead of always
29032 using the live-in sets of successor blocks. AND the live sets
29033 with the set of registers that have been defined in the EBB,
29034 rather than with all potential phi registers. Cope correctly
29035 with branches back to the start of the current EBB.
29036 (function_info::start_block): New function.
29037 (function_info::end_block): Likewise.
29038 (function_info::populate_phi_inputs): Likewise.
29039 (function_info::create_ebbs): Likewise.
29040 (function_info::process_all_blocks): Rewrite into a multi-phase
29042 * rtl-ssa/functions.cc: Include internals.h.
29043 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
29044 (function_info::init_function_data): Remove caller.
29045 * rtl-ssa/insns.cc: Include internals.h
29046 (function_info::create_reg_use): New function. Lazily any
29047 degenerate phis needed by the linear RPO view.
29048 (function_info::record_use): Use create_reg_use. When processing
29049 debug uses, use potential_phi_regs and test it before checking
29050 whether the register is live on entry to the current EBB. Lazily
29051 calculate ebb_live_in_for_debug.
29052 (function_info::record_call_clobbers): Update call to record_reg_def.
29053 (function_info::record_def): Likewise.
29055 2021-02-15 Martin Liska <mliska@suse.cz>
29057 * toplev.c (init_asm_output): Free output of
29058 gen_command_line_string function.
29059 (process_options): Likewise.
29061 2021-02-15 Martin Liska <mliska@suse.cz>
29063 * params.opt: Add 2 missing Param keywords.
29065 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
29067 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
29069 2021-02-15 Jakub Jelinek <jakub@redhat.com>
29071 PR tree-optimization/99079
29072 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
29073 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
29074 require both type and TREE_TYPE (@1) to be integral types and either
29075 type having smaller or equal precision, or TREE_TYPE (@1) being
29076 unsigned type, or type being signed type. If TREE_TYPE (@1)
29077 doesn't have wrapping overflow, perform the subtraction of one in
29080 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
29081 Richard Biener <rguether@suse.de>
29084 * ipa-reference.c (ipa_init): Only conditinally initialize
29085 reference_vars_to_consider.
29086 (propagate): Conditionally deninitialize reference_vars_to_consider.
29087 (ipa_reference_write_optimization_summary): Sanity check that
29088 reference_vars_to_consider is not allocated.
29090 2021-02-13 Levy Hsu <admin@levyhsu.com>
29093 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
29094 extend parameter to get_si_mem_base_reg declaration.
29095 (get_si_mem_base_reg): Add extend parameter. Set it.
29096 (analyze): Pass extend arg to get_si_mem_base_reg.
29097 (transform): Likewise. Use it when rewriting mems.
29098 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
29099 loads and emit sign/zero extending load followed by subreg move.
29101 2021-02-13 Jim Wilson <jimw@sifive.com>
29104 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
29105 exit when !reload_completed. Only perform check for compressed reg
29106 if reload_completed.
29107 (riscv_rtx_costs): In MEM case, when optimizing for size and
29108 shorten memrefs, if not compressible, then increase cost.
29110 2021-02-13 Jakub Jelinek <jakub@redhat.com>
29112 PR rtl-optimization/98439
29113 * recog.c (pass_split_before_regstack::gate): Enable even when
29114 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
29117 2021-02-13 Jakub Jelinek <jakub@redhat.com>
29120 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
29121 swap of V2SImode elements in memory into DImode memory rotate by 32.
29123 2021-02-12 Martin Sebor <msebor@redhat.com>
29125 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
29127 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
29129 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
29130 m_temp_obstack rather than m_obstack to allocate the temporary use.
29132 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
29134 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
29135 as read-modify operations.
29137 2021-02-12 Richard Biener <rguenther@suse.de>
29139 PR middle-end/38474
29140 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
29141 fbi->aa_walk_budget is bigger than zero. Update
29142 fbi->aa_walk_budget.
29143 (param_change_prob): Likewise.
29144 * ipa-prop.c (detect_type_change_from_memory_writes):
29145 Properly account walk_aliased_vdefs.
29146 (parm_preserved_before_stmt_p): Canonicalize updates.
29147 (parm_ref_data_preserved_p): Likewise.
29148 (parm_ref_data_pass_through_p): Likewise.
29149 (determine_known_aggregate_parts): Account own alias queries.
29151 2021-02-12 Martin Liska <mliska@suse.cz>
29153 * opts-common.c (decode_cmdline_option): Release werror_arg.
29154 * opts.c (gen_producer_string): Release output of
29155 gen_command_line_string.
29157 2021-02-12 Richard Biener <rguenther@suse.de>
29159 PR tree-optimization/38474
29160 * params.opt (-param=max-store-chains-to-track=): New param.
29161 (-param=max-stores-to-track=): Likewise.
29162 * doc/invoke.texi (max-store-chains-to-track): Document.
29163 (max-stores-to-track): Likewise.
29164 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
29166 (pass_store_merging::m_n_stores): Likewise.
29167 (pass_store_merging::terminate_and_process_chain): Update
29168 m_n_stores and m_n_chains.
29169 (pass_store_merging::process_store): Likewise. Terminate
29170 oldest chains if the number of stores or chains get too large.
29171 (imm_store_chain_info::terminate_and_process_chain): Dump
29174 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
29176 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
29177 the cold section, emit a nop before the directive if the previous
29178 active instruction can throw.
29180 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
29183 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
29184 memory addresses that are legal for quad word accesses.
29186 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
29189 * config/arm/thumb2.md (*doloop_end_internal): Generate
29190 alternative sequence to handle long range branches.
29192 2021-02-11 Joel Hutton <joel.hutton@arm.com>
29194 PR tree-optimization/98772
29195 * optabs-tree.c (supportable_half_widening_operation): New function
29196 to check for supportable V8QI->V8HI widening patterns.
29197 * optabs-tree.h (supportable_half_widening_operation): New function.
29198 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
29199 to create promotion stmts for V8QI->V8HI widening patterns.
29200 (vectorizable_conversion): Add case for V8QI->V8HI.
29202 2021-02-11 Richard Biener <rguenther@suse.de>
29204 * sparseset.h (SPARSESET_ELT_BITS): Remove.
29205 (SPARSESET_ELT_TYPE): Use unsigned int.
29206 * fwprop.c: Do not include sparseset.h.
29208 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29211 * varasm.c (declare_weak): For -fsyntax-only, allow even
29212 TREE_ASM_WRITTEN function decls.
29214 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29217 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
29218 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
29219 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
29220 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
29221 calling simplify_gen_subreg on it.
29223 2021-02-10 Martin Liska <mliska@suse.cz>
29225 * config/nvptx/nvptx.c (nvptx_option_override): Use
29226 flag_patchable_function_entry instead of the removed
29227 function_entry_patch_area_size.
29229 2021-02-10 Martin Liska <mliska@suse.cz>
29231 PR tree-optimization/99002
29232 PR tree-optimization/99026
29233 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
29234 leak when adjacent cases are merged.
29235 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
29237 (make_pass_lower_switch): Remove trailing whitespace.
29238 * tree-switch-conversion.h (release_clusters): New.
29240 2021-02-10 Richard Biener <rguenther@suse.de>
29242 PR rtl-optimization/99054
29243 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
29244 (fixup_partitions): Adjust.
29245 (rtl_verify_edges): Likewise.
29247 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29249 PR middle-end/99007
29250 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
29251 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
29254 2021-02-10 Richard Biener <rguenther@suse.de>
29257 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
29260 2021-02-10 Richard Biener <rguenther@suse.de>
29262 PR tree-optimization/99024
29263 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
29264 clear loop->aux if it is associated with the destroyed loop_vinfo.
29266 2021-02-10 Martin Liska <mliska@suse.cz>
29268 PR tree-optimization/99002
29269 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
29272 2021-02-10 Martin Liska <mliska@suse.cz>
29275 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
29276 a reference exists.
29278 2021-02-10 Jakub Jelinek <jakub@redhat.com>
29281 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
29282 at class scope for DWARF5+.
29284 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
29286 PR rtl-optimization/96015
29287 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
29288 (relax_delay_slots): When deleting a jump to the next active
29289 instruction over a barrier, first delete the barrier if the
29290 jump is the only way to reach the target label.
29292 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
29294 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
29295 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
29296 vector multiplies and vect.alu for SSRA.
29297 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
29298 vect.mul cost field.
29299 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
29300 * config/arm/arm.c: Likewise.
29302 2021-02-09 Richard Biener <rguenther@suse.de>
29304 PR tree-optimization/98863
29305 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
29306 * tree-ssa-sccvn.c (last_pushed_avail): New global.
29307 (rpo_elim::eliminate_push_avail): Chain pushed avails.
29308 (unwind_state::avail_top): Add.
29309 (do_unwind): Rewrite unwinding of avail entries.
29310 (do_rpo_vn): Initialize last_pushed_avail and
29311 avail_top of the undo state.
29313 2021-02-09 Jakub Jelinek <jakub@redhat.com>
29315 PR middle-end/99004
29316 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
29317 const char * to char * and free those pointers after use.
29319 2021-02-09 Richard Biener <rguenther@suse.de>
29321 PR tree-optimization/99017
29322 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
29323 zero vector cost entries.
29325 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
29327 PR middle-end/98974
29328 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
29329 parameter in vectorizable_condition.
29331 2021-02-08 Richard Biener <rguenther@suse.de>
29334 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
29336 2021-02-08 Martin Liska <mliska@suse.cz>
29339 * cfgexpand.c (pass_expand::execute): Parse per-function option
29340 flag_patchable_function_entry and use it.
29341 * common.opt: Remove function_entry_patch_area_size and
29342 function_entry_patch_area_start global variables.
29343 * opts.c (parse_and_check_patch_area): New function.
29344 (common_handle_option): Use it.
29345 * opts.h (parse_and_check_patch_area): New function.
29346 * toplev.c (process_options): Parse and use
29347 function_entry_patch_area_size.
29349 2021-02-08 Martin Sebor <msebor@redhat.com>
29351 * doc/extend.texi (attribute malloc): Correct typos.
29353 2021-02-05 Nathan Sidwell <nathan@acm.org>
29356 * gcc.c (driver::maybe_run_linker): Check for input file
29357 accessibility if not linking.
29359 2021-02-05 Richard Biener <rguenther@suse.de>
29361 PR tree-optimization/98855
29362 * tree-vectorizer.h (add_stmt_cost): New overload.
29363 * tree-vect-slp.c (li_cost_vec_cmp): New.
29364 (vect_bb_slp_scalar_cost): Cost individual loop regions
29365 separately. Account for the scalar instance root stmt.
29367 2021-02-05 Tom de Vries <tdevries@suse.de>
29370 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
29372 (bit_test_cluster::emit): Reuse location_t for newly created
29374 (switch_decision_tree::try_switch_expansion): Preserve
29376 * tree-switch-conversion.h: Change function signatures.
29378 2021-02-05 Jakub Jelinek <jakub@redhat.com>
29381 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
29382 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
29383 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
29384 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
29386 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29388 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
29389 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
29390 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
29391 (vget_high_f16): Reimplement using new builtin.
29392 (vget_high_f32): Likewise.
29393 (vget_high_f64): Likewise.
29394 (vget_high_p8): Likewise.
29395 (vget_high_p16): Likewise.
29396 (vget_high_p64): Likewise.
29397 (vget_high_s8): Likewise.
29398 (vget_high_s16): Likewise.
29399 (vget_high_s32): Likewise.
29400 (vget_high_s64): Likewise.
29401 (vget_high_u8): Likewise.
29402 (vget_high_u16): Likewise.
29403 (vget_high_u32): Likewise.
29404 (vget_high_u64): Likewise.
29406 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29408 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
29409 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
29410 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
29411 (vget_low_f16): Reimplement using new builtin.
29412 (vget_low_f32): Likewise.
29413 (vget_low_f64): Likewise.
29414 (vget_low_p8): Likewise.
29415 (vget_low_p16): Likewise.
29416 (vget_low_p64): Likewise.
29417 (vget_low_s8): Likewise.
29418 (vget_low_s16): Likewise.
29419 (vget_low_s32): Likewise.
29420 (vget_low_s64): Likewise.
29421 (vget_low_u8): Likewise.
29422 (vget_low_u16): Likewise.
29423 (vget_low_u32): Likewise.
29424 (vget_low_u64): Likewise.
29426 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
29428 * gcc.c (print_multilib_info): Check all required argument is provided
29431 2021-02-05 liuhongt <hongtao.liu@intel.com>
29434 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
29435 generate integer mask comparison for 128/256-bits vector when
29436 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
29437 delete redundant !maskcmp condition.
29438 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
29440 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
29441 condition directly to if (maskcmp), add extra check for
29442 cmpmode, it should be MODE_INT.
29443 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
29444 parameters op_true/op_false.
29445 (ix86_use_mask_cmp_p): New.
29447 2021-02-05 liuhongt <hongtao.liu@intel.com>
29450 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
29451 Remove m_GENERIC from ~list.
29452 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
29454 2021-02-04 David Malcolm <dmalcolm@redhat.com>
29457 * diagnostic-show-locus.c (compatible_locations_p): Require
29458 locations in the same macro map to be either both from the
29459 macro definition, or both from the macro arguments.
29461 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
29463 * config/aarch64/aarch64-simd-builtins.def: Add
29464 [su]mull_hi_lane[q] builtin generator macros.
29465 * config/aarch64/aarch64-simd.md
29466 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
29467 (aarch64_<su>mull_hi_lane<mode>): Define.
29468 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
29469 (aarch64_<su>mull_hi_laneq<mode>): Define.
29470 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
29471 builtin instead of inline asm.
29472 (vmull_high_lane_s32): Likewise.
29473 (vmull_high_lane_u16): Likewise.
29474 (vmull_high_lane_u32): Likewise.
29475 (vmull_high_laneq_s16): Likewise.
29476 (vmull_high_laneq_s32): Likewise.
29477 (vmull_high_laneq_u16): Likewise.
29478 (vmull_high_laneq_u32): Liekwise.
29480 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
29482 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
29483 builtin generator macros.
29484 * config/aarch64/aarch64-simd.md
29485 (aarch64_<su>mull_hi_n<mode>_insn): Define.
29486 (aarch64_<su>mull_hi_n<mode>): Define.
29487 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
29488 instead of inline asm.
29489 (vmull_high_n_s32): Likewise.
29490 (vmull_high_n_u16): Likewise.
29491 (vmull_high_n_u32): Likewise.
29493 2021-02-04 Richard Biener <rguenther@suse.de>
29495 PR tree-optimization/98855
29496 * tree-vect-loop.c (vectorizable_phi): Do not cost
29497 single-argument PHIs.
29498 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
29499 * tree-vect-stmts.c (vectorizable_bswap): Also perform
29500 costing for SLP operation.
29502 2021-02-04 Martin Liska <mliska@suse.cz>
29504 * doc/extend.texi: Mention -mprefer-vector-width in target
29507 2021-02-03 Martin Sebor <msebor@redhat.com>
29509 PR tree-optimization/98937
29510 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
29511 Flush pointer_query cache.
29513 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
29515 * config/rs6000/genfusion.pl (gen_2logical): Add missing
29516 fixes based on patch review.
29517 * config/rs6000/fusion.md: Regenerate file.
29519 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
29521 * config/rs6000/t-rs6000: Comment out auto generation of
29524 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
29526 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
29527 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
29528 (output_file_start): Add gfx908.
29529 * config/gcn/gcn.opt (gpu_type): Add gfx908.
29530 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
29531 (MULTILIB_DIRNAMES): Add gfx908.
29532 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
29533 (main): Recognize gfx908.
29534 * config/gcn/t-omp-device: Add gfx908.
29536 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
29538 * config/aarch64/aarch64-simd-builtins.def: Add
29539 [su]mlsl_hi_lane[q] builtin macro generators.
29540 * config/aarch64/aarch64-simd.md
29541 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
29542 (aarch64_<su>mlsl_hi_lane<mode>): Define.
29543 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
29544 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
29545 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
29546 builtin instead of inline asm.
29547 (vmlsl_high_lane_s32): Likewise.
29548 (vmlsl_high_lane_u16): Likewise.
29549 (vmlsl_high_lane_u32): Likewise.
29550 (vmlsl_high_laneq_s16): Likewise.
29551 (vmlsl_high_laneq_s32): Likewise.
29552 (vmlsl_high_laneq_u16): Likewise.
29553 (vmlsl_high_laneq_u32): Likewise.
29554 (vmlal_high_laneq_u32): Likewise.
29556 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
29558 * config/aarch64/aarch64-simd-builtins.def: Add
29559 [su]mlal_hi_lane[q] builtin generator macros.
29560 * config/aarch64/aarch64-simd.md
29561 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
29562 (aarch64_<su>mlal_hi_lane<mode>): Define.
29563 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
29564 (aarch64_<su>mlal_hi_laneq<mode>): Define.
29565 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
29566 builtin instead of inline asm.
29567 (vmlal_high_lane_s32): Likewise.
29568 (vmlal_high_lane_u16): Likewise.
29569 (vmlal_high_lane_u32): Likewise.
29570 (vmlal_high_laneq_s16): Likewise.
29571 (vmlal_high_laneq_s32): Likewise.
29572 (vmlal_high_laneq_u16): Likewise.
29573 (vmlal_high_laneq_u32): Likewise.
29575 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
29577 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
29578 builtin generator macros.
29579 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
29581 (aarch64_<su>mlsl_hi_n<mode>): Define.
29582 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
29583 instead of inline asm.
29584 (vmlsl_high_n_s32): Likewise.
29585 (vmlsl_high_n_u16): Likewise.
29586 (vmlsl_high_n_u32): Likewise.
29588 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
29590 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
29591 builtin generator macros.
29592 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
29594 (aarch64_<su>mlal_hi_n<mode>): Define.
29595 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
29596 instead of inline asm.
29597 (vmlal_high_n_s32): Likewise.
29598 (vmlal_high_n_u16): Likewise.
29599 (vmlal_high_n_u32): Likewise.
29601 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
29603 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
29605 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
29607 (aarch64_<su>mlal_hi<mode>_insn): This.
29608 (aarch64_<su>mlal_hi<mode>): Define.
29609 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
29610 instead of inline asm.
29611 (vmlal_high_s16): Likewise.
29612 (vmlal_high_s32): Likewise.
29613 (vmlal_high_u8): Likewise.
29614 (vmlal_high_u16): Likewise.
29615 (vmlal_high_u32): Likewise.
29617 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
29619 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
29620 after calling alter_subreg() on a (mem).
29622 2021-02-03 Martin Liska <mliska@suse.cz>
29625 * lto-streamer-out.c (produce_lto_section): Fill up missing
29627 * lto-streamer.h (struct lto_section): Add _padding field.
29629 2021-02-03 Richard Biener <rguenther@suse.de>
29631 * lto-streamer.c (lto_get_section_name): Free temporary
29633 * tree-loop-distribution.c
29634 (loop_distribution::merge_dep_scc_partitions): Free edge data.
29636 2021-02-03 Jakub Jelinek <jakub@redhat.com>
29638 PR middle-end/97487
29639 * ifcvt.c (noce_can_force_operand): New function.
29640 (noce_emit_move_insn): Use it.
29641 (noce_try_sign_mask): Likewise. Formatting fix.
29643 2021-02-03 Jakub Jelinek <jakub@redhat.com>
29645 PR middle-end/97971
29646 * lra-constraints.c (process_alt_operands): For inline asm, don't call
29647 fatal_insn, but instead return false.
29649 2021-02-03 Jakub Jelinek <jakub@redhat.com>
29651 PR tree-optimization/98287
29652 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
29655 2021-02-03 Tamar Christina <tamar.christina@arm.com>
29657 PR tree-optimization/98928
29658 * tree-vect-loop.c (vect_analyze_loop_2): Change
29659 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
29660 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
29661 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
29662 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
29664 2021-02-02 Richard Biener <rguenther@suse.de>
29666 * gimple-loop-interchange.cc (prepare_data_references):
29668 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
29669 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
29670 * tree-vect-stmts.c (vectorizable_condition): Do not
29672 (vectorizable_comparison): Likewise.
29674 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29676 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
29677 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
29678 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
29679 (vrsqrteq_u32): Likewise.
29681 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29683 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
29684 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
29685 (aarch64_sqxtun2<mode>_be): Likewise.
29686 (aarch64_sqxtun2<mode>): Likewise.
29687 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
29688 (vqmovun_high_s32): Likewise.
29689 (vqmovun_high_s64): Likewise.
29690 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
29692 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29694 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
29696 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
29698 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29700 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
29701 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
29702 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
29703 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
29704 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
29705 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
29706 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
29707 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
29709 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29711 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
29712 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
29713 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
29715 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29717 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
29718 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
29720 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29722 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
29723 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
29724 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
29725 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
29726 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
29727 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
29728 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
29729 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
29730 NONE builtin flags.
29732 2021-02-02 Jakub Jelinek <jakub@redhat.com>
29734 PR tree-optimization/98848
29735 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
29736 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
29738 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
29741 * expr.c: Check mode before calling store_expr.
29743 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
29745 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
29747 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
29748 instruction using expression ior.
29749 (mve_vornq_u<mode>): New expander.
29750 (mve_vornq_f<mode>): Use ior code instead of unspec.
29751 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
29753 2021-02-02 Alexandre Oliva <oliva@adacore.com>
29755 * tree-nested.c (convert_nonlocal_reference_op): Move
29756 current_function_decl restore after re-gimplification.
29757 (convert_local_reference_op): Likewise.
29759 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29761 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
29763 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
29765 (aarch64_rshrn<mode>_insn_be): Likewise.
29766 (aarch64_rshrn<mode>): Likewise.
29767 (aarch64_rshrn2<mode>_insn_le): Likewise.
29768 (aarch64_rshrn2<mode>_insn_be): Likewise.
29769 (aarch64_rshrn2<mode>): Likewise.
29770 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
29771 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
29773 (vrshrn_high_n_s32): Likewise.
29774 (vrshrn_high_n_s64): Likewise.
29775 (vrshrn_high_n_u16): Likewise.
29776 (vrshrn_high_n_u32): Likewise.
29777 (vrshrn_high_n_u64): Likewise.
29778 (vrshrn_n_s16): Likewise.
29779 (vrshrn_n_s32): Likewise.
29780 (vrshrn_n_s64): Likewise.
29781 (vrshrn_n_u16): Likewise.
29782 (vrshrn_n_u32): Likewise.
29783 (vrshrn_n_u64): Likewise.
29785 2021-02-01 Sergei Trofimovich <siarheit@google.com>
29787 PR tree-optimization/98499
29788 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
29789 conservatively and assume all possible side-effects.
29791 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29793 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
29794 vec_unpacku_hi_): Define builtins.
29795 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
29797 (vmovl_high_s16): Likewise.
29798 (vmovl_high_s32): Likewise.
29799 (vmovl_high_u8): Likewise.
29800 (vmovl_high_u16): Likewise.
29801 (vmovl_high_u32): Likewise.
29803 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29805 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
29807 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
29809 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
29811 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
29813 (vabdl_s16): Likewise.
29814 (vabdl_s32): Likewise.
29815 (vabdl_u8): Likewise.
29816 (vabdl_u16): Likewise.
29817 (vabdl_u32): Likewise.
29818 * config/aarch64/iterators.md (ABDL): New int iterator.
29819 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
29821 2021-02-01 Martin Sebor <msebor@redhat.com>
29823 * tree.h (BLOCK_VARS): Add comment.
29824 (BLOCK_SUBBLOCKS): Same.
29825 (BLOCK_SUPERCONTEXT): Same.
29826 (BLOCK_ABSTRACT_ORIGIN): Same.
29827 (inlined_function_outer_scope_p): Same.
29829 2021-02-01 Martin Sebor <msebor@redhat.com>
29831 PR middle-end/97172
29832 * attribs.c (attr_access::free_lang_data): Define new function.
29833 * attribs.h (attr_access::free_lang_data): Declare new function.
29835 2021-02-01 Richard Biener <rguenther@suse.de>
29837 * vec.h (auto_vec::auto_vec): Add memory stat parameters
29839 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
29841 2021-02-01 Tamar Christina <tamar.christina@arm.com>
29843 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
29844 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
29846 2021-02-01 Richard Biener <rguenther@suse.de>
29848 PR rtl-optimization/98863
29849 * config/i386/i386-features.c (convert_scalars_to_vector):
29850 Set DF_RD_PRUNE_DEAD_DEFS.
29852 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
29854 * system.h (SIZE_MAX): Define if not already defined.
29856 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
29858 * config/rs6000/genfusion.pl (gen_2logical): New function to
29859 generate patterns for logical-logical fusion.
29860 * config/rs6000/fusion.md: Regenerated patterns.
29861 * config/rs6000/rs6000-cpus.def: Add
29862 OPTION_MASK_P10_FUSION_2LOGICAL.
29863 * config/rs6000/rs6000.c (rs6000_option_override_internal):
29864 Enable logical-logical fusion for p10.
29865 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
29867 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
29869 * config/rs6000/rs6000.opt: Add periods to new AIX options.
29871 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
29873 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
29874 (mabi=vec-default): New.
29875 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
29876 __EXTABI__ for AIX Vector extended ABI.
29877 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
29879 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
29881 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
29883 2021-01-30 Jakub Jelinek <jakub@redhat.com>
29885 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
29886 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
29888 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
29891 * lra-constraints.c (in_class_p): Don't narrow class only for REG
29894 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
29896 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
29897 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
29898 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
29900 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
29902 PR tree-optimization/98866
29903 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
29904 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
29905 (gori_map::m_maybe_invariant): Rename from all_outgoing.
29906 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
29907 (gori_map::is_export_p): Ditto.
29908 (gori_map::calculate_gori): Ditto.
29909 (gori_compute::set_range_invariant): New.
29910 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
29911 invariant for pointers evaluating to [1, +INF].
29913 2021-01-29 Richard Biener <rguenther@suse.de>
29915 PR rtl-optimization/98863
29916 * config/i386/i386-features.c (remove_partial_avx_dependency):
29917 Do not perform DF analysis.
29918 (pass_data_remove_partial_avx_dependency): Remove
29921 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
29923 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
29924 builtin generator macros.
29925 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
29927 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
29928 instead of inline asm.
29929 (vmull_n_s32): Likewise.
29930 (vmull_n_u16): Likewise.
29931 (vmull_n_u32): Likewise.
29933 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29935 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
29937 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
29939 (aarch64_<sur>abdl2<mode>): ... This.
29940 (<sur>sadv16qi): Adjust use of above.
29941 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
29943 (vabdl_high_s16): Likewise.
29944 (vabdl_high_s32): Likewise.
29945 (vabdl_high_u8): Likewise.
29946 (vabdl_high_u16): Likewise.
29947 (vabdl_high_u32): Likewise.
29949 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29951 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
29953 (uabal2): Likewise.
29954 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
29956 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
29958 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
29960 (vabal_high_s16): Likewise.
29961 (vabal_high_s32): Likewise.
29962 (vabal_high_u8): Likewise.
29963 (vabal_high_u16): Likewise.
29964 (vabal_high_u32): Likewise.
29965 * config/aarch64/iterators.md (ABAL2): New mode iterator.
29966 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
29968 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29970 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
29973 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
29975 (aarch64_<sur>abal<mode>): ... This
29976 (<sur>sadv16qi): Adust use of the above.
29977 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
29979 (vabal_s16): Likewise.
29980 (vabal_s32): Likewise.
29981 (vabal_u8): Likewise.
29982 (vabal_u16): Likewise.
29983 (vabal_u32): Likewise.
29985 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29987 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
29989 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
29991 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
29993 (vaddlv_s16): Likewise.
29994 (vaddlv_u8): Likewise.
29995 (vaddlv_u16): Likewise.
29996 (vaddlvq_s8): Likewise.
29997 (vaddlvq_s16): Likewise.
29998 (vaddlvq_s32): Likewise.
29999 (vaddlvq_u8): Likewise.
30000 (vaddlvq_u16): Likewise.
30001 (vaddlvq_u32): Likewise.
30002 (vaddlv_s32): Likewise.
30003 (vaddlv_u32): Likewise.
30004 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
30005 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
30006 (Vwstype): New mode attribute.
30008 (VWIDE_S): Likewise.
30009 (USADDLV): New int iterator.
30010 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
30012 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
30014 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
30015 builtin generator macros.
30016 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
30018 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
30019 instead of inline asm.
30020 (vmlsl_lane_s32): Likewise.
30021 (vmlsl_lane_u16): Likewise.
30022 (vmlsl_lane_u32): Likewise.
30023 (vmlsl_laneq_s16): Likewise.
30024 (vmlsl_laneq_s32): Likewise.
30025 (vmlsl_laneq_u16): Likewise.
30026 (vmlsl_laneq_u32): Likewise.
30028 2021-01-29 Richard Biener <rguenther@suse.de>
30030 * doc/invoke.texi (--param max-gcse-memory): Document unit
30032 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
30033 * params.opt (--param max-gcse-memory): Adjust default and
30034 document unit of size.
30036 2021-01-29 Richard Biener <rguenther@suse.de>
30038 PR rtl-optimization/98863
30039 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
30040 HOST_WIDE_INT for the memory estimate.
30042 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
30043 Richard Biener <rguenther@suse.de>
30045 PR tree-optimization/97627
30046 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
30047 Do not analyze fake edges.
30049 2021-01-29 Richard Biener <rguenther@suse.de>
30051 PR rtl-optimization/98144
30052 * df.h (df_mir_bb_info): Add con_visited member.
30053 * df-problems.c (df_mir_alloc): Initialize con_visited,
30054 do not fully populate IN and OUT.
30055 (df_mir_reset): Likewise.
30056 (df_mir_confluence_0): Set con_visited.
30057 (df_mir_confluence_n): Properly handle implicitely
30058 fully populated IN and OUT as designated by con_visited
30059 and update con_visited accordingly.
30061 2021-01-29 Jakub Jelinek <jakub@redhat.com>
30064 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
30065 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
30066 && !TARGET_REALLY_IWMMXT to conditions.
30068 2021-01-29 Jakub Jelinek <jakub@redhat.com>
30071 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
30074 2021-01-28 Marek Polacek <polacek@redhat.com>
30077 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
30078 the main variant, maybe reset it in its variants too.
30079 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
30080 (check_aligned_type): Check if TYPE_USER_ALIGN match.
30082 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
30085 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
30086 of constant zero for comparisons.
30088 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
30090 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
30091 support for mapping built-in function names for long double
30092 built-in functions if long double is IEEE 128-bit.
30094 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
30096 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
30097 builtin generator macros.
30098 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
30100 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
30101 instead of inline asm.
30102 (vmlsl_n_s32): Likewise.
30103 (vmlsl_n_u16): Likewise.
30104 (vmlsl_n_u32): Likewise.
30106 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
30108 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
30109 builtin generator macros.
30110 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
30112 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
30113 instead of inline asm.
30114 (vmlal_n_s32): Likewise.
30115 (vmlal_n_u16): Likewise.
30116 (vmlal_n_u32): Likewise.
30118 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30120 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
30122 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
30124 (aarch64_shrn2<mode>_insn_be): Likewise.
30125 (aarch64_shrn2<mode>): Likewise.
30126 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
30128 (vshrn_high_n_s32): Likewise.
30129 (vshrn_high_n_s64): Likewise.
30130 (vshrn_high_n_u16): Likewise.
30131 (vshrn_high_n_u32): Likewise.
30132 (vshrn_high_n_u64): Likewise.
30134 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30136 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
30138 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
30140 (aarch64_shrn<mode>_insn_be): Likewise.
30141 (aarch64_shrn<mode>): Likewise.
30142 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
30144 (vshrn_n_s32): Likewise.
30145 (vshrn_n_s64): Likewise.
30146 (vshrn_n_u16): Likewise.
30147 (vshrn_n_u32): Likewise.
30148 (vshrn_n_u64): Likewise.
30149 * config/aarch64/iterators.md (vn_mode): New mode attribute.
30151 2021-01-28 Richard Biener <rguenther@suse.de>
30153 PR rtl-optimization/80960
30154 * dse.c (check_mem_read_rtx): Call get_addr on the
30157 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
30158 David Edelsohn <dje.gcc@gmail.com>
30161 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
30162 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
30164 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
30166 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
30167 wrapper call rs6000_expand_vector_set_var for cleanup. Call
30168 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
30170 (rs6000_expand_vector_set_var): Delete.
30171 (rs6000_expand_vector_set_var_p9): Make static.
30172 (rs6000_expand_vector_set_var_p8): Make static.
30174 2021-01-28 Xing GUO <higuoxing@gmail.com>
30176 * common/config/riscv/riscv-common.c
30177 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
30178 when `p` extension exists.
30180 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
30182 PR rtl-optimization/97684
30183 * ira.c (ira): Call ira_set_pseudo_classes before
30184 update_equiv_regs when it is necessary.
30186 2021-01-27 Jakub Jelinek <jakub@redhat.com>
30189 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
30190 %w0, %w1 and %2 instead of %0, %1 and %2.
30192 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
30194 * config/rs6000/genfusion.pl: New script to generate
30195 define_insn_and_split patterns so combine can arrange fused
30196 instructions next to each other.
30197 * config/rs6000/fusion.md: New file, generated fused instruction
30198 patterns for combine.
30199 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
30200 (non_update_memory_operand): New predicate.
30201 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
30202 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
30204 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
30206 * config/rs6000/rs6000.c (rs6000_option_override_internal):
30207 Automatically set OPTION_MASK_P10_FUSION and
30208 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
30209 (rs600_opt_masks): Allow -mpower10-fusion
30210 in function attributes.
30211 (address_is_non_pfx_d_or_x): New function.
30212 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
30213 * config/rs6000/rs6000.md: Include fusion.md.
30214 * config/rs6000/rs6000.opt: Add -mpower10-fusion
30215 and -mpower10-fusion-ld-cmpi.
30216 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
30218 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30220 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
30221 builtin generator macros.
30222 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
30224 (aarch64_<su>mlal<mode>): This.
30225 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
30226 instead of inline asm.
30227 (vmlal_s16): Likewise.
30228 (vmlal_s32): Likewise.
30229 (vmlal_u8): Likewise.
30230 (vmlal_u16): Likewise.
30231 (vmlal_u32): Likewise.
30233 2021-01-27 Richard Biener <rguenther@suse.de>
30235 PR tree-optimization/98854
30236 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
30237 PHIs from scalars when the number of CTORs matches the
30238 number of children.
30240 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30242 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
30244 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
30246 (aarch64_mls_n<mode>): This.
30247 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
30249 (vmls_n_s32): Likewise.
30250 (vmls_n_u16): Likewise.
30251 (vmls_n_u32): Likewise.
30252 (vmlsq_n_s16): Likewise.
30253 (vmlsq_n_s32): Likewise.
30254 (vmlsq_n_u16): Likewise.
30255 (vmlsq_n_u32): Likewise.
30257 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30259 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
30261 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
30263 (vmls_s16): Likewise.
30264 (vmls_s32): Likewise.
30265 (vmls_u8): Likewise.
30266 (vmls_u16): Likewise.
30267 (vmls_u32): Likewise.
30268 (vmlsq_s8): Likewise.
30269 (vmlsq_s16): Likewise.
30270 (vmlsq_s32): Likewise.
30271 (vmlsq_u8): Likewise.
30272 (vmlsq_u16): Likewise.
30273 (vmlsq_u32): Likewise.
30275 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
30277 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
30279 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
30281 (aarch64_mla_n<mode>): This.
30282 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
30284 (vmla_n_s32): Likewise.
30285 (vmla_n_u16): Likewise.
30286 (vmla_n_u32): Likewise.
30287 (vmlaq_n_s16): Likewise.
30288 (vmlaq_n_s32): Likewise.
30289 (vmlaq_n_u16): Likewise.
30290 (vmlaq_n_u32): Likewise.
30292 2021-01-27 liuhongt <hongtao.liu@intel.com>
30295 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
30296 (*sse2_eq<mode>3): Ditto.
30298 2021-01-27 Jakub Jelinek <jakub@redhat.com>
30300 * tree-pass.h (PROP_trees): Rename to ...
30301 (PROP_gimple): ... this.
30302 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
30303 * passes.c (execute_function_dump, execute_function_todo,
30304 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
30305 * varpool.c (ctor_for_folding): Likewise.
30307 2021-01-27 Jakub Jelinek <jakub@redhat.com>
30309 PR tree-optimization/97260
30310 * varpool.c: Include tree-pass.h.
30311 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
30312 non-TREE_SIDE_EFFECTS automatic variables.
30314 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
30316 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
30318 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
30319 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
30320 (gen_compile_unit_die): Recognise C++23.
30322 2021-01-26 Jakub Jelinek <jakub@redhat.com>
30325 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
30328 2021-01-26 Jakub Jelinek <jakub@redhat.com>
30331 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
30332 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
30333 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
30335 2021-01-26 Richard Biener <rguenther@suse.de>
30337 * gimple-pretty-print.c (dump_binary_rhs): Handle
30338 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
30340 2021-01-26 Richard Biener <rguenther@suse.de>
30342 PR middle-end/98726
30343 * tree.h (vector_cst_int_elt): Remove.
30344 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
30347 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
30349 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
30350 for V64DFmode min/max reductions.
30352 2021-01-26 Jakub Jelinek <jakub@redhat.com>
30354 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
30355 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
30356 two halves, one with x and the other with const0_rtx, ordered
30357 depending on endianity.
30359 2021-01-26 Alexandre Oliva <oliva@adacore.com>
30361 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
30362 temporaries not seen in binding block, and not about to be
30363 added as gimple variables.
30365 2021-01-25 Martin Sebor <msebor@redhat.com>
30368 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
30370 2021-01-25 Martin Liska <mliska@suse.cz>
30372 * value-prof.c (get_nth_most_common_value): Use %s instead
30375 2021-01-25 Jakub Jelinek <jakub@redhat.com>
30378 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
30379 readelf -wi is able to read the emitted .debug_info back.
30380 * configure: Regenerated.
30382 2021-01-25 Martin Liska <mliska@suse.cz>
30384 PR gcov-profile/98739
30385 * common.opt: Add missing sign symbol.
30386 * value-prof.c (get_nth_most_common_value): Restore handling
30387 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
30388 PROFILE_REPRODUCIBILITY_MULTITHREADED.
30390 2021-01-25 Richard Biener <rguenther@suse.de>
30392 PR middle-end/98807
30393 * tree.c (vector_element_bits): Always use precision of
30394 the element type for boolean vectors.
30396 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
30398 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
30399 (ENDFILE_SPEC): Evaluate qnolinkcmds.
30401 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
30403 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
30404 nostartfiles handling since this is already done by
30405 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
30406 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
30407 is already done by LINK_COMMAND_SPEC.
30408 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
30409 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
30412 2021-01-25 Jakub Jelinek <jakub@redhat.com>
30415 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
30416 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
30417 value rather than host size_t.
30418 (fold_const_call): Change type of s2 from size_t to
30419 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
30420 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
30421 instead of s2 as last argument.
30423 2021-01-25 Tamar Christina <tamar.christina@arm.com>
30425 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
30426 VCMLA_OP, VCMUL_OP): New.
30427 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
30428 * config/arm/neon.md (cmul<conj_op><mode>3): New.
30429 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
30430 UNSPEC_VCMUL_CONJ): New.
30431 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
30432 cml<fcmac1><conj_op><mode>4): New.
30434 2021-01-23 Jakub Jelinek <jakub@redhat.com>
30437 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
30439 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
30441 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
30443 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
30445 (vmla_s16): Likewise.
30446 (vmla_s32): Likewise.
30447 (vmla_u8): Likewise.
30448 (vmla_u16): Likewise.
30449 (vmla_u32): Likewise.
30450 (vmlaq_s8): Likewise.
30451 (vmlaq_s16): Likewise.
30452 (vmlaq_s32): Likewise.
30453 (vmlaq_u8): Likewise.
30454 (vmlaq_u16): Likewise.
30455 (vmlaq_u32): Likewise.
30457 2021-01-22 David Malcolm <dmalcolm@redhat.com>
30459 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
30462 2021-01-22 Jakub Jelinek <jakub@redhat.com>
30465 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
30466 filenames to emit, still emit the required 0 index directory and
30467 filename entries that match DW_AT_comp_dir and DW_AT_name of the
30470 2021-01-22 Marek Polacek <polacek@redhat.com>
30473 * doc/invoke.texi: Update C++ ABI Version 15 description.
30475 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30477 PR tree-optimization/98766
30478 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
30479 comparing against type size with param_avoid_fma_max_bits.
30481 2021-01-22 Richard Biener <rguenther@suse.de>
30483 PR middle-end/98793
30484 * tree.c (vector_element_bits): Key single-bit bool vector on
30485 integer mode rather than not vector mode.
30487 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
30490 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
30491 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
30493 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
30494 to call different path for P8 and P9.
30495 (rs6000_expand_vector_set_var_p9): New function.
30496 (rs6000_expand_vector_set_var_p8): New function.
30498 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
30502 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
30503 Ajdust variable index vec_insert from address dereference to
30504 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
30505 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
30507 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
30509 2021-01-22 Martin Liska <mliska@suse.cz>
30511 PR gcov-profile/98739
30512 * profile.c (compute_value_histograms): Drop time profile for
30513 -fprofile-reproducible=multithreaded.
30515 2021-01-22 Nathan Sidwell <nathan@acm.org>
30517 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
30520 2021-01-22 Richard Biener <rguenther@suse.de>
30522 PR middle-end/98773
30523 * tree-data-ref.c (initalize_matrix_A): Revert previous
30524 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
30526 2021-01-22 Jakub Jelinek <jakub@redhat.com>
30528 PR tree-optimization/90248
30529 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
30530 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
30532 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
30533 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
30535 2021-01-22 Jakub Jelinek <jakub@redhat.com>
30537 PR tree-optimization/98255
30538 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
30539 extend index - low_bound from sizetype's precision rather than index
30541 (get_addr_base_and_unit_offset_1): Likewise.
30542 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
30543 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
30545 2021-01-22 Richard Biener <rguenther@suse.de>
30547 PR tree-optimization/98786
30548 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
30549 adding new uses of abnormals. Verify we deal with a conditional
30552 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
30555 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
30557 2021-01-22 liuhongt <hongtao.liu@intel.com>
30561 * config/i386/sse.md (VI_128_256): New mode iterator.
30562 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
30563 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
30564 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
30565 define_insn_and_split to lower avx512 vector comparison to avx
30566 version when dest is vector.
30567 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
30568 define_insn_and_split for negating the comparison result.
30569 * config/i386/predicates.md (float_vector_all_ones_operand):
30571 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
30572 general NOT operator without UNSPEC_MASKOP.
30574 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
30576 PR rtl-optimization/98777
30577 * lra-int.h (lra_pmode_pseudo): New extern.
30578 * lra.c (lra_pmode_pseudo): New global.
30580 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
30582 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
30584 * fwprop.c (fwprop_propagation::classify_result): Allow
30585 (subreg (mem)) simplifications.
30587 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30589 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
30591 (aarch64_sqdmlal<mode>): ... This...
30592 (aarch64_sqdmlsl<mode>): ... And this.
30593 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
30594 (aarch64_sqdmlal_lane<mode>): ... This...
30595 (aarch64_sqdmlsl_lane<mode>): ... And this.
30596 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
30597 (aarch64_sqdmlsl_laneq<mode>): ... This...
30598 (aarch64_sqdmlal_laneq<mode>): ... And this.
30599 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
30600 (aarch64_sqdmlsl_n<mode>): ... This...
30601 (aarch64_sqdmlal_n<mode>): ... And this.
30602 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
30603 (aarch64_sqdmlal2<mode>_internal): ... This...
30604 (aarch64_sqdmlsl2<mode>_internal): ... And this.
30606 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
30608 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
30610 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
30613 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
30615 2021-01-21 liuhongt <hongtao.liu@intel.com>
30617 PR rtl-optimization/98694
30618 * regcprop.c (copy_value): If SRC had been assigned a mode
30619 narrower than the copy, we can't link DEST into the chain even
30620 they have same hard_regno_nregs(i.e. HImode/SImode in i386
30623 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30625 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
30626 Convert to define_insn_and_split. Split into simple move when moving
30629 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
30631 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
30632 Adjust comment. Simplify code.
30634 2021-01-20 Jakub Jelinek <jakub@redhat.com>
30637 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
30638 with DW_FORM_line_strp form.
30639 (prune_unused_types_update_strings): Don't add into debug_str_hash
30640 indirect strings with DW_FORM_line_strp form.
30641 (adjust_name_comp_dir): New function.
30642 (dwarf2out_finish): Call it on CU DIEs after resetting
30643 debug_line_str_hash.
30645 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
30647 PR rtl-optimization/98722
30648 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
30649 has no 3-op add insn to transform insns containing two pluses.
30651 2021-01-20 Richard Biener <rguenther@suse.de>
30653 * hwint.h (add_hwi): New function.
30654 (mul_hwi): Likewise.
30655 * tree-data-ref.c (initialize_matrix_A): Properly translate
30656 tree constants and avoid HOST_WIDE_INT_MIN.
30657 (lambda_matrix_row_add): Avoid undefined integer overflow
30658 and return true on such overflow.
30659 (lambda_matrix_right_hermite): Handle overflow from
30660 lambda_matrix_row_add gracefully. Simplify previous fix.
30661 (analyze_subscript_affine_affine): Likewise.
30663 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
30665 PR tree-optimization/96674
30666 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
30667 x >= y && y != XXX_MIN --> x > y - 1
30669 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
30671 PR tree-optimization/98535
30672 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
30673 If the high and low permutes are the same, remove the high permutes
30674 from the working set and only continue with the low ones.
30676 2021-01-20 Jakub Jelinek <jakub@redhat.com>
30678 PR tree-optimization/98721
30679 * builtins.c (access_ref::inform_access): Don't assume
30680 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
30681 object whenever allocfn is NULL, rather than only when DECL_P
30682 is true. Use %qE instead of %qD for that. Formatting fixes.
30684 2021-01-20 Richard Biener <rguenther@suse.de>
30686 PR tree-optimization/98758
30687 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
30688 (lambda_matrix_right_hermite): Avoid undefinedness with
30689 signed integer abs and multiplication.
30690 (analyze_subscript_affine_affine): Use lambda_int.
30692 2021-01-20 David Malcolm <dmalcolm@redhat.com>
30695 * dwarf2out.c (output_line_info): Rename static variable
30696 "generation", moving it out of the function to...
30697 (output_line_info_generation): New.
30698 (init_sections_and_labels): Likewise, renaming the variable to...
30699 (init_sections_and_labels_generation): New.
30700 (dwarf2out_c_finalize): Reset the new variables.
30702 2021-01-19 Martin Sebor <msebor@redhat.com>
30704 PR middle-end/98664
30705 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
30706 all functions, even if they're not declared artificial or inline.
30707 * tree.c (tree_inlined_location): Use macro expansion location
30708 only if scope traversal fails to expose one.
30710 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
30712 PR rtl-optimization/92294
30713 * alias.c (compare_base_symbol_refs): Take an extra parameter
30714 and add the distance between two symbols to it. Enshrine in
30715 comments that -1 means "either 0 or 1, but we can't tell
30716 which at compile time".
30717 (memrefs_conflict_p): Update call accordingly.
30718 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
30721 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30723 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
30724 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
30725 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
30726 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
30727 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
30728 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
30729 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
30730 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
30731 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
30732 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
30733 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
30734 sqmovun): Use NONE flags.
30736 2021-01-19 Richard Biener <rguenther@suse.de>
30739 * ipa-modref.c (analyze_stmt): Only record a summary for a
30742 2021-01-19 Richard Biener <rguenther@suse.de>
30744 PR middle-end/98638
30745 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
30747 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
30749 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
30750 built-in define __FIX_LEON3FT_TN0018.
30752 2021-01-19 Richard Biener <rguenther@suse.de>
30755 * tree-inline.c (tree_function_versioning): Set input_location
30756 to UNKNOWN_LOCATION throughout the function.
30758 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
30761 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
30763 2021-01-19 Martin Jambor <mjambor@suse.cz>
30766 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
30767 whether non-call exceptions allow removal of a statement.
30768 (isra_analyze_call): Pass the appropriate function to
30769 ssa_name_only_returned_p.
30771 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
30773 * config/riscv/arch-canonicalize (longext_sort): New function for
30774 sorting 'multi-letter'.
30775 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
30776 'alts'. The 'arch' may not be the first of 'alts'.
30777 (_expand_combination): Add underline for the 'ext' without '*'.
30778 This is because, a single-letter extension can always be treated well
30779 with a '_' prefix, but it cannot be separated out if it is appended
30782 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
30785 * ira.c (ira): Skip abnormal critical edge splitting.
30787 2021-01-18 Jakub Jelinek <jakub@redhat.com>
30789 PR tree-optimization/98727
30790 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
30791 second .MUL_OVERFLOW operand for signed multiplication with overflow
30792 checking if the second operand of multiplication is not constant.
30794 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
30796 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
30797 defaults to version 4.
30799 2021-01-18 David Malcolm <dmalcolm@redhat.com>
30801 * attribs.h (fndecl_dealloc_argno): New decl.
30802 * builtins.c (call_dealloc_argno): Split out second half of
30804 (fndecl_dealloc_argno): New.
30805 * doc/extend.texi (Common Function Attributes): Document the
30806 interaction between the analyzer and the malloc attribute.
30807 * doc/invoke.texi (Static Analyzer Options): Likewise.
30809 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
30811 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
30812 dwarf_version to 4.
30813 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
30815 2021-01-17 Martin Jambor <mjambor@suse.cz>
30818 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
30821 2021-01-17 Mark Wielaard <mark@klomp.org>
30823 * common.opt (gdwarf-): Init(5).
30824 * doc/invoke.texi (-gdwarf): Document default to 5.
30826 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
30828 * builtin-types.def
30829 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
30831 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
30832 ...this. Add extra argument.
30833 * gimplify.c (omp_default_clause): Ensure that event handle is
30834 firstprivate in a task region.
30835 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
30836 (gimplify_adjust_omp_clauses): Likewise.
30837 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
30838 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
30839 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
30840 if detach clause specified. Add detach argument when generating
30842 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
30844 (finish_taskreg_scan): Move field for variable containing the event
30845 handle to the front of the struct.
30846 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
30848 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
30849 OMP_CLAUSE_DETACH clause.
30850 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
30851 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
30852 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
30854 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
30856 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
30858 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
30860 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
30861 multilib directory names. Use MULTILIB_REQUIRED instead of
30862 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
30863 -mcustom-fpu-cfg=fph2 multilib.
30865 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
30867 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
30868 (nios2_init_fpu_configs): Provide register values for new
30869 -mcustom-fpu-cfg=fph2 option variant.
30870 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
30873 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
30875 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
30876 custom instruction warnings.
30878 2021-01-16 Jakub Jelinek <jakub@redhat.com>
30880 PR tree-optimization/96669
30881 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
30883 2021-01-16 Jakub Jelinek <jakub@redhat.com>
30885 PR tree-optimization/96271
30886 * passes.def: Pass false argument to first two pass_cd_dce
30887 instances and true to last instance. Add comment that
30888 last instance rewrites no longer addressed locals.
30889 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
30891 (pass_cd_dce::set_pass_param): New method.
30892 (pass_cd_dce::execute): Return TODO_update_address_taken from
30893 last cd_dce instance.
30895 2021-01-15 Carl Love <cel@us.ibm.com>
30897 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
30899 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
30900 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
30901 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
30902 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
30903 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
30904 Add builtin define.
30905 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
30906 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
30907 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
30908 New overloaded definitions.
30909 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
30910 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
30911 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
30912 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
30913 P10V_BUILTIN_MULHU_V4SI]: Add case
30914 statement for builtins.
30915 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
30916 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
30917 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
30918 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
30919 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
30920 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
30921 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
30922 Add define_insn, mode is VIlong.
30923 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
30924 Add builtin descriptions.
30926 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
30928 * final.c (final_start_function_1): Reset force_source_line.
30930 2021-01-15 Jakub Jelinek <jakub@redhat.com>
30932 PR tree-optimization/96669
30933 * match.pd (((1 << A) & 1) != 0 -> A == 0,
30934 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
30935 possibly different power of two constants and to right shift too.
30937 2021-01-15 Jakub Jelinek <jakub@redhat.com>
30939 PR tree-optimization/96681
30940 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
30941 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
30942 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
30943 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
30945 2021-01-15 Alexandre Oliva <oliva@adacore.com>
30947 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
30949 2021-01-15 Tamar Christina <tamar.christina@arm.com>
30951 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
30952 cmul<conj_op><mode>3): New.
30953 * config/aarch64/iterators.md (UNSPEC_FCMUL,
30954 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
30955 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
30956 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
30957 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
30958 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
30959 (rot_op): Renamed to conj_op.
30960 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
30961 cmul<conj_op><mode>3): New.
30962 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
30963 cmul<conj_op><mode>3): New.
30965 2021-01-15 David Malcolm <dmalcolm@redhat.com>
30969 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
30970 Escape the tempfile name when constructing the expected output.
30972 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30974 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
30976 (aarch64_<su>mlsl_hi<mode>): ... This.
30977 (aarch64_<su>mlsl_hi<mode>): Define.
30978 (*aarch64_<su>mlsl<mode): Rename to...
30979 (aarch64_<su>mlsl<mode): ... This.
30980 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
30981 smlsl_hi, umlsl_hi): Define builtins.
30982 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
30983 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
30984 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
30985 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
30987 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
30989 * config/i386/i386-c.c (ix86_target_macros):
30990 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
30992 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
30995 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
30996 * Makefile.in (RTL_SSA_H): New variable.
30997 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
30998 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
30999 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
31001 * config/aarch64/aarch64-cc-fusion.cc: New file.
31003 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
31005 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
31006 calling cancel_changes for changes that no longer exist.
31008 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
31010 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
31011 (function_info::reg_defs): ...this.
31012 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
31013 (function_info::reg_defs): ...this.
31015 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31018 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
31020 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31023 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31026 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
31028 2021-01-15 Richard Biener <rguenther@suse.de>
31030 PR tree-optimization/96376
31031 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
31032 for VMAT_INVARIANT.
31034 2021-01-15 Martin Liska <mliska@suse.cz>
31036 * doc/install.texi: Document that some tests need pytest module.
31037 * doc/sourcebuild.texi: Likewise.
31039 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31042 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
31044 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31046 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
31047 (mve_vshrq_n_u<mode>_imm): Likewise.
31048 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
31049 * config/arm/vec-common.md: ... here.
31051 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
31053 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
31055 * config/arm/neon.md (vashl<mode>3): Delete.
31056 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
31057 (vasl<mode>3): New expander.
31059 2021-01-15 Richard Biener <rguenther@suse.de>
31061 PR tree-optimization/98685
31062 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
31063 of vector extern defs.
31065 2021-01-14 David Malcolm <dmalcolm@redhat.com>
31068 * diagnostic.c (diagnostic_kind_text): Break out this array
31070 (diagnostic_build_prefix): ...here.
31071 (fancy_abort): Detect when diagnostic_initialize has not yet been
31072 called and fall back to a minimal implementation of printing the
31073 ICE, rather than segfaulting in internal_error.
31075 2021-01-14 David Malcolm <dmalcolm@redhat.com>
31077 * diagnostic.c (diagnostic_initialize): Eliminate
31078 parseable_fixits_p in favor of initializing extra_output_kind from
31079 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
31080 (convert_column_unit): New function, split out from...
31081 (diagnostic_converted_column): ...this.
31082 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
31083 Use them to call convert_column_unit on the column values.
31084 (diagnostic_report_diagnostic): Eliminate conditional on
31085 parseable_fixits_p in favor of a switch statement on
31086 extra_output_kind, passing the appropriate values to the new
31087 params of print_parseable_fixits.
31088 (selftest::test_print_parseable_fixits_none): Update for new
31089 params of print_parseable_fixits.
31090 (selftest::test_print_parseable_fixits_insert): Likewise.
31091 (selftest::test_print_parseable_fixits_remove): Likewise.
31092 (selftest::test_print_parseable_fixits_replace): Likewise.
31093 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
31095 (selftest::diagnostic_c_tests): Call it.
31096 * diagnostic.h (enum diagnostics_extra_output_kind): New.
31097 (diagnostic_context::parseable_fixits_p): Delete field in favor
31099 (diagnostic_context::extra_output_kind): ...this new field.
31100 * doc/invoke.texi (Environment Variables): Add
31101 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
31102 * opts.c (common_handle_option): Update handling of
31103 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
31106 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31108 * tree-vect-slp-patterns.c (class complex_operations_pattern,
31109 complex_operations_pattern::matches,
31110 complex_operations_pattern::recognize,
31111 complex_operations_pattern::build): New.
31112 (slp_patterns): Use it.
31114 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31116 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
31117 * optabs.def (cmls_optab, cmls_conj_optab): New.
31118 * doc/md.texi: Document them.
31119 * tree-vect-slp-patterns.c (class complex_fms_pattern,
31120 complex_fms_pattern::matches, complex_fms_pattern::recognize,
31121 complex_fms_pattern::build): New.
31123 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31125 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
31126 * optabs.def (cmla_optab, cmla_conj_optab): New.
31127 * doc/md.texi: Document them.
31128 * tree-vect-slp-patterns.c (vect_match_call_p,
31129 class complex_fma_pattern, vect_slp_reset_pattern,
31130 complex_fma_pattern::matches, complex_fma_pattern::recognize,
31131 complex_fma_pattern::build): New.
31133 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31135 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
31136 * optabs.def (cmul_optab, cmul_conj_optab): New.
31137 * doc/md.texi: Document them.
31138 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
31139 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
31140 vect_build_combine_node, class complex_mul_pattern,
31141 complex_mul_pattern::matches, complex_mul_pattern::recognize,
31142 complex_mul_pattern::build): New.
31144 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31146 * tree-vect-slp.c (optimize_load_redistribution_1): New.
31147 (optimize_load_redistribution, vect_is_slp_load_node): New.
31148 (vect_match_slp_patterns): Use it.
31150 2021-01-14 Tamar Christina <tamar.christina@arm.com>
31152 * tree-vect-slp-patterns.c (complex_add_pattern::build):
31155 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
31157 * config/gcn/mkoffload.c (main): Create an offload image only in
31158 64-bit configurations.
31160 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
31163 * config/i386/i386-options.c (ix86_option_override_internal):
31164 Issue an error for -fcf-protection with CF_BRANCH when compiling
31165 for 32-bit non-TARGET_CMOV targets.
31167 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
31170 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
31171 Remove declaration and initialization of shadow variable "ret".
31172 (ix86_option_override_internal): Remove delcaration of
31173 shadow variable "i". Redeclare shadowed variable to unsigned.
31174 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
31175 * config/i386/i386-builtins.c (get_builtin_code_for_version):
31176 Update for redeclaration.
31177 * config/i386/i386.h (pta_size): Ditto.
31179 2021-01-14 Richard Biener <rguenther@suse.de>
31181 PR tree-optimization/98674
31182 * tree-data-ref.c (base_supports_access_fn_components_p): New.
31183 (initialize_data_dependence_relation): For two bases without
31184 possible access fns resort to type size equality when determining
31185 shape compatibility.
31187 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
31190 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
31191 <=, >= operators in vcle and vcge intrinsics respectively.
31192 * config/arm/arm_neon_builtins.def: Remove entry for
31195 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
31198 * config/i386/i386-options.c (ix86_function_specific_save):
31199 Remove redundant assignment to opts->x_ix86_branch_cost.
31200 * config/i386/i386.c (ix86_prefetch_sse):
31201 Rename from x86_prefetch_sse. Update all uses.
31202 * config/i386/i386.h: Update for rename.
31203 * config/i386/i386-options.h: Ditto.
31205 2021-01-14 Jakub Jelinek <jakub@redhat.com>
31208 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
31209 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
31210 Use Bm instead of m for non-avx. Add isa attribute.
31212 2021-01-14 Jakub Jelinek <jakub@redhat.com>
31214 PR tree-optimization/96688
31215 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
31216 ~X can be simplified.
31218 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
31220 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
31221 IFN_LOAD_LANES results.
31223 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31225 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
31227 (aarch64_xtn<mode>): Likewise.
31228 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
31231 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
31233 (vmovl_s16): Likewise.
31234 (vmovl_s32): Likewise.
31235 (vmovl_u8): Likewise.
31236 (vmovl_u16): Likewise.
31237 (vmovl_u32): Likewise.
31238 (vmovn_s16): Likewise.
31239 (vmovn_s32): Likewise.
31240 (vmovn_s64): Likewise.
31241 (vmovn_u16): Likewise.
31242 (vmovn_u32): Likewise.
31243 (vmovn_u64): Likewise.
31245 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31247 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
31249 (aarch64_<su>qxtn2<mode>_be): Likewise.
31250 (aarch64_<su>qxtn2<mode>): Likewise.
31251 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
31253 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
31254 (su): Handle ss_truncate and us_truncate.
31255 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
31257 (vqmovn_high_s32): Likewise.
31258 (vqmovn_high_s64): Likewise.
31259 (vqmovn_high_u16): Likewise.
31260 (vqmovn_high_u32): Likewise.
31261 (vqmovn_high_u64): Likewise.
31263 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31265 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
31267 (aarch64_xtn2<mode>_be): Likewise.
31268 (aarch64_xtn2<mode>): Likewise.
31269 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
31271 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
31273 (vmovn_high_s32): Likewise.
31274 (vmovn_high_s64): Likewise.
31275 (vmovn_high_u16): Likewise.
31276 (vmovn_high_u32): Likewise.
31277 (vmovn_high_u64): Likewise.
31279 2021-01-13 Stafford Horne <shorne@gmail.com>
31281 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
31283 2021-01-13 Stafford Horne <shorne@gmail.com>
31285 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
31287 2021-01-13 Stafford Horne <shorne@gmail.com>
31289 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
31290 define for __or1k_hard_float__.
31292 2021-01-13 Stafford Horne <shorne@gmail.com>
31294 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
31295 (PROFILE_HOOK): Define to call _mcount.
31296 (FUNCTION_PROFILER): Change from abort to no-op.
31298 2021-01-13 Jakub Jelinek <jakub@redhat.com>
31300 PR tree-optimization/96691
31301 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
31302 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
31303 (~D ^ C) or (D ^ C) can be simplified.
31305 2021-01-13 Richard Biener <rguenther@suse.de>
31307 PR tree-optimization/92645
31308 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
31309 until after vector lowering.
31311 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31313 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
31315 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
31316 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
31318 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31320 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
31322 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
31323 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
31325 2021-01-13 Richard Biener <rguenther@suse.de>
31327 PR tree-optimization/92645
31328 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
31329 BIT_FIELD_REF argument.
31330 (vect_build_slp_tree_2): Record the desired vector type
31331 on the external vector def.
31332 (vectorizable_slp_permutation): Handle required punning
31333 of existing vector defs.
31335 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31337 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
31339 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
31341 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
31343 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
31345 * config.gcc [$target == *-*-gnu*]: Enable
31346 'default_gnu_indirect_function'.
31348 2021-01-13 Jakub Jelinek <jakub@redhat.com>
31351 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
31352 registers before calling targetm.vectorize.vec_perm_const, only after
31354 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
31355 two argument permutation when one operand is zero vector and only
31356 after that force operands into registers.
31357 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
31358 define_insn_and_split pattern.
31359 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
31360 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
31361 (*avx2_zero_extendv8hiv8si2_1): Likewise.
31362 (*avx512f_zero_extendv8siv8di2_1): Likewise.
31363 (*avx2_zero_extendv4siv4di2_1): Likewise.
31364 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
31366 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
31367 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
31368 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
31369 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
31370 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
31371 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
31373 2021-01-13 Martin Liska <mliska@suse.cz>
31375 PR tree-optimization/98455
31376 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
31377 Record also virtual PHIs.
31378 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
31381 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
31383 * doc/invoke.texi (C++ Modules): Fix typos.
31385 2021-01-13 Richard Biener <rguenther@suse.de>
31387 PR tree-optimization/98640
31388 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
31389 handle plus or minus from a truncated operand to be
31392 2021-01-13 Jakub Jelinek <jakub@redhat.com>
31395 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
31396 define_insn_and_split patterns.
31397 (splitter after *btr<mode>_2): New splitter.
31399 2021-01-13 Martin Liska <mliska@suse.cz>
31402 * cgraphunit.c (analyze_functions): Remove dead code.
31404 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
31406 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
31407 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
31408 (a64fx_regmove_cost, a64fx_vector_cost): New.
31409 (a64fx_tunings): Use the new added cost tables.
31411 2021-01-13 Jakub Jelinek <jakub@redhat.com>
31414 * config/i386/predicates.md (pmovzx_parallel): New predicate.
31415 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
31416 define_insn_and_split pattern.
31417 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
31418 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
31420 2021-01-13 Julian Brown <julian@codesourcery.com>
31422 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
31423 to fix v0 register.
31425 2021-01-13 Julian Brown <julian@codesourcery.com>
31427 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
31430 2021-01-13 Julian Brown <julian@codesourcery.com>
31432 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
31433 for reciprocal-approximation instructions.
31434 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
31435 refinement and division result.
31436 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
31438 2021-01-13 Julian Brown <julian@codesourcery.com>
31440 * config/gcn/gcn-valu.md (subdf): Rename to...
31443 2021-01-12 Martin Liska <mliska@suse.cz>
31445 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
31447 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
31449 * function-abi.h: Fix typo.
31451 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
31455 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
31456 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
31457 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
31458 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
31459 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
31460 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
31461 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
31462 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
31463 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
31464 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
31465 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
31466 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
31467 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
31468 (ARM_HAVE_V2DI_LDST): Likewise.
31469 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
31470 (*movmisalign<mode>_mve_load): New pattern.
31471 * config/arm/neon.md (movmisalign<mode>): Move to ...
31472 * config/arm/vec-common.md: ... here.
31474 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
31477 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
31478 of pattern 'plus (plus (hard reg, const), pseudo)'.
31480 2021-01-12 Richard Biener <rguenther@suse.de>
31482 PR tree-optimization/98550
31483 * tree-vect-slp.c (vect_record_max_nunits): Check whether
31484 the group size is a multiple of the vector element count.
31485 (vect_build_slp_tree_1): When we need to fail because
31486 the vector type choosen causes unrolling do so lazily
31487 without affecting matches only at the end to guide group splitting.
31489 2021-01-12 Martin Liska <mliska@suse.cz>
31492 * optc-save-gen.awk: Compare also n_target_save vars with
31495 2021-01-12 Martin Liska <mliska@suse.cz>
31497 * gcov.c (source_info::debug): New.
31498 (print_usage): Add --debug (-D) option.
31499 (process_args): Likewise.
31500 (generate_results): Call src->debug after
31501 accumulate_line_counts.
31502 (read_graph_file): Properly assign id for EXIT_BLOCK.
31503 * profile.c (branch_prob): Dump function body before it is
31506 2021-01-12 Jakub Jelinek <jakub@redhat.com>
31508 PR tree-optimization/98629
31509 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
31510 unless returning non-zero.
31512 2021-01-12 Jakub Jelinek <jakub@redhat.com>
31514 PR tree-optimization/95731
31515 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
31516 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
31517 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
31518 only after optimize_range_tests_var_bound.
31520 2021-01-12 Jakub Jelinek <jakub@redhat.com>
31522 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
31523 * configure: Regenerated.
31525 2021-01-12 liuhongt <hongtao.liu@intel.com>
31528 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
31530 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
31533 2021-01-12 Alexandre Oliva <oliva@adacore.com>
31535 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
31537 (auto_end_imm_use_stmt_traverse): New struct.
31538 (FOR_EACH_IMM_USE_STMT): Use it.
31539 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
31541 * gimple-ssa-strength-reduction.c: ... here, ...
31542 * graphite-scop-detection.c: ... here, ...
31543 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
31544 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
31545 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
31546 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
31547 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
31548 * tree-vect-slp.c: ... and here, ...
31549 * doc/tree-ssa.texi: ... and the example here.
31551 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31553 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
31554 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
31555 (*sdiv_pow2<mode>3): New pattern.
31556 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
31557 Wrap the ASRD in an UNSPEC_PRED_X.
31558 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
31559 predicate with a constant PTRUE, if it isn't already.
31560 (*cond_<sve_int_op><mode>_z): Replace with...
31561 (*cond_<sve_int_op><mode>_any): ...this new pattern.
31563 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31565 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
31566 SVE_FULL_I to SVE_I.
31567 (*cond_bic<mode>_any): Likewise.
31569 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31571 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
31572 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
31575 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31577 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
31578 SVE_FULL_I to SVE_I.
31579 (*aarch64_cond_<su>abd<mode>_2): Likewise.
31580 (*aarch64_cond_<su>abd<mode>_any): Likewise.
31581 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
31582 for the max and min but not for the minus.
31583 (*aarch64_cond_<su>abd<mode>_3): New pattern.
31585 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31587 * config/aarch64/iterators.md (SVE_24I): New iterator.
31588 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
31589 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
31591 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31593 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
31594 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
31596 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
31597 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
31598 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
31599 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
31601 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31603 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
31604 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
31605 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
31608 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
31610 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
31611 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
31612 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
31614 2021-01-11 Martin Liska <mliska@suse.cz>
31617 * symtab-clones.h (clone_info::release): Release
31618 symtab::m_clones with ggc_delete as it's a GGC memory.
31620 2021-01-11 Matthias Klose <doko@ubuntu.com>
31622 * Makefile.in (LINK_PROGRESS): Show the link target.
31624 2021-01-11 Richard Biener <rguenther@suse.de>
31626 PR tree-optimization/91403
31627 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
31628 single-element interleaving group size at 4096 elements.
31630 2021-01-11 Richard Biener <rguenther@suse.de>
31632 PR tree-optimization/98526
31633 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
31634 of the actual reduction op for the regular case.
31635 (vectorizable_reduction): Cost the stmts
31636 vect_transform_reduction produces here.
31638 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
31640 * tree-ssa-forwprop.c (simplify_vector_constructor): For
31641 big-endian, use UNPACK[_FLOAT]_HI.
31643 2021-01-11 Tamar Christina <tamar.christina@arm.com>
31645 * tree-vect-slp-patterns.c (class complex_pattern,
31646 class complex_add_pattern): Add parameters to matches.
31647 (complex_add_pattern::build): Free memory.
31648 (complex_add_pattern::matches): Move validation end of match.
31649 (complex_add_pattern::recognize): Likewise.
31651 2021-01-11 Tamar Christina <tamar.christina@arm.com>
31653 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
31655 2021-01-11 Tamar Christina <tamar.christina@arm.com>
31657 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
31659 2021-01-11 Jakub Jelinek <jakub@redhat.com>
31661 PR tree-optimization/95867
31662 * tree-ssa-math-opts.h: New header.
31663 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
31664 (powi_as_mults): No longer static. Use build_one_cst instead of
31665 build_real. Formatting fix.
31666 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
31667 (attempt_builtin_powi): Handle multiplication reassociation without
31668 powi_fndecl using powi_as_mults.
31669 (reassociate_bb): For integral types don't require
31670 -funsafe-math-optimizations to call attempt_builtin_powi.
31672 2021-01-11 Jakub Jelinek <jakub@redhat.com>
31674 PR tree-optimization/95852
31675 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
31676 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
31677 allow in the bb any of the stmts in that vector, div_stmt and
31678 up to 3 cast stmts.
31679 (arith_cast_equal_p): New function.
31680 (arith_overflow_check_p): Add cast_stmt argument, handle signed
31681 multiply overflow checks.
31682 (match_arith_overflow): Adjust caller. Handle signed multiply
31685 2021-01-11 Jakub Jelinek <jakub@redhat.com>
31687 PR tree-optimization/95852
31688 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
31689 (uaddsub_overflow_check_p): Renamed to ...
31690 (arith_overflow_check_p): ... this. Handle also multiplication
31691 with overflow check.
31692 (match_uaddsub_overflow): Renamed to ...
31693 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
31694 also multiplication with overflow check. Adjust function comment.
31695 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
31696 match_arith_overflow also for MULT_EXPR.
31698 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31700 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
31701 __builtin_convertvector.
31702 (vmovl_s16): Likewise.
31703 (vmovl_s32): Likewise.
31704 (vmovl_u8): Likewise.
31705 (vmovl_u16): Likewise.
31706 (vmovl_u32): Likewise.
31707 (vmovn_s16): Likewise.
31708 (vmovn_s32): Likewise.
31709 (vmovn_s64): Likewise.
31710 (vmovn_u16): Likewise.
31711 (vmovn_u32): Likewise.
31712 (vmovn_u64): Likewise.
31714 2021-01-11 Martin Liska <mliska@suse.cz>
31716 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
31717 (if_chain::is_beneficial): Delete clusters
31718 (find_conditions): Make second argument of conditions_in_bbs a
31719 pointer so that we control over it's lifetime.
31720 (pass_if_to_switch::execute): Delete them.
31722 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
31724 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
31727 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
31729 * config/vax/vax.md (cc): Remove mode attribute.
31730 (subst_<cc>, subst_f<cc>): Rename to...
31731 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
31732 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
31733 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
31734 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
31736 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
31738 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
31739 `const_double_zero'.
31741 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
31743 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
31744 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
31747 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
31749 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
31751 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
31752 with `const_double_zero'.
31753 * doc/rtl.texi (Constant Expression Types): Document it.
31755 2021-01-09 Jakub Jelinek <jakub@redhat.com>
31758 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
31759 POINTER_DIFF_EXPR to be any integral type.
31761 2021-01-09 Jakub Jelinek <jakub@redhat.com>
31763 PR rtl-optimization/98603
31764 * function.c (instantiate_virtual_regs_in_insn): For asm goto
31765 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
31766 if any, set ASM_OPERANDS mode to VOIDmode and change
31767 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
31769 2021-01-09 Alexandre Oliva <oliva@gnu.org>
31772 * final.c (notice_source_line): Narrow down the condition to
31773 skip a line-0 marker.
31775 2021-01-08 Sergei Trofimovich <siarheit@google.com>
31777 * ipa-modref.c (merge_call_side_effects): Fix
31778 linebreak split by reordering two print calls.
31780 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
31782 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
31783 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
31784 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
31785 consistency, fix constraint.
31787 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
31789 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
31790 callables instead of mask values.
31791 (struct target_flag_set_p): New predicate.
31792 (s390_cpu_cpp_builtins_internal): Define or undefine
31793 __LONG_DOUBLE_VX__ macro.
31795 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
31798 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
31799 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
31802 2021-01-08 Richard Biener <rguenther@suse.de>
31804 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
31806 2021-01-08 Richard Biener <rguenther@suse.de>
31808 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
31809 (vect_build_slp_tree): On cache hit release the matched
31810 scalar stmts vector.
31811 * tree-vect-stmts.c (vectorizable_store): Properly free
31812 vec_oprnds before possibly gathering them again.
31814 2021-01-08 Richard Biener <rguenther@suse.de>
31816 PR tree-optimization/98544
31817 * tree-vect-slp.c (vect_optimize_slp): Always materialize
31818 permutes at a permute node.
31820 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
31823 * config/i386/i386.c (x86_function_profiler): Use R10 to call
31824 mcount in large model. Sorry for large model with PIC.
31826 2021-01-08 Jakub Jelinek <jakub@redhat.com>
31829 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
31830 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
31831 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
31832 TargetSave and initialize for variables with enum types.
31833 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
31834 mstack-protector-guard-symbol=): Add Save.
31835 * config/i386/i386-options.c (ix86_function_specific_save,
31836 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
31837 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
31838 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
31839 x_ix86_veclibabi_type.
31841 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
31843 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
31844 SVE_FULL_I to SVE_I.
31845 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
31847 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
31849 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
31850 SVE_FULL_I to SVE_I.
31851 (*cond_uxt<mode>_any): Likewise.
31853 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31855 * config/aarch64/iterators.md (Vwhalf): New iterator.
31856 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
31858 (aarch64_<sur>adalp<mode>): ... This. Make more
31860 (<sur>sadv16qi): Adjust callsite of the above.
31861 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
31863 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
31865 (vpadal_s16): Likewise.
31866 (vpadal_u8): Likewise.
31867 (vpadal_u16): Likewise.
31868 (vpadalq_s8): Likewise.
31869 (vpadalq_s16): Likewise.
31870 (vpadalq_s32): Likewise.
31871 (vpadalq_u8): Likewise.
31872 (vpadalq_u16): Likewise.
31873 (vpadalq_u32): Likewise.
31875 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31877 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
31879 (aarch64_<su>abd<mode>): ... This.
31880 (<sur>sadv16qi): Adjust callsite of the above.
31881 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
31883 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
31885 (vabd_s16): Likewise.
31886 (vabd_s32): Likewise.
31887 (vabd_u8): Likewise.
31888 (vabd_u16): Likewise.
31889 (vabd_u32): Likewise.
31890 (vabdq_s8): Likewise.
31891 (vabdq_s16): Likewise.
31892 (vabdq_s32): Likewise.
31893 (vabdq_u8): Likewise.
31894 (vabdq_u16): Likewise.
31895 (vabdq_u32): Likewise.
31897 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31899 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
31901 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
31902 (vaba_s16): Likewise.
31903 (vaba_s32): Likewise.
31904 (vaba_u8): Likewise.
31905 (vaba_u16): Likewise.
31906 (vaba_u32): Likewise.
31907 (vabaq_s8): Likewise.
31908 (vabaq_s16): Likewise.
31909 (vabaq_s32): Likewise.
31910 (vabaq_u8): Likewise.
31911 (vabaq_u16): Likewise.
31912 (vabaq_u32): Likewise.
31914 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31916 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
31917 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
31918 Change RTL pattern to match.
31920 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
31922 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
31923 * config/riscv/riscv-c.c (riscv-subset.h): New.
31924 (INCLUDE_STRING): Define.
31925 (riscv_cpu_cpp_builtins): Add new style architecture extension
31927 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
31928 (riscv_subset_list::end): New.
31929 (riscv_current_subset_list): New.
31931 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
31933 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
31934 Move to riscv-subset.h.
31935 (struct riscv_subset_t): Ditto.
31936 (class riscv_subset_list): Ditto.
31937 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
31938 from riscv-common.c.
31939 (struct riscv_subset_t): Ditto.
31940 (class riscv_subset_list): Ditto.
31941 * config/riscv/t-riscv ($(common_out_file)): Add file
31944 2021-01-07 Jakub Jelinek <jakub@redhat.com>
31947 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
31948 New define_insn patterns.
31950 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
31952 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
31953 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
31954 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
31956 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
31958 PR tree-optimization/98560
31959 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
31960 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
31962 (vec_cond_direct): Likewise argument 2.
31963 (vec_condu_direct, vec_condeq_direct): Delete.
31964 (expand_vect_cond_optab_fn): Rename to...
31965 (expand_vec_cond_optab_fn): ...this, replacing old macro.
31966 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
31967 (expand_vect_cond_mask_optab_fn): Rename to...
31968 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
31969 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
31971 (direct_vec_cond_optab_supported_p): Likewise.
31972 (direct_vec_condu_optab_supported_p): Delete.
31973 (direct_vec_condeq_optab_supported_p): Delete.
31974 * gimple-isel.cc: Include internal-fn.h.
31975 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
31978 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
31980 PR tree-optimization/98560
31981 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
31982 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
31984 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
31986 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
31987 rotate_insn and optab code attributes.
31988 Update all uses to merged code attribute.
31989 * config/i386/sse.md: Update all uses to merged code attribute.
31990 * config/i386/mmx.md: Update all uses to merged code attribute.
31992 2021-01-07 Jakub Jelinek <jakub@redhat.com>
31994 PR tree-optimization/98568
31995 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
31996 (bswap_replace): Use it.
31998 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
32000 PR rtl-optimization/97978
32001 * lra-int.h (lra_hard_reg_split_p): New external.
32002 * lra.c (lra_hard_reg_split_p): New global.
32003 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
32004 * lra-assigns.c (lra_assign): Don't check allocation correctness
32005 after hard reg splitting.
32007 2021-01-06 Martin Sebor <msebor@redhat.com>
32010 * builtins.c (new_delete_mismatch_p): New overload.
32011 (new_delete_mismatch_p (tree, tree)): Call it.
32013 2021-01-06 Alexandre Oliva <oliva@adacore.com>
32015 * Makefile.in (T_GLIMITS_H): New.
32016 (stmp-int-hdrs): Depend on it, use it.
32017 * config/t-vxworks (T_GLIMITS_H): Override it.
32018 (vxw-glimits.h): New.
32020 2021-01-06 Richard Biener <rguenther@suse.de>
32022 PR tree-optimization/98513
32023 * value-range.cc (intersect_ranges): Compare the upper bounds
32024 for the expected relation.
32026 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
32029 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
32031 * doc/standards.texi (HSAIL): Remove section.
32033 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
32035 * configure: Re-generate.
32037 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32039 * doc/invoke.texi (-std=c++20): Adjust for the publication of
32040 ISO 14882:2020 standard.
32041 * doc/standards.texi: Likewise.
32043 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32045 PR tree-optimization/94802
32046 * expr.h (maybe_optimize_sub_cmp_0): Declare.
32047 * expr.c: Include tree-pretty-print.h and flags.h.
32048 (maybe_optimize_sub_cmp_0): New function.
32049 (do_store_flag): Use it.
32050 * cfgexpand.c (expand_gimple_cond): Likewise.
32052 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32054 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
32055 * rtlanal.c (simple_regno_set): Tweak description to clarify the
32058 2021-01-05 Richard Biener <rguenther@suse.de>
32060 PR tree-optimization/98516
32061 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
32062 lanes when materializing on a VEC_PERM node.
32063 (vectorizable_slp_permutation): Dump the permute properly.
32065 2021-01-05 Richard Biener <rguenther@suse.de>
32067 * tree-vect-slp.c (vect_slp_region): Move debug counter
32068 to cover individual subgraphs.
32070 2021-01-05 Richard Biener <rguenther@suse.de>
32072 PR tree-optimization/98428
32073 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
32074 vector lane extracts for loop vectorization.
32076 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32078 PR tree-optimization/98514
32079 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
32081 (operand_rank): Change type from hash_map<tree, long> to
32082 hash_map<tree, int64_t>.
32083 (phi_rank): Change return type from long to int64_t.
32084 (loop_carried_phi): Change block_rank variable type from long to
32086 (propagate_rank): Change return type, rank parameter type and
32087 op_rank variable type from long to int64_t.
32088 (find_operand_rank): Change return type from long to int64_t
32089 and change slot variable type from long * to int64_t *.
32090 (insert_operand_rank): Change rank parameter type from long to
32092 (get_rank): Change return type and rank variable type from long to
32093 int64_t. Use PRId64 instead of ld to print the rank.
32094 (init_reassoc): Change rank variable type from long to int64_t
32095 and adjust correspondingly bb_rank and operand_rank initialization.
32097 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32099 PR tree-optimization/96928
32100 * tree-ssa-phiopt.c (xor_replacement): New function.
32101 (tree_ssa_phiopt_worker): Call it.
32103 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32105 PR tree-optimization/96930
32106 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
32107 from narrower value which has the same type as 1 << B, perform
32108 the right shift on the narrower value followed by extension.
32110 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32112 PR tree-optimization/96239
32113 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
32115 (get_status_for_store_merging): Don't return BB_INVALID for blocks
32116 with potential bswap optimizable CONSTRUCTORs.
32117 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
32120 2021-01-05 Richard Biener <rguenther@suse.de>
32122 PR tree-optimization/98381
32123 * tree.c (vector_element_bits): Properly compute bool vector
32125 * tree-vect-loop.c (vectorizable_live_operation): Properly
32126 compute the last lane bit offset.
32128 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
32131 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
32132 Clear the top 64 bytes of the input XMM register.
32133 (sse_cvttps2pi): Ditto.
32135 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
32138 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
32140 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
32143 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
32146 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
32148 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
32149 (umaddsidi4_split): Likewise.
32151 2021-01-05 liuhongt <hongtao.liu@intel.com>
32154 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
32155 define_insn_and_split for zero_extend of subreg HI of pmovskb
32157 (*sse2_pmovskb_zexthisi): Add new combine splitters for
32158 zero_extend of not of subreg HI of pmovskb result.
32160 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32163 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
32165 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
32166 convert_memory_address to convert symbolic immediates to ptr_mode
32167 before forcing them to memory.
32169 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32171 PR rtl-optimization/97144
32172 * recog.c (constrain_operands): Initialize matching_operand
32173 for each alternative, rather than only doing it once.
32175 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32177 PR rtl-optimization/98403
32178 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
32179 why we don't remove call clobbers.
32180 (function_info::apply_changes_to_insn): Don't attempt to add
32181 call clobbers here.
32183 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
32185 PR tree-optimization/98371
32186 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
32187 (vect_analyze_loop): If an epilogue loop appears to be cheaper
32188 than the main loop, re-analyze it as a main loop before adopting
32191 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
32194 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
32195 * aclocal.m4, configure: Regenerate.
32196 * Makefile.in (NETLIBS): Define.
32197 (BACKEND): Remove $(CODYLIB).
32199 2021-01-05 Jakub Jelinek <jakub@redhat.com>
32201 PR rtl-optimization/98334
32202 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
32203 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
32205 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
32207 * tree-inline.c (expand_call_inline): Restore input_location.
32208 Return result from recursive call.
32210 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
32212 PR tree-optimization/95401
32213 * config/aarch64/aarch64-sve-builtins.cc
32214 (gimple_folder::load_store_cookie): Use bits rather than bytes
32215 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
32216 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
32217 * tree-vect-stmts.c (vectorizable_store): Likewise.
32218 (vectorizable_load): Likewise.
32220 2021-01-04 Richard Biener <rguenther@suse.de>
32222 PR tree-optimization/98308
32223 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
32226 2021-01-04 Jakub Jelinek <jakub@redhat.com>
32228 PR tree-optimization/95771
32229 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
32230 with precision smaller than int's precision and types with precision
32231 twice as large as long long. Formatting fixes.
32233 2021-01-04 Richard Biener <rguenther@suse.de>
32235 PR tree-optimization/98464
32236 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
32237 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
32238 (process_bb): Adjust.
32240 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
32243 * doc/invoke.texi (-fsanitize=address): Fix wording describing
32244 clash with -fsanitize=hwaddress.
32246 2021-01-04 Richard Biener <rguenther@suse.de>
32248 PR tree-optimization/98282
32249 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
32250 invariants as VN_NARY.
32252 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
32255 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
32256 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
32257 to handle zero operands.
32259 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
32261 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
32262 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
32263 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
32266 2021-01-04 Richard Biener <rguenther@suse.de>
32268 PR tree-optimization/98393
32269 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
32270 when hitting the limit.
32272 2021-01-04 Richard Biener <rguenther@suse.de>
32274 PR tree-optimization/98291
32275 * tree-vect-loop.c (vectorizable_reduction): Bypass
32276 associativity check for SLP reductions with VF 1.
32278 2021-01-04 Jakub Jelinek <jakub@redhat.com>
32280 PR tree-optimization/96782
32281 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
32283 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
32285 * collect-utils.c (collect_execute): Check dumppfx.
32286 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
32287 to collect_execute.
32288 (do_link): Add new parameter atsuffix.
32289 (main): Handle -dumpdir option. Skip one argument for
32290 -o, -isystem and -B options.
32291 * gcc.c (make_at_file): New helper function.
32292 (close_at_file): Use it.
32294 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32296 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
32297 Amend handling for LD64_VERSION fallback defaults.
32299 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32301 * config.gcc: Compute default version information
32302 from the configured target. Likewise defaults for
32304 * config/darwin10.h: Removed.
32305 * config/darwin12.h: Removed.
32306 * config/darwin9.h: Removed.
32307 * config/rs6000/darwin8.h: Removed.
32309 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32311 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
32313 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32315 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
32316 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
32318 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32320 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
32322 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
32324 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32326 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
32327 for the Darwin10 unwinder stub from here ...
32328 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
32330 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
32332 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
32333 (ASM_DEBUG_SPEC):Only define if the assembler supports
32335 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
32336 (DARWIN_PREFER_DWARF): Define.
32337 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
32338 (DARWIN_PREFER_DWARF): Likewise
32339 (DSYMUTIL_SPEC): Likewise.
32340 (COLLECT_RUN_DSYMUTIL): Likewise.
32341 (ASM_DEBUG_SPEC): Likewise.
32342 (ASM_DEBUG_OPTION_SPEC): Likewise.
32344 2021-01-02 Jan Hubicka <jh@suse.cz>
32346 * cfg.c (free_block): ggc_free bb.
32348 2021-01-01 Jakub Jelinek <jakub@redhat.com>
32350 * gcc.c (process_command): Update copyright notice dates.
32351 * gcov-dump.c (print_version): Ditto.
32352 * gcov.c (print_version): Ditto.
32353 * gcov-tool.c (print_version): Ditto.
32354 * gengtype.c (create_file): Ditto.
32355 * doc/cpp.texi: Bump @copying's copyright year.
32356 * doc/cppinternals.texi: Ditto.
32357 * doc/gcc.texi: Ditto.
32358 * doc/gccint.texi: Ditto.
32359 * doc/gcov.texi: Ditto.
32360 * doc/install.texi: Ditto.
32361 * doc/invoke.texi: Ditto.
32363 2021-01-01 Jakub Jelinek <jakub@redhat.com>
32365 * ChangeLog-2020: Rotate ChangeLog. New file.
32368 Copyright (C) 2021 Free Software Foundation, Inc.
32370 Copying and distribution of this file, with or without modification,
32371 are permitted in any medium without royalty provided the copyright
32372 notice and this notice are preserved.