1 2021-12-11 Jan Hubicka <hubicka@ucw.cz>
3 * ipa-profile.c (ipa_profile): Do not update hot bb threshold.
5 2021-12-11 Jan Hubicka <hubicka@ucw.cz>
7 * ipa-modref.c (get_modref_function_summary): Use ultimate_alias_target.
8 (ignore_edge): Likewise.
9 (compute_parm_map): Likewise.
10 (modref_propagate_in_scc): Likewise.
11 (modref_propagate_flags_in_scc): Likewise.
13 2021-12-10 Jason Merrill <jason@redhat.com>
15 * symtab.c (symtab_node::equal_address_to): Fix comment typo.
17 2021-12-10 Doug Rupp <rupp@adacore.com>
19 * config/vxworks.h (LINK_SPEC): Remove %(link_target).
20 Change %{v:-v} to %{v:-V}.
22 2021-12-10 Olivier Hainque <hainque@adacore.com>
24 * config/t-vxworks: Remove assignment to STMP_FIXINC.
26 2021-12-10 Martin Liska <mliska@suse.cz>
28 * params.opt: Add missing dot.
30 2021-12-10 Roger Sayle <roger@nextmovesoftware.com>
33 * ipa-modref-tree.h (useful_for_kill_p): Zero width accesses aren't
34 useful for kill tracking.
36 2021-12-10 Andrew Stubbs <ams@codesourcery.com>
38 * config/gcn/mkoffload.c (process_asm): Process the variable table
39 completely differently.
40 (process_obj): Encode the varaible data differently.
42 2021-12-10 Joel Hutton <joel.hutton@arm.com>
44 PR tree-optimization/103523
45 * tree-vect-loop.c (vectorizable_induction): Check for
46 PLUS_EXPR/MINUS_EXPR support.
48 2021-12-10 Cui,Lili <lili.cui@intel.com>
50 * config/i386/i386.c (ix86_vector_costs::add_stmt_cost): Remove Tremont.
52 2021-12-09 Jan Hubicka <hubicka@ucw.cz>
54 * doc/invoke.texi (max-inline-functions-called-once-loop-depth,
55 max-inline-functions-called-once-insns): New parameters.
56 * ipa-inline.c (check_callers): Handle
57 param_inline_functions_called_once_loop_depth and
58 param_inline_functions_called_once_insns.
59 (edge_badness): Fix linebreaks.
60 * params.opt (param=max-inline-functions-called-once-loop-depth,
61 param=max-inline-functions-called-once-insn): New params.
63 2021-12-09 Martin Sebor <msebor@redhat.com>
65 PR tree-optimization/103215
66 * pointer-query.cc (access_ref::merge_ref): Extend the offset and
67 size of the merged object instead of using the larger.
69 2021-12-09 Martin Sebor <msebor@redhat.com>
72 * doc/extend.texi (attribute access): Adjust.
73 * gimple-ssa-warn-access.cc (pass_waccess::maybe_check_access_sizes):
74 Treat access mode none on a void* argument as expecting as few as
77 2021-12-09 Jakub Jelinek <jakub@redhat.com>
80 * config/aarch64/aarch64-sve-builtins.cc (gt_pch_nx): Change type of
81 second argument from function with 2 pointer arguments to function
82 with 3 pointer arguments.
84 2021-12-09 Olivier Hainque <hainque@adacore.com>
86 * config/aarch64/aarch64-vxworks.h (TARGET_OS_CPP_BUILTINS):
87 Use VX_CPU_PREFIX in CPU definitions.
89 2021-12-09 Martin Sebor <msebor@redhat.com>
91 * pointer-query.cc (access_ref::dump): Define new function
92 (pointer_query::dump): Call it.
93 * pointer-query.h (access_ref::dump): Declare new function.
95 2021-12-09 Martin Sebor <msebor@redhat.com>
97 * pointer-query.cc (compute_objsize_r): Add an argument.
98 (gimple_call_return_array): Pass a new argument to compute_objsize_r.
99 (access_ref::merge_ref): Same.
100 (access_ref::inform_access): Add an argument and use it.
101 (access_data::access_data): Initialize new member.
102 (handle_min_max_size): Pass a new argument to compute_objsize_r.
103 (handle_decl): New function.
104 (handle_array_ref): Pass a new argument to compute_objsize_r.
105 Avoid incrementing deref.
106 (set_component_ref_size): New function.
107 (handle_component_ref): New function.
108 (handle_mem_ref): Pass a new argument to compute_objsize_r.
109 Only increment deref after successfully computing object size.
110 (handle_ssa_name): New function.
111 (compute_objsize_r): Move code into helpers and call them.
112 (compute_objsize): Pass a new argument to compute_objsize_r.
113 * pointer-query.h (access_ref::inform_access): Add an argument.
114 (access_data::ostype): New member.
116 2021-12-09 Martin Sebor <msebor@redhat.com>
118 * pointer-query.cc (access_ref::merge_ref): Define new function.
119 (access_ref::get_ref): Move code into merge_ref and call it.
120 * pointer-query.h (access_ref::merge_ref): Declare new function.
122 2021-12-09 Martin Sebor <msebor@redhat.com>
124 * gimple-ssa-warn-restrict.c (builtin_access::builtin_access): Pass
125 GIMPLE statement to compute_objsize.
126 * pointer-query.cc (compute_objsize): Add a statement argument.
127 * pointer-query.h (compute_objsize): Define a new overload.
129 2021-12-09 Martin Sebor <msebor@redhat.com>
131 * gimple-ssa-warn-access.cc (check_access): Adjust to member name
133 (pass_waccess::check_strncmp): Same.
134 * pointer-query.cc (access_ref::access_ref): Remove arguments.
136 (access_data::access_data): Define new ctors.
137 (access_data::set_bound): Define new member function.
138 (compute_objsize_r): Remove unnecessary code.
139 * pointer-query.h (struct access_ref): Remove ctor arguments.
140 (struct access_data): Declare ctor overloads.
141 (access_data::dst_bndrng): New member.
142 (access_data::src_bndrng): New member.
144 2021-12-09 Martin Sebor <msebor@redhat.com>
147 * pointer-query.cc (gimple_call_return_array): Call compute_objsize_r.
149 2021-12-09 Olivier Hainque <hainque@adacore.com>
150 Rasmus Villemoes <rv@rasmusvillemoes.dk>
152 * Makefile.in (T_STDINT_GCC_H): New variable, path to
153 stdint-gcc.h that a target configuration may override when
154 use_gcc_stdint is "provide".
155 (stmp-int-hdrs): Depend on it and copy that for
157 * config.gcc (vxworks): Revert to use_gcc_stdint=provide.
158 * config/t-vxworks (T_STDINT_GCC_H): Define, as vxw-stdint-gcc.h.
159 (vxw-stdint-gcc.h): New target, produced from the original
161 (vxw-glimits.h): Use an automatic variable to designate the
162 first and only prerequisite.
163 * config/vxworks/stdint.h: Remove.
165 2021-12-09 Iain Sandoe <iain@sandoe.co.uk>
168 * config/host-darwin.c (SAFE_ALLOC_SIZE): Remove.
169 (darwin_gt_pch_get_address): Rework for relocatable PCH.
170 (darwin_gt_pch_use_address): Likewise.
172 2021-12-09 Jakub Jelinek <jakub@redhat.com>
175 * config/host-darwin.c (darwin_gt_pch_use_address): When reading
176 manually the file into mapped area, update mapped_addr as
177 an automatic variable rather than addr which is a reference parameter.
178 * config/host-hpux.c (hpux_gt_pch_use_address): When reading
179 manually the file into mapped area, update addr as
180 an automatic variable rather than base which is a reference parameter.
182 2021-12-09 Jakub Jelinek <jakub@redhat.com>
185 * coretypes.h (gt_pointer_operator): Use 3 pointer arguments instead
187 * gengtype.c (struct walk_type_data): Add in_nested_ptr argument.
188 (walk_type): Temporarily set d->in_nested_ptr around nested_ptr
190 (write_types_local_user_process_field): Pass a new middle pointer
191 to gt_pointer_operator op calls, if d->in_nested_ptr pass there
192 address of d->prev_val[2], otherwise NULL.
193 (write_types_local_process_field): Likewise.
194 * ggc-common.c (relocate_ptrs): Add real_ptr_p argument. If equal
195 to ptr_p, do nothing, otherwise if NULL remember ptr_p's
196 or if non-NULL real_ptr_p's corresponding new address in
198 (reloc_addrs_vec): New variable.
199 (compare_ptr, read_uleb128, write_uleb128): New functions.
200 (gt_pch_save): When iterating over objects through relocate_ptrs,
201 save current i into state.ptrs_i. Sort reloc_addrs_vec and emit
202 it as uleb128 of differences between pointer addresses into the
204 (gt_pch_restore): Allow restoring of PCH to a different address
205 than the preferred one, in that case adjust global pointers by bias
206 and also adjust by bias addresses read from the relocation table
207 as uleb128 differences. Otherwise fseek over it. Perform
208 gt_pch_restore_stringpool only after adjusting callbacks and for
209 callback adjustments also take into account the bias.
210 (default_gt_pch_use_address): Change type of first argument from
212 (mmap_gt_pch_use_address): Likewise.
213 * ggc-tests.c (gt_pch_nx): Pass NULL as new middle argument to op.
214 * hash-map.h (hash_map::pch_nx_helper): Likewise.
215 (gt_pch_nx): Likewise.
216 * hash-set.h (gt_pch_nx): Likewise.
217 * hash-table.h (gt_pch_nx): Likewise.
218 * hash-traits.h (ggc_remove::pch_nx): Likewise.
219 * hosthooks-def.h (default_gt_pch_use_address): Change type of first
220 argument from void * to void *&.
221 (mmap_gt_pch_use_address): Likewise.
222 * hosthooks.h (struct host_hooks): Change type of first argument of
223 gt_pch_use_address hook from void * to void *&.
224 * machmode.h (gt_pch_nx): Expect a callback with 3 pointers instead of
225 two in the middle argument.
226 * poly-int.h (gt_pch_nx): Likewise.
227 * stringpool.c (gt_pch_nx): Pass NULL as new middle argument to op.
228 * tree-cfg.c (gt_pch_nx): Likewise, except for LOCATION_BLOCK pass
229 the same &(block) twice.
230 * value-range.h (gt_pch_nx): Pass NULL as new middle argument to op.
231 * vec.h (gt_pch_nx): Likewise.
232 * wide-int.h (gt_pch_nx): Likewise.
233 * config/host-darwin.c (darwin_gt_pch_use_address): Change type of
234 first argument from void * to void *&.
235 * config/host-darwin.h (darwin_gt_pch_use_address): Likewise.
236 * config/host-hpux.c (hpux_gt_pch_use_address): Likewise.
237 * config/host-linux.c (linux_gt_pch_use_address): Likewise. If
238 it couldn't succeed to mmap at the preferred location, set base
239 to the actual one. Update addr in the manual reading loop instead of
241 * config/host-netbsd.c (netbsd_gt_pch_use_address): Change type of
242 first argument from void * to void *&.
243 * config/host-openbsd.c (openbsd_gt_pch_use_address): Likewise.
244 * config/host-solaris.c (sol_gt_pch_use_address): Likewise.
245 * config/i386/host-mingw32.c (mingw32_gt_pch_use_address): Likewise.
246 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Pass NULL
247 as new middle argument to op in the generated code.
248 * doc/gty.texi: Adjust samples for the addition of middle pointer
249 to gt_pointer_operator callback.
251 2021-12-09 Alexandre Oliva <oliva@adacore.com>
254 * reg-stack.c (convert_regs_1): Move any_malformed_asm
256 (reg_to_stack): ... here.
258 2021-12-09 Alexandre Oliva <oliva@adacore.com>
261 * expr.c (emit_move_multi_word): Skip clobber during lra.
263 2021-12-09 Alexandre Oliva <oliva@adacore.com>
265 PR tree-optimization/103024
267 * gimple-harden-conditionals.cc (non_eh_succ_edge): New.
268 (pass_harden_compares::execute): Accept 1-bit integral types,
269 and cope with throwing compares.
271 2021-12-08 Iain Sandoe <iain@sandoe.co.uk>
273 * config/darwin.h (DARWIN_PIE_SPEC): Add -no_pie when
274 linking mdynamic-no-pic code on macOS > 10.7.
276 2021-12-08 Dimitar Dimitrov <dimitar@dinux.eu>
278 * config/pru/pru.c (pru_section_type_flags): New function.
279 (TARGET_SECTION_TYPE_FLAGS): Wire it.
281 2021-12-08 David Faust <david.faust@oracle.com>
283 * config/bpf/bpf.c (handle_attr_preserve): Avoid calling
284 is_gimple_assign with a NULL pointer.
286 2021-12-08 Martin Liska <mliska@suse.cz>
288 * profile-count.c (profile_count::dump): Add function
289 that can dump to a provided buffer.
290 (profile_probability::dump): Likewise.
291 * profile-count.h: Likewise.
292 * tree-ssa-loop-unswitch.c (tree_unswitch_single_loop):
293 Use dump_printf_loc infrastructure.
294 (tree_unswitch_outer_loop): Likewise.
295 (find_loop_guard): Likewise.
296 (hoist_guard): Likewise.
298 2021-12-08 Chung-Lin Tang <cltang@codesourcery.com>
300 * gimplify.c (extract_base_bit_offset): Add 'tree *offsetp' parameter,
301 accomodate case where 'offset' return of get_inner_reference is
303 (is_or_contains_p): Further robustify conditions.
304 (omp_target_reorder_clauses): In alloc/to/from sorting phase, also
305 move following GOMP_MAP_ALWAYS_POINTER maps along. Add new sorting
306 phase where we make sure pointers with an attach/detach map are ordered
308 (gimplify_scan_omp_clauses): Add modifications to avoid creating
309 GOMP_MAP_STRUCT and associated alloc map for attach/detach maps.
311 2021-12-08 Roger Sayle <roger@nextmovesoftware.com>
313 * config/nvptx/nvptx.md (*extend_trunc_<mode>2_qi,
314 *extend_trunc_<mode>2_hi, *extend_trunc_di2_si): New insns.
315 Use cvt to perform sign-extension of truncation in one step.
317 2021-12-08 Chung-Lin Tang <cltang@codesourcery.com>
320 * gimplify.c ("tree-hash-traits.h"): Add include.
321 (gimplify_scan_omp_clauses): Change struct_map_to_clause to type
322 hash_map<tree_operand, tree> *. Adjust struct map handling to handle
323 cases of *A and A->B expressions. Under !DECL_P case of
324 GOMP_CLAUSE_MAP handling, add STRIP_NOPS for indir_p case, add to
325 struct_deref_set for map(*ptr_to_struct) cases. Add MEM_REF case when
326 handling component_ref_p case. Add unshare_expr and gimplification
327 when created GOMP_MAP_STRUCT is not a DECL. Add code to add
328 firstprivate pointer for *pointer-to-struct case.
329 (gimplify_adjust_omp_clauses): Move GOMP_MAP_STRUCT removal code for
330 exit data directives code to earlier position.
331 * omp-low.c (lower_omp_target):
332 Handle GOMP_MAP_ATTACH_ZERO_LENGTH_ARRAY_SECTION, and
333 GOMP_MAP_POINTER_TO_ZERO_LENGTH_ARRAY_SECTION map kinds.
334 * tree-pretty-print.c (dump_omp_clause): Likewise.
336 2021-12-08 Andrew Stubbs <ams@codesourcery.com>
337 Hafiz Abid Qadeer <abidh@codesourcery.com>
339 * dwarf2cfi.c (dw_stack_pointer_regnum): Change type to struct cfa_reg.
340 (dw_frame_pointer_regnum): Likewise.
341 (new_cfi_row): Use set_by_dwreg.
342 (get_cfa_from_loc_descr): Use set_by_dwreg. Support register spans.
343 handle DW_OP_bregx with DW_OP_breg{0-31}. Support DW_OP_lit*,
344 DW_OP_const*, DW_OP_minus, DW_OP_shl and DW_OP_plus.
345 (lookup_cfa_1): Use set_by_dwreg.
346 (def_cfa_0): Update for cfa_reg and support register spans.
347 (reg_save): Change sreg parameter to struct cfa_reg. Support register
349 (dwf_cfa_reg): New function.
350 (dwarf2out_flush_queued_reg_saves): Use dwf_cfa_reg instead of
352 (dwarf2out_frame_debug_def_cfa): Likewise.
353 (dwarf2out_frame_debug_adjust_cfa): Likewise.
354 (dwarf2out_frame_debug_cfa_offset): Likewise. Update reg_save usage.
355 (dwarf2out_frame_debug_cfa_register): Likewise.
356 (dwarf2out_frame_debug_expr): Likewise.
357 (create_pseudo_cfg): Use set_by_dwreg.
358 (initial_return_save): Use set_by_dwreg and dwf_cfa_reg,
359 (create_cie_data): Use dwf_cfa_reg.
360 (execute_dwarf2_frame): Use dwf_cfa_reg.
361 (dump_cfi_row): Use set_by_dwreg.
362 * dwarf2out.c (build_span_loc, build_breg_loc): New function.
363 (build_cfa_loc): Support register spans.
364 (build_cfa_aligned_loc): Update cfa_reg usage.
365 (convert_cfa_to_fb_loc_list): Use set_by_dwreg.
366 * dwarf2out.h (struct cfa_reg): New type.
367 (struct dw_cfa_location): Use struct cfa_reg.
368 (build_span_loc): New prototype.
370 2021-12-08 Haochen Jiang <haochen.jiang@intel.com>
374 (*<sse4_1>_blendv<ssefltmodesuffix><avxsizesuffix>_not_ltint):
375 Add new define_insn_and_split.
377 2021-12-08 Alexandre Oliva <oliva@adacore.com>
380 * gimple-harden-conditionals.cc (detach_value): Use memory if
381 general regs won't do.
383 2021-12-07 Martin Liska <mliska@suse.cz>
386 * config/s390/s390.c (s390_valid_target_attribute_inner_p):
387 Use new enum CLVC_INTEGER.
388 * opt-functions.awk: Use new CLVC_INTEGER.
389 * opts-common.c (set_option): Likewise.
390 (option_enabled): Return -1,0,1 for CLVC_INTEGER.
391 (get_option_state): Use new CLVC_INTEGER.
392 (control_warning_option): Likewise.
393 * opts.h (enum cl_var_type): Likewise.
395 2021-12-07 H.J. Lu <hjl.tools@gmail.com>
398 * config/i386/i386.c (ix86_call_use_plt_p): Check FUNCTION_DECL
399 before calling cgraph_node::get.
401 2021-12-07 Richard Biener <rguenther@suse.de>
403 PR tree-optimization/103596
404 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
405 Note we are not propagating into a PHI argument to may_propagate_copy.
406 * tree-ssa-propagate.h (may_propagate_copy): Add
407 argument specifying whether we propagate into a PHI arg.
408 * tree-ssa-propagate.c (may_propagate_copy): Likewise.
409 When not doing so we can replace an abnormal with
411 (may_propagate_into_stmt): Update may_propagate_copy calls.
412 (replace_exp_1): Move propagation checking code to
413 propagate_value and rename to ...
414 (replace_exp): ... this and elide previous wrapper.
415 (propagate_value): Perform checking with adjusted
416 may_propagate_copy call and dispatch to replace_exp.
418 2021-12-07 Matthias Kretz <m.kretz@gsi.de>
420 * hash-map.h (hash_map::traverse): Let both overloads behave the
422 * predict.c (assert_is_empty): Return true, thus not changing
425 2021-12-07 YunQiang Su <yunqiang.su@cipunited.com>
427 * config/mips/mips.h (ISA_HAS_UNALIGNED_ACCESS, STRICT_ALIGNMENT):
428 R6 can unaligned access.
429 * config/mips/mips.md (movmisalign<mode>): Likewise.
430 * config/mips/mips.opt: add -m(no-)unaligned-access
431 * doc/invoke.texi: Likewise.
433 2021-12-07 Eugene Rozenfeld <erozen@microsoft.com>
435 * auto-profile.c (afdo_propagate_edge): Improve count propagation algorithm.
437 2021-12-06 Paul A. Clarke <pc@us.ibm.com>
440 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Replace "vector" with
443 2021-12-06 Jose E. Marchesi <jose.marchesi@oracle.com>
445 * config/bpf/bpf.c (bpf_handle_preserve_access_index_attribute):
446 Mark arguments `args' and flags' as unused.
447 (bpf_core_newdecl): Remove unused local `newdecl'.
448 (bpf_core_newdecl): Remove unused argument `loc'.
449 (ctfc_debuginfo_early_finish_p): Remove unused function.
450 (TARGET_CTFC_DEBUGINFO_EARLY_FINISH_P): Remove definition.
451 (bpf_core_walk): Do not pass a location to bpf_core_newdecl.
453 2021-12-06 Richard Sandiford <richard.sandiford@arm.com>
455 * gimple-range-edge.cc (gimple_outgoing_range::edge_range_p): Add
456 a shortcut for blocks with single successors.
457 * gimple-range-gori.cc (gori_map::calculate_gori): Likewise.
459 2021-12-06 Richard Sandiford <richard.sandiford@arm.com>
461 * value-range.cc (irange::irange_union): Use quick_push rather
462 than safe_push. Use widest_int rather than wide_int. Avoid
463 assigning wi::to_* results to wide*_int temporaries.
465 2021-12-06 Andrew MacLeod <amacleod@redhat.com>
467 * gimple-range-cache.cc (ranger_cache::fill_block_cache): Check for
468 a range from dominators before filling the cache.
469 (ranger_cache::range_from_dom): New.
470 * gimple-range-cache.h (ranger_cache::range_from_dom): Add prototype.
472 2021-12-06 Andrew MacLeod <amacleod@redhat.com>
474 * gimple-range-gori.h (class gori_compute):: Add prototypes.
475 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Add alternate
476 API for basic block. Call for edge alterantive.
477 (gori_compute::may_recompute_p): Ditto.
479 2021-12-06 Richard Biener <rguenther@suse.de>
481 PR tree-optimization/103581
482 * tree-vect-stmts.c (vect_build_gather_load_calls): Properly
483 guard all the AVX512 mask cases.
485 2021-12-06 Richard Biener <rguenther@suse.de>
487 PR tree-optimization/103544
488 * tree-vect-slp.c (vect_analyze_slp): Only add a SLP reduction
489 opportunity if the stmt in question is the reduction root.
490 (dot_slp_tree): Add missing check for NULL child.
492 2021-12-06 Jakub Jelinek <jakub@redhat.com>
495 * config/avr/avr.c (avr_output_data_section_asm_op,
496 avr_output_bss_section_asm_op): Change argument type from const void *
499 2021-12-06 Tamar Christina <tamar.christina@arm.com>
501 PR rtl-optimization/103404
502 * cse.c (find_sets_in_insn): Don't select elements out of a V1 mode
505 2021-12-06 Hongtao Liu <Hongtao.liu@intel.com>
506 Uroš Bizjak <ubizjak@gmail.com>
509 * config/i386/i386.c (ix86_preferred_reload_class): Allow
510 integer regs when moves between register units are cheap.
511 * config/i386/i386.h (INT_SSE_CLASS_P): New.
513 2021-12-05 David Edelsohn <dje.gcc@gmail.com>
515 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
516 AIX math builtin initialization before new_builtins_are_live.
518 2021-12-04 Jakub Jelinek <jakub@redhat.com>
520 * config/i386/x86-tune.def (X86_TUNE_PARTIAL_REG_DEPENDENCY): Fix
521 comment typo, Preffer -> prefer.
522 * ipa-modref-tree.c (modref_access_node::closer_pair_p): Likewise.
524 2021-12-04 Alexandre Oliva <oliva@adacore.com>
526 PR rtl-optimization/103028
527 * ifcvt.c (find_cond_trap): Validate new insns more strictly
530 2021-12-03 Martin Liska <mliska@suse.cz>
531 Peter Bergner <bergner@linux.ibm.com>
534 * config/rs6000/rs6000.c (rs6000_option_override_internal): Move the
535 disabling of shrink-wrapping when using -mrop-protect from here...
536 (rs6000_override_options_after_change): ...to here.
538 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
541 * config/i386/i386-expand.c (ix86_expand_builtin): Pass PVW_NONE
542 and PVW_NONE to ix86_target_string.
543 * config/i386/i386-options.c (ix86_target_string): Add arguments
544 for move_max and store_max.
545 (ix86_target_string::add_vector_width): New lambda.
546 (ix86_debug_options): Pass ix86_move_max and ix86_store_max to
548 (ix86_function_specific_print): Pass ptr->x_ix86_move_max and
549 ptr->x_ix86_store_max to ix86_target_string.
550 (ix86_valid_target_attribute_tree): Handle x_ix86_move_max and
552 (ix86_option_override_internal): Set the default x_ix86_move_max
553 and x_ix86_store_max.
554 * config/i386/i386-options.h (ix86_target_string): Add
555 prefer_vector_width and prefer_vector_width.
556 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): Removed.
557 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
558 (MOVE_MAX): Use 64 if ix86_move_max or ix86_store_max ==
559 PVW_AVX512. Use 32 if ix86_move_max or ix86_store_max >=
561 (STORE_MAX_PIECES): Use 64 if ix86_store_max == PVW_AVX512.
562 Use 32 if ix86_store_max >= PVW_AVX256.
563 * config/i386/i386.opt: Add -mmove-max=bits and -mstore-max=bits.
564 * config/i386/x86-tune.def (X86_TUNE_AVX512_MOVE_BY_PIECES): New.
565 (X86_TUNE_AVX512_STORE_BY_PIECES): Likewise.
566 * doc/invoke.texi: Document -mmove-max=bits and -mstore-max=bits.
568 2021-12-03 Bill Schmidt <wschmidt@linux.ibm.com>
570 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Fix builtin
573 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
575 * common/config/riscv/riscv-common.c (riscv_implied_info):
576 Add K-ext related entry.
577 (riscv_supported_std_ext): Add 'k'.
578 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add 'k'.
579 (IMPLIED_EXT): Add K-ext related entry.
581 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
583 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
585 * config/riscv/riscv-opts.h (MASK_ZBKB): New.
595 (TARGET_ZBKB): Ditto.
596 (TARGET_ZBKC): Ditto.
597 (TARGET_ZBKX): Ditto.
598 (TARGET_ZKNE): Ditto.
599 (TARGET_ZKND): Ditto.
600 (TARGET_ZKNH): Ditto.
602 (TARGET_ZKSED): Ditto.
603 (TARGET_ZKSH): Ditto.
605 * config/riscv/riscv.opt (riscv_zk_subext): New.
607 2021-12-03 Tamar Christina <tamar.christina@arm.com>
609 * tree-vect-stmts.c (prepare_load_store_mask): Rename to...
610 (prepare_vec_mask): ...This and record operations that have already been
612 (vectorizable_call): Use it.
613 (vectorizable_operation): Likewise.
614 (vectorizable_store): Likewise.
615 (vectorizable_load): Likewise.
616 * tree-vectorizer.h (class _loop_vec_info): Add vec_cond_masked_set.
617 (vec_cond_masked_set_type, tree_cond_mask_hash): New.
619 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
623 * target.def (ifunc_ref_local_ok): Add a target hook.
624 * varasm.c (default_binds_local_p_3): Force indirect function
625 resolver non-local only if targetm.ifunc_ref_local_ok returns
627 * config/i386/i386-expand.c (ix86_expand_call): Call
628 ix86_call_use_plt_p to check if PLT should be used.
629 * config/i386/i386-protos.h (ix86_call_use_plt_p): New.
630 * config/i386/i386.c (output_pic_addr_const): Call
631 ix86_call_use_plt_p to check if "@PLT" is needed.
632 (ix86_call_use_plt_p): New.
633 (TARGET_IFUNC_REF_LOCAL_OK): New.
634 * doc/tm.texi.in: Add TARGET_IFUNC_REF_LOCAL_OK.
635 * doc/tm.texi: Regenerated.
637 2021-12-03 Jakub Jelinek <jakub@redhat.com>
639 * attribs.h (simple_cst_list_equal): Declare.
640 * attribs.c (simple_cst_list_equal): No longer static.
641 * config/i386/i386-options.c (target_attribute_cache): New variable.
642 (ix86_valid_target_attribute_p): Cache DECL_FUNCTION_SPECIFIC_TARGET
643 and DECL_FUNCTION_SPECIFIC_OPTIMIZATION based on args.
645 2021-12-03 Jakub Jelinek <jakub@redhat.com>
648 * ggc.h (gt_pch_note_callback): Declare.
649 * gengtype.h (enum typekind): Add TYPE_CALLBACK.
650 (callback_type): Declare.
651 * gengtype.c (dbgprint_count_type_at): Handle TYPE_CALLBACK.
652 (callback_type): New variable.
653 (process_gc_options): Add CALLBACK argument, handle callback
655 (set_gc_used_type): Adjust process_gc_options caller, if callback,
656 set type to &callback_type.
657 (output_mangled_typename): Handle TYPE_CALLBACK.
658 (walk_type): Likewise. Handle callback option.
659 (write_types_process_field): Handle TYPE_CALLBACK.
660 (write_types_local_user_process_field): Likewise.
661 (write_types_local_process_field): Likewise.
662 (write_root): Likewise.
663 (dump_typekind): Likewise.
664 (dump_type): Likewise.
665 * gengtype-state.c (type_lineloc): Handle TYPE_CALLBACK.
666 (state_writer::write_state_callback_type): New method.
667 (state_writer::write_state_type): Handle TYPE_CALLBACK.
668 (read_state_callback_type): New function.
669 (read_state_type): Handle TYPE_CALLBACK.
670 * ggc-common.c (callback_vec): New variable.
671 (gt_pch_note_callback): New function.
672 (gt_pch_save): Stream out gt_pch_save function address and relocation
674 (gt_pch_restore): Stream in saved gt_pch_save function address and
675 relocation table and apply relocations if needed.
676 * doc/gty.texi (callback): Document new GTY option.
677 * varasm.c (get_unnamed_section): Change callback argument's type and
678 last argument's type from const void * to const char *.
679 (output_section_asm_op): Change argument's type from const void *
680 to const char *, remove unnecessary cast.
681 * tree-core.h (struct tree_translation_unit_decl): Drop GTY((skip))
682 from language member.
683 * output.h (unnamed_section_callback): Change argument type from
684 const void * to const char *.
685 (struct unnamed_section): Use GTY((callback)) instead of GTY((skip))
686 for callback member. Change data member type from const void *
688 (struct noswitch_section): Use GTY((callback)) instead of GTY((skip))
690 (get_unnamed_section): Change callback argument's type and
691 last argument's type from const void * to const char *.
692 (output_section_asm_op): Change argument's type from const void *
694 * config/avr/avr.c (avr_output_progmem_section_asm_op): Likewise.
695 Remove unneeded cast.
696 * config/darwin.c (output_objc_section_asm_op): Change argument's type
697 from const void * to const char *.
698 * config/pa/pa.c (som_output_text_section_asm_op): Likewise.
699 (som_output_comdat_data_section_asm_op): Likewise.
700 * config/rs6000/rs6000.c (rs6000_elf_output_toc_section_asm_op):
702 (rs6000_xcoff_output_readonly_section_asm_op): Likewise. Instead
703 of dereferencing directive hardcode variable names and decide based on
704 whether directive is NULL or not.
705 (rs6000_xcoff_output_readwrite_section_asm_op): Change argument's type
706 from const void * to const char *.
707 (rs6000_xcoff_output_tls_section_asm_op): Likewise. Instead
708 of dereferencing directive hardcode variable names and decide based on
709 whether directive is NULL or not.
710 (rs6000_xcoff_output_toc_section_asm_op): Change argument's type
711 from const void * to const char *.
712 (rs6000_xcoff_asm_init_sections): Adjust get_unnamed_section callers.
714 2021-12-03 Richard Purdie <richard.purdie@linuxfoundation.org>
716 * Makefile.in: Fix "argument list too long" from install-plugins.
718 2021-12-03 Matt Jacobson <mhjacobson@me.com>
720 * config.gcc: For the AVR target, populate TM_MULTILIB_CONFIG.
721 * config/avr/genmultilib.awk: Add ability to filter generated multilib
723 * config/avr/t-avr: Pass TM_MULTILIB_CONFIG to genmultilib.awk.
724 * configure.ac: Update help string for --with-multilib-list.
725 * configure: Regenerate.
727 2021-12-02 Ilya Lipnitskiy <ilya.lipnitskiy@gmail.com>
729 * configure: Regenerate.
730 * configure.ac: Define TARGET_LIBC_GNUSTACK on musl.
732 2021-12-02 Iain Sandoe <iain@sandoe.co.uk>
735 * config/host-darwin.c (TRY_EMPTY_VM_SPACE,
736 SAFE_ALLOC_SIZE): New.
737 (darwin_gt_pch_get_address): Rewrite to use nominated
738 memory segments rather than part of the compiler __DATA
740 (darwin_gt_pch_use_address): Likewise.
742 2021-12-02 Gerald Pfeifer <gerald@pfeifer.com>
744 * doc/install.texi (*-*-freebsd*): Remove references to
745 FreeBSD 1 and FreeBSD 2.
747 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
749 PR rtl-optimization/103437
750 * ira-color.c (setup_allocno_priorities): Switch off backup code
751 for overflow if compiler has __builtin_smul_overflow. Use <
752 for comparison with -INT_MAX.
754 2021-12-02 Bill Schmidt <wschmidt@linux.ibm.com>
756 * config/rs6000/altivec.h: Delete a number of #defines that are now
757 superfluous. Alphabetize. Include rs6000-vecdefines.h. Include some
759 * config/rs6000/rs6000-builtin-new.def (CMPB): Flag as no32bit.
760 (BPERMD): Flag as 32bit (needing special handling for 32-bit).
761 (UNPACK_TD): Return unsigned long long instead of unsigned long.
762 (GET_TEXASR): Return unsigned long instead of unsigned long long.
763 (GET_TEXASRU): Likewise.
764 (GET_TFHAR): Likewise.
765 (GET_TFIAR): Likewise.
766 (SET_TEXASR): Pass unsigned long instead of unsigned long long.
767 (SET_TEXASRU): Likewise.
768 (SET_TFHAR): Likewise.
769 (SET_TFIAR): Likewise.
770 (TABORTDC): Likewise.
771 (TABORTDCI): Likewise.
772 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): Fix error
773 handling for no32bit. Add 32bit handling for RS6000_BIF_BPERMD.
774 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Initialize
775 new_builtins_are_live to 1.
777 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
779 PR rtl-optimization/103437
780 * ira-color.c (setup_allocno_priorities): Use long long
781 multiplication as backup for overflow processing.
783 2021-12-02 Tamar Christina <tamar.christina@arm.com>
785 * config/aarch64/aarch64-protos.h (aarch64_gen_shareable_zero): New.
786 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>,
787 aarch64_rshrn2<mode>): Generate rounding half-ing add when appropriate.
788 * config/aarch64/aarch64.c (aarch64_gen_shareable_zero): New.
790 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
792 PR rtl-optimization/103437
793 * ira-color.c (setup_allocno_priorities): Process multiplication
796 2021-12-02 Martin Liska <mliska@suse.cz>
798 PR gcov-profile/96092
799 * common.opt: New option.
800 * coverage.c (coverage_begin_function): Emit filename with
801 remap_profile_filename.
802 * doc/invoke.texi: Document the new option.
803 * file-prefix-map.c (add_profile_prefix_map): New.
804 (remap_profile_filename): Likewise.
805 * file-prefix-map.h (add_profile_prefix_map): Likewise.
806 (remap_profile_filename): Likewise.
807 * lto-opts.c (lto_write_options): Handle
808 OPT_fprofile_prefix_map_.
809 * opts-global.c (handle_common_deferred_options): Likewise.
810 * opts.c (common_handle_option): Likewise.
811 (gen_command_line_string): Likewise.
812 * profile.c (output_location): Emit filename with
813 remap_profile_filename.
815 2021-12-02 Richard Biener <rguenther@suse.de>
818 * internal-fn.c (expand_DEFERRED_INIT): When the base
819 of the LHS is a decl with matching constant size use
820 that as the initialization target instead of an
821 eventual VLA typed one.
823 2021-12-02 Richard Biener <rguenther@suse.de>
825 PR tree-optimization/103527
826 * tree-vect-stmts.c (vect_build_gather_load_calls): Always
827 use a truth type for building the vector mask.
829 2021-12-02 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
831 * config/vxworks.h (LTO_PLUGIN_SPEC): Adapt to corrected
832 spelling of -linker-output-auto-nolto-rel.
834 2021-12-01 Uroš Bizjak <ubizjak@gmail.com>
837 * config/i386/sse.md (VI2F): Remove mode iterator.
838 (VI2F_256_512): New mode iterator.
839 (vec_set<V8_128:mode>_0): New insn pattern.
840 (vec_set<VI2F_256_512:mode>_0>): Rename from vec_set<VI2F:mode>mode.
841 Use VI2F_256_512 mode iterator instead of VI2F.
842 (*axv512fp16_movsh): Remove.
843 (<sse2p4_1>_pinsr<ssemodesuffix>): Add (x,x,x) AVX2 alternative.
844 Do not disable V8HF mode insn on AVX2 targets.
845 (pinsrw -> pbroadcast + pblendw peephole2): New peephole.
846 (pinsrw -> pbroadcast + pblendw splitter): New post-reload splitter.
847 * config/i386/i386.md (extendhfsf): Call gen_vec_setv8hf_0.
848 * config/i386/i386-expand.c (ix86_expand_vector_set)
849 <case E_V8HFmode>: Use vec_merge path for TARGET_AVX2.
851 2021-12-01 Roger Sayle <roger@nextmovesoftware.com>
852 Richard Biener <rguenther@suse.de>
854 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
855 Check if simplify_using_initial_conditions allows us to
856 simplify the expression for may_be_zero.
857 * match.pd (X != C ? -X : -C -> -X): New transform.
858 (X != C ? ~X : ~C -> ~X): Likewise.
859 ((X+1) > Y ? -X : 1 -> X >= Y ? -X : 1): Likewise.
861 2021-12-01 Peter Bergner <bergner@linux.ibm.com>
864 * gimplify.c (is_var_need_auto_init): Handle opaque types.
866 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
868 * common.opt (static-libphobos): Add option.
869 * config/darwin.h (LINK_SPEC): Substitute -lgphobos with libgphobos.a
870 when linking statically.
871 * gcc.c (driver_handle_option): Set -static-libphobos as always valid.
873 2021-12-01 Jason Merrill <jason@redhat.com>
876 * fold-const.c (maybe_nonzero_address): Use get_create or get
877 depending on folding_initializer.
878 (fold_binary_initializer_loc): New.
879 * fold-const.h (fold_binary_initializer_loc): Declare.
880 * varasm.c (mark_weak): Don't use the decl location.
881 * doc/invoke.texi: Document -fconstexpr-fp-except.
883 2021-12-01 Bill Schmidt <wschmidt@linux.ibm.com>
886 * config/rs6000/rs6000-c.c (rs6000-builtins.h): Stop including.
887 (rs6000-internal.h): Include.
888 (altivec_resolve_new_overloaded_builtin): Move call to
889 rs6000_invalid_new_builtin here from rs6000_new_builtin_decl.
890 * config/rs6000/rs6000-call.c (rs6000-builtins.h): Stop including.
891 (rs6000_invalid_new_builtin): Remove static qualifier.
892 (rs6000_new_builtin_decl): Remove test for supported builtin.
893 * config/rs6000/rs6000-internal.h (rs6000-builtins.h): Include.
894 (rs6000_invalid_new_builtin): Declare.
895 * config/rs6000/rs6000.c (rs6000-builtins.h): Don't include.
897 2021-12-01 Aldy Hernandez <aldyh@redhat.com>
899 PR tree-optimization/103409
900 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
901 Do all the work with just one ssa_global_cache.
902 * gimple-range-path.h: Remove m_tmp_phi_cache.
904 2021-12-01 Richard Sandiford <richard.sandiford@arm.com>
906 PR tree-optimization/103517
907 * tree-vect-slp.c (vect_build_slp_tree_1): When allowing two
908 different component references, check the codes of both them,
909 rather than just the first.
911 2021-12-01 Andrew MacLeod <amacleod@redhat.com>
913 PR tree-optimization/103464
914 * gimple-range.cc (gimple_ranger::prefill_name): Process phis also.
915 (gimple_ranger::prefill_stmt_dependencies): Ditto.
917 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
919 * doc/install.texi (Prerequisites): Add note that D front end now
920 requires GDC installed in order to bootstrap.
921 (Building): Add D compiler section, referencing prerequisites.
923 2021-12-01 Jakub Jelinek <jakub@redhat.com>
925 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE,
926 X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY,
927 X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Formatting fixes.
928 (X86_TUNE_USE_GATHER): Put m_GENERIC last for consistency.
930 2021-12-01 Jakub Jelinek <jakub@redhat.com>
932 PR rtl-optimization/102356
933 * rtl.h (simplify_context): Add assoc_count member and
934 max_assoc_count static member.
935 * simplify-rtx.c (simplify_associative_operation): Don't reassociate
936 more than max_assoc_count times within one outermost simplify_* call.
937 * dwarf2out.c (mem_loc_descriptor): Optimize binary operation
938 with both operands the same using DW_OP_dup.
940 2021-12-01 Tamar Christina <tamar.christina@arm.com>
942 * match.pd: Move below pattern that rewrites to EQ, NE.
943 * tree.c (bitmask_inv_cst_vector_p): Correct do .. while indentation.
945 2021-12-01 Siddhesh Poyarekar <siddhesh@gotplt.org>
946 Martin Liška <mliska@suse.cz>
948 PR tree-optimization/103456
949 * tree-object-size.c (merge_object_sizes): Update osi->changed
950 only if object_sizes_set succeeded.
952 2021-11-30 liuhongt <hongtao.liu@intel.com>
956 * config/i386/i386.md (*x86_64_shld_1): Set_attr
957 length_immediate to 1.
958 (*x86_shld_1): Ditto.
959 (*x86_64_shrd_1): Ditto.
960 (*x86_shrd_1): Ditto.
962 2021-11-30 Bill Schmidt <wschmidt@linux.ibm.com>
964 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
966 (altivec_resolve_new_overloaded_builtin): Likewise.
968 2021-11-30 Martin Jambor <mjambor@suse.cz>
971 * ipa-sra.c (scan_function): Also check ECF_LOOPING_CONST_OR_PURE flag.
973 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
975 PR tree-optimization/103494
976 * tree-vect-stmts.c (vect_get_gather_scatter_ops): Remove ncopies
977 argument and calculate ncopies from gs_info->offset_vectype
979 (vectorizable_store, vectorizable_load): Update accordingly.
981 2021-11-30 Iain Buclaw <ibuclaw@gdcproject.org>
983 * Makefile.in (GDC): New variable.
984 (GDCFLAGS): New variable.
985 * configure: Regenerate.
986 * configure.ac: Add call to ACX_PROG_GDC. Substitute GDCFLAGS.
988 2021-11-30 Martin Liska <mliska@suse.cz>
989 Martin Jambor <mjambor@suse.cz>
992 * ipa-param-manipulation.c
993 (ipa_param_body_adjustments::prepare_debug_expressions): Be
994 careful about hash_map reallocating itself. Simpify a return
995 which always returns true.
997 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
999 PR tree-optimization/103440
1000 * gimple-range-fold.cc (fold_using_range::range_of_phi): Continue
1001 normal param processing for equiv params.
1003 2021-11-30 Richard Biener <rguenther@suse.de>
1005 * config/i386/i386.c (ix86_shift_rotate_cost): Remove
1007 * tree-chrec.c (evolution_function_is_invariant_rec_p):
1009 * tree-if-conv.c (if_convertible_stmt_p): Likewise.
1010 * tree-ssa-pre.c (fully_constant_expression): Likewise.
1011 * tree-vrp.c (operand_less_p): Likewise.
1012 * reload.c (reg_overlap_mentioned_for_reload_p): Remove
1013 unreachable gcc_unreachable ().
1014 * sel-sched-ir.h (bb_next_bb): Likewise.
1015 * varasm.c (compare_constant): Likewise.
1017 2021-11-30 Richard Biener <rguenther@suse.de>
1019 PR tree-optimization/103489
1020 * tree-vect-loop.c (vectorizable_phi): Verify argument
1021 vector type compatibility to mitigate bool pattern recog
1024 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
1026 * omp-offload.c (oacc_loop_auto_partitions): Remove erroneous
1027 "Orphan reductions cannot have gang partitioning" handling.
1029 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
1031 * omp-offload.c (oacc_loop_process): Implement "gang reduction on
1032 an orphan loop" checking.
1034 2021-11-30 Cesar Philippidis <cesar@codesourcery.com>
1035 Thomas Schwinge <thomas@codesourcery.com>
1037 * omp-general.h (enum oacc_loop_flags): Add OLF_REDUCTION enum.
1038 * omp-low.c (lower_oacc_head_mark): Use it to mark OpenACC
1040 * omp-offload.c (oacc_loop_auto_partitions): Don't assign gang
1041 level parallelism to orphan reductions.
1043 2021-11-30 Aldy Hernandez <aldyh@redhat.com>
1045 PR tree-optimization/103451
1046 * range-op.cc (operator_div::wi_fold): Remove
1047 can_throw_non_call_exceptions special case.
1048 * tree-ssa-sink.c (sink_code_in_bb): Same.
1050 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1052 * tree-vect-slp.c (arg1_arg4_map): New variable.
1053 (vect_get_operand_map): Handle IFN_MASK_GATHER_LOAD.
1054 (vect_build_slp_tree_1): Likewise.
1055 (vect_build_slp_tree_2): Likewise.
1056 * tree-vect-stmts.c (vectorizable_load): Expect the mask to be
1057 the last SLP child node rather than the first.
1059 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1061 * tree-if-conv.c: Include tree-eh.h.
1062 (predicate_statements): Remove pe argument. Don't hoist
1064 (combine_blocks): Remove pe argument.
1065 (ifcvt_available_on_edge_p, ifcvt_can_hoist): New functions.
1066 (ifcvt_hoist_invariants): Likewise.
1067 (tree_if_conversion): Update call to combine_blocks. Call
1068 ifcvt_hoist_invariants after VN.
1070 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1072 PR tree-optimization/102467
1073 * doc/sourcebuild.texi (vect_gather_load_ifn): Document.
1074 * tree-vect-data-refs.c (vect_analyze_data_ref_dependence):
1075 Commonize safelen handling. Punt for anything involving
1076 gathers and scatters unless safelen says otherwise.
1077 * tree-vect-slp.c (arg1_map): New variable.
1078 (vect_get_operand_map): Handle IFN_GATHER_LOAD.
1079 (vect_build_slp_tree_1): Likewise.
1080 (vect_build_slp_tree_2): Likewise.
1081 (compatible_calls_p): If vect_get_operand_map returns nonnull,
1082 check that any skipped arguments are equal.
1083 (vect_slp_analyze_node_operations_1): Tighten reduction check.
1084 * tree-vect-stmts.c (check_load_store_for_partial_vectors): Take
1085 an ncopies argument.
1086 (vect_get_gather_scatter_ops): Take slp_node and ncopies arguments.
1088 (vectorizable_store, vectorizable_load): Adjust accordingly.
1090 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1092 * gimple.h (gimple_num_args, gimple_arg, gimple_arg_ptr): New
1094 * tree-vect-slp.c (cond_expr_maps, arg2_map): New variables.
1095 (vect_get_operand_map): New function.
1096 (vect_get_and_check_slp_defs): Fix outdated comment.
1097 Use vect_get_operand_map and new gimple argument accessors.
1098 (vect_build_slp_tree_2): Likewise.
1100 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1102 * tree-vect-slp.c (vect_build_slp_tree_1): Use code_helper
1103 to record the operations performed by statements, only using
1104 CALL_EXPR for things that don't map to built-in or internal
1105 functions. For shifts, require all shift amounts to be equal
1106 if optab_vector is not supported but optab_scalar is.
1108 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1110 * tree-vect-data-refs.c (vect_check_gather_scatter): Continue
1111 processing conversions if the current offset is a pointer.
1113 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1115 * tree-vectorizer.h (vect_is_reduction): Use STMT_VINFO_REDUC_IDX.
1117 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1119 * genopinit.c (main): Turn supports_vec_gather_load and
1120 supports_vec_scatter_store into signed char arrays and remove
1121 supports_vec_gather_load_cached and supports_vec_scatter_store_cached.
1122 * optabs-query.c (supports_vec_convert_optab_p): Add a mode parameter.
1123 If the mode is not VOIDmode, test only for that mode.
1124 (supports_vec_gather_load_p): Likewise.
1125 (supports_vec_scatter_store_p): Likewise.
1126 * optabs-query.h (supports_vec_gather_load_p): Likewise.
1127 (supports_vec_scatter_store_p): Likewise.
1128 * tree-vect-data-refs.c (vect_check_gather_scatter): Pass the
1129 vector mode to supports_vec_gather_load_p and
1130 supports_vec_scatter_store_p.
1132 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1134 * internal-fn.c (commutative_binary_fn_p): Handle IFN_ADD_OVERFLOW
1135 and IFN_MUL_OVERFLOW.
1137 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1139 * internal-fn.c (commutative_binary_fn_p): Handle IFN_UBSAN_CHECK_ADD
1140 and IFN_UBSAN_CHECK_MUL.
1142 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1144 * internal-fn.c (commutative_binary_fn_p): Handle IFN_COMPLEX_MUL.
1146 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1148 * gimple-fold.c: Include internal-fn.h.
1149 (fold_stmt_1): If a function maps to an internal one, use
1150 first_commutative_argument to canonicalize the order of
1151 commutative arguments.
1152 * gimple-match-head.c (gimple_resimplify2, gimple_resimplify3)
1153 (gimple_resimplify4, gimple_resimplify5): Extend commutativity
1154 checks to functions.
1156 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1158 * doc/md.texi (reduc_fmin_scal_@var{m}): Document.
1159 (reduc_fmax_scal_@var{m}): Likewise.
1160 * optabs.def (reduc_fmax_scal_optab): New optab.
1161 (reduc_fmin_scal_optab): Likewise
1162 * internal-fn.def (REDUC_FMAX, REDUC_FMIN): New functions.
1163 * tree-vect-loop.c (reduction_fn_for_scalar_code): Handle
1164 CASE_CFN_FMAX and CASE_CFN_FMIN.
1165 (neutral_op_for_reduction): Likewise.
1166 (needs_fold_left_reduction_p): Likewise.
1167 * config/aarch64/iterators.md (FMAXMINV): New iterator.
1168 (fmaxmin): Handle UNSPEC_FMAXNMV and UNSPEC_FMINNMV.
1169 * config/aarch64/aarch64-simd.md (reduc_<optab>_scal_<mode>): Fix
1171 (reduc_<fmaxmin>_scal_<mode>): New pattern.
1172 * config/aarch64/aarch64-sve.md (reduc_<fmaxmin>_scal_<mode>):
1175 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1177 * builtins.h (associated_internal_fn): Declare overload that
1178 takes a (combined_cfn, return type) pair.
1179 * builtins.c (associated_internal_fn): Split new overload out
1180 of original fndecl version. Also provide an overload that takes
1181 a (combined_cfn, return type) pair.
1182 * internal-fn.h (commutative_binary_fn_p): Declare.
1183 (commutative_ternary_fn_p): Likewise.
1184 (associative_binary_fn_p): Likewise.
1185 * internal-fn.c (commutative_binary_fn_p, commutative_ternary_fn_p):
1186 New functions, split out from...
1187 (first_commutative_argument): ...here.
1188 (associative_binary_fn_p): New function.
1189 * gimple-match.h (code_helper): Add a constructor that takes
1191 (commutative_binary_op_p): Declare.
1192 (commutative_ternary_op_p): Likewise.
1193 (first_commutative_argument): Likewise.
1194 (associative_binary_op_p): Likewise.
1195 (canonicalize_code): Likewise.
1196 (directly_supported_p): Likewise.
1197 (get_conditional_internal_fn): Likewise.
1198 (gimple_build): New overloads that takes a code_helper.
1199 * gimple-fold.c (gimple_build): Likewise.
1200 * gimple-match-head.c (commutative_binary_op_p): New function.
1201 (commutative_ternary_op_p): Likewise.
1202 (first_commutative_argument): Likewise.
1203 (associative_binary_op_p): Likewise.
1204 (canonicalize_code): Likewise.
1205 (directly_supported_p): Likewise.
1206 (get_conditional_internal_fn): Likewise.
1207 * tree-vectorizer.h: Include gimple-match.h.
1208 (neutral_op_for_reduction): Take a code_helper instead of a tree_code.
1209 (needs_fold_left_reduction_p): Likewise.
1210 (reduction_fn_for_scalar_code): Likewise.
1211 (vect_can_vectorize_without_simd_p): Declare a nNew overload that takes
1213 * tree-vect-loop.c: Include case-cfn-macros.h.
1214 (fold_left_reduction_fn): Take a code_helper instead of a tree_code.
1215 (reduction_fn_for_scalar_code): Likewise.
1216 (neutral_op_for_reduction): Likewise.
1217 (needs_fold_left_reduction_p): Likewise.
1218 (use_mask_by_cond_expr_p): Likewise.
1219 (build_vect_cond_expr): Likewise.
1220 (vect_create_partial_epilog): Likewise. Use gimple_build rather
1221 than gimple_build_assign.
1222 (check_reduction_path): Handle calls and operate on code_helpers
1223 rather than tree_codes.
1224 (vect_is_simple_reduction): Likewise.
1225 (vect_model_reduction_cost): Likewise.
1226 (vect_find_reusable_accumulator): Likewise.
1227 (vect_create_epilog_for_reduction): Likewise.
1228 (vect_transform_cycle_phi): Likewise.
1229 (vectorizable_reduction): Likewise. Make more use of
1231 (vect_transform_reduction): Use gimple_extract_op but expect
1232 a tree_code for now.
1233 (vect_can_vectorize_without_simd_p): New overload that takes
1235 * tree-vect-stmts.c (vectorizable_call): Handle reductions in
1237 * tree-vect-patterns.c (vect_mark_pattern_stmts): Use
1238 gimple_extract_op when updating STMT_VINFO_REDUC_IDX.
1240 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1242 * gimple-match.h (code_helper): Provide == and != overloads.
1243 (code_helper::operator tree_code): Make explicit.
1244 (code_helper::operator combined_fn): Likewise.
1245 * gimple-match-head.c (convert_conditional_op): Use explicit
1246 conversions where necessary.
1247 (gimple_resimplify1, gimple_resimplify2, gimple_resimplify3): Likewise.
1248 (maybe_push_res_to_seq, gimple_simplify): Likewise.
1249 * gimple-fold.c (replace_stmt_with_simplification): Likewise.
1251 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1253 * gimple-match.h (code_helper): Add functions for querying whether
1254 the code represents an internal_fn or a built_in_function.
1255 Provide explicit conversion operators for both cases.
1256 (gimple_extract_op): Declare.
1257 * gimple-match-head.c (gimple_extract): New function, extracted from...
1258 (gimple_simplify): ...here.
1259 (gimple_extract_op): New function.
1261 2021-11-30 Eric Botcazou <ebotcazou@adacore.com>
1264 * config/i386/i386.c (ix86_output_call_insn): Beef up comment about
1265 nops emitted with SEH.
1266 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
1267 the cold section, emit a nop before the directive if the previous
1268 active instruction is a call.
1270 2021-11-30 Roger Sayle <roger@nextmovesoftware.com>
1271 Uroš Bizjak <ubizjak@gmail.com>
1274 * config/i386/sse.md (define_split): New splitters to simplify
1275 not;vec_duplicate;and as vec_duplicate;andn.
1277 2021-11-30 Richard Biener <rguenther@suse.de>
1279 PR middle-end/103485
1280 * match.pd (-((int)x >> (prec - 1)) to (unsigned)x >> (prec - 1)):
1281 Use VIEW_CONVERT_EXPR for vectors.
1283 2021-11-30 Richard Biener <rguenther@suse.de>
1285 * cfgrtl.c (skip_insns_after_block): Refactor code to
1286 be more easily readable.
1287 * expr.c (op_by_pieces_d::run): Remove unreachable
1289 * sched-deps.c (sched_analyze): Remove unreachable
1291 * sel-sched-ir.c (in_same_ebb_p): Likewise.
1292 * tree-ssa-alias.c (nonoverlapping_refs_since_match_p):
1293 Remove unreachable code.
1294 * tree-vect-slp.c (vectorize_slp_instance_root_stmt):
1295 Refactor to avoid unreachable loop iteration.
1296 * tree.c (walk_tree_1): Remove unreachable break.
1297 * vec-perm-indices.c (vec_perm_indices::series_p): Remove
1300 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
1303 * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin mask
1306 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
1308 * config/rs6000/rs6000.c
1309 (rs6000_cost_data::update_target_cost_per_stmt): Adjust the way to
1310 compute extra penalized cost. Remove useless parameter.
1311 (rs6000_cost_data::rs6000_add_stmt_cost): Adjust the call to function
1312 update_target_cost_per_stmt.
1314 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
1317 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1319 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
1320 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
1321 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
1322 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
1323 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
1324 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
1325 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
1326 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
1328 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
1330 PR tree-optimization/103467
1331 * gimple-range-gori.cc (range_def_chain::register_dependency): Don't
1332 use an object reference after a potential resize.
1334 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
1337 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
1338 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
1339 optimizing for size. Fix vpinsrw insn template.
1340 (*movhf_internal): Fix pinsrw and pextrw insn templates for
1341 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
1342 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
1343 and TARGET_SSE_SPLIT_REGS targets.
1345 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
1347 * configure: Re-regenerate.
1349 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
1352 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
1353 variables. Use ETAGS variable in TAGS target.
1354 * configure: Regenerate.
1355 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
1356 variables to be overridden.
1358 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
1360 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
1362 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
1363 (_mm_movemask_epi8): Likewise.
1365 2021-11-29 Richard Biener <rguenther@suse.de>
1367 * read-rtl-function.c (function_reader::read_rtx_operand):
1368 Return only after resetting m_in_call_function_usage.
1370 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
1372 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
1373 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
1374 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
1375 on QImode shift amounts from const_int_operand to general_operand.
1377 2021-11-29 Richard Biener <rguenther@suse.de>
1379 * tree.h (reverse_storage_order_for_component_p): Remove
1380 spurious gcc_unreachable.
1381 * cfganal.c (dfs_find_deadend): Likewise.
1382 * fold-const-call.c (fold_const_logb): Likewise.
1383 (fold_const_significand): Likewise.
1384 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
1387 2021-11-29 Richard Biener <rguenther@suse.de>
1389 * vec.c (qsort_chk): Do not return the void return value
1390 from the noreturn qsort_chk_error.
1391 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
1392 * df-scan.c (df_ref_equal_p): Likewise.
1393 * dwarf2out.c (is_base_type): Likewise.
1394 (add_const_value_attribute): Likewise.
1395 * fixed-value.c (fixed_arithmetic): Likewise.
1396 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
1397 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
1398 * graphite-isl-ast-to-gimple.c
1399 (gcc_expression_from_isl_expr_op): Likewise.
1400 (gcc_expression_from_isl_expression): Likewise.
1401 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
1403 * lto-streamer-in.c (lto_input_mode_table): Likewise.
1405 2021-11-29 liuhongt <hongtao.liu@intel.com>
1408 * config/i386/i386.c (inline_secondary_memory_needed): HImode
1409 move between GPR and SSE registers is supported under
1410 TARGET_SSE2 and above.
1411 * config/i386/i386.md (extendhfsf2): Optimize expander.
1412 (truncsfhf2): Ditto.
1413 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
1414 align with V8HImode.
1416 2021-11-29 liuhongt <hongtao.liu@intel.com>
1420 * config/i386/i386.c (ix86_secondary_reload): Without
1421 TARGET_SSE4_1, General register is needed to move HImode from
1422 sse register to memory.
1423 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
1424 pextrw in output templates.
1425 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
1426 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
1427 alternatives related to sse register.
1429 2021-11-29 Richard Biener <rguenther@suse.de>
1431 PR tree-optimization/103458
1432 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
1433 create forwarders for blocks with abnormal predecessors.
1435 2021-11-29 Richard Biener <rguenther@suse.de>
1437 * gimple-predicate-analysis.cc (can_be_invalidated_p):
1438 Restore semantics to the one before the split from
1441 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
1443 * profile.c: Include sreal.h
1444 (struct bb_stats): New.
1445 (cmp_stats): New function.
1446 (compute_branch_probabilities): Output bb stats.
1448 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
1450 * cfghooks.c: Include sreal.h, profile.h.
1451 (profile_record_check_consistency): Fix checking of count counsistency;
1452 record also dynamic mismatches.
1453 * cfgrtl.c (rtl_account_profile_record): Similarly.
1454 * tree-cfg.c (gimple_account_profile_record): Likewise.
1455 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
1456 num_mismatched_freq_out, turn time to double, add
1457 dyn_mismatched_prob_out, dyn_mismatched_count_in,
1458 num_mismatched_prob_out; remove num_mismatched_count_out.
1459 * passes.c (account_profile_1): New function.
1460 (account_profile_in_list): New function.
1461 (pass_manager::dump_profile_report): Rewrite.
1462 (execute_one_ipa_transform_pass): Check profile consistency after
1464 (execute_all_ipa_transforms): Remove cfun test; record all transform
1466 (execute_one_pass): Fix collecting of profile stats.
1468 2021-11-27 Jakub Jelinek <jakub@redhat.com>
1470 PR middle-end/103431
1471 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
1472 Change insn pattern to accurately describe the instructions.
1473 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
1474 define_insn patterns.
1475 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
1476 define_insn_and_split patterns.
1477 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
1478 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
1479 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
1480 splitters for x86_{,64_}sh{l,r}d pattern changes.
1482 2021-11-27 Jakub Jelinek <jakub@redhat.com>
1484 PR tree-optimization/103435
1485 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
1486 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
1489 2021-11-27 Martin Jambor <mjambor@suse.cz>
1492 * ipa-prop.c (ipcp_transform_function): Call
1493 delete_unreachable_blocks_update_callgraph instead of returning
1496 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
1498 * tree-object-size.c (object_sizes_grow, object_sizes_release,
1499 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
1500 object_sizes_set): New functions.
1501 (addr_object_size, compute_builtin_object_size,
1502 expr_object_size, call_object_size, unknown_object_size,
1503 merge_object_sizes, plus_stmt_object_size,
1504 cond_expr_object_size, collect_object_sizes_for,
1505 check_for_plus_in_loops_1, init_object_sizes,
1506 fini_object_sizes): Adjust.
1508 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
1510 * tree-object-size.c: New enum.
1511 (object_sizes, computed, addr_object_size,
1512 compute_builtin_object_size, expr_object_size, call_object_size,
1513 merge_object_sizes, plus_stmt_object_size,
1514 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
1515 object_sizes_execute): Replace magic numbers with enums.
1517 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
1519 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
1520 tree_niter_desc* argument instead of just a tree for NITER.
1521 If we require the iv candidate value at the end of the final
1522 loop iteration, try using the original loop bound as the
1523 NITER for sufficiently simple loops.
1524 (may_eliminate_iv): Update (only) call to cand_value_at.
1526 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
1528 * ipa-modref.c (analyze_function): Drop parameter F and dump
1529 cgraph node name rather than cfun name.
1530 (modref_generate): Update.
1531 (modref_summaries::insert):Update.
1532 (modref_summaries_lto::insert):Update.
1533 (pass_modref::execute):Update.
1534 (ipa_merge_modref_summary_after_inlining): Improve combining of
1537 2021-11-26 Jan Hubicka <jh@suse.cz>
1540 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
1542 2021-11-26 konglin1 <lingling.kong@intel.com>
1545 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
1547 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
1548 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
1549 (*extendhf<mode>2): Rename from extendhf<mode>2.
1550 (truncsfhf2): Likewise.
1551 (truncdfhf2): Likewise.
1552 (*trunc<mode>2): Likewise.
1554 2021-11-26 liuhongt <hongtao.liu@intel.com>
1556 PR middle-end/103419
1557 * match.pd: Fix typo, use the type of second parameter, not
1560 2021-11-25 Jan Hubicka <jh@suse.cz>
1562 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
1564 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
1566 PR middle-end/103406
1567 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
1569 2021-11-25 Martin Jambor <mjambor@suse.cz>
1572 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
1574 * ipa-param-manipulation.h (ipa_param_adjustments): New member
1575 function get_updated_index_or_split.
1576 * ipa-param-manipulation.c
1577 (ipa_param_adjustments::get_updated_index_or_split): New function.
1578 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
1579 capability to identify scalarized parameters and perform substitution
1581 (ipcp_transform_function): Create descriptors earlier, handle new
1582 return values of adjust_agg_replacement_values.
1584 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1586 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
1587 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
1588 * gimple-range-path.h: Remove compute_ranges_defined.
1590 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
1593 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
1594 number of spilled registers is counted correctly.
1596 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1598 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
1601 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
1603 PR tree-optimization/103359
1604 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
1605 equivalent to def, don't initially include it's range.
1607 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1609 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
1610 gimple_static_chain.
1612 2021-11-25 Richard Biener <rguenther@suse.de>
1614 * cgraphunit.c (symbol_table::output_weakrefs): Remove
1616 (get_alias_symbol): Remove now unused function.
1618 2021-11-25 Richard Biener <rguenther@suse.de>
1620 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
1622 (skip_insns_after_block): Remove unreachable break and continue.
1624 2021-11-25 Richard Biener <rguenther@suse.de>
1626 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
1628 2021-11-25 Richard Biener <rguenther@suse.de>
1630 * regset.h (REG_SET_EMPTY_P): New macro.
1631 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
1633 2021-11-25 Martin Liska <mliska@suse.cz>
1635 * doc/invoke.texi: Use @option for -Wuninitialized.
1637 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1639 PR tree-optimization/103254
1640 * gimple-range-path.cc (path_range_query::compute_ranges): Move
1641 exported boolean code...
1642 (path_range_query::compute_imports): ...here.
1644 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1646 PR tree-optimization/103254
1647 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
1648 (path_range_query::compute_ranges_in_block): Move to
1649 compute_ranges_defined.
1650 * gimple-range-path.h (compute_ranges_defined): New.
1652 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1654 PR tree-optimization/103417
1655 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
1658 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1660 PR tree-optimization/103376
1661 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
1662 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
1663 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
1664 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
1665 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
1668 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1671 * doc/invoke.texi (-Wcomma-subscript): Document that for
1672 -std=c++20 the option isn't enabled by default with -Wno-deprecated
1673 but for -std=c++23 it is.
1675 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
1677 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
1678 (modref_merge_call_site_flags): ... here.
1679 (ipa_merge_modref_summary_after_inlining): Use it.
1681 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1683 * cfgloop.c (verify_loop_structure): Reduce scope of
1684 'class loop *loop' variable.
1685 * ipa-fnsummary.c (analyze_function_body): Likewise.
1686 * loop-init.c (fix_loop_structure): Likewise.
1687 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
1688 * predict.c (predict_loops): Likewise.
1689 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
1690 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
1692 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1694 PR tree-optimization/103231
1695 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
1696 (gimple_ranger::gimple_ranger): Delete stmt stack.
1697 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
1699 (gimple_ranger::prefill_name): New.
1700 (gimple_ranger::prefill_stmt_dependencies): New.
1701 * gimple-range.h (class gimple_ranger): Add prototypes.
1703 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1705 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
1706 return a range, return if it came from the cache or not.
1707 (get_non_stale_global_range): Rename to get_global_range, and return
1708 the temporal state in a flag.
1709 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
1710 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
1712 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
1715 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1717 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
1718 a constant is encountered to terminate the trace.
1720 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
1722 PR middle-end/103059
1723 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
1724 of indexed addressing.
1725 (find_reloads): Adjust accordingly.
1727 2021-11-24 Richard Biener <rguenther@suse.de>
1728 Jan Hubicka <jh@suse.cz>
1730 PR tree-optimization/103168
1731 * ipa-modref.h (struct modref_summary): Add load_accesses.
1732 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
1733 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
1734 info to walk the virtual use->def chain to CSE const/pure
1735 function calls possibly reading from memory.
1737 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1739 PR middle-end/103244
1740 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
1743 2021-11-24 Richard Biener <rguenther@suse.de>
1745 PR middle-end/103193
1746 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
1747 with NaNs and -ftrapping-math.
1749 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1751 PR middle-end/103384
1752 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
1753 return 0 for kind(host) and continue for kind(nohost).
1755 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1757 PR middle-end/103365
1758 * attribs.h (lookup_attribute): Allow attr_name to start with
1759 underscore, as long as canonicalize_attr_name returns false.
1760 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
1761 * attribs.c (extract_attribute_substring): Reimplement using
1762 canonicalize_attr_name.
1763 (register_scoped_attribute): Change gcc_assert into
1764 gcc_checking_assert, verify !canonicalize_attr_name rather than
1765 that str.str doesn't start with '_'.
1767 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1769 PR tree-optimization/103376
1770 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
1771 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
1773 (find_bswap_or_nop_1, find_bswap_or_nop,
1774 imm_store_chain_info::try_coalesce_bswap): Adjust
1775 perform_symbolic_merge callers.
1777 2021-11-24 Richard Biener <rguenther@suse.de>
1779 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
1780 argument instead of re-computing it.
1781 (find_interesting_uses): Likewise.
1782 (find_induction_variables): Pass through loop body.
1783 (tree_ssa_iv_optimize_loop): Pass down loop body.
1785 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1787 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
1790 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1792 * tree.c (bitmask_inv_cst_vector_p): New.
1793 * tree.h (bitmask_inv_cst_vector_p): New.
1794 * match.pd: Use it in new bitmask compare pattern.
1796 2021-11-24 Jason Merrill <jason@redhat.com>
1798 * timevar.h (class auto_cond_timevar): New.
1800 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
1801 H.J. Lu <hongjiu.lu@intel.com>
1803 PR tree-optimization/103194
1804 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
1806 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
1807 (optimize_atomic_bit_test_and): Enhance
1808 optimize_atomic_bit_test_and to handle truncation.
1810 2021-11-23 Martin Sebor <msebor@redhat.com>
1813 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
1814 * common.opt: Add -Winfinite-recursion.
1815 * doc/invoke.texi (-Winfinite-recursion): Document.
1816 * passes.def (pass_warn_recursion): Schedule a new pass.
1817 * tree-pass.h (make_pass_warn_recursion): Declare.
1818 * gimple-warn-recursion.c: New file.
1820 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1822 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
1823 error messages for ENB_P8V and ENB_P9V.
1825 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1827 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
1828 CMPB to power6-64 stanza.
1829 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
1831 (rs6000_new_builtin_is_supported): Likewise.
1832 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
1833 (rs6000_init_builtins): Handle ENB_P6_64 case.
1834 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
1835 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
1836 (enable_string): Add "ENB_P6_64".
1837 (write_decls): Add ENB_P6_64 to bif_enable enum.
1839 2021-11-23 Jan Hubicka <jh@suse.cz>
1841 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
1843 (modref_tree::modref_tree): Remove parametr.
1844 (modref_tree::insert_base): Add max_bases parameter.
1845 (modref_tree::insert): Add max_bases, max_refs, max_accesses
1847 (modref_tree::insert): New member function.
1848 (modref_tree::merge): Add max_bases, max_refs, max_accesses
1850 (modref_tree::insert): New member function.
1851 * ipa-modref-tree.c (test_insert_search_collapse): Update.
1852 (test_merge): Update.
1853 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
1854 (dump_lto_records): Likewise.
1855 (modref_summary::finalize): Fix whitespace.
1856 (get_modref_function_summary): Likewise.
1857 (modref_access_analysis::record_access): Update.
1858 (modref_access_analysis::record_access_lto): Update.
1859 (modref_access_analysis::process_fnspec): Update.
1860 (analyze_function): Update.
1861 (modref_summaries::duplicate): Update.
1862 (modref_summaries_lto::duplicate): Update.
1863 (write_modref_records): Update.
1864 (read_modref_records): Update.
1865 (read_section): Update.
1866 (propagate_unknown_call): Update.
1867 (modref_propagate_in_scc): Update.
1868 (ipa_merge_modref_summary_after_inlining): Update.
1870 2021-11-23 Martin Liska <mliska@suse.cz>
1872 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
1874 2021-11-23 Richard Biener <rguenther@suse.de>
1876 PR tree-optimization/103361
1877 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
1878 for the dependence distance.
1879 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
1881 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1883 * tree-inline.h (struct copy_body_data): Remove
1884 transform_lang_insert_block member.
1885 * tree-inline.c (remap_block): Don't call
1886 id->transform_lang_insert_block.
1887 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
1888 tree_function_versioning, maybe_inline_call_in_expr,
1889 copy_fn): Don't initialize id.transform_lang_insert_block.
1890 * gimplify.c (gimplify_omp_loop): Likewise.
1892 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
1894 PR tree-optimization/103335
1895 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
1896 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
1897 are divisible by BITS_PER_UNIT.
1898 (get_byte_aligned_range_containing_ref): New function.
1899 (get_byte_aligned_range_contained_in_ref): New function.
1900 (normalize_ref): Rename to ...
1901 (get_byte_range): ... this one; handle accesses not aligned to byte
1902 boundary; return range in bytes rater than updating ao_ref.
1903 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
1905 (setup_live_bytes_from_ref): Likewise.
1906 (clear_bytes_written_by): Update.
1907 (live_bytes_read): Update.
1908 (dse_classify_store): Simplify tech before live_bytes_read checks.
1910 2021-11-23 Andrew Pinski <apinski@marvell.com>
1912 PR tree-optimization/102216
1913 * gimple-fold.c (fold_stmt_1): Add canonicalization
1914 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
1915 can only be done if !in_place.
1917 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1919 PR middle-end/102431
1920 * gimplify.c (replace_reduction_placeholders): Remove.
1921 (note_no_context_vars): New function.
1922 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
1923 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
1924 and replace_reduction_placeholders for duplication of
1925 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
1926 automatic vars have DECL_CONTEXT set to non-NULL before doing so
1927 and reset it afterwards for those vars and their corresponding
1930 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
1933 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
1934 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
1935 or v4sf when p9_vector is set.
1936 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
1938 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1940 PR tree-optimization/102232
1941 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
1943 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1945 PR tree-optimization/96779
1946 * match.pd (-x == x) -> (x == 0): New optimization.
1948 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
1950 PR tree-optimization/98953
1951 PR tree-optimization/103345
1952 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
1953 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
1954 (pass_optimize_bswap::execute): Likewise.
1956 2021-11-22 Martin Liska <mliska@suse.cz>
1958 * doc/invoke.texi: Remove duplicate documentation for 3 params.
1960 2021-11-22 Richard Biener <rguenther@suse.de>
1962 PR tree-optimization/103351
1963 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
1966 2021-11-22 liuhongt <hongtao.liu@intel.com>
1969 * config/i386/constraints.md (Bk): New
1970 define_memory_constraint.
1971 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
1973 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
1975 * config/i386/i386.md (*movsi_internal): Don't allow
1976 mask/sse/mmx move in TLS code sequences.
1977 (*movdi_internal): Ditto.
1979 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
1981 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
1984 2021-11-21 Jakub Jelinek <jakub@redhat.com>
1987 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
1988 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
1989 instead of target_option_default_node.
1991 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1994 * ipa-modref.c (parm_map_for_arg): Rename to ...
1995 (parm_map_for_ptr): .. this one; handle static chain and calls to
1997 (modref_access_analysis::get_access): Use parm_map_for_ptr.
1998 (modref_access_analysis::process_fnspec): Update.
1999 (modref_access_analysis::analyze_load): Update.
2000 (modref_access_analysis::analyze_store): Update.
2002 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
2004 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
2006 (ignore_retval_p): Likewise.
2007 (ignore_stores_p): Likewise.
2008 (parm_map_for_arg): Likewise.
2009 (class modref_access_analysis): New class.
2010 (modref_access_analysis::set_side_effects): New member function.
2011 (modref_access_analysis::set_nondeterministic): New member function.
2012 (get_access): Turn to ...
2013 (modref_access_analysis::get_access): ... this one.
2014 (record_access): Turn to ...
2015 (modref_access_analysis::record_access): ... this one.
2016 (record_access_lto): Turn to ...
2017 (modref_access_analysis::record_access_lto): ... This one.
2018 (record_access_p): Turn to ...
2019 (modref_access_analysis::record_access_p): ... This one
2020 (modref_access_analysis::record_unknown_load): New member function.
2021 (modref_access_analysis::record_unknown_store): New member function.
2022 (get_access_for_fnspec): Turn to ...
2023 (modref_access_analysis::get_access_for_fnspec): ... this one.
2024 (merge_call_side_effects): Turn to ...
2025 (moderf_access_analysis::merge_call_side_effects): Turn to ...
2026 (collapse_loads): Move later in source code.
2027 (collapse_stores): Move later in source code.
2028 (process_fnspec): Turn to ...
2029 (modref_access_analysis::process_fnspec): ... this one.
2030 (analyze_call): Turn to ...
2031 (modref_access_analysis::analyze_call): ... this one.
2032 (struct summary_ptrs): Remove.
2033 (analyze_load): Turn to ...
2034 (modref_access_analysis::analyze_load): ... this one.
2035 (analyze_store): Turn to ...
2036 (modref_access_analysis::analyze_store): ... this one.
2037 (analyze_stmt): Turn to ...
2038 (modref_access_analysis::analyze_stmt): ... This one.
2039 (remove_summary): Remove.
2040 (modref_access_analysis::propagate): Break out from ...
2041 (modref_access_analysis::analyze): Break out from ...
2042 (analyze_function): ... here.
2044 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
2045 Robin Dapp <rdapp@linux.ibm.com>
2048 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
2049 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
2051 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
2054 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
2055 (merge_call_side_effects): Improve debug output.
2057 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
2060 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
2062 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
2064 PR tree-optimization/103226
2065 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
2068 2021-11-20 Andrew Pinski <apinski@marvell.com>
2070 PR tree-optimization/103220
2071 * match.pd ((type) X bitop CST): Don't check if CST
2072 fits into the type if only the sign changes.
2074 2021-11-20 Alexandre Oliva <oliva@adacore.com>
2076 PR tree-optimization/102988
2077 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
2078 without decl sharing.
2080 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
2082 * doc/invoke.texi: Remove whitespace after an @option.
2084 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
2086 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
2087 _ARCH_PWR9, optimize vec_sum2s when LE.
2089 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
2092 * config/darwin-driver.c (darwin_driver_init): Handle exported
2093 symbols and symbol lists (suppress automatic export of the TLS
2095 * config/darwin.c (darwin_rename_builtins): Remove workaround.
2096 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
2097 (REAL_LIBGCC_SPEC): Handle revised library uses.
2098 * config/darwin.opt (nodefaultexport): New.
2099 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
2100 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
2101 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
2103 2021-11-19 Martin Jambor <mjambor@suse.cz>
2105 * opts.c (default_options_table): Switch off
2106 flag_semantic_interposition at Ofast.
2107 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
2108 -fsemantic-interposition.
2110 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
2112 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
2113 by !gimple_call_chain.
2115 2021-11-19 Martin Sebor <msebor@redhat.com>
2119 * doc/invoke.texi (-Waddress): Update.
2121 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
2123 PR tree-optimization/103254
2124 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
2125 depth for all statements with multple ssa names.
2127 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
2129 * config/s390/s390.md (define_peephole2): Variable insn points
2130 to the first matched insn. Use peep2_next_insn(1) to refer to
2131 the second matched insn.
2133 2021-11-19 Tamar Christina <tamar.christina@arm.com>
2135 PR tree-optimization/103311
2137 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
2138 test to new codegen.
2139 (complex_mul_pattern::matches): Move check downwards.
2141 2021-11-19 Martin Liska <mliska@suse.cz>
2144 2021-11-19 Martin Liska <mliska@suse.cz>
2146 * cfgexpand.c (pass_expand::execute): Use option directly.
2147 * function.c (allocate_struct_function): Likewise.
2148 * gimple-low.c (lower_function_body): Likewise.
2149 (lower_stmt): Likewise.
2150 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
2151 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
2152 * ipa-split.c (split_function): Likewise.
2153 * lto-streamer-in.c (input_function): Likewise.
2154 * sese.c (sese_insert_phis_for_liveouts): Likewise.
2155 * ssa-iterators.h (num_imm_uses): Likewise.
2156 * tree-cfg.c (make_blocks): Likewise.
2157 (gimple_merge_blocks): Likewise.
2158 * tree-inline.c (tree_function_versioning): Likewise.
2159 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
2160 * tree-sra.c (analyze_access_subtree): Likewise.
2161 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
2162 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2163 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
2164 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
2165 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
2166 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
2167 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
2168 (target_for_debug_bind): Likewise.
2169 (insert_debug_temp_for_var_def): Likewise.
2170 (insert_debug_temps_for_defs): Likewise.
2171 (reset_debug_uses): Likewise.
2172 * tree-ssanames.c (release_ssa_name_fn): Likewise.
2173 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
2174 (adjust_debug_stmts): Likewise.
2175 (adjust_phi_and_debug_stmts): Likewise.
2176 (vect_do_peeling): Likewise.
2177 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
2178 (vect_transform_loop): Likewise.
2179 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
2180 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
2181 (MAY_HAVE_DEBUG_STMTS): Use options directly.
2183 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
2185 * gcc.c (process_command): Skip dumpdir override if file is a
2187 * doc/invoke.texi: Update -dumpdir documentation.
2189 2021-11-19 Andrew Pinski <apinski@marvell.com>
2191 PR tree-optimization/103314
2192 * match.pd ((type) X op CST): Restrict the equal
2193 TYPE_PRECISION case to GIMPLE only.
2195 2021-11-19 Martin Liska <mliska@suse.cz>
2198 * ipa-modref-tree.h (struct modref_parm_map): Add default
2200 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
2202 2021-11-19 Richard Biener <rguenther@suse.de>
2204 PR middle-end/103248
2205 * tree-eh.c (operation_could_trap_helper_p): Properly handle
2206 fixed-point RDIV_EXPR.
2208 2021-11-19 Richard Biener <rguenther@suse.de>
2210 PR tree-optimization/102436
2211 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
2212 to just create the if structure and return the then block.
2213 (execute_sm): Add flag to indicate the var will re-use
2215 (hoist_memory_references): Support a single conditional
2216 block with all stores as special case.
2218 2021-11-19 Andrew Pinski <apinski@marvell.com>
2220 PR tree-optimization/103317
2221 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
2222 middle bb case, check to make sure it has a single predecessor.
2224 2021-11-19 Andrew Pinski <apinski@marvell.com>
2226 PR tree-optimization/103257
2228 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
2229 Disable until !canonicalize_math_p ().
2231 2021-11-19 Marek Polacek <polacek@redhat.com>
2235 * doc/invoke.texi: Update documentation for -Wuninitialized.
2236 * tree.c (stabilize_reference): Set location.
2238 2021-11-19 liuhongt <hongtao.liu@intel.com>
2241 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
2242 storing 256/512-bit SSE register to be equal to cost of
2243 unaligned store to avoid odd alignment peeling.
2244 (icelake_cost): Ditto.
2246 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
2248 * config/rs6000/predicates.md (current_file_function_operand):
2249 Add flag_semantic_interposition to call of decl_replaceable_p.
2251 2021-11-18 Martin Liska <mliska@suse.cz>
2253 * ipa-modref.c (analyze_function): Do not execute the code
2254 only if dump_file != NULL.
2256 2021-11-18 Martin Liska <mliska@suse.cz>
2258 * ipa-modref.c (analyze_function): Use fnode instead of repeated
2259 cgraph_node::get (current_function_decl).
2261 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
2263 * cgraph.c (cgraph_node::get_availability): Update call of
2265 (cgraph_node::verify_node): Verify that semantic_interposition flag
2267 * cgraph.h: (symtab_node): Add semantic_interposition flag.
2268 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
2269 semantic_interposition flag.
2270 * cgraphunit.c (cgraph_node::finalize_function): Set
2271 semantic_interposition flag.
2272 (cgraph_node::add_new_function): Likewise.
2273 (varpool_node::finalize_decl): Likewise.
2274 (cgraph_node::create_wrapper): Likewise.
2275 * common.opt (fsemantic-interposition): Turn to optimization node.
2276 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
2277 (lto_output_varpool_node): Likewise.
2278 (input_overwrite_node): Likewise.
2279 (input_varpool_node): Likewise.
2280 * symtab.c (symtab_node::dump_base): Dump new flag.
2281 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
2283 * varasm.h (decl_replaceable_p): Update declaration.
2284 * varpool.c (varpool_node::ctor_useable_for_folding_p):
2285 Use semantic_interposition flag.
2286 (varpool_node::get_availability): Likewise.
2287 (varpool_node::create_alias): Copy semantic_interposition flag.
2289 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
2292 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
2293 parameter may still be returned.
2294 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
2295 even for unused function args.
2297 2021-11-18 Tamar Christina <tamar.christina@arm.com>
2299 PR tree-optimization/103311
2300 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
2301 complex_fms_pattern::matches): Check for multiplications.
2303 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
2306 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
2307 CS prefix for -mindirect-branch-cs-prefix.
2308 (ix86_output_indirect_branch_via_reg): Likewise.
2309 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
2310 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
2312 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
2314 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
2317 2021-11-18 Tamar Christina <tamar.christina@arm.com>
2319 PR tree-optimization/103253
2320 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
2322 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
2324 * doc/extend.texi: Document __builtin_assoc_barrier.
2326 2021-11-18 Martin Liska <mliska@suse.cz>
2328 * cfgexpand.c (pass_expand::execute): Use option directly.
2329 * function.c (allocate_struct_function): Likewise.
2330 * gimple-low.c (lower_function_body): Likewise.
2331 (lower_stmt): Likewise.
2332 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
2333 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
2334 * ipa-split.c (split_function): Likewise.
2335 * lto-streamer-in.c (input_function): Likewise.
2336 * sese.c (sese_insert_phis_for_liveouts): Likewise.
2337 * ssa-iterators.h (num_imm_uses): Likewise.
2338 * tree-cfg.c (make_blocks): Likewise.
2339 (gimple_merge_blocks): Likewise.
2340 * tree-inline.c (tree_function_versioning): Likewise.
2341 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
2342 * tree-sra.c (analyze_access_subtree): Likewise.
2343 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
2344 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2345 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
2346 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
2347 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
2348 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
2349 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
2350 (target_for_debug_bind): Likewise.
2351 (insert_debug_temp_for_var_def): Likewise.
2352 (insert_debug_temps_for_defs): Likewise.
2353 (reset_debug_uses): Likewise.
2354 * tree-ssanames.c (release_ssa_name_fn): Likewise.
2355 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
2356 (adjust_debug_stmts): Likewise.
2357 (adjust_phi_and_debug_stmts): Likewise.
2358 (vect_do_peeling): Likewise.
2359 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
2360 (vect_transform_loop): Likewise.
2361 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
2362 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
2363 (MAY_HAVE_DEBUG_STMTS): Use options directly.
2365 2021-11-18 Richard Biener <rguenther@suse.de>
2367 PR tree-optimization/103277
2368 * tree-ssa-dse.c (need_ab_cleanup): New.
2369 (dse_optimize_redundant_stores): Adjust.
2370 (delete_dead_or_redundant_assignment): Get extra
2371 need_ab_cleanup argument and set when abnormal cleanup is
2373 (dse_optimize_call): Adjust.
2374 (dse_optimize_stmt): Likewise.
2375 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
2376 Perform abnormal cleanup.
2377 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
2379 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
2381 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2382 Adjust generated cfg to avoid infinite loop.
2384 2021-11-18 konglin1 <lingling.kong@intel.com>
2386 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
2387 (_mm512_mask_mul_pch): Likewise.
2388 (_mm512_maskz_mul_pch): Likewise.
2389 (_mm512_mul_round_pch): Likewise.
2390 (_mm512_mask_mul_round_pch): Likewise.
2391 (_mm512_maskz_mul_round_pch): Likewise.
2392 (_mm512_cmul_pch): Likewise.
2393 (_mm512_mask_cmul_pch): Likewise.
2394 (_mm512_maskz_cmul_pch): Likewise.
2395 (_mm512_cmul_round_pch): Likewise.
2396 (_mm512_mask_cmul_round_pch): Likewise.
2397 (_mm512_maskz_cmul_round_pch): Likewise.
2398 (_mm_mul_sch): Likewise.
2399 (_mm_mask_mul_sch): Likewise.
2400 (_mm_maskz_mul_sch): Likewise.
2401 (_mm_mul_round_sch): Likewise.
2402 (_mm_mask_mul_round_sch): Likewise.
2403 (_mm_maskz_mul_round_sch): Likewise.
2404 (_mm_cmul_sch): Likewise.
2405 (_mm_mask_cmul_sch): Likewise.
2406 (_mm_maskz_cmul_sch): Likewise.
2407 (_mm_cmul_round_sch): Likewise.
2408 (_mm_mask_cmul_round_sch): Likewise.
2409 (_mm_maskz_cmul_round_sch): Likewise.
2410 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
2411 (_mm_mask_mul_pch): Likewise.
2412 (_mm_maskz_mul_pch): Likewise.
2413 (_mm256_mul_pch): Likewise.
2414 (_mm256_mask_mul_pch): Likewise.
2415 (_mm256_maskz_mul_pch): Likewise.
2416 (_mm_cmul_pch): Likewise.
2417 (_mm_mask_cmul_pch): Likewise.
2418 (_mm_maskz_cmul_pch): Likewise.
2419 (_mm256_cmul_pch): Likewise.
2420 (_mm256_mask_cmul_pch): Likewise.
2421 (_mm256_maskz_cmul_pch): Likewise.
2423 2021-11-17 Andrew Pinski <apinski@marvell.com>
2425 PR tree-optimization/103228
2426 PR tree-optimization/55177
2427 * match.pd ((type) X bitop CST): Also do this
2428 transformation for nop conversions.
2430 2021-11-17 Martin Sebor <msebor@redhat.com>
2432 PR tree-optimization/102759
2433 * gimple-array-bounds.cc (build_printable_array_type): Move...
2434 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
2435 pathological function redeclarations that remove a previously
2437 Improve formatting of function arguments in informational notes.
2438 * pointer-query.cc (build_printable_array_type): ...to here.
2439 * pointer-query.h (build_printable_array_type): Declared.
2441 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
2444 * config/i386/i386-opts.h (harden_sls): New enum.
2445 * config/i386/i386.c (output_indirect_thunk): Mitigate against
2446 SLS for function return.
2447 (ix86_output_function_return): Likewise.
2448 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
2450 (ix86_output_indirect_jmp): Likewise.
2451 (ix86_output_call_insn): Likewise.
2452 * config/i386/i386.opt: Add -mharden-sls=.
2453 * doc/invoke.texi: Document -mharden-sls=.
2455 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
2458 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
2459 (ix86_output_function_return): Likewise.
2460 * config/i386/i386.md (simple_return_pop_internal): Likewise.
2462 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2465 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
2468 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
2470 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
2471 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
2472 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
2473 on indirect_thunks_used.
2474 (ix86_output_indirect_function_return): Ditto.
2476 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2478 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
2479 (modref_access_node::stream_out): New member function.
2480 (modref_access_node::stream_in): New member function.
2481 * ipa-modref-tree.h (modref_access_node::stream_out,
2482 modref_access_node::stream_in): Declare.
2483 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
2484 (modref_summary_lto::dump): Dump kills.
2485 (analyze_store): Record kills for LTO
2486 (analyze_stmt): Likewise.
2487 (modref_summaries_lto::duplicate): Duplicate kills.
2488 (write_modref_records): Use new stream_out member function.
2489 (read_modref_records): Likewise.
2490 (modref_write): Stream out kills.
2491 (read_section): Stream in kills
2492 (remap_kills): New function.
2493 (update_signature): Use it.
2495 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
2497 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
2498 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
2499 * config/i386/i386.c (zero_all_vector_registers):
2500 Use LEGACY_SSE_REGNO_P predicate.
2501 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
2502 and EXT_REG_SSE_REGNO_P predicates.
2503 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
2504 and LEGACY_SSE_REGNO_P predicates.
2506 2021-11-17 Jason Merrill <jason@redhat.com>
2508 * doc/invoke.texi (C++ Dialect Options): Document
2509 -fimplicit-constexpr.
2511 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2513 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
2514 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
2516 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
2519 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
2523 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
2524 (gimplify_compound_lval): Gimplify base expression first.
2525 (gimplify_target_expr): Add comment.
2527 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2529 PR tree-optimization/103192
2530 * tree-ssa-loop-im.c (move_computations_worker): Use
2531 reset_flow_sensitive_info instead of manually clearing
2532 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
2533 with integral types.
2535 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2537 PR tree-optimization/103255
2538 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
2539 range_nonzero rather than unadjusted base's range. Formatting fixes.
2541 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
2543 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
2544 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
2545 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
2546 * internal-fn.c (first_commutative_argument): Handle them.
2547 (FOR_EACH_COND_FN_PAIR): Likewise.
2548 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
2549 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
2552 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2554 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
2555 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
2556 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
2557 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
2558 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
2561 2021-11-17 Andrew Pinski <apinski@marvell.com>
2563 PR tree-optimization/103288
2564 * tree-ssa-phiopt.c (value_replacement): Return early if middle
2565 block has more than one pred.
2567 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2569 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
2570 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
2571 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
2572 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
2573 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
2574 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
2575 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
2576 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
2578 2021-11-17 Marek Polacek <polacek@redhat.com>
2580 PR preprocessor/103026
2581 * doc/invoke.texi: Document -Wbidi-chars.
2583 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2586 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
2589 2021-11-16 Jan Hubicka <jh@suse.cz>
2591 * ipa-modref.c (get_modref_function_summary): Declare.
2592 * ipa-modref.h (get_modref_function_summary): New function.
2593 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
2594 (clear_bytes_written_by): ... here; also clear memory killed by
2597 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
2599 * ggc-common.c (gt_pch_save): If we cannot find a suitable
2600 memory segment for save, then error-out, do not try to
2602 (gt_pch_restore): Save the existing line table, and when
2603 the replacement is being read, use that when constructing
2606 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
2609 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
2611 (*mma_assemble_acc): Likewise.
2613 2021-11-16 Martin Sebor <msebor@redhat.com>
2615 PR tree-optimization/102960
2616 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
2617 than a pointer to it.
2618 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
2620 (get_maxval_strlen): Use an auto_bitmap.
2621 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
2623 (get_range_strlen_phi): ...into this function.
2624 Avoid assuming maximum string length is constant
2625 (printf_strlen_execute): Dump pointer query cache contents when
2626 details are requisted.
2628 2021-11-16 Jason Merrill <jason@redhat.com>
2630 * langhooks.h (struct lang_hooks): Adjust comment.
2631 * print-tree.c (print_node): Also call print_xnode hook for
2634 2021-11-16 Andrew Pinski <apinski@marvell.com>
2636 PR tree-optimization/103218
2637 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
2639 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
2641 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
2642 (umaddhisi4): Likewise.
2643 (machi): Update pattern.
2646 2021-11-16 Richard Biener <rguenther@suse.de>
2648 PR tree-optimization/102880
2649 * tree-ssa-dce.c (sort_phi_args): New function.
2650 (make_forwarders_with_degenerate_phis): Likewise.
2651 (perform_tree_ssa_dce): Call
2652 make_forwarders_with_degenerate_phis.
2654 2021-11-16 Richard Biener <rguenther@suse.de>
2656 PR tree-optimization/102880
2657 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
2658 single_pred (bb1) condition to places that really need it.
2659 (match_simplify_replacement): Likewise.
2660 (value_replacement): Likewise.
2661 (replace_phi_edge_with_variable): Deal with extra edges
2664 2021-11-16 Martin Jambor <mjambor@suse.cz>
2666 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
2667 add a fixme note about the mode assignment perhaps being unnecessary.
2668 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
2670 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
2671 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
2672 * tree-inline.c (remap_ssa_name): Likewise.
2673 (tree_function_versioning): Likewise.
2674 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
2675 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2676 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
2678 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2680 PR tree-optimization/103208
2681 * omp-expand.c (expand_omp_build_cond): New function.
2682 (expand_omp_for_init_counts, expand_omp_for_init_vars,
2683 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
2685 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2687 PR tree-optimization/102009
2688 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
2689 Punt if any of alloc_size arguments is out of bounds vs. number of
2692 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
2694 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
2695 on !optimize_function_for_size_p.
2696 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
2697 (define_splits): Conditionalize on !optimize_function_for_size_p.
2698 (*bmi2_rorxsi3_1_zext): Likewise.
2699 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
2700 (define_splits): Conditionalize on !optimize_function_for_size_p.
2702 2021-11-16 Jan Hubicka <jh@suse.cz>
2705 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
2708 2021-11-16 Andrew Pinski <apinski@marvell.com>
2710 PR tree-optimization/103245
2711 * match.pd: Combine the abs pattern matching using multiplication.
2712 Adding optional nop_convert too.
2714 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
2716 PR middle-end/103268
2717 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
2720 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2722 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
2723 determine if it is safe to transform to strcat.
2724 (gimple_fold_builtin_snprintf): Likewise.
2726 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2728 * gimple-fold.c (known_lower): New function.
2729 (gimple_fold_builtin_strncat_chk,
2730 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
2731 gimple_fold_builtin_stxncpy_chk,
2732 gimple_fold_builtin_snprintf_chk,
2733 gimple_fold_builtin_sprintf_chk): Use it.
2735 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2737 * gimple-fold.c (dump_transformation): New function.
2738 (gimple_fold_builtin_stxcpy_chk,
2739 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
2740 BUILT_IN_STRNCPY if return value is not used.
2742 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
2744 PR middle-end/103184
2745 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
2746 before transforming equivalent, but slighly different cases to
2747 their canonical forms.
2749 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2752 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
2753 * doc/tm.texi: Regenerated.
2754 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
2755 * ipa.c (cgraph_build_static_cdtor_1): Return the built
2757 (build_cxa_atexit_decl): New.
2758 (build_dso_handle_decl): New.
2759 (build_cxa_dtor_registrations): New.
2760 (compare_cdtor_tu_order): New.
2761 (build_cxa_atexit_fns): New.
2762 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
2763 process the DTORs/CTORs accordingly.
2764 (pass_ipa_cdtor_merge::gate): Also run if
2765 dtors_from_cxa_atexit is set.
2766 * target.def (dtors_from_cxa_atexit): New hook.
2768 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2770 * config.in: Regenerate.
2771 * configure: Regenerate.
2772 * configure.ac: Test ld64 for -platform-version support.
2774 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2776 * diagnostic-spec.h (typedef xint_hash_t)
2777 (typedef xint_hash_map_t): Replace with...
2778 (typedef nowarn_map_t): ... this.
2779 (nowarn_map): Adjust.
2780 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
2782 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2784 * profile.c (branch_prob): Use 'location_hash' for
2787 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2789 PR tree-optimization/103207
2790 * value-range.cc (irange::set): Drop overflow.
2792 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2794 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
2795 to OMP_TARGET_CLAUSES if it isn't there already.
2797 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2799 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2800 Remove useless code.
2801 (path_range_query::ssa_defined_in_bb): New.
2802 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
2803 could trigger additional lookups.
2804 Do not use the cache for ARGs defined in this block.
2805 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
2806 (path_range_query::maybe_register_phi_relation): Same.
2807 (path_range_query::range_of_stmt): Adjust comment.
2808 * gimple-range-path.h (ssa_defined_in_bb): New.
2810 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2812 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2813 Default to global range if nothing found.
2815 2021-11-15 Richard Biener <rguenther@suse.de>
2817 PR tree-optimization/103237
2818 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
2819 double reductions with multiple inner loop LC PHI nodes.
2821 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
2824 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2825 New expand function.
2826 * config/i386/i386-options.c (ix86_target_string): Add
2827 -mrelax-cmpxchg-loop flag.
2828 (ix86_valid_target_attribute_inner_p): Likewise.
2829 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
2830 New expand function prototype.
2831 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
2832 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
2834 (atomic_<logic>_fetch<mode>): Likewise.
2835 (atomic_fetch_nand<mode>): Likewise.
2836 (atomic_nand_fetch<mode>): Likewise.
2837 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
2838 (atomic_<logic>_fetch<mode>): Likewise.
2839 (atomic_fetch_nand<mode>): Likewise.
2840 (atomic_nand_fetch<mode>): Likewise.
2841 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
2843 2021-11-15 Richard Biener <rguenther@suse.de>
2845 PR tree-optimization/103219
2846 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
2847 to determine the exit for the VN region.
2849 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2851 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
2852 (pass_data_vectorize): Likewise.
2853 (make_pass_vectorize): Likewise.
2854 * tree-vectorizer.c (vectorize_loops): Merge with
2855 pass_vectorize::execute and replace cfun occurences with fun param.
2856 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
2857 fun, and adjust callers approrpiately.
2858 (note_simd_array_uses): Likewise.
2859 (vect_loop_dist_alias_call): Likewise.
2860 (set_uid_loop_bbs): Likewise.
2861 (vect_transform_loops): Likewise.
2862 (try_vectorize_loop_1): Likewise.
2863 (try_vectorize_loop): Likewise.
2865 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2868 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
2869 atomic_bit_test_and_complement<mode>,
2870 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
2873 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
2875 * config/vax/vax.c (vax_lra_p): New prototype and function.
2876 (TARGET_LRA_P): Wire it.
2877 * config/vax/vax.opt (mlra): New option.
2878 * doc/invoke.texi (Option Summary, VAX Options): Document the
2881 2021-11-14 Jan Hubicka <jh@suse.cz>
2883 * ipa-modref.h (struct modref_summary): Add nondeterministic
2884 and calls_interposable flags.
2885 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
2886 (modref_summary::useful_p): Check new flags.
2887 (struct modref_summary_lto): Add nondeterministic and
2888 calls_interposable flags.
2889 (modref_summary_lto::modref_summary_lto): Initialize new flags.
2890 (modref_summary_lto::useful_p): Check new flags.
2891 (modref_summary::dump): Dump new flags.
2892 (modref_summary_lto::dump): Dump new flags.
2893 (ignore_nondeterminism_p): New function.
2894 (merge_call_side_effects): Merge new flags.
2895 (process_fnspec): Likewise.
2896 (analyze_load): Volatile access is nondeterministic.
2897 (analyze_store): Liekwise.
2898 (analyze_stmt): Volatile ASM is nondeterministic.
2899 (analyze_function): Clear new flags.
2900 (modref_summaries::duplicate): Duplicate new flags.
2901 (modref_summaries_lto::duplicate): Duplicate new flags.
2902 (modref_write): Stream new flags.
2903 (read_section): Stream new flags.
2904 (propagate_unknown_call): Update new flags.
2905 (modref_propagate_in_scc): Propagate new flags.
2906 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
2908 * tree-ssa-structalias.c (determine_global_memory_access):
2911 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
2913 * config/vax/vax.h (SET_RATIO): New macro.
2914 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
2915 (setmemhi): New expander.
2916 (setmemhi1): New insn and splitter.
2917 (*setmemhi1): New insn.
2919 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2921 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
2923 (modref_access_node::merge_for_kills): Likewise.
2924 (modref_access_node::insert_kill): Likewise.
2925 * ipa-modref-tree.h (modref_access_node::update_for_kills,
2926 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
2928 (modref_access_node::useful_for_kill): New member function.
2929 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
2930 (lto_modref_summary): Add kills.
2931 (modref_summary::dump): Dump kills.
2932 (record_access): Add mdoref_access_node parameter.
2933 (record_access_lto): Likewise.
2934 (merge_call_side_effects): Merge kills.
2935 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
2936 (struct summary_ptrs): Add always_executed filed.
2937 (analyze_load): Update.
2938 (analyze_store): Update; record kills.
2939 (analyze_stmt): Add always_executed; record kills in clobbers.
2940 (analyze_function): Track always_executed.
2941 (modref_summaries::duplicate): Duplicate kills.
2942 (update_signature): Release kills.
2943 * ipa-modref.h (struct modref_summary): Add kills.
2944 * tree-ssa-alias.c (alias_stats): Add kill stats.
2945 (dump_alias_stats): Dump kill stats.
2946 (store_kills_ref_p): Break out from ...
2947 (stmt_kills_ref_p): Use it; handle modref info based kills.
2949 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
2951 PR tree-optimization/103229
2952 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
2953 null value to memset.
2955 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2957 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
2959 (modref_access_node::get_ao_ref): Likewise.
2960 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
2961 (modref_access_node::get_ao_ref): Declare.
2962 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
2963 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
2965 2021-11-13 Jan Hubicka <jh@suse.cz>
2968 * dbgcnt.def (ipa_attr): New counters.
2969 * ipa-pure-const.c: Include dbgcnt.c
2970 (ipa_make_function_const): Use debug counter.
2971 (ipa_make_function_pure): Likewise.
2972 (propagate_pure_const): Fix bug in my previous change.
2974 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2976 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
2977 Offline from ipa-modref-tree.h.
2978 (modref_access_node::dump): Move from ipa-modref.c; make member
2980 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
2981 modref_access_node::dump): Declare.
2982 * ipa-modref.c (dump_access): Remove.
2983 (dump_records): Update.
2984 (dump_lto_records): Update.
2985 (record_access): Update.
2986 (record_access_lto): Update.
2988 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2990 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
2991 (modref_summary::dump): Dump try_dse.
2992 (modref_summary::finalize): Add FUN attribute; compute try-dse.
2993 (analyze_function): Update.
2994 (read_section): Update.
2995 (update_signature): Update.
2996 (pass_ipa_modref::execute): Update.
2997 * ipa-modref.h (struct modref_summary):
2998 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
2999 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
3000 * tree-ssa-dse.c (dse_optimize_call): New function.
3001 (dse_optimize_stmt): Use it.
3003 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
3005 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
3007 2021-11-13 Jan Hubicka <jh@suse.cz>
3010 (struct modref_access_node): Move longer member functions to
3012 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
3014 * ipa-modref-tree.c (modref_access_node::contains): Move here
3015 from ipa-modref-tree.h.
3016 (modref_access_node::update): Likewise.
3017 (modref_access_node::merge): Likewise.
3018 (modref_access_node::closer_pair_p): Likewise.
3019 (modref_access_node::forced_merge): Likewise.
3020 (modref_access_node::update2): Likewise.
3021 (modref_access_node::combined_offsets): Likewise.
3022 (modref_access_node::try_merge_with): Likewise.
3023 (modref_access_node::insert): Likewise.
3025 2021-11-13 Jan Hubicka <jh@suse.cz>
3027 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
3028 (modref_summary::global_memory_written_p): Remove.
3029 (modref_summary::dump): Dump new flags.
3030 (modref_summary::finalize): New member function.
3031 (analyze_function): Call it.
3032 (read_section): Call it.
3033 (update_signature): Call it.
3034 (pass_ipa_modref::execute): Call it.
3035 * ipa-modref.h (struct modref_summary): Remove
3036 global_memory_read_p and global_memory_written_p.
3037 Add global_memory_read, global_memory_written.
3038 * tree-ssa-structalias.c (determine_global_memory_access):
3041 2021-11-13 Jan Hubicka <jh@suse.cz>
3043 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
3044 * ipa-param-manipulation.c
3045 (ipa_param_adjustments::type_attribute_allowed_p):
3046 New member function.
3047 (drop_type_attribute_if_params_changed_p): New function.
3048 (build_adjusted_function_type): Use it.
3049 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
3051 2021-11-13 David Malcolm <dmalcolm@redhat.com>
3053 * doc/invoke.texi (Static Analyzer Options): Add
3054 -Wno-analyzer-tainted-allocation-size,
3055 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
3056 -Wno-analyzer-tainted-size to list. Add
3057 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
3058 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
3059 of options effectively enabled by -fanalyzer.
3060 (-Wanalyzer-tainted-allocation-size): New.
3061 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
3062 (-Wanalyzer-tainted-divisor): New.
3063 (-Wanalyzer-tainted-offset): New.
3064 (-Wanalyzer-tainted-size): New.
3066 2021-11-13 Jan Hubicka <jh@suse.cz>
3068 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
3069 * gimple.c (gimple_call_arg_flags): ... here.
3070 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
3071 (modref_merge_call_site_flags): Use arg_eaf_flags.
3073 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
3075 PR tree-optimization/103222
3076 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
3078 (path_range_query::compute_ranges_in_block): Call
3079 compute_ranges_in_phis.
3080 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
3083 2021-11-13 Jan Hubicka <jh@suse.cz>
3085 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
3086 changes on "fn spec" attribute; give up on varadic types.
3087 * ipa-param-manipulation.c: Include attribs.h.
3088 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
3089 true remove "fn spec" attribute.
3090 (ipa_param_adjustments::build_new_function_type): Update.
3091 (ipa_param_body_adjustments::modify_formal_parameters): update.
3092 * ipa-sra.c: Include attribs.h.
3093 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
3095 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
3097 * gimple-range-path.cc (path_range_query::path_range_query): Merge
3099 (path_range_query::import_p): Move from header file.
3100 (path_range_query::~path_range_query): Adjust for combined ctors.
3101 * gimple-range-path.h: Merge ctors.
3102 (path_range_query::import_p): Move to .cc file.
3104 2021-11-13 Jan Hubicka <jh@suse.cz>
3106 PR tree-optimization/103182
3107 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
3108 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
3110 2021-11-12 Stafford Horne <shorne@gmail.com>
3112 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
3113 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
3114 new function or1k_profile_hook.
3115 * config/or1k/or1k.c (struct machine_function): Add new field
3116 set_mcount_arg_insn.
3117 (or1k_profile_hook): New function.
3118 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
3120 (or1k_frame_pointer_required): Frame pointer no longer needed
3123 2021-11-12 Jan Hubicka <jh@suse.cz>
3125 PR tree-optimization/103209
3126 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
3127 use of handle_rhs_call
3129 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
3131 PR tree-optimization/103202
3132 * gimple-range-path.cc
3133 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
3135 2021-11-12 Jan Hubicka <jh@suse.cz>
3137 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
3138 fix call of ipa_make_function_const and ipa_make_function_pure.
3140 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3142 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
3143 (aarch64_vector_op::m_seen_loads): Likewise.
3144 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
3146 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
3147 Remove code that tries to predict different vec_flags from the
3149 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
3150 Remove m_advsimd_ops handling.
3152 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3154 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
3155 (aarch64_sve_only_stmt_p): Delete.
3156 (aarch64_vector_costs::prefer_unrolled_loop): New function,
3157 extracted from adjust_body_cost.
3158 (aarch64_vector_costs::better_main_loop_than_p): New function,
3159 using heuristics extracted from adjust_body_cost and
3160 adjust_body_cost_sve.
3161 (aarch64_vector_costs::adjust_body_cost_sve): Remove
3162 advsimd_cycles_per_iter and could_use_advsimd parameters.
3163 Update after changes above.
3164 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
3166 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3168 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
3169 New member variable.
3170 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
3171 (aarch64_vec_op_count::vf_factor): New function.
3172 (aarch64_vector_costs::aarch64_vector_costs): When costing for
3173 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
3175 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
3176 instead of hard-coding 2.
3178 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3180 * config/aarch64/aarch64.c
3181 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
3182 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
3183 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
3184 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
3185 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
3186 above properties too.
3187 (aarch64_estimate_min_cycles_per_iter): Delete.
3188 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
3189 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
3190 routine to print CPI estimates.
3191 (adjust_body_cost_sve): Likewise. Use the other functions above
3192 instead of doing the work inline.
3194 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3196 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
3198 (aarch64_vec_op_count::base_issue_info): Remove handling of null
3200 (aarch64_vec_op_count::simd_issue_info): Likewise.
3201 (aarch64_vec_op_count::sve_issue_info): Likewise.
3202 (aarch64_vector_costs::m_ops): Turn into a vector.
3203 (aarch64_vector_costs::m_advsimd_ops): Likewise.
3204 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
3205 the vectors based on aarch64_tune_params.
3206 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
3207 of all entries in m_ops.
3208 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
3210 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
3211 parameter and get the information from the ops instead.
3212 (aarch64_vector_costs::adjust_body_cost_sve): Take a
3213 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
3214 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
3215 Exit earlier if m_ops is empty for either cost structure.
3217 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3219 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
3220 (aarch64_vector_costs::m_sve_ops): Replace with...
3221 (aarch64_vector_costs::m_ops): ...this.
3222 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
3223 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
3224 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
3225 Initialize m_vec_flags here rather than in add_stmt_cost.
3226 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
3227 Allow vectype to be null.
3228 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
3229 code too. Don't require vectype to be nonnull.
3230 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
3231 and scalar costs as parameters. Use the scalar costs to determine
3232 the cycles per iteration of the scalar loop, then multiply it
3233 by the estimated VF.
3234 (aarch64_vector_costs::finish_cost): Update call accordingly.
3236 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3238 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
3239 (aarch64_vector_costs::count_ops): Use it rather than the
3240 vectype to determine floatness.
3242 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3244 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
3245 Remove vectype parameter and get floatness from the type of the
3247 (arch64_in_loop_reduction_latency): Likewise.
3248 (aarch64_detect_vector_stmt_subtype): Update caller.
3249 (aarch64_vector_costs::count_ops): Likewise.
3251 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3253 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
3254 (aarch64_vec_op_count): ...this. Add a constructor.
3255 (aarch64_vec_op_count::vec_flags): New function.
3256 (aarch64_vec_op_count::base_issue_info): Likewise.
3257 (aarch64_vec_op_count::simd_issue_info): Likewise.
3258 (aarch64_vec_op_count::sve_issue_info): Likewise.
3259 (aarch64_vec_op_count::m_issue_info): New member variable.
3260 (aarch64_vec_op_count::m_vec_flags): Likewise.
3261 (aarch64_vector_costs): Add a constructor.
3262 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
3263 (aarch64_vector_costs::aarch64_vector_costs): New function.
3264 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
3265 (aarch64_vector_costs::count_ops): Remove vec_flags and
3266 issue_info parameters, using the new aarch64_vec_op_count
3268 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
3269 (aarch64_sve_op_count::dump): Fold into...
3270 (aarch64_vec_op_count::dump): ..here.
3272 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3274 * config/aarch64/aarch64.c: Include tree-dfa.h.
3275 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
3276 and MEM_OFFSET into account.
3277 (aarch64_swap_ldrstr_operands): Use it.
3278 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
3279 address of the second memory doesn't depend on the result of
3282 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
3284 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
3286 2021-11-12 Jan Hubicka <jh@suse.cz>
3288 PR tree-optimization/103175
3289 * ipa-modref.c (modref_lattice::merge): Add sanity check.
3290 (callee_to_caller_flags): Make flags adjustment sane.
3291 (modref_eaf_analysis::analyze_ssa_name): Likewise.
3293 2021-11-12 Jan Hubicka <jh@suse.cz>
3296 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
3297 not mark pure/const function if there are side-effects.
3299 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
3301 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
3302 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
3303 * tree-pretty-print.c (dump_omp_clause): Add support for printing
3304 implicit attribute in tree dumping.
3305 * gimplify.c (gimplify_adjust_omp_clauses_1):
3306 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
3308 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
3309 clauses, from simple append, to starting of list, after non-map clauses.
3310 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
3311 values passed to libgomp for implicit maps.
3313 2021-11-12 Jakub Jelinek <jakub@redhat.com>
3315 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
3316 (BUILT_IN_GOMP_TEAMS4): New.
3317 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
3318 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
3319 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
3320 GOMP_teams, pass to it also num_teams lower-bound expression
3321 or a dup of upper-bound if it is missing and a flag whether
3322 it is the first call or not.
3324 2021-11-12 Martin Liska <mliska@suse.cz>
3326 PR tree-optimization/102497
3327 * gimple-predicate-analysis.cc (add_pred): Remove unused
3330 2021-11-12 Richard Biener <rguenther@suse.de>
3332 PR tree-optimization/103204
3333 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
3334 top operand after folding in an address.
3336 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3338 * gimple-range-path.cc (path_range_query::path_range_query): New
3339 ctor without a ranger.
3340 (path_range_query::~path_range_query): Free ranger if necessary.
3341 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
3342 (path_range_query::ssa_range_in_phi): Same.
3343 (path_range_query::compute_ranges_in_block): Same.
3344 (path_range_query::compute_imports): Same.
3345 (path_range_query::compute_ranges): Same.
3346 (path_range_query::range_of_stmt): Same.
3347 (path_range_query::compute_outgoing_relations): Same.
3348 * gimple-range-path.h (class path_range_query): New ctor.
3349 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
3350 as path_range_query allocates one.
3351 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
3352 (back_threader::~back_threader): Same.
3354 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3356 * tree-ssa-threadbackward.c
3357 (back_threader_profitability::profitable_path_p): Remove loop
3358 crossing restriction.
3360 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3362 * ipa-modref.c (analyze_function): Do pure/const discovery, return
3364 (pass_modref::execute): If pure/const is discovered fixup cfg.
3365 (ignore_edge): Do not ignore pure/const edges.
3366 (modref_propagate_in_scc): Do pure/const discovery, return true if
3367 cdtor was promoted pure/const.
3368 (pass_ipa_modref::execute): If needed remove unreachable functions.
3369 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
3370 (warn_function_cold): Likewise.
3371 (skip_function_for_local_pure_const): Move earlier.
3372 (ipa_make_function_const): Break out from ...
3373 (ipa_make_function_pure): Break out from ...
3374 (propagate_pure_const): ... here.
3375 (pass_local_pure_const::execute): Use it.
3376 * ipa-utils.h (ipa_make_function_const): Declare.
3377 (ipa_make_function_pure): Declare.
3378 * passes.def: Move early modref after pure-const.
3380 2021-11-11 David Malcolm <dmalcolm@redhat.com>
3383 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
3385 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3387 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
3388 (TYPES_COMBINEP): Delete.
3389 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3390 qualified builtins for vcombine_* intrinsics.
3391 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
3393 (vcombine_s16): Likewise.
3394 (vcombine_s32): Likewise.
3395 (vcombine_f32): Likewise.
3396 (vcombine_u8): Use type-qualified builtin and remove casts.
3397 (vcombine_u16): Likewise.
3398 (vcombine_u32): Likewise.
3399 (vcombine_u64): Likewise.
3400 (vcombine_p8): Likewise.
3401 (vcombine_p16): Likewise.
3402 (vcombine_p64): Likewise.
3403 (vcombine_bf16): Remove unnecessary cast.
3404 * config/aarch64/iterators.md (VD_I): New mode iterator.
3405 (VDC_P): New mode iterator.
3407 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3409 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
3410 (TYPES_LOAD1_P): Define.
3411 (TYPES_STORE1_U): Define.
3412 (TYPES_STORE1P): Rename to...
3413 (TYPES_STORE1_P): This.
3414 (get_mem_type_for_load_store): Add unsigned and poly types.
3415 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
3416 type-qualified builtin declarations.
3417 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3418 qualified builtins for LD1/ST1.
3419 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
3420 builtin and remove cast.
3421 (vld1_p16): Likewise.
3422 (vld1_u8): Likewise.
3423 (vld1_u16): Likewise.
3424 (vld1_u32): Likewise.
3425 (vld1q_p8): Likewise.
3426 (vld1q_p16): Likewise.
3427 (vld1q_p64): Likewise.
3428 (vld1q_u8): Likewise.
3429 (vld1q_u16): Likewise.
3430 (vld1q_u32): Likewise.
3431 (vld1q_u64): Likewise.
3432 (vst1_p8): Likewise.
3433 (vst1_p16): Likewise.
3434 (vst1_u8): Likewise.
3435 (vst1_u16): Likewise.
3436 (vst1_u32): Likewise.
3437 (vst1q_p8): Likewise.
3438 (vst1q_p16): Likewise.
3439 (vst1q_p64): Likewise.
3440 (vst1q_u8): Likewise.
3441 (vst1q_u16): Likewise.
3442 (vst1q_u32): Likewise.
3443 (vst1q_u64): Likewise.
3444 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
3446 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3448 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3449 builtins for vector reduction.
3450 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
3451 builtin and remove casts.
3452 (vaddv_u16): Likewise.
3453 (vaddv_u32): Likewise.
3454 (vaddvq_u8): Likewise.
3455 (vaddvq_u16): Likewise.
3456 (vaddvq_u32): Likewise.
3457 (vaddvq_u64): Likewise.
3459 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3461 * config/aarch64/aarch64-simd-builtins.def:
3462 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
3463 builtin and remove casts.
3464 (vpaddq_u16): Likewise.
3465 (vpaddq_u32): Likewise.
3466 (vpaddq_u64): Likewise.
3467 (vpadd_u8): Likewise.
3468 (vpadd_u16): Likewise.
3469 (vpadd_u32): Likewise.
3470 (vpaddd_u64): Likewise.
3472 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3474 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3475 builtins for [r]subhn[2].
3476 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
3478 (vsubhn_s32): Likewise.
3479 (vsubhn_s64): Likewise.
3480 (vsubhn_u16): Use type-qualified builtin and remove casts.
3481 (vsubhn_u32): Likewise.
3482 (vsubhn_u64): Likewise.
3483 (vrsubhn_s16): Remove unnecessary cast.
3484 (vrsubhn_s32): Likewise.
3485 (vrsubhn_s64): Likewise.
3486 (vrsubhn_u16): Use type-qualified builtin and remove casts.
3487 (vrsubhn_u32): Likewise.
3488 (vrsubhn_u64): Likewise.
3489 (vrsubhn_high_s16): Remove unnecessary cast.
3490 (vrsubhn_high_s32): Likewise.
3491 (vrsubhn_high_s64): Likewise.
3492 (vrsubhn_high_u16): Use type-qualified builtin and remove
3494 (vrsubhn_high_u32): Likewise.
3495 (vrsubhn_high_u64): Likewise.
3496 (vsubhn_high_s16): Remove unnecessary cast.
3497 (vsubhn_high_s32): Likewise.
3498 (vsubhn_high_s64): Likewise.
3499 (vsubhn_high_u16): Use type-qualified builtin and remove
3501 (vsubhn_high_u32): Likewise.
3502 (vsubhn_high_u64): Likewise.
3504 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3506 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3507 builtins for [r]addhn[2].
3508 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
3510 (vaddhn_s32): Likewise.
3511 (vaddhn_s64): Likewise.
3512 (vaddhn_u16): Use type-qualified builtin and remove casts.
3513 (vaddhn_u32): Likewise.
3514 (vaddhn_u64): Likewise.
3515 (vraddhn_s16): Remove unnecessary cast.
3516 (vraddhn_s32): Likewise.
3517 (vraddhn_s64): Likewise.
3518 (vraddhn_u16): Use type-qualified builtin and remove casts.
3519 (vraddhn_u32): Likewise.
3520 (vraddhn_u64): Likewise.
3521 (vaddhn_high_s16): Remove unnecessary cast.
3522 (vaddhn_high_s32): Likewise.
3523 (vaddhn_high_s64): Likewise.
3524 (vaddhn_high_u16): Use type-qualified builtin and remove
3526 (vaddhn_high_u32): Likewise.
3527 (vaddhn_high_u64): Likewise.
3528 (vraddhn_high_s16): Remove unnecessary cast.
3529 (vraddhn_high_s32): Likewise.
3530 (vraddhn_high_s64): Likewise.
3531 (vraddhn_high_u16): Use type-qualified builtin and remove
3533 (vraddhn_high_u32): Likewise.
3534 (vraddhn_high_u64): Likewise.
3536 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3538 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3539 qualifiers in generator macros for uhsub builtins.
3540 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
3542 (vhsub_s16): Likewise.
3543 (vhsub_s32): Likewise.
3544 (vhsub_u8): Use type-qualified builtin and remove casts.
3545 (vhsub_u16): Likewise.
3546 (vhsub_u32): Likewise.
3547 (vhsubq_s8): Remove unnecessary cast.
3548 (vhsubq_s16): Likewise.
3549 (vhsubq_s32): Likewise.
3550 (vhsubq_u8): Use type-qualified builtin and remove casts.
3551 (vhsubq_u16): Likewise.
3552 (vhsubq_u32): Likewise.
3554 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3556 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3557 qualifiers in generator macros for u[r]hadd builtins.
3558 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
3560 (vhadd_s16): Likewise.
3561 (vhadd_s32): Likewise.
3562 (vhadd_u8): Use type-qualified builtin and remove casts.
3563 (vhadd_u16): Likewise.
3564 (vhadd_u32): Likewise.
3565 (vhaddq_s8): Remove unnecessary cast.
3566 (vhaddq_s16): Likewise.
3567 (vhaddq_s32): Likewise.
3568 (vhaddq_u8): Use type-qualified builtin and remove casts.
3569 (vhaddq_u16): Likewise.
3570 (vhaddq_u32): Likewise.
3571 (vrhadd_s8): Remove unnecessary cast.
3572 (vrhadd_s16): Likewise.
3573 (vrhadd_s32): Likewise.
3574 (vrhadd_u8): Use type-qualified builtin and remove casts.
3575 (vrhadd_u16): Likewise.
3576 (vrhadd_u32): Likewise.
3577 (vrhaddq_s8): Remove unnecessary cast.
3578 (vrhaddq_s16): Likewise.
3579 (vrhaddq_s32): Likewise.
3580 (vrhaddq_u8): Use type-wualified builtin and remove casts.
3581 (vrhaddq_u16): Likewise.
3582 (vrhaddq_u32): Likewise.
3584 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3586 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3587 qualifiers in generator macros for usub[lw][2] builtins.
3588 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
3590 (vsubl_s16): Likewise.
3591 (vsubl_s32): Likewise.
3592 (vsubl_u8): Use type-qualified builtin and remove casts.
3593 (vsubl_u16): Likewise.
3594 (vsubl_u32): Likewise.
3595 (vsubl_high_s8): Remove unnecessary cast.
3596 (vsubl_high_s16): Likewise.
3597 (vsubl_high_s32): Likewise.
3598 (vsubl_high_u8): Use type-qualified builtin and remove casts.
3599 (vsubl_high_u16): Likewise.
3600 (vsubl_high_u32): Likewise.
3601 (vsubw_s8): Remove unnecessary casts.
3602 (vsubw_s16): Likewise.
3603 (vsubw_s32): Likewise.
3604 (vsubw_u8): Use type-qualified builtin and remove casts.
3605 (vsubw_u16): Likewise.
3606 (vsubw_u32): Likewise.
3607 (vsubw_high_s8): Remove unnecessary cast.
3608 (vsubw_high_s16): Likewise.
3609 (vsubw_high_s32): Likewise.
3610 (vsubw_high_u8): Use type-qualified builtin and remove casts.
3611 (vsubw_high_u16): Likewise.
3612 (vsubw_high_u32): Likewise.
3614 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3616 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3617 qualifiers in generator macros for uadd[lw][2] builtins.
3618 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
3620 (vaddl_s16): Likewise.
3621 (vaddl_s32): Likewise.
3622 (vaddl_u8): Use type-qualified builtin and remove casts.
3623 (vaddl_u16): Likewise.
3624 (vaddl_u32): Likewise.
3625 (vaddl_high_s8): Remove unnecessary cast.
3626 (vaddl_high_s16): Likewise.
3627 (vaddl_high_s32): Likewise.
3628 (vaddl_high_u8): Use type-qualified builtin and remove casts.
3629 (vaddl_high_u16): Likewise.
3630 (vaddl_high_u32): Likewise.
3631 (vaddw_s8): Remove unnecessary cast.
3632 (vaddw_s16): Likewise.
3633 (vaddw_s32): Likewise.
3634 (vaddw_u8): Use type-qualified builtin and remove casts.
3635 (vaddw_u16): Likewise.
3636 (vaddw_u32): Likewise.
3637 (vaddw_high_s8): Remove unnecessary cast.
3638 (vaddw_high_s16): Likewise.
3639 (vaddw_high_s32): Likewise.
3640 (vaddw_high_u8): Use type-qualified builtin and remove casts.
3641 (vaddw_high_u16): Likewise.
3642 (vaddw_high_u32): Likewise.
3644 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3646 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3647 qualified builtins for [R]SHRN[2].
3648 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
3649 builtin and remove casts.
3650 (vshrn_n_u32): Likewise.
3651 (vshrn_n_u64): Likewise.
3652 (vrshrn_high_n_u16): Likewise.
3653 (vrshrn_high_n_u32): Likewise.
3654 (vrshrn_high_n_u64): Likewise.
3655 (vrshrn_n_u16): Likewise.
3656 (vrshrn_n_u32): Likewise.
3657 (vrshrn_n_u64): Likewise.
3658 (vshrn_high_n_u16): Likewise.
3659 (vshrn_high_n_u32): Likewise.
3660 (vshrn_high_n_u64): Likewise.
3662 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3664 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3665 type-qualified builtins for XTN[2].
3666 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
3667 qualified builtin and remove casts.
3668 (vmovn_high_u32): Likewise.
3669 (vmovn_high_u64): Likewise.
3670 (vmovn_u16): Likewise.
3671 (vmovn_u32): Likewise.
3672 (vmovn_u64): Likewise.
3674 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3676 * config/aarch64/aarch64-simd-builtins.def: Use poly type
3677 qualifier in builtin generator macros.
3678 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
3679 builtin and remove casts.
3680 (vmulq_p8): Likewise.
3681 (vmull_high_p8): Likewise.
3682 (vmull_p8): Likewise.
3684 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3686 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3687 qualified builtin generators for unsigned MLA/MLS intrinsics.
3688 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
3690 (vmla_n_u32): Likewise.
3691 (vmla_u8): Likewise.
3692 (vmla_u16): Likewise.
3693 (vmla_u32): Likewise.
3694 (vmlaq_n_u16): Likewise.
3695 (vmlaq_n_u32): Likewise.
3696 (vmlaq_u8): Likewise.
3697 (vmlaq_u16): Likewise.
3698 (vmlaq_u32): Likewise.
3699 (vmls_n_u16): Likewise.
3700 (vmls_n_u32): Likewise.
3701 (vmls_u8): Likewise.
3702 (vmls_u16): Likewise.
3703 (vmls_u32): Likewise.
3704 (vmlsq_n_u16): Likewise.
3705 (vmlsq_n_u32): Likewise.
3706 (vmlsq_u8): Likewise.
3707 (vmlsq_u16): Likewise.
3708 (vmlsq_u32): Likewise.
3710 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3712 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
3713 with looping const/pure.
3714 (modref_summary_lto::useful_p): Likewise.
3715 (merge_call_side_effects): Merge side effects before early exit
3717 (process_fnspec): Also handle pure functions.
3718 (analyze_call): Do not early exit on looping pure const.
3719 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
3720 (modref_propagate_in_scc): Update.
3722 2021-11-11 Richard Biener <rguenther@suse.de>
3724 PR tree-optimization/103190
3725 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
3727 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3729 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
3731 (path_range_query::compute_imports): ...this. Adapt it so it can
3732 be passed the imports bitmap instead of working on m_imports.
3733 (path_range_query::compute_ranges): Call compute_imports in all
3734 cases unless an imports bitmap is passed.
3735 * gimple-range-path.h (path_range_query::compute_imports): New.
3736 (path_range_query::add_copies_to_imports): Remove.
3737 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
3738 (back_threader::find_paths_to_names): Inline resolve_def.
3739 (back_threader::find_paths): Call compute_imports.
3740 (back_threader::resolve_phi): Adjust comment.
3742 2021-11-11 Richard Biener <rguenther@suse.de>
3744 PR tree-optimization/103188
3745 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
3746 Remove query parameter, split out check for size
3748 (ch_base::m_ranger, cb_base::m_query): Remove.
3749 (ch_base::copy_headers): Split processing loop into
3750 analysis around which we allocate and use ranger and
3751 transform where we do not.
3752 (pass_ch::execute): Do not allocate/free ranger here.
3753 (pass_ch_vect::execute): Likewise.
3755 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3757 * ipa-pure-const.c (propagate_pure_const): Self recursion is
3760 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3762 * cgraph.c (set_noreturn_flag_1): New function.
3763 (cgraph_node::set_noreturn_flag): New member function
3764 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
3765 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
3767 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3768 Richard Biener <rguenther@suse.de>
3770 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
3771 statically to the edge remaining in the loop.
3773 2021-11-11 Richard Biener <rguenther@suse.de>
3775 PR middle-end/103181
3776 * tree-eh.c (operation_could_trap_helper_p): Properly
3777 check vector constants for a zero element for integer
3778 division. Separate floating point and integer division code.
3780 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3783 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
3784 handling only for DECL_BIT_FIELD_TYPE decls.
3786 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
3789 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
3790 token is arch extension without leading '+' and emit appropriate
3791 diagnostic for the same.
3793 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3795 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
3796 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
3797 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
3798 * tree.c (omp_clause_num_ops): Increase num ops for
3799 OMP_CLAUSE_NUM_TEAMS to 2.
3800 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
3801 for OMP_CLAUSE_NUM_TEAMS.
3802 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
3803 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
3804 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
3805 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
3806 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
3807 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
3808 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
3810 2021-11-11 Richard Biener <rguenther@suse.de>
3812 * cfganal.c (find_pdom): Remove.
3813 (control_dependences::find_control_dependence): Remove
3814 special-casing of entry block, call get_immediate_dominator
3816 * gimple-predicate-analysis.cc (find_pdom): Remove.
3817 (find_dom): Likewise.
3818 (find_control_equiv_block): Call get_immediate_dominator
3820 (compute_control_dep_chain): Likewise.
3821 (predicate::init_from_phi_def): Likewise.
3823 2021-11-11 Richard Biener <rguenther@suse.de>
3825 * cfganal.h (control_dependences::control_dependence_map):
3827 (control_dependences::m_bitmaps): New.
3828 * cfganal.c (control_dependences::set_control_dependence_map_bit):
3830 (control_dependences::clear_control_dependence_bitmap):
3832 (control_dependences::find_control_dependence): Do not
3833 find_edge for the abnormal edge test.
3834 (control_dependences::control_dependences): Instead do not
3835 add abnormal edges to the edge list. Adjust.
3836 (control_dependences::~control_dependences): Likewise.
3837 (control_dependences::get_edges_dependent_on): Likewise.
3838 * function-tests.c: Include bitmap.h.
3840 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
3842 * doc/invoke.texi: Change references to "future cpu" to "power10",
3843 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
3845 2021-11-11 Cui,Lili <lili.cui@intel.com>
3847 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
3849 (processor_cost_table): Use alderlake_cost for Alderlake.
3850 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
3851 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
3853 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
3855 (ix86_adjust_cost): Handle Alderlake.
3856 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
3857 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
3858 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
3859 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
3860 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
3861 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
3862 (X86_TUNE_USE_LEAVE): Likewise.
3863 (X86_TUNE_PUSH_MEMORY): Likewise.
3864 (X86_TUNE_USE_INCDEC): Likewise.
3865 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
3866 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
3867 (X86_TUNE_USE_SAHF): Likewise.
3868 (X86_TUNE_USE_BT): Likewise.
3869 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
3870 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
3871 (X86_TUNE_AVOID_MFENCE): Likewise.
3872 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
3873 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
3874 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
3875 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
3876 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
3877 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
3878 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
3879 (X86_TUNE_USE_GATHER): Disable for Alderlake.
3880 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
3881 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
3883 2021-11-11 liuhongt <hongtao.liu@intel.com>
3886 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
3887 (avxsizesuffix): Ditto.
3889 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
3891 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
3892 wrong marco checking.
3894 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3896 PR tree-optimization/102906
3897 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
3898 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
3899 (class ch_base): Add m_ranger and m_query.
3900 (ch_base::copy_headers): Pass m_query to
3901 entry_loop_condition_is_static.
3902 (pass_ch::execute): Allocate and deallocate m_ranger and
3904 (pass_ch_vect::execute): Same.
3906 2021-11-10 Andrew Pinski <apinski@marvell.com>
3909 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
3910 Use vwcore iterator for the r constraint output string.
3912 2021-11-10 qing zhao <qing.zhao@oracle.com>
3914 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
3915 only when have_insn_for return true for the mode. Fix a memory leak.
3917 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
3919 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
3920 cortexa5_extra_costs, cortexa7_extra_costs,
3921 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
3922 Initialize movi, dup and extract costing fields.
3924 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3926 * gimple-range-path.cc (path_range_query::path_range_query): Do
3928 (path_range_query::dump): Change m_path uses to non-pointer.
3929 (path_range_query::defined_outside_path): Same.
3930 (path_range_query::set_path): Same.
3931 (path_range_query::add_copies_to_imports): Same.
3932 (path_range_query::range_of_stmt): Same.
3933 (path_range_query::compute_outgoing_relations): Same.
3934 (path_range_query::compute_ranges): Imports are now optional.
3935 Implement overload that takes an edge.
3936 * gimple-range-path.h (class path_range_query): Make imports
3937 optional for compute_ranges. Add compute_ranges(edge) overload.
3938 Make m_path an auto_vec instead of a pointer and adjust
3941 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3943 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
3944 (default_hash_traits<scalar_conf_masked_key>): Likewise.
3945 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
3947 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
3948 Register mask inverses.
3950 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3952 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
3953 successful vectorization.
3955 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
3957 * gimple-range-cache.cc (sbr_vector::grow): New.
3958 (sbr_vector::set_bb_range): Call grow.
3959 (sbr_vector::get_bb_range): Same.
3960 (sbr_vector::bb_range_p): Remove assert.
3962 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3964 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
3965 ,*aarch64_topbits_shuffle<mode>_be): Remove.
3967 2021-11-10 Jan Hubicka <jh@suse.cz>
3969 * ipa-modref.c: Include tree-eh.h
3970 (modref_summary::modref_summary): Initialize side_effects.
3971 (struct modref_summary_lto): New bool field side_effects.
3972 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
3973 (modref_summary::dump): Dump side_effects.
3974 (modref_summary_lto::dump): Dump side_effects.
3975 (merge_call_side_effects): Merge side effects.
3976 (process_fnspec): Calls to non-const/pure or looping
3977 function is a side effect.
3978 (analyze_call): Self-recursion is a side-effect; handle
3980 (analyze_load): Watch for volatile and throwing memory.
3981 (analyze_store): Likewise.
3982 (analyze_stmt): Watch for volatitle asm.
3983 (analyze_function): Handle side_effects.
3984 (modref_summaries::duplicate): Duplicate side_effects.
3985 (modref_summaries_lto::duplicate): Likewise.
3986 (modref_write): Stream side_effects.
3987 (read_section): Likewise.
3988 (update_signature): Update.
3989 (propagate_unknown_call): Handle side_effects.
3990 (modref_propagate_in_scc): Likewise.
3991 * ipa-modref.h (struct modref_summary): Add side_effects.
3992 * ipa-pure-const.c (special_builtin_state): Rename to ...
3993 (builtin_safe_for_const_function_p): ... this one.
3994 (check_call): Update.
3995 (finite_function_p): Break out from ...
3996 (propagate_pure_const): ... here
3997 * ipa-utils.h (finite_function): Declare.
3999 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
4001 * config.gcc (powerpc*-*-*): Remove -rpath from
4002 --with-advance-toolchain.
4004 2021-11-10 Marek Polacek <polacek@redhat.com>
4007 * attribs.c (struct scoped_attributes): Add a bool member.
4008 (lookup_scoped_attribute_spec): Forward declare.
4009 (register_scoped_attributes): New bool parameter, defaulted to
4011 (handle_ignored_attributes_option): New function.
4012 (free_attr_data): New function.
4013 (init_attributes): Call handle_ignored_attributes_option.
4014 (attr_namespace_ignored_p): New function.
4015 (decl_attributes): Check attr_namespace_ignored_p before
4017 * attribs.h (free_attr_data): Declare.
4018 (register_scoped_attributes): Adjust declaration.
4019 (handle_ignored_attributes_option): Declare.
4020 (canonicalize_attr_name): New function template.
4021 (canonicalize_attr_name): Use it.
4022 * common.opt (Wattributes=): New option with a variable.
4023 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
4024 * doc/invoke.texi: Document -Wno-attributes=.
4025 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
4026 * plugin.h (register_scoped_attributes): Adjust declaration.
4027 * toplev.c (compile_file): Call free_attr_data.
4029 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
4031 * config/arm/arm-cpus.in (cortex-a710): New CPU.
4032 * config/arm/arm-tables.opt: Regenerate.
4033 * config/arm/arm-tune.md: Regenerate.
4034 * doc/invoke.texi: Update docs.
4036 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
4038 * config/aarch64/aarch64-builtins.c
4039 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
4041 2021-11-10 Martin Liska <mliska@suse.cz>
4043 * lto-wrapper.c (merge_and_complain): Make the first argument
4046 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4048 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
4049 smax_nan and fmin_nan instead of smin_nan.
4050 (maxmin_uns): Rename to...
4051 (fmaxmin): ...this and make the same changes. Remove entries
4052 unrelated to fmax* and fmin*.
4053 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
4054 (<fmaxmin><mode>3): ...this.
4055 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
4057 (aarch64_<optab>p<mode>): ...this.
4058 (<maxmin_uns><mode>3): Rename to...
4059 (<fmaxmin><mode>3): ...this.
4060 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
4061 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
4062 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
4063 (aarch64_reduc_<optab>_internal<mode>): ...this.
4064 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
4065 (aarch64_reduc_<optab>_internalv2si): ...this.
4066 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
4067 (<fmaxmin><mode>3): ...this.
4068 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
4070 (fmax_nan, fmin_nan): ...this.
4071 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
4072 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
4073 (vmin_f16, vminq_f16): Update accordingly.
4075 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4077 * tree-vectorizer.h (vector_costs::finish_cost): Take the
4078 corresponding scalar costs as a parameter.
4079 (finish_cost): Likewise.
4080 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
4081 (vect_estimate_min_profitable_iters): Update accordingly.
4082 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
4083 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
4084 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
4086 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
4088 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4090 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
4092 (_loop_vec_info::single_scalar_iteration_cost): Delete.
4093 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
4094 (vector_costs::total_cost): New function.
4095 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
4096 after above changes.
4097 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
4098 (vect_compute_single_scalar_iteration_cost): Store the costs
4099 in loop_vinfo->scalar_costs.
4100 (vect_estimate_min_profitable_iters): Get the scalar cost from
4101 loop_vinfo->scalar_costs.
4103 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4105 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
4106 (vector_costs::better_epilogue_loop_than_p)
4107 (vector_costs::compare_inside_loop_cost)
4108 (vector_costs::compare_outside_loop_cost): Likewise.
4109 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
4110 (vector_costs::better_epilogue_loop_than_p)
4111 (vector_costs::compare_inside_loop_cost)
4112 (vector_costs::compare_outside_loop_cost): New functions,
4113 containing code moved from...
4114 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
4116 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4118 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
4119 and vec_inside_cost.
4120 (vector_costs::outside_cost): New function.
4121 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
4123 (vect_estimate_min_profitable_iters): Likewise.
4124 (vect_better_loop_vinfo_p): Get the inside and outside costs
4125 from the loop_vec_infos' vector_costs.
4127 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4129 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
4130 (_loop_vec_info::vector_costs): ...this.
4131 (LOOP_VINFO_TARGET_COST_DATA): Delete.
4132 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
4134 (vec_info::~vec_info): Remove corresponding delete.
4135 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
4136 vector_costs to null.
4137 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
4138 (vect_analyze_loop_operations): Update after above changes.
4139 (vect_analyze_loop_2): Likewise.
4140 (vect_estimate_min_profitable_iters): Likewise.
4141 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
4143 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
4145 * tree-core.h (EAF_DIRECT): Remove.
4146 (EAF_NOCLOBBER): Remove.
4147 (EAF_UNUSED): Remove.
4148 (EAF_NOESCAPE): Remove.
4149 (EAF_NO_DIRECT_CLOBBER): New.
4150 (EAF_NO_INDIRECT_CLOBBER): New.
4151 (EAF_NODIRECTESCAPE): Remove.
4152 (EAF_NO_DIRECT_ESCAPE): New.
4153 (EAF_NO_INDIRECT_ESCAPE): New.
4154 (EAF_NOT_RETURNED): Remove.
4155 (EAF_NOT_RETURNED_INDIRECTLY): New.
4156 (EAF_NOREAD): Remove.
4157 (EAF_NO_DIRECT_READ): New.
4158 (EAF_NO_INDIRECT_READ): New.
4159 * gimple.c (gimple_call_arg_flags): Update for new flags.
4160 (gimple_call_retslot_flags): Update for new flags.
4161 * ipa-modref.c (dump_eaf_flags): Likewise.
4162 (remove_useless_eaf_flags): Likewise.
4163 (deref_flags): Likewise.
4164 (modref_lattice::init): Likewise.
4165 (modref_lattice::merge): Likewise.
4166 (modref_lattice::merge_direct_load): Likewise.
4167 (modref_lattice::merge_direct_store): Likewise.
4168 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
4169 (callee_to_caller_flags): Likewise.
4170 (modref_eaf_analysis::analyze_ssa_name): Likewise.
4171 (modref_eaf_analysis::propagate): Likewise.
4172 (modref_merge_call_site_flags): Likewise.
4173 * ipa-modref.h (interposable_eaf_flags): Likewise.
4174 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
4175 * tree-ssa-structalias.c (handle_call_arg): Likewise.
4176 (handle_rhs_call): Likewise.
4177 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
4179 2021-11-10 Tamar Christina <tamar.christina@arm.com>
4181 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
4183 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
4185 * config/aarch64/aarch64-builtins.c
4186 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
4189 2021-11-10 Jan Hubicka <jh@suse.cz>
4191 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
4193 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
4194 Hongtao Liu <hongtao.liu@intel.com>
4196 PR middle-end/102566
4197 * match.pd (nop_atomic_bit_test_and_p): New match.
4198 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
4199 (gimple_nop_atomic_bit_test_and_p): New prototype.
4200 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
4201 different cases to their canonical forms.
4203 2021-11-10 liuhongt <hongtao.liu@intel.com>
4205 PR tree-optimization/103126
4206 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
4207 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
4208 * tree-if-conv.c : Include tree-vectorizer.h.
4209 (is_cond_scalar_reduction): Handle
4210 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
4211 (convert_scalar_cond_reduction): Ditto.
4213 2021-11-10 konglin1 <lingling.kong@intel.com>
4215 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
4216 (cmla<conj_op><mode>4): Likewise
4218 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
4220 * tree-ssa-threadedge.c: Do not include
4221 gimple-ssa-evrp-analyze.h.
4222 * value-pointer-equiv.cc: Same.
4224 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
4226 * tree-ssa-threadbackward.c
4227 (back_threader::maybe_register_path_dump): Abstract path dumping...
4228 (dump_path): ...here.
4229 (back_threader::resolve_phi): Call dump_path.
4232 2021-11-10 konglin1 <lingling.kong@intel.com>
4234 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
4235 Add new define_insn.
4236 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
4237 (fma_<mode>_fcmaddc_bcst): Likewise
4239 2021-11-10 liuhongt <hongtao.liu@intel.com>
4242 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
4245 2021-11-10 Andrew Pinski <apinski@marvell.com>
4248 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
4249 as an input, use original one.
4251 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
4253 * config/nios2/nios2.c (nios2_can_inline_p): New.
4254 (TARGET_CAN_INLINE_P): Define.
4256 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
4258 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
4259 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
4260 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
4261 (lxvrse_expand_builtin): Modify the expansion for sign extension.
4262 All extensions are done within VSX registers.
4264 2021-11-09 Andrew Pinski <apinski@marvell.com>
4266 PR tree-optimization/10352
4267 * match.pd: Remove check of TYPE_PRECISION for
4270 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
4272 * function.c (allocate_struct_function): Don't set x_range_query.
4273 * function.h (get_range_query): Move to value-query.h.
4274 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
4275 (disable_ranger): Clear function current query field.
4276 * value-query.cc (get_global_range_query): Relocate to:
4277 * value-query.h (get_global_range_query): Here and inline.
4278 (get_range_query): Relocate here from function.h.
4280 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4282 * tree-ssa-threadbackward.c
4283 (back_threader::maybe_register_path_dump): New.
4284 (back_threader::maybe_register_path): Call maybe_register_path_dump.
4286 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4288 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
4289 Return NULL when unprofitable.
4291 2021-11-09 Martin Jambor <mjambor@suse.cz>
4293 * tree.h (build_debug_expr_decl): Declare.
4294 * tree.c (build_debug_expr_decl): New function.
4295 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
4296 instead of building a DEBUG_EXPR_DECL.
4297 * ipa-param-manipulation.c
4298 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4299 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
4300 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
4301 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
4302 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
4304 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
4306 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
4307 (def_builtin): Change debug formatting for easier parsing and
4308 include more information.
4309 (rs6000_init_builtins): Add dump of autogenerated builtins.
4310 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
4313 2021-11-09 Marek Polacek <polacek@redhat.com>
4315 * ipa-modref.c (analyze_parms): Fix a typo.
4317 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
4320 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
4321 assertion for Z*-ext.
4323 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
4325 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
4326 and past_static_chain; merge past summary with current one.
4327 (analyze_function): Update.
4329 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4331 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
4333 2021-11-09 Martin Liska <mliska@suse.cz>
4335 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
4336 conditional based on
4337 * print-rtl.h (class rtx_writer): Likewise.
4339 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
4341 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
4342 for 'Deleted'. Turn into a '#define'.
4344 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4346 * gimple-range-path.cc (path_range_query::dump): Clean up.
4347 (path_range_query::compute_ranges): Same.
4348 * value-relation.cc (path_oracle::dump): Same.
4350 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4352 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
4353 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
4355 * flag-types.h (enum threader_debug): New.
4356 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
4357 * params.opt: Add entry for --param=threader-debug=.
4359 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
4361 PR middle-end/101204
4363 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
4364 (struct xint_hash_t): ... this.
4365 * doc/gty.texi: Update.
4367 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
4369 * config/arm/arm-cpus.in (armv9): New define.
4370 (ARMv9a): New group.
4371 (armv9-a): New arch definition.
4372 * config/arm/arm-tables.opt: Regenerate.
4373 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
4374 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
4375 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
4376 to MULTILIB_MATCHES.
4377 * config/arm/t-multilib: Added v9_a_nosimd_variants and
4378 v9_a_simd_variants to MULTILIB_MATCHES.
4379 * doc/invoke.texi: Update docs.
4381 2021-11-09 Martin Jambor <mjambor@suse.cz>
4384 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
4385 return with error_mark_mode when part of expression is mapped to
4387 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
4388 mapped value to NULL if walk_tree returns error_mark_mode.
4390 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4392 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
4394 (get_addr_stridx): Same.
4396 (get_range_strlen_dynamic): Same.
4397 (handle_builtin_strlen): Same.
4398 (handle_builtin_strchr): Same.
4399 (handle_builtin_strcpy): Same.
4400 (maybe_diag_stxncpy_trunc): Same.
4401 (handle_builtin_stxncpy_strncat): Same.
4402 (handle_builtin_memcpy): Same.
4403 (handle_builtin_strcat): Same.
4404 (handle_alloc_call): Same.
4405 (handle_builtin_memset): Same.
4406 (handle_builtin_string_cmp): Same.
4407 (handle_pointer_plus): Same.
4408 (count_nonzero_bytes_addr): Same.
4409 (count_nonzero_bytes): Same.
4410 (handle_store): Same.
4411 (fold_strstr_to_strncmp): Same.
4412 (handle_integral_assign): Same.
4413 (check_and_optimize_stmt): Same.
4414 (class strlen_dom_walker): Replace evrp with ranger.
4415 (strlen_dom_walker::before_dom_children): Remove evrp.
4416 (strlen_dom_walker::after_dom_children): Remove evrp.
4417 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
4418 Restrict sprintf output.
4420 2021-11-09 Martin Liska <mliska@suse.cz>
4422 * genconditions.c (write_one_condition): Add const qualifier
4425 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
4427 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
4430 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4432 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
4433 escape-newline support.
4434 (VEC_INIT_V4SI): Likewise.
4435 (VEC_INIT_V8HI): Likewise.
4436 (PACK_V1TI): Likewise.
4438 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
4439 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
4440 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
4444 (FMAF128_ODD): Likewise.
4445 (VSCEQPUO): Likewise.
4447 (VSIEQPF): Likewise.
4448 (VSTDCQP): Likewise.
4449 (PACK_TD): Likewise.
4450 (TABORTDC): Likewise.
4451 (TABORTDCI): Likewise.
4452 (SE_LXVRBX): Likewise.
4453 (SE_LXVRHX): Likewise.
4454 (SE_LXVRWX): Likewise.
4455 (SE_LXVRDX): Likewise.
4456 (VREPLACE_UN_UV2DI): Likewise.
4457 (VREPLACE_UN_UV4SI): Likewise.
4458 (VREPLACE_UN_V2DI): Likewise.
4459 (VREPLACE_ELT_UV2DI): Likewise.
4460 (VREPLACE_ELT_V2DI): Likewise.
4461 (ZE_LXVRBX): Likewise.
4462 (ZE_LXVRHX): Likewise.
4463 (ZE_LXVRWX): Likewise.
4464 (ZE_LXVRDX): Likewise.
4466 (CNTLZDM): Likewise.
4467 (CNTTZDM): Likewise.
4470 (PMXVBF16GER2): Likewise.
4471 (PMXVBF16GER2_INTERNAL): Likewise.
4472 (PMXVBF16GER2NN): Likewise.
4473 (PMXVBF16GER2NN_INTERNAL): Likewise.
4474 (PMXVBF16GER2NP): Likewise.
4475 (PMXVBF16GER2NP_INTERNAL): Likewise.
4476 (PMXVBF16GER2PN): Likewise.
4477 (PMXVBF16GER2PN_INTERNAL): Likewise.
4478 (PMXVBF16GER2PP): Likewise.
4479 (PMXVBF16GER2PP_INTERNAL): Likewise.
4480 (PMXVF16GER2): Likewise.
4481 (PMXVF16GER2_INTERNAL): Likewise.
4482 (PMXVF16GER2NN): Likewise.
4483 (PMXVF16GER2NN_INTERNAL): Likewise.
4484 (PMXVF16GER2NP): Likewise.
4485 (PMXVF16GER2NP_INTERNAL): Likewise.
4486 (PMXVF16GER2PN): Likewise.
4487 (PMXVF16GER2PN_INTERNAL): Likewise.
4488 (PMXVF16GER2PP): Likewise.
4489 (PMXVF16GER2PP_INTERNAL): Likewise.
4490 (PMXVF32GER_INTERNAL): Likewise.
4491 (PMXVF32GERNN): Likewise.
4492 (PMXVF32GERNN_INTERNAL): Likewise.
4493 (PMXVF32GERNP): Likewise.
4494 (PMXVF32GERNP_INTERNAL): Likewise.
4495 (PMXVF32GERPN): Likewise.
4496 (PMXVF32GERPN_INTERNAL): Likewise.
4497 (PMXVF32GERPP): Likewise.
4498 (PMXVF32GERPP_INTERNAL): Likewise.
4499 (PMXVF64GER): Likewise.
4500 (PMXVF64GER_INTERNAL): Likewise.
4501 (PMXVF64GERNN): Likewise.
4502 (PMXVF64GERNN_INTERNAL): Likewise.
4503 (PMXVF64GERNP): Likewise.
4504 (PMXVF64GERNP_INTERNAL): Likewise.
4505 (PMXVF64GERPN): Likewise.
4506 (PMXVF64GERPN_INTERNAL): Likewise.
4507 (PMXVF64GERPP): Likewise.
4508 (PMXVF64GERPP_INTERNAL): Likewise.
4509 (PMXVI16GER2): Likewise.
4510 (PMXVI16GER2_INTERNAL): Likewise.
4511 (PMXVI16GER2PP): Likewise.
4512 (PMXVI16GER2PP_INTERNAL): Likewise.
4513 (PMXVI16GER2S): Likewise.
4514 (PMXVI16GER2S_INTERNAL): Likewise.
4515 (PMXVI16GER2SPP): Likewise.
4516 (PMXVI16GER2SPP_INTERNAL): Likewise.
4517 (PMXVI4GER8): Likewise.
4518 (PMXVI4GER8_INTERNAL): Likewise.
4519 (PMXVI4GER8PP): Likewise.
4520 (PMXVI4GER8PP_INTERNAL): Likewise.
4521 (PMXVI8GER4): Likewise.
4522 (PMXVI8GER4_INTERNAL): Likewise.
4523 (PMXVI8GER4PP): Likewise.
4524 (PMXVI8GER4PP_INTERNAL): Likewise.
4525 (PMXVI8GER4SPP): Likewise.
4526 (PMXVI8GER4SPP_INTERNAL): Likewise.
4527 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
4528 (linebuf): Increase size.
4529 (lines): New variable.
4530 (lastline): Likewise.
4531 (real_line_pos): New function.
4532 (diag): Change signature.
4533 (bif_diag): Change signature; support escape-newline handling.
4534 (ovld_diag): Likewise.
4535 (fatal): Move earlier.
4536 (consume_whitespace): Adjust diag call.
4537 (advance_line): Add escape-newline handling; call fatal.
4538 (safe_inc_pos): Adjust diag call.
4539 (match_identifier): Likewise.
4540 (match_integer): Likewise.
4541 (match_to_right_bracket): Call fatal instead of diag; adjust diag
4543 (match_basetype): Adjust diag calls.
4544 (match_bracketed_pair): Likewise.
4545 (match_const_restriction): Likewise.
4546 (match_type): Likewise.
4547 (parse_args): Likewise.
4548 (parse_bif_attrs): Likewise.
4549 (complete_vector_type): Likewise.
4550 (complete_base_type): Likewise.
4551 (parse_prototype): Likewise.
4552 (parse_bif_entry): Likewise.
4553 (parse_bif_stanza): Likewise.
4554 (parse_ovld_entry): Likewise.
4555 (parse_ovld_stanza): Likewise.
4556 (main): Allocate buffers for lines[].
4558 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
4560 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
4561 use the REG argument on the CM_LARGE_PIC code path as well.
4562 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
4563 specially with the large code models.
4565 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4567 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
4568 allowing interposition since issues with binds_to_local_def were
4571 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
4573 PR tree-optimization/103122
4574 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
4575 entry with "calulate new values" set to false.
4577 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4579 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
4580 and builtin_return_address.
4581 (builtin_fnspec): Annotate builtin_return,
4582 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
4583 builtin_cxa_end_cleanup, builtin_eh_copy_values,
4584 builtin_frame_address, builtin_apply_args,
4585 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
4586 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
4588 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
4589 and builtin_return_address.
4591 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4593 PR tree-optimization/103177
4594 * passes.def: Move uncprop after pure/const and modref.
4596 2021-11-08 Martin Jambor <mjambor@suse.cz>
4600 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
4601 location before invoking remap_with_debug_expressions on it.
4602 * ipa-param-manipulation.c
4603 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4605 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
4607 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
4610 2021-11-08 Richard Biener <rguenther@suse.de>
4612 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
4613 instead of an sbitmap to cache irreducible state.
4615 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
4617 PR tree-optimization/103120
4618 * gimple-range-path.cc (path_range_query::range_defined_in_block):
4619 Bail if there's a cache entry.
4621 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4623 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
4624 rs6000_builtin_decls_x when appropriate.
4625 (add_condition_to_bb): Likewise.
4626 (rs6000_atomic_assign_expand_fenv): Likewise.
4628 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4630 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
4631 (rs6000_builtin_decl): Call it.
4633 2021-11-08 Martin Liska <mliska@suse.cz>
4635 * cgraph.c (cgraph_node::dump): Dump it from decl.
4637 2021-11-08 Martin Liska <mliska@suse.cz>
4639 PR gcov-profile/100520
4640 * coverage.c (coverage_compute_profile_id): Strip .gk when
4641 compare debug is used.
4642 * system.h (endswith): New function.
4644 2021-11-08 Martin Liska <mliska@suse.cz>
4646 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
4648 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
4650 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
4651 * config/rs6000/rbtree.h: Likewise.
4652 * config/rs6000/rs6000-builtin-new.def: Likewise.
4653 * config/rs6000/rs6000-gen-builtins.c: Likewise.
4654 * config/rs6000/rs6000-overload.def: Likewise.
4656 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4658 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
4659 (aarch64_address_cost): Use it instead of testing for CImode and
4662 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4664 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
4665 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
4666 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
4667 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
4668 NUM_MODE_DECIMAL_FLOAT.
4669 (REAL_MODE_FORMAT): Likewise.
4671 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
4673 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
4676 2021-11-08 Richard Biener <rguenther@suse.de>
4678 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
4680 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
4681 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
4682 LOOP_VINFO_NITERS_ASSUMPTIONS.
4683 (vect_analyze_loop_1): Adjust.
4684 (vect_analyze_loop): Move loop constraint setting and
4685 SCEV/niter reset here from vect_create_loop_vinfo to perform
4687 (vect_analyze_loop_form): Move dumping of symbolic niters
4688 here from vect_create_loop_vinfo.
4690 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4692 * ipa-modref-tree.h (enum modref_special_parms): New enum.
4693 (struct modref_access_node): update for special parms.
4694 (struct modref_ref_node): Likewise.
4695 (struct modref_parm_map): Likewise.
4696 (struct modref_tree): Likewise.
4697 * ipa-modref.c (dump_access): Likewise.
4698 (get_access): Detect static chain.
4699 (parm_map_for_arg): Take tree as arg instead of
4701 (merge_call_side_effects): Compute map for static chain.
4702 (process_fnspec): Update.
4703 (struct escape_point): Remove retslot_arg and static_chain_arg.
4704 (analyze_parms): Update.
4705 (compute_parm_map): Update.
4706 (propagate_unknown_call): Update.
4707 (modref_propagate_in_scc): Update.
4708 (modref_merge_call_site_flags): Update.
4709 (ipa_merge_modref_summary_after_inlining): Update.
4710 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
4711 * ipa-modref-tree.c (test_merge): Update.
4713 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
4715 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
4716 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
4717 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
4720 2021-11-08 liuhongt <hongtao.liu@intel.com>
4722 PR tree-optimization/103077
4723 * doc/invoke.texi (Options That Control Optimization):
4724 Update documentation for -ftree-loop-vectorize and
4725 -ftree-slp-vectorize which are enabled by default at -02.
4727 2021-11-08 liuhongt <hongtao.liu@intel.com>
4730 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
4731 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
4733 2021-11-08 liuhongt <hongtao.liu@intel.com>
4736 * match.pd: Simplify
4737 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
4738 c) under flag_unsafe_math_optimizations.
4740 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4743 * params.opt (modref-max-depth): Add range.
4744 (modref-max-adjustments): Fix range.
4746 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
4748 * tree-pass.h (make_pass_vrp_threader): Remove.
4749 * tree-ssa-threadbackward.c
4750 (back_threader_profitability::profitable_path_p): Remove
4751 ASSERT_EXPR references.
4752 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
4753 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
4754 (execute_vrp): Same.
4755 (class hybrid_threader): Remove.
4756 (hybrid_threader::hybrid_threader): Remove.
4757 (hybrid_threader::~hybrid_threader): Remove.
4758 (hybrid_threader::before_dom_children): Remove.
4759 (hybrid_threader::after_dom_children): Remove.
4760 (execute_vrp_threader): Remove.
4761 (class pass_vrp_threader): Remove.
4762 (make_pass_vrp_threader): Remove.
4764 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4766 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
4767 (gimple_call_retslot_flags): Likewise.
4768 (gimple_call_static_chain_flags): Likewise.
4769 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
4771 (modref_summary::useful_p): Likewise.
4772 (modref_summary_lto::useful_p): Likewise.
4773 (analyze_parms): Do not give up on NOVOPS.
4774 (analyze_function): When dumping report chnages in EAF flags
4775 between IPA and local pass.
4776 (modref_merge_call_site_flags): Compute implicit eaf flags
4777 based on callee ecf_flags and fnspec; if the function does not
4778 bind to current defs use interposable_eaf_flags.
4779 (modref_propagate_flags_in_scc): Update.
4780 * ipa-modref.h (interposable_eaf_flags): New function.
4782 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
4784 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
4786 (rs6000_invalid_new_builtin): New function.
4787 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
4788 (rs6000_expand_ldst_mask): New function.
4789 (new_cpu_expand_builtin): Likewise.
4790 (elemrev_icode): Likewise.
4791 (ldv_expand_builtin): Likewise.
4792 (lxvrse_expand_builtin): Likewise.
4793 (lxvrze_expand_builtin): Likewise.
4794 (stv_expand_builtin): Likewise.
4795 (new_mma_expand_builtin): Likewise.
4796 (new_htm_spr_num): Likewise.
4797 (new_htm_expand_builtin): Likewise.
4798 (rs6000_expand_new_builtin): Likewise.
4799 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
4801 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4803 * ipa-modref.c (modref_lattice): Add do_dataflow,
4804 changed and propagate_to fields.
4805 (modref_lattice::release): Free propagate_to
4806 (modref_lattice::merge): Do not give up early on unknown
4808 (modref_lattice::merge_deref): Likewise.
4809 (modref_eaf_analysis): Update toplevel comment.
4810 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
4811 do optimistic dataflow initialization.
4812 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
4813 (modref_eaf_analysis::propagate): New member function.
4814 (analyze_parms): Update to new API of modref_eaf_analysis.
4816 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
4818 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
4819 return true on functions from other partition.
4821 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
4823 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
4824 (vsx_splat_v4si_di): Change constraint to "wa".
4826 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4828 PR tree-optimization/103061
4829 * value-relation.cc (path_oracle::path_oracle): Initialize
4831 (path_oracle::killing_def): Set m_killed_defs.
4832 (path_oracle::query_relation): Do not look at the root oracle for
4834 * value-relation.h (class path_oracle): Add m_killed_defs.
4836 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4838 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4839 Remove gotos and other cleanups.
4841 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4844 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
4845 paradoxical and zero sized accesses.
4847 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4850 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
4853 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4855 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
4856 to the list of symbol prefixes that must be made linker-
4859 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4861 * config.host: Add support for aarch64-*-darwin.
4862 * config/aarch64/host-aarch64-darwin.c: New file.
4863 * config/aarch64/x-darwin: New file.
4865 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4867 PR tree-optimization/103093
4868 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
4870 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4872 PR tree-optimization/102943
4873 * gimple-range-cache.cc (class update_list): New.
4874 (update_list::add): Replace add_to_update.
4875 (update_list::pop): New.
4876 (ranger_cache::ranger_cache): Adjust.
4877 (ranger_cache::~ranger_cache): Adjust.
4878 (ranger_cache::add_to_update): Delete.
4879 (ranger_cache::propagate_cache): Adjust to new class.
4880 (ranger_cache::propagate_updated_value): Ditto.
4881 (ranger_cache::fill_block_cache): Ditto.
4882 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
4884 2021-11-05 Richard Biener <rguenther@suse.de>
4886 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
4887 comment and expand on another one. Combine nested if.
4889 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4892 * config/pa/pa-modes.def: Add OImode integer type.
4893 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
4895 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
4896 to UNITS_PER_WORD if IN_LIBGCC2.
4897 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
4898 negvti2, ashlti3, shrpd_internal): New patterns.
4899 Change some multi instruction types to multi.
4901 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4904 * config/i386/i386.h
4905 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
4908 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
4911 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
4913 * config/aarch64/constraints.md: Support const in Usw.
4915 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4917 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
4918 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
4920 2021-11-05 Richard Biener <rguenther@suse.de>
4922 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
4923 (LOOP_VINFO_N_STMTS): Likewise.
4924 (vec_info_for_bb): Remove unused function.
4925 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
4926 Initialize n_stmts member.
4927 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
4928 (vect_create_loop_vinfo): Do not set loop->aux.
4929 (vect_analyze_loop_2): Do not get n_stmts as argument,
4930 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
4932 (vect_analyze_loop_1): Remove callback, get the mode iterator
4933 and autodetected_vector_mode as argument, advancing the
4934 iterator and initializing autodetected_vector_mode here.
4935 (vect_analyze_loop): Split analysis loop into two, first
4936 processing main loops only and then epilogues.
4938 2021-11-05 Martin Jambor <mjambor@suse.cz>
4940 * ipa-prop.c (compute_complex_assign_jump_func): Remove
4941 unnecessary check for RECORD_TYPE.
4943 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
4945 * doc/install.texi2html: Do not generate old.html any longer.
4947 2021-11-05 Martin Liska <mliska@suse.cz>
4950 * opts.c (finish_options): Reset flag_gtoggle when it is used.
4952 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4955 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
4956 in CONST_WIDE_INT handling. Adjust recursive calls.
4957 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
4958 new add_const_value_attribute argument.
4959 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
4960 add_const_value_attribute argument.
4962 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
4964 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
4967 2021-11-05 Richard Biener <rguenther@suse.de>
4969 * tree-vectorizer.h (struct vect_loop_form_info): New.
4970 (vect_analyze_loop_form): Adjust.
4971 (vect_create_loop_vinfo): New.
4972 * tree-parloops.c (gather_scalar_reductions): Adjust for
4973 vect_analyze_loop_form API change.
4974 * tree-vect-loop.c: Include <functional>.
4975 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
4976 take struct vect_loop_form_info as output parameter and adjust.
4977 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
4978 split out call to the original vect_analyze_loop_form_1.
4979 (vect_reanalyze_as_main_loop): Rename to...
4980 (vect_analyze_loop_1): ... this, factor out the call to
4981 vect_analyze_loop_form and generalize to be able to use it twice ...
4982 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
4985 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
4988 * config/rs6000/fusion.md: Regenerate.
4989 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
4991 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
4993 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
4996 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4998 * config/aarch64/aarch64-builtins.c
4999 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
5000 ashr_simd, lshr, lshr_simd.
5001 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
5002 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
5003 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
5005 2021-11-04 Tamar Christina <tamar.christina@arm.com>
5007 * match.pd: New negate+shift pattern.
5009 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
5011 PR tree-optimization/103079
5012 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
5014 (gimple_range_calc_op2): Ditto.
5016 2021-11-04 Martin Jambor <mjambor@suse.cz>
5019 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
5020 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
5021 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
5022 parameter to mark_dead_statements.
5023 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
5024 (ipa_param_body_adjustments::mark_dead_statements): New parameter
5025 debugstack, push into it all SSA names used in debug statements,
5026 produce m_dead_ssa_debug_equiv mapping for the removed param.
5027 (replace_with_mapped_expr): New function.
5028 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
5029 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
5030 (ipa_param_body_adjustments::common_initialization): Gather and
5031 procecc SSA which will be removed but are in debug statements. Simplify.
5032 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
5034 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
5035 when avoiding a copy of an unnecessary statement. Remap removed SSA
5036 names in existing debug statements.
5037 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
5038 parameters if we have already done so.
5040 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
5043 * gimple.c (gimple_call_static_chain_flags): Handle case when
5044 nested function does not bind locally.
5046 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5048 * config/aarch64/aarch64.c (aarch64_function_value): Generate
5049 a register rtx for Neon vector-tuple modes.
5050 (aarch64_layout_arg): Likewise.
5052 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5054 * lower-subreg.c (simple_move): Prevent decomposition if
5055 modes are not tieable.
5057 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5058 Richard Sandiford <richard.sandiford@arm.com>
5060 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
5061 (v2x4hi_UP): Likewise.
5062 (v2x4hf_UP): Likewise.
5063 (v2x4bf_UP): Likewise.
5064 (v2x2si_UP): Likewise.
5065 (v2x2sf_UP): Likewise.
5066 (v2x1di_UP): Likewise.
5067 (v2x1df_UP): Likewise.
5068 (v2x16qi_UP): Likewise.
5069 (v2x8hi_UP): Likewise.
5070 (v2x8hf_UP): Likewise.
5071 (v2x8bf_UP): Likewise.
5072 (v2x4si_UP): Likewise.
5073 (v2x4sf_UP): Likewise.
5074 (v2x2di_UP): Likewise.
5075 (v2x2df_UP): Likewise.
5076 (v3x8qi_UP): Likewise.
5077 (v3x4hi_UP): Likewise.
5078 (v3x4hf_UP): Likewise.
5079 (v3x4bf_UP): Likewise.
5080 (v3x2si_UP): Likewise.
5081 (v3x2sf_UP): Likewise.
5082 (v3x1di_UP): Likewise.
5083 (v3x1df_UP): Likewise.
5084 (v3x16qi_UP): Likewise.
5085 (v3x8hi_UP): Likewise.
5086 (v3x8hf_UP): Likewise.
5087 (v3x8bf_UP): Likewise.
5088 (v3x4si_UP): Likewise.
5089 (v3x4sf_UP): Likewise.
5090 (v3x2di_UP): Likewise.
5091 (v3x2df_UP): Likewise.
5092 (v4x8qi_UP): Likewise.
5093 (v4x4hi_UP): Likewise.
5094 (v4x4hf_UP): Likewise.
5095 (v4x4bf_UP): Likewise.
5096 (v4x2si_UP): Likewise.
5097 (v4x2sf_UP): Likewise.
5098 (v4x1di_UP): Likewise.
5099 (v4x1df_UP): Likewise.
5100 (v4x16qi_UP): Likewise.
5101 (v4x8hi_UP): Likewise.
5102 (v4x8hf_UP): Likewise.
5103 (v4x8bf_UP): Likewise.
5104 (v4x4si_UP): Likewise.
5105 (v4x4sf_UP): Likewise.
5106 (v4x2di_UP): Likewise.
5107 (v4x2df_UP): Likewise.
5108 (TYPES_GETREGP): Delete.
5109 (TYPES_SETREGP): Likewise.
5110 (TYPES_LOADSTRUCT_U): Define.
5111 (TYPES_LOADSTRUCT_P): Likewise.
5112 (TYPES_LOADSTRUCT_LANE_U): Likewise.
5113 (TYPES_LOADSTRUCT_LANE_P): Likewise.
5114 (TYPES_STORE1P): Move for consistency.
5115 (TYPES_STORESTRUCT_U): Define.
5116 (TYPES_STORESTRUCT_P): Likewise.
5117 (TYPES_STORESTRUCT_LANE_U): Likewise.
5118 (TYPES_STORESTRUCT_LANE_P): Likewise.
5119 (aarch64_simd_tuple_types): Define.
5120 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
5121 (aarch64_init_simd_builtin_functions): Update frontend lookup
5122 for builtin functions after handling arm_neon.h pragma.
5123 (register_tuple_type): Manually set modes of single-integer
5124 tuple types. Record tuple types.
5125 * config/aarch64/aarch64-modes.def
5126 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
5127 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
5128 (SVE_MODES): Give single-vector modes priority over vector-
5130 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
5131 be after all single-vector modes.
5132 * config/aarch64/aarch64-simd-builtins.def: Update builtin
5133 generator macros to reflect modifications to the backend
5135 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
5136 Use vector-tuple mode iterator and rename to...
5137 (aarch64_simd_ld2<vstruct_elt>): This.
5138 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
5140 (aarch64_simd_ld2r<vstruct_elt>): This.
5141 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
5142 iterator and rename to...
5143 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
5144 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
5146 (vec_load_lanes<mode><vstruct_elt>): This.
5147 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
5149 (aarch64_simd_st2<vstruct_elt>): This.
5150 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
5151 iterator and rename to...
5152 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
5153 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
5155 (vec_store_lanes<mode><vstruct_elt>): This.
5156 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
5158 (aarch64_simd_ld3<vstruct_elt>): This.
5159 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
5161 (aarch64_simd_ld3r<vstruct_elt>): This.
5162 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
5163 iterator and rename to...
5164 (vec_load_lanesci<mode>): This.
5165 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
5167 (aarch64_simd_st3<vstruct_elt>): This.
5168 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
5169 iterator and rename to...
5170 (vec_store_lanesci<mode>): This.
5171 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
5173 (aarch64_simd_ld4<vstruct_elt>): This.
5174 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
5176 (aarch64_simd_ld4r<vstruct_elt>): This.
5177 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
5178 iterator and rename to...
5179 (vec_load_lanesxi<mode>): This.
5180 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
5182 (aarch64_simd_st4<vstruct_elt>): This.
5183 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
5184 iterator and rename to...
5185 (vec_store_lanesxi<mode>): This.
5186 (mov<mode>): Define for Neon vector-tuple modes.
5187 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
5189 (aarch64_ld1x3<vstruct_elt>): This.
5190 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
5192 (aarch64_ld1_x3_<vstruct_elt>): This.
5193 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
5195 (aarch64_ld1x4<vstruct_elt>): This.
5196 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
5198 (aarch64_ld1_x4_<vstruct_elt>): This.
5199 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
5201 (aarch64_st1x2<vstruct_elt>): This.
5202 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
5204 (aarch64_st1_x2_<vstruct_elt>): This.
5205 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
5207 (aarch64_st1x3<vstruct_elt>): This.
5208 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
5210 (aarch64_st1_x3_<vstruct_elt>): This.
5211 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
5213 (aarch64_st1x4<vstruct_elt>): This.
5214 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
5216 (aarch64_st1_x4_<vstruct_elt>): This.
5217 (*aarch64_mov<mode>): Define for vector-tuple modes.
5218 (*aarch64_be_mov<mode>): Likewise.
5219 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
5220 mode iterator and rename to...
5221 (aarch64_ld<nregs>r<vstruct_elt>): This.
5222 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
5224 (aarch64_ld2<vstruct_elt>_dreg): This.
5225 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
5227 (aarch64_ld3<vstruct_elt>_dreg): This.
5228 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
5230 (aarch64_ld4<vstruct_elt>_dreg): This.
5231 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
5232 iterator and rename to...
5233 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
5234 iterator and rename to...
5235 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
5236 (aarch64_ld1x2<VQ:mode>): Delete.
5237 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
5239 (aarch64_ld1x2<vstruct_elt>): This.
5240 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
5241 tuple mode iterator and rename to...
5242 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
5243 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
5244 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
5245 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
5247 (aarch64_st2<vstruct_elt>_dreg): This.
5248 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
5250 (aarch64_st3<vstruct_elt>_dreg): This.
5251 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
5253 (aarch64_st4<vstruct_elt>_dreg): This.
5254 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
5255 iterator and rename to...
5256 (aarch64_st<nregs><vstruct_elt>): This.
5257 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
5258 iterator and rename to aarch64_st<nregs><vstruct_elt>.
5259 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
5260 tuple mode iterator and rename to...
5261 (aarch64_st<nregs>_lane<vstruct_elt>): This.
5262 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
5263 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
5265 (aarch64_simd_ld1<vstruct_elt>_x2): This.
5266 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
5267 Refactor to include new vector-tuple modes.
5268 (aarch64_classify_vector_mode): Add cases for new vector-
5270 (aarch64_advsimd_partial_struct_mode_p): Define.
5271 (aarch64_advsimd_full_struct_mode_p): Likewise.
5272 (aarch64_advsimd_vector_array_mode): Likewise.
5273 (aarch64_sve_data_mode): Change location in file.
5274 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
5275 (aarch64_hard_regno_nregs): Handle case of partial Neon
5277 (aarch64_classify_address): Refactor to include handling of
5278 Neon vector-tuple modes.
5279 (aarch64_print_operand): Print "d" for "%R" for a partial
5280 Neon vector structure.
5281 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
5282 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
5283 modes with scalar machines modes larger than 8 bytes.
5284 (aarch64_can_change_mode_class): Don't allow changes between
5285 partial and full Neon vector-structure modes.
5286 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
5287 builtin and remove boiler-plate code for opaque mode.
5288 (vst2_lane_f32): Likewise.
5289 (vst2_lane_f64): Likewise.
5290 (vst2_lane_p8): Likewise.
5291 (vst2_lane_p16): Likewise.
5292 (vst2_lane_p64): Likewise.
5293 (vst2_lane_s8): Likewise.
5294 (vst2_lane_s16): Likewise.
5295 (vst2_lane_s32): Likewise.
5296 (vst2_lane_s64): Likewise.
5297 (vst2_lane_u8): Likewise.
5298 (vst2_lane_u16): Likewise.
5299 (vst2_lane_u32): Likewise.
5300 (vst2_lane_u64): Likewise.
5301 (vst2q_lane_f16): Likewise.
5302 (vst2q_lane_f32): Likewise.
5303 (vst2q_lane_f64): Likewise.
5304 (vst2q_lane_p8): Likewise.
5305 (vst2q_lane_p16): Likewise.
5306 (vst2q_lane_p64): Likewise.
5307 (vst2q_lane_s8): Likewise.
5308 (vst2q_lane_s16): Likewise.
5309 (vst2q_lane_s32): Likewise.
5310 (vst2q_lane_s64): Likewise.
5311 (vst2q_lane_u8): Likewise.
5312 (vst2q_lane_u16): Likewise.
5313 (vst2q_lane_u32): Likewise.
5314 (vst2q_lane_u64): Likewise.
5315 (vst3_lane_f16): Likewise.
5316 (vst3_lane_f32): Likewise.
5317 (vst3_lane_f64): Likewise.
5318 (vst3_lane_p8): Likewise.
5319 (vst3_lane_p16): Likewise.
5320 (vst3_lane_p64): Likewise.
5321 (vst3_lane_s8): Likewise.
5322 (vst3_lane_s16): Likewise.
5323 (vst3_lane_s32): Likewise.
5324 (vst3_lane_s64): Likewise.
5325 (vst3_lane_u8): Likewise.
5326 (vst3_lane_u16): Likewise.
5327 (vst3_lane_u32): Likewise.
5328 (vst3_lane_u64): Likewise.
5329 (vst3q_lane_f16): Likewise.
5330 (vst3q_lane_f32): Likewise.
5331 (vst3q_lane_f64): Likewise.
5332 (vst3q_lane_p8): Likewise.
5333 (vst3q_lane_p16): Likewise.
5334 (vst3q_lane_p64): Likewise.
5335 (vst3q_lane_s8): Likewise.
5336 (vst3q_lane_s16): Likewise.
5337 (vst3q_lane_s32): Likewise.
5338 (vst3q_lane_s64): Likewise.
5339 (vst3q_lane_u8): Likewise.
5340 (vst3q_lane_u16): Likewise.
5341 (vst3q_lane_u32): Likewise.
5342 (vst3q_lane_u64): Likewise.
5343 (vst4_lane_f16): Likewise.
5344 (vst4_lane_f32): Likewise.
5345 (vst4_lane_f64): Likewise.
5346 (vst4_lane_p8): Likewise.
5347 (vst4_lane_p16): Likewise.
5348 (vst4_lane_p64): Likewise.
5349 (vst4_lane_s8): Likewise.
5350 (vst4_lane_s16): Likewise.
5351 (vst4_lane_s32): Likewise.
5352 (vst4_lane_s64): Likewise.
5353 (vst4_lane_u8): Likewise.
5354 (vst4_lane_u16): Likewise.
5355 (vst4_lane_u32): Likewise.
5356 (vst4_lane_u64): Likewise.
5357 (vst4q_lane_f16): Likewise.
5358 (vst4q_lane_f32): Likewise.
5359 (vst4q_lane_f64): Likewise.
5360 (vst4q_lane_p8): Likewise.
5361 (vst4q_lane_p16): Likewise.
5362 (vst4q_lane_p64): Likewise.
5363 (vst4q_lane_s8): Likewise.
5364 (vst4q_lane_s16): Likewise.
5365 (vst4q_lane_s32): Likewise.
5366 (vst4q_lane_s64): Likewise.
5367 (vst4q_lane_u8): Likewise.
5368 (vst4q_lane_u16): Likewise.
5369 (vst4q_lane_u32): Likewise.
5370 (vst4q_lane_u64): Likewise.
5371 (vtbl3_s8): Likewise.
5372 (vtbl3_u8): Likewise.
5373 (vtbl3_p8): Likewise.
5374 (vtbl4_s8): Likewise.
5375 (vtbl4_u8): Likewise.
5376 (vtbl4_p8): Likewise.
5377 (vld1_u8_x3): Likewise.
5378 (vld1_s8_x3): Likewise.
5379 (vld1_u16_x3): Likewise.
5380 (vld1_s16_x3): Likewise.
5381 (vld1_u32_x3): Likewise.
5382 (vld1_s32_x3): Likewise.
5383 (vld1_u64_x3): Likewise.
5384 (vld1_s64_x3): Likewise.
5385 (vld1_f16_x3): Likewise.
5386 (vld1_f32_x3): Likewise.
5387 (vld1_f64_x3): Likewise.
5388 (vld1_p8_x3): Likewise.
5389 (vld1_p16_x3): Likewise.
5390 (vld1_p64_x3): Likewise.
5391 (vld1q_u8_x3): Likewise.
5392 (vld1q_s8_x3): Likewise.
5393 (vld1q_u16_x3): Likewise.
5394 (vld1q_s16_x3): Likewise.
5395 (vld1q_u32_x3): Likewise.
5396 (vld1q_s32_x3): Likewise.
5397 (vld1q_u64_x3): Likewise.
5398 (vld1q_s64_x3): Likewise.
5399 (vld1q_f16_x3): Likewise.
5400 (vld1q_f32_x3): Likewise.
5401 (vld1q_f64_x3): Likewise.
5402 (vld1q_p8_x3): Likewise.
5403 (vld1q_p16_x3): Likewise.
5404 (vld1q_p64_x3): Likewise.
5405 (vld1_u8_x2): Likewise.
5406 (vld1_s8_x2): Likewise.
5407 (vld1_u16_x2): Likewise.
5408 (vld1_s16_x2): Likewise.
5409 (vld1_u32_x2): Likewise.
5410 (vld1_s32_x2): Likewise.
5411 (vld1_u64_x2): Likewise.
5412 (vld1_s64_x2): Likewise.
5413 (vld1_f16_x2): Likewise.
5414 (vld1_f32_x2): Likewise.
5415 (vld1_f64_x2): Likewise.
5416 (vld1_p8_x2): Likewise.
5417 (vld1_p16_x2): Likewise.
5418 (vld1_p64_x2): Likewise.
5419 (vld1q_u8_x2): Likewise.
5420 (vld1q_s8_x2): Likewise.
5421 (vld1q_u16_x2): Likewise.
5422 (vld1q_s16_x2): Likewise.
5423 (vld1q_u32_x2): Likewise.
5424 (vld1q_s32_x2): Likewise.
5425 (vld1q_u64_x2): Likewise.
5426 (vld1q_s64_x2): Likewise.
5427 (vld1q_f16_x2): Likewise.
5428 (vld1q_f32_x2): Likewise.
5429 (vld1q_f64_x2): Likewise.
5430 (vld1q_p8_x2): Likewise.
5431 (vld1q_p16_x2): Likewise.
5432 (vld1q_p64_x2): Likewise.
5433 (vld1_s8_x4): Likewise.
5434 (vld1q_s8_x4): Likewise.
5435 (vld1_s16_x4): Likewise.
5436 (vld1q_s16_x4): Likewise.
5437 (vld1_s32_x4): Likewise.
5438 (vld1q_s32_x4): Likewise.
5439 (vld1_u8_x4): Likewise.
5440 (vld1q_u8_x4): Likewise.
5441 (vld1_u16_x4): Likewise.
5442 (vld1q_u16_x4): Likewise.
5443 (vld1_u32_x4): Likewise.
5444 (vld1q_u32_x4): Likewise.
5445 (vld1_f16_x4): Likewise.
5446 (vld1q_f16_x4): Likewise.
5447 (vld1_f32_x4): Likewise.
5448 (vld1q_f32_x4): Likewise.
5449 (vld1_p8_x4): Likewise.
5450 (vld1q_p8_x4): Likewise.
5451 (vld1_p16_x4): Likewise.
5452 (vld1q_p16_x4): Likewise.
5453 (vld1_s64_x4): Likewise.
5454 (vld1_u64_x4): Likewise.
5455 (vld1_p64_x4): Likewise.
5456 (vld1q_s64_x4): Likewise.
5457 (vld1q_u64_x4): Likewise.
5458 (vld1q_p64_x4): Likewise.
5459 (vld1_f64_x4): Likewise.
5460 (vld1q_f64_x4): Likewise.
5461 (vld2_s64): Likewise.
5462 (vld2_u64): Likewise.
5463 (vld2_f64): Likewise.
5464 (vld2_s8): Likewise.
5465 (vld2_p8): Likewise.
5466 (vld2_p64): Likewise.
5467 (vld2_s16): Likewise.
5468 (vld2_p16): Likewise.
5469 (vld2_s32): Likewise.
5470 (vld2_u8): Likewise.
5471 (vld2_u16): Likewise.
5472 (vld2_u32): Likewise.
5473 (vld2_f16): Likewise.
5474 (vld2_f32): Likewise.
5475 (vld2q_s8): Likewise.
5476 (vld2q_p8): Likewise.
5477 (vld2q_s16): Likewise.
5478 (vld2q_p16): Likewise.
5479 (vld2q_p64): Likewise.
5480 (vld2q_s32): Likewise.
5481 (vld2q_s64): Likewise.
5482 (vld2q_u8): Likewise.
5483 (vld2q_u16): Likewise.
5484 (vld2q_u32): Likewise.
5485 (vld2q_u64): Likewise.
5486 (vld2q_f16): Likewise.
5487 (vld2q_f32): Likewise.
5488 (vld2q_f64): Likewise.
5489 (vld3_s64): Likewise.
5490 (vld3_u64): Likewise.
5491 (vld3_f64): Likewise.
5492 (vld3_s8): Likewise.
5493 (vld3_p8): Likewise.
5494 (vld3_s16): Likewise.
5495 (vld3_p16): Likewise.
5496 (vld3_s32): Likewise.
5497 (vld3_u8): Likewise.
5498 (vld3_u16): Likewise.
5499 (vld3_u32): Likewise.
5500 (vld3_f16): Likewise.
5501 (vld3_f32): Likewise.
5502 (vld3_p64): Likewise.
5503 (vld3q_s8): Likewise.
5504 (vld3q_p8): Likewise.
5505 (vld3q_s16): Likewise.
5506 (vld3q_p16): Likewise.
5507 (vld3q_s32): Likewise.
5508 (vld3q_s64): Likewise.
5509 (vld3q_u8): Likewise.
5510 (vld3q_u16): Likewise.
5511 (vld3q_u32): Likewise.
5512 (vld3q_u64): Likewise.
5513 (vld3q_f16): Likewise.
5514 (vld3q_f32): Likewise.
5515 (vld3q_f64): Likewise.
5516 (vld3q_p64): Likewise.
5517 (vld4_s64): Likewise.
5518 (vld4_u64): Likewise.
5519 (vld4_f64): Likewise.
5520 (vld4_s8): Likewise.
5521 (vld4_p8): Likewise.
5522 (vld4_s16): Likewise.
5523 (vld4_p16): Likewise.
5524 (vld4_s32): Likewise.
5525 (vld4_u8): Likewise.
5526 (vld4_u16): Likewise.
5527 (vld4_u32): Likewise.
5528 (vld4_f16): Likewise.
5529 (vld4_f32): Likewise.
5530 (vld4_p64): Likewise.
5531 (vld4q_s8): Likewise.
5532 (vld4q_p8): Likewise.
5533 (vld4q_s16): Likewise.
5534 (vld4q_p16): Likewise.
5535 (vld4q_s32): Likewise.
5536 (vld4q_s64): Likewise.
5537 (vld4q_u8): Likewise.
5538 (vld4q_u16): Likewise.
5539 (vld4q_u32): Likewise.
5540 (vld4q_u64): Likewise.
5541 (vld4q_f16): Likewise.
5542 (vld4q_f32): Likewise.
5543 (vld4q_f64): Likewise.
5544 (vld4q_p64): Likewise.
5545 (vld2_dup_s8): Likewise.
5546 (vld2_dup_s16): Likewise.
5547 (vld2_dup_s32): Likewise.
5548 (vld2_dup_f16): Likewise.
5549 (vld2_dup_f32): Likewise.
5550 (vld2_dup_f64): Likewise.
5551 (vld2_dup_u8): Likewise.
5552 (vld2_dup_u16): Likewise.
5553 (vld2_dup_u32): Likewise.
5554 (vld2_dup_p8): Likewise.
5555 (vld2_dup_p16): Likewise.
5556 (vld2_dup_p64): Likewise.
5557 (vld2_dup_s64): Likewise.
5558 (vld2_dup_u64): Likewise.
5559 (vld2q_dup_s8): Likewise.
5560 (vld2q_dup_p8): Likewise.
5561 (vld2q_dup_s16): Likewise.
5562 (vld2q_dup_p16): Likewise.
5563 (vld2q_dup_s32): Likewise.
5564 (vld2q_dup_s64): Likewise.
5565 (vld2q_dup_u8): Likewise.
5566 (vld2q_dup_u16): Likewise.
5567 (vld2q_dup_u32): Likewise.
5568 (vld2q_dup_u64): Likewise.
5569 (vld2q_dup_f16): Likewise.
5570 (vld2q_dup_f32): Likewise.
5571 (vld2q_dup_f64): Likewise.
5572 (vld2q_dup_p64): Likewise.
5573 (vld3_dup_s64): Likewise.
5574 (vld3_dup_u64): Likewise.
5575 (vld3_dup_f64): Likewise.
5576 (vld3_dup_s8): Likewise.
5577 (vld3_dup_p8): Likewise.
5578 (vld3_dup_s16): Likewise.
5579 (vld3_dup_p16): Likewise.
5580 (vld3_dup_s32): Likewise.
5581 (vld3_dup_u8): Likewise.
5582 (vld3_dup_u16): Likewise.
5583 (vld3_dup_u32): Likewise.
5584 (vld3_dup_f16): Likewise.
5585 (vld3_dup_f32): Likewise.
5586 (vld3_dup_p64): Likewise.
5587 (vld3q_dup_s8): Likewise.
5588 (vld3q_dup_p8): Likewise.
5589 (vld3q_dup_s16): Likewise.
5590 (vld3q_dup_p16): Likewise.
5591 (vld3q_dup_s32): Likewise.
5592 (vld3q_dup_s64): Likewise.
5593 (vld3q_dup_u8): Likewise.
5594 (vld3q_dup_u16): Likewise.
5595 (vld3q_dup_u32): Likewise.
5596 (vld3q_dup_u64): Likewise.
5597 (vld3q_dup_f16): Likewise.
5598 (vld3q_dup_f32): Likewise.
5599 (vld3q_dup_f64): Likewise.
5600 (vld3q_dup_p64): Likewise.
5601 (vld4_dup_s64): Likewise.
5602 (vld4_dup_u64): Likewise.
5603 (vld4_dup_f64): Likewise.
5604 (vld4_dup_s8): Likewise.
5605 (vld4_dup_p8): Likewise.
5606 (vld4_dup_s16): Likewise.
5607 (vld4_dup_p16): Likewise.
5608 (vld4_dup_s32): Likewise.
5609 (vld4_dup_u8): Likewise.
5610 (vld4_dup_u16): Likewise.
5611 (vld4_dup_u32): Likewise.
5612 (vld4_dup_f16): Likewise.
5613 (vld4_dup_f32): Likewise.
5614 (vld4_dup_p64): Likewise.
5615 (vld4q_dup_s8): Likewise.
5616 (vld4q_dup_p8): Likewise.
5617 (vld4q_dup_s16): Likewise.
5618 (vld4q_dup_p16): Likewise.
5619 (vld4q_dup_s32): Likewise.
5620 (vld4q_dup_s64): Likewise.
5621 (vld4q_dup_u8): Likewise.
5622 (vld4q_dup_u16): Likewise.
5623 (vld4q_dup_u32): Likewise.
5624 (vld4q_dup_u64): Likewise.
5625 (vld4q_dup_f16): Likewise.
5626 (vld4q_dup_f32): Likewise.
5627 (vld4q_dup_f64): Likewise.
5628 (vld4q_dup_p64): Likewise.
5629 (vld2_lane_u8): Likewise.
5630 (vld2_lane_u16): Likewise.
5631 (vld2_lane_u32): Likewise.
5632 (vld2_lane_u64): Likewise.
5633 (vld2_lane_s8): Likewise.
5634 (vld2_lane_s16): Likewise.
5635 (vld2_lane_s32): Likewise.
5636 (vld2_lane_s64): Likewise.
5637 (vld2_lane_f16): Likewise.
5638 (vld2_lane_f32): Likewise.
5639 (vld2_lane_f64): Likewise.
5640 (vld2_lane_p8): Likewise.
5641 (vld2_lane_p16): Likewise.
5642 (vld2_lane_p64): Likewise.
5643 (vld2q_lane_u8): Likewise.
5644 (vld2q_lane_u16): Likewise.
5645 (vld2q_lane_u32): Likewise.
5646 (vld2q_lane_u64): Likewise.
5647 (vld2q_lane_s8): Likewise.
5648 (vld2q_lane_s16): Likewise.
5649 (vld2q_lane_s32): Likewise.
5650 (vld2q_lane_s64): Likewise.
5651 (vld2q_lane_f16): Likewise.
5652 (vld2q_lane_f32): Likewise.
5653 (vld2q_lane_f64): Likewise.
5654 (vld2q_lane_p8): Likewise.
5655 (vld2q_lane_p16): Likewise.
5656 (vld2q_lane_p64): Likewise.
5657 (vld3_lane_u8): Likewise.
5658 (vld3_lane_u16): Likewise.
5659 (vld3_lane_u32): Likewise.
5660 (vld3_lane_u64): Likewise.
5661 (vld3_lane_s8): Likewise.
5662 (vld3_lane_s16): Likewise.
5663 (vld3_lane_s32): Likewise.
5664 (vld3_lane_s64): Likewise.
5665 (vld3_lane_f16): Likewise.
5666 (vld3_lane_f32): Likewise.
5667 (vld3_lane_f64): Likewise.
5668 (vld3_lane_p8): Likewise.
5669 (vld3_lane_p16): Likewise.
5670 (vld3_lane_p64): Likewise.
5671 (vld3q_lane_u8): Likewise.
5672 (vld3q_lane_u16): Likewise.
5673 (vld3q_lane_u32): Likewise.
5674 (vld3q_lane_u64): Likewise.
5675 (vld3q_lane_s8): Likewise.
5676 (vld3q_lane_s16): Likewise.
5677 (vld3q_lane_s32): Likewise.
5678 (vld3q_lane_s64): Likewise.
5679 (vld3q_lane_f16): Likewise.
5680 (vld3q_lane_f32): Likewise.
5681 (vld3q_lane_f64): Likewise.
5682 (vld3q_lane_p8): Likewise.
5683 (vld3q_lane_p16): Likewise.
5684 (vld3q_lane_p64): Likewise.
5685 (vld4_lane_u8): Likewise.
5686 (vld4_lane_u16): Likewise.
5687 (vld4_lane_u32): Likewise.
5688 (vld4_lane_u64): Likewise.
5689 (vld4_lane_s8): Likewise.
5690 (vld4_lane_s16): Likewise.
5691 (vld4_lane_s32): Likewise.
5692 (vld4_lane_s64): Likewise.
5693 (vld4_lane_f16): Likewise.
5694 (vld4_lane_f32): Likewise.
5695 (vld4_lane_f64): Likewise.
5696 (vld4_lane_p8): Likewise.
5697 (vld4_lane_p16): Likewise.
5698 (vld4_lane_p64): Likewise.
5699 (vld4q_lane_u8): Likewise.
5700 (vld4q_lane_u16): Likewise.
5701 (vld4q_lane_u32): Likewise.
5702 (vld4q_lane_u64): Likewise.
5703 (vld4q_lane_s8): Likewise.
5704 (vld4q_lane_s16): Likewise.
5705 (vld4q_lane_s32): Likewise.
5706 (vld4q_lane_s64): Likewise.
5707 (vld4q_lane_f16): Likewise.
5708 (vld4q_lane_f32): Likewise.
5709 (vld4q_lane_f64): Likewise.
5710 (vld4q_lane_p8): Likewise.
5711 (vld4q_lane_p16): Likewise.
5712 (vld4q_lane_p64): Likewise.
5713 (vqtbl2_s8): Likewise.
5714 (vqtbl2_u8): Likewise.
5715 (vqtbl2_p8): Likewise.
5716 (vqtbl2q_s8): Likewise.
5717 (vqtbl2q_u8): Likewise.
5718 (vqtbl2q_p8): Likewise.
5719 (vqtbl3_s8): Likewise.
5720 (vqtbl3_u8): Likewise.
5721 (vqtbl3_p8): Likewise.
5722 (vqtbl3q_s8): Likewise.
5723 (vqtbl3q_u8): Likewise.
5724 (vqtbl3q_p8): Likewise.
5725 (vqtbl4_s8): Likewise.
5726 (vqtbl4_u8): Likewise.
5727 (vqtbl4_p8): Likewise.
5728 (vqtbl4q_s8): Likewise.
5729 (vqtbl4q_u8): Likewise.
5730 (vqtbl4q_p8): Likewise.
5731 (vqtbx2_s8): Likewise.
5732 (vqtbx2_u8): Likewise.
5733 (vqtbx2_p8): Likewise.
5734 (vqtbx2q_s8): Likewise.
5735 (vqtbx2q_u8): Likewise.
5736 (vqtbx2q_p8): Likewise.
5737 (vqtbx3_s8): Likewise.
5738 (vqtbx3_u8): Likewise.
5739 (vqtbx3_p8): Likewise.
5740 (vqtbx3q_s8): Likewise.
5741 (vqtbx3q_u8): Likewise.
5742 (vqtbx3q_p8): Likewise.
5743 (vqtbx4_s8): Likewise.
5744 (vqtbx4_u8): Likewise.
5745 (vqtbx4_p8): Likewise.
5746 (vqtbx4q_s8): Likewise.
5747 (vqtbx4q_u8): Likewise.
5748 (vqtbx4q_p8): Likewise.
5749 (vst1_s64_x2): Likewise.
5750 (vst1_u64_x2): Likewise.
5751 (vst1_f64_x2): Likewise.
5752 (vst1_s8_x2): Likewise.
5753 (vst1_p8_x2): Likewise.
5754 (vst1_s16_x2): Likewise.
5755 (vst1_p16_x2): Likewise.
5756 (vst1_s32_x2): Likewise.
5757 (vst1_u8_x2): Likewise.
5758 (vst1_u16_x2): Likewise.
5759 (vst1_u32_x2): Likewise.
5760 (vst1_f16_x2): Likewise.
5761 (vst1_f32_x2): Likewise.
5762 (vst1_p64_x2): Likewise.
5763 (vst1q_s8_x2): Likewise.
5764 (vst1q_p8_x2): Likewise.
5765 (vst1q_s16_x2): Likewise.
5766 (vst1q_p16_x2): Likewise.
5767 (vst1q_s32_x2): Likewise.
5768 (vst1q_s64_x2): Likewise.
5769 (vst1q_u8_x2): Likewise.
5770 (vst1q_u16_x2): Likewise.
5771 (vst1q_u32_x2): Likewise.
5772 (vst1q_u64_x2): Likewise.
5773 (vst1q_f16_x2): Likewise.
5774 (vst1q_f32_x2): Likewise.
5775 (vst1q_f64_x2): Likewise.
5776 (vst1q_p64_x2): Likewise.
5777 (vst1_s64_x3): Likewise.
5778 (vst1_u64_x3): Likewise.
5779 (vst1_f64_x3): Likewise.
5780 (vst1_s8_x3): Likewise.
5781 (vst1_p8_x3): Likewise.
5782 (vst1_s16_x3): Likewise.
5783 (vst1_p16_x3): Likewise.
5784 (vst1_s32_x3): Likewise.
5785 (vst1_u8_x3): Likewise.
5786 (vst1_u16_x3): Likewise.
5787 (vst1_u32_x3): Likewise.
5788 (vst1_f16_x3): Likewise.
5789 (vst1_f32_x3): Likewise.
5790 (vst1_p64_x3): Likewise.
5791 (vst1q_s8_x3): Likewise.
5792 (vst1q_p8_x3): Likewise.
5793 (vst1q_s16_x3): Likewise.
5794 (vst1q_p16_x3): Likewise.
5795 (vst1q_s32_x3): Likewise.
5796 (vst1q_s64_x3): Likewise.
5797 (vst1q_u8_x3): Likewise.
5798 (vst1q_u16_x3): Likewise.
5799 (vst1q_u32_x3): Likewise.
5800 (vst1q_u64_x3): Likewise.
5801 (vst1q_f16_x3): Likewise.
5802 (vst1q_f32_x3): Likewise.
5803 (vst1q_f64_x3): Likewise.
5804 (vst1q_p64_x3): Likewise.
5805 (vst1_s8_x4): Likewise.
5806 (vst1q_s8_x4): Likewise.
5807 (vst1_s16_x4): Likewise.
5808 (vst1q_s16_x4): Likewise.
5809 (vst1_s32_x4): Likewise.
5810 (vst1q_s32_x4): Likewise.
5811 (vst1_u8_x4): Likewise.
5812 (vst1q_u8_x4): Likewise.
5813 (vst1_u16_x4): Likewise.
5814 (vst1q_u16_x4): Likewise.
5815 (vst1_u32_x4): Likewise.
5816 (vst1q_u32_x4): Likewise.
5817 (vst1_f16_x4): Likewise.
5818 (vst1q_f16_x4): Likewise.
5819 (vst1_f32_x4): Likewise.
5820 (vst1q_f32_x4): Likewise.
5821 (vst1_p8_x4): Likewise.
5822 (vst1q_p8_x4): Likewise.
5823 (vst1_p16_x4): Likewise.
5824 (vst1q_p16_x4): Likewise.
5825 (vst1_s64_x4): Likewise.
5826 (vst1_u64_x4): Likewise.
5827 (vst1_p64_x4): Likewise.
5828 (vst1q_s64_x4): Likewise.
5829 (vst1q_u64_x4): Likewise.
5830 (vst1q_p64_x4): Likewise.
5831 (vst1_f64_x4): Likewise.
5832 (vst1q_f64_x4): Likewise.
5833 (vst2_s64): Likewise.
5834 (vst2_u64): Likewise.
5835 (vst2_f64): Likewise.
5836 (vst2_s8): Likewise.
5837 (vst2_p8): Likewise.
5838 (vst2_s16): Likewise.
5839 (vst2_p16): Likewise.
5840 (vst2_s32): Likewise.
5841 (vst2_u8): Likewise.
5842 (vst2_u16): Likewise.
5843 (vst2_u32): Likewise.
5844 (vst2_f16): Likewise.
5845 (vst2_f32): Likewise.
5846 (vst2_p64): Likewise.
5847 (vst2q_s8): Likewise.
5848 (vst2q_p8): Likewise.
5849 (vst2q_s16): Likewise.
5850 (vst2q_p16): Likewise.
5851 (vst2q_s32): Likewise.
5852 (vst2q_s64): Likewise.
5853 (vst2q_u8): Likewise.
5854 (vst2q_u16): Likewise.
5855 (vst2q_u32): Likewise.
5856 (vst2q_u64): Likewise.
5857 (vst2q_f16): Likewise.
5858 (vst2q_f32): Likewise.
5859 (vst2q_f64): Likewise.
5860 (vst2q_p64): Likewise.
5861 (vst3_s64): Likewise.
5862 (vst3_u64): Likewise.
5863 (vst3_f64): Likewise.
5864 (vst3_s8): Likewise.
5865 (vst3_p8): Likewise.
5866 (vst3_s16): Likewise.
5867 (vst3_p16): Likewise.
5868 (vst3_s32): Likewise.
5869 (vst3_u8): Likewise.
5870 (vst3_u16): Likewise.
5871 (vst3_u32): Likewise.
5872 (vst3_f16): Likewise.
5873 (vst3_f32): Likewise.
5874 (vst3_p64): Likewise.
5875 (vst3q_s8): Likewise.
5876 (vst3q_p8): Likewise.
5877 (vst3q_s16): Likewise.
5878 (vst3q_p16): Likewise.
5879 (vst3q_s32): Likewise.
5880 (vst3q_s64): Likewise.
5881 (vst3q_u8): Likewise.
5882 (vst3q_u16): Likewise.
5883 (vst3q_u32): Likewise.
5884 (vst3q_u64): Likewise.
5885 (vst3q_f16): Likewise.
5886 (vst3q_f32): Likewise.
5887 (vst3q_f64): Likewise.
5888 (vst3q_p64): Likewise.
5889 (vst4_s64): Likewise.
5890 (vst4_u64): Likewise.
5891 (vst4_f64): Likewise.
5892 (vst4_s8): Likewise.
5893 (vst4_p8): Likewise.
5894 (vst4_s16): Likewise.
5895 (vst4_p16): Likewise.
5896 (vst4_s32): Likewise.
5897 (vst4_u8): Likewise.
5898 (vst4_u16): Likewise.
5899 (vst4_u32): Likewise.
5900 (vst4_f16): Likewise.
5901 (vst4_f32): Likewise.
5902 (vst4_p64): Likewise.
5903 (vst4q_s8): Likewise.
5904 (vst4q_p8): Likewise.
5905 (vst4q_s16): Likewise.
5906 (vst4q_p16): Likewise.
5907 (vst4q_s32): Likewise.
5908 (vst4q_s64): Likewise.
5909 (vst4q_u8): Likewise.
5910 (vst4q_u16): Likewise.
5911 (vst4q_u32): Likewise.
5912 (vst4q_u64): Likewise.
5913 (vst4q_f16): Likewise.
5914 (vst4q_f32): Likewise.
5915 (vst4q_f64): Likewise.
5916 (vst4q_p64): Likewise.
5917 (vtbx4_s8): Likewise.
5918 (vtbx4_u8): Likewise.
5919 (vtbx4_p8): Likewise.
5920 (vld1_bf16_x2): Likewise.
5921 (vld1q_bf16_x2): Likewise.
5922 (vld1_bf16_x3): Likewise.
5923 (vld1q_bf16_x3): Likewise.
5924 (vld1_bf16_x4): Likewise.
5925 (vld1q_bf16_x4): Likewise.
5926 (vld2_bf16): Likewise.
5927 (vld2q_bf16): Likewise.
5928 (vld2_dup_bf16): Likewise.
5929 (vld2q_dup_bf16): Likewise.
5930 (vld3_bf16): Likewise.
5931 (vld3q_bf16): Likewise.
5932 (vld3_dup_bf16): Likewise.
5933 (vld3q_dup_bf16): Likewise.
5934 (vld4_bf16): Likewise.
5935 (vld4q_bf16): Likewise.
5936 (vld4_dup_bf16): Likewise.
5937 (vld4q_dup_bf16): Likewise.
5938 (vst1_bf16_x2): Likewise.
5939 (vst1q_bf16_x2): Likewise.
5940 (vst1_bf16_x3): Likewise.
5941 (vst1q_bf16_x3): Likewise.
5942 (vst1_bf16_x4): Likewise.
5943 (vst1q_bf16_x4): Likewise.
5944 (vst2_bf16): Likewise.
5945 (vst2q_bf16): Likewise.
5946 (vst3_bf16): Likewise.
5947 (vst3q_bf16): Likewise.
5948 (vst4_bf16): Likewise.
5949 (vst4q_bf16): Likewise.
5950 (vld2_lane_bf16): Likewise.
5951 (vld2q_lane_bf16): Likewise.
5952 (vld3_lane_bf16): Likewise.
5953 (vld3q_lane_bf16): Likewise.
5954 (vld4_lane_bf16): Likewise.
5955 (vld4q_lane_bf16): Likewise.
5956 (vst2_lane_bf16): Likewise.
5957 (vst2q_lane_bf16): Likewise.
5958 (vst3_lane_bf16): Likewise.
5959 (vst3q_lane_bf16): Likewise.
5960 (vst4_lane_bf16): Likewise.
5961 (vst4q_lane_bf16): Likewise.
5962 * config/aarch64/geniterators.sh: Modify iterator regex to
5963 match new vector-tuple modes.
5964 * config/aarch64/iterators.md (insn_count): Extend mode
5965 attribute with vector-tuple type information.
5967 (Vendreg): Likewise.
5970 (VSTRUCT_2D): New mode iterator.
5971 (VSTRUCT_2DNX): Likewise.
5972 (VSTRUCT_2DX): Likewise.
5973 (VSTRUCT_2Q): Likewise.
5974 (VSTRUCT_2QD): Likewise.
5975 (VSTRUCT_3D): Likewise.
5976 (VSTRUCT_3DNX): Likewise.
5977 (VSTRUCT_3DX): Likewise.
5978 (VSTRUCT_3Q): Likewise.
5979 (VSTRUCT_3QD): Likewise.
5980 (VSTRUCT_4D): Likewise.
5981 (VSTRUCT_4DNX): Likewise.
5982 (VSTRUCT_4DX): Likewise.
5983 (VSTRUCT_4Q): Likewise.
5984 (VSTRUCT_4QD): Likewise.
5985 (VSTRUCT_D): Likewise.
5986 (VSTRUCT_Q): Likewise.
5987 (VSTRUCT_QD): Likewise.
5988 (VSTRUCT_ELT): New mode attribute.
5989 (vstruct_elt): Likewise.
5990 * genmodes.c (VECTOR_MODE): Add default prefix and order
5992 (VECTOR_MODE_WITH_PREFIX): Define.
5993 (make_vector_mode): Add mode prefix and order parameters.
5995 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5997 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
5999 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
6001 * expr.c (emit_group_load_1): Remove historic workaround.
6003 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
6005 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
6006 Factor out main loop to...
6007 (aarch64_init_simd_builtin_functions): This new function.
6008 (register_tuple_type): Define.
6009 (aarch64_scalar_builtin_type_p): Define.
6010 (handle_arm_neon_h): Define.
6011 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
6012 pragma for arm_neon.h.
6013 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
6015 (handle_arm_neon_h): Likewise.
6016 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
6017 Remove static modifier.
6018 * config/aarch64/arm_neon.h (target): Remove Neon vector
6019 structure type definitions.
6021 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6023 PR tree-optimization/102943
6024 * gimple-range-path.cc (path_range_query::range_on_path_entry):
6025 Prefer range_of_expr unless there are no statements in the BB.
6027 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6029 PR tree-optimization/102943
6030 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
6031 Avoid duplicate calculation of paths.
6033 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6035 PR tree-optimization/102943
6036 * gimple-range-path.cc (path_range_query::compute_phi_relations):
6037 Only compute relations for SSA names in the import list.
6038 (path_range_query::compute_outgoing_relations): Same.
6039 * gimple-range-path.h (path_range_query::import_p): New.
6041 2021-11-04 Richard Biener <rguenther@suse.de>
6043 PR rtl-optimization/103075
6044 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
6045 false for a VOIDmode operand.
6047 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6049 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
6050 variables private and add "m_" to their names. Remove is_loop.
6051 (aarch64_record_potential_advsimd_unrolling): Replace with...
6052 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
6053 (aarch64_analyze_loop_vinfo): Replace with...
6054 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
6055 Move initialization of (m_)vec_flags to add_stmt_cost.
6056 (aarch64_analyze_bb_vinfo): Delete.
6057 (aarch64_count_ops): Replace with...
6058 (aarch64_vector_costs::count_ops): ...this.
6059 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
6060 using m_costing_for_scalar to test whether we're costing
6061 scalar or vector code.
6062 (aarch64_adjust_body_cost_sve): Replace with...
6063 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
6064 (aarch64_adjust_body_cost): Replace with...
6065 (aarch64_vector_costs::adjust_body_cost): ...this.
6066 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
6068 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6070 * target.def (targetm.vectorize.init_cost): Replace with...
6071 (targetm.vectorize.create_costs): ...this.
6072 (targetm.vectorize.add_stmt_cost): Delete.
6073 (targetm.vectorize.finish_cost): Likewise.
6074 (targetm.vectorize.destroy_cost_data): Likewise.
6075 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
6076 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6077 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6078 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6079 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6080 * doc/tm.texi: Regenerate.
6081 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
6083 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
6084 (vector_costs): New class.
6085 (init_cost): Take a vec_info and return a vector_costs.
6086 (dump_stmt_cost): Remove data parameter.
6087 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
6088 (add_stmt_costs): Likewise.
6089 (finish_cost): Replace data parameter with a vector_costs.
6090 (destroy_cost_data): Delete.
6091 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
6093 (vec_info::vec_info): Remove the target_cost_data parameter and
6094 initialize the member variable to null instead.
6095 (vec_info::~vec_info): Delete target_cost_data instead of calling
6097 (vector_costs::add_stmt_cost): New function.
6098 (vector_costs::finish_cost): Likewise.
6099 (vector_costs::record_stmt_cost): Likewise.
6100 (vector_costs::adjust_cost_for_freq): Likewise.
6101 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
6102 call to vec_info::vec_info.
6103 (vect_compute_single_scalar_iteration_cost): Update after above
6104 changes to costing interface.
6105 (vect_analyze_loop_operations): Likewise.
6106 (vect_estimate_min_profitable_iters): Likewise.
6107 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
6108 at the start_over point, where it needs to be recreated after
6109 trying without slp. Update retry code accordingly.
6110 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
6111 to vec_info::vec_info.
6112 (vect_slp_analyze_operation): Update after above changes to costing
6114 (vect_bb_vectorization_profitable_p): Likewise.
6115 * targhooks.h (default_init_cost): Replace with...
6116 (default_vectorize_create_costs): ...this.
6117 (default_add_stmt_cost): Delete.
6118 (default_finish_cost, default_destroy_cost_data): Likewise.
6119 * targhooks.c (default_init_cost): Replace with...
6120 (default_vectorize_create_costs): ...this.
6121 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
6122 (default_finish_cost, default_destroy_cost_data): Delete.
6123 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
6124 vector_costs. Add a constructor.
6125 (aarch64_init_cost): Replace with...
6126 (aarch64_vectorize_create_costs): ...this.
6127 (aarch64_add_stmt_cost): Replace with...
6128 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
6129 to adjust the cost for inner loops.
6130 (aarch64_finish_cost): Replace with...
6131 (aarch64_vector_costs::finish_cost): ...this.
6132 (aarch64_destroy_cost_data): Delete.
6133 (TARGET_VECTORIZE_INIT_COST): Replace with...
6134 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6135 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6136 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6137 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6138 * config/i386/i386.c (ix86_vector_costs): New structure.
6139 (ix86_init_cost): Replace with...
6140 (ix86_vectorize_create_costs): ...this.
6141 (ix86_add_stmt_cost): Replace with...
6142 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
6143 to adjust the cost for inner loops.
6144 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
6145 (TARGET_VECTORIZE_INIT_COST): Replace with...
6146 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6147 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6148 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6149 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6150 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
6151 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6152 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6153 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6154 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6155 (rs6000_cost_data): Inherit from vector_costs.
6156 Add a constructor. Drop loop_info, cost and costing_for_scalar
6157 in favor of the corresponding vector_costs member variables.
6158 Add "m_" to the names of the remaining member variables and
6160 (rs6000_density_test): Replace with...
6161 (rs6000_cost_data::density_test): ...this.
6162 (rs6000_init_cost): Replace with...
6163 (rs6000_vectorize_create_costs): ...this.
6164 (rs6000_update_target_cost_per_stmt): Replace with...
6165 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
6166 (rs6000_add_stmt_cost): Replace with...
6167 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
6168 to adjust the cost for inner loops.
6169 (rs6000_adjust_vect_cost_per_loop): Replace with...
6170 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
6171 (rs6000_finish_cost): Replace with...
6172 (rs6000_cost_data::finish_cost): ...this. Group loop code
6173 into a single if statement and pass the loop_vinfo down to
6175 (rs6000_destroy_cost_data): Delete.
6177 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6179 PR tree-optimization/103062
6180 PR tree-optimization/103062
6181 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
6182 Increase size of allocation by 1.
6183 (ssa_equiv_stack::push_replacement): Grow as needed.
6184 (ssa_equiv_stack::get_replacement): Same.
6185 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
6186 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
6187 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
6188 (pointer_equiv_analyzer::get_equiv): Same.
6189 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
6190 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
6191 const markers. Use auto_vec instead of tree *.
6193 2021-11-04 Richard Biener <rguenther@suse.de>
6195 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
6196 true parameter and inline valueization.
6197 (vn_nary_op_lookup_1): Inline valueization from ...
6198 (vn_nary_op_compute_hash): ... here and remove it here.
6199 * tree-ssa-pre.c (phi_translate_1): Do not valueize
6200 before vn_nary_lookup_pieces.
6201 (get_representative_for): Mark created SSA representatives
6204 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6206 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
6207 that the operand has a vector mode. Use subreg_lowpart_offset
6208 to test whether an index corresponds to the low part.
6210 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6212 * read-rtl.c: Remove dead !GENERATOR_FILE block.
6213 * read-rtl-function.c (function_reader::consolidate_singletons):
6214 Generate canonical CONST_VECTORs.
6216 2021-11-04 liuhongt <hongtao.liu@intel.com>
6219 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
6220 (regmem_or_bitnot_regmem_operand): .. and extend to handle
6222 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
6223 the operands which are required to be register_operand.
6224 (*<avx512>_vpternlog<mode>_2): Ditto.
6225 (*<avx512>_vpternlog<mode>_3): Ditto.
6226 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
6227 vector HFmodes since it's not a real AVX512FP16 instruction.
6229 2021-11-04 liuhongt <hongtao.liu@intel.com>
6232 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
6233 .COPYSIGN (a,b) when a and b are same type as the truncation
6234 type and has less precision than extend type.
6236 2021-11-04 Richard Biener <rguenther@suse.de>
6238 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
6241 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
6243 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
6244 (VI1_AVX512VNNI): Likewise.
6245 (SDOT_VPDP_SUF): New mode_attr.
6248 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
6249 vpdpwssd when VNNI targets available.
6250 (usdot_prod<mode>): New expander for vector QImode.
6252 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
6254 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
6255 parentheses to base and stride.
6256 (_tile_stream_loadd_internal): Likewise.
6257 (_tile_stored_internal): Likewise.
6259 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
6261 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
6262 arguments to `reg_class_subset_p'.
6264 2021-11-03 Joseph Myers <joseph@codesourcery.com>
6267 * fold-const.c (fold_init): New function.
6268 * fold-const.h (fold_init): New prototype.
6270 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6272 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
6274 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6276 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
6277 a range from edge if arg == phidef.
6279 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6281 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
6282 for constant before any other processing.
6284 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6286 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
6288 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6290 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
6291 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
6292 no explcit context and current_bb is set.
6293 (gimple_ranger::fold_stmt): New.
6294 * gimple-range.h (current_bb, fold_stmt): New.
6295 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
6297 2021-11-03 Richard Biener <rguenther@suse.de>
6299 PR tree-optimization/102970
6300 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
6301 when translating a MEM_REF over a backedge.
6303 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
6305 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
6306 * config/aarch64/aarch64-tune.md: Regenerate.
6307 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
6308 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
6309 * doc/invoke.texi: Add documentation for Ampere-1 core.
6311 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
6313 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
6315 (ldr_got_small_<mode>): Remove pattern.
6316 (ldr_got_small_sidi): Likewise.
6317 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
6318 GOT accesses as moves.
6319 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
6320 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
6321 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
6323 2021-11-03 Martin Liska <mliska@suse.cz>
6325 * gcov.c (read_line): Remove dead variable.
6327 2021-11-03 Martin Liska <mliska@suse.cz>
6330 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
6331 (ipa_fn_summary::account_size_time): Likewise.
6332 (edge_set_predicate): Likewise.
6333 (set_hint_predicate): Likewise.
6334 (add_freqcounting_predicate): Likewise.
6335 (evaluate_conditions_for_known_args): Likewise.
6336 (evaluate_properties_for_edge): Likewise.
6337 (remap_freqcounting_preds_after_dup): Likewise.
6338 (ipa_fn_summary_t::duplicate): Likewise.
6339 (set_cond_stmt_execution_predicate): Likewise.
6340 (set_switch_stmt_execution_predicate): Likewise.
6341 (compute_bb_predicates): Likewise.
6342 (will_be_nonconstant_expr_predicate): Likewise.
6343 (will_be_nonconstant_predicate): Likewise.
6344 (phi_result_unknown_predicate): Likewise.
6345 (predicate_for_phi_result): Likewise.
6346 (analyze_function_body): Likewise.
6347 (compute_fn_summary): Likewise.
6348 (summarize_calls_size_and_time): Likewise.
6349 (estimate_calls_size_and_time): Likewise.
6350 (ipa_call_context::estimate_size_and_time): Likewise.
6351 (remap_edge_summaries): Likewise.
6352 (remap_freqcounting_predicate): Likewise.
6353 (ipa_merge_fn_summary_after_inlining): Likewise.
6354 (ipa_update_overall_fn_summary): Likewise.
6355 (read_ipa_call_summary): Likewise.
6356 (inline_read_section): Likewise.
6357 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
6358 * ipa-predicate.c (predicate::add_clause): Likewise.
6359 (ipa_predicate::add_clause): Likewise.
6360 (predicate::or_with): Likewise.
6361 (ipa_predicate::or_with): Likewise.
6362 (predicate::evaluate): Likewise.
6363 (ipa_predicate::evaluate): Likewise.
6364 (predicate::probability): Likewise.
6365 (ipa_predicate::probability): Likewise.
6366 (dump_condition): Likewise.
6367 (dump_clause): Likewise.
6368 (predicate::dump): Likewise.
6369 (ipa_predicate::dump): Likewise.
6370 (predicate::debug): Likewise.
6371 (ipa_predicate::debug): Likewise.
6372 (predicate::remap_after_duplication): Likewise.
6373 (ipa_predicate::remap_after_duplication): Likewise.
6374 (predicate::remap_after_inlining): Likewise.
6375 (ipa_predicate::remap_after_inlining): Likewise.
6376 (predicate::stream_in): Likewise.
6377 (ipa_predicate::stream_in): Likewise.
6378 (predicate::stream_out): Likewise.
6379 (ipa_predicate::stream_out): Likewise.
6380 (add_condition): Likewise.
6381 * ipa-predicate.h (class predicate): Likewise.
6382 (class ipa_predicate): Likewise.
6383 (add_condition): Likewise.
6385 2021-11-03 Richard Biener <rguenther@suse.de>
6387 * bitmap.h (bitmap_bit_p): Change the return type to bool.
6388 * bitmap.c (bitmap_bit_p): Likewise.
6389 * sbitmap.h (bitmap_bit_p): Likewise.
6390 (bitmap_set_bit): Return whether the bit changed.
6391 (bitmap_clear_bit): Likewise.
6392 * tree-ssa.c (verify_vssa): Make use of the changed state
6393 from bitmap_set_bit.
6395 2021-11-03 Richard Biener <rguenther@suse.de>
6397 PR middle-end/103033
6398 * internal-fn.c (expand_DEFERRED_INIT): Elide the
6399 native_interpret_expr path in favor of folding the
6400 VIEW_CONVERT_EXPR generated when punning the RHS.
6402 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6404 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
6407 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
6410 * ipa-modref.c (callee_to_caller_flags): New function.
6411 (modref_eaf_analysis::analyze_ssa_name): Use it.
6412 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
6414 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6416 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
6419 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
6420 Uroš Bizjak <ubizjak@gmail.com>
6422 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
6423 rotations by non-constant amounts.
6425 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6427 * ipa-modref.c: Fix anonymous namespace placement.
6428 (class modref_eaf_analysis): New class.
6429 (analyze_ssa_name_flags): Turn to ...
6430 (modref_eaf_analysis::analyze_ssa_name): ... this one.
6431 (merge_call_lhs_flags): Turn to ...
6432 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
6433 (modref_eaf_analysis::merge_with_ssa_name): New member function.
6434 (record_escape_points): Turn to ...
6435 (modref_eaf_analysis::record_escape_points): ... this one.
6436 (analyze_parms): Updat
6437 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
6439 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6441 * gimple.c (gimple_call_static_chain_flags): New function.
6442 * gimple.h (gimple_call_static_chain_flags): Declare
6443 * ipa-modref.c (modref_summary::modref_summary): Initialize
6445 (modref_summary_lto::modref_summary_lto): Likewise.
6446 (modref_summary::useful_p): Test static_chain_flags.
6447 (modref_summary_lto::useful_p): Likewise.
6448 (struct modref_summary_lto): Add static_chain_flags.
6449 (modref_summary::dump): Dump static_chain_flags.
6450 (modref_summary_lto::dump): Likewise.
6451 (struct escape_point): Add static_cahin_arg.
6452 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
6453 (analyze_parms): Handle static chains.
6454 (modref_summaries::duplicate): Duplicate static_chain_flags.
6455 (modref_summaries_lto::duplicate): Likewise.
6456 (modref_write): Stream static_chain_flags.
6457 (read_section): Likewise.
6458 (modref_merge_call_site_flags): Handle static_chain_flags.
6459 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
6460 * tree-ssa-structalias.c (handle_rhs_call): Use
6461 gimple_static_chain_flags.
6463 2021-11-02 Richard Biener <rguenther@suse.de>
6465 PR tree-optimization/103029
6466 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
6467 the PHI nodes in the loop maintain their original operand
6470 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6472 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
6473 (EAF_NOREAD): Renumber.
6474 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
6475 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
6476 (deref_flags): Likewise.
6477 (modref_lattice::init): Likewise.
6478 (modref_lattice::merge): Likewise.
6479 (merge_call_lhs_flags): Likewise.
6480 (analyze_ssa_name_flags): Likewise.
6481 (modref_merge_call_site_flags): Likewise.
6482 * tree-ssa-structalias.c (handle_call_arg): Likewise.
6484 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
6486 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
6487 check and remove an unused local variable with shNadd/shNadd.uw
6490 2021-11-02 Tamar Christina <tamar.christina@arm.com>
6492 PR tree-optimization/103007
6493 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
6496 2021-11-02 Richard Biener <rguenther@suse.de>
6498 PR middle-end/103038
6499 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
6500 (can_native_interpret_type_p): Likewise.
6501 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
6502 RHS if the LHS is an SSA name.
6504 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
6506 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
6507 * langhooks-def.h (lhd_simulate_record_decl): Declare.
6508 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
6509 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
6510 * langhooks.c (lhd_simulate_record_decl): New function.
6512 2021-11-02 Jakub Jelinek <jakub@redhat.com>
6515 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
6516 modes with TImode inner mode if 32-bit.
6518 2021-11-02 liuhongt <hongtao.liu@intel.com>
6520 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
6521 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
6522 vect_slp_v4si_store_unalign): Document efficient target.
6523 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
6524 vect_slp_v16qi_store_unalign_1): Ditto.
6525 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
6526 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
6527 (struct_4char_block_move, struct_8char_block_move,
6528 struct_16char_block_move): Ditto.
6530 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
6531 Jakub Jelinek <jakub@redhat.com>
6534 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
6535 ix86_expand_ti_to_v1ti): New helper functions.
6536 (ix86_expand_v1ti_shift): Check if the amount operand is an
6537 integer constant, and expand as a TImode shift if it isn't.
6538 (ix86_expand_v1ti_rotate): Check if the amount operand is an
6539 integer constant, and expand as a TImode rotate if it isn't.
6540 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
6541 right shifts of V1TImode quantities.
6542 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
6543 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
6544 to QImode general_operand, and let the helper functions lower
6545 shifts by non-constant operands, as TImode shifts. Make
6546 conditional on TARGET_64BIT.
6547 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
6548 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
6549 Make conditional on TARGET_64BIT.
6551 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6553 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
6554 "*tabort_1_plus"): Remove operands non-null check.
6556 2021-11-01 Martin Liska <mliska@suse.cz>
6558 * opt-functions.awk: Add new sanity checking.
6559 * optc-gen.awk: Add new argument to integer_range_info.
6560 * params.opt: Update 2 params which have negative IntegerRange.
6562 2021-11-01 qing zhao <qing.zhao@oracle.com>
6564 * gimplify.c (gimplify_decl_expr): Do not add call to
6565 __builtin_clear_padding when a variable is a gimple register
6566 or it might not have padding.
6567 (gimplify_init_constructor): Likewise.
6569 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6571 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
6572 movi, dup and extract costing fields.
6573 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
6574 thunderx_extra_costs, thunderx2t99_extra_costs,
6575 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
6577 * config/arm/aarch-cost-tables.h (generic_extra_costs,
6578 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
6579 exynosm1_extra_costs, xgene1_extra_costs): Likewise
6580 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
6581 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
6583 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6585 * cse.c (add_to_set): New.
6586 (find_sets_in_insn): Register constants in sets.
6587 (canonicalize_insn): Use auto_vec instead.
6588 (cse_insn): Try materializing using vec_dup.
6589 * rtl.h (simplify_context::simplify_gen_vec_select,
6590 simplify_gen_vec_select): New.
6591 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
6593 2021-11-01 David Malcolm <dmalcolm@redhat.com>
6595 * common.opt (fdiagnostics-escape-format=): New.
6596 (diagnostics_escape_format): New enum.
6597 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
6598 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
6599 * diagnostic-format-json.cc (json_end_diagnostic): Add
6600 "escape-source" attribute.
6601 * diagnostic-show-locus.c
6602 (exploc_with_display_col::exploc_with_display_col): Replace
6603 "tabstop" param with a cpp_char_column_policy and add an "aspect"
6604 param. Use these to compute m_display_col accordingly.
6605 (struct char_display_policy): New struct.
6606 (layout::m_policy): New field.
6607 (layout::m_escape_on_output): New field.
6608 (def_policy): New function.
6609 (make_range): Update for changes to exploc_with_display_col ctor.
6610 (default_print_decoded_ch): New.
6611 (width_per_escaped_byte): New.
6612 (escape_as_bytes_width): New.
6613 (escape_as_bytes_print): New.
6614 (escape_as_unicode_width): New.
6615 (escape_as_unicode_print): New.
6617 (layout::layout): Initialize new fields. Update m_exploc ctor
6618 call for above change to ctor.
6619 (layout::maybe_add_location_range): Update for changes to
6620 exploc_with_display_col ctor.
6621 (layout::calculate_x_offset_display): Update for change to
6623 (layout::print_source_line): Pass policy
6624 to cpp_display_width_computation. Capture cpp_decoded_char when
6625 calling process_next_codepoint. Move printing of source code to
6626 m_policy.m_print_cb.
6627 (line_label::line_label): Pass in policy rather than context.
6628 (layout::print_any_labels): Update for change to line_label ctor.
6629 (get_affected_range): Pass in policy rather than context, updating
6630 calls to location_compute_display_column accordingly.
6631 (get_printed_columns): Likewise, also for cpp_display_width.
6632 (correction::correction): Pass in policy rather than tabstop.
6633 (correction::compute_display_cols): Pass m_policy rather than
6634 m_tabstop to cpp_display_width.
6635 (correction::m_tabstop): Replace with...
6636 (correction::m_policy): ...this.
6637 (line_corrections::line_corrections): Pass in policy rather than
6639 (line_corrections::m_context): Replace with...
6640 (line_corrections::m_policy): ...this.
6641 (line_corrections::add_hint): Update to use m_policy rather than
6643 (line_corrections::add_hint): Likewise.
6644 (layout::print_trailing_fixits): Likewise.
6645 (selftest::test_display_widths): New.
6646 (selftest::test_layout_x_offset_display_utf8): Update to use
6647 policy rather than tabstop.
6648 (selftest::test_one_liner_labels_utf8): Add test of escaping
6650 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
6651 use policy rather than tabstop.
6652 (selftest::test_overlapped_fixit_printing): Likewise.
6653 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
6654 (selftest::test_overlapped_fixit_printing_2): Likewise.
6655 (selftest::test_tab_expansion): Likewise.
6656 (selftest::test_escaping_bytes_1): New.
6657 (selftest::test_escaping_bytes_2): New.
6658 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
6659 * diagnostic.c (diagnostic_initialize): Initialize
6660 context->escape_format.
6661 (convert_column_unit): Update to use default character width policy.
6662 (selftest::test_diagnostic_get_location_text): Likewise.
6663 * diagnostic.h (enum diagnostics_escape_format): New enum.
6664 (diagnostic_context::escape_format): New field.
6665 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
6666 (-fdiagnostics-format=): Add "escape-source" attribute to examples
6667 of JSON output, and document it.
6668 * input.c (location_compute_display_column): Pass in "policy"
6669 rather than "tabstop", passing to
6670 cpp_byte_column_to_display_column.
6671 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
6672 * input.h (class cpp_char_column_policy): New forward decl.
6673 (location_compute_display_column): Pass in "policy" rather than
6675 * opts.c (common_handle_option): Handle
6676 OPT_fdiagnostics_escape_format_.
6677 * selftest.c (temp_source_file::temp_source_file): New ctor
6678 overload taking a size_t.
6679 * selftest.h (temp_source_file::temp_source_file): Likewise.
6681 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6683 * dbgcnt.def: Add debug counter for back_thread[12] and
6684 back_threadfull[12].
6685 * passes.def: Pass "first" argument to each back threading pass.
6686 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
6688 (back_threader::debug_counter): New.
6689 (back_threader::maybe_register_path): Call debug_counter.
6691 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6693 * tree-ssa-threadbackward.c (BT_NONE): New.
6696 (back_threader::back_threader): Add flags.
6697 Move loop initialization here.
6698 (back_threader::~back_threader): New.
6699 (back_threader::find_taken_edge_switch): Change solver and ranger
6701 (back_threader::find_taken_edge_cond): Same.
6702 (back_threader::find_paths_to_names): Same.
6703 (back_threader::find_paths): Same.
6704 (back_threader::dump): Same.
6705 (try_thread_blocks): Merge into thread_blocks.
6706 (back_threader::thread_blocks): New.
6707 (do_early_thread_jumps): Merge into thread_blocks.
6708 (do_thread_jumps): Merge into thread_blocks.
6709 (back_threader::thread_through_all_blocks): Remove.
6711 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
6713 PR tree-optimization/103003
6714 * value-relation.cc (dom_oracle::register_relation): If the 2
6715 ssa names are the same, don't register any relation.
6717 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
6719 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
6720 redundant check for calls_eh_return.
6721 * config/aarch64/aarch64.md (*do_return): Likewise.
6723 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6725 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
6726 duplicate_loop_to_header_edge to
6727 duplicate_loop_body_to_header_edge.
6728 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6729 * cfghooks.h (struct cfg_hooks): Likewise.
6730 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6731 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
6732 (clone_loop_to_header_edge): Likewise.
6733 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
6734 * cfgrtl.c (struct cfg_hooks): Likewise.
6735 * doc/loop.texi: Likewise.
6736 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
6737 (unroll_loop_runtime_iterations): Likewise.
6738 (unroll_loop_stupid): Likewise.
6739 (apply_opt_in_copies): Likewise.
6740 * tree-cfg.c (struct cfg_hooks): Likewise.
6741 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
6742 (try_peel_loop): Likewise.
6743 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
6744 (gimple_duplicate_loop_body_to_header_edge): Likewise.
6745 (tree_transform_and_unroll_loop): Likewise.
6746 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
6749 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6751 * cfgloopmanip.c (loop_version): Refactor loopify to
6752 loop_version. Move condition generation after loopify.
6754 * cfgloopmanip.h (loopify): Delete.
6756 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
6758 * ipa-fnsummary.c: Include tree-dfa.h.
6759 (points_to_local_or_readonly_memory_p): Return true on return
6761 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
6764 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
6766 PR middle-end/102972
6767 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
6768 internal Fortran name; new permit_num_teams arg to permit
6769 omp_get_num_teams and omp_get_team_num.
6770 (scan_omp_1_stmt): Update call to it, add missing call for
6771 reverse offload, and check for strictly nested API calls in teams.
6773 2021-10-30 Jakub Jelinek <jakub@redhat.com>
6775 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
6777 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6779 * selftest.c (assert_streq): Add newlines when emitting non-equal
6782 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6784 * Makefile.in: Fix syntax for reference to LIBDEPS in
6787 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6789 * doc/install.texi: Bump required minimum DejaGnu version.
6791 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6792 Andrew MacLeod <amacleod@redhat.com>
6794 * value-relation.cc (path_oracle::killing_def): Add a
6795 self-equivalence so we don't look to the root oracle.
6797 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6799 * passes.def: Replace the pass_thread_jumps before VRP* with
6800 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
6801 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
6802 Remove hyphen from "thread-full" name.
6804 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
6806 PR tree-optimization/102983
6807 * gimple-range-cache.h (propagate_updated_value): Make public.
6808 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
6809 when processing gcond stmts.
6811 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
6813 * gimple.c (gimple_call_retslot_flags): New function.
6814 * gimple.h (gimple_call_retslot_flags): Declare.
6815 * ipa-modref.c: Include tree-cfg.h.
6816 (struct escape_entry): Turn parm_index to signed.
6817 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
6818 (modref_summary::modref_summary): Initialize retslot_flags.
6819 (struct modref_summary_lto): Likewise.
6820 (modref_summary::useful_p): Check retslot_flags.
6821 (modref_summary_lto::useful_p): Likewise.
6822 (modref_summary::dump): Dump retslot_flags.
6823 (modref_summary_lto::dump): Likewise.
6824 (struct escape_point): Add hidden_args enum.
6825 (analyze_ssa_name_flags): Ignore return slot return;
6826 use gimple_call_retslot_flags.
6827 (record_escape_points): Break out from ...
6828 (analyze_parms): ... here; handle retslot_flags.
6829 (modref_summaries::duplicate): Duplicate retslot_flags.
6830 (modref_summaries_lto::duplicate): Likewise.
6831 (modref_write_escape_summary): Stream parm_index as signed.
6832 (modref_read_escape_summary): Likewise.
6833 (modref_write): Stream retslot_flags.
6834 (read_section): Likewise.
6835 (struct escape_map): Fix typo in comment.
6836 (update_escape_summary_1): Fix whitespace.
6837 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
6838 (modref_merge_call_site_flags): Merge retslot_flags.
6839 * ipa-modref.h (struct modref_summary): Add retslot_flags.
6840 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
6842 2021-10-29 Tamar Christina <tamar.christina@arm.com>
6844 PR tree-optimization/102977
6845 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
6846 (vect_detect_pair_op): Add crosslane check.
6847 (vect_match_call_complex_mla): Remove.
6848 (class complex_mul_pattern): Update comment.
6849 (complex_mul_pattern::matches): Update detection.
6850 (class complex_fma_pattern): Remove.
6851 (complex_fma_pattern::matches): Remove.
6852 (complex_fma_pattern::recognize): Remove.
6853 (complex_fma_pattern::build): Remove.
6854 (class complex_fms_pattern): Update comment.
6855 (complex_fms_pattern::matches): Remove.
6856 (complex_operations_pattern::recognize): Remove complex_fma_pattern
6858 2021-10-29 Jakub Jelinek <jakub@redhat.com>
6860 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
6863 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
6866 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
6867 patterns match and emit for VSX xxpermdi.
6869 2021-10-29 liuhongt <hongtao.liu@intel.com>
6872 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
6874 (V16HF_FTYPE_V16HF): Ditto.
6875 (V32HF_FTYPE_V32HF): Ditto.
6876 (V8HF_FTYPE_V8HF_ROUND): Ditto.
6877 (V16HF_FTYPE_V16HF_ROUND): Ditto.
6878 (V32HF_FTYPE_V32HF_ROUND): Ditto.
6879 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
6880 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
6881 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
6882 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
6883 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
6884 * config/i386/i386-builtins.c
6885 (ix86_builtin_vectorized_function): Enable vectorization for
6886 HFmode FLOOR/CEIL/TRUNC operation.
6887 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
6889 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
6892 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6893 Andrew MacLeod <amacleod@redhat.com>
6895 * value-relation.cc (path_oracle::killing_def): Walk the
6896 equivalency list and remove SSA from any equivalencies.
6898 2021-10-28 Stafford Horne <shorne@gmail.com>
6900 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
6903 2021-10-28 Jakub Jelinek <jakub@redhat.com>
6905 PR tree-optimization/102951
6906 * fold-const.h (address_compare): Declare.
6907 * fold-const.c (address_compare): New function.
6908 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
6909 address_compare helper.
6910 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
6913 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6915 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
6917 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6919 PR tree-optimization/102940
6920 * tree-vrp.c (execute_ranger_vrp): Reset scev.
6922 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6924 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
6926 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6928 * configure: Regenerate.
6929 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
6931 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
6933 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
6935 2021-10-28 Richard Biener <rguenther@suse.de>
6938 * fold-const.c (fold_convert_const): Avoid int to float
6939 constant folding with -frounding-math and inexact result.
6940 * simplify-rtx.c (simplify_const_unary_operation): Likewise
6941 for both float and unsigned_float.
6943 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6945 * tree-ssa-threadbackward.c
6946 (back_threader::find_taken_edge_switch): Use find_case_label_range
6947 instead of find_taken_edge.
6949 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6951 * tree-ssa-threadbackward.c (class back_threader_registry):
6952 Inherit from back_jt_path_registry.
6953 (back_threader_registry::thread_through_all_blocks): Remove.
6954 (back_threader_registry::register_path): Remove
6955 m_lowlevel_registry prefix.
6957 2021-10-28 Richard Biener <rguenther@suse.de>
6960 * fold-const.c (fold_convert_const_real_from_real): Honor
6961 -frounding-math if the conversion is not exact.
6962 * simplify-rtx.c (simplify_const_unary_operation): Do not
6963 simplify FLOAT_TRUNCATE with sign dependent rounding.
6965 2021-10-28 Richard Biener <rguenther@suse.de>
6967 PR tree-optimization/102949
6968 * tree-vect-stmts.c (ensure_base_align): Look at the
6969 dr_info of a group leader and assert we are looking at
6970 one with analyzed alignment.
6972 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
6975 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
6976 V1T1 mode for unaligned load and store.
6978 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6980 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
6983 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6985 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
6988 2021-10-28 Alexandre Oliva <oliva@adacore.com>
6990 * common.opt (fharden-compares): New.
6991 (fharden-conditional-branches): New.
6992 * doc/invoke.texi: Document new options.
6993 * gimple-harden-conditionals.cc: New.
6994 * Makefile.in (OBJS): Build it.
6995 * passes.def: Add new passes.
6996 * tree-pass.h (make_pass_harden_compares): Declare.
6997 (make_pass_harden_conditional_branches): Declare.
6999 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
7002 * config/rs6000/altivec.md: Add vsx register constraints.
7003 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
7004 (vsx_xxsel<mode>2): Likewise.
7005 (vsx_xxsel<mode>3): Likewise.
7006 (vsx_xxsel<mode>4): Likewise.
7008 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
7011 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
7012 (altivec_vsel<mode>): ... this and update define.
7013 (*altivec_vsel<mode>_uns): Delete.
7014 (altivec_vsel<mode>2): New define_insn.
7015 (altivec_vsel<mode>3): Likewise.
7016 (altivec_vsel<mode>4): Likewise.
7017 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
7018 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
7020 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
7021 selection instead of per element.
7022 * config/rs6000/vector.md:
7023 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
7024 (vsx_xxsel<mode>): ... this and update define.
7025 (*vsx_xxsel<mode>_uns): Delete.
7026 (vsx_xxsel<mode>2): New define_insn.
7027 (vsx_xxsel<mode>3): Likewise.
7028 (vsx_xxsel<mode>4): Likewise.
7030 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
7032 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
7033 HFmode rsqrt without TARGET_SSE_MATH.
7034 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
7035 btrunc condition to be restricted by -ftrapping-math, adjust
7036 use_rsqrt_p function call.
7037 * config/i386/i386.md (rcphf2): New define_insn.
7038 (rsqrthf2): Likewise.
7039 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
7040 (div<mode>3): New expander for HF mode.
7041 (rsqrt<mode>2): Likewise.
7042 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
7043 (*avx512fp16_vmrsqrtv8hf2): Likewise.
7045 2021-10-27 Saagar Jha <saagar@saagarjha.com>
7047 * config.gcc: Adjust for Darwin21.
7048 * config/darwin-c.c (macosx_version_as_macro): Likewise.
7049 * config/darwin-driver.c (validate_macosx_version_min):
7051 (darwin_find_version_from_kernel): Likewise.
7053 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
7055 * gimple-range-path.cc
7056 (path_range_query::range_defined_in_block): Call killing_def.
7058 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
7060 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
7061 operands as well as relation.
7062 * gimple-range-path.cc
7063 (path_range_query::compute_ranges_in_block): Compute PHI relations
7064 first. Compute outgoing relations at the end.
7065 (path_range_query::compute_ranges): Remove call to compute_relations.
7066 (path_range_query::compute_relations): Remove.
7067 (path_range_query::maybe_register_phi_relation): New.
7068 (path_range_query::compute_phi_relations): Abstract out
7069 registering one PHI relation to...
7070 (path_range_query::compute_outgoing_relations): ...here.
7071 * gimple-range-path.h (class path_range_query): Remove
7073 Add maybe_register_phi_relation.
7075 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
7076 Andrew MacLeod <amacleod@redhat.com>
7078 * value-relation.cc (path_oracle::killing_def): Kill second
7081 2021-10-27 Martin Jambor <mjambor@suse.cz>
7083 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
7084 profile feedback depending on their local availability.
7086 2021-10-27 Martin Jambor <mjambor@suse.cz>
7088 * params.opt (param_ipa_cp_profile_count_base): New parameter.
7089 * doc/invoke.texi (Optimize Options): Add entry for
7090 ipa-cp-profile-count-base.
7091 * ipa-cp.c (max_count): Replace with base_count, replace all
7092 occurrences too, unless otherwise stated.
7093 (ipcp_cloning_candidate_p): identify mostly-directly called
7094 functions based on their counts, not max_count.
7095 (compare_edge_profile_counts): New function.
7096 (ipcp_propagate_stage): Instead of setting max_count, find the
7097 appropriate edge count in a sorted vector of counts of eligible
7098 edges and make it the base_count.
7100 2021-10-27 Martin Jambor <mjambor@suse.cz>
7102 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
7103 n_nonrec_calls and itself, document all fields.
7104 (init_caller_stats): Initialize the above new fields.
7105 (gather_caller_stats): Gather self-recursive counts and calls number.
7106 (get_info_about_necessary_edges): Gather counts of self-recursive and
7107 other edges bringing in the requested value separately.
7108 (dump_profile_updates): Rework to dump info about a single node only.
7109 (lenient_count_portion_handling): New function.
7110 (struct gather_other_count_struct): New type.
7111 (gather_count_of_non_rec_edges): New function.
7112 (struct desc_incoming_count_struct): New type.
7113 (analyze_clone_icoming_counts): New function.
7114 (adjust_clone_incoming_counts): Likewise.
7115 (update_counts_for_self_gen_clones): Likewise.
7116 (update_profiling_info): Rewritten.
7117 (update_specialized_profile): Adjust call to dump_profile_updates.
7118 (create_specialized_node): Do not update profiling info.
7119 (decide_about_value): New parameter self_gen_clones, either push new
7120 clones into it or updat their profile counts. For self-recursively
7121 generated values, use a portion of the node count instead of count
7122 from self-recursive edges to estimate goodness.
7123 (decide_whether_version_node): Gather clones for self-generated values
7124 in a new vector, update their profiles at once at the end.
7126 2021-10-27 Richard Biener <rguenther@suse.de>
7128 * tree-vectorizer.c (vect_transform_loops): New function,
7130 (try_vectorize_loop_1): ... here. Simplify as epilogues
7131 are now fully handled in the split part.
7133 2021-10-27 Jakub Jelinek <jakub@redhat.com>
7135 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
7136 iterators with pointer types.
7137 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
7139 2021-10-26 Martin Sebor <msebor@redhat.com>
7141 PR tree-optimization/102238
7142 PR tree-optimization/102919
7143 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
7144 (array_elt_at_offset): Move to pointer-query.
7145 (set_aggregate_size_and_offset): New function.
7146 (field_at_offset): Move to pointer-query.
7147 (get_origin_and_offset): Rename...
7148 (get_origin_and_offset_r): this. Add an argument. Make aggregate
7149 handling more robust.
7150 (get_origin_and_offset): New.
7151 (alias_offset): Add an argument.
7152 (format_string): Use subobject size determined by get_origin_and_offset.
7153 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
7154 Improve/correct handling of aggregates.
7155 (array_elt_at_offset): Same.
7156 * pointer-query.h (field_at_offset): Declare.
7157 (array_elt_at_offset): Declare.
7159 2021-10-26 Martin Sebor <msebor@redhat.com>
7161 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
7163 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
7164 (expand_builtin_fork_or_exec): Same.
7165 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
7166 compute_objsize additional arguments.
7167 (inbounds_memaccess_p): Same.
7168 (array_bounds_checker::check_array_bounds): Add an assert. Stash
7169 statement in a member.
7170 (check_array_bounds_dom_walker::before_dom_children): Same.
7171 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
7172 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
7173 (handle_printf_call): Pass a new argument.
7174 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
7175 (check_access): Add an argument and pass it along to callees.
7176 (check_read_access): Make a member function.
7177 (pass_waccess::check_strcat): Pass access_data ctor additional
7179 (pass_waccess::check_strncat): Same.
7180 (pass_waccess::check_stxcpy): Same.
7181 (pass_waccess::check_stxncpy): Same.
7182 (pass_waccess::check_strncmp): Same.
7183 (pass_waccess::check_read_access): Same.
7184 (pass_waccess::check_builtin): Same.
7185 (pass_waccess::maybe_check_access_sizes): Same.
7186 (pass_waccess::maybe_check_dealloc_call): Same.
7187 * gimple-ssa-warn-access.h (check_read_access): Declare a new
7189 * pointer-query.cc (compute_objsize_r): Add an argument.
7190 (gimple_call_return_array): Same.
7191 (gimple_call_alloc_size): Same.
7192 (access_ref::access_ref): Same.
7193 (access_ref::get_ref): Same.
7194 (pointer_query::get_ref): Same.
7195 (handle_min_max_size): Pass an arguments to callees.
7196 (handle_array_ref): Add an argument.
7197 (handle_mem_ref): Same.
7198 (compute_objsize): Same.
7199 * pointer-query.h (struct access_ref): Adjust signatures.
7200 (struct access_data): Same.
7201 (gimple_call_alloc_size): Add an argument.
7202 (gimple_parm_array_size): Same.
7203 (compute_objsize): Same.
7204 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
7205 argument to compute_objsize.
7206 (strlen_pass::maybe_warn_overflow): Same.
7207 (maybe_diag_stxncpy_trunc): Same.
7209 2021-10-26 Martin Sebor <msebor@redhat.com>
7211 PR middle-end/102453
7212 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
7213 (pass_waccess::check_atomic_builtin): Call it.
7215 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
7217 PR rtl-optimization/102842
7218 * lra-constraints.c (match_reload): Ignore out in checking values
7220 (curr_insn_transform): Collect outputs before doing reloads of operands.
7222 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
7225 * config/rs6000/x86intrin.h: Move some included headers to new
7226 headers. Include new immintrin.h instead of those headers.
7227 * config/rs6000/immintrin.h: New.
7228 * config/rs6000/x86gprintrin.h: New.
7229 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
7231 2021-10-26 Richard Biener <rguenther@suse.de>
7233 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
7234 Remove byte_offset parameter.
7235 (vect_create_data_ref_ptr): Likewise.
7236 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7238 (vect_create_data_ref_ptr): Likewise.
7239 * tree-vect-stmts.c (vectorizable_store): Adjust.
7240 (vectorizable_load): Likewise.
7242 2021-10-26 Richard Biener <rguenther@suse.de>
7244 PR tree-optimization/96109
7245 * tree-vectorizer.h (dr_misalignment): Add optional offset
7247 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
7248 offset applied for negative stride accesses.
7249 (vect_enhance_data_refs_alignment): Compute negative stride
7250 access offset and pass it to dr_misalignment.
7251 * tree-vect-stmts.c (get_negative_load_store_type): Pass
7252 negative offset to dr_misalignment.
7253 (get_group_load_store_type): Likewise.
7254 (get_load_store_type): Likewise.
7255 (vectorizable_store): Remove asserts about alignment.
7256 (vectorizable_load): Likewise.
7258 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
7260 PR tree-optimization/102897
7261 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
7263 2021-10-26 Richard Biener <rguenther@suse.de>
7265 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7266 Take offset in bytes.
7267 (vect_create_data_ref_ptr): Likewise.
7268 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
7269 offset by element size.
7270 (vect_create_cond_for_align_checks): Likewise.
7271 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
7272 (vectorizable_load): Remove duplicate leftover from merge
7275 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
7277 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
7278 function to expand V1TI mode logical shifts by integer constants.
7279 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
7280 mode rotations by integer constants.
7281 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
7282 ix86_expand_v1ti_rotate): Prototype new functions here.
7283 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
7284 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
7286 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
7288 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7289 Avoid threading circular paths.
7291 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
7293 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
7294 Attempt to resolve all incoming paths to a PHI.
7295 (back_threader::resolve_def): Always return true for PHIs.
7297 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
7299 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
7300 Always try to resolve path without looking back.
7301 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
7302 edge is a back edge.
7304 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
7306 PR tree-optimization/102789
7307 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
7308 update inits of simd_lane_access.
7310 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7312 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
7313 move to vr-values.c.
7314 (simplify_casted_conds): Move to vrp_folder class.
7315 (execute_vrp): Call via vrp_folder now.
7316 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
7317 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
7318 * vr-values.h (simplify_casted_cond): Add prototype.
7320 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7322 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
7325 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
7327 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
7329 (_mm_testc_si128): Likewise.
7330 (_mm_testnzc_si128): Likewise.
7331 (_mm_blend_ps): Likewise.
7332 (_mm_blendv_ps): Likewise.
7333 (_mm_blend_pd): Likewise.
7334 (_mm_blendv_pd): Likewise.
7335 (_mm_ceil_pd): Likewise.
7336 (_mm_ceil_sd): Likewise.
7337 (_mm_ceil_ps): Likewise.
7338 (_mm_ceil_ss): Likewise.
7339 (_mm_floor_pd): Likewise.
7340 (_mm_floor_sd): Likewise.
7341 (_mm_floor_ps): Likewise.
7342 (_mm_floor_ss): Likewise.
7343 (_mm_minpos_epu16): Likewise.
7344 (_mm_mul_epi32): Likewise.
7345 (_mm_cvtepi8_epi16): Likewise.
7346 (_mm_packus_epi32): Likewise.
7347 (_mm_cmpgt_epi64): Likewise.
7349 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
7351 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
7352 shifts of the mode's smin_value and smax_value when the bit count
7353 operand doesn't have side-effects.
7354 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
7355 when the bit count operand doesn't have side-effects.
7356 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
7357 compile-time evaluation of saturating left shifts with constant
7360 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7362 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
7365 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7367 * flag-types.h (enum ranger_debug): Adjust values.
7368 * params.opt (ranger_debug): Ditto.
7370 2021-10-25 Martin Jambor <mjambor@suse.cz>
7372 PR tree-optimization/102886
7373 * tree-sra.c (totally_scalarize_subtree): Fix the out of
7376 2021-10-25 Andrew Pinski <apinski@marvell.com>
7378 * tree-ssa-dce.c (simple_dce_from_worklist):
7379 Check stmt_unremovable_because_of_non_call_eh_p also
7380 before removing the statement.
7382 2021-10-25 Richard Biener <rguenther@suse.de>
7384 PR tree-optimization/102905
7385 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
7386 Use vect_supportable_dr_alignment again to determine whether
7387 an access is supported when not aligned.
7389 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7391 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
7394 2021-10-25 Jim Wilson <jimw@sifive.com>
7395 Kito Cheng <kito.cheng@sifive.com>
7396 Jia-Wei Chen <jiawei@iscas.ac.cn>
7397 Shi-Hua Liao <shihua@iscas.ac.cn>
7399 * config/riscv/bitmanip.md (shiftm1): New.
7400 (*bset<mode>): Ditto.
7401 (*bset<mode>_mask): Ditto.
7402 (*bset<mode>_1): Ditto.
7403 (*bset<mode>_1_mask): Ditto.
7404 (*bseti<mode>): Ditto.
7405 (*bclr<mode>): Ditto.
7406 (*bclri<mode>): Ditto.
7407 (*binv<mode>): Ditto.
7408 (*binvi<mode>): Ditto.
7409 (*bext<mode>): Ditto.
7411 * config/riscv/predicates.md (splittable_const_int_operand):
7413 (single_bit_mask_operand): New.
7414 (not_single_bit_mask_operand): Ditto.
7415 (const31_operand): Ditto.
7416 (const63_operand): Ditto.
7417 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
7418 (riscv_output_move): Ditto.
7419 (riscv_print_operand): Handle new operand type: T and S.
7420 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
7422 2021-10-25 Jim Wilson <jimw@sifive.com>
7424 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
7427 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7429 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
7431 (riscv_rtx_costs): Ditto.
7433 2021-10-25 Jim Wilson <jimw@sifive.com>
7434 Kito Cheng <kito.cheng@sifive.com>
7435 Jia-Wei Chen <jiawei@iscas.ac.cn>
7437 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
7438 (bitmanip_minmax): New.
7439 (clz_ctz_pcnt): New.
7440 (bitmanip_optab): New.
7441 (bitmanip_insn): New.
7442 (*<optab>_not<mode>): New.
7443 (*xor_not<mode>): New.
7444 (<bitmanip_optab>si2): New.
7445 (*<bitmanip_optab>disi2): New.
7446 (<bitmanip_optab>di2): New.
7447 (*zero_extendhi<GPR:mode>2_bitmanip): New.
7448 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
7449 (*zero_extendhi<GPR:mode>2_zbb): New.
7452 (rotrsi3_sext): New.
7455 (rotlsi3_sext): New.
7456 (bswap<mode>2): New.
7457 (<bitmanip_optab><mode>3): New.
7458 * config/riscv/riscv.md (type): Add rotate.
7459 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
7460 (*zero_extendhi<GPR:mode>2): New.
7461 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
7462 (*extend<SHORT:mode><SUPERQI:mode>2): New.
7464 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7466 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
7468 (riscv_rtx_costs): Ditto.
7470 2021-10-25 Jim Wilson <jimw@sifive.com>
7471 Kito Cheng <kito.cheng@sifive.com>
7472 Jia-Wei Chen <jiawei@iscas.ac.cn>
7474 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
7479 (riscv_rtx_costs): Ditto.
7480 * config/riscv/riscv.md: Include bitmanip.md
7481 (type): Add bitmanip bype.
7482 (zero_extendsidi2): Change to define_expand pattern.
7483 (*zero_extendsidi2_internal): New.
7484 (zero_extendsidi2_shifted): Disable for ZBA.
7486 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7488 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
7489 Add zba, zbb, zbc and zbs.
7490 (riscv_ext_flag_table): Ditto.
7491 * config/riscv/riscv-opts.h (MASK_ZBA): New.
7495 (TARGET_ZBA): Ditto.
7496 (TARGET_ZBB): Ditto.
7497 (TARGET_ZBC): Ditto.
7498 (TARGET_ZBS): Ditto.
7499 * config/riscv/riscv.opt (riscv_zb_subext): New.
7501 2021-10-25 liuhongt <hongtao.liu@intel.com>
7504 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
7505 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
7507 2021-10-25 Richard Biener <rguenther@suse.de>
7509 PR tree-optimization/102920
7510 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
7511 controlling VN_TOP matching behavior.
7512 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
7513 (vn_phi_eq): Do not optimistically match VN_TOP.
7515 2021-10-25 konglin1 <lingling.kong@intel.com>
7517 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
7518 define_insn_and_split.
7519 (fma_<mode>_fadd_fcmul):Likewise
7520 (fma_<complexopname>_<mode>_fma_zero):Likewise
7522 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7524 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
7525 TARGET_DISABLE_FPREGS.
7526 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
7527 MASK_DISABLE_FPREGS.
7528 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
7529 cost of hardware integer multiplication.
7530 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
7531 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
7532 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
7533 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
7534 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
7535 SOFT_FLOAT. Revise comment.
7536 (msoft-float): New option.
7538 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7540 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
7542 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
7544 * doc/install.texi (Specific): Remove obsolete details
7545 around GNU/Linux on Itanium.
7546 (Specific): Remove reference to Windows for Itanium.
7548 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
7550 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
7553 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
7555 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
7557 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
7559 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
7560 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
7561 (one_cmplv1ti2): New define expand.
7563 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
7566 * Makefile.in: Handle dvidir and install-dvi target.
7567 * configure: Regenerate.
7568 * configure.ac: Add install-dvi to target_list.
7570 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
7572 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
7575 2021-10-22 Richard Biener <rguenther@suse.de>
7577 PR tree-optimization/102893
7578 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
7579 test for an exit edge.
7581 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
7582 Andrew MacLeod <amacleod@redhat.com>
7584 * gimple-range-path.cc (path_range_query::compute_phi_relations):
7585 Kill any global relations we may know before registering a new
7587 * value-relation.cc (path_oracle::killing_def): New.
7588 * value-relation.h (path_oracle::killing_def): New.
7590 2021-10-22 Richard Biener <rguenther@suse.de>
7593 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
7595 (vn_phi_lookup): Likewise.
7597 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
7600 * doc/invoke.texi: Document -fcf-protection requires i686 or
7603 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
7605 PR middle-end/102764
7606 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
7608 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
7610 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
7611 macro and all invocations.
7613 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7615 * doc/invoke.texi (ranger-debug): Document.
7616 * flag-types.h (enum ranger_debug): New.
7617 (enum evrp_mode): Remove debug values.
7618 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
7619 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
7620 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
7621 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
7622 (execute_early_vrp): Use evrp-mode directly.
7623 * params.opt (enum evrp_mode): Remove debug values.
7624 (ranger-debug): New.
7625 (ranger-logical-depth): Relocate to be in alphabetical order.
7627 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7629 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
7630 * flag-types.h: (enum vrp_mode): New.
7631 * params.opt: (vrp1-mode, vrp2-mode): New.
7632 * tree-vrp.c (vrp_pass_num): New.
7633 (pass_vrp::pass_vrp): Set pass number.
7634 (pass_vrp::execute): Choose which VRP mode to execute.
7636 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7638 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
7639 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
7640 * tree-vrp.c (class rvrp_folder): Relocate here.
7641 (execute_ranger_vrp): New.
7642 * tree-vrp.h (execute_ranger_vrp): Export.
7644 2021-10-21 Martin Liska <mliska@suse.cz>
7648 * opts.c (finish_options): Process flag_var_tracking* options
7649 here as they can be adjusted by optimize attribute.
7650 Process also flag_syntax_only and flag_gtoggle.
7651 * toplev.c (process_options): Remove it here.
7652 * common.opt: Make debug_nonbind_markers_p as PerFunction
7653 attribute as it depends on optimization level.
7655 2021-10-21 Martin Jambor <mjambor@suse.cz>
7657 PR tree-optimization/102505
7658 * tree-sra.c (totally_scalarize_subtree): Check that the
7659 encountered field fits within the acces we would like to put it
7662 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7664 * tree-ssa-threadbackward.c
7665 (back_threader::maybe_register_path): Remove circular paths check.
7667 2021-10-21 Richard Biener <rguenther@suse.de>
7669 * toplev.c (process_options): Move the initial debug_hooks
7671 (toplev::main): ... before the call of the post_options
7674 2021-10-21 Richard Biener <rguenther@suse.de>
7676 PR tree-optimization/102847
7677 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
7678 load cost in the prologue for VMAT_INVARIANT.
7680 2021-10-21 Richard Biener <rguenther@suse.de>
7682 PR tree-optimization/102847
7683 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
7684 handle VMAT_INVARIANT as a splat in the prologue.
7686 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
7689 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
7690 move to use the same logic as HImode.
7692 2021-10-21 Richard Biener <rguenther@suse.de>
7694 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
7695 superfluous gimple_call_nothrow_p check.
7697 2021-10-21 Andrew Pinski <apinski@marvell.com>
7699 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
7700 Mark the ssa-name of the rhs as one to be removed.
7701 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
7702 Call simple_dce_from_worklist at the end to a simple dce.
7704 2021-10-21 Andrew Pinski <apinski@marvell.com>
7706 * tree-cfg.c (maybe_remove_writeonly_store): New function
7707 factored out from ...
7708 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
7710 2021-10-21 Andrew Pinski <apinski@marvell.com>
7712 * tree-cfg.c (execute_fixup_cfg): Remove comment
7713 about standalone pass.
7715 2021-10-21 Andrew Pinski <apinski@marvell.com>
7717 * tree-cfg.c (execute_fixup_cfg): Output when the statement
7718 is removed when it is a write only var.
7720 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7722 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7723 Avoid threading circular paths.
7725 2021-10-20 Alex Coplan <alex.coplan@arm.com>
7727 * calls.c (initialize_argument_information): Remove some dead
7728 code, remove handling for function_arg returning const_int.
7729 * doc/tm.texi: Delete documentation for unused target hooks.
7730 * doc/tm.texi.in: Likewise.
7731 * target.def (load_bounds_for_arg): Delete.
7732 (store_bounds_for_arg): Delete.
7733 (load_returned_bounds): Delete.
7734 (store_returned_bounds): Delete.
7735 * targhooks.c (default_load_bounds_for_arg): Delete.
7736 (default_store_bounds_for_arg): Delete.
7737 (default_load_returned_bounds): Delete.
7738 (default_store_returned_bounds): Delete.
7739 * targhooks.h (default_load_bounds_for_arg): Delete.
7740 (default_store_bounds_for_arg): Delete.
7741 (default_load_returned_bounds): Delete.
7742 (default_store_returned_bounds): Delete.
7744 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7746 * doc/extend.texi (Basic Asm): Clarify that asm is not an
7748 * doc/invoke.texi (-fno-asm): Fix description for C++.
7750 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7752 * doc/install.texi: Remove link to old.html
7754 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7756 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
7758 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7760 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
7762 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7764 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
7766 * config/aarch64/constraints.md (D1): New.
7768 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7770 * config/aarch64/aarch64-simd.md
7771 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
7772 (*aarch64_topbits_shuffle<mode>_le): New.
7773 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
7774 (*aarch64_topbits_shuffle<mode>_be): New.
7775 * config/aarch64/predicates.md
7776 (aarch64_simd_shift_imm_vec_exact_top): New.
7778 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7780 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
7781 *aarch64_<srn_op>shrn<mode>2_vect_le,
7782 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
7783 * config/aarch64/iterators.md (srn_op): New.
7785 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
7787 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
7789 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
7790 ctx->outer instead of ctx.
7792 2021-10-20 Martin Liska <mliska@suse.cz>
7795 2021-10-19 Martin Liska <mliska@suse.cz>
7798 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7799 * system.h (strip_whilespaces): New function.
7801 2021-10-20 Martin Liska <mliska@suse.cz>
7804 2021-10-19 Martin Liska <mliska@suse.cz>
7807 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7810 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7812 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
7813 Change to 8 with -Os, 11 otherwise.
7815 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7817 * config/aarch64/aarch64.c (neoversev1_tunings):
7818 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
7819 (neoversen2_tunings): Likewise.
7821 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7824 * config/aarch64/aarch64.md (frint_pattern): Update comment.
7825 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
7826 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
7827 * config/aarch64/arm_neon.h: Likewise.
7828 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
7830 2021-10-20 Martin Liska <mliska@suse.cz>
7832 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
7833 already declared global variable.
7834 (arm_unwind_emit_set): Use out_file as function argument.
7835 (arm_unwind_emit): Likewise.
7836 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
7837 (machopic_output_stub_indirection): Likewise.
7838 (machopic_output_indirection): Likewise.
7839 (machopic_finish): Likewise.
7840 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
7841 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
7842 * config/ia64/ia64.c (process_epilogue): Likewise.
7843 (process_cfa_adjust_cfa): Likewise.
7844 (process_cfa_register): Likewise.
7845 (process_cfa_offset): Likewise.
7846 (ia64_asm_unwind_emit): Likewise.
7847 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
7849 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7851 * config/aarch64/aarch64-builtins.c
7852 (aarch64_general_gimple_fold_builtin):
7853 lower vld1 and vst1 variants of the neon builtins
7854 * config/aarch64/aarch64-protos.h:
7855 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
7856 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
7859 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7861 * match.pd: Generate IFN_TRUNC.
7863 2021-10-20 Richard Biener <rguenther@suse.de>
7865 PR tree-optimization/102853
7866 * tree-data-ref.c (split_constant_offset_1): Bail out
7867 immediately if the expression traps on overflow.
7869 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7871 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
7873 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7875 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
7876 Remove superflous debugging message.
7877 (back_jt_path_registry::duplicate_thread_path): Same.
7879 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7881 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
7883 (back_threader_registry::register_path): Remove m_threaded_paths.
7885 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7887 PR tree-optimization/102814
7888 * doc/invoke.texi: Document --param=max-fsm-thread-length.
7889 * params.opt: Add --param=max-fsm-thread-length.
7890 * tree-ssa-threadbackward.c
7891 (back_threader_profitability::profitable_path_p): Fail on paths
7892 longer than max-fsm-thread-length.
7894 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
7896 PR middle-end/102764
7897 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
7898 statement to reset the current location for the outgoing edges.
7900 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7901 Richard Biener <rguenther@suse.de>
7903 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
7904 on the same line as the threading cancellation.
7905 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
7906 Avoid threading through loop headers where the path remains in the
7909 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
7911 * tree-object-size.c (unknown): Make into a function. Adjust
7913 (unknown_object_size): Simplify implementation.
7915 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
7916 Kewen Lin <linkw@linux.ibm.com>
7918 * doc/sourcebuild.texi (Effective-Target Keywords): Document
7919 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
7920 vect_slp_v16qi_store, vect_slp_v2hi_store,
7921 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
7923 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
7925 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
7927 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7931 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
7932 * config/rs6000/pmmintrin.h: Same.
7933 * config/rs6000/smmintrin.h: Same.
7934 * config/rs6000/tmmintrin.h: Same.
7936 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7938 * config.gcc (extra_headers): Add nmmintrin.h.
7940 2021-10-19 Richard Biener <rguenther@suse.de>
7942 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
7943 misalignment parameter.
7944 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
7945 Do not change DR_MISALIGNMENT in place, instead pass the
7946 adjusted misalignment to vect_supportable_dr_alignment.
7947 (vect_peeling_supportable): Likewise.
7948 (vect_peeling_hash_get_lowest_cost): Adjust.
7949 (vect_enhance_data_refs_alignment): Likewise.
7950 (vect_vfa_access_size): Likewise.
7951 (vect_supportable_dr_alignment): Add misalignment
7952 parameter and simplify.
7953 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
7954 (get_group_load_store_type): Likewise.
7955 (get_load_store_type): Likewise.
7957 2021-10-19 Clément Chigot <clement.chigot@atos.net>
7959 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
7960 __tls_get_addr reference to .text csect.
7962 2021-10-19 Martin Liska <mliska@suse.cz>
7965 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7968 2021-10-19 Richard Biener <rguenther@suse.de>
7970 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
7971 (vect_get_load_cost): Likewise.
7972 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
7973 alignment support scheme and misalignment as arguments
7975 (vect_get_peeling_costs_all_drs): Compute that info here
7976 and note that we shouldn't need to.
7977 * tree-vect-stmts.c (vect_model_store_cost): Get
7978 alignment support scheme and misalignment as arguments.
7979 (vect_get_store_cost): Likewise.
7980 (vect_model_load_cost): Likewise.
7981 (vect_get_load_cost): Likewise.
7982 (vectorizable_store): Pass down alignment support scheme
7983 and misalignment to costing.
7984 (vectorizable_load): Likewise.
7986 2021-10-19 Richard Biener <rguenther@suse.de>
7988 * tree-vect-stmts.c (get_negative_load_store_type): Add
7989 offset output parameter and initialize it.
7990 (get_group_load_store_type): Likewise.
7991 (get_load_store_type): Likewise.
7992 (vectorizable_store): Use offset as computed by
7993 get_load_store_type.
7994 (vectorizable_load): Likewise.
7996 2021-10-19 Richard Biener <rguenther@suse.de>
7998 PR tree-optimization/102827
7999 * tree-if-conv.c (predicate_statements): Add pe parameter
8000 and use that edge to insert invariant stmts on.
8001 (combine_blocks): Pass through pe.
8002 (tree_if_conversion): Compute the edge to insert invariant
8003 stmts on and pass it along.
8005 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
8008 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
8009 sssubaddv2hi3): Swap the order of operators in vec_concat.
8011 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
8013 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
8014 (altivec_vmrghb_direct): New.
8015 (*altivec_vmrghh_internal): Delete.
8016 (altivec_vmrghh_direct): New.
8017 (*altivec_vmrghw_internal): Delete.
8018 (altivec_vmrghw_direct_<mode>): New.
8019 (altivec_vmrghw_direct): Delete.
8020 (*altivec_vmrglb_internal): Delete.
8021 (altivec_vmrglb_direct): New.
8022 (*altivec_vmrglh_internal): Delete.
8023 (altivec_vmrglh_direct): New.
8024 (*altivec_vmrglw_internal): Delete.
8025 (altivec_vmrglw_direct_<mode>): New.
8026 (altivec_vmrglw_direct): Delete.
8027 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
8028 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
8030 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
8031 (vsx_xxmrglw_<mode>): Adjust.
8033 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
8035 * passes.def: Change threading comment before pass_ccp pass.
8037 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
8039 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
8040 Modify the expansion for sign extension. All extensions are done
8041 within VSX registers.
8043 2021-10-19 Richard Biener <rguenther@suse.de>
8045 * tree-vect-stmts.c (get_group_load_store_type): Add
8046 misalignment output parameter and initialize it.
8047 (get_group_load_store_type): Likewise.
8048 (vectorizable_store): Remove now redundant queries.
8049 (vectorizable_load): Likewise.
8051 2021-10-19 Richard Biener <rguenther@suse.de>
8053 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
8054 check_aligned argument.
8055 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
8057 (vect_peeling_hash_insert): Add supportable_if_not_aligned
8058 argument and do not call vect_supportable_dr_alignment here.
8059 (vect_peeling_supportable): Adjust.
8060 (vect_enhance_data_refs_alignment): Compute whether the
8061 access is supported with different alignment here and
8062 pass that down to vect_peeling_hash_insert.
8063 (vect_vfa_access_size): Adjust.
8064 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
8065 (vect_get_load_cost): Likewise.
8066 (get_negative_load_store_type): Likewise.
8067 (get_group_load_store_type): Likewise.
8068 (get_load_store_type): Likewise.
8070 2021-10-19 Martin Liska <mliska@suse.cz>
8073 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
8074 * system.h (strip_whilespaces): New function.
8076 2021-10-19 dianhong xu <dianhong.xu@intel.com>
8078 * config/i386/avx512fp16intrin.h:
8079 (_mm512_set1_pch): New intrinsic.
8080 * config/i386/avx512fp16vlintrin.h:
8081 (_mm256_set1_pch): New intrinsic.
8082 (_mm_set1_pch): Ditto.
8084 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
8086 PR tree-optimization/102796
8087 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
8088 normally. Return get_tree_range for non gimple_range_ssa_p names.
8089 (gimple_ranger::range_of_stmt): Use get_tree_range for non
8090 gimple_range_ssa_p names.
8092 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
8095 * config/i386/i386.c (ix86_print_operand_address):
8096 Error out for non-address_operand asm operands.
8098 2021-10-18 Richard Biener <rguenther@suse.de>
8100 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
8101 not auto-convert dr_alignment_support to bool.
8102 (vect_peeling_supportable): Likewise.
8103 (vect_enhance_data_refs_alignment): Likewise.
8104 (vect_supportable_dr_alignment): Commonize read/write case.
8105 * tree-vect-stmts.c (vect_get_store_cost): Use
8106 dr_alignment_support, not int, for the vect_supportable_dr_alignment
8108 (vect_get_load_cost): Likewise.
8110 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
8112 * tree-object-size.c (object_sizes_execute): Consolidate LHS
8113 null check and do it early.
8115 2021-10-18 Richard Biener <rguenther@suse.de>
8117 * tree-vect-stmts.c (vectorizable_store): Use the
8118 computed alignment scheme instead of querying
8121 2021-10-18 Richard Biener <rguenther@suse.de>
8123 * tree-vect-stmts.c (vectorizable_store): Do not recompute
8124 alignment scheme already determined by get_load_store_type.
8126 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
8128 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
8129 Clone corresponding pass.
8131 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
8133 * combine.c (recog_for_combine): For an unrecognized move/set of
8134 a constant, try force_const_mem to place it in the constant pool.
8135 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
8136 Delete global variables (that are no longer assigned a cost value).
8137 (cse_insn): Simplify logic for deciding whether to place a folded
8138 constant in the constant pool using force_const_mem.
8139 (cse_main): Remove zero initialization of constant_pool_entries_cost
8140 and constant_pool_entries_regcost.
8141 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
8142 fractionally more expensive, when optimizing for speed.
8144 2021-10-18 Martin Liska <mliska@suse.cz>
8146 PR gcov-profile/102746
8147 PR gcov-profile/102747
8148 * gcov.c (main): Return return_code.
8149 (output_gcov_file): Mark return_code when error happens.
8150 (generate_results): Likewise.
8151 (read_graph_file): Likewise.
8152 (read_count_file): Likewise.
8154 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
8156 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
8157 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
8158 a popcount, allowing compile-time evaluation/simplification.
8159 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
8161 2021-10-18 Richard Biener <rguenther@suse.de>
8163 PR tree-optimization/102788
8164 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
8166 (vect_pattern_recog_1): Likewise.
8167 (vect_recog_bool_pattern): Continue matching the pattern
8168 even if we do not have a vector type for a conversion
8171 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
8173 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
8174 Evalute SS_NEG and SS_ABS of a constant argument.
8176 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
8179 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
8180 * internal-fn.def: Add entry for COND_NEG internal function.
8181 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
8182 NEGATE_EXPR, COND_NEG mapping.
8183 * optabs.def: Add entry for cond_neg_optab.
8184 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
8185 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
8186 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
8188 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
8190 * tree-ssa-strlen.c (class strlen_pass): Rename from
8192 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
8193 (handle_assign): Same.
8194 (adjust_last_stmt): Same.
8195 (maybe_warn_overflow): Same.
8196 (handle_builtin_strlen): Same.
8197 (handle_builtin_strchr): Same.
8198 (handle_builtin_strcpy): Same.
8199 (handle_builtin_strncat): Same.
8200 (handle_builtin_stxncpy_strncat): Same.
8201 (handle_builtin_memcpy): Same.
8202 (handle_builtin_strcat): Same.
8203 (handle_alloc_call): Same.
8204 (handle_builtin_memset): Same.
8205 (handle_builtin_memcmp): Same.
8206 (get_len_or_size): Same.
8207 (strxcmp_eqz_result): Same.
8208 (handle_builtin_string_cmp): Same.
8209 (handle_pointer_plus): Same.
8210 (count_nonzero_bytes_addr): Same.
8211 (count_nonzero_bytes): Same.
8212 (handle_store): Same.
8213 (strlen_check_and_optimize_call): Same.
8214 (handle_integral_assign): Same.
8215 (check_and_optimize_stmt): Same.
8216 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
8218 2021-10-18 Richard Biener <rguenther@suse.de>
8220 PR tree-optimization/102798
8221 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
8222 Only copy points-to info to newly generated SSA names.
8224 2021-10-18 Martin Liska <mliska@suse.cz>
8226 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
8227 * gcov.c (get_cycles_count): Likewise.
8228 * lto-compress.c (lto_compression_zlib): Likewise.
8229 (lto_uncompression_zlib): Likewise.
8230 * targhooks.c (default_pch_valid_p): Likewise.
8232 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
8234 * tree-pass.h (make_pass_thread_jumps_full): New.
8235 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
8236 (try_thread_blocks): Add resolve and speed arguments.
8237 (pass_thread_jumps::execute): Inline.
8238 (do_early_thread_jumps): New.
8239 (do_thread_jumps): New.
8240 (make_pass_thread_jumps): Move.
8241 (pass_early_thread_jumps::gate): Inline.
8242 (pass_early_thread_jumps::execute): Inline.
8243 (class pass_thread_jumps_full): New.
8245 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
8247 * configure.ac: Treat powerpc64*-*-freebsd* the same as
8248 powerpc64-*-freebsd*.
8249 * configure: Regenerate.
8251 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
8253 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
8254 set_ptr_nonull to set_ptr_nonnull in comments.
8256 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
8258 PR tree-optimization/102720
8259 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
8260 of call used and clobbered sets.
8262 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
8264 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
8265 skip gomp and itm when r or nodefaultlibs is given.
8266 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
8267 Update ordering of exclusions, remove duplicate 'v' addition
8268 (collect2 will add this from the main command line).
8270 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
8272 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
8273 filelist and framework options in specs instead of code.
8274 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
8275 specs that are really driver ones.
8276 (DARWIN_CC1_SPEC): Likewise.
8277 (CPP_SPEC): Likewise.
8278 (SYSROOT_SPEC): Append space.
8279 (LINK_SYSROOT_SPEC): Remove most driver link specs.
8280 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
8281 (STARTFILE_SPEC): Likewise.
8282 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
8283 (ASM_SPEC): Update driver-related specs.
8284 (ASM_FINAL_SPEC): Likewise.
8285 * config/darwin.opt: Remove now unused option aliases.
8286 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
8287 (ASM_SPEC): Update driver-related specs.
8289 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
8291 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
8292 allow standard_sse_constant_p immediate constants.
8294 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
8296 * config.gcc: Add tm-dwarf2.h to tm_d-file.
8298 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
8300 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
8301 that occur in abnormal phis.
8302 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
8303 abnormal and EH edges.
8304 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
8305 (rvrp_folder::value_on_edge): Ditto.
8306 (rvrp_folder::value_of_stmt): Ditto.
8307 (hybrid_folder::value_of_expr): Ditto for ranger queries.
8308 (hybrid_folder::value_on_edge): Ditto.
8309 (hybrid_folder::value_of_stmt): Ditto.
8310 * value-query.cc (gimple_range_global): Always return a range if
8311 the type is supported.
8313 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
8315 * config/pa/pa.md: Consistently use "rG" constraint for copy
8316 instruction in move patterns.
8318 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
8320 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
8323 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
8324 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
8325 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
8326 (ELFABIVERSION_AMDGPU_HSA_V4): New.
8327 (SET_SRAM_ECC_UNSUPPORTED): New.
8328 (copy_early_debug_info): Create elf flags to match the other objects.
8329 (main): Just let the attribute flags pass through.
8331 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8333 * tree-loop-distribution.c (reduction_var_overflows_first):
8334 Pass the type of reduction_var as first argument as it is also
8335 done for the load type.
8336 (loop_distribution::transform_reduction_loop): Add missing
8337 TREE_TYPE while determining precission of reduction_var.
8339 2021-10-15 Richard Biener <rguenther@suse.de>
8341 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
8343 * toplev.c (process_options): Warn when STABS debugging is
8344 enabled but not the preferred format.
8345 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
8347 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
8349 2021-10-15 Richard Biener <rguenther@suse.de>
8352 * tree-inline.c (copy_bb): Avoid underflowing nargs.
8354 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
8356 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
8357 HFmode input operand to HImode.
8358 (ix86_vectorize_vec_perm_const): Likewise.
8359 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
8361 (*avx512f_permvar_truncv8siv8hi_1_hf):
8364 2021-10-15 Richard Biener <rguenther@suse.de>
8366 PR middle-end/102682
8367 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
8368 not create a paradoxical subreg.
8370 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
8372 * config/i386/i386-expand.c (ix86_expand_vector_init):
8373 For half_vector concat for HFmode, handle them like HImode.
8375 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8377 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
8378 (back_threader::back_threader): Same.
8379 (back_threader::resolve_phi): Try to solve without looking back if
8381 (back_threader::find_paths_to_names): Same.
8382 (try_thread_blocks): Pass resolve argument to back threader.
8383 (pass_early_thread_jumps::execute): Same.
8385 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8387 * doc/invoke.texi: Remove max-fsm-thread-length,
8388 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
8390 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
8392 (back_threader_registry::back_threader_registry): Same.
8393 (back_threader_profitability::profitable_path_p): Remove
8394 param_max_fsm_thread-length.
8395 (back_threader_registry::register_path): Remove
8396 m_max_allowable_paths.
8398 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8400 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
8402 (back_threader::~back_threader): Do not release m_path.
8404 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
8406 PR tree-optimization/102738
8407 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
8409 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
8411 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
8412 (omp_mark_declare_variant): Move from c-omp.c.
8413 (omp_context_name_list_prop): Update for Fortran strings.
8414 * omp-general.h (omp_check_context_selector): New prototype.
8415 (omp_mark_declare_variant): New prototype.
8417 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8419 * config/rs6000/rs6000.c (rs6000_density_test): Move early
8420 exit test further up the function.
8422 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8424 * config/arm/arm.c (arm_add_stmt_cost): Delete.
8425 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
8427 2021-10-14 Martin Jambor <mjambor@suse.cz>
8429 * doc/invoke.texi (Optimize Options): Add entry for
8430 ipa-cp-recursive-freq-factor.
8432 2021-10-14 Tamar Christina <tamar.christina@arm.com>
8434 * match.pd: New rule.
8436 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
8439 * ipa-modref-tree.h (modref_access_node::update2):
8440 Also check that parm_offset is unchanged.
8441 (modref_ref_node::insert_access): Fix updating of
8444 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8446 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
8449 2021-10-14 Richard Biener <rguenther@suse.de>
8451 PR tree-optimization/102659
8452 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
8453 rewrite pointer typed undefined overflow operations.
8454 (predicate_statements): Likewise. Make sure to emit invariant
8455 conversions in the preheader.
8456 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
8457 non-empty preheaders.
8458 * tree-data-ref.c (dr_analyze_indices): Strip useless
8459 conversions to the MEM_REF base type.
8461 2021-10-14 Martin Liska <mliska@suse.cz>
8463 * common.opt: Stop using AUTODETECT_VALUE
8464 and use EnabledBy where possible.
8465 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
8466 * toplev.c (AUTODETECT_VALUE): Remove macro.
8467 (process_options): Simplify by using EnabledBy and
8468 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
8471 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
8473 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
8475 (vld1_s32_x3): Likewise.
8477 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8479 PR tree-optimization/102736
8480 PR tree-optimization/102736
8481 * gimple-range-path.cc (path_range_query::range_on_path_entry):
8482 Assert that the requested range is defined outside the path.
8483 (path_range_query::ssa_range_in_phi): Do not call
8484 range_on_path_entry for SSA names that are defined within the
8487 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
8489 * config/darwin-driver.c (darwin_find_version_from_kernel):
8490 Quote internal identifiers and avoid contractions in
8492 (darwin_default_min_version): Likewise.
8493 (darwin_driver_init): Likewise.
8495 2021-10-14 Martin Jambor <mjambor@suse.cz>
8497 * params.opt (ipa-cp-recursive-freq-factor): New.
8498 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
8499 scc_no, self_recursion_generated_level, same_scc and
8500 self_recursion_generated_p.
8501 (ipcp_lattice::add_value): Replaced parameter unlimited with
8502 same_lat_gen_level, usit it determine limit of values and store it to
8504 (ipcp_lattice<valtype>::print): Dump the new fileds.
8505 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
8506 parameter and store it to the new value.
8507 (self_recursively_generated_p): Removed.
8508 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
8509 instead of self_recursively_generated_p, store self generation level
8511 (value_topo_info<valtype>::add_val): Set scc_no.
8512 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
8513 recursively feeding values and self generated values by appropriate
8516 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8518 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
8519 redundant test for flag_vect_cost_model.
8521 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8523 * bitmap.c (debug): New overloaded function for auto_bitmaps.
8524 * bitmap.h (debug): Same.
8526 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
8528 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
8529 Adjust builtin call.
8530 (_mm512_mask3_fcmadd_pch): Likewise.
8531 (_mm512_mask_fmadd_pch): Likewise
8532 (_mm512_mask3_fmadd_pch): Likewise
8533 (_mm512_mask_fcmadd_round_pch): Likewise
8534 (_mm512_mask3_fcmadd_round_pch): Likewise
8535 (_mm512_mask_fmadd_round_pch): Likewise
8536 (_mm512_mask3_fmadd_round_pch): Likewise
8537 (_mm_mask_fcmadd_sch): Likewise
8538 (_mm_mask3_fcmadd_sch): Likewise
8539 (_mm_mask_fmadd_sch): Likewise
8540 (_mm_mask3_fmadd_sch): Likewise
8541 (_mm_mask_fcmadd_round_sch): Likewise
8542 (_mm_mask3_fcmadd_round_sch): Likewise
8543 (_mm_mask_fmadd_round_sch): Likewise
8544 (_mm_mask3_fmadd_round_sch): Likewise
8545 (_mm_fcmadd_round_sch): Likewise
8546 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
8547 Adjust builtin call.
8548 (_mm_mask3_fmadd_pch): Likewise
8549 (_mm256_mask_fmadd_pch): Likewise
8550 (_mm256_mask3_fmadd_pch): Likewise
8551 (_mm_mask_fcmadd_pch): Likewise
8552 (_mm_mask3_fcmadd_pch): Likewise
8553 (_mm256_mask_fcmadd_pch): Likewise
8554 (_mm256_mask3_fcmadd_pch): Likewise
8555 * config/i386/i386-builtin.def: Add mask3 builtin for complex
8556 fma, and adjust mask_builtin to corresponding expander.
8557 * config/i386/i386-expand.c (ix86_expand_round_builtin):
8558 Skip eraseing embedded rounding for expanders that emits
8560 * config/i386/sse.md (complexmove): New mode_attr.
8561 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
8562 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
8563 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8564 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8565 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8566 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8567 * config/i386/subst.md (round_embedded_complex): New subst.
8569 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
8571 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
8572 (cbranchdf4): Likewise.
8573 Add missing move patterns for TARGET_SOFT_FLOAT.
8575 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
8577 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
8578 pseudo intermediate when moving a SUBREG into a hard register,
8579 by checking ix86_hardreg_mov_ok.
8580 (ix86_expand_vector_extract): Store zero-extended SImode
8581 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
8583 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
8584 complex (SUBREG) sets of (vector) hard registers before reload, by
8585 checking ix86_hardreg_mov_ok.
8587 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8589 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
8591 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8593 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
8595 2021-10-13 Martin Sebor <msebor@redhat.com>
8597 PR middle-end/102630
8598 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
8600 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
8602 * collect2.c (is_lto_object_file): Release simple-object
8603 resources, close files.
8605 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
8607 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
8608 when checking mve addressing modes.
8609 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
8611 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
8612 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
8613 there is a single Ux constraint.
8615 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
8617 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
8618 multiplication on 32-bit PA 1.1 and 2.0 targets.
8620 2021-10-13 Martin Liska <mliska@suse.cz>
8622 PR gcov-profile/90364
8623 * coverage.c (build_info): Emit checksum to the global variable.
8624 (build_info_type): Add new field for checksum.
8625 (coverage_obj_finish): Pass object_checksum.
8626 (coverage_init): Use 0 as checksum for .gcno files.
8627 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
8628 * gcov.c (read_graph_file): Read also checksum.
8629 * doc/invoke.texi: Document the behaviour change.
8631 2021-10-13 Richard Biener <rguenther@suse.de>
8633 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
8634 start at integer value 2.
8635 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
8636 the iterator for GSI_LAST_NEW_STMT.
8637 (gsi_insert_seq_nodes_after): Likewise.
8638 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
8639 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
8640 arguments to gsi_replace.
8642 2021-10-13 Martin Liska <mliska@suse.cz>
8645 * common.opt: Use EnabledBy instead of detection in
8646 finish_options and process_options.
8647 * opts.c (finish_options): Remove handling of
8648 x_flag_unroll_all_loops.
8649 * toplev.c (process_options): Likewise for flag_web and
8650 flag_rename_registers.
8652 2021-10-13 Richard Biener <rguenther@suse.de>
8654 PR tree-optimization/102659
8655 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
8656 (if_convertible_gimple_assign_stmt_p): Mark the loop for
8657 rewrite when stmts with undefined behavior on integer
8659 (combine_blocks): Predicate also when we need to rewrite stmts.
8660 (predicate_statements): Rewrite affected stmts to something
8661 with well-defined behavior on overflow.
8662 (tree_if_conversion): Initialize need_to_rewrite_undefined.
8664 2021-10-13 Richard Biener <rguenther@suse.de>
8667 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
8670 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
8672 * dwarf2ctf.c: Fix typo in comment.
8674 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
8677 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
8678 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
8679 avoid generating an automatic #define of vec_cpsgn. Use the
8680 correct built-in for V4SFmode that doesn't depend on VSX.
8682 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
8686 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
8687 define_insn_and_split pattern. Add alternative 1 and split it
8688 post reload to insert operand 1 into the low part of operand 0.
8689 (*sub<mode>_1_slp): Ditto.
8690 (*and<mode>_1_slp): Ditto.
8691 (*<any_or:code><mode>_1_slp): Ditto.
8692 (*ashl<mode>3_1_slp): Ditto.
8693 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
8694 (*<any_rotate:insn><mode>3_1_slp): Ditto.
8695 (*neg<mode>_1_slp): New insn_and_split pattern.
8696 (*one_cmpl<mode>_1_slp): Ditto.
8698 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
8700 * doc/install.texi: Update MinGW and mingw-64 Binaries
8703 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
8705 * doc/extend.texi (Common Variable Attributes): Fix typos in
8706 alloc_size documentation.
8708 2021-10-12 Richard Biener <rguenther@suse.de>
8710 PR tree-optimization/102696
8711 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
8712 the tree fatally failed when we reject a BIT_FIELD_REF.
8714 2021-10-12 Richard Biener <rguenther@suse.de>
8716 PR tree-optimization/102572
8717 * tree-vect-stmts.c (vect_build_gather_load_calls): When
8718 gathering the vectorized defs for the mask pass in the
8719 desired mask vector type so invariants will be handled
8722 2021-10-12 Tamar Christina <tamar.christina@arm.com>
8724 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
8725 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
8726 *fcmuo<mode>_nor_combine): New.
8728 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
8731 * config/sparc/sparc-modes.def (OI): New integer mode.
8733 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8735 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
8736 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
8739 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8741 * tree-vectorizer.h (loop_cost_model): New function.
8742 (unlimited_cost_model): Use it.
8743 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
8744 call instead of flag_vect_cost_model.
8745 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
8746 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
8747 of flag_simd_cost_model.
8749 2021-10-12 liuhongt <hongtao.liu@intel.com>
8752 * config/i386/i386-expand.c (emit_reduc_half): Handle
8754 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
8755 (reduc_plus_scal_v4qi): Ditto.
8757 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8759 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
8760 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
8761 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
8763 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8765 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
8766 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
8767 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
8768 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
8769 _mm_cvtepu32_epi64): New.
8771 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8773 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
8774 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
8776 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8778 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
8779 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
8780 _mm_max_epi32, _mm_max_epu32): New.
8782 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8784 * ipa-modref-tree.h (struct modref_access_node): Revert
8786 (struct modref_ref_node): Likewise.
8788 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8790 * ipa-modref-tree.h (modref_tree::global_access_p): New member
8793 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8794 ignore_stores_eaf_flags): Move to ipa-modref.h
8795 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
8796 (modref_summary::global_memory_read_p): New member function.
8797 (modref_summary::global_memory_written_p): New member function.
8798 * ipa-modref.h (modref_summary::global_memory_read_p,
8799 modref_summary::global_memory_written_p): Declare.
8800 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8801 ignore_stores_eaf_flags): move here.
8802 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
8804 (handle_rhs_call): Rewrite.
8805 (handle_call_arg): New function.
8806 (determine_global_memory_access): New function.
8807 (handle_const_call): Remove
8808 (handle_pure_call): Remove
8809 (find_func_aliases_for_call): Update use of handle_rhs_call.
8810 (compute_points_to_sets): Handle global memory acccesses
8813 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
8815 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
8816 documentation, mention UBSAN_OPTIONS, similar to what is done
8817 for AddressSanitizer.
8819 2021-10-11 Richard Biener <rguenther@suse.de>
8821 PR middle-end/102683
8822 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
8823 availability before building an integer type for storage
8826 2021-10-11 Richard Biener <rguenther@suse.de>
8828 PR middle-end/101480
8829 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
8832 2021-10-11 Martin Liska <mliska@suse.cz>
8834 * common.opt: Remove Init(2) for some options.
8835 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
8836 use rather OPTION_SET_P.
8838 2021-10-11 Martin Liska <mliska@suse.cz>
8840 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
8841 * flag-types.h (enum ira_region): Likewise.
8842 * toplev.c (process_options): Use OPTION_SET_P instead of
8843 IRA_REGION_AUTODETECT.
8845 2021-10-11 Jakub Jelinek <jakub@redhat.com>
8847 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
8848 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
8850 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8852 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
8853 * config/s390/s390.c (s390_rawmemchr): New function.
8854 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
8855 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
8856 the pattern vfees<mode> from vx-builtins.md.
8857 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
8859 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8861 * builtins.c (get_memory_rtx): Change to external linkage.
8862 * builtins.h (get_memory_rtx): Add function prototype.
8863 * doc/md.texi (rawmemchr<mode>): Document.
8864 * internal-fn.c (expand_RAWMEMCHR): Define.
8865 * internal-fn.def (RAWMEMCHR): Add.
8866 * optabs.def (rawmemchr_optab): Add.
8867 * tree-loop-distribution.c (find_single_drs): Change return code
8868 behaviour by also returning true if no single store was found
8870 (loop_distribution::classify_partition): Respect the new return
8871 code behaviour of function find_single_drs.
8872 (loop_distribution::execute): Call new function
8873 transform_reduction_loop in order to replace rawmemchr or strlen
8874 like loops by calls into builtins.
8875 (generate_reduction_builtin_1): New function.
8876 (generate_rawmemchr_builtin): New function.
8877 (generate_strlen_builtin_1): New function.
8878 (generate_strlen_builtin): New function.
8879 (generate_strlen_builtin_using_rawmemchr): New function.
8880 (reduction_var_overflows_first): New function.
8881 (determine_reduction_stmt_1): New function.
8882 (determine_reduction_stmt): New function.
8883 (loop_distribution::transform_reduction_loop): New function.
8885 2021-10-11 Martin Liska <mliska@suse.cz>
8887 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
8888 and remove legacy hashing code.
8890 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
8893 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
8894 CONST_INT for BEGIN and END, and use gcc_assert rather than
8897 2021-10-10 Jakub Jelinek <jakub@redhat.com>
8900 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
8901 use MO_VAL_SET if loc is not sp.
8903 2021-10-10 Andrew Pinski <apinski@marvell.com>
8905 PR tree-optimization/102622
8906 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
8907 Swap the order of a?0:pow2cst and a?0:-1 transformations.
8909 2021-10-09 liuhongt <hongtao.liu@intel.com>
8912 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
8914 (ix86_use_mask_cmp_p): Ditto.
8915 (ix86_expand_sse_movcc): Ditto.
8916 * config/i386/i386.md (setcc_hf_mask): New define_insn.
8917 (movhf_mask): Ditto.
8918 (UNSPEC_MOVCC_MASK): New unspec.
8919 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
8921 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
8923 PR rtl-optimization/102627
8924 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
8926 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8928 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
8931 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8933 * value-range.cc (irange::debug): New.
8934 * value-range.h (irange::debug): New.
8936 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
8938 PR tree-optimization/102385
8939 * predict.h (change_edge_frequency): Declare.
8940 * predict.c (change_edge_frequency): New function.
8941 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
8943 (tree_unroll_loop): Likewise.
8944 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
8945 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
8947 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
8948 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
8949 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
8950 to retrieve the exit edges. Make all the old profile update code
8951 conditional on !single_loop_p -- the case it was written for --
8952 and use a different approach for the single-loop case.
8954 2021-10-08 Martin Liska <mliska@suse.cz>
8956 * config/alpha/alpha.c (alpha_option_override): Use new macro
8958 * config/arc/arc.c (arc_override_options): Likewise.
8959 * config/arm/arm.c (arm_option_override): Likewise.
8960 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
8961 * config/c6x/c6x.c (c6x_option_override): Likewise.
8962 * config/csky/csky.c: Likewise.
8963 * config/darwin.c (darwin_override_options): Likewise.
8964 * config/frv/frv.c (frv_option_override): Likewise.
8965 * config/i386/djgpp.h: Likewise.
8966 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
8967 (ix86_max_noce_ifcvt_seq_cost): Likewise.
8968 * config/ia64/ia64.c (ia64_option_override): Likewise.
8969 (ia64_override_options_after_change): Likewise.
8970 * config/m32c/m32c.c (m32c_option_override): Likewise.
8971 * config/m32r/m32r.c (m32r_init): Likewise.
8972 * config/m68k/m68k.c (m68k_option_override): Likewise.
8973 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
8974 * config/mips/mips.c (mips_option_override): Likewise.
8975 * config/nios2/nios2.c (nios2_option_override): Likewise.
8976 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
8977 * config/pa/pa.c (pa_option_override): Likewise.
8978 * config/riscv/riscv.c (riscv_option_override): Likewise.
8979 * config/rs6000/aix71.h: Likewise.
8980 * config/rs6000/aix72.h: Likewise.
8981 * config/rs6000/aix73.h: Likewise.
8982 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
8983 (rs6000_override_options_after_change): Likewise.
8984 (rs6000_linux64_override_options): Likewise.
8985 (glibc_supports_ieee_128bit): Likewise.
8986 (rs6000_option_override_internal): Likewise.
8987 (rs6000_file_start): Likewise.
8988 (rs6000_darwin_file_start): Likewise.
8989 * config/rs6000/rtems.h: Likewise.
8990 * config/rs6000/sysv4.h: Likewise.
8991 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
8992 * config/s390/s390.c (s390_option_override): Likewise.
8993 * config/sh/linux.h: Likewise.
8994 * config/sh/netbsd-elf.h (while): Likewise.
8995 * config/sh/sh.c (sh_option_override): Likewise.
8996 * config/sol2.c (solaris_override_options): Likewise.
8997 * config/sparc/sparc.c (sparc_option_override): Likewise.
8998 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
8999 * config/visium/visium.c (visium_option_override): Likewise.
9000 * config/vxworks.c (vxworks_override_options): Likewise.
9001 * lto-opts.c (lto_write_options): Likewise.
9002 * omp-expand.c (expand_omp_simd): Likewise.
9003 * omp-general.c (omp_max_vf): Likewise.
9004 * omp-offload.c (oacc_xform_loop): Likewise.
9005 * opts.h (OPTION_SET_P): Likewise.
9006 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
9007 * toplev.c (process_options): Likewise.
9008 * tree-predcom.c: Likewise.
9009 * tree-sra.c (analyze_all_variable_accesses): Likewise.
9011 2021-10-08 liuhongt <hongtao.liu@intel.com>
9014 * config/i386/i386.c (ix86_optab_supported_p):
9015 Return true for HFmode.
9016 * match.pd: Simplify (_Float16) ceil ((double) x) to
9017 __builtin_ceilf16 (a) when a is _Float16 type and
9018 direct_internal_fn_supported_p.
9020 2021-10-08 liuhongt <hongtao.liu@intel.com>
9023 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
9024 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
9025 (reduc_<code>_scal_v4hi): New.
9027 2021-10-08 liuhongt <hongtao.liu@intel.com>
9029 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
9030 * doc/invoke.texi (Options That Control Optimization): Update
9032 * opts.c (default_options_table): Enable auto-vectorization at
9033 O2 with very-cheap cost model.
9034 (finish_options): Use cheap cost model for
9035 explicit -ftree{,-loop}-vectorize.
9037 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
9039 * ctfc.c (ctfc_delete_container): Free hash table contents.
9041 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
9043 * toplev.c (process_options): Do not warn for GNU GIMPLE.
9045 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
9047 * tree-object-size.c (addr_object_size,
9048 compute_builtin_object_size): Drop PDECL and POFF arguments.
9049 (addr_object_size): Adjust calls.
9050 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
9053 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
9055 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
9056 representing signed and unsigned high-part multiplication resp.
9057 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
9058 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
9059 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
9060 SS_DIV, US_DIV]: Similar simplifications for saturating
9062 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
9063 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
9064 Implement compile-time evaluation for constant operands.
9065 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
9067 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
9068 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
9069 Mention the new smul_highpart and umul_highpart RTX codes.
9070 * doc/invoke.texi: Silence @xref "compilation" warnings.
9072 2021-10-07 Martin Jambor <mjambor@suse.cz>
9075 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
9076 case when the source reference description corresponds to a
9077 referance taken in a function src->caller is inlined to.
9079 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
9082 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
9084 (modref_access_node::try_merge_with): Add sanity check that there
9085 are no redundant entries in the list.
9087 2021-10-07 Richard Biener <rguenther@suse.de>
9089 PR tree-optimization/102608
9090 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
9093 2021-10-07 Martin Liska <mliska@suse.cz>
9095 * toplev.c (toplev::main): Make
9096 save_opt_decoded_options a pointer type
9097 * toplev.h: Likewise.
9099 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
9101 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
9102 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
9103 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
9105 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
9107 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
9108 Adjust for new -msram-ecc=any behaviour.
9109 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
9110 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
9111 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
9112 (EF_AMDGPU_XNACK_V3): ... this.
9113 (EF_AMDGPU_SRAM_ECC): Rename to ...
9114 (EF_AMDGPU_SRAM_ECC_V3): ... this.
9115 (EF_AMDGPU_FEATURE_XNACK_V4): New.
9116 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
9117 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
9118 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
9119 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
9120 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
9121 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
9122 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
9123 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
9124 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
9125 (SET_XNACK_ON): New.
9126 (SET_XNACK_OFF): New.
9128 (SET_SRAM_ECC_ON): New.
9129 (SET_SRAM_ECC_ANY): New.
9130 (SET_SRAM_ECC_OFF): New.
9131 (TEST_SRAM_ECC_ANY): New.
9132 (TEST_SRAM_ECC_ON): New.
9133 (main): Implement HSACOv4 and -msram-ecc=any.
9135 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
9137 * config.in: Regenerate.
9138 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
9142 (A_FIJI): Rename to ...
9144 (A_900): Rename to ...
9146 (A_906): Rename to ...
9148 (A_908): Rename to ...
9150 (SRAMOPT): New macro.
9151 (ASM_SPEC): Adjust xnack option usage.
9152 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
9153 * configure: Regenerate.
9154 * configure.ac: Detect LLVM assembler dialect.
9156 2021-10-07 Richard Biener <rguenther@suse.de>
9158 * tree-pretty-print.c (dump_generic_node): Do not elide
9159 printing '&' when dumping with -gimple.
9161 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9163 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
9165 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
9166 (adjust_imagpart_expr): Ditto.
9167 * value-range.cc (irange::irange_intersect): Call new routine if
9168 RHS is a single pair.
9169 (irange::intersect): New wide_int version.
9170 * value-range.h (class irange): New prototype.
9172 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9174 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
9175 Add parameter to limit size when recognizing switches.
9176 (gimple_outgoing_range::edge_range_p): Check size limit.
9177 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
9178 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
9179 that exceed the size limit.
9180 (gori_compute::gori_compute): Add initializer.
9181 * params.opt (evrp-switch-limit): New.
9182 * doc/invoke.texi: Update docs.
9184 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9186 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
9187 TYPE_MAX_VALUE instead of creating new trees when possible.
9189 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9191 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
9192 zero and non-zero more efficently.
9194 2021-10-06 Richard Biener <rguenther@suse.de>
9197 * dumpfile.h (TDF_GIMPLE_VAL): New.
9198 (dump_flag): Re-order and adjust TDF_* flags. Make
9199 the enum uint32_t. Use std::underlying_type in the
9201 (optgroup_flag): Likewise for the operator overloads.
9202 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
9203 in _Literal if TDF_GIMPLE_VAL.
9204 * gimple-pretty-print.c (dump_gimple_assign): Add
9205 TDF_GIMPLE_VAL to flags when dumping operands where only
9206 is_gimple_val are allowed.
9207 (dump_gimple_cond): Likewise.
9209 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
9211 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
9214 2021-10-05 qing zhao <qing.zhao@oracle.com>
9216 PR middle-end/102359
9217 * gimplify.c (gimplify_decl_expr): Not add initialization for an
9218 auto variable when it has been initialized by frontend.
9220 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
9222 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9225 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
9227 * common/config/avr/avr-common.c (avr_handle_option): Mark
9228 argument as ATTRIBUTE_UNUSED.
9230 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
9232 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
9233 Undefine before redefinition.
9235 2021-10-05 Richard Biener <rguenther@suse.de>
9237 * toplev.c (no_backend): Remove global var.
9238 (process_options): Pass in no_backend, move post_options
9239 langhook call to toplev::main.
9240 (do_compile): Pass in no_backend, move process_options call
9242 (toplev::run_self_tests): Check no_backend at the caller.
9243 (toplev::main): Call post_options and process_options
9244 split out from do_compile, do self-tests only if
9245 no_backend is initialized.
9247 2021-10-05 Richard Biener <rguenther@suse.de>
9249 * tree-cfg.c (dump_function_to_file): Dump the UID of the
9250 function as part of the name when requested.
9251 * tree-pretty-print.c (dump_function_name): Dump the UID when
9252 requested and the langhook produced the actual name.
9254 2021-10-05 Richard Biener <rguenther@suse.de>
9256 PR middle-end/102587
9257 PR middle-end/102285
9258 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
9259 zero-initialization as last resort, use the constant
9260 size as given by the DEFERRED_INIT argument to build
9263 2021-10-04 Marek Polacek <polacek@redhat.com>
9266 * doc/invoke.texi: Document -Warray-compare.
9268 2021-10-04 Richard Biener <rguenther@suse.de>
9270 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
9271 variables are not to be initialized.
9273 2021-10-04 Richard Biener <rguenther@suse.de>
9275 * expr.h (non_mem_decl_p): Declare.
9276 (mem_ref_refers_to_non_mem_p): Likewise.
9277 * expr.c (non_mem_decl_p): Export.
9278 (mem_ref_refers_to_non_mem_p): Likewise.
9279 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
9280 but check the base with mem_ref_refers_to_non_mem_p
9283 2021-10-04 Richard Biener <rguenther@suse.de>
9285 PR tree-optimization/102570
9286 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
9287 we are using clique for the internal function code.
9288 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
9289 internal function code.
9290 (print_vn_reference_ops): Print the internal function code.
9291 (vn_reference_op_compute_hash): Hash it.
9292 (copy_reference_ops_from_call): Record it.
9293 (visit_stmt): Remove the restriction around internal function
9295 (fully_constant_vn_reference_p): Use fold_const_call and handle
9297 (vn_reference_eq): Compare call return types.
9298 * tree-ssa-pre.c (create_expression_by_pieces): Handle
9299 generating calls to internal functions.
9300 (compute_avail): Remove the restriction around internal function
9303 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
9305 PR tree-optimization/102560
9306 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
9307 marker for invalid_range.
9309 2021-10-04 Richard Biener <rguenther@suse.de>
9311 PR middle-end/102587
9312 * internal-fn.c (expand_DEFERRED_INIT): Guard register
9313 initialization path an avoid initializing VLA registers
9316 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
9318 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
9320 2021-10-03 Martin Liska <mliska@suse.cz>
9322 * toplev.c (toplev::main): Check opt_index if it is a part
9325 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
9327 PR tree-optimization/102563
9328 * range-op.cc (operator_lshift::op1_range): Do not clobber
9331 2021-10-02 Martin Liska <mliska@suse.cz>
9333 * toplev.c (toplev::main): save_decoded_options[0] is program
9334 name and so it should be skipped.
9336 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
9338 PR tree-optimization/102546
9339 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
9340 X << Y is non-zero implies X is also non-zero.
9342 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9344 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
9346 * config/aarch64/aarch64-tune.md: Regenerate.
9347 * doc/invoke.texi: Update docs.
9349 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9351 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
9353 * config/aarch64/aarch64-tune.md: Regenerate.
9354 * doc/invoke.texi: Update docs.
9356 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9358 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
9360 * config/aarch64/aarch64-tune.md: Regenerate.
9361 * doc/invoke.texi: Update docs.
9363 2021-10-01 Martin Sebor <msebor@redhat.com>
9366 * doc/invoke.texi (-Waddress): Update.
9367 * gengtype.c (write_types): Avoid -Waddress.
9368 * poly-int.h (POLY_SET_COEFF): Avoid using null.
9370 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
9373 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
9376 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9378 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
9380 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
9382 * gimple-range-path.cc (path_range_query::compute_ranges): Use
9384 * gimple-range-path.h (class path_range_query): Remove shadowed
9386 (path_range_query::get_path_oracle): New.
9388 2021-10-01 Jakub Jelinek <jakub@redhat.com>
9389 Richard Biener <rguenther@suse.de>
9392 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
9393 INT_MIN / -1 division detection from here ...
9394 (-fsanitize=signed-integer-overflow): ... and add it here.
9396 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9398 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
9400 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
9401 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
9402 (AARCH64_ISA_V9): New ISA flag.
9403 * doc/invoke.texi: Update docs.
9405 2021-10-01 Martin Liska <mliska@suse.cz>
9407 * toplev.c (toplev::main): Save decoded optimization options.
9408 * toplev.h (save_opt_decoded_options): New.
9409 * doc/extend.texi: Be more clear about optimize and target
9412 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
9414 * explow.c: Include langhooks.h.
9415 (set_stack_check_libfunc): Build a proper function type.
9417 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
9420 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
9421 not return true for external weak function symbols in medium model.
9423 2021-10-01 Jakub Jelinek <jakub@redhat.com>
9425 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
9426 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
9427 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
9428 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
9429 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
9430 OMP_CLAUSE_SCHEDULE_STATIC.
9432 2021-10-01 Richard Biener <rguenther@suse.de>
9434 PR middle-end/102518
9435 * tree-inline.c (setup_one_parameter): Avoid substituting
9436 an invariant into contexts where a GIMPLE register is not valid.
9438 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9440 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
9441 * config/arm/arm-tables.opt: Regenerate.
9442 * config/arm/arm-tune.md: Regenerate.
9443 * doc/invoke.texi: Update docs.
9445 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
9448 * config/i386/i386.md
9449 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
9452 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
9454 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
9455 omp_{c,re}alloc, fix omp_alloc/omp_free.
9457 2021-09-30 Martin Liska <mliska@suse.cz>
9459 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
9460 asm_out_file and stream directly to MYFILE.
9462 2021-09-30 Richard Biener <rguenther@suse.de>
9464 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
9465 Restore and fix condition under which we apply npeel to
9466 the DRs misalignment value.
9468 2021-09-30 Richard Biener <rguenther@suse.de>
9470 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
9471 Fix npeel check for variable amount of peeling.
9473 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9475 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
9477 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9479 * gimple-range.cc (gimple_ranger::debug): New.
9480 * gimple-range.h (class gimple_ranger): Add debug.
9482 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9484 PR middle-end/102519
9485 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
9487 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
9490 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
9491 (btf_finalize): Empty the hash_map btf_var_ids.
9493 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
9495 * tree-vrp.c (thread_through_all_blocks): Return bool.
9496 (execute_vrp_threader): Return TODO_* flags.
9497 (pass_data_vrp_threader): Set todo_flags_finish to 0.
9499 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
9501 * timevar.def (TV_TREE_VRP_THREADER): New.
9502 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
9504 2021-09-29 David Faust <david.faust@oracle.com>
9506 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
9508 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
9510 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
9511 new type qualifier enum.
9512 (TYPES_TERNOP_SSSU): Likewise.
9513 (TYPES_TERNOP_PPPU): Likewise.
9514 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
9515 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
9517 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
9518 builtin and remove casts.
9519 (vqtbl1_s8): Likewise.
9520 (vqtbl1q_p8): Likewise.
9521 (vqtbl1q_s8): Likewise.
9522 (vqtbx1_s8): Likewise.
9523 (vqtbx1_p8): Likewise.
9524 (vqtbx1q_s8): Likewise.
9525 (vqtbx1q_p8): Likewise.
9526 (vtbl1_p8): Likewise.
9527 (vtbl2_p8): Likewise.
9528 (vtbx2_p8): Likewise.
9530 2021-09-29 Richard Biener <rguenther@suse.de>
9532 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
9534 (vect_update_misalignment_for_peel): Use it to update
9535 misaligned to the value necessary for an aligned access.
9536 (vect_get_peeling_costs_all_drs): Likewise.
9537 (vect_enhance_data_refs_alignment): Likewise.
9539 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9541 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
9542 emitted operations and adjust heuristic for code size.
9544 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9546 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
9547 emitted operations and adjust heuristic for code size.
9549 2021-09-29 Jakub Jelinek <jakub@redhat.com>
9551 PR middle-end/102504
9552 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
9553 in OMP_SCOPE clauses, not just on worksharing construct clauses.
9555 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
9557 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
9558 (<u>mulditi3): Call <su>muldi3_highpart.
9559 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
9560 (<u>mulsidi3): Call <su>mulsi3_highpart.
9561 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
9563 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9565 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
9567 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9569 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
9570 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
9573 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9575 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
9576 (AARCH64_FL_V8_7): Likewise.
9577 (AARCH64_FL_FOR_ARCH8_7): Likewise.
9578 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
9579 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
9580 * doc/invoke.texi: Document the above.
9582 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9584 * dbgcnt.c (dbg_cnt_counter): New.
9585 * dbgcnt.h (dbg_cnt_counter): New.
9586 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
9587 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
9588 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
9589 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
9592 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9594 * cfgcleanup.c (pass_jump::execute): Check
9595 flag_expensive_optimizations.
9596 (pass_jump_after_combine::gate): Same.
9597 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
9598 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
9599 * tree-ssa-threadupdate.c
9600 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
9603 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9605 * tree-ssa-reassoc.c (biased_names): New global.
9606 (propagate_bias_p): New function.
9607 (loop_carried_phi): Remove.
9608 (propagate_rank): Propagate bias along single uses.
9609 (get_rank): Update biased_names when needed.
9611 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9613 * passes.def (pass_reassoc): Rename parameter to early_p.
9614 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
9616 (phi_rank): Don't bias loop-carried phi ranks
9617 before vectorization pass.
9618 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
9619 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
9621 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
9623 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
9625 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
9627 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9630 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
9631 special 80387 instruction XFmode constants if flag_rounding_math.
9633 2021-09-28 Richard Biener <rguenther@suse.de>
9635 PR tree-optimization/100112
9636 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
9637 referece into the hashtable twice in case last_vuse is
9638 different from the original vuse on the stmt.
9640 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9642 PR middle-end/102492
9643 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
9644 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
9645 clauses on SIMD constructs.
9647 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9649 PR tree-optimization/102511
9650 * gimple-range-path.cc (path_range_query::range_on_path_entry):
9651 Return VARYING when nothing found.
9653 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
9656 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
9658 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
9659 (VALID_MMX_REG_MODE): Add V4HFmode.
9660 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
9661 vector mode condition.
9662 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
9664 (function_arg_32): Add V4HFmode.
9665 (function_arg_advance_32): Likewise.
9666 * config/i386/i386.md (mode): Add V4HF/V2HF.
9667 (MODE_SIZE): Likewise.
9668 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
9669 (V_32): Add V2HF mode.
9670 (VHF_32_64): New mode iterator.
9671 (*mov<mode>_internal): Adjust sse alternatives to support
9673 (*mov<mode>_internal): Adjust sse alternatives to support
9675 (<insn><mode>3): New define_insn for add/sub/mul/div.
9677 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9679 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
9681 (pass_early_thread_jumps::gate): Same.
9682 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
9683 Return if !flag_thread_jumps.
9684 * tree-ssa-threadupdate.c
9685 (jt_path_registry::register_jump_thread): Assert that
9686 flag_thread_jumps is true.
9688 2021-09-28 liuhongt <hongtao.liu@intel.com>
9691 (simplify_context::simplify_binary_operation_1): Relax
9692 condition of simplifying (vec_concat:M (vec_select op0
9693 index0)(vec_select op1 index1)) to allow different modes
9694 between op0 and M, but have same inner mode.
9696 2021-09-28 liuhongt <hongtao.liu@intel.com>
9698 * config/i386/i386-expand.c (emit_reduc_half): Handle
9699 V8HF/V16HF/V32HFmode.
9700 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
9701 (REDUC_SSE_SMINMAX_MODE): Ditto.
9702 (REDUC_PLUS_MODE): Add V16HF and V32HF.
9703 (REDUC_SMINMAX_MODE): Ditto.
9705 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9707 * gimple-range-path.cc
9708 (path_range_query::precompute_ranges_in_block): Rename to...
9709 (path_range_query::compute_ranges_in_block): ...this.
9710 (path_range_query::precompute_ranges): Rename to...
9711 (path_range_query::compute_ranges): ...this.
9712 (path_range_query::precompute_relations): Rename to...
9713 (path_range_query::compute_relations): ...this.
9714 (path_range_query::precompute_phi_relations): Rename to...
9715 (path_range_query::compute_phi_relations): ...this.
9716 * gimple-range-path.h: Rename precompute* to compute*.
9717 * tree-ssa-threadbackward.c
9718 (back_threader::find_taken_edge_switch): Same.
9719 (back_threader::find_taken_edge_cond): Same.
9720 * tree-ssa-threadedge.c
9721 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
9722 (hybrid_jt_state::register_equivs_stmt): Inline...
9723 * tree-ssa-threadedge.h: ...here.
9725 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9727 * tree-vrp.c (lhs_of_dominating_assert): Remove.
9728 (class vrp_jt_state): Remove.
9729 (class vrp_jt_simplifier): Remove.
9730 (vrp_jt_simplifier::simplify): Remove.
9731 (class vrp_jump_threader): Remove.
9732 (vrp_jump_threader::vrp_jump_threader): Remove.
9733 (vrp_jump_threader::~vrp_jump_threader): Remove.
9734 (vrp_jump_threader::before_dom_children): Remove.
9735 (vrp_jump_threader::after_dom_children): Remove.
9737 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9739 * passes.def (pass_vrp_threader): New.
9740 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
9741 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
9742 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
9743 (hybrid_jt_simplifier::simplify): New.
9744 (hybrid_jt_simplifier::compute_ranges_from_state): New.
9745 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
9746 (class hybrid_jt_simplifier): New.
9747 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
9749 (class hybrid_threader): New.
9750 (hybrid_threader::hybrid_threader): New.
9751 (hybrid_threader::~hybrid_threader): New.
9752 (hybrid_threader::before_dom_children): New.
9753 (hybrid_threader::after_dom_children): New.
9754 (execute_vrp_threader): New.
9755 (class pass_vrp_threader): New.
9756 (make_pass_vrp_threader): New.
9758 2021-09-27 Martin Liska <mliska@suse.cz>
9760 * output.h (enum section_flag): New.
9761 (SECTION_FORGET): Remove.
9762 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
9763 (SECTION_STYLE_MASK): Define it based on other enum
9765 * varasm.c (switch_to_section): Remove unused handling of
9768 2021-09-27 Martin Liska <mliska@suse.cz>
9770 * common.opt: Add new variable flag_default_complex_method.
9771 * opts.c (finish_options): Handle flags related to
9772 x_flag_complex_method.
9773 * toplev.c (process_options): Remove option handling related
9774 to flag_complex_method.
9776 2021-09-27 Richard Biener <rguenther@suse.de>
9778 PR middle-end/102450
9779 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
9780 type_for_size, instead use int_mode_for_size.
9782 2021-09-27 Andrew Pinski <apinski@marvell.com>
9785 * gimplify.c (gimplify_save_expr): Return early
9786 if the type of val is error_mark_node.
9788 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9790 * tree-ssanames.c (ssa_name_has_boolean_range): Use
9793 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9795 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
9796 vrp_visit_cond_stmt.
9797 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
9798 (cprop_into_stmt): Same.
9799 (dom_opt_dom_walker::optimize_stmt): Same.
9801 2021-09-27 Richard Biener <rguenther@suse.de>
9803 PR tree-optimization/97351
9804 PR tree-optimization/97352
9805 PR tree-optimization/82426
9806 * tree-vectorizer.h (dr_misalignment): Add vector type
9808 (aligned_access_p): Likewise.
9809 (known_alignment_for_access_p): Likewise.
9810 (vect_supportable_dr_alignment): Likewise.
9811 (vect_known_alignment_in_bytes): Likewise. Refactor.
9812 (DR_MISALIGNMENT): Remove.
9813 (vect_update_shared_vectype): Likewise.
9814 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
9815 a vector type with larger alignment requirement and apply
9816 the negative step adjustment here.
9817 (vect_calculate_target_alignment): Remove.
9818 (vect_compute_data_ref_alignment): Get explicit vector type
9819 argument, do not apply a negative step alignment adjustment
9821 (vect_slp_analyze_node_alignment): Re-analyze alignment
9822 when we re-visit the DR with a bigger desired alignment but
9823 keep more precise results from smaller alignments.
9824 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
9825 (vect_slp_analyze_node_operations_1): Do not update the
9826 shared vector type on stmts.
9827 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
9828 vector type of an SLP node to the representative stmt-info.
9829 (vect_transform_stmt): Likewise.
9831 2021-09-27 liuhongt <hongtao.liu@intel.com>
9834 2021-09-09 liuhongt <hongtao.liu@intel.com>
9837 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
9838 (reduc_plus_scal_v4sf): .. this, New define_expand.
9839 (reduc_plus_scal_v2df): .. and this, New define_expand.
9841 2021-09-26 liuhongt <hongtao.liu@intel.com>
9843 * doc/extend.texi (Half-Precision): Remove storage only
9844 description for _Float16 w/o avx512fp16.
9846 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
9848 * config/pru/constraints.md (Rrio): New constraint.
9849 * config/pru/predicates.md (regio_operand): New predicate.
9850 * config/pru/pru-pragma.c (pru_register_pragmas): Register
9851 the __regio_symbol address space.
9852 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
9853 * config/pru/pru.c (pru_symref2ioregno): New helper function.
9854 (pru_legitimate_address_p): Remove.
9855 (pru_addr_space_legitimate_address_p): Use the address space
9857 (pru_nongeneric_pointer_addrspace): New helper function.
9858 (pru_insert_attributes): New function to validate __regio_symbol
9860 (TARGET_INSERT_ATTRIBUTES): New macro.
9861 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
9862 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
9863 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
9864 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
9866 (*regio_nozext_writesi): New pattern to write to I/O registers.
9867 (*regio_zext_write_r30<EQS0:mode>): Ditto.
9868 * doc/extend.texi: Document the new PRU Named Address Space.
9870 2021-09-24 Patrick Palka <ppalka@redhat.com>
9874 * real.c (encode_ieee_double): Avoid unwanted sign extension.
9875 (encode_ieee_quad): Likewise.
9877 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
9879 PR rtl-optimization/102147
9880 * ira-build.c (ira_conflict_vector_profitable_p): Make
9881 profitability calculation independent of host compiler pointer and
9884 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9886 * gimple-range-path.cc (path_range_query::path_range_query):
9887 Move debugging header...
9888 (path_range_query::precompute_ranges): ...here.
9889 (path_range_query::internal_range_of_expr): Do not call
9890 range_on_path_entry if NAME is defined in the current block.
9892 2021-09-24 Richard Biener <rguenther@suse.de>
9894 * cfghooks.c (verify_flow_info): Verify unallocated BB and
9895 edge flags are not set.
9897 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9899 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9901 (jt_path_registry::register_jump_thread): Call
9902 cancel_invalid_paths.
9903 * tree-ssa-threadupdate.h (class jt_path_registry): Add
9904 cancel_invalid_paths.
9906 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9908 PR tree-optimization/102400
9909 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
9910 result_vdef to zero value.
9912 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9914 PR tree-optimization/102451
9915 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
9918 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
9920 * config/i386/sse.md (cond_<insn><mode>): Extend to support
9922 (cond_mul<mode>): Likewise.
9923 (cond_div<mode>): Likewise.
9924 (cond_<code><mode>): Likewise.
9925 (cond_fma<mode>): Likewise.
9926 (cond_fms<mode>): Likewise.
9927 (cond_fnma<mode>): Likewise.
9928 (cond_fnms<mode>): Likewise.
9930 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9932 PR tree-optimization/102463
9933 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
9934 there is no range-ops handler, don't look for a relation.
9936 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9938 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
9939 non-executable_edge flag as parameter.
9940 * gimple-range-cache.h (ranger_cache): Adjust prototype.
9941 * gimple-range-gori.cc (gori_compute::gori_compute): Take
9942 non-executable_edge flag as parameter.
9943 (gori_compute::outgoing_edge_range_p): Check new flag.
9944 * gimple-range-gori.h (gori_compute): Adjust prototype.
9945 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
9946 (gimple_ranger::range_on_edge): Check new flag.
9947 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
9948 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
9949 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
9950 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
9951 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
9952 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
9953 (simplify_using_ranges::simplify_switch_using_ranges): Clear
9954 EDGE_EXECUTABLE like it originally did.
9955 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
9956 NON_EXECUTABLE flags.
9957 (simplify_using_ranges::simplify_using_ranges): Adjust.
9958 * vr-values.h (class simplify_using_ranges): Adjust.
9959 (simplify_using_ranges::set_range_query): Add non-executable flag param.
9961 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
9964 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
9965 zero-width bit fields and return indicator.
9966 (rs6000_discover_homogeneous_aggregate): Diagnose when the
9967 presence of a zero-width bit field changes parameter passing in
9970 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9972 * gimple-range-fold.cc (fold_using_range::range_of_phi):
9973 Remove dominator check.
9975 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9977 * gimple-range-path.cc (path_range_query::precompute_relations):
9978 Hoist edge calculations before using EDGE_SUCC.
9980 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
9982 * configure.ac: Fix --with-multilib-list description.
9983 * configure: Regenerate.
9985 2021-09-23 Richard Biener <rguenther@suse.de>
9987 PR tree-optimization/102448
9988 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9989 Clear alignment info copied from DR_PTR_INFO.
9991 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9993 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
9995 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
9996 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
9997 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
9998 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
9999 (vcond<mode><mode>): New expander.
10000 (vcond<mode><sseintvecmodelower>): Likewise.
10001 (vcond<sseintvecmodelower><mode>): Likewise.
10002 (vcondu<mode><sseintvecmodelower>): Likewise.
10004 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10006 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
10008 (extendv4hf<mode>2): Likewise.
10009 (extendv2hfv2df2): Likewise.
10010 (trunc<mode><ssePHmodelower>2): Likewise.
10011 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
10012 (trunc<mode>v4hf2): ... this, and drop constraints.
10013 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
10014 (truncv2dfv2hf2): ... this, and likewise.
10016 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10018 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
10020 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
10022 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
10023 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
10024 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
10026 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10028 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
10029 (fixuns_trunchfhi2): Likewise.
10030 (*fixuns_trunchfsi2zext): New define_insn.
10031 * config/i386/sse.md (ssePHmodelower): New mode_attr.
10032 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
10033 New expander for same element vector fix_truncate.
10034 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
10035 Likewise for V4HF to V4SI/V4DI fix_truncate.
10036 (fix<fixunssuffix>_truncv2hfv2di2):
10037 Likeise for V2HF to V2DI fix_truncate.
10039 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10041 * config/i386/i386.md (<code>hf3): New expander.
10043 2021-09-23 liuhongt <hongtao.liu@intel.com>
10045 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
10046 (VFH_SF_AVX512VL): Extend to handle HFmode.
10047 (VF_SF_AVX512VL): Deleted.
10049 2021-09-23 liuhongt <hongtao.liu@intel.com>
10051 * config/i386/i386.md (rinthf2): New expander.
10052 (nearbyinthf2): New expander.
10054 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
10056 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
10057 (class dom_jt_state): ...this and provide virtual overrides.
10058 (dom_jt_state::register_equiv): New.
10059 (class dom_jt_simplifier): Rename from
10060 dom_jump_threader_simplifier.
10061 (dom_jump_threader_simplifier::simplify): Rename...
10062 (dom_jt_simplifier::simplify): ...to this.
10063 (pass_dominator::execute): Use dom_jt_simplifier and
10065 * tree-ssa-threadedge.c (jump_threader::jump_threader):
10067 (jt_state::register_equivs_stmt): Abstract out...
10068 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
10070 (jump_threader::thread_around_empty_blocks): Update state.
10071 (jump_threader::thread_through_normal_block): Same.
10072 (jt_state::jt_state): Remove.
10073 (jt_state::push): Remove pass specific bits. Keep block vector
10075 (jt_state::append_path): New.
10076 (jt_state::pop): Remove pass specific bits.
10077 (jt_state::register_equiv): Same.
10078 (jt_state::record_ranges_from_stmt): Same.
10079 (jt_state::register_equivs_on_edge): Same. Rename...
10080 (jt_state::register_equivs_edge): ...to this.
10081 (jt_state::dump): New.
10082 (jt_state::debug): New.
10083 (jump_threader_simplifier::simplify): Remove.
10084 (jt_state::get_path): New.
10085 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
10086 class. Expose common functionality as virtual methods.
10087 (class jump_threader_simplifier): Same. Rename...
10088 (class jt_simplifier): ...to this.
10089 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
10090 (class vrp_jt_simplifier): ...to this. Provide pass specific
10092 (class vrp_jt_state): New.
10093 (vrp_jump_threader_simplifier::simplify): Rename...
10094 (vrp_jt_simplifier::simplify): ...to this. Inline code from
10095 what used to be the base class.
10096 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
10099 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
10102 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
10105 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
10106 Richard Biener <rguenther@suse.de>
10108 * match.pd (negation simplifications): Implement some negation
10109 folding transformations from fold-const.c's fold_negate_expr.
10110 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
10111 argument, to control whether the op should be simplified prior
10112 to looking up/assigning a value number.
10113 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
10114 (vn_nary_simplify): Likewise.
10115 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
10116 now call vn_nary_build_or_lookup_1 disabling simplification.
10118 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
10120 PR tree-optimization/102087
10121 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
10122 Update bound/cmp/control for niter.
10124 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
10126 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
10127 Move check for non-empty BB here.
10128 (fur_source::register_outgoing_edges): ...from here.
10130 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
10132 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
10133 Remove call to improve_range_with_equivs.
10134 (path_range_query::improve_range_with_equivs): Remove
10135 * gimple-range-path.h: Remove improve_range_with_equivs.
10137 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10139 * config/i386/avx512fp16intrin.h:
10140 (_mm512_mask_blend_ph): New intrinsic.
10141 (_mm512_permutex2var_ph): Ditto.
10142 (_mm512_permutexvar_ph): Ditto.
10143 * config/i386/avx512fp16vlintrin.h:
10144 (_mm256_mask_blend_ph): New intrinsic.
10145 (_mm256_permutex2var_ph): Ditto.
10146 (_mm256_permutexvar_ph): Ditto.
10147 (_mm_mask_blend_ph): Ditto.
10148 (_mm_permutex2var_ph): Ditto.
10149 (_mm_permutexvar_ph): Ditto.
10151 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10153 * config/i386/avx512fp16intrin.h: Add new intrinsics.
10154 (_mm512_conj_pch): New intrinsic.
10155 (_mm512_mask_conj_pch): Ditto.
10156 (_mm512_maskz_conj_pch): Ditto.
10157 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
10158 (_mm256_conj_pch): New intrinsic.
10159 (_mm256_mask_conj_pch): Ditto.
10160 (_mm256_maskz_conj_pch): Ditto.
10161 (_mm_conj_pch): Ditto.
10162 (_mm_mask_conj_pch): Ditto.
10163 (_mm_maskz_conj_pch): Ditto.
10165 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10167 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
10168 (_mm512_reduce_add_ph): New intrinsic.
10169 (_mm512_reduce_mul_ph): Ditto.
10170 (_mm512_reduce_min_ph): Ditto.
10171 (_mm512_reduce_max_ph): Ditto.
10172 * config/i386/avx512fp16vlintrin.h
10173 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
10174 (_mm256_reduce_add_ph): New intrinsic.
10175 (_mm256_reduce_mul_ph): Ditto.
10176 (_mm256_reduce_min_ph): Ditto.
10177 (_mm256_reduce_max_ph): Ditto.
10178 (_mm_reduce_add_ph): Ditto.
10179 (_mm_reduce_mul_ph): Ditto.
10180 (_mm_reduce_min_ph): Ditto.
10181 (_mm_reduce_max_ph): Ditto.
10183 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10185 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
10186 __m128h_u): New typedef.
10187 (_mm512_load_ph): New intrinsic.
10188 (_mm256_load_ph): Ditto.
10189 (_mm_load_ph): Ditto.
10190 (_mm512_loadu_ph): Ditto.
10191 (_mm256_loadu_ph): Ditto.
10192 (_mm_loadu_ph): Ditto.
10193 (_mm512_store_ph): Ditto.
10194 (_mm256_store_ph): Ditto.
10195 (_mm_store_ph): Ditto.
10196 (_mm512_storeu_ph): Ditto.
10197 (_mm256_storeu_ph): Ditto.
10198 (_mm_storeu_ph): Ditto.
10199 (_mm512_abs_ph): Ditto.
10200 * config/i386/avx512fp16vlintrin.h
10201 (_mm_abs_ph): Ditto.
10202 (_mm256_abs_ph): Ditto.
10204 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
10206 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
10208 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
10211 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
10212 is actually a full copy of the source operand into the target.
10213 Don't emit a strict low part move if source and target mode match.
10215 2021-09-22 Jakub Jelinek <jakub@redhat.com>
10217 PR middle-end/102415
10218 * omp-expand.c (expand_omp_single): If region->exit is NULL,
10219 assert region->entry is GIMPLE_OMP_SCOPE region and return.
10221 2021-09-22 Jakub Jelinek <jakub@redhat.com>
10223 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
10224 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
10225 arguments from 2 to 3.
10226 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
10227 allocate clause allocator and print align if present.
10228 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
10229 for omp_default_mem_alloc if align modifier is present. If align
10230 modifier is present, use TREE_LIST to encode both allocator and
10232 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
10233 Handle align modifier on allocator clause if present.
10235 2021-09-22 liuhongt <hongtao.liu@intel.com>
10237 * config/i386/i386.md (define_attr "isa"): Add
10239 (define_attr "enabled"): Correspond fma_or_avx512vl to
10240 TARGET_FMA || TARGET_AVX512VL.
10241 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
10243 (fnmav2sf4): Ditto.
10244 (fnmsv2sf4): Ditto.
10246 2021-09-22 liuhongt <hongtao.liu@intel.com>
10248 * config/i386/i386.md (cstorehf3): New define_expand.
10250 2021-09-22 liuhongt <hongtao.liu@intel.com>
10252 * config/i386/i386.md (<rounding_insn>hf2): New expander.
10253 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
10254 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
10255 Extend from VF_128 to VFH_128.
10257 2021-09-22 liuhongt <hongtao.liu@intel.com>
10259 * config/i386/i386-features.c (i386-features.c): Handle
10261 * config/i386/i386.md (sqrthf2): New expander.
10262 (*sqrthf2): New define_insn.
10263 * config/i386/sse.md
10264 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
10267 2021-09-22 liuhongt <hongtao.liu@intel.com>
10269 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
10271 (_mm_mask3_fcmadd_sch): Likewise.
10272 (_mm_maskz_fcmadd_sch): Likewise.
10273 (_mm_fcmadd_sch): Likewise.
10274 (_mm_mask_fmadd_sch): Likewise.
10275 (_mm_mask3_fmadd_sch): Likewise.
10276 (_mm_maskz_fmadd_sch): Likewise.
10277 (_mm_fmadd_sch): Likewise.
10278 (_mm_mask_fcmadd_round_sch): Likewise.
10279 (_mm_mask3_fcmadd_round_sch): Likewise.
10280 (_mm_maskz_fcmadd_round_sch): Likewise.
10281 (_mm_fcmadd_round_sch): Likewise.
10282 (_mm_mask_fmadd_round_sch): Likewise.
10283 (_mm_mask3_fmadd_round_sch): Likewise.
10284 (_mm_maskz_fmadd_round_sch): Likewise.
10285 (_mm_fmadd_round_sch): Likewise.
10286 (_mm_fcmul_sch): Likewise.
10287 (_mm_mask_fcmul_sch): Likewise.
10288 (_mm_maskz_fcmul_sch): Likewise.
10289 (_mm_fmul_sch): Likewise.
10290 (_mm_mask_fmul_sch): Likewise.
10291 (_mm_maskz_fmul_sch): Likewise.
10292 (_mm_fcmul_round_sch): Likewise.
10293 (_mm_mask_fcmul_round_sch): Likewise.
10294 (_mm_maskz_fcmul_round_sch): Likewise.
10295 (_mm_fmul_round_sch): Likewise.
10296 (_mm_mask_fmul_round_sch): Likewise.
10297 (_mm_maskz_fmul_round_sch): Likewise.
10298 * config/i386/i386-builtin.def: Add corresponding new builtins.
10299 * config/i386/sse.md
10300 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
10301 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
10302 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
10304 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
10305 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
10307 * config/i386/subst.md (mask_scalarcz_name): New.
10308 (mask_scalarc_name): Ditto.
10309 (mask_scalarc_operand3): Ditto.
10310 (mask_scalarcz_operand4): Ditto.
10311 (round_scalarcz_name): Ditto.
10312 (round_scalarc_mask_operand3): Ditto.
10313 (round_scalarcz_mask_operand4): Ditto.
10314 (round_scalarc_mask_op3): Ditto.
10315 (round_scalarcz_mask_op4): Ditto.
10316 (round_scalarcz_constraint): Ditto.
10317 (round_scalarcz_nimm_predicate): Ditto.
10318 (mask_scalarcz): Ditto.
10319 (mask_scalarc): Ditto.
10320 (round_scalarcz): Ditto.
10322 2021-09-22 liuhongt <hongtao.liu@intel.com>
10324 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
10326 (_mm512_mask_fcmadd_pch): Likewise.
10327 (_mm512_mask3_fcmadd_pch): Likewise.
10328 (_mm512_maskz_fcmadd_pch): Likewise.
10329 (_mm512_fmadd_pch): Likewise.
10330 (_mm512_mask_fmadd_pch): Likewise.
10331 (_mm512_mask3_fmadd_pch): Likewise.
10332 (_mm512_maskz_fmadd_pch): Likewise.
10333 (_mm512_fcmadd_round_pch): Likewise.
10334 (_mm512_mask_fcmadd_round_pch): Likewise.
10335 (_mm512_mask3_fcmadd_round_pch): Likewise.
10336 (_mm512_maskz_fcmadd_round_pch): Likewise.
10337 (_mm512_fmadd_round_pch): Likewise.
10338 (_mm512_mask_fmadd_round_pch): Likewise.
10339 (_mm512_mask3_fmadd_round_pch): Likewise.
10340 (_mm512_maskz_fmadd_round_pch): Likewise.
10341 (_mm512_fcmul_pch): Likewise.
10342 (_mm512_mask_fcmul_pch): Likewise.
10343 (_mm512_maskz_fcmul_pch): Likewise.
10344 (_mm512_fmul_pch): Likewise.
10345 (_mm512_mask_fmul_pch): Likewise.
10346 (_mm512_maskz_fmul_pch): Likewise.
10347 (_mm512_fcmul_round_pch): Likewise.
10348 (_mm512_mask_fcmul_round_pch): Likewise.
10349 (_mm512_maskz_fcmul_round_pch): Likewise.
10350 (_mm512_fmul_round_pch): Likewise.
10351 (_mm512_mask_fmul_round_pch): Likewise.
10352 (_mm512_maskz_fmul_round_pch): Likewise.
10353 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
10355 (_mm_mask_fmadd_pch): Likewise.
10356 (_mm_mask3_fmadd_pch): Likewise.
10357 (_mm_maskz_fmadd_pch): Likewise.
10358 (_mm256_fmadd_pch): Likewise.
10359 (_mm256_mask_fmadd_pch): Likewise.
10360 (_mm256_mask3_fmadd_pch): Likewise.
10361 (_mm256_maskz_fmadd_pch): Likewise.
10362 (_mm_fcmadd_pch): Likewise.
10363 (_mm_mask_fcmadd_pch): Likewise.
10364 (_mm_mask3_fcmadd_pch): Likewise.
10365 (_mm_maskz_fcmadd_pch): Likewise.
10366 (_mm256_fcmadd_pch): Likewise.
10367 (_mm256_mask_fcmadd_pch): Likewise.
10368 (_mm256_mask3_fcmadd_pch): Likewise.
10369 (_mm256_maskz_fcmadd_pch): Likewise.
10370 (_mm_fmul_pch): Likewise.
10371 (_mm_mask_fmul_pch): Likewise.
10372 (_mm_maskz_fmul_pch): Likewise.
10373 (_mm256_fmul_pch): Likewise.
10374 (_mm256_mask_fmul_pch): Likewise.
10375 (_mm256_maskz_fmul_pch): Likewise.
10376 (_mm_fcmul_pch): Likewise.
10377 (_mm_mask_fcmul_pch): Likewise.
10378 (_mm_maskz_fcmul_pch): Likewise.
10379 (_mm256_fcmul_pch): Likewise.
10380 (_mm256_mask_fcmul_pch): Likewise.
10381 (_mm256_maskz_fcmul_pch): Likewise.
10382 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
10383 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
10384 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
10385 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
10386 * config/i386/i386-builtin.def: Add new builtins.
10387 * config/i386/i386-expand.c: Handle new builtin types.
10388 * config/i386/subst.md (SUBST_CV): New.
10389 (maskc_name): Ditto.
10390 (maskc_operand3): Ditto.
10392 (sdc_maskz_name): Ditto.
10393 (sdc_mask_op4): Ditto.
10394 (sdc_mask_op5): Ditto.
10395 (sdc_mask_mode512bit_condition): Ditto.
10397 (round_maskc_operand3): Ditto.
10398 (round_sdc_mask_operand4): Ditto.
10399 (round_maskc_op3): Ditto.
10400 (round_sdc_mask_op4): Ditto.
10401 (round_saeonly_sdc_mask_operand5): Ditto.
10402 * config/i386/sse.md (unspec): Add complex fma unspecs.
10403 (avx512fmaskcmode): New.
10404 (UNSPEC_COMPLEX_F_C_MA): Ditto.
10405 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
10406 (complexopname): Ditto.
10407 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
10408 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
10409 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
10411 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
10412 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
10414 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
10416 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
10417 rs6000-density-size-threshold, rs6000-density-penalty,
10418 rs6000-density-load-pct-threshold,
10419 rs6000-density-load-num-threshold): New parameter.
10420 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
10421 corresponding parameters.
10423 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10425 * gimple-range-path.cc (path_range_query::defined_outside_path):
10427 (path_range_query::range_on_path_entry): New.
10428 (path_range_query::internal_range_of_expr): Resolve unknowns
10430 (path_range_query::improve_range_with_equivs): New.
10431 (path_range_query::ssa_range_in_phi): Resolve unknowns with
10433 * gimple-range-path.h (class path_range_query): Add
10434 defined_outside_path, range_on_path_entry, and
10435 improve_range_with_equivs.
10437 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10439 * gimple-range-path.cc (path_range_query::add_to_imports): New.
10440 (path_range_query::add_copies_to_imports): New.
10441 (path_range_query::precompute_ranges): Call
10442 add_copies_to_imports.
10443 * gimple-range-path.h (class path_range_query): Add prototypes
10444 for add_copies_to_imports and add_to_imports.
10446 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10448 * gimple-range-path.cc (path_range_query::range_defined_in_block):
10449 Remove useless code.
10451 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10453 * gimple-range-fold.h (class fur_source): Make oracle protected.
10454 * gimple-range-path.cc (path_range_query::path_range_query): Add
10455 resolve argument. Initialize oracle.
10456 (path_range_query::~path_range_query): Delete oracle.
10457 (path_range_query::range_of_stmt): Adapt to use relations.
10458 (path_range_query::precompute_ranges): Pre-compute relations.
10459 (class jt_fur_source): New
10460 (jt_fur_source::jt_fur_source): New.
10461 (jt_fur_source::register_relation): New.
10462 (jt_fur_source::query_relation): New.
10463 (path_range_query::precompute_relations): New.
10464 (path_range_query::precompute_phi_relations): New.
10465 * gimple-range-path.h (path_range_query): Add resolve argument.
10466 Add oracle, precompute_relations, precompute_phi_relations.
10467 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
10468 resolve argument to solver.
10470 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10472 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
10473 Rename postfold_gcond_edges to register_outgoing_edges and
10475 (fold_using_range::postfold_gcond_edges): Rename...
10476 (fur_source::register_outgoing_edges): ...to this.
10477 * gimple-range-fold.h (postfold_gcond_edges): Rename to
10478 register_outgoing_edges and move to fur_source.
10480 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10482 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
10483 dom_info_available_p.
10485 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10487 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
10488 and quick_grow_cleared instead of safe_grow_cleared.
10490 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
10493 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
10494 'random ()' to '0'.
10496 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
10498 * configure.ac: Detect when the assembler supports new-style
10499 architecture extensions.
10500 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
10501 the full CPU string if the assembler can grok it.
10502 (arm_rewrite_march): Likewise but for the architecture.
10503 * config.in: Regenerate.
10504 * configure: Regenerate.
10506 2021-09-21 Richard Biener <rguenther@suse.de>
10508 PR tree-optimization/102421
10509 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
10510 adjust alignment info.
10512 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
10514 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
10515 bitfield stream out.
10517 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10519 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
10520 undefined edges, apply an equivalence if appropriate.
10521 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
10522 UNDEFINED if EDGE_EXECUTABLE is not set.
10523 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
10524 as EXECUTABLE upon startup.
10525 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
10526 EDGE_EXECUTABLE set.
10527 * vr-values.c (set_and_propagate_unexecutable): New.
10528 (simplify_using_ranges::fold_cond): Call set_and_propagate.
10529 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
10530 * vr-values.h: Add prototype.
10532 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10534 * value-relation.cc (equiv_oracle::register_initial_def): New.
10535 (equiv_oracle::register_relation): Call register_initial_def.
10536 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
10537 (relation_oracle::register_stmt): Check def block of PHI arguments.
10538 * value-relation.h (equiv_oracle): Add new prototypes.
10540 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
10542 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
10543 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
10544 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
10545 __ROUNDING_MATH__ according to their corresponding flags.
10546 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
10547 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
10548 and __ROUNDING_MATH__.
10550 2021-09-20 Richard Biener <rguenther@suse.de>
10552 * tree-vect-stmts.c (vectorizable_load): Use the vectype
10555 2021-09-20 Richard Biener <rguenther@suse.de>
10557 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
10558 Do not compute alignment of the vectorized access here.
10560 2021-09-20 Richard Biener <rguenther@suse.de>
10562 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
10563 Store -1 for runtime alias peeling iterations.
10565 2021-09-20 Richard Biener <rguenther@suse.de>
10567 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
10569 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
10571 * input.c (string_concat_db::record_string_concatenation)
10572 (string_concat_db::get_string_concatenation): Skip for
10573 'RESERVED_LOCATION_P'.
10575 2021-09-20 Richard Biener <rguenther@suse.de>
10577 PR tree-optimization/65206
10578 * tree-data-ref.h (struct data_reference): Add alt_indices,
10580 * tree-data-ref.c (free_data_ref): Release alt_indices.
10581 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
10582 (create_data_ref): Adjust.
10583 (initialize_data_dependence_relation): Split into head
10584 and tail. When the base objects fail to match up try
10585 again with pointer-based analysis of indices.
10586 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
10587 not compare the lazily computed alternate set of indices.
10589 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
10591 * gcc.c: Test for execute OK when we find the
10592 programs for assembler linker and dsymutil and those
10593 were specified at configure-time.
10595 2021-09-19 Martin Sebor <msebor@redhat.com>
10597 PR middle-end/102403
10598 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
10599 Correct a function pre/postcondition.
10601 2021-09-19 Martin Sebor <msebor@redhat.com>
10603 PR middle-end/102243
10604 * tree-ssa-strlen.c (get_range): Handle null cfun.
10606 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
10608 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
10609 unwinder shim as a convenience library.
10611 2021-09-19 Andrew Pinski <apinski@marvell.com>
10613 * doc/install.texi: Add note about
10614 binutils 2.35 is required for LTO usage.
10616 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10618 * tree-ssa-threadbackward.c
10619 (back_threader_registry::register_path): Use push_edge.
10620 * tree-ssa-threadedge.c
10621 (jump_threader::thread_around_empty_blocks): Same.
10622 (jump_threader::thread_through_normal_block): Same.
10623 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
10625 * tree-ssa-threadupdate.c
10626 (jt_path_registry::allocate_thread_edge): Remove.
10627 (jt_path_registry::push_edge): New.
10628 (dump_jump_thread_path): Make static.
10629 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
10632 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10634 * gimple-range-path.cc (path_range_query::path_range_query): Add
10636 (path_range_query::dump): Remove extern declaration of dump_ranger.
10637 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
10638 * gimple-range-trace.h (dump_ranger): Add prototype.
10640 2021-09-19 John Ericson <git@JohnEricson.me>
10642 * gcc.c (find_a_program): New function, factored out of...
10643 (find_a_file): Here.
10644 (execute): Use find_a_program when looking for programs rather
10647 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
10649 * config/avr/avr-mcus.def: Add atmega324pb.
10650 * doc/avr-mmcu.texi: Corresponding changes.
10652 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
10654 PR middle-end/88173
10655 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
10656 the same transformations as to @1. For comparisons against NaN,
10657 don't check HONOR_SNANS but confirm that neither operand is a
10660 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
10662 * attribs.c (make_unique_name): Delete.
10663 * attribs.h (make_unique_name): Delete.
10665 2021-09-19 Andrew Pinski <apinski@marvell.com>
10667 * lra-constraints.c (check_and_process_move): Assert
10668 that dclass and sclass are greater than or equal to NO_REGS.
10670 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10672 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
10673 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
10674 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
10676 2021-09-18 liuhongt <hongtao.liu@intel.com>
10678 * config/i386/i386-features.c (remove_partial_avx_dependency):
10679 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
10680 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
10682 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10684 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
10685 if file/namespace scope variable doesn't have predetermined sharing,
10686 treat it as if there was default(none).
10688 2021-09-18 liuhongt <hongtao.liu@intel.com>
10690 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
10692 (_mm_mask_fmadd_sh): Likewise.
10693 (_mm_mask3_fmadd_sh): Likewise.
10694 (_mm_maskz_fmadd_sh): Likewise.
10695 (_mm_fmadd_round_sh): Likewise.
10696 (_mm_mask_fmadd_round_sh): Likewise.
10697 (_mm_mask3_fmadd_round_sh): Likewise.
10698 (_mm_maskz_fmadd_round_sh): Likewise.
10699 (_mm_fnmadd_sh): Likewise.
10700 (_mm_mask_fnmadd_sh): Likewise.
10701 (_mm_mask3_fnmadd_sh): Likewise.
10702 (_mm_maskz_fnmadd_sh): Likewise.
10703 (_mm_fnmadd_round_sh): Likewise.
10704 (_mm_mask_fnmadd_round_sh): Likewise.
10705 (_mm_mask3_fnmadd_round_sh): Likewise.
10706 (_mm_maskz_fnmadd_round_sh): Likewise.
10707 (_mm_fmsub_sh): Likewise.
10708 (_mm_mask_fmsub_sh): Likewise.
10709 (_mm_mask3_fmsub_sh): Likewise.
10710 (_mm_maskz_fmsub_sh): Likewise.
10711 (_mm_fmsub_round_sh): Likewise.
10712 (_mm_mask_fmsub_round_sh): Likewise.
10713 (_mm_mask3_fmsub_round_sh): Likewise.
10714 (_mm_maskz_fmsub_round_sh): Likewise.
10715 (_mm_fnmsub_sh): Likewise.
10716 (_mm_mask_fnmsub_sh): Likewise.
10717 (_mm_mask3_fnmsub_sh): Likewise.
10718 (_mm_maskz_fnmsub_sh): Likewise.
10719 (_mm_fnmsub_round_sh): Likewise.
10720 (_mm_mask_fnmsub_round_sh): Likewise.
10721 (_mm_mask3_fnmsub_round_sh): Likewise.
10722 (_mm_maskz_fnmsub_round_sh): Likewise.
10723 * config/i386/i386-builtin-types.def
10724 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
10725 * config/i386/i386-builtin.def: Add new builtins.
10726 * config/i386/i386-expand.c: Handle new builtin type.
10727 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
10728 Ajdust to support FP16.
10729 (fmai_vmfmsub_<mode><round_name>): Ditto.
10730 (fmai_vmfnmadd_<mode><round_name>): Ditto.
10731 (fmai_vmfnmsub_<mode><round_name>): Ditto.
10732 (*fmai_fmadd_<mode>): Ditto.
10733 (*fmai_fmsub_<mode>): Ditto.
10734 (*fmai_fnmadd_<mode><round_name>): Ditto.
10735 (*fmai_fnmsub_<mode><round_name>): Ditto.
10736 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
10737 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
10738 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
10739 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
10740 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
10741 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
10742 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
10743 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10744 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
10745 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10746 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
10747 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
10748 adjust to support FP16.
10749 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
10750 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
10751 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
10754 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10756 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
10758 (maskload<mode><avx512fmaskmodelower>): Ditto.
10759 (maskstore<mode><avx512fmaskmodelower>): Ditto.
10761 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10763 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
10765 (ix86_expand_copysign): Ditto.
10766 (ix86_expand_xorsign): Ditto.
10767 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
10769 (ix86_build_signbit_mask): Ditto.
10770 (ix86_can_change_mode_class): Ditto.
10771 * config/i386/i386.md
10772 (SSEMODEF): Add HFmode.
10773 (ssevecmodef): Ditto.
10774 (<code>hf2): New define_expand.
10775 (*<code>hf2_1): New define_insn_and_split.
10776 (copysign<mode>): Extend to support HFmode under AVX512FP16.
10777 (xorsign<mode>): Ditto.
10778 * config/i386/sse.md (VFB): New mode iterator.
10779 (VFB_128_256): Ditto.
10781 (sseintvecmode2): Support HF vector mode.
10782 (<code><mode>2): Use new mode iterator.
10783 (*<code><mode>2): Ditto.
10784 (copysign<mode>3): Ditto.
10785 (xorsign<mode>3): Ditto.
10786 (<code><mode>3<mask_name>): Ditto.
10787 (<code><mode>3<mask_name>): Ditto.
10788 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
10789 (<sse>_andnot<mode>3<mask_name>): Ditto.
10790 (*<code><mode>3<mask_name>): Ditto.
10791 (*<code><mode>3<mask_name>): Ditto.
10793 2021-09-18 liuhongt <hongtao.liu@intel.com>
10795 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
10797 (_mm512_mask3_fmadd_ph): Likewise.
10798 (_mm512_maskz_fmadd_ph): Likewise.
10799 (_mm512_fmadd_round_ph): Likewise.
10800 (_mm512_mask_fmadd_round_ph): Likewise.
10801 (_mm512_mask3_fmadd_round_ph): Likewise.
10802 (_mm512_maskz_fmadd_round_ph): Likewise.
10803 (_mm512_fnmadd_ph): Likewise.
10804 (_mm512_mask_fnmadd_ph): Likewise.
10805 (_mm512_mask3_fnmadd_ph): Likewise.
10806 (_mm512_maskz_fnmadd_ph): Likewise.
10807 (_mm512_fnmadd_round_ph): Likewise.
10808 (_mm512_mask_fnmadd_round_ph): Likewise.
10809 (_mm512_mask3_fnmadd_round_ph): Likewise.
10810 (_mm512_maskz_fnmadd_round_ph): Likewise.
10811 (_mm512_fmsub_ph): Likewise.
10812 (_mm512_mask_fmsub_ph): Likewise.
10813 (_mm512_mask3_fmsub_ph): Likewise.
10814 (_mm512_maskz_fmsub_ph): Likewise.
10815 (_mm512_fmsub_round_ph): Likewise.
10816 (_mm512_mask_fmsub_round_ph): Likewise.
10817 (_mm512_mask3_fmsub_round_ph): Likewise.
10818 (_mm512_maskz_fmsub_round_ph): Likewise.
10819 (_mm512_fnmsub_ph): Likewise.
10820 (_mm512_mask_fnmsub_ph): Likewise.
10821 (_mm512_mask3_fnmsub_ph): Likewise.
10822 (_mm512_maskz_fnmsub_ph): Likewise.
10823 (_mm512_fnmsub_round_ph): Likewise.
10824 (_mm512_mask_fnmsub_round_ph): Likewise.
10825 (_mm512_mask3_fnmsub_round_ph): Likewise.
10826 (_mm512_maskz_fnmsub_round_ph): Likewise.
10827 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
10829 (_mm256_mask_fmadd_ph): Likewise.
10830 (_mm256_mask3_fmadd_ph): Likewise.
10831 (_mm256_maskz_fmadd_ph): Likewise.
10832 (_mm_fmadd_ph): Likewise.
10833 (_mm_mask_fmadd_ph): Likewise.
10834 (_mm_mask3_fmadd_ph): Likewise.
10835 (_mm_maskz_fmadd_ph): Likewise.
10836 (_mm256_fnmadd_ph): Likewise.
10837 (_mm256_mask_fnmadd_ph): Likewise.
10838 (_mm256_mask3_fnmadd_ph): Likewise.
10839 (_mm256_maskz_fnmadd_ph): Likewise.
10840 (_mm_fnmadd_ph): Likewise.
10841 (_mm_mask_fnmadd_ph): Likewise.
10842 (_mm_mask3_fnmadd_ph): Likewise.
10843 (_mm_maskz_fnmadd_ph): Likewise.
10844 (_mm256_fmsub_ph): Likewise.
10845 (_mm256_mask_fmsub_ph): Likewise.
10846 (_mm256_mask3_fmsub_ph): Likewise.
10847 (_mm256_maskz_fmsub_ph): Likewise.
10848 (_mm_fmsub_ph): Likewise.
10849 (_mm_mask_fmsub_ph): Likewise.
10850 (_mm_mask3_fmsub_ph): Likewise.
10851 (_mm_maskz_fmsub_ph): Likewise.
10852 (_mm256_fnmsub_ph): Likewise.
10853 (_mm256_mask_fnmsub_ph): Likewise.
10854 (_mm256_mask3_fnmsub_ph): Likewise.
10855 (_mm256_maskz_fnmsub_ph): Likewise.
10856 (_mm_fnmsub_ph): Likewise.
10857 (_mm_mask_fnmsub_ph): Likewise.
10858 (_mm_mask3_fnmsub_ph): Likewise.
10859 (_mm_maskz_fnmsub_ph): Likewise.
10860 * config/i386/i386-builtin.def: Add corresponding new builtins.
10861 * config/i386/sse.md
10862 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
10863 support HF vector modes.
10864 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
10866 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10867 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10868 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10869 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
10870 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
10871 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
10872 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
10874 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10875 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10876 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10877 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
10878 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
10879 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
10881 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10882 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10883 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10884 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
10885 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
10886 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
10887 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
10889 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10890 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10891 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10892 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
10893 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
10895 2021-09-18 liuhongt <hongtao.liu@intel.com>
10897 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
10899 (_mm512_mask_fmaddsub_ph): Likewise.
10900 (_mm512_mask3_fmaddsub_ph): Likewise.
10901 (_mm512_maskz_fmaddsub_ph): Likewise.
10902 (_mm512_fmaddsub_round_ph): Likewise.
10903 (_mm512_mask_fmaddsub_round_ph): Likewise.
10904 (_mm512_mask3_fmaddsub_round_ph): Likewise.
10905 (_mm512_maskz_fmaddsub_round_ph): Likewise.
10906 (_mm512_mask_fmsubadd_ph): Likewise.
10907 (_mm512_mask3_fmsubadd_ph): Likewise.
10908 (_mm512_maskz_fmsubadd_ph): Likewise.
10909 (_mm512_fmsubadd_round_ph): Likewise.
10910 (_mm512_mask_fmsubadd_round_ph): Likewise.
10911 (_mm512_mask3_fmsubadd_round_ph): Likewise.
10912 (_mm512_maskz_fmsubadd_round_ph): Likewise.
10913 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
10915 (_mm256_mask_fmaddsub_ph): Likewise.
10916 (_mm256_mask3_fmaddsub_ph): Likewise.
10917 (_mm256_maskz_fmaddsub_ph): Likewise.
10918 (_mm_fmaddsub_ph): Likewise.
10919 (_mm_mask_fmaddsub_ph): Likewise.
10920 (_mm_mask3_fmaddsub_ph): Likewise.
10921 (_mm_maskz_fmaddsub_ph): Likewise.
10922 (_mm256_fmsubadd_ph): Likewise.
10923 (_mm256_mask_fmsubadd_ph): Likewise.
10924 (_mm256_mask3_fmsubadd_ph): Likewise.
10925 (_mm256_maskz_fmsubadd_ph): Likewise.
10926 (_mm_fmsubadd_ph): Likewise.
10927 (_mm_mask_fmsubadd_ph): Likewise.
10928 (_mm_mask3_fmsubadd_ph): Likewise.
10929 (_mm_maskz_fmsubadd_ph): Likewise.
10930 * config/i386/i386-builtin.def: Add corresponding new builtins.
10931 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
10932 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
10933 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
10935 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
10937 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
10938 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
10939 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
10941 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
10942 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
10944 2021-09-18 liuhongt <hongtao.liu@intel.com>
10947 * config/i386/i386.c (ix86_print_operand): Handle
10948 V8HF/V16HF/V32HFmode.
10949 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
10950 * config/i386/sse.md (avx512bcst): Remove.
10952 2021-09-17 Martin Sebor <msebor@redhat.com>
10954 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
10955 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
10956 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
10957 (check_defs): Add comment.
10958 (can_skip_redundant_opnd): Update comment.
10959 (compute_uninit_opnds_pos): Adjust to namespace change.
10960 (find_pdom): Move to gimple-predicate-analysis.cc.
10962 (struct uninit_undef_val_t): New.
10963 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
10964 (find_control_equiv_block): Same.
10965 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
10966 (MAX_SWITCH_CASES): Same.
10967 (compute_control_dep_chain): Same.
10968 (find_uninit_use): Use predicate analyzer.
10969 (struct pred_info): Move to gimple-predicate-analysis.
10970 (convert_control_dep_chain_into_preds): Same.
10971 (find_predicates): Same.
10972 (collect_phi_def_edges): Same.
10973 (warn_uninitialized_phi): Use predicate analyzer.
10974 (find_def_preds): Move to gimple-predicate-analysis.
10975 (dump_pred_info): Same.
10976 (dump_pred_chain): Same.
10977 (dump_predicates): Same.
10978 (destroy_predicate_vecs): Remove.
10979 (execute_late_warn_uninitialized): New.
10980 (get_cmp_code): Move to gimple-predicate-analysis.
10981 (is_value_included_in): Same.
10982 (value_sat_pred_p): Same.
10983 (find_matching_predicate_in_rest_chains): Same.
10984 (is_use_properly_guarded): Same.
10985 (prune_uninit_phi_opnds): Same.
10986 (find_var_cmp_const): Same.
10987 (use_pred_not_overlap_with_undef_path_pred): Same.
10988 (pred_equal_p): Same.
10989 (is_neq_relop_p): Same.
10990 (is_neq_zero_form_p): Same.
10991 (pred_expr_equal_p): Same.
10992 (is_pred_expr_subset_of): Same.
10993 (is_pred_chain_subset_of): Same.
10994 (is_included_in): Same.
10995 (is_superset_of): Same.
10996 (pred_neg_p): Same.
10997 (simplify_pred): Same.
10998 (simplify_preds_2): Same.
10999 (simplify_preds_3): Same.
11000 (simplify_preds_4): Same.
11001 (simplify_preds): Same.
11003 (push_to_worklist): Same.
11004 (get_pred_info_from_cmp): Same.
11005 (is_degenerated_phi): Same.
11006 (normalize_one_pred_1): Same.
11007 (normalize_one_pred): Same.
11008 (normalize_one_pred_chain): Same.
11009 (normalize_preds): Same.
11010 (can_one_predicate_be_invalidated_p): Same.
11011 (can_chain_union_be_invalidated_p): Same.
11012 (uninit_uses_cannot_happen): Same.
11013 (pass_late_warn_uninitialized::execute): Define.
11014 * gimple-predicate-analysis.cc: New file.
11015 * gimple-predicate-analysis.h: New file.
11017 2021-09-17 Julian Brown <julian@codesourcery.com>
11019 * config/gcn/gcn.c (gimple.h): Include.
11020 (gcn_fork_join): Emit barrier for worker-level joins.
11021 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
11022 writes_gang_private bitmap parameter. Set bit for blocks
11023 containing gang-private variable writes.
11024 (worker_single_simple): Don't emit barrier after predicated block.
11025 (worker_single_copy): Don't emit barrier if we're not broadcasting
11026 anything and the block contains no gang-private writes.
11027 (neuter_worker_single): Don't predicate blocks that only contain
11028 NOPs or internal marker functions. Pass has_gang_private_write
11029 argument to worker_single_copy.
11030 (oacc_do_neutering): Add writes_gang_private bitmap handling.
11032 2021-09-17 Julian Brown <julian@codesourcery.com>
11034 * config/gcn/gcn-protos.h
11035 (gcn_goacc_create_worker_broadcast_record): Update prototype.
11036 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
11037 preallocated block of LDS memory. Do not cache/share decls for
11038 reduction temporaries between invocations.
11039 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
11040 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
11041 and return temporary LDS space at that offset. Return pointer in
11043 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
11045 (ACC_LDS_SIZE): Define as acc_lds_size.
11046 (gcn_init_machine_status): Don't initialise lds_allocated,
11047 lds_allocs, reduc_decls fields of machine function struct.
11048 (gcn_option_override): Handle default size for gang-private
11049 variables and -mgang-private-size option.
11050 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
11051 initialising M0_REG.
11052 (gcn_shared_mem_layout): New function.
11053 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
11054 gang_private_hwm variable.
11055 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
11056 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
11057 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
11058 * config/gcn/gcn.opt (gang_private_size_opt): New global.
11059 (mgang-private-size=): New option.
11060 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
11061 documentation hook.
11062 * doc/tm.texi: Regenerate.
11063 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
11065 (build_sender_ref): Handle sender_decl being pointer.
11066 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
11067 parameters. Pass placement argument to
11068 create_worker_broadcast_record hook invocations. Handle
11069 sender_decl being pointer and isolate_broadcasts inserting extra
11071 (blk_offset_map_t): Add typedef.
11072 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
11073 preallocated range to worker_single_copy call.
11074 (dfs_broadcast_reachable_1): New function.
11075 (idx_decl_pair_t, used_range_vec_t): New typedefs.
11076 (sort_size_descending): New function.
11077 (addr_range): New class.
11078 (splay_tree_compare_addr_range, splay_tree_free_key)
11079 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
11080 (execute_omp_oacc_neuter_broadcast): Rename to...
11081 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
11082 parameters. Arrange layout of shared memory for broadcast
11084 (execute_omp_oacc_neuter_broadcast): New function.
11085 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
11086 handling from here. Enable pass for all OpenACC routines in order
11087 to call shared memory-layout hook.
11088 * target.def (create_worker_broadcast_record): Add OFFSET
11090 (shared_mem_layout): New hook.
11092 2021-09-17 Julian Brown <julian@codesourcery.com>
11093 Thomas Schwinge <thomas@codesourcery.com>
11095 * omp-oacc-neuter-broadcast.cc
11096 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
11098 (execute_omp_oacc_neuter_broadcast): Adjust.
11100 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
11102 * value-relation.cc (class equiv_chain): Move to header file.
11103 (path_oracle::path_oracle): New.
11104 (path_oracle::~path_oracle): New.
11105 (path_oracle::register_relation): New.
11106 (path_oracle::query_relation): New.
11107 (path_oracle::reset_path): New.
11108 (path_oracle::dump): New.
11109 * value-relation.h (class equiv_chain): Move to here.
11110 (class path_oracle): New.
11112 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
11114 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
11116 * gimple-range-fold.cc (fur_depend::register_relation): Use
11117 register_stmt/edge routines.
11118 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
11119 (equiv_oracle::equiv_oracle): Create self equivalence cache.
11120 (equiv_oracle::~equiv_oracle): Release same.
11121 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
11122 are no equivalences.
11123 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
11124 (equiv_oracle::register_relation): Rename from register_equiv.
11125 (relation_chain_head::find_relation): Relocate from dom_oracle.
11126 (relation_oracle::register_stmt): New.
11127 (relation_oracle::register_edge): New.
11128 (dom_oracle::*): Rename from relation_oracle.
11129 (dom_oracle::register_relation): Adjust to call equiv_oracle.
11130 (dom_oracle::set_one_relation): Split from register_relation.
11131 (dom_oracle::register_transitives): Consolidate 2 methods.
11132 (dom_oracle::find_relation_block): Move core to relation_chain.
11133 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
11134 * value-relation.h (class relation_oracle): New pure virtual base.
11135 (class equiv_oracle): Inherit from relation_oracle and adjust.
11136 (class dom_oracle): Rename from old relation_oracle and adjust.
11138 2021-09-17 Martin Sebor <msebor@redhat.com>
11140 PR middle-end/102200
11141 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
11142 (handle_min_max_size): Change argument. Store original SSA_NAME for
11143 operands to potentially distinct (sub)objects.
11144 (compute_objsize_r): Adjust call to the above.
11146 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11148 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
11149 (rs6000_new_builtin_vectorized_function): New function.
11150 (rs6000_new_builtin_md_vectorized_function): Likewise.
11151 (rs6000_builtin_vectorized_function): Call
11152 rs6000_new_builtin_vectorized_function.
11153 (rs6000_builtin_md_vectorized_function): Call
11154 rs6000_new_builtin_md_vectorized_function.
11156 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11158 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
11159 (ASSEMBLE_PAIR): Likewise.
11160 (BUILD_ACC): Likewise.
11161 (DISASSEMBLE_ACC): Likewise.
11162 (DISASSEMBLE_PAIR): Likewise.
11163 (PMXVBF16GER2): Likewise.
11164 (PMXVBF16GER2NN): Likewise.
11165 (PMXVBF16GER2NP): Likewise.
11166 (PMXVBF16GER2PN): Likewise.
11167 (PMXVBF16GER2PP): Likewise.
11168 (PMXVF16GER2): Likewise.
11169 (PMXVF16GER2NN): Likewise.
11170 (PMXVF16GER2NP): Likewise.
11171 (PMXVF16GER2PN): Likewise.
11172 (PMXVF16GER2PP): Likewise.
11173 (PMXVF32GER): Likewise.
11174 (PMXVF32GERNN): Likewise.
11175 (PMXVF32GERNP): Likewise.
11176 (PMXVF32GERPN): Likewise.
11177 (PMXVF32GERPP): Likewise.
11178 (PMXVF64GER): Likewise.
11179 (PMXVF64GERNN): Likewise.
11180 (PMXVF64GERNP): Likewise.
11181 (PMXVF64GERPN): Likewise.
11182 (PMXVF64GERPP): Likewise.
11183 (PMXVI16GER2): Likewise.
11184 (PMXVI16GER2PP): Likewise.
11185 (PMXVI16GER2S): Likewise.
11186 (PMXVI16GER2SPP): Likewise.
11187 (PMXVI4GER8): Likewise.
11188 (PMXVI4GER8PP): Likewise.
11189 (PMXVI8GER4): Likewise.
11190 (PMXVI8GER4PP): Likewise.
11191 (PMXVI8GER4SPP): Likewise.
11192 (XVBF16GER2): Likewise.
11193 (XVBF16GER2NN): Likewise.
11194 (XVBF16GER2NP): Likewise.
11195 (XVBF16GER2PN): Likewise.
11196 (XVBF16GER2PP): Likewise.
11197 (XVF16GER2): Likewise.
11198 (XVF16GER2NN): Likewise.
11199 (XVF16GER2NP): Likewise.
11200 (XVF16GER2PN): Likewise.
11201 (XVF16GER2PP): Likewise.
11202 (XVF32GER): Likewise.
11203 (XVF32GERNN): Likewise.
11204 (XVF32GERNP): Likewise.
11205 (XVF32GERPN): Likewise.
11206 (XVF32GERPP): Likewise.
11207 (XVF64GER): Likewise.
11208 (XVF64GERNN): Likewise.
11209 (XVF64GERNP): Likewise.
11210 (XVF64GERPN): Likewise.
11211 (XVF64GERPP): Likewise.
11212 (XVI16GER2): Likewise.
11213 (XVI16GER2PP): Likewise.
11214 (XVI16GER2S): Likewise.
11215 (XVI16GER2SPP): Likewise.
11216 (XVI4GER8): Likewise.
11217 (XVI4GER8PP): Likewise.
11218 (XVI8GER4): Likewise.
11219 (XVI8GER4PP): Likewise.
11220 (XVI8GER4SPP): Likewise.
11221 (XXMFACC): Likewise.
11222 (XXMTACC): Likewise.
11223 (XXSETACCZ): Likewise.
11224 (ASSEMBLE_PAIR_V): Likewise.
11225 (BUILD_PAIR): Likewise.
11226 (DISASSEMBLE_PAIR_V): Likewise.
11229 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
11230 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
11231 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
11232 (parse_bif_attrs): Handle ismmaint.
11233 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
11234 (write_bif_static_init): Handle ismmaint.
11236 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11238 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
11240 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
11241 (rs6000_new_builtin_valid_without_lhs): New function.
11242 (rs6000_gimple_fold_new_mma_builtin): Likewise.
11243 (rs6000_gimple_fold_new_builtin): Likewise.
11245 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
11247 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
11248 Destruct stale Value objects.
11249 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
11252 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
11255 * match.pd (shift optimizations): Disable recent sign-changing
11256 optimization for shifts by zero, these will be folded later.
11258 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11260 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
11261 [power9] to [always].
11263 2021-09-17 Richard Biener <rguenther@suse.de>
11265 * tree-vect-stmts.c (vectorizable_load): Do not frob
11268 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11270 * config/i386/i386-features.c (remove_partial_avx_dependency):
11271 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
11272 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
11274 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
11276 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
11277 * config/i386/i386.md (SSE FP to FP splitters): Replace
11278 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
11279 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
11280 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
11281 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
11282 * config/i386/x86-tune.def
11283 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
11284 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
11286 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11289 * config/i386/i386-features.c (remove_partial_avx_dependency):
11290 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
11291 before generating vxorps.
11293 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11295 * config/i386/i386-options.c (processor_cost_table): Use
11296 tremont_cost for Tremont.
11297 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
11298 (tremont_memset): Likewise.
11299 (tremont_cost): Likewise.
11300 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
11301 Enable for Tremont.
11303 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11305 * common/config/i386/i386-common.c: Use Haswell scheduling model
11307 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
11309 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
11311 (ix86_adjust_cost): Handle Tremont.
11312 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
11313 Enable for Tremont.
11314 (X86_TUNE_USE_LEAVE): Likewise.
11315 (X86_TUNE_PUSH_MEMORY): Likewise.
11316 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
11317 (X86_TUNE_USE_CLTD): Likewise.
11318 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
11319 (X86_TUNE_AVOID_MFENCE): Likewise.
11320 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
11321 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
11322 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
11323 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
11324 (X86_TUNE_OPT_AGU): Likewise.
11325 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
11326 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
11327 (X86_TUNE_EXPAND_ABS): Likewise.
11328 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
11329 (X86_TUNE_SLOW_PSHUFB): Likewise.
11331 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
11333 PR rtl-optimization/102306
11334 * combine.c (try_combine): Abort the combination if we are about to
11335 duplicate volatile references.
11337 2021-09-17 liuhongt <hongtao.liu@intel.com>
11339 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
11341 (_mm256_undefined_ph): Likewise.
11342 (_mm512_undefined_ph): Likewise.
11343 (_mm_cvtsh_h): Likewise.
11344 (_mm256_cvtsh_h): Likewise.
11345 (_mm512_cvtsh_h): Likewise.
11346 (_mm512_castph_ps): Likewise.
11347 (_mm512_castph_pd): Likewise.
11348 (_mm512_castph_si512): Likewise.
11349 (_mm512_castph512_ph128): Likewise.
11350 (_mm512_castph512_ph256): Likewise.
11351 (_mm512_castph128_ph512): Likewise.
11352 (_mm512_castph256_ph512): Likewise.
11353 (_mm512_zextph128_ph512): Likewise.
11354 (_mm512_zextph256_ph512): Likewise.
11355 (_mm512_castps_ph): Likewise.
11356 (_mm512_castpd_ph): Likewise.
11357 (_mm512_castsi512_ph): Likewise.
11358 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
11360 (_mm256_castph_ps): Likewise.
11361 (_mm_castph_pd): Likewise.
11362 (_mm256_castph_pd): Likewise.
11363 (_mm_castph_si128): Likewise.
11364 (_mm256_castph_si256): Likewise.
11365 (_mm_castps_ph): Likewise.
11366 (_mm256_castps_ph): Likewise.
11367 (_mm_castpd_ph): Likewise.
11368 (_mm256_castpd_ph): Likewise.
11369 (_mm_castsi128_ph): Likewise.
11370 (_mm256_castsi256_ph): Likewise.
11371 (_mm256_castph256_ph128): Likewise.
11372 (_mm256_castph128_ph256): Likewise.
11373 (_mm256_zextph128_ph256): Likewise.
11375 2021-09-17 liuhongt <hongtao.liu@intel.com>
11377 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
11379 (_mm_mask_cvtsh_ss): Likewise.
11380 (_mm_maskz_cvtsh_ss): Likewise.
11381 (_mm_cvtsh_sd): Likewise.
11382 (_mm_mask_cvtsh_sd): Likewise.
11383 (_mm_maskz_cvtsh_sd): Likewise.
11384 (_mm_cvt_roundsh_ss): Likewise.
11385 (_mm_mask_cvt_roundsh_ss): Likewise.
11386 (_mm_maskz_cvt_roundsh_ss): Likewise.
11387 (_mm_cvt_roundsh_sd): Likewise.
11388 (_mm_mask_cvt_roundsh_sd): Likewise.
11389 (_mm_maskz_cvt_roundsh_sd): Likewise.
11390 (_mm_cvtss_sh): Likewise.
11391 (_mm_mask_cvtss_sh): Likewise.
11392 (_mm_maskz_cvtss_sh): Likewise.
11393 (_mm_cvtsd_sh): Likewise.
11394 (_mm_mask_cvtsd_sh): Likewise.
11395 (_mm_maskz_cvtsd_sh): Likewise.
11396 (_mm_cvt_roundss_sh): Likewise.
11397 (_mm_mask_cvt_roundss_sh): Likewise.
11398 (_mm_maskz_cvt_roundss_sh): Likewise.
11399 (_mm_cvt_roundsd_sh): Likewise.
11400 (_mm_mask_cvt_roundsd_sh): Likewise.
11401 (_mm_maskz_cvt_roundsd_sh): Likewise.
11402 * config/i386/i386-builtin-types.def
11403 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
11404 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
11405 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
11406 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
11407 * config/i386/i386-builtin.def: Add corrresponding new builtins.
11408 * config/i386/i386-expand.c: Handle new builtin types.
11409 * config/i386/sse.md (VF48_128): New mode iterator.
11410 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
11412 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
11415 2021-09-17 liuhongt <hongtao.liu@intel.com>
11417 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
11419 (_mm512_mask_cvtph_pd): Likewise.
11420 (_mm512_maskz_cvtph_pd): Likewise.
11421 (_mm512_cvt_roundph_pd): Likewise.
11422 (_mm512_mask_cvt_roundph_pd): Likewise.
11423 (_mm512_maskz_cvt_roundph_pd): Likewise.
11424 (_mm512_cvtxph_ps): Likewise.
11425 (_mm512_mask_cvtxph_ps): Likewise.
11426 (_mm512_maskz_cvtxph_ps): Likewise.
11427 (_mm512_cvtx_roundph_ps): Likewise.
11428 (_mm512_mask_cvtx_roundph_ps): Likewise.
11429 (_mm512_maskz_cvtx_roundph_ps): Likewise.
11430 (_mm512_cvtxps_ph): Likewise.
11431 (_mm512_mask_cvtxps_ph): Likewise.
11432 (_mm512_maskz_cvtxps_ph): Likewise.
11433 (_mm512_cvtx_roundps_ph): Likewise.
11434 (_mm512_mask_cvtx_roundps_ph): Likewise.
11435 (_mm512_maskz_cvtx_roundps_ph): Likewise.
11436 (_mm512_cvtpd_ph): Likewise.
11437 (_mm512_mask_cvtpd_ph): Likewise.
11438 (_mm512_maskz_cvtpd_ph): Likewise.
11439 (_mm512_cvt_roundpd_ph): Likewise.
11440 (_mm512_mask_cvt_roundpd_ph): Likewise.
11441 (_mm512_maskz_cvt_roundpd_ph): Likewise.
11442 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
11444 (_mm_mask_cvtph_pd): Likewise.
11445 (_mm_maskz_cvtph_pd): Likewise.
11446 (_mm256_cvtph_pd): Likewise.
11447 (_mm256_mask_cvtph_pd): Likewise.
11448 (_mm256_maskz_cvtph_pd): Likewise.
11449 (_mm_cvtxph_ps): Likewise.
11450 (_mm_mask_cvtxph_ps): Likewise.
11451 (_mm_maskz_cvtxph_ps): Likewise.
11452 (_mm256_cvtxph_ps): Likewise.
11453 (_mm256_mask_cvtxph_ps): Likewise.
11454 (_mm256_maskz_cvtxph_ps): Likewise.
11455 (_mm_cvtxps_ph): Likewise.
11456 (_mm_mask_cvtxps_ph): Likewise.
11457 (_mm_maskz_cvtxps_ph): Likewise.
11458 (_mm256_cvtxps_ph): Likewise.
11459 (_mm256_mask_cvtxps_ph): Likewise.
11460 (_mm256_maskz_cvtxps_ph): Likewise.
11461 (_mm_cvtpd_ph): Likewise.
11462 (_mm_mask_cvtpd_ph): Likewise.
11463 (_mm_maskz_cvtpd_ph): Likewise.
11464 (_mm256_cvtpd_ph): Likewise.
11465 (_mm256_mask_cvtpd_ph): Likewise.
11466 (_mm256_maskz_cvtpd_ph): Likewise.
11467 * config/i386/i386-builtin.def: Add corresponding new builtins.
11468 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11469 * config/i386/i386-expand.c: Handle new builtin types.
11470 * config/i386/sse.md
11471 (VF4_128_8_256): New.
11472 (VF48H_AVX512VL): Ditto.
11473 (ssePHmode): Add HF vector modes.
11474 (castmode): Add new convertable modes.
11475 (qq2phsuff): Ditto.
11476 (ph2pssuffix): New.
11477 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
11478 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
11479 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
11480 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
11481 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
11482 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
11483 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
11485 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
11486 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
11487 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
11488 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
11490 2021-09-17 liuhongt <hongtao.liu@intel.com>
11492 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
11494 (_mm_cvttsh_u32): Likewise.
11495 (_mm_cvtt_roundsh_i32): Likewise.
11496 (_mm_cvtt_roundsh_u32): Likewise.
11497 (_mm_cvttsh_i64): Likewise.
11498 (_mm_cvttsh_u64): Likewise.
11499 (_mm_cvtt_roundsh_i64): Likewise.
11500 (_mm_cvtt_roundsh_u64): Likewise.
11501 * config/i386/i386-builtin.def: Add corresponding new builtins.
11502 * config/i386/sse.md
11503 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
11506 2021-09-17 liuhongt <hongtao.liu@intel.com>
11508 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
11510 (_mm512_mask_cvttph_epi32): Likewise.
11511 (_mm512_maskz_cvttph_epi32): Likewise.
11512 (_mm512_cvtt_roundph_epi32): Likewise.
11513 (_mm512_mask_cvtt_roundph_epi32): Likewise.
11514 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
11515 (_mm512_cvttph_epu32): Likewise.
11516 (_mm512_mask_cvttph_epu32): Likewise.
11517 (_mm512_maskz_cvttph_epu32): Likewise.
11518 (_mm512_cvtt_roundph_epu32): Likewise.
11519 (_mm512_mask_cvtt_roundph_epu32): Likewise.
11520 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
11521 (_mm512_cvttph_epi64): Likewise.
11522 (_mm512_mask_cvttph_epi64): Likewise.
11523 (_mm512_maskz_cvttph_epi64): Likewise.
11524 (_mm512_cvtt_roundph_epi64): Likewise.
11525 (_mm512_mask_cvtt_roundph_epi64): Likewise.
11526 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
11527 (_mm512_cvttph_epu64): Likewise.
11528 (_mm512_mask_cvttph_epu64): Likewise.
11529 (_mm512_maskz_cvttph_epu64): Likewise.
11530 (_mm512_cvtt_roundph_epu64): Likewise.
11531 (_mm512_mask_cvtt_roundph_epu64): Likewise.
11532 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
11533 (_mm512_cvttph_epi16): Likewise.
11534 (_mm512_mask_cvttph_epi16): Likewise.
11535 (_mm512_maskz_cvttph_epi16): Likewise.
11536 (_mm512_cvtt_roundph_epi16): Likewise.
11537 (_mm512_mask_cvtt_roundph_epi16): Likewise.
11538 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
11539 (_mm512_cvttph_epu16): Likewise.
11540 (_mm512_mask_cvttph_epu16): Likewise.
11541 (_mm512_maskz_cvttph_epu16): Likewise.
11542 (_mm512_cvtt_roundph_epu16): Likewise.
11543 (_mm512_mask_cvtt_roundph_epu16): Likewise.
11544 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
11545 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
11547 (_mm_mask_cvttph_epi32): Likewise.
11548 (_mm_maskz_cvttph_epi32): Likewise.
11549 (_mm256_cvttph_epi32): Likewise.
11550 (_mm256_mask_cvttph_epi32): Likewise.
11551 (_mm256_maskz_cvttph_epi32): Likewise.
11552 (_mm_cvttph_epu32): Likewise.
11553 (_mm_mask_cvttph_epu32): Likewise.
11554 (_mm_maskz_cvttph_epu32): Likewise.
11555 (_mm256_cvttph_epu32): Likewise.
11556 (_mm256_mask_cvttph_epu32): Likewise.
11557 (_mm256_maskz_cvttph_epu32): Likewise.
11558 (_mm_cvttph_epi64): Likewise.
11559 (_mm_mask_cvttph_epi64): Likewise.
11560 (_mm_maskz_cvttph_epi64): Likewise.
11561 (_mm256_cvttph_epi64): Likewise.
11562 (_mm256_mask_cvttph_epi64): Likewise.
11563 (_mm256_maskz_cvttph_epi64): Likewise.
11564 (_mm_cvttph_epu64): Likewise.
11565 (_mm_mask_cvttph_epu64): Likewise.
11566 (_mm_maskz_cvttph_epu64): Likewise.
11567 (_mm256_cvttph_epu64): Likewise.
11568 (_mm256_mask_cvttph_epu64): Likewise.
11569 (_mm256_maskz_cvttph_epu64): Likewise.
11570 (_mm_cvttph_epi16): Likewise.
11571 (_mm_mask_cvttph_epi16): Likewise.
11572 (_mm_maskz_cvttph_epi16): Likewise.
11573 (_mm256_cvttph_epi16): Likewise.
11574 (_mm256_mask_cvttph_epi16): Likewise.
11575 (_mm256_maskz_cvttph_epi16): Likewise.
11576 (_mm_cvttph_epu16): Likewise.
11577 (_mm_mask_cvttph_epu16): Likewise.
11578 (_mm_maskz_cvttph_epu16): Likewise.
11579 (_mm256_cvttph_epu16): Likewise.
11580 (_mm256_mask_cvttph_epu16): Likewise.
11581 (_mm256_maskz_cvttph_epu16): Likewise.
11582 * config/i386/i386-builtin.def: Add new builtins.
11583 * config/i386/sse.md
11584 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
11586 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
11587 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
11588 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
11589 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
11591 2021-09-17 liuhongt <hongtao.liu@intel.com>
11593 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
11594 (_mm_cvtsh_u32): Likewise.
11595 (_mm_cvt_roundsh_i32): Likewise.
11596 (_mm_cvt_roundsh_u32): Likewise.
11597 (_mm_cvtsh_i64): Likewise.
11598 (_mm_cvtsh_u64): Likewise.
11599 (_mm_cvt_roundsh_i64): Likewise.
11600 (_mm_cvt_roundsh_u64): Likewise.
11601 (_mm_cvti32_sh): Likewise.
11602 (_mm_cvtu32_sh): Likewise.
11603 (_mm_cvt_roundi32_sh): Likewise.
11604 (_mm_cvt_roundu32_sh): Likewise.
11605 (_mm_cvti64_sh): Likewise.
11606 (_mm_cvtu64_sh): Likewise.
11607 (_mm_cvt_roundi64_sh): Likewise.
11608 (_mm_cvt_roundu64_sh): Likewise.
11609 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11610 * config/i386/i386-builtin.def: Add corresponding new builtins.
11611 * config/i386/i386-expand.c (ix86_expand_round_builtin):
11612 Handle new builtin types.
11613 * config/i386/sse.md
11614 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
11616 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
11617 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
11619 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
11621 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
11622 (altivec_resolve_new_overloaded_builtin): New forward decl.
11623 (rs6000_new_builtin_type_compatible): New function.
11624 (altivec_resolve_overloaded_builtin): Call
11625 altivec_resolve_new_overloaded_builtin.
11626 (altivec_build_new_resolved_builtin): New function.
11627 (altivec_resolve_new_overloaded_builtin): Likewise.
11628 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
11630 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
11631 name of rs6000_new_builtin_is_supported.
11633 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
11635 * config/i386/i386-protos.h (ix86_decompose_address):
11636 Change return type to bool.
11637 * config/i386/i386.c (ix86_decompose_address): Ditto.
11639 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
11642 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
11643 Added 'build/' to target, use build/%.o rule.
11644 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
11645 '$(build_exeext)' to target and 'build/' for the *.o files.
11646 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
11649 2021-09-16 Martin Jambor <mjambor@suse.cz>
11651 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
11652 the body has been removed. Remove trailing whitespace.
11654 2021-09-16 Richard Biener <rguenther@suse.de>
11656 PR middle-end/102360
11657 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
11658 of non-memory more robust.
11660 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11662 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
11663 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
11664 (leon5_adjust_cost): Increase cost of store with data dependency
11665 on ALU instruction and FPU anti-dependencies.
11666 (sparc_option_override): Add LEON5 costs
11667 (sparc_adjust_cost): Add LEON5 cost adjustments
11668 * config/sparc/sparc.h: Add LEON5
11669 * config/sparc/sparc.md: Include LEON5 scheduling information
11670 * config/sparc/sparc.opt: Add LEON5
11671 * doc/invoke.texi: Add LEON5
11672 * config/sparc/leon5.md: New file.
11674 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11676 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
11677 sensitive sequence for B2BST errata workaround.
11679 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11681 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
11682 functions with atomic instruction in the UT700 errata workaround.
11684 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11686 * config/sparc/sparc.c (next_active_non_empty_insn): New function
11687 that returns next active non empty assembly instruction.
11688 (sparc_do_work_around_errata): Use new function.
11690 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11692 * config/sparc/sparc.c (store_insn_p): Add predicate for store
11694 (load_insn_p): Add predicate for load attributes.
11695 (sparc_do_work_around_errata): Use new predicates.
11697 2021-09-16 Andreas Larsson <andreas@gaisler.com>
11699 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
11702 2021-09-16 Martin Liska <mliska@suse.cz>
11704 * config/mips/netbsd.h: Fix typo in name of a macro.
11706 2021-09-16 liuhongt <hongtao.liu@intel.com>
11708 PR middle-end/102080
11709 * match.pd: Check mask type when doing cond_op related gimple
11711 * tree.c (is_truth_type_for): New function.
11712 * tree.h (is_truth_type_for): New declaration.
11714 2021-09-16 liuhongt <hongtao.liu@intel.com>
11716 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
11718 (_mm512_mask_cvtepi32_ph): Likewise.
11719 (_mm512_maskz_cvtepi32_ph): Likewise.
11720 (_mm512_cvt_roundepi32_ph): Likewise.
11721 (_mm512_mask_cvt_roundepi32_ph): Likewise.
11722 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
11723 (_mm512_cvtepu32_ph): Likewise.
11724 (_mm512_mask_cvtepu32_ph): Likewise.
11725 (_mm512_maskz_cvtepu32_ph): Likewise.
11726 (_mm512_cvt_roundepu32_ph): Likewise.
11727 (_mm512_mask_cvt_roundepu32_ph): Likewise.
11728 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
11729 (_mm512_cvtepi64_ph): Likewise.
11730 (_mm512_mask_cvtepi64_ph): Likewise.
11731 (_mm512_maskz_cvtepi64_ph): Likewise.
11732 (_mm512_cvt_roundepi64_ph): Likewise.
11733 (_mm512_mask_cvt_roundepi64_ph): Likewise.
11734 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
11735 (_mm512_cvtepu64_ph): Likewise.
11736 (_mm512_mask_cvtepu64_ph): Likewise.
11737 (_mm512_maskz_cvtepu64_ph): Likewise.
11738 (_mm512_cvt_roundepu64_ph): Likewise.
11739 (_mm512_mask_cvt_roundepu64_ph): Likewise.
11740 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
11741 (_mm512_cvtepi16_ph): Likewise.
11742 (_mm512_mask_cvtepi16_ph): Likewise.
11743 (_mm512_maskz_cvtepi16_ph): Likewise.
11744 (_mm512_cvt_roundepi16_ph): Likewise.
11745 (_mm512_mask_cvt_roundepi16_ph): Likewise.
11746 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
11747 (_mm512_cvtepu16_ph): Likewise.
11748 (_mm512_mask_cvtepu16_ph): Likewise.
11749 (_mm512_maskz_cvtepu16_ph): Likewise.
11750 (_mm512_cvt_roundepu16_ph): Likewise.
11751 (_mm512_mask_cvt_roundepu16_ph): Likewise.
11752 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
11753 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
11755 (_mm_mask_cvtepi32_ph): Likewise.
11756 (_mm_maskz_cvtepi32_ph): Likewise.
11757 (_mm256_cvtepi32_ph): Likewise.
11758 (_mm256_mask_cvtepi32_ph): Likewise.
11759 (_mm256_maskz_cvtepi32_ph): Likewise.
11760 (_mm_cvtepu32_ph): Likewise.
11761 (_mm_mask_cvtepu32_ph): Likewise.
11762 (_mm_maskz_cvtepu32_ph): Likewise.
11763 (_mm256_cvtepu32_ph): Likewise.
11764 (_mm256_mask_cvtepu32_ph): Likewise.
11765 (_mm256_maskz_cvtepu32_ph): Likewise.
11766 (_mm_cvtepi64_ph): Likewise.
11767 (_mm_mask_cvtepi64_ph): Likewise.
11768 (_mm_maskz_cvtepi64_ph): Likewise.
11769 (_mm256_cvtepi64_ph): Likewise.
11770 (_mm256_mask_cvtepi64_ph): Likewise.
11771 (_mm256_maskz_cvtepi64_ph): Likewise.
11772 (_mm_cvtepu64_ph): Likewise.
11773 (_mm_mask_cvtepu64_ph): Likewise.
11774 (_mm_maskz_cvtepu64_ph): Likewise.
11775 (_mm256_cvtepu64_ph): Likewise.
11776 (_mm256_mask_cvtepu64_ph): Likewise.
11777 (_mm256_maskz_cvtepu64_ph): Likewise.
11778 (_mm_cvtepi16_ph): Likewise.
11779 (_mm_mask_cvtepi16_ph): Likewise.
11780 (_mm_maskz_cvtepi16_ph): Likewise.
11781 (_mm256_cvtepi16_ph): Likewise.
11782 (_mm256_mask_cvtepi16_ph): Likewise.
11783 (_mm256_maskz_cvtepi16_ph): Likewise.
11784 (_mm_cvtepu16_ph): Likewise.
11785 (_mm_mask_cvtepu16_ph): Likewise.
11786 (_mm_maskz_cvtepu16_ph): Likewise.
11787 (_mm256_cvtepu16_ph): Likewise.
11788 (_mm256_mask_cvtepu16_ph): Likewise.
11789 (_mm256_maskz_cvtepu16_ph): Likewise.
11790 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11791 * config/i386/i386-builtin.def: Add corresponding new builtins.
11792 * config/i386/i386-expand.c
11793 (ix86_expand_args_builtin): Handle new builtin types.
11794 (ix86_expand_round_builtin): Ditto.
11795 * config/i386/i386-modes.def: Declare V2HF and V6HF.
11796 * config/i386/sse.md (VI2H_AVX512VL): New.
11797 (qq2phsuff): Ditto.
11798 (sseintvecmode): Add HF vector modes.
11799 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
11801 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11802 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11803 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11804 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11805 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
11806 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11807 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11808 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11809 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11810 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
11811 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
11813 2021-09-16 liuhongt <hongtao.liu@intel.com>
11815 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
11817 (_mm512_mask_cvtph_epi32): Likewise.
11818 (_mm512_maskz_cvtph_epi32): Likewise.
11819 (_mm512_cvt_roundph_epi32): Likewise.
11820 (_mm512_mask_cvt_roundph_epi32): Likewise.
11821 (_mm512_maskz_cvt_roundph_epi32): Likewise.
11822 (_mm512_cvtph_epu32): Likewise.
11823 (_mm512_mask_cvtph_epu32): Likewise.
11824 (_mm512_maskz_cvtph_epu32): Likewise.
11825 (_mm512_cvt_roundph_epu32): Likewise.
11826 (_mm512_mask_cvt_roundph_epu32): Likewise.
11827 (_mm512_maskz_cvt_roundph_epu32): Likewise.
11828 (_mm512_cvtph_epi64): Likewise.
11829 (_mm512_mask_cvtph_epi64): Likewise.
11830 (_mm512_maskz_cvtph_epi64): Likewise.
11831 (_mm512_cvt_roundph_epi64): Likewise.
11832 (_mm512_mask_cvt_roundph_epi64): Likewise.
11833 (_mm512_maskz_cvt_roundph_epi64): Likewise.
11834 (_mm512_cvtph_epu64): Likewise.
11835 (_mm512_mask_cvtph_epu64): Likewise.
11836 (_mm512_maskz_cvtph_epu64): Likewise.
11837 (_mm512_cvt_roundph_epu64): Likewise.
11838 (_mm512_mask_cvt_roundph_epu64): Likewise.
11839 (_mm512_maskz_cvt_roundph_epu64): Likewise.
11840 (_mm512_cvtph_epi16): Likewise.
11841 (_mm512_mask_cvtph_epi16): Likewise.
11842 (_mm512_maskz_cvtph_epi16): Likewise.
11843 (_mm512_cvt_roundph_epi16): Likewise.
11844 (_mm512_mask_cvt_roundph_epi16): Likewise.
11845 (_mm512_maskz_cvt_roundph_epi16): Likewise.
11846 (_mm512_cvtph_epu16): Likewise.
11847 (_mm512_mask_cvtph_epu16): Likewise.
11848 (_mm512_maskz_cvtph_epu16): Likewise.
11849 (_mm512_cvt_roundph_epu16): Likewise.
11850 (_mm512_mask_cvt_roundph_epu16): Likewise.
11851 (_mm512_maskz_cvt_roundph_epu16): Likewise.
11852 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
11854 (_mm_mask_cvtph_epi32): Likewise.
11855 (_mm_maskz_cvtph_epi32): Likewise.
11856 (_mm256_cvtph_epi32): Likewise.
11857 (_mm256_mask_cvtph_epi32): Likewise.
11858 (_mm256_maskz_cvtph_epi32): Likewise.
11859 (_mm_cvtph_epu32): Likewise.
11860 (_mm_mask_cvtph_epu32): Likewise.
11861 (_mm_maskz_cvtph_epu32): Likewise.
11862 (_mm256_cvtph_epu32): Likewise.
11863 (_mm256_mask_cvtph_epu32): Likewise.
11864 (_mm256_maskz_cvtph_epu32): Likewise.
11865 (_mm_cvtph_epi64): Likewise.
11866 (_mm_mask_cvtph_epi64): Likewise.
11867 (_mm_maskz_cvtph_epi64): Likewise.
11868 (_mm256_cvtph_epi64): Likewise.
11869 (_mm256_mask_cvtph_epi64): Likewise.
11870 (_mm256_maskz_cvtph_epi64): Likewise.
11871 (_mm_cvtph_epu64): Likewise.
11872 (_mm_mask_cvtph_epu64): Likewise.
11873 (_mm_maskz_cvtph_epu64): Likewise.
11874 (_mm256_cvtph_epu64): Likewise.
11875 (_mm256_mask_cvtph_epu64): Likewise.
11876 (_mm256_maskz_cvtph_epu64): Likewise.
11877 (_mm_cvtph_epi16): Likewise.
11878 (_mm_mask_cvtph_epi16): Likewise.
11879 (_mm_maskz_cvtph_epi16): Likewise.
11880 (_mm256_cvtph_epi16): Likewise.
11881 (_mm256_mask_cvtph_epi16): Likewise.
11882 (_mm256_maskz_cvtph_epi16): Likewise.
11883 (_mm_cvtph_epu16): Likewise.
11884 (_mm_mask_cvtph_epu16): Likewise.
11885 (_mm_maskz_cvtph_epu16): Likewise.
11886 (_mm256_cvtph_epu16): Likewise.
11887 (_mm256_mask_cvtph_epu16): Likewise.
11888 (_mm256_maskz_cvtph_epu16): Likewise.
11889 * config/i386/i386-builtin-types.def: Add new builtin types.
11890 * config/i386/i386-builtin.def: Add new builtins.
11891 * config/i386/i386-expand.c
11892 (ix86_expand_args_builtin): Handle new builtin types.
11893 (ix86_expand_round_builtin): Ditto.
11894 * config/i386/sse.md (sseintconvert): New.
11895 (ssePHmode): Ditto.
11896 (UNSPEC_US_FIX_NOTRUNC): Ditto.
11897 (sseintconvertsignprefix): Ditto.
11898 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
11901 2021-09-16 liuhongt <hongtao.liu@intel.com>
11903 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
11905 (_mm_cvtsi128_si16): Likewise.
11906 (_mm_mask_load_sh): Likewise.
11907 (_mm_maskz_load_sh): Likewise.
11908 (_mm_mask_store_sh): Likewise.
11909 (_mm_move_sh): Likewise.
11910 (_mm_mask_move_sh): Likewise.
11911 (_mm_maskz_move_sh): Likewise.
11912 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11913 * config/i386/i386-builtin.def: Add corresponding new builtins.
11914 * config/i386/i386-expand.c
11915 (ix86_expand_special_args_builtin): Handle new builtin types.
11916 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
11917 * config/i386/sse.md (VI2F): New mode iterator.
11918 (vec_set<mode>_0): Use new mode iterator.
11919 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
11920 (avx512f_store<mode>_mask): Ditto.
11922 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
11924 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
11926 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
11928 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11929 Proceed if no symbol summary or the symbol alias flag is false.
11931 2021-09-15 Jakub Jelinek <jakub@redhat.com>
11935 * varasm.c (output_constructor_regular_field): Instead of assertion
11936 that array_size_for_constructor result is equal to size of
11937 TREE_TYPE (local->val) in bytes, assert that the type size is greater
11938 or equal to array_size_for_constructor result and use type size as
11941 2021-09-15 Martin Liska <mliska@suse.cz>
11944 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
11946 2021-09-15 Martin Liska <mliska@suse.cz>
11949 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11950 Check that we have a symbol summary for a symbol.
11952 2021-09-15 Richard Biener <rguenther@suse.de>
11955 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11956 to inherit from elfos.h
11958 2021-09-15 liuhongt <hongtao.liu@intel.com>
11961 * config/i386/i386-expand.c
11962 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
11964 (ix86_expand_vector_set): Use blendw instead of pinsrw.
11965 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
11966 AVX512FP16 which supports 16bit vector load.
11967 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
11969 (avx512bw_interleave_high<mode><mask_name>): .. this, and
11970 extend to V32HFmode.
11971 (avx2_interleave_highv16hi<mask_name>): Rename to ..
11972 (avx2_interleave_high<mode><mask_name>): .. this, and extend
11974 (vec_interleave_highv8hi<mask_name>): Rename to ..
11975 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
11976 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
11978 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
11979 this, and extend to V32HFmode.
11980 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
11981 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
11982 (vec_interleave_lowv8hi<mask_name>): Rename to ..
11983 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
11984 (sse4_1_pblendw): Rename to ..
11985 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
11986 (avx2_pblendph): New define_expand.
11987 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
11988 sseintmodesuffix instead of ssemodesuffix.
11989 (blendsuf): New mode attr.
11991 2021-09-15 Richard Biener <rguenther@suse.de>
11993 * tree-vectorizer.h (dr_misalignment): Move out of line.
11994 (dr_target_alignment): New.
11995 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
11996 (set_dr_target_alignment): New.
11997 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
11998 * tree-vect-data-refs.c (dr_misalignment): Compute and
11999 return the group members misalignment.
12000 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
12001 (vect_analyze_data_refs_alignment): Compute alignment only
12002 for the first element of a DR group.
12003 (vect_slp_analyze_node_alignment): Likewise.
12005 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
12007 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
12008 * config/i386/avx512fp16vlintrin.h: Likewise.
12009 * config/i386/i386-builtin.def: Adjust builtin name and
12010 enumeration to match AVX512F style.
12012 2021-09-15 Richard Biener <rguenther@suse.de>
12014 PR tree-optimization/102318
12015 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
12016 previous change and do the mode conversion separately from
12017 the sign conversion.
12019 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
12020 Peter Cordes <peter@cordes.ca>
12023 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
12024 (*vec_extract<mode><ssescalarmodelower>_valign): Output
12025 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
12028 2021-09-15 Richard Biener <rguenther@suse.de>
12030 * config.gcc: Remove vax-*-openbsd* configuration.
12032 2021-09-15 Richard Biener <rguenther@suse.de>
12034 * config.gcc: Remove m68k-openbsd.
12036 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
12039 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
12041 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
12043 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
12044 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
12045 (*mma_xxsetaccz): Delete.
12046 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
12047 Use UNSPECV_MMA_XXSETACCZ. Update comment.
12048 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
12050 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
12052 * Makefile.in: Remove variables related to applying no-PIE
12053 to the exes on $build.
12054 * configure: Regenerate.
12055 * configure.ac: Remove configuration related to applying
12056 no-PIE to the exes on $build.
12058 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
12060 * config/arc/arc.md (doloop_end): Add missing mode.
12061 (loop_end): Likewise.
12063 2021-09-14 Jakub Jelinek <jakub@redhat.com>
12065 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
12066 it to recursive calls, for depth above 7 just gimplify or return.
12067 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
12068 __builtin_clear_padding and TARGET_EXPR.
12069 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
12071 2021-09-14 liuhongt <hongtao.liu@intel.com>
12073 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
12075 (_mm_mask_fpclass_sh_mask): Likewise.
12076 (_mm512_mask_fpclass_ph_mask): Likewise.
12077 (_mm512_fpclass_ph_mask): Likewise.
12078 (_mm_getexp_sh): Likewise.
12079 (_mm_mask_getexp_sh): Likewise.
12080 (_mm_maskz_getexp_sh): Likewise.
12081 (_mm512_getexp_ph): Likewise.
12082 (_mm512_mask_getexp_ph): Likewise.
12083 (_mm512_maskz_getexp_ph): Likewise.
12084 (_mm_getexp_round_sh): Likewise.
12085 (_mm_mask_getexp_round_sh): Likewise.
12086 (_mm_maskz_getexp_round_sh): Likewise.
12087 (_mm512_getexp_round_ph): Likewise.
12088 (_mm512_mask_getexp_round_ph): Likewise.
12089 (_mm512_maskz_getexp_round_ph): Likewise.
12090 (_mm_getmant_sh): Likewise.
12091 (_mm_mask_getmant_sh): Likewise.
12092 (_mm_maskz_getmant_sh): Likewise.
12093 (_mm512_getmant_ph): Likewise.
12094 (_mm512_mask_getmant_ph): Likewise.
12095 (_mm512_maskz_getmant_ph): Likewise.
12096 (_mm_getmant_round_sh): Likewise.
12097 (_mm_mask_getmant_round_sh): Likewise.
12098 (_mm_maskz_getmant_round_sh): Likewise.
12099 (_mm512_getmant_round_ph): Likewise.
12100 (_mm512_mask_getmant_round_ph): Likewise.
12101 (_mm512_maskz_getmant_round_ph): Likewise.
12102 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
12104 (_mm_fpclass_ph_mask): Likewise.
12105 (_mm256_mask_fpclass_ph_mask): Likewise.
12106 (_mm256_fpclass_ph_mask): Likewise.
12107 (_mm256_getexp_ph): Likewise.
12108 (_mm256_mask_getexp_ph): Likewise.
12109 (_mm256_maskz_getexp_ph): Likewise.
12110 (_mm_getexp_ph): Likewise.
12111 (_mm_mask_getexp_ph): Likewise.
12112 (_mm_maskz_getexp_ph): Likewise.
12113 (_mm256_getmant_ph): Likewise.
12114 (_mm256_mask_getmant_ph): Likewise.
12115 (_mm256_maskz_getmant_ph): Likewise.
12116 (_mm_getmant_ph): Likewise.
12117 (_mm_mask_getmant_ph): Likewise.
12118 (_mm_maskz_getmant_ph): Likewise.
12119 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12120 * config/i386/i386-builtin.def: Add corresponding new builtins.
12121 * config/i386/i386-expand.c
12122 (ix86_expand_args_builtin): Handle new builtin types.
12123 (ix86_expand_round_builtin): Ditto.
12124 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
12125 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
12126 to support HF vector modes.
12127 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
12129 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
12130 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
12131 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
12132 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
12135 2021-09-14 liuhongt <hongtao.liu@intel.com>
12137 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
12139 (_mm512_mask_reduce_ph): Likewise.
12140 (_mm512_maskz_reduce_ph): Likewise.
12141 (_mm512_reduce_round_ph): Likewise.
12142 (_mm512_mask_reduce_round_ph): Likewise.
12143 (_mm512_maskz_reduce_round_ph): Likewise.
12144 (_mm_reduce_sh): Likewise.
12145 (_mm_mask_reduce_sh): Likewise.
12146 (_mm_maskz_reduce_sh): Likewise.
12147 (_mm_reduce_round_sh): Likewise.
12148 (_mm_mask_reduce_round_sh): Likewise.
12149 (_mm_maskz_reduce_round_sh): Likewise.
12150 (_mm512_roundscale_ph): Likewise.
12151 (_mm512_mask_roundscale_ph): Likewise.
12152 (_mm512_maskz_roundscale_ph): Likewise.
12153 (_mm512_roundscale_round_ph): Likewise.
12154 (_mm512_mask_roundscale_round_ph): Likewise.
12155 (_mm512_maskz_roundscale_round_ph): Likewise.
12156 (_mm_roundscale_sh): Likewise.
12157 (_mm_mask_roundscale_sh): Likewise.
12158 (_mm_maskz_roundscale_sh): Likewise.
12159 (_mm_roundscale_round_sh): Likewise.
12160 (_mm_mask_roundscale_round_sh): Likewise.
12161 (_mm_maskz_roundscale_round_sh): Likewise.
12162 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
12164 (_mm_mask_reduce_ph): Likewise.
12165 (_mm_maskz_reduce_ph): Likewise.
12166 (_mm256_reduce_ph): Likewise.
12167 (_mm256_mask_reduce_ph): Likewise.
12168 (_mm256_maskz_reduce_ph): Likewise.
12169 (_mm_roundscale_ph): Likewise.
12170 (_mm_mask_roundscale_ph): Likewise.
12171 (_mm_maskz_roundscale_ph): Likewise.
12172 (_mm256_roundscale_ph): Likewise.
12173 (_mm256_mask_roundscale_ph): Likewise.
12174 (_mm256_maskz_roundscale_ph): Likewise.
12175 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12176 * config/i386/i386-builtin.def: Add corresponding new builtins.
12177 * config/i386/i386-expand.c
12178 (ix86_expand_args_builtin): Handle new builtin types.
12179 (ix86_expand_round_builtin): Ditto.
12180 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
12182 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
12183 ... this, and adjust for round operands.
12184 (reduces<mode><mask_scalar_name>): Likewise, with ...
12185 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
12187 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
12188 Adjust for HF vector modes.
12189 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
12191 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
12193 2021-09-14 liuhongt <hongtao.liu@intel.com>
12195 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
12197 (_mm512_mask_rcp_ph): Likewise.
12198 (_mm512_maskz_rcp_ph): Likewise.
12199 (_mm_rcp_sh): Likewise.
12200 (_mm_mask_rcp_sh): Likewise.
12201 (_mm_maskz_rcp_sh): Likewise.
12202 (_mm512_scalef_ph): Likewise.
12203 (_mm512_mask_scalef_ph): Likewise.
12204 (_mm512_maskz_scalef_ph): Likewise.
12205 (_mm512_scalef_round_ph): Likewise.
12206 (_mm512_mask_scalef_round_ph): Likewise.
12207 (_mm512_maskz_scalef_round_ph): Likewise.
12208 (_mm_scalef_sh): Likewise.
12209 (_mm_mask_scalef_sh): Likewise.
12210 (_mm_maskz_scalef_sh): Likewise.
12211 (_mm_scalef_round_sh): Likewise.
12212 (_mm_mask_scalef_round_sh): Likewise.
12213 (_mm_maskz_scalef_round_sh): Likewise.
12214 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
12216 (_mm256_rcp_ph): Likewise.
12217 (_mm_mask_rcp_ph): Likewise.
12218 (_mm256_mask_rcp_ph): Likewise.
12219 (_mm_maskz_rcp_ph): Likewise.
12220 (_mm256_maskz_rcp_ph): Likewise.
12221 (_mm_scalef_ph): Likewise.
12222 (_mm256_scalef_ph): Likewise.
12223 (_mm_mask_scalef_ph): Likewise.
12224 (_mm256_mask_scalef_ph): Likewise.
12225 (_mm_maskz_scalef_ph): Likewise.
12226 (_mm256_maskz_scalef_ph): Likewise.
12227 * config/i386/i386-builtin.def: Add new builtins.
12228 * config/i386/sse.md (VFH_AVX512VL): New.
12229 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
12230 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
12231 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
12232 Adjust to support HF vector modes.
12233 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
12235 2021-09-14 liuhongt <hongtao.liu@intel.com>
12237 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
12239 (_mm512_mask_sqrt_ph): Likewise.
12240 (_mm512_maskz_sqrt_ph): Likewise.
12241 (_mm512_sqrt_round_ph): Likewise.
12242 (_mm512_mask_sqrt_round_ph): Likewise.
12243 (_mm512_maskz_sqrt_round_ph): Likewise.
12244 (_mm512_rsqrt_ph): Likewise.
12245 (_mm512_mask_rsqrt_ph): Likewise.
12246 (_mm512_maskz_rsqrt_ph): Likewise.
12247 (_mm_rsqrt_sh): Likewise.
12248 (_mm_mask_rsqrt_sh): Likewise.
12249 (_mm_maskz_rsqrt_sh): Likewise.
12250 (_mm_sqrt_sh): Likewise.
12251 (_mm_mask_sqrt_sh): Likewise.
12252 (_mm_maskz_sqrt_sh): Likewise.
12253 (_mm_sqrt_round_sh): Likewise.
12254 (_mm_mask_sqrt_round_sh): Likewise.
12255 (_mm_maskz_sqrt_round_sh): Likewise.
12256 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
12257 (_mm256_sqrt_ph): Likewise.
12258 (_mm_mask_sqrt_ph): Likewise.
12259 (_mm256_mask_sqrt_ph): Likewise.
12260 (_mm_maskz_sqrt_ph): Likewise.
12261 (_mm256_maskz_sqrt_ph): Likewise.
12262 (_mm_rsqrt_ph): Likewise.
12263 (_mm256_rsqrt_ph): Likewise.
12264 (_mm_mask_rsqrt_ph): Likewise.
12265 (_mm256_mask_rsqrt_ph): Likewise.
12266 (_mm_maskz_rsqrt_ph): Likewise.
12267 (_mm256_maskz_rsqrt_ph): Likewise.
12268 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12269 * config/i386/i386-builtin.def: Add corresponding new builtins.
12270 * config/i386/i386-expand.c
12271 (ix86_expand_args_builtin): Handle new builtins.
12272 (ix86_expand_round_builtin): Ditto.
12273 * config/i386/sse.md (VF_AVX512FP16VL): New.
12274 (sqrt<mode>2): Adjust for HF vector modes.
12275 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
12276 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
12278 (<sse>_rsqrt<mode>2<mask_name>): New.
12279 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
12281 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
12283 PR bootstrap/101574
12284 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
12285 'RESERVED_LOCATION_P' locations.
12286 * warning-control.cc (get_nowarn_spec, suppress_warning)
12287 (copy_warning): Likewise.
12289 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
12291 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
12292 (typedef key_type_t): ... this. Remove.
12293 (nowarn_map): Document.
12294 * diagnostic-spec.c (nowarn_map): Likewise.
12295 * warning-control.cc (convert_to_key): Evolve functions into...
12296 (get_location): ... these. Adjust all users.
12298 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
12300 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
12302 2021-09-13 Jason Merrill <jason@redhat.com>
12304 * params.opt: Add destructive-interference-size and
12305 constructive-interference-size.
12306 * doc/invoke.texi: Document them.
12307 * config/aarch64/aarch64.c (aarch64_override_options_internal):
12309 * config/arm/arm.c (arm_option_override): Set them.
12310 * config/i386/i386-options.c (ix86_option_override_internal):
12313 2021-09-13 Martin Liska <mliska@suse.cz>
12314 H.J. Lu <hjl.tools@gmail.com>
12317 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
12318 for x86-64 micro levels for __builtin_cpu_supports.
12319 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
12320 Add priorities for the micro-arch levels.
12321 (enum processor_features): Add new features.
12322 * common/config/i386/i386-isas.h: Add micro-arch features.
12323 * config/i386/i386-builtins.c (get_builtin_code_for_version):
12324 Support the micro-arch levels by callsing
12325 __builtin_cpu_supports.
12326 * doc/extend.texi: Document that the levels are support by
12327 __builtin_cpu_supports.
12329 2021-09-13 Andrew Pinski <apinski@marvell.com>
12332 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
12334 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
12335 (aarch64_general_gimple_fold_builtin): Likewise.
12337 2021-09-13 Andrew Pinski <apinski@marvell.com>
12339 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
12340 to the Unsupported targets list.
12341 Remove support for m32r-*-linux* and m32rle-*-linux*.
12342 * config/m32r/linux.h: Removed.
12343 * config/m32r/t-linux: Removed.
12345 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
12348 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
12349 register index for SVE predicate modes.
12351 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
12353 * tree-ssa-threadbackward.c
12354 (back_threader_profitability::profitable_path_p): Remove FSM
12356 (back_threader_registry::register_path): Same.
12357 * tree-ssa-threadedge.c
12358 (jump_threader::simplify_control_stmt_condition): Same.
12359 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
12360 Add backedge_threads argument.
12361 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
12362 backedge_threads argument.
12363 (back_jt_path_registry::back_jt_path_registry): Same.
12364 (dump_jump_thread_path): Adjust for FSM removal.
12365 (back_jt_path_registry::rewire_first_differing_edge): Same.
12366 (back_jt_path_registry::adjust_paths_after_duplication): Same.
12367 (back_jt_path_registry::update_cfg): Same.
12368 (jt_path_registry::register_jump_thread): Same.
12369 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
12371 (class back_jt_path_registry): Add backedge_threads to
12374 2021-09-13 Martin Liska <mliska@suse.cz>
12377 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
12379 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
12382 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
12383 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
12384 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
12385 TARGET_AVX256_STORE_BY_PIECES instead of
12386 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
12387 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
12388 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
12389 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
12390 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
12391 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
12393 2021-09-13 liuhongt <hongtao.liu@intel.com>
12395 PR bootstrap/102302
12396 * expmed.c (extract_bit_field_using_extv): Use
12397 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
12399 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
12401 * Makefile.in (OBJS): Add value-pointer-equiv.o.
12402 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
12403 value-pointer-equiv.*.
12404 (ssa_equiv_stack::ssa_equiv_stack): Same.
12405 (ssa_equiv_stack::enter): Same.
12406 (ssa_equiv_stack::leave): Same.
12407 (ssa_equiv_stack::push_replacement): Same.
12408 (ssa_equiv_stack::get_replacement): Same.
12409 (is_pointer_ssa): Same.
12410 (class pointer_equiv_analyzer): Same.
12411 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
12412 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
12413 (pointer_equiv_analyzer::set_global_equiv): Same.
12414 (pointer_equiv_analyzer::set_cond_equiv): Same.
12415 (pointer_equiv_analyzer::get_equiv): Same.
12416 (pointer_equiv_analyzer::enter): Same.
12417 (pointer_equiv_analyzer::leave): Same.
12418 (pointer_equiv_analyzer::get_equiv_expr): Same.
12419 (pta_valueize): Same.
12420 (pointer_equiv_analyzer::visit_stmt): Same.
12421 (pointer_equiv_analyzer::visit_edge): Same.
12422 (hybrid_folder::value_of_expr): Same.
12423 (hybrid_folder::value_on_edge): Same.
12424 * value-pointer-equiv.cc: New file.
12425 * value-pointer-equiv.h: New file.
12427 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12430 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
12431 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
12433 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12436 * config/arm/arm.md (movmisaligndi): New define_expand.
12437 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
12439 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12442 * emit-rtl.c (gen_highpart): Use adjust_address to handle
12443 MEM rather than calling simplify_gen_subreg.
12445 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
12447 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
12448 define into a block.
12450 2021-09-13 Richard Biener <rguenther@suse.de>
12452 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
12453 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
12454 change based on DARWIN_PREFER_DWARF not being defined.
12456 2021-09-13 Richard Biener <rguenther@suse.de>
12458 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
12459 to inherit from elfos.h
12461 2021-09-13 Richard Biener <rguenther@suse.de>
12463 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
12465 2021-09-13 Richard Biener <rguenther@suse.de>
12467 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
12468 override, pick up DWARF2_DEBUG define from elfos.h
12470 2021-09-13 Richard Biener <rguenther@suse.de>
12472 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
12475 2021-09-13 Richard Biener <rguenther@suse.de>
12477 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
12480 2021-09-13 Richard Biener <rguenther@suse.de>
12482 * config/i386/cygming.h: Always default to DWARF2 debugging.
12483 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
12485 * doc/install.texi: Document binutils 2.16 as minimum
12486 requirement for mingw.
12488 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
12490 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
12491 nstmts, nloads and extra_ctor_cost.
12492 (rs6000_density_test): Add load density related heuristics. Do
12493 extra costing on vector construction statements if need.
12494 (rs6000_init_cost): Init new members.
12495 (rs6000_update_target_cost_per_stmt): New function.
12496 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
12497 rs6000_update_target_cost_per_stmt and call it.
12499 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
12501 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
12502 (rs6000_init_cost): Adjust.
12504 2021-09-13 liuhongt <hongtao.liu@intel.com>
12506 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
12507 (UNSPEC_XORSIGN): Ditto.
12509 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
12511 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
12512 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12515 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
12517 * tree-ssa-threadbackward.c (class back_threader_registry): Use
12518 back_jt_path_registry.
12519 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
12520 fwd_jt_path_registry.
12521 * tree-ssa-threadedge.h (class jump_threader): Same..
12522 * tree-ssa-threadupdate.c
12523 (jump_thread_path_registry::jump_thread_path_registry): Rename...
12524 (jt_path_registry::jt_path_registry): ...to this.
12525 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
12526 (jt_path_registry::~jt_path_registry): ...this.
12527 (fwd_jt_path_registry::fwd_jt_path_registry): New.
12528 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
12529 (jump_thread_path_registry::allocate_thread_edge): Rename...
12530 (jt_path_registry::allocate_thread_edge): ...to this.
12531 (jump_thread_path_registry::allocate_thread_path): Rename...
12532 (jt_path_registry::allocate_thread_path): ...to this.
12533 (jump_thread_path_registry::lookup_redirection_data): Rename...
12534 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
12535 (jump_thread_path_registry::thread_block_1): Rename...
12536 (fwd_jt_path_registry::thread_block_1): ...to this.
12537 (jump_thread_path_registry::thread_block): Rename...
12538 (fwd_jt_path_registry::thread_block): ...to this.
12539 (jt_path_registry::thread_through_loop_header): Rename...
12540 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
12541 (jump_thread_path_registry::mark_threaded_blocks): Rename...
12542 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
12543 (jump_thread_path_registry::debug_path): Rename...
12544 (jt_path_registry::debug_path): ...to this.
12545 (jump_thread_path_registry::dump): Rename...
12546 (jt_path_registry::debug): ...to this.
12547 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
12548 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
12549 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
12550 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
12551 (jump_thread_path_registry::duplicate_thread_path): Rename...
12552 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
12553 drop ill-formed candidates.
12554 (jump_thread_path_registry::remove_jump_threads_including): Rename...
12555 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
12556 (jt_path_registry::thread_through_all_blocks): New.
12557 (back_jt_path_registry::update_cfg): New.
12558 (fwd_jt_path_registry::update_cfg): New.
12559 (jump_thread_path_registry::register_jump_thread): Rename...
12560 (jt_path_registry::register_jump_thread): ...to this.
12561 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
12563 (class jt_path_registry): ...here.
12564 (class fwd_jt_path_registry): New.
12565 (class back_jt_path_registry): New.
12567 2021-09-10 liuhongt <hongtao.liu@intel.com>
12570 2021-09-01 liuhongt <hongtao.liu@intel.com>
12572 * emit-rtl.c (validate_subreg): Get rid of all float-int
12575 2021-09-10 Jakub Jelinek <jakub@redhat.com>
12577 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
12578 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
12579 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
12580 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
12581 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
12582 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
12583 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
12584 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
12586 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
12587 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
12588 GF_OMP_ATOMIC_WEAK.
12589 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
12591 * tree.h (OMP_ATOMIC_WEAK): Define.
12592 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
12593 fail memory order being encoded in the same enum and also print
12594 fail clause if present.
12595 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
12596 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
12597 handle pre_p == NULL case as a test mode that only returns value
12598 but doesn't change gimplify nor change anything otherwise, adjust
12599 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
12600 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
12601 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
12603 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
12604 COND_EXPR rhs. Set weak flag on gimple load/store for
12606 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
12607 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
12609 (expand_omp_atomic_cas): New function.
12610 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
12612 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
12613 using expand_omp_atomic_cas.
12615 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
12616 Michael Matz <matz@suse.de>
12618 * tree-pass.h (PROP_loop_opts_done): New.
12619 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
12620 Intersect with global range.
12621 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
12622 * tree-ssa-threadbackward.c
12623 (back_threader_profitability::profitable_path_p): Disable
12624 threading through latches until after loop optimizations have run.
12626 2021-09-10 David Faust <david.faust@oracle.com>
12628 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
12631 2021-09-10 David Faust <david.faust@oracle.com>
12633 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
12634 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
12635 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
12636 (bpf_expand_cbranch): New function.
12637 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
12639 (zero_extendsidi2): Only use mov32 instruction if it is available.
12640 (SIM mode iterator): Conditionalize support for SI mode.
12641 (JM mode iterator): New.
12642 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
12643 (*branch_on_di): Update name, use new JM iterator.
12644 * config/bpf/bpf.opt: (mjmpext): New option.
12645 (malu32): Likewise.
12646 (mjmp32): Likewise.
12648 (bpf_isa): New enum.
12650 2021-09-10 David Faust <david.faust@oracle.com>
12652 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
12653 for register-to-register extensions.
12654 (zero_extendqidi2): Likewise.
12656 2021-09-10 Richard Biener <rguenther@suse.de>
12658 PR middle-end/102273
12659 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
12661 2021-09-10 Richard Biener <rguenther@suse.de>
12663 PR middle-end/102269
12664 * gimplify.c (is_var_need_auto_init): Empty types do not need
12667 2021-09-10 Richard Biener <rguenther@suse.de>
12669 * configure.ac (--with-stabs): Remove.
12670 * configure: Regenerate.
12671 * doc/install.texi: Remove --with-stabs documentation.
12673 2021-09-10 liuhongt <hongtao.liu@intel.com>
12675 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
12677 (_mm512_mask_cmp_ph_mask): Likewise.
12678 (_mm512_cmp_round_ph_mask): Likewise.
12679 (_mm512_mask_cmp_round_ph_mask): Likewise.
12680 (_mm_cmp_sh_mask): Likewise.
12681 (_mm_mask_cmp_sh_mask): Likewise.
12682 (_mm_cmp_round_sh_mask): Likewise.
12683 (_mm_mask_cmp_round_sh_mask): Likewise.
12684 (_mm_comieq_sh): Likewise.
12685 (_mm_comilt_sh): Likewise.
12686 (_mm_comile_sh): Likewise.
12687 (_mm_comigt_sh): Likewise.
12688 (_mm_comige_sh): Likewise.
12689 (_mm_comineq_sh): Likewise.
12690 (_mm_ucomieq_sh): Likewise.
12691 (_mm_ucomilt_sh): Likewise.
12692 (_mm_ucomile_sh): Likewise.
12693 (_mm_ucomigt_sh): Likewise.
12694 (_mm_ucomige_sh): Likewise.
12695 (_mm_ucomineq_sh): Likewise.
12696 (_mm_comi_round_sh): Likewise.
12697 (_mm_comi_sh): Likewise.
12698 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
12699 (_mm_mask_cmp_ph_mask): Likewise.
12700 (_mm256_cmp_ph_mask): Likewise.
12701 (_mm256_mask_cmp_ph_mask): Likewise.
12702 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12703 * config/i386/i386-builtin.def: Add corresponding new builtins.
12704 * config/i386/i386-expand.c
12705 (ix86_expand_args_builtin): Handle new builtin types.
12706 (ix86_expand_round_builtin): Ditto.
12707 * config/i386/i386.md (ssevecmode): Add HF mode.
12708 (MODEFH): New mode iterator.
12709 * config/i386/sse.md
12710 (V48H_AVX512VL): New mode iterator to support HF vector modes.
12711 Ajdust corresponding description.
12712 (ssecmpintprefix): New.
12713 (VI12_AVX512VL): Adjust to support HF vector modes.
12714 (cmp_imm_predicate): Likewise.
12715 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
12717 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
12718 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
12719 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
12721 2021-09-10 liuhongt <hongtao.liu@intel.com>
12723 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
12724 (_mm512_mask_max_ph): Likewise.
12725 (_mm512_maskz_max_ph): Likewise.
12726 (_mm512_min_ph): Likewise.
12727 (_mm512_mask_min_ph): Likewise.
12728 (_mm512_maskz_min_ph): Likewise.
12729 (_mm512_max_round_ph): Likewise.
12730 (_mm512_mask_max_round_ph): Likewise.
12731 (_mm512_maskz_max_round_ph): Likewise.
12732 (_mm512_min_round_ph): Likewise.
12733 (_mm512_mask_min_round_ph): Likewise.
12734 (_mm512_maskz_min_round_ph): Likewise.
12735 (_mm_max_sh): Likewise.
12736 (_mm_mask_max_sh): Likewise.
12737 (_mm_maskz_max_sh): Likewise.
12738 (_mm_min_sh): Likewise.
12739 (_mm_mask_min_sh): Likewise.
12740 (_mm_maskz_min_sh): Likewise.
12741 (_mm_max_round_sh): Likewise.
12742 (_mm_mask_max_round_sh): Likewise.
12743 (_mm_maskz_max_round_sh): Likewise.
12744 (_mm_min_round_sh): Likewise.
12745 (_mm_mask_min_round_sh): Likewise.
12746 (_mm_maskz_min_round_sh): Likewise.
12747 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
12748 (_mm256_max_ph): Likewise.
12749 (_mm_mask_max_ph): Likewise.
12750 (_mm256_mask_max_ph): Likewise.
12751 (_mm_maskz_max_ph): Likewise.
12752 (_mm256_maskz_max_ph): Likewise.
12753 (_mm_min_ph): Likewise.
12754 (_mm256_min_ph): Likewise.
12755 (_mm_mask_min_ph): Likewise.
12756 (_mm256_mask_min_ph): Likewise.
12757 (_mm_maskz_min_ph): Likewise.
12758 (_mm256_maskz_min_ph): Likewise.
12759 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12760 * config/i386/i386-builtin.def: Add corresponding new builtins.
12761 * config/i386/i386-expand.c
12762 (ix86_expand_args_builtin): Handle new builtin types.
12763 * config/i386/sse.md
12764 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
12765 support HF vector modes.
12766 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
12767 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
12769 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
12771 * config/i386/subst.md (round_saeonly_mode512bit_condition):
12772 Adjust for HF vector modes.
12774 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
12776 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
12777 (_mm_mask_add_sh): Likewise.
12778 (_mm_maskz_add_sh): Likewise.
12779 (_mm_sub_sh): Likewise.
12780 (_mm_mask_sub_sh): Likewise.
12781 (_mm_maskz_sub_sh): Likewise.
12782 (_mm_mul_sh): Likewise.
12783 (_mm_mask_mul_sh): Likewise.
12784 (_mm_maskz_mul_sh): Likewise.
12785 (_mm_div_sh): Likewise.
12786 (_mm_mask_div_sh): Likewise.
12787 (_mm_maskz_div_sh): Likewise.
12788 (_mm_add_round_sh): Likewise.
12789 (_mm_mask_add_round_sh): Likewise.
12790 (_mm_maskz_add_round_sh): Likewise.
12791 (_mm_sub_round_sh): Likewise.
12792 (_mm_mask_sub_round_sh): Likewise.
12793 (_mm_maskz_sub_round_sh): Likewise.
12794 (_mm_mul_round_sh): Likewise.
12795 (_mm_mask_mul_round_sh): Likewise.
12796 (_mm_maskz_mul_round_sh): Likewise.
12797 (_mm_div_round_sh): Likewise.
12798 (_mm_mask_div_round_sh): Likewise.
12799 (_mm_maskz_div_round_sh): Likewise.
12800 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12801 * config/i386/i386-builtin.def: Add corresponding new builtins.
12802 * config/i386/i386-expand.c
12803 (ix86_expand_round_builtin): Handle new builtins.
12804 * config/i386/sse.md (VF_128): Change description.
12805 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
12806 Adjust to support HF vector modes.
12807 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
12810 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
12812 * config/i386/i386-expand.c
12813 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
12814 * config/i386/i386.c
12815 (ix86_preferred_simd_mode): Handle HF mode.
12816 * config/i386/sse.md (V_256H): New mode iterator.
12817 (avx_vextractf128<mode>): Use it.
12818 (VEC_INIT_MODE): Align vector HFmode condition to vector
12819 HImodes since there're no real HF instruction used.
12820 (VEC_INIT_HALF_MODE): Ditto.
12822 (VIHF_AVX512BW): Ditto.
12823 (*vec_extracthf): Ditto.
12824 (VEC_EXTRACT_MODE): Ditto.
12826 2021-09-10 Richard Biener <rguenther@suse.de>
12829 * config/dbx.h: Remove.
12830 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
12831 * config/lynx.h: Likewise.
12833 2021-09-10 liuhongt <hongtao.liu@intel.com>
12835 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
12836 right into ANDNOT + AND + IOR, using paradoxical subregs.
12837 (ix86_split_copysign_const): Remove.
12838 (ix86_split_copysign_var): Ditto.
12839 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
12840 (ix86_split_copysign_var): Ditto.
12841 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
12842 (@copysign<mode>3_var): Ditto.
12844 2021-09-09 qing zhao <qing.zhao@oracle.com>
12846 * builtins.c (expand_builtin_memset): Make external visible.
12847 * builtins.h (expand_builtin_memset): Declare extern.
12848 * common.opt (ftrivial-auto-var-init=): New option.
12849 * doc/extend.texi: Document the uninitialized attribute.
12850 * doc/invoke.texi: Document -ftrivial-auto-var-init.
12851 * flag-types.h (enum auto_init_type): New enumerated type
12853 * gimple-fold.c (clear_padding_type): Add one new parameter.
12854 (clear_padding_union): Likewise.
12855 (clear_padding_emit_loop): Likewise.
12856 (clear_type_padding_in_mask): Likewise.
12857 (gimple_fold_builtin_clear_padding): Handle this new parameter.
12858 * gimplify.c (gimple_add_init_for_auto_var): New function.
12859 (gimple_add_padding_init_for_auto_var): New function.
12860 (is_var_need_auto_init): New function.
12861 (gimplify_decl_expr): Add initialization to automatic variables per
12863 (gimplify_call_expr): Add one new parameter for call to
12864 __builtin_clear_padding.
12865 (gimplify_init_constructor): Add padding initialization in the end.
12866 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
12867 (expand_DEFERRED_INIT): New function.
12868 * internal-fn.def (DEFERRED_INIT): New internal function.
12869 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
12870 * tree-sra.c (generate_subtree_deferred_init): New function.
12871 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
12872 calls to .DEFERRED_INIT.
12873 (sra_modify_deferred_init): New function.
12874 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
12875 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
12876 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
12878 (check_defs): Likewise.
12879 (warn_uninitialized_vars): Likewise.
12880 * tree-ssa.c (ssa_undefined_value_p): Likewise.
12881 * tree.c (build_common_builtin_nodes): Build tree node for
12882 BUILT_IN_CLEAR_PADDING when needed.
12884 2021-09-09 Richard Biener <rguenther@suse.de>
12886 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
12889 2021-09-09 Richard Biener <rguenther@suse.de>
12891 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
12892 DOM walk from get_loop_body_in_dom_order using a worklist
12895 2021-09-09 liuhongt <hongtao.liu@intel.com>
12897 * config.gcc: Add avx512fp16vlintrin.h.
12898 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
12899 (_mm512_mask_add_ph): Likewise.
12900 (_mm512_maskz_add_ph): Likewise.
12901 (_mm512_sub_ph): Likewise.
12902 (_mm512_mask_sub_ph): Likewise.
12903 (_mm512_maskz_sub_ph): Likewise.
12904 (_mm512_mul_ph): Likewise.
12905 (_mm512_mask_mul_ph): Likewise.
12906 (_mm512_maskz_mul_ph): Likewise.
12907 (_mm512_div_ph): Likewise.
12908 (_mm512_mask_div_ph): Likewise.
12909 (_mm512_maskz_div_ph): Likewise.
12910 (_mm512_add_round_ph): Likewise.
12911 (_mm512_mask_add_round_ph): Likewise.
12912 (_mm512_maskz_add_round_ph): Likewise.
12913 (_mm512_sub_round_ph): Likewise.
12914 (_mm512_mask_sub_round_ph): Likewise.
12915 (_mm512_maskz_sub_round_ph): Likewise.
12916 (_mm512_mul_round_ph): Likewise.
12917 (_mm512_mask_mul_round_ph): Likewise.
12918 (_mm512_maskz_mul_round_ph): Likewise.
12919 (_mm512_div_round_ph): Likewise.
12920 (_mm512_mask_div_round_ph): Likewise.
12921 (_mm512_maskz_div_round_ph): Likewise.
12922 * config/i386/avx512fp16vlintrin.h: New header.
12923 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
12924 Add new builtin types.
12925 * config/i386/i386-builtin.def: Add corresponding builtins.
12926 * config/i386/i386-expand.c
12927 (ix86_expand_args_builtin): Handle new builtin types.
12928 (ix86_expand_round_builtin): Likewise.
12929 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
12930 * config/i386/sse.md (VFH): New mode_iterator.
12932 (avx512fmaskmode): Add HF vector modes.
12933 (avx512fmaskhalfmode): Likewise.
12934 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
12936 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
12937 (mul<mode>3<mask_name><round_name>): Likewise.
12938 (*mul<mode>3<mask_name><round_name>): Likewise.
12939 (div<mode>3): Likewise.
12940 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
12941 * config/i386/subst.md (SUBST_V): Add HF vector modes.
12942 (SUBST_A): Likewise.
12943 (round_mode512bit_condition): Adjust for V32HFmode.
12945 2021-09-09 liuhongt <hongtao.liu@intel.com>
12948 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
12949 (reduc_plus_scal_v4sf): .. this, New define_expand.
12950 (reduc_plus_scal_v2df): .. and this, New define_expand.
12952 2021-09-09 liuhongt <hongtao.liu@intel.com>
12955 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
12958 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
12961 * doc/trouble.texi (Copy Assignment): Fix description of
12962 behaviour and fix code in example.
12964 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
12967 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
12968 r11 instead of r12 for restoring CR.
12970 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12971 liuhongt <hongtao.liu@intel.com>
12974 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
12975 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
12976 into AND with mask and XOR, using paradoxical subregs.
12977 (ix86_split_xorsign): Remove.
12978 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
12980 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
12982 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
12984 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12987 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
12988 operands[2], emit abs<mode>2 instead.
12989 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
12990 first alternative even for avx, add another alternative with
12991 =&Yv <- 0, Yv, Yvm constraints.
12992 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
12993 to op1, emit vpandn instead.
12995 2021-09-08 liuhongt <hongtao.liu@intel.com>
12997 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
12998 (_mm256_set_ph): Likewise.
12999 (_mm512_set_ph): Likewise.
13000 (_mm_setr_ph): Likewise.
13001 (_mm256_setr_ph): Likewise.
13002 (_mm512_setr_ph): Likewise.
13003 (_mm_set1_ph): Likewise.
13004 (_mm256_set1_ph): Likewise.
13005 (_mm512_set1_ph): Likewise.
13006 (_mm_setzero_ph): Likewise.
13007 (_mm256_setzero_ph): Likewise.
13008 (_mm512_setzero_ph): Likewise.
13009 (_mm_set_sh): Likewise.
13010 (_mm_load_sh): Likewise.
13011 (_mm_store_sh): Likewise.
13012 * config/i386/i386-builtin-types.def (V8HF): New type.
13013 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
13014 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
13015 Support vector HFmodes.
13016 (ix86_expand_vector_init_one_nonzero): Likewise.
13017 (ix86_expand_vector_init_one_var): Likewise.
13018 (ix86_expand_vector_init_interleave): Likewise.
13019 (ix86_expand_vector_init_general): Likewise.
13020 (ix86_expand_vector_set): Likewise.
13021 (ix86_expand_vector_extract): Likewise.
13022 (ix86_expand_vector_init_concat): Likewise.
13023 (ix86_expand_sse_movcc): Handle vector HFmodes.
13024 (ix86_expand_vector_set_var): Ditto.
13025 * config/i386/i386-modes.def: Add HF vector modes in comment.
13026 * config/i386/i386.c (classify_argument): Add HF vector modes.
13027 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
13028 (ix86_vector_mode_supported_p): Likewise.
13029 (ix86_set_reg_reg_cost): Handle vector HFmode.
13030 (ix86_get_ssemov): Handle vector HFmode.
13031 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
13033 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
13035 (function_arg_advance_32): Ditto.
13036 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
13037 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
13038 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
13039 (VALID_SSE2_REG_VHF_MODE): New.
13040 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
13041 (SSE_REG_MODE_P): Add vector HFmode.
13042 * config/i386/i386.md (mode): Add HF vector modes.
13043 (MODE_SIZE): Likewise.
13044 (ssemodesuffix): Add ph suffix for HF vector modes.
13045 * config/i386/sse.md (VFH_128): New mode iterator.
13046 (VMOVE): Adjust for HF vector modes.
13048 (V_256_512): Likewise.
13049 (avx512): Likewise.
13050 (avx512fmaskmode): Likewise.
13051 (shuffletype): Likewise.
13052 (sseinsnmode): Likewise.
13053 (ssedoublevecmode): Likewise.
13054 (ssehalfvecmode): Likewise.
13055 (ssehalfvecmodelower): Likewise.
13056 (ssePScmode): Likewise.
13057 (ssescalarmode): Likewise.
13058 (ssescalarmodelower): Likewise.
13059 (sseintprefix): Likewise.
13061 (bcstscalarsuff): Likewise.
13062 (xtg_mode): Likewise.
13063 (VI12HF_AVX512VL): New mode_iterator.
13064 (VF_AVX512FP16): Likewise.
13066 (VIHF_256): Likewise.
13067 (VIHF_AVX512BW): Likewise.
13068 (V16_256): Likewise.
13069 (V32_512): Likewise.
13070 (sseintmodesuffix): New mode_attr.
13071 (sse): Add scalar and vector HFmodes.
13072 (ssescalarmode): Add vector HFmode mapping.
13073 (ssescalarmodesuffix): Add sh suffix for HFmode.
13074 (*<sse>_vm<insn><mode>3): Use VFH_128.
13075 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
13076 (*ieee_<ieee_maxmin><mode>3): Likewise.
13077 (<avx512>_blendm<mode>): New define_insn.
13078 (vec_setv8hf): New define_expand.
13079 (vec_set<mode>_0): New define_insn for HF vector set.
13080 (*avx512fp16_movsh): Likewise.
13081 (avx512fp16_movsh): Likewise.
13082 (vec_extract_lo_v32hi): Rename to ...
13083 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
13085 (vec_extract_hi_v32hi): Likewise.
13086 (vec_extract_hi_<mode>): Likewise.
13087 (vec_extract_lo_v16hi): Likewise.
13088 (vec_extract_lo_<mode>): Likewise.
13089 (vec_extract_hi_v16hi): Likewise.
13090 (vec_extract_hi_<mode>): Likewise.
13091 (vec_set_hi_v16hi): Likewise.
13092 (vec_set_hi_<mode>): Likewise.
13093 (vec_set_lo_v16hi): Likewise.
13094 (vec_set_lo_<mode>): Likewise.
13095 (*vec_extract<mode>_0): New define_insn_and_split for HF
13097 (*vec_extracthf): New define_insn.
13098 (VEC_EXTRACT_MODE): Add HF vector modes.
13099 (PINSR_MODE): Add V8HF.
13100 (sse2p4_1): Likewise.
13101 (pinsr_evex_isa): Likewise.
13102 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
13103 insert for V8HFmode.
13104 (pbroadcast_evex_isa): Add HF vector modes.
13105 (AVX2_VEC_DUP_MODE): Likewise.
13106 (VEC_INIT_MODE): Likewise.
13107 (VEC_INIT_HALF_MODE): Likewise.
13108 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
13110 (avx2_pbroadcast<mode>_1): Likewise.
13111 (<avx512>_vec_dup<mode>_1): Likewise.
13112 (<avx512>_vec_dup<mode><mask_name>): Likewise.
13113 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
13116 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
13117 H.J. Lu <hongjiu.lu@intel.com>
13118 Liu Hongtao <hongtao.liu@intel.com>
13119 Wang Hongyu <hongyu.wang@intel.com>
13120 Xu Dianhong <dianhong.xu@intel.com>
13122 * common/config/i386/cpuinfo.h (get_available_features):
13123 Detect FEATURE_AVX512FP16.
13124 * common/config/i386/i386-common.c
13125 (OPTION_MASK_ISA_AVX512FP16_SET,
13126 OPTION_MASK_ISA_AVX512FP16_UNSET,
13127 OPTION_MASK_ISA2_AVX512FP16_SET,
13128 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
13129 (OPTION_MASK_ISA2_AVX512BW_UNSET,
13130 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
13131 (ix86_handle_option): Handle -mavx512fp16.
13132 * common/config/i386/i386-cpuinfo.h (enum processor_features):
13133 Add FEATURE_AVX512FP16.
13134 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
13135 * config.gcc: Add avx512fp16intrin.h.
13136 * config/i386/avx512fp16intrin.h: New intrinsic header.
13137 * config/i386/cpuid.h: Add bit_AVX512FP16.
13138 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
13139 * config/i386/i386-builtins.c: Support _Float16 type for i386
13141 (ix86_register_float16_builtin_type): New function.
13142 (ix86_float16_type_node): New.
13143 * config/i386/i386-c.c (ix86_target_macros_internal): Define
13145 * config/i386/i386-expand.c (ix86_expand_branch): Support
13147 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
13148 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
13149 (ix86_expand_fp_movcc): Ditto.
13150 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
13151 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
13152 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
13153 * config/i386/i386.c (ix86_get_ssemov): Use
13154 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
13155 (ix86_get_excess_precision): Use
13156 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
13158 (sse_store_index): Use SFmode cost for HFmode cost.
13159 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
13160 GPR cost for HFmode.
13161 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
13162 (ix86_mangle_type): Add manlging for _Float16 type.
13163 (inline_secondary_memory_needed): No memory is needed for
13164 16bit movement between gpr and sse reg under
13166 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
13167 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
13168 (ix86_division_cost): Ditto.
13169 (ix86_rtx_costs): Ditto.
13170 (ix86_add_stmt_cost): Ditto.
13171 (ix86_optab_supported_p): Ditto.
13172 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
13173 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
13174 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
13175 * config/i386/i386.md (mode): Add HFmode.
13176 (MODE_SIZE): Add HFmode.
13177 (isa): Add avx512fp16.
13178 (enabled): Handle avx512fp16.
13179 (ssemodesuffix): Add sh suffix for HFmode.
13180 (comm): Add mult, div.
13181 (plusminusmultdiv): New code iterator.
13182 (insn): Add mult, div.
13183 (*movhf_internal): Adjust for avx512fp16 instruction.
13184 (*movhi_internal): Ditto.
13185 (*cmpi<unord>hf): New define_insn for HFmode.
13186 (*ieee_s<ieee_maxmin>hf3): Likewise.
13187 (extendhf<mode>2): Likewise.
13188 (trunc<mode>hf2): Likewise.
13189 (float<floatunssuffix><mode>hf2): Likewise.
13190 (*<insn>hf): Likewise.
13191 (cbranchhf4): New expander.
13192 (movhfcc): Likewise.
13193 (<insn>hf3): Likewise.
13194 (mulhf3): Likewise.
13195 (divhf3): Likewise.
13196 * config/i386/i386.opt: Add mavx512fp16.
13197 * config/i386/immintrin.h: Include avx512fp16intrin.h.
13198 * doc/invoke.texi: Add mavx512fp16.
13199 * doc/extend.texi: Add avx512fp16 Usage Notes.
13201 2021-09-08 liuhongt <hongtao.liu@intel.com>
13203 * common.opt: Support -fexcess-precision=16.
13204 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
13205 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
13206 EXCESS_PRECISION_TYPE_FLOAT16.
13207 * config/arm/arm.c (arm_excess_precision): Ditto.
13208 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
13209 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
13210 when EXCESS_PRECISION_TYPE_FLOAT16.
13211 * config/s390/s390.c (s390_excess_precision): Ditto.
13212 * coretypes.h (enum excess_precision_type): Add
13213 EXCESS_PRECISION_TYPE_FLOAT16.
13214 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
13215 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
13216 * doc/extend.texi (Half-Precision): Document
13217 -fexcess-precision=16.
13218 * flag-types.h (enum excess_precision): Add
13219 EXCESS_PRECISION_FLOAT16.
13220 * target.def (excess_precision): Update document.
13221 * tree.c (excess_precision_type): Set excess_precision_type to
13222 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
13224 2021-09-08 liuhongt <hongtao.liu@intel.com>
13226 * doc/extend.texi: (@node Floating Types): Adjust the wording.
13227 (@node Half-Precision): Ditto.
13229 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
13232 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
13233 'CONST_INT_P (src)' to the condition of the block that tries to
13234 eliminate literal when loading integer contant.
13236 2021-09-07 David Faust <david.faust@oracle.com>
13238 * doc/extend.texi (BPF Type Attributes) New node.
13239 Document new preserve_access_index attribute.
13240 Document new preserve_access_index builtin.
13241 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
13243 2021-09-07 David Faust <david.faust@oracle.com>
13245 * config/bpf/bpf.c: Adjust includes.
13246 (bpf_handle_preserve_access_index_attribute): New function.
13247 (bpf_attribute_table): Use it here.
13248 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
13249 (bpf_option_override): Handle "-mco-re" option.
13250 (bpf_asm_init_sections): New.
13251 (TARGET_ASM_INIT_SECTIONS): Redefine.
13252 (bpf_file_end): New.
13253 (TARGET_ASM_FILE_END): Redefine.
13254 (bpf_init_builtins): Add "__builtin_preserve_access_index".
13255 (bpf_core_compute, bpf_core_get_index): New.
13256 (is_attr_preserve_access): New.
13257 (bpf_expand_builtin): Handle new builtins.
13258 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
13259 (bpf_core_walk): New.
13260 (bpf_resolve_overloaded_builtin): New.
13261 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
13262 (handle_attr): New.
13263 (pass_bpf_core_attr): New RTL pass.
13264 * config/bpf/bpf-passes.def: New file.
13265 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
13266 * config/bpf/coreout.c: New file.
13267 * config/bpf/coreout.h: Likewise.
13268 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
13269 (coreout.o): New rule.
13270 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
13271 * config.gcc (bpf): Add coreout.h to extra_headers.
13272 Add coreout.o to extra_objs.
13273 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
13275 2021-09-07 David Faust <david.faust@oracle.com>
13277 * btfout.c (get_btf_id): Function is no longer static.
13278 * ctfc.h: Expose it here.
13280 2021-09-07 David Faust <david.faust@oracle.com>
13282 * ctfc.c (ctf_lookup_tree_type): New function.
13283 * ctfc.h: Likewise.
13285 2021-09-07 David Faust <david.faust@oracle.com>
13287 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
13288 * ctfc.h: Analogous change.
13290 2021-09-07 David Faust <david.faust@oracle.com>
13292 * dwarf2out.c (lookup_type_die): Function is no longer static.
13293 * dwarf2out.h: Expose it here.
13295 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
13297 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
13298 (ctf_debug_early_finish): New definition.
13299 (ctf_debug_finish): Likewise.
13300 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
13301 (ctf_debug_early_finish): New declaration.
13302 (ctf_debug_finish): Likewise.
13303 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
13304 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
13306 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
13308 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
13309 support when compiling for CO-RE.
13310 * config/bpf/bpf.opt: Add new command line option -mco-re.
13312 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
13314 * flag-types.h (enum debug_info_type): Add new enum
13315 DINFO_TYPE_BTF_WITH_CORE.
13316 (BTF_WITH_CORE_DEBUG): New bitmask.
13317 * flags.h (btf_with_core_debuginfo_p): New declaration.
13318 * opts.c (btf_with_core_debuginfo_p): New definition.
13320 2021-09-07 Jason Merrill <jason@redhat.com>
13322 * tree.h (error_operand_p): Change to inline function.
13324 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
13326 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
13327 (empty_block_with_phis_p): ...this.
13328 (potentially_threadable_block): Same.
13329 (jump_threader::thread_through_normal_block): Same.
13331 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
13334 * dwarf2out.c (mark_base_types): New overloaded function.
13335 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
13336 as the compilation unit, and call move_marked_base_types afterward.
13338 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
13341 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
13343 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
13345 2021-09-07 Richard Biener <rguenther@suse.de>
13347 PR tree-optimization/102226
13348 * tree-vect-loop.c (vect_transform_cycle_phi): Record
13349 the converted value for the epilogue PHI use.
13351 2021-09-07 Martin Liska <mliska@suse.cz>
13353 PR gcov-profile/80223
13354 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
13355 options, do not inline when no_profile_instrument_function
13356 attributes are different in early inliner. It's fine to inline
13357 it after PGO instrumentation.
13359 2021-09-07 Richard Biener <rguenther@suse.de>
13361 PR tree-optimization/101555
13362 * tree-ssa-pre.c (translate_vuse_through_block): Do not
13363 perform an alias walk to determine the validity of the
13364 mem at the start of the block which is already guaranteed
13365 by means of prune_clobbered_mems.
13366 (phi_translate_1): Pass edge to translate_vuse_through_block.
13368 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
13371 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
13372 (remainder<mode>3): Likewise.
13374 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
13376 * config/mips/mips.c (mips_file_start): add .module for
13379 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
13381 * wide-int.cc (wi::clz): Reorder tests to ensure the result
13382 is zero for all negative values.
13384 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
13386 * doc/invoke.texi (-foffload-options): Fix @opindex.
13388 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
13391 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
13392 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
13393 source alternative for AVX.
13395 2021-09-06 liuhongt <hongtao.liu@intel.com>
13397 PR middle-end/102182
13398 * optabs.c (expand_fix): Add from1 to avoid from being
13401 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
13403 * dwarf2out.c (modified_type_die): Deal with all array types earlier
13404 and use local variable consistently throughout the function.
13406 2021-09-06 Jakub Jelinek <jakub@redhat.com>
13408 PR tree-optimization/102207
13409 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
13410 were promoted from signed to wider unsigned type.
13412 2021-09-06 Andrew Pinski <apinski@marvell.com>
13414 PR tree-optimization/63184
13415 * match.pd: Add simplification of pointer_diff of two pointer_plus
13416 with addr_expr in the first operand of each pointer_plus.
13417 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
13418 in the first operand of each pointer_plus.
13420 2021-09-06 Richard Biener <rguenther@suse.de>
13422 PR tree-optimization/102176
13423 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
13425 (vect_bb_slp_scalar_cost): Use the computed set of
13426 vectorized scalar stmts instead of relying on the out-of-date
13427 and not accurate PURE_SLP_STMT.
13428 (vect_bb_vectorization_profitable_p): Compute the set
13429 of vectorized scalar stmts.
13431 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13433 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
13434 GIMPLE_COND special casing.
13435 (path_range_query::range_defined_in_block): Use range_of_stmt
13436 instead of calling fold_range directly.
13438 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13440 * gimple-range-path.cc (path_range_query::range_of_expr): Set
13441 m_undefined_path when appropriate.
13442 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
13443 (path_range_query::unreachable_path_p): New.
13444 (path_range_query::precompute_ranges): Set m_undefined_path.
13445 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
13446 (path_range_query::internal_range_of_expr): New.
13447 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
13448 Use unreachable_path_p.
13450 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13452 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
13453 Remove argument and call find_taken_edge.
13454 (back_threader::resolve_phi): Do not calculate taken edge before
13455 calling maybe_register_path.
13456 (back_threader::find_paths_to_names): Same.
13458 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
13460 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
13461 * config/h8300/testcompare.md (store_c): Update name, use new
13463 (store_neg_c, store_shifted_c): New patterns.
13465 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
13468 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
13469 instead of r12 for CR save, in all cases.
13471 2021-09-03 Andrew Pinski <apinski@marvell.com>
13473 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
13474 Handle error_mark_node as the type of the type_decl.
13476 2021-09-03 Andrew Pinski <apinski@marvell.com>
13478 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
13480 (aarch64_simd_types): Likewise.
13481 (aarch64_simd_intOI_type_node): Likewise.
13482 (aarch64_simd_intCI_type_node): Likewise.
13483 (aarch64_simd_intXI_type_node): Likewise.
13484 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
13485 (aarch64_fp16_ptr_type_node): Likewise.
13486 (aarch64_bf16_type_node): Likewise.
13487 (aarch64_bf16_ptr_type_node): Likewise.
13489 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13491 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
13493 (minus_op1_op2_relation_effect): ...here.
13494 (class operator_pointer_diff): New.
13495 (operator_pointer_diff::op1_op2_relation_effect): Call
13496 minus_op1_op2_relation_effect.
13497 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
13499 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13501 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
13502 Add may_peel_loop_headers.
13503 (back_threader_registry::thread_through_all_blocks): Same.
13504 (try_thread_blocks): Pass may_peel_loop_headers argument.
13505 (pass_early_thread_jumps::execute): Same.
13507 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13509 * tree-ssa-threadedge.c (has_phis_p): New.
13510 (forwarder_block_p): New.
13511 (potentially_threadable_block): Call forwarder_block_p.
13512 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
13513 (jump_threader::thread_through_normal_block): Call
13516 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13518 * tree-ssa-threadbackward.c (back_threader::dump): New.
13519 (back_threader::debug): New.
13520 (back_threader_profitability::profitable_path_p): Dump blocks
13521 even if we are bailing early.
13523 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13525 * tree-ssa-threadupdate.c (cancel_thread): New.
13526 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
13527 (jump_thread_path_registry::mark_threaded_blocks): Same.
13528 (jump_thread_path_registry::register_jump_thread): Same.
13530 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13532 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
13533 which objects are available.
13534 (jt_state::pop): Same.
13535 (jt_state::register_equiv): Same.
13536 (jt_state::register_equivs_on_edge): Same.
13538 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13540 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
13541 Move pop until after a thread is registered.
13543 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13545 * tree-ssa-threadupdate.c (debug): New.
13547 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13549 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
13550 (push_dump_file::~push_dump_file): New.
13551 (dump_ranger): Change dump_file temporarily while dumping
13553 * gimple-range-trace.h (class push_dump_file): New.
13555 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13557 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
13558 (dump_ranger): Dump function name.
13560 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13562 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13563 Adjust for non-null.
13564 (path_range_query::adjust_for_non_null_uses): New.
13565 (path_range_query::precompute_ranges): Call
13566 adjust_for_non_null_uses.
13567 * gimple-range-path.h: Add m_non_null and
13568 adjust_for_non_null_uses.
13570 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13572 * gimple-range-path.cc (path_range_query::dump): Dump path
13574 (path_range_query::precompute_ranges): Dump entire path.
13576 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13578 * value-relation.cc (relation_oracle::debug): New.
13579 * value-relation.h (relation_oracle::debug): New.
13581 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13583 * tree-ssa-loop-ch.c: Remove unnecessary include file.
13585 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13587 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
13588 Skip statements with no defining BB.
13589 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13590 Do not get confused by statements with no defining BB.
13592 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13594 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
13595 gimple_range_adjustment. Add support for constants.
13596 (adjust_realpart_expr): New.
13597 (gimple_range_adjustment): Move IMAGPART_EXPR code to
13598 adjust_imagpart_expr.
13599 * range-op.cc (integral_table::integral_table): Add entry for
13602 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13604 * omp-expand.c (expand_omp_atomic_pipeline): Use
13605 IFN_ATOMIC_COMPARE_EXCHANGE instead of
13606 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
13609 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13612 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
13613 that is false if DECL_BIT_FIELD.
13614 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
13615 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
13616 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
13617 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
13618 of DECL_FIELD_ABI_IGNORED.
13619 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
13620 SET_DECL_FIELD_ABI_IGNORED instead of writing to
13621 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
13622 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
13623 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
13624 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
13626 2021-09-03 liuhongt <hongtao.liu@intel.com>
13629 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
13630 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
13631 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
13633 2021-09-02 Martin Sebor <msebor@redhat.com>
13635 PR tree-optimization/17506
13637 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
13639 2021-09-02 Richard Biener <rguenther@suse.de>
13641 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
13642 fix for PR78185 and continue processing when leaving
13643 finite inner loops.
13645 2021-09-02 Jakub Jelinek <jakub@redhat.com>
13647 PR tree-optimization/99591
13648 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
13651 2021-09-02 Richard Biener <rguenther@suse.de>
13654 2021-09-02 Richard Biener <rguenther@suse.de>
13656 PR tree-optimization/102155
13657 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13658 over a part of the RPO array and do not recurse here.
13659 Dump blocks marked as always executed.
13660 (fill_always_executed_in): Walk over the RPO array and
13661 process loops whose header we run into.
13662 (loop_invariant_motion_in_fun): Compute the first RPO
13663 using rev_post_order_and_mark_dfs_back_seme in iteration
13664 order and pass that to fill_always_executed_in.
13666 2021-09-02 liuhongt <hongtao.liu@intel.com>
13668 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
13669 * config/i386/i386.c (enum x86_64_reg_class): Add
13670 X86_64_SSEHF_CLASS.
13671 (merge_classes): Handle X86_64_SSEHF_CLASS.
13672 (examine_argument): Ditto.
13673 (construct_container): Ditto.
13674 (classify_argument): Ditto, and set HFmode/HCmode to
13675 X86_64_SSEHF_CLASS.
13676 (function_value_32): Return _FLoat16/Complex Float16 by
13678 (function_value_64): Return _Float16/Complex Float16 by SSE
13680 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
13681 (ix86_secondary_reload): Require gpr as intermediate register
13682 to store _Float16 from sse register when sse4 is not
13684 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
13686 (ix86_scalar_mode_supported_p): Ditto.
13687 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
13688 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
13689 (VALID_INT_MODE_P): Add HFmode and HCmode.
13690 * config/i386/i386.md (*pushhf_rex64): New define_insn.
13692 (*movhf_internal): Ditto.
13693 * doc/extend.texi (Half-Precision Floating Point): Documemt
13696 2021-09-02 Richard Biener <rguenther@suse.de>
13698 PR tree-optimization/102155
13699 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13700 over a part of the RPO array and do not recurse here.
13701 Dump blocks marked as always executed.
13702 (fill_always_executed_in): Walk over the RPO array and
13703 process loops whose header we run into.
13704 (loop_invariant_motion_in_fun): Compute the first RPO
13705 using rev_post_order_and_mark_dfs_back_seme in iteration
13706 order and pass that to fill_always_executed_in.
13708 2021-09-02 YunQiang Su <syq@debian.org>
13711 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13713 * config/mips/mips.c (mips_module_isa_name): New.
13714 mips_file_start: add .module mipsREV to all asm output
13716 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
13718 PR tree-optimization/102152
13719 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
13720 comparison to a scalar comparison before calling
13721 update_stmt_if_modified.
13723 2021-09-01 Andrew Pinski <apinski@marvell.com>
13726 * config/aarch64/aarch64.c (aarch64_expand_setmem):
13727 Check STRICT_ALIGNMENT before creating an overlapping
13730 2021-09-01 Martin Sebor <msebor@redhat.com>
13732 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
13733 (check_access): Pass additional argument.
13734 (check_memop_access): Remove template and make a member function.
13735 (maybe_check_dealloc_call): Make a pass_waccess member function.
13736 (class pass_waccess): Add, rename, and remove members.
13737 (pass_waccess::pass_waccess): Adjust to name change.
13738 (pass_waccess::~pass_waccess): Same.
13739 (check_alloca): Make a member function.
13740 (check_alloc_size_call): Same.
13741 (check_strcat): Same.
13742 (check_strncat): Same.
13743 (check_stxcpy): Same.
13744 (check_stxncpy): Same.
13745 (check_strncmp): Same.
13746 (maybe_warn_rdwr_sizes): Rename...
13747 (pass_waccess::maybe_check_access_sizes): ...to this.
13748 (pass_waccess::check_call): Adjust to name changes.
13749 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
13751 (pass_waccess::execute): Adjust to name changes.
13752 * gimple-ssa-warn-access.h (check_memop_access): Remove.
13753 * pointer-query.cc (access_ref::phi): Handle null pointer.
13754 (access_ref::inform_access): Same.
13755 (pointer_query::put_ref): Modify a cached value, not a copy of it.
13756 (pointer_query::dump): New function.
13757 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
13759 * pointer-query.h (pointer_query::dump): Declare.
13760 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
13761 (dump_strlen_info): Use function query.
13762 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
13764 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
13766 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
13768 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
13770 * doc/extend.texi: Document unavailable attribute.
13771 * print-tree.c (print_node): Handle unavailable attribute.
13772 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
13773 * tree.c (error_unavailable_use): New.
13774 * tree.h (TREE_UNAVAILABLE): New.
13775 (error_unavailable_use): New.
13777 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13779 PR tree-optimization/102124
13780 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
13781 MINUS_EXPR, if itype is unsigned with smaller precision than type,
13782 add an extra cast to signed variant of itype to ensure sign-extension.
13784 2021-09-01 Martin Liska <mliska@suse.cz>
13786 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
13787 edges and rather use colors for TRUE and FALSE edges.
13789 2021-09-01 Richard Biener <rguenther@suse.de>
13791 PR tree-optimization/93491
13792 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
13793 after processing the stmt itself. Do not consider
13794 pure functions possibly not returning. Properly avoid
13795 adding possibly trapping calls to EXP_GEN when there's
13796 a preceeding possibly not returning call.
13797 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
13800 2021-09-01 Richard Biener <rguenther@suse.de>
13802 PR tree-optimization/102139
13803 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
13804 type to record a std::pair of the stmt-info and the innermost
13806 (dr_vec_info::group): New member.
13807 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
13808 (vect_compute_data_ref_alignment): Verify the recorded
13809 base alignment can be used.
13810 (data_ref_pair): Remove.
13811 (dr_group_sort_cmp): Adjust.
13812 (vect_analyze_data_ref_accesses): Store the group-ID in the
13813 dr_vec_info and operate on a vector of dr_vec_infos.
13815 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
13817 * read-md.c (md_reader::handle_enum): support value assignation.
13818 * doc/md.texi: record define_c_enum value assignation support.
13820 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13822 PR tree-optimization/102141
13823 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
13824 argument. If false, emit stmts after gsi instead of before, and
13826 (bswap_replace): Adjust callers. When converting output of bswap,
13827 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
13830 2021-09-01 liuhongt <hongtao.liu@intel.com>
13832 * emit-rtl.c (validate_subreg): Get rid of all float-int
13835 2021-09-01 liuhongt <hongtao.liu@intel.com>
13838 2021-08-30 liuhongt <hongtao.liu@intel.com>
13840 * expmed.c (extract_bit_field_1): Make sure we're playing with
13841 integral modes before call extract_integral_bit_field.
13842 (extract_integral_bit_field): Add a parameter of type
13843 scalar_int_mode which corresponds to of tmode.
13844 And call extract_and_convert_fixed_bit_field instead of
13845 extract_fixed_bit_field and convert_extracted_bit_field.
13846 (extract_and_convert_fixed_bit_field): New function, it's a
13847 combination of extract_fixed_bit_field and
13848 convert_extracted_bit_field.
13850 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13852 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
13854 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13856 * omp-general.h (omp_is_reference): Rename to...
13857 (omp_privatize_by_reference): ... this. Adjust all users...
13858 * omp-general.c: ... here, ...
13859 * gimplify.c: ... here, ...
13860 * omp-expand.c: ... here, ...
13861 * omp-low.c: ... here.
13863 2021-08-31 Martin Sebor <msebor@redhat.com>
13865 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
13866 pointer element for equality to zero, not that of the cotaining
13869 2021-08-31 Martin Sebor <msebor@redhat.com>
13871 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
13873 2021-08-31 Martin Sebor <msebor@redhat.com>
13875 * function.h (function): Add comments.
13876 (get_range_query): Same. Add attribute returns nonnull.
13878 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13880 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
13881 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
13882 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
13884 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
13886 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
13888 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
13889 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
13890 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
13891 (m32c-*-elf*, m32c-*-rtems*): Likewise.
13892 * config/cris/cris.h (DBX_NO_XREFS): Remove.
13893 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
13894 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
13895 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
13896 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
13897 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
13899 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
13901 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
13902 allowed on target constructs and only with particular other clauses.
13903 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
13904 'ancestor' is used.
13905 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
13906 structs when 'ancestor' is used.
13907 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
13908 'ancestor' is used.
13909 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
13910 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
13912 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13914 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
13915 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13917 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
13918 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
13919 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
13920 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
13921 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
13922 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13923 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
13924 would be paradoxical.
13926 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13928 * combine.c (combine_simplify_rtx): Avoid converting an explicit
13929 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
13930 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
13932 2021-08-31 Richard Biener <rguenther@suse.de>
13934 PR tree-optimization/102142
13935 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
13936 condition under which to unset the visited flag.
13938 2021-08-31 Richard Biener <rguenther@suse.de>
13940 PR middle-end/102129
13941 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
13942 possibly trapping expressions across calls.
13944 2021-08-31 Jakub Jelinek <jakub@redhat.com>
13946 PR tree-optimization/102134
13947 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
13948 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
13951 2021-08-31 Andrew Pinski <apinski@marvell.com>
13954 * collect-utils.c (setup_signals): New declaration.
13955 * collect-utils.h (setup_signals): New function.
13956 * collect2.c (handler): Delete.
13957 (main): Instead of manually setting up the signals,
13958 just call setup_signals.
13959 * lto-wrapper.c (main): Likewise.
13961 2021-08-31 Andrew Pinski <apinski@marvell.com>
13964 * config/i386/i386-protos.h (x86_output_aligned_bss):
13965 Change align argument to unsigned type.
13966 (x86_elf_aligned_decl_common): Likewise.
13967 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
13968 (x86_output_aligned_bss): Likewise.
13970 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13972 * config/mips/mips.c (mips_module_isa_name): New.
13973 mips_file_start: add .module mipsREV to all asm output
13975 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13977 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
13978 use enum instead of int for 'isa' member.
13979 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
13980 config/mips/netbsd.h: replace hardcoded numbers with enum.
13982 2021-08-31 liuhongt <hongtao.liu@intel.com>
13984 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
13985 define_split to define_insn_and_split.
13986 (*avx2_eq<mode>3): Removed.
13987 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
13988 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
13989 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
13991 (*avx2_gt<mode>3): Removed.
13992 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
13993 define_insn to define_expand, and adjust pattern.
13994 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
13996 2021-08-30 David Malcolm <dmalcolm@redhat.com>
13999 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
14001 2021-08-30 Jason Merrill <jason@redhat.com>
14003 * doc/invoke.texi: Document -Wmissing-requires.
14005 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
14007 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
14008 TARGET_EXTRA_BUILTINS guard.
14010 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
14012 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
14013 initialization of V2DI_type_node and unsigned_V2DI_type_node.
14015 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
14017 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
14018 decl when new_builtins_are_live.
14019 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
14022 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
14024 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
14025 OPTION_MASK_P10_FUSION_2STORE.
14026 (POWERPC_MASKS): Likewise.
14027 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
14028 store fusion for Power10.
14029 (is_fusable_store): New.
14030 (power10_sched_reorder): Likewise.
14031 (rs6000_sched_reorder): Do Power10 specific reordering.
14032 (rs6000_sched_reorder2): Likewise.
14033 * config/rs6000/rs6000.opt: Add new option.
14035 2021-08-30 Richard Biener <rguenther@suse.de>
14037 PR tree-optimization/102128
14038 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
14039 Move scanning for if-converted scalar code to the caller
14040 and instead delay clearing the visited flag for profitable
14042 (vect_slp_region): Cost all subgraphs before scheduling.
14043 For if-converted BB vectorization scan for scalar COND_EXPRs
14044 and do not vectorize if any found and the cost model is
14047 2021-08-30 Richard Biener <rguenther@suse.de>
14049 * common.opt (fexceptions): Mark
14050 EnabledBy(fnon-call-exceptions).
14051 * doc/invoke.texi (fnon-call-exceptions): Document this
14052 enables -fexceptions.
14054 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
14056 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
14057 is not already defined.
14059 2021-08-30 liuhongt <hongtao.liu@intel.com>
14061 * expmed.c (extract_bit_field_1): Make sure we're playing with
14062 integral modes before call extract_integral_bit_field.
14063 (extract_integral_bit_field): Add a parameter of type
14064 scalar_int_mode which corresponds to of tmode.
14065 And call extract_and_convert_fixed_bit_field instead of
14066 extract_fixed_bit_field and convert_extracted_bit_field.
14067 (extract_and_convert_fixed_bit_field): New function, it's a
14068 combination of extract_fixed_bit_field and
14069 convert_extracted_bit_field.
14071 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
14073 * config/darwin.c (darwin_libc_has_function): Do not run
14074 the checks for x86 or modern Darwin. Make sure that there
14075 is a value set for darwin_macosx_version_min before testing.
14077 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
14079 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
14081 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
14083 * ipa-modref-tree.h (modref_access_node::merge): Break out
14084 logic combining offsets and logic merging ranges to ...
14085 (modref_access_node::combined_offsets): ... here
14086 (modref_access_node::update2): ... here
14087 (modref_access_node::closer_pair_p): New member function.
14088 (modref_access_node::forced_merge): New member function.
14089 (modre_ref_node::insert): Do merging when table is full.
14091 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
14094 * config.gcc: MIPS: use N64 ABI by default if the triple end
14095 with -gnuabi64, which is used by Debian since 2013.
14097 2021-08-28 Alexandre Oliva <oliva@adacore.com>
14099 * ipa-modref.c (analyze_function): Skip debug stmts.
14100 * tree-inline.c (estimate_num_insn): Consider builtins even
14101 without a cgraph_node.
14103 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
14105 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
14106 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
14107 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
14108 * config/h8300/h8300.md (eqne): New code iterator.
14109 (geultu, geultu_to_c): Similarly.
14110 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
14111 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
14112 (cmp<mode>_c): New pattern
14114 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
14116 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
14118 (dom_opt_dom_walker::optimize_stmt): Use it.
14120 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
14122 * config/darwin.c (finalize_ctors): Add a section-start linker-
14124 (finalize_dtors): Likewise.
14125 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
14127 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
14129 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
14130 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
14131 old initialization logic when new builtins are enabled.
14132 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
14133 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
14134 (write_init_file): Likewise.
14136 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
14138 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
14139 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
14140 (amdgcn-* | gcn-*) Likewise.
14142 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
14144 * symbol-summary.h: Added missing template keyword.
14146 2021-08-27 Richard Biener <rguenther@suse.de>
14148 PR tree-optimization/45178
14149 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
14150 infinite loops without exit do not mark control dependent
14151 edges of the latch necessary.
14153 2021-08-27 konglin1 <lingling.kong@intel.com>
14156 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
14158 (<avx512>scattersi<mode>): Likewise.
14159 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
14160 (*avx512f_scatterdi<VI48F:mode>): Likewise
14162 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
14164 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
14165 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
14166 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
14167 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
14168 P10_BUILTIN_PEXTD on Power10.
14170 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
14172 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
14173 signedness for some Power10 bifs.
14175 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
14178 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
14179 computed alignment if the entire struct has attribute packed.
14181 2021-08-27 liuhongt <hongtao.liu@intel.com>
14185 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
14186 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
14187 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
14188 IX86_BUILTIN_SHUFPS256.
14189 (ix86_masked_all_ones): New function.
14191 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
14193 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
14194 (conditional moves with memory inputs splitters): Ditto.
14195 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
14197 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14199 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
14200 search after merging.
14202 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14204 * config/rs6000/rs6000-overload.def: Add remaining overloads.
14206 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14208 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
14210 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14212 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
14213 crypto, and htm stanzas.
14215 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14217 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
14219 2021-08-26 Martin Sebor <msebor@redhat.com>
14221 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
14222 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
14223 (warn_uninitialized_vars): Same. Reduce visibility of locals.
14224 (warn_uninitialized_phi): Same.
14226 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
14228 * tree-ssa-ccp.c (get_individual_bits): Helper function to
14229 extract the individual bits from a widest_int constant (mask).
14230 (gray_code_bit_flips): New read-only table for effiently
14231 enumerating permutations/combinations of bits.
14232 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
14233 by unknown counts that are guaranteed less than the target
14234 precision and four or fewer unknown bits by enumeration.
14235 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
14236 enumeration under the same conditions. Handle remaining
14237 shifts as a mask based upon the minimum possible shift value.
14239 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
14240 Richard Biener <rguenther@suse.de>
14242 * match.pd (shift transformations): Remove a redundant
14243 !POINTER_TYPE_P check.
14245 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
14248 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
14249 Set all_regs to true in the call to replace_rtx.
14251 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14253 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
14254 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
14255 hiting --param modref-max-refs limit.
14256 (modref_tree:insert_base): Be smarter when hitting
14257 --param modref-max-bases limit. Add new parameter REF.
14258 (modref_tree:insert): Update.
14259 (modref_tree:merge): Update.
14260 * ipa-modref.c (read_modref_records): Update.
14262 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14264 * params.opt: (modref-max-adjustments): Add full stop.
14266 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14268 * ipa-modref-tree.h (modref_ref_node::verify): New member
14270 (modref_ref_node::insert): Use it.
14271 (modref_ref_node::try_mere_with): Fix off by one error.
14273 2021-08-26 Martin Liska <mliska@suse.cz>
14274 Stefan Kneifel <stefan.kneifel@bluewin.ch>
14276 * cgraph.h (create_version_clone_with_body): Add new parameter.
14277 * cgraphclones.c: Likewise.
14278 * multiple_target.c (create_dispatcher_calls): Do not use
14280 (create_target_clone): Likewise here.
14282 2021-08-26 Jonathan Yong <10walls@gmail.com>
14284 * doc/extend.texi: Add note about reserved priorities
14285 to the constructor attribute.
14287 2021-08-25 Martin Sebor <msebor@redhat.com>
14289 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
14290 range table header alone.
14291 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
14293 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
14295 * doc/invoke.texi: Document --param modref-max-adjustments.
14296 * ipa-modref-tree.c (test_insert_search_collapse): Update.
14297 (test_merge): Update.
14298 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
14299 (modref_access_node::operator==): Fix handling of access ranges.
14300 (modref_access_node::contains): Constify parameter; handle also
14301 mismatched parm offsets.
14302 (modref_access_node::update): New function.
14303 (modref_access_node::merge): New function.
14304 (unspecified_modref_access_node): Update constructor.
14305 (modref_ref_node::insert_access): Add record_adjustments parameter;
14307 (modref_ref_node::try_merge_with): New private function.
14308 (modref_tree::insert): New record_adjustments parameter.
14309 (modref_tree::merge): New record_adjustments parameter.
14310 (modref_tree::copy_from): Update.
14311 * ipa-modref.c (dump_access): Dump adjustments field.
14312 (get_access): Update constructor.
14313 (record_access): Update call of insert.
14314 (record_access_lto): Update call of insert.
14315 (merge_call_side_effects): Add record_adjustments parameter.
14316 (get_access_for_fnspec): Update.
14317 (process_fnspec): Update.
14318 (analyze_call): Update.
14319 (analyze_function): Update.
14320 (read_modref_records): Update.
14321 (ipa_merge_modref_summary_after_inlining): Update.
14322 (propagate_unknown_call): Update.
14323 (modref_propagate_in_scc): Update.
14324 * params.opt (param-max-modref-adjustments=): New.
14326 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
14328 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
14330 (xxspltiw_v4si): Use vecperm type attribute.
14331 (xxspltiw_v4si_inst): Use vecperm type attribute.
14332 (xxspltiw_v4sf_inst): Likewise.
14333 (xxspltidp_v2df): Use vecperm type attribute. Use
14334 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
14335 (xxspltidp_v2df_inst): Likewise.
14336 (xxsplti32dx_v4si): Use vecperm type attribute.
14337 (xxsplti32dx_v4si_inst): Likewise.
14338 (xxsplti32dx_v4sf_inst): Likewise.
14339 (xxblend_<mode>): Likewise.
14340 (xxpermx): Likewise.
14341 (xxpermx_inst): Likewise.
14342 (xxeval): Likewise.
14344 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
14347 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
14348 * diagnostic.c (diagnostic_initialize_input_context): New function.
14349 * diagnostic.h (diagnostic_initialize_input_context): Declare.
14350 * input.c (default_charset_callback): New function.
14351 (file_cache::initialize_input_context): New function.
14352 (file_cache_slot::create): Added ability to convert the input
14353 according to the input context.
14354 (file_cache::file_cache): Initialize the new input context.
14355 (class file_cache_slot): Added new m_alloc_offset member.
14356 (file_cache_slot::file_cache_slot): Initialize the new member.
14357 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
14358 (file_cache_slot::maybe_grow): Likewise.
14359 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
14360 (file_cache_slot::get_next_line): Likewise.
14361 * input.h (class file_cache): Added input context member.
14363 2021-08-25 Richard Biener <rguenther@suse.de>
14365 PR tree-optimization/102046
14366 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
14367 update ->any_pattern when swapping operands.
14369 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
14372 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
14373 (ix86_decompose_address): Remove retval check for ASHIFT,
14374 allow non-canonical zero extend if AND mask covers ASHIFT
14376 (ix86_legitimate_address_p): Adjust condition for decompose.
14377 (ix86_rtx_costs): Adjust cost for lea with non-canonical
14379 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
14381 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
14383 PR tree-optimization/101145
14384 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
14386 (number_of_iterations_lt): Invoke above function.
14387 (adjust_cond_for_loop_until_wrap):
14388 Merge to number_of_iterations_until_wrap.
14389 (number_of_iterations_cond): Update invokes for
14390 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
14392 2021-08-25 konglin1 <lingling.kong@intel.com>
14395 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
14396 macro define in O0.
14397 (_mm512_mask_fpclass_ps_mask): Ditto.
14399 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
14401 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
14402 (vec_unpacku_hi_v8hi): Likewise.
14403 (vec_unpacku_lo_v16qi): Likewise.
14404 (vec_unpacku_lo_v8hi): Likewise.
14405 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
14406 (vec_unpacku_lo_<VP_small_lc>): Likewise.
14408 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
14410 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
14411 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
14412 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
14413 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
14415 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14417 PR middle-end/102031
14418 * simplify-rtx.c (simplify_truncation): When comparing precisions
14419 use "subreg_prec" variable, not "subreg_mode".
14421 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14423 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
14426 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14428 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
14429 various pointer type nodes.
14430 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
14431 values for various pointer types.
14432 (ptr_V16QI_type_node): New macro.
14433 (ptr_V1TI_type_node): New macro.
14434 (ptr_V2DI_type_node): New macro.
14435 (ptr_V2DF_type_node): New macro.
14436 (ptr_V4SI_type_node): New macro.
14437 (ptr_V4SF_type_node): New macro.
14438 (ptr_V8HI_type_node): New macro.
14439 (ptr_unsigned_V16QI_type_node): New macro.
14440 (ptr_unsigned_V1TI_type_node): New macro.
14441 (ptr_unsigned_V8HI_type_node): New macro.
14442 (ptr_unsigned_V4SI_type_node): New macro.
14443 (ptr_unsigned_V2DI_type_node): New macro.
14444 (ptr_bool_V16QI_type_node): New macro.
14445 (ptr_bool_V8HI_type_node): New macro.
14446 (ptr_bool_V4SI_type_node): New macro.
14447 (ptr_bool_V2DI_type_node): New macro.
14448 (ptr_bool_V1TI_type_node): New macro.
14449 (ptr_pixel_type_node): New macro.
14450 (ptr_intQI_type_node): New macro.
14451 (ptr_uintQI_type_node): New macro.
14452 (ptr_intHI_type_node): New macro.
14453 (ptr_uintHI_type_node): New macro.
14454 (ptr_intSI_type_node): New macro.
14455 (ptr_uintSI_type_node): New macro.
14456 (ptr_intDI_type_node): New macro.
14457 (ptr_uintDI_type_node): New macro.
14458 (ptr_intTI_type_node): New macro.
14459 (ptr_uintTI_type_node): New macro.
14460 (ptr_long_integer_type_node): New macro.
14461 (ptr_long_unsigned_type_node): New macro.
14462 (ptr_float_type_node): New macro.
14463 (ptr_double_type_node): New macro.
14464 (ptr_long_double_type_node): New macro.
14465 (ptr_dfloat64_type_node): New macro.
14466 (ptr_dfloat128_type_node): New macro.
14467 (ptr_ieee128_type_node): New macro.
14468 (ptr_ibm128_type_node): New macro.
14469 (ptr_vector_pair_type_node): New macro.
14470 (ptr_vector_quad_type_node): New macro.
14471 (ptr_long_long_integer_type_node): New macro.
14472 (ptr_long_long_unsigned_type_node): New macro.
14474 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14476 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
14477 and power9-64 stanzas.
14479 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14480 Tom de Vries <tdevries@suse.de>
14482 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
14483 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
14484 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
14485 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
14486 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
14487 * config/nvptx/nvptx-c.c: New source file.
14488 (nvptx_cpu_cpp_builtins): Move implementation here.
14490 2021-08-24 Martin Sebor <msebor@redhat.com>
14492 PR middle-end/101600
14493 PR middle-end/101977
14494 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
14495 the phrasing of a warning.
14496 (check_access): Use the remaining size after subtracting any offset
14497 rather than the whole object size.
14498 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
14499 clear for any nonnull PHI argument.
14500 (compute_objsize): Clear argument.
14502 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14504 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
14506 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14508 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
14511 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
14513 * value-relation.cc (rr_transitive_table): New.
14514 (relation_transitive): New.
14515 (value_relation::swap): Remove.
14516 (value_relation::apply_transitive): New.
14517 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
14518 (relation_oracle::register_relation): Call register_transitives.
14519 (relation_oracle::register_transitives): New.
14520 * value-relation.h (relation_oracle): Add new temporary bitmap and
14523 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
14526 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
14527 from integer to a pseudo vector register.
14529 2021-08-24 Richard Biener <rguenther@suse.de>
14531 PR tree-optimization/100089
14532 * tree-vectorizer.h (vect_slp_bb): Rename to ...
14533 (vect_slp_if_converted_bb): ... this and get the original
14534 loop as new argument.
14535 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
14536 pass original loop to vect_slp_if_converted_bb.
14537 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
14538 If orig_loop was passed scan the not vectorized stmts
14539 for COND_EXPRs and force not profitable if found.
14540 (vect_slp_region): Pass down all SLP instances to costing
14541 if orig_loop was specified.
14542 (vect_slp_bbs): Pass through orig_loop.
14543 (vect_slp_bb): Rename to ...
14544 (vect_slp_if_converted_bb): ... this and get the original
14545 loop as new argument.
14546 (vect_slp_function): Adjust.
14548 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14551 * config/arm/arm.md (attribute arch): Add fix_vlldm.
14552 (arch_enabled): Use it.
14553 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
14554 use when erratum mitigation is needed.
14556 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14559 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
14560 * doc/invoke.texi (Arm Options): Document it.
14561 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
14562 (ALL_QUIRKS): Add quirk_vlldm.
14563 (cortex-m33): Add quirk_vlldm.
14564 (cortex-m35p, cortex-m55): Likewise.
14565 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
14566 targetting an affected CPU and not explicitly controlled on
14569 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14571 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
14572 (lazy_load_multiple_insn): Likewise.
14574 2021-08-24 liuhongt <hongtao.liu@intel.com>
14577 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
14578 Enable avx512 embedded broadcast.
14579 (*<avx512>_vternlog<mode>_all): Ditto.
14580 (<avx512>_vternlog<mode>_mask): Ditto.
14582 2021-08-24 liuhongt <hongtao.liu@intel.com>
14585 * config/i386/i386.c (ix86_rtx_costs): Define cost for
14587 * config/i386/i386.h (STRIP_UNARY): New macro.
14588 * config/i386/predicates.md (reg_or_notreg_operand): New
14590 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
14591 (*<avx512>_vternlog<mode>_1): New pre_reload
14592 define_insn_and_split.
14593 (*<avx512>_vternlog<mode>_2): Ditto.
14594 (*<avx512>_vternlog<mode>_3): Ditto.
14595 (any_logic1,any_logic2): New code iterator.
14596 (logic_op): New code attribute.
14597 (ternlogsuffix): Extend to VNxDF and VNxSF.
14599 2021-08-24 Richard Biener <rguenther@suse.de>
14601 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
14602 * params.opt (--param vect-inner-loop-cost-factor): Adjust
14604 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
14605 inner_loop_cost_factor to the minimum of the estimated number
14606 of iterations of the inner loop and vect-inner-loop-cost-factor.
14608 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14609 Richard Biener <rguenther@suse.de>
14611 * config/i386/i386-features.c (compute_convert_gain): Provide
14612 more accurate values for CONST_INT, when optimizing for size.
14613 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
14614 * config/i386/i386.h (COSTS_N_BYTES): to here.
14616 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14617 Jakub Jelinek <jakub@redhat.com>
14619 PR middle-end/102029
14620 * match.pd (shift transformations): Add an additional check for
14621 !POINTER_TYPE_P in the recently added left shift transformation.
14623 2021-08-24 liuhongt <hongtao.liu@intel.com>
14625 PR tree-optimization/100089
14626 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
14627 loop vectorizer when cost model is very-cheap.
14629 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14631 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
14632 asprintf, which is not available on AIX.
14634 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14636 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
14637 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
14639 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14641 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
14642 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
14644 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
14645 (rbtree.o): Likewise.
14646 (rs6000-gen-builtins): Likewise.
14647 (rs6000-builtins.c): Likewise.
14648 (rs6000-builtins.h): Likewise.
14649 (rs6000.o): Add dependency.
14650 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
14651 (rs6000-vecdefines.h): New target.
14652 (rs6000-builtins.o): Likewise.
14653 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
14654 (rs6000-c.o): Likewise.
14656 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14659 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
14660 Diagnose buffer overrun.
14661 (safe_inc_pos): Fix overrun detection.
14662 (match_identifier): Diagnose buffer overrun.
14663 (match_integer): Likewise.
14664 (match_to_right_bracket): Likewise.
14666 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14668 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
14669 Improve range compare.
14670 (modref_access_node::contains): New member function.
14671 (modref_access_node::search): Remove.
14672 (modref_access_node::insert): Be smarter about subaccesses.
14674 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
14676 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
14677 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
14678 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
14681 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
14683 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
14685 * config/h8300/jumpcall.md (call, call_value): Restrict to
14686 !SIBLING_CALL_P cases.
14687 (subcall, sibcall_value): New patterns & expanders.
14688 * config/h8300/proepi.md (epilogue): Pass new argument to
14689 h8300_expand_epilogue.
14690 (sibcall_epilogue): New expander.
14691 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
14693 (h8300_ok_for_sibcall_p): New function.
14694 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
14696 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14698 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
14699 Handle case where the operand is already the desired mode.
14701 2021-08-23 Richard Biener <rguenther@suse.de>
14704 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
14705 in addition to has_gimple_body.
14707 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14709 PR middle-end/101949
14710 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
14713 2021-08-23 Martin Liska <mliska@suse.cz>
14715 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
14718 2021-08-23 Richard Biener <rguenther@suse.de>
14720 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
14721 Properly scale the inner loop cost only once.
14723 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14725 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
14726 Provide bounds for unsigned (and signed with non-negative operands)
14727 division and modulus.
14729 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14731 * simplify-rtx.c (simplify_truncation): Generalize simplification
14732 of (truncate:A (subreg:B X)).
14733 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
14734 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
14735 already has the desired machine mode.
14736 (test_scalar_int_ops): Add tests that useless extensions and
14737 truncations are optimized away.
14738 (test_scalar_int_ext_ops): New self-test function to confirm
14739 that truncations of extensions are correctly simplified.
14740 (test_scalar_int_ext_ops2): New self-test function to check
14741 truncations of truncations, extensions of extensions, and
14742 truncations of extensions.
14743 (test_scalar_ops): Call the above two functions with a
14744 representative sampling of integer machine modes.
14746 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14748 * match.pd (shift transformations): Change the sign of an
14749 LSHIFT_EXPR if it reduces the number of explicit conversions.
14751 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14753 PR tree-optimization/86723
14754 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
14755 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
14756 non-memory permutation of 64-bit src which only has bytes of
14757 0 or [5..8] and n->range is 4.
14758 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
14759 find_bswap_or_nop_finalize caller, support bswap with some bytes
14760 zeroed, as long as at least two bytes are not zeroed.
14761 (bswap_replace): Add mask argument and handle masking of bswap
14763 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
14764 caller, punt if cast64_to_32 or mask is not all ones.
14765 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
14766 caller, for now punt if cast64_to_32.
14768 2021-08-23 Richard Biener <rguenther@suse.de>
14770 PR tree-optimization/79334
14771 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
14772 a type also for COMPONENT_REFs.
14773 (vn_reference_may_trap): Check ARRAY_REF with constant index
14774 against the array domain.
14776 2021-08-23 liuhongt <hongtao.liu@intel.com>
14779 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
14780 TARGET_AVX512BW to condition.
14782 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14785 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
14786 register variables already during early_dwarf if possible.
14788 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14790 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
14792 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14795 * config/arm/arm.opt: Fix typo.
14796 * config/arm/t-rmprofile: Fix typo.
14798 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14800 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
14801 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
14802 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
14803 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
14804 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
14805 if either grainsize or num_tasks clause has the strict modifier.
14807 2021-08-23 Martin Liska <mliska@suse.cz>
14809 * dbgcnt.def (DEBUG_COUNTER): New counter.
14810 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
14812 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14814 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
14816 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
14819 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
14821 * config/mips/mips.c (mips_msa_output_shift_immediate): New
14823 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
14824 vlshr<mode>3): Call it.
14826 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
14827 Martin Liska <mliska@suse.cz>
14829 PR middle-end/101949
14830 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
14833 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
14835 * config/mips/mips.c (mips_function_rodata_section,
14836 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
14838 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
14840 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
14842 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14844 * configure.ac (thread-local storage support): Remove tls_first_major
14845 and tls_first_minor. Use "$conftest_s" to check support.
14846 * configure: Regenerate.
14848 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14850 * configure.ac: Fixup formatting.
14852 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14854 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
14855 * configure.ac: ... update all callers.
14857 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14860 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
14861 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
14862 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
14863 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
14864 Remove remaining checks for in-tree assembler.
14865 * configure: Regenerate.
14867 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
14869 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
14870 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
14872 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
14873 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
14874 Improve logical shifts by 27 bits for H8/S.
14875 (get_shift_alg): Corresponding changes.
14876 (h8300_option_override): Revert to loops for -Os when profitable.
14878 2021-08-20 Richard Biener <rguenther@suse.de>
14880 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
14882 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
14883 index as group_id when dataref_groups were not computed.
14884 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
14885 we advace to the next BB.
14887 2021-08-20 Jakub Jelinek <jakub@redhat.com>
14889 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
14892 2021-08-20 Martin Liska <mliska@suse.cz>
14894 PR gcov-profile/89961
14895 * gcov.c (make_gcov_file_name): Rewrite using std::string.
14896 (mangle_name): Simplify, do not used the second argument.
14897 (strip_extention): New function.
14898 (get_md5sum): Likewise.
14899 (get_gcov_intermediate_filename): Handle properly -p and -x
14901 (output_gcov_file): Use string type.
14902 (generate_results): Likewise.
14903 (md5sum_to_hex): Remove.
14905 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
14907 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
14908 (UNSPEC_XXSPLTIW): Move to vsx.md.
14909 (UNSPEC_XXSPLTID): Move to vsx.md.
14910 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
14911 (UNSPEC_XXBLEND): Move to vsx.md.
14912 (UNSPEC_XXPERMX): Move to vsx.md.
14913 (VM3): Move to vsx.md.
14914 (VM3_char): Move to vsx.md.
14915 (xxspltiw_v4si): Move to vsx.md.
14916 (xxspltiw_v4sf): Move to vsx.md.
14917 (xxspltiw_v4sf_inst): Move to vsx.md.
14918 (xxspltidp_v2df): Move to vsx.md.
14919 (xxspltidp_v2df_inst): Move to vsx.md.
14920 (xxsplti32dx_v4si_inst): Move to vsx.md.
14921 (xxsplti32dx_v4sf): Move to vsx.md.
14922 (xxsplti32dx_v4sf_inst): Move to vsx.md.
14923 (xxblend_<mode>): Move to vsx.md.
14924 (xxpermx): Move to vsx.md.
14925 (xxpermx_inst): Move to vsx.md.
14926 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
14927 (UNSPEC_XXSPLTIW): Move from altivec.md.
14928 (UNSPEC_XXSPLTID): Move from altivec.md.
14929 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
14930 (UNSPEC_XXBLEND): Move from altivec.md.
14931 (UNSPEC_XXPERMX): Move from altivec.md.
14932 (VM3): Move from altivec.md.
14933 (VM3_char): Move from altivec.md.
14934 (xxspltiw_v4si): Move from altivec.md.
14935 (xxspltiw_v4sf): Move from altivec.md.
14936 (xxspltiw_v4sf_inst): Move from altivec.md.
14937 (xxspltidp_v2df): Move from altivec.md.
14938 (xxspltidp_v2df_inst): Move from altivec.md.
14939 (xxsplti32dx_v4si_inst): Move from altivec.md.
14940 (xxsplti32dx_v4sf): Move from altivec.md.
14941 (xxsplti32dx_v4sf_inst): Move from altivec.md.
14942 (xxblend_<mode>): Move from altivec.md.
14943 (xxpermx): Move from altivec.md.
14944 (xxpermx_inst): Move from altivec.md.
14946 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
14948 * tree-vect-generic.c (expand_vector_operations_1): Use either
14949 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
14950 when constructing scalar splat expressions.
14952 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
14955 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
14956 pointer to __vector_pair *.
14958 2021-08-19 Martin Sebor <msebor@redhat.com>
14960 * gimple-range.cc: Add comments.
14961 * gimple-range.h: Same.
14963 2021-08-19 Martin Sebor <msebor@redhat.com>
14965 PR middle-end/101984
14966 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
14969 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
14971 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
14972 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
14973 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
14975 2021-08-19 Jakub Jelinek <jakub@redhat.com>
14977 PR middle-end/101950
14978 * optabs.c (expand_clrsb_using_clz): New function.
14979 (expand_unop): Use it as another clrsb expansion fallback.
14981 2021-08-19 liuhongt <hongtao.liu@intel.com>
14984 2021-07-28 liuhongt <hongtao.liu@intel.com>
14987 * config/i386/i386.h (processor_costs): Add new member
14989 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
14990 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
14991 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
14992 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
14993 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
14994 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
14995 generic_cost, core_cost): Initialize integer_to_sse same value
14997 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
14998 * config/i386/i386.c (ix86_builtin_vectorization_cost):
14999 Use integer_to_sse instead of sse_op to calculate the cost of
15002 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
15004 * config.gcc: Include rpath.opt for Darwin.
15005 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
15007 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
15009 PR bootstrap/101959
15010 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
15013 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
15015 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
15016 pattern and type cast.
15017 (vld4_lane_f32): Use float RTL pattern.
15018 (vld4q_lane_f64): Use float type cast.
15020 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
15022 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
15025 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
15027 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
15028 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
15029 (hash_map_tests_c_tests): Call it.
15031 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
15033 * ggc.h (enum ggc_collect): New.
15034 (ggc_collect): Use it.
15035 * ggc-page.c: Adjust.
15036 * ggc-common.c: Likewise.
15037 * ggc-tests.c: Likewise.
15038 * read-rtl-function.c: Likewise.
15039 * selftest-run-tests.c: Likewise.
15040 * doc/gty.texi (Invoking the garbage collector): Likewise.
15042 2021-08-18 liuhongt <hongtao.liu@intel.com>
15045 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
15047 * config/i386/sse.md (*sse3_haddv2df3_low): Add
15048 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
15049 (*sse3_hsubv2df3_low): Ditto.
15050 * config/i386/x86-tune.def
15051 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
15053 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15055 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
15056 (gori_compute::compute_operand_range): Add tracing.
15057 (gori_compute::logical_combine): Ditto.
15058 (gori_compute::compute_logical_operands): Ditto.
15059 (gori_compute::compute_operand1_range): Ditto.
15060 (gori_compute::compute_operand2_range): Ditto.
15061 (gori_compute::outgoing_edge_range_p): Ditto.
15062 * gimple-range-gori.h (class gori_compute): Add range_tracer.
15064 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15066 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
15067 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
15068 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
15069 * params.opt (--param=evrp-mode): Adjust options.
15071 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15073 * Makefile.in (OBJS): Add gimple-range-trace.o.
15074 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
15075 * gimple-range-fold.cc: Adjust headers.
15076 * gimple-range-trace.cc: New.
15077 * gimple-range-trace.h: New.
15078 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
15079 (gimple_ranger::range_of_expr): Add tracing.
15080 (gimple_ranger::range_on_entry): Ditto.
15081 (gimple_ranger::range_on_exit): Ditto.
15082 (gimple_ranger::range_on_edge): Ditto.
15083 (gimple_ranger::fold_range_internal): Ditto.
15084 (gimple_ranger::dump_bb): Do not calculate edge range twice.
15085 (trace_ranger::*): Remove.
15086 (enable_ranger): Never create a trace_ranger.
15087 (debug_seed_ranger): Move to gimple-range-trace.cc.
15088 (dump_ranger): Ditto.
15089 (debug_ranger): Ditto.
15090 * gimple-range.h: Include gimple-range-trace.h.
15091 (range_on_entry, range_on_exit): No longer virtual.
15092 (class trace_ranger): Remove.
15093 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
15095 2021-08-17 Martin Sebor <msebor@redhat.com>
15097 PR middle-end/101854
15098 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
15099 in gimple-ssa-warn-access.cc.
15100 * calls.c (alloc_max_size): Move code to check_alloca.
15101 (get_size_range): Move to pointer-query.cc.
15102 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
15103 (get_attr_nonstring_decl): Move to tree.c.
15104 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
15105 (append_attrname): Same.
15106 (maybe_warn_rdwr_sizes): Same.
15107 (initialize_argument_information): Move code to
15108 gimple-ssa-warn-access.cc.
15109 * calls.h (maybe_warn_alloc_args_overflow): Move to
15110 gimple-ssa-warn-access.h.
15111 (get_attr_nonstring_decl): Move to tree.h.
15112 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
15113 (enum size_range_flags): Move to pointer-query.h.
15114 (get_size_range): Same.
15115 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
15116 to avoid Clang -Wunused-function.
15117 (get_size_range): Declare static.
15118 (maybe_emit_free_warning): Rename...
15119 (maybe_check_dealloc_call): ...to this for consistency.
15120 (class pass_waccess): Add members.
15121 (pass_waccess::~pass_waccess): Defined.
15122 (alloc_max_size): Move here from calls.c.
15123 (maybe_warn_alloc_args_overflow): Same.
15124 (check_alloca): New function.
15125 (check_alloc_size_call): New function.
15126 (check_strncat): Handle another warning flag.
15127 (pass_waccess::check_builtin): Handle alloca.
15128 (fntype_argno_type): Move here from calls.c.
15129 (append_attrname): Same.
15130 (maybe_warn_rdwr_sizes): Same.
15131 (pass_waccess::check_call): Define.
15132 (check_nonstring_args): New function.
15133 (pass_waccess::check): Call new member functions.
15134 (pass_waccess::execute): Enable ranger.
15135 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
15136 (maybe_warn_nonstring_arg): Same.
15137 * gimple-ssa-warn-restrict.c: Remove #include.
15138 * pointer-query.cc (get_size_range): Move here from calls.c.
15139 * pointer-query.h (enum size_range_flags): Same.
15140 (get_size_range): Same.
15141 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
15142 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
15144 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
15146 * ggc.h (ggc_collect): Add 'force_collect' parameter.
15147 * ggc-page.c (ggc_collect): Use that one instead of global
15148 'ggc_force_collect'. Adjust all users.
15149 * doc/gty.texi (Invoking the garbage collector): Update.
15150 * ggc-internal.h (ggc_force_collect): Remove.
15151 * ggc-common.c (ggc_force_collect): Likewise.
15152 * selftest.h (forcibly_ggc_collect): Remove.
15153 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
15154 * read-rtl-function.c (test_loading_labels): Adjust.
15155 * selftest-run-tests.c (run_tests): Likewise.
15157 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
15159 * config/darwin.c (darwin_file_end): Reset and reclaim the
15160 section names table at the end of compile.
15162 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
15165 * config.in: Regenerate.
15166 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
15167 (ASM_SPEC): Pass options to disable branch shortening where
15169 * configure: Regenerate.
15170 * configure.ac: Detect versions of 'as' that support the
15171 optimisation which has the bug.
15173 2021-08-17 Richard Biener <rguenther@suse.de>
15175 * optabs-query.c (supports_vec_gather_load_p): Also check
15177 (supports_vec_scatter_store_p): Likewise.
15178 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
15179 back to masked variants if non-masked are not supported.
15180 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
15181 When we need to use masked gather/scatter but do not have
15182 a mask set up a constant true one.
15183 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
15184 non-SSA_NAME masks.
15186 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
15188 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
15189 algorithm as PLUS_EXPR to improve subtraction bit bounds.
15190 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
15192 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
15194 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
15195 calculate the mask-value pair result of a multiplication by an
15197 (bit_value_binop) [MULT_EXPR]: Call it from here for
15198 multiplications by (sparse) non-negative constants.
15200 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
15203 * config.gcc (gcc_cv_initfini_array): Leave undefined for
15204 uclinuxfdpiceabi targets.
15206 2021-08-17 Alexandre Oliva <oliva@adacore.com>
15208 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
15209 reverse debug stmts.
15211 2021-08-17 Alexandre Oliva <oliva@adacore.com>
15213 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
15215 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
15217 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
15218 (__LD4Q_LANE_FUNC): Likewise.
15219 (vld4_lane_u8): Define without macro.
15220 (vld4_lane_u16): Likewise.
15221 (vld4_lane_u32): Likewise.
15222 (vld4_lane_u64): Likewise.
15223 (vld4_lane_s8): Likewise.
15224 (vld4_lane_s16): Likewise.
15225 (vld4_lane_s32): Likewise.
15226 (vld4_lane_s64): Likewise.
15227 (vld4_lane_f16): Likewise.
15228 (vld4_lane_f32): Likewise.
15229 (vld4_lane_f64): Likewise.
15230 (vld4_lane_p8): Likewise.
15231 (vld4_lane_p16): Likewise.
15232 (vld4_lane_p64): Likewise.
15233 (vld4q_lane_u8): Likewise.
15234 (vld4q_lane_u16): Likewise.
15235 (vld4q_lane_u32): Likewise.
15236 (vld4q_lane_u64): Likewise.
15237 (vld4q_lane_s8): Likewise.
15238 (vld4q_lane_s16): Likewise.
15239 (vld4q_lane_s32): Likewise.
15240 (vld4q_lane_s64): Likewise.
15241 (vld4q_lane_f16): Likewise.
15242 (vld4q_lane_f32): Likewise.
15243 (vld4q_lane_f64): Likewise.
15244 (vld4q_lane_p8): Likewise.
15245 (vld4q_lane_p16): Likewise.
15246 (vld4q_lane_p64): Likewise.
15247 (vld4_lane_bf16): Likewise.
15248 (vld4q_lane_bf16): Likewise.
15250 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
15252 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
15253 (__LD3Q_LANE_FUNC): Delete.
15254 (vld3_lane_u8): Define without macro.
15255 (vld3_lane_u16): Likewise.
15256 (vld3_lane_u32): Likewise.
15257 (vld3_lane_u64): Likewise.
15258 (vld3_lane_s8): Likewise.
15259 (vld3_lane_s16): Likewise.
15260 (vld3_lane_s32): Likewise.
15261 (vld3_lane_s64): Likewise.
15262 (vld3_lane_f16): Likewise.
15263 (vld3_lane_f32): Likewise.
15264 (vld3_lane_f64): Likewise.
15265 (vld3_lane_p8): Likewise.
15266 (vld3_lane_p16): Likewise.
15267 (vld3_lane_p64): Likewise.
15268 (vld3q_lane_u8): Likewise.
15269 (vld3q_lane_u16): Likewise.
15270 (vld3q_lane_u32): Likewise.
15271 (vld3q_lane_u64): Likewise.
15272 (vld3q_lane_s8): Likewise.
15273 (vld3q_lane_s16): Likewise.
15274 (vld3q_lane_s32): Likewise.
15275 (vld3q_lane_s64): Likewise.
15276 (vld3q_lane_f16): Likewise.
15277 (vld3q_lane_f32): Likewise.
15278 (vld3q_lane_f64): Likewise.
15279 (vld3q_lane_p8): Likewise.
15280 (vld3q_lane_p16): Likewise.
15281 (vld3q_lane_p64): Likewise.
15282 (vld3_lane_bf16): Likewise.
15283 (vld3q_lane_bf16): Likewise.
15285 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
15287 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
15288 (__LD2Q_LANE_FUNC): Likewise.
15289 (vld2_lane_u8): Define without macro.
15290 (vld2_lane_u16): Likewise.
15291 (vld2_lane_u32): Likewise.
15292 (vld2_lane_u64): Likewise.
15293 (vld2_lane_s8): Likewise.
15294 (vld2_lane_s16): Likewise.
15295 (vld2_lane_s32): Likewise.
15296 (vld2_lane_s64): Likewise.
15297 (vld2_lane_f16): Likewise.
15298 (vld2_lane_f32): Likewise.
15299 (vld2_lane_f64): Likewise.
15300 (vld2_lane_p8): Likewise.
15301 (vld2_lane_p16): Likewise.
15302 (vld2_lane_p64): Likewise.
15303 (vld2q_lane_u8): Likewise.
15304 (vld2q_lane_u16): Likewise.
15305 (vld2q_lane_u32): Likewise.
15306 (vld2q_lane_u64): Likewise.
15307 (vld2q_lane_s8): Likewise.
15308 (vld2q_lane_s16): Likewise.
15309 (vld2q_lane_s32): Likewise.
15310 (vld2q_lane_s64): Likewise.
15311 (vld2q_lane_f16): Likewise.
15312 (vld2q_lane_f32): Likewise.
15313 (vld2q_lane_f64): Likewise.
15314 (vld2q_lane_p8): Likewise.
15315 (vld2q_lane_p16): Likewise.
15316 (vld2q_lane_p64): Likewise.
15317 (vld2_lane_bf16): Likewise.
15318 (vld2q_lane_bf16): Likewise.
15320 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
15322 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
15325 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
15327 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
15328 (rank_for_schedule): Use it.
15330 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
15332 PR rtl-optimization/91598
15333 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
15334 insns after memory reads and before memory writes.
15336 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
15338 * rtl.h (CONST_VECTOR_P): New macro.
15339 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
15340 code testing macros.
15341 (aarch64_ptrue_all_mode): Likewise.
15342 (aarch64_expand_mov_immediate): Likewise.
15343 (aarch64_const_vec_all_in_range_p): Likewise.
15344 (aarch64_rtx_costs): Likewise.
15345 (aarch64_legitimate_constant_p): Likewise.
15346 (aarch64_simd_valid_immediate): Likewise.
15347 (aarch64_simd_make_constant): Likewise.
15348 (aarch64_convert_mult_to_shift): Likewise.
15349 (aarch64_expand_sve_vec_perm): Likewise.
15350 (aarch64_vec_fpconst_pow_of_2): Likewise.
15352 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15354 PR tree-optimization/101938
15355 * range-op.cc (operator_abs::op1_range): Special case
15356 -TYPE_MIN_VALUE for flag_wrapv.
15358 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
15360 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
15363 2021-08-17 Jakub Jelinek <jakub@redhat.com>
15365 * tree.def (OMP_SCOPE): New tree code.
15366 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
15367 * tree-nested.c (convert_nonlocal_reference_stmt,
15368 convert_local_reference_stmt, convert_gimple_call): Handle
15370 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
15371 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
15372 * gimple.c (gimple_build_omp_scope): New function.
15373 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
15374 * gimple.h (gimple_build_omp_scope): Declare.
15375 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
15376 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
15377 gimple_omp_scope_set_clauses): New inline functions.
15378 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
15379 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
15380 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
15381 * gimple-walk.c (walk_gimple_stmt): Likewise.
15382 * gimple-low.c (lower_stmt): Likewise.
15383 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
15384 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
15385 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
15386 allowing task reductions. Reject inscan reductions on scope.
15387 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
15388 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
15389 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
15390 (estimate_num_insns): Likewise.
15391 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
15392 contexts if var isn't privatized there.
15393 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
15394 (scan_omp_1_stmt): Likewise.
15395 (maybe_add_implicit_barrier_cancel): Look through outer
15397 (lower_omp_scope): New function.
15398 (lower_omp_task_reductions): Handle OMP_SCOPE.
15399 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
15400 (diagnose_sb_1, diagnose_sb_2): Likewise.
15401 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
15402 (expand_omp): Handle GIMPLE_OMP_SCOPE.
15403 (omp_make_gimple_edges): Likewise.
15404 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
15406 2021-08-17 Richard Biener <rguenther@suse.de>
15408 PR tree-optimization/101925
15409 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
15410 reverse on COMPONENT_REF and ARRAY_REF according to
15411 what reverse_storage_order_for_component_p does.
15412 (vn_reference_eq): Compare reversed on reference ops.
15413 (reverse_storage_order_for_component_p): New overload.
15414 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
15415 on the reference looked up.
15417 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
15419 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
15420 shifts on the H8/S.
15421 (h8300_option_override): Use loops on H8/S more often when optimizing
15423 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
15424 accordingly. Handle various arithmetic right shifts with special
15425 sequences that we couldn't handle before.
15427 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
15429 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
15431 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
15433 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
15434 * config/sparc/sparc.c (sparc_gcov_type_size): New.
15435 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
15436 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
15437 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
15438 * doc/tm.texi.in: Regenerate.
15439 * target.def (gcov_type_size): New target hook.
15440 * targhooks.c (default_gcov_type_size): New.
15441 * targhooks.h (default_gcov_type_size): Declare.
15442 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
15444 (gimple_gen_time_profiler): Likewise.
15446 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
15448 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
15450 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
15452 PR middle-end/101931
15453 * omp-low.c (omp_runtime_api_call): Update for routines
15454 added in the meanwhile.
15456 2021-08-16 Martin Liska <mliska@suse.cz>
15458 PR tree-optimization/100393
15459 * tree-switch-conversion.c (group_cluster::dump): Use
15460 get_comparison_count.
15461 (jump_table_cluster::find_jump_tables): Pre-compute number of
15462 comparisons and then decrement it. Cache also max_ratio.
15463 (jump_table_cluster::can_be_handled): Change signature.
15464 * tree-switch-conversion.h (get_comparison_count): New.
15466 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
15468 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
15469 encodings only when -fgnat-encodings=all is specified.
15470 (add_bound_info): Likewise.
15471 (add_byte_size_attribute): Likewise.
15472 (gen_member_die): Likewise.
15474 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15476 * omp-oacc-neuter-broadcast.cc
15477 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
15479 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15481 * omp-oacc-neuter-broadcast.cc
15482 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
15485 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15487 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
15488 (execute_omp_oacc_neuter_broadcast): ... here.
15489 (install_var_field, build_receiver_ref, build_sender_ref): Take
15490 'field_map_t *' parameter. Adjust all users.
15491 (worker_single_copy, neuter_worker_single): Take a
15492 'record_field_map_t *' parameter. Adjust all users.
15494 2021-08-16 liuhongt <hongtao.liu@intel.com>
15497 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
15500 2021-08-16 Martin Liska <mliska@suse.cz>
15503 * multiple_target.c (create_dispatcher_calls): Make default
15504 function local only if it is a definition.
15506 2021-08-16 Martin Liska <mliska@suse.cz>
15509 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
15510 consider equal SSA_NAMEs when one is a param.
15512 2021-08-16 liuhongt <hongtao.liu@intel.com>
15515 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
15516 Support vpermi2b for V32QI/V16QImode.
15517 (ix86_extract_perm_from_pool_constant): New function.
15518 (ix86_expand_vec_one_operand_perm_avx512): Support
15519 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
15520 (expand_vec_perm_1): Adjust comments for upper.
15521 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
15523 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
15524 (pshufb_truncv4siv4hi_operand): Ditto.
15525 (pshufb_truncv8hiv8qi_operand): Ditto.
15526 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
15527 New pre_reload define_insn_and_split.
15528 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
15529 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
15530 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
15531 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
15532 (*avx512f_permvar_truncv4div4si_1): Ditto.
15533 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
15534 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
15535 (*avx512f_pshufd_truncv2div2si_1): Ditto.
15537 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
15539 * config/riscv/multilib-generator: Support code model option for
15541 * doc/install.texi: Add document of new option for
15542 --with-multilib-generator.
15544 2021-08-15 Clément Chigot <clement.chigot@atos.net>
15546 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
15547 (rs6000_legitimize_tls_address_aix): Use it.
15548 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
15549 xcoff_tls_exec_model_detected is true.
15551 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
15553 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
15554 to allow a bit more code growth, saving many dozens of cycles.
15555 (h8300_option_override): Adjus shift_alg_si if optimizing for
15557 (get_shift_alg): Use special + inline shifts for residuals
15560 2021-08-14 Stafford Horne <shorne@gmail.com>
15563 * config/or1k/or1k-opts.h: New file.
15564 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
15565 Support generating gotha relocations if -mcmodel=large is
15567 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
15569 * config/or1k/or1k.opt (mcmodel=): New option.
15570 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
15572 2021-08-14 Martin Sebor <msebor@redhat.com>
15574 PR middle-end/101791
15575 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
15576 to valid_new_delete_pair_p.
15577 * tree.c (valid_new_delete_pair_p): Add argument.
15578 * tree.h (valid_new_delete_pair_p): Same.
15580 2021-08-14 Jakub Jelinek <jakub@redhat.com>
15583 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
15584 <case E_V64QImode>: For this mode assert
15585 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
15587 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
15590 * config/rs6000/altivec.md (xxeval): Use register_predicate
15591 instead of altivec_register_predicate.
15593 2021-08-13 Martin Sebor <msebor@redhat.com>
15595 PR middle-end/101734
15596 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
15597 (maybe_warn_operand): Call it.
15599 2021-08-13 Martin Liska <mliska@suse.cz>
15602 * attribs.c (decl_attributes): Make naked functions "noipa"
15605 2021-08-13 Martin Liska <mliska@suse.cz>
15608 * symtab.c (symtab_node::noninterposable_alias): Do not create
15609 local aliases for target_clone functions as the clonning pass
15612 2021-08-13 Martin Liska <mliska@suse.cz>
15614 * opts.c (LIVE_PATCHING_OPTION): Define.
15615 (control_options_for_live_patching): Use it in error messages.
15617 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
15619 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
15620 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
15621 ignore_stores_eaf_flags): New constants.
15622 (remove_useless_eaf_flags): New function.
15623 (eaf_flags_useful_p): Use it.
15624 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
15626 (modref_lattice::init): Add EAF_NOREAD.
15627 (modref_lattice::add_escape_point): Do not reacord escape point if
15629 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
15630 use remove_useless_eaf_flags.
15631 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
15632 (modref_lattice::merge_direct_load): Add EAF_NOREAD
15633 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
15634 (analyze_parms): Use remove_useless_eaf_flags.
15635 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
15636 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
15637 use remove_useless_eaf_flags.
15638 (modref_propagate_flags_in_scc): Update.
15639 * ipa-modref.h: Turn eaf_flags_t back to char.
15640 * tree-core.h (EAF_NOT_RETURNED): Fix.
15641 (EAF_NOREAD): New constant
15642 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
15644 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
15645 (handle_pure_call): Likewise.
15647 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15649 * tree.def (OMP_MASKED): New tree code.
15650 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
15651 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
15652 OMP_CLAUSE_FILTER_EXPR): Define.
15653 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
15654 (omp_clause_code_name): Likewise.
15655 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
15656 * tree-nested.c (convert_nonlocal_omp_clauses,
15657 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
15658 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
15659 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
15660 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
15661 (dump_generic_node): Handle OMP_MASTER.
15662 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
15663 * gimple.c (gimple_build_omp_masked): New function.
15664 (gimple_copy): Handle GIMPLE_OMP_MASKED.
15665 * gimple.h (gimple_build_omp_masked): Declare.
15666 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
15667 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
15668 gimple_omp_masked_set_clauses): New inline functions.
15669 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
15670 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
15671 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
15672 * gimple-walk.c (walk_gimple_stmt): Likewise.
15673 * gimple-low.c (lower_stmt): Likewise.
15674 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
15675 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
15676 that take one expression rather than decl or constant, force
15677 gimplification of that into a SSA_NAME or temporary unless min
15679 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
15680 (gimplify_expr): Handle OMP_MASKED.
15681 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
15682 (estimate_num_insns): Likewise.
15683 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
15684 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
15685 diagnostics for existence of masked construct.
15686 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
15687 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
15688 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
15691 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
15694 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
15695 (ldexp<mode>3): Use avx512f_scalef<mode>2.
15696 (UNSPEC_SCALEF): Move from ...
15697 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
15699 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
15701 * ipa-split.c (consider_split): Fix condition testing void functions.
15703 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
15705 * doc/invoke.texi: Remove docs for threader-mode param.
15706 * flag-types.h (enum threader_mode): Remove.
15707 * params.opt: Remove threader-mode param.
15708 * tree-ssa-threadbackward.c (class back_threader): Remove
15709 path_is_unreachable_p.
15710 Make find_paths private.
15711 Add maybe_thread and thread_through_all_blocks.
15712 Remove reference marker for m_registry.
15713 Remove reference marker for m_profit.
15714 (back_threader::back_threader): Adjust for registry and profit not
15716 (dump_path): Move down.
15717 (debug): Move down.
15718 (class thread_jumps): Remove.
15719 (class back_threader_registry): Remove m_all_paths.
15721 (thread_jumps::thread_through_all_blocks): Move to back_threader
15723 (fsm_find_thread_path): Remove
15724 (back_threader::maybe_thread): New.
15725 (back_threader::thread_through_all_blocks): Move from
15727 (back_threader_registry::back_threader_registry): Remove
15729 (back_threader_registry::~back_threader_registry): Remove.
15730 (thread_jumps::find_taken_edge): Remove.
15731 (thread_jumps::check_subpath_and_update_thread_path): Remove.
15732 (thread_jumps::maybe_register_path): Remove.
15733 (thread_jumps::handle_phi): Remove.
15734 (handle_assignment_p): Remove.
15735 (thread_jumps::handle_assignment): Remove.
15736 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
15737 (thread_jumps::find_jump_threads_backwards): Remove.
15738 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
15739 (try_thread_blocks): Rename find_jump_threads_backwards to
15741 (pass_early_thread_jumps::execute): Same.
15743 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
15745 * tree-core.h (omp_clause_proc_bind_kind): Add
15746 OMP_CLAUSE_PROC_BIND_PRIMARY.
15747 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
15748 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
15750 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
15752 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
15753 fno-common reference.
15754 * config/arc/arc.c (arc_override_options): Remove overriding of
15757 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15760 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
15761 If d->testing_p, return true after performing checks instead of
15762 actually expanding the insn.
15763 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
15764 !TARGET_AVX512BW and return false.
15766 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
15768 * configure.ac (PE linker --disable-dynamicbase support): New check.
15769 * configure: Regenerate.
15770 * config.in: Likewise.
15771 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
15772 (LINK_SPEC): Use it.
15773 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
15774 (LINK_SPEC): Likewise.
15776 2021-08-12 liuhongt <hongtao.liu@intel.com>
15779 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
15780 post_reload define_insn_and_split.
15781 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
15782 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
15783 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
15784 (*avx2_zero_extendv8hiv8si2_2): Ditto.
15785 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
15786 (*avx512f_zero_extendv8siv8di2_2): Ditto.
15787 (*avx2_zero_extendv4siv4di2_2): Ditto.
15788 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
15789 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
15792 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15794 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
15797 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15799 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
15801 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15803 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
15804 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
15805 initialization of pcvoid_type_node here...
15806 (altivec_init_builtins): ...from here.
15807 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
15808 RS6000_BTI_const_ptr_void.
15809 (pcvoid_type_node): New macro.
15811 2021-08-11 Richard Biener <rguenther@suse.de>
15814 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
15815 hard-register accesses.
15817 2021-08-11 Richard Biener <rguenther@suse.de>
15819 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
15820 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
15821 to determine has_volatile_ops.
15823 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
15825 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
15827 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
15828 Alexandre Oliva <oliva@adacore.com>
15830 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
15832 (modref_lattice::merge_deref): Fix handling of indirect scape points.
15833 (update_escape_summary_1): Likewise.
15834 (update_escape_summary): Likewise.
15835 (ipa_merge_modref_summary_after_inlining): Likewise.
15837 2021-08-11 Richard Biener <rguenther@suse.de>
15839 PR middle-end/101858
15840 * fold-const.c (fold_binary_loc): Guard simplification
15841 of X < (cast) (1 << Y) to integer types.
15843 2021-08-11 Richard Biener <rguenther@suse.de>
15845 PR tree-optimization/101861
15846 * tree-vect-stmts.c (vectorizable_load): Fix error in
15847 previous change with regard to gather vectorization.
15849 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15852 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
15854 (vdup_n_s16): Likewise.
15855 (vdup_n_s32): Likewise.
15856 (vdup_n_s64): Likewise.
15857 (vdup_n_u8): Likewise.
15858 (vdup_n_u16): Likewise.
15859 (vdup_n_u32): Likewise.
15860 (vdup_n_u64): Likewise.
15861 (vdup_n_p8): Likewise.
15862 (vdup_n_p16): Likewise.
15863 (vdup_n_p64): Likewise.
15864 (vdup_n_f16): Likewise.
15865 (vdup_n_f32): Likewise.
15866 (vdupq_n_s8): Likewise.
15867 (vdupq_n_s16): Likewise.
15868 (vdupq_n_s32): Likewise.
15869 (vdupq_n_s64): Likewise.
15870 (vdupq_n_u8): Likewise.
15871 (vdupq_n_u16): Likewise.
15872 (vdupq_n_u32): Likewise.
15873 (vdupq_n_u64): Likewise.
15874 (vdupq_n_p8): Likewise.
15875 (vdupq_n_p16): Likewise.
15876 (vdupq_n_p64): Likewise.
15877 (vdupq_n_f16): Likewise.
15878 (vdupq_n_f32): Likewise.
15879 (vmov_n_s8): Replace call to builtin with call to corresponding
15881 (vmov_n_s16): Likewise.
15882 (vmov_n_s32): Likewise.
15883 (vmov_n_s64): Likewise.
15884 (vmov_n_u8): Likewise.
15885 (vmov_n_u16): Likewise.
15886 (vmov_n_u32): Likewise.
15887 (vmov_n_u64): Likewise.
15888 (vmov_n_p8): Likewise.
15889 (vmov_n_p16): Likewise.
15890 (vmov_n_f16): Likewise.
15891 (vmov_n_f32): Likewise.
15892 (vmovq_n_s8): Likewise.
15893 (vmovq_n_s16): Likewise.
15894 (vmovq_n_s32): Likewise.
15895 (vmovq_n_s64): Likewise.
15896 (vmovq_n_u8): Likewise.
15897 (vmovq_n_u16): Likewise.
15898 (vmovq_n_u32): Likewise.
15899 (vmovq_n_u64): Likewise.
15900 (vmovq_n_p8): Likewise.
15901 (vmovq_n_p16): Likewise.
15902 (vmovq_n_f16): Likewise.
15903 (vmovq_n_f32): Likewise.
15904 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
15906 2021-08-11 liuhongt <hongtao.liu@intel.com>
15909 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
15910 when TARGET_AVX512F and TARGET_SSE_MATH.
15912 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15915 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
15916 for V32HImode if !TARGET_AVX512BW.
15917 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
15918 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
15919 early, but actually check the permutation.
15921 2021-08-10 Richard Biener <rguenther@suse.de>
15923 PR tree-optimization/101809
15924 * tree-vect-stmts.c (get_load_store_type): Allow emulated
15925 gathers with offset vector nunits being a constant multiple
15926 of the data vector nunits.
15927 (vect_get_gather_scatter_ops): Use the appropriate nunits
15928 for the offset vector defs.
15929 (vectorizable_store): Adjust call to
15930 vect_get_gather_scatter_ops.
15931 (vectorizable_load): Likewise. Handle the case of less
15932 offset vectors than data vectors.
15934 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15937 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
15938 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
15941 2021-08-10 Richard Biener <rguenther@suse.de>
15943 PR tree-optimization/101801
15944 PR tree-optimization/101819
15945 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
15946 * tree-vect-loop.c (vect_emulated_vector_p): New function.
15947 (vectorizable_reduction): Re-instantiate a check for emulated
15949 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15950 (vectorizable_operation): Likewise. Cost emulated vector
15951 operations according to the scalar sequence synthesized by
15954 2021-08-10 Richard Biener <rguenther@suse.de>
15956 PR middle-end/101824
15957 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
15958 volatile in case the variable was.
15960 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
15963 * config/i386/constraints.md (BC): Document for integer SSE
15964 constant all bits set operand.
15965 (BF): New constraint for const floating-point all bits set
15967 * config/i386/i386.c (standard_sse_constant_p): Likewise.
15968 (standard_sse_constant_opcode): Likewise.
15969 * config/i386/sse.md (sseconstm1): New mode attribute.
15970 (mov<mode>_internal): Replace BC with <sseconstm1>.
15972 2021-08-10 liuhongt <hongtao.liu@intel.com>
15974 * config/i386/sse.md (cond_<insn><mode>): New expander.
15975 (VI248_AVX512VLBW): New mode iterator.
15976 * config/i386/predicates.md
15977 (nonimmediate_or_const_vec_dup_operand): New predicate.
15979 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
15981 PR tree-optimization/101741
15982 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
15983 type of parameter for toupper/tolower.
15985 2021-08-09 Martin Jambor <mjambor@suse.cz>
15987 PR testsuite/101654
15988 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
15990 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
15992 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
15994 (is_store_insn1): Verify source is a register.
15996 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
15999 * config/i386/mmx.md (<any_logic:code>v2sf3):
16000 Rename from *mmx_<any_logic:code>v2sf3
16002 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16004 * config/nvptx/nvptx.c: Cross-reference parts adapted in
16005 'gcc/omp-oacc-neuter-broadcast.cc'.
16006 * omp-low.c: Likewise.
16007 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
16010 2021-08-09 Julian Brown <julian@codesourcery.com>
16011 Kwok Cheung Yeung <kcy@codesourcery.com>
16012 Thomas Schwinge <thomas@codesourcery.com>
16014 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
16015 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
16016 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
16017 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
16018 (gcn_fork_join): Update comment.
16019 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
16020 (macc_experimental_workers): Remove unused option.
16022 2021-08-09 Julian Brown <julian@codesourcery.com>
16023 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
16024 Kwok Cheung Yeung <kcy@codesourcery.com>
16025 Thomas Schwinge <thomas@codesourcery.com>
16027 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
16028 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
16029 Add documentation hook.
16030 * doc/tm.texi: Regenerate.
16031 * omp-oacc-neuter-broadcast.cc: New file.
16032 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
16033 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
16034 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
16035 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
16036 * target.def (goacc.create_worker_broadcast_record): Add target
16038 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
16040 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
16041 Rename prototype to...
16042 (gcn_goacc_create_worker_broadcast_record): ... this.
16043 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
16045 (gcn_goacc_create_worker_broadcast_record): ... this.
16046 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
16048 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
16050 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
16053 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
16054 the right iterator.
16056 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16058 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
16060 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16062 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
16063 consider '-foffload-abi'.
16064 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
16065 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
16068 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16070 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
16073 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16075 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
16077 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16079 * doc/gty.texi (Files): Update.
16081 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16083 * doc/gty.texi (Files): Fix GTY header file example.
16085 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
16087 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
16088 determine the upper and lower bounds from a mask-value pair.
16089 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
16090 absolute value and unsigned absolute value expressions.
16091 (bit_value_binop): Initialize *VAL's precision.
16092 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
16093 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
16094 support when the operands are unknown but potentially equal.
16095 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
16097 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
16099 * config/aarch64/aarch64.md
16100 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
16102 2021-08-08 Sergei Trofimovich <siarheit@google.com>
16104 * lra-constraints.c: Fix s/otput/output/ typo.
16106 2021-08-06 Martin Sebor <msebor@redhat.com>
16108 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
16109 (expand_builtin_strcat): Same.
16110 (expand_builtin_stpncpy): Same.
16111 (expand_builtin_strncat): Same.
16112 (check_read_access): Same.
16113 (check_memop_access): Same.
16114 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
16115 (expand_builtin_strnlen): Same.
16116 (expand_builtin_memcpy): Same.
16117 (expand_builtin_memmove): Same.
16118 (expand_builtin_mempcpy): Same.
16119 (expand_builtin_strcpy): Same.
16120 (expand_builtin_strcpy_args): Same.
16121 (expand_builtin_stpcpy_1): Same.
16122 (expand_builtin_strncpy): Same.
16123 (expand_builtin_memset): Same.
16124 (expand_builtin_bzero): Same.
16125 (expand_builtin_strcmp): Same.
16126 (expand_builtin_strncmp): Same.
16127 (expand_builtin): Remove handlers.
16128 (fold_builtin_strlen): Add a comment.
16129 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
16130 * calls.c (maybe_warn_nonstring_arg): Same.
16131 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
16132 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
16133 (gimple_fold_builtin_stpcpy): Same.
16134 * gimple-ssa-warn-access.cc (has_location): New function.
16135 (get_location): Same.
16136 (get_callee_fndecl): Same.
16137 (call_nargs): Same.
16139 (warn_string_no_nul): Define.
16140 (unterminated_array): Same.
16141 (check_nul_terminated_array): Same.
16142 (maybe_warn_nonstring_arg): Same.
16143 (maybe_warn_for_bound): Same.
16144 (warn_for_access): Same.
16145 (check_access): Same.
16146 (check_memop_access): Same.
16147 (check_read_access): Same.
16148 (warn_dealloc_offset): Use helper functions.
16149 (maybe_emit_free_warning): Same.
16150 (class pass_waccess): Add members.
16151 (check_strcat): New function.
16152 (check_strncat): New function.
16153 (check_stxcpy): New function.
16154 (check_stxncpy): New function.
16155 (check_strncmp): New function.
16156 (pass_waccess::check_builtin): New function.
16157 (pass_waccess::check): Call it.
16158 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
16160 (maybe_warn_for_bound): Same.
16161 (check_access): Same.
16162 (check_memop_access): Same.
16163 (check_read_access): Same.
16164 * pointer-query.h (struct access_data): Define a ctor overload.
16166 2021-08-06 Richard Biener <rguenther@suse.de>
16168 PR tree-optimization/101801
16169 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
16170 (vect_can_vectorize_without_simd_p): ... to this.
16171 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
16172 (vect_can_vectorize_without_simd_p): ... to this and fold
16173 in vect_min_worthwhile_factor.
16174 (vect_min_worthwhile_factor): Remove.
16175 (vectorizable_reduction): Adjust and remove the cost part.
16176 * tree-vect-stmts.c (vectorizable_shift): Likewise.
16177 (vectorizable_operation): Likewise.
16179 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
16182 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
16183 Add general_gr_operand predicate to operand 3.
16185 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
16187 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
16188 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
16190 2021-08-06 Tamar Christina <tamar.christina@arm.com>
16192 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
16193 register_svprfop): Pass vec<> by pointer.
16194 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
16195 * langhooks.c (lhd_simulate_enum_decl): Likewise.
16196 * langhooks.h (struct lang_hooks_for_types): Likewise.
16198 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16200 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
16201 __builtin_memcpy instead of constructing an additional
16202 __builtin_aarch64_simd_oi one vector at a time.
16203 (vst1q_bf16_x2): Likewise.
16204 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
16205 an additional __builtin_aarch64_simd_ci one vector at a time.
16206 (vst1q_bf16_x3): Likewise.
16207 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
16208 (vst1q_bf16_x4): Likewise.
16209 (vst2_bf16): Use __builtin_memcpy instead of constructing an
16210 additional __builtin_aarch64_simd_oi one vector at a time.
16211 (vst2q_bf16): Likewise.
16212 (vst3_bf16): Use __builtin_memcpy instead of constructing an
16213 additional __builtin_aarch64_simd_ci mode one vector at a
16215 (vst3q_bf16): Likewise.
16216 (vst4_bf16): Use __builtin_memcpy instead of constructing an
16217 additional __builtin_aarch64_simd_xi one vector at a time.
16218 (vst4q_bf16): Likewise.
16220 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16222 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
16223 (__ST2Q_LANE_FUNC): Delete.
16224 (vst2_lane_f16): Use __builtin_memcpy to copy vector
16225 structure instead of constructing __builtin_aarch64_simd_oi
16226 one vector at a time.
16227 (vst2_lane_f32): Likewise.
16228 (vst2_lane_f64): Likewise.
16229 (vst2_lane_p8): Likewise.
16230 (vst2_lane_p16): Likewise.
16231 (vst2_lane_p64): Likewise.
16232 (vst2_lane_s8): Likewise.
16233 (vst2_lane_s16): Likewise.
16234 (vst2_lane_s32): Likewise.
16235 (vst2_lane_s64): Likewise.
16236 (vst2_lane_u8): Likewise.
16237 (vst2_lane_u16): Likewise.
16238 (vst2_lane_u32): Likewise.
16239 (vst2_lane_u64): Likewise.
16240 (vst2_lane_bf16): Likewise.
16241 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
16242 structure instead of using a union.
16243 (vst2q_lane_f32): Likewise.
16244 (vst2q_lane_f64): Likewise.
16245 (vst2q_lane_p8): Likewise.
16246 (vst2q_lane_p16): Likewise.
16247 (vst2q_lane_p64): Likewise.
16248 (vst2q_lane_s8): Likewise.
16249 (vst2q_lane_s16): Likewise.
16250 (vst2q_lane_s32): Likewise.
16251 (vst2q_lane_s64): Likewise.
16252 (vst2q_lane_u8): Likewise.
16253 (vst2q_lane_u16): Likewise.
16254 (vst2q_lane_u32): Likewise.
16255 (vst2q_lane_u64): Likewise.
16256 (vst2q_lane_bf16): Likewise.
16258 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16260 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
16261 (__ST3Q_LANE_FUNC): Delete.
16262 (vst3_lane_f16): Use __builtin_memcpy to copy vector
16263 structure instead of constructing __builtin_aarch64_simd_ci
16264 one vector at a time.
16265 (vst3_lane_f32): Likewise.
16266 (vst3_lane_f64): Likewise.
16267 (vst3_lane_p8): Likewise.
16268 (vst3_lane_p16): Likewise.
16269 (vst3_lane_p64): Likewise.
16270 (vst3_lane_s8): Likewise.
16271 (vst3_lane_s16): Likewise.
16272 (vst3_lane_s32): Likewise.
16273 (vst3_lane_s64): Likewise.
16274 (vst3_lane_u8): Likewise.
16275 (vst3_lane_u16): Likewise.
16276 (vst3_lane_u32): Likewise.
16277 (vst3_lane_u64): Likewise.
16278 (vst3_lane_bf16): Likewise.
16279 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
16280 structure instead of using a union.
16281 (vst3q_lane_f32): Likewise.
16282 (vst3q_lane_f64): Likewise.
16283 (vst3q_lane_p8): Likewise.
16284 (vst3q_lane_p16): Likewise.
16285 (vst3q_lane_p64): Likewise.
16286 (vst3q_lane_s8): Likewise.
16287 (vst3q_lane_s16): Likewise.
16288 (vst3q_lane_s32): Likewise.
16289 (vst3q_lane_s64): Likewise.
16290 (vst3q_lane_u8): Likewise.
16291 (vst3q_lane_u16): Likewise.
16292 (vst3q_lane_u32): Likewise.
16293 (vst3q_lane_u64): Likewise.
16294 (vst3q_lane_bf16): Likewise.
16296 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16298 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
16299 (__ST4Q_LANE_FUNC): Delete.
16300 (vst4_lane_f16): Use __builtin_memcpy to copy vector
16301 structure instead of constructing __builtin_aarch64_simd_xi
16302 one vector at a time.
16303 (vst4_lane_f32): Likewise.
16304 (vst4_lane_f64): Likewise.
16305 (vst4_lane_p8): Likewise.
16306 (vst4_lane_p16): Likewise.
16307 (vst4_lane_p64): Likewise.
16308 (vst4_lane_s8): Likewise.
16309 (vst4_lane_s16): Likewise.
16310 (vst4_lane_s32): Likewise.
16311 (vst4_lane_s64): Likewise.
16312 (vst4_lane_u8): Likewise.
16313 (vst4_lane_u16): Likewise.
16314 (vst4_lane_u32): Likewise.
16315 (vst4_lane_u64): Likewise.
16316 (vst4_lane_bf16): Likewise.
16317 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
16318 structure instead of using a union.
16319 (vst4q_lane_f32): Likewise.
16320 (vst4q_lane_f64): Likewise.
16321 (vst4q_lane_p8): Likewise.
16322 (vst4q_lane_p16): Likewise.
16323 (vst4q_lane_p64): Likewise.
16324 (vst4q_lane_s8): Likewise.
16325 (vst4q_lane_s16): Likewise.
16326 (vst4q_lane_s32): Likewise.
16327 (vst4q_lane_s64): Likewise.
16328 (vst4q_lane_u8): Likewise.
16329 (vst4q_lane_u16): Likewise.
16330 (vst4q_lane_u32): Likewise.
16331 (vst4q_lane_u64): Likewise.
16332 (vst4q_lane_bf16): Likewise.
16334 2021-08-06 Martin Liska <mliska@suse.cz>
16336 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
16337 a target option is restored, it can have
16338 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
16339 and error should not be emitted.
16341 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
16343 * gcov-io.h (gcov_write): Declare.
16344 * gcov-io.c (gcov_write): New.
16345 (gcov_write_counter): Remove.
16346 (gcov_write_tag_length): Likewise.
16347 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
16348 gcov_write_unsigned().
16349 * doc/invoke.texi (fprofile-info-section): Mention
16350 __gcov_info_to_gdca().
16352 2021-08-06 Martin Sebor <msebor@redhat.com>
16354 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
16355 arguments to by-reference.
16356 (iterate_fix_dominators): Same.
16357 * dominance.h (iterate_fix_dominators): Same.
16358 * ipa-prop.h: Call auto_vec::to_vec_legacy.
16359 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
16360 arguments to by-reference.
16361 (debug_data_dependence_relation): Same.
16362 (dump_data_dependence_relations): Same.
16363 * tree-data-ref.h (debug_data_dependence_relation): Same.
16364 (dump_data_dependence_relations): Same.
16365 * tree-predcom.c (dump_chains): Same.
16366 (initialize_root_vars_lm): Same.
16367 (determine_unroll_factor): Same.
16368 (replace_phis_by_defined_names): Same.
16369 (insert_init_seqs): Same.
16370 (pcom_worker::tree_predictive_commoning_loop): Call
16371 auto_vec::to_vec_legacy.
16372 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
16373 arguments to by-reference.
16374 * tree-ssa-threadbackward.c (populate_worklist): Same.
16375 (back_threader::resolve_def): Same.
16376 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
16377 (vect_enhance_data_refs_alignment): Same.
16378 (vect_check_lower_bound): Same.
16379 (vect_prune_runtime_alias_test_list): Same.
16380 (vect_permute_store_chain): Same.
16381 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
16382 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
16383 * tree-vectorizer.h (vect_permute_store_chain): Same.
16384 * vec.c (test_init): New function.
16385 (vec_c_tests): Call new function.
16386 * vec.h (vec): Declare ctors, dtor, and assignment.
16387 (auto_vec::vec_to_legacy): New function.
16388 (vec::copy): Adjust initialization.
16390 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
16393 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
16394 callee only uses GPRs.
16395 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
16396 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
16397 * config/i386/x86gprintrin.h: Add
16398 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
16399 to disable non-GPR ISAs.
16401 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
16403 PR middle-end/101787
16404 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
16406 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
16408 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
16409 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
16410 (vect_is_extending_load, vect_is_integer_truncation): New functions,
16411 moved from aarch64.c but given different names.
16412 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
16413 (aarch64_is_reduction, aarch64_reduc_type)
16414 (aarch64_embedded_comparison_type, aarch64_comparison_type)
16415 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
16416 in favor of the above. Update callers accordingly.
16418 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16421 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
16422 writing .cpu directive in asm output.
16423 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
16424 (arm_last_printed_arch_string): Delete.
16425 (arm_last-printed_fpu_string): Delete.
16426 (arm_configure_build_target): If use of floating-point/SIMD is
16427 disabled, remove all fp/simd related features from the target ISA.
16428 (last_arm_targ_options): New variable.
16429 (arm_print_asm_arch_directives): Add new parameters. Change order
16430 of emitted directives and handle all cases here.
16431 (arm_file_start): Always call arm_print_asm_arch_directives, move
16432 all generation of .arch/.arch_extension here.
16433 (arm_file_end): Call arm_print_asm_arch.
16434 (arm_declare_function_name): Call arm_print_asm_arch_directives
16435 instead of printing .arch/.fpu directives directly.
16437 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16439 * config/arm/arm.c (arm_configure_build_target): Don't call
16440 arm_option_reconfigure_globals.
16441 (arm_option_restore): Call arm_option_reconfigure_globals after
16442 reconfiguring the target.
16443 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
16445 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16447 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
16448 arch_name is always set.
16450 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
16452 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
16453 of vec_select high-half from being added into Neon subtract
16456 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
16458 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
16459 of vec_select high-half from being added into Neon add cost.
16461 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
16463 * cfgloop.h (loops_list::loops_list): Add one optional argument
16464 root and adjust accordingly, update loop tree walking and factor
16466 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
16468 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
16470 PR tree-optimization/101626
16471 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
16472 reverse scalar storage order on a pointer or vector component.
16474 2021-08-05 liuhongt <hongtao.liu@intel.com>
16476 * config/i386/sse.md (cond_<code><mode>): New expander.
16478 2021-08-05 liuhongt <hongtao.liu@intel.com>
16480 * config/i386/sse.md (cond_<code><mode>): New expander.
16482 2021-08-05 liuhongt <hongtao.liu@intel.com>
16484 * config/i386/sse.md (cond_<code><mode>): New expander.
16486 2021-08-04 David Malcolm <dmalcolm@redhat.com>
16489 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
16491 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16494 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
16495 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
16497 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16500 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16501 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16502 data with SSE register from one memory location to another.
16504 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16506 * config/s390/s390.c (expand_perm_with_vpdi): New function.
16507 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
16508 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
16509 parameterized expander.
16510 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
16512 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16514 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
16515 (struct expand_vec_perm_d): Define struct.
16516 (expand_perm_with_merge): New function.
16517 (vectorize_vec_perm_const_1): New function.
16518 (s390_vectorize_vec_perm_const): New function.
16519 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
16521 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16523 * config/s390/vector.md (V_HW_64): Remove mode iterator.
16524 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
16525 * config/s390/vx-builtins.md
16526 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
16529 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16531 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
16533 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
16535 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
16536 instead of an unspec.
16538 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16540 * config/s390/s390-modes.def: Add more vector modes to support
16541 concatenation of two vectors.
16542 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
16544 (s390_expand_merge): Likewise.
16545 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
16546 (s390_expand_merge): New function.
16547 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
16548 Remove constant definitions.
16549 * config/s390/vector.md (V_HW_2): Add mode iterators.
16550 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
16551 (vec_2x_nelts, vec_2x_wide): New mode attributes.
16552 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
16553 New pattern definitions.
16554 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
16555 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
16556 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
16557 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
16559 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
16561 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
16562 emit vec merge pattern.
16564 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16566 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
16568 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
16569 vec_select high-half from being added into Neon multiply
16571 * rtlanal.c (vec_series_highpart_p): Define.
16572 * rtlanal.h (vec_series_highpart_p): Declare.
16574 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16576 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
16578 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
16579 vec_select cost from being added into Neon multiply cost.
16581 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16583 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
16584 which old_loop_vinfo is an epilogue loop that handles a constant
16585 number of iterations.
16587 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16589 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
16590 when a reanalyzed loop fails to be cheaper than the current
16593 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16595 * config/aarch64/aarch64.c: Fix a typo.
16597 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
16599 PR gcov-profile/101773
16600 * gcov-io.c (gcov_close): Check return code of a fclose.
16602 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
16605 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
16606 .file statement when needed.
16608 2021-08-04 Richard Biener <rguenther@suse.de>
16610 * tree-vect-data-refs.c (vect_check_gather_scatter):
16611 Include widening conversions only when the result is
16612 still handed by native gather or the current offset
16613 size not already matches the data size.
16614 Also succeed analysis in case there's no native support,
16615 noted by a IFN_LAST ifn and a NULL decl.
16616 (vect_analyze_data_refs): Always consider gathers.
16617 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
16618 Test for no IFN gather rather than decl gather.
16619 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
16620 gather-scatter info and cost emulated gathers accordingly.
16621 (vect_truncate_gather_scatter_offset): Properly test for
16623 (vect_use_strided_gather_scatters_p): Likewise.
16624 (get_load_store_type): Handle emulated gathers and its
16626 (vectorizable_load): Likewise. Emulate them by extracting
16627 scalar offsets, doing scalar loads and a vector construct.
16629 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16632 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
16633 argument to set m_max_size.
16634 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
16635 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
16636 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
16638 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
16639 Marc Glisse <marc.glisse@inria.fr>
16641 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
16642 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
16643 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
16645 2021-08-04 Richard Biener <rguenther@suse.de>
16647 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
16648 out code to decompose vector loads ...
16649 (optimize_vector_load): ... here. Generalize it to
16650 handle intermediate widening and TARGET_MEM_REF loads
16651 and apply it to loads with a supported vector mode as well.
16653 2021-08-04 Richard Biener <rguenther@suse.de>
16655 PR tree-optimization/101756
16656 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
16657 the result of the reduction epilogue is compatible to the original
16660 2021-08-04 liuhongt <hongtao.liu@intel.com>
16663 * config/i386/i386.md (peephole2): Refine predicate from
16664 register_operand to general_reg_operand.
16666 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
16668 * gimple-range-path.h (path_range_query::dump): Mark override.
16670 2021-08-04 Richard Biener <rguenther@suse.de>
16672 PR tree-optimization/101769
16673 * tree-tailcall.c (eliminate_tail_call): Add the created loop
16674 for the first recursion and return it via the new output parameter.
16675 (optimize_tail_call): Pass through new output param.
16676 (tree_optimize_tail_calls_1): After creating all latches,
16677 add the created loop to the loop tree. Do not mark loops for fixup.
16679 2021-08-04 Martin Liska <mliska@suse.cz>
16681 * doc/invoke.texi: Document threader-mode param.
16683 2021-08-04 liuhongt <hongtao.liu@intel.com>
16685 * config/i386/sse.md (cond_fma<mode>): New expander.
16686 (cond_fms<mode>): Ditto.
16687 (cond_fnma<mode>): Ditto.
16688 (cond_fnms<mode>): Ditto.
16690 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16692 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
16694 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16696 * config/rs6000/constraints.md: Remove "e" from the list of available
16697 constraint characters.
16699 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
16701 PR gcov-profile/71672
16702 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
16704 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
16706 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
16708 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
16710 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
16711 try XMM31 to avoid vzeroupper.
16713 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16715 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
16716 -mcpu=neoverse-512tvb.
16717 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
16718 * config/aarch64/aarch64-tune.md: Regenerate.
16719 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
16720 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
16721 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
16722 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
16723 (aarch64_adjust_body_cost): Likewise.
16725 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16727 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
16728 record issue information for operations that occur in the
16731 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16733 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
16734 parameter. Detect cases in which an Advanced SIMD MLA would almost
16735 certainly require a MOV.
16736 (aarch64_count_ops): Update accordingly.
16738 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16740 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
16741 function, split out from...
16742 (aarch64_detect_vector_stmt_subtype): ...here.
16743 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
16745 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16747 * config/aarch64/aarch64-protos.h (sve_vec_cost):
16748 Add gather_load_x32_cost and gather_load_x64_cost.
16749 * config/aarch64/aarch64.c (generic_sve_vector_cost)
16750 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
16751 accordingly, using the values given by the scalar_load * number
16752 of elements calculation that we used previously.
16753 (aarch64_detect_vector_stmt_subtype): Use the new fields.
16755 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16757 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
16758 function, split out from...
16759 (aarch64_adjust_body_cost): ...here.
16761 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16763 * config/aarch64/fractional-cost.h: New file.
16764 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
16765 and cost_fraction.h.
16766 (vec_cost_fraction): New typedef.
16767 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
16768 (aarch64_detect_vector_stmt_subtype): Likewise.
16769 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
16770 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
16772 (aarch64_adjust_body_cost): Likewise.
16773 (aarch64_test_cost_fraction): New function.
16774 (aarch64_run_selftests): Call it.
16776 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16778 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
16780 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
16782 (aarch64_estimated_poly_value): Likewise. Use the least significant
16783 set bit for the minimum and likely values. Use the most significant
16784 set bit for the maximum value.
16786 2021-08-03 liuhongt <hongtao.liu@intel.com>
16788 * config/i386/sse.md (cond_<insn><mode>): New expander.
16789 (cond_mul<mode>): Ditto.
16791 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
16793 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
16795 2021-08-03 liuhongt <hongtao.liu@intel.com>
16797 * config/i386/sse.md (cond_<insn><mode>):New expander.
16798 (cond_mul<mode>): Ditto.
16799 (cond_div<mode>): Ditto.
16801 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16803 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
16804 check stack_realign_needed for stack realignment.
16805 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
16806 than the largest integer supported by vector register.
16807 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
16808 (MOVE_MAX): Set to bytes of the largest integer supported by
16810 (STORE_MAX_PIECES): New.
16812 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16814 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16815 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16816 data from one memory location to another.
16818 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16820 PR middle-end/90773
16821 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
16823 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
16825 PR tree-optimization/101724
16826 * params.opt: Remove --param=threader-iterative.
16827 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
16830 2021-08-02 Tom de Vries <tdevries@suse.de>
16832 PR middle-end/101665
16833 * doc/extend.texi (nonnull attribute): Improve documentation.
16835 2021-08-02 Andrew Pinski <apinski@marvell.com>
16837 PR rtl-optimization/101683
16838 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
16840 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
16842 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
16843 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
16844 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
16845 (tree_ssa_phiop_worker): Update call to function above.
16847 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
16850 * config/i386/i386.md (bsr_rex64_1_zext): New.
16851 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
16852 with gen_bsr_rex64_1_zext.
16854 2021-07-31 Jakub Jelinek <jakub@redhat.com>
16857 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
16858 define_insn patterns.
16859 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
16860 Add combine splitters for constant - clz.
16861 (clz<mode>2): Use a temporary pseudo for bsr result.
16863 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16865 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
16866 _mm_floor_sd, _mm_floor_ss): New.
16868 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16870 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
16871 _mm_ceil_sd, _mm_ceil_ss): New.
16873 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16875 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
16876 _mm_blend_ps, _mm_blendv_ps): New.
16878 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
16879 Uroš Bizjak <ubizjak@gmail.com>
16881 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
16882 to generate a conditional move using the carry flag after sub $1.
16883 (peephole2): Eliminate a register-to-register move by inverting
16884 the condition of a conditional move.
16886 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16888 * config/mmix/mmix.md ("call", "call_value", "*call_real")
16889 ("*call_value_real"): Don't generate rtx mentioning the generic
16890 operands 1 and 2 to "call", and similarly for "call_value".
16891 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
16892 (mmix_print_operand): Use '!' instead of 'p'.
16894 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16896 * doc/md.texi (call): Correct information about operand 2.
16897 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
16899 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16901 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
16903 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16905 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
16907 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16909 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
16910 const_basic_block..
16911 (*::get_bb_range): Ditto.
16912 (*::bb_range_p): Ditto.
16913 * gimple-range-cache.h: Change prototypes.
16915 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16917 PR middle-end/90773
16918 * builtins.c (builtin_memcpy_read_str): Change the mode argument
16919 from scalar_int_mode to fixed_size_mode.
16920 (builtin_strncpy_read_str): Likewise.
16921 (gen_memset_value_from_prev): New function.
16922 (builtin_memset_read_str): Change the mode argument from
16923 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
16924 and support CONST_VECTOR.
16925 (builtin_memset_gen_str): Likewise.
16926 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
16928 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
16929 with fixed_size_mode.
16930 (builtin_memset_read_str): Likewise.
16931 * expr.c (widest_int_mode_for_size): Renamed to ...
16932 (widest_fixed_size_mode_for_size): Add a bool argument to
16933 indicate if QI vector mode can be used.
16934 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
16935 instead of widest_int_mode_for_size.
16936 (pieces_addr::adjust): Change the mode argument from
16937 scalar_int_mode to fixed_size_mode.
16938 (op_by_pieces_d): Make m_len read-only. Add a bool member,
16939 m_qi_vector_mode, to indicate that QI vector mode can be used.
16940 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
16941 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
16942 instead of widest_int_mode_for_size.
16943 (op_by_pieces_d::get_usable_mode): Change the mode argument from
16944 scalar_int_mode to fixed_size_mode. Call
16945 widest_fixed_size_mode_for_size instead of
16946 widest_int_mode_for_size.
16947 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
16948 function to return the smallest integer or QI vector mode.
16949 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
16950 instead of widest_int_mode_for_size. Call
16951 smallest_fixed_size_mode_for_size instead of
16952 smallest_int_mode_for_size.
16953 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
16954 indicate that QI vector mode can be used and pass it to
16955 op_by_pieces_d::op_by_pieces_d.
16956 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
16957 instead of widest_int_mode_for_size. Pass memsetp to
16958 widest_fixed_size_mode_for_size to support QI vector mode.
16959 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
16960 (store_by_pieces): Pass memsetp to
16961 store_by_pieces_d::store_by_pieces_d.
16962 (clear_by_pieces_1): Removed.
16963 (clear_by_pieces): Replace clear_by_pieces_1 with
16964 builtin_memset_read_str and pass true to store_by_pieces_d to
16965 support vector mode broadcast.
16966 (string_cst_read_str): Change the mode argument from
16967 scalar_int_mode to fixed_size_mode.
16968 * expr.h (by_pieces_constfn): Change scalar_int_mode to
16970 (by_pieces_prev): Likewise.
16971 * rtl.h (lowpart_subreg_regno): New.
16972 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
16973 simplify_subreg_regno.
16974 * target.def (gen_memset_scratch_rtx): New hook.
16975 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
16976 * doc/tm.texi: Regenerated.
16978 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16981 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
16982 TARGET_EXPR instead of MODIFY_EXPR.
16984 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16987 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
16988 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
16989 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
16991 (vec_cmpu<IMSA:mode><mode_i>): New expander.
16993 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16996 * config/i386/i386-options.c (ix86_option_override_internal):
16997 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
16999 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
17002 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
17003 explicitly dereferencing __a.
17004 (vld1_s64): Likewise.
17005 (vld1_u64): Likewise.
17006 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
17007 and change to VAR13.
17009 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
17011 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
17012 use of m_range_analyzer.
17013 (loop_versioning::lv_dom_walker::before_dom_children): Same.
17014 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
17015 (loop_versioning::prune_loop_conditions): Replace vr_values use
17016 with range_query interface.
17017 (pass_loop_versioning::execute): Use ranger.
17019 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
17022 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
17023 enum values, and emit a warning if they mismatch.
17025 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
17027 * cfgloop.h (as_const): New function.
17028 (class loop_iterator): Rename to ...
17029 (class loops_list): ... this.
17030 (loop_iterator::next): Rename to ...
17031 (loops_list::Iter::fill_curr_loop): ... this and adjust.
17032 (loop_iterator::loop_iterator): Rename to ...
17033 (loops_list::loops_list): ... this and adjust.
17034 (loops_list::Iter): New class.
17035 (loops_list::iterator): New type.
17036 (loops_list::const_iterator): New type.
17037 (loops_list::begin): New function.
17038 (loops_list::end): Likewise.
17039 (loops_list::begin const): Likewise.
17040 (loops_list::end const): Likewise.
17041 (FOR_EACH_LOOP): Remove.
17042 (FOR_EACH_LOOP_FN): Remove.
17043 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
17044 for loop with loops_list instance.
17045 (sort_sibling_loops): Likewise.
17046 (disambiguate_loops_with_multiple_latches): Likewise.
17047 (verify_loop_structure): Likewise.
17048 * cfgloopmanip.c (create_preheaders): Likewise.
17049 (force_single_succ_latches): Likewise.
17050 * config/aarch64/falkor-tag-collision-avoidance.c
17051 (execute_tag_collision_avoidance): Likewise.
17052 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
17053 * config/s390/s390.c (s390_adjust_loops): Likewise.
17054 * doc/loop.texi: Likewise.
17055 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
17056 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
17057 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
17058 (loop_versioning::make_versioning_decisions): Likewise.
17059 * gimple-ssa-split-paths.c (split_paths): Likewise.
17060 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
17061 * graphite.c (canonicalize_loop_form): Likewise.
17062 (graphite_transform_loops): Likewise.
17063 * ipa-fnsummary.c (analyze_function_body): Likewise.
17064 * ipa-pure-const.c (analyze_function): Likewise.
17065 * loop-doloop.c (doloop_optimize_loops): Likewise.
17066 * loop-init.c (loop_optimizer_finalize): Likewise.
17067 (fix_loop_structure): Likewise.
17068 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
17069 (move_loop_invariants): Likewise.
17070 * loop-unroll.c (decide_unrolling): Likewise.
17071 (unroll_loops): Likewise.
17072 * modulo-sched.c (sms_schedule): Likewise.
17073 * predict.c (predict_loops): Likewise.
17074 (pass_profile::execute): Likewise.
17075 * profile.c (branch_prob): Likewise.
17076 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
17077 (sel_find_rgns): Likewise.
17078 * tree-cfg.c (replace_loop_annotate): Likewise.
17079 (replace_uses_by): Likewise.
17080 (move_sese_region_to_fn): Likewise.
17081 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
17082 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
17083 * tree-parloops.c (parallelize_loops): Likewise.
17084 * tree-predcom.c (tree_predictive_commoning): Likewise.
17085 * tree-scalar-evolution.c (scev_initialize): Likewise.
17086 (scev_reset): Likewise.
17087 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
17088 * tree-ssa-live.c (remove_unused_locals): Likewise.
17089 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
17090 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
17091 (tree_ssa_lim_initialize): Likewise.
17092 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
17093 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
17094 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
17095 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
17096 (free_numbers_of_iterations_estimates): Likewise.
17097 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
17098 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
17099 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
17100 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
17101 (pass_scev_cprop::execute): Likewise.
17102 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
17103 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
17104 * tree-ssa-threadupdate.c
17105 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
17106 * tree-vectorizer.c (vectorize_loops): Likewise.
17107 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
17109 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
17111 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
17112 generating a VOIDmode register for e.g the
17113 function_arg_info::end_marker.
17115 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
17117 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
17118 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
17119 (cc mode_attr): Similarly.
17120 (ccz subst_attr): Similarly.
17121 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
17122 * config/h8300/testcompare.md: Remove various cc0 based patterns
17123 that had been commented out. Add pattern to set CCZ from a bit
17126 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
17127 Julian Brown <julian@codesourcery.com>
17128 Kwok Cheung Yeung <kcy@codesourcery.com>
17130 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
17131 'update_stmt' after modification.
17132 (pass_oacc_loop_designation): New function, extracted out of...
17133 (pass_oacc_device_lower): ... this.
17134 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
17135 (make_pass_oacc_loop_designation): New
17136 * passes.def: Add it.
17137 * tree-parloops.c (create_parallel_loop): Adjust.
17138 * tree-pass.h (make_pass_oacc_loop_designation): New.
17140 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
17142 * flag-types.h (enum threader_mode): New.
17143 * params.opt: Add entry for --param=threader-mode.
17144 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
17145 (class back_threader): New.
17146 (back_threader::back_threader): New.
17147 (back_threader::~back_threader): New.
17148 (back_threader::maybe_register_path): New.
17149 (back_threader::find_taken_edge): New.
17150 (back_threader::find_taken_edge_switch): New.
17151 (back_threader::find_taken_edge_cond): New.
17152 (back_threader::resolve_def): New.
17153 (back_threader::resolve_phi): New.
17154 (back_threader::find_paths_to_names): New.
17155 (back_threader::find_paths): New.
17158 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
17159 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
17160 (pass_thread_jumps::execute): Abstract out code...
17161 (try_thread_blocks): ...here.
17162 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
17163 Abstract out threading candidate code to...
17164 (single_succ_to_potentially_threadable_block): ...here.
17165 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
17167 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
17168 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
17169 Return bool from register_jump_thread.
17171 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
17173 * target.def: in0 and in1 do not need to be registers.
17174 * doc/tm.texi: Regenerate.
17176 2021-07-29 liuhongt <hongtao.liu@intel.com>
17179 * config/i386/i386.c (ix86_widen_mult_cost): New function.
17180 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
17183 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
17186 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
17187 (rs6000_preferred_doloop_mode): New hook.
17188 * doc/tm.texi: Regenerate.
17189 * doc/tm.texi.in: Add hook preferred_doloop_mode.
17190 * target.def (preferred_doloop_mode): New hook.
17191 * targhooks.c (default_preferred_doloop_mode): New hook.
17192 * targhooks.h (default_preferred_doloop_mode): New hook.
17193 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
17194 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
17195 and compute_doloop_base_on_mode.
17197 2021-07-28 Martin Sebor <msebor@redhat.com>
17199 PR middle-end/101494
17200 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
17201 and size computation.
17203 2021-07-28 Martin Sebor <msebor@redhat.com>
17205 PR middle-end/101601
17206 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
17208 Handle pointers to functions.
17210 2021-07-28 Martin Sebor <msebor@redhat.com>
17212 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
17213 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
17214 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
17215 (access_ref::access_ref): Same.
17216 (access_ref::phi): Same.
17217 (access_ref::get_ref): Same.
17218 (access_ref::size_remaining): Same.
17219 (access_ref::offset_in_range): Same.
17220 (access_ref::add_offset): Same.
17221 (access_ref::inform_access): Same.
17222 (ssa_name_limit_t::visit_phi): Same.
17223 (ssa_name_limit_t::leave_phi): Same.
17224 (ssa_name_limit_t::next): Same.
17225 (ssa_name_limit_t::next_phi): Same.
17226 (ssa_name_limit_t::~ssa_name_limit_t): Same.
17227 (pointer_query::pointer_query): Same.
17228 (pointer_query::get_ref): Same.
17229 (pointer_query::put_ref): Same.
17230 (pointer_query::flush_cache): Same.
17231 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
17232 (check_nul_terminated_array): Same.
17233 (unterminated_array): Same.
17234 (maybe_warn_for_bound): Same.
17235 (check_read_access): Same.
17236 (warn_for_access): Same.
17237 (get_size_range): Same.
17238 (check_access): Same.
17239 (gimple_call_alloc_size): Move to tree.c.
17240 (gimple_parm_array_size): Move to pointer-query.cc.
17241 (get_offset_range): Same.
17242 (gimple_call_return_array): Same.
17243 (handle_min_max_size): Same.
17244 (handle_array_ref): Same.
17245 (handle_mem_ref): Same.
17246 (compute_objsize): Same.
17247 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
17248 (call_dealloc_argno): Same.
17249 (fndecl_dealloc_argno): Same.
17250 (new_delete_mismatch_p): Same.
17251 (matching_alloc_calls_p): Same.
17252 (warn_dealloc_offset): Same.
17253 (maybe_emit_free_warning): Same.
17254 * builtins.h (check_nul_terminated_array): Move to
17255 gimple-ssa-warn-access.h.
17256 (check_nul_terminated_array): Same.
17257 (warn_string_no_nul): Same.
17258 (unterminated_array): Same.
17259 (class ssa_name_limit_t): Same.
17260 (class pointer_query): Same.
17261 (struct access_ref): Same.
17262 (class range_query): Same.
17263 (struct access_data): Same.
17264 (gimple_call_alloc_size): Same.
17265 (gimple_parm_array_size): Same.
17266 (compute_objsize): Same.
17267 (class access_data): Same.
17268 (maybe_emit_free_warning): Same.
17269 * calls.c (initialize_argument_information): Remove call to
17270 maybe_emit_free_warning.
17271 * gimple-array-bounds.cc: Include new header..
17272 * gimple-fold.c: Same.
17273 * gimple-ssa-sprintf.c: Same.
17274 * gimple-ssa-warn-restrict.c: Same.
17275 * passes.def: Add pass_warn_access.
17276 * tree-pass.h (make_pass_warn_access): Declare.
17277 * tree-ssa-strlen.c: Include new headers.
17278 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
17279 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
17280 * gimple-ssa-warn-access.cc: New file.
17281 * gimple-ssa-warn-access.h: New file.
17282 * pointer-query.cc: New file.
17283 * pointer-query.h: New file.
17285 2021-07-28 Jakub Jelinek <jakub@redhat.com>
17287 PR middle-end/101624
17288 * ubsan.c (maybe_instrument_pointer_overflow,
17289 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
17290 PARM_DECLs or RESULT_DECLs.
17291 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
17293 2021-07-28 Jakub Jelinek <jakub@redhat.com>
17295 PR middle-end/101642
17296 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
17297 to type of bswap16 for comparison.
17298 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
17300 2021-07-28 Richard Biener <rguenther@suse.de>
17302 PR tree-optimization/101615
17303 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
17304 at CTOR SLP graph entries.
17306 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
17308 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
17309 Add "r,w" alternative.
17311 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
17314 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
17315 AVX_U128_DIRTY when all bits are zero.
17317 2021-07-28 Richard Biener <rguenther@suse.de>
17319 PR tree-optimization/101615
17320 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
17321 external nodes cannot be permuted so make them perm_out 0.
17323 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
17326 * config.in: Regenerate.
17327 * config/gcn/gcn-hsa.h (A_FIJI): New define.
17328 (A_900): New define.
17329 (A_906): New define.
17330 (A_908): New define.
17331 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
17332 * config/gcn/gcn.c (output_file_start): Adjust attributes according
17333 to the assembler capabilities.
17334 * config/gcn/mkoffload.c (main): Likewise.
17335 * configure: Regenerate.
17336 * configure.ac: Add tests for LLVM assembler attribute features.
17338 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
17340 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
17341 cond_false and cond_true on branches.
17343 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
17345 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
17348 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
17350 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
17352 2021-07-28 Jakub Jelinek <jakub@redhat.com>
17355 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
17356 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
17357 and has special !TARGET_AVX512VL expansion.
17358 (vashrv2di3<mask_name>): Rename to ...
17359 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
17360 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
17362 2021-07-28 Martin Uecker <muecker@gwdg.de>
17364 * calls.c (maybe_warn_rdwr_sizes): Correct argument
17365 numbers in warning that were switched.
17367 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
17369 PR tree-optimization/101596
17370 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
17371 by using new_type's precision instead.
17373 2021-07-28 liuhongt <hongtao.liu@intel.com>
17376 * config/i386/i386.h (processor_costs): Add new member
17378 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
17379 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
17380 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
17381 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
17382 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
17383 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
17384 generic_cost, core_cost): Initialize integer_to_sse same value
17386 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
17387 * config/i386/i386.c (ix86_builtin_vectorization_cost):
17388 Use integer_to_sse instead of sse_op to calculate the cost of
17391 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
17393 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
17395 (write_init_file): Call write_ovld_static_init.
17397 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
17399 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
17401 (write_init_file): Call write_bif_static_init.
17403 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
17405 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
17406 (TYPE_MAP_SIZE): New macro.
17407 (type_map): New initialized variable.
17408 (typemap_cmp): New function.
17409 (write_type_node): Likewise.
17410 (write_fntype_init): Implement.
17412 2021-07-27 Martin Sebor <msebor@redhat.com>
17414 PR tree-optimization/101584
17415 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
17416 (check_defs): Call it.
17418 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
17420 * tree-ssa-dom.c (dom_jump_threader_simplifier):
17421 Put avail_exprs_stack in the class, instead of passing it to
17422 jump_threader_simplifier.
17423 (dom_jump_threader_simplifier::simplify): Add state argument.
17424 (dom_opt_dom_walker): Add state.
17425 (pass_dominator::execute): Pass state to threader.
17426 (dom_opt_dom_walker::before_dom_children): Use state.
17427 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
17428 arguments by state.
17429 (jump_threader::record_temporary_equivalences_from_phis):
17430 Register equivalences through the state variable.
17431 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
17432 Record ranges in a statement through the state variable.
17433 (jump_threader::simplify_control_stmt_condition): Pass state to
17435 (jump_threader::simplify_control_stmt_condition_1): Same.
17436 (jump_threader::thread_around_empty_blocks): Remove obsolete
17438 (jump_threader::thread_through_normal_block): Record equivalences
17439 on edge through the state variable.
17440 (jump_threader::thread_across_edge): Abstract state pushing.
17441 (jt_state::jt_state): New.
17442 (jt_state::push): New.
17443 (jt_state::pop): New.
17444 (jt_state::register_equiv): New.
17445 (jt_state::record_ranges_from_stmt): New.
17446 (jt_state::register_equivs_on_edge): New.
17447 (jump_threader_simplifier::jump_threader_simplifier): Move from
17449 (jump_threader_simplifier::simplify): Add state argument.
17450 * tree-ssa-threadedge.h (class jt_state): New.
17451 (class jump_threader): Add state to constructor.
17452 (class jump_threader_simplifier): Add state to simplify. Remove
17453 avail_exprs_stack from class.
17454 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
17456 (vrp_jump_threader::vrp_jump_threader): Add state.
17457 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
17459 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
17461 * Makefile.in (OBJS): Add gimple-range-path.o.
17462 * gimple-range-path.cc: New file.
17463 * gimple-range-path.h: New file.
17465 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
17467 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
17468 inside vec_duplicate for all patterns.
17469 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
17470 Push sign/zero-extension inside vec_duplicate.
17472 2021-07-27 Richard Biener <rguenther@suse.de>
17474 PR tree-optimization/101573
17475 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
17476 looking at uninitialized PHI arg defs in some constrained cases.
17477 (warn_uninitialized_vars): Call it.
17478 (execute_early_warn_uninitialized): Calculate dominators.
17480 2021-07-27 Richard Biener <rguenther@suse.de>
17482 PR tree-optimization/39821
17483 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
17484 vector_stmt for widening arithmetic.
17485 (vectorizable_conversion): Adjust.
17487 2021-07-27 Martin Jambor <mjambor@suse.cz>
17489 * cgraph.h (ipa_replace_map): New field force_load_ref.
17490 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
17491 aded new flag load_dereferenced, adjusted comments.
17492 (ipa_get_param_dereferenced): New function.
17493 (ipa_set_param_dereferenced): Likewise.
17494 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
17495 * ipa-cp.c: Include gimple.h.
17496 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
17497 (get_replacement_map): New parameter force_load_ref, set the
17498 appropriate flag in ipa_replace_map if set.
17499 (struct symbol_and_index_together): New type.
17500 (adjust_refs_in_act_callers): New function.
17501 (adjust_references_in_caller): Likewise.
17502 (create_specialized_node): When appropriate, call
17503 adjust_references_in_caller and force only load references.
17504 * ipa-prop.c (load_from_dereferenced_name): New function.
17505 (ipa_analyze_controlled_uses): Also detect loads from a
17506 dereference, harden testing of call statements.
17507 (ipa_write_node_info): Stream the dereferenced flag.
17508 (ipa_read_node_info): Likewise.
17509 (ipa_set_jf_constant): Also create refdesc when jump function
17510 references a variable.
17511 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
17512 also on references of variables and return a symtab_node. Adjust
17514 (propagate_controlled_uses): Also remove references to VAR_DECLs.
17516 2021-07-27 Jakub Jelinek <jakub@redhat.com>
17518 PR middle-end/101586
17519 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
17520 positions above or equal to sz except for diagnostics of flexible
17523 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17525 PR tree-optimization/78888
17526 * gimple-range-fold.cc (get_letter_range): New.
17527 (fold_using_range::range_of_builtin_call): Call get_letter_range.
17529 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17531 PR tree-optimization/78888
17532 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
17533 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
17535 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
17536 Marc Glisse <marc.glisse@inria.fr>
17538 * match.pd (rotate): Simplify equality/inequality of rotations.
17539 (bswap): Simplify equality/inequality tests of byte swapping.
17541 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17543 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
17546 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17548 * range-op.cc (operator_lshift::fold_range): Pass rel to
17549 base class fold_range.
17550 (operator_rshift::fold_range): Same.
17552 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
17555 * toplev.h (min_align_loops_log): Remove declaration.
17556 (min_align_jumps_log, min_align_labels_log): Likewise.
17557 (min_align_functions_log): Likewise.
17559 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17561 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
17563 (execute_vrp): Abstract out simplification of conditionals...
17564 (simplify_casted_conds): ...here.
17566 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17568 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
17569 Add gimple argument.
17570 (array_bounds_checker::check_array_ref): Same.
17571 (array_bounds_checker::check_addr_expr): Same.
17572 (array_bounds_checker::check_array_bounds): Pass statement to
17573 check_array_bounds and check_addr_expr.
17574 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
17575 (check_addr_expr): Same.
17576 (get_value_range): Same.
17578 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17580 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
17581 (sdot_prod, udot_prod): ... This.
17582 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
17584 (<sur>dot_prod<vsi2qi>): ... this.
17585 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
17586 Change operands order.
17587 (<sur>sadv16qi): Use new operands order.
17588 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
17589 vdotq_s32): Use new RTL ordering.
17591 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17593 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
17594 aarch64_types_ternop_suss_qualifiers): New.
17595 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
17596 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
17597 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
17599 2021-07-23 Jakub Jelinek <jakub@redhat.com>
17601 PR rtl-optimization/101562
17602 * expmed.c (store_integral_bit_field): Only use movstrict_optab
17603 if the operand isn't paradoxical.
17605 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
17607 * gimple-array-bounds.h (class array_bounds_checker): Change
17608 ranges type to range_query.
17610 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17612 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
17613 __builtin_memcpy instead of constructing
17614 __builtin_aarch64_simd_oi one vector at a time.
17615 (vst1_u64_x2): Likewise.
17616 (vst1_f64_x2): Likewise.
17617 (vst1_s8_x2): Likewise.
17618 (vst1_p8_x2): Likewise.
17619 (vst1_s16_x2): Likewise.
17620 (vst1_p16_x2): Likewise.
17621 (vst1_s32_x2): Likewise.
17622 (vst1_u8_x2): Likewise.
17623 (vst1_u16_x2): Likewise.
17624 (vst1_u32_x2): Likewise.
17625 (vst1_f16_x2): Likewise.
17626 (vst1_f32_x2): Likewise.
17627 (vst1_p64_x2): Likewise.
17628 (vst1q_s8_x2): Likewise.
17629 (vst1q_p8_x2): Likewise.
17630 (vst1q_s16_x2): Likewise.
17631 (vst1q_p16_x2): Likewise.
17632 (vst1q_s32_x2): Likewise.
17633 (vst1q_s64_x2): Likewise.
17634 (vst1q_u8_x2): Likewise.
17635 (vst1q_u16_x2): Likewise.
17636 (vst1q_u32_x2): Likewise.
17637 (vst1q_u64_x2): Likewise.
17638 (vst1q_f16_x2): Likewise.
17639 (vst1q_f32_x2): Likewise.
17640 (vst1q_f64_x2): Likewise.
17641 (vst1q_p64_x2): Likewise.
17643 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17645 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
17646 __builtin_memcpy instead of constructing
17647 __builtin_aarch64_simd_ci one vector at a time.
17648 (vst1_u64_x3): Likewise.
17649 (vst1_f64_x3): Likewise.
17650 (vst1_s8_x3): Likewise.
17651 (vst1_p8_x3): Likewise.
17652 (vst1_s16_x3): Likewise.
17653 (vst1_p16_x3): Likewise.
17654 (vst1_s32_x3): Likewise.
17655 (vst1_u8_x3): Likewise.
17656 (vst1_u16_x3): Likewise.
17657 (vst1_u32_x3): Likewise.
17658 (vst1_f16_x3): Likewise.
17659 (vst1_f32_x3): Likewise.
17660 (vst1_p64_x3): Likewise.
17661 (vst1q_s8_x3): Likewise.
17662 (vst1q_p8_x3): Likewise.
17663 (vst1q_s16_x3): Likewise.
17664 (vst1q_p16_x3): Likewise.
17665 (vst1q_s32_x3): Likewise.
17666 (vst1q_s64_x3): Likewise.
17667 (vst1q_u8_x3): Likewise.
17668 (vst1q_u16_x3): Likewise.
17669 (vst1q_u32_x3): Likewise.
17670 (vst1q_u64_x3): Likewise.
17671 (vst1q_f16_x3): Likewise.
17672 (vst1q_f32_x3): Likewise.
17673 (vst1q_f64_x3): Likewise.
17674 (vst1q_p64_x3): Likewise.
17676 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
17679 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
17680 hard register when LRA is in progress.
17682 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17684 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
17685 __builtin_memcpy instead of using a union.
17686 (vst1q_s8_x4): Likewise.
17687 (vst1_s16_x4): Likewise.
17688 (vst1q_s16_x4): Likewise.
17689 (vst1_s32_x4): Likewise.
17690 (vst1q_s32_x4): Likewise.
17691 (vst1_u8_x4): Likewise.
17692 (vst1q_u8_x4): Likewise.
17693 (vst1_u16_x4): Likewise.
17694 (vst1q_u16_x4): Likewise.
17695 (vst1_u32_x4): Likewise.
17696 (vst1q_u32_x4): Likewise.
17697 (vst1_f16_x4): Likewise.
17698 (vst1q_f16_x4): Likewise.
17699 (vst1_f32_x4): Likewise.
17700 (vst1q_f32_x4): Likewise.
17701 (vst1_p8_x4): Likewise.
17702 (vst1q_p8_x4): Likewise.
17703 (vst1_p16_x4): Likewise.
17704 (vst1q_p16_x4): Likewise.
17705 (vst1_s64_x4): Likewise.
17706 (vst1_u64_x4): Likewise.
17707 (vst1_p64_x4): Likewise.
17708 (vst1q_s64_x4): Likewise.
17709 (vst1q_u64_x4): Likewise.
17710 (vst1q_p64_x4): Likewise.
17711 (vst1_f64_x4): Likewise.
17712 (vst1q_f64_x4): Likewise.
17714 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
17716 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
17717 instead of constructing __builtin_aarch64_simd_oi one vector
17719 (vst2_u64): Likewise.
17720 (vst2_f64): Likewise.
17721 (vst2_s8): Likewise.
17722 (vst2_p8): Likewise.
17723 (vst2_s16): Likewise.
17724 (vst2_p16): Likewise.
17725 (vst2_s32): Likewise.
17726 (vst2_u8): Likewise.
17727 (vst2_u16): Likewise.
17728 (vst2_u32): Likewise.
17729 (vst2_f16): Likewise.
17730 (vst2_f32): Likewise.
17731 (vst2_p64): Likewise.
17732 (vst2q_s8): Likewise.
17733 (vst2q_p8): Likewise.
17734 (vst2q_s16): Likewise.
17735 (vst2q_p16): Likewise.
17736 (vst2q_s32): Likewise.
17737 (vst2q_s64): Likewise.
17738 (vst2q_u8): Likewise.
17739 (vst2q_u16): Likewise.
17740 (vst2q_u32): Likewise.
17741 (vst2q_u64): Likewise.
17742 (vst2q_f16): Likewise.
17743 (vst2q_f32): Likewise.
17744 (vst2q_f64): Likewise.
17745 (vst2q_p64): Likewise.
17747 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17749 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
17750 instead of constructing __builtin_aarch64_simd_ci one vector
17752 (vst3_u64): Likewise.
17753 (vst3_f64): Likewise.
17754 (vst3_s8): Likewise.
17755 (vst3_p8): Likewise.
17756 (vst3_s16): Likewise.
17757 (vst3_p16): Likewise.
17758 (vst3_s32): Likewise.
17759 (vst3_u8): Likewise.
17760 (vst3_u16): Likewise.
17761 (vst3_u32): Likewise.
17762 (vst3_f16): Likewise.
17763 (vst3_f32): Likewise.
17764 (vst3_p64): Likewise.
17765 (vst3q_s8): Likewise.
17766 (vst3q_p8): Likewise.
17767 (vst3q_s16): Likewise.
17768 (vst3q_p16): Likewise.
17769 (vst3q_s32): Likewise.
17770 (vst3q_s64): Likewise.
17771 (vst3q_u8): Likewise.
17772 (vst3q_u16): Likewise.
17773 (vst3q_u32): Likewise.
17774 (vst3q_u64): Likewise.
17775 (vst3q_f16): Likewise.
17776 (vst3q_f32): Likewise.
17777 (vst3q_f64): Likewise.
17778 (vst3q_p64): Likewise.
17780 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17782 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
17783 instead of constructing __builtin_aarch64_simd_xi one vector
17785 (vst4_u64): Likewise.
17786 (vst4_f64): Likewise.
17787 (vst4_s8): Likewise.
17788 (vst4_p8): Likewise.
17789 (vst4_s16): Likewise.
17790 (vst4_p16): Likewise.
17791 (vst4_s32): Likewise.
17792 (vst4_u8): Likewise.
17793 (vst4_u16): Likewise.
17794 (vst4_u32): Likewise.
17795 (vst4_f16): Likewise.
17796 (vst4_f32): Likewise.
17797 (vst4_p64): Likewise.
17798 (vst4q_s8): Likewise.
17799 (vst4q_p8): Likewise.
17800 (vst4q_s16): Likewise.
17801 (vst4q_p16): Likewise.
17802 (vst4q_s32): Likewise.
17803 (vst4q_s64): Likewise.
17804 (vst4q_u8): Likewise.
17805 (vst4q_u16): Likewise.
17806 (vst4q_u32): Likewise.
17807 (vst4q_u64): Likewise.
17808 (vst4q_f16): Likewise.
17809 (vst4q_f32): Likewise.
17810 (vst4q_f64): Likewise.
17811 (vst4q_p64): Likewise.
17813 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17815 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
17816 instead of constructing __builtin_aarch64_simd_oi one vector
17818 (vtbx4_u8): Likewise.
17819 (vtbx4_p8): Likewise.
17821 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17823 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
17824 instead of constructing __builtin_aarch64_simd_oi one vector
17826 (vtbl3_u8): Likewise.
17827 (vtbl3_p8): Likewise.
17828 (vtbl4_s8): Likewise.
17829 (vtbl4_u8): Likewise.
17830 (vtbl4_p8): Likewise.
17832 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17834 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
17835 instead of constructing __builtin_aarch64_simd_oi one vector
17837 (vqtbx2_u8): Likewise.
17838 (vqtbx2_p8): Likewise.
17839 (vqtbx2q_s8): Likewise.
17840 (vqtbx2q_u8): Likewise.
17841 (vqtbx2q_p8): Likewise.
17842 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
17843 __builtin_aarch64_simd_ci one vector at a time.
17844 (vqtbx3_u8): Likewise.
17845 (vqtbx3_p8): Likewise.
17846 (vqtbx3q_s8): Likewise.
17847 (vqtbx3q_u8): Likewise.
17848 (vqtbx3q_p8): Likewise.
17849 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
17850 __builtin_aarch64_simd_xi one vector at a time.
17851 (vqtbx4_u8): Likewise.
17852 (vqtbx4_p8): Likewise.
17853 (vqtbx4q_s8): Likewise.
17854 (vqtbx4q_u8): Likewise.
17855 (vqtbx4q_p8): Likewise.
17857 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17859 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
17860 instead of constructing __builtin_aarch64_simd_oi one vector
17862 (vqtbl2_u8): Likewise.
17863 (vqtbl2_p8): Likewise.
17864 (vqtbl2q_s8): Likewise.
17865 (vqtbl2q_u8): Likewise.
17866 (vqtbl2q_p8): Likewise.
17867 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
17868 __builtin_aarch64_simd_ci one vector at a time.
17869 (vqtbl3_u8): Likewise.
17870 (vqtbl3_p8): Likewise.
17871 (vqtbl3q_s8): Likewise.
17872 (vqtbl3q_u8): Likewise.
17873 (vqtbl3q_p8): Likewise.
17874 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
17875 __builtin_aarch64_simd_xi one vector at a time.
17876 (vqtbl4_u8): Likewise.
17877 (vqtbl4_p8): Likewise.
17878 (vqtbl4q_s8): Likewise.
17879 (vqtbl4q_u8): Likewise.
17880 (vqtbl4q_p8): Likewise.
17882 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
17885 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
17887 2021-07-22 Andrew Pinski <apinski@marvell.com>
17889 PR tree-optimization/10153
17890 * tree-tailcall.c (create_tailcall_accumulator):
17891 Don't call fold_convert as the type should be correct already.
17892 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
17893 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
17895 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
17897 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
17898 varying_p check for null/non-null check.
17900 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17902 PR tree-optimization/101511
17903 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
17904 is in ssa2's equiv set, and don't trap if so.
17906 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17908 PR tree-optimization/101497
17909 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
17912 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17914 PR tree-optimization/101496
17915 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
17916 first, then vrp_visit_cond_Stmt.
17918 2021-07-22 liuhongt <hongtao.liu@intel.com>
17920 * config/i386/i386-expand.c
17921 (ix86_broadcast_from_integer_constant): Rename to ..
17922 (ix86_broadcast_from_constant): .. this, and extend it to
17924 (ix86_expand_vector_move): Extend to float mode.
17925 * config/i386/i386-features.c
17926 (replace_constant_pool_with_broadcast): Remove.
17927 (remove_partial_avx_dependency_gate): Ditto.
17928 (constant_pool_broadcast): Ditto.
17929 (class pass_constant_pool_broadcast): Ditto.
17930 (make_pass_constant_pool_broadcast): Ditto.
17931 (remove_partial_avx_dependency): Adjust gate.
17932 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
17933 * config/i386/i386-protos.h
17934 (make_pass_constant_pool_broadcast): Remove.
17936 2021-07-22 liuhongt <hongtao.liu@intel.com>
17938 * config/i386/constraints.md (Wb): New constraint.
17940 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
17942 (*ashlqi3_1): Ditto.
17943 (*<insn><mode>3_1): Split to ..
17944 (*ashr<mode>3_1): this, ...
17945 (*lshr<mode>3_1): and this, also extend this pattern to avx512
17947 (*<insn><mode>3_1): Split to ..
17948 (*ashr<mode>3_1): this, ...
17949 (*lshrqi3_1): and this, also extend this pattern to avx512
17951 (*lshrhi3_1): And this, also extend this pattern to avx512
17953 * config/i386/sse.md (k<code><mode>): New define_split after
17954 it to convert generic shift pattern to mask shift ones.
17956 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
17957 Joseph Myers <joseph@codesourcery.com>
17958 Cesar Philippidis <cesar@codesourcery.com>
17960 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
17961 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
17963 * tree-pretty-print.c (dump_omp_clause): Likewise.
17964 * omp-general.c (oacc_verify_routine_clauses): Likewise.
17965 * gimplify.c (gimplify_scan_omp_clauses)
17966 (gimplify_adjust_omp_clauses): Likewise.
17967 * tree-nested.c (convert_nonlocal_omp_clauses)
17968 (convert_local_omp_clauses): Likewise.
17969 * omp-low.c (scan_sharing_clauses): Likewise.
17970 * omp-offload.c (execute_oacc_device_lower): Update.
17972 2021-07-21 Martin Sebor <msebor@redhat.com>
17974 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
17976 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17978 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
17981 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17983 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
17985 (write_fntype_init): New stub function.
17986 (write_init_bif_table): Likewise.
17987 (write_init_ovld_table): New function.
17988 (write_init_file): Implement.
17990 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17992 * config/rs6000/rs6000-gen-builtins.c
17993 (write_autogenerated_header): New function.
17994 (write_decls): Likewise.
17995 (write_extern_fntype): New callback function.
17996 (write_header_file): Implement.
17998 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18000 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
18003 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18005 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
18007 (complete_base_type): Likewise.
18008 (construct_fntype_id): Likewise.
18009 (parse_bif_entry): Call contruct_fntype_id.
18010 (parse_ovld_entry): Likewise.
18012 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18014 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
18015 (MAXOVLDSTANZAS): New macro.
18016 (ovld_stanzas): New variable.
18017 (curr_ovld_stanza): Likewise.
18018 (MAXOVLDS): New macro.
18019 (ovlddata): New struct.
18020 (ovlds): New variable.
18021 (curr_ovld): Likewise.
18022 (max_ovld_args): Likewise.
18023 (parse_ovld_entry): New function.
18024 (parse_ovld_stanza): Likewise.
18025 (parse_ovld): Implement.
18027 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18029 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
18032 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18034 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
18035 (parse_prototype): Implement.
18037 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18039 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
18040 (curr_bif_stanza): New variable.
18041 (stanza_entry): New struct.
18042 (stanza_map): New initialized variable.
18043 (enable_string): Likewise.
18044 (fnkinds): New enum.
18045 (typelist): New struct.
18046 (attrinfo): Likewise.
18047 (MAXRESTROPNDS): New macro.
18048 (prototype): New struct.
18049 (MAXBIFS): New macro.
18050 (bifdata): New struct.
18051 (bifs): New variable.
18052 (curr_bif): Likewise.
18053 (bif_order): Likewise.
18054 (bif_index): Likewise.
18055 (fatal): New function.
18056 (stanza_name_to_stanza): Likewise.
18057 (parse_bif_attrs): New stub function.
18058 (parse_prototype): Likewise.
18059 (parse_bif_entry): New function.
18060 (parse_bif_stanza): Likewise.
18061 (parse_bif): Implement.
18062 (set_bif_order): New function.
18063 (create_bif_order): Implement.
18065 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18067 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
18068 (num_bifs): New variable.
18069 (num_ovld_stanzas): Likewise.
18070 (num_ovlds): Likewise.
18071 (parse_codes): New enum.
18072 (bif_rbt): New variable.
18073 (ovld_rbt): Likewise.
18074 (fntype_rbt): Likewise.
18075 (bifo_rbt): Likewise.
18076 (parse_bif): New stub function.
18077 (create_bif_order): Likewise.
18078 (parse_ovld): Likewise.
18079 (write_header_file): Likewise.
18080 (write_init_file): Likewise.
18081 (write_defines_file): Likewise.
18082 (delete_output_files): New function.
18085 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
18088 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
18089 from CRC32 _builtin functions.
18091 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
18093 * coverage.c (build_gcov_info_var_registration): Mark the object placed
18094 in the linker set as referenced so that it does not get optimized away.
18096 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
18099 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
18101 * config.gcc (riscv*-*-*): Detect which python is available.
18103 2021-07-21 Jakub Jelinek <jakub@redhat.com>
18105 PR middle-end/101535
18106 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
18107 contexts in which decl isn't privatized and for ORT_TARGET return
18108 false if decl is mapped.
18110 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
18112 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
18113 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
18114 has been successfully unrolled.
18116 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
18118 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
18119 exit condition if no tail loop is needed, and if the original exit
18120 condition should therefore be kept as-is.
18121 (tree_transform_and_unroll_loop): Handle that case here too.
18123 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
18125 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
18127 (free_data_refs): Likewise.
18128 * tree-data-ref.h (free_dependence_relations): Likewise.
18129 (free_data_refs): Likewise.
18130 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
18132 (struct component): Likewise.
18133 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
18134 (pcom_worker::~pcom_worker): Likewise.
18135 (pcom_worker::release_chain): Adjust as auto_vec changes.
18136 (pcom_worker::loop): Rename to ...
18137 (pcom_worker::m_loop): ... this.
18138 (pcom_worker::datarefs): Rename to ...
18139 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
18140 (pcom_worker::dependences): Rename to ...
18141 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
18142 (pcom_worker::chains): Rename to ...
18143 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
18144 (pcom_worker::looparound_phis): Rename to ...
18145 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
18147 (pcom_worker::cache): Rename to ...
18148 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
18149 (pcom_worker::release_chain): Adjust for auto_vec changes.
18150 (pcom_worker::release_chains): Adjust for auto_vec and renaming
18152 (release_component): Remove.
18153 (release_components): Adjust for release_component removal.
18154 (component_of): Adjust to use vec.
18155 (merge_comps): Likewise.
18156 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
18157 (pcom_worker::determine_offset): Likewise.
18158 (class comp_ptrs): Remove.
18159 (pcom_worker::split_data_refs_to_components): Adjust for renaming
18160 changes, for comp_ptrs removal with auto_vec.
18161 (pcom_worker::suitable_component_p): Adjust for renaming changes.
18162 (pcom_worker::filter_suitable_components): Adjust for release_component
18164 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
18165 (pcom_worker::find_looparound_phi): Likewise.
18166 (pcom_worker::add_looparound_copies): Likewise.
18167 (pcom_worker::determine_roots_comp): Likewise.
18168 (pcom_worker::single_nonlooparound_use): Likewise.
18169 (pcom_worker::execute_pred_commoning_chain): Likewise.
18170 (pcom_worker::execute_pred_commoning): Likewise.
18171 (pcom_worker::try_combine_chains): Likewise.
18172 (pcom_worker::prepare_initializers_chain): Likewise.
18173 (pcom_worker::prepare_initializers): Likewise.
18174 (pcom_worker::prepare_finalizers_chain): Likewise.
18175 (pcom_worker::prepare_finalizers): Likewise.
18176 (pcom_worker::tree_predictive_commoning_loop): Likewise.
18178 2021-07-20 Martin Sebor <msebor@redhat.com>
18180 PR middle-end/101397
18181 * builtins.c (gimple_call_return_array): Add argument. Correct
18182 offsets for memchr, mempcpy, stpcpy, and stpncpy.
18183 (compute_objsize_r): Adjust offset computation for argument returning
18186 2021-07-20 Martin Sebor <msebor@redhat.com>
18188 PR middle-end/101300
18189 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
18191 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
18193 * function.c (assign_parm_setup_block): Use adjust_address instead
18194 of change_address to preserve MEM_EXPR and friends.
18196 2021-07-20 Martin Sebor <msebor@redhat.com>
18198 * cfgloop.h (single_likely_exit): Adjust by-value argument to
18199 by-const-reference.
18200 * cfgloopanal.c (single_likely_exit): Same.
18201 * cgraph.h (struct cgraph_node): Same.
18202 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
18203 * genautomata.c (merge_states): Same.
18204 * genextract.c (VEC_char_to_string): Same.
18205 * genmatch.c (dt_node::gen_kids_1): Same.
18206 (walk_captures): Adjust by-value argument to by-reference.
18207 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
18208 to by-const-reference.
18209 * gimple.c (gimple_build_call_vec): Same.
18210 (gimple_build_call_internal_vec): Same.
18211 (gimple_build_switch): Same.
18212 (sort_case_labels): Same.
18213 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
18215 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
18216 by-const-reference.
18217 (gimple_build_call_internal_vec): Same.
18218 (gimple_build_switch): Same.
18219 (sort_case_labels): Same.
18220 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
18222 * haifa-sched.c (calc_priorities): Adjust by-value argument to
18223 by-const-reference.
18224 (sched_init_luids): Same.
18225 (haifa_init_h_i_d): Same.
18226 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
18227 (adjust_callers_for_value_intersection): Adjust by-value argument to
18229 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
18230 by-const-reference.
18231 (find_more_contexts_for_caller_subset): Same.
18232 (find_aggregate_values_for_callers_subset): Same.
18233 (copy_useful_known_contexts): Same.
18234 * ipa-fnsummary.c (remap_edge_summaries): Same.
18235 (remap_freqcounting_predicate): Same.
18236 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
18238 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
18239 to by-const-reference.
18240 * ipa-predicate.h (predicate::remap_after_inlining): Same.
18241 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
18242 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
18243 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
18244 * read-rtl.c (add_overload_instance): Same.
18245 * rtl.h (native_decode_rtx): Same.
18246 (native_decode_vector_rtx): Same.
18247 * sched-int.h (sched_init_luids): Same.
18248 (haifa_init_h_i_d): Same.
18249 * simplify-rtx.c (native_decode_vector_rtx): Same.
18250 (native_decode_rtx): Same.
18251 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
18252 (shrink_wrap_one_built_in_call_with_conds): Same.
18253 (shrink_wrap_conditional_dead_built_in_calls): Same.
18254 * tree-data-ref.c (create_runtime_alias_checks): Same.
18255 (compute_all_dependences): Same.
18256 * tree-data-ref.h (compute_all_dependences): Same.
18257 (create_runtime_alias_checks): Same.
18258 (index_in_loop_nest): Same.
18259 * tree-if-conv.c (mask_exists): Same.
18260 * tree-loop-distribution.c (class loop_distribution): Same.
18261 (loop_distribution::create_rdg_vertices): Same.
18262 (dump_rdg_partitions): Same.
18263 (debug_rdg_partitions): Same.
18264 (partition_contains_all_rw): Same.
18265 (loop_distribution::distribute_loop): Same.
18266 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
18267 (oacc_entry_exit_single_gang): Same.
18268 * tree-ssa-loop-im.c (hoist_memory_references): Same.
18269 (loop_suitable_for_sm): Same.
18270 * tree-ssa-loop-niter.c (bound_index): Same.
18271 * tree-ssa-reassoc.c (update_ops): Same.
18272 (swap_ops_for_binary_stmt): Same.
18273 (rewrite_expr_tree): Same.
18274 (rewrite_expr_tree_parallel): Same.
18275 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
18276 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
18277 * tree-ssa-structalias.c (process_all_all_constraints): Same.
18278 (make_constraints_to): Same.
18279 (handle_lhs_call): Same.
18280 (find_func_aliases_for_builtin_call): Same.
18281 (sort_fieldstack): Same.
18282 (check_for_overlaps): Same.
18283 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
18284 (vect_create_cond_for_unequal_addrs): Same.
18285 (vect_create_cond_for_lower_bounds): Same.
18286 (vect_create_cond_for_alias_checks): Same.
18287 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
18288 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
18289 (vect_make_slp_decision): Same.
18290 (vect_slp_bbs): Same.
18291 (duplicate_and_interleave): Same.
18292 (vect_transform_slp_perm_load): Same.
18293 (vect_schedule_slp): Same.
18294 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
18295 (vect_schedule_slp): Same.
18296 (duplicate_and_interleave): Same.
18297 * tree.c (build_vector_from_ctor): Same.
18298 (build_vector): Same.
18299 (check_vector_cst): Same.
18300 (check_vector_cst_duplicate): Same.
18301 (check_vector_cst_fill): Same.
18302 (check_vector_cst_stepped): Same.
18303 * tree.h (build_vector_from_ctor): Same.
18305 2021-07-20 Jakub Jelinek <jakub@redhat.com>
18308 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
18309 type from bool to int.
18310 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
18311 EASY_VECTOR_MSB case if either step or copies is not 1.
18312 (vspltis_shifted): Fix comment typo.
18313 (easy_altivec_constant): Change return type from bool to int, instead
18314 of returning true return byte size of the element mode that should be
18315 used to synthetize the constant.
18316 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
18317 that vspltis_shifted is 0, handle the case where easy_altivec_constant
18318 assumes using different vector mode from CONST_VECTOR's mode.
18319 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
18320 easy_altivec_constant to determine mode in which -1 >> -1 should be
18321 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
18323 2021-07-20 Richard Biener <rguenther@suse.de>
18326 * dwarf2out.h (dwarf_file_data): Add key member.
18327 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
18328 (dwarf_file_hasher::hash): Hash key.
18329 (lookup_filename): Remap the filename and store it in the
18330 filename member of dwarf_file_data when creating a new
18332 (file_name_acquire): Do not remap the filename again.
18333 (maybe_emit_file): Likewise.
18335 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
18337 * config/aarch64/aarch64-simd-builtins.def: Use two variant
18338 generators for all TBL/TBX intrinsics and rename to
18339 consistent forms: qtbl[1234] or qtbx[1234].
18340 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
18342 (aarch64_qtbl1<mode>): This.
18343 (aarch64_tbx1<mode>): Rename to...
18344 (aarch64_qtbx1<mode>): This.
18345 (aarch64_tbl2v16qi): Delete.
18346 (aarch64_tbl3<mode>): Rename to...
18347 (aarch64_qtbl2<mode>): This.
18348 (aarch64_tbx4<mode>): Rename to...
18349 (aarch64_qtbx2<mode>): This.
18350 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
18351 renamed qtbl1 and qtbl2 RTL patterns.
18352 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
18354 (vqtbl1_s8): Likewise.
18355 (vqtbl1_u8): Likewise.
18356 (vqtbl1q_p8): Likewise.
18357 (vqtbl1q_s8): Likewise.
18358 (vqtbl1q_u8): Likewise.
18359 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
18360 (vqtbx1_u8): Likewise.
18361 (vqtbx1_p8): Likewise.
18362 (vqtbx1q_s8): Likewise.
18363 (vqtbx1q_u8): Likewise.
18364 (vqtbx1q_p8): Likewise.
18365 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
18366 (vtbl1_u8): Likewise.
18367 (vtbl1_p8): Likewise.
18368 (vtbl2_s8): Likewise
18369 (vtbl2_u8): Likewise.
18370 (vtbl2_p8): Likewise.
18371 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
18372 (vtbl3_u8): Likewise.
18373 (vtbl3_p8): Likewise.
18374 (vtbl4_s8): Likewise.
18375 (vtbl4_u8): Likewise.
18376 (vtbl4_p8): Likewise.
18377 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
18378 (vtbx2_u8): Likewise.
18379 (vtbx2_p8): Likewise.
18380 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
18381 (vqtbl2_u8): Likewise.
18382 (vqtbl2_p8): Likewise.
18383 (vqtbl2q_s8): Likewise.
18384 (vqtbl2q_u8): Likewise.
18385 (vqtbl2q_p8): Likewise.
18386 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
18387 (vqtbx2_u8): Likewise.
18388 (vqtbx2_p8): Likewise.
18389 (vqtbx2q_s8): Likewise.
18390 (vqtbx2q_u8): Likewise.
18391 (vqtbx2q_p8): Likewise.
18392 (vtbx4_s8): Likewise.
18393 (vtbx4_u8): Likewise.
18394 (vtbx4_p8): Likewise.
18396 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
18399 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
18401 (define_peephole2 atomic_loaddi_fpu): Ditto.
18403 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
18405 * config.gcc (riscv*-*-*): Detect which python is available.
18407 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
18409 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
18410 (smul<mode>3_highpart): ... this.
18411 (mulhu_<mode>): Rename to...
18412 (umul<mode>3_highpart): ... this.
18413 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
18414 MULHU_V2DI, MULHU_V4SI): Adjust.
18416 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
18418 PR tree-optimization/100696
18419 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
18420 * internal-fn.def (IFN_MULH): New internal function.
18421 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
18422 recog normal multiply highpart as IFN_MULH.
18423 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
18426 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
18428 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
18429 (BTF_DEBUGGING_INFO): Likewise.
18430 * doc/tm.texi.in: Document the new macros.
18431 * doc/tm.texi: Regenerated.
18432 * toplev.c: Guard initialization of debug hooks.
18434 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
18436 * flags.h (ctf_debuginfo_p): New function declaration.
18437 * opts.c (ctf_debuginfo_p): New function definition.
18439 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
18442 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
18443 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
18444 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
18445 * config/gcn/gcn-valu.md: Add a warning comment.
18446 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
18447 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
18448 * config/gcn/gcn.md: Add a warning comment.
18449 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
18450 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
18452 (EF_AMDGPU_XNACK): New.
18453 (EF_AMDGPU_SRAM_ECC): New.
18455 (copy_early_debug_info): Use elf_flags.
18456 (main): Handle -mxnack and -msram-ecc options.
18457 * doc/invoke.texi: Document -mxnack and -msram-ecc.
18459 2021-07-19 Andrew Pinski <apinski@marvell.com>
18462 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
18463 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
18465 2021-07-19 Richard Biener <rguenther@suse.de>
18467 PR tree-optimization/101505
18468 * tree-vect-patterns.c (vect_determine_precisions): Walk
18469 PHIs also for loop vectorization.
18471 2021-07-19 Richard Biener <rguenther@suse.de>
18473 * gimple.h (gimple_expr_type): Remove.
18474 * doc/gimple.texi: Remove gimple_expr_type documentation.
18476 2021-07-19 Richard Biener <rguenther@suse.de>
18478 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
18479 (ao_ref_init_from_vn_reference): Likewise.
18480 (fully_constant_reference): Likewise.
18481 (vn_reference_lookup_call): Do not set vr->type to random
18483 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
18485 * tree-vect-generic.c (expand_vector_piecewise): Pass in
18486 whether we expanded parallel.
18487 (expand_vector_parallel): Adjust.
18488 (expand_vector_addition): Likewise.
18489 (expand_vector_comparison): Likewise.
18490 (expand_vector_operation): Likewise.
18491 (expand_vector_scalar_condition): Likewise.
18492 (expand_vector_conversion): Likewise.
18494 2021-07-19 Richard Biener <rguenther@suse.de>
18496 * tree-vrp.c (register_edge_assert_for_2): Use the
18498 (vrp_folder::fold_predicate_in): Likewise.
18499 * vr-values.c (gimple_assign_nonzero_p): Likewise.
18500 (vr_values::extract_range_from_comparison): Likewise.
18501 (vr_values::extract_range_from_ubsan_builtin): Use the
18502 type of the first operand.
18503 (vr_values::extract_range_basic): Push down type
18504 computation, use the appropriate LHS.
18505 (vr_values::extract_range_from_assignment): Use the
18508 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
18511 * common/config/i386/i386-common.c (ix86_handle_option): For
18512 -mgeneral-regs-only, enable the GPR only instructions which are
18513 enabled implicitly by SSE ISAs unless they have been disabled
18516 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
18519 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
18520 ix86_avx_u128_mode_needed.
18521 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
18522 returns AVX register.
18524 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
18526 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
18527 (handle_const_call): Liekise
18528 (handle_pure_call): Liekise
18530 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
18532 PR tree-optimization/96542
18533 * range-op.cc (range_operator::wi_fold_in_parts): New.
18534 (range_operator::fold_range): Call wi_fold_in_parts.
18535 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
18536 * range-op.h (wi_fold_in_parts): Add prototype.
18538 2021-07-16 David Malcolm <dmalcolm@redhat.com>
18540 * doc/analyzer.texi: Add __analyzer_dump_state.
18542 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18544 * config/rs6000/rbtree.c: New file.
18545 * config/rs6000/rbtree.h: New file.
18547 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18549 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
18550 (typeinfo): Add restr field.
18551 (match_bracketed_pair): New function.
18552 (match_const_restriction): Implement.
18554 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18556 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
18558 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18560 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
18561 (basetype): Likewise.
18562 (typeinfo): Likewise.
18563 (handle_pointer): New function.
18564 (match_basetype): New stub function.
18565 (match_const_restriction): Likewise.
18566 (match_type): New function.
18568 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18570 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
18572 (advance_line): Likewise.
18573 (safe_inc_pos): Likewise.
18574 (match_identifier): Likewise.
18575 (match_integer): Likewise.
18576 (match_to_right_bracket): Likewise.
18578 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18580 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
18581 (ovld_file): Likewise.
18582 (header_file): Likewise.
18583 (init_file): Likewise.
18584 (defines_file): Likewise.
18585 (pgm_path): Likewise.
18586 (bif_path): Likewise.
18587 (ovld_path): Likewise.
18588 (header_path): Likewise.
18589 (init_path): Likewise.
18590 (defines_path): Likewise.
18591 (LINELEN): New macro.
18592 (linebuf): New variable.
18596 (bif_diag): New function.
18597 (ovld_diag): Likewise.
18599 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18601 * config/rs6000/rs6000-builtin-new.def: New.
18602 * config/rs6000/rs6000-overload.def: New.
18604 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18606 * config/rs6000/rs6000-gen-builtins.c: New.
18608 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18610 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
18611 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
18612 * gengtype-state.c (state_writer::write_state_file_list): Add a
18613 parameter to the fileslist expression for the number of build
18615 (read_state_files_list): Detect build headers and strip the
18616 initial "./" or ".\" from their names.
18617 * gengtype.c (build_headers): New global variable.
18618 (num_build_headers): Likewise.
18619 (open_base_files): Emit #include for each build header.
18620 (main): Detect and count build headers.
18621 * gengtype.h (build_headers): New extern variable.
18622 (num_build_headers): Likewise.
18624 2021-07-16 Richard Biener <rguenther@suse.de>
18626 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
18627 the type of the LHS.
18628 (find_bswap_or_nop_1): Likewise.
18629 (find_bswap_or_nop): Likewise.
18630 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
18632 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
18633 Remove unused parameters, pass in the scalar type. Fix
18634 internal store function handling.
18635 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
18636 (vect_get_vector_types_for_stmt): Move down check for
18637 existing vector stmt after we've determined a scalar type.
18638 Pass down the used scalar type to vect_get_smallest_scalar_type.
18639 * tree-vect-generic.c (expand_vector_condition): Use
18640 the type of the LHS.
18641 (expand_vector_scalar_condition): Likewise.
18642 (expand_vector_operations_1): Likewise.
18643 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
18644 (vect_recog_dot_prod_pattern): Likewise.
18645 (vect_recog_sad_pattern): Likewise.
18646 (vect_recog_widen_op_pattern): Likewise.
18647 (vect_recog_widen_sum_pattern): Likewise.
18648 (vect_recog_mixed_size_cond_pattern): Likewise.
18650 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
18652 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
18653 (dump_eaf_flags): Dump EAF_NOT_RETURNED
18654 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
18655 and EAF_NOT_RETURNED.
18656 (modref_summary::useful_p): Likewise.
18657 (modref_summary_lto::useful_p): Likewise.
18658 (struct) modref_summary_lto: Use eaf_fleags_t.
18659 (deref_flags): Handle EAF_NOT_RETURNED.
18660 (struct escape_point): Use min_flags.
18661 (modref_lattice::init): Add EAF_NOT_RETURNED.
18662 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
18663 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
18665 (analyze_parms): Also analyze const functions; update conition on
18667 (modref_write): Update streaming.
18668 (read_section): Update streaming.
18669 (remap_arg_flags): Use eaf_flags_t.
18670 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
18671 * ipa-modref.h: (eaf_flags_t): New typedef.
18672 (struct modref_summary): Use eaf_flags_t.
18673 * tree-core.h (EAF_NOT_RETURNED): New constant.
18675 2021-07-16 Richard Biener <rguenther@suse.de>
18677 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
18678 the type of the LHS.
18679 (gimple_assign_nonnegative_warnv_p): Likewise.
18680 (gimple_call_nonnegative_warnv_p): Likewise. Return false
18681 if the call has no LHS.
18682 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
18683 * tree-eh.c (stmt_could_throw_1_p): Likewise.
18684 * tree-inline.c (insert_init_stmt): Likewise.
18685 * tree-ssa-loop-niter.c (get_val_for): Likewise.
18686 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
18688 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
18689 gassign *. Use the type of the lhs.
18690 (vn_nary_op_lookup_stmt): Adjust.
18691 (vn_nary_op_insert_stmt): Likewise.
18693 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
18695 * config/s390/predicates.md (bras_sym_operand): Accept all
18696 functions in 64-bit mode, use UNSPEC_PLT31.
18697 (larl_operand): Use UNSPEC_PLT31.
18698 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
18699 (legitimize_pic_address): Likewise.
18700 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
18702 (s390_delegitimize_address): Use UNSPEC_PLT31.
18703 (s390_output_addr_const_extra): Likewise.
18704 (print_operand): Add @PLT to TLS calls, handle %K.
18705 (s390_function_profiler): Mark __fentry__/_mcount as function,
18706 use %K, use UNSPEC_PLT31.
18707 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
18708 (s390_emit_call): Use UNSPEC_PLT31.
18709 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
18710 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
18711 (*movdi_64): Use %K.
18712 (reload_base_64): Likewise.
18713 (*sibcall_brc): Likewise.
18714 (*sibcall_brcl): Likewise.
18715 (*sibcall_value_brc): Likewise.
18716 (*sibcall_value_brcl): Likewise.
18718 (*brasl): Likewise.
18719 (*bras_r): Likewise.
18720 (*brasl_r): Likewise.
18721 (*bras_tls): Likewise.
18722 (*brasl_tls): Likewise.
18723 (main_base_64): Likewise.
18724 (reload_base_64): Likewise.
18725 (@split_stack_call<mode>): Likewise.
18727 2021-07-16 Richard Biener <rguenther@suse.de>
18729 PR tree-optimization/101467
18730 * tree-vect-stmts.c (vect_gen_while): Properly guard
18731 make_temp_ssa_name usage.
18733 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
18735 * config.gcc: Don't use forked print-sysroot-suffix.sh and
18736 t-sysroot-suffix for C-SKY.
18737 * config/csky/print-sysroot-suffix.sh: Delete.
18738 * config/csky/t-csky-linux: Delete.
18739 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
18740 instead of CSKY_MULTILIB_DIRNAMES.
18742 2021-07-16 Richard Biener <rguenther@suse.de>
18744 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
18745 conversion issues with the partial reduction of the reused
18746 vector accumulator.
18748 2021-07-16 Richard Biener <rguenther@suse.de>
18750 * config/i386/i386-options.c (ix86_option_override_internal): Set
18751 param_vect_partial_vector_usage to zero if not set.
18753 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
18756 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
18757 (VALID_INT_MODE_P): Add SDmode and DDmode.
18758 Add TDmode for TARGET_64BIT.
18759 (VALID_DFP_MODE_P): Remove.
18760 * config/i386/i386.c (ix86_hard_regno_mode_ok):
18761 Do not use VALID_DFP_MODE_P.
18763 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
18765 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
18767 (fold_using_range::fold_stmt): Ditto.
18768 (fold_using_range::range_of_range_op): Ditto.
18769 (fold_using_range::range_of_phi): Ditto.
18770 (fold_using_range::range_of_call): Ditto.
18771 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
18772 (fold_using_range::range_of_builtin_call): Ditto.
18773 (fold_using_range::range_of_cond_expr): Ditto.
18774 * gimple-range-fold.h (gimple_range_type): New.
18776 2021-07-15 Martin Sebor <msebor@redhat.com>
18778 PR middle-end/97027
18779 * tree-ssa-strlen.c (handle_assign): New function.
18780 (maybe_warn_overflow): Add argument.
18781 (nonzero_bytes_for_type): New function.
18782 (count_nonzero_bytes): Handle more tree types. Call
18783 nonzero_bytes_for_tye.
18784 (count_nonzero_bytes): Handle types.
18785 (handle_store): Handle stores from function calls.
18786 (strlen_check_and_optimize_call): Move code to handle_assign. Call
18787 it for assignments from function calls.
18789 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18794 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
18796 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18798 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
18800 2021-07-15 Martin Sebor <msebor@redhat.com>
18804 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
18805 (operand_compare::verify_hash_value): Same.
18806 * tree-core.h (OEP_DECL_NAME): New.
18808 2021-07-15 Martin Jambor <mjambor@suse.cz>
18810 * profile-count.h (profile_count::value): Change the return type to
18812 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
18814 * tree-cfg.c (dump_function_to_file): Likewise.
18816 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
18819 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
18820 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
18823 2021-07-15 Richard Biener <rguenther@suse.de>
18825 * tree-vectorizer.h (vect_gen_while): Match up with
18826 vect_gen_while_not.
18827 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
18828 of vect_gen_while_not.
18829 (vect_gen_while_not): Adjust.
18830 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
18832 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
18834 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
18835 (ranger_cache::range_of_def): Call adjust_range.
18836 (ranger_cache::entry_range): Same.
18837 * gimple-range-cache.h (non_null_ref::adjust_range): New.
18838 * gimple-range.cc (gimple_ranger::range_of_expr): Call
18840 (gimple_ranger::range_on_entry): Same.
18842 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18845 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18847 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18849 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18852 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18854 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18855 (sdot_prod, udot_prod): ...These.
18856 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18857 (aarch64_<sur>dot<vsi2qi>): Rename to...
18858 (<sur>dot_prod<vsi2qi>): ...This.
18859 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18862 2021-07-15 Jakub Jelinek <jakub@redhat.com>
18864 PR middle-end/101437
18865 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
18866 types even if they have non-BLKmode TYPE_MODE.
18868 2021-07-15 Richard Biener <rguenther@suse.de>
18871 * gcc.c (process_command): Process -gtoggle like process_options
18872 would after parsing options.
18874 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18876 * cfgexpand.c (expand_asm_loc): Adjust.
18877 (expand_asm_stmt): Likewise.
18878 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
18879 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18880 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
18881 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
18882 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
18883 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
18884 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
18885 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
18886 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
18887 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
18888 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
18889 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
18890 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
18891 * doc/tm.texi: Regenerate.
18892 * target.def: Add location argument to md_asm_adjust.
18894 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18896 * tree-diagnostic.c (diagnostic_report_current_function): Use the
18897 diagnostic's location, not input_location.
18899 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18901 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
18903 (expand_asm_stmt): Likewise.
18905 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18907 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
18908 addressed memory rtx, if any.
18909 (rs6000_split_multireg_move): Fix code formatting.
18910 Handle MMA build built-ins with operands in adjacent memory locations.
18912 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18914 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
18917 2021-07-14 Jason Merrill <jason@redhat.com>
18919 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
18921 2021-07-14 Jason Merrill <jason@redhat.com>
18923 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
18926 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18928 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
18929 Remove erroneous line.
18931 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
18933 * params.opt (param_evrp_mode): Change default.
18935 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18937 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18938 (sdot_prod, udot_prod): ...These.
18939 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18940 (aarch64_<sur>dot<vsi2qi>): Rename to...
18941 (<sur>dot_prod<vsi2qi>): ...This.
18942 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18945 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18947 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18949 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18951 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
18953 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18955 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
18957 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18959 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
18960 (usdot_prod<vsi2qi>): ... This.
18961 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
18962 (usdot_prod): ...This.
18963 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
18964 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
18966 (@<sur>dot_prod<vsi2qi>): ...This.
18967 * config/aarch64/aarch64-sve-builtins-base.cc
18968 (svusdot_impl::expand): Use it.
18970 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18972 * optabs.def (usdot_prod_optab): New.
18973 * doc/md.texi: Document it and clarify other dot prod optabs.
18974 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
18975 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
18976 * optabs.c (expand_widen_pattern_expr): Likewise.
18977 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
18978 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
18979 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
18981 (vect_widened_op_tree): Optionally ignore
18983 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
18985 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
18988 * config/i386/driver-i386.c (host_detect_local_cpu): Check
18989 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
18990 Enable UINTR only for 64-bit codegen.
18991 * config/i386/i386-options.c
18992 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
18994 * config/i386/i386.h (ARCH_ARG): New.
18995 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
18996 "[arch|tune] 64" for 64-bit codegen.
18998 2021-07-14 Richard Biener <rguenther@suse.de>
19000 PR tree-optimization/101445
19001 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
19002 of the IV in the correct direction for negative stride
19005 2021-07-14 Jakub Jelinek <jakub@redhat.com>
19008 * godump.c (godump_str_hash): New type.
19009 (godump_container::pot_dummy_types): Use string_hash instead of
19010 ptr_hash in the hash_set.
19012 2021-07-14 Richard Biener <rguenther@suse.de>
19014 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
19015 vector types where the old vector type has a multiple of
19016 the new vector type elements.
19017 (vect_create_partial_epilog): New function, split out from...
19018 (vect_create_epilog_for_reduction): ... here.
19019 (vect_transform_cycle_phi): Reduce the re-used accumulator
19020 to the new vector type.
19022 2021-07-14 Alexandre Oliva <oliva@adacore.com>
19024 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
19025 non-'t'-sized arg check.
19027 2021-07-14 Alexandre Oliva <oliva@adacore.com>
19029 * tree-cfg.c (cleanup_dead_labels_eh): Update
19030 post_landing_pad label upon change of landing pad block's
19032 (cleanup_dead_labels): Check that a removed label is not that
19035 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
19037 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
19039 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
19040 Add Neon to general purpose register case for zero-extend
19042 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
19043 case to prevent some cases opting to go through memory.
19044 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
19045 * rtl.c (rtvec_series_p): Define predicate to determine
19046 whether a vector contains a linear series of integers.
19047 * rtl.h (rtvec_series_p): Define.
19048 * rtlanal.c (vec_series_lowpart_p): Define predicate to
19049 determine if a vector selection is equivalent to the low part
19051 * rtlanal.h (vec_series_lowpart_p): Define.
19052 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
19053 Add vec_select -> subreg simplification.
19055 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
19057 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
19058 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
19059 _mm_test_mix_ones_zeros): New.
19061 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
19062 Richard Biener <rguenther@suse.de>
19064 * gimple.c (gimple_could_trap_p_1): Make S argument a
19065 "const gimple*". Preserve constness in call to
19066 gimple_asm_volatile_p.
19067 (gimple_could_trap_p): Make S argument a "const gimple*".
19068 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
19069 Update function prototypes.
19071 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19073 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
19074 (_loop_vec_info::main_loop_edge): New field.
19075 (_loop_vec_info::skip_main_loop_edge): Likewise.
19076 (_loop_vec_info::skip_this_loop_edge): Likewise.
19077 (_loop_vec_info::reusable_accumulators): Likewise.
19078 (_stmt_vec_info::reduc_scalar_results): Likewise.
19079 (_stmt_vec_info::reused_accumulator): Likewise.
19080 (vect_get_main_loop_result): Declare.
19081 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
19082 reduc_scalar_inputs.
19083 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
19084 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
19085 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
19086 skip_main_loop_edge and skip_this_loop_edge fields.
19087 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
19088 (vect_emit_reduction_init_stmts): New function.
19089 (get_initial_def_for_reduction): Use it.
19090 (get_initial_defs_for_reduction): Likewise. Change the vinfo
19091 parameter to a loop_vec_info.
19092 (vect_create_epilog_for_reduction): Store the scalar results
19093 in the reduc_info. If an epilogue loop is reusing an accumulator
19094 from the main loop, and if the epilogue loop can also be skipped,
19095 try to place the reduction code in the join block. Record
19096 accumulators that could potentially be reused by epilogue loops.
19097 (vect_transform_cycle_phi): When vectorizing epilogue loops,
19098 try to reuse accumulators from the main loop. Record the initial
19099 value in reduc_info for non-SLP reductions too.
19101 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19103 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
19104 adjustment handling. Take the neutral value as an argument,
19105 in place of the code argument.
19106 (vect_transform_cycle_phi): Update accordingly. Handle the
19107 initial values of cond reductions separately from code reductions.
19108 Choose the adjustment here rather than in
19109 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
19111 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19113 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
19114 (neutral_op_for_reduction): ...this, providing a more general
19116 (vect_create_epilog_for_reduction): Update accordingly.
19117 (vectorizable_reduction): Likewise.
19118 (vect_transform_cycle_phi): Likewise.
19120 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19122 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
19123 reduc_info instead of the original stmt_vec_info.
19124 (vect_transform_cycle_phi): Update accordingly.
19126 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19128 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
19129 reduc_info as an additional parameter.
19130 (vect_transform_cycle_phi): Update accordingly.
19132 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19134 * tree-vectorizer.h: Include tree-ssa-operands.h.
19135 (vect_phi_initial_value): New function.
19136 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
19137 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
19138 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
19139 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
19141 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19143 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
19144 the phi results to vectype after creating them. Remove later
19145 conversion code that thus becomes redundant.
19147 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19149 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
19150 the new_phis vector with a reduc_inputs vector. Combine handling
19151 of reduction chains and ncopies > 1.
19153 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19155 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
19156 scalar_results to group_size elements after reducing down from
19157 N*group_size elements. Construct an array_slice of the live-out
19158 stmts and assert that there is one stmt per scalar result.
19160 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19162 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
19163 nested_in_vect_loop and use double_reduc everywhere. Remove dead
19164 assignment to "loop".
19166 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19168 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
19169 vector types first. For scalar types, consider both the preferred
19170 vector mode and the alternative vector modes.
19171 * optabs-query.c (can_vec_mask_load_store_p): Use the same
19172 structure as above, in particular using related_vector_mode
19173 for modes provided by autovectorize_vector_modes.
19175 2021-07-13 Jakub Jelinek <jakub@redhat.com>
19176 Richard Biener <rguenther@suse.de>
19178 PR tree-optimization/101419
19179 * tree-pass.h (PROP_objsz): Define.
19180 (make_pass_early_object_sizes): Declare.
19181 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
19182 there to pass_early_object_sizes, drop parameter.
19183 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
19184 drop parameter, move pass_post_ipa_warn right after that.
19185 * tree-object-size.c (pass_object_sizes::execute): Rename to...
19186 (object_sizes_execute): ... this. Add insert_min_max_p argument.
19187 (pass_data_object_sizes): Move after object_sizes_execute.
19188 (pass_object_sizes): Likewise. In execute method call
19189 object_sizes_execute, drop set_pass_param method and insert_min_max_p
19190 non-static data member and its initializer in the ctor.
19191 (pass_data_early_object_sizes, pass_early_object_sizes,
19192 make_pass_early_object_sizes): New.
19193 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
19194 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
19196 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
19199 * config/riscv/constraints.md ("S"): Update description and remove
19201 * doc/md.texi (Machine Constraints): Document the 'S' constraints
19204 2021-07-13 Richard Biener <rguenther@suse.de>
19207 2021-07-12 Richard Biener <rguenther@suse.de>
19209 * tree-vect-slp.c (vect_slp_region): Show the number of
19210 SLP graph entries in the optimization message.
19212 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
19214 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
19216 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
19217 return type to long.
19218 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
19221 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
19223 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
19224 Query relation between the 2 operands and use it.
19226 2021-07-12 Sergei Trofimovich <siarheit@google.com>
19228 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
19230 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
19233 * config/i386/predicates.md (vec_setm_sse41_operand):
19234 Rename from vec_setm_operand.
19235 (vec_setm_avx2_operand): New predicate.
19236 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
19237 Use vec_setm_sse41_operand as operand 2 predicate.
19238 (vec_set<V_256_512:mode): New expander.
19239 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
19240 as operand 2 predicate.
19242 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
19244 PR tree-optimization/101335
19245 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
19247 2021-07-12 Andrew Pinski <apinski@marvell.com>
19249 * tree-ssa-phiopt.c (match_simplify_replacement): Move
19250 insert of the sequence before the movement of the
19251 statement. Check if to see if the statement is used
19252 outside of the original phi to see if we should move it.
19254 2021-07-12 Richard Biener <rguenther@suse.de>
19256 * dump-context.h (debug_dump_context::debug_dump_context):
19257 Add FILE * parameter defaulted to stderr.
19258 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
19259 * tree-vect-slp.c (dot_slp_tree): New functions.
19261 2021-07-12 Richard Biener <rguenther@suse.de>
19263 PR tree-optimization/101373
19264 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
19265 references when the BB may not return.
19266 (compute_avail): Pass in the function we're working on and
19267 replace cfun references with it. Externally throwing
19268 const calls also possibly terminate the function.
19269 (pass_pre::execute): Pass down the function we're working on.
19270 * gcse.c (compute_hash_table_work): Externally throwing
19271 const/pure calls also need record_last_mem_set_info.
19272 * postreload-gcse.c (record_opr_changes): Looping or externally
19273 throwing const/pure calls also need record_last_mem_set_info.
19275 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
19277 * recog.c (memory_address_addr_space_p): Change the type to bool.
19278 Return true/false instead of 1/0.
19279 (offsettable_memref_p): Ditto.
19280 (offsettable_nonstrict_memref_p): Ditto.
19281 (offsettable_address_addr_space_p): Ditto.
19282 Change the type of addressp indirect function to bool.
19283 * recog.h (memory_address_addr_space_p): Change the type to bool.
19284 (strict_memory_address_addr_space_p): Ditto.
19285 (offsettable_memref_p): Ditto.
19286 (offsettable_nonstrict_memref_p): Ditto.
19287 (offsettable_address_addr_space_p): Ditto.
19288 * reload.c (maybe_memory_address_addr_space_p): Ditto.
19289 (strict_memory_address_addr_space_p): Change the type to bool.
19290 Return true/false instead of 1/0.
19291 (maybe_memory_address_addr_space_p): Change the type to bool.
19293 2021-07-12 Richard Biener <rguenther@suse.de>
19295 * tree-vect-slp.c (vect_slp_region): Show the number of
19296 SLP graph entries in the optimization message.
19298 2021-07-12 Richard Biener <rguenther@suse.de>
19300 PR tree-optimization/101394
19301 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
19302 copies from abnormals for a full redundancy.
19304 2021-07-12 Richard Biener <rguenther@suse.de>
19306 PR middle-end/101423
19307 * gimple.c (gimple_could_trap_p_1): Internal function calls
19309 * tree-eh.c (tree_could_trap_p): Likewise.
19311 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19314 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
19316 (vmulq_n_u32): Likewise.
19317 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
19318 (vmulq_n_f32): Likewise.
19319 (vmul_n_f16): Likewise.
19320 (vmulq_n_f16): Likewise.
19322 2021-07-12 Martin Liska <mliska@suse.cz>
19324 PR sanitizer/101425
19325 * gcc.c (check_offload_target_name): Call
19326 candidates_list_and_hint only if we have a candidate.
19328 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19331 * config/arm/neon.md (vec_init): Move to ...
19332 * config/arm/vec-common.md (vec_init): ... here.
19333 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
19335 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
19337 PR tree-optimization/101403
19338 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
19339 signedness of the shift is not the same as the signedness of
19340 the type extension.
19342 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
19343 Uroš Bizjak <ubizjak@gmail.com>
19345 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
19346 divmod of a constant numerator with new define_insn_and_split.
19348 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
19351 * config/i386/i386-expand.c (ix86_expand_call): If a call is
19352 to a non-local-binding, or local but to a public symbol, then
19353 assume that it might be indirected via the lazy symbol binder.
19354 Mark R10 and R10 as clobbered in that case.
19356 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
19359 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
19360 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
19361 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
19363 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
19365 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
19366 log2 range of operands[3] to [1,31].
19367 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
19369 2021-07-09 Sergei Trofimovich <siarheit@google.com>
19371 * doc/md.texi: Don't split @smallexample in multiple @groups.
19373 2021-07-09 Sergei Trofimovich <siarheit@google.com>
19375 * doc/md.texi: Add missing 'see' word.
19377 2021-07-09 Andrew Pinski <apinski@marvell.com>
19379 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
19380 to take sequence and gimple_match_op. Accept the case where
19381 op is a SSA_NAME and one statement in the sequence.
19382 Also allow constants.
19383 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
19384 Update call to phiopt_early_allow. Discard the sequence if not
19387 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
19392 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
19393 instead of ISA_HAS_MSA.
19394 (mips_expand_vec_unpack): Likewise.
19395 (mips_expand_vector_init): Likewise.
19397 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
19399 * config/rs6000/vsx.md (mods_<mode>): Rename to...
19400 (mod<mode>3): ... this.
19401 (modu_<mode>): Rename to...
19402 (umod<mode>3): ... this.
19403 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
19404 MODU_V4SI): Adjust.
19406 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
19408 * config/h8300/shiftrotate.md (variable shifts): Expose condition
19409 code handling for the test before the loop.
19411 2021-07-08 Martin Jambor <mjambor@suse.cz>
19414 * ipa-sra.c (class isra_call_summary): New member
19415 m_before_any_store, initialize it in the constructor.
19416 (isra_call_summary::dump): Dump the new field.
19417 (ipa_sra_call_summaries::duplicate): Copy it.
19418 (process_scan_results): Set it.
19419 (isra_write_edge_summary): Stream it.
19420 (isra_read_edge_summary): Likewise.
19421 (param_splitting_across_edge): Only override
19422 safe_to_import_accesses if m_before_any_store is set.
19424 2021-07-08 Martin Sebor <msebor@redhat.com>
19426 PR bootstrap/101374
19427 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
19428 Use Object Size Type 0 instead of 1.
19430 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
19432 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
19435 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
19437 * match.pd: Simplify an extend-operate-truncate sequence involving
19440 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
19441 Richard Biener <rguenther@suse.de>
19443 PR tree-optimization/40210
19444 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
19445 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
19446 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
19448 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
19451 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
19453 * config/i386/mmx.md (V_32): New mode iterator.
19454 (mov<V_32:mode>): Use V_32 mode iterator.
19455 (*mov<V_32:mode>_internal): Ditto.
19456 (*push<V_32:mode>2_rex64): Ditto.
19457 (*push<V_32:mode>2): Ditto.
19458 (movmisalign<V_32:mode>): Ditto.
19459 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
19460 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
19461 (vec_unpacks_lo_v4qi): New expander.
19462 (vec_unpacks_hi_v4qi): Ditto.
19463 (vec_unpacku_lo_v4qi): Ditto.
19464 (vec_unpacku_hi_v4qi): Ditto.
19465 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
19466 (VALID_INT_MODE_P): Ditto.
19468 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
19471 * config/rs6000/rs6000.md (udivti3): New insn.
19472 (divti3): New insn.
19473 (umodti3): New insn.
19474 (modti3): New insn.
19476 2021-07-07 Martin Sebor <msebor@redhat.com>
19478 PR tree-optimization/100137
19479 PR tree-optimization/99121
19480 PR tree-optimization/97027
19481 * builtins.c (access_ref::access_ref): Also set offmax.
19482 (access_ref::offset_in_range): Define new function.
19483 (access_ref::add_offset): Set offmax.
19484 (access_ref::inform_access): Handle access_none.
19485 (handle_mem_ref): Clear ostype.
19486 (compute_objsize_r): Handle ASSERT_EXPR.
19487 * builtins.h (struct access_ref): Add offmax member.
19488 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
19489 compute_objsize() and simplify.
19491 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
19493 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
19494 and VSX_BUILTIN_STXVP.
19496 2021-07-07 Martin Sebor <msebor@redhat.com>
19499 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
19500 a stray %K from error_at() missed in r12-2088.
19502 2021-07-07 Richard Biener <rguenther@suse.de>
19504 PR tree-optimization/99728
19505 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
19507 (mem_refs_may_alias_p): Add assert we handled aggregate
19509 (sm_seq_valid_bb): Give up when running into aggregate copies.
19510 (ref_indep_loop_p): Handle aggregate copies as never
19511 being invariant themselves but allow other refs to be
19512 disambiguated against them.
19513 (can_sm_ref_p): Do not try to apply store-motion to aggregate
19516 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
19519 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
19522 2021-07-06 Martin Sebor <msebor@redhat.com>
19524 * gimple-pretty-print.c (percent_G_format): Remove.
19525 * tree-diagnostic.c (default_tree_printer): Remove calls.
19526 * tree-pretty-print.c (percent_K_format): Remove.
19527 * tree-pretty-print.h (percent_K_format): Remove.
19529 2021-07-06 Martin Sebor <msebor@redhat.com>
19531 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
19532 Remove %K and use error_at.
19533 (aarch64_expand_fcmla_builtin): Same.
19534 (aarch64_expand_builtin_tme): Same.
19535 (aarch64_expand_builtin_memtag): Same.
19536 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
19537 (arm_expand_builtin): Same.
19538 * config/arm/arm.c (bounds_check): Same.
19540 2021-07-06 Martin Sebor <msebor@redhat.com>
19542 * builtins.c (warn_string_no_nul): Remove %G.
19543 (maybe_warn_for_bound): Same.
19544 (warn_for_access): Same.
19545 (check_access): Same.
19546 (check_strncat_sizes): Same.
19547 (expand_builtin_strncat): Same.
19548 (expand_builtin_strncmp): Same.
19549 (expand_builtin): Same.
19550 (expand_builtin_object_size): Same.
19551 (warn_dealloc_offset): Same.
19552 (maybe_emit_free_warning): Same.
19553 * calls.c (maybe_warn_alloc_args_overflow): Same.
19554 (maybe_warn_nonstring_arg): Same.
19555 (maybe_warn_rdwr_sizes): Same.
19556 * expr.c (expand_expr_real_1): Remove %K.
19557 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
19558 (gimple_fold_builtin_strncat): Same.
19559 * gimple-ssa-sprintf.c (format_directive): Same.
19560 (handle_printf_call): Same.
19561 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
19562 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19563 (maybe_diag_access_bounds): Same. Call gimple_location.
19564 (check_bounds_or_overlap): Same.
19565 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
19566 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
19567 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
19568 (maybe_diag_stxncpy_trunc): Same.
19569 (handle_builtin_stxncpy_strncat): Same.
19570 (maybe_warn_pointless_strcmp): Same.
19571 * tree-ssa-uninit.c (maybe_warn_operand): Same.
19573 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
19576 * config/i386/predicates.md (vec_setm_operand): Enable
19577 register_operand for TARGET_SSE4_1.
19578 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
19579 as operand 2 predicate. Call ix86_expand_vector_set_var
19580 for non-constant index operand.
19581 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
19582 Call ix86_expand_vector_set_var for non-constant index operand.
19584 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
19586 * config/h8300/jumpcall.md (*branch): When possible, generate
19587 the comparison in CCZN mode.
19588 * config/h8300/predicates.md (simple_memory_operand): Reject all
19589 auto-increment addressing modes.
19591 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19593 PR bootstrap/100246
19594 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
19597 2021-07-06 Richard Biener <rguenther@suse.de>
19599 * doc/md.texi (vec_fmaddsub<mode>4): Document.
19600 (vec_fmsubadd<mode>4): Likewise.
19601 * optabs.def (vec_fmaddsub$a4): Add.
19602 (vec_fmsubadd$a4): Likewise.
19603 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
19604 (IFN_VEC_FMSUBADD): Likewise.
19605 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
19606 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
19607 (addsub_pattern::build): Likewise.
19608 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
19609 and CFN_VEC_FMSUBADD are not transparent for permutes.
19610 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
19611 (vec_fmsubadd<mode>4): Likewise.
19613 2021-07-06 Richard Biener <rguenther@suse.de>
19615 * doc/invoke.texi (fmove-loop-stores): Document.
19616 * common.opt (fmove-loop-stores): New option.
19617 * opts.c (default_options_table): Enable -fmove-loop-stores
19618 at -O1 but not -Og.
19619 * tree-ssa-loop-im.c (pass_lim::execute): Pass
19620 flag_move_loop_stores instead of true to
19621 loop_invariant_motion_in_fun.
19623 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19625 * doc/install.texi: Document --with-dsymutil.
19627 2021-07-06 Andrew Pinski <apinski@marvell.com>
19629 PR tree-optimization/101256
19630 * dbgcnt.def (phiopt_edge_range): New counter.
19631 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
19632 Check to make sure the new name is defined in the same
19633 bb as the conditional before duplicating range info.
19634 Also add debug counter.
19636 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19638 PR rtl-optimization/100328
19639 * config/i386/i386-options.c (ix86_option_override_internal):
19640 Set param_ira_consider_dup_in_all_alts to 0.
19642 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19644 PR rtl-optimization/100328
19645 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
19647 * ira.c (ira_get_dup_out_num): Adjust as parameter
19648 param_ira_consider_dup_in_all_alts.
19649 * params.opt (ira-consider-dup-in-all-alts): New.
19650 * ira-conflicts.c (process_regs_for_copy): Add one parameter
19651 single_input_op_has_cstr_p.
19652 (get_freq_for_shuffle_copy): New function.
19653 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
19654 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
19656 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
19658 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
19659 generate condition code aware RTL directly.
19661 2021-07-05 Andrew Pinski <apinski@marvell.com>
19663 PR tree-optimization/101039
19664 * match.pd (A CMP 0 ? A : -A): New patterns.
19665 * tree-ssa-phiopt.c (abs_replacement): Delete function.
19666 (tree_ssa_phiopt_worker): Don't call abs_replacement.
19667 Update comment about abs_replacement.
19669 2021-07-05 Andrew Pinski <apinski@marvell.com>
19671 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
19672 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
19674 2021-07-05 Andrew Pinski <apinski@marvell.com>
19676 * tree-ssa-phiopt.c (match_simplify_replacement):
19677 Add early_p argument. Call gimple_simplify_phiopt
19678 instead of gimple_simplify.
19679 (tree_ssa_phiopt_worker): Update call to
19680 match_simplify_replacement and allow unconditionally.
19681 (phiopt_early_allow): New function.
19682 (gimple_simplify_phiopt): New function.
19684 2021-07-05 Andrew Pinski <apinski@marvell.com>
19686 PR middle-end/101237
19687 * fold-const.c (negate_expr_p): Remove call to element_mode
19688 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
19689 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
19690 (fold_negate_expr_1): Likewise.
19691 (const_unop): Likewise.
19692 (fold_cond_expr_with_comparison): Likewise.
19693 (fold_binary_loc): Likewise.
19694 (fold_ternary_loc): Likewise.
19695 (tree_call_nonnegative_warnv_p): Likewise.
19696 * match.pd (-(A + B) -> (-B) - A): Likewise.
19698 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
19700 * configure.ac: Handle --with-dsymutil in the same way as we
19701 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
19702 Extract the type and version for the dsymutil configured or
19703 found by the default searches.
19704 * config.in: Regenerated.
19705 * configure: Regenerated.
19706 * collect2.c (do_dsymutil): Handle locating dsymutil in the
19707 same way as for the assembler and linker.
19708 * config/darwin.h (DSYMUTIL): Delete.
19709 * gcc.c: Report a configured dsymutil correctly.
19710 * exec-tool.in: Allow for dsymutil.
19712 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
19714 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
19715 Handle V4QI and V2HI modes.
19716 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
19717 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
19718 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
19719 Handle 4-byte dual operands with TARGET_XOP and single operands
19720 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
19721 mmx_pshufbv4qi3 for TARGET_SSSE3.
19722 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
19723 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
19724 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
19725 (expand_vec_perm_even_odd_1): Handle V4QI mode.
19726 (expand_vec_perm_broadcast_1): Handle V4QI mode.
19727 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
19728 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
19729 (mmx_pshufbv4qi3): Ditto.
19730 (*mmx_pblendw32): Ditto.
19731 (*mmx_pblendw64): Rename from *mmx_pblendw.
19732 (mmx_punpckhbw_low): New insn_and_split pattern.
19733 (mmx_punpcklbw_low): Ditto.
19735 2021-07-05 Richard Biener <rguenther@suse.de>
19737 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
19738 set LOOP_C_INFINITE on the vectorized loop.
19740 2021-07-05 Richard Biener <rguenther@suse.de>
19742 PR middle-end/101291
19743 * cfgloopmanip.c (loop_version): Set the loop copy of the
19744 versioned loop to the new loop.
19746 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
19749 * config.gcc: Ensure that Darwin biarch definitions are
19750 added before i386.h.
19751 * config/i386/darwin.h (TARGET_64BIT): Remove.
19752 (PR80556_WORKAROUND): New.
19753 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
19754 (DARWIN_SUBARCH_SPEC): New.
19755 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
19756 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19757 (REAL_LIBGCC_SPEC): Remove.
19758 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
19759 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19760 (REAL_LIBGCC_SPEC): Remove.
19762 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
19764 PR middle-end/101294
19765 * expr.c (store_constructor): Don't use vec_duplicate on vector.
19767 2021-07-02 Martin Sebor <msebor@redhat.com>
19769 PR middle-end/98871
19770 PR middle-end/98512
19771 * diagnostic.c (get_any_inlining_info): New.
19772 (update_effective_level_from_pragmas): Handle inlining context.
19773 (diagnostic_enabled): Same.
19774 (diagnostic_report_diagnostic): Same.
19775 * diagnostic.h (struct diagnostic_info): Add ctor.
19776 (struct diagnostic_context): Add new member.
19777 * tree-diagnostic.c (set_inlining_locations): New.
19778 (tree_diagnostics_defaults): Set new callback pointer.
19780 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
19782 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
19784 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
19785 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
19786 lxvp and stxvp built-ins.
19787 (mma_init_builtins): Handle lxvp and stxvp built-ins.
19788 (builtin_function_type): Likewise.
19789 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
19791 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19793 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
19794 additional argument for the code.
19795 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
19796 argument for the code. Just return if the ZN bits are useful or
19797 not rather than the old style CC_* enums.
19798 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
19799 more generic shiftqi patterns.
19800 (shifthi_noscratch, shiftsi_noscratch): Similarly.
19801 (shiftqi_noscratch_set_flags): New pattern.
19802 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
19804 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
19806 PR tree-optimization/101223
19807 * range-op.cc (build_lt): Add -1 for signed values.
19808 (built_gt): Subtract -1 for signed values.
19810 2021-07-02 David Faust <david.faust@oracle.com>
19812 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
19813 (btf_asm_type): Likewise.
19815 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19817 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
19818 an array of rtx rather than a pointer to rtx. Add code argument.
19819 (compute_a_shift_length): Similarly.
19820 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
19821 shift itself isn't an operand. Create dummy operand[0] to carry
19822 a mode and pass a suitable rtx code to compute_a_shift_length.
19823 (get_shift_alg): Adjust operand number of clobber in output templates.
19824 (output_a_shift): Make first argument an array of rtx rather than
19825 a pointer to rtx. Add code argument for the type of shift.
19826 Adjust now that the shift itself is no longer an operand.
19827 (compute_a_shift_length): Similarly.
19828 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
19829 iterator rather than nshift_operator.
19830 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
19831 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
19832 and compute_a_shift_length.
19833 (shiftqi_noscratch_clobber_flags): Likewise.
19834 (shifthi_noscratch_clobber_flags): Likewise.
19835 (shiftsi_noscratch_clobber_flags): Likewise.
19837 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19840 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
19841 dsymutil for BTF/CTF.
19843 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19846 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
19847 segment to include BTF.
19848 (BTF_INFO_SECTION_NAME): New.
19850 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19852 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
19853 (small_data_operand, memreg_operand, small_insn_p): Likewise.
19854 * config/m32r/m32r.c (call_operand): Adjust return type.
19855 (small_data_operand, memreg_operand): Likewise.
19857 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19859 * config/frv/frv-protos.h (integer_register_operand): Adjust return
19861 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
19862 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
19863 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
19864 (move_source_operand, move_destination_operand): Likewise.
19865 (condexec_source_operand, condexec_dest_operand): Likewise.
19866 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
19867 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
19868 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
19869 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
19870 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
19871 (even_fpr_operand, odd_fpr_operand): Likewise.
19872 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
19873 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
19874 (uint4_operand, uint1_operand, int_2word_operand): Likewise
19875 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
19876 (relational_operator, float_relational_operator): Likewise.
19877 (ccr_eqne_operator, minmax_operator): Likewise.
19878 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
19879 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
19880 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
19881 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
19882 (quad_acc_operand, accg_operand): Likewise.
19884 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19886 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
19887 return type to a bool.
19888 (nonimmediate_nonstack_operand): Likewise.
19889 (xstormy16_splittable_below100_operand): Likewise.
19890 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
19892 (xstormy16_splittable_below100_operand): Likewise.
19894 2021-07-02 Richard Biener <rguenther@suse.de>
19896 PR tree-optimization/101293
19897 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
19898 with combined offsets.
19899 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
19900 combined with the rest of the offset.
19902 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
19904 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
19905 PIC encodings for PE-COFF targets.
19907 2021-07-02 Jakub Jelinek <jakub@redhat.com>
19910 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
19911 Return nullptr for TImode inner mode.
19913 2021-07-02 Richard Biener <rguenther@suse.de>
19915 PR tree-optimization/101280
19916 PR tree-optimization/101173
19917 * gimple-loop-interchange.cc
19918 (tree_loop_interchange::valid_data_dependences): Properly
19919 guard all dependence checks with DDR_REVERSED_P or its
19922 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
19924 * config/i386/i386-expand.c (ix86_expand_builtin):
19925 Add branch to clear odata when ZF is set for asedecenc_expand
19926 and wideaesdecenc_expand.
19928 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
19930 * config/i386/gcc-auto-profile: regenerate
19932 2021-07-02 liuhongt <hongtao.liu@intel.com>
19934 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
19935 (trunc<mode><pmov_dst_4_lower>2): this.
19937 2021-07-01 David Malcolm <dmalcolm@redhat.com>
19939 * diagnostic.h (diagnostic_context::m_file_cache): New field.
19940 * input.c (class fcache): Rename to...
19941 (class file_cache_slot): ...this, making most members private and
19942 prefixing fields with "m_".
19943 (file_cache_slot::get_file_path): New accessor.
19944 (file_cache_slot::get_use_count): New accessor.
19945 (file_cache_slot::missing_trailing_newline_p): New accessor.
19946 (file_cache_slot::inc_use_count): New.
19947 (fcache_buffer_size): Move to...
19948 (file_cache_slot::buffer_size): ...here.
19949 (fcache_line_record_size): Move to...
19950 (file_cache_slot::line_record_size): ...here.
19951 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
19952 (fcache_tab_size): Move to file_cache::num_file_slots.
19953 (diagnostic_file_cache_init): Update for move of fcache_tab
19954 to global_dc->m_file_cache.
19955 (diagnostic_file_cache_fini): Likewise.
19956 (lookup_file_in_cache_tab): Convert to...
19957 (file_cache::lookup_file): ...this.
19958 (diagnostics_file_cache_forcibly_evict_file): Update for move of
19959 fcache_tab to global_dc->m_file_cache, moving most of
19960 implementation to...
19961 (file_cache::forcibly_evict_file): ...this new function and...
19962 (file_cache_slot::evict): ...this new function.
19963 (evicted_cache_tab_entry): Convert to...
19964 (file_cache::evicted_cache_tab_entry): ...this.
19965 (add_file_to_cache_tab): Convert to...
19966 (file_cache::add_file): ...this, moving bulk of implementation
19968 (file_cache_slot::create): ..this new function.
19969 (file_cache::file_cache): New.
19970 (file_cache::~file_cache): New.
19971 (lookup_or_add_file_to_cache_tab): Convert to...
19972 (file_cache::lookup_or_add_file): ..this new function.
19973 (fcache::fcache): Rename to...
19974 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
19976 (fcache::~fcache): Rename to...
19977 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
19979 (needs_read): Convert to...
19980 (file_cache_slot::needs_read_p): ...this.
19981 (needs_grow): Convert to...
19982 (file_cache_slot::needs_grow_p): ...this.
19983 (maybe_grow): Convert to...
19984 (file_cache_slot::maybe_grow): ...this.
19985 (read_data): Convert to...
19986 (file_cache_slot::read_data): ...this.
19987 (maybe_read_data): Convert to...
19988 (file_cache_slot::maybe_read_data): ...this.
19989 (get_next_line): Convert to...
19990 (file_cache_slot::get_next_line): ...this.
19991 (goto_next_line): Convert to...
19992 (file_cache_slot::goto_next_line): ...this.
19993 (read_line_num): Convert to...
19994 (file_cache_slot::read_line_num): ...this.
19995 (location_get_source_line): Update for moving of globals to
19996 global_dc->m_file_cache.
19997 (location_missing_trailing_newline): Likewise.
19998 * input.h (class file_cache_slot): New forward decl.
19999 (class file_cache): New.
20001 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
20003 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
20004 128-bit floating point conditional move support.
20005 (have_compare_and_set_mask): Add IEEE 128-bit floating point
20007 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
20008 (mov<mode>cc_p10, IEEE128 iterator): New insn.
20009 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
20010 (fpmask<mode>, IEEE128 iterator): New insn.
20011 (xxsel<mode>, IEEE128 iterator): New insn.
20013 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
20016 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
20018 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
20020 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
20022 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
20024 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
20025 (vec_duplicate<mode>): New expander.
20027 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
20030 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
20032 (ix86_byte_broadcast): New function.
20033 (ix86_convert_const_wide_int_to_broadcast): Likewise.
20034 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
20035 size is 16 bytes or bigger.
20036 (ix86_broadcast_from_integer_constant): New function.
20037 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
20038 to broadcast if mode size is 16 bytes or bigger.
20039 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
20041 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
20043 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20045 * config/i386/predicates.md (ix86_endbr_immediate_operand):
20046 Return true/false instead of 1/0.
20047 (movq_parallel): Ditto.
20049 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20051 * recog.c (general_operand): Return true/false instead of 1/0.
20052 (register_operand): Ditto.
20053 (immediate_operand): Ditto.
20054 (const_int_operand): Ditto.
20055 (const_scalar_int_operand): Ditto.
20056 (const_double_operand): Ditto.
20057 (push_operand): Ditto.
20058 (pop_operand): Ditto.
20059 (memory_operand): Ditto.
20060 (indirect_operand): Ditto.
20062 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20064 * genpreds.c (write_predicate_subfunction):
20065 Change the type of written subfunction to bool.
20066 (write_one_predicate_function):
20067 Change the type of written function to bool.
20068 (write_tm_preds_h): Ditto.
20069 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
20070 * recog.c (general_operand): Change the type to bool.
20071 (address_operand): Ditto.
20072 (register_operand): Ditto.
20073 (pmode_register_operand): Ditto.
20074 (scratch_operand): Ditto.
20075 (immediate_operand): Ditto.
20076 (const_int_operand): Ditto.
20077 (const_scalar_int_operand): Ditto.
20078 (const_double_operand): Ditto.
20079 (nonimmediate_operand): Ditto.
20080 (nonmemory_operand): Ditto.
20081 (push_operand): Ditto.
20082 (pop_operand): Ditto.
20083 (memory_operand): Ditto.
20084 (indirect_operand): Ditto.
20085 (ordered_comparison_operator): Ditto.
20086 (comparison_operator): Ditto.
20087 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
20088 Change the type of indirect predicate function to bool.
20089 * config/rs6000/rs6000.c (easy_vector_constant):
20090 Change the type to bool.
20091 * config/mips/mips-protos.h (m16_based_address_p):
20092 Change the type of operand 3 to bool.
20094 2021-07-01 Richard Biener <rguenther@suse.de>
20096 PR tree-optimization/101280
20097 PR tree-optimization/101173
20098 * gimple-loop-interchange.cc
20099 (tree_loop_interchange::valid_data_dependences): Revert
20100 previous change and instead correctly handle DDR_REVERSED_P
20103 2021-07-01 Richard Biener <rguenther@suse.de>
20105 PR tree-optimization/101278
20106 * tree-ssa-dse.c (dse_classify_store): First check for
20107 uses, then ignore stmt for chaining purposes.
20109 2021-07-01 Richard Biener <rguenther@suse.de>
20111 PR tree-optimization/100778
20112 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
20113 vectorized ops ahead of their scalar BB.
20115 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20118 * config/i386/i386.md (*nabs<dwi>2_doubleword):
20119 New insn_and_split pattern.
20120 (*nabs<dwi>2_1): Ditto.
20121 * config/i386/i386-features.c
20122 (general_scalar_chain::compute_convert_gain):
20123 Handle (NEG (ABS (...))) RTX. Rewrite src code
20124 scanner as switch statement.
20125 (general_scalar_chain::convert_insn):
20126 Handle (NEG (ABS (...))) RTX.
20127 (general_scalar_to_vector_candidate_p):
20128 Detect (NEG (ABS (...))) RTX. Reorder case statements
20129 for (AND (NOT (...) ...)) fallthrough.
20131 2021-07-01 Richard Biener <rguenther@suse.de>
20133 PR tree-optimization/101178
20134 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
20135 (slpg::perm_in): Add.
20136 (slpg::get_perm_in): Remove.
20137 (slpg::get_perm_materialized): Add.
20138 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
20139 during permute propagation and materialization.
20141 2021-07-01 Jakub Jelinek <jakub@redhat.com>
20144 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
20146 2021-07-01 Jakub Jelinek <jakub@redhat.com>
20148 PR middle-end/94366
20149 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
20150 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
20151 use boolean_type_node instead of integer_type_node as NE_EXPR type.
20152 (lower_reduction_clauses): Likewise.
20154 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
20156 * config/gcn/gcn.c: Include dwarf2.h.
20157 (gcn_addr_space_debug): New function.
20158 (TARGET_ADDR_SPACE_DEBUG): New hook.
20160 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
20162 * common/config/gcn/gcn-common.c
20163 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
20164 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
20166 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
20167 (gcn_frame_pointer_rqd): New function.
20168 (TARGET_FRAME_POINTER_REQUIRED): New hook.
20170 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
20172 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
20173 prologue register saves.
20174 (gcn_debug_unwind_info): Use UI_DWARF2.
20175 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
20176 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
20177 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
20178 (DWARF_LINK_REGISTER): New define.
20179 (FIRST_PSEUDO_REGISTER): Increment.
20180 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
20181 (CALL_USED_REGISTERS): Likewise.
20182 (REGISTER_NAMES): Likewise.
20184 2021-06-30 Richard Biener <rguenther@suse.de>
20186 PR tree-optimization/101267
20187 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
20188 API and use SLP compatible interface of vect_is_simple_use.
20189 Reject not vectorized SLP defs for callers that do not support
20191 (vect_check_store_rhs): Handle masked stores and pass down
20192 the appropriate operator index.
20193 (vectorizable_call): Adjust.
20194 (vectorizable_store): Likewise.
20195 (vectorizable_load): Likewise. Handle SLP pecularity of
20197 (vect_is_simple_use): Remove special-casing of masked stores.
20199 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
20201 * common.opt (foffload): Remove help as Driver only.
20202 * gcc.c (display_help): Add -foffload.
20204 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
20206 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
20207 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
20208 a candidate list; better inform no offload target is configured
20209 and fix hint extraction when passed target is not '\0' at [len].
20210 * common.opt (foffload): Add tailing '.'.
20211 (foffload-options): Likewise; fix flag name in the help string.
20213 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20216 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
20217 (vcage_f32): Gate comparison on __FAST_MATH__.
20218 (vcageq_f32): Likewise.
20219 (vcale_f32): Likewise.
20220 (vcaleq_f32): Likewise.
20221 (vcagt_f32): Likewise.
20222 (vcagtq_f32): Likewise.
20223 (vcalt_f32): Likewise.
20224 (vcaltq_f32): Likewise.
20225 (vcage_f16): Likewise.
20226 (vcageq_f16): Likewise.
20227 (vcale_f16): Likewise.
20228 (vcaleq_f16): Likewise.
20229 (vcagt_f16): Likewise.
20230 (vcagtq_f16): Likewise.
20231 (vcalt_f16): Likewise.
20232 (vcaltq_f16): Likewise.
20234 2021-06-30 Richard Biener <rguenther@suse.de>
20236 PR tree-optimization/101264
20237 * tree-vect-slp.c (vect_optimize_slp): Propagate the
20238 computed perm_in to all "any" permute successors
20239 we cannot de-duplicate immediately.
20241 2021-06-30 liuhongt <hongtao.liu@intel.com>
20244 * config/i386/sse.md
20245 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
20247 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
20249 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
20250 * config/i386/subst.md (maskz_scalar): New define_subst.
20251 (maskz_scalar_name): New subst_attr.
20252 (maskz_scalar_op5): Ditto.
20253 (round_saeonly_maskz_scalar_op5): Ditto.
20254 (round_saeonly_maskz_scalar_operand5): Ditto.
20256 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
20258 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
20259 Increase code CSECT alignment to at least 32 bytes.
20260 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
20261 alignment designation.
20263 2021-06-29 Sergei Trofimovich <siarheit@google.com>
20265 * doc/generic.texi: Fix s/net yet/not yet/ typo.
20267 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
20269 PR tree-optimization/101254
20270 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
20271 wrapping/non-wrapping when setting the result range.
20273 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
20275 * value-query.cc (gimple_range_global): Allow phis.
20277 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
20279 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
20280 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
20281 statement for location context.
20282 (check_for_binary_op_overflow): Ditto.
20283 (simplify_using_ranges::get_vr_for_comparison): Ditto.
20284 (simplify_using_ranges::compare_name_with_value): Ditto.
20285 (simplify_using_ranges::compare_names): Ditto.
20286 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
20287 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
20288 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
20289 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
20290 (simplify_using_ranges::two_valued_val_range_p): Ditto.
20291 (simplify_using_ranges::simplify): Ditto.
20292 * vr-values.h: Adjust prototypes.
20294 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
20297 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
20299 2021-06-29 Julian Brown <julian@codesourcery.com>
20301 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
20302 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
20303 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
20305 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
20306 (MAX_FIXED_MODE_SIZE): Change to 128.
20308 2021-06-29 Julian Brown <julian@codesourcery.com>
20310 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
20311 (s_mnemonic): Add clrsb.
20312 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
20313 (clrsb<mode>2): Add expander for SImode/DImode.
20315 2021-06-29 Julian Brown <julian@codesourcery.com>
20317 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
20318 muldi3): Add patterns.
20320 2021-06-29 Julian Brown <julian@codesourcery.com>
20322 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
20323 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
20325 2021-06-29 Julian Brown <julian@codesourcery.com>
20327 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
20329 2021-06-29 Joseph Myers <joseph@codesourcery.com>
20331 * btfout.c, ctfout.c: Include "memmodel.h".
20333 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
20335 * gcc.c (check_offload_target_name): Cast len argument to
20336 %q.*s to 'int'; avoid -Wstringop-truncation warning.
20338 2021-06-29 Richard Biener <rguenther@suse.de>
20340 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
20341 to "any" permute nodes and relax "any" permute proapgation
20342 during iterative backward propagation.
20344 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
20347 * common.opt (-foffload=): Update description.
20348 (-foffload-options=): New.
20349 * doc/invoke.texi (C Language Options): Document
20350 -foffload and -foffload-options.
20351 * gcc.c (check_offload_target_name): New, split off from
20352 handle_foffload_option.
20353 (check_foffload_target_names): New.
20354 (handle_foffload_option): Handle -foffload=default.
20355 (driver_handle_option): Update for -foffload-options.
20356 * lto-opts.c (lto_write_options): Use -foffload-options
20357 instead of -foffload.
20358 * lto-wrapper.c (merge_and_complain, append_offload_options):
20360 * opts.c (common_handle_option): Likewise.
20362 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
20364 * doc/invoke.texi (C Language Options): Sort options
20365 alphabetically in optlist and also the description itself.
20366 Remove leftover -fallow-single-precision from and add missing
20367 -fgnu-tm to the optlist.
20369 2021-06-29 Richard Biener <rguenther@suse.de>
20371 * tree-vect-slp.c (slpg_vertex::visited): Remove.
20372 (vect_slp_perms_eq): Handle -1 permutes.
20373 (vect_optimize_slp): Rewrite permute propagation.
20375 2021-06-29 Jakub Jelinek <jakub@redhat.com>
20378 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
20379 perform the optimization in GENERIC when sanitizing and x has a
20382 2021-06-29 Richard Biener <rguenther@suse.de>
20384 PR tree-optimization/101242
20385 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
20386 PHIs with not represented initial values as leafs.
20388 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
20390 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
20391 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
20392 "register" keyword.
20393 (pdp11_initial_elimination_offset) Remove unused variable.
20394 (pdp11_cmp_length) Ditto.
20395 (pdp11_insn_cost): Ditto, and fix signedness warning.
20397 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
20399 * btfout.c: Include tm_p.h.
20402 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
20404 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
20406 (bpf_expand_epilogue): Likewise.
20407 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
20408 Do not define DBX_DEBUGGING_INFO.
20410 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
20412 * doc/invoke.texi: Document the CTF and BTF debug info options.
20414 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
20415 David Faust <david.faust@oracle.com>
20416 Jose E. Marchesi <jose.marchesi@oracle.com>
20417 Weimin Pan <weimin.pan@oracle.com>
20419 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
20420 GTFILES. Add new object files.
20421 * common.opt: Add CTF and BTF debug info options.
20422 * btfout.c: New file.
20423 * ctfc.c: Likewise.
20424 * ctfc.h: Likewise.
20425 * ctfout.c: Likewise.
20426 * dwarf2ctf.c: Likewise.
20427 * dwarf2ctf.h: Likewise.
20428 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
20430 * dwarf2out.c (dwarf2out_source_line): Likewise.
20431 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
20433 (debug_format_do_cu): New function.
20434 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
20436 Include dwarf2ctf.c.
20437 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
20439 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
20440 (CTF_DEBUG): New bitmask.
20441 (BTF_DEBUG): Likewise.
20442 (enum ctf_debug_info_levels): New enum.
20443 * gengtype.c (open_base_files): Handle ctfc.h.
20444 (main): Handle uint32_t type.
20445 * flags.h (btf_debuginfo_p): New definition.
20446 (dwarf_based_debuginfo_p): Likewise.
20447 * opts.c (debug_type_names): Add entries for CTF and BTF.
20448 (btf_debuginfo_p): New function.
20449 (dwarf_based_debuginfo_p): Likewise.
20450 (common_handle_option): Handle -gctfN and -gbtf options.
20451 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
20452 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
20455 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
20457 * dwarf2out.c (AT_class): Function is no longer static.
20458 (AT_int): Likewise.
20459 (AT_unsigned): Likewise.
20460 (AT_loc): Likewise.
20461 (get_AT): Likewise.
20462 (get_AT_string): Likewise.
20463 (get_AT_flag): Likewise.
20464 (get_AT_unsigned): Likewise.
20465 (get_AT_ref): Likewise.
20466 (new_die_raw): Likewise.
20467 (lookup_decl_die): Likewise.
20468 (base_type_die): Likewise.
20469 (add_name_attribute): Likewise.
20470 (add_AT_int): Likewise.
20471 (add_AT_unsigned): Likewise.
20472 (add_AT_loc): Likewise.
20473 (dw_get_die_tag): New function.
20474 (dw_get_die_child): Likewise.
20475 (dw_get_die_sib): Likewise.
20476 (struct dwarf_file_data): Move from here to dwarf2out.h
20477 (struct dw_attr_struct): Likewise.
20478 * dwarf2out.h: Analogous changes.
20480 2021-06-28 Martin Jambor <mjambor@suse.cz>
20483 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
20484 members m_dead_stmts and m_dead_ssas.
20485 * ipa-param-manipulation.c
20486 (ipa_param_body_adjustments::mark_dead_statements): New function.
20487 (ipa_param_body_adjustments::common_initialization): Call it on
20488 all removed but not split parameters.
20489 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
20491 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
20493 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
20494 dead debug statements.
20495 (copy_phis_for_bb): Do not copy dead PHI nodes.
20497 2021-06-28 Martin Jambor <mjambor@suse.cz>
20500 * symtab-clones.h (clone_info): Removed member param_adjustments.
20501 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
20502 deal with pass-through splits now.
20503 (ipa_param_performed_split): Removed.
20504 (ipa_param_adjustments::modify_call): Adjusted parameters.
20505 (class ipa_param_body_adjustments): Adjusted parameters of
20506 register_replacement, modify_gimple_stmt and modify_call_stmt.
20507 (ipa_verify_edge_has_no_modifications): Declare.
20508 (ipa_edge_modifications_finalize): Declare.
20509 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
20510 performed_splits processing, pas only edge to padjs->modify_call,
20511 check that call arguments were not modified if they should not have
20513 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
20515 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
20516 (ipa_edge_modification_info): Likewise.
20517 (ipa_edge_modification_sum): Likewise.
20518 (ipa_edge_modifications): New edge summary.
20519 (ipa_verify_edge_has_no_modifications): New function.
20520 (transitive_split_p): Removed.
20521 (transitive_split_map): Likewise.
20522 (init_transitive_splits): Likewise.
20523 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
20524 summary instead of performed_splits.
20525 (ipa_param_body_adjustments::register_replacement): Drop dummy
20526 parameter, set base_index of the created ipa_param_body_replacement.
20527 (phi_arg_will_live_p): New function.
20528 (ipa_param_body_adjustments::common_initialization): Do not create
20529 IPA_SRA dummy decls.
20530 (simple_tree_swap_info): Removed.
20531 (remap_split_decl_to_dummy): Likewise.
20532 (record_argument_state_1): New function.
20533 (record_argument_state): Likewise.
20534 (ipa_param_body_adjustments::modify_call_stmt): New parameter
20535 orig_stmt. Do not work with dummy decls, save necessary info about
20536 changes to ipa_edge_modifications.
20537 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
20538 orig_stmt, pass it to modify_call_stmt.
20539 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
20540 modify_gimple_stmt.
20541 (ipa_edge_modifications_finalize): New function.
20542 * tree-inline.c (remap_gimple_stmt): Pass original statement to
20543 modify_gimple_stmt.
20544 (copy_phis_for_bb): Do not copy dead PHI nodes.
20545 (expand_call_inline): Do not remap performed_splits.
20546 (update_clone_info): Likewise.
20547 * toplev.c: Include ipa-param-manipulation.h.
20548 (toplev::finalize): Call ipa_edge_modifications_finalize.
20550 2021-06-28 Andrew Pinski <apinski@marvell.com>
20552 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
20553 info if we're the only things setting the target PHI.
20554 (value_replacement): Don't duplicate range here.
20555 (minmax_replacement): Likewise.
20557 2021-06-28 Richard Biener <rguenther@suse.de>
20559 PR tree-optimization/101229
20560 * gimple-walk.c (gimple_walk_op): Handle PHIs.
20562 2021-06-28 Martin Liska <mliska@suse.cz>
20564 * config/v850/v850.c (construct_dispose_instruction): Allocate
20566 (construct_prepare_instruction): Likewise.
20568 2021-06-28 Martin Liska <mliska@suse.cz>
20570 * config/v850/v850.c (v850_option_override): Build default
20572 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
20573 ignored for inlining.
20574 (TARGET_CAN_INLINE_P): New.
20576 2021-06-28 Richard Biener <rguenther@suse.de>
20578 PR tree-optimization/101207
20579 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
20580 permute eliding for load permutations properly.
20582 2021-06-28 Richard Biener <rguenther@suse.de>
20584 PR tree-optimization/101173
20585 * gimple-loop-interchange.cc
20586 (tree_loop_interchange::valid_data_dependences): Disallow outer
20587 loop dependence distance of zero.
20589 2021-06-28 liuhongt <hongtao.liu@intel.com>
20592 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
20593 define_insn_and_split.
20594 (*avx_cmp<mode>3_ltint): Ditto.
20595 (*avx2_pcmp<mode>3_3): Ditto.
20596 (*avx2_pcmp<mode>3_4): Ditto.
20597 (*avx2_pcmp<mode>3_5): Ditto.
20599 2021-06-28 liuhongt <hongtao.liu@intel.com>
20601 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
20602 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
20603 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
20604 IX86_BUILTIN_PBLENDVB128): Replace icode with
20606 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
20608 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
20609 New pre_reload splitter.
20611 2021-06-27 Andrew Pinski <apinski@marvell.com>
20613 PR middle-end/101230
20614 * fold-const.c (fold_ternary_loc): Check
20615 the return value of invert_tree_comparison.
20617 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
20619 * config.gcc: Add SPDX License Identifier.
20620 (powerpc-ibm-aix789): Default to aix73.h.
20621 (powerpc-ibm-aix7.2.*.*): New stanza.
20622 * config/rs6000/aix72.h: Add SPDX License Identifier.
20623 * config/rs6000/aix73.h: New file.
20625 2021-06-26 Jason Merrill <jason@redhat.com>
20627 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
20629 2021-06-26 Andrew Pinski <apinski@marvell.com>
20631 * genmatch.c (lower_cond): Copy for_subst_vec
20632 for the simplify also.
20633 (lower): Swap the order for lower_for and lower_cond.
20635 2021-06-26 Andrew Pinski <apinski@marvell.com>
20637 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
20638 flow senatitive info on the moved ssa set.
20640 2021-06-26 Andrew Pinski <apinski@marvell.com>
20642 * fold-const.c (fold_cond_expr_with_comparison):
20643 Exand arg0 into comp_code, arg00, and arg01.
20644 (fold_ternary_loc): Use invert_tree_comparison
20645 instead of fold_invert_truthvalue for the case
20646 where we have A CMP B ? C : A.
20648 2021-06-25 Martin Sebor <msebor@redhat.com>
20650 PR middle-end/101216
20651 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
20653 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
20655 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
20657 2021-06-25 Richard Biener <rguenther@suse.de>
20659 PR tree-optimization/101202
20660 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
20663 2021-06-25 Richard Biener <rguenther@suse.de>
20665 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
20666 STMT_VINFO_REDUC_DEF from the original representative.
20668 2021-06-25 Martin Sebor <msebor@redhat.com>
20670 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
20671 gimple_no_warning_p and gimple_set_no_warning with
20672 warning_suppressed_p, and suppress_warning.
20674 (maybe_warn_for_bound): Same.
20675 (warn_for_access): Same.
20676 (check_access): Same.
20677 (expand_builtin_strncmp): Same.
20678 (fold_builtin_varargs): Same.
20679 * calls.c (maybe_warn_nonstring_arg): Same.
20680 (maybe_warn_rdwr_sizes): Same.
20681 * cfgexpand.c (expand_call_stmt): Same.
20682 * cgraphunit.c (check_global_declaration): Same.
20683 * fold-const.c (fold_undefer_overflow_warnings): Same.
20684 (fold_truth_not_expr): Same.
20685 (fold_unary_loc): Same.
20686 (fold_checksum_tree): Same.
20687 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
20688 (array_bounds_checker::check_mem_ref): Same.
20689 (array_bounds_checker::check_addr_expr): Same.
20690 (array_bounds_checker::check_array_bounds): Same.
20691 * gimple-expr.c (copy_var_decl): Same.
20692 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
20693 (gimple_fold_builtin_strncat): Same.
20694 (gimple_fold_builtin_stxcpy_chk): Same.
20695 (gimple_fold_builtin_stpcpy): Same.
20696 (gimple_fold_builtin_sprintf): Same.
20697 (fold_stmt_1): Same.
20698 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
20699 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
20700 * gimple-ssa-sprintf.c (handle_printf_call): Same.
20701 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
20702 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
20703 * gimple-ssa-warn-restrict.h: Adjust declarations.
20704 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
20705 gimple_no_warning_p and gimple_set_no_warning with
20706 warning_suppressed_p, and suppress_warning.
20707 (check_call): Same.
20708 (check_bounds_or_overlap): Same.
20709 * gimple.c (gimple_build_call_from_tree): Same.
20710 * gimplify.c (gimplify_return_expr): Same.
20711 (gimplify_cond_expr): Same.
20712 (gimplify_modify_expr_complex_part): Same.
20713 (gimplify_modify_expr): Same.
20714 (gimple_push_cleanup): Same.
20715 (gimplify_expr): Same.
20716 * omp-expand.c (expand_omp_for_generic): Same.
20717 (expand_omp_taskloop_for_outer): Same.
20718 * omp-low.c (lower_rec_input_clauses): Same.
20719 (lower_lastprivate_clauses): Same.
20720 (lower_send_clauses): Same.
20721 (lower_omp_target): Same.
20722 * tree-cfg.c (pass_warn_function_return::execute): Same.
20723 * tree-complex.c (create_one_component_var): Same.
20724 * tree-inline.c (remap_gimple_op_r): Same.
20725 (copy_tree_body_r): Same.
20726 (declare_return_variable): Same.
20727 (expand_call_inline): Same.
20728 * tree-nested.c (lookup_field_for_decl): Same.
20729 * tree-sra.c (create_access_replacement): Same.
20730 (generate_subtree_copies): Same.
20731 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
20732 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
20733 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
20734 * tree-ssa-loop-im.c (execute_sm): Same.
20735 * tree-ssa-phiopt.c (cond_store_replacement): Same.
20736 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
20737 (handle_builtin_strcpy): Same.
20738 (maybe_diag_stxncpy_trunc): Same.
20739 (handle_builtin_stxncpy_strncat): Same.
20740 (handle_builtin_strcat): Same.
20741 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
20742 (set_no_uninit_warning): Same.
20743 (uninit_undefined_value_p): Same.
20744 (warn_uninit): Same.
20745 (maybe_warn_operand): Same.
20746 * tree-vrp.c (compare_values_warnv): Same.
20747 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
20748 (test_for_singularity): Same.
20749 * gimple.h (warning_suppressed_p): New function.
20750 (suppress_warning): Same.
20751 (copy_no_warning): Same.
20752 (gimple_set_block): Call gimple_set_location.
20753 (gimple_set_location): Call copy_warning.
20755 2021-06-25 Martin Sebor <msebor@redhat.com>
20757 * tree.h (warning_suppressed_at, copy_warning,
20758 warning_suppressed_p, suppress_warning): New functions.
20760 2021-06-25 Martin Sebor <msebor@redhat.com>
20762 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
20763 * gengtype.c (open_base_files): Add diagnostic-spec.h.
20764 * diagnostic-spec.c: New file.
20765 * diagnostic-spec.h: New file.
20766 * tree.h (no_warning, all_warnings, suppress_warning_at): New
20768 * warning-control.cc: New file.
20770 2021-06-25 liuhongt <hongtao.liu@intel.com>
20773 * config/i386/i386.c (x86_order_regs_for_local_alloc):
20776 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20778 PR tree-optimization/101189
20779 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
20780 LHS range of condition to postfold routine.
20781 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
20782 FALSE edge if the LHS range supports it being taken.
20783 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
20785 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20787 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
20788 (relation_oracle::find_relation_block): Check correct bitmap.
20789 (relation_oracle::dump): Do not dump NULL blocks.
20791 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20793 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
20794 range_on_edge instead of manually calculating.
20796 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20798 * range-op.cc: Fix comment.
20800 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
20803 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
20804 Handle V8QI and V4HI modes.
20805 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
20807 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
20808 (mmxpackmode): New mode attribute.
20809 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
20810 (mmxunpackmode): New mode attribute.
20811 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
20812 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
20813 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
20814 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
20815 * config/i386/i386.md (extsuffix): Move from ...
20816 * config/i386/sse.md: ... here.
20818 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20820 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
20821 (dwarf2out_finish): ...instead of here.
20823 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20825 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
20826 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
20827 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
20828 * configure: Regenerate.
20830 2021-06-24 Richard Biener <rguenther@suse.de>
20832 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
20833 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
20834 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
20836 2021-06-24 Richard Biener <rguenther@suse.de>
20838 * config/i386/sse.md (avx_addsubv4df3): Rename to
20840 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
20841 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
20842 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
20843 * config/i386/i386-builtin.def: Adjust.
20844 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
20845 * optabs.def (vec_addsub_optab): New optab.
20846 * tree-vect-slp-patterns.c (class addsub_pattern): New.
20847 (slp_patterns): Add addsub_pattern.
20848 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
20849 across CFN_VEC_ADDSUB.
20850 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
20852 * doc/md.texi (vec_addsub<mode>3): Document.
20854 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20856 PR middle-end/101170
20857 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
20858 where regno + subreg_regno_offset wraps around use 0 as starting
20861 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20863 PR middle-end/101172
20864 * stor-layout.c (finish_bitfield_representative): If nextf has
20865 error_mark_node type, set repr type to error_mark_node too.
20867 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
20869 * config/s390/s390.c (s390_function_profiler): Ignore labelno
20871 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
20873 2021-06-24 Richard Biener <rguenther@suse.de>
20875 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
20876 across operations that have different semantics on different
20879 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20881 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
20882 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
20883 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
20884 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
20885 OMP_TARGET user outer_ctx instead of ctx for placeholders and
20886 initializer/combiner gimplification.
20887 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
20888 on target constructs.
20889 (lower_rec_input_clauses): Likewise.
20890 (lower_omp_target): Likewise.
20891 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
20892 on target if in_reduction is present.
20894 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
20896 * tree-predcom.c (class pcom_worker): New class.
20897 (release_chain): Renamed to...
20898 (pcom_worker::release_chain): ...this.
20899 (release_chains): Renamed to...
20900 (pcom_worker::release_chains): ...this.
20901 (aff_combination_dr_offset): Renamed to...
20902 (pcom_worker::aff_combination_dr_offset): ...this.
20903 (determine_offset): Renamed to...
20904 (pcom_worker::determine_offset): ...this.
20905 (class comp_ptrs): New class.
20906 (split_data_refs_to_components): Renamed to...
20907 (pcom_worker::split_data_refs_to_components): ...this,
20908 and update with class comp_ptrs.
20909 (suitable_component_p): Renamed to...
20910 (pcom_worker::suitable_component_p): ...this.
20911 (filter_suitable_components): Renamed to...
20912 (pcom_worker::filter_suitable_components): ...this.
20913 (valid_initializer_p): Renamed to...
20914 (pcom_worker::valid_initializer_p): ...this.
20915 (find_looparound_phi): Renamed to...
20916 (pcom_worker::find_looparound_phi): ...this.
20917 (add_looparound_copies): Renamed to...
20918 (pcom_worker::add_looparound_copies): ...this.
20919 (determine_roots_comp): Renamed to...
20920 (pcom_worker::determine_roots_comp): ...this.
20921 (determine_roots): Renamed to...
20922 (pcom_worker::determine_roots): ...this.
20923 (single_nonlooparound_use): Renamed to...
20924 (pcom_worker::single_nonlooparound_use): ...this.
20925 (remove_stmt): Renamed to...
20926 (pcom_worker::remove_stmt): ...this.
20927 (execute_pred_commoning_chain): Renamed to...
20928 (pcom_worker::execute_pred_commoning_chain): ...this.
20929 (execute_pred_commoning): Renamed to...
20930 (pcom_worker::execute_pred_commoning): ...this.
20931 (struct epcc_data): New member worker.
20932 (execute_pred_commoning_cbck): Call execute_pred_commoning
20933 with pcom_worker pointer.
20934 (find_use_stmt): Renamed to...
20935 (pcom_worker::find_use_stmt): ...this.
20936 (find_associative_operation_root): Renamed to...
20937 (pcom_worker::find_associative_operation_root): ...this.
20938 (find_common_use_stmt): Renamed to...
20939 (pcom_worker::find_common_use_stmt): ...this.
20940 (combinable_refs_p): Renamed to...
20941 (pcom_worker::combinable_refs_p): ...this.
20942 (reassociate_to_the_same_stmt): Renamed to...
20943 (pcom_worker::reassociate_to_the_same_stmt): ...this.
20944 (stmt_combining_refs): Renamed to...
20945 (pcom_worker::stmt_combining_refs): ...this.
20946 (combine_chains): Renamed to...
20947 (pcom_worker::combine_chains): ...this.
20948 (try_combine_chains): Renamed to...
20949 (pcom_worker::try_combine_chains): ...this.
20950 (prepare_initializers_chain): Renamed to...
20951 (pcom_worker::prepare_initializers_chain): ...this.
20952 (prepare_initializers): Renamed to...
20953 (pcom_worker::prepare_initializers): ...this.
20954 (prepare_finalizers_chain): Renamed to...
20955 (pcom_worker::prepare_finalizers_chain): ...this.
20956 (prepare_finalizers): Renamed to...
20957 (pcom_worker::prepare_finalizers): ...this.
20958 (tree_predictive_commoning_loop): Renamed to...
20959 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
20960 some calls and remove some cleanup code.
20961 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
20962 (static variable looparound_phis): Remove.
20963 (static variable name_expansions): Remove.
20965 2021-06-24 Richard Biener <rguenther@suse.de>
20967 * tree-vect-slp.c (slpg_vertex): New struct.
20968 (vect_slp_build_vertices): Adjust.
20969 (vect_optimize_slp): Likewise. Maintain an outgoing permute
20970 and a materialized one.
20972 2021-06-24 Richard Biener <rguenther@suse.de>
20974 PR tree-optimization/101105
20975 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
20976 Only ignore steps when they are equal or scalar order is preserved.
20978 2021-06-24 liuhongt <hongtao.liu@intel.com>
20981 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
20982 Adjust comments for ix86_expand_vecop_qihi2.
20983 (ix86_expand_vecmul_qihi): Renamed to ..
20984 (ix86_expand_vecop_qihi2): Adjust function prototype to
20985 support shift operation, add static to definition.
20986 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
20987 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
20988 ix86_expand_vec_shift_qihi_constant.
20989 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
20990 (ix86_expand_vec_shift_qihi_constant): Deleted.
20991 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
20993 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
20994 condition TARGET_64BIT.
20995 (mul<mode>3): Ditto.
20996 (<insn><mode>3): Ditto.
20997 (vlshr<mode>3): Extend to support avx512 vlshr.
20998 (v<insn><mode>3): New expander for
21000 (v<insn>v8qi3): Ditto.
21001 (vashrv8hi3<mask_name>): Renamed to ..
21002 (vashr<mode>3): And extend to support V16QImode for avx512.
21003 (vashrv16qi3): Deleted.
21004 (vashrv2di3<mask_name>): Extend expander to support avx512
21007 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
21009 * doc/lto.texi (Design Overview): Update that slim objects are
21012 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
21014 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
21015 of OTHER_POWER10_MASKS so it will not be enabled by default.
21017 2021-06-23 Richard Biener <rguenther@suse.de>
21018 Martin Jambor <mjambor@suse.cz>
21020 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
21021 param replacement unconditionally. Adjust comment.
21023 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21025 * Makefile.in (OBJS): Add gimple-range-fold.o
21026 * gimple-range-fold.cc: New.
21027 * gimple-range-fold.h: New.
21028 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
21029 (gimple_range_calc_op2): Ditto.
21030 * gimple-range-gori.h: Move prototypes to here.
21031 * gimple-range.cc: Adjust include files.
21032 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
21033 (fur_source::get_operand): Ditto.
21034 (fur_source::get_phi_operand): Ditto.
21035 (fur_source::query_relation): Ditto.
21036 (fur_source::register_relation): Ditto.
21037 (class fur_edge): Ditto.
21038 (fur_edge::fur_edge): Ditto.
21039 (fur_edge::get_operand): Ditto.
21040 (fur_edge::get_phi_operand): Ditto.
21041 (fur_stmt::fur_stmt): Ditto.
21042 (fur_stmt::get_operand): Ditto.
21043 (fur_stmt::get_phi_operand): Ditto.
21044 (fur_stmt::query_relation): Ditto.
21045 (class fur_depend): Relocate to gimple-range-fold.h.
21046 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
21047 (fur_depend::register_relation): Ditto.
21048 (fur_depend::register_relation): Ditto.
21049 (class fur_list): Ditto.
21050 (fur_list::fur_list): Ditto.
21051 (fur_list::get_operand): Ditto.
21052 (fur_list::get_phi_operand): Ditto.
21053 (fold_range): Ditto.
21054 (adjust_pointer_diff_expr): Ditto.
21055 (gimple_range_adjustment): Ditto.
21056 (gimple_range_base_of_assignment): Ditto.
21057 (gimple_range_operand1): Ditto.
21058 (gimple_range_operand2): Ditto.
21059 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
21060 (gimple_range_calc_op2): Ditto.
21061 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
21062 (fold_using_range::range_of_range_op): Ditto.
21063 (fold_using_range::range_of_address): Ditto.
21064 (fold_using_range::range_of_phi): Ditto.
21065 (fold_using_range::range_of_call): Ditto.
21066 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
21067 (fold_using_range::range_of_builtin_call): Ditto.
21068 (fold_using_range::range_of_cond_expr): Ditto.
21069 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
21070 (fold_using_range::relation_fold_and_or): Ditto.
21071 (fold_using_range::postfold_gcond_edges): Ditto.
21072 * gimple-range.h: Add gimple-range-fold.h to include files. Change
21073 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
21074 (gimple_range_handler): Relocate to gimple-range-fold.h.
21075 (gimple_range_ssa_p): Ditto.
21076 (range_compatible_p): Ditto.
21077 (class fur_source): Ditto.
21078 (class fur_stmt): Ditto.
21079 (class fold_using_range): Ditto.
21080 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
21081 (gimple_range_calc_op2): Ditto.
21083 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21085 PR tree-optimization/101148
21086 PR tree-optimization/101014
21087 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21088 (ranger_cache::~ranger_cache): Adjust.
21089 (ranger_cache::block_range): Check if propagation disallowed.
21090 (ranger_cache::propagate_cache): Disallow propagation if new value
21091 can't be stored properly.
21092 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
21094 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21096 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
21097 (sbr_vector::set_bb_range): Return true.
21098 (class sbr_sparse_bitmap): Adjust.
21099 (sbr_sparse_bitmap::set_bb_range): Return value.
21100 (block_range_cache::set_bb_range): Return value.
21101 (ranger_cache::propagate_cache): Use return value to print msg.
21102 * gimple-range-cache.h (class block_range_cache): Adjust.
21104 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21106 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
21108 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
21110 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
21111 so this pattern can be used for test/compare removal. Pass
21112 current insn to compute_logical_op_length and output_logical_op.
21113 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
21114 (h8300_and_costs): Add argument to compute_logical_op_length.
21115 (output_logical_op): Add new argument. Use it to determine if the
21116 condition codes are used and adjust the output accordingly.
21117 (compute_logical_op_length): Add new argument and update length
21118 computations when condition codes are used.
21119 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
21121 (output_logical_op): Likewise.
21123 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
21126 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21127 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
21128 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
21129 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
21130 * config/i386/sse.md (unspec): ... here.
21132 2021-06-23 Martin Liska <mliska@suse.cz>
21135 * optc-save-gen.awk: Put back arm_fp16_format to
21138 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
21141 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
21144 (clz<mode>2): Update RTX pattern for additions.
21146 2021-06-23 Jakub Jelinek <jakub@redhat.com>
21148 PR middle-end/101167
21149 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
21150 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
21152 2021-06-22 Sergei Trofimovich <siarheit@google.com>
21154 * doc/rtl.texi: drop unbalanced parenthesis.
21156 2021-06-22 Richard Biener <rguenther@suse.de>
21158 PR middle-end/101156
21159 * gimplify.c (gimplify_expr): Remove premature incorrect
21162 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21164 PR tree-optimization/101159
21165 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
21168 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21170 PR middle-end/101160
21171 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
21172 clear crtl->return_rtx instead of keeping it referencing a pseudo.
21174 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21175 Andrew Pinski <apinski@marvell.com>
21177 PR tree-optimization/101162
21178 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
21181 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21183 * range-op.cc (range_relational_tests): New.
21184 (range_op_tests): Call range_relational_tests.
21186 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21188 * range-op.cc (operator_cast::lhs_op1_relation): New.
21189 (operator_identity::lhs_op1_relation): Mew.
21191 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21193 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
21195 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21197 * range-op.cc (operator_plus::lhs_op1_relation): New.
21198 (operator_plus::lhs_op2_relation): New.
21200 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21202 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
21203 relation_oracle if dominators exist.
21204 (ranger_cache::~ranger_cache): Dispose of oracle.
21205 (ranger_cache::dump_bb): Dump oracle.
21206 * gimple-range.cc (fur_source::fur_source): New.
21207 (fur_source::get_operand): Use mmeber query.
21208 (fur_source::get_phi_operand): Use member_query.
21209 (fur_source::query_relation): New.
21210 (fur_source::register_dependency): Delete.
21211 (fur_source::register_relation): New.
21212 (fur_edge::fur_edge): Adjust.
21213 (fur_edge::get_phi_operand): Fix comment.
21214 (fur_edge::query): Delete.
21215 (fur_stmt::fur_stmt): Adjust.
21216 (fur_stmt::query): Delete.
21217 (fur_depend::fur_depend): Adjust.
21218 (fur_depend::register_relation): New.
21219 (fur_depend::register_relation): New.
21220 (fur_list::fur_list): Adjust.
21221 (fur_list::get_operand): Use member query.
21222 (fold_using_range::range_of_range_op): Process and query relations.
21223 (fold_using_range::range_of_address): Adjust dependency call.
21224 (fold_using_range::range_of_phi): Ditto.
21225 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
21226 (fold_using_range::relation_fold_and_or): New.
21227 (fold_using_range::postfold_gcond_edges): New.
21228 * gimple-range.h (class gimple_ranger): Adjust.
21229 (class fur_source): Adjust members.
21230 (class fur_stmt): Ditto.
21231 (class fold_using_range): Ditto.
21233 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21235 * range-op.cc (range_operator::wi_fold): Apply relation effect.
21236 (range_operator::fold_range): Adjust and apply relation effect.
21237 (*::fold_range): Add relation parameters.
21238 (*::op1_range): Ditto.
21239 (*::op2_range): Ditto.
21240 (range_operator::lhs_op1_relation): New.
21241 (range_operator::lhs_op2_relation): New.
21242 (range_operator::op1_op2_relation): New.
21243 (range_operator::op1_op2_relation_effect): New.
21244 (relop_early_resolve): New.
21245 (operator_equal::op1_op2_relation): New.
21246 (operator_equal::fold_range): Call relop_early_resolve.
21247 (operator_not_equal::op1_op2_relation): New.
21248 (operator_not_equal::fold_range): Call relop_early_resolve.
21249 (operator_lt::op1_op2_relation): New.
21250 (operator_lt::fold_range): Call relop_early_resolve.
21251 (operator_le::op1_op2_relation): New.
21252 (operator_le::fold_range): Call relop_early_resolve.
21253 (operator_gt::op1_op2_relation): New.
21254 (operator_gt::fold_range): Call relop_early_resolve.
21255 (operator_ge::op1_op2_relation): New.
21256 (operator_ge::fold_range): Call relop_early_resolve.
21257 * range-op.h (class range_operator): Adjust parameters and methods.
21259 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21261 * Makefile.in (OBJS): Add value-relation.o.
21262 * gimple-range.h: Adjust include files.
21263 * tree-data-ref.c: Adjust include file order.
21264 * value-query.cc (range_query::get_value_range): Default to no oracle.
21265 (range_query::query_relation): New.
21266 (range_query::query_relation): New.
21267 * value-query.h (class range_query): Adjust.
21268 * value-relation.cc: New.
21269 * value-relation.h: New.
21271 2021-06-22 Richard Biener <rguenther@suse.de>
21273 PR tree-optimization/101151
21274 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
21277 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
21279 * config/riscv/riscv.c (thead_c906_tune_info): New.
21280 (riscv_tune_info_table): Use new tune.
21282 2021-06-22 Richard Biener <rguenther@suse.de>
21284 PR tree-optimization/101158
21285 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
21286 checking after checking for matching operation.
21288 2021-06-22 Richard Biener <rguenther@suse.de>
21290 PR tree-optimization/101159
21291 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
21292 missing NULL vectype check.
21294 2021-06-22 Richard Biener <rguenther@suse.de>
21296 PR tree-optimization/101154
21297 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
21299 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21302 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
21303 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
21304 * config/i386/i386.c (ix86_expand_prologue): Clear it.
21305 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
21306 Remove "" from match_operand. Emit new insns using emit_move_insn and
21307 set ix86_last_zero_store_uid to INSN_UID of the last store.
21308 Add peephole2s for 1/2/4 stores of const0_rtx following previous
21311 2021-06-22 Martin Liska <mliska@suse.cz>
21313 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
21316 2021-06-22 Martin Liska <mliska@suse.cz>
21318 * gcov-io.h: Remove padding entries.
21320 2021-06-22 liuhongt <hongtao.liu@intel.com>
21322 PR tree-optimization/97770
21323 * tree-vect-patterns.c (vect_recog_popcount_pattern):
21325 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
21327 2021-06-22 liuhongt <hongtao.liu@intel.com>
21330 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
21331 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
21332 (expand<mode>_mask): this ..
21333 (*expand<mode>_mask): New pre_reload splitter to transform
21334 v{,p}expand* to vmov* when mask is zero, all ones, or has all
21335 ones in it's lower part, otherwise still generate
21338 2021-06-22 liuhongt <hongtao.liu@intel.com>
21341 * config/i386/i386-expand.c
21342 (ix86_expand_special_args_builtin): Keep constm1_operand only
21343 if it satisfies insn's operand predicate.
21345 2021-06-21 Jason Merrill <jason@redhat.com>
21348 * df-scan.c (df_ref_record): Check that regno < endregno.
21349 * function.c (assign_parms, expand_function_end): Do nothing with a
21350 TYPE_EMPTY_P result.
21352 2021-06-21 Richard Biener <rguenther@suse.de>
21354 PR tree-optimization/101120
21355 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
21357 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
21358 DR chain DCE capability.
21359 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
21360 * tree-vect-stmts.c (vectorizable_load): Remove unused
21361 loads in the DR chain for SLP.
21363 2021-06-21 Jakub Jelinek <jakub@redhat.com>
21365 PR inline-asm/100785
21366 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
21367 output or input operands were already error_mark_node.
21368 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
21369 remove all inputs, outputs and clobbers from the asm and
21370 set template to "".
21372 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
21374 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
21375 (vceq_s16): Likewise.
21376 (vceq_s32): Likewise.
21377 (vceq_u8): Likewise.
21378 (vceq_u16): Likewise.
21379 (vceq_u32): Likewise.
21380 (vceq_p8): Likewise.
21381 (vceqq_s8): Likewise.
21382 (vceqq_s16): Likewise.
21383 (vceqq_s32): Likewise.
21384 (vceqq_u8): Likewise.
21385 (vceqq_u16): Likewise.
21386 (vceqq_u32): Likewise.
21387 (vceqq_p8): Likewise.
21388 (vceq_f32): Gate __a == __b on __FAST_MATH__.
21389 (vceqq_f32): Likewise.
21390 (vceq_f16): Likewise.
21391 (vceqq_f16): Likewise.
21393 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
21396 * config/arm/iterators.md (NEON_VACMP): Remove.
21397 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
21399 (neon_vca<cmp_op><mode>_insn): Likewise.
21400 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
21403 2021-06-21 Richard Biener <rguenther@suse.de>
21405 PR tree-optimization/101121
21406 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
21407 when we just lack a stmt with the desired op when doing permutation.
21408 (vect_build_slp_tree): When caching a failed SLP build attempt
21409 assert that at least one lane is marked as not matching.
21411 2021-06-21 liuhongt <hongtao.liu@intel.com>
21414 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
21415 register alternative.
21416 (*and<mode>_1): Ditto.
21418 (*andn<mode>_1): Ditto.
21419 (*<code><mode>_1): Ditto.
21420 (*<code>qi_1): Ditto.
21421 (*one_cmpl<mode>2_1): Ditto.
21422 (*one_cmplsi2_1_zext): Ditto.
21423 (*one_cmplqi2_1): Ditto.
21424 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
21425 the order of mask registers to be before general registers.
21427 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
21430 * config/i386/i386.md: New define_peephole2s to shrink writing
21431 1, 2 or 4 consecutive zeros to memory when optimizing for size.
21433 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
21435 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
21436 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
21437 more efficient code when the source can be trivially simplified.
21439 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
21441 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
21442 a range if global is not available.
21443 (ranger_cache::entry_range): Fallback to range_of_def.
21444 * gimple-range-cache.h (range_of_def): Adjust prototype.
21446 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
21448 PR tree-optimization/101014
21449 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
21451 (ranger_cache::~ranger_cache): Ditto.
21452 (ranger_cache::enable_new_values): Delete.
21453 (ranger_cache::push_poor_value): Delete.
21454 (ranger_cache::range_of_def): Remove poor value processing.
21455 (ranger_cache::entry_range): Ditto.
21456 (ranger_cache::fill_block_cache): Ditto.
21457 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
21458 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
21459 * gimple-range.h (class gimple_ranger): Adjust.
21461 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21464 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
21465 derived from arm_canon_arch.
21466 (arm_canon_arch_option): Call it.
21467 (arm_canon_arch_multilib_option): New function.
21468 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
21469 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
21470 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
21471 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
21472 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
21473 * config/arm/arm.opt (mlibarch): New option.
21474 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
21475 of march on RHS with mlibarch.
21477 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
21479 * config.in: Regenerate.
21480 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
21482 * configure: Regenerate.
21483 * configure.ac: Fix for global_load assembler functions.
21485 2021-06-18 Richard Biener <rguenther@suse.de>
21487 PR tree-optimization/101112
21488 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
21489 to lookup a pattern stmt def.
21491 2021-06-18 Jakub Jelinek <jakub@redhat.com>
21493 PR middle-end/101062
21494 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
21495 representatives in QUAL_UNION_TYPE.
21497 2021-06-18 Andrew Pinski <apinski@marvell.com>
21499 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
21500 Add counting of how many times it is done.
21501 (factor_out_conditional_conversion): Likewise.
21502 (match_simplify_replacement): Likewise.
21503 (value_replacement): Likewise.
21504 (spaceship_replacement): Likewise.
21505 (cond_store_replacement): Likewise.
21506 (cond_if_else_store_replacement_1): Likewise.
21507 (hoist_adjacent_loads): Likewise.
21509 2021-06-18 Andrew Pinski <apinski@marvell.com>
21511 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
21512 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
21513 (verify_gimple_assign_binary): Reject point and offset types on
21514 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
21515 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
21516 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
21518 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
21520 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
21521 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
21522 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
21525 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
21527 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
21528 earlyclobber to alts 0/1.
21529 (gen_addadd): Add earlyclobber to alts 0/1.
21530 * config/rs6000/fusion.md: Regenerate file.
21532 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21534 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
21536 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21538 * gimple-range-cache.cc: Comment cleanups.
21539 * gimple-range-gori.cc: Comment cleanups.
21540 * gimple-range.cc: Comment/spacing cleanups
21541 * value-range.h: Comment cleanups.
21543 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
21546 * calls.c (expand_call): Replace PUSH_ARGS with
21547 targetm.calls.push_argument (0).
21548 (emit_library_call_value_1): Likewise.
21549 * defaults.h (PUSH_ARGS): Removed.
21550 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
21551 targetm.calls.push_argument (0).
21552 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
21553 (emit_push_insn): Pass the number bytes to push to
21554 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
21555 * hooks.c (hook_bool_uint_true): New.
21556 * hooks.h (hook_bool_uint_true): Likewise.
21557 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
21558 targetm.calls.push_argument (0).
21559 * target.def (push_argument): Add a targetm.calls hook.
21560 * targhooks.c (default_push_argument): New.
21561 * targhooks.h (default_push_argument): Likewise.
21562 * config/bpf/bpf.h (PUSH_ARGS): Removed.
21563 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
21564 * config/cr16/cr16.h (PUSH_ARGS): Removed.
21565 * config/i386/i386.c (ix86_push_argument): New.
21566 (TARGET_PUSH_ARGUMENT): Likewise.
21567 * config/i386/i386.h (PUSH_ARGS): Removed.
21568 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
21569 * config/m32c/m32c.h (PUSH_ARGS): Removed.
21570 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
21571 * config/pru/pru.h (PUSH_ARGS): Likewise.
21572 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
21573 TARGET_PUSH_ARGUMENT hook.
21574 * doc/tm.texi: Regenerated.
21576 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
21579 * config/i386/i386-expand.c (expand_vector_set_var):
21580 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
21581 ix86_expand_vector_init_duplicate.
21582 (ix86_expand_vector_init_duplicate): Emit insv_1 for
21583 QImode for !TARGET_PARTIAL_REG_STALL.
21584 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
21585 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
21586 as operand 2 predicate. Call ix86_expand_vector_set_var
21587 for non-constant index operand.
21588 (vec_setv2si): Ditto.
21589 (vec_setv4hi): Ditto.
21590 (vec_setv8qi): ditto.
21592 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21594 PR tree-optimization/100790
21595 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
21598 2021-06-17 Martin Liska <mliska@suse.cz>
21600 * doc/invoke.texi: Use consistently -O1 instead of -O.
21602 2021-06-17 Martin Liska <mliska@suse.cz>
21604 * gcov-io.h: Update documentation entry about string format.
21606 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
21609 * config/s390/vecintrin.h (vec_doublee): Fix to use
21610 __builtin_s390_vflls.
21611 (vec_floate): Fix to use __builtin_s390_vflrd.
21613 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21615 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
21616 * dominance.h (get_dominated_to_depth): Likewise.
21617 (get_all_dominated_blocks): Likewise.
21618 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
21619 * gcse.c (hoist_code): Likewise.
21620 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
21621 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
21622 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
21623 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
21625 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21627 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
21628 * dominance.h (get_dominated_by_region): Likewise.
21629 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
21630 (gimple_duplicate_sese_tail): Likewise.
21631 (move_sese_region_to_fn): Likewise.
21633 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21635 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
21636 * dominance.h (get_dominated_by): Likewise.
21637 * auto-profile.c (afdo_find_equiv_class): Adjust.
21638 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
21639 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
21640 * tree-cfg.c (test_linear_chain): Likewise.
21641 (test_diamond): Likewise.
21643 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21645 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
21646 * cfgloopanal.c (get_loop_hot_path): Likewise.
21647 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
21649 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21651 * cgraph.c (cgraph_node::collect_callers): Return
21652 auto_vec<cgraph_edge *>.
21653 * cgraph.h (cgraph_node::collect_callers): Likewise.
21654 * ipa-cp.c (create_specialized_node): Adjust.
21655 (decide_about_value): Likewise.
21656 (decide_whether_version_node): Likewise.
21657 * ipa-sra.c (process_isra_node_results): Likewise.
21659 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21661 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
21662 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
21664 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
21667 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21669 * gimple-range.cc (debug_seed_ranger): New.
21670 (dump_ranger): New.
21671 (debug_ranger): New.
21673 2021-06-17 Richard Biener <rguenther@suse.de>
21675 PR tree-optimization/54400
21676 * tree-vectorizer.h (enum slp_instance_kind): Add
21677 slp_inst_kind_bb_reduc.
21678 (reduction_fn_for_scalar_code): Declare.
21679 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
21680 Check SLP_INSTANCE_KIND instead of looking at the
21682 (vect_slp_analyze_instance_alignment): Likewise.
21683 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
21684 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
21685 chain linearization from vect_build_slp_tree_2 and generalize
21686 for the use of BB reduction vectorization.
21687 (vect_build_slp_tree_2): Adjust accordingly.
21688 (vect_optimize_slp): Elide permutes at the root of BB reduction
21690 (vectorizable_bb_reduc_epilogue): New function.
21691 (vect_slp_prune_covered_roots): Likewise.
21692 (vect_slp_analyze_operations): Use them.
21693 (vect_slp_check_for_constructors): Recognize associatable
21694 chains for BB reduction vectorization.
21695 (vectorize_slp_instance_root_stmt): Generate code for the
21696 BB reduction epilogue.
21698 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21700 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
21702 (gori_compute::may_recompute_p): New.
21703 (gori_compute::outgoing_edge_range_p): Perform recomputations.
21704 * gimple-range-gori.h (class gori_compute): Add prototype.
21706 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21708 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
21709 true when a range can be calculated.
21710 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
21712 2021-06-16 Martin Sebor <msebor@redhat.com>
21714 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
21715 Correct documented defaults.
21717 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
21719 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
21720 m_new_value_p directly.
21722 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
21725 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
21726 Handle 64bit modes for TARGET_SSE4_1.
21727 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
21728 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
21729 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
21730 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
21731 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
21732 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
21734 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21736 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
21737 Change to an expander that emits the correct instruction
21738 depending on endianness.
21739 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
21740 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
21742 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21744 * config/aarch64/aarch64-simd-builtins.def: Split generator
21745 for aarch64_<su>qmovn builtins into scalar and vector
21747 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
21749 (aarch64_<su>qmovn<mode>_insn_be): Define.
21750 (aarch64_<su>qmovn<mode>): Split into scalar and vector
21751 variants. Change vector variant to an expander that emits the
21752 correct instruction depending on endianness.
21754 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21756 * config/aarch64/aarch64-simd-builtins.def: Split generator
21757 for aarch64_sqmovun builtins into scalar and vector variants.
21758 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
21759 Split into scalar and vector variants. Change vector variant
21760 to an expander that emits the correct instruction depending
21762 (aarch64_sqmovun<mode>_insn_le): Define.
21763 (aarch64_sqmovun<mode>_insn_be): Define.
21765 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21767 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
21768 Define - modeling zero-high-half semantics.
21769 (aarch64_xtn<mode>): Change to an expander that emits the
21770 appropriate instruction depending on endianness.
21771 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
21773 (aarch64_xtn2<mode>_le): Rename to...
21774 (aarch64_xtn2<mode>_insn_le): This.
21775 (aarch64_xtn2<mode>_be): Rename to...
21776 (aarch64_xtn2<mode>_insn_be): This.
21777 (vec_pack_trunc_<mode>): Emit truncation instruction instead
21779 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
21780 attribute iterator.
21782 2021-06-16 Martin Jambor <mjambor@suse.cz>
21784 PR tree-optimization/100453
21785 * tree-sra.c (create_access): Disqualify any const candidates
21786 which are written to.
21787 (sra_modify_expr): Do not store sub-replacements back to a const base.
21788 (handle_unscalarized_data_in_subtree): Likewise.
21789 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
21790 instead of constant_decl_p.
21792 2021-06-16 Jakub Jelinek <jakub@redhat.com>
21794 PR middle-end/101062
21795 * stor-layout.c (finish_bitfield_representative): For fields in unions
21796 assume nextf is always NULL.
21797 (finish_bitfield_layout): Compute bit field representatives also in
21798 unions, but handle it as if each bitfield was the only field in the
21801 2021-06-16 Richard Biener <rguenther@suse.de>
21803 PR tree-optimization/101088
21804 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
21805 supported refs on edges. Do not assert same ref but
21806 different kind stores are unsuported but mark them so.
21807 (hoist_memory_references): Only look for supported refs
21810 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
21812 PR rtl-optimization/46235
21813 * config/i386/i386.md: New define_split for bt followed by cmov.
21814 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
21815 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
21816 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
21817 by setnc with zero extension.
21819 2021-06-16 Richard Biener <rguenther@suse.de>
21821 PR tree-optimization/101083
21822 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
21823 vectype as argument.
21824 (vect_build_slp_tree_2): Adjust.
21826 2021-06-15 Martin Sebor <msebor@redhat.com>
21828 PR middle-end/100876
21829 * builtins.c: (gimple_call_return_array): Account for size_t
21830 mangling as either unsigned int or unsigned long
21832 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
21834 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
21835 up before eliminating comparisons.
21837 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
21839 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
21840 nonzero|X is nonzero.
21841 (range_op_bitwise_and_tests): Add tests for above.
21843 2021-06-15 Carl Love <cel@us.ibm.com>
21846 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
21848 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
21851 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
21854 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
21855 (struct gimplify_omp_ctx): Extend defaultmap array by one.
21856 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
21857 (omp_notice_variable): Update type classification for Fortran.
21858 (gimplify_scan_omp_clauses): Update calls for new argument; handle
21859 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
21860 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
21861 * langhooks.c (lhd_omp_scalar_p): Likewise.
21862 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
21863 (LANG_HOOKS_DECLS): Add them.
21864 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
21865 omp_scalar_p pointer type to include the new bool argument.
21867 2021-06-15 David Malcolm <dmalcolm@redhat.com>
21869 * doc/analyzer.texi
21870 (Special Functions for Debugging the Analyzer): Add
21871 __analyzer_dump_capacity.
21873 2021-06-15 Jakub Jelinek <jakub@redhat.com>
21876 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
21877 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
21879 2021-06-15 Richard Biener <rguenther@suse.de>
21881 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
21882 check to identify loop latches.
21883 * cfgloop.c (verify_loop_structure): Likewise.
21884 * loop-init.c (apply_loop_flags): Allow marked irreducible
21885 regions even with multiple latches.
21886 * predict.c (rebuild_frequencies): Simplify.
21888 2021-06-15 Richard Biener <rguenther@suse.de>
21890 * tree-ssa-threadupdate.c
21891 (jump_thread_path_registry::mark_threaded_blocks): Assert we
21892 have marked irreducible regions.
21894 2021-06-14 Martin Sebor <msebor@redhat.com>
21897 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
21898 Handle calls to placement new.
21899 (ndecl_dealloc_argno): Avoid placement delete.
21901 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
21904 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
21905 create_tmp_reg_or_ssa_name().
21907 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
21909 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21910 (ranger_cache::enable_new_values): Set to specified value and
21911 return the old value.
21912 (ranger_cache::disable_new_values): Delete.
21913 (ranger_cache::fill_block_cache): Disable non 1st order derived
21915 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
21916 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
21918 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
21921 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
21922 Return true early when testing with V2HImode.
21923 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
21925 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
21927 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
21928 (mve_vec_unpack<US>_hi_<mode>): New pattern.
21929 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
21930 (mve_vmovntq_<supf><mode>): Prefix with '@'.
21931 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
21933 (vec_unpack<US>_lo_<mode>): Likewise.
21934 (vec_pack_trunc_<mode>): Rename to
21935 neon_quad_vec_pack_trunc_<mode>.
21936 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
21938 (vec_unpack<US>_lo_<mode>): New.
21939 (vec_pack_trunc_<mode>): New.
21941 2021-06-14 Richard Biener <rguenther@suse.de>
21943 PR tree-optimization/100934
21944 * tree-ssa-dom.c (pass_dominator::execute): Properly
21945 mark irreducible regions.
21947 2021-06-14 Martin Liska <mliska@suse.cz>
21949 * doc/invoke.texi: Put r{...} on the same line as @item.
21951 2021-06-14 Martin Liska <mliska@suse.cz>
21953 * doc/invoke.texi: Add missing newline.
21955 2021-06-14 Martin Liska <mliska@suse.cz>
21957 * doc/invoke.texi: Remove '+' charasters.
21959 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
21961 * config.gcc (arc): Add support for with_cpu option.
21962 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
21964 2021-06-14 Richard Biener <rguenther@suse.de>
21966 PR tree-optimization/101031
21967 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
21968 instead of size when accounting for a possibly string
21971 2021-06-14 Martin Liska <mliska@suse.cz>
21973 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
21975 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
21977 * value-query.cc (gimple_range_global): Call get_range_global
21978 if called after inlining.
21980 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
21983 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21984 Emit constant permutation insn directly from here.
21986 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
21988 * attribs.c (find_attribute_namespace): Iterate over vec<> with
21990 * auto-profile.c (afdo_find_equiv_class): Likewise.
21991 * gcc.c (do_specs_vec): Likewise.
21992 (do_spec_1): Likewise.
21993 (driver::set_up_specs): Likewise.
21994 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
21995 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
21996 (imm_store_chain_info::try_coalesce_bswap): Likewise.
21997 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
21998 (get_location_for_stmts): Likewise.
21999 * graphite-poly.c (print_iteration_domains): Likewise.
22000 (free_poly_bb): Likewise.
22001 (remove_gbbs_in_scop): Likewise.
22002 (free_scop): Likewise.
22003 (dump_gbb_cases): Likewise.
22004 (dump_gbb_conditions): Likewise.
22005 (print_pdrs): Likewise.
22006 (print_scop): Likewise.
22007 * ifcvt.c (cond_move_process_if_block): Likewise.
22008 * lower-subreg.c (decompose_multiword_subregs): Likewise.
22009 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
22010 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
22011 * sel-sched-dump.c (dump_insn_vector): Likewise.
22012 * store-motion.c (store_ops_ok): Likewise.
22013 (store_killed_in_insn): Likewise.
22014 * timevar.c (timer::named_items::print): Likewise.
22015 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
22016 (cleanup_tree_cfg_noloop): Likewise.
22017 * tree-data-ref.c (dump_data_references): Likewise.
22018 (print_dir_vectors): Likewise.
22019 (print_dist_vectors): Likewise.
22020 (dump_data_dependence_relations): Likewise.
22021 (dump_dist_dir_vectors): Likewise.
22022 (dump_ddrs): Likewise.
22023 (create_runtime_alias_checks): Likewise.
22024 (free_subscripts): Likewise.
22025 (save_dist_v): Likewise.
22026 (save_dir_v): Likewise.
22027 (invariant_access_functions): Likewise.
22028 (same_access_functions): Likewise.
22029 (access_functions_are_affine_or_constant_p): Likewise.
22030 (find_data_references_in_stmt): Likewise.
22031 (graphite_find_data_references_in_stmt): Likewise.
22032 (free_dependence_relations): Likewise.
22033 (free_data_refs): Likewise.
22034 * tree-inline.c (copy_debug_stmts): Likewise.
22035 * tree-into-ssa.c (dump_currdefs): Likewise.
22036 (rewrite_update_phi_arguments): Likewise.
22037 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
22038 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
22040 (vect_slp_analyze_node_dependences): Likewise.
22041 (vect_slp_analyze_instance_dependence): Likewise.
22042 (vect_record_base_alignments): Likewise.
22043 (vect_get_peeling_costs_all_drs): Likewise.
22044 (vect_peeling_supportable): Likewise.
22045 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
22046 (vec_info::free_stmt_vec_infos): Likewise.
22048 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
22050 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
22051 (andqi3_1<cczn>): Removed.
22052 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
22053 (H8/SX bit logicals): Split out from other patterns.
22054 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
22055 mulqihi3_const_clobber_flags.
22056 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
22058 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
22061 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
22062 to true if red zone is used.
22063 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
22064 ix86_red_zone_used.
22065 * config/i386/i386.h (machine_function): Add red_zone_used.
22066 (ix86_red_zone_size): Removed.
22067 (ix86_red_zone_used): New.
22068 * config/i386/i386.md (peephole2 patterns): Replace
22069 ix86_red_zone_size with ix86_red_zone_used.
22071 2021-06-12 Jason Merrill <jason@redhat.com>
22073 * doc/extend.texi (unused variable attribute): Applies to
22074 structure fields as well.
22076 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
22078 * auto-profile.c (read_profile): fix a typo in an error string
22080 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
22082 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
22084 * tree-pretty-print.c (dump_omp_clauses): Update.
22085 (dump_generic_node) <OMP_CLAUSE>: Use it.
22087 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
22090 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
22091 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
22092 the polymorphic variants matching code.
22093 (__arm_vld1q_z): Likewise.
22094 (__arm_vld2q): Likewise.
22095 (__arm_vld4q): Likewise.
22096 (__arm_vldrbq_gather_offset): Likewise.
22097 (__arm_vldrbq_gather_offset_z): Likewise.
22099 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
22101 PR tree-optimization/96392
22102 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
22104 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
22106 PR tree-optimization/96392
22107 * fold-const.c (fold_real_zero_addition_p): Take both arguments
22108 of the addition or subtraction, not just the zero. Use this
22109 other argument in tests for signaling NaNs and signed zeros.
22110 (tree_expr_maybe_real_minus_zero_p): New predicate.
22111 * fold-const.h (fold_real_zero_addition_p): Update prototype.
22112 (tree_expr_maybe_real_minus_zero_p): New function prototype.
22113 * match.pd: Update calls to fold_real_zero_addition_p.
22114 Replace HONOR_NANS with tree_expr_maybe_nan_p.
22115 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
22116 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
22117 * tree-ssa-reassoc.c (eliminate_using_constants): Update
22118 call to fold_real_zero_addition_p.
22120 2021-06-11 Richard Biener <rguenther@suse.de>
22122 PR tree-optimization/101025
22123 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
22124 all refs that require dependence checking.
22126 2021-06-11 Richard Biener <rguenther@suse.de>
22128 PR tree-optimization/101028
22129 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
22130 reassoc discovery fails fatally, mark appropriate lanes
22133 2021-06-11 Richard Biener <rguenther@suse.de>
22135 PR tree-optimization/101026
22136 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
22137 have a representative for the associated chain nodes.
22139 2021-06-11 Jakub Jelinek <jakub@redhat.com>
22141 PR rtl-optimization/101008
22142 * simplify-rtx.c (relational_result): New function.
22143 (simplify_logical_relational_operation,
22144 simplify_relational_operation): Use it.
22146 2021-06-11 Jakub Jelinek <jakub@redhat.com>
22149 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
22151 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
22154 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
22155 false if the permutation can be implemented with constant
22156 permutation instruction in wider mode.
22157 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
22158 Handle V8QImode and V4HImode.
22160 2021-06-11 Martin Liska <mliska@suse.cz>
22162 PR gcov-profile/100788
22163 * common.opt: Add new option.
22164 * coverage.c (coverage_begin_function): Emit warning instead on
22165 the internal compiler error.
22166 * doc/invoke.texi: Document the option.
22167 * toplev.c (process_options): Enable it by default.
22169 2021-06-11 Richard Biener <rguenther@suse.de>
22171 PR middle-end/101009
22172 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
22173 to set *init_b to true when we encounter a constant equal
22175 (compute_affine_dependence): Also dump the actual DR_REF.
22177 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
22179 PR tree-optimization/100984
22180 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
22181 replacements table.
22182 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
22184 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
22186 * config/rs6000/rs6000.md
22187 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
22188 define_insn_and_split.
22190 2021-06-11 Richard Biener <rguenther@suse.de>
22192 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
22193 to sort operands of the associative chain.
22195 2021-06-11 Richard Biener <rguenther@suse.de>
22197 * system.h (gcc_stablesort_r): Declare.
22198 * sort.cc (gcc_sort_r): Support stable sort.
22199 (gcc_stablesort_r): Define.
22200 * vec.h (vec<>::stablesort): Add.
22202 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
22205 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
22206 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
22207 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
22208 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
22209 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
22210 (expand_vec_perm_interleave2): Handle 64bit modes.
22211 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
22212 (expand_vec_perm_even_odd_1): Ditto.
22213 (ix86_vectorize_vec_perm_const): Ditto.
22214 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
22215 * config/i386/sse.md: ... here.
22216 * config/i386/mmx.md (*vec_interleave_lowv2sf):
22217 New insn_and_split pattern.
22218 (*vec_interleave_highv2sf): Ditto.
22219 (mmx_pshufbv8qi3): New insn pattern.
22220 (*mmx_pblendw): Ditto.
22222 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
22224 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
22225 (build_acc): Likewise.
22226 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
22227 source operands in little-endian mode.
22228 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
22229 (mma_init_builtins): Likewise.
22230 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
22231 ordering for the MMA assemble and build source operands.
22232 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
22234 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
22237 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
22239 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
22241 * config/h8300/extensions.md: Replace _clobber_flags patterns
22244 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
22246 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
22247 (vcond_mask_<mode><tointvec>): this.
22249 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
22250 Thomas Schwinge <thomas@codesourcery.com>
22252 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
22253 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
22254 * gimple.h (enum gf_mask): Split
22255 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
22256 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
22257 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
22258 (is_gimple_omp_oacc): Update.
22259 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
22260 * gimplify.c (gimplify_omp_target_update): Likewise.
22261 * omp-expand.c (expand_omp_target, build_omp_regions_1)
22262 (omp_make_gimple_edges): Likewise.
22263 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
22266 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
22268 * value-query.cc (value_query::value_on_edge): Rename name to
22270 (range_query::range_on_edge): Same.
22271 (range_query::value_of_expr): Same.
22272 (range_query::value_on_edge): Same.
22273 * value-query.h (class value_query): Same.
22274 (class range_query): Same.
22276 2021-06-10 Richard Biener <rguenther@suse.de>
22278 PR tree-optimization/101003
22279 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
22280 use the pattern stmt defs when linearizing a chain.
22282 2021-06-10 Jakub Jelinek <jakub@redhat.com>
22285 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
22286 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
22288 2021-06-10 Clement Chigot <clement.chigot@atos.net>
22290 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
22291 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
22293 2021-06-09 Andrew Pinski <apinski@marvell.com>
22295 PR tree-optimization/100925
22296 * match.pd (a ? CST1 : CST2): Limit transformations
22297 that would produce a negative to integeral types only.
22298 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
22300 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
22303 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
22305 * doc/tm.texi: Correctly update.
22307 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
22309 * doc/tm.texi: Correctly update.
22311 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
22314 * doc/tm.texi.in (Trampolines): Add a missing blank line.
22316 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
22319 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
22320 and -ftrampolines work only with Ada.
22321 * doc/tm.texi.in (Trampolines): Likewise.
22322 * doc/tm.texi: Regenerated.
22324 2021-06-09 Carl Love <cel@us.ibm.com>
22326 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
22327 Add define for new builtins.
22328 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
22329 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
22330 overloaded builtin definitions.
22331 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
22332 VSIGNEXTSD2Q): Add builtin expansions.
22333 (SIGNEXT): Add P10 overload definition.
22334 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
22335 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
22336 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
22337 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
22338 vsignextend_si_v2di)[VIlong]: Add define_expand.
22339 Make define_insn vsx_sign_extend_si_v2di visible.
22340 * doc/extend.texi: Add documentation for the vec_signexti,
22341 vec_signextll builtins and vec_signextq.
22343 2021-06-09 Carl Love <cel@us.ibm.com>
22345 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
22346 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
22347 __floattikf_sw, __floatuntikf_sw respectively.
22348 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
22349 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
22350 define_insn for mode IEEE 128.
22352 2021-06-09 Carl Love <cel@us.ibm.com>
22354 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
22355 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
22356 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
22357 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
22358 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
22359 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
22360 uses of VSX_TI to VEC_TI.
22362 2021-06-09 Carl Love <cel@us.ibm.com>
22364 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
22366 2021-06-09 Carl Love <cel@us.ibm.com>
22368 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
22370 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
22371 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
22372 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
22373 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
22374 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
22375 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
22376 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
22378 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
22379 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
22380 altivec_vrlqnm): New define_expands.
22381 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
22382 VCMPGTUT_P): Add macro expansions.
22383 (BU_P10V_AV_P): Add builtin predicate definition.
22384 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
22385 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
22386 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
22387 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
22388 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
22389 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
22390 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
22391 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
22392 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
22393 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
22394 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
22395 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
22396 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
22397 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
22398 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
22399 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
22400 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
22401 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
22402 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
22403 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
22404 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
22405 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
22406 P10V_BUILTIN_MODU_V1TI):
22407 New overloaded definitions.
22408 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
22409 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
22410 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
22411 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
22412 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
22413 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
22415 (altivec_init_builtins): New E_V1TImode case statement.
22416 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
22417 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
22418 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
22419 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
22420 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
22421 E_V1TImode]: New case statements.
22422 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
22423 value RS6000_BTI_bool_V1TI.
22424 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
22425 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
22426 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
22427 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
22428 vlshrv1ti3, vashrv1ti3): New define_expands.
22429 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
22430 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
22431 UNSPEC_VSX_MODUQ): New unspecs.
22432 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
22433 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
22435 (vcmpnet): New define_expand.
22436 * doc/extend.texi: Add documentation for the new builtins vec_rl,
22437 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
22438 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
22439 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
22440 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
22441 vec_any_ge, vec_any_le.
22443 2021-06-09 Carl Love <cel@us.ibm.com>
22445 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
22446 bug in argument generation.
22448 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
22450 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
22452 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
22453 remove <supf> iterator.
22454 (mve_vclzq_u<mode>): New.
22455 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
22456 (neon_vclz<mode): Move to ...
22457 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
22458 * config/arm/vec-common.md: ... here. Add support for MVE.
22460 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
22462 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
22463 (@mve_vrhaddq_<supf><mode): Likewise.
22464 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
22465 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
22466 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
22468 2021-06-09 imba-tjd <109224573@qq.com>
22470 * doc/invoke.texi: Fix typo.
22472 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
22474 PR middle-end/53267
22475 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
22476 Support evaluation of fmod/fmodf/fmodl at compile-time.
22478 2021-06-09 Richard Biener <rguenther@suse.de>
22480 PR tree-optimization/100981
22481 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
22482 gimple_get_lhs to also handle calls.
22483 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
22486 2021-06-09 Richard Biener <rguenther@suse.de>
22488 PR tree-optimization/97832
22489 * tree-vectorizer.h (_slp_tree::failed): New.
22490 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
22492 (_slp_tree::~_slp_tree): Free failed.
22493 (vect_build_slp_tree): Retain failed nodes and record
22494 matches in them, copying that back out when running
22495 into a cached fail. Dump start and end of discovery.
22496 (dt_sort_cmp): New.
22497 (vect_build_slp_tree_2): Handle associatable chains
22498 together doing more aggressive operand swapping.
22500 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
22503 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
22505 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
22506 Linux and GNU targets.
22508 2021-06-09 Richard Biener <rguenther@suse.de>
22510 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
22513 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22515 * config/arc/arc.md (loop_end): Change it to
22516 define_insn_and_split.
22518 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22520 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
22521 (machi): New pattern.
22522 (umaddhisi4): Use VMAC2HU instruction.
22523 (umachi): New pattern.
22525 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22527 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
22528 * config/arc/arc.c (arc_split_move_p): New function.
22529 (arc_split_move): Clean up.
22530 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
22531 (movdf_insn): Likewise.
22532 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
22534 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
22537 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
22538 argument to "raw". Do not emit segment overrides when "raw" is true.
22540 2021-06-09 Martin Liska <mliska@suse.cz>
22542 * doc/gcov.texi: Create a proper JSON files.
22543 * doc/invoke.texi: Remove dots in order to make it a valid
22546 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
22548 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
22549 (insn_is_load_p): Use pattern_is_rotate64.
22550 (insn_is_swap_p): Likewise.
22551 (quad_aligned_load_p): Likewise.
22552 (const_load_sequence_p): Likewise.
22553 (replace_swapped_aligned_load): Likewise.
22554 (recombine_lvx_pattern): Likewise.
22555 (recombine_stvx_pattern): Likewise.
22557 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
22559 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
22560 fur_stmt source record.
22561 * gimple-range.cc (fur_source::get_operand): Generic range query.
22562 (fur_source::get_phi_operand): New.
22563 (fur_source::register_dependency): New.
22564 (fur_source::query): New.
22565 (class fur_edge): New. Edge source for operands.
22566 (fur_edge::fur_edge): New.
22567 (fur_edge::get_operand): New.
22568 (fur_edge::get_phi_operand): New.
22569 (fur_edge::query): New.
22570 (fur_stmt::fur_stmt): New.
22571 (fur_stmt::get_operand): New.
22572 (fur_stmt::get_phi_operand): New.
22573 (fur_stmt::query): New.
22574 (class fur_depend): New. Statement source and process dependencies.
22575 (fur_depend::fur_depend): New.
22576 (fur_depend::register_dependency): New.
22577 (class fur_list): New. List source for operands.
22578 (fur_list::fur_list): New.
22579 (fur_list::get_operand): New.
22580 (fur_list::get_phi_operand): New.
22581 (fold_range): New. Instantiate appropriate fur_source class and fold.
22582 (fold_using_range::range_of_range_op): Use new API.
22583 (fold_using_range::range_of_address): Ditto.
22584 (fold_using_range::range_of_phi): Ditto.
22585 (imple_ranger::fold_range_internal): Use fur_depend class.
22586 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
22587 * gimple-range.h (class fur_source): Now a base class.
22588 (class fur_stmt): New.
22589 (fold_range): New prototypes.
22590 (fur_source::fur_source): Delete.
22592 2021-06-08 Andrew Pinski <apinski@marvell.com>
22594 PR tree-optimization/25290
22595 * tree-ssa-phiopt.c (xor_replacement): Delete.
22596 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
22597 (match_simplify_replacement): Allow one cheap preparation
22598 statement that can be moved to before the if.
22600 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
22602 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
22603 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
22605 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
22607 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
22608 Create length attribute on define_insn_and_split. Only split for cases which we
22610 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
22611 fix length computation.
22612 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
22614 2021-06-08 Richard Biener <rguenther@suse.de>
22616 PR tree-optimization/100923
22617 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
22618 the operand vector to be valueized.
22619 (valueize_refs): Likewise.
22620 (valueize_shared_reference_ops_from_ref): Adjust.
22621 (valueize_shared_reference_ops_from_call): Likewise.
22622 (vn_reference_lookup_3): Likewise.
22623 (vn_reference_lookup_pieces): Likewise. Re-valueize
22624 with honoring availability when we are about to create
22625 the ao_ref and valueized before.
22626 (vn_reference_lookup): Likewise.
22627 (vn_reference_insert_pieces): Adjust.
22629 2021-06-08 Richard Biener <rguenther@suse.de>
22631 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
22632 (_slp_instance::root_stmts): ... a vector.
22633 (SLP_INSTANCE_ROOT_STMT): Rename to ...
22634 (SLP_INSTANCE_ROOT_STMTS): ... this.
22635 (slp_root::root): Change to...
22636 (slp_root::roots): ... a vector.
22637 (slp_root::slp_root): Adjust.
22638 * tree-vect-slp.c (_slp_instance::location): Adjust.
22639 (vect_free_slp_instance): Release the root stmt vector.
22640 (vect_build_slp_instance): Adjust.
22641 (vect_analyze_slp): Likewise.
22642 (_bb_vec_info::~_bb_vec_info): Likewise.
22643 (vect_slp_analyze_operations): Likewise.
22644 (vect_bb_vectorization_profitable_p): Likewise. Adjust
22645 costs for the root stmt.
22646 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
22648 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
22650 (vectorize_slp_instance_root_stmt): Adjust.
22651 (vect_schedule_slp): Likewise.
22653 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
22655 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
22656 (ssa_equiv_stack::ssa_equiv_stack): New.
22657 (ssa_equiv_stack::~ssa_equiv_stack): New.
22658 (ssa_equiv_stack::enter): New.
22659 (ssa_equiv_stack::leave): New.
22660 (ssa_equiv_stack::push_replacement): New.
22661 (ssa_equiv_stack::get_replacement): New.
22662 (is_pointer_ssa): New.
22663 (class pointer_equiv_analyzer): New.
22664 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
22665 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
22666 (pointer_equiv_analyzer::set_global_equiv): New.
22667 (pointer_equiv_analyzer::set_cond_equiv): New.
22668 (pointer_equiv_analyzer::get_equiv): New.
22669 (pointer_equiv_analyzer::enter): New.
22670 (pointer_equiv_analyzer::leave): New.
22671 (pointer_equiv_analyzer::get_equiv_expr): New.
22672 (pta_valueize): New.
22673 (pointer_equiv_analyzer::visit_stmt): New.
22674 (pointer_equiv_analyzer::visit_edge): New.
22675 (hybrid_folder::value_of_expr): Call PTA.
22676 (hybrid_folder::value_on_edge): Same.
22677 (hybrid_folder::pre_fold_bb): New.
22678 (hybrid_folder::post_fold_bb): New.
22679 (hybrid_folder::pre_fold_stmt): New.
22680 (rvrp_folder::pre_fold_bb): New.
22681 (rvrp_folder::post_fold_bb): New.
22682 (rvrp_folder::pre_fold_stmt): New.
22683 (rvrp_folder::value_of_expr): Call PTA.
22684 (rvrp_folder::value_on_edge): Same.
22686 2021-06-08 Jakub Jelinek <jakub@redhat.com>
22689 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
22690 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
22692 2021-06-08 Richard Biener <rguenther@suse.de>
22694 PR middle-end/100951
22695 * tree-vect-generic.c (expand_vector_piecewise): Build a
22696 VECTOR_CST if all elements are constant.
22697 (expand_vector_condition): Likewise.
22698 (lower_vec_perm): Likewise.
22699 (expand_vector_conversion): Likewise.
22701 2021-06-08 Martin Liska <mliska@suse.cz>
22703 * doc/invoke.texi: Document new param evrp-sparse-threshold.
22705 2021-06-08 Martin Liska <mliska@suse.cz>
22707 * genautomata.c (create_automata): Fix typo.
22709 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22711 PR tree-optimization/100794
22712 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
22713 allow_unroll_p and only allow unrolling when it's true.
22714 (tree_predictive_commoning): Add parameter allow_unroll_p and
22716 (run_tree_predictive_commoning): Likewise.
22717 (pass_predcom::gate): Check flag_tree_loop_vectorize and
22718 global_options_set.x_flag_predictive_commoning.
22719 (pass_predcom::execute): Adjust for allow_unroll_p.
22721 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22723 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
22724 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
22725 lambda function cleanup, remove scev_reset call, and adjust return
22727 (tree_predictive_commoning): Adjust for different changed values,
22728 only set flag TODO_update_ssa_only_virtuals if changed.
22729 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
22730 from todo_flags_finish.
22732 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22734 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
22735 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
22736 (sbr_sparse_bitmap::bitmap_set_quad): New.
22737 (sbr_sparse_bitmap::bitmap_get_quad): New.
22738 (sbr_sparse_bitmap::set_bb_range): New.
22739 (sbr_sparse_bitmap::get_bb_range): New.
22740 (sbr_sparse_bitmap::bb_range_p): New.
22741 (block_range_cache::block_range_cache): initialize bitmap obstack.
22742 (block_range_cache::~block_range_cache): Destruct obstack.
22743 (block_range_cache::set_bb_range): Decide when to utilze the
22744 sparse on entry cache.
22745 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
22746 * params.opt (-param=evrp-sparse-threshold): New.
22748 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22750 * bitmap.c (bitmap_set_aligned_chunk): New.
22751 (bitmap_get_aligned_chunk): New.
22752 (test_aligned_chunk): New.
22753 (bitmap_c_tests): Call test_aligned_chunk.
22754 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
22756 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
22759 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22761 (ix86_expand_vector_init_one_nonzero): Ditto.
22762 (ix86_expand_vector_init_one_var): Ditto.
22763 (ix86_expand_vector_init_general): Ditto.
22764 * config/i386/mmx.md (vec_initv4qiqi): New expander.
22766 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
22768 * config/h8300/movepush.md: Change most _clobber_flags
22769 patterns to instead use <cczn> subst.
22770 (movsi_cczn): New pattern with usable CC cases split out.
22771 (movsi_h8sx_cczn): Likewise.
22773 2021-06-07 Martin Liska <mliska@suse.cz>
22775 * common/common-target.def: Split long lines and replace them
22777 * target.def: Likewise.
22778 * doc/tm.texi: Re-generated.
22780 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22783 * fold-const.c (fold_read_from_vector): Return NULL if trying to
22784 read from a CONSTRUCTOR with vector type elements.
22786 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22788 PR middle-end/100898
22789 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
22790 should copy any arguments. Don't call gimple_call_num_args
22791 on id->call_stmt or call_stmt more than once.
22793 2021-06-07 liuhongt <hongtao.liu@intel.com>
22796 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
22798 (<insn>v4siv4di2): Delete constraints for define_expand.
22800 2021-06-07 liuhongt <hongtao.liu@intel.com>
22803 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
22804 assignment of cfun->machine->has_explicit_vzeroupper.
22805 * config/i386/i386-features.c
22806 (ix86_add_reg_usage_to_vzerouppers): Delete.
22807 (ix86_add_reg_usage_to_vzeroupper): Ditto.
22808 (rest_of_handle_insert_vzeroupper): Remove
22809 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
22811 (gate): Remove cfun->machine->has_explicit_vzeroupper.
22812 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
22814 * config/i386/i386.c (ix86_insn_callee_abi): New function.
22815 (ix86_initialize_callee_abi): Ditto.
22816 (ix86_expand_avx_vzeroupper): Ditto.
22817 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
22819 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
22820 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
22822 * config/i386/i386.h (struct GTY(()) machine_function): Delete
22823 has_explicit_vzeroupper.
22824 * config/i386/i386.md (enum unspec): New member
22826 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
22827 define_constants for insn callee abi index.
22828 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
22829 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
22830 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
22831 (*avx_vzeroupper): Rename to ..
22832 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
22833 call_insn which has a special vzeroupper ABI.
22834 (*avx_vzeroupper_1): Deleted.
22836 2021-06-07 liuhongt <hongtao.liu@intel.com>
22839 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
22840 it won't use stack pointer reg.
22841 * final.c (leaf_function_p): When call_insn is a fake call, it
22842 won't affect caller as a leaf function.
22843 * reg-stack.c (callee_clobbers_any_stack_reg): New.
22844 (subst_stack_regs): When call_insn doesn't clobber any stack
22845 reg, don't clear the arguments.
22846 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
22848 * shrink-wrap.c (requires_stack_frame_p): No need for stack
22849 frame for a fake call.
22850 * rtl.h (FAKE_CALL_P): New macro.
22852 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
22854 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
22856 (sparc_order_regs_for_local_alloc): ...this.
22857 (sparc_leaf_reg_remap): Declare.
22858 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
22859 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
22860 * config/sparc/sparc.c (leaf_reg_remap): Delete.
22861 (order_regs_for_local_alloc): Rename to...
22862 (sparc_order_regs_for_local_alloc): ...this.
22863 (sparc_leaf_reg_remap): New function.
22864 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
22866 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
22868 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
22869 Use assemble_name to output BSS section name.
22871 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
22873 * config/i386/constraints.md (Bs):
22874 Remove boolean operators from match_test RTX.
22877 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
22880 2021-06-06 Martin Liska <mliska@suse.cz>
22882 * doc/extend.texi: Add missing @headitem.
22883 * doc/invoke.texi: Likewise.
22884 * doc/objc.texi: Likewise.
22886 2021-06-06 Martin Liska <mliska@suse.cz>
22888 * genhooks.c (emit_findices): Remove unused function.
22889 (emit_documentation): Do not call emit_findices
22890 and do not search for @Fcode directives.
22892 2021-06-06 Martin Liska <mliska@suse.cz>
22894 * doc/invoke.texi: Remove extra character.
22896 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22898 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
22900 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22902 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
22903 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
22904 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
22905 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
22907 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22909 * config/or1k/or1k.md (*movdi): Fix empty split condition.
22911 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22913 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
22916 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22918 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
22919 *zero_extendsidi2): Fix empty split condition.
22921 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
22923 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
22925 * config/h8300/bitfield.md: Likewise.
22926 * config/h8300/combiner.md: Likewise.
22927 * config/h8300/divmod.md: Likewise.
22928 * config/h8300/extensions.md: Likewise.
22929 * config/h8300/jumpcall.md: Likewise.
22930 * config/h8300/movepush.md: Likewise.
22931 * config/h8300/multiply.md: Likewise.
22932 * config/h8300/other.md: Likewise.
22933 * config/h8300/shiftrotate.md: Likewise.
22934 * config/h8300/logical.md: Likewise. Fix split pattern to use
22935 code iterator that somehow slipped through.
22937 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
22939 PR middle-end/100905
22940 * tree-nested.c (convert_nonlocal_omp_clauses,
22941 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
22943 2021-06-04 Martin Sebor <msebor@redhat.com>
22945 PR middle-end/100732
22946 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
22947 with either source or destination argument of invalid type.
22948 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
22949 calls with arguments of invalid type.
22951 2021-06-04 Martin Sebor <msebor@redhat.com>
22953 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
22955 (attr_access::vla_bounds): Also handle VLA bounds.
22957 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22959 * config/i386/predicates.md (GOT_memory_operand):
22960 Implement using match_code RTXes.
22961 (GOT32_symbol_operand): Ditto.
22963 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22966 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22968 (ix86_expand_vector_init_general): Ditto.
22969 Use SImode instead of word_mode for logic operations
22970 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
22971 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
22972 implemented by expand_vec_perm_1.
22973 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
22974 should be implemented using standard shuffle patterns.
22975 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
22976 V2HI modes to modes, implementable with shuffle for one operand.
22977 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
22978 (*pshufw_1): New insn pattern.
22979 (*vec_dupv2hi): Ditto.
22980 (vec_initv2hihi): New expander.
22982 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22984 * config/arm/vfp.md (no_literal_pool_df_immediate,
22985 no_literal_pool_sf_immediate): Fix empty split condition.
22987 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22989 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
22990 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
22991 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
22992 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
22993 *<sse4_1_avx2>_pblendvb_lt): Likewise.
22995 2021-06-04 Jakub Jelinek <jakub@redhat.com>
22998 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
22999 concatenation from half-sized modes with TImode elements.
23001 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
23003 * config/arc/arc.c (arc_override_options): Disable millicode
23004 thunks when RF16 is on.
23006 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
23008 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
23010 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
23012 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
23013 Replace PROMOTE_MODE marco with its content.
23015 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
23017 * config/cris/cris.md (*addi_reload): Fix empty split condition.
23019 2021-06-03 Jim Wilson <jimw@sifive.com>
23021 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
23022 turn it on for all riscv targets.
23024 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
23027 * config/i386/i386-expand.c (ix86_expand_vector_set):
23028 Handle V2HI and V4QI modes.
23029 (ix86_expand_vector_extract): Ditto.
23030 * config/i386/mmx.md (*pinsrw): New insn pattern.
23033 (*pextrw_zext): Ditto.
23035 (*pextrb_zext): Ditto.
23036 (vec_setv2hi): New expander.
23037 (vec_extractv2hihi): Ditto.
23038 (vec_setv4qi): Ditto.
23039 (vec_extractv4qiqi): Ditto.
23040 (vec_setv8qi): Enable only for TARGET_SSE4_1.
23041 (vec_extractv8qiqi): Ditto.
23043 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
23045 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
23046 order to subf instruction.
23047 * config/rs6000/fusion.md: Regenerate.
23049 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
23051 * calls.c (get_size_range): Use range_of_expr instead of
23052 determine_value_range.
23053 * tree-affine.c (expr_to_aff_combination): Same.
23054 * tree-data-ref.c (split_constant_offset): Same.
23055 * tree-vrp.c (determine_value_range_1): Remove.
23056 (determine_value_range): Remove.
23057 * tree-vrp.h (determine_value_range): Remove.
23059 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
23061 * function-tests.c (test_ranges): Call gimple_range_tests.
23062 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
23064 * gimple-range.cc (fur_source::get_operand): Do not call
23065 get_tree_range or gimple_range_global.
23067 (get_tree_range): Move to value-query.cc.
23068 Call get_arith_expr_range.
23069 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
23070 Include gimple-range-tests.cc.
23071 * gimple-range.h (fold_range): Add argument.
23072 (get_tree_range): Remove.
23073 * selftest.h (gimple_range_tests): New.
23074 * value-query.cc (global_range_query::range_of_expr): Add
23076 (range_query::get_tree_range): Move from gimple-range.cc.
23077 * value-query.h (class range_query): Add get_tree_range and
23078 get_arith_expr_range. Make fur_source a friend.
23079 * vr-values.c (vr_values::range_of_expr): Pass stmt to
23081 * gimple-range-tests.cc: New file.
23083 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
23085 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
23086 update_global_range.
23087 * value-query.cc (update_global_range): New.
23088 * value-query.h (update_global_range): New.
23090 2021-06-03 David Malcolm <dmalcolm@redhat.com>
23092 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
23093 printing the same location twice if there are fix-it hints,
23094 multiple locations, or a label.
23096 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
23098 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
23099 thresholds to narrow the upper bound on epilogue iterations.
23101 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
23103 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
23104 (mve_vabsq_s<mode>): Likewise.
23105 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
23106 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
23107 * config/arm/vec-common.md (neg<mode>2): Rename to
23108 <absneg_str><mode>2.
23110 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
23112 * common/config/arc/arc-common.c (arc_option_optimization_table):
23113 Remove malign-call.
23114 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
23115 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
23116 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
23117 * config/arc/arc.md (abssi2_mixed): Remove pattern.
23118 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
23119 (malign-call): Likewise.
23120 (mmixed-code): Likewise.
23121 * doc/invoke.texi (ARC): Update doc.
23123 2021-06-03 Martin Liska <mliska@suse.cz>
23125 * common.opt: Use proper Enum values.
23126 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
23127 (parse_sanitizer_options): Handle only sanitizer_opts.
23128 (common_handle_option): Just assign value.
23130 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23133 * tree-inline.c (inline_forbidden_p): Remove test on return type.
23135 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23137 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
23138 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
23139 (type_byte_size): Inline into...
23140 (add_byte_size_attribute): ...this and call add_scalar_info.
23142 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23144 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
23145 (typed_binop_from_tree): New function.
23146 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
23147 turn a divide by a power of 2 into a shift.
23148 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
23149 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
23150 typed divide by calling typed_binop_from_tree.
23152 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23154 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
23155 (is_handled_procedure_type): Likewise.
23156 (struct loc_descr_context): Add strict_signedness field.
23157 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
23158 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
23159 (resolve_args_picking): Minor tweak.
23160 (function_to_dwarf_procedure): Initialize strict_signedness field.
23161 (type_byte_size): Likewise.
23162 (field_byte_offset): Likewise.
23163 (gen_descr_array_type_die): Likewise.
23164 (gen_variant_part): Likewise.
23165 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
23166 to true when a context is present before evaluating the arguments.
23167 <COND_EXPR>: Do not generate a useless comparison with zero.
23168 When dereferencing an address, if strict_signedness is true and the
23169 type is small and signed, use DW_OP_deref_type to do the dereference
23170 and then DW_OP_convert to convert back to the generic type.
23172 2021-06-03 Jakub Jelinek <jakub@redhat.com>
23175 * tree-inline.c (copy_tree_body_r): Handle iterators on
23176 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
23178 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
23180 * config/arc/arc.md (*bbit_di): Remove.
23182 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
23184 PR rtl-optimization/100264
23185 * ree.c (get_sub_rtx): Ignore SET expressions without register
23186 destinations and remove assertion, as it is not valid anymore
23187 with this new behaviour.
23188 (merge_def_and_ext): Eliminate destination check for register
23189 as such SET expressions can't occur anymore.
23190 (combine_reaching_defs): Likewise.
23192 2021-06-02 Jakub Jelinek <jakub@redhat.com>
23195 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
23196 -Wtype-limits warnings.
23197 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
23198 in operands to avoid -Wsign-compare warnings.
23200 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
23202 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
23205 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
23207 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
23209 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
23211 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
23213 * config/s390/subst.md(cconly_subst): Use a single constraint
23214 in (match_scratch).
23216 2021-06-02 Martin Liska <mliska@suse.cz>
23218 * ipa-icf.h: Use auto_vec for memory_access_types.
23220 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
23222 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
23223 argument from prototype.
23224 (output_logical_op): Add rtx_code argument.
23225 (compute_logical_op_length): Likewise.
23226 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
23227 to compute_a_shift_length.
23228 (output_logical_op); New argument with the rtx code rather than
23229 extracting it from an operand. Handle QImode too.
23230 (compute_logical_op_length): Similary.
23231 (compute_a_shift_length): Drop unused argument.
23232 * config/h8300/h8300.md (logicals): New code iterator.
23233 * config/h8300/logical.md (<code><mode>3 expander): Combine
23234 the "and" expander with the "ior"/"xor" expander.
23235 (bclr<mode>msx): Combine the QI/HI mode patterns.
23236 (<logical><mode>3 insns): Use code iterator rather than match_operator.
23237 Handle QImode as well. Update call to output_logical_op and
23238 compute_logical_op_length to pass in rtx_code
23239 Fix split condition on all define_insn_and_split patterns.
23240 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
23241 the flags and setting ZN via existing define_subst.
23242 * config/h8300/shiftrotate.md: Drop unused argument from
23243 calls to compute_a_shift_length.
23244 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
23246 2021-06-01 Andrew Pinski <apinski@marvell.com>
23248 PR tree-optimization/25290
23249 * tree-ssa-phiopt.c (match_simplify_replacement):
23251 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
23252 (two_value_replacement): Change the comment about
23253 conditional_replacement.
23254 (conditional_replacement): Delete.
23256 2021-06-01 Andrew Pinski <apinski@marvell.com>
23258 PR tree-optimization/95481
23259 * tree-tailcall.c (find_tail_calls): Handle empty typed
23262 2021-06-01 Andrew Pinski <apinski@marvell.com>
23264 * gimplify.c (zero_sized_field_decl): Delete
23265 (zero_sized_type): Delete
23266 (gimplify_init_ctor_eval): Use is_empty_type instead
23267 of zero_sized_field_decl.
23268 (gimplify_modify_expr): Use is_empty_type instead of
23271 2021-06-01 Jason Merrill <jason@redhat.com>
23274 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
23276 2021-06-01 Jason Merrill <jason@redhat.com>
23279 * diagnostic.h (warning_enabled_at): Declare.
23280 * diagnostic.c (diagnostic_enabled): Factor out from...
23281 (diagnostic_report_diagnostic): ...here.
23282 (warning_enabled_at): New.
23284 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
23286 * gimple-ssa-evrp.c: Enable exporting of global ranges.
23288 2021-06-01 Martin Liska <mliska@suse.cz>
23291 * doc/invoke.texi: Mention that -fgcse-after-reload
23292 is enabled with -O3.
23294 2021-06-01 liuhongt <hongtao.liu@intel.com>
23296 PR tree-optimization/98365
23297 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
23298 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
23299 (convert_scalar_cond_reduction): Ditto.
23300 (predicate_scalar_phi): Ditto.
23302 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23304 PR tree-optimization/100781
23305 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
23306 value calculation by default.
23307 (ranger_cache::enable_new_values): New.
23308 (ranger_cache::disable_new_values): New.
23309 (ranger_cache::push_poor_value): Check if new values are allowed.
23310 * gimple-range-cache.h (class ranger_cache): New member/methods.
23311 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
23312 statement, and disable/renable new value calculation.
23314 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23316 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
23317 (ranger_cache::range_of_def): New.
23318 (ranger_cache::entry_range): New.
23319 (ranger_cache::exit_range): New.
23320 (ranger_cache::range_of_expr): Adjust.
23321 (ranger_cache::range_on_edge): Adjust.
23322 (ranger_cache::propagate_cache): Call exit_range directly.
23323 * gimple-range-cache.h (class ranger_cache): Adjust.
23325 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23327 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
23328 gori_compute being a member rather than base class.
23329 dervied call to member call.
23330 (ranger_cache::dump): No longer dump gori_map.
23331 (ranger_cache::dump_bb): New.
23332 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
23333 being a member rather than base class.
23334 (ranger_cache::set_global_range): Ditto.
23335 (ranger_cache::ssa_range_in_bb): Ditto.
23336 (ranger_cache::range_of_expr): New.
23337 (ranger_cache::range_on_edge): New.
23338 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
23339 (ranger_cache::propagate_cache): Adjust debugging output.
23340 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
23342 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
23343 member, and inherit from range_query instead.
23344 (ranger_cache::dump_bb): New. split from dump.
23345 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
23346 (gori_compute::expr_range_at_stmt): Delete.
23347 (gori_compute::compute_name_range_op): Delete.
23348 (gori_compute::compute_operand_range_switch): Add fur_source.
23349 (gori_compute::compute_operand_range): Add fur_source param, inline
23350 old compute_name_range_op and optimize_logical_operands.
23351 (struct tf_range): Delete.
23352 (gori_compute::logical_combine): Adjust
23353 (gori_compute::optimize_logical_operands): Delete.
23354 (gori_compute::compute_logical_operands_in_chain): Delete.
23355 (gori_compute::compute_logical_operands): Adjust.
23356 (gori_compute::compute_operand1_range): Adjust to fur_source.
23357 (gori_compute::compute_operand2_range): Ditto.
23358 (gori_compute::compute_operand1_and_operand2_range): Ditto.
23359 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
23360 and adjust to fur_source.
23361 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
23362 range_query and fur_source.
23363 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
23364 from the ranger_cache..
23365 (gimple_ranger::fold_range_internal): Adjust to base class change of
23367 (gimple_ranger::dump_bb): Adjust dump.
23368 * gimple-range.h (gimple_ranger):export gori computes object.
23370 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23372 PR tree-optimization/100774
23373 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
23374 Constant values are also not stale.
23375 (ranger_cache::set_global_range): Range invariant values should also
23376 have the correct timestamp.
23378 2021-05-31 Martin Liska <mliska@suse.cz>
23380 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
23381 Unpack FUNCTION_DECL_DECL_TYPE.
23382 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
23383 Stream FUNCTION_DECL_DECL_TYPE instead of
23384 DECL_IS_OPERATOR_NEW_P.
23385 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
23387 (DECL_IS_OPERATOR_NEW_P): Likewise.
23388 (DECL_IS_OPERATOR_DELETE_P): Likewise.
23389 (DECL_LAMBDA_FUNCTION_P): Likewise.
23391 2021-05-31 Richard Biener <rguenther@suse.de>
23394 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
23395 * internal-fn.def (SHUFFLEVECTOR): New.
23396 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
23397 * doc/extend.texi: Document __builtin_shufflevector.
23399 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
23402 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
23403 indexed form addresses.
23405 2021-05-29 Jeff Law <jlaw@tachyum.com>
23407 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
23408 parameter. Call callers fixed.
23410 (output_plussi): Add FALLTHRU markers.
23411 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
23413 2021-05-29 Jakub Jelinek <jakub@redhat.com>
23415 PR middle-end/99928
23416 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
23417 combined with parallel, make sure to add shared clause to
23418 parallel for explicit linear clause.
23420 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
23422 PR tree-optimization/100787
23423 * gimple-ssa-evrp.c: Disable exporting of global ranges.
23425 2021-05-28 Jason Merrill <jason@redhat.com>
23427 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
23428 operator--, operator*, operator==, and operator!=.
23429 (class tsi_range): New.
23431 2021-05-28 Richard Biener <rguenther@suse.de>
23433 PR tree-optimization/100778
23434 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
23435 trapping ops in different BBs.
23437 2021-05-28 Richard Biener <rguenther@suse.de>
23440 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
23441 copy fntype from original call.
23443 2021-05-28 Martin Liska <mliska@suse.cz>
23445 PR gcov-profile/100751
23446 * doc/gcov.texi: Revert partially a hunk that was wrong.
23448 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
23450 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
23452 (HAVE_sync_compare_and_swaphi): Likewise.
23453 (HAVE_sync_compare_and_swapsi): Likewise.
23455 2021-05-28 Jakub Jelinek <jakub@redhat.com>
23457 PR middle-end/99928
23458 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
23460 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
23462 * gimplify.c (gimplify_omp_affinity): New.
23463 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
23464 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
23465 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
23466 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
23467 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
23469 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
23470 Richard Biener <rguenther@suse.de>
23472 * match.pd <popcount & / + pattern matching>:
23473 When generating popcount directly fails, try doing it in two halves.
23475 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
23477 * Makefile.in (generated_files): Add gimple-match.c and
23480 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
23482 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
23484 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
23486 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
23488 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
23490 PR tree-optimization/99398
23491 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
23492 where the fed operands are CTOR/CST and propagated through
23493 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
23494 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
23496 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
23499 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23501 * config/i386/mmx.md (addv2sf3): Do not call
23502 ix86_fixup_binary_operands_no_copy.
23505 (<smaxmin:code>v2sf3): Ditto.
23506 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
23507 (<plusminus:insn><VI_32:mode>3): Remove expander.
23508 (<plusminus:insn><VI_32:mode>3): Rename from
23509 "*<plusminus:insn><VI_32:mode>3".
23510 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
23511 (mulv2hi3): Remove expander.
23512 (mulv2hi3): Rename from *mulv2hi3.
23513 (<s>mulv2hi3_highpart): Remove expander.
23514 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
23515 (<smaxmin:code><MMXMODE14:mode>3): Rename from
23516 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
23517 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
23518 (SMAXMIN_MMXMODEI): Remove mode iterator.
23519 (<smaxmin:code>v4hi3): New expander.
23520 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
23521 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
23522 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23523 (SMAXMIN_VI_32): Remove mode iterator.
23524 (<umaxmin:code><MMXMODE24:mode>3): Rename from
23525 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
23526 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
23527 (UMAXMIN_MMXMODEI): Remove mode iterator.
23528 (<umaxmin:code>v8qi3): New expander.
23529 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
23530 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
23531 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23532 (UMAXMIN_VI_32): Remove mode iterator.
23533 (<any_shift:insn>v2hi3): Remove expander.
23534 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
23535 (<any_logic:code><MMXMODEI:mode>3): Do not call
23536 ix86_fixup_binary_operands_no_copy.
23537 (<any_logic:code><VI_32:mode>3): Remove expander.
23538 (<any_logic:code><VI_32:mode>3): Rename from
23539 "*<any_logic:code><VI_32:mode>3".
23540 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
23541 * config/i386/sse.md (div<VF2:mode>3): Do not call
23542 ix86_fixup_binary_operands_no_copy.
23543 (div<VF1:mode>3): Ditto.
23544 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
23545 (smulhrsv4hi3): Ditto.
23546 (smulhrsv2hi3): Ditto.
23548 2021-05-27 Martin Sebor <msebor@redhat.com>
23550 * ggc.h (gt_ggc_mx): Add overloads for all integers.
23552 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
23554 (hash_map::operator==): New function.
23556 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23559 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
23560 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
23561 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
23562 (*xop_maskcmp<VI_32:mode>3): Ditto.
23563 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
23564 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
23566 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
23569 * config/arm/arm.c (arm_configure_build_target): Remove parameter
23570 opts_set, directly check opts parameters for being non-null.
23571 (arm_option_restore): Update call to arm_configure_build_target.
23572 (arm_option_override): Likewise.
23573 (arm_can_inline_p): Likewise.
23574 (arm_valid_target_attribute_tree): Likewise.
23575 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
23576 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
23579 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23581 * vr-values.c (simplify_conversion_using_ranges): Use
23582 get_range_query instead of get_global_range_query.
23584 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23586 * gimple-range.cc (get_range_global): Move to value-query.cc.
23587 (gimple_range_global): Same.
23588 (get_global_range_query): Same.
23589 (global_range_query::range_of_expr): Same.
23590 * gimple-range.h (class global_range_query): Move to
23592 (gimple_range_global): Same.
23593 * tree-ssanames.c (get_range_info): Move to value-query.cc.
23594 (get_ptr_nonnull): Same.
23595 * tree-ssanames.h (get_range_info): Remove.
23596 (get_ptr_nonnull): Remove.
23597 * value-query.cc (get_ssa_name_range_info): Move from
23599 (get_ssa_name_ptr_info_nonnull): Same.
23600 (get_range_global): Move from gimple-range.cc.
23601 (gimple_range_global): Same.
23602 (get_global_range_query): Same.
23603 (global_range_query::range_of_expr): Same.
23604 * value-query.h (class global_range_query): Move from
23606 (gimple_range_global): Same.
23608 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23611 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
23612 (uavgv2hi3_ceil): Ditto.
23614 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23617 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
23619 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23621 * tree-ssanames.c (get_range_info): Merge both copies of
23622 get_range_info into one that works with irange.
23623 * tree-ssanames.h (get_range_info): Remove version that works on
23626 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23628 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
23629 (expand_builtin_strnlen): Same.
23630 (determine_block_size): Same.
23631 * fold-const.c (expr_not_equal_to): Same.
23632 * gimple-fold.c (size_must_be_zero_p): Same.
23633 * gimple-match-head.c: Include gimple-range.h.
23634 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
23635 * gimple-ssa-warn-restrict.c
23636 (builtin_memref::extend_offset_range): Same.
23637 * graphite-sese-to-poly.c (add_param_constraints): Same.
23638 * internal-fn.c (get_min_precision): Same.
23639 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
23640 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
23642 * tree-data-ref.c (split_constant_offset): Same.
23643 (dr_step_indicator): Same.
23644 * tree-dfa.c (get_ref_base_and_extent): Same.
23645 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
23646 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
23647 (determine_value_range): Same.
23648 (record_nonwrapping_iv): Same.
23649 (infer_loop_bounds_from_signedness): Same.
23650 (scev_var_range_cant_overflow): Same.
23651 * tree-ssa-phiopt.c (two_value_replacement): Same.
23652 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
23653 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
23654 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
23656 (dump_strlen_info): Same.
23657 (set_strlen_range): Same.
23658 (maybe_diag_stxncpy_trunc): Same.
23659 (get_len_or_size): Same.
23660 (handle_integral_assign): Same.
23661 * tree-ssa-structalias.c (find_what_p_points_to): Same.
23662 * tree-ssa-uninit.c (find_var_cmp_const): Same.
23663 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
23664 * tree-vect-patterns.c (vect_get_range_info): Same.
23665 (vect_recog_divmod_pattern): Same.
23666 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
23667 (register_edge_assert_for_2): Same.
23668 (determine_value_range_1): Same.
23669 * tree.c (get_range_pos_neg): Same.
23670 * vr-values.c (vr_values::get_lattice_entry): Same.
23671 (vr_values::update_value_range): Same.
23672 (simplify_conversion_using_ranges): Same.
23674 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23676 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
23677 get_range_query instead of query argument.
23678 (pass_walloca::execute): Enable and disable global ranger.
23680 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23682 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
23684 (rvrp_folder::~rvrp_folder): Call disable_ranger.
23685 (hybrid_folder::hybrid_folder): Call enable_ranger.
23686 (hybrid_folder::~hybrid_folder): Call disable_ranger.
23688 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23690 * function.c (allocate_struct_function): Set cfun->x_range_query.
23691 * function.h (struct function): Declare x_range_query.
23692 (get_range_query): New.
23693 (get_global_range_query): New.
23694 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
23695 Remove call to safe_grow_cleared.
23696 * gimple-range.cc (get_range_global): New.
23697 (gimple_range_global): Move from gimple-range.h.
23698 (get_global_range_query): New.
23699 (global_range_query::range_of_expr): New.
23700 (enable_ranger): New.
23701 (disable_ranger): New.
23702 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
23703 (class global_range_query): New.
23704 (enable_ranger): New.
23705 (disable_ranger): New.
23706 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
23707 dump_all_value_ranges to dump.
23708 * tree-vrp.c (vrp_prop::finalize): Same.
23709 * value-query.cc (range_query::dump): New.
23710 * value-query.h (range_query::dump): New.
23711 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
23712 (vr_values::dump): ...this.
23713 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
23714 dump and make virtual.
23716 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
23718 * config/i386/i386.c (ix86_autovectorize_vector_modes):
23719 Add V4QImode and V16QImode for TARGET_SSE2.
23720 * doc/sourcebuild.texi (Vector-specific attributes):
23721 Add vect64 and vect32 description.
23723 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
23725 * gimple-range-gori.cc (range_def_chain::register_dependency):
23726 Resize m_def_chain when needed.
23728 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
23730 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
23731 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
23732 * config/arm/vec-common.md: .. here. Add support for MVE.
23734 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23736 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
23738 * config/microblaze/microblaze.c (microblaze_legitimize_address,
23740 microblaze_option_override, print_operand): Likewise.
23741 * config/microblaze/microblaze.md (call_internal_plt,
23742 call_value_intern_plt, call_value_intern): Likewise.
23743 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
23744 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
23745 call_value_multiple_internal1): Likewise.
23746 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
23748 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
23750 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
23751 arc_ccfsm_advance, symbolic_reference_mentioned_p,
23752 arc_raw_symbolic_reference_mentioned_p): Remove register
23755 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23758 * omp-low.c: Include omp-offload.h.
23759 (create_omp_child_function): If current_function_decl has
23760 "omp declare target" attribute and is_gimple_omp_offloaded,
23761 remove that attribute from the copy of attribute list and
23762 add "omp target entrypoint" attribute instead.
23763 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
23764 variables for offloading if in omp_maybe_offloaded_ctx.
23765 * omp-offload.c (pass_omp_target_link::execute): Nullify second
23766 argument to GOMP_target_data_ext in offloaded code.
23768 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23770 * config/csky/csky.c (csky_can_change_mode_class): Delete.
23771 For csky, HF/SF mode use the low bits of VREGS.
23773 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23775 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
23776 DECL which is a reference for OMP.
23778 2021-05-26 Martin Liska <mliska@suse.cz>
23780 PR gcov-profile/100751
23781 * doc/gcov.texi: Document that __gcov_dump can be called just
23782 once and that __gcov_reset resets run-time counters.
23784 2021-05-26 Martin Liska <mliska@suse.cz>
23786 * doc/install.texi: Port relevant part from install-old.texi
23787 and re-generate list of CPUs and systems.
23789 2021-05-26 Martin Liska <mliska@suse.cz>
23791 * Makefile.in: Remove it.
23792 * doc/include/fdl.texi: Update next/previous chapters.
23793 * doc/install.texi: Likewise.
23794 * doc/install-old.texi: Removed.
23796 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23798 * config/csky/csky.c (ck810_legitimate_index_p): Support
23799 "base + index" with DF mode.
23800 * config/csky/constraints.md ("Y"): New constraint for memory operands
23801 without index register.
23802 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
23803 when mov between memory and general registers, and lower their priority.
23804 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
23806 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23808 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
23810 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23812 * config/csky/csky.md (untyped_call): Emit clobber for return
23813 registers to mark them used.
23815 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23817 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
23819 2021-05-26 Andrew Pinski <apinski@marvell.com>
23821 * match.pd (x < 0 ? ~y : y): New patterns.
23823 2021-05-26 Andrew Pinski <apinski@marvell.com>
23825 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
23826 A?POW2:0 and A?0:POW2.
23828 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23830 * gimple-range-gori.cc (class logical_stmt_cache): Delete
23831 (logical_stmt_cache::logical_stmt_cache ): Delete.
23832 (logical_stmt_cache::~logical_stmt_cache): Delete.
23833 (logical_stmt_cache::cache_entry::dump): Delete.
23834 (logical_stmt_cache::get_range): Delete.
23835 (logical_stmt_cache::cached_name ): Delete.
23836 (logical_stmt_cache::same_cached_name): Delete.
23837 (logical_stmt_cache::cacheable_p): Delete.
23838 (logical_stmt_cache::slot_diagnostics ): Delete.
23839 (logical_stmt_cache::dump): Delete.
23840 (gori_compute_cache::gori_compute_cache): Delete.
23841 (gori_compute_cache::~gori_compute_cache): Delete.
23842 (gori_compute_cache::compute_operand_range): Delete.
23843 (gori_compute_cache::cache_stmt): Delete.
23844 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
23846 (class gori_compute_cache): Delete.
23848 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23850 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
23852 (fold_using_range::range_of_address): Adjust.
23853 (fold_using_range::range_of_phi): Adjust.
23854 * gimple-range.h (class fur_source): Adjust.
23855 (fur_source::fur_source): Adjust.
23857 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23859 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
23860 from expr_range_in_bb and adjust.
23861 (gori_compute::compute_name_range_op): Adjust.
23862 (gori_compute::optimize_logical_operands): Adjust.
23863 (gori_compute::compute_logical_operands_in_chain): Adjust.
23864 (gori_compute::compute_operand1_range): Adjust.
23865 (gori_compute::compute_operand2_range): Adjust.
23866 (ori_compute_cache::cache_stmt): Adjust.
23867 * gimple-range-gori.h (gori_compute): Rename prototype.
23869 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23871 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
23872 checked only after range_of_stmt, not range_on_entry.
23873 (gimple_ranger::range_on_entry): Check for non-null in any
23874 predecessor block, if it is not already non-null.
23875 (gimple_ranger::range_on_exit): DOnt check for non-null after
23876 range on entry call.
23877 (gimple_ranger::dump_bb): New. Split from dump.
23878 (gimple_ranger::dump): Adjust.
23879 * gimple-range.h (class gimple_ranger): Adjust.
23881 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23883 * gimple-range-cache.cc (struct range_timestamp): Delete.
23884 (class temporal_cache): Adjust.
23885 (temporal_cache::get_timestamp): Delete.
23886 (temporal_cache::set_dependency): Delete.
23887 (temporal_cache::temporal_value): Adjust.
23888 (temporal_cache::current_p): Take dependencies as params.
23889 (temporal_cache::set_timestamp): Adjust.
23890 (temporal_cache::set_always_current): Adjust.
23891 (ranger_cache::get_non_stale_global_range): Adjust.
23892 (ranger_cache::register_dependency): Delete.
23893 * gimple-range-cache.h (class range_cache): Adjust.
23895 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23897 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
23899 (range_def_chain::~range_def_chain): Dispose of obstack rather than
23900 each individual bitmap.
23901 (range_def_chain::set_import): New.
23902 (range_def_chain::get_imports): New.
23903 (range_def_chain::chain_import_p): New.
23904 (range_def_chain::register_dependency): Rename from build_def_chain
23906 (range_def_chain::def_chain_in_bitmap_p): New.
23907 (range_def_chain::add_def_chain_to_bitmap): New.
23908 (range_def_chain::has_def_chain): Just check first depenedence.
23909 (range_def_chain::get_def_chain): Process imports, use generic
23910 register_dependency routine.
23911 (range_def_chain::dump): New.
23912 (gori_map::gori_map): Allocate import list.
23913 (gori_map::~gori_map): Release imports.
23914 (gori_map::exports): Check for past allocated block size.
23915 (gori_map::imports): New.
23916 (gori_map::def_chain_in_export_p): Delete.
23917 (gori_map::is_import_p): New.
23918 (gori_map::maybe_add_gori): Handle imports.
23919 (gori_map::dump): Adjust output, add imports.
23920 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
23921 (gori_export_iterator::gori_export_iterator): New.
23922 (gori_export_iterator::next): New.
23923 (gori_export_iterator::get_name): New.
23924 * gimple-range-gori.h (range_def_chain): Add imports and direct
23925 dependecies via struct rdc.
23926 (range_def_chain::depend1): New.
23927 (range_def_chain::depend2): New.
23928 (class gori_map): Adjust.
23929 (FOR_EACH_GORI_IMPORT_NAME): New.
23930 (FOR_EACH_GORI_EXPORT_NAME): New.
23931 (class gori_export_iterator): New.
23933 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23935 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
23936 export cache filling to here.
23937 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
23939 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23941 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
23942 (gori_map): Move to gimple-range-gori.h.
23943 (gori_compute::gori_compute): Adjust.
23944 (gori_compute::~gori_compute): Delete.
23945 (gori_compute::compute_operand_range_switch): Adjust.
23946 (gori_compute::compute_operand_range): Adjust.
23947 (gori_compute::compute_logical_operands): Adjust.
23948 (gori_compute::has_edge_range_p ): Adjust.
23949 (gori_compute::set_range_invariant): Delete.
23950 (gori_compute::dump): Adjust.
23951 (gori_compute::outgoing_edge_range_p): Adjust.
23952 * gimple-range-gori.h (class range_def_chain): Relocate here.
23953 (class gori_map): Relocate here.
23954 (class gori_compute): Inherit from gori_map, and adjust.
23956 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
23958 * value-range.cc (range_tests_legacy): Use
23959 build_nonstandard_integer_type instead of int and short.
23961 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23963 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
23964 when really creating an initialization statement for it.
23966 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23968 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
23970 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
23972 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
23974 2021-05-25 Martin Liska <mliska@suse.cz>
23976 PR tree-optimization/92860
23978 * optc-save-gen.awk: Remove exceptions.
23980 2021-05-25 Martin Liska <mliska@suse.cz>
23982 * asan.h (sanitize_coverage_p): New function.
23983 * doc/extend.texi: Document it.
23984 * fold-const.c (fold_range_test): Use sanitize_flags_p
23985 instead of flag_sanitize_coverage.
23986 (fold_truth_andor): Likewise.
23987 * sancov.c: Likewise.
23988 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
23989 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
23990 -fsanitize-coverage when inlining.
23992 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23994 * config/csky/csky-modes.def : Fix copyright.
23996 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23998 * config/csky/csky-modes.def : Amend copyright.
23999 * config/csky/csky_insn_fpuv2.md : Likewise.
24000 * config/csky/csky_insn_fpuv3.md : Likewise.
24002 2021-05-25 Richard Biener <rguenther@suse.de>
24004 PR middle-end/100727
24005 * calls.c (initialize_argument_information): Explicitely test
24006 for WITH_SIZE_EXPR.
24007 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
24009 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24011 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
24012 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
24013 the signle definition. The signle definition may not work well
24014 at simplify_subreg_regno().
24015 (HARD_FRAME_POINTER_REGNUM): New.
24016 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
24017 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
24018 csky_initial_elimination_offset, csky_expand_prologue,
24019 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
24021 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24023 * config/csky/csky.c (csky_option_override):
24024 Init csky_arch_isa_features[] in advance, so TARGET_DSP
24025 and TARGET_DIV can be set well.
24027 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24029 * config/csky/constraints.md ("l", "h"): Delete.
24030 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
24031 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
24032 * config/csky/csky.c (regno_reg_classm,
24033 csky_secondary_reload, csky_register_move_cost):
24034 Use HILO_REGS instead of LO_REGS and HI_REGS.
24036 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24038 * config/csky/constraints.md ("W"): New constriant for mem operand
24039 with base reg, index register.
24040 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
24041 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
24043 ("Dv"): New constraint for const double value that can be used at
24045 * config/csky/csky-modes.def (HFmode): New mode.
24046 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
24047 to "csky_valid_mem_constraint_operand" and support new constraint
24049 (csky_get_movedouble_length): New.
24050 (fpuv3_output_move): New.
24051 (fpuv3_const_double): New.
24052 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
24053 (decompose_csky_address): Refine.
24054 (csky_print_operand): New "CONST_DOUBLE" operand.
24055 (csky_output_move): Support fpv3 instructions.
24056 (csky_get_movedouble_length): New.
24057 (fpuv3_output_move): New.
24058 (fpuv3_const_double): New.
24059 (csky_emit_compare): Cover float comparsion.
24060 (csky_emit_compare_float): Refine.
24061 (csky_vaild_fpuv2_mem_operand): Rename to
24062 "csky_valid_mem_constraint_operand" and support new constraint "W".
24063 (ck860_rtx_costs): New.
24064 (csky_rtx_costs): Add the cost calculation of CK860.
24065 (regno_reg_class): New vregs for fpuv3.
24066 (csky_dbx_regno): Likewise.
24067 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
24068 (csky_conditional_register_usage): Suporrot fpuv3.
24069 (csky_dwarf_register_span): Suporrot fpuv3.
24070 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
24071 (ck810_legitimate_index_p): Support fp16.
24072 * config/csky/csky.h (TARGET_TLS): ADD CK860.
24073 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
24074 (TARGET_SINGLE_FPU): Support fpuv3.
24075 (TARGET_SUPPORT_FPV3): New.
24076 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
24077 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
24078 REG_CLASS_CONTENTS): Support fpuv3.
24079 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
24080 (csky_movsf_fpv2): Likewise.
24081 (ck801_movsf): Likewise.
24082 (csky_movsf): Likewise.
24084 (csky_movdf_fpv2): Likewise.
24085 (ck801_movdf): Likewise.
24086 (csky_movdf): Likewise.
24087 (movsicc): Refine. Use "comparison_operatior" instead of
24088 "ordered_comparison_operatior".
24089 (addsicc): Likewise.
24090 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
24091 (call_value_internal_vh): New.
24092 * config/csky/csky_cores.def (CK860): New arch and cpu.
24097 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
24098 into emit-patterns and match-patterns, remain the emit-patterns here,
24099 and move the match-patterns to csky_insn_fpuv2.md or
24100 csky_insn_fpuv3.md.
24101 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
24102 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
24104 * config/csky/csky_isa.def (fcr): New.
24109 (CK860): New definition for ck860.
24110 * config/csky/csky_tables.opt (ck860): New processors ck860,
24111 ck860f. And new arch ck860.
24116 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
24117 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
24118 * config/csky/t-csky-elf: Support 860.
24119 * config/csky/t-csky-linux: Likewise.
24120 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
24122 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
24124 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
24125 add generation of logical-add and add-logical fusion pairs.
24126 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
24128 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
24129 logical-add and add-logical fusion by default.
24130 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
24131 -mpower10-fusion-add-logical options.
24132 * config/rs6000/fusion.md: Regenerate file.
24134 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
24136 * value-range.cc (irange::legacy_equal_p): Check type when
24137 comparing VR_VARYING types.
24138 (range_tests_legacy): Test comparing VARYING ranges of different
24141 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
24143 * config/aarch64/aarch64.c (neoversen1_tunings):
24144 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
24146 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
24148 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
24149 extern weak symbols. Limit symbol offsets for non-GOT symbols with
24152 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
24154 * config/arm/neon.md (vec_load_lanesxi<mode>)
24155 (vec_store_lanexoi<mode>): Move ...
24156 * config/arm/vec-common.md: here.
24158 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
24160 * config/arm/neon.md (vec_load_lanesoi<mode>)
24161 (vec_store_lanesoi<mode>): Move ...
24162 * config/arm/vec-common.md: here.
24164 2021-05-24 liuhongt <hongtao.liu@intel.com>
24167 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
24168 stmt with GIMPLE_NOP when lhs doesn't exist.
24170 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
24173 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
24174 New instruction pattern.
24175 (*push<VI_32:mode>2): Ditto.
24176 (push splitter for SSE registers): New splitter.
24178 2021-05-23 Andrew Pinski <apinski@marvell.com>
24180 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
24182 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
24184 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
24185 * config/rs6000/fusion.md: Regenerate file.
24187 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
24189 * config/rs6000/genfusion.pl (gen_addadd): New function.
24190 * config/rs6000/fusion.md: Regenerate file.
24191 * config/rs6000/rs6000-cpus.def: Add
24192 OPTION_MASK_P10_FUSION_2ADD to masks.
24193 * config/rs6000/rs6000.c (rs6000_option_override_internal):
24194 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
24195 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
24197 2021-05-21 Jakub Jelinek <jakub@redhat.com>
24199 PR middle-end/99928
24200 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
24201 * gimplify.c (enum gimplify_omp_var_data): Fix up
24202 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
24203 (omp_lastprivate_for_combined_outer_constructs): If combined target
24204 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
24205 GOVD_MAP | GOVD_SEEN.
24206 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
24207 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
24208 (gimplify_adjust_omp_clauses): For firstprivate clauses with
24209 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
24210 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
24211 let it be replaced by implicit map clause.
24213 2021-05-21 Jakub Jelinek <jakub@redhat.com>
24215 PR middle-end/99928
24216 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
24218 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
24219 (gimplify_omp_for): Likewise.
24221 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24223 PR middle-end/90115
24224 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
24225 'external' in blocks.
24227 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24229 PR middle-end/90115
24230 * flag-types.h (enum openacc_privatization): New.
24231 * params.opt (-param=openacc-privatization): New.
24232 * doc/invoke.texi (openacc-privatization): Document it.
24233 * omp-general.h (get_openacc_privatization_dump_flags): New
24235 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
24236 * omp-offload.c (execute_oacc_device_lower)
24237 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
24238 * target.def (goacc.adjust_private_decl): Add 'location_t'
24240 * doc/tm.texi: Regenerate.
24241 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
24242 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
24243 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
24244 Likewise. Preserve it for...
24245 (nvptx_goacc_expand_var_decl): ... use here.
24247 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24249 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
24252 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24254 PR middle-end/90115
24255 * omp-low.c (oacc_privatization_candidate_p): New function.
24256 (oacc_privatization_scan_clause_chain)
24257 (oacc_privatization_scan_decl_chain): Use it. Also
24258 'gcc_checking_assert' that we're not seeing duplicates.
24260 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24262 PR middle-end/90115
24263 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
24266 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24268 PR middle-end/90115
24269 * omp-offload.c (execute_oacc_device_lower): Explain.
24271 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24273 PR middle-end/90115
24274 * omp-offload.c (execute_oacc_device_lower)
24275 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
24277 * internal-fn.c (expand_UNIQUE): Don't expect
24278 'IFN_UNIQUE_OACC_PRIVATE'.
24280 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24282 PR middle-end/90115
24283 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
24285 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24287 PR middle-end/90115
24288 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
24289 (nvptx_goacc_expand_var_decl): Tighten.
24291 2021-05-21 Julian Brown <julian@codesourcery.com>
24292 Chung-Lin Tang <cltang@codesourcery.com>
24293 Thomas Schwinge <thomas@codesourcery.com>
24295 PR middle-end/90115
24296 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
24297 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
24298 * doc/tm.texi: Regenerate.
24299 * expr.c (expand_expr_real_1): Expand decls using the
24300 expand_var_decl OpenACC hook if defined.
24301 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
24302 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
24303 * omp-low.c (omp_context): Add oacc_privatization_candidates
24305 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
24307 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
24308 private marker's gimple call arguments, and pass it to
24309 lower_oacc_reductions.
24310 (oacc_privatization_scan_clause_chain)
24311 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
24313 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
24314 * omp-offload.c (convert.h): Include.
24315 (oacc_loop_xform_head_tail): Treat private-variable markers like
24316 fork/join when transforming head/tail sequences.
24317 (struct var_decl_rewrite_info): Add struct.
24318 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
24319 (execute_oacc_device_lower): Support rewriting gang-private
24320 variables using target hook, and fix up addr_expr and var_decl
24322 * target.def (adjust_private_decl, expand_var_decl): New hooks.
24323 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
24325 (gcn_goacc_adjust_private_decl): ...this.
24326 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
24328 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
24329 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
24330 definition using gcn_goacc_adjust_gangprivate_decl...
24331 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
24332 gcn_goacc_adjust_private_decl.
24333 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
24334 (gang_private_shared_size): New global variable.
24335 (gang_private_shared_align): Likewise.
24336 (gang_private_shared_sym): Likewise.
24337 (gang_private_shared_hmap): Likewise.
24338 (nvptx_option_override): Initialize these.
24339 (nvptx_file_end): Output gang_private_shared_sym.
24340 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
24342 (nvptx_set_current_function): Clear gang_private_shared_hmap.
24343 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
24344 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
24346 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
24348 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
24350 2021-05-21 Richard Biener <rguenther@suse.de>
24351 H.J. Lu <hjl.tools@gmail.com>
24353 PR middle-end/90773
24354 * expr.c (expand_constructor): Elide expand_constructor if
24355 move by pieces is preferred.
24357 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24359 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
24360 Take a flag and mode value as arguments.
24361 (aarch64_modifies_global_state_p): Likewise.
24362 (aarch64_reads_global_state_p): Likewise.
24363 (aarch64_could_trap_p): Likewise.
24364 (aarch64_get_attributes): Likewise.
24365 (aarch64_init_simd_builtins): Adjust callsite of above.
24366 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
24367 function attributes to apply to builtins.
24368 (aarch64_init_crc32_builtins): Likewise.
24369 (aarch64_init_builtin_rsqrt): Likewise.
24371 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
24373 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
24374 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
24375 (gen_2logical): Use new fusion types.
24376 * config/rs6000/fusion.md: Regenerate.
24378 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
24381 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
24382 Handle V4QI and V2HI modes.
24383 (ix86_expand_sse_movcc): Ditto.
24384 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
24385 New instruction pattern.
24386 (*eq<VI_32:mode>3): Ditto.
24387 (*gt<VI_32:mode>3): Ditto.
24388 (*xop_pcmov_<VI_32:mode>): Ditto.
24389 (mmx_pblendvb32): Ditto.
24390 (mmx_pblendvb64): Rename from mmx_pblendvb.
24391 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
24392 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
24393 (vcond<VI_32:mode><VI_32:mode>): Ditto.
24394 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
24395 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
24397 2021-05-21 Jakub Jelinek <jakub@redhat.com>
24399 PR tree-optimization/94589
24400 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
24401 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
24402 not the same as cmp3) treat <= the same as < and >= the same as >.
24403 Don't require that cond2_phi_edge is true edge, instead take
24404 false/true edges into account based on cmp1/cmp2 comparison kinds.
24406 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
24409 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
24410 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
24411 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
24412 using SMAXMIN_MMXMODEI mode iterator.
24413 (*<smaxmin:code>v4qi3): New insn pattern.
24414 (*<smaxmin:code>v2hi3): Ditto.
24415 (SMAXMIN_VI_32): New mode iterator.
24416 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
24417 (UMAXMIN_MMXMODEI): New mode iterator.
24418 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
24419 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
24420 using UMAXMIN_MMXMODEI mode iterator.
24421 (*<umaxmin:code>v4qi3): New insn pattern.
24422 (*<umaxmin:code>v2hi3): Ditto.
24423 (UMAXMIN_VI_32): New mode iterator.
24424 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
24425 (abs<VI_32:mode>2): New insn pattern.
24426 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
24427 * config/i386/sse.md: ... here.
24429 2021-05-20 Clement Chigot <clement.chigot@atos.net>
24430 David Edelsohn <dje.gcc@gmail.com>
24432 * collect2.c (scan_prog_file): Issue non-fatal warning for
24435 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
24437 * doc/invoke.texi (-Wno-c++11-extensions)
24438 (-Wno-c++14-extensions, -Wno-c++17-extensions)
24439 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
24442 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
24444 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
24445 * config/darwin.c (darwin_override_options): Likewise.
24446 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
24447 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
24448 (DWARF2_FRAME_REG_OUT): Likewise.
24449 * config/mips/mips.c (mips_output_filename): Likewise.
24450 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
24452 (rs6000_dbx_register_number): Likewise.
24453 * dbxout.c: Include flags.h.
24454 * dwarf2cfi.c (cfi_label_required_p): Likewise.
24455 (dwarf2out_do_frame): Likewise.
24456 * except.c: Include flags.h.
24457 * final.c (dwarf2_debug_info_emitted_p): Likewise.
24458 (final_scan_insn_1): Likewise.
24459 * flags.h (dwarf_debuginfo_p): New function declaration.
24460 * opts.c (dwarf_debuginfo_p): New function definition.
24461 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
24462 * toplev.c (process_options): Likewise.
24464 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
24466 * common.opt: Change type to support bitmasks.
24467 * flag-types.h (enum debug_info_type): Rename enumerator constants.
24468 (NO_DEBUG): New bitmask.
24469 (DBX_DEBUG): Likewise.
24470 (DWARF2_DEBUG): Likewise.
24471 (XCOFF_DEBUG): Likewise.
24472 (VMS_DEBUG): Likewise.
24473 (VMS_AND_DWARF2_DEBUG): Likewise.
24474 * flags.h (debug_set_to_format): New function declaration.
24475 (debug_set_count): Likewise.
24476 (debug_set_names): Likewise.
24477 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
24478 (debug_set_to_format): New function definition.
24479 (debug_set_count): Likewise.
24480 (debug_set_names): Likewise.
24481 (set_debug_level): Update access to debug_type_names.
24482 * toplev.c: Likewise.
24484 2021-05-20 Martin Sebor <msebor@redhat.com>
24486 PR middle-end/100684
24487 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
24489 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
24492 * config/i386/i386.md (isa): Remove x64_bmi.
24493 (enabled): Remove x64_bmi.
24494 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24495 Remove general register alternative.
24496 (*andnot<VI_32:mode>3): Ditto.
24497 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24498 (*<any_logic:code><VI_32:mode>3): Ditto.
24500 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
24502 * config/arm/arm.c: Include head files tree-vectorizer.h and
24505 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
24508 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
24509 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
24510 (<s>mulv4hi3_highpart): New expander.
24511 (*<s>mulv2hi3_highpart): New insn pattern.
24512 (<s>mulv2hi3_higpart): New expander.
24513 (*<any_shift:insn>v2hi3): New insn pattern.
24514 (<any_shift:insn>v2hi3): New expander.
24515 * config/i386/sse.md (smulhrsv2hi3): New expander.
24516 (*smulhrsv2hi3): New insn pattern.
24518 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
24520 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
24522 * params.opt (vect-inner-loop-cost-factor): New.
24523 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
24524 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
24525 tree-vectorizer.h and its required ones.
24526 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
24527 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
24528 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
24529 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
24530 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
24531 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
24533 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
24534 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
24535 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
24537 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
24538 Torbjörn Svensson <torbjorn.svensson@st.com>
24541 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
24543 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24545 PR middle-end/99928
24546 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
24547 explicit linear clause when combined with target, make it map(tofrom:)
24548 instead of no clause or firstprivate.
24550 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24552 PR tree-optimization/94589
24553 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
24554 of integral conversions.
24556 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
24558 * gimple-range.cc (fur_source::get_operand): New.
24559 (gimple_range_fold): Delete.
24560 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
24561 (fold_using_range::range_of_range_op): Move from gimple_ranger.
24562 (fold_using_range::range_of_address): Ditto.
24563 (fold_using_range::range_of_phi): Ditto.
24564 (fold_using_range::range_of_call): Ditto.
24565 (fold_using_range::range_of_builtin_ubsan_call): Move from
24566 range_of_builtin_ubsan_call.
24567 (fold_using_range::range_of_builtin_call): Move from
24568 range_of_builtin_call.
24569 (gimple_ranger::range_of_builtin_call): Delete.
24570 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
24571 (gimple_ranger::fold_range_internal): New.
24572 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
24573 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
24574 gimple_ranger. Improve ranges of SSA_NAMES when possible.
24575 * gimple-range.h (gimple_ranger): Remove various range_of routines.
24576 (class fur_source): New.
24577 (class fold_using_range): New.
24578 (fur_source::fur_source): New.
24580 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
24581 instead of range_of_builtin_call.
24583 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
24585 * doc/cpp.texi (Common Predefined Macros): Update documentation
24586 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
24588 2021-05-19 Alex Coplan <alex.coplan@arm.com>
24591 * config/arm/arm.md (nonsecure_call_internal): Always ensure
24592 callee's address is in a register.
24594 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
24596 * common/config/riscv/riscv-common.c
24597 (riscv_subset_list::parsing_subset_version): Properly parse the letter
24599 (riscv_subset_list::parse_std_ext,
24600 riscv_subset_list::parse_multiletter_ext): To handle errors generated
24601 in riscv_subset_list::parsing_subset_version.
24603 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24605 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
24606 type attribute in patterns generating XTN(2).
24608 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24610 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
24611 Remove as duplicate of...
24612 (aarch64_xtn<mode>): This.
24613 (aarch64_xtn2<mode>_le): Move position in file.
24614 (aarch64_xtn2<mode>_be): Move position in file.
24615 (aarch64_xtn2<mode>): Move position in file.
24616 (vec_pack_trunc_<mode>): Define as an expander.
24618 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24620 * config/aarch64/aarch64-simd-builtins.def: Split builtin
24621 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
24622 separate scalar and vector generators.
24623 * config/aarch64/aarch64-simd.md
24624 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
24626 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
24627 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
24628 * config/aarch64/iterators.md: Define SD_HSDI iterator.
24630 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
24632 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
24634 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
24636 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24638 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
24639 Implement as an expand emitting a big/little endian
24640 instruction pattern.
24641 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
24642 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
24644 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24646 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
24647 Implement as an expand emitting a big/little endian
24648 instruction pattern.
24649 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
24650 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
24651 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
24652 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
24654 2021-05-19 Richard Biener <rguenther@suse.de>
24656 PR middle-end/100672
24657 * fold-const.c (fold_negate_expr_1): Use element_precision.
24658 (negate_expr_p): Likewise.
24660 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
24662 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
24663 (pred_load): New int attribute.
24664 * config/aarch64/aarch64-sve.md
24665 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
24666 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
24667 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
24668 code_for_aarch64_load.
24670 2021-05-19 Richard Biener <rguenther@suse.de>
24672 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
24673 sure TARGET_MEM_REF bases are expanded as memory.
24674 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
24675 Do not mark TARGET_MEM_REF bases addressable.
24676 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
24677 TARGET_MEM_REF bases as never rewritable.
24678 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
24679 walk TARGET_MEM_REF bases as address-takens.
24680 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
24682 2021-05-19 Richard Biener <rguenther@suse.de>
24684 * builtins.c (get_object_alignment_1): Strip outer
24686 * tree-dfa.c (get_ref_base_and_extent): Handle outer
24687 WITH_SIZE_EXPR for size processing and process the
24689 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
24690 outer WITH_SIZE_EXPR.
24691 (ao_ref_base_alias_ptr_type): Likewise.
24692 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
24693 and handle that accordingly, stripping it for the
24694 core alias workers.
24695 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
24696 looking through it instead of returning NULL.
24698 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24700 PR middle-end/100576
24701 * builtins.c (check_read_access): Convert bound to size_type_node if
24704 2021-05-19 Richard Biener <rguenther@suse.de>
24706 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
24707 (verify_types_in_gimple_reference): ... here. Sanitize.
24708 (verify_gimple_call): Verify references in LHS and arguments.
24709 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
24711 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
24713 * config/i386/i386.h (VALID_INT_MODE_P):
24714 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
24715 * config/i386/i386.md (isa): Add x64_bmi.
24716 (enabled): Handle x64_bmi.
24717 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24718 Add alternative using 64bit general registers.
24719 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24721 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24723 PR middle-end/99928
24724 * tree.h (OMP_MASTER_COMBINED): Define.
24725 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
24726 handling for outer combined/composite constructs to a loop.
24727 Handle lastprivate on combined target.
24728 (gimplify_expr): Formatting fix.
24730 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
24732 * passes.def: Add sink_code pass before store_merging.
24733 * tree-ssa-sink.c (pass_sink_code:clone): New.
24735 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
24737 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
24738 rs6000_special_adjust_field_align_p.
24739 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
24740 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
24741 Remove ABI warning.
24742 (rs6000_function_arg): Likewise.
24743 * config/rs6000/rs6000-protos.h
24744 (rs6000_special_adjust_field_align_p): Remove prototype.
24745 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
24747 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
24748 rs6000_special_adjust_field_align_p.
24750 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24753 * config/i386/i386.h (VALID_SSE2_REG_MODE):
24754 Add V4QI and V2HI modes.
24755 (VALID_INT_MODE_P): Ditto.
24756 * config/i386/mmx.md (VI_32): New mode iterator.
24757 (mmxvecsize): Handle V4QI and V2HI.
24759 (mov<VI_32:mode>): New expander.
24760 (*mov<mode>_internal): New insn pattern.
24761 (movmisalign<VI_32:mode>): New expander.
24762 (neg<VI_32:mode>): New expander.
24763 (<plusminus:insn><VI_32:mode>3): New expander.
24764 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
24765 (mulv2hi3): New expander.
24766 (*mulv2hi3): New insn pattern.
24767 (one_cmpl<VI_32:mode>2): New expander.
24768 (*andnot<VI_32:mode>3): New insn pattern.
24769 (<any_logic:code><VI_32:mode>3): New expander.
24770 (*<any_logic:code><VI_32:mode>3): New insn pattern.
24772 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24774 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
24775 Fix a mode mismatch with operand 1.
24777 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24780 * config/i386/i386-expand.c (split_double_mode): Return
24781 temporary register when simplify_gen_subreg fails with
24782 the high half od the paradoxical subreg.
24784 2021-05-18 Richard Biener <rguenther@suse.de>
24786 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
24787 and honor it when expanding.
24788 (expand_used_vars_for_block): Pass through forced_stack_var.
24789 (expand_used_vars): Likewise.
24790 (discover_nonconstant_array_refs_r): Set bits in
24791 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
24792 (avoid_type_punning_on_regs): Likewise.
24793 (discover_nonconstant_array_refs): Likewise.
24794 (pass_expand::execute): Create and pass down forced_stack_var
24795 bitmap. For parameters and returns temporarily set
24796 TREE_ADDRESSABLE when expand_function_start.
24798 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24800 * doc/sourcebuild.texi: Document 'dg-note'.
24802 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
24805 * configure: Regenerate.
24806 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
24808 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24810 * gimple.h (is_gimple_omp_oacc): Tighten.
24811 * omp-low.c (check_omp_nesting_restrictions): Adjust.
24813 2021-05-18 Richard Biener <rguenther@suse.de>
24815 * tree-ssa-operands.c (mark_address_taken): Simplify.
24817 2021-05-18 Martin Liska <mliska@suse.cz>
24819 * config/gcn/mkoffload.c (STR): Redefine.
24820 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24821 * config/nvptx/mkoffload.c (STR): Likewise.
24823 2021-05-18 Martin Liska <mliska@suse.cz>
24825 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
24826 Use startswith function instead of strncmp.
24827 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
24828 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
24829 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
24830 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
24831 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
24832 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
24833 * config/arm/arm.c (arm_file_start): Likewise.
24834 (arm_valid_target_attribute_rec): Likewise.
24835 (thumb1_md_asm_adjust): Likewise.
24836 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
24837 * config/avr/avr.c (STR_PREFIX_P): Likewise.
24838 (avr_set_current_function): Likewise.
24839 (avr_handle_addr_attribute): Likewise.
24840 (avr_asm_output_aligned_decl_common): Likewise.
24841 (avr_asm_named_section): Likewise.
24842 (avr_section_type_flags): Likewise.
24843 (avr_asm_select_section): Likewise.
24844 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
24845 (c6x_section_type_flags): Likewise.
24846 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
24847 (darwin_objc_declare_unresolved_class_reference): Likewise.
24848 (darwin_objc_declare_class_definition): Likewise.
24849 * config/darwin.c (indirect_data): Likewise.
24850 (darwin_encode_section_info): Likewise.
24851 (darwin_objc2_section): Likewise.
24852 (darwin_objc1_section): Likewise.
24853 (machopic_select_section): Likewise.
24854 (darwin_globalize_label): Likewise.
24855 (darwin_label_is_anonymous_local_objc_name): Likewise.
24856 (darwin_asm_named_section): Likewise.
24857 (darwin_asm_output_dwarf_offset): Likewise.
24858 * config/frv/frv.c (frv_string_begins_with): Likewise.
24859 (frv_in_small_data_p): Likewise.
24860 * config/gcn/mkoffload.c (STR): Likewise.
24862 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
24863 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
24864 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
24865 (ix86_md_asm_adjust): Likewise.
24866 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24867 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
24868 (i386_pe_file_end): Likewise.
24869 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
24870 (ia64_section_type_flags): Likewise.
24871 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
24872 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
24873 (mips16_stub_function_p): Likewise.
24874 (mips_function_rodata_section): Likewise.
24875 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
24876 (msp430_function_section): Likewise.
24877 (msp430_section_type_flags): Likewise.
24878 (msp430_expand_helper): Likewise.
24879 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
24880 (nios2_valid_target_attribute_rec): Likewise.
24881 * config/nvptx/mkoffload.c (process): Likewise.
24883 * config/pa/som.h: Likewise.
24884 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
24885 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
24886 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
24887 (rs6000_inner_target_options): Likewise.
24888 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
24889 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
24890 * config/vax/vax.c (vax_output_int_move): Likewise.
24891 * config/vms/vms-ld.c (startswith): Likewise.
24892 (process_args): Likewise.
24894 * config/vms/vms.c: Likewise.
24896 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24898 PR rtl-optimization/100590
24899 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
24900 they are NONJUMP_INSN_P.
24902 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24905 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
24906 DECL_ASSEMBLER_NAME on the fn_decl.
24908 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24910 PR tree-optimization/94589
24911 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
24912 phi result used in (res & ~1) == 0 comparison as res >= 0 as
24913 res == 2 would be UB with -ffinite-math-only.
24915 2021-05-18 Martin Liska <mliska@suse.cz>
24917 * Makefile.in: genversion.o should depend on DATESTAMP.
24919 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
24921 * config/arc/simdext.md (negv2si2): Remove round bracket.
24923 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24925 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
24926 _Bool as macro expanding to _Bool.
24928 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24931 * tree.c (build_reference_type_for_mode)
24932 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
24934 (build_reference_type, build_pointer_type): Invoke
24935 build_*_type_for_mode with VOIDmode.
24937 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
24939 PR tree-optimization/100512
24940 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
24941 and non-zero pointer ranges as invariant.
24942 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
24943 processing from here.
24945 2021-05-17 Tom de Vries <tdevries@suse.de>
24948 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
24949 * config/nvptx/nvptx.c (nvptx_output_barrier)
24950 (nvptx_output_atomic_insn): New function.
24951 (nvptx_print_operand): Add support for 'B'.
24952 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
24955 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
24957 PR tree-optimization/100349
24958 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
24961 2021-05-17 Tamar Christina <tamar.christina@arm.com>
24963 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
24964 (host_detect_local_cpu): Use it.
24966 2021-05-17 Martin Liska <mliska@suse.cz>
24968 * doc/invoke.texi: Add 2 missing dots.
24970 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
24972 PR bootstrap/100552
24973 * configure.ac: Replace pattern substitution with call to sed.
24974 * configure: Regenerate.
24976 2021-05-17 Richard Biener <rguenther@suse.de>
24978 PR middle-end/100582
24979 * tree.c (array_at_struct_end_p): Get to the base of the
24980 reference before looking for the underlying decl.
24982 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
24984 * genoutput.c (validate_insn_alternatives) Make "wrong number of
24985 alternatives" message more specific, and remove assumption on where
24988 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24990 * config/arm/iterators.md (V16): New iterator.
24991 (VH_cvtto): New iterator.
24992 (v_cmp_result): Added V4HF and V8HF support.
24993 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
24994 (vcond<mode><mode>): Likewise.
24995 (vcond_mask_<mode><v_cmp_result>): Likewise.
24996 (vcond<VH_cvtto><mode>): New expander.
24998 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
25000 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
25002 * config/arm/arm.c (arm_expand_vector_compare): Add support for
25004 (arm_expand_vcond): Likewise.
25005 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
25006 VCMPEQQ_N_S, VCMPNEQ_N_S.
25007 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
25008 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
25009 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
25010 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
25011 (@mve_vpselq_<supf><mode>): Likewise.
25012 (@mve_vpselq_f<mode>"): Likewise.
25013 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
25014 and move to vec-common.md.
25015 (vec_cmpu<mode><mode>): Likewise.
25016 (vcond<mode><mode>): Likewise.
25017 (vcond<V_cvtto><mode>): Likewise.
25018 (vcondu<mode><v_cmp_result>): Likewise.
25019 (vcond_mask_<mode><v_cmp_result>): Likewise.
25020 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
25021 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
25022 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
25023 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
25024 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
25025 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
25027 (vec_cmpu<mode><mode>): Likewise.
25028 (vcond<mode><mode>): Likewise.
25029 (vcond<V_cvtto><mode>): Likewise.
25030 (vcondu<mode><v_cmp_result>): Likewise.
25031 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
25034 2021-05-17 liuhongt <hongtao.liu@intel.com>
25037 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
25038 gsi_insert_seq_before instead.
25040 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
25042 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
25043 (arm_sat_ok): ...this.
25045 2021-05-17 Martin Liska <mliska@suse.cz>
25047 * lto-wrapper.c (merge_flto_options): Factor out a new function.
25048 (merge_and_complain): Use it.
25049 (run_gcc): Merge also linker command line -flto=foo argument
25052 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
25054 * config/arm/arm.h (CPP_SPEC): Remove error message about
25055 -mlittle-endian/-mbig-endian conflict.
25057 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
25059 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
25060 __ROP_PROTECT__ if -mrop-protect is selected.
25062 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
25064 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
25065 rop_hash_save_offset and rop_hash_size.
25066 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
25067 rop_hash_size and rop_hash_save_offset.
25068 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
25069 (rs6000_emit_prologue): Emit hashst[p] in prologue.
25070 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
25071 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
25073 (hashst): New define_insn.
25074 (hashchk): Likewise.
25076 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
25078 * config/rs6000/rs6000.c (rs6000_option_override_internal):
25079 Disable shrink wrap when inserting ROP-protect instructions.
25080 * config/rs6000/rs6000.opt (mrop-protect): New option.
25081 (mprivileged): Likewise.
25082 * doc/invoke.texi: Document mrop-protect and mprivileged.
25084 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
25086 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
25089 2021-05-15 Martin Jambor <mjambor@suse.cz>
25092 2021-05-13 Martin Jambor <mjambor@suse.cz>
25094 PR tree-optimization/100453
25095 * tree-sra.c (sra_modify_assign): All const base accesses do not
25096 need refreshing, not just those from decl_pool.
25097 (sra_modify_assign): Do not refresh into a const base decl.
25099 2021-05-15 Jakub Jelinek <jakub@redhat.com>
25101 PR rtl-optimization/100342
25102 * regcprop.c (copy_value): When copying a source reg in a wider
25103 mode than it has recorded for the value, adjust recorded destination
25104 mode too or punt if !REG_CAN_CHANGE_MODE_P.
25106 2021-05-14 Jason Merrill <jason@redhat.com>
25108 * intl.h: Add comments.
25110 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25112 * config/aarch64/aarch64-simd.md
25113 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
25114 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
25115 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
25116 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
25117 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
25118 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
25119 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
25120 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
25121 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
25123 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25126 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
25127 boolean logic equivalent.
25128 (vtst_s16): Likewise.
25129 (vtst_s32): Likewise.
25130 (vtst_u8): Likewise.
25131 (vtst_u16): Likewise.
25132 (vtst_u32): Likewise.
25133 (vtst_p8): Likewise.
25134 (vtst_p16): Likewise.
25135 (vtstq_s8): Likewise.
25136 (vtstq_s16): Likewise.
25137 (vtstq_s32): Likewise.
25138 (vtstq_u8): Likewise.
25139 (vtstq_u16): Likewise.
25140 (vtstq_u32): Likewise.
25141 (vtstq_p8): Likewise.
25142 (vtstq_p16): Likewise.
25143 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
25144 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
25146 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25148 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
25149 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
25150 (aarch64_sqdmlsl2<mode>): Delete.
25151 (aarch64_sqdmlal2_lane<mode>): Merge this...
25152 (aarch64_sqdmlsl2_lane<mode>): ... And this...
25153 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
25154 (aarch64_sqdmlal2_laneq<mode>): Merge this...
25155 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
25156 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
25157 (aarch64_sqdmlal2_n<mode>): Merge this...
25158 (aarch64_sqdmlsl2_n<mode>): ... And this...
25159 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
25161 2021-05-13 Martin Sebor <msebor@redhat.com>
25163 PR middle-end/100574
25164 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
25165 all null arguments.
25167 2021-05-13 Martin Sebor <msebor@redhat.com>
25169 PR tree-optimization/93100
25170 PR middle-end/98583
25171 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
25172 don't modify referenced objects.
25174 2021-05-13 Martin Jambor <mjambor@suse.cz>
25176 PR tree-optimization/100453
25177 * tree-sra.c (sra_modify_assign): All const base accesses do not
25178 need refreshing, not just those from decl_pool.
25179 (sra_modify_assign): Do not refresh into a const base decl.
25181 2021-05-13 Martin Liska <mliska@suse.cz>
25183 * tree-ssa-dom.c: Remove m_simplifier.
25185 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
25188 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
25189 canonicalize DImode inequality comparisons against the
25190 maximum integral value.
25192 2021-05-13 Jakub Jelinek <jakub@redhat.com>
25194 PR tree-optimization/98856
25195 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
25196 Expect V2DI and V4DI arithmetic right shifts to be emulated.
25197 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
25199 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
25200 expand_vec_perm_2perm_pblendv): New functions.
25201 (ix86_expand_vec_perm_const_1): Use them.
25202 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
25203 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
25204 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
25205 (ashrv4di3): New define_expand.
25206 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
25207 and !TARGET_AVX512VL expansion.
25209 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
25212 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
25213 sizes < 16 to a register when constructing vpcmov pattern.
25214 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
25216 2021-05-13 Martin Liska <mliska@suse.cz>
25218 * gcov-io.c (gcov_write_block): Remove.
25219 (gcov_write_words): Likewise.
25220 (gcov_read_words): Re-implement using gcov_read_bytes.
25221 (gcov_allocate): Remove.
25222 (GCOV_BLOCK_SIZE): Likewise.
25223 (struct gcov_var): Remove most of the fields.
25224 (gcov_position): Implement with ftell.
25225 (gcov_rewrite): Remove setting of start and offset fields.
25226 (from_file): Re-format.
25227 (gcov_open): Remove setbuf call. It should not be needed.
25228 (gcov_close): Remove internal buffer handling.
25229 (gcov_magic): Use __builtin_bswap32.
25230 (gcov_write_counter): Use directly gcov_write_unsigned.
25231 (gcov_write_string): Use direct fwrite and do not round
25233 (gcov_seek): Use directly fseek.
25234 (gcov_write_tag): Use gcov_write_unsigned directly.
25235 (gcov_write_length): Likewise.
25236 (gcov_write_tag_length): Likewise.
25237 (gcov_read_bytes): Use directly fread.
25238 (gcov_read_unsigned): Use gcov_read_words.
25239 (gcov_read_counter): Likewise.
25240 (gcov_read_string): Use gcov_read_bytes.
25241 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
25242 that size is not in bytes, but words (4B).
25243 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
25244 (GCOV_TAG_ARCS_LENGTH): Likewise.
25245 (GCOV_TAG_ARCS_NUM): Likewise.
25246 (GCOV_TAG_COUNTER_LENGTH): Likewise.
25247 (GCOV_TAG_COUNTER_NUM): Likewise.
25248 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
25250 2021-05-13 liuhongt <hongtao.liu@intel.com>
25253 * config/i386/sse.md (ssedoublevecmode): Add attribute for
25254 V64QI/V32HI/V16SI/V4DI.
25255 (ssehalfvecmode): Add attribute for V2DI/V2DF.
25256 (*vec_concatv4si_0): Extend to VI124_128.
25257 (*vec_concat<mode>_0): New pre-reload splitter.
25258 * config/i386/predicates.md (movq_parallel): New predicate.
25260 2021-05-13 Alexandre Oliva <oliva@adacore.com>
25262 * targhooks.c (default_zero_call_used_regs): Retry using
25263 successfully-zeroed registers as sources.
25265 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
25267 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
25269 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
25272 * gimple-range.cc (range_of_builtin_call): Skip out on
25273 processing __builtin_clz when varying.
25275 2021-05-12 Tom de Vries <tdevries@suse.de>
25278 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
25279 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
25280 to ptx_version_option.
25281 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
25282 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
25283 (define_insn "nvptx_vote_ballot"): Use sync variant for
25285 * config/nvptx/nvptx.opt (ptx_version): Add enum.
25286 (mptx): Add option.
25287 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
25289 2021-05-12 Richard Biener <rguenther@suse.de>
25291 PR tree-optimization/100566
25292 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
25293 allow_back for all edge queries.
25295 2021-05-12 liuhongt <hongtao.liu@intel.com>
25298 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
25299 splitters for pblendvb of NOT mask register.
25301 2021-05-12 Richard Biener <rguenther@suse.de>
25303 PR tree-optimization/100519
25304 * tree-ssa-reassoc.c (can_associate_p): Split into...
25305 (can_associate_op_p): ... this
25306 (can_associate_type_p): ... and this.
25307 (is_reassociable_op): Call can_associate_op_p.
25308 (break_up_subtract_bb): Call the appropriate predicates.
25309 (reassociate_bb): Likewise.
25311 2021-05-12 Martin Liska <mliska@suse.cz>
25313 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
25314 (run_gcc): Use -flto argument detection for merged
25317 2021-05-12 Martin Liska <mliska@suse.cz>
25319 * lto-wrapper.c (print_lto_docs_link): New function.
25320 (run_gcc): Print warning about missing job server detection
25321 after we know NR of partitions. Do the same for -flto{,=1}.
25322 * opts.c (get_option_html_page): Support -flto option.
25324 2021-05-12 Martin Liska <mliska@suse.cz>
25326 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
25328 (append_option): Remove.
25329 (find_option): Rework to use the vector type.
25330 (remove_option): Remove.
25331 (merge_and_complain): Use vectors for cl_decoded_option data
25333 (append_compiler_options): Likewise.
25334 (append_diag_options): Likewise.
25335 (append_linker_options): Likewise.
25336 (append_offload_options): Likewise.
25337 (compile_offload_image): Likewise.
25338 (compile_images_for_offload_targets): Likewise.
25339 (find_and_merge_options): Likewise.
25340 (run_gcc): Likewise.
25342 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
25345 * dwarf2out.c (dwarf2out_finish): Set
25346 have_multiple_function_sections with multi-range text_section.
25348 2021-05-12 Martin Liska <mliska@suse.cz>
25350 PR bootstrap/100560
25351 * Makefile.in: Remove version.h from linker command line.
25353 2021-05-12 Richard Biener <rguenther@suse.de>
25355 PR middle-end/100547
25356 * rtl.h (rtvec_alloc): Make argument size_t.
25357 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
25359 2021-05-12 Jakub Jelinek <jakub@redhat.com>
25361 PR middle-end/100508
25362 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
25363 type, don't reuse DECL_RTL if it has different mode, instead force
25364 creation of a new DEBUG_EXPR.
25366 2021-05-12 Jakub Jelinek <jakub@redhat.com>
25367 Marc Glisse <marc.glisse@inria.fr>
25369 PR tree-optimization/94589
25370 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
25371 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
25373 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
25376 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
25377 * config/i386/mmx.md (MMXMODE124): New mode iterator.
25379 (mmxintvecmode): New mode attribute.
25380 (mmxintvecmodelower): Ditto.
25381 (*mmx_maskcmpv2sf3_comm): New insn pattern.
25382 (*mmx_maskcmpv2sf3): Ditto.
25383 (vec_cmpv2sfv2si): New expander.
25384 (vcond<V2FI:mode>v2si): Ditto.
25385 (mmx_vlendvps): New insn pattern.
25386 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
25387 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
25388 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
25390 2021-05-11 Martin Sebor <msebor@redhat.com>
25392 PR middle-end/21433
25393 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
25395 2021-05-11 Richard Biener <rguenther@suse.de>
25397 * gimple-fold.c (gimple_fold_call): Do not call
25398 maybe_fold_reference on call arguments or the static chain.
25399 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
25402 2021-05-11 Martin Liska <mliska@suse.cz>
25404 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
25405 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
25406 (DEF_HSAIL_SAT_BUILTIN): Likewise.
25407 (DEF_HSAIL_INTR_BUILTIN): Likewise.
25408 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
25409 * doc/frontends.texi: Remove BRIG.
25410 * doc/install.texi: Likewise.
25411 * doc/invoke.texi: Likewise.
25412 * doc/standards.texi: Likewise.
25413 * brig-builtins.def: Removed.
25414 * brig/ChangeLog: Removed.
25415 * brig/Make-lang.in: Removed.
25416 * brig/brig-builtins.h: Removed.
25417 * brig/brig-c.h: Removed.
25418 * brig/brig-lang.c: Removed.
25419 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
25420 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
25421 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
25422 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
25423 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
25424 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
25425 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
25426 * brig/brigfrontend/brig-comment-handler.cc: Removed.
25427 * brig/brigfrontend/brig-control-handler.cc: Removed.
25428 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
25429 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
25430 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
25431 * brig/brigfrontend/brig-function-handler.cc: Removed.
25432 * brig/brigfrontend/brig-function.cc: Removed.
25433 * brig/brigfrontend/brig-function.h: Removed.
25434 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
25435 * brig/brigfrontend/brig-label-handler.cc: Removed.
25436 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
25437 * brig/brigfrontend/brig-machine.c: Removed.
25438 * brig/brigfrontend/brig-machine.h: Removed.
25439 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
25440 * brig/brigfrontend/brig-module-handler.cc: Removed.
25441 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
25442 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
25443 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
25444 * brig/brigfrontend/brig-to-generic.cc: Removed.
25445 * brig/brigfrontend/brig-to-generic.h: Removed.
25446 * brig/brigfrontend/brig-util.cc: Removed.
25447 * brig/brigfrontend/brig-util.h: Removed.
25448 * brig/brigfrontend/brig-variable-handler.cc: Removed.
25449 * brig/brigfrontend/hsa-brig-format.h: Removed.
25450 * brig/brigfrontend/phsa.h: Removed.
25451 * brig/brigspec.c: Removed.
25452 * brig/config-lang.in: Removed.
25453 * brig/gccbrig.texi: Removed.
25454 * brig/lang-specs.h: Removed.
25455 * brig/lang.opt: Removed.
25457 2021-05-11 Richard Biener <rguenther@suse.de>
25460 * ipa-param-manipulation.c
25461 (ipa_param_body_adjustments::modify_call_stmt): Avoid
25462 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
25463 via gimple_call_lhs_ptr.
25465 2021-05-11 Alex Coplan <alex.coplan@arm.com>
25468 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
25469 Avoid emitting CFA adjusts on the sp if we have the fp.
25471 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
25473 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
25474 (VMULD): New iterator.
25475 (VCOND): Handle V4HF and V8HF.
25476 (VCONQ): Fix entry for V2SF.
25477 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
25478 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
25479 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
25480 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
25481 Use a 128-bit vector mode for the indexed operand. Use stype for
25482 the scheduling type.
25484 2021-05-11 Richard Biener <rguenther@suse.de>
25486 * gimple-fold.c (maybe_fold_reference): Only return
25487 is_gimple_min_invariant values.
25489 2021-05-11 Richard Biener <rguenther@suse.de>
25491 PR middle-end/100509
25492 * gimple-fold.c (fold_gimple_assign): Only call
25493 get_symbol_constant_value on register type symbols.
25495 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
25496 Joe Ramsay <joe.ramsay@arm.com>
25499 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
25500 (__arm_vcmpneq): Remove duplicate definition.
25501 (__arm_vstrwq_scatter_offset_p): Likewise.
25502 (__arm_vmaxq_x): Likewise.
25503 (__arm_vmlsdavaq): Likewise.
25504 (__arm_vmlsdavaxq): Likewise.
25505 (__arm_vmlsdavq_p): Likewise.
25506 (__arm_vmlsdavxq_p): Likewise.
25507 (__arm_vrmlaldavhaq): Likewise.
25508 (__arm_vstrbq_p): Likewise.
25509 (__arm_vstrbq_scatter_offset): Likewise.
25510 (__arm_vstrbq_scatter_offset_p): Likewise.
25511 (__arm_vstrdq_scatter_offset): Likewise.
25512 (__arm_vstrdq_scatter_offset_p): Likewise.
25513 (__arm_vstrdq_scatter_shifted_offset): Likewise.
25514 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
25516 2021-05-11 Jakub Jelinek <jakub@redhat.com>
25518 PR middle-end/100471
25519 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
25520 is 0, bypass the reduction loop including
25521 GOMP_taskgroup_reduction_unregister call.
25523 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25525 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
25526 costing_for_scalar.
25527 (rs6000_density_test): Early return if costing_for_scalar is true.
25528 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
25530 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25532 * doc/tm.texi: Regenerated.
25533 * target.def (init_cost): Add new parameter costing_for_scalar.
25534 * targhooks.c (default_init_cost): Adjust for new parameter.
25535 * targhooks.h (default_init_cost): Likewise.
25536 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
25537 (vect_compute_single_scalar_iteration_cost): Likewise.
25538 (vect_analyze_loop_2): Likewise.
25539 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
25540 (vect_bb_vectorization_profitable_p): Likewise.
25541 * tree-vectorizer.h (init_cost): Likewise.
25542 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
25543 * config/i386/i386.c (ix86_init_cost): Likewise.
25544 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
25546 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25548 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
25549 vect_nonmem and moved into...
25550 (struct rs6000_cost_data): ...here.
25551 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
25552 (rs6000_add_stmt_cost): Likewise.
25553 (rs6000_finish_cost): Likewise.
25555 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
25557 * range-op.cc (get_bool_state): Adjust head comment.
25558 (operator_not_equal::op1_range): Fix comment.
25559 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
25561 2021-05-10 Martin Sebor <msebor@redhat.com>
25563 PR middle-end/100425
25564 PR middle-end/100510
25565 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
25566 (pass_walloca::xlimit_certain_p): ...to this.
25567 (pass_walloca::gate): Execute for any kind of handled warning.
25568 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
25569 warnings when xlimit_certain_p is set.
25571 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
25573 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
25574 Return ALTIVEC_REGS if that is best_class.
25575 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
25577 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25579 * config/arm/arm.h (CPP_SPEC): Remove error message about
25582 2021-05-10 Martin Jambor <mjambor@suse.cz>
25584 * ipa-prop.h (IPA_NODE_REF): Removed.
25585 (IPA_NODE_REF_GET_CREATE): Likewise.
25586 (IPA_EDGE_REF): Likewise.
25587 (IPA_EDGE_REF_GET_CREATE): Likewise.
25588 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
25589 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
25590 use of ipa_node_params_sum.
25591 (ipcp_versionable_function_p): Likewise.
25592 (push_node_to_stack): Likewise.
25593 (pop_node_from_stack): Likewise.
25594 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
25595 direct use of ipa_node_params_sum.
25596 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
25597 ipa_node_params_sum.
25598 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
25600 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
25601 use of ipa_node_params_sum.
25602 (self_recursively_generated_p): Likewise.
25603 (propagate_scalar_across_jump_function): Likewise.
25604 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
25605 direct use of ipa_edge_args_sum, moved the lookup after the early
25606 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
25607 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
25608 direct uses of ipa_node_params_sum.
25609 (propagate_vr_across_jump_function): Likewise.
25610 (propagate_aggregate_lattice): Likewise.
25611 (propagate_aggs_across_jump_function): Likewise.
25612 (propagate_constants_across_call): Likewise, also replaced
25613 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
25614 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
25615 of ipa_node_params_sum.
25616 (estimate_local_effects): Likewise.
25617 (add_all_node_vals_to_toposort): Likewise.
25618 (propagate_constants_topo): Likewise.
25619 (ipcp_propagate_stage): Likewise.
25620 (ipcp_discover_new_direct_edges): Likewise.
25621 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
25622 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
25623 (get_info_about_necessary_edges): Likewise.
25624 (want_remove_some_param_p): Likewise.
25625 (create_specialized_node): Likewise.
25626 (self_recursive_pass_through_p): Likewise.
25627 (self_recursive_agg_pass_through_p): Likewise.
25628 (find_more_scalar_values_for_callers_subset): Likewise and also
25629 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
25630 case replacing two of those with a single query.
25631 (find_more_contexts_for_caller_subset): Likewise for the
25632 ipa_polymorphic_call_context overload.
25633 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
25634 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
25635 ipa_node_params_sum.
25636 (find_aggregate_values_for_callers_subset): Likewise, also reusing
25637 results of ipa_edge_args_sum->get.
25638 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
25639 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
25640 direct use of ipa_edge_args_sum.
25641 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
25642 summary query after the early exit and reused the result later.
25643 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
25644 ipa_node_params_sum.
25645 (decide_whether_version_node): Likewise. Removed re-querying for
25646 summaries after cloning.
25647 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
25648 ipa_node_params_sum.
25649 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
25651 (identify_dead_nodes): Likewise.
25652 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
25653 ipa_node_params_sum.
25654 (ipcp_store_vr_results): Likewise.
25655 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
25656 (ipa_fn_summary_t::duplicate): Likewise.
25657 (analyze_function_body): Likewise.
25658 (estimate_calls_size_and_time): Likewise.
25659 (ipa_cached_call_context::duplicate_from): Likewise.
25660 (ipa_call_context::equal_to): Likewise.
25661 (remap_edge_params): Likewise.
25662 (ipa_merge_fn_summary_after_inlining): Likewise.
25663 (inline_read_section): Likewise.
25664 * ipa-icf.c (sem_function::param_used_p): Likewise.
25665 * ipa-modref.c (compute_parm_map): Likewise.
25666 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
25668 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
25669 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25671 * ipa-profile.c (check_argument_count): Likewise.
25672 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
25673 with a direct use of ipa_node_params_sum.
25674 (ipa_initialize_node_params): Likewise.
25675 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
25676 direct use of ipa_edge_args_sum and reused the query result.
25677 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
25678 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25679 direct use of ipa_edge_args_sum.
25680 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
25681 ipa_node_params_sum and reused the result of the query.
25682 (ipa_analyze_node): Likewise.
25683 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
25684 of ipa_node_params_sum.
25685 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
25686 direct uses of ipa_edge_args_sum.
25687 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
25688 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25689 direct use of ipa_edge_args_sum. Removed superficial re-querying the
25691 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
25692 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25694 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
25695 direct use of ipa_edge_args_sum.
25696 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
25697 use of ipa_node_params_sum.
25698 (ipa_print_node_params): Likewise.
25699 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
25700 direct uses of ipa_edge_args_sum.
25701 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
25703 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
25704 ipa_node_params_sum.
25705 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
25706 scopes where it is used.
25708 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
25710 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
25711 <case E_V2SImode>: Force op_true to register.
25713 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25715 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
25716 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
25717 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
25719 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
25720 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
25721 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
25722 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
25723 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
25724 (mve_vcmpneq_n_f<mode>): Remove.
25725 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
25726 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
25727 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
25729 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25731 * config/arm/iterators.md (MVE_COMPARISONS): New.
25733 (mve_cmp_type): New.
25734 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
25736 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
25737 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
25738 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
25739 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
25740 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25741 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25742 (mve_vcmpneq_n_<mode>): Remove.
25744 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25746 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
25747 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
25748 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
25751 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25753 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
25754 (vcmpneq_n_u): Likewise.
25755 (vcmpeqq_u,): Likewise.
25756 (vcmpeqq_n_u): Likewise.
25757 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
25758 VCMPEQQ_N_U and VCMPNEQ_N_U.
25759 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
25760 (mve_vcmpeqq_n): Likewise.
25761 (mve_vcmpeqq): Likewise.
25762 (mve_vcmpneq_n): Likewise.
25764 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25766 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
25767 the 's' version of the builtin.
25769 2021-05-10 Richard Biener <rguenther@suse.de>
25771 PR tree-optimization/100492
25772 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
25773 Find nothing when the loop contains an irreducible region.
25775 2021-05-10 Richard Biener <rguenther@suse.de>
25777 PR middle-end/100464
25779 * gimple-fold.c (canonicalize_constructor_val): Do not set
25782 2021-05-10 Richard Biener <rguenther@suse.de>
25784 PR tree-optimization/100434
25785 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
25787 (dse_optimize_stmt): Handle call LHS by dropping the
25788 LHS or the whole call if it doesn't have other
25790 (pass_dse::execute): Adjust.
25792 2021-05-10 Martin Liska <mliska@suse.cz>
25794 * Makefile.in: Add missing genversion rule.
25796 2021-05-10 Alex Coplan <alex.coplan@arm.com>
25799 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
25800 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
25802 2021-05-10 Martin Liska <mliska@suse.cz>
25804 * builtins.c (is_builtin_name): Use startswith
25805 function instead of strncmp.
25806 * collect2.c (main): Likewise.
25807 (has_lto_section): Likewise.
25808 (scan_libraries): Likewise.
25809 * coverage.c (coverage_checksum_string): Likewise.
25810 (coverage_init): Likewise.
25811 * dwarf2out.c (is_cxx): Likewise.
25812 (gen_compile_unit_die): Likewise.
25813 * gcc-ar.c (main): Likewise.
25814 * gcc.c (init_spec): Likewise.
25815 (read_specs): Likewise.
25816 (execute): Likewise.
25817 (check_live_switch): Likewise.
25818 * genattrtab.c (write_attr_case): Likewise.
25819 (IS_ATTR_GROUP): Likewise.
25820 * gencfn-macros.c (main): Likewise.
25821 * gengtype.c (type_for_name): Likewise.
25822 (gen_rtx_next): Likewise.
25823 (get_file_langdir): Likewise.
25824 (write_local): Likewise.
25825 * genmatch.c (get_operator): Likewise.
25826 (get_operand_type): Likewise.
25827 (expr::gen_transform): Likewise.
25828 * genoutput.c (validate_optab_operands): Likewise.
25829 * incpath.c (add_sysroot_to_chain): Likewise.
25830 * langhooks.c (lang_GNU_C): Likewise.
25831 (lang_GNU_CXX): Likewise.
25832 (lang_GNU_Fortran): Likewise.
25833 (lang_GNU_OBJC): Likewise.
25834 * lto-wrapper.c (run_gcc): Likewise.
25835 * omp-general.c (omp_max_simt_vf): Likewise.
25836 * omp-low.c (omp_runtime_api_call): Likewise.
25837 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
25838 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
25839 * real.c (real_from_string): Likewise.
25840 * selftest.c (assert_str_startswith): Likewise.
25841 * timevar.c (timer::validate_phases): Likewise.
25842 * tree.c (get_file_function_name): Likewise.
25843 * ubsan.c (ubsan_use_new_style_p): Likewise.
25844 * varasm.c (default_function_rodata_section): Likewise.
25845 (incorporeal_function_p): Likewise.
25846 (default_section_type_flags): Likewise.
25847 * system.h (startswith): Define startswith.
25849 2021-05-10 Martin Liska <mliska@suse.cz>
25851 * bitmap.h (class auto_bitmap): Remove
25852 __cplusplus >= 201103.
25853 * config/aarch64/aarch64.c: Likewise.
25854 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
25856 * sbitmap.h: Likewise.
25858 2021-05-10 Martin Liska <mliska@suse.cz>
25860 * Makefile.in: Rename gcov-iov to genversion and depend
25861 on version.h (instead of gcov-iov.h).
25862 * gcov-io.h: Include version.h instread of gcov-iov.h.
25863 * gengtype-state.c (read_state_version): Likewise.
25864 * gcov-iov.c: Moved to...
25865 * genversion.c: ...here.
25866 * lto-streamer.h (LTO_major_version): Define it with
25868 * version.c: Removed.
25869 * version.h: Removed.
25871 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25873 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
25874 * config/arc/simdext.md (VCT): Add predicates for iterator
25877 (voptab): Likewise.
25878 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
25879 (<voptab>v2si3): New patterns.
25881 (reduc_plus_scal_v4hi): Likewise.
25882 (reduc_plus_scal_v2si): Likewise.
25883 (vec_duplicatev2si): Likewise.
25884 (vec_duplicatev4hi): Likewise.
25886 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25888 * config/arc/simdext.md: Format and cleanup file.
25890 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25892 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
25893 only when munaligned-access option is on.
25894 (movmisalign<mode>): Likewise.
25896 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25898 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
25900 * config/arc/arc.c (arc_reorg): Remove underscore from string.
25902 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25904 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
25905 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
25906 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
25907 (norm_f): Likewise.
25910 (clzsi2): Use fls instruction when available.
25911 (arc_clzsi2): Likewise.
25913 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25915 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
25917 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25919 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
25921 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
25923 PR middle-end/100467
25924 * toplev.c (compile_file): Call insn_locations_init before
25925 targetm.asm_out.code_end.
25927 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25930 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25932 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25934 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25935 Andrew Stubbs <amd@codesourcery.com>
25938 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
25939 emit_move_insn operands.
25941 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25943 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
25944 location for the outgoing edges of an empty block.
25945 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
25946 type by means of the get_array_descr_info langhook, if it is set and
25947 returns true. Remove obsolete code dealing with unnamed subtypes.
25949 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25951 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
25952 (sbr_vector): Renamed from ssa_block_cache.
25953 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
25954 (ssa_block_ranges::~ssa_block_ranges): Remove.
25955 (sbr_vector::set_bb_range): Use varying and undefined cached values.
25956 (ssa_block_ranges::set_bb_varying): Remove.
25957 (sbr_vector::get_bb_range): Adjust assert.
25958 (sbr_vector::bb_range_p): Adjust assert.
25959 (~block_range_cache): No freeing loop required.
25960 (block_range_cache::get_block_ranges): Remove.
25961 (block_range_cache::set_bb_range): Inline get_block_ranges.
25962 (block_range_cache::set_bb_varying): Remove.
25963 * gimple-range-cache.h (set_bb_varying): Remove prototype.
25964 * value-range.h (irange_allocator::get_memory): New.
25966 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25968 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
25969 dominator tree is available and requested.
25970 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
25971 (ranger_cache::fill_block_cache): Don't search dom tree here either.
25972 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
25974 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25976 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
25977 only PHI nodes better.
25979 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25981 * gimple-range-edge.h (gimple_outgoing_range): Rename from
25983 (gcond_edge_range): Export prototype.
25984 * gimple-range-edge.cc (gcond_edge_range): New.
25985 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
25986 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
25988 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25990 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
25991 default range into a temp and allocate only what is needed.
25993 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25995 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
25997 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25999 * gimple-range.h (gimple_range_global): Pick up parameter initial
26000 values, and use-before defined locals are UNDEFINED.
26002 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
26004 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
26006 * tree.h (reverse_storage_order_for_component_p): Return false if
26007 the type is a pointer.
26009 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
26011 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
26013 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
26016 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
26017 Handle V8QI, V4HI and V2SI modes.
26018 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
26019 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
26020 * config/i386/i386.md (unspec): ... here.
26022 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
26023 Tom de Vries <tdevries@suse.de>
26025 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
26026 a truth_value_p reduction variable is nonintegral.
26028 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
26031 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
26032 Return false for mode sizes < 16.
26034 2021-05-07 Jakub Jelinek <jakub@redhat.com>
26037 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
26039 2021-05-06 Martin Jambor <mjambor@suse.cz>
26041 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
26042 when there is no function summary.
26043 (ipa_sra_summarize_function): produce edge summaries even when
26046 2021-05-06 Tom Tromey <tom@tromey.com>
26048 * godump.c (string_hash_eq): Remove.
26049 (go_finish): Use htab_eq_string.
26051 2021-05-06 Tom Tromey <tom@tromey.com>
26053 * gengtype-state.c (read_state): Use htab_eq_string.
26054 (string_eq): Remove.
26056 2021-05-06 Tom Tromey <tom@tromey.com>
26058 * gensupport.c (htab_eq_string): Remove.
26060 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
26063 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
26064 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
26065 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
26066 set_ignored_loc callbacks.
26067 * debug.c (do_nothing_debug_hooks): Likewise.
26068 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
26069 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
26070 (in_text_section_p, last_text_label, last_cold_label,
26071 switch_text_ranges, switch_cold_ranges): New data items.
26072 (dwarf2out_note_section_used): Remove.
26073 (dwarf2out_begin_prologue): Set fde->ignored_debug and
26075 (mark_ignored_debug_section): New helper function.
26076 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
26077 mark_ignored_debug_section.
26078 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
26079 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
26080 (size_of_aranges): Adjust formula for multi-part text ranges size.
26081 (output_aranges): Output multi-part text ranges.
26082 (dwarf2out_set_ignored_loc): New callback function.
26083 (dwarf2out_finish): Output multi-part text ranges.
26084 (dwarf2out_c_finalize): Clear new data items.
26085 * final.c (final_start_function_1): Call set_ignored_loc callback.
26086 (final_scan_insn_1): Likewise.
26087 * ggc-page.c (gt_ggc_mx): New helper function.
26088 * stringpool.c (gt_pch_nx): Likewise.
26090 2021-05-06 Richard Biener <rguenther@suse.de>
26092 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
26093 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
26094 (TV_TREE_INTO_SSA): New.
26095 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
26096 (rewrite_blocks): Likewise.
26097 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
26099 2021-05-06 Jakub Jelinek <jakub@redhat.com>
26101 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
26102 abs_replacement, xor_replacement,
26103 cond_removal_in_popcount_clz_ctz_pattern,
26104 replace_phi_edge_with_variable): Change type of phi argument from
26105 gimple * to gphi *.
26107 2021-05-06 Richard Biener <rguenther@suse.de>
26109 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
26110 Output an opt-info message.
26111 (do_split_loop_on_cond): Likewise.
26112 (tree_ssa_split_loops): Update SSA form here.
26114 2021-05-06 Richard Biener <rguenther@suse.de>
26116 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
26117 return variable removal.
26119 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
26121 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
26122 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
26123 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
26125 * config/s390/s390.c (s390_const_operand_ok): Remove unused
26128 2021-05-06 Jakub Jelinek <jakub@redhat.com>
26130 PR tree-optimization/94589
26131 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
26132 spaceship_replacement.
26133 (cond_only_block_p, spaceship_replacement): New functions.
26135 2021-05-06 Richard Biener <rguenther@suse.de>
26138 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
26139 dealing with a debug use and only query existing addresses
26141 (lower_emutls_1): Avoid splitting out addresses for debug
26142 stmts, reset the debug stmt when we fail to find existing
26144 (lower_emutls_phi_arg): Set wi.stmt.
26146 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
26149 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
26150 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
26151 (stack_protect_test): Use cbranch helper.
26153 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
26156 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
26157 always return the establisher frame for __builtin_frame_address (0).
26159 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
26162 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
26163 (ix86_cpu_model_var): Likewise.
26164 (ix86_cpu_features2_type_node): Likewise.
26165 (ix86_cpu_features2_var): Likewise.
26166 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
26169 2021-05-05 Martin Sebor <msebor@redhat.com>
26171 * passes.def (pass_warn_printf): Run after SSA.
26173 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26175 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
26176 * config/arm/predicates.md (minus_one_operand): New predicate.
26178 2021-05-05 Jeff Law <jlaw@tachyum.com>
26180 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
26182 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
26184 PR rtl-optimization/100263
26185 * postreload.c (move2add_valid_value_p): Ensure register can
26188 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
26190 PR rtl-optimization/100411
26191 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
26192 and beginning of function markers.
26194 2021-05-05 Jeff Law <jlaw@tachyum.com>
26196 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
26197 * config/cr16/cr16.c (notice_update_cc): Remove.
26198 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
26200 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
26203 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
26204 Handle V8QI, V4HI and V2SI modes.
26205 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
26206 (ix86_build_signbit_mask): Ditto.
26207 * config/i386/mmx.md (MMXMODE14): New mode iterator.
26208 (<smaxmin:code><MMXMODE14:mode>3): New expander.
26209 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
26210 (<umaxmin:code><MMXMODE24:mode>3): New expander.
26211 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
26212 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
26213 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26214 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26215 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26216 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26218 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
26220 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
26221 not expand the VALUE_EXPR of variables put in the non-local frame.
26222 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
26223 to be ignored for debug info, ensure its variable offsets are not.
26225 2021-05-05 Richard Biener <rguenther@suse.de>
26227 PR tree-optimization/79333
26228 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
26229 Fold stmt following SSA edges.
26231 2021-05-05 Richard Biener <rguenther@suse.de>
26233 PR middle-end/100394
26234 * calls.c (expand_call): Preserve possibly throwing calls.
26235 * cfgexpand.c (expand_call_stmt): When a call can throw signal
26236 RTL expansion there are side-effects.
26237 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
26238 mark all possibly throwing stmts necessary unless we can elide
26240 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
26241 -fdelete-dead-exceptions.
26242 * tree.h (DECL_PURE_P): Add note about exceptions.
26244 2021-05-05 Alexandre Oliva <oliva@adacore.com>
26246 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
26249 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
26251 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
26252 get_fnname_from_decl for name of thunk.
26253 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
26254 and ASM_OUTPUT_LABEL.
26255 (rs6000_xcoff_declare_function_name): Use assemble_name and
26257 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
26258 (rs6000_xcoff_encode_section_info): Don't add mapping class
26259 for aliases. Always add [DS] mapping class to primary
26261 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
26263 2021-05-04 Martin Sebor <msebor@redhat.com>
26265 PR middle-end/100307
26266 * builtins.c (compute_objsize_r): Clear base0 for pointers.
26268 2021-05-04 Jeff Law <jlaw@tachyum.com>
26270 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
26272 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
26274 * caller-save.c: Remove CC0.
26275 * cfgcleanup.c: Remove CC0.
26276 * cfgrtl.c: Remove CC0.
26277 * combine.c: Remove CC0.
26278 * compare-elim.c: Remove CC0.
26279 * conditions.h: Remove CC0.
26280 * config/h8300/h8300.h: Remove CC0.
26281 * config/h8300/h8300-protos.h: Remove CC0.
26282 * config/h8300/peepholes.md: Remove CC0.
26283 * config/i386/x86-tune-sched.c: Remove CC0.
26284 * config/m68k/m68k.c: Remove CC0.
26285 * config/rl78/rl78.c: Remove CC0.
26286 * config/sparc/sparc.c: Remove CC0.
26287 * config/xtensa/xtensa.c: Remove CC0.
26288 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
26289 RTL where that is used as a placeholder only.
26290 * cprop.c: Remove CC0.
26291 * cse.c: Remove CC0.
26292 * cselib.c: Remove CC0.
26293 * df-problems.c: Remove CC0.
26294 * df-scan.c: Remove CC0.
26295 * doc/md.texi: Remove CC0. Adjust an example.
26296 * doc/rtl.texi: Remove CC0. Adjust an example.
26297 * doc/tm.texi: Regenerate.
26298 * doc/tm.texi.in: Remove CC0.
26299 * emit-rtl.c: Remove CC0.
26300 * final.c: Remove CC0.
26301 * fwprop.c: Remove CC0.
26302 * gcse-common.c: Remove CC0.
26303 * gcse.c: Remove CC0.
26304 * genattrtab.c: Remove CC0.
26305 * genconfig.c: Remove CC0.
26306 * genemit.c: Remove CC0.
26307 * genextract.c: Remove CC0.
26308 * gengenrtl.c: Remove CC0.
26309 * genrecog.c: Remove CC0.
26310 * haifa-sched.c: Remove CC0.
26311 * ifcvt.c: Remove CC0.
26312 * ira-costs.c: Remove CC0.
26313 * ira.c: Remove CC0.
26314 * jump.c: Remove CC0.
26315 * loop-invariant.c: Remove CC0.
26316 * lra-constraints.c: Remove CC0.
26317 * lra-eliminations.c: Remove CC0.
26318 * optabs.c: Remove CC0.
26319 * postreload-gcse.c: Remove CC0.
26320 * postreload.c: Remove CC0.
26321 * print-rtl.c: Remove CC0.
26322 * read-rtl-function.c: Remove CC0.
26323 * reg-notes.def: Remove CC0.
26324 * reg-stack.c: Remove CC0.
26325 * reginfo.c: Remove CC0.
26326 * regrename.c: Remove CC0.
26327 * reload.c: Remove CC0.
26328 * reload1.c: Remove CC0.
26329 * reorg.c: Remove CC0.
26330 * resource.c: Remove CC0.
26331 * rtl.c: Remove CC0.
26332 * rtl.def: Remove CC0.
26333 * rtl.h: Remove CC0.
26334 * rtlanal.c: Remove CC0.
26335 * sched-deps.c: Remove CC0.
26336 * sched-rgn.c: Remove CC0.
26337 * shrink-wrap.c: Remove CC0.
26338 * simplify-rtx.c: Remove CC0.
26339 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
26340 CC_STATUS_MDEP, and CC_STATUS.
26341 * target.def: Remove CC0.
26342 * valtrack.c: Remove CC0.
26343 * var-tracking.c: Remove CC0.
26345 2021-05-04 Richard Biener <rguenther@suse.de>
26347 PR tree-optimization/100414
26348 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
26350 (tree_ssa_phiopt_worker): But unconditionally here.
26352 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
26354 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
26355 && and || with floating-point and complex arguments.
26357 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
26359 * tree-inline.c (insert_debug_decl_map): Delete.
26360 (copy_debug_stmt): Minor tweak.
26361 (setup_one_parameter): Do not use a variable if the value is either
26362 a read-only DECL or a non-addressable local variable in the caller.
26363 In this case, insert the debug-only variable in the map manually.
26364 (expand_call_inline): Do not generate a CLOBBER for these values.
26365 * tree-inline.h (debug_map): Minor tweak.
26367 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
26369 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
26370 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
26372 2021-05-04 Richard Biener <rguenther@suse.de>
26374 PR tree-optimization/100329
26375 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
26377 (insert_stmt_after): Assert we're not running into asm goto.
26379 2021-05-04 Richard Biener <rguenther@suse.de>
26381 PR tree-optimization/100398
26382 * tree-ssa-dse.c (pass_dse::execute): Preserve control
26385 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26387 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
26389 2021-05-04 Alexandre Oliva <oliva@adacore.com>
26391 * builtins.c (try_store_by_multiple_pieces): New.
26392 (expand_builtin_memset_args): Use it. If target_char_cast
26393 fails, proceed as for non-constant val. Pass len's ctz to...
26394 * expr.c (clear_storage_hints): ... this. Try store by
26395 multiple pieces after setmem.
26396 (clear_storage): Adjust.
26397 * expr.h (clear_storage_hints): Likewise.
26398 (try_store_by_multiple_pieces): Declare.
26399 * passes.def: Replace the last copy_prop with ccp.
26401 2021-05-03 Tom de Vries <tdevries@suse.de>
26404 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
26407 2021-05-03 Richard Biener <rguenther@suse.de>
26409 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
26411 2021-05-03 Richard Biener <rguenther@suse.de>
26413 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
26414 (dse_dom_walker): Remove.
26415 (dse_dom_walker::dse_optimize_stmt): Rename...
26416 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
26417 (dse_dom_walker::before_dom_children): Inline ...
26418 (pass_dse::execute): ... here. Perform a reverse program
26421 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
26424 * configure: Regenerated.
26426 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
26429 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
26430 (s390_md_asm_adjust): Handle hard registers.
26432 2021-05-03 Jakub Jelinek <jakub@redhat.com>
26434 PR tree-optimization/100382
26435 * tree-ssa-dse.c: Include tree-eh.h.
26436 (dse_dom_walker::before_dom_children): Don't remove stmts if
26437 stmt_unremovable_because_of_non_call_eh_p is true.
26439 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
26441 * varasm.c (compute_reloc_for_var): Split out from...
26442 (get_variable_section): Use it.
26443 * output.h (compute_reloc_for_var): Declare.
26444 * config/rs6000/rs6000-protos.h
26445 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
26447 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
26448 Don't append storage mapping class to symbol.
26449 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
26450 Don't convert TLS BSS to common.
26451 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
26452 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
26454 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
26456 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
26457 If align is 0 from TLS class, use the same rules as varasm.c
26458 If not common, switch to BSS section manually.
26459 If common, emit appropriate comm or lcomm directive.
26460 (rs6000_xcoff_encode_section_info): Add logic to append all
26461 storage mapping classes.
26462 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
26463 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
26464 rs6000_xcoff_asm_output_aligned_decl_common.
26465 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
26466 rs6000_xcoff_asm_output_aligned_decl_common.
26467 (ASM_OUTPUT_TLS_COMMON): Use
26468 rs6000_xcoff_asm_output_aligned_decl_common.
26470 2021-05-02 Jakub Jelinek <jakub@redhat.com>
26473 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
26474 as first argument of pseudo_node_t constructors.
26476 2021-05-02 Jakub Jelinek <jakub@redhat.com>
26479 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
26481 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
26483 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
26484 (gt_pch_nx (int_range<1> *&)): New.
26485 (gt_ggc_mx (int_range<1> *&)): New.
26486 * value-range.h (class irange): Add GTY support for
26489 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
26491 * doc/options.texi (Negative): Change either or to both and.
26493 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26495 * config/aarch64/aarch64-simd-builtins.def: Add
26496 float_ml[as][q]_laneq builtin generator macros.
26497 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
26498 (aarch64_float_mla_laneq<mode>): Define.
26499 (aarch64_float_mls_laneq<mode>): Define.
26500 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
26501 instead of GCC vector extensions.
26502 (vmlaq_laneq_f32): Likewise.
26503 (vmls_laneq_f32): Likewise.
26504 (vmlsq_laneq_f32): Likewise.
26506 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26508 * config/aarch64/aarch64-simd-builtins.def: Add
26509 float_ml[as]_lane builtin generator macros.
26510 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
26512 (mul_lane<mode>3): This, and re-order arguments.
26513 (aarch64_float_mla_lane<mode>): Define.
26514 (aarch64_float_mls_lane<mode>): Define.
26515 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
26516 instead of GCC vector extensions.
26517 (vmlaq_lane_f32): Likewise.
26518 (vmls_lane_f32): Likewise.
26519 (vmlsq_lane_f32): Likewise.
26521 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26523 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
26524 builtin generator macros.
26525 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
26527 (aarch64_float_mls<mode>): Define.
26528 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
26529 instead of relying on GCC vector extensions.
26530 (vmla_f64): Likewise.
26531 (vmlaq_f32): Likewise.
26532 (vmlaq_f64): Likewise.
26533 (vmls_f32): Likewise.
26534 (vmls_f64): Likewise.
26535 (vmlsq_f32): Likewise.
26536 (vmlsq_f64): Likewise.
26537 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
26539 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26541 * config/aarch64/aarch64-simd-builtins.def: Add
26542 float_ml[as]_n_builtin generator macros.
26543 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
26545 (mul_n<mode>3): This, and re-order arguments.
26546 (aarch64_float_mla_n<mode>): Define.
26547 (aarch64_float_mls_n<mode>): Define.
26548 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
26549 instead of inline asm.
26550 (vmlaq_n_f32): Likewise.
26551 (vmls_n_f32): Likewise.
26552 (vmlsq_n_f32): Likewise.
26554 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
26556 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
26557 builtin generator macros.
26558 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
26559 (aarch64_pmull_hiv16qi_insn): Define.
26560 (aarch64_pmull_hiv16qi): Define.
26561 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
26562 instead of inline asm.
26563 (vmull_p8): Likewise.
26565 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26567 * config/avr/avr.md: Adjust peepholes to match and
26568 generate parallels with clobber of REG_CC.
26569 (mov<mode>_insn): Rename to mov<mode>_insn_split.
26570 (*mov<mode>_insn): Rename to mov<mode>_insn.
26572 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
26574 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
26575 for VAR_DECLs if -fdata-sections enabled.
26577 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
26579 PR bootstrap/100327
26580 * config/rs6000/rs6000.c
26581 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
26582 (rs6000_libgcc_floating_mode_supported_p): New target hook.
26584 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26586 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
26588 (class back_threader_registry): ...to here...
26589 (class back_threader_profitability): ...and here...
26590 (thread_jumps::thread_through_all_blocks): Remove argument.
26591 (back_threader_registry::back_threader_registry): New.
26592 (back_threader_registry::~back_threader_registry): New.
26593 (back_threader_registry::thread_through_all_blocks): New.
26594 (thread_jumps::profitable_jump_thread_path): Move from here...
26595 (back_threader_profitability::profitable_path_p): ...to here.
26596 (thread_jumps::find_taken_edge): New.
26597 (thread_jumps::convert_and_register_current_path): Move...
26598 (back_threader_registry::register_path): ...to here.
26599 (thread_jumps::register_jump_thread_path_if_profitable): Move...
26600 (thread_jumps::maybe_register_path): ...to here.
26601 (thread_jumps::handle_phi): Call find_taken_edge and
26602 maybe_register_path.
26603 (thread_jumps::handle_assignment): Same.
26604 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
26605 tree argument to handle_phi and handle_assignment.
26606 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
26607 set of m_speed_p and m_max_threaded_paths.
26608 (pass_thread_jumps::execute): Remove second argument from
26609 find_jump_threads_backwards.
26610 (pass_early_thread_jumps::execute): Same.
26612 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26614 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
26615 (class dom_opt_dom_walker): Initialize some class variables.
26616 (pass_dominator::execute): Pass evrp_range_analyzer and
26617 dom_jump_threader_simplifier to dom_opt_dom_walker.
26618 Adjust for some functions moving into classes.
26619 (simplify_stmt_for_jump_threading): Adjust and move to...
26620 (jump_threader_simplifier::simplify): ...here.
26621 (dom_opt_dom_walker::before_dom_children): Adjust for
26622 m_evrp_range_analyzer.
26623 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
26624 (test_for_singularity): Place in dom_opt_dom_walker class.
26625 (dom_opt_dom_walker::optimize_stmt): The argument
26626 evrp_range_analyzer is now a class field.
26627 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
26628 (thread_jumps::thread_through_all_blocks): New.
26629 (thread_jumps::convert_and_register_current_path): Use m_registry.
26630 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
26631 being in the threader class.
26632 (pass_early_thread_jumps::execute): Same.
26633 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
26634 (jump_threader::jump_threader): ...here.
26635 (threadedge_finalize_values): Move...
26636 (jump_threader::~jump_threader): ...here.
26637 (jump_threader::remove_jump_threads_including): New.
26638 (jump_threader::thread_through_all_blocks): New.
26639 (record_temporary_equivalences_from_phis): Move...
26640 (jump_threader::record_temporary_equivalences_from_phis): ...here.
26641 (record_temporary_equivalences_from_stmts_at_dest): Move...
26642 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
26644 (simplify_control_stmt_condition_1): Move to jump_threader class.
26645 (simplify_control_stmt_condition): Move...
26646 (jump_threader::simplify_control_stmt_condition): ...here.
26647 (thread_around_empty_blocks): Move...
26648 (jump_threader::thread_around_empty_blocks): ...here.
26649 (thread_through_normal_block): Move...
26650 (jump_threader::thread_through_normal_block): ...here.
26651 (thread_across_edge): Move...
26652 (jump_threader::thread_across_edge): ...here.
26653 (thread_outgoing_edges): Move...
26654 (jump_threader::thread_outgoing_edges): ...here.
26655 * tree-ssa-threadedge.h: Move externally facing functings...
26656 (class jump_threader): ...here...
26657 (class jump_threader_simplifier): ...and here.
26658 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
26659 (jump_thread_path_allocator::jump_thread_path_allocator): New.
26660 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
26661 (jump_thread_path_allocator::allocate_thread_edge): New.
26662 (jump_thread_path_allocator::allocate_thread_path): New.
26663 (jump_thread_path_registry::jump_thread_path_registry): New.
26664 (jump_thread_path_registry::~jump_thread_path_registry): New.
26665 (jump_thread_path_registry::allocate_thread_edge): New.
26666 (jump_thread_path_registry::allocate_thread_path): New.
26667 (dump_jump_thread_path): Make extern.
26668 (debug (const vec<jump_thread_edge *> &path)): New.
26669 (struct removed_edges): Move to tree-ssa-threadupdate.h.
26670 (struct thread_stats_d): Remove.
26671 (remove_ctrl_stmt_and_useless_edges): Make static.
26672 (lookup_redirection_data): Move...
26673 (jump_thread_path_registry::lookup_redirection_data): ...here.
26674 (ssa_redirect_edges): Make static.
26675 (thread_block_1): Move...
26676 (jump_thread_path_registry::thread_block_1): ...here.
26677 (thread_block): Move...
26678 (jump_thread_path_registry::thread_block): ...here.
26679 (thread_through_loop_header): Move...
26680 (jump_thread_path_registry::thread_through_loop_header): ...here.
26681 (mark_threaded_blocks): Move...
26682 (jump_thread_path_registry::mark_threaded_blocks): ...here.
26683 (debug_path): Move...
26684 (jump_thread_path_registry::debug_path): ...here.
26685 (debug_all_paths): Move...
26686 (jump_thread_path_registry::dump): ..here.
26687 (rewire_first_differing_edge): Move...
26688 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
26689 (adjust_paths_after_duplication): Move...
26690 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
26691 (duplicate_thread_path): Move...
26692 (jump_thread_path_registry::duplicate_thread_path): ..here.
26693 (remove_jump_threads_including): Move...
26694 (jump_thread_path_registry::remove_jump_threads_including): ...here.
26695 (thread_through_all_blocks): Move to...
26696 (jump_thread_path_registry::thread_through_all_blocks): ...here.
26697 (delete_jump_thread_path): Remove.
26698 (register_jump_thread): Move...
26699 (jump_thread_path_registry::register_jump_thread): ...here.
26700 * tree-ssa-threadupdate.h: Move externally facing functions...
26701 (class jump_thread_path_allocator): ...here...
26702 (class jump_thread_path_registry): ...and here.
26703 (thread_through_all_blocks): Remove.
26704 (struct removed_edges): New.
26705 (register_jump_thread): Remove.
26706 (remove_jump_threads_including): Remove.
26707 (delete_jump_thread_path): Remove.
26708 (remove_ctrl_stmt_and_useless_edges): Remove.
26709 (free_dom_edge_info): New prototype.
26710 * tree-vrp.c: Remove x_vr_values hack.
26711 (class vrp_jump_threader_simplifier): New.
26712 (vrp_jump_threader_simplifier::simplify): New.
26713 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
26714 Remove m_dummy_cond.
26715 Instantiate m_simplifier and m_threader.
26716 (vrp_jump_threader::thread_through_all_blocks): New.
26717 (vrp_jump_threader::simplify_stmt): Remove.
26718 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
26719 Remove x_vr_values hack.
26720 (execute_vrp): Adjust for thread_through_all_blocks being in a
26723 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
26725 * genflags.c (gen_insn): Print failed expansion string.
26727 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26729 * expr.c (alignment_for_piecewise_move): Call mode_for_size
26730 without limit to MAX_FIXED_MODE_SIZE.
26732 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26734 PR middle-end/90773
26735 * builtins.c (builtin_memset_gen_str): Don't use return from
26736 simplify_gen_subreg.
26738 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
26741 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
26742 (*addsi3_carry_zext_0r): Ditto.
26743 (*sub<mode>3_carry_0): Ditto.
26744 (*subsi3_carry_zext_0r): Ditto.
26745 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
26747 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
26748 Also consider ix86_carry_flag_unset_operator to calculate
26749 the cost of adc/sbb insn.
26751 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
26753 PR rtl-optimization/100225
26754 PR rtl-optimization/84878
26755 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
26756 where we have an instruction which touches (writes) any hard
26757 register from df->regular_block_artificial_uses set.
26758 Allow not-single-set instruction only right before basic block
26761 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
26763 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
26765 2021-04-30 LevyHsu <admin@levyhsu.com>
26767 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
26768 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
26769 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
26770 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
26772 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26774 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
26776 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26778 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
26779 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
26780 * config/i386/i386.c: Adjust.
26781 * config/i386/i386.md: Adjust.
26782 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
26783 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26784 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26785 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26786 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26787 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26788 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26789 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26790 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26791 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26792 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
26794 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26796 * config/i386/i386-expand.c (ix86_expand_int_compare):
26797 Swap operands of GTU and LEU comparison to emit carry flag comparison.
26798 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
26799 predicate to allow more combine opportunities with memory operands.
26800 (*sub<mode>3_carry_0): Ditto.
26802 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
26804 PR rtl-optimization/100303
26805 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
26806 boolean that indicates whether the use will only be used in
26807 debug instructions. Treat it in the same way that existing
26808 cross-EBB debug references would be handled if so.
26809 (function_info::make_uses_available): Likewise.
26810 * rtl-ssa/functions.h (function_info::make_uses_available): Update
26811 prototype accordingly.
26812 (function_info::make_uses_available): Likewise.
26813 * fwprop.c (try_fwprop_subst): Update call accordingly.
26815 2021-04-29 Jeff Law <jlaw@tachyum.com>
26817 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
26820 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26821 Richard Biener <rguenther@suse.de>
26824 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
26825 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
26826 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
26827 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
26828 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
26829 to PURE_ARGS category.
26830 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26831 Handle PURE_ARGS category.
26832 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
26834 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26836 * configure.ac: Check for the presence of sys/locking.h header and
26837 for whether _LK_LOCK is supported by _locking.
26838 * configure: Regenerate.
26839 * config.in: Likewise.
26840 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
26841 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
26842 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
26844 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26846 * config/i386/predicates.md (fcmov_comparison_operator):
26847 Do not check for trivial FP comparison operator.
26848 <case GEU, case LTU>: Allow CCGZmode.
26849 <case GTU, case LEU>: Do not allow CCCmode.
26850 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
26851 (ix86_carry_flag_operator): Match only LTU and UNLT code.
26852 Do not check for trivial FP comparison operator. Allow CCGZmode.
26854 2021-04-29 Tom de Vries <tdevries@suse.de>
26856 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
26857 fd->loop.step by either step or orig_step.
26859 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26861 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
26862 (load_got_register): Do the PIC dance here.
26863 (sparc_legitimize_tls_address): Simplify.
26864 (sparc_emit_probe_stack_range): Likewise.
26865 (sparc32_initialize_trampoline): Likewise.
26866 (sparc64_initialize_trampoline): Likewise.
26867 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
26868 (probe_stack_range<P:mode>): Likewise.
26869 (flush<P:mode>): Likewise.
26870 (tgd_hi22<P:mode>): Likewise.
26871 (tgd_lo10<P:mode>): Likewise.
26872 (tgd_add<P:mode>): Likewise.
26873 (tgd_call<P:mode>): Likewise.
26874 (tldm_hi22<P:mode>): Likewise.
26875 (tldm_lo10<P:mode>): Likewise.
26876 (tldm_add<P:mode>): Likewise.
26877 (tldm_call<P:mode>): Likewise.
26878 (tldo_hix22<P:mode>): Likewise.
26879 (tldo_lox10<P:mode>): Likewise.
26880 (tldo_add<P:mode>): Likewise.
26881 (tie_hi22<P:mode>): Likewise.
26882 (tie_lo10<P:mode>): Likewise.
26883 (tie_add<P:mode>): Likewise.
26884 (tle_hix22<P:mode>): Likewise.
26885 (tle_lox10<P:mode>): Likewise.
26886 (stack_protect_setsi): Rename to...
26887 (stack_protect_set32): ...this.
26888 (stack_protect_setdi): Rename to...
26889 (stack_protect_set64): ...this.
26890 (stack_protect_set): Adjust calls to above.
26891 (stack_protect_testsi): Rename to...
26892 (stack_protect_test32): ...this.
26893 (stack_protect_testdi): Rename to...
26894 (stack_protect_test64): ...this.
26895 (stack_protect_test): Adjust calls to above.
26897 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
26899 PR middle-end/90773
26900 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
26901 (builtin_strncpy_read_str): Likewise.
26902 (builtin_memset_read_str): Add an argument for the previous RTL
26903 information and generate the new RTL from the previous RTL info.
26904 (builtin_memset_gen_str): Likewise.
26905 * builtins.h (builtin_strncpy_read_str): Update the prototype.
26906 (builtin_memset_read_str): Likewise.
26907 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
26908 returns true, round up size and alignment to the widest integer
26909 mode for maximum size.
26910 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
26911 and pass it to m_constfn.
26912 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
26913 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
26914 initialize m_push. Initialize m_overlap_op_by_pieces with
26915 targetm.overlap_op_by_pieces_p ().
26916 (op_by_pieces_d::run): Pass the previous RTL information to
26917 pieces_addr::adjust and generate overlapping operations if
26918 m_overlap_op_by_pieces is true.
26920 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
26922 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
26924 (can_store_by_pieces): Use by_pieces_constfn on constfun.
26925 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
26926 for op_by_pieces_d change.
26927 (clear_by_pieces_1): Add a dummy argument.
26928 (clear_by_pieces): Updated for op_by_pieces_d change.
26929 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
26930 (string_cst_read_str): Add a dummy argument.
26931 * expr.h (by_pieces_constfn): Add a dummy argument.
26932 (by_pieces_prev): New.
26933 * target.def (overlap_op_by_pieces_p): New target hook.
26934 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
26935 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
26936 * doc/tm.texi: Regenerated.
26938 2021-04-29 Richard Biener <rguenther@suse.de>
26940 PR tree-optimization/100253
26941 * tree-vect-stmts.c (vectorizable_load): Do not assume
26942 element alignment when DR_MISALIGNMENT is -1.
26943 (vectorizable_store): Likewise.
26945 2021-04-29 Jakub Jelinek <jakub@redhat.com>
26948 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
26949 absu_hwi instead of abs_hwi.
26951 2021-04-29 Richard Biener <rguenther@suse.de>
26953 PR middle-end/38474
26954 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
26955 forwarding when indirect forwarding through ESCAPED
26958 2021-04-29 Tom de Vries <tdevries@suse.de>
26961 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
26962 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
26963 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
26964 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
26966 2021-04-29 Richard Biener <rguenther@suse.de>
26968 PR tree-optimization/99912
26969 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
26970 (dse_dom_walker::todo): Likewise.
26971 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
26973 (dse_dom_walker::before_dom_children): Remove trivially
26974 dead SSA defs and schedule CFG cleanup if we removed all
26976 (pass_dse::execute): Get TODO as computed by the DOM walker
26977 and return it. Wipe dominator info earlier.
26979 2021-04-29 Richard Biener <rguenther@suse.de>
26982 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
26983 Track blocks to cleanup EH in new m_need_eh_cleanup.
26984 (ipcp_modif_dom_walker::cleanup_eh): New.
26985 (ipcp_transform_function): Release dominator info before
26988 2021-04-29 Martin Sebor <msebor@redhat.com>
26990 PR middle-end/100250
26991 * attribs.c (attr_access::array_as_string): Avoid dereferencing
26992 a pointer when it's null.
26994 2021-04-29 Martin Sebor <msebor@redhat.com>
26996 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
26997 * ipa-free-lang-data.cc: New file.
26998 * tree.c: Move pass free_lang_data to file above.
26999 (build_array_type_1): Declare extern.
27000 * tree.h (build_array_type_1): Declare.
27002 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27004 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
27005 make consistent with updated RTL pattern.
27006 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
27007 Implement using ss_truncate and us_truncate rather than
27009 * config/aarch64/iterators.md: Remove redundant unspecs and
27010 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
27012 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27014 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
27015 attributes consistent with those defined in arm_neon.h.
27017 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27019 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
27020 attributes consistent with those defined in arm_neon.h.
27022 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27024 * config/aarch64/aarch64-simd-builtins.def: Add
27025 float_trunc_rodd builtin generator macros.
27026 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
27028 (aarch64_float_trunc_rodd_lo_v2sf): Define.
27029 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
27030 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
27031 (aarch64_float_trunc_rodd_hi_v4sf): Define.
27032 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
27033 instead of inline asm.
27034 (vcvtx_high_f32_f64): Likewise.
27035 (vcvtxd_f32_f64): Likewise.
27036 * config/aarch64/iterators.md: Add FCVTXN unspec.
27038 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27040 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
27042 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
27044 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
27045 instead of inline asm.
27046 (vqtbx1_u8): Likewise.
27047 (vqtbx1_p8): Likewise.
27048 (vqtbx1q_s8): Likewise.
27049 (vqtbx1q_u8): Likewise.
27050 (vqtbx1q_p8): Likewise.
27051 (vtbx2_s8): Likewise.
27052 (vtbx2_u8): Likewise.
27053 (vtbx2_p8): Likewise.
27055 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27057 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
27059 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
27060 instead of inline asm.
27061 (vqtbl1_s8): Likewise.
27062 (vqtbl1_u8): Likewise.
27063 (vqtbl1q_p8): Likewise.
27064 (vqtbl1q_s8): Likewise.
27065 (vqtbl1q_u8): Likewise.
27066 (vtbl1_s8): Likewise.
27067 (vtbl1_u8): Likewise.
27068 (vtbl1_p8): Likewise.
27069 (vtbl2_s8): Likewise.
27070 (vtbl2_u8): Likewise.
27071 (vtbl2_p8): Likewise.
27073 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27075 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
27076 ssri_n buitin generator macro.
27077 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
27078 instead of inline asm.
27079 (vsri_n_p16): Likewise.
27080 (vsri_n_p64): Likewise.
27081 (vsriq_n_p8): Likewise.
27082 (vsriq_n_p16): Likewise.
27083 (vsriq_n_p64): Likewise.
27085 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27087 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
27088 iterator for polynomial ssli_n builtin generator macro.
27089 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
27090 instead of inline asm.
27091 (vsli_n_p16): Likewise.
27092 (vsliq_n_p8): Likewise.
27093 (vsliq_n_p16): Likewise.
27094 * config/aarch64/iterators.md: Define VALLP mode iterator.
27096 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27098 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
27099 iterator to generate [su]adalp RTL builtins.
27100 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
27101 [su]adalp RTL pattern.
27102 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
27103 instead of inline asm.
27104 (vpadal_u32): Likewise.
27106 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27108 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
27109 builtin generator macros.
27110 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
27112 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
27113 instead of inline asm.
27114 (vpaddl_s16): Likewise.
27115 (vpaddl_s32): Likewise.
27116 (vpaddl_u8): Likewise.
27117 (vpaddl_u16): Likewise.
27118 (vpaddl_u32): Likewise.
27119 (vpaddlq_s8): Likewise.
27120 (vpaddlq_s16): Likewise.
27121 (vpaddlq_s32): Likewise.
27122 (vpaddlq_u8): Likewise.
27123 (vpaddlq_u16): Likewise.
27124 (vpaddlq_u32): Liwewise.
27125 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
27126 appropriate attributes.
27128 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27130 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
27131 for aarch64_addp<mode> builtin macro generator.
27132 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
27133 aarch64_addp<mode> RTL pattern.
27134 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
27135 instead of inline asm.
27136 (vpaddq_s16): Likewise.
27137 (vpaddq_s32): Likewise.
27138 (vpaddq_s64): Likewise.
27139 (vpaddq_u8): Likewise.
27140 (vpaddq_u16): Likewise.
27141 (vpaddq_u32): Likewise.
27142 (vpaddq_u64): Likewise.
27144 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27146 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
27147 builtin generator macros.
27148 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
27150 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
27151 instead of inline asm.
27152 (vqdmulh_n_s32): Likewise.
27153 (vqdmulhq_n_s16): Likewise.
27154 (vqdmulhq_n_s32): Likewise.
27155 (vqrdmulh_n_s16): Likewise.
27156 (vqrdmulh_n_s32): Likewise.
27157 (vqrdmulhq_n_s16): Likewise.
27158 (vqrdmulhq_n_s32): Likewise.
27160 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
27162 * doc/install.texi (--enable-offload-defaulted): Document.
27164 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
27166 * config/avr/avr-dimode.md: Turn existing patterns into
27167 define_insn_and_split style patterns where the splitter
27168 adds a clobber of the condition code register. Drop "cc"
27169 attribute. Add new patterns to match output of
27171 * config/avr/avr-fixed.md: Likewise.
27172 * config/avr/avr.c (cc_reg_rtx): New.
27173 (avr_parallel_insn_from_insns): Adjust insn count
27174 for removal of set of cc0.
27175 (avr_is_casesi_sequence): Likewise.
27176 (avr_casei_sequence_check_operands): Likewise.
27177 (avr_optimize_casesi): Likewise. Also insert
27178 new insns after jump_insn.
27179 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
27180 for removal of set of cc0.
27181 (avr_init_expanders): Initialize cc_reg_rtx.
27182 (avr_regno_reg_class): Handle REG_CC.
27183 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
27184 (avr_notice_update_cc): Remove function.
27185 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
27186 (compare_condition): Adjust for PARALLEL with
27188 (out_shift_with_cnt): Likewise.
27189 (ashlhi3_out): Likewise.
27190 (ashrhi3_out): Likewise.
27191 (lshrhi3_out): Likewise.
27192 (avr_class_max_nregs): Return single reg for REG_CC.
27193 (avr_compare_pattern): Check for REG_CC instead
27195 (avr_reorg_remove_redundant_compare): Likewise.
27196 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
27197 (avr_hard_regno_nregs): Return single reg for REG_CC.
27198 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
27199 (avr_md_asm_adjust): Clobber REG_CC.
27200 (TARGET_HARD_REGNO_NREGS): Define.
27201 (TARGET_CLASS_MAX_NREGS): Define.
27202 (TARGET_MD_ASM_ADJUST): Define.
27203 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
27205 (enum reg_class): Add CC_REG class.
27206 (NOTICE_UPDATE_CC): Remove.
27207 (CC_OVERFLOW_UNUSABLE): Remove.
27208 (CC_NO_CARRY): Remove.
27209 * config/avr/avr.md: Turn existing patterns into
27210 define_insn_and_split style patterns where the splitter
27211 adds a clobber of the condition code register. Drop "cc"
27212 attribute. Add new patterns to match output of
27214 (sez): Remove unused pattern.
27216 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
27219 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
27222 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
27225 * config/aarch64/constraints.md (Utq): Require the address to
27226 be valid for both the element mode and for V2DImode.
27228 2021-04-28 Jakub Jelinek <jakub@redhat.com>
27229 Tobias Burnus <tobias@codesourcery.com>
27231 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
27232 * gcc.c (process_command): New variable.
27233 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
27234 set it if -foffload is defaulted.
27235 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
27236 (compile_offload_image): If OFFLOAD_DEFAULTED and
27237 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
27238 if corresponding mkoffload can't be found.
27239 (compile_images_for_offload_targets): Likewise. Free and clear
27240 offload_names if no valid offload is found.
27241 * config.in: Regenerate.
27242 * configure: Regenerate.
27244 2021-04-28 Richard Biener <rguenther@suse.de>
27246 PR tree-optimization/100292
27247 * tree-vect-generic.c (expand_vector_condition): Do not fold
27250 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
27252 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
27253 * config/rs6000/aix64.opt (m64): New.
27256 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
27258 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
27259 (index_term_p): Handle ASHIFT too.
27261 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
27263 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
27264 (sync_lock_test_and_set<mode>): Adjust accordingly.
27265 (sync_lock_release<mode>): Likewise.
27267 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
27269 * config/vax/vax-protos.h (adjacent_operands_p): Remove
27271 * config/vax/vax.c (adjacent_operands_p): Remove.
27273 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
27275 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
27276 through to the non-conditional execution case if getting the
27277 condition for conditional execution has failed.
27279 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
27281 PR middle-end/100284
27282 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
27283 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
27284 than asserting on it.
27286 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
27288 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
27289 with TARGET_AIX_OS.
27291 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
27294 * calls.c (precompute_register_parameters): Additionally test
27295 targetm.precompute_tls_p to pre-compute argument.
27296 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
27297 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
27298 * target.def (precompute_tls_p): New.
27299 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
27300 * doc/tm.texi: Regenerated.
27302 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27305 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
27306 back to HOST_WIDE_INT.
27308 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
27311 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
27312 memory alignment for the outer mode.
27314 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
27316 PR middle-end/90773
27317 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
27318 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
27320 2021-04-27 Alex Coplan <alex.coplan@arm.com>
27323 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
27324 with negative immediates: ensure we expand cbranchsi4_scratch
27325 correctly and ensure we satisfy its constraints.
27326 * config/arm/sync.md
27327 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
27328 attempt to tie two output operands together with constraints;
27329 collapse two alternatives.
27330 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
27331 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
27333 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27336 * config/aarch64/predicates.md (aarch64_sub_immediate,
27337 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
27338 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
27339 * config/aarch64/aarch64.c (aarch64_print_operand,
27340 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
27342 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27344 PR tree-optimization/100239
27345 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
27346 permutations with all indices from the first zero element as vec_shl.
27348 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27350 PR rtl-optimization/100254
27351 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
27352 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
27354 2021-04-27 Richard Biener <rguenther@suse.de>
27356 PR tree-optimization/99912
27357 * passes.def: Add comment about new TODO_remove_unused_locals.
27358 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
27361 2021-04-27 Richard Biener <rguenther@suse.de>
27363 PR tree-optimization/99912
27364 * passes.def (pass_all_optimizations): Add pass_dse before
27365 the first pass_dce, move the first pass_dse before the
27366 pass_dce following pass_pre.
27368 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27370 PR tree-optimization/95527
27371 * generic-match-head.c: Include tm.h.
27372 * gimple-match-head.c: Include tm.h.
27373 * match.pd (CLZ == INTEGER_CST): Don't use
27374 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
27375 if clz == CFN_CLZ. Add missing val declaration.
27376 (CTZ cmp CST): New simplifications.
27378 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27380 PR tree-optimization/96696
27381 * expr.c (expand_expr_divmod): New function.
27382 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
27383 divisions. Formatting fixes.
27384 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
27387 2021-04-27 Martin Jambor <mjambor@suse.cz>
27390 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
27391 If removing a call statement LHS SSA name, release it.
27393 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
27396 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
27397 is valid before including it in the mask.
27399 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
27402 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
27405 2021-04-27 Richard Biener <rguenther@suse.de>
27407 PR tree-optimization/100051
27408 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
27409 disambiguator based on access size vs. decl size.
27411 2021-04-27 Richard Biener <rguenther@suse.de>
27413 PR tree-optimization/100278
27414 * tree-ssa-pre.c (compute_avail): Give up when we cannot
27415 adjust TBAA beacuse of mismatching bases.
27417 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27420 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
27421 For any_rotate define_insn_split and following splitters, use
27422 SWI iterator instead of SWI48.
27424 2021-04-27 Richard Biener <rguenther@suse.de>
27426 PR tree-optimization/99776
27427 * match.pd (bit_field_ref (ctor)): Relax element extract
27428 type compatibility checks.
27430 2021-04-27 Cui,Lili <lili.cui@intel.com>
27432 * common/config/i386/i386-common.c (processor_names):
27433 Sync processor_names with processor_type.
27434 * config/i386/i386-options.c (processor_cost_table):
27435 Sync processor_cost_table with processor_type.
27437 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27439 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
27440 (irange::set): Call irange_set_1bit_anti_range for handling all
27441 1-bit ranges. Fall through on ~[MIN,MAX].
27443 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27445 * value-range.cc (irange::legacy_num_pairs): Remove.
27446 (irange::invert): Change gcc_assert to gcc_checking_assert.
27447 * value-range.h (irange::num_pairs): Adjust for a cached
27448 num_pairs(). Also, rename all gcc_assert's to
27449 gcc_checking_assert's.
27451 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27453 * value-range.cc (irange::operator=): Set m_kind.
27454 (irange::copy_to_legacy): Handle varying and undefined sources
27455 as a legacy copy since they can be easily copied.
27456 (irange::irange_set): Set m_kind.
27457 (irange::irange_set_anti_range): Same.
27458 (irange::set): Rename normalize_min_max to normalize_kind.
27459 (irange::verify_range): Adjust for multi-ranges having the
27461 (irange::irange_union): Set m_kind.
27462 (irange::irange_intersect): Same.
27463 (irange::invert): Same.
27464 * value-range.h (irange::kind): Always return m_kind.
27465 (irange::varying_p): Rename to...
27466 (irange::varying_comptaible_p): ...this.
27467 (irange::undefined_p): Only look at m_kind.
27468 (irange::irange): Always set VR_UNDEFINED if applicable.
27469 (irange::set_undefined): Always set VR_UNDEFINED.
27470 (irange::set_varying): Always set m_kind to VR_VARYING.
27471 (irange::normalize_min_max): Rename to...
27472 (irange::normalize_kind): ...this.
27474 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27476 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
27477 Adjust for constant_p including varying_p.
27478 * tree-vrp.c (vrp_prop::finalize): Same.
27479 (determine_value_range): Same.
27480 * vr-values.c (vr_values::range_of_expr): Same.
27481 * value-range.cc (irange::symbolic_p): Do not check varying_p.
27482 (irange::constant_p): Same.
27484 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27486 * value-range.cc (irange::legacy_lower_bound): Replace
27487 !undefined_p check with num_ranges > 0.
27488 (irange::legacy_upper_bound): Same.
27489 * value-range.h (irange::type): Same.
27490 (irange::lower_bound): Same.
27491 (irange::upper_bound): Same.
27493 2021-04-26 Richard Biener <rguenther@suse.de>
27495 PR tree-optimization/99956
27496 * gimple-loop-interchange.cc (compute_access_stride):
27497 Try instantiating the access in a shallower loop nest
27498 if instantiating failed.
27499 (compute_access_strides): Pass adjustable loop_nest
27500 to compute_access_stride.
27502 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
27504 * doc/sourcebuild.texi (arm_cmse_hw): Document.
27506 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27508 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
27510 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
27511 Nathan Sidwell <nathan@codesourcery.com>
27512 Tom de Vries <vries@codesourcery.com>
27513 Julian Brown <julian@codesourcery.com>
27514 Kwok Cheung Yeung <kcy@codesourcery.com>
27516 * omp-offload.c (oacc_validate_dims): Implement
27517 '-Wopenacc-parallelism'.
27518 * doc/invoke.texi (-Wopenacc-parallelism): Document.
27520 2021-04-26 Richard Biener <rguenther@suse.de>
27522 * tree-cfg.h (gimplify_build1): Remove.
27523 (gimplify_build2): Likewise.
27524 (gimplify_build3): Likewise.
27525 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
27526 (gimplify_build2): Likewise.
27527 (gimplify_build3): Likewise.
27528 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
27530 (gimplify_build2): Likewise.
27531 (gimplify_build3): Likewise.
27532 (tree_vec_extract): Use resimplify with following SSA edges.
27533 (expand_vector_parallel): Avoid passing NULL size/bitpos
27534 to tree_vec_extract.
27535 * expr.c (store_constructor): Deal with zero-element CTORs.
27536 * match.pd (bit_field_ref <vector CTOR>): Make sure to
27537 produce vector constants when possible.
27539 2021-04-26 Richard Biener <rguenther@suse.de>
27541 * tree-complex.c: Include gimple-fold.h.
27542 (expand_complex_addition): Use gimple_build.
27543 (expand_complex_multiplication_components): Likewise.
27544 (expand_complex_multiplication): Likewise.
27545 (expand_complex_div_straight): Likewise.
27546 (expand_complex_div_wide): Likewise.
27547 (expand_complex_division): Likewise.
27548 (expand_complex_conjugate): Likewise.
27549 (expand_complex_comparison): Likewise.
27551 2021-04-26 Richard Biener <rguenther@suse.de>
27553 * tree-ssa-phiopt.c (two_value_replacement): Remove use
27554 of legacy gimplify_buildN API.
27556 2021-04-26 Richard Biener <rguenther@suse.de>
27558 PR tree-optimization/99473
27559 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
27562 2021-04-26 Richard Biener <rguenther@suse.de>
27564 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
27565 Use replace_call_with_value.
27567 2021-04-26 Richard Biener <rguenther@suse.de>
27569 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
27570 (update_gimple_call): Likewise.
27571 (update_call_from_tree): Likewise.
27572 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
27573 (valid_gimple_call_p): Likewise.
27574 (move_ssa_defining_stmt_for_defs): Likewise.
27575 (finish_update_gimple_call): Likewise.
27576 (update_gimple_call): Likewise.
27577 (update_call_from_tree): Likewise.
27578 (propagate_tree_value_into_stmt): Use replace_call_with_value.
27579 * gimple-fold.h (update_gimple_call): Declare.
27580 * gimple-fold.c (valid_gimple_rhs_p): Move here from
27581 tree-ssa-propagate.c.
27582 (update_gimple_call): Likewise.
27583 (valid_gimple_call_p): Likewise.
27584 (finish_update_gimple_call): Likewise, and simplify.
27585 (gimplify_and_update_call_from_tree): Implement
27586 update_call_from_tree functionality, avoid excessive
27587 push/pop_gimplify_context.
27588 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
27589 (gimple_fold_call): Likewise.
27590 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
27591 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
27592 (pass_fold_builtins::execute): Likewise.
27593 (optimize_stack_restore): Use replace_call_with_value.
27594 * tree-cfg.c (fold_loop_internal_call): Likewise.
27595 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
27596 only gimplify_and_update_call_from_tree.
27597 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
27598 (handle_builtin_strchr): Likewise.
27599 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
27601 2021-04-26 Jakub Jelinek <jakub@redhat.com>
27604 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
27605 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
27608 2021-04-25 liuhongt <hongtao.liu@intel.com>
27611 * config/i386/i386-builtin.def (BDESC): Change the icode of
27612 the following builtins to CODE_FOR_nothing.
27613 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
27614 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
27615 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
27616 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
27617 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
27618 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
27619 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
27620 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
27621 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
27622 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
27623 (sse2_eq<mode>3): Ditto.
27624 (sse4_1_eqv2di3): Ditto.
27625 (sse2_gt<mode>3): Rename to ..
27626 (*sse2_gt<mode>3): .. this.
27628 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27631 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27634 * config/darwin.c (darwin_binds_local_p): Assume that any
27635 public symbol might be interposed for PIC code. Update function
27636 header comment to reflect current Darwin capability.
27638 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27641 * config/darwin.c (darwin_binds_local_p): Assume that any
27642 public symbol might be interposed for PIC code. Update function
27643 header comment to reflect current Darwin capability.
27645 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
27647 * doc/sourcebuild.texi: Document no-opts and any-opts target
27650 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27652 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
27654 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27656 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
27658 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27661 * config/i386/i386-options.c (ix86_option_override_internal):
27662 Error out when -m96bit-long-double is used with 64bit targets.
27663 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
27665 2021-04-23 Martin Liska <mliska@suse.cz>
27667 * lto-wrapper.c: Remove FIXME about usage of
27668 hardware_concurrency. The function is not on par with
27671 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27674 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
27675 Copy operand 3 to operand 4. Use sse_reg_operand
27676 as operand 3 predicate.
27677 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
27678 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
27679 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
27680 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
27681 Copy operand 1 to operand 0.
27682 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
27683 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
27684 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
27686 2021-04-23 Alex Coplan <alex.coplan@arm.com>
27688 PR rtl-optimization/100230
27689 * early-remat.c (early_remat::sort_candidates): Use delete[]
27690 instead of delete for array allocated with new[].
27692 2021-04-23 Richard Biener <rguenther@suse.de>
27694 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
27695 (capture_info::capture_info): Likewise.
27696 (capture_info::walk_match): Likewise.
27697 (expr::gen_transform): Likewise.
27698 (dt_simplify::gen_1): Likewise.
27699 * gimple-match-head.c (maybe_resimplify_conditional_op):
27700 Remove VEC_COND_EXPR special-casing.
27701 (gimple_simplify): Likewise.
27702 * gimple.c (gimple_could_trap_p_1): Adjust.
27703 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
27704 to participate in PRE.
27706 2021-04-23 Richard Biener <rguenther@suse.de>
27708 * cfganal.c (connect_infinite_loops_to_exit): First call
27709 add_noreturn_fake_exit_edges.
27710 * ipa-sra.c (process_scan_results): Do not call the now redundant
27711 add_noreturn_fake_exit_edges.
27712 * predict.c (tree_estimate_probability): Likewise.
27713 (rebuild_frequencies): Likewise.
27714 * store-motion.c (one_store_motion_pass): Likewise.
27716 2021-04-23 Richard Biener <rguenther@suse.de>
27718 PR tree-optimization/100222
27719 * predict.c (pass_profile::execute): Remove redundant call to
27720 mark_irreducible_loops.
27721 (report_predictor_hitrates): Likewise.
27723 2021-04-23 Richard Biener <rguenther@suse.de>
27725 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
27726 valid_gimple_rhs_p by instead gimplifying to one.
27728 2021-04-23 Richard Biener <rguenther@suse.de>
27730 PR tree-optimization/99971
27731 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
27732 Always use TBAA for loads.
27734 2021-04-23 liuhongt <hongtao.liu@intel.com>
27737 * config/i386/i386-options.c (ix86_option_override_internal):
27738 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
27739 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
27740 by target attribute.
27742 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
27744 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
27746 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
27748 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
27750 * config.gcc (powerpc-ibm-aix6.*): Remove.
27751 * config/rs6000/aix61.h: Delete.
27753 2021-04-22 Martin Liska <mliska@suse.cz>
27755 PR testsuite/100159
27756 PR testsuite/100192
27757 * builtins.c (expand_builtin): Fix typos and missing comments.
27758 * dwarf2out.c (gen_subprogram_die): Likewise.
27759 (gen_struct_or_union_type_die): Likewise.
27761 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
27764 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
27765 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
27767 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
27769 * config/i386/darwin.h (TARGET_64BIT): Remove definition
27770 based on TARGET_ISA_64BIT.
27771 (TARGET_64BIT_P): Remove definition based on
27772 TARGET_ISA_64BIT_P().
27774 2021-04-21 Martin Liska <mliska@suse.cz>
27777 2021-04-21 Martin Liska <mliska@suse.cz>
27779 * lto-wrapper.c (cpuset_popcount): Remove.
27780 (init_num_threads): Remove and use hardware_concurrency.
27782 2021-04-21 Martin Liska <mliska@suse.cz>
27785 * main.c (main): Call toplev::finalize in CHECKING_P mode.
27786 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
27787 when incremental LTO linking happens.
27789 2021-04-21 Martin Liska <mliska@suse.cz>
27791 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
27792 makeserver cannot be detected, then use -flto=N fallback.
27794 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27796 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
27797 default to yes for aarch64-linux-gnu.
27798 * configure: Regenerate.
27800 2021-04-21 Martin Liska <mliska@suse.cz>
27802 * lto-wrapper.c (cpuset_popcount): Remove.
27803 (init_num_threads): Remove and use hardware_concurrency.
27805 2021-04-21 Martin Liska <mliska@suse.cz>
27807 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
27808 which remains to be '(... || 0)' and clang complains about it.
27809 * dwarf2out.c (AT_vms_delta): Declare conditionally.
27810 (add_AT_vms_delta): Likewise.
27811 * tree.c (fld_simplified_type): Use rather more common pattern
27812 for disabling of something (#if 0).
27813 (get_tree_code_name): Likewise.
27814 (verify_type_variant): Likewise.
27816 2021-04-21 Martin Liska <mliska@suse.cz>
27818 * config/i386/i386-expand.c (decide_alignment): Use newly named
27819 macro TARGET_CPU_P.
27820 * config/i386/i386.c (ix86_decompose_address): Likewise.
27821 (ix86_address_cost): Likewise.
27822 (ix86_lea_outperforms): Likewise.
27823 (ix86_avoid_lea_for_addr): Likewise.
27824 (ix86_add_stmt_cost): Likewise.
27825 * config/i386/i386.h (TARGET_*): Remove.
27826 (TARGET_CPU_P): New macro.
27827 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
27828 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
27829 (swap_top_of_ready_list): Likewise.
27830 (ix86_atom_sched_reorder): Likewise.
27831 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
27832 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
27834 2021-04-21 Martin Liska <mliska@suse.cz>
27836 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
27838 (SET_TARGET_NO_SAHF): Likewise.
27839 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
27840 (SET_TARGET_PREFETCH_SSE): Likewise.
27841 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
27842 (SET_TARGET_NO_TUNE): Likewise.
27843 (TARGET_EXPLICIT_NO_80387_P): Likewise.
27844 (SET_TARGET_NO_80387): Likewise.
27846 * config/i386/i386.h (TARGET_*): Remove.
27847 * opth-gen.awk: Generate new used macros.
27849 2021-04-21 Martin Liska <mliska@suse.cz>
27851 * config/i386/i386.h (PTA_*): Remove.
27852 (enum pta_flag): New.
27853 (DEF_PTA): Generate PTA_* values from i386-isa.def.
27854 * config/i386/i386-isa.def: New file.
27856 2021-04-21 Alex Coplan <alex.coplan@arm.com>
27859 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
27860 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
27861 jump table targets.
27863 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
27865 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
27866 x86_64-*-* targets.
27867 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
27869 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
27870 (ix86_handle_option): Handle -mmwait.
27871 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
27872 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
27873 __builtin_ia32_monitor and __builtin_ia32_mwait.
27874 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
27875 (ix86_valid_target_attribute_inner_p): Likewise.
27876 (ix86_option_override_internal): Enable mwait/monitor
27877 instructions for -msse3.
27878 * config/i386/i386.h (TARGET_MWAIT): New.
27879 (TARGET_MWAIT_P): Likewise.
27880 * config/i386/i386.opt: Add -mmwait.
27881 * config/i386/mwaitintrin.h: New file.
27882 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
27883 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
27885 (@sse3_monitor_<mode>): Likewise.
27886 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
27887 * doc/extend.texi: Document mwait target attribute.
27888 * doc/invoke.texi: Document -mmwait.
27890 2021-04-21 Martin Liska <mliska@suse.cz>
27892 * config/i386/i386-options.c (DEF_ENUM): Remove it.
27893 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
27894 * config/i386/stringop.def (DEF_ENUM): Likewise.
27896 2021-04-21 Martin Liska <mliska@suse.cz>
27898 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
27899 of print_generic_expr.
27901 2021-04-21 Jakub Jelinek <jakub@redhat.com>
27903 PR rtl-optimization/100148
27904 * cprop.c (constprop_register): Use next_nondebug_insn instead of
27907 2021-04-21 Martin Liska <mliska@suse.cz>
27910 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
27911 free_dominance_info calls.
27913 2021-04-21 Richard Biener <rguenther@suse.de>
27915 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
27916 parameter (and assume it to be false).
27917 (fold_gimple_assign): Adjust, remove all callers of
27918 maybe_fold_reference calling it with is_lhs true.
27919 (gimple_fold_call): Likewise.
27920 (fold_stmt_1): Likewise.
27922 2021-04-21 Richard Biener <rguenther@suse.de>
27924 * fold-const.c (pedantic_non_lvalue_loc): Remove.
27925 (fold_binary_loc): Adjust.
27926 (fold_ternary_loc): Likewise.
27928 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27930 PR middle-end/100130
27931 * varasm.c (get_block_for_decl): Make sure that any use of the
27932 retain attribute matches the section's retain flag.
27933 (switch_to_section): Check for retain mismatches even when
27934 changing sections, but do not warn if the given decl is the
27935 section's named.decl.
27936 (output_object_block): Pass the first decl in the block (if any)
27937 to switch_to_section.
27939 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
27941 * config/i386/i386-c.c (ix86_target_macros_internal): Define
27942 __CRC32__ for -mcrc32.
27943 * config/i386/i386-options.c (ix86_option_override_internal):
27944 Enable crc32 instruction for -msse4.2.
27945 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
27947 (sse4_2_crc32di): Likewise.
27948 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
27951 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
27954 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
27957 2021-04-20 Martin Liska <mliska@suse.cz>
27959 * doc/invoke.texi: Fix typo.
27960 * params.opt: Likewise.
27962 2021-04-20 Martin Liska <mliska@suse.cz>
27964 * doc/invoke.texi: Document new param.
27966 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
27968 PR tree-optimization/100081
27969 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
27970 rather than gori_compute_cache.
27971 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
27972 (range_def_chain::m_logical_depth): New member.
27973 (range_def_chain::range_def_chain): Initialize m_logical_depth.
27974 (range_def_chain::get_def_chain): Don't build defchains through more
27975 than LOGICAL_LIMIT logical expressions.
27976 * params.opt (param_ranger_logical_depth): New.
27978 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
27981 * config/arm/arm.c (arm_configure_build_target): Do not strip
27982 extended FPU/SIMD feature bits from the target ISA when -mfpu
27983 is specified (partial revert of r11-8168).
27985 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
27987 * params.opt (-param=openacc-kernels=): Add.
27988 * omp-oacc-kernels-decompose.cc
27989 (pass_omp_oacc_kernels_decompose::gate): Use it.
27990 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
27991 (--param): ... here, 'openacc-kernels'.
27993 2021-04-19 Martin Liska <mliska@suse.cz>
27996 * gengtype.c (finish_root_table): Align function arguments
27997 in between declaration and definition.
27999 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
28001 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
28002 frames larger than the SEH maximum frame size.
28004 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
28006 PR rtl-optimization/99927
28007 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
28008 is dead, just drop it.
28010 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
28013 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
28014 * doc/tm.texi: Regenerate.
28015 * doc/tm.texi.in (D language and ABI): Add @hook for
28016 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
28018 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
28020 * config/darwin-d.c (darwin_d_handle_target_object_format): New
28022 (darwin_d_register_target_info): New function.
28023 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28024 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
28026 (dragonfly_d_register_target_info): New function.
28027 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28028 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
28030 (freebsd_d_register_target_info): New function.
28031 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28032 * config/glibc-d.c (glibc_d_handle_target_object_format): New
28034 (glibc_d_register_target_info): New function.
28035 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28036 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
28038 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
28039 as handler for objectFormat key.
28040 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
28042 (winnt_d_register_target_info): New function.
28043 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28044 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
28046 (netbsd_d_register_target_info): New function.
28047 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28048 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
28050 (openbsd_d_register_target_info): New function.
28051 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28052 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
28053 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
28054 handler for objectFormat key.
28055 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
28057 (rs6000_d_register_target_info): Add
28058 rs6000_d_handle_target_object_format as handler for objectFormat key.
28059 * config/sol2-d.c (solaris_d_handle_target_object_format): New
28061 (solaris_d_register_target_info): New function.
28062 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28064 2021-04-16 Jakub Jelinek <jakub@redhat.com>
28067 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
28068 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
28070 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
28071 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
28072 the old and new alignment after applying MIN/MAX to it is different.
28074 2021-04-16 Tamar Christina <tamar.christina@arm.com>
28077 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
28078 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
28080 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
28082 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
28084 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
28085 this section and its subsections.
28087 2021-04-16 Jakub Jelinek <jakub@redhat.com>
28090 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
28091 define_insn patterns.
28093 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
28095 PR rtl-optimization/98689
28096 * reg-notes.def (UNTYPED_CALL): New note.
28097 * combine.c (distribute_notes): Handle it.
28098 * emit-rtl.c (try_split): Likewise.
28099 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
28100 that calls with the note implicitly set all return value registers.
28101 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
28104 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
28106 PR rtl-optimization/99596
28107 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
28108 register accesses for const calls. Assume that pure functions
28109 can only read from global registers. Ignore cases in which
28110 the stack pointer has been marked global.
28112 2021-04-16 Jakub Jelinek <jakub@redhat.com>
28115 * tree-vect-loop.c (vect_transform_loop): Don't remove just
28116 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
28117 them by their last argument.
28119 2021-04-15 Martin Liska <mliska@suse.cz>
28121 * doc/invoke.texi: Other params don't use it, remove it.
28123 2021-04-15 Richard Biener <rguenther@suse.de>
28125 * gimple-builder.h: Add deprecation note.
28127 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
28130 * attribs.h (restrict_type_identity_attributes_to): Declare.
28131 * attribs.c (restrict_type_identity_attributes_to): New function.
28133 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
28136 * attribs.h (affects_type_identity_attributes): Declare.
28137 * attribs.c (remove_attributes_matching): New function.
28138 (affects_type_identity_attributes): Likewise.
28140 2021-04-15 Jakub Jelinek <jakub@redhat.com>
28143 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
28144 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
28145 ZERO_EXTEND, SIGN_EXTEND or AND.
28147 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
28149 PR rtl-optimization/99929
28150 * rtl.h (same_vector_encodings_p): New function.
28151 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
28152 * cselib.c (rtx_equal_for_cselib_1): Likewise.
28153 * jump.c (rtx_renumbered_equal_p): Likewise.
28154 * lra-constraints.c (operands_match_p): Likewise.
28155 * reload.c (operands_match_p): Likewise.
28156 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
28158 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
28160 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
28161 more information about variable-length CONST_VECTORs.
28163 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
28165 PR rtl-optimization/100066
28166 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
28167 ordered modes when choosing splitting mode for hard reg.
28169 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
28172 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
28174 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
28176 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
28178 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
28179 for mask operand types.
28180 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
28181 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
28183 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
28185 * config/s390/s390.c (s390_const_operand_ok): Check the new
28186 operand types and generate a list of valid values.
28188 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
28190 * doc/tm.texi: Regenerate.
28191 * doc/tm.texi.in (D language and ABI): Add @hook for
28192 TARGET_D_REGISTER_OS_TARGET_INFO.
28194 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
28196 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
28198 (aarch64_d_register_target_info): New function.
28199 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
28201 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
28203 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
28204 (arm_d_register_target_info): New function.
28205 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
28206 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28207 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
28208 (ix86_d_register_target_info): New function.
28209 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
28210 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28211 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
28212 (mips_d_register_target_info): New function.
28213 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
28214 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28215 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
28216 (pa_d_register_target_info): New function.
28217 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
28218 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28219 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
28221 (riscv_d_register_target_info): New function.
28222 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
28223 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28224 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
28226 (rs6000_d_register_target_info): New function.
28227 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
28229 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28230 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
28231 (s390_d_register_target_info): New function.
28232 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
28233 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28234 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
28236 (sparc_d_register_target_info): New function.
28237 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
28238 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28239 * doc/tm.texi: Regenerate.
28240 * doc/tm.texi.in (D language and ABI): Add @hook for
28241 TARGET_D_REGISTER_CPU_TARGET_INFO.
28243 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
28245 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
28246 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
28247 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
28248 * doc/tm.texi: Regenerate.
28249 * doc/tm.texi.in (D language and ABI): Add @hook for
28250 TARGET_D_HAS_STDCALL_CONVENTION.
28252 2021-04-14 Richard Biener <rguenther@suse.de>
28254 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
28255 VEC_COND_EXPRs have a gimple_val condition.
28256 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
28257 can no longer have a GENERIC condition.
28259 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
28262 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
28263 from the isa_delta when -mfpu has been used.
28264 (arm_options_perform_arch_sanity_checks): It's the architecture that
28265 lacks an FPU not the processor.
28267 2021-04-13 Richard Biener <rguenther@suse.de>
28269 PR tree-optimization/100053
28270 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
28271 not use optimistic dominance queries for backedges to validate
28273 (dominated_by_p_w_unex): Add parameter to ignore executable
28274 state on backedges.
28275 (rpo_elim::eliminate_avail): Adjust.
28277 2021-04-13 Jakub Jelinek <jakub@redhat.com>
28280 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
28281 *aarch64_bfxilsi_extrdi): New define_insn patterns.
28283 2021-04-13 Jakub Jelinek <jakub@redhat.com>
28286 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
28287 outermode, return NULL if the result doesn't encode back to the
28288 original byte sequence.
28289 (simplify_gen_subreg): Don't create SUBREGs from constants to
28290 MODE_COMPOSITE_P outermode.
28292 2021-04-12 Jakub Jelinek <jakub@redhat.com>
28294 PR rtl-optimization/99905
28295 * combine.c (expand_compound_operation): If pos + len > modewidth,
28296 perform the right shift by pos in inner_mode and then convert to mode,
28297 instead of trying to simplify a shift of rtx with inner_mode by pos
28298 as if it was a shift in mode.
28300 2021-04-12 Jakub Jelinek <jakub@redhat.com>
28303 * combine.c (simplify_and_const_int_1): Don't optimize varop
28304 away if it has side-effects.
28306 2021-04-12 Martin Liska <mliska@suse.cz>
28308 * doc/extend.texi: Escape @smallexample content.
28310 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28312 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
28313 alternative in order to load a DFP zero.
28315 2021-04-12 Martin Liska <mliska@suse.cz>
28317 * doc/extend.texi: Be more precise in documentation
28318 of symver attribute.
28320 2021-04-12 Martin Liska <mliska@suse.cz>
28323 * gimplify.c (gimplify_expr): Right now, we unpoison all
28324 variables before a goto <dest>. We should not do it if we are
28327 2021-04-12 Cui,Lili <lili.cui@intel.com>
28329 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
28331 * common/config/i386/i386-common.c (processor_names): Add
28333 (processor_alias_table): Add rocketlake.
28334 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
28335 INTEL_COREI7_ROCKETLAKE.
28336 * config.gcc: Add -march=rocketlake.
28337 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
28339 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
28340 (processor_cost_table): Add rocketlake cost.
28341 * config/i386/i386.h (ix86_size_cost) : Define
28343 (processor_type) : Add PROCESSOR_ROCKETLAKE.
28344 (PTA_ROCKETLAKE): Ditto.
28345 * doc/extend.texi: Add rocketlake.
28346 * doc/invoke.texi: Add rocketlake.
28348 2021-04-12 Cui,Lili <lili.cui@intel.com>
28350 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
28351 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
28352 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
28353 * doc/invoke.texi: Change alderlake ISA list.
28355 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
28357 PR middle-end/98088
28358 * omp-expand.c (expand_oacc_collapse_init): Update condition in
28361 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
28364 * config/i386/serializeintrin.h (_serialize): Defined as macro.
28366 2021-04-10 Jakub Jelinek <jakub@redhat.com>
28369 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
28370 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
28372 2021-04-10 Jakub Jelinek <jakub@redhat.com>
28374 PR middle-end/99989
28375 * gimple-ssa-warn-alloca.c
28376 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
28377 0 with integer precision unconditionally.
28379 2021-04-10 Jakub Jelinek <jakub@redhat.com>
28381 PR rtl-optimization/98601
28382 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
28383 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
28384 unaligned_mems handle VOIDmode like BLKmode.
28386 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
28389 * tree.c (free_lang_data_in_decl): Do not release body of
28390 declare_variant_alt.
28392 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
28394 * config/aarch64/aarch64.c (aarch64_option_restore): If the
28395 architecture was specified explicitly and the tuning wasn't,
28396 tune for the architecture rather than the configured default CPU.
28398 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
28400 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
28401 as the temporary register.
28403 2021-04-09 Martin Liska <mliska@suse.cz>
28405 * doc/extend.texi: Move non-target attributes on the top level.
28407 2021-04-09 Martin Liska <mliska@suse.cz>
28409 * doc/invoke.texi: Document minimum and maximum value of the
28410 argument for both supported compression algorithms.
28412 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
28414 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
28415 TLS BSS before TLS data.
28416 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
28418 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
28420 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
28422 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
28424 * match.pd: Extend vec_cond folds to handle shifts.
28426 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
28428 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
28431 2021-04-08 Alex Coplan <alex.coplan@arm.com>
28434 * config/arm/iterators.md (MVE_vecs): New.
28435 (V_elem): Also handle V2DF.
28436 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
28437 (*mve_vdup<mode>): ... this. Remove second alternative since
28438 vec_duplicate of const_int is not canonical RTL, and we don't
28439 want to match symbol_refs.
28440 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
28442 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
28444 * fold-const.c (fold_single_bit_test): Fix typo.
28445 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
28448 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
28450 PR tree-optimization/97513
28451 * tree-vect-slp.c (vect_add_slp_permutation): New function,
28453 (vectorizable_slp_permutation): ...here. Detect cases in which
28454 all VEC_PERM_EXPRs are guaranteed to have the same stepped
28455 permute vector and only generate one permute vector for that case.
28456 Extend that case to handle variable-length vectors.
28458 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
28460 PR tree-optimization/99873
28461 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
28462 (vect_build_slp_instance): Don't split store groups that could
28463 use IFN_STORE_LANES.
28465 2021-04-07 Jakub Jelinek <jakub@redhat.com>
28468 * varasm.c (output_constant_pool_contents): Don't strip name encoding
28469 from XSTR (desc->sym, 0) or from label before passing those to
28472 2021-04-07 Richard Biener <rguenther@suse.de>
28474 PR tree-optimization/99954
28475 * tree-loop-distribution.c: Include tree-affine.h.
28476 (generate_memcpy_builtin): Try using tree-affine to prove
28478 (loop_distribution::classify_builtin_ldst): Always classify
28481 2021-04-07 Richard Biener <rguenther@suse.de>
28483 PR tree-optimization/99947
28484 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
28485 steps vector to avoid pushing elements from the reallocated
28488 2021-04-07 Richard Biener <rguenther@suse.de>
28490 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
28491 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
28493 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
28495 (debug_vn_reference_ops): New.
28497 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
28499 PR tree-optimization/98736
28500 * tree-loop-distribution.c
28501 * (loop_distribution::bb_top_order_init):
28502 Compute RPO with programing order preserved by calling function
28503 rev_post_order_and_mark_dfs_back_seme.
28505 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
28508 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
28509 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
28511 (process_bb_lives): Don't update biggest mode of hard reg for
28512 implicit in multi-register group. Use the new functions for
28513 updating dead_set and unused_set by register notes.
28515 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
28517 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
28520 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
28522 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
28523 (skylake_memset): Likewise.
28524 (skylake_cost): Change CLEAR_RATIO to 17.
28525 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28526 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
28527 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
28529 2021-04-06 Richard Biener <rguenther@suse.de>
28531 PR tree-optimization/99880
28532 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
28533 set vectorized defs of relevant PHIs.
28535 2021-04-06 Richard Biener <rguenther@suse.de>
28537 PR tree-optimization/99924
28538 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
28539 nodes w/o scalar stmts as visited.
28541 2021-04-06 Alex Coplan <alex.coplan@arm.com>
28544 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
28545 PCS for [su]fix_optab.
28547 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28549 * config/darwin.c (machopic_legitimize_pic_address): Check
28550 that the current pic register is one of the hard reg set
28551 before setting liveness.
28553 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28555 * config/darwin.c (machopic_legitimize_pic_address): Fix
28556 whitespace, remove unused code.
28558 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28560 PR tree-optimization/99882
28561 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
28564 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28566 PR rtl-optimization/99863
28567 * dse.c (replace_read): Drop regs_live argument. Instead of
28568 regs_live, use store_insn->fixed_regs_live if non-NULL,
28569 otherwise punt if insns sequence clobbers or sets any hard
28572 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28575 * targhooks.h (default_print_patchable_function_entry_1): Declare.
28576 * targhooks.c (default_print_patchable_function_entry_1): New function,
28577 copied from default_print_patchable_function_entry with an added flags
28579 (default_print_patchable_function_entry): Rewritten into a small
28580 wrapper around default_print_patchable_function_entry_1.
28581 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
28583 (rs6000_print_patchable_function_entry): New function.
28585 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
28587 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
28589 2021-04-01 Jason Merrill <jason@redhat.com>
28592 * common.opt: Document v15 and v16.
28594 2021-04-01 Richard Biener <rguenther@suse.de>
28596 PR tree-optimization/99863
28597 * gimplify.c (gimplify_init_constructor): Recompute vector
28600 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28602 * doc/extend.texi (symver attribute): Fix up syntax errors
28605 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28607 PR tree-optimization/96573
28608 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
28609 also pointer types.
28611 2021-04-01 Richard Biener <rguenther@suse.de>
28613 PR tree-optimization/99856
28614 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
28615 precision to vector element precision.
28617 2021-04-01 Martin Jambor <mjambor@suse.cz>
28619 PR tree-optimization/97009
28620 * tree-sra.c (access_or_its_child_written): New function.
28621 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
28624 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
28627 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
28629 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
28632 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
28633 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
28634 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
28635 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
28636 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
28637 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
28639 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
28640 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
28641 (define_attr "prefixed"): Update initializer.
28643 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28646 * dwarf2out.c (debug_ranges_dwo_section): New variable.
28647 (DW_RANGES_IDX_SKELETON): Define.
28648 (struct dw_ranges): Add begin_entry and end_entry members.
28649 (DEBUG_DWO_RNGLISTS_SECTION): Define.
28650 (add_ranges_num): Adjust r initializer for addition of *_entry
28652 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
28653 set idx to DW_RANGES_IDX_SKELETON.
28654 (use_distinct_base_address_for_range): New function.
28655 (index_rnglists): Don't set r->idx if it is equal to
28656 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
28657 r->end_entry for -gsplit-dwarf if those will be needed by
28659 (output_rnglists): Add DWO argument. If true, switch to
28660 debug_ranges_dwo_section rather than debug_ranges_section.
28661 Adjust l1/l2 label indexes. Only output the offset table when
28662 dwo is true and don't include in there the skeleton range
28663 entry if present. For -gsplit-dwarf, skip ranges that belong
28664 to the other rnglists section. Change return type from void
28665 to bool and return true if there are any range entries for
28666 the other section. For dwarf_split_debug_info use
28667 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
28668 entries instead of DW_RLE_start_end, DW_RLE_start_length and
28669 DW_RLE_base_address. Use use_distinct_base_address_for_range.
28670 (init_sections_and_labels): Initialize debug_ranges_dwo_section
28671 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
28672 and range_base_label indexes.
28673 (dwarf2out_finish): Call index_rnglists earlier before finalizing
28674 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
28675 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
28676 with different dwo arguments.
28677 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
28679 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28681 PR tree-optimization/98268
28682 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
28683 recompute_tree_invariant_for_addr_expr after successfully
28684 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
28686 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28688 PR tree-optimization/99726
28689 * tree-data-ref.c (create_intersect_range_checks_index): Bail
28690 out if there is more than one access function SCEV for the loop
28693 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28695 PR rtl-optimization/97141
28696 PR rtl-optimization/98726
28697 * emit-rtl.c (valid_for_const_vector_p): Return true for
28699 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
28700 poly_wide_int instead of a wide_int.
28701 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
28703 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
28704 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
28705 false for CONST_VECTORs that cannot be forced to memory.
28706 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
28707 is too complex to force to memory, build it up from individual
28710 2021-03-31 Jan Hubicka <jh@suse.cz>
28713 * cgraph.c (cgraph_node::release_body): Fix overactive check.
28715 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
28718 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
28721 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
28723 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
28724 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
28726 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
28727 "rep movsb/stosb" only for known sizes.
28728 * config/i386/i386-options.c (processor_cost_table): Use Ice
28729 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
28730 Rapids and Alder Lake.
28731 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
28732 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
28733 (icelake_memset): Likewise.
28734 (icelake_cost): Likewise.
28735 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28738 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28741 * config/aarch64/aarch64.c
28742 (aarch64_vectorize_preferred_vector_alignment): Query the size
28743 of the provided SVE vector; do not assume that all SVE vectors
28744 have the same size.
28746 2021-03-31 Jan Hubicka <jh@suse.cz>
28749 * cgraph.c (cgraph_node::release_body): Remove all callers and
28751 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
28752 * cgraphunit.c (cgraph_node::expand): And here.
28754 2021-03-31 Martin Liska <mliska@suse.cz>
28756 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
28757 and one negated condition.
28759 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28760 Richard Sandiford <richard.sandiford@arm.com>
28763 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
28764 constraints on operands[2] and similarly 0 and rk constraints
28765 on operands[1] corresponding to that.
28767 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28770 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
28771 linker doesn't support DWARF sections new in DWARF5.
28772 * config/i386/i386-options.c (ix86_option_override_internal): Default
28773 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
28775 * config.in: Regenerated.
28776 * configure: Regenerated.
28778 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28781 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
28782 available issue_info before using it.
28784 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28787 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
28790 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
28793 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
28794 (altivec_lvsl_reg_<mode>): ... this.
28795 (altivec_lvsr_reg): Change to ...
28796 (altivec_lvsr_reg_<mode>): ... this.
28797 * config/rs6000/predicates.md (vec_set_index_operand): New.
28798 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
28799 Enable 32bit variable vec_insert for all TARGET_VSX.
28800 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28801 Enable 32bit variable vec_insert for p9 and above.
28802 (rs6000_expand_vector_set_var_p8): Rename to ...
28803 (rs6000_expand_vector_set_var_p7): ... this.
28804 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
28806 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
28807 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
28808 gen_altivec_lvsr_reg_di.
28810 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
28813 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
28814 (__rdtscp): Likewise.
28816 2021-03-30 Tamar Christina <tamar.christina@arm.com>
28818 PR tree-optimization/99825
28819 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
28820 Reject non-mult 2 lanes.
28822 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
28825 * config/arm/arm.c (arm_file_start): Fix emission of
28826 Tag_ABI_VFP_args attribute.
28828 2021-03-30 Richard Biener <rguenther@suse.de>
28830 PR tree-optimization/99824
28831 * stor-layout.c (set_min_and_max_values_for_integral_type):
28832 Assert the precision is within the bounds of
28833 WIDE_INT_MAX_PRECISION.
28834 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
28835 the outermost component ref only to lower the access size
28836 and initialize that from the access type.
28838 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
28841 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
28842 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
28844 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
28846 * config/aarch64/aarch64.md
28847 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
28848 attribute to disambiguate between SIMD and FP variants of the
28851 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
28853 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
28854 (analyze_ssa_name_flags): Fix typo in comment.
28856 2021-03-29 Alex Coplan <alex.coplan@arm.com>
28859 * config/aarch64/aarch64-sve-builtins.cc
28860 (function_builder::add_function): Add placeholder_p argument, use
28861 placeholder decls if this is set.
28862 (function_builder::add_unique_function): Instead of conditionally adding
28863 direct overloads, unconditionally add either a direct overload or a
28865 (function_builder::add_overloaded_function): Set placeholder_p if we're
28866 using C++ overloads. Use the obstack for string storage instead
28867 of relying on the tree nodes.
28868 (function_builder::add_overloaded_functions): Don't return early for
28869 m_direct_overloads: we need to add placeholders.
28870 * config/aarch64/aarch64-sve-builtins.h
28871 (function_builder::add_function): Add placeholder_p argument.
28873 2021-03-29 Richard Biener <rguenther@suse.de>
28875 PR tree-optimization/99807
28876 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
28877 assert below VEC_PERM handling.
28879 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28882 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
28883 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
28884 matching const_int 0.
28885 (move_lo_quad_internal_be_<mode>): Likewise.
28886 (move_lo_quad_<mode>): Update for the above.
28887 * config/aarch64/iterators.md (VQ_2E): Delete.
28889 2021-03-29 Jakub Jelinek <jakub@redhat.com>
28891 PR tree-optimization/99777
28892 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
28893 types other than scalar integral types.
28895 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
28897 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
28898 XCOFF TLS reloc decorations.
28900 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
28902 * doc/analyzer.texi (Analyzer Internals): Update link to
28903 "A Memory Model for Static Analysis of C Programs".
28905 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
28907 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
28908 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
28910 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
28911 (rs6000_special_round_type_align): Recursively check innermost first
28914 2021-03-26 Jakub Jelinek <jakub@redhat.com>
28917 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
28918 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
28919 assignment with drap_reg active, queue reg save for hfp with offset 0
28920 and flush queued reg saves. When handling a push with rule18,
28921 defer queueing reg save for hfp and just assert the offset is 0.
28922 (scan_trace): Assert that fde->rule18 is false.
28924 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
28927 * ira-costs.c (record_reg_classes): Put case with
28928 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
28929 * ira.c (ira_setup_alts): Ditto.
28930 * lra-constraints.c (process_alt_operands): Ditto.
28931 * recog.c (asm_operand_ok): Ditto.
28932 * reload.c (find_reloads): Ditto.
28934 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28936 * config/aarch64/aarch64-protos.h
28937 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
28938 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
28939 * config/aarch64/aarch64.c (generic_addrcost_table): Update
28940 accordingly, using the same costs as for post_modify.
28941 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
28942 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
28943 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
28944 (a64fx_addrcost_table): Likewise.
28945 (neoversev1_addrcost_table): New.
28946 (neoversev1_tunings): Use neoversev1_addrcost_table.
28947 (aarch64_address_cost): Use the new post_modify costs for CImode
28950 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28952 * config/aarch64/aarch64.opt
28953 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
28954 * doc/invoke.texi: Document it.
28955 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
28956 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
28957 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
28958 (aarch64_vec_issue_info): New structures.
28959 (cpu_vector_cost): Write comments above the variables rather
28961 (cpu_vector_cost::issue_info): New member variable.
28962 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
28963 and tree-ssa-loop-niter.h.
28964 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
28965 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
28966 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
28967 (thunderx3t110_vector_cost): Initialize issue_info to null.
28968 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
28969 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
28970 (neoversev1_vector_cost): Use them.
28971 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
28972 (aarch64_vector_costs::saw_sve_only_op): New member variable.
28973 (aarch64_vector_costs::num_vector_iterations): Likewise.
28974 (aarch64_vector_costs::scalar_ops): Likewise.
28975 (aarch64_vector_costs::advsimd_ops): Likewise.
28976 (aarch64_vector_costs::sve_ops): Likewise.
28977 (aarch64_vector_costs::seen_loads): Likewise.
28978 (aarch64_simd_vec_costs_for_flags): New function.
28979 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
28980 Count the number of predicate operations required by SVE WHILE
28982 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
28983 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
28984 (aarch64_count_ops): Likewise.
28985 (aarch64_add_stmt_cost): Record whether see an SVE operation
28986 that cannot currently be implementing using Advanced SIMD.
28987 Record issue information about the scalar, Advanced SIMD
28988 and (where relevant) SVE versions of a loop.
28989 (aarch64_vec_op_count::dump): New function.
28990 (aarch64_sve_op_count::dump): Likewise.
28991 (aarch64_estimate_min_cycles_per_iter): Likewise.
28992 (aarch64_adjust_body_cost): If issue information is available,
28993 try to compare the issue rates of the various loop implementations
28994 and increase or decrease the vector body cost accordingly.
28996 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28998 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
28999 Assume a zero cost for induction phis.
29001 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29003 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
29005 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
29006 vector comparisons.
29008 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29010 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
29012 (aarch64_add_stmt_cost): Call it.
29014 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29016 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
29017 New tuning parameter.
29018 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
29019 (aarch64_estimated_sve_vq): New function.
29020 (aarch64_vector_costs::analyzed_vinfo): New member variable.
29021 (aarch64_vector_costs::is_loop): Likewise.
29022 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
29023 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
29024 (aarch64_record_potential_advsimd_unrolling): New function.
29025 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
29026 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
29027 aarch64_analyze_bb_vinfo on the first use of a costs structure.
29028 Detect whether we're vectorizing a loop for SVE that might be
29029 completely unrolled if it used Advanced SIMD instead.
29030 (aarch64_adjust_body_cost_for_latency): New function.
29031 (aarch64_finish_cost): Call it.
29033 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29035 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
29036 (aarch64_init_cost): New function.
29037 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
29038 the default unsigned[3].
29039 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
29040 (TARGET_VECTORIZE_INIT_COST): Override.
29041 (TARGET_VECTORIZE_FINISH_COST): Likewise.
29042 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
29044 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29046 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
29047 (neoversev1_sve_vector_cost): New cost structures.
29048 (neoversev1_vector_cost): Likewise.
29049 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
29051 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29053 * config/aarch64/aarch64-protos.h
29054 (sve_vec_cost::scatter_store_elt_cost): New member variable.
29055 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
29056 accordingly, taking the cost from the cost of a scalar_store.
29057 (a64fx_sve_vector_cost): Likewise.
29058 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
29060 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29062 * config/aarch64/aarch64-protos.h
29063 (simd_vec_cost::store_elt_extra_cost): New member variable.
29064 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
29065 accordingly, using the vec_to_scalar cost for the new field.
29066 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
29067 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
29068 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
29069 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
29070 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
29071 (thunderx3t110_advsimd_vector_cost): Likewise.
29072 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
29074 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29076 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
29077 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
29078 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
29079 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
29080 accordingly, using zero for the new costs.
29081 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
29082 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
29083 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
29084 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
29085 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
29086 (thunderx3t110_advsimd_vector_cost): Likewise.
29087 (aarch64_ld234_st234_vectors): New function.
29088 (aarch64_adjust_stmt_cost): Likewise.
29089 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
29090 the new vector costs.
29092 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29094 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
29095 derived class of simd_vec_cost. Add information about CLAST[AB]
29096 and FADDA instructions.
29097 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
29098 accordingly, using the vec_to_scalar costs for the new fields.
29099 (a64fx_sve_vector_cost): Likewise.
29100 (aarch64_reduc_type): New function.
29101 (aarch64_sve_in_loop_reduction_latency): Likewise.
29102 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
29103 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
29104 that occur in the loop body.
29105 (aarch64_add_stmt_cost): Update call accordingly.
29107 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29109 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
29111 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
29112 above the fields rather than to the right.
29113 (simd_vec_cost::reduc_i8_cost): New member variable.
29114 (simd_vec_cost::reduc_i16_cost): Likewise.
29115 (simd_vec_cost::reduc_i32_cost): Likewise.
29116 (simd_vec_cost::reduc_i64_cost): Likewise.
29117 (simd_vec_cost::reduc_f16_cost): Likewise.
29118 (simd_vec_cost::reduc_f32_cost): Likewise.
29119 (simd_vec_cost::reduc_f64_cost): Likewise.
29120 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
29121 accordingly, using the vec_to_scalar_cost for the new fields.
29122 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
29123 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
29124 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
29125 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
29126 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
29127 (thunderx3t110_advsimd_vector_cost): Likewise.
29128 (aarch64_use_new_vector_costs_p): New function.
29129 (aarch64_simd_vec_costs): New function, split out from...
29130 (aarch64_builtin_vectorization_cost): ...here.
29131 (aarch64_is_reduction): New function.
29132 (aarch64_detect_vector_stmt_subtype): Likewise.
29133 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
29134 using the new vector costs.
29136 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29139 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
29140 TLS declarations as public.
29142 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29144 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
29145 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
29146 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
29147 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
29148 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
29149 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
29150 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
29151 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
29152 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
29154 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29157 * config.gcc (*-*-cygwin*): Add winnt-d.o
29158 (*-*-mingw*): Likewise.
29159 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
29160 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
29161 * config/i386/t-cygming: Add winnt-d.o.
29162 * config/i386/winnt-d.c: New file.
29164 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29166 * config/freebsd-d.c: Include memmodel.h.
29168 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29171 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
29172 * config/t-openbsd: Add openbsd-d.o.
29173 * config/openbsd-d.c: New file.
29175 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
29177 PR tree-optimization/96974
29178 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
29179 with graceful exit.
29181 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
29184 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
29188 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
29189 always_inline in system headers.
29191 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
29193 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
29195 2021-03-25 Jakub Jelinek <jakub@redhat.com>
29198 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
29199 * fold-const.c (operand_compare::operand_equal_p): Don't compare
29200 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
29202 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
29206 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
29207 always_inline in system headers.
29209 2021-03-25 Richard Biener <rguenther@suse.de>
29211 PR tree-optimization/99746
29212 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
29213 the scalar stmt as patterned. Instead set up required things
29216 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
29218 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
29221 2021-03-24 Martin Liska <mliska@suse.cz>
29224 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
29226 * config/i386/i386-options.c (ix86_option_override_internal):
29227 Add run-time assert.
29229 2021-03-24 Martin Jambor <mjambor@suse.cz>
29232 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
29233 parameters with unknown type.
29234 (ipacp_value_safe_for_type): New function.
29235 (propagate_vals_across_arith_jfunc): Verify that the constant type
29236 can be used for a type of the formal parameter.
29237 (propagate_vals_across_ancestor): Likewise.
29238 (propagate_scalar_across_jump_function): Likewise. Pass the type
29239 also to propagate_vals_across_ancestor.
29241 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
29244 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
29246 (movmisalign<mode>_mve_load): Likewise.
29248 2021-03-24 Jakub Jelinek <jakub@redhat.com>
29251 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
29252 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
29254 2021-03-24 Alexandre Oliva <oliva@adacore.com>
29256 * doc/sourcebuild.texi (sysconf): New effective target.
29258 2021-03-24 Alexandre Oliva <oliva@adacore.com>
29260 * config/i386/predicates.md (reg_or_const_vec_operand): New.
29261 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
29262 the now *-prefixed insn_and_split, turn the splitter const vec
29263 into an input for the insn, making it an ignored immediate for
29264 non-split cases, and loaded into the scratch register
29267 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
29270 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
29271 Use define_relaxed_memory_constraint for them.
29273 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
29276 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
29277 colon to the diagnostic message.
29279 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
29281 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
29283 (try_fwprop_subst_note): Use set_info instead of insn_info.
29284 (try_fwprop_subst_pattern): Likewise.
29285 (try_fwprop_subst_notes): Likewise.
29286 (try_fwprop_subst): Likewise.
29287 (forward_propagate_subreg): Likewise.
29288 (forward_propagate_and_simplify): Likewise.
29289 (forward_propagate_into): Likewise.
29290 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
29292 (set_info::single_nondebug_insn_use): Likewise.
29293 (set_info::single_phi_use): Likewise.
29294 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
29296 (set_info::single_nondebug_insn_use): Likewise.
29297 (set_info::single_phi_use): Likewise.
29299 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
29301 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
29303 2021-03-23 Jakub Jelinek <jakub@redhat.com>
29306 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
29307 expand_mult to perform an unsigned rather than a signed
29310 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
29313 * config/i386/cpuid.h (__cpuid): Add __volatile__.
29314 (__cpuid_count): Likewise.
29316 2021-03-23 Richard Biener <rguenther@suse.de>
29318 PR tree-optimization/99721
29319 * tree-vect-slp.c (vect_slp_analyze_node_operations):
29320 Make sure we can schedule the node.
29322 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29324 * config/riscv/riscv.c (riscv_subword): Take endianness into
29325 account when calculating the byte offset.
29327 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29329 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
29330 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
29331 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
29332 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
29333 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
29334 new predicate "subreg_lowpart_operator"
29336 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29338 * config/riscv/riscv.c (riscv_swap_instruction): New function
29339 to byteswap an SImode rtx containing an instruction.
29340 (riscv_trampoline_init): Byteswap the generated instructions
29343 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29345 * common/config/riscv/riscv-common.c
29346 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
29347 * config.gcc (riscv32be-*, riscv64be-*): Set
29348 TARGET_BIG_ENDIAN_DEFAULT to 1.
29349 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
29350 depending on default endianness.
29351 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
29352 * config/riscv/linux.h (LINK_SPEC): Likewise.
29353 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
29354 default endianness.
29355 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
29357 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29359 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
29360 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
29361 * config/riscv/linux.h (LINK_SPEC): Likewise.
29362 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
29364 (BYTES_BIG_ENDIAN): Handle big endian.
29365 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
29366 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
29368 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
29370 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
29372 * regcprop.c (find_oldest_value_reg): Ask target whether
29373 different mode is fine for replacement register.
29375 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
29377 PR tree-optimization/99296
29378 * value-range.cc (irange::irange_set_1bit_anti_range): New.
29379 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
29380 * value-range.h (irange::irange_set_1bit_anti_range): New.
29382 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
29385 * config/aarch64/constraints.md (UtQ): Use
29386 define_relaxed_memory_constraint for it.
29387 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
29388 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
29389 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
29390 (have_relaxed_memory_constraints): New static var.
29391 (relaxed_memory_start, relaxed_memory_end): Ditto.
29392 (add_constraint): Add arg is_relaxed_memory. Check name for
29393 relaxed memory. Set up is_relaxed_memory in constraint_data and
29394 have_relaxed_memory_constraints. Adjust calls.
29395 (choose_enum_order): Process relaxed memory.
29396 (write_tm_preds_h): Ditto.
29397 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
29398 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
29399 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
29400 * ira-lives.c (single_reg_class): Use
29401 insn_extra_relaxed_memory_constraint.
29402 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
29403 * lra-constraints.c (valid_address_p): Use
29404 insn_extra_relaxed_memory_constraint instead of other memory
29406 (process_alt_operands): Process CT_RELAXED_MEMORY.
29407 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
29408 * recog.c (asm_operand_ok, preprocess_constraints): Process
29410 * reload.c (find_reloads): Ditto.
29411 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
29412 * stmt.c (parse_input_constraint): Use
29413 insn_extra_relaxed_memory_constraint.
29415 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
29418 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
29421 2021-03-22 Alex Coplan <alex.coplan@arm.com>
29424 * config/arm/arm-protos.h (neon_make_constant): Add generate
29425 argument to guard emitting insns, default to true.
29426 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
29427 CONST_VECTORs which neon_make_constant can't handle.
29428 (neon_vdup_constant): Add generate argument, avoid emitting
29429 insns if it's not set.
29430 (neon_make_constant): Plumb new generate argument through.
29431 * config/arm/constraints.md (Ui): New. Use it...
29432 * config/arm/mve.md (*mve_mov<mode>): ... here.
29433 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
29434 synthesize constants.
29436 2021-03-22 Richard Biener <rguenther@suse.de>
29438 * debug.h: Add deprecation warning.
29440 2021-03-22 Richard Biener <rguenther@suse.de>
29442 PR tree-optimization/99694
29443 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
29446 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
29449 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
29450 after type checking.
29452 2021-03-22 Jakub Jelinek <jakub@redhat.com>
29456 * dwarf2out.c (get_full_len): Use get_precision rather than
29458 (add_const_value_attribute): Make sure add_AT_wide argument has
29459 precision prec rather than some very wide one.
29461 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
29463 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
29464 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
29465 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
29466 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
29467 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
29468 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
29469 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
29470 *fix_trunc<mode>si2_internal): Fix empty split condition.
29471 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
29472 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
29473 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
29474 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
29476 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
29479 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
29480 Convert idx to DImode.
29481 (rs6000_expand_vector_set_var_p8): Likewise.
29483 2021-03-21 Jakub Jelinek <jakub@redhat.com>
29486 * dwarf2out.c (insert_float): Change return type from void to
29487 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
29488 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
29491 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
29494 * config/i386/i386.c (construct_container): Check cfun != NULL
29495 before accessing silent_p.
29497 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
29499 * asan.c: Fix typos in comments.
29501 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
29503 PR rtl-optimization/99680
29504 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
29505 (process_address_1): Check empty constraint before using
29508 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
29510 * config/rs6000/rs6000.c (power10_cost): New.
29511 (rs6000_option_override_internal): Set Power10 costs.
29512 (rs6000_issue_rate): Set Power10 issue rate.
29513 * config/rs6000/power10.md: Rewrite for Power10.
29515 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
29518 * lra-constraints.c (process_address_1): Don't use unknown
29519 constraint for address constraint.
29521 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
29524 * config.gcc (powerpc-*-darwin8): Delete the reference to
29525 the now removed darwin8.h.
29527 2021-03-19 Olivier Hainque <hainque@adacore.com>
29530 * config/vxworksae.h (VX_CPU_PREFIX): Define.
29532 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
29534 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
29536 2021-03-19 Tamar Christina <tamar.christina@arm.com>
29538 PR tree-optimization/99656
29539 * tree-vect-slp-patterns.c (linear_loads_p,
29540 complex_add_pattern::matches, is_eq_or_top,
29541 vect_validate_multiplication, complex_mul_pattern::matches,
29542 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
29543 * tree-vectorizer.h: (complex_load_perm_t): Removed.
29544 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
29545 complex_load_perm_t.
29547 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
29550 * config/i386/i386-options.c (ix86_init_machine_status): Set
29552 * config/i386/i386.c (init_cumulative_args): Set silent_p to
29554 (construct_container): Return early for return and argument
29555 errors if silent_p is true.
29556 * config/i386/i386.h (machine_function): Add silent_p.
29558 2021-03-19 Jakub Jelinek <jakub@redhat.com>
29561 * config/arm/constraints.md (Ds): New constraint.
29562 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
29563 constraint instead of w,Dm.
29565 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
29567 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
29570 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
29572 PR middle-end/99641
29573 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
29574 array type, do the computation of the current position in sizetype.
29576 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
29579 * lra-constraints.c (process_address_1): Use lookup_constraint
29580 only for a single constraint.
29582 2021-03-18 Martin Sebor <msebor@redhat.com>
29584 PR middle-end/99502
29585 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
29586 (inbounds_memaccess_p): ...to this. Check the ending offset of
29587 the accessed member.
29589 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
29591 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
29592 %> quote markers to error messages.
29593 (gcn_goacc_validate_dims): Likewise.
29594 (gcn_conditional_register_usage): Remove exclaimation mark from error
29596 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
29598 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
29600 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
29603 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
29604 Kito Cheng <kito.cheng@sifive.com>
29606 * config/riscv/riscv.c (riscv_block_move_straight): Change type
29607 to unsigned HOST_WIDE_INT for parameter and local variable with
29608 HOST_WIDE_INT type.
29609 (riscv_adjust_block_mem): Ditto.
29610 (riscv_block_move_loop): Ditto.
29611 (riscv_expand_block_move): Ditto.
29613 2021-03-18 Nick Clifton <nickc@redhat.com>
29615 * config/v850/v850.c (construct_restore_jr): Increase static
29617 (construct_save_jarl): Likewise.
29618 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
29620 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29622 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
29623 (aarch64_override_options_internal): Use it.
29624 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
29627 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
29629 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
29630 error message format issues.
29631 (nios2_option_override): Likewise.
29632 (nios2_expand_fpu_builtin): Likewise.
29633 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
29634 truncation warning.
29635 (nios2_expand_custom_builtin): More error message format fixes.
29636 (nios2_expand_rdwrctl_builtin): Likewise.
29637 (nios2_expand_rdprs_builtin): Likewise.
29638 (nios2_expand_eni_builtin): Likewise.
29639 (nios2_expand_builtin): Likewise.
29640 (nios2_register_custom_code): Likewise.
29641 (nios2_valid_target_attribute_rec): Likewise.
29642 (nios2_add_insn_asm): Fix uninitialized variable warning.
29644 2021-03-17 Jan Hubicka <jh@suse.cz>
29646 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
29647 of gather to match reality.
29648 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
29650 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29652 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
29653 to compare against CC_REG rather than NE.
29655 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
29658 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
29659 inline assembly statements.
29660 (ix86_print_operand): Update 'P' handling for -fno-plt.
29662 2021-03-17 Tamar Christina <tamar.christina@arm.com>
29665 * config/aarch64/aarch64.c
29666 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
29668 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
29671 * config/rs6000/predicates.md (branch_comparison_operator): Allow
29672 ordered and unordered for CCFPmode, if flag_finite_math_only.
29674 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29677 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
29678 rather than ASHIFT.
29679 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
29681 2021-03-16 Martin Liska <mliska@suse.cz>
29684 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
29685 cl_optimization_compare function.
29687 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
29689 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
29692 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29695 * config/i386/i386.h (struct machine_function): Add
29696 has_explicit_vzeroupper bitfield.
29697 * config/i386/i386-expand.c (ix86_expand_builtin): Set
29698 cfun->machine->has_explicit_vzeroupper when expanding
29699 IX86_BUILTIN_VZEROUPPER.
29700 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
29701 Do the mode switching only when TARGET_VZEROUPPER, expensive
29702 optimizations turned on and not optimizing for size.
29703 (pass_insert_vzeroupper::gate): Enable even when
29704 cfun->machine->has_explicit_vzeroupper is set.
29706 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29709 * config/aarch64/aarch64.c
29710 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
29711 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
29712 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
29714 2021-03-15 Richard Biener <rguenther@suse.de>
29716 PR tree-optimization/98834
29717 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
29718 subsetting by truncating the access size.
29720 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
29722 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
29723 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
29726 2021-03-15 Martin Liska <mliska@suse.cz>
29728 * spellcheck.c: Add missing comma in initialization.
29730 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
29732 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
29733 alternative 2 and alternative 1 with alternative 3 using
29734 YW register constraint.
29735 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
29736 using YW register constraint.
29737 (*vec_extractv16qi_zext): Ditto.
29738 (*vec_extractv4si): Merge alternatives 4 and 5
29739 using Yw register constraint.
29740 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
29742 2021-03-13 Martin Sebor <msebor@redhat.com>
29744 PR tree-optimization/99489
29745 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
29746 is not a call statement.
29748 2021-03-13 Jakub Jelinek <jakub@redhat.com>
29750 PR tree-optimization/99544
29751 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
29752 if for vector types multiplication can't be done in type's mode.
29754 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
29757 * config/sparc/constraints.md (w): Rename to...
29758 (W): ... this and ditch previous implementation.
29759 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
29760 (*movdf_insn_sp64): Likewise.
29761 (*mov<VM64:mode>_insn_sp64): Likewise.
29762 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
29764 (atomic_compare_and_swap_leon3_1): Likewise.
29765 (*atomic_compare_and_swapdi_v8plus): Likewise.
29766 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
29767 architecture and add missing address validity check during LRA.
29769 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
29772 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
29773 occuring for assumed-size arrays in use_device_{ptr,addr}.
29775 2021-03-12 Jakub Jelinek <jakub@redhat.com>
29778 * config/i386/constraints.md (YW): New internal constraint.
29779 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
29780 (*<sse2_avx2>_<insn><mode>3<mask_name>,
29781 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
29782 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
29784 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
29785 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
29786 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
29787 into one, use Yw instead of former x,v.
29788 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
29789 the last alternative.
29790 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
29791 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
29792 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
29793 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
29794 into one, use <v_Yw> instead of former x,v.
29795 (avx2_interleave_highv32qi<mask_name>,
29796 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
29797 constraints. Add && <mask_avx512bw_condition> to condition.
29798 (avx2_interleave_lowv32qi<mask_name>,
29799 vec_interleave_lowv16qi<mask_name>,
29800 avx2_interleave_highv16hi<mask_name>,
29801 vec_interleave_highv8hi<mask_name>,
29802 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
29803 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
29804 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
29805 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
29806 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
29807 Yw instead of v in constraints.
29808 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
29809 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
29810 instead of Yv in constraints.
29811 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
29812 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
29813 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
29814 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
29815 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
29817 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
29818 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
29819 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
29820 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
29821 two, one with just x, another isa avx512vl with v.
29823 2021-03-12 Martin Liska <mliska@suse.cz>
29825 * doc/invoke.texi: Add missing param documentation.
29827 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29830 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
29831 analyzer/trimmed-graph.o.
29832 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
29833 feasibility checking to reflect new implementation.
29834 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
29836 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
29838 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29840 * digraph.cc (selftest::test_shortest_paths): Update
29841 shortest_paths init for new param. Add test of
29842 SPS_TO_GIVEN_TARGET.
29843 * shortest-paths.h (enum shortest_path_sense): New.
29844 (shortest_paths::shortest_paths): Add "sense" param.
29845 Update for renamings. Generalize to use "sense" param.
29846 (shortest_paths::get_shortest_path): Rename param.
29847 (shortest_paths::m_sense): New field.
29848 (shortest_paths::m_prev): Rename...
29849 (shortest_paths::m_best_edge): ...to this.
29850 (shortest_paths::get_shortest_path): Update for renamings.
29851 Conditionalize flipping of path on sense of traversal.
29853 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29855 * digraph.cc (selftest::test_shortest_paths): Add test coverage
29856 for paths from B and C.
29857 * shortest-paths.h (shortest_paths::shortest_paths): Handle
29858 unreachable nodes, rather than asserting.
29860 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
29863 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
29864 xcoff_tbss_section_name.
29865 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
29866 * xcoffout.c (xcoff_tbss_section_name): Delete.
29867 * xcoffout.h (xcoff_tbss_section_name): Delete.
29869 2021-03-11 Richard Biener <rguenther@suse.de>
29871 PR tree-optimization/99523
29872 * tree-cfg.c (dump_function_to_file): Dump SSA names
29873 w/o identifier to the decls section as well, not only those
29874 without a VAR_DECL.
29876 2021-03-11 Jakub Jelinek <jakub@redhat.com>
29879 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
29880 function calls with lhs fail if the lhs don't have compatible types.
29882 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
29884 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
29885 Change FRAME_POINTER_REGNUM to correspond to a new faked
29886 register faked_fp, part of GENNONACR_REGS like faked_ap.
29887 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
29888 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
29889 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
29890 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
29891 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
29893 (CRIS_REAL_FP_REGNUM): New constant.
29894 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
29895 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
29896 (cris_initial_elimination_offset): Handle elimination changes
29897 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
29898 and add one from FRAME_POINTER_REGNUM to
29899 HARD_FRAME_POINTER_REGNUM.
29900 (cris_expand_prologue, cris_expand_epilogue): Emit code for
29901 hard_frame_pointer_rtx instead of frame_pointer_rtx.
29903 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
29906 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
29907 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
29909 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
29912 * lra-constraints.c (process_address_1): Don't check unknown
29913 constraint, use X for empty constraint.
29915 2021-03-10 Alex Coplan <alex.coplan@arm.com>
29917 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
29918 Fix typo in comment describing "is_ha" argument.
29920 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
29922 * doc/sourcebuild.texi: Document LRA target selector.
29924 2021-03-10 David Malcolm <dmalcolm@redhat.com>
29926 * doc/ux.texi: Add subsection contrasting interactive versus
29927 batch usage of GCC.
29929 2021-03-10 Joel Hutton <joel.hutton@arm.com>
29932 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
29934 (vectorizable_load): Fix gather load mask check condition.
29936 2021-03-10 Richard Biener <rguenther@suse.de>
29938 PR tree-optimization/99510
29939 * tree.c (check_aligned_type): Check that the candidate
29940 has TYPE_USER_ALIGN set instead of matching with the
29943 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
29945 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
29946 float and vector integer modes only if the mode is not larger.
29948 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
29950 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
29952 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29954 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
29956 * ira-lives.c (single_reg_class): Ditto.
29958 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
29960 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
29961 the architecture-specific rtems.h.
29962 (aarch64-*-rtems*): Likewise.
29963 (arm*-*-rtems*): Likewise.
29964 (epiphany-*-rtems*): Likewise.
29965 (riscv*-*-rtems*): Likewise.
29967 2021-03-09 Jakub Jelinek <jakub@redhat.com>
29969 PR tree-optimization/99305
29970 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
29971 before integer_all_onesp instead of vice versa.
29973 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
29975 * common/config/arm/arm-common.c (arm_config_default): Change type
29976 of 'i' to unsigned.
29978 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29981 * lra-constraints.c (process_address_1): Process constraint 'g'
29982 separately and digital constraints containing more one digit.
29984 2021-03-09 Nick Clifton <nickc@redhat.com>
29986 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
29987 (DWARF"_DEBUGGING_INFO): Define.
29989 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
29992 * calls.c (initialize_argument_information): When the argument
29993 is passed by reference, do not make a copy in a thunk only if
29994 the argument is already in memory. Remove redundant test for
29995 the case of callee copy.
29997 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
30000 * lra-constraints.c (process_address_1): Process 0..9 constraints
30001 in process_address_1.
30003 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
30005 * config/s390/s390.c (struct s390_processor processor_table):
30006 Binutils name string must not be empty.
30008 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
30010 * config/arc/arc.c (arc_attr_type): Remove function.
30012 2021-03-09 Martin Liska <mliska@suse.cz>
30015 * config/i386/i386-options.c (ix86_option_override_internal):
30016 Set isa_flags for OPTS argument and not for the global
30019 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
30021 * config/rs6000/predicates.md (ds_form_mem_operand): Check
30024 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
30027 * config/rs6000/predicates.md (ds_form_mem_operand) New
30029 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
30030 ds_form_mem_operand in ld/lwa patterns.
30031 * config/rs6000/fusion.md: Regenerate file.
30033 2021-03-08 Martin Sebor <msebor@redhat.com>
30035 PR middle-end/98266
30036 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
30037 (array_bounds_checker::check_array_bounds): Call it.
30039 2021-03-08 Martin Sebor <msebor@redhat.com>
30041 PR middle-end/97631
30042 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
30043 (handle_builtin_stxncpy_strncat): Rename locals. Determine
30044 destination size from allocation calls. Issue a more appropriate
30046 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
30047 (handle_builtin_memset): Same.
30049 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
30052 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
30053 to ensure we do not have an Altivec style address.
30054 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
30055 an Altivec style address.
30056 (*vsx_le_perm_store_<mode>): Likewise.
30057 (splitters after *vsx_le_perm_store_<mode>): Likewise.
30058 (vsx_load_<mode>): Disable special expander if passed an Altivec
30060 (vsx_store_<mode>): Likewise.
30062 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30065 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
30066 (aarch64_simd_shift_imm_vec_hi): Likewise.
30067 (aarch64_simd_shift_imm_vec_si): Likewise.
30068 (aarch64_simd_shift_imm_vec_di): Likewise.
30069 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
30070 predicate from above.
30071 (aarch64_shrn<mode>_insn_be): Likewise.
30072 (aarch64_rshrn<mode>_insn_le): Likewise.
30073 (aarch64_rshrn<mode>_insn_be): Likewise.
30074 (aarch64_shrn2<mode>_insn_le): Likewise.
30075 (aarch64_shrn2<mode>_insn_be): Likewise.
30076 (aarch64_rshrn2<mode>_insn_le): Likewise.
30077 (aarch64_rshrn2<mode>_insn_be): Likewise.
30079 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
30082 * lra-constraints.c (skip_contraint_modifiers): New function.
30083 (process_address_1): Use it before lookup_constraint call.
30085 2021-03-08 Martin Liska <mliska@suse.cz>
30088 * config/i386/i386-options.c (ix86_option_override_internal):
30089 Enable UINTR and HRESET for -march that supports it.
30091 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
30093 * config/s390/s390.c (f_constraint_p): New function.
30094 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
30095 (TARGET_MD_ASM_ADJUST): Likewise.
30097 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
30100 * tree-nested.c (convert_local_reference_stmt): Avoid calling
30101 lookup_field_for_decl for Fortran module (= namespace context).
30103 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
30105 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
30106 comparison with arithmetic right shift.
30107 (s390_expand_vcond): No need for a force_reg anymore.
30108 s390_vec_compare will do it.
30109 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
30110 immediate operands.
30112 2021-03-07 Jakub Jelinek <jakub@redhat.com>
30115 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
30116 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
30118 * config/i386/sse.md (v_Yw): New define_mode_attr.
30119 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
30120 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
30122 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
30123 xYw in constraints.
30125 2021-03-06 Julian Brown <julian@codesourcery.com>
30127 * tree-pretty-print.c (dump_generic_node): Emit non-generic
30128 address space info for aggregates.
30130 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
30132 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
30134 2021-03-05 Jakub Jelinek <jakub@redhat.com>
30136 PR middle-end/99322
30137 * tree-cfg.c (bb_to_omp_idx): New variable.
30138 (execute_build_cfg): Release the bb_to_omp_idx vector after
30139 cleanup_tree_cfg returns.
30140 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
30141 for bb_to_omp_idx being a vec<int> instead of pointer to array
30143 (make_edges): Remove bb_to_omp_idx local variable, don't pass
30144 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
30145 vec<int> instead of pointer to array of ints and don't free/release
30147 (remove_bb): When removing a bb and placing forced label somewhere
30148 else, ensure it is put into the same OpenMP region during cfg
30149 pass if possible or to entry successor as fallback. Unregister
30150 bb from bb_to_omp_idx.
30152 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
30155 * lra-constraints.c (process_address_1): Skip decomposing address
30156 for asm insn operand with unknown constraint.
30158 2021-03-05 Martin Jambor <mjambor@suse.cz>
30161 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
30162 corresponding speculative edges if we are about to resolve
30163 sepculation. Make edge direct (and so resolve speculations) before
30164 removing it from call_site_hash.
30165 (cgraph_edge::make_direct): Relax the initial assert to allow calling
30166 the function on speculative direct edges.
30168 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
30170 PR rtl-optimization/99376
30171 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
30172 of low-order zero bits is too large, set the result to 0 directly.
30174 2021-03-04 Jakub Jelinek <jakub@redhat.com>
30176 PR middle-end/93235
30177 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
30178 SUBREG and a SUBREG to op_mode can't be created.
30180 2021-03-04 Alex Coplan <alex.coplan@arm.com>
30183 * config/aarch64/aarch64-sve-builtins.cc
30184 (function_resolver::require_vector_type): Handle error_mark_node.
30186 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
30188 * cfgexpand.c (expand_asm_loc): Pass new parameter.
30189 (expand_asm_stmt): Likewise.
30190 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
30192 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
30193 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
30194 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
30195 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
30196 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
30197 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
30198 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
30199 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
30200 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
30201 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
30202 * doc/tm.texi (md_asm_adjust): Likewise.
30203 * target.def (md_asm_adjust): Likewise.
30205 2021-03-04 Richard Biener <rguenther@suse.de>
30207 PR middle-end/97855
30208 * tree-pretty-print.c: Poison pp_printf.
30209 (dump_decl_name): Avoid use of pp_printf.
30210 (dump_block_node): Likewise.
30211 (dump_generic_node): Likewise.
30213 2021-03-04 Martin Sebor <msebor@redhat.com>
30215 PR middle-end/96963
30216 PR middle-end/94655
30217 * builtins.c (handle_array_ref): New helper.
30218 (handle_mem_ref): New helper.
30219 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
30220 into new helper functions. Correct a workaround for vectorized
30223 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
30225 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
30226 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
30227 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
30228 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
30229 attribute for Power10.
30230 * config/rs6000/mma.md (*movoo): Likewise.
30231 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
30232 (define_mode_attr bits): Add DD/TD modes.
30233 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
30234 store_conditionalpti): Update size attribute for Power10.
30236 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
30239 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
30240 -Wuninitialized, -Wmaybe-uninitialized.
30241 (wide-int.o-warn): Likewise.
30243 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
30245 * common/config/arm/arm-common.c: Include configargs.h.
30246 (arm_config_default): New function.
30247 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
30248 processors that do not support Thumb. Take into account the
30249 --with-mode configuration setting for selecting the default.
30250 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
30251 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
30253 2021-03-03 Martin Liska <mliska@suse.cz>
30255 PR gcov-profile/97461
30256 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
30258 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
30261 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
30262 point back the hard frame pointer to its default location when the
30263 frame is larger than SEH_MAX_FRAME_SIZE.
30265 2021-03-03 Jakub Jelinek <jakub@redhat.com>
30268 * config/i386/predicates.md (logic_operator): New define_predicate.
30269 * config/i386/i386.md (mov + mem using comm arith peephole2):
30270 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
30271 and the inner mode is [QH]Imode.
30273 2021-03-03 Jakub Jelinek <jakub@redhat.com>
30276 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
30277 (new_loc_list): Clear end_entry.
30278 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
30279 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
30281 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
30282 initialize also end_entry.
30284 2021-03-03 Jakub Jelinek <jakub@redhat.com>
30287 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
30288 partitions, if in non-layout mode after reorder_blocks also move
30289 affected blocks to ensure a single partition transition.
30291 2021-03-03 Jason Merrill <jason@redhat.com>
30294 * cgraphunit.c (process_function_and_variable_attributes): Don't
30295 warn about flatten on an alias if the target also has it.
30296 * cgraph.h (symtab_node::get_alias_target_tree): New.
30298 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
30300 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
30301 period to symbol name.
30302 (tls_get_addr_internal<mode>): Same.
30304 2021-03-02 David Malcolm <dmalcolm@redhat.com>
30307 * diagnostic-show-locus.c
30308 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
30311 2021-03-02 Martin Sebor <msebor@redhat.com>
30313 PR middle-end/99276
30314 * builtins.c (warn_for_access): Remove stray warning text.
30316 2021-03-02 Martin Sebor <msebor@redhat.com>
30318 PR middle-end/99295
30319 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
30322 2021-03-02 Jakub Jelinek <jakub@redhat.com>
30325 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
30326 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
30327 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
30328 DW_MACRO_define_strx and DW_MACRO_undef_strx.
30329 (save_macinfo_strings): Use DW_MACRO_*_str* even with
30330 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
30331 DW_MACRO_undef_strx.
30333 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
30335 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
30337 (BT_FN_V8HI_V8HI_UINT): Likewise.
30338 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
30339 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
30340 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
30341 New builtin definitions.
30342 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
30343 vector extension version.
30344 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
30345 available with current -march level.
30346 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
30347 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
30348 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
30349 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
30350 (vec_extend_to_fp32_lo): Likewise.
30351 (vec_round_from_fp32): Likewise.
30352 (vec_convert_to_fp16): Likewise.
30353 (vec_convert_from_fp16): Likewise.
30354 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
30355 (vclfnls_v8hi): Likewise.
30356 (vcrnfs_v8hi): Likewise.
30357 (vcfn_v8hi): Likewise.
30358 (vcnf_v8hi): Likewise.
30360 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
30362 * common/config/s390/s390-common.c (processor_flags_table): New entry.
30363 * config.gcc: Enable arch14 for --with-arch and --with-tune.
30364 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
30365 arch14 for unknown CPU models.
30366 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
30367 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
30368 (s390_get_sched_attrmask): Likewise.
30369 (s390_get_unit_mask): Likewise.
30370 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
30371 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
30372 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
30373 (TARGET_NNPA_P): New macro definitions.
30374 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
30375 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
30377 2021-03-02 Jakub Jelinek <jakub@redhat.com>
30379 PR middle-end/95757
30380 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
30381 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
30382 != 1 comparisons if name is lhs of a comparison.
30384 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
30388 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
30389 * config/darwin.c (darwin_should_restore_cfa_state): New.
30390 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
30391 * doc/tm.texi: Regenerated.
30392 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
30393 * dwarf2cfi.c (connect_traces): If the target requests, restore
30394 the CFA expression after a DW_CFA_restore.
30395 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
30397 2021-03-01 Martin Liska <mliska@suse.cz>
30400 * optc-save-gen.awk: Add 4 more exceptions.
30402 2021-03-01 Nathan Sidwell <nathan@acm.org>
30405 * tree.h (TYPE_ALIGN_RAW): New accessor.
30406 (TYPE_ALIGN): Use it.
30408 2021-03-01 Jan Hubicka <jh@suse.cz>
30411 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
30413 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
30416 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
30417 point the hard frame pointer to the SSE register save area instead
30418 of the general register save area. Perform only minimal adjustment
30419 for small frames if it is initially not correctly aligned.
30420 (ix86_expand_prologue): Remove early saves for a SEH target.
30421 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
30423 2021-02-28 Jakub Jelinek <jakub@redhat.com>
30426 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
30427 typo - referneced -> referenced.
30428 * tree.c (component_ref_size): Fix comment typo -
30429 refernce -> reference.
30430 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
30431 traling -> trailing.
30432 (aliasing_component_refs_p): Fix comment typos -
30433 refernce -> reference and refernece -> reference and
30434 traling -> trailing.
30435 (nonoverlapping_refs_since_match_p): Fix comment typo -
30436 referneces -> references.
30437 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
30438 referneces -> references.
30440 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
30442 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
30443 diagnostic message to avoid use of a contraction and format
30446 2021-02-27 Jakub Jelinek <jakub@redhat.com>
30449 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
30450 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
30451 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
30452 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
30455 2021-02-26 Richard Biener <rguenther@suse.de>
30457 PR middle-end/99281
30458 * expr.c (store_field): For calls with return-slot optimization
30459 and addressable return type expand the store directly.
30461 2021-02-26 Richard Biener <rguenther@suse.de>
30464 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
30466 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
30469 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
30472 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
30474 * config.gcc: Add rs6000-pcrel-opt.o.
30475 * config/rs6000/rs6000-pcrel-opt.c: New file.
30476 * config/rs6000/pcrel-opt.md: New file.
30477 * config/rs6000/predicates.md: Add d_form_memory predicate.
30478 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
30479 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
30480 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
30481 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
30482 and make_pass_pcrel_opt().
30483 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
30484 (rs6000_option_override_internal): Add pcrel-opt.
30485 (rs6000_delegitimize_address): Support pcrel-opt.
30486 (rs6000_opt_masks): Add pcrel-opt.
30487 (pcrel_opt_valid_mem_p): New function.
30488 (reg_to_non_prefixed): Make global.
30489 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
30490 (output_pcrel_opt_reloc): New function.
30491 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
30492 (pcrel_extern_addr): Set loads_extern_addr.
30493 Add include for pcrel-opt.md.
30494 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
30495 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
30498 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
30501 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
30502 If TARGET_64BIT and dest is SUBREG, we check the width, if it
30503 equal to SImode, we use SImode operation, just like what we are
30506 2021-02-26 Marek Polacek <polacek@redhat.com>
30508 * builtins.c (warn_for_access): Fix typos.
30510 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
30512 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
30513 mark in front of the immediate quantity.
30514 (<optab>_rolsi3_uxtw): Likewise.
30516 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
30519 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
30520 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
30521 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
30522 address and disable when the FPCXT is not available.
30523 (nonsecure_call_value_reg_thumb2): Likewise.
30525 2021-02-25 Nathan Sidwell <nathan@acm.org>
30528 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
30530 2021-02-25 Tamar Christina <tamar.christina@arm.com>
30532 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
30534 2021-02-25 Richard Biener <rguenther@suse.de>
30536 PR tree-optimization/99253
30537 * tree-vect-loop.c (check_reduction_path): First compute
30538 code, then verify out-of-loop uses.
30540 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30543 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
30545 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30547 PR tree-optimization/80635
30548 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
30549 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
30550 has mode precision.
30552 2021-02-25 Richard Biener <rguenther@suse.de>
30554 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
30555 load_map population.
30556 (vect_match_slp_patterns_2): Revert part of last change.
30557 (vect_analyze_slp): Do not interleave optimize_load_redistribution
30558 with pattern detection but do it afterwards. Dump the
30559 whole SLP graph after pattern recognition and load
30560 redistribution optimization finished.
30562 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30565 * omp-low.c (struct omp_context): Add teams_nested_p and
30566 nonteams_nested_p members.
30567 (scan_omp_target): Diagnose teams nested inside of target with other
30568 directives strictly nested inside of the same target.
30569 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
30570 ctx->nonteams_nested_p as needed.
30572 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
30574 PR inline-asm/99123
30575 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
30577 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
30579 * config/cris/cris.c (cris_expand_prologue): Set
30580 current_function_static_stack_size, if flag_stack_usage_info.
30582 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
30584 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
30585 (rs6000_final_prescan_insn): Adjust.
30586 (rs6000_asm_output_opcode): Likewise.
30588 2021-02-24 Martin Sebor <msebor@redhat.com>
30590 PR middle-end/97172
30591 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
30592 from function arguments.
30594 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30596 PR tree-optimization/99220
30597 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
30598 node from cache when it's about to be deleted.
30600 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30602 PR tree-optimization/99225
30603 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
30604 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
30605 build_int_cst (..., 1). Formatting fixes.
30607 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30609 PR tree-optimization/99149
30610 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
30612 (vect_slp_reset_pattern): Remove.
30613 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
30614 (complex_mul_pattern::build, complex_fma_pattern::build,
30615 complex_fms_pattern::build): Fix ref counts.
30616 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
30617 when node is being deleted.
30618 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
30619 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
30621 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
30623 2021-02-24 Matthias Klose <doko@ubuntu.com>
30626 2020-12-07 Matthias Klose <doko@ubuntu.com>
30628 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
30629 and ENABLE_RTL_FLAG_CHECKING.
30631 2021-02-24 Richard Biener <rguenther@suse.de>
30634 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
30636 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
30638 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
30639 (vsx_assemble_pair): ...to this.
30640 (*mma_assemble_pair): Rename from this...
30641 (*vsx_assemble_pair): ...to this.
30642 (mma_disassemble_pair): Rename from this...
30643 (vsx_disassemble_pair): ...to this.
30644 (*mma_disassemble_pair): Rename from this...
30645 (*vsx_disassemble_pair): ...to this.
30646 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
30647 BU_COMPAT): New macros.
30648 (mma_assemble_pair): Rename from this...
30649 (vsx_assemble_pair): ...to this.
30650 (mma_disassemble_pair): Rename from this...
30651 (vsx_disassemble_pair): ...to this.
30652 (mma_assemble_pair): New compatibility built-in.
30653 (mma_disassemble_pair): Likewise.
30654 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
30655 (RS6000_BUILTIN_COMPAT): Define.
30656 (bdesc_compat): New.
30657 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30658 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
30659 and VSX_BUILTIN_ASSEMBLE_PAIR.
30660 (rs6000_init_builtins): Register compatibility built-ins.
30661 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
30662 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
30663 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30664 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
30665 (__builtin_vsx_assemble_pair): ...to this.
30666 (__builtin_mma_disassemble_pair): Rename from this...
30667 (__builtin_vsx_disassemble_pair): ...to this.
30669 2021-02-23 Martin Liska <mliska@suse.cz>
30672 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
30673 with different alignment. That leads to an invalid red zone
30674 size allocated in runtime.
30676 2021-02-23 Jakub Jelinek <jakub@redhat.com>
30678 PR tree-optimization/99204
30679 * fold-const.c (fold_read_from_constant_string): Check that
30680 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
30682 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
30683 Kewen Lin <linkw@gcc.gnu.org>
30685 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
30686 (rotl<mode>3_insert_3): ...this.
30687 (plus_ior_xor): New code_iterator.
30688 (define_split for GPR rl*imi): New splitter.
30689 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
30690 for integer merging.
30692 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30694 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
30696 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
30697 into a register when the above is enabled.
30698 * config/aarch64/aarch64.c (neoversev1_tunings):
30699 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30700 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30702 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
30704 * config/cris/cris.c (cris_print_operand) <'T'>: Change
30705 valid operand from is now an addi mult-value to shift-value.
30706 * config/cris/cris.md (*addi): Change expression of scaled
30707 operand from mult to ashift.
30708 * config/cris/cris.md (*addi_reload): New insn_and_split.
30710 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
30713 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
30714 hook_bool_const_tree_hwi_hwi_const_tree_true.
30715 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
30717 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
30719 PR rtl-optimization/98791
30720 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
30721 for unordered modes.
30723 2021-02-22 Martin Liska <mliska@suse.cz>
30725 * tree-inline.c (inline_forbidden_p): Set
30726 inline_forbidden_reason.
30728 2021-02-22 Richard Biener <rguenther@suse.de>
30730 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
30733 2021-02-22 Richard Biener <rguenther@suse.de>
30735 PR tree-optimization/99165
30736 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
30737 Accumulate changed to ret.
30739 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
30742 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
30744 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
30746 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30749 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
30751 (trunctf<DFP_ALL:mode>2): Likewise.
30752 (trunctdtf2_vr): Likewise.
30753 (trunctdtf2): Likewise.
30754 (extend<DFP_ALL:mode>tf2_vr): Likewise.
30755 (extend<DFP_ALL:mode>tf2): Likewise.
30756 (extendtftd2_vr): Likewise.
30757 (extendtftd2): Likewise.
30759 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30761 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
30762 add memory alternative.
30763 (tf_to_fprx2): New pattern.
30765 2021-02-19 Martin Sebor <msebor@redhat.com>
30768 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
30769 (attr_access::free_lang_data): Remove a spurious test.
30771 2021-02-19 Nathan Sidwell <nathan@acm.org>
30773 * doc/invoke.texi (flang-info-module-read): Document.
30775 2021-02-19 Martin Liska <mliska@suse.cz>
30777 PR translation/99167
30778 * params.opt: Fix typo.
30780 2021-02-19 Richard Biener <rguenther@suse.de>
30782 PR middle-end/99122
30783 * tree-inline.c (inline_forbidden_p): Do not inline functions
30784 with VLA arguments or return value.
30786 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30789 * config/arm/arm.md (*stack_protect_combined_set_insn,
30790 *stack_protect_combined_test_insn): If force_const_mem result
30791 is not valid general operand, force its address into the destination
30794 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30797 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
30798 pad or non-local label, put FORCED_LABELs from bb b after that label
30799 rather than before it.
30801 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
30804 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
30805 expand_vector_broadcast' to emit the vec_duplicate operand.
30807 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
30809 PR rtl-optimization/96264
30810 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
30813 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
30816 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
30817 looking up the retain attribute.
30818 (resolve_unique_section): Likewise.
30819 (get_variable_section): Likewise.
30820 (switch_to_section): Likewise. Warn when a symbol without the
30821 retain attribute and a symbol with the retain attribute are
30822 placed in the section with the same name, instead of the used
30824 * doc/extend.texi: Document the "retain" attribute.
30826 2021-02-18 Nathan Sidwell <nathan@acm.org>
30829 * doc/invoke.texi (flang-info-include-translate): Document header
30832 2021-02-18 Richard Biener <rguenther@suse.de>
30834 PR middle-end/99122
30835 * ipa-fnsummary.c (analyze_function_body): Set
30836 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
30837 * tree-inline.c (insert_init_debug_bind): Pass NULL for
30838 error_mark_node values.
30839 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
30841 (setup_one_parameter): Delay force_value_to_type until when
30844 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
30846 PR tree-optimization/99142
30847 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
30849 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30851 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
30852 wide_int_bitmask::wide_int_bitmask (uint64_t),
30853 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
30854 wide_int_bitmask::operator ~ () const,
30855 wide_int_bitmask::operator | (wide_int_bitmask) const,
30856 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
30858 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
30859 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
30860 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
30861 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
30862 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
30863 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
30864 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
30865 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
30866 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
30867 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
30868 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
30869 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
30870 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
30871 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
30872 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
30873 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
30874 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
30875 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
30876 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
30877 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
30878 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
30879 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
30880 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
30881 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
30883 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30885 PR middle-end/99109
30886 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
30887 (build_printable_array_type): ... this. Add nelts argument. For
30888 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
30889 nelts, call build_array_type_nelts.
30890 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
30891 instead of build_zero_elt_array_type and build_array_type_nelts.
30893 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30896 * config/i386/i386.c (distance_non_agu_define): Don't call
30897 extract_insn_cached here.
30898 (ix86_lea_outperforms): Save and restore recog_data around call
30899 to distance_non_agu_define and distance_agu_use.
30900 (ix86_ok_to_clobber_flags): Remove.
30901 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
30902 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
30903 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
30904 into define_insn. Move the splitting to define_peephole2 and
30905 check there using peep2_regno_dead_p if FLAGS_REG is dead.
30907 2021-02-17 Julian Brown <julian@codesourcery.com>
30909 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
30912 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
30915 * config/mips/mips.c (mips_symbol_insns): Do not use
30916 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
30918 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
30920 PR inline-asm/98096
30921 * stmt.c (resolve_operand_name_1): Take inout operands into account
30922 for access to labels by names.
30923 * doc/extend.texi: Describe counting operands for accessing labels.
30925 2021-02-16 Richard Biener <rguenther@suse.de>
30927 PR tree-optimization/38474
30928 * tree-ssa-structalias.c (variable_info::address_taken): New.
30929 (new_var_info): Initialize address_taken.
30930 (process_constraint): Set address_taken.
30931 (solve_constraints): Use the new address_taken flag rather
30932 than is_reg_var for sorting variables.
30933 (dump_constraint): Dump the variable number if the name
30936 2021-02-16 Jakub Jelinek <jakub@redhat.com>
30939 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
30940 multiply by 4096 and for inbranch by 8192.
30941 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
30942 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
30944 2021-02-15 Maya Rashish <coypu@sdf.org>
30946 * config/aarch64/aarch64.c (aarch64_init_builtins):
30947 Call SUBTARGET_INIT_BUILTINS.
30949 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
30951 PR rtl-optimization/98872
30952 * init-regs.c (initialize_uninitialized_regs): Skip initialization
30953 if CONST0_RTX is NULL.
30955 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
30957 PR rtl-optimization/98863
30958 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
30959 (function_info::build_info): Turn into a declaration, moving the
30960 definition to internals.h.
30961 (function_info::bb_walker): Declare.
30962 (function_info::create_reg_use): Likewise.
30963 (function_info::calculate_potential_phi_regs): Take a build_info
30965 (function_info::place_phis, function_info::create_ebbs): Declare.
30966 (function_info::calculate_ebb_live_in_for_debug): Likewise.
30967 (function_info::populate_backedge_phis): Delete.
30968 (function_info::start_block, function_info::end_block): Declare.
30969 (function_info::populate_phi_inputs): Delete.
30970 (function_info::m_potential_phi_regs): Move information to build_info.
30971 * rtl-ssa/internals.h: New file.
30972 (function_info::bb_phi_info): New class.
30973 (function_info::build_info): Moved from functions.h.
30974 Add a constructor and destructor.
30975 (function_info::build_info::ebb_use): Delete.
30976 (function_info::build_info::ebb_def): Likewise.
30977 (function_info::build_info::bb_live_out): Likewise.
30978 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
30979 (function_info::build_info::potential_phi_regs): Likewise.
30980 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
30981 (function_info::build_info::ebb_def_regs): Likewise.
30982 (function_info::build_info::bb_phis): Likewise.
30983 (function_info::build_info::bb_mem_live_out): Likewise.
30984 (function_info::build_info::bb_to_rpo): Likewise.
30985 (function_info::build_info::def_stack): Likewise.
30986 (function_info::build_info::old_def_stack_limit): Likewise.
30987 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
30988 Remove the regno argument. Push the previous definition onto the
30989 definition stack where necessary.
30990 * rtl-ssa/accesses.cc: Include internals.h.
30991 * rtl-ssa/changes.cc: Likewise.
30992 * rtl-ssa/blocks.cc: Likewise.
30993 (function_info::build_info::build_info): Define.
30994 (function_info::build_info::~build_info): Likewise.
30995 (function_info::bb_walker): New class.
30996 (function_info::bb_walker::bb_walker): Define.
30997 (function_info::add_live_out_use): Convert a logarithmic-complexity
30998 test into a linear one. Allow the same definition to be passed
31000 (function_info::calculate_potential_phi_regs): Moved from
31001 functions.cc. Take a build_info parameter and store the
31002 information there instead.
31003 (function_info::place_phis): New function.
31004 (function_info::add_entry_block_defs): Update call to record_reg_def.
31005 (function_info::calculate_ebb_live_in_for_debug): New function.
31006 (function_info::add_phi_nodes): Use bb_phis to decide which
31007 registers need phi nodes and initialize ebb_def_regs accordingly.
31008 Do not add degenerate phis here.
31009 (function_info::add_artificial_accesses): Use create_reg_use.
31010 Assert that all definitions are listed in the DF LR sets.
31011 Update call to record_reg_def.
31012 (function_info::record_block_live_out): Record live-out register
31013 values in the phis of successor blocks. Use the live-out set
31014 when processing the last block in an EBB, instead of always
31015 using the live-in sets of successor blocks. AND the live sets
31016 with the set of registers that have been defined in the EBB,
31017 rather than with all potential phi registers. Cope correctly
31018 with branches back to the start of the current EBB.
31019 (function_info::start_block): New function.
31020 (function_info::end_block): Likewise.
31021 (function_info::populate_phi_inputs): Likewise.
31022 (function_info::create_ebbs): Likewise.
31023 (function_info::process_all_blocks): Rewrite into a multi-phase
31025 * rtl-ssa/functions.cc: Include internals.h.
31026 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
31027 (function_info::init_function_data): Remove caller.
31028 * rtl-ssa/insns.cc: Include internals.h
31029 (function_info::create_reg_use): New function. Lazily any
31030 degenerate phis needed by the linear RPO view.
31031 (function_info::record_use): Use create_reg_use. When processing
31032 debug uses, use potential_phi_regs and test it before checking
31033 whether the register is live on entry to the current EBB. Lazily
31034 calculate ebb_live_in_for_debug.
31035 (function_info::record_call_clobbers): Update call to record_reg_def.
31036 (function_info::record_def): Likewise.
31038 2021-02-15 Martin Liska <mliska@suse.cz>
31040 * toplev.c (init_asm_output): Free output of
31041 gen_command_line_string function.
31042 (process_options): Likewise.
31044 2021-02-15 Martin Liska <mliska@suse.cz>
31046 * params.opt: Add 2 missing Param keywords.
31048 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
31050 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
31052 2021-02-15 Jakub Jelinek <jakub@redhat.com>
31054 PR tree-optimization/99079
31055 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
31056 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
31057 require both type and TREE_TYPE (@1) to be integral types and either
31058 type having smaller or equal precision, or TREE_TYPE (@1) being
31059 unsigned type, or type being signed type. If TREE_TYPE (@1)
31060 doesn't have wrapping overflow, perform the subtraction of one in
31063 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
31064 Richard Biener <rguether@suse.de>
31067 * ipa-reference.c (ipa_init): Only conditinally initialize
31068 reference_vars_to_consider.
31069 (propagate): Conditionally deninitialize reference_vars_to_consider.
31070 (ipa_reference_write_optimization_summary): Sanity check that
31071 reference_vars_to_consider is not allocated.
31073 2021-02-13 Levy Hsu <admin@levyhsu.com>
31076 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
31077 extend parameter to get_si_mem_base_reg declaration.
31078 (get_si_mem_base_reg): Add extend parameter. Set it.
31079 (analyze): Pass extend arg to get_si_mem_base_reg.
31080 (transform): Likewise. Use it when rewriting mems.
31081 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
31082 loads and emit sign/zero extending load followed by subreg move.
31084 2021-02-13 Jim Wilson <jimw@sifive.com>
31087 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
31088 exit when !reload_completed. Only perform check for compressed reg
31089 if reload_completed.
31090 (riscv_rtx_costs): In MEM case, when optimizing for size and
31091 shorten memrefs, if not compressible, then increase cost.
31093 2021-02-13 Jakub Jelinek <jakub@redhat.com>
31095 PR rtl-optimization/98439
31096 * recog.c (pass_split_before_regstack::gate): Enable even when
31097 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
31100 2021-02-13 Jakub Jelinek <jakub@redhat.com>
31103 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
31104 swap of V2SImode elements in memory into DImode memory rotate by 32.
31106 2021-02-12 Martin Sebor <msebor@redhat.com>
31108 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
31110 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
31112 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
31113 m_temp_obstack rather than m_obstack to allocate the temporary use.
31115 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
31117 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
31118 as read-modify operations.
31120 2021-02-12 Richard Biener <rguenther@suse.de>
31122 PR middle-end/38474
31123 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
31124 fbi->aa_walk_budget is bigger than zero. Update
31125 fbi->aa_walk_budget.
31126 (param_change_prob): Likewise.
31127 * ipa-prop.c (detect_type_change_from_memory_writes):
31128 Properly account walk_aliased_vdefs.
31129 (parm_preserved_before_stmt_p): Canonicalize updates.
31130 (parm_ref_data_preserved_p): Likewise.
31131 (parm_ref_data_pass_through_p): Likewise.
31132 (determine_known_aggregate_parts): Account own alias queries.
31134 2021-02-12 Martin Liska <mliska@suse.cz>
31136 * opts-common.c (decode_cmdline_option): Release werror_arg.
31137 * opts.c (gen_producer_string): Release output of
31138 gen_command_line_string.
31140 2021-02-12 Richard Biener <rguenther@suse.de>
31142 PR tree-optimization/38474
31143 * params.opt (-param=max-store-chains-to-track=): New param.
31144 (-param=max-stores-to-track=): Likewise.
31145 * doc/invoke.texi (max-store-chains-to-track): Document.
31146 (max-stores-to-track): Likewise.
31147 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
31149 (pass_store_merging::m_n_stores): Likewise.
31150 (pass_store_merging::terminate_and_process_chain): Update
31151 m_n_stores and m_n_chains.
31152 (pass_store_merging::process_store): Likewise. Terminate
31153 oldest chains if the number of stores or chains get too large.
31154 (imm_store_chain_info::terminate_and_process_chain): Dump
31157 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
31159 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
31160 the cold section, emit a nop before the directive if the previous
31161 active instruction can throw.
31163 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
31166 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
31167 memory addresses that are legal for quad word accesses.
31169 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
31172 * config/arm/thumb2.md (*doloop_end_internal): Generate
31173 alternative sequence to handle long range branches.
31175 2021-02-11 Joel Hutton <joel.hutton@arm.com>
31177 PR tree-optimization/98772
31178 * optabs-tree.c (supportable_half_widening_operation): New function
31179 to check for supportable V8QI->V8HI widening patterns.
31180 * optabs-tree.h (supportable_half_widening_operation): New function.
31181 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
31182 to create promotion stmts for V8QI->V8HI widening patterns.
31183 (vectorizable_conversion): Add case for V8QI->V8HI.
31185 2021-02-11 Richard Biener <rguenther@suse.de>
31187 * sparseset.h (SPARSESET_ELT_BITS): Remove.
31188 (SPARSESET_ELT_TYPE): Use unsigned int.
31189 * fwprop.c: Do not include sparseset.h.
31191 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31194 * varasm.c (declare_weak): For -fsyntax-only, allow even
31195 TREE_ASM_WRITTEN function decls.
31197 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31200 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
31201 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
31202 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
31203 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
31204 calling simplify_gen_subreg on it.
31206 2021-02-10 Martin Liska <mliska@suse.cz>
31208 * config/nvptx/nvptx.c (nvptx_option_override): Use
31209 flag_patchable_function_entry instead of the removed
31210 function_entry_patch_area_size.
31212 2021-02-10 Martin Liska <mliska@suse.cz>
31214 PR tree-optimization/99002
31215 PR tree-optimization/99026
31216 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
31217 leak when adjacent cases are merged.
31218 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
31220 (make_pass_lower_switch): Remove trailing whitespace.
31221 * tree-switch-conversion.h (release_clusters): New.
31223 2021-02-10 Richard Biener <rguenther@suse.de>
31225 PR rtl-optimization/99054
31226 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
31227 (fixup_partitions): Adjust.
31228 (rtl_verify_edges): Likewise.
31230 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31232 PR middle-end/99007
31233 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
31234 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
31237 2021-02-10 Richard Biener <rguenther@suse.de>
31240 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
31243 2021-02-10 Richard Biener <rguenther@suse.de>
31245 PR tree-optimization/99024
31246 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
31247 clear loop->aux if it is associated with the destroyed loop_vinfo.
31249 2021-02-10 Martin Liska <mliska@suse.cz>
31251 PR tree-optimization/99002
31252 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
31255 2021-02-10 Martin Liska <mliska@suse.cz>
31258 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
31259 a reference exists.
31261 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31264 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
31265 at class scope for DWARF5+.
31267 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
31269 PR rtl-optimization/96015
31270 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
31271 (relax_delay_slots): When deleting a jump to the next active
31272 instruction over a barrier, first delete the barrier if the
31273 jump is the only way to reach the target label.
31275 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
31277 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
31278 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
31279 vector multiplies and vect.alu for SSRA.
31280 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
31281 vect.mul cost field.
31282 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
31283 * config/arm/arm.c: Likewise.
31285 2021-02-09 Richard Biener <rguenther@suse.de>
31287 PR tree-optimization/98863
31288 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
31289 * tree-ssa-sccvn.c (last_pushed_avail): New global.
31290 (rpo_elim::eliminate_push_avail): Chain pushed avails.
31291 (unwind_state::avail_top): Add.
31292 (do_unwind): Rewrite unwinding of avail entries.
31293 (do_rpo_vn): Initialize last_pushed_avail and
31294 avail_top of the undo state.
31296 2021-02-09 Jakub Jelinek <jakub@redhat.com>
31298 PR middle-end/99004
31299 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
31300 const char * to char * and free those pointers after use.
31302 2021-02-09 Richard Biener <rguenther@suse.de>
31304 PR tree-optimization/99017
31305 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
31306 zero vector cost entries.
31308 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
31310 PR middle-end/98974
31311 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
31312 parameter in vectorizable_condition.
31314 2021-02-08 Richard Biener <rguenther@suse.de>
31317 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
31319 2021-02-08 Martin Liska <mliska@suse.cz>
31322 * cfgexpand.c (pass_expand::execute): Parse per-function option
31323 flag_patchable_function_entry and use it.
31324 * common.opt: Remove function_entry_patch_area_size and
31325 function_entry_patch_area_start global variables.
31326 * opts.c (parse_and_check_patch_area): New function.
31327 (common_handle_option): Use it.
31328 * opts.h (parse_and_check_patch_area): New function.
31329 * toplev.c (process_options): Parse and use
31330 function_entry_patch_area_size.
31332 2021-02-08 Martin Sebor <msebor@redhat.com>
31334 * doc/extend.texi (attribute malloc): Correct typos.
31336 2021-02-05 Nathan Sidwell <nathan@acm.org>
31339 * gcc.c (driver::maybe_run_linker): Check for input file
31340 accessibility if not linking.
31342 2021-02-05 Richard Biener <rguenther@suse.de>
31344 PR tree-optimization/98855
31345 * tree-vectorizer.h (add_stmt_cost): New overload.
31346 * tree-vect-slp.c (li_cost_vec_cmp): New.
31347 (vect_bb_slp_scalar_cost): Cost individual loop regions
31348 separately. Account for the scalar instance root stmt.
31350 2021-02-05 Tom de Vries <tdevries@suse.de>
31353 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
31355 (bit_test_cluster::emit): Reuse location_t for newly created
31357 (switch_decision_tree::try_switch_expansion): Preserve
31359 * tree-switch-conversion.h: Change function signatures.
31361 2021-02-05 Jakub Jelinek <jakub@redhat.com>
31364 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
31365 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
31366 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
31367 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
31369 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31371 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
31372 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
31373 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
31374 (vget_high_f16): Reimplement using new builtin.
31375 (vget_high_f32): Likewise.
31376 (vget_high_f64): Likewise.
31377 (vget_high_p8): Likewise.
31378 (vget_high_p16): Likewise.
31379 (vget_high_p64): Likewise.
31380 (vget_high_s8): Likewise.
31381 (vget_high_s16): Likewise.
31382 (vget_high_s32): Likewise.
31383 (vget_high_s64): Likewise.
31384 (vget_high_u8): Likewise.
31385 (vget_high_u16): Likewise.
31386 (vget_high_u32): Likewise.
31387 (vget_high_u64): Likewise.
31389 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31391 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
31392 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
31393 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
31394 (vget_low_f16): Reimplement using new builtin.
31395 (vget_low_f32): Likewise.
31396 (vget_low_f64): Likewise.
31397 (vget_low_p8): Likewise.
31398 (vget_low_p16): Likewise.
31399 (vget_low_p64): Likewise.
31400 (vget_low_s8): Likewise.
31401 (vget_low_s16): Likewise.
31402 (vget_low_s32): Likewise.
31403 (vget_low_s64): Likewise.
31404 (vget_low_u8): Likewise.
31405 (vget_low_u16): Likewise.
31406 (vget_low_u32): Likewise.
31407 (vget_low_u64): Likewise.
31409 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
31411 * gcc.c (print_multilib_info): Check all required argument is provided
31414 2021-02-05 liuhongt <hongtao.liu@intel.com>
31417 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
31418 generate integer mask comparison for 128/256-bits vector when
31419 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
31420 delete redundant !maskcmp condition.
31421 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
31423 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
31424 condition directly to if (maskcmp), add extra check for
31425 cmpmode, it should be MODE_INT.
31426 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
31427 parameters op_true/op_false.
31428 (ix86_use_mask_cmp_p): New.
31430 2021-02-05 liuhongt <hongtao.liu@intel.com>
31433 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
31434 Remove m_GENERIC from ~list.
31435 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
31437 2021-02-04 David Malcolm <dmalcolm@redhat.com>
31440 * diagnostic-show-locus.c (compatible_locations_p): Require
31441 locations in the same macro map to be either both from the
31442 macro definition, or both from the macro arguments.
31444 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
31446 * config/aarch64/aarch64-simd-builtins.def: Add
31447 [su]mull_hi_lane[q] builtin generator macros.
31448 * config/aarch64/aarch64-simd.md
31449 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
31450 (aarch64_<su>mull_hi_lane<mode>): Define.
31451 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
31452 (aarch64_<su>mull_hi_laneq<mode>): Define.
31453 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
31454 builtin instead of inline asm.
31455 (vmull_high_lane_s32): Likewise.
31456 (vmull_high_lane_u16): Likewise.
31457 (vmull_high_lane_u32): Likewise.
31458 (vmull_high_laneq_s16): Likewise.
31459 (vmull_high_laneq_s32): Likewise.
31460 (vmull_high_laneq_u16): Likewise.
31461 (vmull_high_laneq_u32): Liekwise.
31463 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
31465 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
31466 builtin generator macros.
31467 * config/aarch64/aarch64-simd.md
31468 (aarch64_<su>mull_hi_n<mode>_insn): Define.
31469 (aarch64_<su>mull_hi_n<mode>): Define.
31470 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
31471 instead of inline asm.
31472 (vmull_high_n_s32): Likewise.
31473 (vmull_high_n_u16): Likewise.
31474 (vmull_high_n_u32): Likewise.
31476 2021-02-04 Richard Biener <rguenther@suse.de>
31478 PR tree-optimization/98855
31479 * tree-vect-loop.c (vectorizable_phi): Do not cost
31480 single-argument PHIs.
31481 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
31482 * tree-vect-stmts.c (vectorizable_bswap): Also perform
31483 costing for SLP operation.
31485 2021-02-04 Martin Liska <mliska@suse.cz>
31487 * doc/extend.texi: Mention -mprefer-vector-width in target
31490 2021-02-03 Martin Sebor <msebor@redhat.com>
31492 PR tree-optimization/98937
31493 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
31494 Flush pointer_query cache.
31496 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
31498 * config/rs6000/genfusion.pl (gen_2logical): Add missing
31499 fixes based on patch review.
31500 * config/rs6000/fusion.md: Regenerate file.
31502 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
31504 * config/rs6000/t-rs6000: Comment out auto generation of
31507 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
31509 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
31510 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
31511 (output_file_start): Add gfx908.
31512 * config/gcn/gcn.opt (gpu_type): Add gfx908.
31513 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
31514 (MULTILIB_DIRNAMES): Add gfx908.
31515 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
31516 (main): Recognize gfx908.
31517 * config/gcn/t-omp-device: Add gfx908.
31519 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31521 * config/aarch64/aarch64-simd-builtins.def: Add
31522 [su]mlsl_hi_lane[q] builtin macro generators.
31523 * config/aarch64/aarch64-simd.md
31524 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
31525 (aarch64_<su>mlsl_hi_lane<mode>): Define.
31526 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
31527 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
31528 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
31529 builtin instead of inline asm.
31530 (vmlsl_high_lane_s32): Likewise.
31531 (vmlsl_high_lane_u16): Likewise.
31532 (vmlsl_high_lane_u32): Likewise.
31533 (vmlsl_high_laneq_s16): Likewise.
31534 (vmlsl_high_laneq_s32): Likewise.
31535 (vmlsl_high_laneq_u16): Likewise.
31536 (vmlsl_high_laneq_u32): Likewise.
31537 (vmlal_high_laneq_u32): Likewise.
31539 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31541 * config/aarch64/aarch64-simd-builtins.def: Add
31542 [su]mlal_hi_lane[q] builtin generator macros.
31543 * config/aarch64/aarch64-simd.md
31544 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
31545 (aarch64_<su>mlal_hi_lane<mode>): Define.
31546 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
31547 (aarch64_<su>mlal_hi_laneq<mode>): Define.
31548 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
31549 builtin instead of inline asm.
31550 (vmlal_high_lane_s32): Likewise.
31551 (vmlal_high_lane_u16): Likewise.
31552 (vmlal_high_lane_u32): Likewise.
31553 (vmlal_high_laneq_s16): Likewise.
31554 (vmlal_high_laneq_s32): Likewise.
31555 (vmlal_high_laneq_u16): Likewise.
31556 (vmlal_high_laneq_u32): Likewise.
31558 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31560 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
31561 builtin generator macros.
31562 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
31564 (aarch64_<su>mlsl_hi_n<mode>): Define.
31565 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
31566 instead of inline asm.
31567 (vmlsl_high_n_s32): Likewise.
31568 (vmlsl_high_n_u16): Likewise.
31569 (vmlsl_high_n_u32): Likewise.
31571 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31573 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
31574 builtin generator macros.
31575 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
31577 (aarch64_<su>mlal_hi_n<mode>): Define.
31578 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
31579 instead of inline asm.
31580 (vmlal_high_n_s32): Likewise.
31581 (vmlal_high_n_u16): Likewise.
31582 (vmlal_high_n_u32): Likewise.
31584 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31586 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
31588 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
31590 (aarch64_<su>mlal_hi<mode>_insn): This.
31591 (aarch64_<su>mlal_hi<mode>): Define.
31592 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
31593 instead of inline asm.
31594 (vmlal_high_s16): Likewise.
31595 (vmlal_high_s32): Likewise.
31596 (vmlal_high_u8): Likewise.
31597 (vmlal_high_u16): Likewise.
31598 (vmlal_high_u32): Likewise.
31600 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
31602 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
31603 after calling alter_subreg() on a (mem).
31605 2021-02-03 Martin Liska <mliska@suse.cz>
31608 * lto-streamer-out.c (produce_lto_section): Fill up missing
31610 * lto-streamer.h (struct lto_section): Add _padding field.
31612 2021-02-03 Richard Biener <rguenther@suse.de>
31614 * lto-streamer.c (lto_get_section_name): Free temporary
31616 * tree-loop-distribution.c
31617 (loop_distribution::merge_dep_scc_partitions): Free edge data.
31619 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31621 PR middle-end/97487
31622 * ifcvt.c (noce_can_force_operand): New function.
31623 (noce_emit_move_insn): Use it.
31624 (noce_try_sign_mask): Likewise. Formatting fix.
31626 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31628 PR middle-end/97971
31629 * lra-constraints.c (process_alt_operands): For inline asm, don't call
31630 fatal_insn, but instead return false.
31632 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31634 PR tree-optimization/98287
31635 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
31638 2021-02-03 Tamar Christina <tamar.christina@arm.com>
31640 PR tree-optimization/98928
31641 * tree-vect-loop.c (vect_analyze_loop_2): Change
31642 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
31643 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
31644 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
31645 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
31647 2021-02-02 Richard Biener <rguenther@suse.de>
31649 * gimple-loop-interchange.cc (prepare_data_references):
31651 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
31652 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
31653 * tree-vect-stmts.c (vectorizable_condition): Do not
31655 (vectorizable_comparison): Likewise.
31657 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31659 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
31660 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
31661 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
31662 (vrsqrteq_u32): Likewise.
31664 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31666 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
31667 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
31668 (aarch64_sqxtun2<mode>_be): Likewise.
31669 (aarch64_sqxtun2<mode>): Likewise.
31670 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
31671 (vqmovun_high_s32): Likewise.
31672 (vqmovun_high_s64): Likewise.
31673 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
31675 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31677 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
31679 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
31681 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31683 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
31684 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
31685 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
31686 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
31687 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
31688 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
31689 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
31690 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
31692 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31694 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
31695 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
31696 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
31698 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31700 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
31701 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
31703 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31705 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
31706 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
31707 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
31708 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
31709 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
31710 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
31711 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
31712 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
31713 NONE builtin flags.
31715 2021-02-02 Jakub Jelinek <jakub@redhat.com>
31717 PR tree-optimization/98848
31718 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
31719 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
31721 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
31724 * expr.c: Check mode before calling store_expr.
31726 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
31728 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
31730 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
31731 instruction using expression ior.
31732 (mve_vornq_u<mode>): New expander.
31733 (mve_vornq_f<mode>): Use ior code instead of unspec.
31734 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
31736 2021-02-02 Alexandre Oliva <oliva@adacore.com>
31738 * tree-nested.c (convert_nonlocal_reference_op): Move
31739 current_function_decl restore after re-gimplification.
31740 (convert_local_reference_op): Likewise.
31742 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31744 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
31746 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
31748 (aarch64_rshrn<mode>_insn_be): Likewise.
31749 (aarch64_rshrn<mode>): Likewise.
31750 (aarch64_rshrn2<mode>_insn_le): Likewise.
31751 (aarch64_rshrn2<mode>_insn_be): Likewise.
31752 (aarch64_rshrn2<mode>): Likewise.
31753 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
31754 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
31756 (vrshrn_high_n_s32): Likewise.
31757 (vrshrn_high_n_s64): Likewise.
31758 (vrshrn_high_n_u16): Likewise.
31759 (vrshrn_high_n_u32): Likewise.
31760 (vrshrn_high_n_u64): Likewise.
31761 (vrshrn_n_s16): Likewise.
31762 (vrshrn_n_s32): Likewise.
31763 (vrshrn_n_s64): Likewise.
31764 (vrshrn_n_u16): Likewise.
31765 (vrshrn_n_u32): Likewise.
31766 (vrshrn_n_u64): Likewise.
31768 2021-02-01 Sergei Trofimovich <siarheit@google.com>
31770 PR tree-optimization/98499
31771 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
31772 conservatively and assume all possible side-effects.
31774 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31776 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
31777 vec_unpacku_hi_): Define builtins.
31778 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
31780 (vmovl_high_s16): Likewise.
31781 (vmovl_high_s32): Likewise.
31782 (vmovl_high_u8): Likewise.
31783 (vmovl_high_u16): Likewise.
31784 (vmovl_high_u32): Likewise.
31786 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31788 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
31790 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
31792 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
31794 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
31796 (vabdl_s16): Likewise.
31797 (vabdl_s32): Likewise.
31798 (vabdl_u8): Likewise.
31799 (vabdl_u16): Likewise.
31800 (vabdl_u32): Likewise.
31801 * config/aarch64/iterators.md (ABDL): New int iterator.
31802 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
31804 2021-02-01 Martin Sebor <msebor@redhat.com>
31806 * tree.h (BLOCK_VARS): Add comment.
31807 (BLOCK_SUBBLOCKS): Same.
31808 (BLOCK_SUPERCONTEXT): Same.
31809 (BLOCK_ABSTRACT_ORIGIN): Same.
31810 (inlined_function_outer_scope_p): Same.
31812 2021-02-01 Martin Sebor <msebor@redhat.com>
31814 PR middle-end/97172
31815 * attribs.c (attr_access::free_lang_data): Define new function.
31816 * attribs.h (attr_access::free_lang_data): Declare new function.
31818 2021-02-01 Richard Biener <rguenther@suse.de>
31820 * vec.h (auto_vec::auto_vec): Add memory stat parameters
31822 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
31824 2021-02-01 Tamar Christina <tamar.christina@arm.com>
31826 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
31827 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
31829 2021-02-01 Richard Biener <rguenther@suse.de>
31831 PR rtl-optimization/98863
31832 * config/i386/i386-features.c (convert_scalars_to_vector):
31833 Set DF_RD_PRUNE_DEAD_DEFS.
31835 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
31837 * system.h (SIZE_MAX): Define if not already defined.
31839 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
31841 * config/rs6000/genfusion.pl (gen_2logical): New function to
31842 generate patterns for logical-logical fusion.
31843 * config/rs6000/fusion.md: Regenerated patterns.
31844 * config/rs6000/rs6000-cpus.def: Add
31845 OPTION_MASK_P10_FUSION_2LOGICAL.
31846 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31847 Enable logical-logical fusion for p10.
31848 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
31850 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31852 * config/rs6000/rs6000.opt: Add periods to new AIX options.
31854 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31856 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
31857 (mabi=vec-default): New.
31858 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
31859 __EXTABI__ for AIX Vector extended ABI.
31860 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
31862 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
31864 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
31866 2021-01-30 Jakub Jelinek <jakub@redhat.com>
31868 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
31869 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
31871 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
31874 * lra-constraints.c (in_class_p): Don't narrow class only for REG
31877 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
31879 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
31880 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
31881 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
31883 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
31885 PR tree-optimization/98866
31886 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
31887 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
31888 (gori_map::m_maybe_invariant): Rename from all_outgoing.
31889 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
31890 (gori_map::is_export_p): Ditto.
31891 (gori_map::calculate_gori): Ditto.
31892 (gori_compute::set_range_invariant): New.
31893 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
31894 invariant for pointers evaluating to [1, +INF].
31896 2021-01-29 Richard Biener <rguenther@suse.de>
31898 PR rtl-optimization/98863
31899 * config/i386/i386-features.c (remove_partial_avx_dependency):
31900 Do not perform DF analysis.
31901 (pass_data_remove_partial_avx_dependency): Remove
31904 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31906 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
31907 builtin generator macros.
31908 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
31910 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
31911 instead of inline asm.
31912 (vmull_n_s32): Likewise.
31913 (vmull_n_u16): Likewise.
31914 (vmull_n_u32): Likewise.
31916 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31918 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
31920 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
31922 (aarch64_<sur>abdl2<mode>): ... This.
31923 (<sur>sadv16qi): Adjust use of above.
31924 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
31926 (vabdl_high_s16): Likewise.
31927 (vabdl_high_s32): Likewise.
31928 (vabdl_high_u8): Likewise.
31929 (vabdl_high_u16): Likewise.
31930 (vabdl_high_u32): Likewise.
31932 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31934 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
31936 (uabal2): Likewise.
31937 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
31939 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
31941 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
31943 (vabal_high_s16): Likewise.
31944 (vabal_high_s32): Likewise.
31945 (vabal_high_u8): Likewise.
31946 (vabal_high_u16): Likewise.
31947 (vabal_high_u32): Likewise.
31948 * config/aarch64/iterators.md (ABAL2): New mode iterator.
31949 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
31951 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31953 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
31956 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
31958 (aarch64_<sur>abal<mode>): ... This
31959 (<sur>sadv16qi): Adust use of the above.
31960 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
31962 (vabal_s16): Likewise.
31963 (vabal_s32): Likewise.
31964 (vabal_u8): Likewise.
31965 (vabal_u16): Likewise.
31966 (vabal_u32): Likewise.
31968 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31970 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
31972 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
31974 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
31976 (vaddlv_s16): Likewise.
31977 (vaddlv_u8): Likewise.
31978 (vaddlv_u16): Likewise.
31979 (vaddlvq_s8): Likewise.
31980 (vaddlvq_s16): Likewise.
31981 (vaddlvq_s32): Likewise.
31982 (vaddlvq_u8): Likewise.
31983 (vaddlvq_u16): Likewise.
31984 (vaddlvq_u32): Likewise.
31985 (vaddlv_s32): Likewise.
31986 (vaddlv_u32): Likewise.
31987 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
31988 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
31989 (Vwstype): New mode attribute.
31991 (VWIDE_S): Likewise.
31992 (USADDLV): New int iterator.
31993 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
31995 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31997 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
31998 builtin generator macros.
31999 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
32001 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
32002 instead of inline asm.
32003 (vmlsl_lane_s32): Likewise.
32004 (vmlsl_lane_u16): Likewise.
32005 (vmlsl_lane_u32): Likewise.
32006 (vmlsl_laneq_s16): Likewise.
32007 (vmlsl_laneq_s32): Likewise.
32008 (vmlsl_laneq_u16): Likewise.
32009 (vmlsl_laneq_u32): Likewise.
32011 2021-01-29 Richard Biener <rguenther@suse.de>
32013 * doc/invoke.texi (--param max-gcse-memory): Document unit
32015 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
32016 * params.opt (--param max-gcse-memory): Adjust default and
32017 document unit of size.
32019 2021-01-29 Richard Biener <rguenther@suse.de>
32021 PR rtl-optimization/98863
32022 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
32023 HOST_WIDE_INT for the memory estimate.
32025 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
32026 Richard Biener <rguenther@suse.de>
32028 PR tree-optimization/97627
32029 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
32030 Do not analyze fake edges.
32032 2021-01-29 Richard Biener <rguenther@suse.de>
32034 PR rtl-optimization/98144
32035 * df.h (df_mir_bb_info): Add con_visited member.
32036 * df-problems.c (df_mir_alloc): Initialize con_visited,
32037 do not fully populate IN and OUT.
32038 (df_mir_reset): Likewise.
32039 (df_mir_confluence_0): Set con_visited.
32040 (df_mir_confluence_n): Properly handle implicitely
32041 fully populated IN and OUT as designated by con_visited
32042 and update con_visited accordingly.
32044 2021-01-29 Jakub Jelinek <jakub@redhat.com>
32047 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
32048 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
32049 && !TARGET_REALLY_IWMMXT to conditions.
32051 2021-01-29 Jakub Jelinek <jakub@redhat.com>
32054 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
32057 2021-01-28 Marek Polacek <polacek@redhat.com>
32060 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
32061 the main variant, maybe reset it in its variants too.
32062 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
32063 (check_aligned_type): Check if TYPE_USER_ALIGN match.
32065 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
32068 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
32069 of constant zero for comparisons.
32071 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
32073 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
32074 support for mapping built-in function names for long double
32075 built-in functions if long double is IEEE 128-bit.
32077 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
32079 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
32080 builtin generator macros.
32081 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
32083 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
32084 instead of inline asm.
32085 (vmlsl_n_s32): Likewise.
32086 (vmlsl_n_u16): Likewise.
32087 (vmlsl_n_u32): Likewise.
32089 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
32091 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
32092 builtin generator macros.
32093 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
32095 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
32096 instead of inline asm.
32097 (vmlal_n_s32): Likewise.
32098 (vmlal_n_u16): Likewise.
32099 (vmlal_n_u32): Likewise.
32101 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32103 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
32105 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
32107 (aarch64_shrn2<mode>_insn_be): Likewise.
32108 (aarch64_shrn2<mode>): Likewise.
32109 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
32111 (vshrn_high_n_s32): Likewise.
32112 (vshrn_high_n_s64): Likewise.
32113 (vshrn_high_n_u16): Likewise.
32114 (vshrn_high_n_u32): Likewise.
32115 (vshrn_high_n_u64): Likewise.
32117 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32119 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
32121 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
32123 (aarch64_shrn<mode>_insn_be): Likewise.
32124 (aarch64_shrn<mode>): Likewise.
32125 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
32127 (vshrn_n_s32): Likewise.
32128 (vshrn_n_s64): Likewise.
32129 (vshrn_n_u16): Likewise.
32130 (vshrn_n_u32): Likewise.
32131 (vshrn_n_u64): Likewise.
32132 * config/aarch64/iterators.md (vn_mode): New mode attribute.
32134 2021-01-28 Richard Biener <rguenther@suse.de>
32136 PR rtl-optimization/80960
32137 * dse.c (check_mem_read_rtx): Call get_addr on the
32140 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
32141 David Edelsohn <dje.gcc@gmail.com>
32144 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32145 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
32147 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
32149 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
32150 wrapper call rs6000_expand_vector_set_var for cleanup. Call
32151 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
32153 (rs6000_expand_vector_set_var): Delete.
32154 (rs6000_expand_vector_set_var_p9): Make static.
32155 (rs6000_expand_vector_set_var_p8): Make static.
32157 2021-01-28 Xing GUO <higuoxing@gmail.com>
32159 * common/config/riscv/riscv-common.c
32160 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
32161 when `p` extension exists.
32163 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
32165 PR rtl-optimization/97684
32166 * ira.c (ira): Call ira_set_pseudo_classes before
32167 update_equiv_regs when it is necessary.
32169 2021-01-27 Jakub Jelinek <jakub@redhat.com>
32172 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
32173 %w0, %w1 and %2 instead of %0, %1 and %2.
32175 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
32177 * config/rs6000/genfusion.pl: New script to generate
32178 define_insn_and_split patterns so combine can arrange fused
32179 instructions next to each other.
32180 * config/rs6000/fusion.md: New file, generated fused instruction
32181 patterns for combine.
32182 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
32183 (non_update_memory_operand): New predicate.
32184 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
32185 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
32187 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
32189 * config/rs6000/rs6000.c (rs6000_option_override_internal):
32190 Automatically set OPTION_MASK_P10_FUSION and
32191 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
32192 (rs600_opt_masks): Allow -mpower10-fusion
32193 in function attributes.
32194 (address_is_non_pfx_d_or_x): New function.
32195 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
32196 * config/rs6000/rs6000.md: Include fusion.md.
32197 * config/rs6000/rs6000.opt: Add -mpower10-fusion
32198 and -mpower10-fusion-ld-cmpi.
32199 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
32201 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32203 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
32204 builtin generator macros.
32205 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
32207 (aarch64_<su>mlal<mode>): This.
32208 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
32209 instead of inline asm.
32210 (vmlal_s16): Likewise.
32211 (vmlal_s32): Likewise.
32212 (vmlal_u8): Likewise.
32213 (vmlal_u16): Likewise.
32214 (vmlal_u32): Likewise.
32216 2021-01-27 Richard Biener <rguenther@suse.de>
32218 PR tree-optimization/98854
32219 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
32220 PHIs from scalars when the number of CTORs matches the
32221 number of children.
32223 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32225 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
32227 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
32229 (aarch64_mls_n<mode>): This.
32230 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
32232 (vmls_n_s32): Likewise.
32233 (vmls_n_u16): Likewise.
32234 (vmls_n_u32): Likewise.
32235 (vmlsq_n_s16): Likewise.
32236 (vmlsq_n_s32): Likewise.
32237 (vmlsq_n_u16): Likewise.
32238 (vmlsq_n_u32): Likewise.
32240 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32242 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
32244 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
32246 (vmls_s16): Likewise.
32247 (vmls_s32): Likewise.
32248 (vmls_u8): Likewise.
32249 (vmls_u16): Likewise.
32250 (vmls_u32): Likewise.
32251 (vmlsq_s8): Likewise.
32252 (vmlsq_s16): Likewise.
32253 (vmlsq_s32): Likewise.
32254 (vmlsq_u8): Likewise.
32255 (vmlsq_u16): Likewise.
32256 (vmlsq_u32): Likewise.
32258 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32260 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
32262 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
32264 (aarch64_mla_n<mode>): This.
32265 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
32267 (vmla_n_s32): Likewise.
32268 (vmla_n_u16): Likewise.
32269 (vmla_n_u32): Likewise.
32270 (vmlaq_n_s16): Likewise.
32271 (vmlaq_n_s32): Likewise.
32272 (vmlaq_n_u16): Likewise.
32273 (vmlaq_n_u32): Likewise.
32275 2021-01-27 liuhongt <hongtao.liu@intel.com>
32278 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
32279 (*sse2_eq<mode>3): Ditto.
32281 2021-01-27 Jakub Jelinek <jakub@redhat.com>
32283 * tree-pass.h (PROP_trees): Rename to ...
32284 (PROP_gimple): ... this.
32285 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
32286 * passes.c (execute_function_dump, execute_function_todo,
32287 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
32288 * varpool.c (ctor_for_folding): Likewise.
32290 2021-01-27 Jakub Jelinek <jakub@redhat.com>
32292 PR tree-optimization/97260
32293 * varpool.c: Include tree-pass.h.
32294 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
32295 non-TREE_SIDE_EFFECTS automatic variables.
32297 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
32299 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
32301 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
32302 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
32303 (gen_compile_unit_die): Recognise C++23.
32305 2021-01-26 Jakub Jelinek <jakub@redhat.com>
32308 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
32311 2021-01-26 Jakub Jelinek <jakub@redhat.com>
32314 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
32315 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
32316 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
32318 2021-01-26 Richard Biener <rguenther@suse.de>
32320 * gimple-pretty-print.c (dump_binary_rhs): Handle
32321 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
32323 2021-01-26 Richard Biener <rguenther@suse.de>
32325 PR middle-end/98726
32326 * tree.h (vector_cst_int_elt): Remove.
32327 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
32330 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
32332 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
32333 for V64DFmode min/max reductions.
32335 2021-01-26 Jakub Jelinek <jakub@redhat.com>
32337 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
32338 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
32339 two halves, one with x and the other with const0_rtx, ordered
32340 depending on endianity.
32342 2021-01-26 Alexandre Oliva <oliva@adacore.com>
32344 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
32345 temporaries not seen in binding block, and not about to be
32346 added as gimple variables.
32348 2021-01-25 Martin Sebor <msebor@redhat.com>
32351 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
32353 2021-01-25 Martin Liska <mliska@suse.cz>
32355 * value-prof.c (get_nth_most_common_value): Use %s instead
32358 2021-01-25 Jakub Jelinek <jakub@redhat.com>
32361 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
32362 readelf -wi is able to read the emitted .debug_info back.
32363 * configure: Regenerated.
32365 2021-01-25 Martin Liska <mliska@suse.cz>
32367 PR gcov-profile/98739
32368 * common.opt: Add missing sign symbol.
32369 * value-prof.c (get_nth_most_common_value): Restore handling
32370 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
32371 PROFILE_REPRODUCIBILITY_MULTITHREADED.
32373 2021-01-25 Richard Biener <rguenther@suse.de>
32375 PR middle-end/98807
32376 * tree.c (vector_element_bits): Always use precision of
32377 the element type for boolean vectors.
32379 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
32381 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
32382 (ENDFILE_SPEC): Evaluate qnolinkcmds.
32384 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
32386 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
32387 nostartfiles handling since this is already done by
32388 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
32389 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
32390 is already done by LINK_COMMAND_SPEC.
32391 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
32392 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
32395 2021-01-25 Jakub Jelinek <jakub@redhat.com>
32398 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
32399 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
32400 value rather than host size_t.
32401 (fold_const_call): Change type of s2 from size_t to
32402 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
32403 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
32404 instead of s2 as last argument.
32406 2021-01-25 Tamar Christina <tamar.christina@arm.com>
32408 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
32409 VCMLA_OP, VCMUL_OP): New.
32410 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
32411 * config/arm/neon.md (cmul<conj_op><mode>3): New.
32412 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
32413 UNSPEC_VCMUL_CONJ): New.
32414 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
32415 cml<fcmac1><conj_op><mode>4): New.
32417 2021-01-23 Jakub Jelinek <jakub@redhat.com>
32420 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
32422 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
32424 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
32426 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
32428 (vmla_s16): Likewise.
32429 (vmla_s32): Likewise.
32430 (vmla_u8): Likewise.
32431 (vmla_u16): Likewise.
32432 (vmla_u32): Likewise.
32433 (vmlaq_s8): Likewise.
32434 (vmlaq_s16): Likewise.
32435 (vmlaq_s32): Likewise.
32436 (vmlaq_u8): Likewise.
32437 (vmlaq_u16): Likewise.
32438 (vmlaq_u32): Likewise.
32440 2021-01-22 David Malcolm <dmalcolm@redhat.com>
32442 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
32445 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32448 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
32449 filenames to emit, still emit the required 0 index directory and
32450 filename entries that match DW_AT_comp_dir and DW_AT_name of the
32453 2021-01-22 Marek Polacek <polacek@redhat.com>
32456 * doc/invoke.texi: Update C++ ABI Version 15 description.
32458 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32460 PR tree-optimization/98766
32461 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
32462 comparing against type size with param_avoid_fma_max_bits.
32464 2021-01-22 Richard Biener <rguenther@suse.de>
32466 PR middle-end/98793
32467 * tree.c (vector_element_bits): Key single-bit bool vector on
32468 integer mode rather than not vector mode.
32470 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
32473 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32474 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
32476 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
32477 to call different path for P8 and P9.
32478 (rs6000_expand_vector_set_var_p9): New function.
32479 (rs6000_expand_vector_set_var_p8): New function.
32481 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
32485 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32486 Ajdust variable index vec_insert from address dereference to
32487 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
32488 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
32490 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
32492 2021-01-22 Martin Liska <mliska@suse.cz>
32494 PR gcov-profile/98739
32495 * profile.c (compute_value_histograms): Drop time profile for
32496 -fprofile-reproducible=multithreaded.
32498 2021-01-22 Nathan Sidwell <nathan@acm.org>
32500 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
32503 2021-01-22 Richard Biener <rguenther@suse.de>
32505 PR middle-end/98773
32506 * tree-data-ref.c (initalize_matrix_A): Revert previous
32507 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
32509 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32511 PR tree-optimization/90248
32512 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
32513 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
32515 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
32516 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
32518 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32520 PR tree-optimization/98255
32521 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
32522 extend index - low_bound from sizetype's precision rather than index
32524 (get_addr_base_and_unit_offset_1): Likewise.
32525 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
32526 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
32528 2021-01-22 Richard Biener <rguenther@suse.de>
32530 PR tree-optimization/98786
32531 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
32532 adding new uses of abnormals. Verify we deal with a conditional
32535 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32538 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
32540 2021-01-22 liuhongt <hongtao.liu@intel.com>
32544 * config/i386/sse.md (VI_128_256): New mode iterator.
32545 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
32546 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
32547 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
32548 define_insn_and_split to lower avx512 vector comparison to avx
32549 version when dest is vector.
32550 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
32551 define_insn_and_split for negating the comparison result.
32552 * config/i386/predicates.md (float_vector_all_ones_operand):
32554 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
32555 general NOT operator without UNSPEC_MASKOP.
32557 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
32559 PR rtl-optimization/98777
32560 * lra-int.h (lra_pmode_pseudo): New extern.
32561 * lra.c (lra_pmode_pseudo): New global.
32563 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
32565 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
32567 * fwprop.c (fwprop_propagation::classify_result): Allow
32568 (subreg (mem)) simplifications.
32570 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32572 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
32574 (aarch64_sqdmlal<mode>): ... This...
32575 (aarch64_sqdmlsl<mode>): ... And this.
32576 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
32577 (aarch64_sqdmlal_lane<mode>): ... This...
32578 (aarch64_sqdmlsl_lane<mode>): ... And this.
32579 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
32580 (aarch64_sqdmlsl_laneq<mode>): ... This...
32581 (aarch64_sqdmlal_laneq<mode>): ... And this.
32582 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
32583 (aarch64_sqdmlsl_n<mode>): ... This...
32584 (aarch64_sqdmlal_n<mode>): ... And this.
32585 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
32586 (aarch64_sqdmlal2<mode>_internal): ... This...
32587 (aarch64_sqdmlsl2<mode>_internal): ... And this.
32589 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
32591 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
32593 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
32596 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
32598 2021-01-21 liuhongt <hongtao.liu@intel.com>
32600 PR rtl-optimization/98694
32601 * regcprop.c (copy_value): If SRC had been assigned a mode
32602 narrower than the copy, we can't link DEST into the chain even
32603 they have same hard_regno_nregs(i.e. HImode/SImode in i386
32606 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32608 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
32609 Convert to define_insn_and_split. Split into simple move when moving
32612 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
32614 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
32615 Adjust comment. Simplify code.
32617 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32620 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
32621 with DW_FORM_line_strp form.
32622 (prune_unused_types_update_strings): Don't add into debug_str_hash
32623 indirect strings with DW_FORM_line_strp form.
32624 (adjust_name_comp_dir): New function.
32625 (dwarf2out_finish): Call it on CU DIEs after resetting
32626 debug_line_str_hash.
32628 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
32630 PR rtl-optimization/98722
32631 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
32632 has no 3-op add insn to transform insns containing two pluses.
32634 2021-01-20 Richard Biener <rguenther@suse.de>
32636 * hwint.h (add_hwi): New function.
32637 (mul_hwi): Likewise.
32638 * tree-data-ref.c (initialize_matrix_A): Properly translate
32639 tree constants and avoid HOST_WIDE_INT_MIN.
32640 (lambda_matrix_row_add): Avoid undefined integer overflow
32641 and return true on such overflow.
32642 (lambda_matrix_right_hermite): Handle overflow from
32643 lambda_matrix_row_add gracefully. Simplify previous fix.
32644 (analyze_subscript_affine_affine): Likewise.
32646 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
32648 PR tree-optimization/96674
32649 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
32650 x >= y && y != XXX_MIN --> x > y - 1
32652 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
32654 PR tree-optimization/98535
32655 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
32656 If the high and low permutes are the same, remove the high permutes
32657 from the working set and only continue with the low ones.
32659 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32661 PR tree-optimization/98721
32662 * builtins.c (access_ref::inform_access): Don't assume
32663 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
32664 object whenever allocfn is NULL, rather than only when DECL_P
32665 is true. Use %qE instead of %qD for that. Formatting fixes.
32667 2021-01-20 Richard Biener <rguenther@suse.de>
32669 PR tree-optimization/98758
32670 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
32671 (lambda_matrix_right_hermite): Avoid undefinedness with
32672 signed integer abs and multiplication.
32673 (analyze_subscript_affine_affine): Use lambda_int.
32675 2021-01-20 David Malcolm <dmalcolm@redhat.com>
32678 * dwarf2out.c (output_line_info): Rename static variable
32679 "generation", moving it out of the function to...
32680 (output_line_info_generation): New.
32681 (init_sections_and_labels): Likewise, renaming the variable to...
32682 (init_sections_and_labels_generation): New.
32683 (dwarf2out_c_finalize): Reset the new variables.
32685 2021-01-19 Martin Sebor <msebor@redhat.com>
32687 PR middle-end/98664
32688 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
32689 all functions, even if they're not declared artificial or inline.
32690 * tree.c (tree_inlined_location): Use macro expansion location
32691 only if scope traversal fails to expose one.
32693 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
32695 PR rtl-optimization/92294
32696 * alias.c (compare_base_symbol_refs): Take an extra parameter
32697 and add the distance between two symbols to it. Enshrine in
32698 comments that -1 means "either 0 or 1, but we can't tell
32699 which at compile time".
32700 (memrefs_conflict_p): Update call accordingly.
32701 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
32704 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32706 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
32707 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
32708 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
32709 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
32710 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
32711 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
32712 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
32713 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
32714 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
32715 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
32716 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
32717 sqmovun): Use NONE flags.
32719 2021-01-19 Richard Biener <rguenther@suse.de>
32722 * ipa-modref.c (analyze_stmt): Only record a summary for a
32725 2021-01-19 Richard Biener <rguenther@suse.de>
32727 PR middle-end/98638
32728 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
32730 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
32732 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
32733 built-in define __FIX_LEON3FT_TN0018.
32735 2021-01-19 Richard Biener <rguenther@suse.de>
32738 * tree-inline.c (tree_function_versioning): Set input_location
32739 to UNKNOWN_LOCATION throughout the function.
32741 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
32744 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
32746 2021-01-19 Martin Jambor <mjambor@suse.cz>
32749 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
32750 whether non-call exceptions allow removal of a statement.
32751 (isra_analyze_call): Pass the appropriate function to
32752 ssa_name_only_returned_p.
32754 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
32756 * config/riscv/arch-canonicalize (longext_sort): New function for
32757 sorting 'multi-letter'.
32758 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
32759 'alts'. The 'arch' may not be the first of 'alts'.
32760 (_expand_combination): Add underline for the 'ext' without '*'.
32761 This is because, a single-letter extension can always be treated well
32762 with a '_' prefix, but it cannot be separated out if it is appended
32765 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
32768 * ira.c (ira): Skip abnormal critical edge splitting.
32770 2021-01-18 Jakub Jelinek <jakub@redhat.com>
32772 PR tree-optimization/98727
32773 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
32774 second .MUL_OVERFLOW operand for signed multiplication with overflow
32775 checking if the second operand of multiplication is not constant.
32777 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
32779 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
32780 defaults to version 4.
32782 2021-01-18 David Malcolm <dmalcolm@redhat.com>
32784 * attribs.h (fndecl_dealloc_argno): New decl.
32785 * builtins.c (call_dealloc_argno): Split out second half of
32787 (fndecl_dealloc_argno): New.
32788 * doc/extend.texi (Common Function Attributes): Document the
32789 interaction between the analyzer and the malloc attribute.
32790 * doc/invoke.texi (Static Analyzer Options): Likewise.
32792 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
32794 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
32795 dwarf_version to 4.
32796 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
32798 2021-01-17 Martin Jambor <mjambor@suse.cz>
32801 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
32804 2021-01-17 Mark Wielaard <mark@klomp.org>
32806 * common.opt (gdwarf-): Init(5).
32807 * doc/invoke.texi (-gdwarf): Document default to 5.
32809 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
32811 * builtin-types.def
32812 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
32814 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
32815 ...this. Add extra argument.
32816 * gimplify.c (omp_default_clause): Ensure that event handle is
32817 firstprivate in a task region.
32818 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
32819 (gimplify_adjust_omp_clauses): Likewise.
32820 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
32821 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
32822 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
32823 if detach clause specified. Add detach argument when generating
32825 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
32827 (finish_taskreg_scan): Move field for variable containing the event
32828 handle to the front of the struct.
32829 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
32831 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
32832 OMP_CLAUSE_DETACH clause.
32833 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
32834 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
32835 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
32837 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
32839 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
32841 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32843 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
32844 multilib directory names. Use MULTILIB_REQUIRED instead of
32845 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
32846 -mcustom-fpu-cfg=fph2 multilib.
32848 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32850 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
32851 (nios2_init_fpu_configs): Provide register values for new
32852 -mcustom-fpu-cfg=fph2 option variant.
32853 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
32856 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32858 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
32859 custom instruction warnings.
32861 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32863 PR tree-optimization/96669
32864 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
32866 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32868 PR tree-optimization/96271
32869 * passes.def: Pass false argument to first two pass_cd_dce
32870 instances and true to last instance. Add comment that
32871 last instance rewrites no longer addressed locals.
32872 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
32874 (pass_cd_dce::set_pass_param): New method.
32875 (pass_cd_dce::execute): Return TODO_update_address_taken from
32876 last cd_dce instance.
32878 2021-01-15 Carl Love <cel@us.ibm.com>
32880 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
32882 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
32883 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
32884 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
32885 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
32886 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
32887 Add builtin define.
32888 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
32889 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
32890 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
32891 New overloaded definitions.
32892 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
32893 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
32894 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
32895 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
32896 P10V_BUILTIN_MULHU_V4SI]: Add case
32897 statement for builtins.
32898 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
32899 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
32900 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
32901 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
32902 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
32903 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
32904 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
32905 Add define_insn, mode is VIlong.
32906 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
32907 Add builtin descriptions.
32909 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
32911 * final.c (final_start_function_1): Reset force_source_line.
32913 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32915 PR tree-optimization/96669
32916 * match.pd (((1 << A) & 1) != 0 -> A == 0,
32917 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
32918 possibly different power of two constants and to right shift too.
32920 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32922 PR tree-optimization/96681
32923 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
32924 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
32925 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
32926 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
32928 2021-01-15 Alexandre Oliva <oliva@adacore.com>
32930 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
32932 2021-01-15 Tamar Christina <tamar.christina@arm.com>
32934 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
32935 cmul<conj_op><mode>3): New.
32936 * config/aarch64/iterators.md (UNSPEC_FCMUL,
32937 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
32938 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
32939 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
32940 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
32941 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
32942 (rot_op): Renamed to conj_op.
32943 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
32944 cmul<conj_op><mode>3): New.
32945 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
32946 cmul<conj_op><mode>3): New.
32948 2021-01-15 David Malcolm <dmalcolm@redhat.com>
32952 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32953 Escape the tempfile name when constructing the expected output.
32955 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32957 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
32959 (aarch64_<su>mlsl_hi<mode>): ... This.
32960 (aarch64_<su>mlsl_hi<mode>): Define.
32961 (*aarch64_<su>mlsl<mode): Rename to...
32962 (aarch64_<su>mlsl<mode): ... This.
32963 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
32964 smlsl_hi, umlsl_hi): Define builtins.
32965 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
32966 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
32967 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
32968 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
32970 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
32972 * config/i386/i386-c.c (ix86_target_macros):
32973 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
32975 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32978 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
32979 * Makefile.in (RTL_SSA_H): New variable.
32980 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
32981 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
32982 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
32984 * config/aarch64/aarch64-cc-fusion.cc: New file.
32986 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32988 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
32989 calling cancel_changes for changes that no longer exist.
32991 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32993 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
32994 (function_info::reg_defs): ...this.
32995 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
32996 (function_info::reg_defs): ...this.
32998 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33001 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
33003 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33006 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33009 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
33011 2021-01-15 Richard Biener <rguenther@suse.de>
33013 PR tree-optimization/96376
33014 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
33015 for VMAT_INVARIANT.
33017 2021-01-15 Martin Liska <mliska@suse.cz>
33019 * doc/install.texi: Document that some tests need pytest module.
33020 * doc/sourcebuild.texi: Likewise.
33022 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33025 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
33027 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33029 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
33030 (mve_vshrq_n_u<mode>_imm): Likewise.
33031 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
33032 * config/arm/vec-common.md: ... here.
33034 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33036 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
33038 * config/arm/neon.md (vashl<mode>3): Delete.
33039 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
33040 (vasl<mode>3): New expander.
33042 2021-01-15 Richard Biener <rguenther@suse.de>
33044 PR tree-optimization/98685
33045 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
33046 of vector extern defs.
33048 2021-01-14 David Malcolm <dmalcolm@redhat.com>
33051 * diagnostic.c (diagnostic_kind_text): Break out this array
33053 (diagnostic_build_prefix): ...here.
33054 (fancy_abort): Detect when diagnostic_initialize has not yet been
33055 called and fall back to a minimal implementation of printing the
33056 ICE, rather than segfaulting in internal_error.
33058 2021-01-14 David Malcolm <dmalcolm@redhat.com>
33060 * diagnostic.c (diagnostic_initialize): Eliminate
33061 parseable_fixits_p in favor of initializing extra_output_kind from
33062 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
33063 (convert_column_unit): New function, split out from...
33064 (diagnostic_converted_column): ...this.
33065 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
33066 Use them to call convert_column_unit on the column values.
33067 (diagnostic_report_diagnostic): Eliminate conditional on
33068 parseable_fixits_p in favor of a switch statement on
33069 extra_output_kind, passing the appropriate values to the new
33070 params of print_parseable_fixits.
33071 (selftest::test_print_parseable_fixits_none): Update for new
33072 params of print_parseable_fixits.
33073 (selftest::test_print_parseable_fixits_insert): Likewise.
33074 (selftest::test_print_parseable_fixits_remove): Likewise.
33075 (selftest::test_print_parseable_fixits_replace): Likewise.
33076 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
33078 (selftest::diagnostic_c_tests): Call it.
33079 * diagnostic.h (enum diagnostics_extra_output_kind): New.
33080 (diagnostic_context::parseable_fixits_p): Delete field in favor
33082 (diagnostic_context::extra_output_kind): ...this new field.
33083 * doc/invoke.texi (Environment Variables): Add
33084 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
33085 * opts.c (common_handle_option): Update handling of
33086 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
33089 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33091 * tree-vect-slp-patterns.c (class complex_operations_pattern,
33092 complex_operations_pattern::matches,
33093 complex_operations_pattern::recognize,
33094 complex_operations_pattern::build): New.
33095 (slp_patterns): Use it.
33097 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33099 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
33100 * optabs.def (cmls_optab, cmls_conj_optab): New.
33101 * doc/md.texi: Document them.
33102 * tree-vect-slp-patterns.c (class complex_fms_pattern,
33103 complex_fms_pattern::matches, complex_fms_pattern::recognize,
33104 complex_fms_pattern::build): New.
33106 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33108 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
33109 * optabs.def (cmla_optab, cmla_conj_optab): New.
33110 * doc/md.texi: Document them.
33111 * tree-vect-slp-patterns.c (vect_match_call_p,
33112 class complex_fma_pattern, vect_slp_reset_pattern,
33113 complex_fma_pattern::matches, complex_fma_pattern::recognize,
33114 complex_fma_pattern::build): New.
33116 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33118 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
33119 * optabs.def (cmul_optab, cmul_conj_optab): New.
33120 * doc/md.texi: Document them.
33121 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
33122 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
33123 vect_build_combine_node, class complex_mul_pattern,
33124 complex_mul_pattern::matches, complex_mul_pattern::recognize,
33125 complex_mul_pattern::build): New.
33127 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33129 * tree-vect-slp.c (optimize_load_redistribution_1): New.
33130 (optimize_load_redistribution, vect_is_slp_load_node): New.
33131 (vect_match_slp_patterns): Use it.
33133 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33135 * tree-vect-slp-patterns.c (complex_add_pattern::build):
33138 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
33140 * config/gcn/mkoffload.c (main): Create an offload image only in
33141 64-bit configurations.
33143 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
33146 * config/i386/i386-options.c (ix86_option_override_internal):
33147 Issue an error for -fcf-protection with CF_BRANCH when compiling
33148 for 32-bit non-TARGET_CMOV targets.
33150 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
33153 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
33154 Remove declaration and initialization of shadow variable "ret".
33155 (ix86_option_override_internal): Remove delcaration of
33156 shadow variable "i". Redeclare shadowed variable to unsigned.
33157 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
33158 * config/i386/i386-builtins.c (get_builtin_code_for_version):
33159 Update for redeclaration.
33160 * config/i386/i386.h (pta_size): Ditto.
33162 2021-01-14 Richard Biener <rguenther@suse.de>
33164 PR tree-optimization/98674
33165 * tree-data-ref.c (base_supports_access_fn_components_p): New.
33166 (initialize_data_dependence_relation): For two bases without
33167 possible access fns resort to type size equality when determining
33168 shape compatibility.
33170 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
33173 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
33174 <=, >= operators in vcle and vcge intrinsics respectively.
33175 * config/arm/arm_neon_builtins.def: Remove entry for
33178 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
33181 * config/i386/i386-options.c (ix86_function_specific_save):
33182 Remove redundant assignment to opts->x_ix86_branch_cost.
33183 * config/i386/i386.c (ix86_prefetch_sse):
33184 Rename from x86_prefetch_sse. Update all uses.
33185 * config/i386/i386.h: Update for rename.
33186 * config/i386/i386-options.h: Ditto.
33188 2021-01-14 Jakub Jelinek <jakub@redhat.com>
33191 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
33192 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
33193 Use Bm instead of m for non-avx. Add isa attribute.
33195 2021-01-14 Jakub Jelinek <jakub@redhat.com>
33197 PR tree-optimization/96688
33198 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
33199 ~X can be simplified.
33201 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
33203 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
33204 IFN_LOAD_LANES results.
33206 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33208 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
33210 (aarch64_xtn<mode>): Likewise.
33211 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
33214 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
33216 (vmovl_s16): Likewise.
33217 (vmovl_s32): Likewise.
33218 (vmovl_u8): Likewise.
33219 (vmovl_u16): Likewise.
33220 (vmovl_u32): Likewise.
33221 (vmovn_s16): Likewise.
33222 (vmovn_s32): Likewise.
33223 (vmovn_s64): Likewise.
33224 (vmovn_u16): Likewise.
33225 (vmovn_u32): Likewise.
33226 (vmovn_u64): Likewise.
33228 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33230 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
33232 (aarch64_<su>qxtn2<mode>_be): Likewise.
33233 (aarch64_<su>qxtn2<mode>): Likewise.
33234 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
33236 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
33237 (su): Handle ss_truncate and us_truncate.
33238 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
33240 (vqmovn_high_s32): Likewise.
33241 (vqmovn_high_s64): Likewise.
33242 (vqmovn_high_u16): Likewise.
33243 (vqmovn_high_u32): Likewise.
33244 (vqmovn_high_u64): Likewise.
33246 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33248 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
33250 (aarch64_xtn2<mode>_be): Likewise.
33251 (aarch64_xtn2<mode>): Likewise.
33252 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
33254 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
33256 (vmovn_high_s32): Likewise.
33257 (vmovn_high_s64): Likewise.
33258 (vmovn_high_u16): Likewise.
33259 (vmovn_high_u32): Likewise.
33260 (vmovn_high_u64): Likewise.
33262 2021-01-13 Stafford Horne <shorne@gmail.com>
33264 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
33266 2021-01-13 Stafford Horne <shorne@gmail.com>
33268 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
33270 2021-01-13 Stafford Horne <shorne@gmail.com>
33272 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
33273 define for __or1k_hard_float__.
33275 2021-01-13 Stafford Horne <shorne@gmail.com>
33277 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
33278 (PROFILE_HOOK): Define to call _mcount.
33279 (FUNCTION_PROFILER): Change from abort to no-op.
33281 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33283 PR tree-optimization/96691
33284 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
33285 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
33286 (~D ^ C) or (D ^ C) can be simplified.
33288 2021-01-13 Richard Biener <rguenther@suse.de>
33290 PR tree-optimization/92645
33291 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
33292 until after vector lowering.
33294 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33296 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
33298 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
33299 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
33301 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33303 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
33305 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
33306 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
33308 2021-01-13 Richard Biener <rguenther@suse.de>
33310 PR tree-optimization/92645
33311 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
33312 BIT_FIELD_REF argument.
33313 (vect_build_slp_tree_2): Record the desired vector type
33314 on the external vector def.
33315 (vectorizable_slp_permutation): Handle required punning
33316 of existing vector defs.
33318 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33320 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
33322 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33324 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
33326 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
33328 * config.gcc [$target == *-*-gnu*]: Enable
33329 'default_gnu_indirect_function'.
33331 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33334 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
33335 registers before calling targetm.vectorize.vec_perm_const, only after
33337 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
33338 two argument permutation when one operand is zero vector and only
33339 after that force operands into registers.
33340 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
33341 define_insn_and_split pattern.
33342 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
33343 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
33344 (*avx2_zero_extendv8hiv8si2_1): Likewise.
33345 (*avx512f_zero_extendv8siv8di2_1): Likewise.
33346 (*avx2_zero_extendv4siv4di2_1): Likewise.
33347 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
33349 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
33350 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
33351 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
33352 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
33353 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
33354 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
33356 2021-01-13 Martin Liska <mliska@suse.cz>
33358 PR tree-optimization/98455
33359 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
33360 Record also virtual PHIs.
33361 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
33364 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
33366 * doc/invoke.texi (C++ Modules): Fix typos.
33368 2021-01-13 Richard Biener <rguenther@suse.de>
33370 PR tree-optimization/98640
33371 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
33372 handle plus or minus from a truncated operand to be
33375 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33378 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
33379 define_insn_and_split patterns.
33380 (splitter after *btr<mode>_2): New splitter.
33382 2021-01-13 Martin Liska <mliska@suse.cz>
33385 * cgraphunit.c (analyze_functions): Remove dead code.
33387 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
33389 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
33390 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
33391 (a64fx_regmove_cost, a64fx_vector_cost): New.
33392 (a64fx_tunings): Use the new added cost tables.
33394 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33397 * config/i386/predicates.md (pmovzx_parallel): New predicate.
33398 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
33399 define_insn_and_split pattern.
33400 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
33401 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
33403 2021-01-13 Julian Brown <julian@codesourcery.com>
33405 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
33406 to fix v0 register.
33408 2021-01-13 Julian Brown <julian@codesourcery.com>
33410 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
33413 2021-01-13 Julian Brown <julian@codesourcery.com>
33415 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
33416 for reciprocal-approximation instructions.
33417 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
33418 refinement and division result.
33419 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
33421 2021-01-13 Julian Brown <julian@codesourcery.com>
33423 * config/gcn/gcn-valu.md (subdf): Rename to...
33426 2021-01-12 Martin Liska <mliska@suse.cz>
33428 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
33430 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
33432 * function-abi.h: Fix typo.
33434 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
33438 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
33439 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
33440 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
33441 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
33442 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
33443 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
33444 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
33445 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
33446 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
33447 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
33448 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
33449 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
33450 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
33451 (ARM_HAVE_V2DI_LDST): Likewise.
33452 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
33453 (*movmisalign<mode>_mve_load): New pattern.
33454 * config/arm/neon.md (movmisalign<mode>): Move to ...
33455 * config/arm/vec-common.md: ... here.
33457 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
33460 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
33461 of pattern 'plus (plus (hard reg, const), pseudo)'.
33463 2021-01-12 Richard Biener <rguenther@suse.de>
33465 PR tree-optimization/98550
33466 * tree-vect-slp.c (vect_record_max_nunits): Check whether
33467 the group size is a multiple of the vector element count.
33468 (vect_build_slp_tree_1): When we need to fail because
33469 the vector type choosen causes unrolling do so lazily
33470 without affecting matches only at the end to guide group splitting.
33472 2021-01-12 Martin Liska <mliska@suse.cz>
33475 * optc-save-gen.awk: Compare also n_target_save vars with
33478 2021-01-12 Martin Liska <mliska@suse.cz>
33480 * gcov.c (source_info::debug): New.
33481 (print_usage): Add --debug (-D) option.
33482 (process_args): Likewise.
33483 (generate_results): Call src->debug after
33484 accumulate_line_counts.
33485 (read_graph_file): Properly assign id for EXIT_BLOCK.
33486 * profile.c (branch_prob): Dump function body before it is
33489 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33491 PR tree-optimization/98629
33492 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
33493 unless returning non-zero.
33495 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33497 PR tree-optimization/95731
33498 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
33499 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
33500 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
33501 only after optimize_range_tests_var_bound.
33503 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33505 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
33506 * configure: Regenerated.
33508 2021-01-12 liuhongt <hongtao.liu@intel.com>
33511 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
33513 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
33516 2021-01-12 Alexandre Oliva <oliva@adacore.com>
33518 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
33520 (auto_end_imm_use_stmt_traverse): New struct.
33521 (FOR_EACH_IMM_USE_STMT): Use it.
33522 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
33524 * gimple-ssa-strength-reduction.c: ... here, ...
33525 * graphite-scop-detection.c: ... here, ...
33526 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
33527 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
33528 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
33529 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
33530 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
33531 * tree-vect-slp.c: ... and here, ...
33532 * doc/tree-ssa.texi: ... and the example here.
33534 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33536 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
33537 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
33538 (*sdiv_pow2<mode>3): New pattern.
33539 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
33540 Wrap the ASRD in an UNSPEC_PRED_X.
33541 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
33542 predicate with a constant PTRUE, if it isn't already.
33543 (*cond_<sve_int_op><mode>_z): Replace with...
33544 (*cond_<sve_int_op><mode>_any): ...this new pattern.
33546 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33548 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
33549 SVE_FULL_I to SVE_I.
33550 (*cond_bic<mode>_any): Likewise.
33552 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33554 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
33555 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
33558 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33560 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
33561 SVE_FULL_I to SVE_I.
33562 (*aarch64_cond_<su>abd<mode>_2): Likewise.
33563 (*aarch64_cond_<su>abd<mode>_any): Likewise.
33564 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
33565 for the max and min but not for the minus.
33566 (*aarch64_cond_<su>abd<mode>_3): New pattern.
33568 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33570 * config/aarch64/iterators.md (SVE_24I): New iterator.
33571 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
33572 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
33574 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33576 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
33577 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
33579 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
33580 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
33581 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
33582 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
33584 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33586 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
33587 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
33588 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
33591 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33593 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
33594 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
33595 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
33597 2021-01-11 Martin Liska <mliska@suse.cz>
33600 * symtab-clones.h (clone_info::release): Release
33601 symtab::m_clones with ggc_delete as it's a GGC memory.
33603 2021-01-11 Matthias Klose <doko@ubuntu.com>
33605 * Makefile.in (LINK_PROGRESS): Show the link target.
33607 2021-01-11 Richard Biener <rguenther@suse.de>
33609 PR tree-optimization/91403
33610 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
33611 single-element interleaving group size at 4096 elements.
33613 2021-01-11 Richard Biener <rguenther@suse.de>
33615 PR tree-optimization/98526
33616 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
33617 of the actual reduction op for the regular case.
33618 (vectorizable_reduction): Cost the stmts
33619 vect_transform_reduction produces here.
33621 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
33623 * tree-ssa-forwprop.c (simplify_vector_constructor): For
33624 big-endian, use UNPACK[_FLOAT]_HI.
33626 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33628 * tree-vect-slp-patterns.c (class complex_pattern,
33629 class complex_add_pattern): Add parameters to matches.
33630 (complex_add_pattern::build): Free memory.
33631 (complex_add_pattern::matches): Move validation end of match.
33632 (complex_add_pattern::recognize): Likewise.
33634 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33636 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
33638 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33640 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
33642 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33644 PR tree-optimization/95867
33645 * tree-ssa-math-opts.h: New header.
33646 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
33647 (powi_as_mults): No longer static. Use build_one_cst instead of
33648 build_real. Formatting fix.
33649 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
33650 (attempt_builtin_powi): Handle multiplication reassociation without
33651 powi_fndecl using powi_as_mults.
33652 (reassociate_bb): For integral types don't require
33653 -funsafe-math-optimizations to call attempt_builtin_powi.
33655 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33657 PR tree-optimization/95852
33658 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
33659 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
33660 allow in the bb any of the stmts in that vector, div_stmt and
33661 up to 3 cast stmts.
33662 (arith_cast_equal_p): New function.
33663 (arith_overflow_check_p): Add cast_stmt argument, handle signed
33664 multiply overflow checks.
33665 (match_arith_overflow): Adjust caller. Handle signed multiply
33668 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33670 PR tree-optimization/95852
33671 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
33672 (uaddsub_overflow_check_p): Renamed to ...
33673 (arith_overflow_check_p): ... this. Handle also multiplication
33674 with overflow check.
33675 (match_uaddsub_overflow): Renamed to ...
33676 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
33677 also multiplication with overflow check. Adjust function comment.
33678 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
33679 match_arith_overflow also for MULT_EXPR.
33681 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33683 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
33684 __builtin_convertvector.
33685 (vmovl_s16): Likewise.
33686 (vmovl_s32): Likewise.
33687 (vmovl_u8): Likewise.
33688 (vmovl_u16): Likewise.
33689 (vmovl_u32): Likewise.
33690 (vmovn_s16): Likewise.
33691 (vmovn_s32): Likewise.
33692 (vmovn_s64): Likewise.
33693 (vmovn_u16): Likewise.
33694 (vmovn_u32): Likewise.
33695 (vmovn_u64): Likewise.
33697 2021-01-11 Martin Liska <mliska@suse.cz>
33699 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
33700 (if_chain::is_beneficial): Delete clusters
33701 (find_conditions): Make second argument of conditions_in_bbs a
33702 pointer so that we control over it's lifetime.
33703 (pass_if_to_switch::execute): Delete them.
33705 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
33707 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
33710 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33712 * config/vax/vax.md (cc): Remove mode attribute.
33713 (subst_<cc>, subst_f<cc>): Rename to...
33714 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
33715 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
33716 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
33717 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
33719 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33721 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
33722 `const_double_zero'.
33724 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33726 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
33727 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
33730 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33732 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
33734 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
33735 with `const_double_zero'.
33736 * doc/rtl.texi (Constant Expression Types): Document it.
33738 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33741 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
33742 POINTER_DIFF_EXPR to be any integral type.
33744 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33746 PR rtl-optimization/98603
33747 * function.c (instantiate_virtual_regs_in_insn): For asm goto
33748 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
33749 if any, set ASM_OPERANDS mode to VOIDmode and change
33750 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
33752 2021-01-09 Alexandre Oliva <oliva@gnu.org>
33755 * final.c (notice_source_line): Narrow down the condition to
33756 skip a line-0 marker.
33758 2021-01-08 Sergei Trofimovich <siarheit@google.com>
33760 * ipa-modref.c (merge_call_side_effects): Fix
33761 linebreak split by reordering two print calls.
33763 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33765 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
33766 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
33767 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
33768 consistency, fix constraint.
33770 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33772 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
33773 callables instead of mask values.
33774 (struct target_flag_set_p): New predicate.
33775 (s390_cpu_cpp_builtins_internal): Define or undefine
33776 __LONG_DOUBLE_VX__ macro.
33778 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33781 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
33782 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
33785 2021-01-08 Richard Biener <rguenther@suse.de>
33787 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
33789 2021-01-08 Richard Biener <rguenther@suse.de>
33791 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
33792 (vect_build_slp_tree): On cache hit release the matched
33793 scalar stmts vector.
33794 * tree-vect-stmts.c (vectorizable_store): Properly free
33795 vec_oprnds before possibly gathering them again.
33797 2021-01-08 Richard Biener <rguenther@suse.de>
33799 PR tree-optimization/98544
33800 * tree-vect-slp.c (vect_optimize_slp): Always materialize
33801 permutes at a permute node.
33803 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33806 * config/i386/i386.c (x86_function_profiler): Use R10 to call
33807 mcount in large model. Sorry for large model with PIC.
33809 2021-01-08 Jakub Jelinek <jakub@redhat.com>
33812 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
33813 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
33814 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
33815 TargetSave and initialize for variables with enum types.
33816 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
33817 mstack-protector-guard-symbol=): Add Save.
33818 * config/i386/i386-options.c (ix86_function_specific_save,
33819 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
33820 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
33821 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
33822 x_ix86_veclibabi_type.
33824 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33826 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
33827 SVE_FULL_I to SVE_I.
33828 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
33830 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33832 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
33833 SVE_FULL_I to SVE_I.
33834 (*cond_uxt<mode>_any): Likewise.
33836 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33838 * config/aarch64/iterators.md (Vwhalf): New iterator.
33839 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
33841 (aarch64_<sur>adalp<mode>): ... This. Make more
33843 (<sur>sadv16qi): Adjust callsite of the above.
33844 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
33846 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
33848 (vpadal_s16): Likewise.
33849 (vpadal_u8): Likewise.
33850 (vpadal_u16): Likewise.
33851 (vpadalq_s8): Likewise.
33852 (vpadalq_s16): Likewise.
33853 (vpadalq_s32): Likewise.
33854 (vpadalq_u8): Likewise.
33855 (vpadalq_u16): Likewise.
33856 (vpadalq_u32): Likewise.
33858 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33860 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
33862 (aarch64_<su>abd<mode>): ... This.
33863 (<sur>sadv16qi): Adjust callsite of the above.
33864 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
33866 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
33868 (vabd_s16): Likewise.
33869 (vabd_s32): Likewise.
33870 (vabd_u8): Likewise.
33871 (vabd_u16): Likewise.
33872 (vabd_u32): Likewise.
33873 (vabdq_s8): Likewise.
33874 (vabdq_s16): Likewise.
33875 (vabdq_s32): Likewise.
33876 (vabdq_u8): Likewise.
33877 (vabdq_u16): Likewise.
33878 (vabdq_u32): Likewise.
33880 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33882 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
33884 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
33885 (vaba_s16): Likewise.
33886 (vaba_s32): Likewise.
33887 (vaba_u8): Likewise.
33888 (vaba_u16): Likewise.
33889 (vaba_u32): Likewise.
33890 (vabaq_s8): Likewise.
33891 (vabaq_s16): Likewise.
33892 (vabaq_s32): Likewise.
33893 (vabaq_u8): Likewise.
33894 (vabaq_u16): Likewise.
33895 (vabaq_u32): Likewise.
33897 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33899 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
33900 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
33901 Change RTL pattern to match.
33903 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33905 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
33906 * config/riscv/riscv-c.c (riscv-subset.h): New.
33907 (INCLUDE_STRING): Define.
33908 (riscv_cpu_cpp_builtins): Add new style architecture extension
33910 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
33911 (riscv_subset_list::end): New.
33912 (riscv_current_subset_list): New.
33914 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33916 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
33917 Move to riscv-subset.h.
33918 (struct riscv_subset_t): Ditto.
33919 (class riscv_subset_list): Ditto.
33920 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
33921 from riscv-common.c.
33922 (struct riscv_subset_t): Ditto.
33923 (class riscv_subset_list): Ditto.
33924 * config/riscv/t-riscv ($(common_out_file)): Add file
33927 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33930 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
33931 New define_insn patterns.
33933 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33935 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
33936 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
33937 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
33939 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33941 PR tree-optimization/98560
33942 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
33943 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
33945 (vec_cond_direct): Likewise argument 2.
33946 (vec_condu_direct, vec_condeq_direct): Delete.
33947 (expand_vect_cond_optab_fn): Rename to...
33948 (expand_vec_cond_optab_fn): ...this, replacing old macro.
33949 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
33950 (expand_vect_cond_mask_optab_fn): Rename to...
33951 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
33952 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
33954 (direct_vec_cond_optab_supported_p): Likewise.
33955 (direct_vec_condu_optab_supported_p): Delete.
33956 (direct_vec_condeq_optab_supported_p): Delete.
33957 * gimple-isel.cc: Include internal-fn.h.
33958 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
33961 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33963 PR tree-optimization/98560
33964 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
33965 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
33967 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
33969 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
33970 rotate_insn and optab code attributes.
33971 Update all uses to merged code attribute.
33972 * config/i386/sse.md: Update all uses to merged code attribute.
33973 * config/i386/mmx.md: Update all uses to merged code attribute.
33975 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33977 PR tree-optimization/98568
33978 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
33979 (bswap_replace): Use it.
33981 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
33983 PR rtl-optimization/97978
33984 * lra-int.h (lra_hard_reg_split_p): New external.
33985 * lra.c (lra_hard_reg_split_p): New global.
33986 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
33987 * lra-assigns.c (lra_assign): Don't check allocation correctness
33988 after hard reg splitting.
33990 2021-01-06 Martin Sebor <msebor@redhat.com>
33993 * builtins.c (new_delete_mismatch_p): New overload.
33994 (new_delete_mismatch_p (tree, tree)): Call it.
33996 2021-01-06 Alexandre Oliva <oliva@adacore.com>
33998 * Makefile.in (T_GLIMITS_H): New.
33999 (stmp-int-hdrs): Depend on it, use it.
34000 * config/t-vxworks (T_GLIMITS_H): Override it.
34001 (vxw-glimits.h): New.
34003 2021-01-06 Richard Biener <rguenther@suse.de>
34005 PR tree-optimization/98513
34006 * value-range.cc (intersect_ranges): Compare the upper bounds
34007 for the expected relation.
34009 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
34012 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
34014 * doc/standards.texi (HSAIL): Remove section.
34016 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
34018 * configure: Re-generate.
34020 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34022 * doc/invoke.texi (-std=c++20): Adjust for the publication of
34023 ISO 14882:2020 standard.
34024 * doc/standards.texi: Likewise.
34026 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34028 PR tree-optimization/94802
34029 * expr.h (maybe_optimize_sub_cmp_0): Declare.
34030 * expr.c: Include tree-pretty-print.h and flags.h.
34031 (maybe_optimize_sub_cmp_0): New function.
34032 (do_store_flag): Use it.
34033 * cfgexpand.c (expand_gimple_cond): Likewise.
34035 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34037 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
34038 * rtlanal.c (simple_regno_set): Tweak description to clarify the
34041 2021-01-05 Richard Biener <rguenther@suse.de>
34043 PR tree-optimization/98516
34044 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
34045 lanes when materializing on a VEC_PERM node.
34046 (vectorizable_slp_permutation): Dump the permute properly.
34048 2021-01-05 Richard Biener <rguenther@suse.de>
34050 * tree-vect-slp.c (vect_slp_region): Move debug counter
34051 to cover individual subgraphs.
34053 2021-01-05 Richard Biener <rguenther@suse.de>
34055 PR tree-optimization/98428
34056 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
34057 vector lane extracts for loop vectorization.
34059 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34061 PR tree-optimization/98514
34062 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
34064 (operand_rank): Change type from hash_map<tree, long> to
34065 hash_map<tree, int64_t>.
34066 (phi_rank): Change return type from long to int64_t.
34067 (loop_carried_phi): Change block_rank variable type from long to
34069 (propagate_rank): Change return type, rank parameter type and
34070 op_rank variable type from long to int64_t.
34071 (find_operand_rank): Change return type from long to int64_t
34072 and change slot variable type from long * to int64_t *.
34073 (insert_operand_rank): Change rank parameter type from long to
34075 (get_rank): Change return type and rank variable type from long to
34076 int64_t. Use PRId64 instead of ld to print the rank.
34077 (init_reassoc): Change rank variable type from long to int64_t
34078 and adjust correspondingly bb_rank and operand_rank initialization.
34080 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34082 PR tree-optimization/96928
34083 * tree-ssa-phiopt.c (xor_replacement): New function.
34084 (tree_ssa_phiopt_worker): Call it.
34086 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34088 PR tree-optimization/96930
34089 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
34090 from narrower value which has the same type as 1 << B, perform
34091 the right shift on the narrower value followed by extension.
34093 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34095 PR tree-optimization/96239
34096 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
34098 (get_status_for_store_merging): Don't return BB_INVALID for blocks
34099 with potential bswap optimizable CONSTRUCTORs.
34100 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
34103 2021-01-05 Richard Biener <rguenther@suse.de>
34105 PR tree-optimization/98381
34106 * tree.c (vector_element_bits): Properly compute bool vector
34108 * tree-vect-loop.c (vectorizable_live_operation): Properly
34109 compute the last lane bit offset.
34111 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
34114 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
34115 Clear the top 64 bytes of the input XMM register.
34116 (sse_cvttps2pi): Ditto.
34118 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
34121 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
34123 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
34126 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
34129 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
34131 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
34132 (umaddsidi4_split): Likewise.
34134 2021-01-05 liuhongt <hongtao.liu@intel.com>
34137 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
34138 define_insn_and_split for zero_extend of subreg HI of pmovskb
34140 (*sse2_pmovskb_zexthisi): Add new combine splitters for
34141 zero_extend of not of subreg HI of pmovskb result.
34143 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34146 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
34148 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
34149 convert_memory_address to convert symbolic immediates to ptr_mode
34150 before forcing them to memory.
34152 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34154 PR rtl-optimization/97144
34155 * recog.c (constrain_operands): Initialize matching_operand
34156 for each alternative, rather than only doing it once.
34158 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34160 PR rtl-optimization/98403
34161 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
34162 why we don't remove call clobbers.
34163 (function_info::apply_changes_to_insn): Don't attempt to add
34164 call clobbers here.
34166 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34168 PR tree-optimization/98371
34169 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
34170 (vect_analyze_loop): If an epilogue loop appears to be cheaper
34171 than the main loop, re-analyze it as a main loop before adopting
34174 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
34177 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
34178 * aclocal.m4, configure: Regenerate.
34179 * Makefile.in (NETLIBS): Define.
34180 (BACKEND): Remove $(CODYLIB).
34182 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34184 PR rtl-optimization/98334
34185 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
34186 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
34188 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
34190 * tree-inline.c (expand_call_inline): Restore input_location.
34191 Return result from recursive call.
34193 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
34195 PR tree-optimization/95401
34196 * config/aarch64/aarch64-sve-builtins.cc
34197 (gimple_folder::load_store_cookie): Use bits rather than bytes
34198 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
34199 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
34200 * tree-vect-stmts.c (vectorizable_store): Likewise.
34201 (vectorizable_load): Likewise.
34203 2021-01-04 Richard Biener <rguenther@suse.de>
34205 PR tree-optimization/98308
34206 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
34209 2021-01-04 Jakub Jelinek <jakub@redhat.com>
34211 PR tree-optimization/95771
34212 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
34213 with precision smaller than int's precision and types with precision
34214 twice as large as long long. Formatting fixes.
34216 2021-01-04 Richard Biener <rguenther@suse.de>
34218 PR tree-optimization/98464
34219 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
34220 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
34221 (process_bb): Adjust.
34223 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
34226 * doc/invoke.texi (-fsanitize=address): Fix wording describing
34227 clash with -fsanitize=hwaddress.
34229 2021-01-04 Richard Biener <rguenther@suse.de>
34231 PR tree-optimization/98282
34232 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
34233 invariants as VN_NARY.
34235 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
34238 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
34239 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
34240 to handle zero operands.
34242 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
34244 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
34245 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
34246 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
34249 2021-01-04 Richard Biener <rguenther@suse.de>
34251 PR tree-optimization/98393
34252 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
34253 when hitting the limit.
34255 2021-01-04 Richard Biener <rguenther@suse.de>
34257 PR tree-optimization/98291
34258 * tree-vect-loop.c (vectorizable_reduction): Bypass
34259 associativity check for SLP reductions with VF 1.
34261 2021-01-04 Jakub Jelinek <jakub@redhat.com>
34263 PR tree-optimization/96782
34264 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
34266 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
34268 * collect-utils.c (collect_execute): Check dumppfx.
34269 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
34270 to collect_execute.
34271 (do_link): Add new parameter atsuffix.
34272 (main): Handle -dumpdir option. Skip one argument for
34273 -o, -isystem and -B options.
34274 * gcc.c (make_at_file): New helper function.
34275 (close_at_file): Use it.
34277 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34279 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
34280 Amend handling for LD64_VERSION fallback defaults.
34282 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34284 * config.gcc: Compute default version information
34285 from the configured target. Likewise defaults for
34287 * config/darwin10.h: Removed.
34288 * config/darwin12.h: Removed.
34289 * config/darwin9.h: Removed.
34290 * config/rs6000/darwin8.h: Removed.
34292 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34294 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
34296 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34298 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
34299 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
34301 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34303 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
34305 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
34307 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34309 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
34310 for the Darwin10 unwinder stub from here ...
34311 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
34313 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34315 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
34316 (ASM_DEBUG_SPEC):Only define if the assembler supports
34318 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
34319 (DARWIN_PREFER_DWARF): Define.
34320 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
34321 (DARWIN_PREFER_DWARF): Likewise
34322 (DSYMUTIL_SPEC): Likewise.
34323 (COLLECT_RUN_DSYMUTIL): Likewise.
34324 (ASM_DEBUG_SPEC): Likewise.
34325 (ASM_DEBUG_OPTION_SPEC): Likewise.
34327 2021-01-02 Jan Hubicka <jh@suse.cz>
34329 * cfg.c (free_block): ggc_free bb.
34331 2021-01-01 Jakub Jelinek <jakub@redhat.com>
34333 * gcc.c (process_command): Update copyright notice dates.
34334 * gcov-dump.c (print_version): Ditto.
34335 * gcov.c (print_version): Ditto.
34336 * gcov-tool.c (print_version): Ditto.
34337 * gengtype.c (create_file): Ditto.
34338 * doc/cpp.texi: Bump @copying's copyright year.
34339 * doc/cppinternals.texi: Ditto.
34340 * doc/gcc.texi: Ditto.
34341 * doc/gccint.texi: Ditto.
34342 * doc/gcov.texi: Ditto.
34343 * doc/install.texi: Ditto.
34344 * doc/invoke.texi: Ditto.
34346 2021-01-01 Jakub Jelinek <jakub@redhat.com>
34348 * ChangeLog-2020: Rotate ChangeLog. New file.
34351 Copyright (C) 2021 Free Software Foundation, Inc.
34353 Copying and distribution of this file, with or without modification,
34354 are permitted in any medium without royalty provided the copyright
34355 notice and this notice are preserved.