1 2021-12-12 Jonathan Wakely <jwakely@redhat.com>
3 * Makefile.in: Remove unique-ptr-tests.o.
4 * selftest-run-tests.c (selftest::run_tests): Remove
5 unique_ptr_tests_cc_tests.
6 * selftest.h (unique_ptr_tests_cc_tests): Remove.
7 * system.h: Check INCLUDE_MEMORY instead of INCLUDE_UNIQUE_PTR
8 and include <memory> instead of "unique-ptr.h".
9 * unique-ptr-tests.cc: Removed.
11 2021-12-12 Roger Sayle <roger@nextmovesoftware.com>
13 * config/nvptx/nvptx-opts.h (ptx_isa): Add PTX_ISA_SM53 ISA level
15 * config/nvptx/nvptx.opt: Add sm_53 to -misa.
16 * config/nvptx/nvptx-modes.def: Add support for HFmode.
17 * config/nvptx/nvptx.h (TARGET_SM53):
18 New helper macro to conditionalize functionality on target ISA.
19 * config/nvptx/nvptx-c.c (nvptx_cpu_cpp_builtins): Add __PTX_SM__
20 support for the new ISA levels.
21 * config/nvptx/nvptx.c (nvtx_ptx_type_from_mode): Support new HFmode
22 with the ".f16" suffix/qualifier.
23 (nvptx_file_start): Add support for TARGET_SM53.
24 (nvptx_omp_device_kind_arch_isa): Add support for TARGET_SM53
25 and tweak TARGET_SM35.
26 (nvptx_scalar_mode_supported_p): Target hook with conditional
27 HFmode support on TARGET_SM53 and higher.
28 (nvptx_libgcc_floating_mode_supported_p): Likewise.
29 (TARGET_SCALAR_MODE_SUPPORTED_P): Use nvptx_scalar_mode_supported_p.
30 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Likewise, use new hook.
31 * config/nvptx/nvptx.md (*movhf_insn): New define_insn.
32 (movhf): New define_expand for HFmode moves.
33 (addhf3, subhf3, mulhf, extendhf<mode>2, trunc<mode>hf2): New
34 instructions conditional on TARGET_SM53 (i.e. -misa=sm_53).
36 2021-12-12 Jan Hubicka <hubicka@ucw.cz>
39 * ipa-modref.c (modref_access_analysis::analyze): Terminate BB
40 analysis on NULL memory access.
41 * ipa-pure-const.c (analyze_function): Likewise.
43 2021-12-11 Jan Hubicka <hubicka@ucw.cz>
45 * ipa-profile.c (ipa_profile): Do not update hot bb threshold.
47 2021-12-11 Jan Hubicka <hubicka@ucw.cz>
49 * ipa-modref.c (get_modref_function_summary): Use ultimate_alias_target.
50 (ignore_edge): Likewise.
51 (compute_parm_map): Likewise.
52 (modref_propagate_in_scc): Likewise.
53 (modref_propagate_flags_in_scc): Likewise.
55 2021-12-10 Jason Merrill <jason@redhat.com>
57 * symtab.c (symtab_node::equal_address_to): Fix comment typo.
59 2021-12-10 Doug Rupp <rupp@adacore.com>
61 * config/vxworks.h (LINK_SPEC): Remove %(link_target).
62 Change %{v:-v} to %{v:-V}.
64 2021-12-10 Olivier Hainque <hainque@adacore.com>
66 * config/t-vxworks: Remove assignment to STMP_FIXINC.
68 2021-12-10 Martin Liska <mliska@suse.cz>
70 * params.opt: Add missing dot.
72 2021-12-10 Roger Sayle <roger@nextmovesoftware.com>
75 * ipa-modref-tree.h (useful_for_kill_p): Zero width accesses aren't
76 useful for kill tracking.
78 2021-12-10 Andrew Stubbs <ams@codesourcery.com>
80 * config/gcn/mkoffload.c (process_asm): Process the variable table
81 completely differently.
82 (process_obj): Encode the varaible data differently.
84 2021-12-10 Joel Hutton <joel.hutton@arm.com>
86 PR tree-optimization/103523
87 * tree-vect-loop.c (vectorizable_induction): Check for
88 PLUS_EXPR/MINUS_EXPR support.
90 2021-12-10 Cui,Lili <lili.cui@intel.com>
92 * config/i386/i386.c (ix86_vector_costs::add_stmt_cost): Remove Tremont.
94 2021-12-09 Jan Hubicka <hubicka@ucw.cz>
96 * doc/invoke.texi (max-inline-functions-called-once-loop-depth,
97 max-inline-functions-called-once-insns): New parameters.
98 * ipa-inline.c (check_callers): Handle
99 param_inline_functions_called_once_loop_depth and
100 param_inline_functions_called_once_insns.
101 (edge_badness): Fix linebreaks.
102 * params.opt (param=max-inline-functions-called-once-loop-depth,
103 param=max-inline-functions-called-once-insn): New params.
105 2021-12-09 Martin Sebor <msebor@redhat.com>
107 PR tree-optimization/103215
108 * pointer-query.cc (access_ref::merge_ref): Extend the offset and
109 size of the merged object instead of using the larger.
111 2021-12-09 Martin Sebor <msebor@redhat.com>
114 * doc/extend.texi (attribute access): Adjust.
115 * gimple-ssa-warn-access.cc (pass_waccess::maybe_check_access_sizes):
116 Treat access mode none on a void* argument as expecting as few as
119 2021-12-09 Jakub Jelinek <jakub@redhat.com>
122 * config/aarch64/aarch64-sve-builtins.cc (gt_pch_nx): Change type of
123 second argument from function with 2 pointer arguments to function
124 with 3 pointer arguments.
126 2021-12-09 Olivier Hainque <hainque@adacore.com>
128 * config/aarch64/aarch64-vxworks.h (TARGET_OS_CPP_BUILTINS):
129 Use VX_CPU_PREFIX in CPU definitions.
131 2021-12-09 Martin Sebor <msebor@redhat.com>
133 * pointer-query.cc (access_ref::dump): Define new function
134 (pointer_query::dump): Call it.
135 * pointer-query.h (access_ref::dump): Declare new function.
137 2021-12-09 Martin Sebor <msebor@redhat.com>
139 * pointer-query.cc (compute_objsize_r): Add an argument.
140 (gimple_call_return_array): Pass a new argument to compute_objsize_r.
141 (access_ref::merge_ref): Same.
142 (access_ref::inform_access): Add an argument and use it.
143 (access_data::access_data): Initialize new member.
144 (handle_min_max_size): Pass a new argument to compute_objsize_r.
145 (handle_decl): New function.
146 (handle_array_ref): Pass a new argument to compute_objsize_r.
147 Avoid incrementing deref.
148 (set_component_ref_size): New function.
149 (handle_component_ref): New function.
150 (handle_mem_ref): Pass a new argument to compute_objsize_r.
151 Only increment deref after successfully computing object size.
152 (handle_ssa_name): New function.
153 (compute_objsize_r): Move code into helpers and call them.
154 (compute_objsize): Pass a new argument to compute_objsize_r.
155 * pointer-query.h (access_ref::inform_access): Add an argument.
156 (access_data::ostype): New member.
158 2021-12-09 Martin Sebor <msebor@redhat.com>
160 * pointer-query.cc (access_ref::merge_ref): Define new function.
161 (access_ref::get_ref): Move code into merge_ref and call it.
162 * pointer-query.h (access_ref::merge_ref): Declare new function.
164 2021-12-09 Martin Sebor <msebor@redhat.com>
166 * gimple-ssa-warn-restrict.c (builtin_access::builtin_access): Pass
167 GIMPLE statement to compute_objsize.
168 * pointer-query.cc (compute_objsize): Add a statement argument.
169 * pointer-query.h (compute_objsize): Define a new overload.
171 2021-12-09 Martin Sebor <msebor@redhat.com>
173 * gimple-ssa-warn-access.cc (check_access): Adjust to member name
175 (pass_waccess::check_strncmp): Same.
176 * pointer-query.cc (access_ref::access_ref): Remove arguments.
178 (access_data::access_data): Define new ctors.
179 (access_data::set_bound): Define new member function.
180 (compute_objsize_r): Remove unnecessary code.
181 * pointer-query.h (struct access_ref): Remove ctor arguments.
182 (struct access_data): Declare ctor overloads.
183 (access_data::dst_bndrng): New member.
184 (access_data::src_bndrng): New member.
186 2021-12-09 Martin Sebor <msebor@redhat.com>
189 * pointer-query.cc (gimple_call_return_array): Call compute_objsize_r.
191 2021-12-09 Olivier Hainque <hainque@adacore.com>
192 Rasmus Villemoes <rv@rasmusvillemoes.dk>
194 * Makefile.in (T_STDINT_GCC_H): New variable, path to
195 stdint-gcc.h that a target configuration may override when
196 use_gcc_stdint is "provide".
197 (stmp-int-hdrs): Depend on it and copy that for
199 * config.gcc (vxworks): Revert to use_gcc_stdint=provide.
200 * config/t-vxworks (T_STDINT_GCC_H): Define, as vxw-stdint-gcc.h.
201 (vxw-stdint-gcc.h): New target, produced from the original
203 (vxw-glimits.h): Use an automatic variable to designate the
204 first and only prerequisite.
205 * config/vxworks/stdint.h: Remove.
207 2021-12-09 Iain Sandoe <iain@sandoe.co.uk>
210 * config/host-darwin.c (SAFE_ALLOC_SIZE): Remove.
211 (darwin_gt_pch_get_address): Rework for relocatable PCH.
212 (darwin_gt_pch_use_address): Likewise.
214 2021-12-09 Jakub Jelinek <jakub@redhat.com>
217 * config/host-darwin.c (darwin_gt_pch_use_address): When reading
218 manually the file into mapped area, update mapped_addr as
219 an automatic variable rather than addr which is a reference parameter.
220 * config/host-hpux.c (hpux_gt_pch_use_address): When reading
221 manually the file into mapped area, update addr as
222 an automatic variable rather than base which is a reference parameter.
224 2021-12-09 Jakub Jelinek <jakub@redhat.com>
227 * coretypes.h (gt_pointer_operator): Use 3 pointer arguments instead
229 * gengtype.c (struct walk_type_data): Add in_nested_ptr argument.
230 (walk_type): Temporarily set d->in_nested_ptr around nested_ptr
232 (write_types_local_user_process_field): Pass a new middle pointer
233 to gt_pointer_operator op calls, if d->in_nested_ptr pass there
234 address of d->prev_val[2], otherwise NULL.
235 (write_types_local_process_field): Likewise.
236 * ggc-common.c (relocate_ptrs): Add real_ptr_p argument. If equal
237 to ptr_p, do nothing, otherwise if NULL remember ptr_p's
238 or if non-NULL real_ptr_p's corresponding new address in
240 (reloc_addrs_vec): New variable.
241 (compare_ptr, read_uleb128, write_uleb128): New functions.
242 (gt_pch_save): When iterating over objects through relocate_ptrs,
243 save current i into state.ptrs_i. Sort reloc_addrs_vec and emit
244 it as uleb128 of differences between pointer addresses into the
246 (gt_pch_restore): Allow restoring of PCH to a different address
247 than the preferred one, in that case adjust global pointers by bias
248 and also adjust by bias addresses read from the relocation table
249 as uleb128 differences. Otherwise fseek over it. Perform
250 gt_pch_restore_stringpool only after adjusting callbacks and for
251 callback adjustments also take into account the bias.
252 (default_gt_pch_use_address): Change type of first argument from
254 (mmap_gt_pch_use_address): Likewise.
255 * ggc-tests.c (gt_pch_nx): Pass NULL as new middle argument to op.
256 * hash-map.h (hash_map::pch_nx_helper): Likewise.
257 (gt_pch_nx): Likewise.
258 * hash-set.h (gt_pch_nx): Likewise.
259 * hash-table.h (gt_pch_nx): Likewise.
260 * hash-traits.h (ggc_remove::pch_nx): Likewise.
261 * hosthooks-def.h (default_gt_pch_use_address): Change type of first
262 argument from void * to void *&.
263 (mmap_gt_pch_use_address): Likewise.
264 * hosthooks.h (struct host_hooks): Change type of first argument of
265 gt_pch_use_address hook from void * to void *&.
266 * machmode.h (gt_pch_nx): Expect a callback with 3 pointers instead of
267 two in the middle argument.
268 * poly-int.h (gt_pch_nx): Likewise.
269 * stringpool.c (gt_pch_nx): Pass NULL as new middle argument to op.
270 * tree-cfg.c (gt_pch_nx): Likewise, except for LOCATION_BLOCK pass
271 the same &(block) twice.
272 * value-range.h (gt_pch_nx): Pass NULL as new middle argument to op.
273 * vec.h (gt_pch_nx): Likewise.
274 * wide-int.h (gt_pch_nx): Likewise.
275 * config/host-darwin.c (darwin_gt_pch_use_address): Change type of
276 first argument from void * to void *&.
277 * config/host-darwin.h (darwin_gt_pch_use_address): Likewise.
278 * config/host-hpux.c (hpux_gt_pch_use_address): Likewise.
279 * config/host-linux.c (linux_gt_pch_use_address): Likewise. If
280 it couldn't succeed to mmap at the preferred location, set base
281 to the actual one. Update addr in the manual reading loop instead of
283 * config/host-netbsd.c (netbsd_gt_pch_use_address): Change type of
284 first argument from void * to void *&.
285 * config/host-openbsd.c (openbsd_gt_pch_use_address): Likewise.
286 * config/host-solaris.c (sol_gt_pch_use_address): Likewise.
287 * config/i386/host-mingw32.c (mingw32_gt_pch_use_address): Likewise.
288 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Pass NULL
289 as new middle argument to op in the generated code.
290 * doc/gty.texi: Adjust samples for the addition of middle pointer
291 to gt_pointer_operator callback.
293 2021-12-09 Alexandre Oliva <oliva@adacore.com>
296 * reg-stack.c (convert_regs_1): Move any_malformed_asm
298 (reg_to_stack): ... here.
300 2021-12-09 Alexandre Oliva <oliva@adacore.com>
303 * expr.c (emit_move_multi_word): Skip clobber during lra.
305 2021-12-09 Alexandre Oliva <oliva@adacore.com>
307 PR tree-optimization/103024
309 * gimple-harden-conditionals.cc (non_eh_succ_edge): New.
310 (pass_harden_compares::execute): Accept 1-bit integral types,
311 and cope with throwing compares.
313 2021-12-08 Iain Sandoe <iain@sandoe.co.uk>
315 * config/darwin.h (DARWIN_PIE_SPEC): Add -no_pie when
316 linking mdynamic-no-pic code on macOS > 10.7.
318 2021-12-08 Dimitar Dimitrov <dimitar@dinux.eu>
320 * config/pru/pru.c (pru_section_type_flags): New function.
321 (TARGET_SECTION_TYPE_FLAGS): Wire it.
323 2021-12-08 David Faust <david.faust@oracle.com>
325 * config/bpf/bpf.c (handle_attr_preserve): Avoid calling
326 is_gimple_assign with a NULL pointer.
328 2021-12-08 Martin Liska <mliska@suse.cz>
330 * profile-count.c (profile_count::dump): Add function
331 that can dump to a provided buffer.
332 (profile_probability::dump): Likewise.
333 * profile-count.h: Likewise.
334 * tree-ssa-loop-unswitch.c (tree_unswitch_single_loop):
335 Use dump_printf_loc infrastructure.
336 (tree_unswitch_outer_loop): Likewise.
337 (find_loop_guard): Likewise.
338 (hoist_guard): Likewise.
340 2021-12-08 Chung-Lin Tang <cltang@codesourcery.com>
342 * gimplify.c (extract_base_bit_offset): Add 'tree *offsetp' parameter,
343 accomodate case where 'offset' return of get_inner_reference is
345 (is_or_contains_p): Further robustify conditions.
346 (omp_target_reorder_clauses): In alloc/to/from sorting phase, also
347 move following GOMP_MAP_ALWAYS_POINTER maps along. Add new sorting
348 phase where we make sure pointers with an attach/detach map are ordered
350 (gimplify_scan_omp_clauses): Add modifications to avoid creating
351 GOMP_MAP_STRUCT and associated alloc map for attach/detach maps.
353 2021-12-08 Roger Sayle <roger@nextmovesoftware.com>
355 * config/nvptx/nvptx.md (*extend_trunc_<mode>2_qi,
356 *extend_trunc_<mode>2_hi, *extend_trunc_di2_si): New insns.
357 Use cvt to perform sign-extension of truncation in one step.
359 2021-12-08 Chung-Lin Tang <cltang@codesourcery.com>
362 * gimplify.c ("tree-hash-traits.h"): Add include.
363 (gimplify_scan_omp_clauses): Change struct_map_to_clause to type
364 hash_map<tree_operand, tree> *. Adjust struct map handling to handle
365 cases of *A and A->B expressions. Under !DECL_P case of
366 GOMP_CLAUSE_MAP handling, add STRIP_NOPS for indir_p case, add to
367 struct_deref_set for map(*ptr_to_struct) cases. Add MEM_REF case when
368 handling component_ref_p case. Add unshare_expr and gimplification
369 when created GOMP_MAP_STRUCT is not a DECL. Add code to add
370 firstprivate pointer for *pointer-to-struct case.
371 (gimplify_adjust_omp_clauses): Move GOMP_MAP_STRUCT removal code for
372 exit data directives code to earlier position.
373 * omp-low.c (lower_omp_target):
374 Handle GOMP_MAP_ATTACH_ZERO_LENGTH_ARRAY_SECTION, and
375 GOMP_MAP_POINTER_TO_ZERO_LENGTH_ARRAY_SECTION map kinds.
376 * tree-pretty-print.c (dump_omp_clause): Likewise.
378 2021-12-08 Andrew Stubbs <ams@codesourcery.com>
379 Hafiz Abid Qadeer <abidh@codesourcery.com>
381 * dwarf2cfi.c (dw_stack_pointer_regnum): Change type to struct cfa_reg.
382 (dw_frame_pointer_regnum): Likewise.
383 (new_cfi_row): Use set_by_dwreg.
384 (get_cfa_from_loc_descr): Use set_by_dwreg. Support register spans.
385 handle DW_OP_bregx with DW_OP_breg{0-31}. Support DW_OP_lit*,
386 DW_OP_const*, DW_OP_minus, DW_OP_shl and DW_OP_plus.
387 (lookup_cfa_1): Use set_by_dwreg.
388 (def_cfa_0): Update for cfa_reg and support register spans.
389 (reg_save): Change sreg parameter to struct cfa_reg. Support register
391 (dwf_cfa_reg): New function.
392 (dwarf2out_flush_queued_reg_saves): Use dwf_cfa_reg instead of
394 (dwarf2out_frame_debug_def_cfa): Likewise.
395 (dwarf2out_frame_debug_adjust_cfa): Likewise.
396 (dwarf2out_frame_debug_cfa_offset): Likewise. Update reg_save usage.
397 (dwarf2out_frame_debug_cfa_register): Likewise.
398 (dwarf2out_frame_debug_expr): Likewise.
399 (create_pseudo_cfg): Use set_by_dwreg.
400 (initial_return_save): Use set_by_dwreg and dwf_cfa_reg,
401 (create_cie_data): Use dwf_cfa_reg.
402 (execute_dwarf2_frame): Use dwf_cfa_reg.
403 (dump_cfi_row): Use set_by_dwreg.
404 * dwarf2out.c (build_span_loc, build_breg_loc): New function.
405 (build_cfa_loc): Support register spans.
406 (build_cfa_aligned_loc): Update cfa_reg usage.
407 (convert_cfa_to_fb_loc_list): Use set_by_dwreg.
408 * dwarf2out.h (struct cfa_reg): New type.
409 (struct dw_cfa_location): Use struct cfa_reg.
410 (build_span_loc): New prototype.
412 2021-12-08 Haochen Jiang <haochen.jiang@intel.com>
416 (*<sse4_1>_blendv<ssefltmodesuffix><avxsizesuffix>_not_ltint):
417 Add new define_insn_and_split.
419 2021-12-08 Alexandre Oliva <oliva@adacore.com>
422 * gimple-harden-conditionals.cc (detach_value): Use memory if
423 general regs won't do.
425 2021-12-07 Martin Liska <mliska@suse.cz>
428 * config/s390/s390.c (s390_valid_target_attribute_inner_p):
429 Use new enum CLVC_INTEGER.
430 * opt-functions.awk: Use new CLVC_INTEGER.
431 * opts-common.c (set_option): Likewise.
432 (option_enabled): Return -1,0,1 for CLVC_INTEGER.
433 (get_option_state): Use new CLVC_INTEGER.
434 (control_warning_option): Likewise.
435 * opts.h (enum cl_var_type): Likewise.
437 2021-12-07 H.J. Lu <hjl.tools@gmail.com>
440 * config/i386/i386.c (ix86_call_use_plt_p): Check FUNCTION_DECL
441 before calling cgraph_node::get.
443 2021-12-07 Richard Biener <rguenther@suse.de>
445 PR tree-optimization/103596
446 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
447 Note we are not propagating into a PHI argument to may_propagate_copy.
448 * tree-ssa-propagate.h (may_propagate_copy): Add
449 argument specifying whether we propagate into a PHI arg.
450 * tree-ssa-propagate.c (may_propagate_copy): Likewise.
451 When not doing so we can replace an abnormal with
453 (may_propagate_into_stmt): Update may_propagate_copy calls.
454 (replace_exp_1): Move propagation checking code to
455 propagate_value and rename to ...
456 (replace_exp): ... this and elide previous wrapper.
457 (propagate_value): Perform checking with adjusted
458 may_propagate_copy call and dispatch to replace_exp.
460 2021-12-07 Matthias Kretz <m.kretz@gsi.de>
462 * hash-map.h (hash_map::traverse): Let both overloads behave the
464 * predict.c (assert_is_empty): Return true, thus not changing
467 2021-12-07 YunQiang Su <yunqiang.su@cipunited.com>
469 * config/mips/mips.h (ISA_HAS_UNALIGNED_ACCESS, STRICT_ALIGNMENT):
470 R6 can unaligned access.
471 * config/mips/mips.md (movmisalign<mode>): Likewise.
472 * config/mips/mips.opt: add -m(no-)unaligned-access
473 * doc/invoke.texi: Likewise.
475 2021-12-07 Eugene Rozenfeld <erozen@microsoft.com>
477 * auto-profile.c (afdo_propagate_edge): Improve count propagation algorithm.
479 2021-12-06 Paul A. Clarke <pc@us.ibm.com>
482 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Replace "vector" with
485 2021-12-06 Jose E. Marchesi <jose.marchesi@oracle.com>
487 * config/bpf/bpf.c (bpf_handle_preserve_access_index_attribute):
488 Mark arguments `args' and flags' as unused.
489 (bpf_core_newdecl): Remove unused local `newdecl'.
490 (bpf_core_newdecl): Remove unused argument `loc'.
491 (ctfc_debuginfo_early_finish_p): Remove unused function.
492 (TARGET_CTFC_DEBUGINFO_EARLY_FINISH_P): Remove definition.
493 (bpf_core_walk): Do not pass a location to bpf_core_newdecl.
495 2021-12-06 Richard Sandiford <richard.sandiford@arm.com>
497 * gimple-range-edge.cc (gimple_outgoing_range::edge_range_p): Add
498 a shortcut for blocks with single successors.
499 * gimple-range-gori.cc (gori_map::calculate_gori): Likewise.
501 2021-12-06 Richard Sandiford <richard.sandiford@arm.com>
503 * value-range.cc (irange::irange_union): Use quick_push rather
504 than safe_push. Use widest_int rather than wide_int. Avoid
505 assigning wi::to_* results to wide*_int temporaries.
507 2021-12-06 Andrew MacLeod <amacleod@redhat.com>
509 * gimple-range-cache.cc (ranger_cache::fill_block_cache): Check for
510 a range from dominators before filling the cache.
511 (ranger_cache::range_from_dom): New.
512 * gimple-range-cache.h (ranger_cache::range_from_dom): Add prototype.
514 2021-12-06 Andrew MacLeod <amacleod@redhat.com>
516 * gimple-range-gori.h (class gori_compute):: Add prototypes.
517 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Add alternate
518 API for basic block. Call for edge alterantive.
519 (gori_compute::may_recompute_p): Ditto.
521 2021-12-06 Richard Biener <rguenther@suse.de>
523 PR tree-optimization/103581
524 * tree-vect-stmts.c (vect_build_gather_load_calls): Properly
525 guard all the AVX512 mask cases.
527 2021-12-06 Richard Biener <rguenther@suse.de>
529 PR tree-optimization/103544
530 * tree-vect-slp.c (vect_analyze_slp): Only add a SLP reduction
531 opportunity if the stmt in question is the reduction root.
532 (dot_slp_tree): Add missing check for NULL child.
534 2021-12-06 Jakub Jelinek <jakub@redhat.com>
537 * config/avr/avr.c (avr_output_data_section_asm_op,
538 avr_output_bss_section_asm_op): Change argument type from const void *
541 2021-12-06 Tamar Christina <tamar.christina@arm.com>
543 PR rtl-optimization/103404
544 * cse.c (find_sets_in_insn): Don't select elements out of a V1 mode
547 2021-12-06 Hongtao Liu <Hongtao.liu@intel.com>
548 Uroš Bizjak <ubizjak@gmail.com>
551 * config/i386/i386.c (ix86_preferred_reload_class): Allow
552 integer regs when moves between register units are cheap.
553 * config/i386/i386.h (INT_SSE_CLASS_P): New.
555 2021-12-05 David Edelsohn <dje.gcc@gmail.com>
557 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
558 AIX math builtin initialization before new_builtins_are_live.
560 2021-12-04 Jakub Jelinek <jakub@redhat.com>
562 * config/i386/x86-tune.def (X86_TUNE_PARTIAL_REG_DEPENDENCY): Fix
563 comment typo, Preffer -> prefer.
564 * ipa-modref-tree.c (modref_access_node::closer_pair_p): Likewise.
566 2021-12-04 Alexandre Oliva <oliva@adacore.com>
568 PR rtl-optimization/103028
569 * ifcvt.c (find_cond_trap): Validate new insns more strictly
572 2021-12-03 Martin Liska <mliska@suse.cz>
573 Peter Bergner <bergner@linux.ibm.com>
576 * config/rs6000/rs6000.c (rs6000_option_override_internal): Move the
577 disabling of shrink-wrapping when using -mrop-protect from here...
578 (rs6000_override_options_after_change): ...to here.
580 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
583 * config/i386/i386-expand.c (ix86_expand_builtin): Pass PVW_NONE
584 and PVW_NONE to ix86_target_string.
585 * config/i386/i386-options.c (ix86_target_string): Add arguments
586 for move_max and store_max.
587 (ix86_target_string::add_vector_width): New lambda.
588 (ix86_debug_options): Pass ix86_move_max and ix86_store_max to
590 (ix86_function_specific_print): Pass ptr->x_ix86_move_max and
591 ptr->x_ix86_store_max to ix86_target_string.
592 (ix86_valid_target_attribute_tree): Handle x_ix86_move_max and
594 (ix86_option_override_internal): Set the default x_ix86_move_max
595 and x_ix86_store_max.
596 * config/i386/i386-options.h (ix86_target_string): Add
597 prefer_vector_width and prefer_vector_width.
598 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): Removed.
599 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
600 (MOVE_MAX): Use 64 if ix86_move_max or ix86_store_max ==
601 PVW_AVX512. Use 32 if ix86_move_max or ix86_store_max >=
603 (STORE_MAX_PIECES): Use 64 if ix86_store_max == PVW_AVX512.
604 Use 32 if ix86_store_max >= PVW_AVX256.
605 * config/i386/i386.opt: Add -mmove-max=bits and -mstore-max=bits.
606 * config/i386/x86-tune.def (X86_TUNE_AVX512_MOVE_BY_PIECES): New.
607 (X86_TUNE_AVX512_STORE_BY_PIECES): Likewise.
608 * doc/invoke.texi: Document -mmove-max=bits and -mstore-max=bits.
610 2021-12-03 Bill Schmidt <wschmidt@linux.ibm.com>
612 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Fix builtin
615 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
617 * common/config/riscv/riscv-common.c (riscv_implied_info):
618 Add K-ext related entry.
619 (riscv_supported_std_ext): Add 'k'.
620 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add 'k'.
621 (IMPLIED_EXT): Add K-ext related entry.
623 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
625 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
627 * config/riscv/riscv-opts.h (MASK_ZBKB): New.
637 (TARGET_ZBKB): Ditto.
638 (TARGET_ZBKC): Ditto.
639 (TARGET_ZBKX): Ditto.
640 (TARGET_ZKNE): Ditto.
641 (TARGET_ZKND): Ditto.
642 (TARGET_ZKNH): Ditto.
644 (TARGET_ZKSED): Ditto.
645 (TARGET_ZKSH): Ditto.
647 * config/riscv/riscv.opt (riscv_zk_subext): New.
649 2021-12-03 Tamar Christina <tamar.christina@arm.com>
651 * tree-vect-stmts.c (prepare_load_store_mask): Rename to...
652 (prepare_vec_mask): ...This and record operations that have already been
654 (vectorizable_call): Use it.
655 (vectorizable_operation): Likewise.
656 (vectorizable_store): Likewise.
657 (vectorizable_load): Likewise.
658 * tree-vectorizer.h (class _loop_vec_info): Add vec_cond_masked_set.
659 (vec_cond_masked_set_type, tree_cond_mask_hash): New.
661 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
665 * target.def (ifunc_ref_local_ok): Add a target hook.
666 * varasm.c (default_binds_local_p_3): Force indirect function
667 resolver non-local only if targetm.ifunc_ref_local_ok returns
669 * config/i386/i386-expand.c (ix86_expand_call): Call
670 ix86_call_use_plt_p to check if PLT should be used.
671 * config/i386/i386-protos.h (ix86_call_use_plt_p): New.
672 * config/i386/i386.c (output_pic_addr_const): Call
673 ix86_call_use_plt_p to check if "@PLT" is needed.
674 (ix86_call_use_plt_p): New.
675 (TARGET_IFUNC_REF_LOCAL_OK): New.
676 * doc/tm.texi.in: Add TARGET_IFUNC_REF_LOCAL_OK.
677 * doc/tm.texi: Regenerated.
679 2021-12-03 Jakub Jelinek <jakub@redhat.com>
681 * attribs.h (simple_cst_list_equal): Declare.
682 * attribs.c (simple_cst_list_equal): No longer static.
683 * config/i386/i386-options.c (target_attribute_cache): New variable.
684 (ix86_valid_target_attribute_p): Cache DECL_FUNCTION_SPECIFIC_TARGET
685 and DECL_FUNCTION_SPECIFIC_OPTIMIZATION based on args.
687 2021-12-03 Jakub Jelinek <jakub@redhat.com>
690 * ggc.h (gt_pch_note_callback): Declare.
691 * gengtype.h (enum typekind): Add TYPE_CALLBACK.
692 (callback_type): Declare.
693 * gengtype.c (dbgprint_count_type_at): Handle TYPE_CALLBACK.
694 (callback_type): New variable.
695 (process_gc_options): Add CALLBACK argument, handle callback
697 (set_gc_used_type): Adjust process_gc_options caller, if callback,
698 set type to &callback_type.
699 (output_mangled_typename): Handle TYPE_CALLBACK.
700 (walk_type): Likewise. Handle callback option.
701 (write_types_process_field): Handle TYPE_CALLBACK.
702 (write_types_local_user_process_field): Likewise.
703 (write_types_local_process_field): Likewise.
704 (write_root): Likewise.
705 (dump_typekind): Likewise.
706 (dump_type): Likewise.
707 * gengtype-state.c (type_lineloc): Handle TYPE_CALLBACK.
708 (state_writer::write_state_callback_type): New method.
709 (state_writer::write_state_type): Handle TYPE_CALLBACK.
710 (read_state_callback_type): New function.
711 (read_state_type): Handle TYPE_CALLBACK.
712 * ggc-common.c (callback_vec): New variable.
713 (gt_pch_note_callback): New function.
714 (gt_pch_save): Stream out gt_pch_save function address and relocation
716 (gt_pch_restore): Stream in saved gt_pch_save function address and
717 relocation table and apply relocations if needed.
718 * doc/gty.texi (callback): Document new GTY option.
719 * varasm.c (get_unnamed_section): Change callback argument's type and
720 last argument's type from const void * to const char *.
721 (output_section_asm_op): Change argument's type from const void *
722 to const char *, remove unnecessary cast.
723 * tree-core.h (struct tree_translation_unit_decl): Drop GTY((skip))
724 from language member.
725 * output.h (unnamed_section_callback): Change argument type from
726 const void * to const char *.
727 (struct unnamed_section): Use GTY((callback)) instead of GTY((skip))
728 for callback member. Change data member type from const void *
730 (struct noswitch_section): Use GTY((callback)) instead of GTY((skip))
732 (get_unnamed_section): Change callback argument's type and
733 last argument's type from const void * to const char *.
734 (output_section_asm_op): Change argument's type from const void *
736 * config/avr/avr.c (avr_output_progmem_section_asm_op): Likewise.
737 Remove unneeded cast.
738 * config/darwin.c (output_objc_section_asm_op): Change argument's type
739 from const void * to const char *.
740 * config/pa/pa.c (som_output_text_section_asm_op): Likewise.
741 (som_output_comdat_data_section_asm_op): Likewise.
742 * config/rs6000/rs6000.c (rs6000_elf_output_toc_section_asm_op):
744 (rs6000_xcoff_output_readonly_section_asm_op): Likewise. Instead
745 of dereferencing directive hardcode variable names and decide based on
746 whether directive is NULL or not.
747 (rs6000_xcoff_output_readwrite_section_asm_op): Change argument's type
748 from const void * to const char *.
749 (rs6000_xcoff_output_tls_section_asm_op): Likewise. Instead
750 of dereferencing directive hardcode variable names and decide based on
751 whether directive is NULL or not.
752 (rs6000_xcoff_output_toc_section_asm_op): Change argument's type
753 from const void * to const char *.
754 (rs6000_xcoff_asm_init_sections): Adjust get_unnamed_section callers.
756 2021-12-03 Richard Purdie <richard.purdie@linuxfoundation.org>
758 * Makefile.in: Fix "argument list too long" from install-plugins.
760 2021-12-03 Matt Jacobson <mhjacobson@me.com>
762 * config.gcc: For the AVR target, populate TM_MULTILIB_CONFIG.
763 * config/avr/genmultilib.awk: Add ability to filter generated multilib
765 * config/avr/t-avr: Pass TM_MULTILIB_CONFIG to genmultilib.awk.
766 * configure.ac: Update help string for --with-multilib-list.
767 * configure: Regenerate.
769 2021-12-02 Ilya Lipnitskiy <ilya.lipnitskiy@gmail.com>
771 * configure: Regenerate.
772 * configure.ac: Define TARGET_LIBC_GNUSTACK on musl.
774 2021-12-02 Iain Sandoe <iain@sandoe.co.uk>
777 * config/host-darwin.c (TRY_EMPTY_VM_SPACE,
778 SAFE_ALLOC_SIZE): New.
779 (darwin_gt_pch_get_address): Rewrite to use nominated
780 memory segments rather than part of the compiler __DATA
782 (darwin_gt_pch_use_address): Likewise.
784 2021-12-02 Gerald Pfeifer <gerald@pfeifer.com>
786 * doc/install.texi (*-*-freebsd*): Remove references to
787 FreeBSD 1 and FreeBSD 2.
789 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
791 PR rtl-optimization/103437
792 * ira-color.c (setup_allocno_priorities): Switch off backup code
793 for overflow if compiler has __builtin_smul_overflow. Use <
794 for comparison with -INT_MAX.
796 2021-12-02 Bill Schmidt <wschmidt@linux.ibm.com>
798 * config/rs6000/altivec.h: Delete a number of #defines that are now
799 superfluous. Alphabetize. Include rs6000-vecdefines.h. Include some
801 * config/rs6000/rs6000-builtin-new.def (CMPB): Flag as no32bit.
802 (BPERMD): Flag as 32bit (needing special handling for 32-bit).
803 (UNPACK_TD): Return unsigned long long instead of unsigned long.
804 (GET_TEXASR): Return unsigned long instead of unsigned long long.
805 (GET_TEXASRU): Likewise.
806 (GET_TFHAR): Likewise.
807 (GET_TFIAR): Likewise.
808 (SET_TEXASR): Pass unsigned long instead of unsigned long long.
809 (SET_TEXASRU): Likewise.
810 (SET_TFHAR): Likewise.
811 (SET_TFIAR): Likewise.
812 (TABORTDC): Likewise.
813 (TABORTDCI): Likewise.
814 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): Fix error
815 handling for no32bit. Add 32bit handling for RS6000_BIF_BPERMD.
816 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Initialize
817 new_builtins_are_live to 1.
819 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
821 PR rtl-optimization/103437
822 * ira-color.c (setup_allocno_priorities): Use long long
823 multiplication as backup for overflow processing.
825 2021-12-02 Tamar Christina <tamar.christina@arm.com>
827 * config/aarch64/aarch64-protos.h (aarch64_gen_shareable_zero): New.
828 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>,
829 aarch64_rshrn2<mode>): Generate rounding half-ing add when appropriate.
830 * config/aarch64/aarch64.c (aarch64_gen_shareable_zero): New.
832 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
834 PR rtl-optimization/103437
835 * ira-color.c (setup_allocno_priorities): Process multiplication
838 2021-12-02 Martin Liska <mliska@suse.cz>
840 PR gcov-profile/96092
841 * common.opt: New option.
842 * coverage.c (coverage_begin_function): Emit filename with
843 remap_profile_filename.
844 * doc/invoke.texi: Document the new option.
845 * file-prefix-map.c (add_profile_prefix_map): New.
846 (remap_profile_filename): Likewise.
847 * file-prefix-map.h (add_profile_prefix_map): Likewise.
848 (remap_profile_filename): Likewise.
849 * lto-opts.c (lto_write_options): Handle
850 OPT_fprofile_prefix_map_.
851 * opts-global.c (handle_common_deferred_options): Likewise.
852 * opts.c (common_handle_option): Likewise.
853 (gen_command_line_string): Likewise.
854 * profile.c (output_location): Emit filename with
855 remap_profile_filename.
857 2021-12-02 Richard Biener <rguenther@suse.de>
860 * internal-fn.c (expand_DEFERRED_INIT): When the base
861 of the LHS is a decl with matching constant size use
862 that as the initialization target instead of an
863 eventual VLA typed one.
865 2021-12-02 Richard Biener <rguenther@suse.de>
867 PR tree-optimization/103527
868 * tree-vect-stmts.c (vect_build_gather_load_calls): Always
869 use a truth type for building the vector mask.
871 2021-12-02 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
873 * config/vxworks.h (LTO_PLUGIN_SPEC): Adapt to corrected
874 spelling of -linker-output-auto-nolto-rel.
876 2021-12-01 Uroš Bizjak <ubizjak@gmail.com>
879 * config/i386/sse.md (VI2F): Remove mode iterator.
880 (VI2F_256_512): New mode iterator.
881 (vec_set<V8_128:mode>_0): New insn pattern.
882 (vec_set<VI2F_256_512:mode>_0>): Rename from vec_set<VI2F:mode>mode.
883 Use VI2F_256_512 mode iterator instead of VI2F.
884 (*axv512fp16_movsh): Remove.
885 (<sse2p4_1>_pinsr<ssemodesuffix>): Add (x,x,x) AVX2 alternative.
886 Do not disable V8HF mode insn on AVX2 targets.
887 (pinsrw -> pbroadcast + pblendw peephole2): New peephole.
888 (pinsrw -> pbroadcast + pblendw splitter): New post-reload splitter.
889 * config/i386/i386.md (extendhfsf): Call gen_vec_setv8hf_0.
890 * config/i386/i386-expand.c (ix86_expand_vector_set)
891 <case E_V8HFmode>: Use vec_merge path for TARGET_AVX2.
893 2021-12-01 Roger Sayle <roger@nextmovesoftware.com>
894 Richard Biener <rguenther@suse.de>
896 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
897 Check if simplify_using_initial_conditions allows us to
898 simplify the expression for may_be_zero.
899 * match.pd (X != C ? -X : -C -> -X): New transform.
900 (X != C ? ~X : ~C -> ~X): Likewise.
901 ((X+1) > Y ? -X : 1 -> X >= Y ? -X : 1): Likewise.
903 2021-12-01 Peter Bergner <bergner@linux.ibm.com>
906 * gimplify.c (is_var_need_auto_init): Handle opaque types.
908 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
910 * common.opt (static-libphobos): Add option.
911 * config/darwin.h (LINK_SPEC): Substitute -lgphobos with libgphobos.a
912 when linking statically.
913 * gcc.c (driver_handle_option): Set -static-libphobos as always valid.
915 2021-12-01 Jason Merrill <jason@redhat.com>
918 * fold-const.c (maybe_nonzero_address): Use get_create or get
919 depending on folding_initializer.
920 (fold_binary_initializer_loc): New.
921 * fold-const.h (fold_binary_initializer_loc): Declare.
922 * varasm.c (mark_weak): Don't use the decl location.
923 * doc/invoke.texi: Document -fconstexpr-fp-except.
925 2021-12-01 Bill Schmidt <wschmidt@linux.ibm.com>
928 * config/rs6000/rs6000-c.c (rs6000-builtins.h): Stop including.
929 (rs6000-internal.h): Include.
930 (altivec_resolve_new_overloaded_builtin): Move call to
931 rs6000_invalid_new_builtin here from rs6000_new_builtin_decl.
932 * config/rs6000/rs6000-call.c (rs6000-builtins.h): Stop including.
933 (rs6000_invalid_new_builtin): Remove static qualifier.
934 (rs6000_new_builtin_decl): Remove test for supported builtin.
935 * config/rs6000/rs6000-internal.h (rs6000-builtins.h): Include.
936 (rs6000_invalid_new_builtin): Declare.
937 * config/rs6000/rs6000.c (rs6000-builtins.h): Don't include.
939 2021-12-01 Aldy Hernandez <aldyh@redhat.com>
941 PR tree-optimization/103409
942 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
943 Do all the work with just one ssa_global_cache.
944 * gimple-range-path.h: Remove m_tmp_phi_cache.
946 2021-12-01 Richard Sandiford <richard.sandiford@arm.com>
948 PR tree-optimization/103517
949 * tree-vect-slp.c (vect_build_slp_tree_1): When allowing two
950 different component references, check the codes of both them,
951 rather than just the first.
953 2021-12-01 Andrew MacLeod <amacleod@redhat.com>
955 PR tree-optimization/103464
956 * gimple-range.cc (gimple_ranger::prefill_name): Process phis also.
957 (gimple_ranger::prefill_stmt_dependencies): Ditto.
959 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
961 * doc/install.texi (Prerequisites): Add note that D front end now
962 requires GDC installed in order to bootstrap.
963 (Building): Add D compiler section, referencing prerequisites.
965 2021-12-01 Jakub Jelinek <jakub@redhat.com>
967 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE,
968 X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY,
969 X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Formatting fixes.
970 (X86_TUNE_USE_GATHER): Put m_GENERIC last for consistency.
972 2021-12-01 Jakub Jelinek <jakub@redhat.com>
974 PR rtl-optimization/102356
975 * rtl.h (simplify_context): Add assoc_count member and
976 max_assoc_count static member.
977 * simplify-rtx.c (simplify_associative_operation): Don't reassociate
978 more than max_assoc_count times within one outermost simplify_* call.
979 * dwarf2out.c (mem_loc_descriptor): Optimize binary operation
980 with both operands the same using DW_OP_dup.
982 2021-12-01 Tamar Christina <tamar.christina@arm.com>
984 * match.pd: Move below pattern that rewrites to EQ, NE.
985 * tree.c (bitmask_inv_cst_vector_p): Correct do .. while indentation.
987 2021-12-01 Siddhesh Poyarekar <siddhesh@gotplt.org>
988 Martin Liška <mliska@suse.cz>
990 PR tree-optimization/103456
991 * tree-object-size.c (merge_object_sizes): Update osi->changed
992 only if object_sizes_set succeeded.
994 2021-11-30 liuhongt <hongtao.liu@intel.com>
998 * config/i386/i386.md (*x86_64_shld_1): Set_attr
999 length_immediate to 1.
1000 (*x86_shld_1): Ditto.
1001 (*x86_64_shrd_1): Ditto.
1002 (*x86_shrd_1): Ditto.
1004 2021-11-30 Bill Schmidt <wschmidt@linux.ibm.com>
1006 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
1008 (altivec_resolve_new_overloaded_builtin): Likewise.
1010 2021-11-30 Martin Jambor <mjambor@suse.cz>
1013 * ipa-sra.c (scan_function): Also check ECF_LOOPING_CONST_OR_PURE flag.
1015 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1017 PR tree-optimization/103494
1018 * tree-vect-stmts.c (vect_get_gather_scatter_ops): Remove ncopies
1019 argument and calculate ncopies from gs_info->offset_vectype
1021 (vectorizable_store, vectorizable_load): Update accordingly.
1023 2021-11-30 Iain Buclaw <ibuclaw@gdcproject.org>
1025 * Makefile.in (GDC): New variable.
1026 (GDCFLAGS): New variable.
1027 * configure: Regenerate.
1028 * configure.ac: Add call to ACX_PROG_GDC. Substitute GDCFLAGS.
1030 2021-11-30 Martin Liska <mliska@suse.cz>
1031 Martin Jambor <mjambor@suse.cz>
1034 * ipa-param-manipulation.c
1035 (ipa_param_body_adjustments::prepare_debug_expressions): Be
1036 careful about hash_map reallocating itself. Simpify a return
1037 which always returns true.
1039 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
1041 PR tree-optimization/103440
1042 * gimple-range-fold.cc (fold_using_range::range_of_phi): Continue
1043 normal param processing for equiv params.
1045 2021-11-30 Richard Biener <rguenther@suse.de>
1047 * config/i386/i386.c (ix86_shift_rotate_cost): Remove
1049 * tree-chrec.c (evolution_function_is_invariant_rec_p):
1051 * tree-if-conv.c (if_convertible_stmt_p): Likewise.
1052 * tree-ssa-pre.c (fully_constant_expression): Likewise.
1053 * tree-vrp.c (operand_less_p): Likewise.
1054 * reload.c (reg_overlap_mentioned_for_reload_p): Remove
1055 unreachable gcc_unreachable ().
1056 * sel-sched-ir.h (bb_next_bb): Likewise.
1057 * varasm.c (compare_constant): Likewise.
1059 2021-11-30 Richard Biener <rguenther@suse.de>
1061 PR tree-optimization/103489
1062 * tree-vect-loop.c (vectorizable_phi): Verify argument
1063 vector type compatibility to mitigate bool pattern recog
1066 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
1068 * omp-offload.c (oacc_loop_auto_partitions): Remove erroneous
1069 "Orphan reductions cannot have gang partitioning" handling.
1071 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
1073 * omp-offload.c (oacc_loop_process): Implement "gang reduction on
1074 an orphan loop" checking.
1076 2021-11-30 Cesar Philippidis <cesar@codesourcery.com>
1077 Thomas Schwinge <thomas@codesourcery.com>
1079 * omp-general.h (enum oacc_loop_flags): Add OLF_REDUCTION enum.
1080 * omp-low.c (lower_oacc_head_mark): Use it to mark OpenACC
1082 * omp-offload.c (oacc_loop_auto_partitions): Don't assign gang
1083 level parallelism to orphan reductions.
1085 2021-11-30 Aldy Hernandez <aldyh@redhat.com>
1087 PR tree-optimization/103451
1088 * range-op.cc (operator_div::wi_fold): Remove
1089 can_throw_non_call_exceptions special case.
1090 * tree-ssa-sink.c (sink_code_in_bb): Same.
1092 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1094 * tree-vect-slp.c (arg1_arg4_map): New variable.
1095 (vect_get_operand_map): Handle IFN_MASK_GATHER_LOAD.
1096 (vect_build_slp_tree_1): Likewise.
1097 (vect_build_slp_tree_2): Likewise.
1098 * tree-vect-stmts.c (vectorizable_load): Expect the mask to be
1099 the last SLP child node rather than the first.
1101 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1103 * tree-if-conv.c: Include tree-eh.h.
1104 (predicate_statements): Remove pe argument. Don't hoist
1106 (combine_blocks): Remove pe argument.
1107 (ifcvt_available_on_edge_p, ifcvt_can_hoist): New functions.
1108 (ifcvt_hoist_invariants): Likewise.
1109 (tree_if_conversion): Update call to combine_blocks. Call
1110 ifcvt_hoist_invariants after VN.
1112 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1114 PR tree-optimization/102467
1115 * doc/sourcebuild.texi (vect_gather_load_ifn): Document.
1116 * tree-vect-data-refs.c (vect_analyze_data_ref_dependence):
1117 Commonize safelen handling. Punt for anything involving
1118 gathers and scatters unless safelen says otherwise.
1119 * tree-vect-slp.c (arg1_map): New variable.
1120 (vect_get_operand_map): Handle IFN_GATHER_LOAD.
1121 (vect_build_slp_tree_1): Likewise.
1122 (vect_build_slp_tree_2): Likewise.
1123 (compatible_calls_p): If vect_get_operand_map returns nonnull,
1124 check that any skipped arguments are equal.
1125 (vect_slp_analyze_node_operations_1): Tighten reduction check.
1126 * tree-vect-stmts.c (check_load_store_for_partial_vectors): Take
1127 an ncopies argument.
1128 (vect_get_gather_scatter_ops): Take slp_node and ncopies arguments.
1130 (vectorizable_store, vectorizable_load): Adjust accordingly.
1132 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1134 * gimple.h (gimple_num_args, gimple_arg, gimple_arg_ptr): New
1136 * tree-vect-slp.c (cond_expr_maps, arg2_map): New variables.
1137 (vect_get_operand_map): New function.
1138 (vect_get_and_check_slp_defs): Fix outdated comment.
1139 Use vect_get_operand_map and new gimple argument accessors.
1140 (vect_build_slp_tree_2): Likewise.
1142 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1144 * tree-vect-slp.c (vect_build_slp_tree_1): Use code_helper
1145 to record the operations performed by statements, only using
1146 CALL_EXPR for things that don't map to built-in or internal
1147 functions. For shifts, require all shift amounts to be equal
1148 if optab_vector is not supported but optab_scalar is.
1150 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1152 * tree-vect-data-refs.c (vect_check_gather_scatter): Continue
1153 processing conversions if the current offset is a pointer.
1155 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1157 * tree-vectorizer.h (vect_is_reduction): Use STMT_VINFO_REDUC_IDX.
1159 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1161 * genopinit.c (main): Turn supports_vec_gather_load and
1162 supports_vec_scatter_store into signed char arrays and remove
1163 supports_vec_gather_load_cached and supports_vec_scatter_store_cached.
1164 * optabs-query.c (supports_vec_convert_optab_p): Add a mode parameter.
1165 If the mode is not VOIDmode, test only for that mode.
1166 (supports_vec_gather_load_p): Likewise.
1167 (supports_vec_scatter_store_p): Likewise.
1168 * optabs-query.h (supports_vec_gather_load_p): Likewise.
1169 (supports_vec_scatter_store_p): Likewise.
1170 * tree-vect-data-refs.c (vect_check_gather_scatter): Pass the
1171 vector mode to supports_vec_gather_load_p and
1172 supports_vec_scatter_store_p.
1174 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1176 * internal-fn.c (commutative_binary_fn_p): Handle IFN_ADD_OVERFLOW
1177 and IFN_MUL_OVERFLOW.
1179 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1181 * internal-fn.c (commutative_binary_fn_p): Handle IFN_UBSAN_CHECK_ADD
1182 and IFN_UBSAN_CHECK_MUL.
1184 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1186 * internal-fn.c (commutative_binary_fn_p): Handle IFN_COMPLEX_MUL.
1188 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1190 * gimple-fold.c: Include internal-fn.h.
1191 (fold_stmt_1): If a function maps to an internal one, use
1192 first_commutative_argument to canonicalize the order of
1193 commutative arguments.
1194 * gimple-match-head.c (gimple_resimplify2, gimple_resimplify3)
1195 (gimple_resimplify4, gimple_resimplify5): Extend commutativity
1196 checks to functions.
1198 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1200 * doc/md.texi (reduc_fmin_scal_@var{m}): Document.
1201 (reduc_fmax_scal_@var{m}): Likewise.
1202 * optabs.def (reduc_fmax_scal_optab): New optab.
1203 (reduc_fmin_scal_optab): Likewise
1204 * internal-fn.def (REDUC_FMAX, REDUC_FMIN): New functions.
1205 * tree-vect-loop.c (reduction_fn_for_scalar_code): Handle
1206 CASE_CFN_FMAX and CASE_CFN_FMIN.
1207 (neutral_op_for_reduction): Likewise.
1208 (needs_fold_left_reduction_p): Likewise.
1209 * config/aarch64/iterators.md (FMAXMINV): New iterator.
1210 (fmaxmin): Handle UNSPEC_FMAXNMV and UNSPEC_FMINNMV.
1211 * config/aarch64/aarch64-simd.md (reduc_<optab>_scal_<mode>): Fix
1213 (reduc_<fmaxmin>_scal_<mode>): New pattern.
1214 * config/aarch64/aarch64-sve.md (reduc_<fmaxmin>_scal_<mode>):
1217 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1219 * builtins.h (associated_internal_fn): Declare overload that
1220 takes a (combined_cfn, return type) pair.
1221 * builtins.c (associated_internal_fn): Split new overload out
1222 of original fndecl version. Also provide an overload that takes
1223 a (combined_cfn, return type) pair.
1224 * internal-fn.h (commutative_binary_fn_p): Declare.
1225 (commutative_ternary_fn_p): Likewise.
1226 (associative_binary_fn_p): Likewise.
1227 * internal-fn.c (commutative_binary_fn_p, commutative_ternary_fn_p):
1228 New functions, split out from...
1229 (first_commutative_argument): ...here.
1230 (associative_binary_fn_p): New function.
1231 * gimple-match.h (code_helper): Add a constructor that takes
1233 (commutative_binary_op_p): Declare.
1234 (commutative_ternary_op_p): Likewise.
1235 (first_commutative_argument): Likewise.
1236 (associative_binary_op_p): Likewise.
1237 (canonicalize_code): Likewise.
1238 (directly_supported_p): Likewise.
1239 (get_conditional_internal_fn): Likewise.
1240 (gimple_build): New overloads that takes a code_helper.
1241 * gimple-fold.c (gimple_build): Likewise.
1242 * gimple-match-head.c (commutative_binary_op_p): New function.
1243 (commutative_ternary_op_p): Likewise.
1244 (first_commutative_argument): Likewise.
1245 (associative_binary_op_p): Likewise.
1246 (canonicalize_code): Likewise.
1247 (directly_supported_p): Likewise.
1248 (get_conditional_internal_fn): Likewise.
1249 * tree-vectorizer.h: Include gimple-match.h.
1250 (neutral_op_for_reduction): Take a code_helper instead of a tree_code.
1251 (needs_fold_left_reduction_p): Likewise.
1252 (reduction_fn_for_scalar_code): Likewise.
1253 (vect_can_vectorize_without_simd_p): Declare a nNew overload that takes
1255 * tree-vect-loop.c: Include case-cfn-macros.h.
1256 (fold_left_reduction_fn): Take a code_helper instead of a tree_code.
1257 (reduction_fn_for_scalar_code): Likewise.
1258 (neutral_op_for_reduction): Likewise.
1259 (needs_fold_left_reduction_p): Likewise.
1260 (use_mask_by_cond_expr_p): Likewise.
1261 (build_vect_cond_expr): Likewise.
1262 (vect_create_partial_epilog): Likewise. Use gimple_build rather
1263 than gimple_build_assign.
1264 (check_reduction_path): Handle calls and operate on code_helpers
1265 rather than tree_codes.
1266 (vect_is_simple_reduction): Likewise.
1267 (vect_model_reduction_cost): Likewise.
1268 (vect_find_reusable_accumulator): Likewise.
1269 (vect_create_epilog_for_reduction): Likewise.
1270 (vect_transform_cycle_phi): Likewise.
1271 (vectorizable_reduction): Likewise. Make more use of
1273 (vect_transform_reduction): Use gimple_extract_op but expect
1274 a tree_code for now.
1275 (vect_can_vectorize_without_simd_p): New overload that takes
1277 * tree-vect-stmts.c (vectorizable_call): Handle reductions in
1279 * tree-vect-patterns.c (vect_mark_pattern_stmts): Use
1280 gimple_extract_op when updating STMT_VINFO_REDUC_IDX.
1282 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1284 * gimple-match.h (code_helper): Provide == and != overloads.
1285 (code_helper::operator tree_code): Make explicit.
1286 (code_helper::operator combined_fn): Likewise.
1287 * gimple-match-head.c (convert_conditional_op): Use explicit
1288 conversions where necessary.
1289 (gimple_resimplify1, gimple_resimplify2, gimple_resimplify3): Likewise.
1290 (maybe_push_res_to_seq, gimple_simplify): Likewise.
1291 * gimple-fold.c (replace_stmt_with_simplification): Likewise.
1293 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
1295 * gimple-match.h (code_helper): Add functions for querying whether
1296 the code represents an internal_fn or a built_in_function.
1297 Provide explicit conversion operators for both cases.
1298 (gimple_extract_op): Declare.
1299 * gimple-match-head.c (gimple_extract): New function, extracted from...
1300 (gimple_simplify): ...here.
1301 (gimple_extract_op): New function.
1303 2021-11-30 Eric Botcazou <ebotcazou@adacore.com>
1306 * config/i386/i386.c (ix86_output_call_insn): Beef up comment about
1307 nops emitted with SEH.
1308 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
1309 the cold section, emit a nop before the directive if the previous
1310 active instruction is a call.
1312 2021-11-30 Roger Sayle <roger@nextmovesoftware.com>
1313 Uroš Bizjak <ubizjak@gmail.com>
1316 * config/i386/sse.md (define_split): New splitters to simplify
1317 not;vec_duplicate;and as vec_duplicate;andn.
1319 2021-11-30 Richard Biener <rguenther@suse.de>
1321 PR middle-end/103485
1322 * match.pd (-((int)x >> (prec - 1)) to (unsigned)x >> (prec - 1)):
1323 Use VIEW_CONVERT_EXPR for vectors.
1325 2021-11-30 Richard Biener <rguenther@suse.de>
1327 * cfgrtl.c (skip_insns_after_block): Refactor code to
1328 be more easily readable.
1329 * expr.c (op_by_pieces_d::run): Remove unreachable
1331 * sched-deps.c (sched_analyze): Remove unreachable
1333 * sel-sched-ir.c (in_same_ebb_p): Likewise.
1334 * tree-ssa-alias.c (nonoverlapping_refs_since_match_p):
1335 Remove unreachable code.
1336 * tree-vect-slp.c (vectorize_slp_instance_root_stmt):
1337 Refactor to avoid unreachable loop iteration.
1338 * tree.c (walk_tree_1): Remove unreachable break.
1339 * vec-perm-indices.c (vec_perm_indices::series_p): Remove
1342 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
1345 * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin mask
1348 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
1350 * config/rs6000/rs6000.c
1351 (rs6000_cost_data::update_target_cost_per_stmt): Adjust the way to
1352 compute extra penalized cost. Remove useless parameter.
1353 (rs6000_cost_data::rs6000_add_stmt_cost): Adjust the call to function
1354 update_target_cost_per_stmt.
1356 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
1359 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1361 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
1362 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
1363 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
1364 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
1365 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
1366 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
1367 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
1368 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
1370 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
1372 PR tree-optimization/103467
1373 * gimple-range-gori.cc (range_def_chain::register_dependency): Don't
1374 use an object reference after a potential resize.
1376 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
1379 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
1380 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
1381 optimizing for size. Fix vpinsrw insn template.
1382 (*movhf_internal): Fix pinsrw and pextrw insn templates for
1383 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
1384 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
1385 and TARGET_SSE_SPLIT_REGS targets.
1387 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
1389 * configure: Re-regenerate.
1391 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
1394 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
1395 variables. Use ETAGS variable in TAGS target.
1396 * configure: Regenerate.
1397 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
1398 variables to be overridden.
1400 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
1402 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
1404 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
1405 (_mm_movemask_epi8): Likewise.
1407 2021-11-29 Richard Biener <rguenther@suse.de>
1409 * read-rtl-function.c (function_reader::read_rtx_operand):
1410 Return only after resetting m_in_call_function_usage.
1412 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
1414 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
1415 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
1416 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
1417 on QImode shift amounts from const_int_operand to general_operand.
1419 2021-11-29 Richard Biener <rguenther@suse.de>
1421 * tree.h (reverse_storage_order_for_component_p): Remove
1422 spurious gcc_unreachable.
1423 * cfganal.c (dfs_find_deadend): Likewise.
1424 * fold-const-call.c (fold_const_logb): Likewise.
1425 (fold_const_significand): Likewise.
1426 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
1429 2021-11-29 Richard Biener <rguenther@suse.de>
1431 * vec.c (qsort_chk): Do not return the void return value
1432 from the noreturn qsort_chk_error.
1433 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
1434 * df-scan.c (df_ref_equal_p): Likewise.
1435 * dwarf2out.c (is_base_type): Likewise.
1436 (add_const_value_attribute): Likewise.
1437 * fixed-value.c (fixed_arithmetic): Likewise.
1438 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
1439 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
1440 * graphite-isl-ast-to-gimple.c
1441 (gcc_expression_from_isl_expr_op): Likewise.
1442 (gcc_expression_from_isl_expression): Likewise.
1443 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
1445 * lto-streamer-in.c (lto_input_mode_table): Likewise.
1447 2021-11-29 liuhongt <hongtao.liu@intel.com>
1450 * config/i386/i386.c (inline_secondary_memory_needed): HImode
1451 move between GPR and SSE registers is supported under
1452 TARGET_SSE2 and above.
1453 * config/i386/i386.md (extendhfsf2): Optimize expander.
1454 (truncsfhf2): Ditto.
1455 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
1456 align with V8HImode.
1458 2021-11-29 liuhongt <hongtao.liu@intel.com>
1462 * config/i386/i386.c (ix86_secondary_reload): Without
1463 TARGET_SSE4_1, General register is needed to move HImode from
1464 sse register to memory.
1465 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
1466 pextrw in output templates.
1467 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
1468 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
1469 alternatives related to sse register.
1471 2021-11-29 Richard Biener <rguenther@suse.de>
1473 PR tree-optimization/103458
1474 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
1475 create forwarders for blocks with abnormal predecessors.
1477 2021-11-29 Richard Biener <rguenther@suse.de>
1479 * gimple-predicate-analysis.cc (can_be_invalidated_p):
1480 Restore semantics to the one before the split from
1483 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
1485 * profile.c: Include sreal.h
1486 (struct bb_stats): New.
1487 (cmp_stats): New function.
1488 (compute_branch_probabilities): Output bb stats.
1490 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
1492 * cfghooks.c: Include sreal.h, profile.h.
1493 (profile_record_check_consistency): Fix checking of count counsistency;
1494 record also dynamic mismatches.
1495 * cfgrtl.c (rtl_account_profile_record): Similarly.
1496 * tree-cfg.c (gimple_account_profile_record): Likewise.
1497 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
1498 num_mismatched_freq_out, turn time to double, add
1499 dyn_mismatched_prob_out, dyn_mismatched_count_in,
1500 num_mismatched_prob_out; remove num_mismatched_count_out.
1501 * passes.c (account_profile_1): New function.
1502 (account_profile_in_list): New function.
1503 (pass_manager::dump_profile_report): Rewrite.
1504 (execute_one_ipa_transform_pass): Check profile consistency after
1506 (execute_all_ipa_transforms): Remove cfun test; record all transform
1508 (execute_one_pass): Fix collecting of profile stats.
1510 2021-11-27 Jakub Jelinek <jakub@redhat.com>
1512 PR middle-end/103431
1513 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
1514 Change insn pattern to accurately describe the instructions.
1515 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
1516 define_insn patterns.
1517 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
1518 define_insn_and_split patterns.
1519 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
1520 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
1521 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
1522 splitters for x86_{,64_}sh{l,r}d pattern changes.
1524 2021-11-27 Jakub Jelinek <jakub@redhat.com>
1526 PR tree-optimization/103435
1527 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
1528 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
1531 2021-11-27 Martin Jambor <mjambor@suse.cz>
1534 * ipa-prop.c (ipcp_transform_function): Call
1535 delete_unreachable_blocks_update_callgraph instead of returning
1538 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
1540 * tree-object-size.c (object_sizes_grow, object_sizes_release,
1541 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
1542 object_sizes_set): New functions.
1543 (addr_object_size, compute_builtin_object_size,
1544 expr_object_size, call_object_size, unknown_object_size,
1545 merge_object_sizes, plus_stmt_object_size,
1546 cond_expr_object_size, collect_object_sizes_for,
1547 check_for_plus_in_loops_1, init_object_sizes,
1548 fini_object_sizes): Adjust.
1550 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
1552 * tree-object-size.c: New enum.
1553 (object_sizes, computed, addr_object_size,
1554 compute_builtin_object_size, expr_object_size, call_object_size,
1555 merge_object_sizes, plus_stmt_object_size,
1556 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
1557 object_sizes_execute): Replace magic numbers with enums.
1559 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
1561 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
1562 tree_niter_desc* argument instead of just a tree for NITER.
1563 If we require the iv candidate value at the end of the final
1564 loop iteration, try using the original loop bound as the
1565 NITER for sufficiently simple loops.
1566 (may_eliminate_iv): Update (only) call to cand_value_at.
1568 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
1570 * ipa-modref.c (analyze_function): Drop parameter F and dump
1571 cgraph node name rather than cfun name.
1572 (modref_generate): Update.
1573 (modref_summaries::insert):Update.
1574 (modref_summaries_lto::insert):Update.
1575 (pass_modref::execute):Update.
1576 (ipa_merge_modref_summary_after_inlining): Improve combining of
1579 2021-11-26 Jan Hubicka <jh@suse.cz>
1582 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
1584 2021-11-26 konglin1 <lingling.kong@intel.com>
1587 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
1589 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
1590 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
1591 (*extendhf<mode>2): Rename from extendhf<mode>2.
1592 (truncsfhf2): Likewise.
1593 (truncdfhf2): Likewise.
1594 (*trunc<mode>2): Likewise.
1596 2021-11-26 liuhongt <hongtao.liu@intel.com>
1598 PR middle-end/103419
1599 * match.pd: Fix typo, use the type of second parameter, not
1602 2021-11-25 Jan Hubicka <jh@suse.cz>
1604 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
1606 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
1608 PR middle-end/103406
1609 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
1611 2021-11-25 Martin Jambor <mjambor@suse.cz>
1614 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
1616 * ipa-param-manipulation.h (ipa_param_adjustments): New member
1617 function get_updated_index_or_split.
1618 * ipa-param-manipulation.c
1619 (ipa_param_adjustments::get_updated_index_or_split): New function.
1620 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
1621 capability to identify scalarized parameters and perform substitution
1623 (ipcp_transform_function): Create descriptors earlier, handle new
1624 return values of adjust_agg_replacement_values.
1626 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1628 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
1629 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
1630 * gimple-range-path.h: Remove compute_ranges_defined.
1632 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
1635 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
1636 number of spilled registers is counted correctly.
1638 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1640 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
1643 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
1645 PR tree-optimization/103359
1646 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
1647 equivalent to def, don't initially include it's range.
1649 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1651 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
1652 gimple_static_chain.
1654 2021-11-25 Richard Biener <rguenther@suse.de>
1656 * cgraphunit.c (symbol_table::output_weakrefs): Remove
1658 (get_alias_symbol): Remove now unused function.
1660 2021-11-25 Richard Biener <rguenther@suse.de>
1662 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
1664 (skip_insns_after_block): Remove unreachable break and continue.
1666 2021-11-25 Richard Biener <rguenther@suse.de>
1668 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
1670 2021-11-25 Richard Biener <rguenther@suse.de>
1672 * regset.h (REG_SET_EMPTY_P): New macro.
1673 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
1675 2021-11-25 Martin Liska <mliska@suse.cz>
1677 * doc/invoke.texi: Use @option for -Wuninitialized.
1679 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1681 PR tree-optimization/103254
1682 * gimple-range-path.cc (path_range_query::compute_ranges): Move
1683 exported boolean code...
1684 (path_range_query::compute_imports): ...here.
1686 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1688 PR tree-optimization/103254
1689 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
1690 (path_range_query::compute_ranges_in_block): Move to
1691 compute_ranges_defined.
1692 * gimple-range-path.h (compute_ranges_defined): New.
1694 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1696 PR tree-optimization/103417
1697 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
1700 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1702 PR tree-optimization/103376
1703 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
1704 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
1705 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
1706 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
1707 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
1710 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1713 * doc/invoke.texi (-Wcomma-subscript): Document that for
1714 -std=c++20 the option isn't enabled by default with -Wno-deprecated
1715 but for -std=c++23 it is.
1717 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
1719 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
1720 (modref_merge_call_site_flags): ... here.
1721 (ipa_merge_modref_summary_after_inlining): Use it.
1723 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1725 * cfgloop.c (verify_loop_structure): Reduce scope of
1726 'class loop *loop' variable.
1727 * ipa-fnsummary.c (analyze_function_body): Likewise.
1728 * loop-init.c (fix_loop_structure): Likewise.
1729 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
1730 * predict.c (predict_loops): Likewise.
1731 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
1732 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
1734 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1736 PR tree-optimization/103231
1737 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
1738 (gimple_ranger::gimple_ranger): Delete stmt stack.
1739 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
1741 (gimple_ranger::prefill_name): New.
1742 (gimple_ranger::prefill_stmt_dependencies): New.
1743 * gimple-range.h (class gimple_ranger): Add prototypes.
1745 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1747 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
1748 return a range, return if it came from the cache or not.
1749 (get_non_stale_global_range): Rename to get_global_range, and return
1750 the temporal state in a flag.
1751 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
1752 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
1754 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
1757 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1759 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
1760 a constant is encountered to terminate the trace.
1762 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
1764 PR middle-end/103059
1765 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
1766 of indexed addressing.
1767 (find_reloads): Adjust accordingly.
1769 2021-11-24 Richard Biener <rguenther@suse.de>
1770 Jan Hubicka <jh@suse.cz>
1772 PR tree-optimization/103168
1773 * ipa-modref.h (struct modref_summary): Add load_accesses.
1774 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
1775 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
1776 info to walk the virtual use->def chain to CSE const/pure
1777 function calls possibly reading from memory.
1779 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1781 PR middle-end/103244
1782 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
1785 2021-11-24 Richard Biener <rguenther@suse.de>
1787 PR middle-end/103193
1788 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
1789 with NaNs and -ftrapping-math.
1791 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1793 PR middle-end/103384
1794 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
1795 return 0 for kind(host) and continue for kind(nohost).
1797 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1799 PR middle-end/103365
1800 * attribs.h (lookup_attribute): Allow attr_name to start with
1801 underscore, as long as canonicalize_attr_name returns false.
1802 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
1803 * attribs.c (extract_attribute_substring): Reimplement using
1804 canonicalize_attr_name.
1805 (register_scoped_attribute): Change gcc_assert into
1806 gcc_checking_assert, verify !canonicalize_attr_name rather than
1807 that str.str doesn't start with '_'.
1809 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1811 PR tree-optimization/103376
1812 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
1813 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
1815 (find_bswap_or_nop_1, find_bswap_or_nop,
1816 imm_store_chain_info::try_coalesce_bswap): Adjust
1817 perform_symbolic_merge callers.
1819 2021-11-24 Richard Biener <rguenther@suse.de>
1821 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
1822 argument instead of re-computing it.
1823 (find_interesting_uses): Likewise.
1824 (find_induction_variables): Pass through loop body.
1825 (tree_ssa_iv_optimize_loop): Pass down loop body.
1827 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1829 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
1832 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1834 * tree.c (bitmask_inv_cst_vector_p): New.
1835 * tree.h (bitmask_inv_cst_vector_p): New.
1836 * match.pd: Use it in new bitmask compare pattern.
1838 2021-11-24 Jason Merrill <jason@redhat.com>
1840 * timevar.h (class auto_cond_timevar): New.
1842 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
1843 H.J. Lu <hongjiu.lu@intel.com>
1845 PR tree-optimization/103194
1846 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
1848 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
1849 (optimize_atomic_bit_test_and): Enhance
1850 optimize_atomic_bit_test_and to handle truncation.
1852 2021-11-23 Martin Sebor <msebor@redhat.com>
1855 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
1856 * common.opt: Add -Winfinite-recursion.
1857 * doc/invoke.texi (-Winfinite-recursion): Document.
1858 * passes.def (pass_warn_recursion): Schedule a new pass.
1859 * tree-pass.h (make_pass_warn_recursion): Declare.
1860 * gimple-warn-recursion.c: New file.
1862 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1864 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
1865 error messages for ENB_P8V and ENB_P9V.
1867 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1869 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
1870 CMPB to power6-64 stanza.
1871 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
1873 (rs6000_new_builtin_is_supported): Likewise.
1874 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
1875 (rs6000_init_builtins): Handle ENB_P6_64 case.
1876 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
1877 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
1878 (enable_string): Add "ENB_P6_64".
1879 (write_decls): Add ENB_P6_64 to bif_enable enum.
1881 2021-11-23 Jan Hubicka <jh@suse.cz>
1883 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
1885 (modref_tree::modref_tree): Remove parametr.
1886 (modref_tree::insert_base): Add max_bases parameter.
1887 (modref_tree::insert): Add max_bases, max_refs, max_accesses
1889 (modref_tree::insert): New member function.
1890 (modref_tree::merge): Add max_bases, max_refs, max_accesses
1892 (modref_tree::insert): New member function.
1893 * ipa-modref-tree.c (test_insert_search_collapse): Update.
1894 (test_merge): Update.
1895 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
1896 (dump_lto_records): Likewise.
1897 (modref_summary::finalize): Fix whitespace.
1898 (get_modref_function_summary): Likewise.
1899 (modref_access_analysis::record_access): Update.
1900 (modref_access_analysis::record_access_lto): Update.
1901 (modref_access_analysis::process_fnspec): Update.
1902 (analyze_function): Update.
1903 (modref_summaries::duplicate): Update.
1904 (modref_summaries_lto::duplicate): Update.
1905 (write_modref_records): Update.
1906 (read_modref_records): Update.
1907 (read_section): Update.
1908 (propagate_unknown_call): Update.
1909 (modref_propagate_in_scc): Update.
1910 (ipa_merge_modref_summary_after_inlining): Update.
1912 2021-11-23 Martin Liska <mliska@suse.cz>
1914 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
1916 2021-11-23 Richard Biener <rguenther@suse.de>
1918 PR tree-optimization/103361
1919 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
1920 for the dependence distance.
1921 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
1923 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1925 * tree-inline.h (struct copy_body_data): Remove
1926 transform_lang_insert_block member.
1927 * tree-inline.c (remap_block): Don't call
1928 id->transform_lang_insert_block.
1929 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
1930 tree_function_versioning, maybe_inline_call_in_expr,
1931 copy_fn): Don't initialize id.transform_lang_insert_block.
1932 * gimplify.c (gimplify_omp_loop): Likewise.
1934 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
1936 PR tree-optimization/103335
1937 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
1938 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
1939 are divisible by BITS_PER_UNIT.
1940 (get_byte_aligned_range_containing_ref): New function.
1941 (get_byte_aligned_range_contained_in_ref): New function.
1942 (normalize_ref): Rename to ...
1943 (get_byte_range): ... this one; handle accesses not aligned to byte
1944 boundary; return range in bytes rater than updating ao_ref.
1945 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
1947 (setup_live_bytes_from_ref): Likewise.
1948 (clear_bytes_written_by): Update.
1949 (live_bytes_read): Update.
1950 (dse_classify_store): Simplify tech before live_bytes_read checks.
1952 2021-11-23 Andrew Pinski <apinski@marvell.com>
1954 PR tree-optimization/102216
1955 * gimple-fold.c (fold_stmt_1): Add canonicalization
1956 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
1957 can only be done if !in_place.
1959 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1961 PR middle-end/102431
1962 * gimplify.c (replace_reduction_placeholders): Remove.
1963 (note_no_context_vars): New function.
1964 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
1965 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
1966 and replace_reduction_placeholders for duplication of
1967 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
1968 automatic vars have DECL_CONTEXT set to non-NULL before doing so
1969 and reset it afterwards for those vars and their corresponding
1972 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
1975 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
1976 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
1977 or v4sf when p9_vector is set.
1978 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
1980 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1982 PR tree-optimization/102232
1983 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
1985 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1987 PR tree-optimization/96779
1988 * match.pd (-x == x) -> (x == 0): New optimization.
1990 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
1992 PR tree-optimization/98953
1993 PR tree-optimization/103345
1994 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
1995 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
1996 (pass_optimize_bswap::execute): Likewise.
1998 2021-11-22 Martin Liska <mliska@suse.cz>
2000 * doc/invoke.texi: Remove duplicate documentation for 3 params.
2002 2021-11-22 Richard Biener <rguenther@suse.de>
2004 PR tree-optimization/103351
2005 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
2008 2021-11-22 liuhongt <hongtao.liu@intel.com>
2011 * config/i386/constraints.md (Bk): New
2012 define_memory_constraint.
2013 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
2015 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
2017 * config/i386/i386.md (*movsi_internal): Don't allow
2018 mask/sse/mmx move in TLS code sequences.
2019 (*movdi_internal): Ditto.
2021 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
2023 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
2026 2021-11-21 Jakub Jelinek <jakub@redhat.com>
2029 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
2030 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
2031 instead of target_option_default_node.
2033 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
2036 * ipa-modref.c (parm_map_for_arg): Rename to ...
2037 (parm_map_for_ptr): .. this one; handle static chain and calls to
2039 (modref_access_analysis::get_access): Use parm_map_for_ptr.
2040 (modref_access_analysis::process_fnspec): Update.
2041 (modref_access_analysis::analyze_load): Update.
2042 (modref_access_analysis::analyze_store): Update.
2044 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
2046 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
2048 (ignore_retval_p): Likewise.
2049 (ignore_stores_p): Likewise.
2050 (parm_map_for_arg): Likewise.
2051 (class modref_access_analysis): New class.
2052 (modref_access_analysis::set_side_effects): New member function.
2053 (modref_access_analysis::set_nondeterministic): New member function.
2054 (get_access): Turn to ...
2055 (modref_access_analysis::get_access): ... this one.
2056 (record_access): Turn to ...
2057 (modref_access_analysis::record_access): ... this one.
2058 (record_access_lto): Turn to ...
2059 (modref_access_analysis::record_access_lto): ... This one.
2060 (record_access_p): Turn to ...
2061 (modref_access_analysis::record_access_p): ... This one
2062 (modref_access_analysis::record_unknown_load): New member function.
2063 (modref_access_analysis::record_unknown_store): New member function.
2064 (get_access_for_fnspec): Turn to ...
2065 (modref_access_analysis::get_access_for_fnspec): ... this one.
2066 (merge_call_side_effects): Turn to ...
2067 (moderf_access_analysis::merge_call_side_effects): Turn to ...
2068 (collapse_loads): Move later in source code.
2069 (collapse_stores): Move later in source code.
2070 (process_fnspec): Turn to ...
2071 (modref_access_analysis::process_fnspec): ... this one.
2072 (analyze_call): Turn to ...
2073 (modref_access_analysis::analyze_call): ... this one.
2074 (struct summary_ptrs): Remove.
2075 (analyze_load): Turn to ...
2076 (modref_access_analysis::analyze_load): ... this one.
2077 (analyze_store): Turn to ...
2078 (modref_access_analysis::analyze_store): ... this one.
2079 (analyze_stmt): Turn to ...
2080 (modref_access_analysis::analyze_stmt): ... This one.
2081 (remove_summary): Remove.
2082 (modref_access_analysis::propagate): Break out from ...
2083 (modref_access_analysis::analyze): Break out from ...
2084 (analyze_function): ... here.
2086 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
2087 Robin Dapp <rdapp@linux.ibm.com>
2090 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
2091 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
2093 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
2096 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
2097 (merge_call_side_effects): Improve debug output.
2099 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
2102 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
2104 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
2106 PR tree-optimization/103226
2107 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
2110 2021-11-20 Andrew Pinski <apinski@marvell.com>
2112 PR tree-optimization/103220
2113 * match.pd ((type) X bitop CST): Don't check if CST
2114 fits into the type if only the sign changes.
2116 2021-11-20 Alexandre Oliva <oliva@adacore.com>
2118 PR tree-optimization/102988
2119 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
2120 without decl sharing.
2122 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
2124 * doc/invoke.texi: Remove whitespace after an @option.
2126 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
2128 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
2129 _ARCH_PWR9, optimize vec_sum2s when LE.
2131 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
2134 * config/darwin-driver.c (darwin_driver_init): Handle exported
2135 symbols and symbol lists (suppress automatic export of the TLS
2137 * config/darwin.c (darwin_rename_builtins): Remove workaround.
2138 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
2139 (REAL_LIBGCC_SPEC): Handle revised library uses.
2140 * config/darwin.opt (nodefaultexport): New.
2141 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
2142 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
2143 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
2145 2021-11-19 Martin Jambor <mjambor@suse.cz>
2147 * opts.c (default_options_table): Switch off
2148 flag_semantic_interposition at Ofast.
2149 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
2150 -fsemantic-interposition.
2152 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
2154 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
2155 by !gimple_call_chain.
2157 2021-11-19 Martin Sebor <msebor@redhat.com>
2161 * doc/invoke.texi (-Waddress): Update.
2163 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
2165 PR tree-optimization/103254
2166 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
2167 depth for all statements with multple ssa names.
2169 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
2171 * config/s390/s390.md (define_peephole2): Variable insn points
2172 to the first matched insn. Use peep2_next_insn(1) to refer to
2173 the second matched insn.
2175 2021-11-19 Tamar Christina <tamar.christina@arm.com>
2177 PR tree-optimization/103311
2179 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
2180 test to new codegen.
2181 (complex_mul_pattern::matches): Move check downwards.
2183 2021-11-19 Martin Liska <mliska@suse.cz>
2186 2021-11-19 Martin Liska <mliska@suse.cz>
2188 * cfgexpand.c (pass_expand::execute): Use option directly.
2189 * function.c (allocate_struct_function): Likewise.
2190 * gimple-low.c (lower_function_body): Likewise.
2191 (lower_stmt): Likewise.
2192 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
2193 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
2194 * ipa-split.c (split_function): Likewise.
2195 * lto-streamer-in.c (input_function): Likewise.
2196 * sese.c (sese_insert_phis_for_liveouts): Likewise.
2197 * ssa-iterators.h (num_imm_uses): Likewise.
2198 * tree-cfg.c (make_blocks): Likewise.
2199 (gimple_merge_blocks): Likewise.
2200 * tree-inline.c (tree_function_versioning): Likewise.
2201 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
2202 * tree-sra.c (analyze_access_subtree): Likewise.
2203 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
2204 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2205 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
2206 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
2207 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
2208 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
2209 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
2210 (target_for_debug_bind): Likewise.
2211 (insert_debug_temp_for_var_def): Likewise.
2212 (insert_debug_temps_for_defs): Likewise.
2213 (reset_debug_uses): Likewise.
2214 * tree-ssanames.c (release_ssa_name_fn): Likewise.
2215 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
2216 (adjust_debug_stmts): Likewise.
2217 (adjust_phi_and_debug_stmts): Likewise.
2218 (vect_do_peeling): Likewise.
2219 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
2220 (vect_transform_loop): Likewise.
2221 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
2222 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
2223 (MAY_HAVE_DEBUG_STMTS): Use options directly.
2225 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
2227 * gcc.c (process_command): Skip dumpdir override if file is a
2229 * doc/invoke.texi: Update -dumpdir documentation.
2231 2021-11-19 Andrew Pinski <apinski@marvell.com>
2233 PR tree-optimization/103314
2234 * match.pd ((type) X op CST): Restrict the equal
2235 TYPE_PRECISION case to GIMPLE only.
2237 2021-11-19 Martin Liska <mliska@suse.cz>
2240 * ipa-modref-tree.h (struct modref_parm_map): Add default
2242 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
2244 2021-11-19 Richard Biener <rguenther@suse.de>
2246 PR middle-end/103248
2247 * tree-eh.c (operation_could_trap_helper_p): Properly handle
2248 fixed-point RDIV_EXPR.
2250 2021-11-19 Richard Biener <rguenther@suse.de>
2252 PR tree-optimization/102436
2253 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
2254 to just create the if structure and return the then block.
2255 (execute_sm): Add flag to indicate the var will re-use
2257 (hoist_memory_references): Support a single conditional
2258 block with all stores as special case.
2260 2021-11-19 Andrew Pinski <apinski@marvell.com>
2262 PR tree-optimization/103317
2263 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
2264 middle bb case, check to make sure it has a single predecessor.
2266 2021-11-19 Andrew Pinski <apinski@marvell.com>
2268 PR tree-optimization/103257
2270 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
2271 Disable until !canonicalize_math_p ().
2273 2021-11-19 Marek Polacek <polacek@redhat.com>
2277 * doc/invoke.texi: Update documentation for -Wuninitialized.
2278 * tree.c (stabilize_reference): Set location.
2280 2021-11-19 liuhongt <hongtao.liu@intel.com>
2283 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
2284 storing 256/512-bit SSE register to be equal to cost of
2285 unaligned store to avoid odd alignment peeling.
2286 (icelake_cost): Ditto.
2288 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
2290 * config/rs6000/predicates.md (current_file_function_operand):
2291 Add flag_semantic_interposition to call of decl_replaceable_p.
2293 2021-11-18 Martin Liska <mliska@suse.cz>
2295 * ipa-modref.c (analyze_function): Do not execute the code
2296 only if dump_file != NULL.
2298 2021-11-18 Martin Liska <mliska@suse.cz>
2300 * ipa-modref.c (analyze_function): Use fnode instead of repeated
2301 cgraph_node::get (current_function_decl).
2303 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
2305 * cgraph.c (cgraph_node::get_availability): Update call of
2307 (cgraph_node::verify_node): Verify that semantic_interposition flag
2309 * cgraph.h: (symtab_node): Add semantic_interposition flag.
2310 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
2311 semantic_interposition flag.
2312 * cgraphunit.c (cgraph_node::finalize_function): Set
2313 semantic_interposition flag.
2314 (cgraph_node::add_new_function): Likewise.
2315 (varpool_node::finalize_decl): Likewise.
2316 (cgraph_node::create_wrapper): Likewise.
2317 * common.opt (fsemantic-interposition): Turn to optimization node.
2318 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
2319 (lto_output_varpool_node): Likewise.
2320 (input_overwrite_node): Likewise.
2321 (input_varpool_node): Likewise.
2322 * symtab.c (symtab_node::dump_base): Dump new flag.
2323 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
2325 * varasm.h (decl_replaceable_p): Update declaration.
2326 * varpool.c (varpool_node::ctor_useable_for_folding_p):
2327 Use semantic_interposition flag.
2328 (varpool_node::get_availability): Likewise.
2329 (varpool_node::create_alias): Copy semantic_interposition flag.
2331 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
2334 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
2335 parameter may still be returned.
2336 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
2337 even for unused function args.
2339 2021-11-18 Tamar Christina <tamar.christina@arm.com>
2341 PR tree-optimization/103311
2342 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
2343 complex_fms_pattern::matches): Check for multiplications.
2345 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
2348 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
2349 CS prefix for -mindirect-branch-cs-prefix.
2350 (ix86_output_indirect_branch_via_reg): Likewise.
2351 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
2352 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
2354 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
2356 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
2359 2021-11-18 Tamar Christina <tamar.christina@arm.com>
2361 PR tree-optimization/103253
2362 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
2364 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
2366 * doc/extend.texi: Document __builtin_assoc_barrier.
2368 2021-11-18 Martin Liska <mliska@suse.cz>
2370 * cfgexpand.c (pass_expand::execute): Use option directly.
2371 * function.c (allocate_struct_function): Likewise.
2372 * gimple-low.c (lower_function_body): Likewise.
2373 (lower_stmt): Likewise.
2374 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
2375 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
2376 * ipa-split.c (split_function): Likewise.
2377 * lto-streamer-in.c (input_function): Likewise.
2378 * sese.c (sese_insert_phis_for_liveouts): Likewise.
2379 * ssa-iterators.h (num_imm_uses): Likewise.
2380 * tree-cfg.c (make_blocks): Likewise.
2381 (gimple_merge_blocks): Likewise.
2382 * tree-inline.c (tree_function_versioning): Likewise.
2383 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
2384 * tree-sra.c (analyze_access_subtree): Likewise.
2385 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
2386 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2387 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
2388 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
2389 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
2390 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
2391 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
2392 (target_for_debug_bind): Likewise.
2393 (insert_debug_temp_for_var_def): Likewise.
2394 (insert_debug_temps_for_defs): Likewise.
2395 (reset_debug_uses): Likewise.
2396 * tree-ssanames.c (release_ssa_name_fn): Likewise.
2397 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
2398 (adjust_debug_stmts): Likewise.
2399 (adjust_phi_and_debug_stmts): Likewise.
2400 (vect_do_peeling): Likewise.
2401 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
2402 (vect_transform_loop): Likewise.
2403 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
2404 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
2405 (MAY_HAVE_DEBUG_STMTS): Use options directly.
2407 2021-11-18 Richard Biener <rguenther@suse.de>
2409 PR tree-optimization/103277
2410 * tree-ssa-dse.c (need_ab_cleanup): New.
2411 (dse_optimize_redundant_stores): Adjust.
2412 (delete_dead_or_redundant_assignment): Get extra
2413 need_ab_cleanup argument and set when abnormal cleanup is
2415 (dse_optimize_call): Adjust.
2416 (dse_optimize_stmt): Likewise.
2417 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
2418 Perform abnormal cleanup.
2419 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
2421 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
2423 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2424 Adjust generated cfg to avoid infinite loop.
2426 2021-11-18 konglin1 <lingling.kong@intel.com>
2428 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
2429 (_mm512_mask_mul_pch): Likewise.
2430 (_mm512_maskz_mul_pch): Likewise.
2431 (_mm512_mul_round_pch): Likewise.
2432 (_mm512_mask_mul_round_pch): Likewise.
2433 (_mm512_maskz_mul_round_pch): Likewise.
2434 (_mm512_cmul_pch): Likewise.
2435 (_mm512_mask_cmul_pch): Likewise.
2436 (_mm512_maskz_cmul_pch): Likewise.
2437 (_mm512_cmul_round_pch): Likewise.
2438 (_mm512_mask_cmul_round_pch): Likewise.
2439 (_mm512_maskz_cmul_round_pch): Likewise.
2440 (_mm_mul_sch): Likewise.
2441 (_mm_mask_mul_sch): Likewise.
2442 (_mm_maskz_mul_sch): Likewise.
2443 (_mm_mul_round_sch): Likewise.
2444 (_mm_mask_mul_round_sch): Likewise.
2445 (_mm_maskz_mul_round_sch): Likewise.
2446 (_mm_cmul_sch): Likewise.
2447 (_mm_mask_cmul_sch): Likewise.
2448 (_mm_maskz_cmul_sch): Likewise.
2449 (_mm_cmul_round_sch): Likewise.
2450 (_mm_mask_cmul_round_sch): Likewise.
2451 (_mm_maskz_cmul_round_sch): Likewise.
2452 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
2453 (_mm_mask_mul_pch): Likewise.
2454 (_mm_maskz_mul_pch): Likewise.
2455 (_mm256_mul_pch): Likewise.
2456 (_mm256_mask_mul_pch): Likewise.
2457 (_mm256_maskz_mul_pch): Likewise.
2458 (_mm_cmul_pch): Likewise.
2459 (_mm_mask_cmul_pch): Likewise.
2460 (_mm_maskz_cmul_pch): Likewise.
2461 (_mm256_cmul_pch): Likewise.
2462 (_mm256_mask_cmul_pch): Likewise.
2463 (_mm256_maskz_cmul_pch): Likewise.
2465 2021-11-17 Andrew Pinski <apinski@marvell.com>
2467 PR tree-optimization/103228
2468 PR tree-optimization/55177
2469 * match.pd ((type) X bitop CST): Also do this
2470 transformation for nop conversions.
2472 2021-11-17 Martin Sebor <msebor@redhat.com>
2474 PR tree-optimization/102759
2475 * gimple-array-bounds.cc (build_printable_array_type): Move...
2476 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
2477 pathological function redeclarations that remove a previously
2479 Improve formatting of function arguments in informational notes.
2480 * pointer-query.cc (build_printable_array_type): ...to here.
2481 * pointer-query.h (build_printable_array_type): Declared.
2483 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
2486 * config/i386/i386-opts.h (harden_sls): New enum.
2487 * config/i386/i386.c (output_indirect_thunk): Mitigate against
2488 SLS for function return.
2489 (ix86_output_function_return): Likewise.
2490 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
2492 (ix86_output_indirect_jmp): Likewise.
2493 (ix86_output_call_insn): Likewise.
2494 * config/i386/i386.opt: Add -mharden-sls=.
2495 * doc/invoke.texi: Document -mharden-sls=.
2497 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
2500 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
2501 (ix86_output_function_return): Likewise.
2502 * config/i386/i386.md (simple_return_pop_internal): Likewise.
2504 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2507 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
2510 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
2512 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
2513 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
2514 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
2515 on indirect_thunks_used.
2516 (ix86_output_indirect_function_return): Ditto.
2518 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2520 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
2521 (modref_access_node::stream_out): New member function.
2522 (modref_access_node::stream_in): New member function.
2523 * ipa-modref-tree.h (modref_access_node::stream_out,
2524 modref_access_node::stream_in): Declare.
2525 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
2526 (modref_summary_lto::dump): Dump kills.
2527 (analyze_store): Record kills for LTO
2528 (analyze_stmt): Likewise.
2529 (modref_summaries_lto::duplicate): Duplicate kills.
2530 (write_modref_records): Use new stream_out member function.
2531 (read_modref_records): Likewise.
2532 (modref_write): Stream out kills.
2533 (read_section): Stream in kills
2534 (remap_kills): New function.
2535 (update_signature): Use it.
2537 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
2539 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
2540 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
2541 * config/i386/i386.c (zero_all_vector_registers):
2542 Use LEGACY_SSE_REGNO_P predicate.
2543 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
2544 and EXT_REG_SSE_REGNO_P predicates.
2545 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
2546 and LEGACY_SSE_REGNO_P predicates.
2548 2021-11-17 Jason Merrill <jason@redhat.com>
2550 * doc/invoke.texi (C++ Dialect Options): Document
2551 -fimplicit-constexpr.
2553 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2555 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
2556 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
2558 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
2561 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
2565 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
2566 (gimplify_compound_lval): Gimplify base expression first.
2567 (gimplify_target_expr): Add comment.
2569 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2571 PR tree-optimization/103192
2572 * tree-ssa-loop-im.c (move_computations_worker): Use
2573 reset_flow_sensitive_info instead of manually clearing
2574 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
2575 with integral types.
2577 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2579 PR tree-optimization/103255
2580 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
2581 range_nonzero rather than unadjusted base's range. Formatting fixes.
2583 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
2585 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
2586 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
2587 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
2588 * internal-fn.c (first_commutative_argument): Handle them.
2589 (FOR_EACH_COND_FN_PAIR): Likewise.
2590 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
2591 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
2594 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2596 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
2597 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
2598 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
2599 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
2600 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
2603 2021-11-17 Andrew Pinski <apinski@marvell.com>
2605 PR tree-optimization/103288
2606 * tree-ssa-phiopt.c (value_replacement): Return early if middle
2607 block has more than one pred.
2609 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2611 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
2612 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
2613 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
2614 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
2615 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
2616 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
2617 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
2618 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
2620 2021-11-17 Marek Polacek <polacek@redhat.com>
2622 PR preprocessor/103026
2623 * doc/invoke.texi: Document -Wbidi-chars.
2625 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2628 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
2631 2021-11-16 Jan Hubicka <jh@suse.cz>
2633 * ipa-modref.c (get_modref_function_summary): Declare.
2634 * ipa-modref.h (get_modref_function_summary): New function.
2635 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
2636 (clear_bytes_written_by): ... here; also clear memory killed by
2639 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
2641 * ggc-common.c (gt_pch_save): If we cannot find a suitable
2642 memory segment for save, then error-out, do not try to
2644 (gt_pch_restore): Save the existing line table, and when
2645 the replacement is being read, use that when constructing
2648 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
2651 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
2653 (*mma_assemble_acc): Likewise.
2655 2021-11-16 Martin Sebor <msebor@redhat.com>
2657 PR tree-optimization/102960
2658 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
2659 than a pointer to it.
2660 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
2662 (get_maxval_strlen): Use an auto_bitmap.
2663 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
2665 (get_range_strlen_phi): ...into this function.
2666 Avoid assuming maximum string length is constant
2667 (printf_strlen_execute): Dump pointer query cache contents when
2668 details are requisted.
2670 2021-11-16 Jason Merrill <jason@redhat.com>
2672 * langhooks.h (struct lang_hooks): Adjust comment.
2673 * print-tree.c (print_node): Also call print_xnode hook for
2676 2021-11-16 Andrew Pinski <apinski@marvell.com>
2678 PR tree-optimization/103218
2679 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
2681 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
2683 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
2684 (umaddhisi4): Likewise.
2685 (machi): Update pattern.
2688 2021-11-16 Richard Biener <rguenther@suse.de>
2690 PR tree-optimization/102880
2691 * tree-ssa-dce.c (sort_phi_args): New function.
2692 (make_forwarders_with_degenerate_phis): Likewise.
2693 (perform_tree_ssa_dce): Call
2694 make_forwarders_with_degenerate_phis.
2696 2021-11-16 Richard Biener <rguenther@suse.de>
2698 PR tree-optimization/102880
2699 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
2700 single_pred (bb1) condition to places that really need it.
2701 (match_simplify_replacement): Likewise.
2702 (value_replacement): Likewise.
2703 (replace_phi_edge_with_variable): Deal with extra edges
2706 2021-11-16 Martin Jambor <mjambor@suse.cz>
2708 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
2709 add a fixme note about the mode assignment perhaps being unnecessary.
2710 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
2712 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
2713 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
2714 * tree-inline.c (remap_ssa_name): Likewise.
2715 (tree_function_versioning): Likewise.
2716 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
2717 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2718 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
2720 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2722 PR tree-optimization/103208
2723 * omp-expand.c (expand_omp_build_cond): New function.
2724 (expand_omp_for_init_counts, expand_omp_for_init_vars,
2725 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
2727 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2729 PR tree-optimization/102009
2730 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
2731 Punt if any of alloc_size arguments is out of bounds vs. number of
2734 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
2736 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
2737 on !optimize_function_for_size_p.
2738 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
2739 (define_splits): Conditionalize on !optimize_function_for_size_p.
2740 (*bmi2_rorxsi3_1_zext): Likewise.
2741 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
2742 (define_splits): Conditionalize on !optimize_function_for_size_p.
2744 2021-11-16 Jan Hubicka <jh@suse.cz>
2747 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
2750 2021-11-16 Andrew Pinski <apinski@marvell.com>
2752 PR tree-optimization/103245
2753 * match.pd: Combine the abs pattern matching using multiplication.
2754 Adding optional nop_convert too.
2756 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
2758 PR middle-end/103268
2759 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
2762 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2764 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
2765 determine if it is safe to transform to strcat.
2766 (gimple_fold_builtin_snprintf): Likewise.
2768 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2770 * gimple-fold.c (known_lower): New function.
2771 (gimple_fold_builtin_strncat_chk,
2772 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
2773 gimple_fold_builtin_stxncpy_chk,
2774 gimple_fold_builtin_snprintf_chk,
2775 gimple_fold_builtin_sprintf_chk): Use it.
2777 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2779 * gimple-fold.c (dump_transformation): New function.
2780 (gimple_fold_builtin_stxcpy_chk,
2781 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
2782 BUILT_IN_STRNCPY if return value is not used.
2784 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
2786 PR middle-end/103184
2787 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
2788 before transforming equivalent, but slighly different cases to
2789 their canonical forms.
2791 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2794 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
2795 * doc/tm.texi: Regenerated.
2796 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
2797 * ipa.c (cgraph_build_static_cdtor_1): Return the built
2799 (build_cxa_atexit_decl): New.
2800 (build_dso_handle_decl): New.
2801 (build_cxa_dtor_registrations): New.
2802 (compare_cdtor_tu_order): New.
2803 (build_cxa_atexit_fns): New.
2804 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
2805 process the DTORs/CTORs accordingly.
2806 (pass_ipa_cdtor_merge::gate): Also run if
2807 dtors_from_cxa_atexit is set.
2808 * target.def (dtors_from_cxa_atexit): New hook.
2810 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2812 * config.in: Regenerate.
2813 * configure: Regenerate.
2814 * configure.ac: Test ld64 for -platform-version support.
2816 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2818 * diagnostic-spec.h (typedef xint_hash_t)
2819 (typedef xint_hash_map_t): Replace with...
2820 (typedef nowarn_map_t): ... this.
2821 (nowarn_map): Adjust.
2822 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
2824 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2826 * profile.c (branch_prob): Use 'location_hash' for
2829 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2831 PR tree-optimization/103207
2832 * value-range.cc (irange::set): Drop overflow.
2834 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2836 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
2837 to OMP_TARGET_CLAUSES if it isn't there already.
2839 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2841 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2842 Remove useless code.
2843 (path_range_query::ssa_defined_in_bb): New.
2844 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
2845 could trigger additional lookups.
2846 Do not use the cache for ARGs defined in this block.
2847 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
2848 (path_range_query::maybe_register_phi_relation): Same.
2849 (path_range_query::range_of_stmt): Adjust comment.
2850 * gimple-range-path.h (ssa_defined_in_bb): New.
2852 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2854 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2855 Default to global range if nothing found.
2857 2021-11-15 Richard Biener <rguenther@suse.de>
2859 PR tree-optimization/103237
2860 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
2861 double reductions with multiple inner loop LC PHI nodes.
2863 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
2866 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2867 New expand function.
2868 * config/i386/i386-options.c (ix86_target_string): Add
2869 -mrelax-cmpxchg-loop flag.
2870 (ix86_valid_target_attribute_inner_p): Likewise.
2871 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
2872 New expand function prototype.
2873 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
2874 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
2876 (atomic_<logic>_fetch<mode>): Likewise.
2877 (atomic_fetch_nand<mode>): Likewise.
2878 (atomic_nand_fetch<mode>): Likewise.
2879 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
2880 (atomic_<logic>_fetch<mode>): Likewise.
2881 (atomic_fetch_nand<mode>): Likewise.
2882 (atomic_nand_fetch<mode>): Likewise.
2883 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
2885 2021-11-15 Richard Biener <rguenther@suse.de>
2887 PR tree-optimization/103219
2888 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
2889 to determine the exit for the VN region.
2891 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2893 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
2894 (pass_data_vectorize): Likewise.
2895 (make_pass_vectorize): Likewise.
2896 * tree-vectorizer.c (vectorize_loops): Merge with
2897 pass_vectorize::execute and replace cfun occurences with fun param.
2898 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
2899 fun, and adjust callers approrpiately.
2900 (note_simd_array_uses): Likewise.
2901 (vect_loop_dist_alias_call): Likewise.
2902 (set_uid_loop_bbs): Likewise.
2903 (vect_transform_loops): Likewise.
2904 (try_vectorize_loop_1): Likewise.
2905 (try_vectorize_loop): Likewise.
2907 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2910 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
2911 atomic_bit_test_and_complement<mode>,
2912 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
2915 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
2917 * config/vax/vax.c (vax_lra_p): New prototype and function.
2918 (TARGET_LRA_P): Wire it.
2919 * config/vax/vax.opt (mlra): New option.
2920 * doc/invoke.texi (Option Summary, VAX Options): Document the
2923 2021-11-14 Jan Hubicka <jh@suse.cz>
2925 * ipa-modref.h (struct modref_summary): Add nondeterministic
2926 and calls_interposable flags.
2927 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
2928 (modref_summary::useful_p): Check new flags.
2929 (struct modref_summary_lto): Add nondeterministic and
2930 calls_interposable flags.
2931 (modref_summary_lto::modref_summary_lto): Initialize new flags.
2932 (modref_summary_lto::useful_p): Check new flags.
2933 (modref_summary::dump): Dump new flags.
2934 (modref_summary_lto::dump): Dump new flags.
2935 (ignore_nondeterminism_p): New function.
2936 (merge_call_side_effects): Merge new flags.
2937 (process_fnspec): Likewise.
2938 (analyze_load): Volatile access is nondeterministic.
2939 (analyze_store): Liekwise.
2940 (analyze_stmt): Volatile ASM is nondeterministic.
2941 (analyze_function): Clear new flags.
2942 (modref_summaries::duplicate): Duplicate new flags.
2943 (modref_summaries_lto::duplicate): Duplicate new flags.
2944 (modref_write): Stream new flags.
2945 (read_section): Stream new flags.
2946 (propagate_unknown_call): Update new flags.
2947 (modref_propagate_in_scc): Propagate new flags.
2948 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
2950 * tree-ssa-structalias.c (determine_global_memory_access):
2953 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
2955 * config/vax/vax.h (SET_RATIO): New macro.
2956 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
2957 (setmemhi): New expander.
2958 (setmemhi1): New insn and splitter.
2959 (*setmemhi1): New insn.
2961 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2963 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
2965 (modref_access_node::merge_for_kills): Likewise.
2966 (modref_access_node::insert_kill): Likewise.
2967 * ipa-modref-tree.h (modref_access_node::update_for_kills,
2968 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
2970 (modref_access_node::useful_for_kill): New member function.
2971 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
2972 (lto_modref_summary): Add kills.
2973 (modref_summary::dump): Dump kills.
2974 (record_access): Add mdoref_access_node parameter.
2975 (record_access_lto): Likewise.
2976 (merge_call_side_effects): Merge kills.
2977 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
2978 (struct summary_ptrs): Add always_executed filed.
2979 (analyze_load): Update.
2980 (analyze_store): Update; record kills.
2981 (analyze_stmt): Add always_executed; record kills in clobbers.
2982 (analyze_function): Track always_executed.
2983 (modref_summaries::duplicate): Duplicate kills.
2984 (update_signature): Release kills.
2985 * ipa-modref.h (struct modref_summary): Add kills.
2986 * tree-ssa-alias.c (alias_stats): Add kill stats.
2987 (dump_alias_stats): Dump kill stats.
2988 (store_kills_ref_p): Break out from ...
2989 (stmt_kills_ref_p): Use it; handle modref info based kills.
2991 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
2993 PR tree-optimization/103229
2994 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
2995 null value to memset.
2997 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2999 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
3001 (modref_access_node::get_ao_ref): Likewise.
3002 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
3003 (modref_access_node::get_ao_ref): Declare.
3004 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
3005 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
3007 2021-11-13 Jan Hubicka <jh@suse.cz>
3010 * dbgcnt.def (ipa_attr): New counters.
3011 * ipa-pure-const.c: Include dbgcnt.c
3012 (ipa_make_function_const): Use debug counter.
3013 (ipa_make_function_pure): Likewise.
3014 (propagate_pure_const): Fix bug in my previous change.
3016 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
3018 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
3019 Offline from ipa-modref-tree.h.
3020 (modref_access_node::dump): Move from ipa-modref.c; make member
3022 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
3023 modref_access_node::dump): Declare.
3024 * ipa-modref.c (dump_access): Remove.
3025 (dump_records): Update.
3026 (dump_lto_records): Update.
3027 (record_access): Update.
3028 (record_access_lto): Update.
3030 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
3032 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
3033 (modref_summary::dump): Dump try_dse.
3034 (modref_summary::finalize): Add FUN attribute; compute try-dse.
3035 (analyze_function): Update.
3036 (read_section): Update.
3037 (update_signature): Update.
3038 (pass_ipa_modref::execute): Update.
3039 * ipa-modref.h (struct modref_summary):
3040 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
3041 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
3042 * tree-ssa-dse.c (dse_optimize_call): New function.
3043 (dse_optimize_stmt): Use it.
3045 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
3047 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
3049 2021-11-13 Jan Hubicka <jh@suse.cz>
3052 (struct modref_access_node): Move longer member functions to
3054 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
3056 * ipa-modref-tree.c (modref_access_node::contains): Move here
3057 from ipa-modref-tree.h.
3058 (modref_access_node::update): Likewise.
3059 (modref_access_node::merge): Likewise.
3060 (modref_access_node::closer_pair_p): Likewise.
3061 (modref_access_node::forced_merge): Likewise.
3062 (modref_access_node::update2): Likewise.
3063 (modref_access_node::combined_offsets): Likewise.
3064 (modref_access_node::try_merge_with): Likewise.
3065 (modref_access_node::insert): Likewise.
3067 2021-11-13 Jan Hubicka <jh@suse.cz>
3069 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
3070 (modref_summary::global_memory_written_p): Remove.
3071 (modref_summary::dump): Dump new flags.
3072 (modref_summary::finalize): New member function.
3073 (analyze_function): Call it.
3074 (read_section): Call it.
3075 (update_signature): Call it.
3076 (pass_ipa_modref::execute): Call it.
3077 * ipa-modref.h (struct modref_summary): Remove
3078 global_memory_read_p and global_memory_written_p.
3079 Add global_memory_read, global_memory_written.
3080 * tree-ssa-structalias.c (determine_global_memory_access):
3083 2021-11-13 Jan Hubicka <jh@suse.cz>
3085 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
3086 * ipa-param-manipulation.c
3087 (ipa_param_adjustments::type_attribute_allowed_p):
3088 New member function.
3089 (drop_type_attribute_if_params_changed_p): New function.
3090 (build_adjusted_function_type): Use it.
3091 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
3093 2021-11-13 David Malcolm <dmalcolm@redhat.com>
3095 * doc/invoke.texi (Static Analyzer Options): Add
3096 -Wno-analyzer-tainted-allocation-size,
3097 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
3098 -Wno-analyzer-tainted-size to list. Add
3099 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
3100 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
3101 of options effectively enabled by -fanalyzer.
3102 (-Wanalyzer-tainted-allocation-size): New.
3103 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
3104 (-Wanalyzer-tainted-divisor): New.
3105 (-Wanalyzer-tainted-offset): New.
3106 (-Wanalyzer-tainted-size): New.
3108 2021-11-13 Jan Hubicka <jh@suse.cz>
3110 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
3111 * gimple.c (gimple_call_arg_flags): ... here.
3112 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
3113 (modref_merge_call_site_flags): Use arg_eaf_flags.
3115 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
3117 PR tree-optimization/103222
3118 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
3120 (path_range_query::compute_ranges_in_block): Call
3121 compute_ranges_in_phis.
3122 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
3125 2021-11-13 Jan Hubicka <jh@suse.cz>
3127 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
3128 changes on "fn spec" attribute; give up on varadic types.
3129 * ipa-param-manipulation.c: Include attribs.h.
3130 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
3131 true remove "fn spec" attribute.
3132 (ipa_param_adjustments::build_new_function_type): Update.
3133 (ipa_param_body_adjustments::modify_formal_parameters): update.
3134 * ipa-sra.c: Include attribs.h.
3135 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
3137 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
3139 * gimple-range-path.cc (path_range_query::path_range_query): Merge
3141 (path_range_query::import_p): Move from header file.
3142 (path_range_query::~path_range_query): Adjust for combined ctors.
3143 * gimple-range-path.h: Merge ctors.
3144 (path_range_query::import_p): Move to .cc file.
3146 2021-11-13 Jan Hubicka <jh@suse.cz>
3148 PR tree-optimization/103182
3149 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
3150 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
3152 2021-11-12 Stafford Horne <shorne@gmail.com>
3154 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
3155 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
3156 new function or1k_profile_hook.
3157 * config/or1k/or1k.c (struct machine_function): Add new field
3158 set_mcount_arg_insn.
3159 (or1k_profile_hook): New function.
3160 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
3162 (or1k_frame_pointer_required): Frame pointer no longer needed
3165 2021-11-12 Jan Hubicka <jh@suse.cz>
3167 PR tree-optimization/103209
3168 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
3169 use of handle_rhs_call
3171 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
3173 PR tree-optimization/103202
3174 * gimple-range-path.cc
3175 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
3177 2021-11-12 Jan Hubicka <jh@suse.cz>
3179 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
3180 fix call of ipa_make_function_const and ipa_make_function_pure.
3182 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3184 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
3185 (aarch64_vector_op::m_seen_loads): Likewise.
3186 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
3188 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
3189 Remove code that tries to predict different vec_flags from the
3191 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
3192 Remove m_advsimd_ops handling.
3194 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3196 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
3197 (aarch64_sve_only_stmt_p): Delete.
3198 (aarch64_vector_costs::prefer_unrolled_loop): New function,
3199 extracted from adjust_body_cost.
3200 (aarch64_vector_costs::better_main_loop_than_p): New function,
3201 using heuristics extracted from adjust_body_cost and
3202 adjust_body_cost_sve.
3203 (aarch64_vector_costs::adjust_body_cost_sve): Remove
3204 advsimd_cycles_per_iter and could_use_advsimd parameters.
3205 Update after changes above.
3206 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
3208 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3210 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
3211 New member variable.
3212 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
3213 (aarch64_vec_op_count::vf_factor): New function.
3214 (aarch64_vector_costs::aarch64_vector_costs): When costing for
3215 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
3217 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
3218 instead of hard-coding 2.
3220 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3222 * config/aarch64/aarch64.c
3223 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
3224 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
3225 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
3226 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
3227 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
3228 above properties too.
3229 (aarch64_estimate_min_cycles_per_iter): Delete.
3230 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
3231 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
3232 routine to print CPI estimates.
3233 (adjust_body_cost_sve): Likewise. Use the other functions above
3234 instead of doing the work inline.
3236 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3238 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
3240 (aarch64_vec_op_count::base_issue_info): Remove handling of null
3242 (aarch64_vec_op_count::simd_issue_info): Likewise.
3243 (aarch64_vec_op_count::sve_issue_info): Likewise.
3244 (aarch64_vector_costs::m_ops): Turn into a vector.
3245 (aarch64_vector_costs::m_advsimd_ops): Likewise.
3246 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
3247 the vectors based on aarch64_tune_params.
3248 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
3249 of all entries in m_ops.
3250 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
3252 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
3253 parameter and get the information from the ops instead.
3254 (aarch64_vector_costs::adjust_body_cost_sve): Take a
3255 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
3256 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
3257 Exit earlier if m_ops is empty for either cost structure.
3259 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3261 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
3262 (aarch64_vector_costs::m_sve_ops): Replace with...
3263 (aarch64_vector_costs::m_ops): ...this.
3264 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
3265 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
3266 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
3267 Initialize m_vec_flags here rather than in add_stmt_cost.
3268 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
3269 Allow vectype to be null.
3270 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
3271 code too. Don't require vectype to be nonnull.
3272 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
3273 and scalar costs as parameters. Use the scalar costs to determine
3274 the cycles per iteration of the scalar loop, then multiply it
3275 by the estimated VF.
3276 (aarch64_vector_costs::finish_cost): Update call accordingly.
3278 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3280 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
3281 (aarch64_vector_costs::count_ops): Use it rather than the
3282 vectype to determine floatness.
3284 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3286 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
3287 Remove vectype parameter and get floatness from the type of the
3289 (arch64_in_loop_reduction_latency): Likewise.
3290 (aarch64_detect_vector_stmt_subtype): Update caller.
3291 (aarch64_vector_costs::count_ops): Likewise.
3293 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3295 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
3296 (aarch64_vec_op_count): ...this. Add a constructor.
3297 (aarch64_vec_op_count::vec_flags): New function.
3298 (aarch64_vec_op_count::base_issue_info): Likewise.
3299 (aarch64_vec_op_count::simd_issue_info): Likewise.
3300 (aarch64_vec_op_count::sve_issue_info): Likewise.
3301 (aarch64_vec_op_count::m_issue_info): New member variable.
3302 (aarch64_vec_op_count::m_vec_flags): Likewise.
3303 (aarch64_vector_costs): Add a constructor.
3304 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
3305 (aarch64_vector_costs::aarch64_vector_costs): New function.
3306 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
3307 (aarch64_vector_costs::count_ops): Remove vec_flags and
3308 issue_info parameters, using the new aarch64_vec_op_count
3310 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
3311 (aarch64_sve_op_count::dump): Fold into...
3312 (aarch64_vec_op_count::dump): ..here.
3314 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
3316 * config/aarch64/aarch64.c: Include tree-dfa.h.
3317 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
3318 and MEM_OFFSET into account.
3319 (aarch64_swap_ldrstr_operands): Use it.
3320 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
3321 address of the second memory doesn't depend on the result of
3324 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
3326 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
3328 2021-11-12 Jan Hubicka <jh@suse.cz>
3330 PR tree-optimization/103175
3331 * ipa-modref.c (modref_lattice::merge): Add sanity check.
3332 (callee_to_caller_flags): Make flags adjustment sane.
3333 (modref_eaf_analysis::analyze_ssa_name): Likewise.
3335 2021-11-12 Jan Hubicka <jh@suse.cz>
3338 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
3339 not mark pure/const function if there are side-effects.
3341 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
3343 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
3344 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
3345 * tree-pretty-print.c (dump_omp_clause): Add support for printing
3346 implicit attribute in tree dumping.
3347 * gimplify.c (gimplify_adjust_omp_clauses_1):
3348 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
3350 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
3351 clauses, from simple append, to starting of list, after non-map clauses.
3352 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
3353 values passed to libgomp for implicit maps.
3355 2021-11-12 Jakub Jelinek <jakub@redhat.com>
3357 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
3358 (BUILT_IN_GOMP_TEAMS4): New.
3359 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
3360 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
3361 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
3362 GOMP_teams, pass to it also num_teams lower-bound expression
3363 or a dup of upper-bound if it is missing and a flag whether
3364 it is the first call or not.
3366 2021-11-12 Martin Liska <mliska@suse.cz>
3368 PR tree-optimization/102497
3369 * gimple-predicate-analysis.cc (add_pred): Remove unused
3372 2021-11-12 Richard Biener <rguenther@suse.de>
3374 PR tree-optimization/103204
3375 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
3376 top operand after folding in an address.
3378 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3380 * gimple-range-path.cc (path_range_query::path_range_query): New
3381 ctor without a ranger.
3382 (path_range_query::~path_range_query): Free ranger if necessary.
3383 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
3384 (path_range_query::ssa_range_in_phi): Same.
3385 (path_range_query::compute_ranges_in_block): Same.
3386 (path_range_query::compute_imports): Same.
3387 (path_range_query::compute_ranges): Same.
3388 (path_range_query::range_of_stmt): Same.
3389 (path_range_query::compute_outgoing_relations): Same.
3390 * gimple-range-path.h (class path_range_query): New ctor.
3391 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
3392 as path_range_query allocates one.
3393 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
3394 (back_threader::~back_threader): Same.
3396 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3398 * tree-ssa-threadbackward.c
3399 (back_threader_profitability::profitable_path_p): Remove loop
3400 crossing restriction.
3402 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3404 * ipa-modref.c (analyze_function): Do pure/const discovery, return
3406 (pass_modref::execute): If pure/const is discovered fixup cfg.
3407 (ignore_edge): Do not ignore pure/const edges.
3408 (modref_propagate_in_scc): Do pure/const discovery, return true if
3409 cdtor was promoted pure/const.
3410 (pass_ipa_modref::execute): If needed remove unreachable functions.
3411 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
3412 (warn_function_cold): Likewise.
3413 (skip_function_for_local_pure_const): Move earlier.
3414 (ipa_make_function_const): Break out from ...
3415 (ipa_make_function_pure): Break out from ...
3416 (propagate_pure_const): ... here.
3417 (pass_local_pure_const::execute): Use it.
3418 * ipa-utils.h (ipa_make_function_const): Declare.
3419 (ipa_make_function_pure): Declare.
3420 * passes.def: Move early modref after pure-const.
3422 2021-11-11 David Malcolm <dmalcolm@redhat.com>
3425 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
3427 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3429 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
3430 (TYPES_COMBINEP): Delete.
3431 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3432 qualified builtins for vcombine_* intrinsics.
3433 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
3435 (vcombine_s16): Likewise.
3436 (vcombine_s32): Likewise.
3437 (vcombine_f32): Likewise.
3438 (vcombine_u8): Use type-qualified builtin and remove casts.
3439 (vcombine_u16): Likewise.
3440 (vcombine_u32): Likewise.
3441 (vcombine_u64): Likewise.
3442 (vcombine_p8): Likewise.
3443 (vcombine_p16): Likewise.
3444 (vcombine_p64): Likewise.
3445 (vcombine_bf16): Remove unnecessary cast.
3446 * config/aarch64/iterators.md (VD_I): New mode iterator.
3447 (VDC_P): New mode iterator.
3449 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3451 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
3452 (TYPES_LOAD1_P): Define.
3453 (TYPES_STORE1_U): Define.
3454 (TYPES_STORE1P): Rename to...
3455 (TYPES_STORE1_P): This.
3456 (get_mem_type_for_load_store): Add unsigned and poly types.
3457 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
3458 type-qualified builtin declarations.
3459 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3460 qualified builtins for LD1/ST1.
3461 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
3462 builtin and remove cast.
3463 (vld1_p16): Likewise.
3464 (vld1_u8): Likewise.
3465 (vld1_u16): Likewise.
3466 (vld1_u32): Likewise.
3467 (vld1q_p8): Likewise.
3468 (vld1q_p16): Likewise.
3469 (vld1q_p64): Likewise.
3470 (vld1q_u8): Likewise.
3471 (vld1q_u16): Likewise.
3472 (vld1q_u32): Likewise.
3473 (vld1q_u64): Likewise.
3474 (vst1_p8): Likewise.
3475 (vst1_p16): Likewise.
3476 (vst1_u8): Likewise.
3477 (vst1_u16): Likewise.
3478 (vst1_u32): Likewise.
3479 (vst1q_p8): Likewise.
3480 (vst1q_p16): Likewise.
3481 (vst1q_p64): Likewise.
3482 (vst1q_u8): Likewise.
3483 (vst1q_u16): Likewise.
3484 (vst1q_u32): Likewise.
3485 (vst1q_u64): Likewise.
3486 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
3488 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3490 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3491 builtins for vector reduction.
3492 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
3493 builtin and remove casts.
3494 (vaddv_u16): Likewise.
3495 (vaddv_u32): Likewise.
3496 (vaddvq_u8): Likewise.
3497 (vaddvq_u16): Likewise.
3498 (vaddvq_u32): Likewise.
3499 (vaddvq_u64): Likewise.
3501 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3503 * config/aarch64/aarch64-simd-builtins.def:
3504 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
3505 builtin and remove casts.
3506 (vpaddq_u16): Likewise.
3507 (vpaddq_u32): Likewise.
3508 (vpaddq_u64): Likewise.
3509 (vpadd_u8): Likewise.
3510 (vpadd_u16): Likewise.
3511 (vpadd_u32): Likewise.
3512 (vpaddd_u64): Likewise.
3514 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3516 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3517 builtins for [r]subhn[2].
3518 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
3520 (vsubhn_s32): Likewise.
3521 (vsubhn_s64): Likewise.
3522 (vsubhn_u16): Use type-qualified builtin and remove casts.
3523 (vsubhn_u32): Likewise.
3524 (vsubhn_u64): Likewise.
3525 (vrsubhn_s16): Remove unnecessary cast.
3526 (vrsubhn_s32): Likewise.
3527 (vrsubhn_s64): Likewise.
3528 (vrsubhn_u16): Use type-qualified builtin and remove casts.
3529 (vrsubhn_u32): Likewise.
3530 (vrsubhn_u64): Likewise.
3531 (vrsubhn_high_s16): Remove unnecessary cast.
3532 (vrsubhn_high_s32): Likewise.
3533 (vrsubhn_high_s64): Likewise.
3534 (vrsubhn_high_u16): Use type-qualified builtin and remove
3536 (vrsubhn_high_u32): Likewise.
3537 (vrsubhn_high_u64): Likewise.
3538 (vsubhn_high_s16): Remove unnecessary cast.
3539 (vsubhn_high_s32): Likewise.
3540 (vsubhn_high_s64): Likewise.
3541 (vsubhn_high_u16): Use type-qualified builtin and remove
3543 (vsubhn_high_u32): Likewise.
3544 (vsubhn_high_u64): Likewise.
3546 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3548 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3549 builtins for [r]addhn[2].
3550 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
3552 (vaddhn_s32): Likewise.
3553 (vaddhn_s64): Likewise.
3554 (vaddhn_u16): Use type-qualified builtin and remove casts.
3555 (vaddhn_u32): Likewise.
3556 (vaddhn_u64): Likewise.
3557 (vraddhn_s16): Remove unnecessary cast.
3558 (vraddhn_s32): Likewise.
3559 (vraddhn_s64): Likewise.
3560 (vraddhn_u16): Use type-qualified builtin and remove casts.
3561 (vraddhn_u32): Likewise.
3562 (vraddhn_u64): Likewise.
3563 (vaddhn_high_s16): Remove unnecessary cast.
3564 (vaddhn_high_s32): Likewise.
3565 (vaddhn_high_s64): Likewise.
3566 (vaddhn_high_u16): Use type-qualified builtin and remove
3568 (vaddhn_high_u32): Likewise.
3569 (vaddhn_high_u64): Likewise.
3570 (vraddhn_high_s16): Remove unnecessary cast.
3571 (vraddhn_high_s32): Likewise.
3572 (vraddhn_high_s64): Likewise.
3573 (vraddhn_high_u16): Use type-qualified builtin and remove
3575 (vraddhn_high_u32): Likewise.
3576 (vraddhn_high_u64): Likewise.
3578 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3580 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3581 qualifiers in generator macros for uhsub builtins.
3582 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
3584 (vhsub_s16): Likewise.
3585 (vhsub_s32): Likewise.
3586 (vhsub_u8): Use type-qualified builtin and remove casts.
3587 (vhsub_u16): Likewise.
3588 (vhsub_u32): Likewise.
3589 (vhsubq_s8): Remove unnecessary cast.
3590 (vhsubq_s16): Likewise.
3591 (vhsubq_s32): Likewise.
3592 (vhsubq_u8): Use type-qualified builtin and remove casts.
3593 (vhsubq_u16): Likewise.
3594 (vhsubq_u32): Likewise.
3596 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3598 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3599 qualifiers in generator macros for u[r]hadd builtins.
3600 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
3602 (vhadd_s16): Likewise.
3603 (vhadd_s32): Likewise.
3604 (vhadd_u8): Use type-qualified builtin and remove casts.
3605 (vhadd_u16): Likewise.
3606 (vhadd_u32): Likewise.
3607 (vhaddq_s8): Remove unnecessary cast.
3608 (vhaddq_s16): Likewise.
3609 (vhaddq_s32): Likewise.
3610 (vhaddq_u8): Use type-qualified builtin and remove casts.
3611 (vhaddq_u16): Likewise.
3612 (vhaddq_u32): Likewise.
3613 (vrhadd_s8): Remove unnecessary cast.
3614 (vrhadd_s16): Likewise.
3615 (vrhadd_s32): Likewise.
3616 (vrhadd_u8): Use type-qualified builtin and remove casts.
3617 (vrhadd_u16): Likewise.
3618 (vrhadd_u32): Likewise.
3619 (vrhaddq_s8): Remove unnecessary cast.
3620 (vrhaddq_s16): Likewise.
3621 (vrhaddq_s32): Likewise.
3622 (vrhaddq_u8): Use type-wualified builtin and remove casts.
3623 (vrhaddq_u16): Likewise.
3624 (vrhaddq_u32): Likewise.
3626 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3628 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3629 qualifiers in generator macros for usub[lw][2] builtins.
3630 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
3632 (vsubl_s16): Likewise.
3633 (vsubl_s32): Likewise.
3634 (vsubl_u8): Use type-qualified builtin and remove casts.
3635 (vsubl_u16): Likewise.
3636 (vsubl_u32): Likewise.
3637 (vsubl_high_s8): Remove unnecessary cast.
3638 (vsubl_high_s16): Likewise.
3639 (vsubl_high_s32): Likewise.
3640 (vsubl_high_u8): Use type-qualified builtin and remove casts.
3641 (vsubl_high_u16): Likewise.
3642 (vsubl_high_u32): Likewise.
3643 (vsubw_s8): Remove unnecessary casts.
3644 (vsubw_s16): Likewise.
3645 (vsubw_s32): Likewise.
3646 (vsubw_u8): Use type-qualified builtin and remove casts.
3647 (vsubw_u16): Likewise.
3648 (vsubw_u32): Likewise.
3649 (vsubw_high_s8): Remove unnecessary cast.
3650 (vsubw_high_s16): Likewise.
3651 (vsubw_high_s32): Likewise.
3652 (vsubw_high_u8): Use type-qualified builtin and remove casts.
3653 (vsubw_high_u16): Likewise.
3654 (vsubw_high_u32): Likewise.
3656 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3658 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3659 qualifiers in generator macros for uadd[lw][2] builtins.
3660 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
3662 (vaddl_s16): Likewise.
3663 (vaddl_s32): Likewise.
3664 (vaddl_u8): Use type-qualified builtin and remove casts.
3665 (vaddl_u16): Likewise.
3666 (vaddl_u32): Likewise.
3667 (vaddl_high_s8): Remove unnecessary cast.
3668 (vaddl_high_s16): Likewise.
3669 (vaddl_high_s32): Likewise.
3670 (vaddl_high_u8): Use type-qualified builtin and remove casts.
3671 (vaddl_high_u16): Likewise.
3672 (vaddl_high_u32): Likewise.
3673 (vaddw_s8): Remove unnecessary cast.
3674 (vaddw_s16): Likewise.
3675 (vaddw_s32): Likewise.
3676 (vaddw_u8): Use type-qualified builtin and remove casts.
3677 (vaddw_u16): Likewise.
3678 (vaddw_u32): Likewise.
3679 (vaddw_high_s8): Remove unnecessary cast.
3680 (vaddw_high_s16): Likewise.
3681 (vaddw_high_s32): Likewise.
3682 (vaddw_high_u8): Use type-qualified builtin and remove casts.
3683 (vaddw_high_u16): Likewise.
3684 (vaddw_high_u32): Likewise.
3686 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3688 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3689 qualified builtins for [R]SHRN[2].
3690 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
3691 builtin and remove casts.
3692 (vshrn_n_u32): Likewise.
3693 (vshrn_n_u64): Likewise.
3694 (vrshrn_high_n_u16): Likewise.
3695 (vrshrn_high_n_u32): Likewise.
3696 (vrshrn_high_n_u64): Likewise.
3697 (vrshrn_n_u16): Likewise.
3698 (vrshrn_n_u32): Likewise.
3699 (vrshrn_n_u64): Likewise.
3700 (vshrn_high_n_u16): Likewise.
3701 (vshrn_high_n_u32): Likewise.
3702 (vshrn_high_n_u64): Likewise.
3704 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3706 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3707 type-qualified builtins for XTN[2].
3708 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
3709 qualified builtin and remove casts.
3710 (vmovn_high_u32): Likewise.
3711 (vmovn_high_u64): Likewise.
3712 (vmovn_u16): Likewise.
3713 (vmovn_u32): Likewise.
3714 (vmovn_u64): Likewise.
3716 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3718 * config/aarch64/aarch64-simd-builtins.def: Use poly type
3719 qualifier in builtin generator macros.
3720 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
3721 builtin and remove casts.
3722 (vmulq_p8): Likewise.
3723 (vmull_high_p8): Likewise.
3724 (vmull_p8): Likewise.
3726 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3728 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3729 qualified builtin generators for unsigned MLA/MLS intrinsics.
3730 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
3732 (vmla_n_u32): Likewise.
3733 (vmla_u8): Likewise.
3734 (vmla_u16): Likewise.
3735 (vmla_u32): Likewise.
3736 (vmlaq_n_u16): Likewise.
3737 (vmlaq_n_u32): Likewise.
3738 (vmlaq_u8): Likewise.
3739 (vmlaq_u16): Likewise.
3740 (vmlaq_u32): Likewise.
3741 (vmls_n_u16): Likewise.
3742 (vmls_n_u32): Likewise.
3743 (vmls_u8): Likewise.
3744 (vmls_u16): Likewise.
3745 (vmls_u32): Likewise.
3746 (vmlsq_n_u16): Likewise.
3747 (vmlsq_n_u32): Likewise.
3748 (vmlsq_u8): Likewise.
3749 (vmlsq_u16): Likewise.
3750 (vmlsq_u32): Likewise.
3752 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3754 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
3755 with looping const/pure.
3756 (modref_summary_lto::useful_p): Likewise.
3757 (merge_call_side_effects): Merge side effects before early exit
3759 (process_fnspec): Also handle pure functions.
3760 (analyze_call): Do not early exit on looping pure const.
3761 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
3762 (modref_propagate_in_scc): Update.
3764 2021-11-11 Richard Biener <rguenther@suse.de>
3766 PR tree-optimization/103190
3767 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
3769 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3771 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
3773 (path_range_query::compute_imports): ...this. Adapt it so it can
3774 be passed the imports bitmap instead of working on m_imports.
3775 (path_range_query::compute_ranges): Call compute_imports in all
3776 cases unless an imports bitmap is passed.
3777 * gimple-range-path.h (path_range_query::compute_imports): New.
3778 (path_range_query::add_copies_to_imports): Remove.
3779 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
3780 (back_threader::find_paths_to_names): Inline resolve_def.
3781 (back_threader::find_paths): Call compute_imports.
3782 (back_threader::resolve_phi): Adjust comment.
3784 2021-11-11 Richard Biener <rguenther@suse.de>
3786 PR tree-optimization/103188
3787 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
3788 Remove query parameter, split out check for size
3790 (ch_base::m_ranger, cb_base::m_query): Remove.
3791 (ch_base::copy_headers): Split processing loop into
3792 analysis around which we allocate and use ranger and
3793 transform where we do not.
3794 (pass_ch::execute): Do not allocate/free ranger here.
3795 (pass_ch_vect::execute): Likewise.
3797 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3799 * ipa-pure-const.c (propagate_pure_const): Self recursion is
3802 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3804 * cgraph.c (set_noreturn_flag_1): New function.
3805 (cgraph_node::set_noreturn_flag): New member function
3806 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
3807 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
3809 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3810 Richard Biener <rguenther@suse.de>
3812 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
3813 statically to the edge remaining in the loop.
3815 2021-11-11 Richard Biener <rguenther@suse.de>
3817 PR middle-end/103181
3818 * tree-eh.c (operation_could_trap_helper_p): Properly
3819 check vector constants for a zero element for integer
3820 division. Separate floating point and integer division code.
3822 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3825 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
3826 handling only for DECL_BIT_FIELD_TYPE decls.
3828 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
3831 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
3832 token is arch extension without leading '+' and emit appropriate
3833 diagnostic for the same.
3835 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3837 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
3838 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
3839 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
3840 * tree.c (omp_clause_num_ops): Increase num ops for
3841 OMP_CLAUSE_NUM_TEAMS to 2.
3842 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
3843 for OMP_CLAUSE_NUM_TEAMS.
3844 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
3845 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
3846 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
3847 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
3848 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
3849 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
3850 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
3852 2021-11-11 Richard Biener <rguenther@suse.de>
3854 * cfganal.c (find_pdom): Remove.
3855 (control_dependences::find_control_dependence): Remove
3856 special-casing of entry block, call get_immediate_dominator
3858 * gimple-predicate-analysis.cc (find_pdom): Remove.
3859 (find_dom): Likewise.
3860 (find_control_equiv_block): Call get_immediate_dominator
3862 (compute_control_dep_chain): Likewise.
3863 (predicate::init_from_phi_def): Likewise.
3865 2021-11-11 Richard Biener <rguenther@suse.de>
3867 * cfganal.h (control_dependences::control_dependence_map):
3869 (control_dependences::m_bitmaps): New.
3870 * cfganal.c (control_dependences::set_control_dependence_map_bit):
3872 (control_dependences::clear_control_dependence_bitmap):
3874 (control_dependences::find_control_dependence): Do not
3875 find_edge for the abnormal edge test.
3876 (control_dependences::control_dependences): Instead do not
3877 add abnormal edges to the edge list. Adjust.
3878 (control_dependences::~control_dependences): Likewise.
3879 (control_dependences::get_edges_dependent_on): Likewise.
3880 * function-tests.c: Include bitmap.h.
3882 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
3884 * doc/invoke.texi: Change references to "future cpu" to "power10",
3885 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
3887 2021-11-11 Cui,Lili <lili.cui@intel.com>
3889 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
3891 (processor_cost_table): Use alderlake_cost for Alderlake.
3892 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
3893 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
3895 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
3897 (ix86_adjust_cost): Handle Alderlake.
3898 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
3899 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
3900 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
3901 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
3902 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
3903 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
3904 (X86_TUNE_USE_LEAVE): Likewise.
3905 (X86_TUNE_PUSH_MEMORY): Likewise.
3906 (X86_TUNE_USE_INCDEC): Likewise.
3907 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
3908 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
3909 (X86_TUNE_USE_SAHF): Likewise.
3910 (X86_TUNE_USE_BT): Likewise.
3911 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
3912 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
3913 (X86_TUNE_AVOID_MFENCE): Likewise.
3914 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
3915 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
3916 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
3917 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
3918 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
3919 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
3920 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
3921 (X86_TUNE_USE_GATHER): Disable for Alderlake.
3922 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
3923 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
3925 2021-11-11 liuhongt <hongtao.liu@intel.com>
3928 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
3929 (avxsizesuffix): Ditto.
3931 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
3933 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
3934 wrong marco checking.
3936 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3938 PR tree-optimization/102906
3939 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
3940 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
3941 (class ch_base): Add m_ranger and m_query.
3942 (ch_base::copy_headers): Pass m_query to
3943 entry_loop_condition_is_static.
3944 (pass_ch::execute): Allocate and deallocate m_ranger and
3946 (pass_ch_vect::execute): Same.
3948 2021-11-10 Andrew Pinski <apinski@marvell.com>
3951 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
3952 Use vwcore iterator for the r constraint output string.
3954 2021-11-10 qing zhao <qing.zhao@oracle.com>
3956 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
3957 only when have_insn_for return true for the mode. Fix a memory leak.
3959 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
3961 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
3962 cortexa5_extra_costs, cortexa7_extra_costs,
3963 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
3964 Initialize movi, dup and extract costing fields.
3966 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3968 * gimple-range-path.cc (path_range_query::path_range_query): Do
3970 (path_range_query::dump): Change m_path uses to non-pointer.
3971 (path_range_query::defined_outside_path): Same.
3972 (path_range_query::set_path): Same.
3973 (path_range_query::add_copies_to_imports): Same.
3974 (path_range_query::range_of_stmt): Same.
3975 (path_range_query::compute_outgoing_relations): Same.
3976 (path_range_query::compute_ranges): Imports are now optional.
3977 Implement overload that takes an edge.
3978 * gimple-range-path.h (class path_range_query): Make imports
3979 optional for compute_ranges. Add compute_ranges(edge) overload.
3980 Make m_path an auto_vec instead of a pointer and adjust
3983 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3985 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
3986 (default_hash_traits<scalar_conf_masked_key>): Likewise.
3987 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
3989 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
3990 Register mask inverses.
3992 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3994 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
3995 successful vectorization.
3997 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
3999 * gimple-range-cache.cc (sbr_vector::grow): New.
4000 (sbr_vector::set_bb_range): Call grow.
4001 (sbr_vector::get_bb_range): Same.
4002 (sbr_vector::bb_range_p): Remove assert.
4004 2021-11-10 Tamar Christina <tamar.christina@arm.com>
4006 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
4007 ,*aarch64_topbits_shuffle<mode>_be): Remove.
4009 2021-11-10 Jan Hubicka <jh@suse.cz>
4011 * ipa-modref.c: Include tree-eh.h
4012 (modref_summary::modref_summary): Initialize side_effects.
4013 (struct modref_summary_lto): New bool field side_effects.
4014 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
4015 (modref_summary::dump): Dump side_effects.
4016 (modref_summary_lto::dump): Dump side_effects.
4017 (merge_call_side_effects): Merge side effects.
4018 (process_fnspec): Calls to non-const/pure or looping
4019 function is a side effect.
4020 (analyze_call): Self-recursion is a side-effect; handle
4022 (analyze_load): Watch for volatile and throwing memory.
4023 (analyze_store): Likewise.
4024 (analyze_stmt): Watch for volatitle asm.
4025 (analyze_function): Handle side_effects.
4026 (modref_summaries::duplicate): Duplicate side_effects.
4027 (modref_summaries_lto::duplicate): Likewise.
4028 (modref_write): Stream side_effects.
4029 (read_section): Likewise.
4030 (update_signature): Update.
4031 (propagate_unknown_call): Handle side_effects.
4032 (modref_propagate_in_scc): Likewise.
4033 * ipa-modref.h (struct modref_summary): Add side_effects.
4034 * ipa-pure-const.c (special_builtin_state): Rename to ...
4035 (builtin_safe_for_const_function_p): ... this one.
4036 (check_call): Update.
4037 (finite_function_p): Break out from ...
4038 (propagate_pure_const): ... here
4039 * ipa-utils.h (finite_function): Declare.
4041 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
4043 * config.gcc (powerpc*-*-*): Remove -rpath from
4044 --with-advance-toolchain.
4046 2021-11-10 Marek Polacek <polacek@redhat.com>
4049 * attribs.c (struct scoped_attributes): Add a bool member.
4050 (lookup_scoped_attribute_spec): Forward declare.
4051 (register_scoped_attributes): New bool parameter, defaulted to
4053 (handle_ignored_attributes_option): New function.
4054 (free_attr_data): New function.
4055 (init_attributes): Call handle_ignored_attributes_option.
4056 (attr_namespace_ignored_p): New function.
4057 (decl_attributes): Check attr_namespace_ignored_p before
4059 * attribs.h (free_attr_data): Declare.
4060 (register_scoped_attributes): Adjust declaration.
4061 (handle_ignored_attributes_option): Declare.
4062 (canonicalize_attr_name): New function template.
4063 (canonicalize_attr_name): Use it.
4064 * common.opt (Wattributes=): New option with a variable.
4065 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
4066 * doc/invoke.texi: Document -Wno-attributes=.
4067 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
4068 * plugin.h (register_scoped_attributes): Adjust declaration.
4069 * toplev.c (compile_file): Call free_attr_data.
4071 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
4073 * config/arm/arm-cpus.in (cortex-a710): New CPU.
4074 * config/arm/arm-tables.opt: Regenerate.
4075 * config/arm/arm-tune.md: Regenerate.
4076 * doc/invoke.texi: Update docs.
4078 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
4080 * config/aarch64/aarch64-builtins.c
4081 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
4083 2021-11-10 Martin Liska <mliska@suse.cz>
4085 * lto-wrapper.c (merge_and_complain): Make the first argument
4088 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4090 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
4091 smax_nan and fmin_nan instead of smin_nan.
4092 (maxmin_uns): Rename to...
4093 (fmaxmin): ...this and make the same changes. Remove entries
4094 unrelated to fmax* and fmin*.
4095 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
4096 (<fmaxmin><mode>3): ...this.
4097 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
4099 (aarch64_<optab>p<mode>): ...this.
4100 (<maxmin_uns><mode>3): Rename to...
4101 (<fmaxmin><mode>3): ...this.
4102 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
4103 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
4104 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
4105 (aarch64_reduc_<optab>_internal<mode>): ...this.
4106 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
4107 (aarch64_reduc_<optab>_internalv2si): ...this.
4108 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
4109 (<fmaxmin><mode>3): ...this.
4110 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
4112 (fmax_nan, fmin_nan): ...this.
4113 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
4114 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
4115 (vmin_f16, vminq_f16): Update accordingly.
4117 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4119 * tree-vectorizer.h (vector_costs::finish_cost): Take the
4120 corresponding scalar costs as a parameter.
4121 (finish_cost): Likewise.
4122 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
4123 (vect_estimate_min_profitable_iters): Update accordingly.
4124 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
4125 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
4126 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
4128 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
4130 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4132 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
4134 (_loop_vec_info::single_scalar_iteration_cost): Delete.
4135 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
4136 (vector_costs::total_cost): New function.
4137 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
4138 after above changes.
4139 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
4140 (vect_compute_single_scalar_iteration_cost): Store the costs
4141 in loop_vinfo->scalar_costs.
4142 (vect_estimate_min_profitable_iters): Get the scalar cost from
4143 loop_vinfo->scalar_costs.
4145 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4147 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
4148 (vector_costs::better_epilogue_loop_than_p)
4149 (vector_costs::compare_inside_loop_cost)
4150 (vector_costs::compare_outside_loop_cost): Likewise.
4151 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
4152 (vector_costs::better_epilogue_loop_than_p)
4153 (vector_costs::compare_inside_loop_cost)
4154 (vector_costs::compare_outside_loop_cost): New functions,
4155 containing code moved from...
4156 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
4158 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4160 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
4161 and vec_inside_cost.
4162 (vector_costs::outside_cost): New function.
4163 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
4165 (vect_estimate_min_profitable_iters): Likewise.
4166 (vect_better_loop_vinfo_p): Get the inside and outside costs
4167 from the loop_vec_infos' vector_costs.
4169 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
4171 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
4172 (_loop_vec_info::vector_costs): ...this.
4173 (LOOP_VINFO_TARGET_COST_DATA): Delete.
4174 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
4176 (vec_info::~vec_info): Remove corresponding delete.
4177 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
4178 vector_costs to null.
4179 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
4180 (vect_analyze_loop_operations): Update after above changes.
4181 (vect_analyze_loop_2): Likewise.
4182 (vect_estimate_min_profitable_iters): Likewise.
4183 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
4185 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
4187 * tree-core.h (EAF_DIRECT): Remove.
4188 (EAF_NOCLOBBER): Remove.
4189 (EAF_UNUSED): Remove.
4190 (EAF_NOESCAPE): Remove.
4191 (EAF_NO_DIRECT_CLOBBER): New.
4192 (EAF_NO_INDIRECT_CLOBBER): New.
4193 (EAF_NODIRECTESCAPE): Remove.
4194 (EAF_NO_DIRECT_ESCAPE): New.
4195 (EAF_NO_INDIRECT_ESCAPE): New.
4196 (EAF_NOT_RETURNED): Remove.
4197 (EAF_NOT_RETURNED_INDIRECTLY): New.
4198 (EAF_NOREAD): Remove.
4199 (EAF_NO_DIRECT_READ): New.
4200 (EAF_NO_INDIRECT_READ): New.
4201 * gimple.c (gimple_call_arg_flags): Update for new flags.
4202 (gimple_call_retslot_flags): Update for new flags.
4203 * ipa-modref.c (dump_eaf_flags): Likewise.
4204 (remove_useless_eaf_flags): Likewise.
4205 (deref_flags): Likewise.
4206 (modref_lattice::init): Likewise.
4207 (modref_lattice::merge): Likewise.
4208 (modref_lattice::merge_direct_load): Likewise.
4209 (modref_lattice::merge_direct_store): Likewise.
4210 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
4211 (callee_to_caller_flags): Likewise.
4212 (modref_eaf_analysis::analyze_ssa_name): Likewise.
4213 (modref_eaf_analysis::propagate): Likewise.
4214 (modref_merge_call_site_flags): Likewise.
4215 * ipa-modref.h (interposable_eaf_flags): Likewise.
4216 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
4217 * tree-ssa-structalias.c (handle_call_arg): Likewise.
4218 (handle_rhs_call): Likewise.
4219 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
4221 2021-11-10 Tamar Christina <tamar.christina@arm.com>
4223 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
4225 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
4227 * config/aarch64/aarch64-builtins.c
4228 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
4231 2021-11-10 Jan Hubicka <jh@suse.cz>
4233 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
4235 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
4236 Hongtao Liu <hongtao.liu@intel.com>
4238 PR middle-end/102566
4239 * match.pd (nop_atomic_bit_test_and_p): New match.
4240 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
4241 (gimple_nop_atomic_bit_test_and_p): New prototype.
4242 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
4243 different cases to their canonical forms.
4245 2021-11-10 liuhongt <hongtao.liu@intel.com>
4247 PR tree-optimization/103126
4248 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
4249 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
4250 * tree-if-conv.c : Include tree-vectorizer.h.
4251 (is_cond_scalar_reduction): Handle
4252 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
4253 (convert_scalar_cond_reduction): Ditto.
4255 2021-11-10 konglin1 <lingling.kong@intel.com>
4257 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
4258 (cmla<conj_op><mode>4): Likewise
4260 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
4262 * tree-ssa-threadedge.c: Do not include
4263 gimple-ssa-evrp-analyze.h.
4264 * value-pointer-equiv.cc: Same.
4266 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
4268 * tree-ssa-threadbackward.c
4269 (back_threader::maybe_register_path_dump): Abstract path dumping...
4270 (dump_path): ...here.
4271 (back_threader::resolve_phi): Call dump_path.
4274 2021-11-10 konglin1 <lingling.kong@intel.com>
4276 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
4277 Add new define_insn.
4278 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
4279 (fma_<mode>_fcmaddc_bcst): Likewise
4281 2021-11-10 liuhongt <hongtao.liu@intel.com>
4284 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
4287 2021-11-10 Andrew Pinski <apinski@marvell.com>
4290 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
4291 as an input, use original one.
4293 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
4295 * config/nios2/nios2.c (nios2_can_inline_p): New.
4296 (TARGET_CAN_INLINE_P): Define.
4298 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
4300 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
4301 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
4302 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
4303 (lxvrse_expand_builtin): Modify the expansion for sign extension.
4304 All extensions are done within VSX registers.
4306 2021-11-09 Andrew Pinski <apinski@marvell.com>
4308 PR tree-optimization/10352
4309 * match.pd: Remove check of TYPE_PRECISION for
4312 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
4314 * function.c (allocate_struct_function): Don't set x_range_query.
4315 * function.h (get_range_query): Move to value-query.h.
4316 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
4317 (disable_ranger): Clear function current query field.
4318 * value-query.cc (get_global_range_query): Relocate to:
4319 * value-query.h (get_global_range_query): Here and inline.
4320 (get_range_query): Relocate here from function.h.
4322 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4324 * tree-ssa-threadbackward.c
4325 (back_threader::maybe_register_path_dump): New.
4326 (back_threader::maybe_register_path): Call maybe_register_path_dump.
4328 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4330 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
4331 Return NULL when unprofitable.
4333 2021-11-09 Martin Jambor <mjambor@suse.cz>
4335 * tree.h (build_debug_expr_decl): Declare.
4336 * tree.c (build_debug_expr_decl): New function.
4337 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
4338 instead of building a DEBUG_EXPR_DECL.
4339 * ipa-param-manipulation.c
4340 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4341 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
4342 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
4343 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
4344 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
4346 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
4348 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
4349 (def_builtin): Change debug formatting for easier parsing and
4350 include more information.
4351 (rs6000_init_builtins): Add dump of autogenerated builtins.
4352 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
4355 2021-11-09 Marek Polacek <polacek@redhat.com>
4357 * ipa-modref.c (analyze_parms): Fix a typo.
4359 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
4362 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
4363 assertion for Z*-ext.
4365 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
4367 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
4368 and past_static_chain; merge past summary with current one.
4369 (analyze_function): Update.
4371 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4373 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
4375 2021-11-09 Martin Liska <mliska@suse.cz>
4377 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
4378 conditional based on
4379 * print-rtl.h (class rtx_writer): Likewise.
4381 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
4383 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
4384 for 'Deleted'. Turn into a '#define'.
4386 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4388 * gimple-range-path.cc (path_range_query::dump): Clean up.
4389 (path_range_query::compute_ranges): Same.
4390 * value-relation.cc (path_oracle::dump): Same.
4392 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4394 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
4395 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
4397 * flag-types.h (enum threader_debug): New.
4398 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
4399 * params.opt: Add entry for --param=threader-debug=.
4401 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
4403 PR middle-end/101204
4405 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
4406 (struct xint_hash_t): ... this.
4407 * doc/gty.texi: Update.
4409 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
4411 * config/arm/arm-cpus.in (armv9): New define.
4412 (ARMv9a): New group.
4413 (armv9-a): New arch definition.
4414 * config/arm/arm-tables.opt: Regenerate.
4415 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
4416 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
4417 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
4418 to MULTILIB_MATCHES.
4419 * config/arm/t-multilib: Added v9_a_nosimd_variants and
4420 v9_a_simd_variants to MULTILIB_MATCHES.
4421 * doc/invoke.texi: Update docs.
4423 2021-11-09 Martin Jambor <mjambor@suse.cz>
4426 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
4427 return with error_mark_mode when part of expression is mapped to
4429 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
4430 mapped value to NULL if walk_tree returns error_mark_mode.
4432 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
4434 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
4436 (get_addr_stridx): Same.
4438 (get_range_strlen_dynamic): Same.
4439 (handle_builtin_strlen): Same.
4440 (handle_builtin_strchr): Same.
4441 (handle_builtin_strcpy): Same.
4442 (maybe_diag_stxncpy_trunc): Same.
4443 (handle_builtin_stxncpy_strncat): Same.
4444 (handle_builtin_memcpy): Same.
4445 (handle_builtin_strcat): Same.
4446 (handle_alloc_call): Same.
4447 (handle_builtin_memset): Same.
4448 (handle_builtin_string_cmp): Same.
4449 (handle_pointer_plus): Same.
4450 (count_nonzero_bytes_addr): Same.
4451 (count_nonzero_bytes): Same.
4452 (handle_store): Same.
4453 (fold_strstr_to_strncmp): Same.
4454 (handle_integral_assign): Same.
4455 (check_and_optimize_stmt): Same.
4456 (class strlen_dom_walker): Replace evrp with ranger.
4457 (strlen_dom_walker::before_dom_children): Remove evrp.
4458 (strlen_dom_walker::after_dom_children): Remove evrp.
4459 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
4460 Restrict sprintf output.
4462 2021-11-09 Martin Liska <mliska@suse.cz>
4464 * genconditions.c (write_one_condition): Add const qualifier
4467 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
4469 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
4472 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4474 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
4475 escape-newline support.
4476 (VEC_INIT_V4SI): Likewise.
4477 (VEC_INIT_V8HI): Likewise.
4478 (PACK_V1TI): Likewise.
4480 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
4481 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
4482 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
4486 (FMAF128_ODD): Likewise.
4487 (VSCEQPUO): Likewise.
4489 (VSIEQPF): Likewise.
4490 (VSTDCQP): Likewise.
4491 (PACK_TD): Likewise.
4492 (TABORTDC): Likewise.
4493 (TABORTDCI): Likewise.
4494 (SE_LXVRBX): Likewise.
4495 (SE_LXVRHX): Likewise.
4496 (SE_LXVRWX): Likewise.
4497 (SE_LXVRDX): Likewise.
4498 (VREPLACE_UN_UV2DI): Likewise.
4499 (VREPLACE_UN_UV4SI): Likewise.
4500 (VREPLACE_UN_V2DI): Likewise.
4501 (VREPLACE_ELT_UV2DI): Likewise.
4502 (VREPLACE_ELT_V2DI): Likewise.
4503 (ZE_LXVRBX): Likewise.
4504 (ZE_LXVRHX): Likewise.
4505 (ZE_LXVRWX): Likewise.
4506 (ZE_LXVRDX): Likewise.
4508 (CNTLZDM): Likewise.
4509 (CNTTZDM): Likewise.
4512 (PMXVBF16GER2): Likewise.
4513 (PMXVBF16GER2_INTERNAL): Likewise.
4514 (PMXVBF16GER2NN): Likewise.
4515 (PMXVBF16GER2NN_INTERNAL): Likewise.
4516 (PMXVBF16GER2NP): Likewise.
4517 (PMXVBF16GER2NP_INTERNAL): Likewise.
4518 (PMXVBF16GER2PN): Likewise.
4519 (PMXVBF16GER2PN_INTERNAL): Likewise.
4520 (PMXVBF16GER2PP): Likewise.
4521 (PMXVBF16GER2PP_INTERNAL): Likewise.
4522 (PMXVF16GER2): Likewise.
4523 (PMXVF16GER2_INTERNAL): Likewise.
4524 (PMXVF16GER2NN): Likewise.
4525 (PMXVF16GER2NN_INTERNAL): Likewise.
4526 (PMXVF16GER2NP): Likewise.
4527 (PMXVF16GER2NP_INTERNAL): Likewise.
4528 (PMXVF16GER2PN): Likewise.
4529 (PMXVF16GER2PN_INTERNAL): Likewise.
4530 (PMXVF16GER2PP): Likewise.
4531 (PMXVF16GER2PP_INTERNAL): Likewise.
4532 (PMXVF32GER_INTERNAL): Likewise.
4533 (PMXVF32GERNN): Likewise.
4534 (PMXVF32GERNN_INTERNAL): Likewise.
4535 (PMXVF32GERNP): Likewise.
4536 (PMXVF32GERNP_INTERNAL): Likewise.
4537 (PMXVF32GERPN): Likewise.
4538 (PMXVF32GERPN_INTERNAL): Likewise.
4539 (PMXVF32GERPP): Likewise.
4540 (PMXVF32GERPP_INTERNAL): Likewise.
4541 (PMXVF64GER): Likewise.
4542 (PMXVF64GER_INTERNAL): Likewise.
4543 (PMXVF64GERNN): Likewise.
4544 (PMXVF64GERNN_INTERNAL): Likewise.
4545 (PMXVF64GERNP): Likewise.
4546 (PMXVF64GERNP_INTERNAL): Likewise.
4547 (PMXVF64GERPN): Likewise.
4548 (PMXVF64GERPN_INTERNAL): Likewise.
4549 (PMXVF64GERPP): Likewise.
4550 (PMXVF64GERPP_INTERNAL): Likewise.
4551 (PMXVI16GER2): Likewise.
4552 (PMXVI16GER2_INTERNAL): Likewise.
4553 (PMXVI16GER2PP): Likewise.
4554 (PMXVI16GER2PP_INTERNAL): Likewise.
4555 (PMXVI16GER2S): Likewise.
4556 (PMXVI16GER2S_INTERNAL): Likewise.
4557 (PMXVI16GER2SPP): Likewise.
4558 (PMXVI16GER2SPP_INTERNAL): Likewise.
4559 (PMXVI4GER8): Likewise.
4560 (PMXVI4GER8_INTERNAL): Likewise.
4561 (PMXVI4GER8PP): Likewise.
4562 (PMXVI4GER8PP_INTERNAL): Likewise.
4563 (PMXVI8GER4): Likewise.
4564 (PMXVI8GER4_INTERNAL): Likewise.
4565 (PMXVI8GER4PP): Likewise.
4566 (PMXVI8GER4PP_INTERNAL): Likewise.
4567 (PMXVI8GER4SPP): Likewise.
4568 (PMXVI8GER4SPP_INTERNAL): Likewise.
4569 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
4570 (linebuf): Increase size.
4571 (lines): New variable.
4572 (lastline): Likewise.
4573 (real_line_pos): New function.
4574 (diag): Change signature.
4575 (bif_diag): Change signature; support escape-newline handling.
4576 (ovld_diag): Likewise.
4577 (fatal): Move earlier.
4578 (consume_whitespace): Adjust diag call.
4579 (advance_line): Add escape-newline handling; call fatal.
4580 (safe_inc_pos): Adjust diag call.
4581 (match_identifier): Likewise.
4582 (match_integer): Likewise.
4583 (match_to_right_bracket): Call fatal instead of diag; adjust diag
4585 (match_basetype): Adjust diag calls.
4586 (match_bracketed_pair): Likewise.
4587 (match_const_restriction): Likewise.
4588 (match_type): Likewise.
4589 (parse_args): Likewise.
4590 (parse_bif_attrs): Likewise.
4591 (complete_vector_type): Likewise.
4592 (complete_base_type): Likewise.
4593 (parse_prototype): Likewise.
4594 (parse_bif_entry): Likewise.
4595 (parse_bif_stanza): Likewise.
4596 (parse_ovld_entry): Likewise.
4597 (parse_ovld_stanza): Likewise.
4598 (main): Allocate buffers for lines[].
4600 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
4602 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
4603 use the REG argument on the CM_LARGE_PIC code path as well.
4604 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
4605 specially with the large code models.
4607 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4609 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
4610 allowing interposition since issues with binds_to_local_def were
4613 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
4615 PR tree-optimization/103122
4616 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
4617 entry with "calulate new values" set to false.
4619 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4621 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
4622 and builtin_return_address.
4623 (builtin_fnspec): Annotate builtin_return,
4624 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
4625 builtin_cxa_end_cleanup, builtin_eh_copy_values,
4626 builtin_frame_address, builtin_apply_args,
4627 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
4628 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
4630 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
4631 and builtin_return_address.
4633 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4635 PR tree-optimization/103177
4636 * passes.def: Move uncprop after pure/const and modref.
4638 2021-11-08 Martin Jambor <mjambor@suse.cz>
4642 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
4643 location before invoking remap_with_debug_expressions on it.
4644 * ipa-param-manipulation.c
4645 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4647 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
4649 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
4652 2021-11-08 Richard Biener <rguenther@suse.de>
4654 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
4655 instead of an sbitmap to cache irreducible state.
4657 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
4659 PR tree-optimization/103120
4660 * gimple-range-path.cc (path_range_query::range_defined_in_block):
4661 Bail if there's a cache entry.
4663 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4665 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
4666 rs6000_builtin_decls_x when appropriate.
4667 (add_condition_to_bb): Likewise.
4668 (rs6000_atomic_assign_expand_fenv): Likewise.
4670 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4672 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
4673 (rs6000_builtin_decl): Call it.
4675 2021-11-08 Martin Liska <mliska@suse.cz>
4677 * cgraph.c (cgraph_node::dump): Dump it from decl.
4679 2021-11-08 Martin Liska <mliska@suse.cz>
4681 PR gcov-profile/100520
4682 * coverage.c (coverage_compute_profile_id): Strip .gk when
4683 compare debug is used.
4684 * system.h (endswith): New function.
4686 2021-11-08 Martin Liska <mliska@suse.cz>
4688 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
4690 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
4692 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
4693 * config/rs6000/rbtree.h: Likewise.
4694 * config/rs6000/rs6000-builtin-new.def: Likewise.
4695 * config/rs6000/rs6000-gen-builtins.c: Likewise.
4696 * config/rs6000/rs6000-overload.def: Likewise.
4698 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4700 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
4701 (aarch64_address_cost): Use it instead of testing for CImode and
4704 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4706 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
4707 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
4708 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
4709 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
4710 NUM_MODE_DECIMAL_FLOAT.
4711 (REAL_MODE_FORMAT): Likewise.
4713 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
4715 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
4718 2021-11-08 Richard Biener <rguenther@suse.de>
4720 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
4722 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
4723 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
4724 LOOP_VINFO_NITERS_ASSUMPTIONS.
4725 (vect_analyze_loop_1): Adjust.
4726 (vect_analyze_loop): Move loop constraint setting and
4727 SCEV/niter reset here from vect_create_loop_vinfo to perform
4729 (vect_analyze_loop_form): Move dumping of symbolic niters
4730 here from vect_create_loop_vinfo.
4732 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4734 * ipa-modref-tree.h (enum modref_special_parms): New enum.
4735 (struct modref_access_node): update for special parms.
4736 (struct modref_ref_node): Likewise.
4737 (struct modref_parm_map): Likewise.
4738 (struct modref_tree): Likewise.
4739 * ipa-modref.c (dump_access): Likewise.
4740 (get_access): Detect static chain.
4741 (parm_map_for_arg): Take tree as arg instead of
4743 (merge_call_side_effects): Compute map for static chain.
4744 (process_fnspec): Update.
4745 (struct escape_point): Remove retslot_arg and static_chain_arg.
4746 (analyze_parms): Update.
4747 (compute_parm_map): Update.
4748 (propagate_unknown_call): Update.
4749 (modref_propagate_in_scc): Update.
4750 (modref_merge_call_site_flags): Update.
4751 (ipa_merge_modref_summary_after_inlining): Update.
4752 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
4753 * ipa-modref-tree.c (test_merge): Update.
4755 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
4757 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
4758 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
4759 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
4762 2021-11-08 liuhongt <hongtao.liu@intel.com>
4764 PR tree-optimization/103077
4765 * doc/invoke.texi (Options That Control Optimization):
4766 Update documentation for -ftree-loop-vectorize and
4767 -ftree-slp-vectorize which are enabled by default at -02.
4769 2021-11-08 liuhongt <hongtao.liu@intel.com>
4772 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
4773 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
4775 2021-11-08 liuhongt <hongtao.liu@intel.com>
4778 * match.pd: Simplify
4779 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
4780 c) under flag_unsafe_math_optimizations.
4782 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4785 * params.opt (modref-max-depth): Add range.
4786 (modref-max-adjustments): Fix range.
4788 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
4790 * tree-pass.h (make_pass_vrp_threader): Remove.
4791 * tree-ssa-threadbackward.c
4792 (back_threader_profitability::profitable_path_p): Remove
4793 ASSERT_EXPR references.
4794 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
4795 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
4796 (execute_vrp): Same.
4797 (class hybrid_threader): Remove.
4798 (hybrid_threader::hybrid_threader): Remove.
4799 (hybrid_threader::~hybrid_threader): Remove.
4800 (hybrid_threader::before_dom_children): Remove.
4801 (hybrid_threader::after_dom_children): Remove.
4802 (execute_vrp_threader): Remove.
4803 (class pass_vrp_threader): Remove.
4804 (make_pass_vrp_threader): Remove.
4806 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4808 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
4809 (gimple_call_retslot_flags): Likewise.
4810 (gimple_call_static_chain_flags): Likewise.
4811 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
4813 (modref_summary::useful_p): Likewise.
4814 (modref_summary_lto::useful_p): Likewise.
4815 (analyze_parms): Do not give up on NOVOPS.
4816 (analyze_function): When dumping report chnages in EAF flags
4817 between IPA and local pass.
4818 (modref_merge_call_site_flags): Compute implicit eaf flags
4819 based on callee ecf_flags and fnspec; if the function does not
4820 bind to current defs use interposable_eaf_flags.
4821 (modref_propagate_flags_in_scc): Update.
4822 * ipa-modref.h (interposable_eaf_flags): New function.
4824 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
4826 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
4828 (rs6000_invalid_new_builtin): New function.
4829 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
4830 (rs6000_expand_ldst_mask): New function.
4831 (new_cpu_expand_builtin): Likewise.
4832 (elemrev_icode): Likewise.
4833 (ldv_expand_builtin): Likewise.
4834 (lxvrse_expand_builtin): Likewise.
4835 (lxvrze_expand_builtin): Likewise.
4836 (stv_expand_builtin): Likewise.
4837 (new_mma_expand_builtin): Likewise.
4838 (new_htm_spr_num): Likewise.
4839 (new_htm_expand_builtin): Likewise.
4840 (rs6000_expand_new_builtin): Likewise.
4841 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
4843 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4845 * ipa-modref.c (modref_lattice): Add do_dataflow,
4846 changed and propagate_to fields.
4847 (modref_lattice::release): Free propagate_to
4848 (modref_lattice::merge): Do not give up early on unknown
4850 (modref_lattice::merge_deref): Likewise.
4851 (modref_eaf_analysis): Update toplevel comment.
4852 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
4853 do optimistic dataflow initialization.
4854 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
4855 (modref_eaf_analysis::propagate): New member function.
4856 (analyze_parms): Update to new API of modref_eaf_analysis.
4858 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
4860 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
4861 return true on functions from other partition.
4863 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
4865 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
4866 (vsx_splat_v4si_di): Change constraint to "wa".
4868 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4870 PR tree-optimization/103061
4871 * value-relation.cc (path_oracle::path_oracle): Initialize
4873 (path_oracle::killing_def): Set m_killed_defs.
4874 (path_oracle::query_relation): Do not look at the root oracle for
4876 * value-relation.h (class path_oracle): Add m_killed_defs.
4878 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4880 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4881 Remove gotos and other cleanups.
4883 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4886 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
4887 paradoxical and zero sized accesses.
4889 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4892 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
4895 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4897 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
4898 to the list of symbol prefixes that must be made linker-
4901 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4903 * config.host: Add support for aarch64-*-darwin.
4904 * config/aarch64/host-aarch64-darwin.c: New file.
4905 * config/aarch64/x-darwin: New file.
4907 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4909 PR tree-optimization/103093
4910 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
4912 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4914 PR tree-optimization/102943
4915 * gimple-range-cache.cc (class update_list): New.
4916 (update_list::add): Replace add_to_update.
4917 (update_list::pop): New.
4918 (ranger_cache::ranger_cache): Adjust.
4919 (ranger_cache::~ranger_cache): Adjust.
4920 (ranger_cache::add_to_update): Delete.
4921 (ranger_cache::propagate_cache): Adjust to new class.
4922 (ranger_cache::propagate_updated_value): Ditto.
4923 (ranger_cache::fill_block_cache): Ditto.
4924 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
4926 2021-11-05 Richard Biener <rguenther@suse.de>
4928 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
4929 comment and expand on another one. Combine nested if.
4931 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4934 * config/pa/pa-modes.def: Add OImode integer type.
4935 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
4937 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
4938 to UNITS_PER_WORD if IN_LIBGCC2.
4939 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
4940 negvti2, ashlti3, shrpd_internal): New patterns.
4941 Change some multi instruction types to multi.
4943 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4946 * config/i386/i386.h
4947 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
4950 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
4953 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
4955 * config/aarch64/constraints.md: Support const in Usw.
4957 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4959 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
4960 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
4962 2021-11-05 Richard Biener <rguenther@suse.de>
4964 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
4965 (LOOP_VINFO_N_STMTS): Likewise.
4966 (vec_info_for_bb): Remove unused function.
4967 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
4968 Initialize n_stmts member.
4969 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
4970 (vect_create_loop_vinfo): Do not set loop->aux.
4971 (vect_analyze_loop_2): Do not get n_stmts as argument,
4972 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
4974 (vect_analyze_loop_1): Remove callback, get the mode iterator
4975 and autodetected_vector_mode as argument, advancing the
4976 iterator and initializing autodetected_vector_mode here.
4977 (vect_analyze_loop): Split analysis loop into two, first
4978 processing main loops only and then epilogues.
4980 2021-11-05 Martin Jambor <mjambor@suse.cz>
4982 * ipa-prop.c (compute_complex_assign_jump_func): Remove
4983 unnecessary check for RECORD_TYPE.
4985 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
4987 * doc/install.texi2html: Do not generate old.html any longer.
4989 2021-11-05 Martin Liska <mliska@suse.cz>
4992 * opts.c (finish_options): Reset flag_gtoggle when it is used.
4994 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4997 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
4998 in CONST_WIDE_INT handling. Adjust recursive calls.
4999 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
5000 new add_const_value_attribute argument.
5001 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
5002 add_const_value_attribute argument.
5004 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
5006 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
5009 2021-11-05 Richard Biener <rguenther@suse.de>
5011 * tree-vectorizer.h (struct vect_loop_form_info): New.
5012 (vect_analyze_loop_form): Adjust.
5013 (vect_create_loop_vinfo): New.
5014 * tree-parloops.c (gather_scalar_reductions): Adjust for
5015 vect_analyze_loop_form API change.
5016 * tree-vect-loop.c: Include <functional>.
5017 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
5018 take struct vect_loop_form_info as output parameter and adjust.
5019 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
5020 split out call to the original vect_analyze_loop_form_1.
5021 (vect_reanalyze_as_main_loop): Rename to...
5022 (vect_analyze_loop_1): ... this, factor out the call to
5023 vect_analyze_loop_form and generalize to be able to use it twice ...
5024 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
5027 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
5030 * config/rs6000/fusion.md: Regenerate.
5031 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
5033 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
5035 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
5038 2021-11-04 Tamar Christina <tamar.christina@arm.com>
5040 * config/aarch64/aarch64-builtins.c
5041 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
5042 ashr_simd, lshr, lshr_simd.
5043 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
5044 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
5045 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
5047 2021-11-04 Tamar Christina <tamar.christina@arm.com>
5049 * match.pd: New negate+shift pattern.
5051 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
5053 PR tree-optimization/103079
5054 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
5056 (gimple_range_calc_op2): Ditto.
5058 2021-11-04 Martin Jambor <mjambor@suse.cz>
5061 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
5062 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
5063 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
5064 parameter to mark_dead_statements.
5065 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
5066 (ipa_param_body_adjustments::mark_dead_statements): New parameter
5067 debugstack, push into it all SSA names used in debug statements,
5068 produce m_dead_ssa_debug_equiv mapping for the removed param.
5069 (replace_with_mapped_expr): New function.
5070 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
5071 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
5072 (ipa_param_body_adjustments::common_initialization): Gather and
5073 procecc SSA which will be removed but are in debug statements. Simplify.
5074 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
5076 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
5077 when avoiding a copy of an unnecessary statement. Remap removed SSA
5078 names in existing debug statements.
5079 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
5080 parameters if we have already done so.
5082 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
5085 * gimple.c (gimple_call_static_chain_flags): Handle case when
5086 nested function does not bind locally.
5088 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5090 * config/aarch64/aarch64.c (aarch64_function_value): Generate
5091 a register rtx for Neon vector-tuple modes.
5092 (aarch64_layout_arg): Likewise.
5094 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5096 * lower-subreg.c (simple_move): Prevent decomposition if
5097 modes are not tieable.
5099 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5100 Richard Sandiford <richard.sandiford@arm.com>
5102 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
5103 (v2x4hi_UP): Likewise.
5104 (v2x4hf_UP): Likewise.
5105 (v2x4bf_UP): Likewise.
5106 (v2x2si_UP): Likewise.
5107 (v2x2sf_UP): Likewise.
5108 (v2x1di_UP): Likewise.
5109 (v2x1df_UP): Likewise.
5110 (v2x16qi_UP): Likewise.
5111 (v2x8hi_UP): Likewise.
5112 (v2x8hf_UP): Likewise.
5113 (v2x8bf_UP): Likewise.
5114 (v2x4si_UP): Likewise.
5115 (v2x4sf_UP): Likewise.
5116 (v2x2di_UP): Likewise.
5117 (v2x2df_UP): Likewise.
5118 (v3x8qi_UP): Likewise.
5119 (v3x4hi_UP): Likewise.
5120 (v3x4hf_UP): Likewise.
5121 (v3x4bf_UP): Likewise.
5122 (v3x2si_UP): Likewise.
5123 (v3x2sf_UP): Likewise.
5124 (v3x1di_UP): Likewise.
5125 (v3x1df_UP): Likewise.
5126 (v3x16qi_UP): Likewise.
5127 (v3x8hi_UP): Likewise.
5128 (v3x8hf_UP): Likewise.
5129 (v3x8bf_UP): Likewise.
5130 (v3x4si_UP): Likewise.
5131 (v3x4sf_UP): Likewise.
5132 (v3x2di_UP): Likewise.
5133 (v3x2df_UP): Likewise.
5134 (v4x8qi_UP): Likewise.
5135 (v4x4hi_UP): Likewise.
5136 (v4x4hf_UP): Likewise.
5137 (v4x4bf_UP): Likewise.
5138 (v4x2si_UP): Likewise.
5139 (v4x2sf_UP): Likewise.
5140 (v4x1di_UP): Likewise.
5141 (v4x1df_UP): Likewise.
5142 (v4x16qi_UP): Likewise.
5143 (v4x8hi_UP): Likewise.
5144 (v4x8hf_UP): Likewise.
5145 (v4x8bf_UP): Likewise.
5146 (v4x4si_UP): Likewise.
5147 (v4x4sf_UP): Likewise.
5148 (v4x2di_UP): Likewise.
5149 (v4x2df_UP): Likewise.
5150 (TYPES_GETREGP): Delete.
5151 (TYPES_SETREGP): Likewise.
5152 (TYPES_LOADSTRUCT_U): Define.
5153 (TYPES_LOADSTRUCT_P): Likewise.
5154 (TYPES_LOADSTRUCT_LANE_U): Likewise.
5155 (TYPES_LOADSTRUCT_LANE_P): Likewise.
5156 (TYPES_STORE1P): Move for consistency.
5157 (TYPES_STORESTRUCT_U): Define.
5158 (TYPES_STORESTRUCT_P): Likewise.
5159 (TYPES_STORESTRUCT_LANE_U): Likewise.
5160 (TYPES_STORESTRUCT_LANE_P): Likewise.
5161 (aarch64_simd_tuple_types): Define.
5162 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
5163 (aarch64_init_simd_builtin_functions): Update frontend lookup
5164 for builtin functions after handling arm_neon.h pragma.
5165 (register_tuple_type): Manually set modes of single-integer
5166 tuple types. Record tuple types.
5167 * config/aarch64/aarch64-modes.def
5168 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
5169 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
5170 (SVE_MODES): Give single-vector modes priority over vector-
5172 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
5173 be after all single-vector modes.
5174 * config/aarch64/aarch64-simd-builtins.def: Update builtin
5175 generator macros to reflect modifications to the backend
5177 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
5178 Use vector-tuple mode iterator and rename to...
5179 (aarch64_simd_ld2<vstruct_elt>): This.
5180 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
5182 (aarch64_simd_ld2r<vstruct_elt>): This.
5183 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
5184 iterator and rename to...
5185 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
5186 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
5188 (vec_load_lanes<mode><vstruct_elt>): This.
5189 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
5191 (aarch64_simd_st2<vstruct_elt>): This.
5192 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
5193 iterator and rename to...
5194 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
5195 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
5197 (vec_store_lanes<mode><vstruct_elt>): This.
5198 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
5200 (aarch64_simd_ld3<vstruct_elt>): This.
5201 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
5203 (aarch64_simd_ld3r<vstruct_elt>): This.
5204 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
5205 iterator and rename to...
5206 (vec_load_lanesci<mode>): This.
5207 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
5209 (aarch64_simd_st3<vstruct_elt>): This.
5210 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
5211 iterator and rename to...
5212 (vec_store_lanesci<mode>): This.
5213 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
5215 (aarch64_simd_ld4<vstruct_elt>): This.
5216 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
5218 (aarch64_simd_ld4r<vstruct_elt>): This.
5219 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
5220 iterator and rename to...
5221 (vec_load_lanesxi<mode>): This.
5222 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
5224 (aarch64_simd_st4<vstruct_elt>): This.
5225 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
5226 iterator and rename to...
5227 (vec_store_lanesxi<mode>): This.
5228 (mov<mode>): Define for Neon vector-tuple modes.
5229 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
5231 (aarch64_ld1x3<vstruct_elt>): This.
5232 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
5234 (aarch64_ld1_x3_<vstruct_elt>): This.
5235 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
5237 (aarch64_ld1x4<vstruct_elt>): This.
5238 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
5240 (aarch64_ld1_x4_<vstruct_elt>): This.
5241 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
5243 (aarch64_st1x2<vstruct_elt>): This.
5244 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
5246 (aarch64_st1_x2_<vstruct_elt>): This.
5247 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
5249 (aarch64_st1x3<vstruct_elt>): This.
5250 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
5252 (aarch64_st1_x3_<vstruct_elt>): This.
5253 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
5255 (aarch64_st1x4<vstruct_elt>): This.
5256 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
5258 (aarch64_st1_x4_<vstruct_elt>): This.
5259 (*aarch64_mov<mode>): Define for vector-tuple modes.
5260 (*aarch64_be_mov<mode>): Likewise.
5261 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
5262 mode iterator and rename to...
5263 (aarch64_ld<nregs>r<vstruct_elt>): This.
5264 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
5266 (aarch64_ld2<vstruct_elt>_dreg): This.
5267 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
5269 (aarch64_ld3<vstruct_elt>_dreg): This.
5270 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
5272 (aarch64_ld4<vstruct_elt>_dreg): This.
5273 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
5274 iterator and rename to...
5275 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
5276 iterator and rename to...
5277 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
5278 (aarch64_ld1x2<VQ:mode>): Delete.
5279 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
5281 (aarch64_ld1x2<vstruct_elt>): This.
5282 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
5283 tuple mode iterator and rename to...
5284 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
5285 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
5286 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
5287 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
5289 (aarch64_st2<vstruct_elt>_dreg): This.
5290 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
5292 (aarch64_st3<vstruct_elt>_dreg): This.
5293 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
5295 (aarch64_st4<vstruct_elt>_dreg): This.
5296 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
5297 iterator and rename to...
5298 (aarch64_st<nregs><vstruct_elt>): This.
5299 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
5300 iterator and rename to aarch64_st<nregs><vstruct_elt>.
5301 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
5302 tuple mode iterator and rename to...
5303 (aarch64_st<nregs>_lane<vstruct_elt>): This.
5304 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
5305 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
5307 (aarch64_simd_ld1<vstruct_elt>_x2): This.
5308 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
5309 Refactor to include new vector-tuple modes.
5310 (aarch64_classify_vector_mode): Add cases for new vector-
5312 (aarch64_advsimd_partial_struct_mode_p): Define.
5313 (aarch64_advsimd_full_struct_mode_p): Likewise.
5314 (aarch64_advsimd_vector_array_mode): Likewise.
5315 (aarch64_sve_data_mode): Change location in file.
5316 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
5317 (aarch64_hard_regno_nregs): Handle case of partial Neon
5319 (aarch64_classify_address): Refactor to include handling of
5320 Neon vector-tuple modes.
5321 (aarch64_print_operand): Print "d" for "%R" for a partial
5322 Neon vector structure.
5323 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
5324 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
5325 modes with scalar machines modes larger than 8 bytes.
5326 (aarch64_can_change_mode_class): Don't allow changes between
5327 partial and full Neon vector-structure modes.
5328 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
5329 builtin and remove boiler-plate code for opaque mode.
5330 (vst2_lane_f32): Likewise.
5331 (vst2_lane_f64): Likewise.
5332 (vst2_lane_p8): Likewise.
5333 (vst2_lane_p16): Likewise.
5334 (vst2_lane_p64): Likewise.
5335 (vst2_lane_s8): Likewise.
5336 (vst2_lane_s16): Likewise.
5337 (vst2_lane_s32): Likewise.
5338 (vst2_lane_s64): Likewise.
5339 (vst2_lane_u8): Likewise.
5340 (vst2_lane_u16): Likewise.
5341 (vst2_lane_u32): Likewise.
5342 (vst2_lane_u64): Likewise.
5343 (vst2q_lane_f16): Likewise.
5344 (vst2q_lane_f32): Likewise.
5345 (vst2q_lane_f64): Likewise.
5346 (vst2q_lane_p8): Likewise.
5347 (vst2q_lane_p16): Likewise.
5348 (vst2q_lane_p64): Likewise.
5349 (vst2q_lane_s8): Likewise.
5350 (vst2q_lane_s16): Likewise.
5351 (vst2q_lane_s32): Likewise.
5352 (vst2q_lane_s64): Likewise.
5353 (vst2q_lane_u8): Likewise.
5354 (vst2q_lane_u16): Likewise.
5355 (vst2q_lane_u32): Likewise.
5356 (vst2q_lane_u64): Likewise.
5357 (vst3_lane_f16): Likewise.
5358 (vst3_lane_f32): Likewise.
5359 (vst3_lane_f64): Likewise.
5360 (vst3_lane_p8): Likewise.
5361 (vst3_lane_p16): Likewise.
5362 (vst3_lane_p64): Likewise.
5363 (vst3_lane_s8): Likewise.
5364 (vst3_lane_s16): Likewise.
5365 (vst3_lane_s32): Likewise.
5366 (vst3_lane_s64): Likewise.
5367 (vst3_lane_u8): Likewise.
5368 (vst3_lane_u16): Likewise.
5369 (vst3_lane_u32): Likewise.
5370 (vst3_lane_u64): Likewise.
5371 (vst3q_lane_f16): Likewise.
5372 (vst3q_lane_f32): Likewise.
5373 (vst3q_lane_f64): Likewise.
5374 (vst3q_lane_p8): Likewise.
5375 (vst3q_lane_p16): Likewise.
5376 (vst3q_lane_p64): Likewise.
5377 (vst3q_lane_s8): Likewise.
5378 (vst3q_lane_s16): Likewise.
5379 (vst3q_lane_s32): Likewise.
5380 (vst3q_lane_s64): Likewise.
5381 (vst3q_lane_u8): Likewise.
5382 (vst3q_lane_u16): Likewise.
5383 (vst3q_lane_u32): Likewise.
5384 (vst3q_lane_u64): Likewise.
5385 (vst4_lane_f16): Likewise.
5386 (vst4_lane_f32): Likewise.
5387 (vst4_lane_f64): Likewise.
5388 (vst4_lane_p8): Likewise.
5389 (vst4_lane_p16): Likewise.
5390 (vst4_lane_p64): Likewise.
5391 (vst4_lane_s8): Likewise.
5392 (vst4_lane_s16): Likewise.
5393 (vst4_lane_s32): Likewise.
5394 (vst4_lane_s64): Likewise.
5395 (vst4_lane_u8): Likewise.
5396 (vst4_lane_u16): Likewise.
5397 (vst4_lane_u32): Likewise.
5398 (vst4_lane_u64): Likewise.
5399 (vst4q_lane_f16): Likewise.
5400 (vst4q_lane_f32): Likewise.
5401 (vst4q_lane_f64): Likewise.
5402 (vst4q_lane_p8): Likewise.
5403 (vst4q_lane_p16): Likewise.
5404 (vst4q_lane_p64): Likewise.
5405 (vst4q_lane_s8): Likewise.
5406 (vst4q_lane_s16): Likewise.
5407 (vst4q_lane_s32): Likewise.
5408 (vst4q_lane_s64): Likewise.
5409 (vst4q_lane_u8): Likewise.
5410 (vst4q_lane_u16): Likewise.
5411 (vst4q_lane_u32): Likewise.
5412 (vst4q_lane_u64): Likewise.
5413 (vtbl3_s8): Likewise.
5414 (vtbl3_u8): Likewise.
5415 (vtbl3_p8): Likewise.
5416 (vtbl4_s8): Likewise.
5417 (vtbl4_u8): Likewise.
5418 (vtbl4_p8): Likewise.
5419 (vld1_u8_x3): Likewise.
5420 (vld1_s8_x3): Likewise.
5421 (vld1_u16_x3): Likewise.
5422 (vld1_s16_x3): Likewise.
5423 (vld1_u32_x3): Likewise.
5424 (vld1_s32_x3): Likewise.
5425 (vld1_u64_x3): Likewise.
5426 (vld1_s64_x3): Likewise.
5427 (vld1_f16_x3): Likewise.
5428 (vld1_f32_x3): Likewise.
5429 (vld1_f64_x3): Likewise.
5430 (vld1_p8_x3): Likewise.
5431 (vld1_p16_x3): Likewise.
5432 (vld1_p64_x3): Likewise.
5433 (vld1q_u8_x3): Likewise.
5434 (vld1q_s8_x3): Likewise.
5435 (vld1q_u16_x3): Likewise.
5436 (vld1q_s16_x3): Likewise.
5437 (vld1q_u32_x3): Likewise.
5438 (vld1q_s32_x3): Likewise.
5439 (vld1q_u64_x3): Likewise.
5440 (vld1q_s64_x3): Likewise.
5441 (vld1q_f16_x3): Likewise.
5442 (vld1q_f32_x3): Likewise.
5443 (vld1q_f64_x3): Likewise.
5444 (vld1q_p8_x3): Likewise.
5445 (vld1q_p16_x3): Likewise.
5446 (vld1q_p64_x3): Likewise.
5447 (vld1_u8_x2): Likewise.
5448 (vld1_s8_x2): Likewise.
5449 (vld1_u16_x2): Likewise.
5450 (vld1_s16_x2): Likewise.
5451 (vld1_u32_x2): Likewise.
5452 (vld1_s32_x2): Likewise.
5453 (vld1_u64_x2): Likewise.
5454 (vld1_s64_x2): Likewise.
5455 (vld1_f16_x2): Likewise.
5456 (vld1_f32_x2): Likewise.
5457 (vld1_f64_x2): Likewise.
5458 (vld1_p8_x2): Likewise.
5459 (vld1_p16_x2): Likewise.
5460 (vld1_p64_x2): Likewise.
5461 (vld1q_u8_x2): Likewise.
5462 (vld1q_s8_x2): Likewise.
5463 (vld1q_u16_x2): Likewise.
5464 (vld1q_s16_x2): Likewise.
5465 (vld1q_u32_x2): Likewise.
5466 (vld1q_s32_x2): Likewise.
5467 (vld1q_u64_x2): Likewise.
5468 (vld1q_s64_x2): Likewise.
5469 (vld1q_f16_x2): Likewise.
5470 (vld1q_f32_x2): Likewise.
5471 (vld1q_f64_x2): Likewise.
5472 (vld1q_p8_x2): Likewise.
5473 (vld1q_p16_x2): Likewise.
5474 (vld1q_p64_x2): Likewise.
5475 (vld1_s8_x4): Likewise.
5476 (vld1q_s8_x4): Likewise.
5477 (vld1_s16_x4): Likewise.
5478 (vld1q_s16_x4): Likewise.
5479 (vld1_s32_x4): Likewise.
5480 (vld1q_s32_x4): Likewise.
5481 (vld1_u8_x4): Likewise.
5482 (vld1q_u8_x4): Likewise.
5483 (vld1_u16_x4): Likewise.
5484 (vld1q_u16_x4): Likewise.
5485 (vld1_u32_x4): Likewise.
5486 (vld1q_u32_x4): Likewise.
5487 (vld1_f16_x4): Likewise.
5488 (vld1q_f16_x4): Likewise.
5489 (vld1_f32_x4): Likewise.
5490 (vld1q_f32_x4): Likewise.
5491 (vld1_p8_x4): Likewise.
5492 (vld1q_p8_x4): Likewise.
5493 (vld1_p16_x4): Likewise.
5494 (vld1q_p16_x4): Likewise.
5495 (vld1_s64_x4): Likewise.
5496 (vld1_u64_x4): Likewise.
5497 (vld1_p64_x4): Likewise.
5498 (vld1q_s64_x4): Likewise.
5499 (vld1q_u64_x4): Likewise.
5500 (vld1q_p64_x4): Likewise.
5501 (vld1_f64_x4): Likewise.
5502 (vld1q_f64_x4): Likewise.
5503 (vld2_s64): Likewise.
5504 (vld2_u64): Likewise.
5505 (vld2_f64): Likewise.
5506 (vld2_s8): Likewise.
5507 (vld2_p8): Likewise.
5508 (vld2_p64): Likewise.
5509 (vld2_s16): Likewise.
5510 (vld2_p16): Likewise.
5511 (vld2_s32): Likewise.
5512 (vld2_u8): Likewise.
5513 (vld2_u16): Likewise.
5514 (vld2_u32): Likewise.
5515 (vld2_f16): Likewise.
5516 (vld2_f32): Likewise.
5517 (vld2q_s8): Likewise.
5518 (vld2q_p8): Likewise.
5519 (vld2q_s16): Likewise.
5520 (vld2q_p16): Likewise.
5521 (vld2q_p64): Likewise.
5522 (vld2q_s32): Likewise.
5523 (vld2q_s64): Likewise.
5524 (vld2q_u8): Likewise.
5525 (vld2q_u16): Likewise.
5526 (vld2q_u32): Likewise.
5527 (vld2q_u64): Likewise.
5528 (vld2q_f16): Likewise.
5529 (vld2q_f32): Likewise.
5530 (vld2q_f64): Likewise.
5531 (vld3_s64): Likewise.
5532 (vld3_u64): Likewise.
5533 (vld3_f64): Likewise.
5534 (vld3_s8): Likewise.
5535 (vld3_p8): Likewise.
5536 (vld3_s16): Likewise.
5537 (vld3_p16): Likewise.
5538 (vld3_s32): Likewise.
5539 (vld3_u8): Likewise.
5540 (vld3_u16): Likewise.
5541 (vld3_u32): Likewise.
5542 (vld3_f16): Likewise.
5543 (vld3_f32): Likewise.
5544 (vld3_p64): Likewise.
5545 (vld3q_s8): Likewise.
5546 (vld3q_p8): Likewise.
5547 (vld3q_s16): Likewise.
5548 (vld3q_p16): Likewise.
5549 (vld3q_s32): Likewise.
5550 (vld3q_s64): Likewise.
5551 (vld3q_u8): Likewise.
5552 (vld3q_u16): Likewise.
5553 (vld3q_u32): Likewise.
5554 (vld3q_u64): Likewise.
5555 (vld3q_f16): Likewise.
5556 (vld3q_f32): Likewise.
5557 (vld3q_f64): Likewise.
5558 (vld3q_p64): Likewise.
5559 (vld4_s64): Likewise.
5560 (vld4_u64): Likewise.
5561 (vld4_f64): Likewise.
5562 (vld4_s8): Likewise.
5563 (vld4_p8): Likewise.
5564 (vld4_s16): Likewise.
5565 (vld4_p16): Likewise.
5566 (vld4_s32): Likewise.
5567 (vld4_u8): Likewise.
5568 (vld4_u16): Likewise.
5569 (vld4_u32): Likewise.
5570 (vld4_f16): Likewise.
5571 (vld4_f32): Likewise.
5572 (vld4_p64): Likewise.
5573 (vld4q_s8): Likewise.
5574 (vld4q_p8): Likewise.
5575 (vld4q_s16): Likewise.
5576 (vld4q_p16): Likewise.
5577 (vld4q_s32): Likewise.
5578 (vld4q_s64): Likewise.
5579 (vld4q_u8): Likewise.
5580 (vld4q_u16): Likewise.
5581 (vld4q_u32): Likewise.
5582 (vld4q_u64): Likewise.
5583 (vld4q_f16): Likewise.
5584 (vld4q_f32): Likewise.
5585 (vld4q_f64): Likewise.
5586 (vld4q_p64): Likewise.
5587 (vld2_dup_s8): Likewise.
5588 (vld2_dup_s16): Likewise.
5589 (vld2_dup_s32): Likewise.
5590 (vld2_dup_f16): Likewise.
5591 (vld2_dup_f32): Likewise.
5592 (vld2_dup_f64): Likewise.
5593 (vld2_dup_u8): Likewise.
5594 (vld2_dup_u16): Likewise.
5595 (vld2_dup_u32): Likewise.
5596 (vld2_dup_p8): Likewise.
5597 (vld2_dup_p16): Likewise.
5598 (vld2_dup_p64): Likewise.
5599 (vld2_dup_s64): Likewise.
5600 (vld2_dup_u64): Likewise.
5601 (vld2q_dup_s8): Likewise.
5602 (vld2q_dup_p8): Likewise.
5603 (vld2q_dup_s16): Likewise.
5604 (vld2q_dup_p16): Likewise.
5605 (vld2q_dup_s32): Likewise.
5606 (vld2q_dup_s64): Likewise.
5607 (vld2q_dup_u8): Likewise.
5608 (vld2q_dup_u16): Likewise.
5609 (vld2q_dup_u32): Likewise.
5610 (vld2q_dup_u64): Likewise.
5611 (vld2q_dup_f16): Likewise.
5612 (vld2q_dup_f32): Likewise.
5613 (vld2q_dup_f64): Likewise.
5614 (vld2q_dup_p64): Likewise.
5615 (vld3_dup_s64): Likewise.
5616 (vld3_dup_u64): Likewise.
5617 (vld3_dup_f64): Likewise.
5618 (vld3_dup_s8): Likewise.
5619 (vld3_dup_p8): Likewise.
5620 (vld3_dup_s16): Likewise.
5621 (vld3_dup_p16): Likewise.
5622 (vld3_dup_s32): Likewise.
5623 (vld3_dup_u8): Likewise.
5624 (vld3_dup_u16): Likewise.
5625 (vld3_dup_u32): Likewise.
5626 (vld3_dup_f16): Likewise.
5627 (vld3_dup_f32): Likewise.
5628 (vld3_dup_p64): Likewise.
5629 (vld3q_dup_s8): Likewise.
5630 (vld3q_dup_p8): Likewise.
5631 (vld3q_dup_s16): Likewise.
5632 (vld3q_dup_p16): Likewise.
5633 (vld3q_dup_s32): Likewise.
5634 (vld3q_dup_s64): Likewise.
5635 (vld3q_dup_u8): Likewise.
5636 (vld3q_dup_u16): Likewise.
5637 (vld3q_dup_u32): Likewise.
5638 (vld3q_dup_u64): Likewise.
5639 (vld3q_dup_f16): Likewise.
5640 (vld3q_dup_f32): Likewise.
5641 (vld3q_dup_f64): Likewise.
5642 (vld3q_dup_p64): Likewise.
5643 (vld4_dup_s64): Likewise.
5644 (vld4_dup_u64): Likewise.
5645 (vld4_dup_f64): Likewise.
5646 (vld4_dup_s8): Likewise.
5647 (vld4_dup_p8): Likewise.
5648 (vld4_dup_s16): Likewise.
5649 (vld4_dup_p16): Likewise.
5650 (vld4_dup_s32): Likewise.
5651 (vld4_dup_u8): Likewise.
5652 (vld4_dup_u16): Likewise.
5653 (vld4_dup_u32): Likewise.
5654 (vld4_dup_f16): Likewise.
5655 (vld4_dup_f32): Likewise.
5656 (vld4_dup_p64): Likewise.
5657 (vld4q_dup_s8): Likewise.
5658 (vld4q_dup_p8): Likewise.
5659 (vld4q_dup_s16): Likewise.
5660 (vld4q_dup_p16): Likewise.
5661 (vld4q_dup_s32): Likewise.
5662 (vld4q_dup_s64): Likewise.
5663 (vld4q_dup_u8): Likewise.
5664 (vld4q_dup_u16): Likewise.
5665 (vld4q_dup_u32): Likewise.
5666 (vld4q_dup_u64): Likewise.
5667 (vld4q_dup_f16): Likewise.
5668 (vld4q_dup_f32): Likewise.
5669 (vld4q_dup_f64): Likewise.
5670 (vld4q_dup_p64): Likewise.
5671 (vld2_lane_u8): Likewise.
5672 (vld2_lane_u16): Likewise.
5673 (vld2_lane_u32): Likewise.
5674 (vld2_lane_u64): Likewise.
5675 (vld2_lane_s8): Likewise.
5676 (vld2_lane_s16): Likewise.
5677 (vld2_lane_s32): Likewise.
5678 (vld2_lane_s64): Likewise.
5679 (vld2_lane_f16): Likewise.
5680 (vld2_lane_f32): Likewise.
5681 (vld2_lane_f64): Likewise.
5682 (vld2_lane_p8): Likewise.
5683 (vld2_lane_p16): Likewise.
5684 (vld2_lane_p64): Likewise.
5685 (vld2q_lane_u8): Likewise.
5686 (vld2q_lane_u16): Likewise.
5687 (vld2q_lane_u32): Likewise.
5688 (vld2q_lane_u64): Likewise.
5689 (vld2q_lane_s8): Likewise.
5690 (vld2q_lane_s16): Likewise.
5691 (vld2q_lane_s32): Likewise.
5692 (vld2q_lane_s64): Likewise.
5693 (vld2q_lane_f16): Likewise.
5694 (vld2q_lane_f32): Likewise.
5695 (vld2q_lane_f64): Likewise.
5696 (vld2q_lane_p8): Likewise.
5697 (vld2q_lane_p16): Likewise.
5698 (vld2q_lane_p64): Likewise.
5699 (vld3_lane_u8): Likewise.
5700 (vld3_lane_u16): Likewise.
5701 (vld3_lane_u32): Likewise.
5702 (vld3_lane_u64): Likewise.
5703 (vld3_lane_s8): Likewise.
5704 (vld3_lane_s16): Likewise.
5705 (vld3_lane_s32): Likewise.
5706 (vld3_lane_s64): Likewise.
5707 (vld3_lane_f16): Likewise.
5708 (vld3_lane_f32): Likewise.
5709 (vld3_lane_f64): Likewise.
5710 (vld3_lane_p8): Likewise.
5711 (vld3_lane_p16): Likewise.
5712 (vld3_lane_p64): Likewise.
5713 (vld3q_lane_u8): Likewise.
5714 (vld3q_lane_u16): Likewise.
5715 (vld3q_lane_u32): Likewise.
5716 (vld3q_lane_u64): Likewise.
5717 (vld3q_lane_s8): Likewise.
5718 (vld3q_lane_s16): Likewise.
5719 (vld3q_lane_s32): Likewise.
5720 (vld3q_lane_s64): Likewise.
5721 (vld3q_lane_f16): Likewise.
5722 (vld3q_lane_f32): Likewise.
5723 (vld3q_lane_f64): Likewise.
5724 (vld3q_lane_p8): Likewise.
5725 (vld3q_lane_p16): Likewise.
5726 (vld3q_lane_p64): Likewise.
5727 (vld4_lane_u8): Likewise.
5728 (vld4_lane_u16): Likewise.
5729 (vld4_lane_u32): Likewise.
5730 (vld4_lane_u64): Likewise.
5731 (vld4_lane_s8): Likewise.
5732 (vld4_lane_s16): Likewise.
5733 (vld4_lane_s32): Likewise.
5734 (vld4_lane_s64): Likewise.
5735 (vld4_lane_f16): Likewise.
5736 (vld4_lane_f32): Likewise.
5737 (vld4_lane_f64): Likewise.
5738 (vld4_lane_p8): Likewise.
5739 (vld4_lane_p16): Likewise.
5740 (vld4_lane_p64): Likewise.
5741 (vld4q_lane_u8): Likewise.
5742 (vld4q_lane_u16): Likewise.
5743 (vld4q_lane_u32): Likewise.
5744 (vld4q_lane_u64): Likewise.
5745 (vld4q_lane_s8): Likewise.
5746 (vld4q_lane_s16): Likewise.
5747 (vld4q_lane_s32): Likewise.
5748 (vld4q_lane_s64): Likewise.
5749 (vld4q_lane_f16): Likewise.
5750 (vld4q_lane_f32): Likewise.
5751 (vld4q_lane_f64): Likewise.
5752 (vld4q_lane_p8): Likewise.
5753 (vld4q_lane_p16): Likewise.
5754 (vld4q_lane_p64): Likewise.
5755 (vqtbl2_s8): Likewise.
5756 (vqtbl2_u8): Likewise.
5757 (vqtbl2_p8): Likewise.
5758 (vqtbl2q_s8): Likewise.
5759 (vqtbl2q_u8): Likewise.
5760 (vqtbl2q_p8): Likewise.
5761 (vqtbl3_s8): Likewise.
5762 (vqtbl3_u8): Likewise.
5763 (vqtbl3_p8): Likewise.
5764 (vqtbl3q_s8): Likewise.
5765 (vqtbl3q_u8): Likewise.
5766 (vqtbl3q_p8): Likewise.
5767 (vqtbl4_s8): Likewise.
5768 (vqtbl4_u8): Likewise.
5769 (vqtbl4_p8): Likewise.
5770 (vqtbl4q_s8): Likewise.
5771 (vqtbl4q_u8): Likewise.
5772 (vqtbl4q_p8): Likewise.
5773 (vqtbx2_s8): Likewise.
5774 (vqtbx2_u8): Likewise.
5775 (vqtbx2_p8): Likewise.
5776 (vqtbx2q_s8): Likewise.
5777 (vqtbx2q_u8): Likewise.
5778 (vqtbx2q_p8): Likewise.
5779 (vqtbx3_s8): Likewise.
5780 (vqtbx3_u8): Likewise.
5781 (vqtbx3_p8): Likewise.
5782 (vqtbx3q_s8): Likewise.
5783 (vqtbx3q_u8): Likewise.
5784 (vqtbx3q_p8): Likewise.
5785 (vqtbx4_s8): Likewise.
5786 (vqtbx4_u8): Likewise.
5787 (vqtbx4_p8): Likewise.
5788 (vqtbx4q_s8): Likewise.
5789 (vqtbx4q_u8): Likewise.
5790 (vqtbx4q_p8): Likewise.
5791 (vst1_s64_x2): Likewise.
5792 (vst1_u64_x2): Likewise.
5793 (vst1_f64_x2): Likewise.
5794 (vst1_s8_x2): Likewise.
5795 (vst1_p8_x2): Likewise.
5796 (vst1_s16_x2): Likewise.
5797 (vst1_p16_x2): Likewise.
5798 (vst1_s32_x2): Likewise.
5799 (vst1_u8_x2): Likewise.
5800 (vst1_u16_x2): Likewise.
5801 (vst1_u32_x2): Likewise.
5802 (vst1_f16_x2): Likewise.
5803 (vst1_f32_x2): Likewise.
5804 (vst1_p64_x2): Likewise.
5805 (vst1q_s8_x2): Likewise.
5806 (vst1q_p8_x2): Likewise.
5807 (vst1q_s16_x2): Likewise.
5808 (vst1q_p16_x2): Likewise.
5809 (vst1q_s32_x2): Likewise.
5810 (vst1q_s64_x2): Likewise.
5811 (vst1q_u8_x2): Likewise.
5812 (vst1q_u16_x2): Likewise.
5813 (vst1q_u32_x2): Likewise.
5814 (vst1q_u64_x2): Likewise.
5815 (vst1q_f16_x2): Likewise.
5816 (vst1q_f32_x2): Likewise.
5817 (vst1q_f64_x2): Likewise.
5818 (vst1q_p64_x2): Likewise.
5819 (vst1_s64_x3): Likewise.
5820 (vst1_u64_x3): Likewise.
5821 (vst1_f64_x3): Likewise.
5822 (vst1_s8_x3): Likewise.
5823 (vst1_p8_x3): Likewise.
5824 (vst1_s16_x3): Likewise.
5825 (vst1_p16_x3): Likewise.
5826 (vst1_s32_x3): Likewise.
5827 (vst1_u8_x3): Likewise.
5828 (vst1_u16_x3): Likewise.
5829 (vst1_u32_x3): Likewise.
5830 (vst1_f16_x3): Likewise.
5831 (vst1_f32_x3): Likewise.
5832 (vst1_p64_x3): Likewise.
5833 (vst1q_s8_x3): Likewise.
5834 (vst1q_p8_x3): Likewise.
5835 (vst1q_s16_x3): Likewise.
5836 (vst1q_p16_x3): Likewise.
5837 (vst1q_s32_x3): Likewise.
5838 (vst1q_s64_x3): Likewise.
5839 (vst1q_u8_x3): Likewise.
5840 (vst1q_u16_x3): Likewise.
5841 (vst1q_u32_x3): Likewise.
5842 (vst1q_u64_x3): Likewise.
5843 (vst1q_f16_x3): Likewise.
5844 (vst1q_f32_x3): Likewise.
5845 (vst1q_f64_x3): Likewise.
5846 (vst1q_p64_x3): Likewise.
5847 (vst1_s8_x4): Likewise.
5848 (vst1q_s8_x4): Likewise.
5849 (vst1_s16_x4): Likewise.
5850 (vst1q_s16_x4): Likewise.
5851 (vst1_s32_x4): Likewise.
5852 (vst1q_s32_x4): Likewise.
5853 (vst1_u8_x4): Likewise.
5854 (vst1q_u8_x4): Likewise.
5855 (vst1_u16_x4): Likewise.
5856 (vst1q_u16_x4): Likewise.
5857 (vst1_u32_x4): Likewise.
5858 (vst1q_u32_x4): Likewise.
5859 (vst1_f16_x4): Likewise.
5860 (vst1q_f16_x4): Likewise.
5861 (vst1_f32_x4): Likewise.
5862 (vst1q_f32_x4): Likewise.
5863 (vst1_p8_x4): Likewise.
5864 (vst1q_p8_x4): Likewise.
5865 (vst1_p16_x4): Likewise.
5866 (vst1q_p16_x4): Likewise.
5867 (vst1_s64_x4): Likewise.
5868 (vst1_u64_x4): Likewise.
5869 (vst1_p64_x4): Likewise.
5870 (vst1q_s64_x4): Likewise.
5871 (vst1q_u64_x4): Likewise.
5872 (vst1q_p64_x4): Likewise.
5873 (vst1_f64_x4): Likewise.
5874 (vst1q_f64_x4): Likewise.
5875 (vst2_s64): Likewise.
5876 (vst2_u64): Likewise.
5877 (vst2_f64): Likewise.
5878 (vst2_s8): Likewise.
5879 (vst2_p8): Likewise.
5880 (vst2_s16): Likewise.
5881 (vst2_p16): Likewise.
5882 (vst2_s32): Likewise.
5883 (vst2_u8): Likewise.
5884 (vst2_u16): Likewise.
5885 (vst2_u32): Likewise.
5886 (vst2_f16): Likewise.
5887 (vst2_f32): Likewise.
5888 (vst2_p64): Likewise.
5889 (vst2q_s8): Likewise.
5890 (vst2q_p8): Likewise.
5891 (vst2q_s16): Likewise.
5892 (vst2q_p16): Likewise.
5893 (vst2q_s32): Likewise.
5894 (vst2q_s64): Likewise.
5895 (vst2q_u8): Likewise.
5896 (vst2q_u16): Likewise.
5897 (vst2q_u32): Likewise.
5898 (vst2q_u64): Likewise.
5899 (vst2q_f16): Likewise.
5900 (vst2q_f32): Likewise.
5901 (vst2q_f64): Likewise.
5902 (vst2q_p64): Likewise.
5903 (vst3_s64): Likewise.
5904 (vst3_u64): Likewise.
5905 (vst3_f64): Likewise.
5906 (vst3_s8): Likewise.
5907 (vst3_p8): Likewise.
5908 (vst3_s16): Likewise.
5909 (vst3_p16): Likewise.
5910 (vst3_s32): Likewise.
5911 (vst3_u8): Likewise.
5912 (vst3_u16): Likewise.
5913 (vst3_u32): Likewise.
5914 (vst3_f16): Likewise.
5915 (vst3_f32): Likewise.
5916 (vst3_p64): Likewise.
5917 (vst3q_s8): Likewise.
5918 (vst3q_p8): Likewise.
5919 (vst3q_s16): Likewise.
5920 (vst3q_p16): Likewise.
5921 (vst3q_s32): Likewise.
5922 (vst3q_s64): Likewise.
5923 (vst3q_u8): Likewise.
5924 (vst3q_u16): Likewise.
5925 (vst3q_u32): Likewise.
5926 (vst3q_u64): Likewise.
5927 (vst3q_f16): Likewise.
5928 (vst3q_f32): Likewise.
5929 (vst3q_f64): Likewise.
5930 (vst3q_p64): Likewise.
5931 (vst4_s64): Likewise.
5932 (vst4_u64): Likewise.
5933 (vst4_f64): Likewise.
5934 (vst4_s8): Likewise.
5935 (vst4_p8): Likewise.
5936 (vst4_s16): Likewise.
5937 (vst4_p16): Likewise.
5938 (vst4_s32): Likewise.
5939 (vst4_u8): Likewise.
5940 (vst4_u16): Likewise.
5941 (vst4_u32): Likewise.
5942 (vst4_f16): Likewise.
5943 (vst4_f32): Likewise.
5944 (vst4_p64): Likewise.
5945 (vst4q_s8): Likewise.
5946 (vst4q_p8): Likewise.
5947 (vst4q_s16): Likewise.
5948 (vst4q_p16): Likewise.
5949 (vst4q_s32): Likewise.
5950 (vst4q_s64): Likewise.
5951 (vst4q_u8): Likewise.
5952 (vst4q_u16): Likewise.
5953 (vst4q_u32): Likewise.
5954 (vst4q_u64): Likewise.
5955 (vst4q_f16): Likewise.
5956 (vst4q_f32): Likewise.
5957 (vst4q_f64): Likewise.
5958 (vst4q_p64): Likewise.
5959 (vtbx4_s8): Likewise.
5960 (vtbx4_u8): Likewise.
5961 (vtbx4_p8): Likewise.
5962 (vld1_bf16_x2): Likewise.
5963 (vld1q_bf16_x2): Likewise.
5964 (vld1_bf16_x3): Likewise.
5965 (vld1q_bf16_x3): Likewise.
5966 (vld1_bf16_x4): Likewise.
5967 (vld1q_bf16_x4): Likewise.
5968 (vld2_bf16): Likewise.
5969 (vld2q_bf16): Likewise.
5970 (vld2_dup_bf16): Likewise.
5971 (vld2q_dup_bf16): Likewise.
5972 (vld3_bf16): Likewise.
5973 (vld3q_bf16): Likewise.
5974 (vld3_dup_bf16): Likewise.
5975 (vld3q_dup_bf16): Likewise.
5976 (vld4_bf16): Likewise.
5977 (vld4q_bf16): Likewise.
5978 (vld4_dup_bf16): Likewise.
5979 (vld4q_dup_bf16): Likewise.
5980 (vst1_bf16_x2): Likewise.
5981 (vst1q_bf16_x2): Likewise.
5982 (vst1_bf16_x3): Likewise.
5983 (vst1q_bf16_x3): Likewise.
5984 (vst1_bf16_x4): Likewise.
5985 (vst1q_bf16_x4): Likewise.
5986 (vst2_bf16): Likewise.
5987 (vst2q_bf16): Likewise.
5988 (vst3_bf16): Likewise.
5989 (vst3q_bf16): Likewise.
5990 (vst4_bf16): Likewise.
5991 (vst4q_bf16): Likewise.
5992 (vld2_lane_bf16): Likewise.
5993 (vld2q_lane_bf16): Likewise.
5994 (vld3_lane_bf16): Likewise.
5995 (vld3q_lane_bf16): Likewise.
5996 (vld4_lane_bf16): Likewise.
5997 (vld4q_lane_bf16): Likewise.
5998 (vst2_lane_bf16): Likewise.
5999 (vst2q_lane_bf16): Likewise.
6000 (vst3_lane_bf16): Likewise.
6001 (vst3q_lane_bf16): Likewise.
6002 (vst4_lane_bf16): Likewise.
6003 (vst4q_lane_bf16): Likewise.
6004 * config/aarch64/geniterators.sh: Modify iterator regex to
6005 match new vector-tuple modes.
6006 * config/aarch64/iterators.md (insn_count): Extend mode
6007 attribute with vector-tuple type information.
6009 (Vendreg): Likewise.
6012 (VSTRUCT_2D): New mode iterator.
6013 (VSTRUCT_2DNX): Likewise.
6014 (VSTRUCT_2DX): Likewise.
6015 (VSTRUCT_2Q): Likewise.
6016 (VSTRUCT_2QD): Likewise.
6017 (VSTRUCT_3D): Likewise.
6018 (VSTRUCT_3DNX): Likewise.
6019 (VSTRUCT_3DX): Likewise.
6020 (VSTRUCT_3Q): Likewise.
6021 (VSTRUCT_3QD): Likewise.
6022 (VSTRUCT_4D): Likewise.
6023 (VSTRUCT_4DNX): Likewise.
6024 (VSTRUCT_4DX): Likewise.
6025 (VSTRUCT_4Q): Likewise.
6026 (VSTRUCT_4QD): Likewise.
6027 (VSTRUCT_D): Likewise.
6028 (VSTRUCT_Q): Likewise.
6029 (VSTRUCT_QD): Likewise.
6030 (VSTRUCT_ELT): New mode attribute.
6031 (vstruct_elt): Likewise.
6032 * genmodes.c (VECTOR_MODE): Add default prefix and order
6034 (VECTOR_MODE_WITH_PREFIX): Define.
6035 (make_vector_mode): Add mode prefix and order parameters.
6037 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
6039 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
6041 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
6043 * expr.c (emit_group_load_1): Remove historic workaround.
6045 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
6047 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
6048 Factor out main loop to...
6049 (aarch64_init_simd_builtin_functions): This new function.
6050 (register_tuple_type): Define.
6051 (aarch64_scalar_builtin_type_p): Define.
6052 (handle_arm_neon_h): Define.
6053 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
6054 pragma for arm_neon.h.
6055 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
6057 (handle_arm_neon_h): Likewise.
6058 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
6059 Remove static modifier.
6060 * config/aarch64/arm_neon.h (target): Remove Neon vector
6061 structure type definitions.
6063 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6065 PR tree-optimization/102943
6066 * gimple-range-path.cc (path_range_query::range_on_path_entry):
6067 Prefer range_of_expr unless there are no statements in the BB.
6069 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6071 PR tree-optimization/102943
6072 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
6073 Avoid duplicate calculation of paths.
6075 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6077 PR tree-optimization/102943
6078 * gimple-range-path.cc (path_range_query::compute_phi_relations):
6079 Only compute relations for SSA names in the import list.
6080 (path_range_query::compute_outgoing_relations): Same.
6081 * gimple-range-path.h (path_range_query::import_p): New.
6083 2021-11-04 Richard Biener <rguenther@suse.de>
6085 PR rtl-optimization/103075
6086 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
6087 false for a VOIDmode operand.
6089 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6091 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
6092 variables private and add "m_" to their names. Remove is_loop.
6093 (aarch64_record_potential_advsimd_unrolling): Replace with...
6094 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
6095 (aarch64_analyze_loop_vinfo): Replace with...
6096 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
6097 Move initialization of (m_)vec_flags to add_stmt_cost.
6098 (aarch64_analyze_bb_vinfo): Delete.
6099 (aarch64_count_ops): Replace with...
6100 (aarch64_vector_costs::count_ops): ...this.
6101 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
6102 using m_costing_for_scalar to test whether we're costing
6103 scalar or vector code.
6104 (aarch64_adjust_body_cost_sve): Replace with...
6105 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
6106 (aarch64_adjust_body_cost): Replace with...
6107 (aarch64_vector_costs::adjust_body_cost): ...this.
6108 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
6110 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6112 * target.def (targetm.vectorize.init_cost): Replace with...
6113 (targetm.vectorize.create_costs): ...this.
6114 (targetm.vectorize.add_stmt_cost): Delete.
6115 (targetm.vectorize.finish_cost): Likewise.
6116 (targetm.vectorize.destroy_cost_data): Likewise.
6117 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
6118 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6119 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6120 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6121 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6122 * doc/tm.texi: Regenerate.
6123 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
6125 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
6126 (vector_costs): New class.
6127 (init_cost): Take a vec_info and return a vector_costs.
6128 (dump_stmt_cost): Remove data parameter.
6129 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
6130 (add_stmt_costs): Likewise.
6131 (finish_cost): Replace data parameter with a vector_costs.
6132 (destroy_cost_data): Delete.
6133 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
6135 (vec_info::vec_info): Remove the target_cost_data parameter and
6136 initialize the member variable to null instead.
6137 (vec_info::~vec_info): Delete target_cost_data instead of calling
6139 (vector_costs::add_stmt_cost): New function.
6140 (vector_costs::finish_cost): Likewise.
6141 (vector_costs::record_stmt_cost): Likewise.
6142 (vector_costs::adjust_cost_for_freq): Likewise.
6143 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
6144 call to vec_info::vec_info.
6145 (vect_compute_single_scalar_iteration_cost): Update after above
6146 changes to costing interface.
6147 (vect_analyze_loop_operations): Likewise.
6148 (vect_estimate_min_profitable_iters): Likewise.
6149 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
6150 at the start_over point, where it needs to be recreated after
6151 trying without slp. Update retry code accordingly.
6152 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
6153 to vec_info::vec_info.
6154 (vect_slp_analyze_operation): Update after above changes to costing
6156 (vect_bb_vectorization_profitable_p): Likewise.
6157 * targhooks.h (default_init_cost): Replace with...
6158 (default_vectorize_create_costs): ...this.
6159 (default_add_stmt_cost): Delete.
6160 (default_finish_cost, default_destroy_cost_data): Likewise.
6161 * targhooks.c (default_init_cost): Replace with...
6162 (default_vectorize_create_costs): ...this.
6163 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
6164 (default_finish_cost, default_destroy_cost_data): Delete.
6165 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
6166 vector_costs. Add a constructor.
6167 (aarch64_init_cost): Replace with...
6168 (aarch64_vectorize_create_costs): ...this.
6169 (aarch64_add_stmt_cost): Replace with...
6170 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
6171 to adjust the cost for inner loops.
6172 (aarch64_finish_cost): Replace with...
6173 (aarch64_vector_costs::finish_cost): ...this.
6174 (aarch64_destroy_cost_data): Delete.
6175 (TARGET_VECTORIZE_INIT_COST): Replace with...
6176 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6177 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6178 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6179 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6180 * config/i386/i386.c (ix86_vector_costs): New structure.
6181 (ix86_init_cost): Replace with...
6182 (ix86_vectorize_create_costs): ...this.
6183 (ix86_add_stmt_cost): Replace with...
6184 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
6185 to adjust the cost for inner loops.
6186 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
6187 (TARGET_VECTORIZE_INIT_COST): Replace with...
6188 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6189 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6190 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6191 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6192 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
6193 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
6194 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
6195 (TARGET_VECTORIZE_FINISH_COST): Likewise.
6196 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
6197 (rs6000_cost_data): Inherit from vector_costs.
6198 Add a constructor. Drop loop_info, cost and costing_for_scalar
6199 in favor of the corresponding vector_costs member variables.
6200 Add "m_" to the names of the remaining member variables and
6202 (rs6000_density_test): Replace with...
6203 (rs6000_cost_data::density_test): ...this.
6204 (rs6000_init_cost): Replace with...
6205 (rs6000_vectorize_create_costs): ...this.
6206 (rs6000_update_target_cost_per_stmt): Replace with...
6207 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
6208 (rs6000_add_stmt_cost): Replace with...
6209 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
6210 to adjust the cost for inner loops.
6211 (rs6000_adjust_vect_cost_per_loop): Replace with...
6212 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
6213 (rs6000_finish_cost): Replace with...
6214 (rs6000_cost_data::finish_cost): ...this. Group loop code
6215 into a single if statement and pass the loop_vinfo down to
6217 (rs6000_destroy_cost_data): Delete.
6219 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
6221 PR tree-optimization/103062
6222 PR tree-optimization/103062
6223 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
6224 Increase size of allocation by 1.
6225 (ssa_equiv_stack::push_replacement): Grow as needed.
6226 (ssa_equiv_stack::get_replacement): Same.
6227 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
6228 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
6229 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
6230 (pointer_equiv_analyzer::get_equiv): Same.
6231 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
6232 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
6233 const markers. Use auto_vec instead of tree *.
6235 2021-11-04 Richard Biener <rguenther@suse.de>
6237 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
6238 true parameter and inline valueization.
6239 (vn_nary_op_lookup_1): Inline valueization from ...
6240 (vn_nary_op_compute_hash): ... here and remove it here.
6241 * tree-ssa-pre.c (phi_translate_1): Do not valueize
6242 before vn_nary_lookup_pieces.
6243 (get_representative_for): Mark created SSA representatives
6246 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6248 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
6249 that the operand has a vector mode. Use subreg_lowpart_offset
6250 to test whether an index corresponds to the low part.
6252 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
6254 * read-rtl.c: Remove dead !GENERATOR_FILE block.
6255 * read-rtl-function.c (function_reader::consolidate_singletons):
6256 Generate canonical CONST_VECTORs.
6258 2021-11-04 liuhongt <hongtao.liu@intel.com>
6261 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
6262 (regmem_or_bitnot_regmem_operand): .. and extend to handle
6264 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
6265 the operands which are required to be register_operand.
6266 (*<avx512>_vpternlog<mode>_2): Ditto.
6267 (*<avx512>_vpternlog<mode>_3): Ditto.
6268 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
6269 vector HFmodes since it's not a real AVX512FP16 instruction.
6271 2021-11-04 liuhongt <hongtao.liu@intel.com>
6274 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
6275 .COPYSIGN (a,b) when a and b are same type as the truncation
6276 type and has less precision than extend type.
6278 2021-11-04 Richard Biener <rguenther@suse.de>
6280 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
6283 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
6285 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
6286 (VI1_AVX512VNNI): Likewise.
6287 (SDOT_VPDP_SUF): New mode_attr.
6290 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
6291 vpdpwssd when VNNI targets available.
6292 (usdot_prod<mode>): New expander for vector QImode.
6294 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
6296 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
6297 parentheses to base and stride.
6298 (_tile_stream_loadd_internal): Likewise.
6299 (_tile_stored_internal): Likewise.
6301 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
6303 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
6304 arguments to `reg_class_subset_p'.
6306 2021-11-03 Joseph Myers <joseph@codesourcery.com>
6309 * fold-const.c (fold_init): New function.
6310 * fold-const.h (fold_init): New prototype.
6312 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6314 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
6316 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6318 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
6319 a range from edge if arg == phidef.
6321 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6323 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
6324 for constant before any other processing.
6326 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6328 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
6330 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
6332 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
6333 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
6334 no explcit context and current_bb is set.
6335 (gimple_ranger::fold_stmt): New.
6336 * gimple-range.h (current_bb, fold_stmt): New.
6337 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
6339 2021-11-03 Richard Biener <rguenther@suse.de>
6341 PR tree-optimization/102970
6342 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
6343 when translating a MEM_REF over a backedge.
6345 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
6347 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
6348 * config/aarch64/aarch64-tune.md: Regenerate.
6349 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
6350 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
6351 * doc/invoke.texi: Add documentation for Ampere-1 core.
6353 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
6355 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
6357 (ldr_got_small_<mode>): Remove pattern.
6358 (ldr_got_small_sidi): Likewise.
6359 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
6360 GOT accesses as moves.
6361 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
6362 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
6363 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
6365 2021-11-03 Martin Liska <mliska@suse.cz>
6367 * gcov.c (read_line): Remove dead variable.
6369 2021-11-03 Martin Liska <mliska@suse.cz>
6372 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
6373 (ipa_fn_summary::account_size_time): Likewise.
6374 (edge_set_predicate): Likewise.
6375 (set_hint_predicate): Likewise.
6376 (add_freqcounting_predicate): Likewise.
6377 (evaluate_conditions_for_known_args): Likewise.
6378 (evaluate_properties_for_edge): Likewise.
6379 (remap_freqcounting_preds_after_dup): Likewise.
6380 (ipa_fn_summary_t::duplicate): Likewise.
6381 (set_cond_stmt_execution_predicate): Likewise.
6382 (set_switch_stmt_execution_predicate): Likewise.
6383 (compute_bb_predicates): Likewise.
6384 (will_be_nonconstant_expr_predicate): Likewise.
6385 (will_be_nonconstant_predicate): Likewise.
6386 (phi_result_unknown_predicate): Likewise.
6387 (predicate_for_phi_result): Likewise.
6388 (analyze_function_body): Likewise.
6389 (compute_fn_summary): Likewise.
6390 (summarize_calls_size_and_time): Likewise.
6391 (estimate_calls_size_and_time): Likewise.
6392 (ipa_call_context::estimate_size_and_time): Likewise.
6393 (remap_edge_summaries): Likewise.
6394 (remap_freqcounting_predicate): Likewise.
6395 (ipa_merge_fn_summary_after_inlining): Likewise.
6396 (ipa_update_overall_fn_summary): Likewise.
6397 (read_ipa_call_summary): Likewise.
6398 (inline_read_section): Likewise.
6399 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
6400 * ipa-predicate.c (predicate::add_clause): Likewise.
6401 (ipa_predicate::add_clause): Likewise.
6402 (predicate::or_with): Likewise.
6403 (ipa_predicate::or_with): Likewise.
6404 (predicate::evaluate): Likewise.
6405 (ipa_predicate::evaluate): Likewise.
6406 (predicate::probability): Likewise.
6407 (ipa_predicate::probability): Likewise.
6408 (dump_condition): Likewise.
6409 (dump_clause): Likewise.
6410 (predicate::dump): Likewise.
6411 (ipa_predicate::dump): Likewise.
6412 (predicate::debug): Likewise.
6413 (ipa_predicate::debug): Likewise.
6414 (predicate::remap_after_duplication): Likewise.
6415 (ipa_predicate::remap_after_duplication): Likewise.
6416 (predicate::remap_after_inlining): Likewise.
6417 (ipa_predicate::remap_after_inlining): Likewise.
6418 (predicate::stream_in): Likewise.
6419 (ipa_predicate::stream_in): Likewise.
6420 (predicate::stream_out): Likewise.
6421 (ipa_predicate::stream_out): Likewise.
6422 (add_condition): Likewise.
6423 * ipa-predicate.h (class predicate): Likewise.
6424 (class ipa_predicate): Likewise.
6425 (add_condition): Likewise.
6427 2021-11-03 Richard Biener <rguenther@suse.de>
6429 * bitmap.h (bitmap_bit_p): Change the return type to bool.
6430 * bitmap.c (bitmap_bit_p): Likewise.
6431 * sbitmap.h (bitmap_bit_p): Likewise.
6432 (bitmap_set_bit): Return whether the bit changed.
6433 (bitmap_clear_bit): Likewise.
6434 * tree-ssa.c (verify_vssa): Make use of the changed state
6435 from bitmap_set_bit.
6437 2021-11-03 Richard Biener <rguenther@suse.de>
6439 PR middle-end/103033
6440 * internal-fn.c (expand_DEFERRED_INIT): Elide the
6441 native_interpret_expr path in favor of folding the
6442 VIEW_CONVERT_EXPR generated when punning the RHS.
6444 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6446 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
6449 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
6452 * ipa-modref.c (callee_to_caller_flags): New function.
6453 (modref_eaf_analysis::analyze_ssa_name): Use it.
6454 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
6456 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6458 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
6461 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
6462 Uroš Bizjak <ubizjak@gmail.com>
6464 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
6465 rotations by non-constant amounts.
6467 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6469 * ipa-modref.c: Fix anonymous namespace placement.
6470 (class modref_eaf_analysis): New class.
6471 (analyze_ssa_name_flags): Turn to ...
6472 (modref_eaf_analysis::analyze_ssa_name): ... this one.
6473 (merge_call_lhs_flags): Turn to ...
6474 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
6475 (modref_eaf_analysis::merge_with_ssa_name): New member function.
6476 (record_escape_points): Turn to ...
6477 (modref_eaf_analysis::record_escape_points): ... this one.
6478 (analyze_parms): Updat
6479 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
6481 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6483 * gimple.c (gimple_call_static_chain_flags): New function.
6484 * gimple.h (gimple_call_static_chain_flags): Declare
6485 * ipa-modref.c (modref_summary::modref_summary): Initialize
6487 (modref_summary_lto::modref_summary_lto): Likewise.
6488 (modref_summary::useful_p): Test static_chain_flags.
6489 (modref_summary_lto::useful_p): Likewise.
6490 (struct modref_summary_lto): Add static_chain_flags.
6491 (modref_summary::dump): Dump static_chain_flags.
6492 (modref_summary_lto::dump): Likewise.
6493 (struct escape_point): Add static_cahin_arg.
6494 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
6495 (analyze_parms): Handle static chains.
6496 (modref_summaries::duplicate): Duplicate static_chain_flags.
6497 (modref_summaries_lto::duplicate): Likewise.
6498 (modref_write): Stream static_chain_flags.
6499 (read_section): Likewise.
6500 (modref_merge_call_site_flags): Handle static_chain_flags.
6501 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
6502 * tree-ssa-structalias.c (handle_rhs_call): Use
6503 gimple_static_chain_flags.
6505 2021-11-02 Richard Biener <rguenther@suse.de>
6507 PR tree-optimization/103029
6508 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
6509 the PHI nodes in the loop maintain their original operand
6512 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6514 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
6515 (EAF_NOREAD): Renumber.
6516 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
6517 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
6518 (deref_flags): Likewise.
6519 (modref_lattice::init): Likewise.
6520 (modref_lattice::merge): Likewise.
6521 (merge_call_lhs_flags): Likewise.
6522 (analyze_ssa_name_flags): Likewise.
6523 (modref_merge_call_site_flags): Likewise.
6524 * tree-ssa-structalias.c (handle_call_arg): Likewise.
6526 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
6528 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
6529 check and remove an unused local variable with shNadd/shNadd.uw
6532 2021-11-02 Tamar Christina <tamar.christina@arm.com>
6534 PR tree-optimization/103007
6535 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
6538 2021-11-02 Richard Biener <rguenther@suse.de>
6540 PR middle-end/103038
6541 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
6542 (can_native_interpret_type_p): Likewise.
6543 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
6544 RHS if the LHS is an SSA name.
6546 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
6548 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
6549 * langhooks-def.h (lhd_simulate_record_decl): Declare.
6550 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
6551 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
6552 * langhooks.c (lhd_simulate_record_decl): New function.
6554 2021-11-02 Jakub Jelinek <jakub@redhat.com>
6557 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
6558 modes with TImode inner mode if 32-bit.
6560 2021-11-02 liuhongt <hongtao.liu@intel.com>
6562 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
6563 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
6564 vect_slp_v4si_store_unalign): Document efficient target.
6565 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
6566 vect_slp_v16qi_store_unalign_1): Ditto.
6567 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
6568 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
6569 (struct_4char_block_move, struct_8char_block_move,
6570 struct_16char_block_move): Ditto.
6572 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
6573 Jakub Jelinek <jakub@redhat.com>
6576 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
6577 ix86_expand_ti_to_v1ti): New helper functions.
6578 (ix86_expand_v1ti_shift): Check if the amount operand is an
6579 integer constant, and expand as a TImode shift if it isn't.
6580 (ix86_expand_v1ti_rotate): Check if the amount operand is an
6581 integer constant, and expand as a TImode rotate if it isn't.
6582 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
6583 right shifts of V1TImode quantities.
6584 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
6585 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
6586 to QImode general_operand, and let the helper functions lower
6587 shifts by non-constant operands, as TImode shifts. Make
6588 conditional on TARGET_64BIT.
6589 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
6590 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
6591 Make conditional on TARGET_64BIT.
6593 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6595 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
6596 "*tabort_1_plus"): Remove operands non-null check.
6598 2021-11-01 Martin Liska <mliska@suse.cz>
6600 * opt-functions.awk: Add new sanity checking.
6601 * optc-gen.awk: Add new argument to integer_range_info.
6602 * params.opt: Update 2 params which have negative IntegerRange.
6604 2021-11-01 qing zhao <qing.zhao@oracle.com>
6606 * gimplify.c (gimplify_decl_expr): Do not add call to
6607 __builtin_clear_padding when a variable is a gimple register
6608 or it might not have padding.
6609 (gimplify_init_constructor): Likewise.
6611 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6613 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
6614 movi, dup and extract costing fields.
6615 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
6616 thunderx_extra_costs, thunderx2t99_extra_costs,
6617 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
6619 * config/arm/aarch-cost-tables.h (generic_extra_costs,
6620 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
6621 exynosm1_extra_costs, xgene1_extra_costs): Likewise
6622 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
6623 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
6625 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6627 * cse.c (add_to_set): New.
6628 (find_sets_in_insn): Register constants in sets.
6629 (canonicalize_insn): Use auto_vec instead.
6630 (cse_insn): Try materializing using vec_dup.
6631 * rtl.h (simplify_context::simplify_gen_vec_select,
6632 simplify_gen_vec_select): New.
6633 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
6635 2021-11-01 David Malcolm <dmalcolm@redhat.com>
6637 * common.opt (fdiagnostics-escape-format=): New.
6638 (diagnostics_escape_format): New enum.
6639 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
6640 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
6641 * diagnostic-format-json.cc (json_end_diagnostic): Add
6642 "escape-source" attribute.
6643 * diagnostic-show-locus.c
6644 (exploc_with_display_col::exploc_with_display_col): Replace
6645 "tabstop" param with a cpp_char_column_policy and add an "aspect"
6646 param. Use these to compute m_display_col accordingly.
6647 (struct char_display_policy): New struct.
6648 (layout::m_policy): New field.
6649 (layout::m_escape_on_output): New field.
6650 (def_policy): New function.
6651 (make_range): Update for changes to exploc_with_display_col ctor.
6652 (default_print_decoded_ch): New.
6653 (width_per_escaped_byte): New.
6654 (escape_as_bytes_width): New.
6655 (escape_as_bytes_print): New.
6656 (escape_as_unicode_width): New.
6657 (escape_as_unicode_print): New.
6659 (layout::layout): Initialize new fields. Update m_exploc ctor
6660 call for above change to ctor.
6661 (layout::maybe_add_location_range): Update for changes to
6662 exploc_with_display_col ctor.
6663 (layout::calculate_x_offset_display): Update for change to
6665 (layout::print_source_line): Pass policy
6666 to cpp_display_width_computation. Capture cpp_decoded_char when
6667 calling process_next_codepoint. Move printing of source code to
6668 m_policy.m_print_cb.
6669 (line_label::line_label): Pass in policy rather than context.
6670 (layout::print_any_labels): Update for change to line_label ctor.
6671 (get_affected_range): Pass in policy rather than context, updating
6672 calls to location_compute_display_column accordingly.
6673 (get_printed_columns): Likewise, also for cpp_display_width.
6674 (correction::correction): Pass in policy rather than tabstop.
6675 (correction::compute_display_cols): Pass m_policy rather than
6676 m_tabstop to cpp_display_width.
6677 (correction::m_tabstop): Replace with...
6678 (correction::m_policy): ...this.
6679 (line_corrections::line_corrections): Pass in policy rather than
6681 (line_corrections::m_context): Replace with...
6682 (line_corrections::m_policy): ...this.
6683 (line_corrections::add_hint): Update to use m_policy rather than
6685 (line_corrections::add_hint): Likewise.
6686 (layout::print_trailing_fixits): Likewise.
6687 (selftest::test_display_widths): New.
6688 (selftest::test_layout_x_offset_display_utf8): Update to use
6689 policy rather than tabstop.
6690 (selftest::test_one_liner_labels_utf8): Add test of escaping
6692 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
6693 use policy rather than tabstop.
6694 (selftest::test_overlapped_fixit_printing): Likewise.
6695 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
6696 (selftest::test_overlapped_fixit_printing_2): Likewise.
6697 (selftest::test_tab_expansion): Likewise.
6698 (selftest::test_escaping_bytes_1): New.
6699 (selftest::test_escaping_bytes_2): New.
6700 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
6701 * diagnostic.c (diagnostic_initialize): Initialize
6702 context->escape_format.
6703 (convert_column_unit): Update to use default character width policy.
6704 (selftest::test_diagnostic_get_location_text): Likewise.
6705 * diagnostic.h (enum diagnostics_escape_format): New enum.
6706 (diagnostic_context::escape_format): New field.
6707 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
6708 (-fdiagnostics-format=): Add "escape-source" attribute to examples
6709 of JSON output, and document it.
6710 * input.c (location_compute_display_column): Pass in "policy"
6711 rather than "tabstop", passing to
6712 cpp_byte_column_to_display_column.
6713 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
6714 * input.h (class cpp_char_column_policy): New forward decl.
6715 (location_compute_display_column): Pass in "policy" rather than
6717 * opts.c (common_handle_option): Handle
6718 OPT_fdiagnostics_escape_format_.
6719 * selftest.c (temp_source_file::temp_source_file): New ctor
6720 overload taking a size_t.
6721 * selftest.h (temp_source_file::temp_source_file): Likewise.
6723 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6725 * dbgcnt.def: Add debug counter for back_thread[12] and
6726 back_threadfull[12].
6727 * passes.def: Pass "first" argument to each back threading pass.
6728 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
6730 (back_threader::debug_counter): New.
6731 (back_threader::maybe_register_path): Call debug_counter.
6733 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6735 * tree-ssa-threadbackward.c (BT_NONE): New.
6738 (back_threader::back_threader): Add flags.
6739 Move loop initialization here.
6740 (back_threader::~back_threader): New.
6741 (back_threader::find_taken_edge_switch): Change solver and ranger
6743 (back_threader::find_taken_edge_cond): Same.
6744 (back_threader::find_paths_to_names): Same.
6745 (back_threader::find_paths): Same.
6746 (back_threader::dump): Same.
6747 (try_thread_blocks): Merge into thread_blocks.
6748 (back_threader::thread_blocks): New.
6749 (do_early_thread_jumps): Merge into thread_blocks.
6750 (do_thread_jumps): Merge into thread_blocks.
6751 (back_threader::thread_through_all_blocks): Remove.
6753 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
6755 PR tree-optimization/103003
6756 * value-relation.cc (dom_oracle::register_relation): If the 2
6757 ssa names are the same, don't register any relation.
6759 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
6761 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
6762 redundant check for calls_eh_return.
6763 * config/aarch64/aarch64.md (*do_return): Likewise.
6765 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6767 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
6768 duplicate_loop_to_header_edge to
6769 duplicate_loop_body_to_header_edge.
6770 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6771 * cfghooks.h (struct cfg_hooks): Likewise.
6772 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6773 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
6774 (clone_loop_to_header_edge): Likewise.
6775 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
6776 * cfgrtl.c (struct cfg_hooks): Likewise.
6777 * doc/loop.texi: Likewise.
6778 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
6779 (unroll_loop_runtime_iterations): Likewise.
6780 (unroll_loop_stupid): Likewise.
6781 (apply_opt_in_copies): Likewise.
6782 * tree-cfg.c (struct cfg_hooks): Likewise.
6783 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
6784 (try_peel_loop): Likewise.
6785 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
6786 (gimple_duplicate_loop_body_to_header_edge): Likewise.
6787 (tree_transform_and_unroll_loop): Likewise.
6788 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
6791 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6793 * cfgloopmanip.c (loop_version): Refactor loopify to
6794 loop_version. Move condition generation after loopify.
6796 * cfgloopmanip.h (loopify): Delete.
6798 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
6800 * ipa-fnsummary.c: Include tree-dfa.h.
6801 (points_to_local_or_readonly_memory_p): Return true on return
6803 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
6806 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
6808 PR middle-end/102972
6809 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
6810 internal Fortran name; new permit_num_teams arg to permit
6811 omp_get_num_teams and omp_get_team_num.
6812 (scan_omp_1_stmt): Update call to it, add missing call for
6813 reverse offload, and check for strictly nested API calls in teams.
6815 2021-10-30 Jakub Jelinek <jakub@redhat.com>
6817 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
6819 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6821 * selftest.c (assert_streq): Add newlines when emitting non-equal
6824 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6826 * Makefile.in: Fix syntax for reference to LIBDEPS in
6829 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6831 * doc/install.texi: Bump required minimum DejaGnu version.
6833 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6834 Andrew MacLeod <amacleod@redhat.com>
6836 * value-relation.cc (path_oracle::killing_def): Add a
6837 self-equivalence so we don't look to the root oracle.
6839 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6841 * passes.def: Replace the pass_thread_jumps before VRP* with
6842 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
6843 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
6844 Remove hyphen from "thread-full" name.
6846 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
6848 PR tree-optimization/102983
6849 * gimple-range-cache.h (propagate_updated_value): Make public.
6850 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
6851 when processing gcond stmts.
6853 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
6855 * gimple.c (gimple_call_retslot_flags): New function.
6856 * gimple.h (gimple_call_retslot_flags): Declare.
6857 * ipa-modref.c: Include tree-cfg.h.
6858 (struct escape_entry): Turn parm_index to signed.
6859 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
6860 (modref_summary::modref_summary): Initialize retslot_flags.
6861 (struct modref_summary_lto): Likewise.
6862 (modref_summary::useful_p): Check retslot_flags.
6863 (modref_summary_lto::useful_p): Likewise.
6864 (modref_summary::dump): Dump retslot_flags.
6865 (modref_summary_lto::dump): Likewise.
6866 (struct escape_point): Add hidden_args enum.
6867 (analyze_ssa_name_flags): Ignore return slot return;
6868 use gimple_call_retslot_flags.
6869 (record_escape_points): Break out from ...
6870 (analyze_parms): ... here; handle retslot_flags.
6871 (modref_summaries::duplicate): Duplicate retslot_flags.
6872 (modref_summaries_lto::duplicate): Likewise.
6873 (modref_write_escape_summary): Stream parm_index as signed.
6874 (modref_read_escape_summary): Likewise.
6875 (modref_write): Stream retslot_flags.
6876 (read_section): Likewise.
6877 (struct escape_map): Fix typo in comment.
6878 (update_escape_summary_1): Fix whitespace.
6879 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
6880 (modref_merge_call_site_flags): Merge retslot_flags.
6881 * ipa-modref.h (struct modref_summary): Add retslot_flags.
6882 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
6884 2021-10-29 Tamar Christina <tamar.christina@arm.com>
6886 PR tree-optimization/102977
6887 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
6888 (vect_detect_pair_op): Add crosslane check.
6889 (vect_match_call_complex_mla): Remove.
6890 (class complex_mul_pattern): Update comment.
6891 (complex_mul_pattern::matches): Update detection.
6892 (class complex_fma_pattern): Remove.
6893 (complex_fma_pattern::matches): Remove.
6894 (complex_fma_pattern::recognize): Remove.
6895 (complex_fma_pattern::build): Remove.
6896 (class complex_fms_pattern): Update comment.
6897 (complex_fms_pattern::matches): Remove.
6898 (complex_operations_pattern::recognize): Remove complex_fma_pattern
6900 2021-10-29 Jakub Jelinek <jakub@redhat.com>
6902 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
6905 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
6908 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
6909 patterns match and emit for VSX xxpermdi.
6911 2021-10-29 liuhongt <hongtao.liu@intel.com>
6914 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
6916 (V16HF_FTYPE_V16HF): Ditto.
6917 (V32HF_FTYPE_V32HF): Ditto.
6918 (V8HF_FTYPE_V8HF_ROUND): Ditto.
6919 (V16HF_FTYPE_V16HF_ROUND): Ditto.
6920 (V32HF_FTYPE_V32HF_ROUND): Ditto.
6921 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
6922 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
6923 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
6924 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
6925 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
6926 * config/i386/i386-builtins.c
6927 (ix86_builtin_vectorized_function): Enable vectorization for
6928 HFmode FLOOR/CEIL/TRUNC operation.
6929 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
6931 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
6934 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6935 Andrew MacLeod <amacleod@redhat.com>
6937 * value-relation.cc (path_oracle::killing_def): Walk the
6938 equivalency list and remove SSA from any equivalencies.
6940 2021-10-28 Stafford Horne <shorne@gmail.com>
6942 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
6945 2021-10-28 Jakub Jelinek <jakub@redhat.com>
6947 PR tree-optimization/102951
6948 * fold-const.h (address_compare): Declare.
6949 * fold-const.c (address_compare): New function.
6950 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
6951 address_compare helper.
6952 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
6955 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6957 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
6959 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6961 PR tree-optimization/102940
6962 * tree-vrp.c (execute_ranger_vrp): Reset scev.
6964 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6966 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
6968 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6970 * configure: Regenerate.
6971 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
6973 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
6975 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
6977 2021-10-28 Richard Biener <rguenther@suse.de>
6980 * fold-const.c (fold_convert_const): Avoid int to float
6981 constant folding with -frounding-math and inexact result.
6982 * simplify-rtx.c (simplify_const_unary_operation): Likewise
6983 for both float and unsigned_float.
6985 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6987 * tree-ssa-threadbackward.c
6988 (back_threader::find_taken_edge_switch): Use find_case_label_range
6989 instead of find_taken_edge.
6991 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6993 * tree-ssa-threadbackward.c (class back_threader_registry):
6994 Inherit from back_jt_path_registry.
6995 (back_threader_registry::thread_through_all_blocks): Remove.
6996 (back_threader_registry::register_path): Remove
6997 m_lowlevel_registry prefix.
6999 2021-10-28 Richard Biener <rguenther@suse.de>
7002 * fold-const.c (fold_convert_const_real_from_real): Honor
7003 -frounding-math if the conversion is not exact.
7004 * simplify-rtx.c (simplify_const_unary_operation): Do not
7005 simplify FLOAT_TRUNCATE with sign dependent rounding.
7007 2021-10-28 Richard Biener <rguenther@suse.de>
7009 PR tree-optimization/102949
7010 * tree-vect-stmts.c (ensure_base_align): Look at the
7011 dr_info of a group leader and assert we are looking at
7012 one with analyzed alignment.
7014 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
7017 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
7018 V1T1 mode for unaligned load and store.
7020 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
7022 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
7025 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
7027 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
7030 2021-10-28 Alexandre Oliva <oliva@adacore.com>
7032 * common.opt (fharden-compares): New.
7033 (fharden-conditional-branches): New.
7034 * doc/invoke.texi: Document new options.
7035 * gimple-harden-conditionals.cc: New.
7036 * Makefile.in (OBJS): Build it.
7037 * passes.def: Add new passes.
7038 * tree-pass.h (make_pass_harden_compares): Declare.
7039 (make_pass_harden_conditional_branches): Declare.
7041 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
7044 * config/rs6000/altivec.md: Add vsx register constraints.
7045 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
7046 (vsx_xxsel<mode>2): Likewise.
7047 (vsx_xxsel<mode>3): Likewise.
7048 (vsx_xxsel<mode>4): Likewise.
7050 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
7053 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
7054 (altivec_vsel<mode>): ... this and update define.
7055 (*altivec_vsel<mode>_uns): Delete.
7056 (altivec_vsel<mode>2): New define_insn.
7057 (altivec_vsel<mode>3): Likewise.
7058 (altivec_vsel<mode>4): Likewise.
7059 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
7060 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
7062 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
7063 selection instead of per element.
7064 * config/rs6000/vector.md:
7065 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
7066 (vsx_xxsel<mode>): ... this and update define.
7067 (*vsx_xxsel<mode>_uns): Delete.
7068 (vsx_xxsel<mode>2): New define_insn.
7069 (vsx_xxsel<mode>3): Likewise.
7070 (vsx_xxsel<mode>4): Likewise.
7072 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
7074 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
7075 HFmode rsqrt without TARGET_SSE_MATH.
7076 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
7077 btrunc condition to be restricted by -ftrapping-math, adjust
7078 use_rsqrt_p function call.
7079 * config/i386/i386.md (rcphf2): New define_insn.
7080 (rsqrthf2): Likewise.
7081 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
7082 (div<mode>3): New expander for HF mode.
7083 (rsqrt<mode>2): Likewise.
7084 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
7085 (*avx512fp16_vmrsqrtv8hf2): Likewise.
7087 2021-10-27 Saagar Jha <saagar@saagarjha.com>
7089 * config.gcc: Adjust for Darwin21.
7090 * config/darwin-c.c (macosx_version_as_macro): Likewise.
7091 * config/darwin-driver.c (validate_macosx_version_min):
7093 (darwin_find_version_from_kernel): Likewise.
7095 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
7097 * gimple-range-path.cc
7098 (path_range_query::range_defined_in_block): Call killing_def.
7100 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
7102 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
7103 operands as well as relation.
7104 * gimple-range-path.cc
7105 (path_range_query::compute_ranges_in_block): Compute PHI relations
7106 first. Compute outgoing relations at the end.
7107 (path_range_query::compute_ranges): Remove call to compute_relations.
7108 (path_range_query::compute_relations): Remove.
7109 (path_range_query::maybe_register_phi_relation): New.
7110 (path_range_query::compute_phi_relations): Abstract out
7111 registering one PHI relation to...
7112 (path_range_query::compute_outgoing_relations): ...here.
7113 * gimple-range-path.h (class path_range_query): Remove
7115 Add maybe_register_phi_relation.
7117 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
7118 Andrew MacLeod <amacleod@redhat.com>
7120 * value-relation.cc (path_oracle::killing_def): Kill second
7123 2021-10-27 Martin Jambor <mjambor@suse.cz>
7125 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
7126 profile feedback depending on their local availability.
7128 2021-10-27 Martin Jambor <mjambor@suse.cz>
7130 * params.opt (param_ipa_cp_profile_count_base): New parameter.
7131 * doc/invoke.texi (Optimize Options): Add entry for
7132 ipa-cp-profile-count-base.
7133 * ipa-cp.c (max_count): Replace with base_count, replace all
7134 occurrences too, unless otherwise stated.
7135 (ipcp_cloning_candidate_p): identify mostly-directly called
7136 functions based on their counts, not max_count.
7137 (compare_edge_profile_counts): New function.
7138 (ipcp_propagate_stage): Instead of setting max_count, find the
7139 appropriate edge count in a sorted vector of counts of eligible
7140 edges and make it the base_count.
7142 2021-10-27 Martin Jambor <mjambor@suse.cz>
7144 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
7145 n_nonrec_calls and itself, document all fields.
7146 (init_caller_stats): Initialize the above new fields.
7147 (gather_caller_stats): Gather self-recursive counts and calls number.
7148 (get_info_about_necessary_edges): Gather counts of self-recursive and
7149 other edges bringing in the requested value separately.
7150 (dump_profile_updates): Rework to dump info about a single node only.
7151 (lenient_count_portion_handling): New function.
7152 (struct gather_other_count_struct): New type.
7153 (gather_count_of_non_rec_edges): New function.
7154 (struct desc_incoming_count_struct): New type.
7155 (analyze_clone_icoming_counts): New function.
7156 (adjust_clone_incoming_counts): Likewise.
7157 (update_counts_for_self_gen_clones): Likewise.
7158 (update_profiling_info): Rewritten.
7159 (update_specialized_profile): Adjust call to dump_profile_updates.
7160 (create_specialized_node): Do not update profiling info.
7161 (decide_about_value): New parameter self_gen_clones, either push new
7162 clones into it or updat their profile counts. For self-recursively
7163 generated values, use a portion of the node count instead of count
7164 from self-recursive edges to estimate goodness.
7165 (decide_whether_version_node): Gather clones for self-generated values
7166 in a new vector, update their profiles at once at the end.
7168 2021-10-27 Richard Biener <rguenther@suse.de>
7170 * tree-vectorizer.c (vect_transform_loops): New function,
7172 (try_vectorize_loop_1): ... here. Simplify as epilogues
7173 are now fully handled in the split part.
7175 2021-10-27 Jakub Jelinek <jakub@redhat.com>
7177 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
7178 iterators with pointer types.
7179 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
7181 2021-10-26 Martin Sebor <msebor@redhat.com>
7183 PR tree-optimization/102238
7184 PR tree-optimization/102919
7185 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
7186 (array_elt_at_offset): Move to pointer-query.
7187 (set_aggregate_size_and_offset): New function.
7188 (field_at_offset): Move to pointer-query.
7189 (get_origin_and_offset): Rename...
7190 (get_origin_and_offset_r): this. Add an argument. Make aggregate
7191 handling more robust.
7192 (get_origin_and_offset): New.
7193 (alias_offset): Add an argument.
7194 (format_string): Use subobject size determined by get_origin_and_offset.
7195 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
7196 Improve/correct handling of aggregates.
7197 (array_elt_at_offset): Same.
7198 * pointer-query.h (field_at_offset): Declare.
7199 (array_elt_at_offset): Declare.
7201 2021-10-26 Martin Sebor <msebor@redhat.com>
7203 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
7205 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
7206 (expand_builtin_fork_or_exec): Same.
7207 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
7208 compute_objsize additional arguments.
7209 (inbounds_memaccess_p): Same.
7210 (array_bounds_checker::check_array_bounds): Add an assert. Stash
7211 statement in a member.
7212 (check_array_bounds_dom_walker::before_dom_children): Same.
7213 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
7214 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
7215 (handle_printf_call): Pass a new argument.
7216 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
7217 (check_access): Add an argument and pass it along to callees.
7218 (check_read_access): Make a member function.
7219 (pass_waccess::check_strcat): Pass access_data ctor additional
7221 (pass_waccess::check_strncat): Same.
7222 (pass_waccess::check_stxcpy): Same.
7223 (pass_waccess::check_stxncpy): Same.
7224 (pass_waccess::check_strncmp): Same.
7225 (pass_waccess::check_read_access): Same.
7226 (pass_waccess::check_builtin): Same.
7227 (pass_waccess::maybe_check_access_sizes): Same.
7228 (pass_waccess::maybe_check_dealloc_call): Same.
7229 * gimple-ssa-warn-access.h (check_read_access): Declare a new
7231 * pointer-query.cc (compute_objsize_r): Add an argument.
7232 (gimple_call_return_array): Same.
7233 (gimple_call_alloc_size): Same.
7234 (access_ref::access_ref): Same.
7235 (access_ref::get_ref): Same.
7236 (pointer_query::get_ref): Same.
7237 (handle_min_max_size): Pass an arguments to callees.
7238 (handle_array_ref): Add an argument.
7239 (handle_mem_ref): Same.
7240 (compute_objsize): Same.
7241 * pointer-query.h (struct access_ref): Adjust signatures.
7242 (struct access_data): Same.
7243 (gimple_call_alloc_size): Add an argument.
7244 (gimple_parm_array_size): Same.
7245 (compute_objsize): Same.
7246 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
7247 argument to compute_objsize.
7248 (strlen_pass::maybe_warn_overflow): Same.
7249 (maybe_diag_stxncpy_trunc): Same.
7251 2021-10-26 Martin Sebor <msebor@redhat.com>
7253 PR middle-end/102453
7254 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
7255 (pass_waccess::check_atomic_builtin): Call it.
7257 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
7259 PR rtl-optimization/102842
7260 * lra-constraints.c (match_reload): Ignore out in checking values
7262 (curr_insn_transform): Collect outputs before doing reloads of operands.
7264 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
7267 * config/rs6000/x86intrin.h: Move some included headers to new
7268 headers. Include new immintrin.h instead of those headers.
7269 * config/rs6000/immintrin.h: New.
7270 * config/rs6000/x86gprintrin.h: New.
7271 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
7273 2021-10-26 Richard Biener <rguenther@suse.de>
7275 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
7276 Remove byte_offset parameter.
7277 (vect_create_data_ref_ptr): Likewise.
7278 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7280 (vect_create_data_ref_ptr): Likewise.
7281 * tree-vect-stmts.c (vectorizable_store): Adjust.
7282 (vectorizable_load): Likewise.
7284 2021-10-26 Richard Biener <rguenther@suse.de>
7286 PR tree-optimization/96109
7287 * tree-vectorizer.h (dr_misalignment): Add optional offset
7289 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
7290 offset applied for negative stride accesses.
7291 (vect_enhance_data_refs_alignment): Compute negative stride
7292 access offset and pass it to dr_misalignment.
7293 * tree-vect-stmts.c (get_negative_load_store_type): Pass
7294 negative offset to dr_misalignment.
7295 (get_group_load_store_type): Likewise.
7296 (get_load_store_type): Likewise.
7297 (vectorizable_store): Remove asserts about alignment.
7298 (vectorizable_load): Likewise.
7300 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
7302 PR tree-optimization/102897
7303 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
7305 2021-10-26 Richard Biener <rguenther@suse.de>
7307 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7308 Take offset in bytes.
7309 (vect_create_data_ref_ptr): Likewise.
7310 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
7311 offset by element size.
7312 (vect_create_cond_for_align_checks): Likewise.
7313 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
7314 (vectorizable_load): Remove duplicate leftover from merge
7317 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
7319 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
7320 function to expand V1TI mode logical shifts by integer constants.
7321 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
7322 mode rotations by integer constants.
7323 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
7324 ix86_expand_v1ti_rotate): Prototype new functions here.
7325 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
7326 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
7328 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
7330 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7331 Avoid threading circular paths.
7333 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
7335 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
7336 Attempt to resolve all incoming paths to a PHI.
7337 (back_threader::resolve_def): Always return true for PHIs.
7339 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
7341 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
7342 Always try to resolve path without looking back.
7343 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
7344 edge is a back edge.
7346 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
7348 PR tree-optimization/102789
7349 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
7350 update inits of simd_lane_access.
7352 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7354 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
7355 move to vr-values.c.
7356 (simplify_casted_conds): Move to vrp_folder class.
7357 (execute_vrp): Call via vrp_folder now.
7358 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
7359 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
7360 * vr-values.h (simplify_casted_cond): Add prototype.
7362 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7364 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
7367 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
7369 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
7371 (_mm_testc_si128): Likewise.
7372 (_mm_testnzc_si128): Likewise.
7373 (_mm_blend_ps): Likewise.
7374 (_mm_blendv_ps): Likewise.
7375 (_mm_blend_pd): Likewise.
7376 (_mm_blendv_pd): Likewise.
7377 (_mm_ceil_pd): Likewise.
7378 (_mm_ceil_sd): Likewise.
7379 (_mm_ceil_ps): Likewise.
7380 (_mm_ceil_ss): Likewise.
7381 (_mm_floor_pd): Likewise.
7382 (_mm_floor_sd): Likewise.
7383 (_mm_floor_ps): Likewise.
7384 (_mm_floor_ss): Likewise.
7385 (_mm_minpos_epu16): Likewise.
7386 (_mm_mul_epi32): Likewise.
7387 (_mm_cvtepi8_epi16): Likewise.
7388 (_mm_packus_epi32): Likewise.
7389 (_mm_cmpgt_epi64): Likewise.
7391 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
7393 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
7394 shifts of the mode's smin_value and smax_value when the bit count
7395 operand doesn't have side-effects.
7396 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
7397 when the bit count operand doesn't have side-effects.
7398 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
7399 compile-time evaluation of saturating left shifts with constant
7402 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7404 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
7407 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
7409 * flag-types.h (enum ranger_debug): Adjust values.
7410 * params.opt (ranger_debug): Ditto.
7412 2021-10-25 Martin Jambor <mjambor@suse.cz>
7414 PR tree-optimization/102886
7415 * tree-sra.c (totally_scalarize_subtree): Fix the out of
7418 2021-10-25 Andrew Pinski <apinski@marvell.com>
7420 * tree-ssa-dce.c (simple_dce_from_worklist):
7421 Check stmt_unremovable_because_of_non_call_eh_p also
7422 before removing the statement.
7424 2021-10-25 Richard Biener <rguenther@suse.de>
7426 PR tree-optimization/102905
7427 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
7428 Use vect_supportable_dr_alignment again to determine whether
7429 an access is supported when not aligned.
7431 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7433 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
7436 2021-10-25 Jim Wilson <jimw@sifive.com>
7437 Kito Cheng <kito.cheng@sifive.com>
7438 Jia-Wei Chen <jiawei@iscas.ac.cn>
7439 Shi-Hua Liao <shihua@iscas.ac.cn>
7441 * config/riscv/bitmanip.md (shiftm1): New.
7442 (*bset<mode>): Ditto.
7443 (*bset<mode>_mask): Ditto.
7444 (*bset<mode>_1): Ditto.
7445 (*bset<mode>_1_mask): Ditto.
7446 (*bseti<mode>): Ditto.
7447 (*bclr<mode>): Ditto.
7448 (*bclri<mode>): Ditto.
7449 (*binv<mode>): Ditto.
7450 (*binvi<mode>): Ditto.
7451 (*bext<mode>): Ditto.
7453 * config/riscv/predicates.md (splittable_const_int_operand):
7455 (single_bit_mask_operand): New.
7456 (not_single_bit_mask_operand): Ditto.
7457 (const31_operand): Ditto.
7458 (const63_operand): Ditto.
7459 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
7460 (riscv_output_move): Ditto.
7461 (riscv_print_operand): Handle new operand type: T and S.
7462 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
7464 2021-10-25 Jim Wilson <jimw@sifive.com>
7466 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
7469 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7471 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
7473 (riscv_rtx_costs): Ditto.
7475 2021-10-25 Jim Wilson <jimw@sifive.com>
7476 Kito Cheng <kito.cheng@sifive.com>
7477 Jia-Wei Chen <jiawei@iscas.ac.cn>
7479 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
7480 (bitmanip_minmax): New.
7481 (clz_ctz_pcnt): New.
7482 (bitmanip_optab): New.
7483 (bitmanip_insn): New.
7484 (*<optab>_not<mode>): New.
7485 (*xor_not<mode>): New.
7486 (<bitmanip_optab>si2): New.
7487 (*<bitmanip_optab>disi2): New.
7488 (<bitmanip_optab>di2): New.
7489 (*zero_extendhi<GPR:mode>2_bitmanip): New.
7490 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
7491 (*zero_extendhi<GPR:mode>2_zbb): New.
7494 (rotrsi3_sext): New.
7497 (rotlsi3_sext): New.
7498 (bswap<mode>2): New.
7499 (<bitmanip_optab><mode>3): New.
7500 * config/riscv/riscv.md (type): Add rotate.
7501 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
7502 (*zero_extendhi<GPR:mode>2): New.
7503 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
7504 (*extend<SHORT:mode><SUPERQI:mode>2): New.
7506 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7508 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
7510 (riscv_rtx_costs): Ditto.
7512 2021-10-25 Jim Wilson <jimw@sifive.com>
7513 Kito Cheng <kito.cheng@sifive.com>
7514 Jia-Wei Chen <jiawei@iscas.ac.cn>
7516 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
7521 (riscv_rtx_costs): Ditto.
7522 * config/riscv/riscv.md: Include bitmanip.md
7523 (type): Add bitmanip bype.
7524 (zero_extendsidi2): Change to define_expand pattern.
7525 (*zero_extendsidi2_internal): New.
7526 (zero_extendsidi2_shifted): Disable for ZBA.
7528 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7530 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
7531 Add zba, zbb, zbc and zbs.
7532 (riscv_ext_flag_table): Ditto.
7533 * config/riscv/riscv-opts.h (MASK_ZBA): New.
7537 (TARGET_ZBA): Ditto.
7538 (TARGET_ZBB): Ditto.
7539 (TARGET_ZBC): Ditto.
7540 (TARGET_ZBS): Ditto.
7541 * config/riscv/riscv.opt (riscv_zb_subext): New.
7543 2021-10-25 liuhongt <hongtao.liu@intel.com>
7546 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
7547 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
7549 2021-10-25 Richard Biener <rguenther@suse.de>
7551 PR tree-optimization/102920
7552 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
7553 controlling VN_TOP matching behavior.
7554 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
7555 (vn_phi_eq): Do not optimistically match VN_TOP.
7557 2021-10-25 konglin1 <lingling.kong@intel.com>
7559 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
7560 define_insn_and_split.
7561 (fma_<mode>_fadd_fcmul):Likewise
7562 (fma_<complexopname>_<mode>_fma_zero):Likewise
7564 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7566 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
7567 TARGET_DISABLE_FPREGS.
7568 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
7569 MASK_DISABLE_FPREGS.
7570 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
7571 cost of hardware integer multiplication.
7572 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
7573 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
7574 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
7575 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
7576 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
7577 SOFT_FLOAT. Revise comment.
7578 (msoft-float): New option.
7580 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7582 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
7584 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
7586 * doc/install.texi (Specific): Remove obsolete details
7587 around GNU/Linux on Itanium.
7588 (Specific): Remove reference to Windows for Itanium.
7590 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
7592 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
7595 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
7597 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
7599 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
7601 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
7602 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
7603 (one_cmplv1ti2): New define expand.
7605 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
7608 * Makefile.in: Handle dvidir and install-dvi target.
7609 * configure: Regenerate.
7610 * configure.ac: Add install-dvi to target_list.
7612 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
7614 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
7617 2021-10-22 Richard Biener <rguenther@suse.de>
7619 PR tree-optimization/102893
7620 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
7621 test for an exit edge.
7623 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
7624 Andrew MacLeod <amacleod@redhat.com>
7626 * gimple-range-path.cc (path_range_query::compute_phi_relations):
7627 Kill any global relations we may know before registering a new
7629 * value-relation.cc (path_oracle::killing_def): New.
7630 * value-relation.h (path_oracle::killing_def): New.
7632 2021-10-22 Richard Biener <rguenther@suse.de>
7635 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
7637 (vn_phi_lookup): Likewise.
7639 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
7642 * doc/invoke.texi: Document -fcf-protection requires i686 or
7645 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
7647 PR middle-end/102764
7648 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
7650 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
7652 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
7653 macro and all invocations.
7655 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7657 * doc/invoke.texi (ranger-debug): Document.
7658 * flag-types.h (enum ranger_debug): New.
7659 (enum evrp_mode): Remove debug values.
7660 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
7661 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
7662 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
7663 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
7664 (execute_early_vrp): Use evrp-mode directly.
7665 * params.opt (enum evrp_mode): Remove debug values.
7666 (ranger-debug): New.
7667 (ranger-logical-depth): Relocate to be in alphabetical order.
7669 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7671 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
7672 * flag-types.h: (enum vrp_mode): New.
7673 * params.opt: (vrp1-mode, vrp2-mode): New.
7674 * tree-vrp.c (vrp_pass_num): New.
7675 (pass_vrp::pass_vrp): Set pass number.
7676 (pass_vrp::execute): Choose which VRP mode to execute.
7678 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7680 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
7681 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
7682 * tree-vrp.c (class rvrp_folder): Relocate here.
7683 (execute_ranger_vrp): New.
7684 * tree-vrp.h (execute_ranger_vrp): Export.
7686 2021-10-21 Martin Liska <mliska@suse.cz>
7690 * opts.c (finish_options): Process flag_var_tracking* options
7691 here as they can be adjusted by optimize attribute.
7692 Process also flag_syntax_only and flag_gtoggle.
7693 * toplev.c (process_options): Remove it here.
7694 * common.opt: Make debug_nonbind_markers_p as PerFunction
7695 attribute as it depends on optimization level.
7697 2021-10-21 Martin Jambor <mjambor@suse.cz>
7699 PR tree-optimization/102505
7700 * tree-sra.c (totally_scalarize_subtree): Check that the
7701 encountered field fits within the acces we would like to put it
7704 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7706 * tree-ssa-threadbackward.c
7707 (back_threader::maybe_register_path): Remove circular paths check.
7709 2021-10-21 Richard Biener <rguenther@suse.de>
7711 * toplev.c (process_options): Move the initial debug_hooks
7713 (toplev::main): ... before the call of the post_options
7716 2021-10-21 Richard Biener <rguenther@suse.de>
7718 PR tree-optimization/102847
7719 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
7720 load cost in the prologue for VMAT_INVARIANT.
7722 2021-10-21 Richard Biener <rguenther@suse.de>
7724 PR tree-optimization/102847
7725 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
7726 handle VMAT_INVARIANT as a splat in the prologue.
7728 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
7731 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
7732 move to use the same logic as HImode.
7734 2021-10-21 Richard Biener <rguenther@suse.de>
7736 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
7737 superfluous gimple_call_nothrow_p check.
7739 2021-10-21 Andrew Pinski <apinski@marvell.com>
7741 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
7742 Mark the ssa-name of the rhs as one to be removed.
7743 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
7744 Call simple_dce_from_worklist at the end to a simple dce.
7746 2021-10-21 Andrew Pinski <apinski@marvell.com>
7748 * tree-cfg.c (maybe_remove_writeonly_store): New function
7749 factored out from ...
7750 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
7752 2021-10-21 Andrew Pinski <apinski@marvell.com>
7754 * tree-cfg.c (execute_fixup_cfg): Remove comment
7755 about standalone pass.
7757 2021-10-21 Andrew Pinski <apinski@marvell.com>
7759 * tree-cfg.c (execute_fixup_cfg): Output when the statement
7760 is removed when it is a write only var.
7762 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7764 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7765 Avoid threading circular paths.
7767 2021-10-20 Alex Coplan <alex.coplan@arm.com>
7769 * calls.c (initialize_argument_information): Remove some dead
7770 code, remove handling for function_arg returning const_int.
7771 * doc/tm.texi: Delete documentation for unused target hooks.
7772 * doc/tm.texi.in: Likewise.
7773 * target.def (load_bounds_for_arg): Delete.
7774 (store_bounds_for_arg): Delete.
7775 (load_returned_bounds): Delete.
7776 (store_returned_bounds): Delete.
7777 * targhooks.c (default_load_bounds_for_arg): Delete.
7778 (default_store_bounds_for_arg): Delete.
7779 (default_load_returned_bounds): Delete.
7780 (default_store_returned_bounds): Delete.
7781 * targhooks.h (default_load_bounds_for_arg): Delete.
7782 (default_store_bounds_for_arg): Delete.
7783 (default_load_returned_bounds): Delete.
7784 (default_store_returned_bounds): Delete.
7786 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7788 * doc/extend.texi (Basic Asm): Clarify that asm is not an
7790 * doc/invoke.texi (-fno-asm): Fix description for C++.
7792 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7794 * doc/install.texi: Remove link to old.html
7796 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7798 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
7800 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7802 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
7804 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7806 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
7808 * config/aarch64/constraints.md (D1): New.
7810 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7812 * config/aarch64/aarch64-simd.md
7813 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
7814 (*aarch64_topbits_shuffle<mode>_le): New.
7815 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
7816 (*aarch64_topbits_shuffle<mode>_be): New.
7817 * config/aarch64/predicates.md
7818 (aarch64_simd_shift_imm_vec_exact_top): New.
7820 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7822 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
7823 *aarch64_<srn_op>shrn<mode>2_vect_le,
7824 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
7825 * config/aarch64/iterators.md (srn_op): New.
7827 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
7829 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
7831 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
7832 ctx->outer instead of ctx.
7834 2021-10-20 Martin Liska <mliska@suse.cz>
7837 2021-10-19 Martin Liska <mliska@suse.cz>
7840 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7841 * system.h (strip_whilespaces): New function.
7843 2021-10-20 Martin Liska <mliska@suse.cz>
7846 2021-10-19 Martin Liska <mliska@suse.cz>
7849 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7852 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7854 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
7855 Change to 8 with -Os, 11 otherwise.
7857 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7859 * config/aarch64/aarch64.c (neoversev1_tunings):
7860 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
7861 (neoversen2_tunings): Likewise.
7863 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7866 * config/aarch64/aarch64.md (frint_pattern): Update comment.
7867 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
7868 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
7869 * config/aarch64/arm_neon.h: Likewise.
7870 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
7872 2021-10-20 Martin Liska <mliska@suse.cz>
7874 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
7875 already declared global variable.
7876 (arm_unwind_emit_set): Use out_file as function argument.
7877 (arm_unwind_emit): Likewise.
7878 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
7879 (machopic_output_stub_indirection): Likewise.
7880 (machopic_output_indirection): Likewise.
7881 (machopic_finish): Likewise.
7882 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
7883 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
7884 * config/ia64/ia64.c (process_epilogue): Likewise.
7885 (process_cfa_adjust_cfa): Likewise.
7886 (process_cfa_register): Likewise.
7887 (process_cfa_offset): Likewise.
7888 (ia64_asm_unwind_emit): Likewise.
7889 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
7891 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7893 * config/aarch64/aarch64-builtins.c
7894 (aarch64_general_gimple_fold_builtin):
7895 lower vld1 and vst1 variants of the neon builtins
7896 * config/aarch64/aarch64-protos.h:
7897 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
7898 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
7901 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7903 * match.pd: Generate IFN_TRUNC.
7905 2021-10-20 Richard Biener <rguenther@suse.de>
7907 PR tree-optimization/102853
7908 * tree-data-ref.c (split_constant_offset_1): Bail out
7909 immediately if the expression traps on overflow.
7911 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7913 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
7915 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7917 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
7918 Remove superflous debugging message.
7919 (back_jt_path_registry::duplicate_thread_path): Same.
7921 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7923 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
7925 (back_threader_registry::register_path): Remove m_threaded_paths.
7927 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7929 PR tree-optimization/102814
7930 * doc/invoke.texi: Document --param=max-fsm-thread-length.
7931 * params.opt: Add --param=max-fsm-thread-length.
7932 * tree-ssa-threadbackward.c
7933 (back_threader_profitability::profitable_path_p): Fail on paths
7934 longer than max-fsm-thread-length.
7936 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
7938 PR middle-end/102764
7939 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
7940 statement to reset the current location for the outgoing edges.
7942 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7943 Richard Biener <rguenther@suse.de>
7945 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
7946 on the same line as the threading cancellation.
7947 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
7948 Avoid threading through loop headers where the path remains in the
7951 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
7953 * tree-object-size.c (unknown): Make into a function. Adjust
7955 (unknown_object_size): Simplify implementation.
7957 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
7958 Kewen Lin <linkw@linux.ibm.com>
7960 * doc/sourcebuild.texi (Effective-Target Keywords): Document
7961 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
7962 vect_slp_v16qi_store, vect_slp_v2hi_store,
7963 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
7965 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
7967 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
7969 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7973 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
7974 * config/rs6000/pmmintrin.h: Same.
7975 * config/rs6000/smmintrin.h: Same.
7976 * config/rs6000/tmmintrin.h: Same.
7978 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7980 * config.gcc (extra_headers): Add nmmintrin.h.
7982 2021-10-19 Richard Biener <rguenther@suse.de>
7984 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
7985 misalignment parameter.
7986 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
7987 Do not change DR_MISALIGNMENT in place, instead pass the
7988 adjusted misalignment to vect_supportable_dr_alignment.
7989 (vect_peeling_supportable): Likewise.
7990 (vect_peeling_hash_get_lowest_cost): Adjust.
7991 (vect_enhance_data_refs_alignment): Likewise.
7992 (vect_vfa_access_size): Likewise.
7993 (vect_supportable_dr_alignment): Add misalignment
7994 parameter and simplify.
7995 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
7996 (get_group_load_store_type): Likewise.
7997 (get_load_store_type): Likewise.
7999 2021-10-19 Clément Chigot <clement.chigot@atos.net>
8001 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
8002 __tls_get_addr reference to .text csect.
8004 2021-10-19 Martin Liska <mliska@suse.cz>
8007 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
8010 2021-10-19 Richard Biener <rguenther@suse.de>
8012 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
8013 (vect_get_load_cost): Likewise.
8014 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
8015 alignment support scheme and misalignment as arguments
8017 (vect_get_peeling_costs_all_drs): Compute that info here
8018 and note that we shouldn't need to.
8019 * tree-vect-stmts.c (vect_model_store_cost): Get
8020 alignment support scheme and misalignment as arguments.
8021 (vect_get_store_cost): Likewise.
8022 (vect_model_load_cost): Likewise.
8023 (vect_get_load_cost): Likewise.
8024 (vectorizable_store): Pass down alignment support scheme
8025 and misalignment to costing.
8026 (vectorizable_load): Likewise.
8028 2021-10-19 Richard Biener <rguenther@suse.de>
8030 * tree-vect-stmts.c (get_negative_load_store_type): Add
8031 offset output parameter and initialize it.
8032 (get_group_load_store_type): Likewise.
8033 (get_load_store_type): Likewise.
8034 (vectorizable_store): Use offset as computed by
8035 get_load_store_type.
8036 (vectorizable_load): Likewise.
8038 2021-10-19 Richard Biener <rguenther@suse.de>
8040 PR tree-optimization/102827
8041 * tree-if-conv.c (predicate_statements): Add pe parameter
8042 and use that edge to insert invariant stmts on.
8043 (combine_blocks): Pass through pe.
8044 (tree_if_conversion): Compute the edge to insert invariant
8045 stmts on and pass it along.
8047 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
8050 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
8051 sssubaddv2hi3): Swap the order of operators in vec_concat.
8053 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
8055 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
8056 (altivec_vmrghb_direct): New.
8057 (*altivec_vmrghh_internal): Delete.
8058 (altivec_vmrghh_direct): New.
8059 (*altivec_vmrghw_internal): Delete.
8060 (altivec_vmrghw_direct_<mode>): New.
8061 (altivec_vmrghw_direct): Delete.
8062 (*altivec_vmrglb_internal): Delete.
8063 (altivec_vmrglb_direct): New.
8064 (*altivec_vmrglh_internal): Delete.
8065 (altivec_vmrglh_direct): New.
8066 (*altivec_vmrglw_internal): Delete.
8067 (altivec_vmrglw_direct_<mode>): New.
8068 (altivec_vmrglw_direct): Delete.
8069 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
8070 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
8072 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
8073 (vsx_xxmrglw_<mode>): Adjust.
8075 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
8077 * passes.def: Change threading comment before pass_ccp pass.
8079 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
8081 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
8082 Modify the expansion for sign extension. All extensions are done
8083 within VSX registers.
8085 2021-10-19 Richard Biener <rguenther@suse.de>
8087 * tree-vect-stmts.c (get_group_load_store_type): Add
8088 misalignment output parameter and initialize it.
8089 (get_group_load_store_type): Likewise.
8090 (vectorizable_store): Remove now redundant queries.
8091 (vectorizable_load): Likewise.
8093 2021-10-19 Richard Biener <rguenther@suse.de>
8095 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
8096 check_aligned argument.
8097 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
8099 (vect_peeling_hash_insert): Add supportable_if_not_aligned
8100 argument and do not call vect_supportable_dr_alignment here.
8101 (vect_peeling_supportable): Adjust.
8102 (vect_enhance_data_refs_alignment): Compute whether the
8103 access is supported with different alignment here and
8104 pass that down to vect_peeling_hash_insert.
8105 (vect_vfa_access_size): Adjust.
8106 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
8107 (vect_get_load_cost): Likewise.
8108 (get_negative_load_store_type): Likewise.
8109 (get_group_load_store_type): Likewise.
8110 (get_load_store_type): Likewise.
8112 2021-10-19 Martin Liska <mliska@suse.cz>
8115 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
8116 * system.h (strip_whilespaces): New function.
8118 2021-10-19 dianhong xu <dianhong.xu@intel.com>
8120 * config/i386/avx512fp16intrin.h:
8121 (_mm512_set1_pch): New intrinsic.
8122 * config/i386/avx512fp16vlintrin.h:
8123 (_mm256_set1_pch): New intrinsic.
8124 (_mm_set1_pch): Ditto.
8126 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
8128 PR tree-optimization/102796
8129 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
8130 normally. Return get_tree_range for non gimple_range_ssa_p names.
8131 (gimple_ranger::range_of_stmt): Use get_tree_range for non
8132 gimple_range_ssa_p names.
8134 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
8137 * config/i386/i386.c (ix86_print_operand_address):
8138 Error out for non-address_operand asm operands.
8140 2021-10-18 Richard Biener <rguenther@suse.de>
8142 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
8143 not auto-convert dr_alignment_support to bool.
8144 (vect_peeling_supportable): Likewise.
8145 (vect_enhance_data_refs_alignment): Likewise.
8146 (vect_supportable_dr_alignment): Commonize read/write case.
8147 * tree-vect-stmts.c (vect_get_store_cost): Use
8148 dr_alignment_support, not int, for the vect_supportable_dr_alignment
8150 (vect_get_load_cost): Likewise.
8152 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
8154 * tree-object-size.c (object_sizes_execute): Consolidate LHS
8155 null check and do it early.
8157 2021-10-18 Richard Biener <rguenther@suse.de>
8159 * tree-vect-stmts.c (vectorizable_store): Use the
8160 computed alignment scheme instead of querying
8163 2021-10-18 Richard Biener <rguenther@suse.de>
8165 * tree-vect-stmts.c (vectorizable_store): Do not recompute
8166 alignment scheme already determined by get_load_store_type.
8168 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
8170 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
8171 Clone corresponding pass.
8173 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
8175 * combine.c (recog_for_combine): For an unrecognized move/set of
8176 a constant, try force_const_mem to place it in the constant pool.
8177 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
8178 Delete global variables (that are no longer assigned a cost value).
8179 (cse_insn): Simplify logic for deciding whether to place a folded
8180 constant in the constant pool using force_const_mem.
8181 (cse_main): Remove zero initialization of constant_pool_entries_cost
8182 and constant_pool_entries_regcost.
8183 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
8184 fractionally more expensive, when optimizing for speed.
8186 2021-10-18 Martin Liska <mliska@suse.cz>
8188 PR gcov-profile/102746
8189 PR gcov-profile/102747
8190 * gcov.c (main): Return return_code.
8191 (output_gcov_file): Mark return_code when error happens.
8192 (generate_results): Likewise.
8193 (read_graph_file): Likewise.
8194 (read_count_file): Likewise.
8196 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
8198 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
8199 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
8200 a popcount, allowing compile-time evaluation/simplification.
8201 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
8203 2021-10-18 Richard Biener <rguenther@suse.de>
8205 PR tree-optimization/102788
8206 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
8208 (vect_pattern_recog_1): Likewise.
8209 (vect_recog_bool_pattern): Continue matching the pattern
8210 even if we do not have a vector type for a conversion
8213 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
8215 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
8216 Evalute SS_NEG and SS_ABS of a constant argument.
8218 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
8221 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
8222 * internal-fn.def: Add entry for COND_NEG internal function.
8223 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
8224 NEGATE_EXPR, COND_NEG mapping.
8225 * optabs.def: Add entry for cond_neg_optab.
8226 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
8227 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
8228 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
8230 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
8232 * tree-ssa-strlen.c (class strlen_pass): Rename from
8234 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
8235 (handle_assign): Same.
8236 (adjust_last_stmt): Same.
8237 (maybe_warn_overflow): Same.
8238 (handle_builtin_strlen): Same.
8239 (handle_builtin_strchr): Same.
8240 (handle_builtin_strcpy): Same.
8241 (handle_builtin_strncat): Same.
8242 (handle_builtin_stxncpy_strncat): Same.
8243 (handle_builtin_memcpy): Same.
8244 (handle_builtin_strcat): Same.
8245 (handle_alloc_call): Same.
8246 (handle_builtin_memset): Same.
8247 (handle_builtin_memcmp): Same.
8248 (get_len_or_size): Same.
8249 (strxcmp_eqz_result): Same.
8250 (handle_builtin_string_cmp): Same.
8251 (handle_pointer_plus): Same.
8252 (count_nonzero_bytes_addr): Same.
8253 (count_nonzero_bytes): Same.
8254 (handle_store): Same.
8255 (strlen_check_and_optimize_call): Same.
8256 (handle_integral_assign): Same.
8257 (check_and_optimize_stmt): Same.
8258 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
8260 2021-10-18 Richard Biener <rguenther@suse.de>
8262 PR tree-optimization/102798
8263 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
8264 Only copy points-to info to newly generated SSA names.
8266 2021-10-18 Martin Liska <mliska@suse.cz>
8268 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
8269 * gcov.c (get_cycles_count): Likewise.
8270 * lto-compress.c (lto_compression_zlib): Likewise.
8271 (lto_uncompression_zlib): Likewise.
8272 * targhooks.c (default_pch_valid_p): Likewise.
8274 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
8276 * tree-pass.h (make_pass_thread_jumps_full): New.
8277 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
8278 (try_thread_blocks): Add resolve and speed arguments.
8279 (pass_thread_jumps::execute): Inline.
8280 (do_early_thread_jumps): New.
8281 (do_thread_jumps): New.
8282 (make_pass_thread_jumps): Move.
8283 (pass_early_thread_jumps::gate): Inline.
8284 (pass_early_thread_jumps::execute): Inline.
8285 (class pass_thread_jumps_full): New.
8287 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
8289 * configure.ac: Treat powerpc64*-*-freebsd* the same as
8290 powerpc64-*-freebsd*.
8291 * configure: Regenerate.
8293 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
8295 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
8296 set_ptr_nonull to set_ptr_nonnull in comments.
8298 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
8300 PR tree-optimization/102720
8301 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
8302 of call used and clobbered sets.
8304 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
8306 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
8307 skip gomp and itm when r or nodefaultlibs is given.
8308 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
8309 Update ordering of exclusions, remove duplicate 'v' addition
8310 (collect2 will add this from the main command line).
8312 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
8314 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
8315 filelist and framework options in specs instead of code.
8316 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
8317 specs that are really driver ones.
8318 (DARWIN_CC1_SPEC): Likewise.
8319 (CPP_SPEC): Likewise.
8320 (SYSROOT_SPEC): Append space.
8321 (LINK_SYSROOT_SPEC): Remove most driver link specs.
8322 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
8323 (STARTFILE_SPEC): Likewise.
8324 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
8325 (ASM_SPEC): Update driver-related specs.
8326 (ASM_FINAL_SPEC): Likewise.
8327 * config/darwin.opt: Remove now unused option aliases.
8328 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
8329 (ASM_SPEC): Update driver-related specs.
8331 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
8333 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
8334 allow standard_sse_constant_p immediate constants.
8336 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
8338 * config.gcc: Add tm-dwarf2.h to tm_d-file.
8340 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
8342 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
8343 that occur in abnormal phis.
8344 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
8345 abnormal and EH edges.
8346 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
8347 (rvrp_folder::value_on_edge): Ditto.
8348 (rvrp_folder::value_of_stmt): Ditto.
8349 (hybrid_folder::value_of_expr): Ditto for ranger queries.
8350 (hybrid_folder::value_on_edge): Ditto.
8351 (hybrid_folder::value_of_stmt): Ditto.
8352 * value-query.cc (gimple_range_global): Always return a range if
8353 the type is supported.
8355 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
8357 * config/pa/pa.md: Consistently use "rG" constraint for copy
8358 instruction in move patterns.
8360 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
8362 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
8365 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
8366 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
8367 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
8368 (ELFABIVERSION_AMDGPU_HSA_V4): New.
8369 (SET_SRAM_ECC_UNSUPPORTED): New.
8370 (copy_early_debug_info): Create elf flags to match the other objects.
8371 (main): Just let the attribute flags pass through.
8373 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8375 * tree-loop-distribution.c (reduction_var_overflows_first):
8376 Pass the type of reduction_var as first argument as it is also
8377 done for the load type.
8378 (loop_distribution::transform_reduction_loop): Add missing
8379 TREE_TYPE while determining precission of reduction_var.
8381 2021-10-15 Richard Biener <rguenther@suse.de>
8383 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
8385 * toplev.c (process_options): Warn when STABS debugging is
8386 enabled but not the preferred format.
8387 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
8389 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
8391 2021-10-15 Richard Biener <rguenther@suse.de>
8394 * tree-inline.c (copy_bb): Avoid underflowing nargs.
8396 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
8398 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
8399 HFmode input operand to HImode.
8400 (ix86_vectorize_vec_perm_const): Likewise.
8401 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
8403 (*avx512f_permvar_truncv8siv8hi_1_hf):
8406 2021-10-15 Richard Biener <rguenther@suse.de>
8408 PR middle-end/102682
8409 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
8410 not create a paradoxical subreg.
8412 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
8414 * config/i386/i386-expand.c (ix86_expand_vector_init):
8415 For half_vector concat for HFmode, handle them like HImode.
8417 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8419 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
8420 (back_threader::back_threader): Same.
8421 (back_threader::resolve_phi): Try to solve without looking back if
8423 (back_threader::find_paths_to_names): Same.
8424 (try_thread_blocks): Pass resolve argument to back threader.
8425 (pass_early_thread_jumps::execute): Same.
8427 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8429 * doc/invoke.texi: Remove max-fsm-thread-length,
8430 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
8432 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
8434 (back_threader_registry::back_threader_registry): Same.
8435 (back_threader_profitability::profitable_path_p): Remove
8436 param_max_fsm_thread-length.
8437 (back_threader_registry::register_path): Remove
8438 m_max_allowable_paths.
8440 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8442 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
8444 (back_threader::~back_threader): Do not release m_path.
8446 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
8448 PR tree-optimization/102738
8449 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
8451 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
8453 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
8454 (omp_mark_declare_variant): Move from c-omp.c.
8455 (omp_context_name_list_prop): Update for Fortran strings.
8456 * omp-general.h (omp_check_context_selector): New prototype.
8457 (omp_mark_declare_variant): New prototype.
8459 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8461 * config/rs6000/rs6000.c (rs6000_density_test): Move early
8462 exit test further up the function.
8464 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8466 * config/arm/arm.c (arm_add_stmt_cost): Delete.
8467 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
8469 2021-10-14 Martin Jambor <mjambor@suse.cz>
8471 * doc/invoke.texi (Optimize Options): Add entry for
8472 ipa-cp-recursive-freq-factor.
8474 2021-10-14 Tamar Christina <tamar.christina@arm.com>
8476 * match.pd: New rule.
8478 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
8481 * ipa-modref-tree.h (modref_access_node::update2):
8482 Also check that parm_offset is unchanged.
8483 (modref_ref_node::insert_access): Fix updating of
8486 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8488 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
8491 2021-10-14 Richard Biener <rguenther@suse.de>
8493 PR tree-optimization/102659
8494 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
8495 rewrite pointer typed undefined overflow operations.
8496 (predicate_statements): Likewise. Make sure to emit invariant
8497 conversions in the preheader.
8498 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
8499 non-empty preheaders.
8500 * tree-data-ref.c (dr_analyze_indices): Strip useless
8501 conversions to the MEM_REF base type.
8503 2021-10-14 Martin Liska <mliska@suse.cz>
8505 * common.opt: Stop using AUTODETECT_VALUE
8506 and use EnabledBy where possible.
8507 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
8508 * toplev.c (AUTODETECT_VALUE): Remove macro.
8509 (process_options): Simplify by using EnabledBy and
8510 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
8513 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
8515 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
8517 (vld1_s32_x3): Likewise.
8519 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8521 PR tree-optimization/102736
8522 PR tree-optimization/102736
8523 * gimple-range-path.cc (path_range_query::range_on_path_entry):
8524 Assert that the requested range is defined outside the path.
8525 (path_range_query::ssa_range_in_phi): Do not call
8526 range_on_path_entry for SSA names that are defined within the
8529 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
8531 * config/darwin-driver.c (darwin_find_version_from_kernel):
8532 Quote internal identifiers and avoid contractions in
8534 (darwin_default_min_version): Likewise.
8535 (darwin_driver_init): Likewise.
8537 2021-10-14 Martin Jambor <mjambor@suse.cz>
8539 * params.opt (ipa-cp-recursive-freq-factor): New.
8540 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
8541 scc_no, self_recursion_generated_level, same_scc and
8542 self_recursion_generated_p.
8543 (ipcp_lattice::add_value): Replaced parameter unlimited with
8544 same_lat_gen_level, usit it determine limit of values and store it to
8546 (ipcp_lattice<valtype>::print): Dump the new fileds.
8547 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
8548 parameter and store it to the new value.
8549 (self_recursively_generated_p): Removed.
8550 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
8551 instead of self_recursively_generated_p, store self generation level
8553 (value_topo_info<valtype>::add_val): Set scc_no.
8554 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
8555 recursively feeding values and self generated values by appropriate
8558 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8560 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
8561 redundant test for flag_vect_cost_model.
8563 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8565 * bitmap.c (debug): New overloaded function for auto_bitmaps.
8566 * bitmap.h (debug): Same.
8568 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
8570 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
8571 Adjust builtin call.
8572 (_mm512_mask3_fcmadd_pch): Likewise.
8573 (_mm512_mask_fmadd_pch): Likewise
8574 (_mm512_mask3_fmadd_pch): Likewise
8575 (_mm512_mask_fcmadd_round_pch): Likewise
8576 (_mm512_mask3_fcmadd_round_pch): Likewise
8577 (_mm512_mask_fmadd_round_pch): Likewise
8578 (_mm512_mask3_fmadd_round_pch): Likewise
8579 (_mm_mask_fcmadd_sch): Likewise
8580 (_mm_mask3_fcmadd_sch): Likewise
8581 (_mm_mask_fmadd_sch): Likewise
8582 (_mm_mask3_fmadd_sch): Likewise
8583 (_mm_mask_fcmadd_round_sch): Likewise
8584 (_mm_mask3_fcmadd_round_sch): Likewise
8585 (_mm_mask_fmadd_round_sch): Likewise
8586 (_mm_mask3_fmadd_round_sch): Likewise
8587 (_mm_fcmadd_round_sch): Likewise
8588 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
8589 Adjust builtin call.
8590 (_mm_mask3_fmadd_pch): Likewise
8591 (_mm256_mask_fmadd_pch): Likewise
8592 (_mm256_mask3_fmadd_pch): Likewise
8593 (_mm_mask_fcmadd_pch): Likewise
8594 (_mm_mask3_fcmadd_pch): Likewise
8595 (_mm256_mask_fcmadd_pch): Likewise
8596 (_mm256_mask3_fcmadd_pch): Likewise
8597 * config/i386/i386-builtin.def: Add mask3 builtin for complex
8598 fma, and adjust mask_builtin to corresponding expander.
8599 * config/i386/i386-expand.c (ix86_expand_round_builtin):
8600 Skip eraseing embedded rounding for expanders that emits
8602 * config/i386/sse.md (complexmove): New mode_attr.
8603 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
8604 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
8605 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8606 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8607 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8608 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8609 * config/i386/subst.md (round_embedded_complex): New subst.
8611 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
8613 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
8614 (cbranchdf4): Likewise.
8615 Add missing move patterns for TARGET_SOFT_FLOAT.
8617 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
8619 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
8620 pseudo intermediate when moving a SUBREG into a hard register,
8621 by checking ix86_hardreg_mov_ok.
8622 (ix86_expand_vector_extract): Store zero-extended SImode
8623 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
8625 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
8626 complex (SUBREG) sets of (vector) hard registers before reload, by
8627 checking ix86_hardreg_mov_ok.
8629 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8631 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
8633 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8635 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
8637 2021-10-13 Martin Sebor <msebor@redhat.com>
8639 PR middle-end/102630
8640 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
8642 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
8644 * collect2.c (is_lto_object_file): Release simple-object
8645 resources, close files.
8647 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
8649 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
8650 when checking mve addressing modes.
8651 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
8653 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
8654 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
8655 there is a single Ux constraint.
8657 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
8659 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
8660 multiplication on 32-bit PA 1.1 and 2.0 targets.
8662 2021-10-13 Martin Liska <mliska@suse.cz>
8664 PR gcov-profile/90364
8665 * coverage.c (build_info): Emit checksum to the global variable.
8666 (build_info_type): Add new field for checksum.
8667 (coverage_obj_finish): Pass object_checksum.
8668 (coverage_init): Use 0 as checksum for .gcno files.
8669 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
8670 * gcov.c (read_graph_file): Read also checksum.
8671 * doc/invoke.texi: Document the behaviour change.
8673 2021-10-13 Richard Biener <rguenther@suse.de>
8675 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
8676 start at integer value 2.
8677 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
8678 the iterator for GSI_LAST_NEW_STMT.
8679 (gsi_insert_seq_nodes_after): Likewise.
8680 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
8681 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
8682 arguments to gsi_replace.
8684 2021-10-13 Martin Liska <mliska@suse.cz>
8687 * common.opt: Use EnabledBy instead of detection in
8688 finish_options and process_options.
8689 * opts.c (finish_options): Remove handling of
8690 x_flag_unroll_all_loops.
8691 * toplev.c (process_options): Likewise for flag_web and
8692 flag_rename_registers.
8694 2021-10-13 Richard Biener <rguenther@suse.de>
8696 PR tree-optimization/102659
8697 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
8698 (if_convertible_gimple_assign_stmt_p): Mark the loop for
8699 rewrite when stmts with undefined behavior on integer
8701 (combine_blocks): Predicate also when we need to rewrite stmts.
8702 (predicate_statements): Rewrite affected stmts to something
8703 with well-defined behavior on overflow.
8704 (tree_if_conversion): Initialize need_to_rewrite_undefined.
8706 2021-10-13 Richard Biener <rguenther@suse.de>
8709 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
8712 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
8714 * dwarf2ctf.c: Fix typo in comment.
8716 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
8719 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
8720 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
8721 avoid generating an automatic #define of vec_cpsgn. Use the
8722 correct built-in for V4SFmode that doesn't depend on VSX.
8724 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
8728 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
8729 define_insn_and_split pattern. Add alternative 1 and split it
8730 post reload to insert operand 1 into the low part of operand 0.
8731 (*sub<mode>_1_slp): Ditto.
8732 (*and<mode>_1_slp): Ditto.
8733 (*<any_or:code><mode>_1_slp): Ditto.
8734 (*ashl<mode>3_1_slp): Ditto.
8735 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
8736 (*<any_rotate:insn><mode>3_1_slp): Ditto.
8737 (*neg<mode>_1_slp): New insn_and_split pattern.
8738 (*one_cmpl<mode>_1_slp): Ditto.
8740 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
8742 * doc/install.texi: Update MinGW and mingw-64 Binaries
8745 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
8747 * doc/extend.texi (Common Variable Attributes): Fix typos in
8748 alloc_size documentation.
8750 2021-10-12 Richard Biener <rguenther@suse.de>
8752 PR tree-optimization/102696
8753 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
8754 the tree fatally failed when we reject a BIT_FIELD_REF.
8756 2021-10-12 Richard Biener <rguenther@suse.de>
8758 PR tree-optimization/102572
8759 * tree-vect-stmts.c (vect_build_gather_load_calls): When
8760 gathering the vectorized defs for the mask pass in the
8761 desired mask vector type so invariants will be handled
8764 2021-10-12 Tamar Christina <tamar.christina@arm.com>
8766 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
8767 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
8768 *fcmuo<mode>_nor_combine): New.
8770 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
8773 * config/sparc/sparc-modes.def (OI): New integer mode.
8775 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8777 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
8778 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
8781 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8783 * tree-vectorizer.h (loop_cost_model): New function.
8784 (unlimited_cost_model): Use it.
8785 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
8786 call instead of flag_vect_cost_model.
8787 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
8788 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
8789 of flag_simd_cost_model.
8791 2021-10-12 liuhongt <hongtao.liu@intel.com>
8794 * config/i386/i386-expand.c (emit_reduc_half): Handle
8796 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
8797 (reduc_plus_scal_v4qi): Ditto.
8799 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8801 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
8802 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
8803 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
8805 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8807 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
8808 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
8809 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
8810 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
8811 _mm_cvtepu32_epi64): New.
8813 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8815 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
8816 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
8818 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8820 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
8821 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
8822 _mm_max_epi32, _mm_max_epu32): New.
8824 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8826 * ipa-modref-tree.h (struct modref_access_node): Revert
8828 (struct modref_ref_node): Likewise.
8830 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8832 * ipa-modref-tree.h (modref_tree::global_access_p): New member
8835 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8836 ignore_stores_eaf_flags): Move to ipa-modref.h
8837 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
8838 (modref_summary::global_memory_read_p): New member function.
8839 (modref_summary::global_memory_written_p): New member function.
8840 * ipa-modref.h (modref_summary::global_memory_read_p,
8841 modref_summary::global_memory_written_p): Declare.
8842 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8843 ignore_stores_eaf_flags): move here.
8844 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
8846 (handle_rhs_call): Rewrite.
8847 (handle_call_arg): New function.
8848 (determine_global_memory_access): New function.
8849 (handle_const_call): Remove
8850 (handle_pure_call): Remove
8851 (find_func_aliases_for_call): Update use of handle_rhs_call.
8852 (compute_points_to_sets): Handle global memory acccesses
8855 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
8857 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
8858 documentation, mention UBSAN_OPTIONS, similar to what is done
8859 for AddressSanitizer.
8861 2021-10-11 Richard Biener <rguenther@suse.de>
8863 PR middle-end/102683
8864 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
8865 availability before building an integer type for storage
8868 2021-10-11 Richard Biener <rguenther@suse.de>
8870 PR middle-end/101480
8871 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
8874 2021-10-11 Martin Liska <mliska@suse.cz>
8876 * common.opt: Remove Init(2) for some options.
8877 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
8878 use rather OPTION_SET_P.
8880 2021-10-11 Martin Liska <mliska@suse.cz>
8882 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
8883 * flag-types.h (enum ira_region): Likewise.
8884 * toplev.c (process_options): Use OPTION_SET_P instead of
8885 IRA_REGION_AUTODETECT.
8887 2021-10-11 Jakub Jelinek <jakub@redhat.com>
8889 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
8890 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
8892 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8894 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
8895 * config/s390/s390.c (s390_rawmemchr): New function.
8896 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
8897 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
8898 the pattern vfees<mode> from vx-builtins.md.
8899 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
8901 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8903 * builtins.c (get_memory_rtx): Change to external linkage.
8904 * builtins.h (get_memory_rtx): Add function prototype.
8905 * doc/md.texi (rawmemchr<mode>): Document.
8906 * internal-fn.c (expand_RAWMEMCHR): Define.
8907 * internal-fn.def (RAWMEMCHR): Add.
8908 * optabs.def (rawmemchr_optab): Add.
8909 * tree-loop-distribution.c (find_single_drs): Change return code
8910 behaviour by also returning true if no single store was found
8912 (loop_distribution::classify_partition): Respect the new return
8913 code behaviour of function find_single_drs.
8914 (loop_distribution::execute): Call new function
8915 transform_reduction_loop in order to replace rawmemchr or strlen
8916 like loops by calls into builtins.
8917 (generate_reduction_builtin_1): New function.
8918 (generate_rawmemchr_builtin): New function.
8919 (generate_strlen_builtin_1): New function.
8920 (generate_strlen_builtin): New function.
8921 (generate_strlen_builtin_using_rawmemchr): New function.
8922 (reduction_var_overflows_first): New function.
8923 (determine_reduction_stmt_1): New function.
8924 (determine_reduction_stmt): New function.
8925 (loop_distribution::transform_reduction_loop): New function.
8927 2021-10-11 Martin Liska <mliska@suse.cz>
8929 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
8930 and remove legacy hashing code.
8932 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
8935 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
8936 CONST_INT for BEGIN and END, and use gcc_assert rather than
8939 2021-10-10 Jakub Jelinek <jakub@redhat.com>
8942 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
8943 use MO_VAL_SET if loc is not sp.
8945 2021-10-10 Andrew Pinski <apinski@marvell.com>
8947 PR tree-optimization/102622
8948 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
8949 Swap the order of a?0:pow2cst and a?0:-1 transformations.
8951 2021-10-09 liuhongt <hongtao.liu@intel.com>
8954 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
8956 (ix86_use_mask_cmp_p): Ditto.
8957 (ix86_expand_sse_movcc): Ditto.
8958 * config/i386/i386.md (setcc_hf_mask): New define_insn.
8959 (movhf_mask): Ditto.
8960 (UNSPEC_MOVCC_MASK): New unspec.
8961 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
8963 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
8965 PR rtl-optimization/102627
8966 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
8968 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8970 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
8973 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8975 * value-range.cc (irange::debug): New.
8976 * value-range.h (irange::debug): New.
8978 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
8980 PR tree-optimization/102385
8981 * predict.h (change_edge_frequency): Declare.
8982 * predict.c (change_edge_frequency): New function.
8983 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
8985 (tree_unroll_loop): Likewise.
8986 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
8987 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
8989 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
8990 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
8991 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
8992 to retrieve the exit edges. Make all the old profile update code
8993 conditional on !single_loop_p -- the case it was written for --
8994 and use a different approach for the single-loop case.
8996 2021-10-08 Martin Liska <mliska@suse.cz>
8998 * config/alpha/alpha.c (alpha_option_override): Use new macro
9000 * config/arc/arc.c (arc_override_options): Likewise.
9001 * config/arm/arm.c (arm_option_override): Likewise.
9002 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
9003 * config/c6x/c6x.c (c6x_option_override): Likewise.
9004 * config/csky/csky.c: Likewise.
9005 * config/darwin.c (darwin_override_options): Likewise.
9006 * config/frv/frv.c (frv_option_override): Likewise.
9007 * config/i386/djgpp.h: Likewise.
9008 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
9009 (ix86_max_noce_ifcvt_seq_cost): Likewise.
9010 * config/ia64/ia64.c (ia64_option_override): Likewise.
9011 (ia64_override_options_after_change): Likewise.
9012 * config/m32c/m32c.c (m32c_option_override): Likewise.
9013 * config/m32r/m32r.c (m32r_init): Likewise.
9014 * config/m68k/m68k.c (m68k_option_override): Likewise.
9015 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
9016 * config/mips/mips.c (mips_option_override): Likewise.
9017 * config/nios2/nios2.c (nios2_option_override): Likewise.
9018 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
9019 * config/pa/pa.c (pa_option_override): Likewise.
9020 * config/riscv/riscv.c (riscv_option_override): Likewise.
9021 * config/rs6000/aix71.h: Likewise.
9022 * config/rs6000/aix72.h: Likewise.
9023 * config/rs6000/aix73.h: Likewise.
9024 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
9025 (rs6000_override_options_after_change): Likewise.
9026 (rs6000_linux64_override_options): Likewise.
9027 (glibc_supports_ieee_128bit): Likewise.
9028 (rs6000_option_override_internal): Likewise.
9029 (rs6000_file_start): Likewise.
9030 (rs6000_darwin_file_start): Likewise.
9031 * config/rs6000/rtems.h: Likewise.
9032 * config/rs6000/sysv4.h: Likewise.
9033 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
9034 * config/s390/s390.c (s390_option_override): Likewise.
9035 * config/sh/linux.h: Likewise.
9036 * config/sh/netbsd-elf.h (while): Likewise.
9037 * config/sh/sh.c (sh_option_override): Likewise.
9038 * config/sol2.c (solaris_override_options): Likewise.
9039 * config/sparc/sparc.c (sparc_option_override): Likewise.
9040 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
9041 * config/visium/visium.c (visium_option_override): Likewise.
9042 * config/vxworks.c (vxworks_override_options): Likewise.
9043 * lto-opts.c (lto_write_options): Likewise.
9044 * omp-expand.c (expand_omp_simd): Likewise.
9045 * omp-general.c (omp_max_vf): Likewise.
9046 * omp-offload.c (oacc_xform_loop): Likewise.
9047 * opts.h (OPTION_SET_P): Likewise.
9048 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
9049 * toplev.c (process_options): Likewise.
9050 * tree-predcom.c: Likewise.
9051 * tree-sra.c (analyze_all_variable_accesses): Likewise.
9053 2021-10-08 liuhongt <hongtao.liu@intel.com>
9056 * config/i386/i386.c (ix86_optab_supported_p):
9057 Return true for HFmode.
9058 * match.pd: Simplify (_Float16) ceil ((double) x) to
9059 __builtin_ceilf16 (a) when a is _Float16 type and
9060 direct_internal_fn_supported_p.
9062 2021-10-08 liuhongt <hongtao.liu@intel.com>
9065 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
9066 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
9067 (reduc_<code>_scal_v4hi): New.
9069 2021-10-08 liuhongt <hongtao.liu@intel.com>
9071 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
9072 * doc/invoke.texi (Options That Control Optimization): Update
9074 * opts.c (default_options_table): Enable auto-vectorization at
9075 O2 with very-cheap cost model.
9076 (finish_options): Use cheap cost model for
9077 explicit -ftree{,-loop}-vectorize.
9079 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
9081 * ctfc.c (ctfc_delete_container): Free hash table contents.
9083 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
9085 * toplev.c (process_options): Do not warn for GNU GIMPLE.
9087 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
9089 * tree-object-size.c (addr_object_size,
9090 compute_builtin_object_size): Drop PDECL and POFF arguments.
9091 (addr_object_size): Adjust calls.
9092 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
9095 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
9097 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
9098 representing signed and unsigned high-part multiplication resp.
9099 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
9100 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
9101 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
9102 SS_DIV, US_DIV]: Similar simplifications for saturating
9104 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
9105 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
9106 Implement compile-time evaluation for constant operands.
9107 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
9109 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
9110 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
9111 Mention the new smul_highpart and umul_highpart RTX codes.
9112 * doc/invoke.texi: Silence @xref "compilation" warnings.
9114 2021-10-07 Martin Jambor <mjambor@suse.cz>
9117 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
9118 case when the source reference description corresponds to a
9119 referance taken in a function src->caller is inlined to.
9121 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
9124 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
9126 (modref_access_node::try_merge_with): Add sanity check that there
9127 are no redundant entries in the list.
9129 2021-10-07 Richard Biener <rguenther@suse.de>
9131 PR tree-optimization/102608
9132 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
9135 2021-10-07 Martin Liska <mliska@suse.cz>
9137 * toplev.c (toplev::main): Make
9138 save_opt_decoded_options a pointer type
9139 * toplev.h: Likewise.
9141 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
9143 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
9144 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
9145 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
9147 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
9149 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
9150 Adjust for new -msram-ecc=any behaviour.
9151 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
9152 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
9153 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
9154 (EF_AMDGPU_XNACK_V3): ... this.
9155 (EF_AMDGPU_SRAM_ECC): Rename to ...
9156 (EF_AMDGPU_SRAM_ECC_V3): ... this.
9157 (EF_AMDGPU_FEATURE_XNACK_V4): New.
9158 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
9159 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
9160 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
9161 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
9162 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
9163 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
9164 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
9165 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
9166 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
9167 (SET_XNACK_ON): New.
9168 (SET_XNACK_OFF): New.
9170 (SET_SRAM_ECC_ON): New.
9171 (SET_SRAM_ECC_ANY): New.
9172 (SET_SRAM_ECC_OFF): New.
9173 (TEST_SRAM_ECC_ANY): New.
9174 (TEST_SRAM_ECC_ON): New.
9175 (main): Implement HSACOv4 and -msram-ecc=any.
9177 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
9179 * config.in: Regenerate.
9180 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
9184 (A_FIJI): Rename to ...
9186 (A_900): Rename to ...
9188 (A_906): Rename to ...
9190 (A_908): Rename to ...
9192 (SRAMOPT): New macro.
9193 (ASM_SPEC): Adjust xnack option usage.
9194 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
9195 * configure: Regenerate.
9196 * configure.ac: Detect LLVM assembler dialect.
9198 2021-10-07 Richard Biener <rguenther@suse.de>
9200 * tree-pretty-print.c (dump_generic_node): Do not elide
9201 printing '&' when dumping with -gimple.
9203 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9205 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
9207 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
9208 (adjust_imagpart_expr): Ditto.
9209 * value-range.cc (irange::irange_intersect): Call new routine if
9210 RHS is a single pair.
9211 (irange::intersect): New wide_int version.
9212 * value-range.h (class irange): New prototype.
9214 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9216 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
9217 Add parameter to limit size when recognizing switches.
9218 (gimple_outgoing_range::edge_range_p): Check size limit.
9219 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
9220 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
9221 that exceed the size limit.
9222 (gori_compute::gori_compute): Add initializer.
9223 * params.opt (evrp-switch-limit): New.
9224 * doc/invoke.texi: Update docs.
9226 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9228 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
9229 TYPE_MAX_VALUE instead of creating new trees when possible.
9231 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
9233 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
9234 zero and non-zero more efficently.
9236 2021-10-06 Richard Biener <rguenther@suse.de>
9239 * dumpfile.h (TDF_GIMPLE_VAL): New.
9240 (dump_flag): Re-order and adjust TDF_* flags. Make
9241 the enum uint32_t. Use std::underlying_type in the
9243 (optgroup_flag): Likewise for the operator overloads.
9244 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
9245 in _Literal if TDF_GIMPLE_VAL.
9246 * gimple-pretty-print.c (dump_gimple_assign): Add
9247 TDF_GIMPLE_VAL to flags when dumping operands where only
9248 is_gimple_val are allowed.
9249 (dump_gimple_cond): Likewise.
9251 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
9253 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
9256 2021-10-05 qing zhao <qing.zhao@oracle.com>
9258 PR middle-end/102359
9259 * gimplify.c (gimplify_decl_expr): Not add initialization for an
9260 auto variable when it has been initialized by frontend.
9262 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
9264 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9267 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
9269 * common/config/avr/avr-common.c (avr_handle_option): Mark
9270 argument as ATTRIBUTE_UNUSED.
9272 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
9274 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
9275 Undefine before redefinition.
9277 2021-10-05 Richard Biener <rguenther@suse.de>
9279 * toplev.c (no_backend): Remove global var.
9280 (process_options): Pass in no_backend, move post_options
9281 langhook call to toplev::main.
9282 (do_compile): Pass in no_backend, move process_options call
9284 (toplev::run_self_tests): Check no_backend at the caller.
9285 (toplev::main): Call post_options and process_options
9286 split out from do_compile, do self-tests only if
9287 no_backend is initialized.
9289 2021-10-05 Richard Biener <rguenther@suse.de>
9291 * tree-cfg.c (dump_function_to_file): Dump the UID of the
9292 function as part of the name when requested.
9293 * tree-pretty-print.c (dump_function_name): Dump the UID when
9294 requested and the langhook produced the actual name.
9296 2021-10-05 Richard Biener <rguenther@suse.de>
9298 PR middle-end/102587
9299 PR middle-end/102285
9300 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
9301 zero-initialization as last resort, use the constant
9302 size as given by the DEFERRED_INIT argument to build
9305 2021-10-04 Marek Polacek <polacek@redhat.com>
9308 * doc/invoke.texi: Document -Warray-compare.
9310 2021-10-04 Richard Biener <rguenther@suse.de>
9312 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
9313 variables are not to be initialized.
9315 2021-10-04 Richard Biener <rguenther@suse.de>
9317 * expr.h (non_mem_decl_p): Declare.
9318 (mem_ref_refers_to_non_mem_p): Likewise.
9319 * expr.c (non_mem_decl_p): Export.
9320 (mem_ref_refers_to_non_mem_p): Likewise.
9321 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
9322 but check the base with mem_ref_refers_to_non_mem_p
9325 2021-10-04 Richard Biener <rguenther@suse.de>
9327 PR tree-optimization/102570
9328 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
9329 we are using clique for the internal function code.
9330 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
9331 internal function code.
9332 (print_vn_reference_ops): Print the internal function code.
9333 (vn_reference_op_compute_hash): Hash it.
9334 (copy_reference_ops_from_call): Record it.
9335 (visit_stmt): Remove the restriction around internal function
9337 (fully_constant_vn_reference_p): Use fold_const_call and handle
9339 (vn_reference_eq): Compare call return types.
9340 * tree-ssa-pre.c (create_expression_by_pieces): Handle
9341 generating calls to internal functions.
9342 (compute_avail): Remove the restriction around internal function
9345 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
9347 PR tree-optimization/102560
9348 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
9349 marker for invalid_range.
9351 2021-10-04 Richard Biener <rguenther@suse.de>
9353 PR middle-end/102587
9354 * internal-fn.c (expand_DEFERRED_INIT): Guard register
9355 initialization path an avoid initializing VLA registers
9358 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
9360 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
9362 2021-10-03 Martin Liska <mliska@suse.cz>
9364 * toplev.c (toplev::main): Check opt_index if it is a part
9367 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
9369 PR tree-optimization/102563
9370 * range-op.cc (operator_lshift::op1_range): Do not clobber
9373 2021-10-02 Martin Liska <mliska@suse.cz>
9375 * toplev.c (toplev::main): save_decoded_options[0] is program
9376 name and so it should be skipped.
9378 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
9380 PR tree-optimization/102546
9381 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
9382 X << Y is non-zero implies X is also non-zero.
9384 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9386 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
9388 * config/aarch64/aarch64-tune.md: Regenerate.
9389 * doc/invoke.texi: Update docs.
9391 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9393 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
9395 * config/aarch64/aarch64-tune.md: Regenerate.
9396 * doc/invoke.texi: Update docs.
9398 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9400 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
9402 * config/aarch64/aarch64-tune.md: Regenerate.
9403 * doc/invoke.texi: Update docs.
9405 2021-10-01 Martin Sebor <msebor@redhat.com>
9408 * doc/invoke.texi (-Waddress): Update.
9409 * gengtype.c (write_types): Avoid -Waddress.
9410 * poly-int.h (POLY_SET_COEFF): Avoid using null.
9412 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
9415 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
9418 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9420 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
9422 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
9424 * gimple-range-path.cc (path_range_query::compute_ranges): Use
9426 * gimple-range-path.h (class path_range_query): Remove shadowed
9428 (path_range_query::get_path_oracle): New.
9430 2021-10-01 Jakub Jelinek <jakub@redhat.com>
9431 Richard Biener <rguenther@suse.de>
9434 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
9435 INT_MIN / -1 division detection from here ...
9436 (-fsanitize=signed-integer-overflow): ... and add it here.
9438 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9440 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
9442 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
9443 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
9444 (AARCH64_ISA_V9): New ISA flag.
9445 * doc/invoke.texi: Update docs.
9447 2021-10-01 Martin Liska <mliska@suse.cz>
9449 * toplev.c (toplev::main): Save decoded optimization options.
9450 * toplev.h (save_opt_decoded_options): New.
9451 * doc/extend.texi: Be more clear about optimize and target
9454 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
9456 * explow.c: Include langhooks.h.
9457 (set_stack_check_libfunc): Build a proper function type.
9459 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
9462 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
9463 not return true for external weak function symbols in medium model.
9465 2021-10-01 Jakub Jelinek <jakub@redhat.com>
9467 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
9468 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
9469 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
9470 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
9471 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
9472 OMP_CLAUSE_SCHEDULE_STATIC.
9474 2021-10-01 Richard Biener <rguenther@suse.de>
9476 PR middle-end/102518
9477 * tree-inline.c (setup_one_parameter): Avoid substituting
9478 an invariant into contexts where a GIMPLE register is not valid.
9480 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9482 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
9483 * config/arm/arm-tables.opt: Regenerate.
9484 * config/arm/arm-tune.md: Regenerate.
9485 * doc/invoke.texi: Update docs.
9487 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
9490 * config/i386/i386.md
9491 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
9494 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
9496 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
9497 omp_{c,re}alloc, fix omp_alloc/omp_free.
9499 2021-09-30 Martin Liska <mliska@suse.cz>
9501 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
9502 asm_out_file and stream directly to MYFILE.
9504 2021-09-30 Richard Biener <rguenther@suse.de>
9506 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
9507 Restore and fix condition under which we apply npeel to
9508 the DRs misalignment value.
9510 2021-09-30 Richard Biener <rguenther@suse.de>
9512 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
9513 Fix npeel check for variable amount of peeling.
9515 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9517 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
9519 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9521 * gimple-range.cc (gimple_ranger::debug): New.
9522 * gimple-range.h (class gimple_ranger): Add debug.
9524 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9526 PR middle-end/102519
9527 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
9529 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
9532 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
9533 (btf_finalize): Empty the hash_map btf_var_ids.
9535 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
9537 * tree-vrp.c (thread_through_all_blocks): Return bool.
9538 (execute_vrp_threader): Return TODO_* flags.
9539 (pass_data_vrp_threader): Set todo_flags_finish to 0.
9541 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
9543 * timevar.def (TV_TREE_VRP_THREADER): New.
9544 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
9546 2021-09-29 David Faust <david.faust@oracle.com>
9548 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
9550 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
9552 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
9553 new type qualifier enum.
9554 (TYPES_TERNOP_SSSU): Likewise.
9555 (TYPES_TERNOP_PPPU): Likewise.
9556 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
9557 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
9559 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
9560 builtin and remove casts.
9561 (vqtbl1_s8): Likewise.
9562 (vqtbl1q_p8): Likewise.
9563 (vqtbl1q_s8): Likewise.
9564 (vqtbx1_s8): Likewise.
9565 (vqtbx1_p8): Likewise.
9566 (vqtbx1q_s8): Likewise.
9567 (vqtbx1q_p8): Likewise.
9568 (vtbl1_p8): Likewise.
9569 (vtbl2_p8): Likewise.
9570 (vtbx2_p8): Likewise.
9572 2021-09-29 Richard Biener <rguenther@suse.de>
9574 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
9576 (vect_update_misalignment_for_peel): Use it to update
9577 misaligned to the value necessary for an aligned access.
9578 (vect_get_peeling_costs_all_drs): Likewise.
9579 (vect_enhance_data_refs_alignment): Likewise.
9581 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9583 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
9584 emitted operations and adjust heuristic for code size.
9586 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9588 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
9589 emitted operations and adjust heuristic for code size.
9591 2021-09-29 Jakub Jelinek <jakub@redhat.com>
9593 PR middle-end/102504
9594 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
9595 in OMP_SCOPE clauses, not just on worksharing construct clauses.
9597 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
9599 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
9600 (<u>mulditi3): Call <su>muldi3_highpart.
9601 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
9602 (<u>mulsidi3): Call <su>mulsi3_highpart.
9603 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
9605 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9607 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
9609 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9611 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
9612 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
9615 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9617 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
9618 (AARCH64_FL_V8_7): Likewise.
9619 (AARCH64_FL_FOR_ARCH8_7): Likewise.
9620 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
9621 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
9622 * doc/invoke.texi: Document the above.
9624 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9626 * dbgcnt.c (dbg_cnt_counter): New.
9627 * dbgcnt.h (dbg_cnt_counter): New.
9628 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
9629 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
9630 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
9631 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
9634 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9636 * cfgcleanup.c (pass_jump::execute): Check
9637 flag_expensive_optimizations.
9638 (pass_jump_after_combine::gate): Same.
9639 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
9640 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
9641 * tree-ssa-threadupdate.c
9642 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
9645 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9647 * tree-ssa-reassoc.c (biased_names): New global.
9648 (propagate_bias_p): New function.
9649 (loop_carried_phi): Remove.
9650 (propagate_rank): Propagate bias along single uses.
9651 (get_rank): Update biased_names when needed.
9653 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9655 * passes.def (pass_reassoc): Rename parameter to early_p.
9656 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
9658 (phi_rank): Don't bias loop-carried phi ranks
9659 before vectorization pass.
9660 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
9661 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
9663 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
9665 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
9667 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
9669 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9672 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
9673 special 80387 instruction XFmode constants if flag_rounding_math.
9675 2021-09-28 Richard Biener <rguenther@suse.de>
9677 PR tree-optimization/100112
9678 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
9679 referece into the hashtable twice in case last_vuse is
9680 different from the original vuse on the stmt.
9682 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9684 PR middle-end/102492
9685 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
9686 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
9687 clauses on SIMD constructs.
9689 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9691 PR tree-optimization/102511
9692 * gimple-range-path.cc (path_range_query::range_on_path_entry):
9693 Return VARYING when nothing found.
9695 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
9698 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
9700 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
9701 (VALID_MMX_REG_MODE): Add V4HFmode.
9702 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
9703 vector mode condition.
9704 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
9706 (function_arg_32): Add V4HFmode.
9707 (function_arg_advance_32): Likewise.
9708 * config/i386/i386.md (mode): Add V4HF/V2HF.
9709 (MODE_SIZE): Likewise.
9710 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
9711 (V_32): Add V2HF mode.
9712 (VHF_32_64): New mode iterator.
9713 (*mov<mode>_internal): Adjust sse alternatives to support
9715 (*mov<mode>_internal): Adjust sse alternatives to support
9717 (<insn><mode>3): New define_insn for add/sub/mul/div.
9719 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9721 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
9723 (pass_early_thread_jumps::gate): Same.
9724 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
9725 Return if !flag_thread_jumps.
9726 * tree-ssa-threadupdate.c
9727 (jt_path_registry::register_jump_thread): Assert that
9728 flag_thread_jumps is true.
9730 2021-09-28 liuhongt <hongtao.liu@intel.com>
9733 (simplify_context::simplify_binary_operation_1): Relax
9734 condition of simplifying (vec_concat:M (vec_select op0
9735 index0)(vec_select op1 index1)) to allow different modes
9736 between op0 and M, but have same inner mode.
9738 2021-09-28 liuhongt <hongtao.liu@intel.com>
9740 * config/i386/i386-expand.c (emit_reduc_half): Handle
9741 V8HF/V16HF/V32HFmode.
9742 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
9743 (REDUC_SSE_SMINMAX_MODE): Ditto.
9744 (REDUC_PLUS_MODE): Add V16HF and V32HF.
9745 (REDUC_SMINMAX_MODE): Ditto.
9747 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9749 * gimple-range-path.cc
9750 (path_range_query::precompute_ranges_in_block): Rename to...
9751 (path_range_query::compute_ranges_in_block): ...this.
9752 (path_range_query::precompute_ranges): Rename to...
9753 (path_range_query::compute_ranges): ...this.
9754 (path_range_query::precompute_relations): Rename to...
9755 (path_range_query::compute_relations): ...this.
9756 (path_range_query::precompute_phi_relations): Rename to...
9757 (path_range_query::compute_phi_relations): ...this.
9758 * gimple-range-path.h: Rename precompute* to compute*.
9759 * tree-ssa-threadbackward.c
9760 (back_threader::find_taken_edge_switch): Same.
9761 (back_threader::find_taken_edge_cond): Same.
9762 * tree-ssa-threadedge.c
9763 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
9764 (hybrid_jt_state::register_equivs_stmt): Inline...
9765 * tree-ssa-threadedge.h: ...here.
9767 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9769 * tree-vrp.c (lhs_of_dominating_assert): Remove.
9770 (class vrp_jt_state): Remove.
9771 (class vrp_jt_simplifier): Remove.
9772 (vrp_jt_simplifier::simplify): Remove.
9773 (class vrp_jump_threader): Remove.
9774 (vrp_jump_threader::vrp_jump_threader): Remove.
9775 (vrp_jump_threader::~vrp_jump_threader): Remove.
9776 (vrp_jump_threader::before_dom_children): Remove.
9777 (vrp_jump_threader::after_dom_children): Remove.
9779 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9781 * passes.def (pass_vrp_threader): New.
9782 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
9783 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
9784 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
9785 (hybrid_jt_simplifier::simplify): New.
9786 (hybrid_jt_simplifier::compute_ranges_from_state): New.
9787 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
9788 (class hybrid_jt_simplifier): New.
9789 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
9791 (class hybrid_threader): New.
9792 (hybrid_threader::hybrid_threader): New.
9793 (hybrid_threader::~hybrid_threader): New.
9794 (hybrid_threader::before_dom_children): New.
9795 (hybrid_threader::after_dom_children): New.
9796 (execute_vrp_threader): New.
9797 (class pass_vrp_threader): New.
9798 (make_pass_vrp_threader): New.
9800 2021-09-27 Martin Liska <mliska@suse.cz>
9802 * output.h (enum section_flag): New.
9803 (SECTION_FORGET): Remove.
9804 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
9805 (SECTION_STYLE_MASK): Define it based on other enum
9807 * varasm.c (switch_to_section): Remove unused handling of
9810 2021-09-27 Martin Liska <mliska@suse.cz>
9812 * common.opt: Add new variable flag_default_complex_method.
9813 * opts.c (finish_options): Handle flags related to
9814 x_flag_complex_method.
9815 * toplev.c (process_options): Remove option handling related
9816 to flag_complex_method.
9818 2021-09-27 Richard Biener <rguenther@suse.de>
9820 PR middle-end/102450
9821 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
9822 type_for_size, instead use int_mode_for_size.
9824 2021-09-27 Andrew Pinski <apinski@marvell.com>
9827 * gimplify.c (gimplify_save_expr): Return early
9828 if the type of val is error_mark_node.
9830 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9832 * tree-ssanames.c (ssa_name_has_boolean_range): Use
9835 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9837 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
9838 vrp_visit_cond_stmt.
9839 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
9840 (cprop_into_stmt): Same.
9841 (dom_opt_dom_walker::optimize_stmt): Same.
9843 2021-09-27 Richard Biener <rguenther@suse.de>
9845 PR tree-optimization/97351
9846 PR tree-optimization/97352
9847 PR tree-optimization/82426
9848 * tree-vectorizer.h (dr_misalignment): Add vector type
9850 (aligned_access_p): Likewise.
9851 (known_alignment_for_access_p): Likewise.
9852 (vect_supportable_dr_alignment): Likewise.
9853 (vect_known_alignment_in_bytes): Likewise. Refactor.
9854 (DR_MISALIGNMENT): Remove.
9855 (vect_update_shared_vectype): Likewise.
9856 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
9857 a vector type with larger alignment requirement and apply
9858 the negative step adjustment here.
9859 (vect_calculate_target_alignment): Remove.
9860 (vect_compute_data_ref_alignment): Get explicit vector type
9861 argument, do not apply a negative step alignment adjustment
9863 (vect_slp_analyze_node_alignment): Re-analyze alignment
9864 when we re-visit the DR with a bigger desired alignment but
9865 keep more precise results from smaller alignments.
9866 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
9867 (vect_slp_analyze_node_operations_1): Do not update the
9868 shared vector type on stmts.
9869 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
9870 vector type of an SLP node to the representative stmt-info.
9871 (vect_transform_stmt): Likewise.
9873 2021-09-27 liuhongt <hongtao.liu@intel.com>
9876 2021-09-09 liuhongt <hongtao.liu@intel.com>
9879 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
9880 (reduc_plus_scal_v4sf): .. this, New define_expand.
9881 (reduc_plus_scal_v2df): .. and this, New define_expand.
9883 2021-09-26 liuhongt <hongtao.liu@intel.com>
9885 * doc/extend.texi (Half-Precision): Remove storage only
9886 description for _Float16 w/o avx512fp16.
9888 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
9890 * config/pru/constraints.md (Rrio): New constraint.
9891 * config/pru/predicates.md (regio_operand): New predicate.
9892 * config/pru/pru-pragma.c (pru_register_pragmas): Register
9893 the __regio_symbol address space.
9894 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
9895 * config/pru/pru.c (pru_symref2ioregno): New helper function.
9896 (pru_legitimate_address_p): Remove.
9897 (pru_addr_space_legitimate_address_p): Use the address space
9899 (pru_nongeneric_pointer_addrspace): New helper function.
9900 (pru_insert_attributes): New function to validate __regio_symbol
9902 (TARGET_INSERT_ATTRIBUTES): New macro.
9903 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
9904 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
9905 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
9906 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
9908 (*regio_nozext_writesi): New pattern to write to I/O registers.
9909 (*regio_zext_write_r30<EQS0:mode>): Ditto.
9910 * doc/extend.texi: Document the new PRU Named Address Space.
9912 2021-09-24 Patrick Palka <ppalka@redhat.com>
9916 * real.c (encode_ieee_double): Avoid unwanted sign extension.
9917 (encode_ieee_quad): Likewise.
9919 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
9921 PR rtl-optimization/102147
9922 * ira-build.c (ira_conflict_vector_profitable_p): Make
9923 profitability calculation independent of host compiler pointer and
9926 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9928 * gimple-range-path.cc (path_range_query::path_range_query):
9929 Move debugging header...
9930 (path_range_query::precompute_ranges): ...here.
9931 (path_range_query::internal_range_of_expr): Do not call
9932 range_on_path_entry if NAME is defined in the current block.
9934 2021-09-24 Richard Biener <rguenther@suse.de>
9936 * cfghooks.c (verify_flow_info): Verify unallocated BB and
9937 edge flags are not set.
9939 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9941 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9943 (jt_path_registry::register_jump_thread): Call
9944 cancel_invalid_paths.
9945 * tree-ssa-threadupdate.h (class jt_path_registry): Add
9946 cancel_invalid_paths.
9948 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9950 PR tree-optimization/102400
9951 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
9952 result_vdef to zero value.
9954 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9956 PR tree-optimization/102451
9957 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
9960 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
9962 * config/i386/sse.md (cond_<insn><mode>): Extend to support
9964 (cond_mul<mode>): Likewise.
9965 (cond_div<mode>): Likewise.
9966 (cond_<code><mode>): Likewise.
9967 (cond_fma<mode>): Likewise.
9968 (cond_fms<mode>): Likewise.
9969 (cond_fnma<mode>): Likewise.
9970 (cond_fnms<mode>): Likewise.
9972 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9974 PR tree-optimization/102463
9975 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
9976 there is no range-ops handler, don't look for a relation.
9978 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9980 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
9981 non-executable_edge flag as parameter.
9982 * gimple-range-cache.h (ranger_cache): Adjust prototype.
9983 * gimple-range-gori.cc (gori_compute::gori_compute): Take
9984 non-executable_edge flag as parameter.
9985 (gori_compute::outgoing_edge_range_p): Check new flag.
9986 * gimple-range-gori.h (gori_compute): Adjust prototype.
9987 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
9988 (gimple_ranger::range_on_edge): Check new flag.
9989 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
9990 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
9991 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
9992 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
9993 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
9994 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
9995 (simplify_using_ranges::simplify_switch_using_ranges): Clear
9996 EDGE_EXECUTABLE like it originally did.
9997 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
9998 NON_EXECUTABLE flags.
9999 (simplify_using_ranges::simplify_using_ranges): Adjust.
10000 * vr-values.h (class simplify_using_ranges): Adjust.
10001 (simplify_using_ranges::set_range_query): Add non-executable flag param.
10003 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
10006 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
10007 zero-width bit fields and return indicator.
10008 (rs6000_discover_homogeneous_aggregate): Diagnose when the
10009 presence of a zero-width bit field changes parameter passing in
10012 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
10014 * gimple-range-fold.cc (fold_using_range::range_of_phi):
10015 Remove dominator check.
10017 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
10019 * gimple-range-path.cc (path_range_query::precompute_relations):
10020 Hoist edge calculations before using EDGE_SUCC.
10022 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
10024 * configure.ac: Fix --with-multilib-list description.
10025 * configure: Regenerate.
10027 2021-09-23 Richard Biener <rguenther@suse.de>
10029 PR tree-optimization/102448
10030 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
10031 Clear alignment info copied from DR_PTR_INFO.
10033 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10035 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
10037 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
10038 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
10039 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
10040 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
10041 (vcond<mode><mode>): New expander.
10042 (vcond<mode><sseintvecmodelower>): Likewise.
10043 (vcond<sseintvecmodelower><mode>): Likewise.
10044 (vcondu<mode><sseintvecmodelower>): Likewise.
10046 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10048 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
10050 (extendv4hf<mode>2): Likewise.
10051 (extendv2hfv2df2): Likewise.
10052 (trunc<mode><ssePHmodelower>2): Likewise.
10053 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
10054 (trunc<mode>v4hf2): ... this, and drop constraints.
10055 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
10056 (truncv2dfv2hf2): ... this, and likewise.
10058 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10060 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
10062 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
10064 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
10065 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
10066 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
10068 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10070 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
10071 (fixuns_trunchfhi2): Likewise.
10072 (*fixuns_trunchfsi2zext): New define_insn.
10073 * config/i386/sse.md (ssePHmodelower): New mode_attr.
10074 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
10075 New expander for same element vector fix_truncate.
10076 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
10077 Likewise for V4HF to V4SI/V4DI fix_truncate.
10078 (fix<fixunssuffix>_truncv2hfv2di2):
10079 Likeise for V2HF to V2DI fix_truncate.
10081 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
10083 * config/i386/i386.md (<code>hf3): New expander.
10085 2021-09-23 liuhongt <hongtao.liu@intel.com>
10087 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
10088 (VFH_SF_AVX512VL): Extend to handle HFmode.
10089 (VF_SF_AVX512VL): Deleted.
10091 2021-09-23 liuhongt <hongtao.liu@intel.com>
10093 * config/i386/i386.md (rinthf2): New expander.
10094 (nearbyinthf2): New expander.
10096 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
10098 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
10099 (class dom_jt_state): ...this and provide virtual overrides.
10100 (dom_jt_state::register_equiv): New.
10101 (class dom_jt_simplifier): Rename from
10102 dom_jump_threader_simplifier.
10103 (dom_jump_threader_simplifier::simplify): Rename...
10104 (dom_jt_simplifier::simplify): ...to this.
10105 (pass_dominator::execute): Use dom_jt_simplifier and
10107 * tree-ssa-threadedge.c (jump_threader::jump_threader):
10109 (jt_state::register_equivs_stmt): Abstract out...
10110 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
10112 (jump_threader::thread_around_empty_blocks): Update state.
10113 (jump_threader::thread_through_normal_block): Same.
10114 (jt_state::jt_state): Remove.
10115 (jt_state::push): Remove pass specific bits. Keep block vector
10117 (jt_state::append_path): New.
10118 (jt_state::pop): Remove pass specific bits.
10119 (jt_state::register_equiv): Same.
10120 (jt_state::record_ranges_from_stmt): Same.
10121 (jt_state::register_equivs_on_edge): Same. Rename...
10122 (jt_state::register_equivs_edge): ...to this.
10123 (jt_state::dump): New.
10124 (jt_state::debug): New.
10125 (jump_threader_simplifier::simplify): Remove.
10126 (jt_state::get_path): New.
10127 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
10128 class. Expose common functionality as virtual methods.
10129 (class jump_threader_simplifier): Same. Rename...
10130 (class jt_simplifier): ...to this.
10131 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
10132 (class vrp_jt_simplifier): ...to this. Provide pass specific
10134 (class vrp_jt_state): New.
10135 (vrp_jump_threader_simplifier::simplify): Rename...
10136 (vrp_jt_simplifier::simplify): ...to this. Inline code from
10137 what used to be the base class.
10138 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
10141 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
10144 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
10147 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
10148 Richard Biener <rguenther@suse.de>
10150 * match.pd (negation simplifications): Implement some negation
10151 folding transformations from fold-const.c's fold_negate_expr.
10152 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
10153 argument, to control whether the op should be simplified prior
10154 to looking up/assigning a value number.
10155 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
10156 (vn_nary_simplify): Likewise.
10157 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
10158 now call vn_nary_build_or_lookup_1 disabling simplification.
10160 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
10162 PR tree-optimization/102087
10163 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
10164 Update bound/cmp/control for niter.
10166 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
10168 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
10169 Move check for non-empty BB here.
10170 (fur_source::register_outgoing_edges): ...from here.
10172 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
10174 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
10175 Remove call to improve_range_with_equivs.
10176 (path_range_query::improve_range_with_equivs): Remove
10177 * gimple-range-path.h: Remove improve_range_with_equivs.
10179 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10181 * config/i386/avx512fp16intrin.h:
10182 (_mm512_mask_blend_ph): New intrinsic.
10183 (_mm512_permutex2var_ph): Ditto.
10184 (_mm512_permutexvar_ph): Ditto.
10185 * config/i386/avx512fp16vlintrin.h:
10186 (_mm256_mask_blend_ph): New intrinsic.
10187 (_mm256_permutex2var_ph): Ditto.
10188 (_mm256_permutexvar_ph): Ditto.
10189 (_mm_mask_blend_ph): Ditto.
10190 (_mm_permutex2var_ph): Ditto.
10191 (_mm_permutexvar_ph): Ditto.
10193 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10195 * config/i386/avx512fp16intrin.h: Add new intrinsics.
10196 (_mm512_conj_pch): New intrinsic.
10197 (_mm512_mask_conj_pch): Ditto.
10198 (_mm512_maskz_conj_pch): Ditto.
10199 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
10200 (_mm256_conj_pch): New intrinsic.
10201 (_mm256_mask_conj_pch): Ditto.
10202 (_mm256_maskz_conj_pch): Ditto.
10203 (_mm_conj_pch): Ditto.
10204 (_mm_mask_conj_pch): Ditto.
10205 (_mm_maskz_conj_pch): Ditto.
10207 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10209 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
10210 (_mm512_reduce_add_ph): New intrinsic.
10211 (_mm512_reduce_mul_ph): Ditto.
10212 (_mm512_reduce_min_ph): Ditto.
10213 (_mm512_reduce_max_ph): Ditto.
10214 * config/i386/avx512fp16vlintrin.h
10215 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
10216 (_mm256_reduce_add_ph): New intrinsic.
10217 (_mm256_reduce_mul_ph): Ditto.
10218 (_mm256_reduce_min_ph): Ditto.
10219 (_mm256_reduce_max_ph): Ditto.
10220 (_mm_reduce_add_ph): Ditto.
10221 (_mm_reduce_mul_ph): Ditto.
10222 (_mm_reduce_min_ph): Ditto.
10223 (_mm_reduce_max_ph): Ditto.
10225 2021-09-22 dianhong xu <dianhong.xu@intel.com>
10227 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
10228 __m128h_u): New typedef.
10229 (_mm512_load_ph): New intrinsic.
10230 (_mm256_load_ph): Ditto.
10231 (_mm_load_ph): Ditto.
10232 (_mm512_loadu_ph): Ditto.
10233 (_mm256_loadu_ph): Ditto.
10234 (_mm_loadu_ph): Ditto.
10235 (_mm512_store_ph): Ditto.
10236 (_mm256_store_ph): Ditto.
10237 (_mm_store_ph): Ditto.
10238 (_mm512_storeu_ph): Ditto.
10239 (_mm256_storeu_ph): Ditto.
10240 (_mm_storeu_ph): Ditto.
10241 (_mm512_abs_ph): Ditto.
10242 * config/i386/avx512fp16vlintrin.h
10243 (_mm_abs_ph): Ditto.
10244 (_mm256_abs_ph): Ditto.
10246 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
10248 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
10250 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
10253 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
10254 is actually a full copy of the source operand into the target.
10255 Don't emit a strict low part move if source and target mode match.
10257 2021-09-22 Jakub Jelinek <jakub@redhat.com>
10259 PR middle-end/102415
10260 * omp-expand.c (expand_omp_single): If region->exit is NULL,
10261 assert region->entry is GIMPLE_OMP_SCOPE region and return.
10263 2021-09-22 Jakub Jelinek <jakub@redhat.com>
10265 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
10266 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
10267 arguments from 2 to 3.
10268 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
10269 allocate clause allocator and print align if present.
10270 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
10271 for omp_default_mem_alloc if align modifier is present. If align
10272 modifier is present, use TREE_LIST to encode both allocator and
10274 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
10275 Handle align modifier on allocator clause if present.
10277 2021-09-22 liuhongt <hongtao.liu@intel.com>
10279 * config/i386/i386.md (define_attr "isa"): Add
10281 (define_attr "enabled"): Correspond fma_or_avx512vl to
10282 TARGET_FMA || TARGET_AVX512VL.
10283 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
10285 (fnmav2sf4): Ditto.
10286 (fnmsv2sf4): Ditto.
10288 2021-09-22 liuhongt <hongtao.liu@intel.com>
10290 * config/i386/i386.md (cstorehf3): New define_expand.
10292 2021-09-22 liuhongt <hongtao.liu@intel.com>
10294 * config/i386/i386.md (<rounding_insn>hf2): New expander.
10295 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
10296 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
10297 Extend from VF_128 to VFH_128.
10299 2021-09-22 liuhongt <hongtao.liu@intel.com>
10301 * config/i386/i386-features.c (i386-features.c): Handle
10303 * config/i386/i386.md (sqrthf2): New expander.
10304 (*sqrthf2): New define_insn.
10305 * config/i386/sse.md
10306 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
10309 2021-09-22 liuhongt <hongtao.liu@intel.com>
10311 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
10313 (_mm_mask3_fcmadd_sch): Likewise.
10314 (_mm_maskz_fcmadd_sch): Likewise.
10315 (_mm_fcmadd_sch): Likewise.
10316 (_mm_mask_fmadd_sch): Likewise.
10317 (_mm_mask3_fmadd_sch): Likewise.
10318 (_mm_maskz_fmadd_sch): Likewise.
10319 (_mm_fmadd_sch): Likewise.
10320 (_mm_mask_fcmadd_round_sch): Likewise.
10321 (_mm_mask3_fcmadd_round_sch): Likewise.
10322 (_mm_maskz_fcmadd_round_sch): Likewise.
10323 (_mm_fcmadd_round_sch): Likewise.
10324 (_mm_mask_fmadd_round_sch): Likewise.
10325 (_mm_mask3_fmadd_round_sch): Likewise.
10326 (_mm_maskz_fmadd_round_sch): Likewise.
10327 (_mm_fmadd_round_sch): Likewise.
10328 (_mm_fcmul_sch): Likewise.
10329 (_mm_mask_fcmul_sch): Likewise.
10330 (_mm_maskz_fcmul_sch): Likewise.
10331 (_mm_fmul_sch): Likewise.
10332 (_mm_mask_fmul_sch): Likewise.
10333 (_mm_maskz_fmul_sch): Likewise.
10334 (_mm_fcmul_round_sch): Likewise.
10335 (_mm_mask_fcmul_round_sch): Likewise.
10336 (_mm_maskz_fcmul_round_sch): Likewise.
10337 (_mm_fmul_round_sch): Likewise.
10338 (_mm_mask_fmul_round_sch): Likewise.
10339 (_mm_maskz_fmul_round_sch): Likewise.
10340 * config/i386/i386-builtin.def: Add corresponding new builtins.
10341 * config/i386/sse.md
10342 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
10343 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
10344 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
10346 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
10347 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
10349 * config/i386/subst.md (mask_scalarcz_name): New.
10350 (mask_scalarc_name): Ditto.
10351 (mask_scalarc_operand3): Ditto.
10352 (mask_scalarcz_operand4): Ditto.
10353 (round_scalarcz_name): Ditto.
10354 (round_scalarc_mask_operand3): Ditto.
10355 (round_scalarcz_mask_operand4): Ditto.
10356 (round_scalarc_mask_op3): Ditto.
10357 (round_scalarcz_mask_op4): Ditto.
10358 (round_scalarcz_constraint): Ditto.
10359 (round_scalarcz_nimm_predicate): Ditto.
10360 (mask_scalarcz): Ditto.
10361 (mask_scalarc): Ditto.
10362 (round_scalarcz): Ditto.
10364 2021-09-22 liuhongt <hongtao.liu@intel.com>
10366 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
10368 (_mm512_mask_fcmadd_pch): Likewise.
10369 (_mm512_mask3_fcmadd_pch): Likewise.
10370 (_mm512_maskz_fcmadd_pch): Likewise.
10371 (_mm512_fmadd_pch): Likewise.
10372 (_mm512_mask_fmadd_pch): Likewise.
10373 (_mm512_mask3_fmadd_pch): Likewise.
10374 (_mm512_maskz_fmadd_pch): Likewise.
10375 (_mm512_fcmadd_round_pch): Likewise.
10376 (_mm512_mask_fcmadd_round_pch): Likewise.
10377 (_mm512_mask3_fcmadd_round_pch): Likewise.
10378 (_mm512_maskz_fcmadd_round_pch): Likewise.
10379 (_mm512_fmadd_round_pch): Likewise.
10380 (_mm512_mask_fmadd_round_pch): Likewise.
10381 (_mm512_mask3_fmadd_round_pch): Likewise.
10382 (_mm512_maskz_fmadd_round_pch): Likewise.
10383 (_mm512_fcmul_pch): Likewise.
10384 (_mm512_mask_fcmul_pch): Likewise.
10385 (_mm512_maskz_fcmul_pch): Likewise.
10386 (_mm512_fmul_pch): Likewise.
10387 (_mm512_mask_fmul_pch): Likewise.
10388 (_mm512_maskz_fmul_pch): Likewise.
10389 (_mm512_fcmul_round_pch): Likewise.
10390 (_mm512_mask_fcmul_round_pch): Likewise.
10391 (_mm512_maskz_fcmul_round_pch): Likewise.
10392 (_mm512_fmul_round_pch): Likewise.
10393 (_mm512_mask_fmul_round_pch): Likewise.
10394 (_mm512_maskz_fmul_round_pch): Likewise.
10395 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
10397 (_mm_mask_fmadd_pch): Likewise.
10398 (_mm_mask3_fmadd_pch): Likewise.
10399 (_mm_maskz_fmadd_pch): Likewise.
10400 (_mm256_fmadd_pch): Likewise.
10401 (_mm256_mask_fmadd_pch): Likewise.
10402 (_mm256_mask3_fmadd_pch): Likewise.
10403 (_mm256_maskz_fmadd_pch): Likewise.
10404 (_mm_fcmadd_pch): Likewise.
10405 (_mm_mask_fcmadd_pch): Likewise.
10406 (_mm_mask3_fcmadd_pch): Likewise.
10407 (_mm_maskz_fcmadd_pch): Likewise.
10408 (_mm256_fcmadd_pch): Likewise.
10409 (_mm256_mask_fcmadd_pch): Likewise.
10410 (_mm256_mask3_fcmadd_pch): Likewise.
10411 (_mm256_maskz_fcmadd_pch): Likewise.
10412 (_mm_fmul_pch): Likewise.
10413 (_mm_mask_fmul_pch): Likewise.
10414 (_mm_maskz_fmul_pch): Likewise.
10415 (_mm256_fmul_pch): Likewise.
10416 (_mm256_mask_fmul_pch): Likewise.
10417 (_mm256_maskz_fmul_pch): Likewise.
10418 (_mm_fcmul_pch): Likewise.
10419 (_mm_mask_fcmul_pch): Likewise.
10420 (_mm_maskz_fcmul_pch): Likewise.
10421 (_mm256_fcmul_pch): Likewise.
10422 (_mm256_mask_fcmul_pch): Likewise.
10423 (_mm256_maskz_fcmul_pch): Likewise.
10424 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
10425 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
10426 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
10427 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
10428 * config/i386/i386-builtin.def: Add new builtins.
10429 * config/i386/i386-expand.c: Handle new builtin types.
10430 * config/i386/subst.md (SUBST_CV): New.
10431 (maskc_name): Ditto.
10432 (maskc_operand3): Ditto.
10434 (sdc_maskz_name): Ditto.
10435 (sdc_mask_op4): Ditto.
10436 (sdc_mask_op5): Ditto.
10437 (sdc_mask_mode512bit_condition): Ditto.
10439 (round_maskc_operand3): Ditto.
10440 (round_sdc_mask_operand4): Ditto.
10441 (round_maskc_op3): Ditto.
10442 (round_sdc_mask_op4): Ditto.
10443 (round_saeonly_sdc_mask_operand5): Ditto.
10444 * config/i386/sse.md (unspec): Add complex fma unspecs.
10445 (avx512fmaskcmode): New.
10446 (UNSPEC_COMPLEX_F_C_MA): Ditto.
10447 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
10448 (complexopname): Ditto.
10449 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
10450 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
10451 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
10453 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
10454 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
10456 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
10458 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
10459 rs6000-density-size-threshold, rs6000-density-penalty,
10460 rs6000-density-load-pct-threshold,
10461 rs6000-density-load-num-threshold): New parameter.
10462 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
10463 corresponding parameters.
10465 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10467 * gimple-range-path.cc (path_range_query::defined_outside_path):
10469 (path_range_query::range_on_path_entry): New.
10470 (path_range_query::internal_range_of_expr): Resolve unknowns
10472 (path_range_query::improve_range_with_equivs): New.
10473 (path_range_query::ssa_range_in_phi): Resolve unknowns with
10475 * gimple-range-path.h (class path_range_query): Add
10476 defined_outside_path, range_on_path_entry, and
10477 improve_range_with_equivs.
10479 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10481 * gimple-range-path.cc (path_range_query::add_to_imports): New.
10482 (path_range_query::add_copies_to_imports): New.
10483 (path_range_query::precompute_ranges): Call
10484 add_copies_to_imports.
10485 * gimple-range-path.h (class path_range_query): Add prototypes
10486 for add_copies_to_imports and add_to_imports.
10488 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10490 * gimple-range-path.cc (path_range_query::range_defined_in_block):
10491 Remove useless code.
10493 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10495 * gimple-range-fold.h (class fur_source): Make oracle protected.
10496 * gimple-range-path.cc (path_range_query::path_range_query): Add
10497 resolve argument. Initialize oracle.
10498 (path_range_query::~path_range_query): Delete oracle.
10499 (path_range_query::range_of_stmt): Adapt to use relations.
10500 (path_range_query::precompute_ranges): Pre-compute relations.
10501 (class jt_fur_source): New
10502 (jt_fur_source::jt_fur_source): New.
10503 (jt_fur_source::register_relation): New.
10504 (jt_fur_source::query_relation): New.
10505 (path_range_query::precompute_relations): New.
10506 (path_range_query::precompute_phi_relations): New.
10507 * gimple-range-path.h (path_range_query): Add resolve argument.
10508 Add oracle, precompute_relations, precompute_phi_relations.
10509 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
10510 resolve argument to solver.
10512 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10514 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
10515 Rename postfold_gcond_edges to register_outgoing_edges and
10517 (fold_using_range::postfold_gcond_edges): Rename...
10518 (fur_source::register_outgoing_edges): ...to this.
10519 * gimple-range-fold.h (postfold_gcond_edges): Rename to
10520 register_outgoing_edges and move to fur_source.
10522 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10524 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
10525 dom_info_available_p.
10527 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10529 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
10530 and quick_grow_cleared instead of safe_grow_cleared.
10532 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
10535 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
10536 'random ()' to '0'.
10538 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
10540 * configure.ac: Detect when the assembler supports new-style
10541 architecture extensions.
10542 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
10543 the full CPU string if the assembler can grok it.
10544 (arm_rewrite_march): Likewise but for the architecture.
10545 * config.in: Regenerate.
10546 * configure: Regenerate.
10548 2021-09-21 Richard Biener <rguenther@suse.de>
10550 PR tree-optimization/102421
10551 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
10552 adjust alignment info.
10554 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
10556 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
10557 bitfield stream out.
10559 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10561 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
10562 undefined edges, apply an equivalence if appropriate.
10563 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
10564 UNDEFINED if EDGE_EXECUTABLE is not set.
10565 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
10566 as EXECUTABLE upon startup.
10567 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
10568 EDGE_EXECUTABLE set.
10569 * vr-values.c (set_and_propagate_unexecutable): New.
10570 (simplify_using_ranges::fold_cond): Call set_and_propagate.
10571 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
10572 * vr-values.h: Add prototype.
10574 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10576 * value-relation.cc (equiv_oracle::register_initial_def): New.
10577 (equiv_oracle::register_relation): Call register_initial_def.
10578 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
10579 (relation_oracle::register_stmt): Check def block of PHI arguments.
10580 * value-relation.h (equiv_oracle): Add new prototypes.
10582 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
10584 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
10585 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
10586 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
10587 __ROUNDING_MATH__ according to their corresponding flags.
10588 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
10589 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
10590 and __ROUNDING_MATH__.
10592 2021-09-20 Richard Biener <rguenther@suse.de>
10594 * tree-vect-stmts.c (vectorizable_load): Use the vectype
10597 2021-09-20 Richard Biener <rguenther@suse.de>
10599 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
10600 Do not compute alignment of the vectorized access here.
10602 2021-09-20 Richard Biener <rguenther@suse.de>
10604 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
10605 Store -1 for runtime alias peeling iterations.
10607 2021-09-20 Richard Biener <rguenther@suse.de>
10609 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
10611 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
10613 * input.c (string_concat_db::record_string_concatenation)
10614 (string_concat_db::get_string_concatenation): Skip for
10615 'RESERVED_LOCATION_P'.
10617 2021-09-20 Richard Biener <rguenther@suse.de>
10619 PR tree-optimization/65206
10620 * tree-data-ref.h (struct data_reference): Add alt_indices,
10622 * tree-data-ref.c (free_data_ref): Release alt_indices.
10623 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
10624 (create_data_ref): Adjust.
10625 (initialize_data_dependence_relation): Split into head
10626 and tail. When the base objects fail to match up try
10627 again with pointer-based analysis of indices.
10628 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
10629 not compare the lazily computed alternate set of indices.
10631 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
10633 * gcc.c: Test for execute OK when we find the
10634 programs for assembler linker and dsymutil and those
10635 were specified at configure-time.
10637 2021-09-19 Martin Sebor <msebor@redhat.com>
10639 PR middle-end/102403
10640 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
10641 Correct a function pre/postcondition.
10643 2021-09-19 Martin Sebor <msebor@redhat.com>
10645 PR middle-end/102243
10646 * tree-ssa-strlen.c (get_range): Handle null cfun.
10648 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
10650 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
10651 unwinder shim as a convenience library.
10653 2021-09-19 Andrew Pinski <apinski@marvell.com>
10655 * doc/install.texi: Add note about
10656 binutils 2.35 is required for LTO usage.
10658 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10660 * tree-ssa-threadbackward.c
10661 (back_threader_registry::register_path): Use push_edge.
10662 * tree-ssa-threadedge.c
10663 (jump_threader::thread_around_empty_blocks): Same.
10664 (jump_threader::thread_through_normal_block): Same.
10665 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
10667 * tree-ssa-threadupdate.c
10668 (jt_path_registry::allocate_thread_edge): Remove.
10669 (jt_path_registry::push_edge): New.
10670 (dump_jump_thread_path): Make static.
10671 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
10674 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10676 * gimple-range-path.cc (path_range_query::path_range_query): Add
10678 (path_range_query::dump): Remove extern declaration of dump_ranger.
10679 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
10680 * gimple-range-trace.h (dump_ranger): Add prototype.
10682 2021-09-19 John Ericson <git@JohnEricson.me>
10684 * gcc.c (find_a_program): New function, factored out of...
10685 (find_a_file): Here.
10686 (execute): Use find_a_program when looking for programs rather
10689 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
10691 * config/avr/avr-mcus.def: Add atmega324pb.
10692 * doc/avr-mmcu.texi: Corresponding changes.
10694 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
10696 PR middle-end/88173
10697 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
10698 the same transformations as to @1. For comparisons against NaN,
10699 don't check HONOR_SNANS but confirm that neither operand is a
10702 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
10704 * attribs.c (make_unique_name): Delete.
10705 * attribs.h (make_unique_name): Delete.
10707 2021-09-19 Andrew Pinski <apinski@marvell.com>
10709 * lra-constraints.c (check_and_process_move): Assert
10710 that dclass and sclass are greater than or equal to NO_REGS.
10712 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10714 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
10715 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
10716 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
10718 2021-09-18 liuhongt <hongtao.liu@intel.com>
10720 * config/i386/i386-features.c (remove_partial_avx_dependency):
10721 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
10722 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
10724 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10726 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
10727 if file/namespace scope variable doesn't have predetermined sharing,
10728 treat it as if there was default(none).
10730 2021-09-18 liuhongt <hongtao.liu@intel.com>
10732 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
10734 (_mm_mask_fmadd_sh): Likewise.
10735 (_mm_mask3_fmadd_sh): Likewise.
10736 (_mm_maskz_fmadd_sh): Likewise.
10737 (_mm_fmadd_round_sh): Likewise.
10738 (_mm_mask_fmadd_round_sh): Likewise.
10739 (_mm_mask3_fmadd_round_sh): Likewise.
10740 (_mm_maskz_fmadd_round_sh): Likewise.
10741 (_mm_fnmadd_sh): Likewise.
10742 (_mm_mask_fnmadd_sh): Likewise.
10743 (_mm_mask3_fnmadd_sh): Likewise.
10744 (_mm_maskz_fnmadd_sh): Likewise.
10745 (_mm_fnmadd_round_sh): Likewise.
10746 (_mm_mask_fnmadd_round_sh): Likewise.
10747 (_mm_mask3_fnmadd_round_sh): Likewise.
10748 (_mm_maskz_fnmadd_round_sh): Likewise.
10749 (_mm_fmsub_sh): Likewise.
10750 (_mm_mask_fmsub_sh): Likewise.
10751 (_mm_mask3_fmsub_sh): Likewise.
10752 (_mm_maskz_fmsub_sh): Likewise.
10753 (_mm_fmsub_round_sh): Likewise.
10754 (_mm_mask_fmsub_round_sh): Likewise.
10755 (_mm_mask3_fmsub_round_sh): Likewise.
10756 (_mm_maskz_fmsub_round_sh): Likewise.
10757 (_mm_fnmsub_sh): Likewise.
10758 (_mm_mask_fnmsub_sh): Likewise.
10759 (_mm_mask3_fnmsub_sh): Likewise.
10760 (_mm_maskz_fnmsub_sh): Likewise.
10761 (_mm_fnmsub_round_sh): Likewise.
10762 (_mm_mask_fnmsub_round_sh): Likewise.
10763 (_mm_mask3_fnmsub_round_sh): Likewise.
10764 (_mm_maskz_fnmsub_round_sh): Likewise.
10765 * config/i386/i386-builtin-types.def
10766 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
10767 * config/i386/i386-builtin.def: Add new builtins.
10768 * config/i386/i386-expand.c: Handle new builtin type.
10769 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
10770 Ajdust to support FP16.
10771 (fmai_vmfmsub_<mode><round_name>): Ditto.
10772 (fmai_vmfnmadd_<mode><round_name>): Ditto.
10773 (fmai_vmfnmsub_<mode><round_name>): Ditto.
10774 (*fmai_fmadd_<mode>): Ditto.
10775 (*fmai_fmsub_<mode>): Ditto.
10776 (*fmai_fnmadd_<mode><round_name>): Ditto.
10777 (*fmai_fnmsub_<mode><round_name>): Ditto.
10778 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
10779 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
10780 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
10781 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
10782 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
10783 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
10784 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
10785 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10786 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
10787 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10788 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
10789 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
10790 adjust to support FP16.
10791 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
10792 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
10793 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
10796 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10798 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
10800 (maskload<mode><avx512fmaskmodelower>): Ditto.
10801 (maskstore<mode><avx512fmaskmodelower>): Ditto.
10803 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10805 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
10807 (ix86_expand_copysign): Ditto.
10808 (ix86_expand_xorsign): Ditto.
10809 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
10811 (ix86_build_signbit_mask): Ditto.
10812 (ix86_can_change_mode_class): Ditto.
10813 * config/i386/i386.md
10814 (SSEMODEF): Add HFmode.
10815 (ssevecmodef): Ditto.
10816 (<code>hf2): New define_expand.
10817 (*<code>hf2_1): New define_insn_and_split.
10818 (copysign<mode>): Extend to support HFmode under AVX512FP16.
10819 (xorsign<mode>): Ditto.
10820 * config/i386/sse.md (VFB): New mode iterator.
10821 (VFB_128_256): Ditto.
10823 (sseintvecmode2): Support HF vector mode.
10824 (<code><mode>2): Use new mode iterator.
10825 (*<code><mode>2): Ditto.
10826 (copysign<mode>3): Ditto.
10827 (xorsign<mode>3): Ditto.
10828 (<code><mode>3<mask_name>): Ditto.
10829 (<code><mode>3<mask_name>): Ditto.
10830 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
10831 (<sse>_andnot<mode>3<mask_name>): Ditto.
10832 (*<code><mode>3<mask_name>): Ditto.
10833 (*<code><mode>3<mask_name>): Ditto.
10835 2021-09-18 liuhongt <hongtao.liu@intel.com>
10837 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
10839 (_mm512_mask3_fmadd_ph): Likewise.
10840 (_mm512_maskz_fmadd_ph): Likewise.
10841 (_mm512_fmadd_round_ph): Likewise.
10842 (_mm512_mask_fmadd_round_ph): Likewise.
10843 (_mm512_mask3_fmadd_round_ph): Likewise.
10844 (_mm512_maskz_fmadd_round_ph): Likewise.
10845 (_mm512_fnmadd_ph): Likewise.
10846 (_mm512_mask_fnmadd_ph): Likewise.
10847 (_mm512_mask3_fnmadd_ph): Likewise.
10848 (_mm512_maskz_fnmadd_ph): Likewise.
10849 (_mm512_fnmadd_round_ph): Likewise.
10850 (_mm512_mask_fnmadd_round_ph): Likewise.
10851 (_mm512_mask3_fnmadd_round_ph): Likewise.
10852 (_mm512_maskz_fnmadd_round_ph): Likewise.
10853 (_mm512_fmsub_ph): Likewise.
10854 (_mm512_mask_fmsub_ph): Likewise.
10855 (_mm512_mask3_fmsub_ph): Likewise.
10856 (_mm512_maskz_fmsub_ph): Likewise.
10857 (_mm512_fmsub_round_ph): Likewise.
10858 (_mm512_mask_fmsub_round_ph): Likewise.
10859 (_mm512_mask3_fmsub_round_ph): Likewise.
10860 (_mm512_maskz_fmsub_round_ph): Likewise.
10861 (_mm512_fnmsub_ph): Likewise.
10862 (_mm512_mask_fnmsub_ph): Likewise.
10863 (_mm512_mask3_fnmsub_ph): Likewise.
10864 (_mm512_maskz_fnmsub_ph): Likewise.
10865 (_mm512_fnmsub_round_ph): Likewise.
10866 (_mm512_mask_fnmsub_round_ph): Likewise.
10867 (_mm512_mask3_fnmsub_round_ph): Likewise.
10868 (_mm512_maskz_fnmsub_round_ph): Likewise.
10869 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
10871 (_mm256_mask_fmadd_ph): Likewise.
10872 (_mm256_mask3_fmadd_ph): Likewise.
10873 (_mm256_maskz_fmadd_ph): Likewise.
10874 (_mm_fmadd_ph): Likewise.
10875 (_mm_mask_fmadd_ph): Likewise.
10876 (_mm_mask3_fmadd_ph): Likewise.
10877 (_mm_maskz_fmadd_ph): Likewise.
10878 (_mm256_fnmadd_ph): Likewise.
10879 (_mm256_mask_fnmadd_ph): Likewise.
10880 (_mm256_mask3_fnmadd_ph): Likewise.
10881 (_mm256_maskz_fnmadd_ph): Likewise.
10882 (_mm_fnmadd_ph): Likewise.
10883 (_mm_mask_fnmadd_ph): Likewise.
10884 (_mm_mask3_fnmadd_ph): Likewise.
10885 (_mm_maskz_fnmadd_ph): Likewise.
10886 (_mm256_fmsub_ph): Likewise.
10887 (_mm256_mask_fmsub_ph): Likewise.
10888 (_mm256_mask3_fmsub_ph): Likewise.
10889 (_mm256_maskz_fmsub_ph): Likewise.
10890 (_mm_fmsub_ph): Likewise.
10891 (_mm_mask_fmsub_ph): Likewise.
10892 (_mm_mask3_fmsub_ph): Likewise.
10893 (_mm_maskz_fmsub_ph): Likewise.
10894 (_mm256_fnmsub_ph): Likewise.
10895 (_mm256_mask_fnmsub_ph): Likewise.
10896 (_mm256_mask3_fnmsub_ph): Likewise.
10897 (_mm256_maskz_fnmsub_ph): Likewise.
10898 (_mm_fnmsub_ph): Likewise.
10899 (_mm_mask_fnmsub_ph): Likewise.
10900 (_mm_mask3_fnmsub_ph): Likewise.
10901 (_mm_maskz_fnmsub_ph): Likewise.
10902 * config/i386/i386-builtin.def: Add corresponding new builtins.
10903 * config/i386/sse.md
10904 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
10905 support HF vector modes.
10906 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
10908 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10909 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10910 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10911 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
10912 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
10913 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
10914 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
10916 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10917 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10918 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10919 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
10920 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
10921 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
10923 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10924 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10925 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10926 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
10927 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
10928 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
10929 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
10931 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10932 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10933 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10934 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
10935 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
10937 2021-09-18 liuhongt <hongtao.liu@intel.com>
10939 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
10941 (_mm512_mask_fmaddsub_ph): Likewise.
10942 (_mm512_mask3_fmaddsub_ph): Likewise.
10943 (_mm512_maskz_fmaddsub_ph): Likewise.
10944 (_mm512_fmaddsub_round_ph): Likewise.
10945 (_mm512_mask_fmaddsub_round_ph): Likewise.
10946 (_mm512_mask3_fmaddsub_round_ph): Likewise.
10947 (_mm512_maskz_fmaddsub_round_ph): Likewise.
10948 (_mm512_mask_fmsubadd_ph): Likewise.
10949 (_mm512_mask3_fmsubadd_ph): Likewise.
10950 (_mm512_maskz_fmsubadd_ph): Likewise.
10951 (_mm512_fmsubadd_round_ph): Likewise.
10952 (_mm512_mask_fmsubadd_round_ph): Likewise.
10953 (_mm512_mask3_fmsubadd_round_ph): Likewise.
10954 (_mm512_maskz_fmsubadd_round_ph): Likewise.
10955 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
10957 (_mm256_mask_fmaddsub_ph): Likewise.
10958 (_mm256_mask3_fmaddsub_ph): Likewise.
10959 (_mm256_maskz_fmaddsub_ph): Likewise.
10960 (_mm_fmaddsub_ph): Likewise.
10961 (_mm_mask_fmaddsub_ph): Likewise.
10962 (_mm_mask3_fmaddsub_ph): Likewise.
10963 (_mm_maskz_fmaddsub_ph): Likewise.
10964 (_mm256_fmsubadd_ph): Likewise.
10965 (_mm256_mask_fmsubadd_ph): Likewise.
10966 (_mm256_mask3_fmsubadd_ph): Likewise.
10967 (_mm256_maskz_fmsubadd_ph): Likewise.
10968 (_mm_fmsubadd_ph): Likewise.
10969 (_mm_mask_fmsubadd_ph): Likewise.
10970 (_mm_mask3_fmsubadd_ph): Likewise.
10971 (_mm_maskz_fmsubadd_ph): Likewise.
10972 * config/i386/i386-builtin.def: Add corresponding new builtins.
10973 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
10974 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
10975 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
10977 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
10979 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
10980 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
10981 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
10983 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
10984 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
10986 2021-09-18 liuhongt <hongtao.liu@intel.com>
10989 * config/i386/i386.c (ix86_print_operand): Handle
10990 V8HF/V16HF/V32HFmode.
10991 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
10992 * config/i386/sse.md (avx512bcst): Remove.
10994 2021-09-17 Martin Sebor <msebor@redhat.com>
10996 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
10997 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
10998 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
10999 (check_defs): Add comment.
11000 (can_skip_redundant_opnd): Update comment.
11001 (compute_uninit_opnds_pos): Adjust to namespace change.
11002 (find_pdom): Move to gimple-predicate-analysis.cc.
11004 (struct uninit_undef_val_t): New.
11005 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
11006 (find_control_equiv_block): Same.
11007 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
11008 (MAX_SWITCH_CASES): Same.
11009 (compute_control_dep_chain): Same.
11010 (find_uninit_use): Use predicate analyzer.
11011 (struct pred_info): Move to gimple-predicate-analysis.
11012 (convert_control_dep_chain_into_preds): Same.
11013 (find_predicates): Same.
11014 (collect_phi_def_edges): Same.
11015 (warn_uninitialized_phi): Use predicate analyzer.
11016 (find_def_preds): Move to gimple-predicate-analysis.
11017 (dump_pred_info): Same.
11018 (dump_pred_chain): Same.
11019 (dump_predicates): Same.
11020 (destroy_predicate_vecs): Remove.
11021 (execute_late_warn_uninitialized): New.
11022 (get_cmp_code): Move to gimple-predicate-analysis.
11023 (is_value_included_in): Same.
11024 (value_sat_pred_p): Same.
11025 (find_matching_predicate_in_rest_chains): Same.
11026 (is_use_properly_guarded): Same.
11027 (prune_uninit_phi_opnds): Same.
11028 (find_var_cmp_const): Same.
11029 (use_pred_not_overlap_with_undef_path_pred): Same.
11030 (pred_equal_p): Same.
11031 (is_neq_relop_p): Same.
11032 (is_neq_zero_form_p): Same.
11033 (pred_expr_equal_p): Same.
11034 (is_pred_expr_subset_of): Same.
11035 (is_pred_chain_subset_of): Same.
11036 (is_included_in): Same.
11037 (is_superset_of): Same.
11038 (pred_neg_p): Same.
11039 (simplify_pred): Same.
11040 (simplify_preds_2): Same.
11041 (simplify_preds_3): Same.
11042 (simplify_preds_4): Same.
11043 (simplify_preds): Same.
11045 (push_to_worklist): Same.
11046 (get_pred_info_from_cmp): Same.
11047 (is_degenerated_phi): Same.
11048 (normalize_one_pred_1): Same.
11049 (normalize_one_pred): Same.
11050 (normalize_one_pred_chain): Same.
11051 (normalize_preds): Same.
11052 (can_one_predicate_be_invalidated_p): Same.
11053 (can_chain_union_be_invalidated_p): Same.
11054 (uninit_uses_cannot_happen): Same.
11055 (pass_late_warn_uninitialized::execute): Define.
11056 * gimple-predicate-analysis.cc: New file.
11057 * gimple-predicate-analysis.h: New file.
11059 2021-09-17 Julian Brown <julian@codesourcery.com>
11061 * config/gcn/gcn.c (gimple.h): Include.
11062 (gcn_fork_join): Emit barrier for worker-level joins.
11063 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
11064 writes_gang_private bitmap parameter. Set bit for blocks
11065 containing gang-private variable writes.
11066 (worker_single_simple): Don't emit barrier after predicated block.
11067 (worker_single_copy): Don't emit barrier if we're not broadcasting
11068 anything and the block contains no gang-private writes.
11069 (neuter_worker_single): Don't predicate blocks that only contain
11070 NOPs or internal marker functions. Pass has_gang_private_write
11071 argument to worker_single_copy.
11072 (oacc_do_neutering): Add writes_gang_private bitmap handling.
11074 2021-09-17 Julian Brown <julian@codesourcery.com>
11076 * config/gcn/gcn-protos.h
11077 (gcn_goacc_create_worker_broadcast_record): Update prototype.
11078 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
11079 preallocated block of LDS memory. Do not cache/share decls for
11080 reduction temporaries between invocations.
11081 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
11082 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
11083 and return temporary LDS space at that offset. Return pointer in
11085 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
11087 (ACC_LDS_SIZE): Define as acc_lds_size.
11088 (gcn_init_machine_status): Don't initialise lds_allocated,
11089 lds_allocs, reduc_decls fields of machine function struct.
11090 (gcn_option_override): Handle default size for gang-private
11091 variables and -mgang-private-size option.
11092 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
11093 initialising M0_REG.
11094 (gcn_shared_mem_layout): New function.
11095 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
11096 gang_private_hwm variable.
11097 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
11098 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
11099 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
11100 * config/gcn/gcn.opt (gang_private_size_opt): New global.
11101 (mgang-private-size=): New option.
11102 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
11103 documentation hook.
11104 * doc/tm.texi: Regenerate.
11105 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
11107 (build_sender_ref): Handle sender_decl being pointer.
11108 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
11109 parameters. Pass placement argument to
11110 create_worker_broadcast_record hook invocations. Handle
11111 sender_decl being pointer and isolate_broadcasts inserting extra
11113 (blk_offset_map_t): Add typedef.
11114 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
11115 preallocated range to worker_single_copy call.
11116 (dfs_broadcast_reachable_1): New function.
11117 (idx_decl_pair_t, used_range_vec_t): New typedefs.
11118 (sort_size_descending): New function.
11119 (addr_range): New class.
11120 (splay_tree_compare_addr_range, splay_tree_free_key)
11121 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
11122 (execute_omp_oacc_neuter_broadcast): Rename to...
11123 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
11124 parameters. Arrange layout of shared memory for broadcast
11126 (execute_omp_oacc_neuter_broadcast): New function.
11127 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
11128 handling from here. Enable pass for all OpenACC routines in order
11129 to call shared memory-layout hook.
11130 * target.def (create_worker_broadcast_record): Add OFFSET
11132 (shared_mem_layout): New hook.
11134 2021-09-17 Julian Brown <julian@codesourcery.com>
11135 Thomas Schwinge <thomas@codesourcery.com>
11137 * omp-oacc-neuter-broadcast.cc
11138 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
11140 (execute_omp_oacc_neuter_broadcast): Adjust.
11142 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
11144 * value-relation.cc (class equiv_chain): Move to header file.
11145 (path_oracle::path_oracle): New.
11146 (path_oracle::~path_oracle): New.
11147 (path_oracle::register_relation): New.
11148 (path_oracle::query_relation): New.
11149 (path_oracle::reset_path): New.
11150 (path_oracle::dump): New.
11151 * value-relation.h (class equiv_chain): Move to here.
11152 (class path_oracle): New.
11154 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
11156 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
11158 * gimple-range-fold.cc (fur_depend::register_relation): Use
11159 register_stmt/edge routines.
11160 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
11161 (equiv_oracle::equiv_oracle): Create self equivalence cache.
11162 (equiv_oracle::~equiv_oracle): Release same.
11163 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
11164 are no equivalences.
11165 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
11166 (equiv_oracle::register_relation): Rename from register_equiv.
11167 (relation_chain_head::find_relation): Relocate from dom_oracle.
11168 (relation_oracle::register_stmt): New.
11169 (relation_oracle::register_edge): New.
11170 (dom_oracle::*): Rename from relation_oracle.
11171 (dom_oracle::register_relation): Adjust to call equiv_oracle.
11172 (dom_oracle::set_one_relation): Split from register_relation.
11173 (dom_oracle::register_transitives): Consolidate 2 methods.
11174 (dom_oracle::find_relation_block): Move core to relation_chain.
11175 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
11176 * value-relation.h (class relation_oracle): New pure virtual base.
11177 (class equiv_oracle): Inherit from relation_oracle and adjust.
11178 (class dom_oracle): Rename from old relation_oracle and adjust.
11180 2021-09-17 Martin Sebor <msebor@redhat.com>
11182 PR middle-end/102200
11183 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
11184 (handle_min_max_size): Change argument. Store original SSA_NAME for
11185 operands to potentially distinct (sub)objects.
11186 (compute_objsize_r): Adjust call to the above.
11188 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11190 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
11191 (rs6000_new_builtin_vectorized_function): New function.
11192 (rs6000_new_builtin_md_vectorized_function): Likewise.
11193 (rs6000_builtin_vectorized_function): Call
11194 rs6000_new_builtin_vectorized_function.
11195 (rs6000_builtin_md_vectorized_function): Call
11196 rs6000_new_builtin_md_vectorized_function.
11198 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11200 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
11201 (ASSEMBLE_PAIR): Likewise.
11202 (BUILD_ACC): Likewise.
11203 (DISASSEMBLE_ACC): Likewise.
11204 (DISASSEMBLE_PAIR): Likewise.
11205 (PMXVBF16GER2): Likewise.
11206 (PMXVBF16GER2NN): Likewise.
11207 (PMXVBF16GER2NP): Likewise.
11208 (PMXVBF16GER2PN): Likewise.
11209 (PMXVBF16GER2PP): Likewise.
11210 (PMXVF16GER2): Likewise.
11211 (PMXVF16GER2NN): Likewise.
11212 (PMXVF16GER2NP): Likewise.
11213 (PMXVF16GER2PN): Likewise.
11214 (PMXVF16GER2PP): Likewise.
11215 (PMXVF32GER): Likewise.
11216 (PMXVF32GERNN): Likewise.
11217 (PMXVF32GERNP): Likewise.
11218 (PMXVF32GERPN): Likewise.
11219 (PMXVF32GERPP): Likewise.
11220 (PMXVF64GER): Likewise.
11221 (PMXVF64GERNN): Likewise.
11222 (PMXVF64GERNP): Likewise.
11223 (PMXVF64GERPN): Likewise.
11224 (PMXVF64GERPP): Likewise.
11225 (PMXVI16GER2): Likewise.
11226 (PMXVI16GER2PP): Likewise.
11227 (PMXVI16GER2S): Likewise.
11228 (PMXVI16GER2SPP): Likewise.
11229 (PMXVI4GER8): Likewise.
11230 (PMXVI4GER8PP): Likewise.
11231 (PMXVI8GER4): Likewise.
11232 (PMXVI8GER4PP): Likewise.
11233 (PMXVI8GER4SPP): Likewise.
11234 (XVBF16GER2): Likewise.
11235 (XVBF16GER2NN): Likewise.
11236 (XVBF16GER2NP): Likewise.
11237 (XVBF16GER2PN): Likewise.
11238 (XVBF16GER2PP): Likewise.
11239 (XVF16GER2): Likewise.
11240 (XVF16GER2NN): Likewise.
11241 (XVF16GER2NP): Likewise.
11242 (XVF16GER2PN): Likewise.
11243 (XVF16GER2PP): Likewise.
11244 (XVF32GER): Likewise.
11245 (XVF32GERNN): Likewise.
11246 (XVF32GERNP): Likewise.
11247 (XVF32GERPN): Likewise.
11248 (XVF32GERPP): Likewise.
11249 (XVF64GER): Likewise.
11250 (XVF64GERNN): Likewise.
11251 (XVF64GERNP): Likewise.
11252 (XVF64GERPN): Likewise.
11253 (XVF64GERPP): Likewise.
11254 (XVI16GER2): Likewise.
11255 (XVI16GER2PP): Likewise.
11256 (XVI16GER2S): Likewise.
11257 (XVI16GER2SPP): Likewise.
11258 (XVI4GER8): Likewise.
11259 (XVI4GER8PP): Likewise.
11260 (XVI8GER4): Likewise.
11261 (XVI8GER4PP): Likewise.
11262 (XVI8GER4SPP): Likewise.
11263 (XXMFACC): Likewise.
11264 (XXMTACC): Likewise.
11265 (XXSETACCZ): Likewise.
11266 (ASSEMBLE_PAIR_V): Likewise.
11267 (BUILD_PAIR): Likewise.
11268 (DISASSEMBLE_PAIR_V): Likewise.
11271 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
11272 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
11273 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
11274 (parse_bif_attrs): Handle ismmaint.
11275 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
11276 (write_bif_static_init): Handle ismmaint.
11278 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11280 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
11282 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
11283 (rs6000_new_builtin_valid_without_lhs): New function.
11284 (rs6000_gimple_fold_new_mma_builtin): Likewise.
11285 (rs6000_gimple_fold_new_builtin): Likewise.
11287 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
11289 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
11290 Destruct stale Value objects.
11291 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
11294 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
11297 * match.pd (shift optimizations): Disable recent sign-changing
11298 optimization for shifts by zero, these will be folded later.
11300 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
11302 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
11303 [power9] to [always].
11305 2021-09-17 Richard Biener <rguenther@suse.de>
11307 * tree-vect-stmts.c (vectorizable_load): Do not frob
11310 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11312 * config/i386/i386-features.c (remove_partial_avx_dependency):
11313 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
11314 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
11316 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
11318 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
11319 * config/i386/i386.md (SSE FP to FP splitters): Replace
11320 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
11321 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
11322 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
11323 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
11324 * config/i386/x86-tune.def
11325 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
11326 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
11328 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11331 * config/i386/i386-features.c (remove_partial_avx_dependency):
11332 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
11333 before generating vxorps.
11335 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11337 * config/i386/i386-options.c (processor_cost_table): Use
11338 tremont_cost for Tremont.
11339 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
11340 (tremont_memset): Likewise.
11341 (tremont_cost): Likewise.
11342 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
11343 Enable for Tremont.
11345 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
11347 * common/config/i386/i386-common.c: Use Haswell scheduling model
11349 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
11351 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
11353 (ix86_adjust_cost): Handle Tremont.
11354 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
11355 Enable for Tremont.
11356 (X86_TUNE_USE_LEAVE): Likewise.
11357 (X86_TUNE_PUSH_MEMORY): Likewise.
11358 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
11359 (X86_TUNE_USE_CLTD): Likewise.
11360 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
11361 (X86_TUNE_AVOID_MFENCE): Likewise.
11362 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
11363 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
11364 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
11365 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
11366 (X86_TUNE_OPT_AGU): Likewise.
11367 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
11368 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
11369 (X86_TUNE_EXPAND_ABS): Likewise.
11370 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
11371 (X86_TUNE_SLOW_PSHUFB): Likewise.
11373 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
11375 PR rtl-optimization/102306
11376 * combine.c (try_combine): Abort the combination if we are about to
11377 duplicate volatile references.
11379 2021-09-17 liuhongt <hongtao.liu@intel.com>
11381 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
11383 (_mm256_undefined_ph): Likewise.
11384 (_mm512_undefined_ph): Likewise.
11385 (_mm_cvtsh_h): Likewise.
11386 (_mm256_cvtsh_h): Likewise.
11387 (_mm512_cvtsh_h): Likewise.
11388 (_mm512_castph_ps): Likewise.
11389 (_mm512_castph_pd): Likewise.
11390 (_mm512_castph_si512): Likewise.
11391 (_mm512_castph512_ph128): Likewise.
11392 (_mm512_castph512_ph256): Likewise.
11393 (_mm512_castph128_ph512): Likewise.
11394 (_mm512_castph256_ph512): Likewise.
11395 (_mm512_zextph128_ph512): Likewise.
11396 (_mm512_zextph256_ph512): Likewise.
11397 (_mm512_castps_ph): Likewise.
11398 (_mm512_castpd_ph): Likewise.
11399 (_mm512_castsi512_ph): Likewise.
11400 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
11402 (_mm256_castph_ps): Likewise.
11403 (_mm_castph_pd): Likewise.
11404 (_mm256_castph_pd): Likewise.
11405 (_mm_castph_si128): Likewise.
11406 (_mm256_castph_si256): Likewise.
11407 (_mm_castps_ph): Likewise.
11408 (_mm256_castps_ph): Likewise.
11409 (_mm_castpd_ph): Likewise.
11410 (_mm256_castpd_ph): Likewise.
11411 (_mm_castsi128_ph): Likewise.
11412 (_mm256_castsi256_ph): Likewise.
11413 (_mm256_castph256_ph128): Likewise.
11414 (_mm256_castph128_ph256): Likewise.
11415 (_mm256_zextph128_ph256): Likewise.
11417 2021-09-17 liuhongt <hongtao.liu@intel.com>
11419 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
11421 (_mm_mask_cvtsh_ss): Likewise.
11422 (_mm_maskz_cvtsh_ss): Likewise.
11423 (_mm_cvtsh_sd): Likewise.
11424 (_mm_mask_cvtsh_sd): Likewise.
11425 (_mm_maskz_cvtsh_sd): Likewise.
11426 (_mm_cvt_roundsh_ss): Likewise.
11427 (_mm_mask_cvt_roundsh_ss): Likewise.
11428 (_mm_maskz_cvt_roundsh_ss): Likewise.
11429 (_mm_cvt_roundsh_sd): Likewise.
11430 (_mm_mask_cvt_roundsh_sd): Likewise.
11431 (_mm_maskz_cvt_roundsh_sd): Likewise.
11432 (_mm_cvtss_sh): Likewise.
11433 (_mm_mask_cvtss_sh): Likewise.
11434 (_mm_maskz_cvtss_sh): Likewise.
11435 (_mm_cvtsd_sh): Likewise.
11436 (_mm_mask_cvtsd_sh): Likewise.
11437 (_mm_maskz_cvtsd_sh): Likewise.
11438 (_mm_cvt_roundss_sh): Likewise.
11439 (_mm_mask_cvt_roundss_sh): Likewise.
11440 (_mm_maskz_cvt_roundss_sh): Likewise.
11441 (_mm_cvt_roundsd_sh): Likewise.
11442 (_mm_mask_cvt_roundsd_sh): Likewise.
11443 (_mm_maskz_cvt_roundsd_sh): Likewise.
11444 * config/i386/i386-builtin-types.def
11445 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
11446 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
11447 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
11448 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
11449 * config/i386/i386-builtin.def: Add corrresponding new builtins.
11450 * config/i386/i386-expand.c: Handle new builtin types.
11451 * config/i386/sse.md (VF48_128): New mode iterator.
11452 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
11454 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
11457 2021-09-17 liuhongt <hongtao.liu@intel.com>
11459 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
11461 (_mm512_mask_cvtph_pd): Likewise.
11462 (_mm512_maskz_cvtph_pd): Likewise.
11463 (_mm512_cvt_roundph_pd): Likewise.
11464 (_mm512_mask_cvt_roundph_pd): Likewise.
11465 (_mm512_maskz_cvt_roundph_pd): Likewise.
11466 (_mm512_cvtxph_ps): Likewise.
11467 (_mm512_mask_cvtxph_ps): Likewise.
11468 (_mm512_maskz_cvtxph_ps): Likewise.
11469 (_mm512_cvtx_roundph_ps): Likewise.
11470 (_mm512_mask_cvtx_roundph_ps): Likewise.
11471 (_mm512_maskz_cvtx_roundph_ps): Likewise.
11472 (_mm512_cvtxps_ph): Likewise.
11473 (_mm512_mask_cvtxps_ph): Likewise.
11474 (_mm512_maskz_cvtxps_ph): Likewise.
11475 (_mm512_cvtx_roundps_ph): Likewise.
11476 (_mm512_mask_cvtx_roundps_ph): Likewise.
11477 (_mm512_maskz_cvtx_roundps_ph): Likewise.
11478 (_mm512_cvtpd_ph): Likewise.
11479 (_mm512_mask_cvtpd_ph): Likewise.
11480 (_mm512_maskz_cvtpd_ph): Likewise.
11481 (_mm512_cvt_roundpd_ph): Likewise.
11482 (_mm512_mask_cvt_roundpd_ph): Likewise.
11483 (_mm512_maskz_cvt_roundpd_ph): Likewise.
11484 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
11486 (_mm_mask_cvtph_pd): Likewise.
11487 (_mm_maskz_cvtph_pd): Likewise.
11488 (_mm256_cvtph_pd): Likewise.
11489 (_mm256_mask_cvtph_pd): Likewise.
11490 (_mm256_maskz_cvtph_pd): Likewise.
11491 (_mm_cvtxph_ps): Likewise.
11492 (_mm_mask_cvtxph_ps): Likewise.
11493 (_mm_maskz_cvtxph_ps): Likewise.
11494 (_mm256_cvtxph_ps): Likewise.
11495 (_mm256_mask_cvtxph_ps): Likewise.
11496 (_mm256_maskz_cvtxph_ps): Likewise.
11497 (_mm_cvtxps_ph): Likewise.
11498 (_mm_mask_cvtxps_ph): Likewise.
11499 (_mm_maskz_cvtxps_ph): Likewise.
11500 (_mm256_cvtxps_ph): Likewise.
11501 (_mm256_mask_cvtxps_ph): Likewise.
11502 (_mm256_maskz_cvtxps_ph): Likewise.
11503 (_mm_cvtpd_ph): Likewise.
11504 (_mm_mask_cvtpd_ph): Likewise.
11505 (_mm_maskz_cvtpd_ph): Likewise.
11506 (_mm256_cvtpd_ph): Likewise.
11507 (_mm256_mask_cvtpd_ph): Likewise.
11508 (_mm256_maskz_cvtpd_ph): Likewise.
11509 * config/i386/i386-builtin.def: Add corresponding new builtins.
11510 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11511 * config/i386/i386-expand.c: Handle new builtin types.
11512 * config/i386/sse.md
11513 (VF4_128_8_256): New.
11514 (VF48H_AVX512VL): Ditto.
11515 (ssePHmode): Add HF vector modes.
11516 (castmode): Add new convertable modes.
11517 (qq2phsuff): Ditto.
11518 (ph2pssuffix): New.
11519 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
11520 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
11521 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
11522 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
11523 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
11524 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
11525 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
11527 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
11528 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
11529 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
11530 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
11532 2021-09-17 liuhongt <hongtao.liu@intel.com>
11534 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
11536 (_mm_cvttsh_u32): Likewise.
11537 (_mm_cvtt_roundsh_i32): Likewise.
11538 (_mm_cvtt_roundsh_u32): Likewise.
11539 (_mm_cvttsh_i64): Likewise.
11540 (_mm_cvttsh_u64): Likewise.
11541 (_mm_cvtt_roundsh_i64): Likewise.
11542 (_mm_cvtt_roundsh_u64): Likewise.
11543 * config/i386/i386-builtin.def: Add corresponding new builtins.
11544 * config/i386/sse.md
11545 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
11548 2021-09-17 liuhongt <hongtao.liu@intel.com>
11550 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
11552 (_mm512_mask_cvttph_epi32): Likewise.
11553 (_mm512_maskz_cvttph_epi32): Likewise.
11554 (_mm512_cvtt_roundph_epi32): Likewise.
11555 (_mm512_mask_cvtt_roundph_epi32): Likewise.
11556 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
11557 (_mm512_cvttph_epu32): Likewise.
11558 (_mm512_mask_cvttph_epu32): Likewise.
11559 (_mm512_maskz_cvttph_epu32): Likewise.
11560 (_mm512_cvtt_roundph_epu32): Likewise.
11561 (_mm512_mask_cvtt_roundph_epu32): Likewise.
11562 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
11563 (_mm512_cvttph_epi64): Likewise.
11564 (_mm512_mask_cvttph_epi64): Likewise.
11565 (_mm512_maskz_cvttph_epi64): Likewise.
11566 (_mm512_cvtt_roundph_epi64): Likewise.
11567 (_mm512_mask_cvtt_roundph_epi64): Likewise.
11568 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
11569 (_mm512_cvttph_epu64): Likewise.
11570 (_mm512_mask_cvttph_epu64): Likewise.
11571 (_mm512_maskz_cvttph_epu64): Likewise.
11572 (_mm512_cvtt_roundph_epu64): Likewise.
11573 (_mm512_mask_cvtt_roundph_epu64): Likewise.
11574 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
11575 (_mm512_cvttph_epi16): Likewise.
11576 (_mm512_mask_cvttph_epi16): Likewise.
11577 (_mm512_maskz_cvttph_epi16): Likewise.
11578 (_mm512_cvtt_roundph_epi16): Likewise.
11579 (_mm512_mask_cvtt_roundph_epi16): Likewise.
11580 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
11581 (_mm512_cvttph_epu16): Likewise.
11582 (_mm512_mask_cvttph_epu16): Likewise.
11583 (_mm512_maskz_cvttph_epu16): Likewise.
11584 (_mm512_cvtt_roundph_epu16): Likewise.
11585 (_mm512_mask_cvtt_roundph_epu16): Likewise.
11586 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
11587 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
11589 (_mm_mask_cvttph_epi32): Likewise.
11590 (_mm_maskz_cvttph_epi32): Likewise.
11591 (_mm256_cvttph_epi32): Likewise.
11592 (_mm256_mask_cvttph_epi32): Likewise.
11593 (_mm256_maskz_cvttph_epi32): Likewise.
11594 (_mm_cvttph_epu32): Likewise.
11595 (_mm_mask_cvttph_epu32): Likewise.
11596 (_mm_maskz_cvttph_epu32): Likewise.
11597 (_mm256_cvttph_epu32): Likewise.
11598 (_mm256_mask_cvttph_epu32): Likewise.
11599 (_mm256_maskz_cvttph_epu32): Likewise.
11600 (_mm_cvttph_epi64): Likewise.
11601 (_mm_mask_cvttph_epi64): Likewise.
11602 (_mm_maskz_cvttph_epi64): Likewise.
11603 (_mm256_cvttph_epi64): Likewise.
11604 (_mm256_mask_cvttph_epi64): Likewise.
11605 (_mm256_maskz_cvttph_epi64): Likewise.
11606 (_mm_cvttph_epu64): Likewise.
11607 (_mm_mask_cvttph_epu64): Likewise.
11608 (_mm_maskz_cvttph_epu64): Likewise.
11609 (_mm256_cvttph_epu64): Likewise.
11610 (_mm256_mask_cvttph_epu64): Likewise.
11611 (_mm256_maskz_cvttph_epu64): Likewise.
11612 (_mm_cvttph_epi16): Likewise.
11613 (_mm_mask_cvttph_epi16): Likewise.
11614 (_mm_maskz_cvttph_epi16): Likewise.
11615 (_mm256_cvttph_epi16): Likewise.
11616 (_mm256_mask_cvttph_epi16): Likewise.
11617 (_mm256_maskz_cvttph_epi16): Likewise.
11618 (_mm_cvttph_epu16): Likewise.
11619 (_mm_mask_cvttph_epu16): Likewise.
11620 (_mm_maskz_cvttph_epu16): Likewise.
11621 (_mm256_cvttph_epu16): Likewise.
11622 (_mm256_mask_cvttph_epu16): Likewise.
11623 (_mm256_maskz_cvttph_epu16): Likewise.
11624 * config/i386/i386-builtin.def: Add new builtins.
11625 * config/i386/sse.md
11626 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
11628 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
11629 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
11630 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
11631 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
11633 2021-09-17 liuhongt <hongtao.liu@intel.com>
11635 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
11636 (_mm_cvtsh_u32): Likewise.
11637 (_mm_cvt_roundsh_i32): Likewise.
11638 (_mm_cvt_roundsh_u32): Likewise.
11639 (_mm_cvtsh_i64): Likewise.
11640 (_mm_cvtsh_u64): Likewise.
11641 (_mm_cvt_roundsh_i64): Likewise.
11642 (_mm_cvt_roundsh_u64): Likewise.
11643 (_mm_cvti32_sh): Likewise.
11644 (_mm_cvtu32_sh): Likewise.
11645 (_mm_cvt_roundi32_sh): Likewise.
11646 (_mm_cvt_roundu32_sh): Likewise.
11647 (_mm_cvti64_sh): Likewise.
11648 (_mm_cvtu64_sh): Likewise.
11649 (_mm_cvt_roundi64_sh): Likewise.
11650 (_mm_cvt_roundu64_sh): Likewise.
11651 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11652 * config/i386/i386-builtin.def: Add corresponding new builtins.
11653 * config/i386/i386-expand.c (ix86_expand_round_builtin):
11654 Handle new builtin types.
11655 * config/i386/sse.md
11656 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
11658 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
11659 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
11661 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
11663 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
11664 (altivec_resolve_new_overloaded_builtin): New forward decl.
11665 (rs6000_new_builtin_type_compatible): New function.
11666 (altivec_resolve_overloaded_builtin): Call
11667 altivec_resolve_new_overloaded_builtin.
11668 (altivec_build_new_resolved_builtin): New function.
11669 (altivec_resolve_new_overloaded_builtin): Likewise.
11670 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
11672 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
11673 name of rs6000_new_builtin_is_supported.
11675 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
11677 * config/i386/i386-protos.h (ix86_decompose_address):
11678 Change return type to bool.
11679 * config/i386/i386.c (ix86_decompose_address): Ditto.
11681 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
11684 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
11685 Added 'build/' to target, use build/%.o rule.
11686 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
11687 '$(build_exeext)' to target and 'build/' for the *.o files.
11688 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
11691 2021-09-16 Martin Jambor <mjambor@suse.cz>
11693 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
11694 the body has been removed. Remove trailing whitespace.
11696 2021-09-16 Richard Biener <rguenther@suse.de>
11698 PR middle-end/102360
11699 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
11700 of non-memory more robust.
11702 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11704 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
11705 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
11706 (leon5_adjust_cost): Increase cost of store with data dependency
11707 on ALU instruction and FPU anti-dependencies.
11708 (sparc_option_override): Add LEON5 costs
11709 (sparc_adjust_cost): Add LEON5 cost adjustments
11710 * config/sparc/sparc.h: Add LEON5
11711 * config/sparc/sparc.md: Include LEON5 scheduling information
11712 * config/sparc/sparc.opt: Add LEON5
11713 * doc/invoke.texi: Add LEON5
11714 * config/sparc/leon5.md: New file.
11716 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11718 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
11719 sensitive sequence for B2BST errata workaround.
11721 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11723 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
11724 functions with atomic instruction in the UT700 errata workaround.
11726 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11728 * config/sparc/sparc.c (next_active_non_empty_insn): New function
11729 that returns next active non empty assembly instruction.
11730 (sparc_do_work_around_errata): Use new function.
11732 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11734 * config/sparc/sparc.c (store_insn_p): Add predicate for store
11736 (load_insn_p): Add predicate for load attributes.
11737 (sparc_do_work_around_errata): Use new predicates.
11739 2021-09-16 Andreas Larsson <andreas@gaisler.com>
11741 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
11744 2021-09-16 Martin Liska <mliska@suse.cz>
11746 * config/mips/netbsd.h: Fix typo in name of a macro.
11748 2021-09-16 liuhongt <hongtao.liu@intel.com>
11750 PR middle-end/102080
11751 * match.pd: Check mask type when doing cond_op related gimple
11753 * tree.c (is_truth_type_for): New function.
11754 * tree.h (is_truth_type_for): New declaration.
11756 2021-09-16 liuhongt <hongtao.liu@intel.com>
11758 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
11760 (_mm512_mask_cvtepi32_ph): Likewise.
11761 (_mm512_maskz_cvtepi32_ph): Likewise.
11762 (_mm512_cvt_roundepi32_ph): Likewise.
11763 (_mm512_mask_cvt_roundepi32_ph): Likewise.
11764 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
11765 (_mm512_cvtepu32_ph): Likewise.
11766 (_mm512_mask_cvtepu32_ph): Likewise.
11767 (_mm512_maskz_cvtepu32_ph): Likewise.
11768 (_mm512_cvt_roundepu32_ph): Likewise.
11769 (_mm512_mask_cvt_roundepu32_ph): Likewise.
11770 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
11771 (_mm512_cvtepi64_ph): Likewise.
11772 (_mm512_mask_cvtepi64_ph): Likewise.
11773 (_mm512_maskz_cvtepi64_ph): Likewise.
11774 (_mm512_cvt_roundepi64_ph): Likewise.
11775 (_mm512_mask_cvt_roundepi64_ph): Likewise.
11776 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
11777 (_mm512_cvtepu64_ph): Likewise.
11778 (_mm512_mask_cvtepu64_ph): Likewise.
11779 (_mm512_maskz_cvtepu64_ph): Likewise.
11780 (_mm512_cvt_roundepu64_ph): Likewise.
11781 (_mm512_mask_cvt_roundepu64_ph): Likewise.
11782 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
11783 (_mm512_cvtepi16_ph): Likewise.
11784 (_mm512_mask_cvtepi16_ph): Likewise.
11785 (_mm512_maskz_cvtepi16_ph): Likewise.
11786 (_mm512_cvt_roundepi16_ph): Likewise.
11787 (_mm512_mask_cvt_roundepi16_ph): Likewise.
11788 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
11789 (_mm512_cvtepu16_ph): Likewise.
11790 (_mm512_mask_cvtepu16_ph): Likewise.
11791 (_mm512_maskz_cvtepu16_ph): Likewise.
11792 (_mm512_cvt_roundepu16_ph): Likewise.
11793 (_mm512_mask_cvt_roundepu16_ph): Likewise.
11794 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
11795 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
11797 (_mm_mask_cvtepi32_ph): Likewise.
11798 (_mm_maskz_cvtepi32_ph): Likewise.
11799 (_mm256_cvtepi32_ph): Likewise.
11800 (_mm256_mask_cvtepi32_ph): Likewise.
11801 (_mm256_maskz_cvtepi32_ph): Likewise.
11802 (_mm_cvtepu32_ph): Likewise.
11803 (_mm_mask_cvtepu32_ph): Likewise.
11804 (_mm_maskz_cvtepu32_ph): Likewise.
11805 (_mm256_cvtepu32_ph): Likewise.
11806 (_mm256_mask_cvtepu32_ph): Likewise.
11807 (_mm256_maskz_cvtepu32_ph): Likewise.
11808 (_mm_cvtepi64_ph): Likewise.
11809 (_mm_mask_cvtepi64_ph): Likewise.
11810 (_mm_maskz_cvtepi64_ph): Likewise.
11811 (_mm256_cvtepi64_ph): Likewise.
11812 (_mm256_mask_cvtepi64_ph): Likewise.
11813 (_mm256_maskz_cvtepi64_ph): Likewise.
11814 (_mm_cvtepu64_ph): Likewise.
11815 (_mm_mask_cvtepu64_ph): Likewise.
11816 (_mm_maskz_cvtepu64_ph): Likewise.
11817 (_mm256_cvtepu64_ph): Likewise.
11818 (_mm256_mask_cvtepu64_ph): Likewise.
11819 (_mm256_maskz_cvtepu64_ph): Likewise.
11820 (_mm_cvtepi16_ph): Likewise.
11821 (_mm_mask_cvtepi16_ph): Likewise.
11822 (_mm_maskz_cvtepi16_ph): Likewise.
11823 (_mm256_cvtepi16_ph): Likewise.
11824 (_mm256_mask_cvtepi16_ph): Likewise.
11825 (_mm256_maskz_cvtepi16_ph): Likewise.
11826 (_mm_cvtepu16_ph): Likewise.
11827 (_mm_mask_cvtepu16_ph): Likewise.
11828 (_mm_maskz_cvtepu16_ph): Likewise.
11829 (_mm256_cvtepu16_ph): Likewise.
11830 (_mm256_mask_cvtepu16_ph): Likewise.
11831 (_mm256_maskz_cvtepu16_ph): Likewise.
11832 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11833 * config/i386/i386-builtin.def: Add corresponding new builtins.
11834 * config/i386/i386-expand.c
11835 (ix86_expand_args_builtin): Handle new builtin types.
11836 (ix86_expand_round_builtin): Ditto.
11837 * config/i386/i386-modes.def: Declare V2HF and V6HF.
11838 * config/i386/sse.md (VI2H_AVX512VL): New.
11839 (qq2phsuff): Ditto.
11840 (sseintvecmode): Add HF vector modes.
11841 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
11843 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11844 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11845 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11846 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11847 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
11848 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11849 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11850 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11851 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11852 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
11853 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
11855 2021-09-16 liuhongt <hongtao.liu@intel.com>
11857 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
11859 (_mm512_mask_cvtph_epi32): Likewise.
11860 (_mm512_maskz_cvtph_epi32): Likewise.
11861 (_mm512_cvt_roundph_epi32): Likewise.
11862 (_mm512_mask_cvt_roundph_epi32): Likewise.
11863 (_mm512_maskz_cvt_roundph_epi32): Likewise.
11864 (_mm512_cvtph_epu32): Likewise.
11865 (_mm512_mask_cvtph_epu32): Likewise.
11866 (_mm512_maskz_cvtph_epu32): Likewise.
11867 (_mm512_cvt_roundph_epu32): Likewise.
11868 (_mm512_mask_cvt_roundph_epu32): Likewise.
11869 (_mm512_maskz_cvt_roundph_epu32): Likewise.
11870 (_mm512_cvtph_epi64): Likewise.
11871 (_mm512_mask_cvtph_epi64): Likewise.
11872 (_mm512_maskz_cvtph_epi64): Likewise.
11873 (_mm512_cvt_roundph_epi64): Likewise.
11874 (_mm512_mask_cvt_roundph_epi64): Likewise.
11875 (_mm512_maskz_cvt_roundph_epi64): Likewise.
11876 (_mm512_cvtph_epu64): Likewise.
11877 (_mm512_mask_cvtph_epu64): Likewise.
11878 (_mm512_maskz_cvtph_epu64): Likewise.
11879 (_mm512_cvt_roundph_epu64): Likewise.
11880 (_mm512_mask_cvt_roundph_epu64): Likewise.
11881 (_mm512_maskz_cvt_roundph_epu64): Likewise.
11882 (_mm512_cvtph_epi16): Likewise.
11883 (_mm512_mask_cvtph_epi16): Likewise.
11884 (_mm512_maskz_cvtph_epi16): Likewise.
11885 (_mm512_cvt_roundph_epi16): Likewise.
11886 (_mm512_mask_cvt_roundph_epi16): Likewise.
11887 (_mm512_maskz_cvt_roundph_epi16): Likewise.
11888 (_mm512_cvtph_epu16): Likewise.
11889 (_mm512_mask_cvtph_epu16): Likewise.
11890 (_mm512_maskz_cvtph_epu16): Likewise.
11891 (_mm512_cvt_roundph_epu16): Likewise.
11892 (_mm512_mask_cvt_roundph_epu16): Likewise.
11893 (_mm512_maskz_cvt_roundph_epu16): Likewise.
11894 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
11896 (_mm_mask_cvtph_epi32): Likewise.
11897 (_mm_maskz_cvtph_epi32): Likewise.
11898 (_mm256_cvtph_epi32): Likewise.
11899 (_mm256_mask_cvtph_epi32): Likewise.
11900 (_mm256_maskz_cvtph_epi32): Likewise.
11901 (_mm_cvtph_epu32): Likewise.
11902 (_mm_mask_cvtph_epu32): Likewise.
11903 (_mm_maskz_cvtph_epu32): Likewise.
11904 (_mm256_cvtph_epu32): Likewise.
11905 (_mm256_mask_cvtph_epu32): Likewise.
11906 (_mm256_maskz_cvtph_epu32): Likewise.
11907 (_mm_cvtph_epi64): Likewise.
11908 (_mm_mask_cvtph_epi64): Likewise.
11909 (_mm_maskz_cvtph_epi64): Likewise.
11910 (_mm256_cvtph_epi64): Likewise.
11911 (_mm256_mask_cvtph_epi64): Likewise.
11912 (_mm256_maskz_cvtph_epi64): Likewise.
11913 (_mm_cvtph_epu64): Likewise.
11914 (_mm_mask_cvtph_epu64): Likewise.
11915 (_mm_maskz_cvtph_epu64): Likewise.
11916 (_mm256_cvtph_epu64): Likewise.
11917 (_mm256_mask_cvtph_epu64): Likewise.
11918 (_mm256_maskz_cvtph_epu64): Likewise.
11919 (_mm_cvtph_epi16): Likewise.
11920 (_mm_mask_cvtph_epi16): Likewise.
11921 (_mm_maskz_cvtph_epi16): Likewise.
11922 (_mm256_cvtph_epi16): Likewise.
11923 (_mm256_mask_cvtph_epi16): Likewise.
11924 (_mm256_maskz_cvtph_epi16): Likewise.
11925 (_mm_cvtph_epu16): Likewise.
11926 (_mm_mask_cvtph_epu16): Likewise.
11927 (_mm_maskz_cvtph_epu16): Likewise.
11928 (_mm256_cvtph_epu16): Likewise.
11929 (_mm256_mask_cvtph_epu16): Likewise.
11930 (_mm256_maskz_cvtph_epu16): Likewise.
11931 * config/i386/i386-builtin-types.def: Add new builtin types.
11932 * config/i386/i386-builtin.def: Add new builtins.
11933 * config/i386/i386-expand.c
11934 (ix86_expand_args_builtin): Handle new builtin types.
11935 (ix86_expand_round_builtin): Ditto.
11936 * config/i386/sse.md (sseintconvert): New.
11937 (ssePHmode): Ditto.
11938 (UNSPEC_US_FIX_NOTRUNC): Ditto.
11939 (sseintconvertsignprefix): Ditto.
11940 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
11943 2021-09-16 liuhongt <hongtao.liu@intel.com>
11945 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
11947 (_mm_cvtsi128_si16): Likewise.
11948 (_mm_mask_load_sh): Likewise.
11949 (_mm_maskz_load_sh): Likewise.
11950 (_mm_mask_store_sh): Likewise.
11951 (_mm_move_sh): Likewise.
11952 (_mm_mask_move_sh): Likewise.
11953 (_mm_maskz_move_sh): Likewise.
11954 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11955 * config/i386/i386-builtin.def: Add corresponding new builtins.
11956 * config/i386/i386-expand.c
11957 (ix86_expand_special_args_builtin): Handle new builtin types.
11958 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
11959 * config/i386/sse.md (VI2F): New mode iterator.
11960 (vec_set<mode>_0): Use new mode iterator.
11961 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
11962 (avx512f_store<mode>_mask): Ditto.
11964 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
11966 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
11968 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
11970 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11971 Proceed if no symbol summary or the symbol alias flag is false.
11973 2021-09-15 Jakub Jelinek <jakub@redhat.com>
11977 * varasm.c (output_constructor_regular_field): Instead of assertion
11978 that array_size_for_constructor result is equal to size of
11979 TREE_TYPE (local->val) in bytes, assert that the type size is greater
11980 or equal to array_size_for_constructor result and use type size as
11983 2021-09-15 Martin Liska <mliska@suse.cz>
11986 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
11988 2021-09-15 Martin Liska <mliska@suse.cz>
11991 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11992 Check that we have a symbol summary for a symbol.
11994 2021-09-15 Richard Biener <rguenther@suse.de>
11997 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11998 to inherit from elfos.h
12000 2021-09-15 liuhongt <hongtao.liu@intel.com>
12003 * config/i386/i386-expand.c
12004 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
12006 (ix86_expand_vector_set): Use blendw instead of pinsrw.
12007 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
12008 AVX512FP16 which supports 16bit vector load.
12009 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
12011 (avx512bw_interleave_high<mode><mask_name>): .. this, and
12012 extend to V32HFmode.
12013 (avx2_interleave_highv16hi<mask_name>): Rename to ..
12014 (avx2_interleave_high<mode><mask_name>): .. this, and extend
12016 (vec_interleave_highv8hi<mask_name>): Rename to ..
12017 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
12018 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
12020 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
12021 this, and extend to V32HFmode.
12022 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
12023 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
12024 (vec_interleave_lowv8hi<mask_name>): Rename to ..
12025 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
12026 (sse4_1_pblendw): Rename to ..
12027 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
12028 (avx2_pblendph): New define_expand.
12029 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
12030 sseintmodesuffix instead of ssemodesuffix.
12031 (blendsuf): New mode attr.
12033 2021-09-15 Richard Biener <rguenther@suse.de>
12035 * tree-vectorizer.h (dr_misalignment): Move out of line.
12036 (dr_target_alignment): New.
12037 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
12038 (set_dr_target_alignment): New.
12039 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
12040 * tree-vect-data-refs.c (dr_misalignment): Compute and
12041 return the group members misalignment.
12042 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
12043 (vect_analyze_data_refs_alignment): Compute alignment only
12044 for the first element of a DR group.
12045 (vect_slp_analyze_node_alignment): Likewise.
12047 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
12049 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
12050 * config/i386/avx512fp16vlintrin.h: Likewise.
12051 * config/i386/i386-builtin.def: Adjust builtin name and
12052 enumeration to match AVX512F style.
12054 2021-09-15 Richard Biener <rguenther@suse.de>
12056 PR tree-optimization/102318
12057 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
12058 previous change and do the mode conversion separately from
12059 the sign conversion.
12061 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
12062 Peter Cordes <peter@cordes.ca>
12065 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
12066 (*vec_extract<mode><ssescalarmodelower>_valign): Output
12067 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
12070 2021-09-15 Richard Biener <rguenther@suse.de>
12072 * config.gcc: Remove vax-*-openbsd* configuration.
12074 2021-09-15 Richard Biener <rguenther@suse.de>
12076 * config.gcc: Remove m68k-openbsd.
12078 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
12081 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
12083 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
12085 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
12086 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
12087 (*mma_xxsetaccz): Delete.
12088 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
12089 Use UNSPECV_MMA_XXSETACCZ. Update comment.
12090 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
12092 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
12094 * Makefile.in: Remove variables related to applying no-PIE
12095 to the exes on $build.
12096 * configure: Regenerate.
12097 * configure.ac: Remove configuration related to applying
12098 no-PIE to the exes on $build.
12100 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
12102 * config/arc/arc.md (doloop_end): Add missing mode.
12103 (loop_end): Likewise.
12105 2021-09-14 Jakub Jelinek <jakub@redhat.com>
12107 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
12108 it to recursive calls, for depth above 7 just gimplify or return.
12109 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
12110 __builtin_clear_padding and TARGET_EXPR.
12111 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
12113 2021-09-14 liuhongt <hongtao.liu@intel.com>
12115 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
12117 (_mm_mask_fpclass_sh_mask): Likewise.
12118 (_mm512_mask_fpclass_ph_mask): Likewise.
12119 (_mm512_fpclass_ph_mask): Likewise.
12120 (_mm_getexp_sh): Likewise.
12121 (_mm_mask_getexp_sh): Likewise.
12122 (_mm_maskz_getexp_sh): Likewise.
12123 (_mm512_getexp_ph): Likewise.
12124 (_mm512_mask_getexp_ph): Likewise.
12125 (_mm512_maskz_getexp_ph): Likewise.
12126 (_mm_getexp_round_sh): Likewise.
12127 (_mm_mask_getexp_round_sh): Likewise.
12128 (_mm_maskz_getexp_round_sh): Likewise.
12129 (_mm512_getexp_round_ph): Likewise.
12130 (_mm512_mask_getexp_round_ph): Likewise.
12131 (_mm512_maskz_getexp_round_ph): Likewise.
12132 (_mm_getmant_sh): Likewise.
12133 (_mm_mask_getmant_sh): Likewise.
12134 (_mm_maskz_getmant_sh): Likewise.
12135 (_mm512_getmant_ph): Likewise.
12136 (_mm512_mask_getmant_ph): Likewise.
12137 (_mm512_maskz_getmant_ph): Likewise.
12138 (_mm_getmant_round_sh): Likewise.
12139 (_mm_mask_getmant_round_sh): Likewise.
12140 (_mm_maskz_getmant_round_sh): Likewise.
12141 (_mm512_getmant_round_ph): Likewise.
12142 (_mm512_mask_getmant_round_ph): Likewise.
12143 (_mm512_maskz_getmant_round_ph): Likewise.
12144 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
12146 (_mm_fpclass_ph_mask): Likewise.
12147 (_mm256_mask_fpclass_ph_mask): Likewise.
12148 (_mm256_fpclass_ph_mask): Likewise.
12149 (_mm256_getexp_ph): Likewise.
12150 (_mm256_mask_getexp_ph): Likewise.
12151 (_mm256_maskz_getexp_ph): Likewise.
12152 (_mm_getexp_ph): Likewise.
12153 (_mm_mask_getexp_ph): Likewise.
12154 (_mm_maskz_getexp_ph): Likewise.
12155 (_mm256_getmant_ph): Likewise.
12156 (_mm256_mask_getmant_ph): Likewise.
12157 (_mm256_maskz_getmant_ph): Likewise.
12158 (_mm_getmant_ph): Likewise.
12159 (_mm_mask_getmant_ph): Likewise.
12160 (_mm_maskz_getmant_ph): Likewise.
12161 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12162 * config/i386/i386-builtin.def: Add corresponding new builtins.
12163 * config/i386/i386-expand.c
12164 (ix86_expand_args_builtin): Handle new builtin types.
12165 (ix86_expand_round_builtin): Ditto.
12166 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
12167 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
12168 to support HF vector modes.
12169 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
12171 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
12172 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
12173 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
12174 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
12177 2021-09-14 liuhongt <hongtao.liu@intel.com>
12179 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
12181 (_mm512_mask_reduce_ph): Likewise.
12182 (_mm512_maskz_reduce_ph): Likewise.
12183 (_mm512_reduce_round_ph): Likewise.
12184 (_mm512_mask_reduce_round_ph): Likewise.
12185 (_mm512_maskz_reduce_round_ph): Likewise.
12186 (_mm_reduce_sh): Likewise.
12187 (_mm_mask_reduce_sh): Likewise.
12188 (_mm_maskz_reduce_sh): Likewise.
12189 (_mm_reduce_round_sh): Likewise.
12190 (_mm_mask_reduce_round_sh): Likewise.
12191 (_mm_maskz_reduce_round_sh): Likewise.
12192 (_mm512_roundscale_ph): Likewise.
12193 (_mm512_mask_roundscale_ph): Likewise.
12194 (_mm512_maskz_roundscale_ph): Likewise.
12195 (_mm512_roundscale_round_ph): Likewise.
12196 (_mm512_mask_roundscale_round_ph): Likewise.
12197 (_mm512_maskz_roundscale_round_ph): Likewise.
12198 (_mm_roundscale_sh): Likewise.
12199 (_mm_mask_roundscale_sh): Likewise.
12200 (_mm_maskz_roundscale_sh): Likewise.
12201 (_mm_roundscale_round_sh): Likewise.
12202 (_mm_mask_roundscale_round_sh): Likewise.
12203 (_mm_maskz_roundscale_round_sh): Likewise.
12204 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
12206 (_mm_mask_reduce_ph): Likewise.
12207 (_mm_maskz_reduce_ph): Likewise.
12208 (_mm256_reduce_ph): Likewise.
12209 (_mm256_mask_reduce_ph): Likewise.
12210 (_mm256_maskz_reduce_ph): Likewise.
12211 (_mm_roundscale_ph): Likewise.
12212 (_mm_mask_roundscale_ph): Likewise.
12213 (_mm_maskz_roundscale_ph): Likewise.
12214 (_mm256_roundscale_ph): Likewise.
12215 (_mm256_mask_roundscale_ph): Likewise.
12216 (_mm256_maskz_roundscale_ph): Likewise.
12217 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12218 * config/i386/i386-builtin.def: Add corresponding new builtins.
12219 * config/i386/i386-expand.c
12220 (ix86_expand_args_builtin): Handle new builtin types.
12221 (ix86_expand_round_builtin): Ditto.
12222 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
12224 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
12225 ... this, and adjust for round operands.
12226 (reduces<mode><mask_scalar_name>): Likewise, with ...
12227 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
12229 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
12230 Adjust for HF vector modes.
12231 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
12233 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
12235 2021-09-14 liuhongt <hongtao.liu@intel.com>
12237 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
12239 (_mm512_mask_rcp_ph): Likewise.
12240 (_mm512_maskz_rcp_ph): Likewise.
12241 (_mm_rcp_sh): Likewise.
12242 (_mm_mask_rcp_sh): Likewise.
12243 (_mm_maskz_rcp_sh): Likewise.
12244 (_mm512_scalef_ph): Likewise.
12245 (_mm512_mask_scalef_ph): Likewise.
12246 (_mm512_maskz_scalef_ph): Likewise.
12247 (_mm512_scalef_round_ph): Likewise.
12248 (_mm512_mask_scalef_round_ph): Likewise.
12249 (_mm512_maskz_scalef_round_ph): Likewise.
12250 (_mm_scalef_sh): Likewise.
12251 (_mm_mask_scalef_sh): Likewise.
12252 (_mm_maskz_scalef_sh): Likewise.
12253 (_mm_scalef_round_sh): Likewise.
12254 (_mm_mask_scalef_round_sh): Likewise.
12255 (_mm_maskz_scalef_round_sh): Likewise.
12256 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
12258 (_mm256_rcp_ph): Likewise.
12259 (_mm_mask_rcp_ph): Likewise.
12260 (_mm256_mask_rcp_ph): Likewise.
12261 (_mm_maskz_rcp_ph): Likewise.
12262 (_mm256_maskz_rcp_ph): Likewise.
12263 (_mm_scalef_ph): Likewise.
12264 (_mm256_scalef_ph): Likewise.
12265 (_mm_mask_scalef_ph): Likewise.
12266 (_mm256_mask_scalef_ph): Likewise.
12267 (_mm_maskz_scalef_ph): Likewise.
12268 (_mm256_maskz_scalef_ph): Likewise.
12269 * config/i386/i386-builtin.def: Add new builtins.
12270 * config/i386/sse.md (VFH_AVX512VL): New.
12271 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
12272 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
12273 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
12274 Adjust to support HF vector modes.
12275 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
12277 2021-09-14 liuhongt <hongtao.liu@intel.com>
12279 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
12281 (_mm512_mask_sqrt_ph): Likewise.
12282 (_mm512_maskz_sqrt_ph): Likewise.
12283 (_mm512_sqrt_round_ph): Likewise.
12284 (_mm512_mask_sqrt_round_ph): Likewise.
12285 (_mm512_maskz_sqrt_round_ph): Likewise.
12286 (_mm512_rsqrt_ph): Likewise.
12287 (_mm512_mask_rsqrt_ph): Likewise.
12288 (_mm512_maskz_rsqrt_ph): Likewise.
12289 (_mm_rsqrt_sh): Likewise.
12290 (_mm_mask_rsqrt_sh): Likewise.
12291 (_mm_maskz_rsqrt_sh): Likewise.
12292 (_mm_sqrt_sh): Likewise.
12293 (_mm_mask_sqrt_sh): Likewise.
12294 (_mm_maskz_sqrt_sh): Likewise.
12295 (_mm_sqrt_round_sh): Likewise.
12296 (_mm_mask_sqrt_round_sh): Likewise.
12297 (_mm_maskz_sqrt_round_sh): Likewise.
12298 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
12299 (_mm256_sqrt_ph): Likewise.
12300 (_mm_mask_sqrt_ph): Likewise.
12301 (_mm256_mask_sqrt_ph): Likewise.
12302 (_mm_maskz_sqrt_ph): Likewise.
12303 (_mm256_maskz_sqrt_ph): Likewise.
12304 (_mm_rsqrt_ph): Likewise.
12305 (_mm256_rsqrt_ph): Likewise.
12306 (_mm_mask_rsqrt_ph): Likewise.
12307 (_mm256_mask_rsqrt_ph): Likewise.
12308 (_mm_maskz_rsqrt_ph): Likewise.
12309 (_mm256_maskz_rsqrt_ph): Likewise.
12310 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12311 * config/i386/i386-builtin.def: Add corresponding new builtins.
12312 * config/i386/i386-expand.c
12313 (ix86_expand_args_builtin): Handle new builtins.
12314 (ix86_expand_round_builtin): Ditto.
12315 * config/i386/sse.md (VF_AVX512FP16VL): New.
12316 (sqrt<mode>2): Adjust for HF vector modes.
12317 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
12318 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
12320 (<sse>_rsqrt<mode>2<mask_name>): New.
12321 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
12323 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
12325 PR bootstrap/101574
12326 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
12327 'RESERVED_LOCATION_P' locations.
12328 * warning-control.cc (get_nowarn_spec, suppress_warning)
12329 (copy_warning): Likewise.
12331 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
12333 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
12334 (typedef key_type_t): ... this. Remove.
12335 (nowarn_map): Document.
12336 * diagnostic-spec.c (nowarn_map): Likewise.
12337 * warning-control.cc (convert_to_key): Evolve functions into...
12338 (get_location): ... these. Adjust all users.
12340 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
12342 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
12344 2021-09-13 Jason Merrill <jason@redhat.com>
12346 * params.opt: Add destructive-interference-size and
12347 constructive-interference-size.
12348 * doc/invoke.texi: Document them.
12349 * config/aarch64/aarch64.c (aarch64_override_options_internal):
12351 * config/arm/arm.c (arm_option_override): Set them.
12352 * config/i386/i386-options.c (ix86_option_override_internal):
12355 2021-09-13 Martin Liska <mliska@suse.cz>
12356 H.J. Lu <hjl.tools@gmail.com>
12359 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
12360 for x86-64 micro levels for __builtin_cpu_supports.
12361 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
12362 Add priorities for the micro-arch levels.
12363 (enum processor_features): Add new features.
12364 * common/config/i386/i386-isas.h: Add micro-arch features.
12365 * config/i386/i386-builtins.c (get_builtin_code_for_version):
12366 Support the micro-arch levels by callsing
12367 __builtin_cpu_supports.
12368 * doc/extend.texi: Document that the levels are support by
12369 __builtin_cpu_supports.
12371 2021-09-13 Andrew Pinski <apinski@marvell.com>
12374 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
12376 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
12377 (aarch64_general_gimple_fold_builtin): Likewise.
12379 2021-09-13 Andrew Pinski <apinski@marvell.com>
12381 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
12382 to the Unsupported targets list.
12383 Remove support for m32r-*-linux* and m32rle-*-linux*.
12384 * config/m32r/linux.h: Removed.
12385 * config/m32r/t-linux: Removed.
12387 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
12390 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
12391 register index for SVE predicate modes.
12393 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
12395 * tree-ssa-threadbackward.c
12396 (back_threader_profitability::profitable_path_p): Remove FSM
12398 (back_threader_registry::register_path): Same.
12399 * tree-ssa-threadedge.c
12400 (jump_threader::simplify_control_stmt_condition): Same.
12401 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
12402 Add backedge_threads argument.
12403 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
12404 backedge_threads argument.
12405 (back_jt_path_registry::back_jt_path_registry): Same.
12406 (dump_jump_thread_path): Adjust for FSM removal.
12407 (back_jt_path_registry::rewire_first_differing_edge): Same.
12408 (back_jt_path_registry::adjust_paths_after_duplication): Same.
12409 (back_jt_path_registry::update_cfg): Same.
12410 (jt_path_registry::register_jump_thread): Same.
12411 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
12413 (class back_jt_path_registry): Add backedge_threads to
12416 2021-09-13 Martin Liska <mliska@suse.cz>
12419 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
12421 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
12424 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
12425 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
12426 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
12427 TARGET_AVX256_STORE_BY_PIECES instead of
12428 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
12429 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
12430 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
12431 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
12432 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
12433 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
12435 2021-09-13 liuhongt <hongtao.liu@intel.com>
12437 PR bootstrap/102302
12438 * expmed.c (extract_bit_field_using_extv): Use
12439 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
12441 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
12443 * Makefile.in (OBJS): Add value-pointer-equiv.o.
12444 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
12445 value-pointer-equiv.*.
12446 (ssa_equiv_stack::ssa_equiv_stack): Same.
12447 (ssa_equiv_stack::enter): Same.
12448 (ssa_equiv_stack::leave): Same.
12449 (ssa_equiv_stack::push_replacement): Same.
12450 (ssa_equiv_stack::get_replacement): Same.
12451 (is_pointer_ssa): Same.
12452 (class pointer_equiv_analyzer): Same.
12453 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
12454 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
12455 (pointer_equiv_analyzer::set_global_equiv): Same.
12456 (pointer_equiv_analyzer::set_cond_equiv): Same.
12457 (pointer_equiv_analyzer::get_equiv): Same.
12458 (pointer_equiv_analyzer::enter): Same.
12459 (pointer_equiv_analyzer::leave): Same.
12460 (pointer_equiv_analyzer::get_equiv_expr): Same.
12461 (pta_valueize): Same.
12462 (pointer_equiv_analyzer::visit_stmt): Same.
12463 (pointer_equiv_analyzer::visit_edge): Same.
12464 (hybrid_folder::value_of_expr): Same.
12465 (hybrid_folder::value_on_edge): Same.
12466 * value-pointer-equiv.cc: New file.
12467 * value-pointer-equiv.h: New file.
12469 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12472 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
12473 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
12475 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12478 * config/arm/arm.md (movmisaligndi): New define_expand.
12479 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
12481 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12484 * emit-rtl.c (gen_highpart): Use adjust_address to handle
12485 MEM rather than calling simplify_gen_subreg.
12487 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
12489 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
12490 define into a block.
12492 2021-09-13 Richard Biener <rguenther@suse.de>
12494 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
12495 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
12496 change based on DARWIN_PREFER_DWARF not being defined.
12498 2021-09-13 Richard Biener <rguenther@suse.de>
12500 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
12501 to inherit from elfos.h
12503 2021-09-13 Richard Biener <rguenther@suse.de>
12505 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
12507 2021-09-13 Richard Biener <rguenther@suse.de>
12509 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
12510 override, pick up DWARF2_DEBUG define from elfos.h
12512 2021-09-13 Richard Biener <rguenther@suse.de>
12514 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
12517 2021-09-13 Richard Biener <rguenther@suse.de>
12519 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
12522 2021-09-13 Richard Biener <rguenther@suse.de>
12524 * config/i386/cygming.h: Always default to DWARF2 debugging.
12525 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
12527 * doc/install.texi: Document binutils 2.16 as minimum
12528 requirement for mingw.
12530 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
12532 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
12533 nstmts, nloads and extra_ctor_cost.
12534 (rs6000_density_test): Add load density related heuristics. Do
12535 extra costing on vector construction statements if need.
12536 (rs6000_init_cost): Init new members.
12537 (rs6000_update_target_cost_per_stmt): New function.
12538 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
12539 rs6000_update_target_cost_per_stmt and call it.
12541 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
12543 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
12544 (rs6000_init_cost): Adjust.
12546 2021-09-13 liuhongt <hongtao.liu@intel.com>
12548 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
12549 (UNSPEC_XORSIGN): Ditto.
12551 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
12553 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
12554 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12557 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
12559 * tree-ssa-threadbackward.c (class back_threader_registry): Use
12560 back_jt_path_registry.
12561 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
12562 fwd_jt_path_registry.
12563 * tree-ssa-threadedge.h (class jump_threader): Same..
12564 * tree-ssa-threadupdate.c
12565 (jump_thread_path_registry::jump_thread_path_registry): Rename...
12566 (jt_path_registry::jt_path_registry): ...to this.
12567 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
12568 (jt_path_registry::~jt_path_registry): ...this.
12569 (fwd_jt_path_registry::fwd_jt_path_registry): New.
12570 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
12571 (jump_thread_path_registry::allocate_thread_edge): Rename...
12572 (jt_path_registry::allocate_thread_edge): ...to this.
12573 (jump_thread_path_registry::allocate_thread_path): Rename...
12574 (jt_path_registry::allocate_thread_path): ...to this.
12575 (jump_thread_path_registry::lookup_redirection_data): Rename...
12576 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
12577 (jump_thread_path_registry::thread_block_1): Rename...
12578 (fwd_jt_path_registry::thread_block_1): ...to this.
12579 (jump_thread_path_registry::thread_block): Rename...
12580 (fwd_jt_path_registry::thread_block): ...to this.
12581 (jt_path_registry::thread_through_loop_header): Rename...
12582 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
12583 (jump_thread_path_registry::mark_threaded_blocks): Rename...
12584 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
12585 (jump_thread_path_registry::debug_path): Rename...
12586 (jt_path_registry::debug_path): ...to this.
12587 (jump_thread_path_registry::dump): Rename...
12588 (jt_path_registry::debug): ...to this.
12589 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
12590 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
12591 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
12592 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
12593 (jump_thread_path_registry::duplicate_thread_path): Rename...
12594 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
12595 drop ill-formed candidates.
12596 (jump_thread_path_registry::remove_jump_threads_including): Rename...
12597 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
12598 (jt_path_registry::thread_through_all_blocks): New.
12599 (back_jt_path_registry::update_cfg): New.
12600 (fwd_jt_path_registry::update_cfg): New.
12601 (jump_thread_path_registry::register_jump_thread): Rename...
12602 (jt_path_registry::register_jump_thread): ...to this.
12603 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
12605 (class jt_path_registry): ...here.
12606 (class fwd_jt_path_registry): New.
12607 (class back_jt_path_registry): New.
12609 2021-09-10 liuhongt <hongtao.liu@intel.com>
12612 2021-09-01 liuhongt <hongtao.liu@intel.com>
12614 * emit-rtl.c (validate_subreg): Get rid of all float-int
12617 2021-09-10 Jakub Jelinek <jakub@redhat.com>
12619 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
12620 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
12621 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
12622 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
12623 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
12624 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
12625 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
12626 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
12628 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
12629 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
12630 GF_OMP_ATOMIC_WEAK.
12631 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
12633 * tree.h (OMP_ATOMIC_WEAK): Define.
12634 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
12635 fail memory order being encoded in the same enum and also print
12636 fail clause if present.
12637 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
12638 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
12639 handle pre_p == NULL case as a test mode that only returns value
12640 but doesn't change gimplify nor change anything otherwise, adjust
12641 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
12642 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
12643 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
12645 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
12646 COND_EXPR rhs. Set weak flag on gimple load/store for
12648 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
12649 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
12651 (expand_omp_atomic_cas): New function.
12652 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
12654 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
12655 using expand_omp_atomic_cas.
12657 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
12658 Michael Matz <matz@suse.de>
12660 * tree-pass.h (PROP_loop_opts_done): New.
12661 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
12662 Intersect with global range.
12663 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
12664 * tree-ssa-threadbackward.c
12665 (back_threader_profitability::profitable_path_p): Disable
12666 threading through latches until after loop optimizations have run.
12668 2021-09-10 David Faust <david.faust@oracle.com>
12670 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
12673 2021-09-10 David Faust <david.faust@oracle.com>
12675 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
12676 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
12677 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
12678 (bpf_expand_cbranch): New function.
12679 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
12681 (zero_extendsidi2): Only use mov32 instruction if it is available.
12682 (SIM mode iterator): Conditionalize support for SI mode.
12683 (JM mode iterator): New.
12684 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
12685 (*branch_on_di): Update name, use new JM iterator.
12686 * config/bpf/bpf.opt: (mjmpext): New option.
12687 (malu32): Likewise.
12688 (mjmp32): Likewise.
12690 (bpf_isa): New enum.
12692 2021-09-10 David Faust <david.faust@oracle.com>
12694 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
12695 for register-to-register extensions.
12696 (zero_extendqidi2): Likewise.
12698 2021-09-10 Richard Biener <rguenther@suse.de>
12700 PR middle-end/102273
12701 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
12703 2021-09-10 Richard Biener <rguenther@suse.de>
12705 PR middle-end/102269
12706 * gimplify.c (is_var_need_auto_init): Empty types do not need
12709 2021-09-10 Richard Biener <rguenther@suse.de>
12711 * configure.ac (--with-stabs): Remove.
12712 * configure: Regenerate.
12713 * doc/install.texi: Remove --with-stabs documentation.
12715 2021-09-10 liuhongt <hongtao.liu@intel.com>
12717 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
12719 (_mm512_mask_cmp_ph_mask): Likewise.
12720 (_mm512_cmp_round_ph_mask): Likewise.
12721 (_mm512_mask_cmp_round_ph_mask): Likewise.
12722 (_mm_cmp_sh_mask): Likewise.
12723 (_mm_mask_cmp_sh_mask): Likewise.
12724 (_mm_cmp_round_sh_mask): Likewise.
12725 (_mm_mask_cmp_round_sh_mask): Likewise.
12726 (_mm_comieq_sh): Likewise.
12727 (_mm_comilt_sh): Likewise.
12728 (_mm_comile_sh): Likewise.
12729 (_mm_comigt_sh): Likewise.
12730 (_mm_comige_sh): Likewise.
12731 (_mm_comineq_sh): Likewise.
12732 (_mm_ucomieq_sh): Likewise.
12733 (_mm_ucomilt_sh): Likewise.
12734 (_mm_ucomile_sh): Likewise.
12735 (_mm_ucomigt_sh): Likewise.
12736 (_mm_ucomige_sh): Likewise.
12737 (_mm_ucomineq_sh): Likewise.
12738 (_mm_comi_round_sh): Likewise.
12739 (_mm_comi_sh): Likewise.
12740 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
12741 (_mm_mask_cmp_ph_mask): Likewise.
12742 (_mm256_cmp_ph_mask): Likewise.
12743 (_mm256_mask_cmp_ph_mask): Likewise.
12744 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12745 * config/i386/i386-builtin.def: Add corresponding new builtins.
12746 * config/i386/i386-expand.c
12747 (ix86_expand_args_builtin): Handle new builtin types.
12748 (ix86_expand_round_builtin): Ditto.
12749 * config/i386/i386.md (ssevecmode): Add HF mode.
12750 (MODEFH): New mode iterator.
12751 * config/i386/sse.md
12752 (V48H_AVX512VL): New mode iterator to support HF vector modes.
12753 Ajdust corresponding description.
12754 (ssecmpintprefix): New.
12755 (VI12_AVX512VL): Adjust to support HF vector modes.
12756 (cmp_imm_predicate): Likewise.
12757 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
12759 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
12760 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
12761 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
12763 2021-09-10 liuhongt <hongtao.liu@intel.com>
12765 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
12766 (_mm512_mask_max_ph): Likewise.
12767 (_mm512_maskz_max_ph): Likewise.
12768 (_mm512_min_ph): Likewise.
12769 (_mm512_mask_min_ph): Likewise.
12770 (_mm512_maskz_min_ph): Likewise.
12771 (_mm512_max_round_ph): Likewise.
12772 (_mm512_mask_max_round_ph): Likewise.
12773 (_mm512_maskz_max_round_ph): Likewise.
12774 (_mm512_min_round_ph): Likewise.
12775 (_mm512_mask_min_round_ph): Likewise.
12776 (_mm512_maskz_min_round_ph): Likewise.
12777 (_mm_max_sh): Likewise.
12778 (_mm_mask_max_sh): Likewise.
12779 (_mm_maskz_max_sh): Likewise.
12780 (_mm_min_sh): Likewise.
12781 (_mm_mask_min_sh): Likewise.
12782 (_mm_maskz_min_sh): Likewise.
12783 (_mm_max_round_sh): Likewise.
12784 (_mm_mask_max_round_sh): Likewise.
12785 (_mm_maskz_max_round_sh): Likewise.
12786 (_mm_min_round_sh): Likewise.
12787 (_mm_mask_min_round_sh): Likewise.
12788 (_mm_maskz_min_round_sh): Likewise.
12789 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
12790 (_mm256_max_ph): Likewise.
12791 (_mm_mask_max_ph): Likewise.
12792 (_mm256_mask_max_ph): Likewise.
12793 (_mm_maskz_max_ph): Likewise.
12794 (_mm256_maskz_max_ph): Likewise.
12795 (_mm_min_ph): Likewise.
12796 (_mm256_min_ph): Likewise.
12797 (_mm_mask_min_ph): Likewise.
12798 (_mm256_mask_min_ph): Likewise.
12799 (_mm_maskz_min_ph): Likewise.
12800 (_mm256_maskz_min_ph): Likewise.
12801 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12802 * config/i386/i386-builtin.def: Add corresponding new builtins.
12803 * config/i386/i386-expand.c
12804 (ix86_expand_args_builtin): Handle new builtin types.
12805 * config/i386/sse.md
12806 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
12807 support HF vector modes.
12808 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
12809 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
12811 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
12813 * config/i386/subst.md (round_saeonly_mode512bit_condition):
12814 Adjust for HF vector modes.
12816 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
12818 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
12819 (_mm_mask_add_sh): Likewise.
12820 (_mm_maskz_add_sh): Likewise.
12821 (_mm_sub_sh): Likewise.
12822 (_mm_mask_sub_sh): Likewise.
12823 (_mm_maskz_sub_sh): Likewise.
12824 (_mm_mul_sh): Likewise.
12825 (_mm_mask_mul_sh): Likewise.
12826 (_mm_maskz_mul_sh): Likewise.
12827 (_mm_div_sh): Likewise.
12828 (_mm_mask_div_sh): Likewise.
12829 (_mm_maskz_div_sh): Likewise.
12830 (_mm_add_round_sh): Likewise.
12831 (_mm_mask_add_round_sh): Likewise.
12832 (_mm_maskz_add_round_sh): Likewise.
12833 (_mm_sub_round_sh): Likewise.
12834 (_mm_mask_sub_round_sh): Likewise.
12835 (_mm_maskz_sub_round_sh): Likewise.
12836 (_mm_mul_round_sh): Likewise.
12837 (_mm_mask_mul_round_sh): Likewise.
12838 (_mm_maskz_mul_round_sh): Likewise.
12839 (_mm_div_round_sh): Likewise.
12840 (_mm_mask_div_round_sh): Likewise.
12841 (_mm_maskz_div_round_sh): Likewise.
12842 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12843 * config/i386/i386-builtin.def: Add corresponding new builtins.
12844 * config/i386/i386-expand.c
12845 (ix86_expand_round_builtin): Handle new builtins.
12846 * config/i386/sse.md (VF_128): Change description.
12847 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
12848 Adjust to support HF vector modes.
12849 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
12852 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
12854 * config/i386/i386-expand.c
12855 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
12856 * config/i386/i386.c
12857 (ix86_preferred_simd_mode): Handle HF mode.
12858 * config/i386/sse.md (V_256H): New mode iterator.
12859 (avx_vextractf128<mode>): Use it.
12860 (VEC_INIT_MODE): Align vector HFmode condition to vector
12861 HImodes since there're no real HF instruction used.
12862 (VEC_INIT_HALF_MODE): Ditto.
12864 (VIHF_AVX512BW): Ditto.
12865 (*vec_extracthf): Ditto.
12866 (VEC_EXTRACT_MODE): Ditto.
12868 2021-09-10 Richard Biener <rguenther@suse.de>
12871 * config/dbx.h: Remove.
12872 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
12873 * config/lynx.h: Likewise.
12875 2021-09-10 liuhongt <hongtao.liu@intel.com>
12877 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
12878 right into ANDNOT + AND + IOR, using paradoxical subregs.
12879 (ix86_split_copysign_const): Remove.
12880 (ix86_split_copysign_var): Ditto.
12881 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
12882 (ix86_split_copysign_var): Ditto.
12883 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
12884 (@copysign<mode>3_var): Ditto.
12886 2021-09-09 qing zhao <qing.zhao@oracle.com>
12888 * builtins.c (expand_builtin_memset): Make external visible.
12889 * builtins.h (expand_builtin_memset): Declare extern.
12890 * common.opt (ftrivial-auto-var-init=): New option.
12891 * doc/extend.texi: Document the uninitialized attribute.
12892 * doc/invoke.texi: Document -ftrivial-auto-var-init.
12893 * flag-types.h (enum auto_init_type): New enumerated type
12895 * gimple-fold.c (clear_padding_type): Add one new parameter.
12896 (clear_padding_union): Likewise.
12897 (clear_padding_emit_loop): Likewise.
12898 (clear_type_padding_in_mask): Likewise.
12899 (gimple_fold_builtin_clear_padding): Handle this new parameter.
12900 * gimplify.c (gimple_add_init_for_auto_var): New function.
12901 (gimple_add_padding_init_for_auto_var): New function.
12902 (is_var_need_auto_init): New function.
12903 (gimplify_decl_expr): Add initialization to automatic variables per
12905 (gimplify_call_expr): Add one new parameter for call to
12906 __builtin_clear_padding.
12907 (gimplify_init_constructor): Add padding initialization in the end.
12908 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
12909 (expand_DEFERRED_INIT): New function.
12910 * internal-fn.def (DEFERRED_INIT): New internal function.
12911 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
12912 * tree-sra.c (generate_subtree_deferred_init): New function.
12913 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
12914 calls to .DEFERRED_INIT.
12915 (sra_modify_deferred_init): New function.
12916 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
12917 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
12918 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
12920 (check_defs): Likewise.
12921 (warn_uninitialized_vars): Likewise.
12922 * tree-ssa.c (ssa_undefined_value_p): Likewise.
12923 * tree.c (build_common_builtin_nodes): Build tree node for
12924 BUILT_IN_CLEAR_PADDING when needed.
12926 2021-09-09 Richard Biener <rguenther@suse.de>
12928 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
12931 2021-09-09 Richard Biener <rguenther@suse.de>
12933 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
12934 DOM walk from get_loop_body_in_dom_order using a worklist
12937 2021-09-09 liuhongt <hongtao.liu@intel.com>
12939 * config.gcc: Add avx512fp16vlintrin.h.
12940 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
12941 (_mm512_mask_add_ph): Likewise.
12942 (_mm512_maskz_add_ph): Likewise.
12943 (_mm512_sub_ph): Likewise.
12944 (_mm512_mask_sub_ph): Likewise.
12945 (_mm512_maskz_sub_ph): Likewise.
12946 (_mm512_mul_ph): Likewise.
12947 (_mm512_mask_mul_ph): Likewise.
12948 (_mm512_maskz_mul_ph): Likewise.
12949 (_mm512_div_ph): Likewise.
12950 (_mm512_mask_div_ph): Likewise.
12951 (_mm512_maskz_div_ph): Likewise.
12952 (_mm512_add_round_ph): Likewise.
12953 (_mm512_mask_add_round_ph): Likewise.
12954 (_mm512_maskz_add_round_ph): Likewise.
12955 (_mm512_sub_round_ph): Likewise.
12956 (_mm512_mask_sub_round_ph): Likewise.
12957 (_mm512_maskz_sub_round_ph): Likewise.
12958 (_mm512_mul_round_ph): Likewise.
12959 (_mm512_mask_mul_round_ph): Likewise.
12960 (_mm512_maskz_mul_round_ph): Likewise.
12961 (_mm512_div_round_ph): Likewise.
12962 (_mm512_mask_div_round_ph): Likewise.
12963 (_mm512_maskz_div_round_ph): Likewise.
12964 * config/i386/avx512fp16vlintrin.h: New header.
12965 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
12966 Add new builtin types.
12967 * config/i386/i386-builtin.def: Add corresponding builtins.
12968 * config/i386/i386-expand.c
12969 (ix86_expand_args_builtin): Handle new builtin types.
12970 (ix86_expand_round_builtin): Likewise.
12971 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
12972 * config/i386/sse.md (VFH): New mode_iterator.
12974 (avx512fmaskmode): Add HF vector modes.
12975 (avx512fmaskhalfmode): Likewise.
12976 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
12978 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
12979 (mul<mode>3<mask_name><round_name>): Likewise.
12980 (*mul<mode>3<mask_name><round_name>): Likewise.
12981 (div<mode>3): Likewise.
12982 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
12983 * config/i386/subst.md (SUBST_V): Add HF vector modes.
12984 (SUBST_A): Likewise.
12985 (round_mode512bit_condition): Adjust for V32HFmode.
12987 2021-09-09 liuhongt <hongtao.liu@intel.com>
12990 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
12991 (reduc_plus_scal_v4sf): .. this, New define_expand.
12992 (reduc_plus_scal_v2df): .. and this, New define_expand.
12994 2021-09-09 liuhongt <hongtao.liu@intel.com>
12997 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
13000 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
13003 * doc/trouble.texi (Copy Assignment): Fix description of
13004 behaviour and fix code in example.
13006 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
13009 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
13010 r11 instead of r12 for restoring CR.
13012 2021-09-08 Jakub Jelinek <jakub@redhat.com>
13013 liuhongt <hongtao.liu@intel.com>
13016 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
13017 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
13018 into AND with mask and XOR, using paradoxical subregs.
13019 (ix86_split_xorsign): Remove.
13020 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
13022 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
13024 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
13026 2021-09-08 Jakub Jelinek <jakub@redhat.com>
13029 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
13030 operands[2], emit abs<mode>2 instead.
13031 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
13032 first alternative even for avx, add another alternative with
13033 =&Yv <- 0, Yv, Yvm constraints.
13034 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
13035 to op1, emit vpandn instead.
13037 2021-09-08 liuhongt <hongtao.liu@intel.com>
13039 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
13040 (_mm256_set_ph): Likewise.
13041 (_mm512_set_ph): Likewise.
13042 (_mm_setr_ph): Likewise.
13043 (_mm256_setr_ph): Likewise.
13044 (_mm512_setr_ph): Likewise.
13045 (_mm_set1_ph): Likewise.
13046 (_mm256_set1_ph): Likewise.
13047 (_mm512_set1_ph): Likewise.
13048 (_mm_setzero_ph): Likewise.
13049 (_mm256_setzero_ph): Likewise.
13050 (_mm512_setzero_ph): Likewise.
13051 (_mm_set_sh): Likewise.
13052 (_mm_load_sh): Likewise.
13053 (_mm_store_sh): Likewise.
13054 * config/i386/i386-builtin-types.def (V8HF): New type.
13055 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
13056 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
13057 Support vector HFmodes.
13058 (ix86_expand_vector_init_one_nonzero): Likewise.
13059 (ix86_expand_vector_init_one_var): Likewise.
13060 (ix86_expand_vector_init_interleave): Likewise.
13061 (ix86_expand_vector_init_general): Likewise.
13062 (ix86_expand_vector_set): Likewise.
13063 (ix86_expand_vector_extract): Likewise.
13064 (ix86_expand_vector_init_concat): Likewise.
13065 (ix86_expand_sse_movcc): Handle vector HFmodes.
13066 (ix86_expand_vector_set_var): Ditto.
13067 * config/i386/i386-modes.def: Add HF vector modes in comment.
13068 * config/i386/i386.c (classify_argument): Add HF vector modes.
13069 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
13070 (ix86_vector_mode_supported_p): Likewise.
13071 (ix86_set_reg_reg_cost): Handle vector HFmode.
13072 (ix86_get_ssemov): Handle vector HFmode.
13073 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
13075 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
13077 (function_arg_advance_32): Ditto.
13078 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
13079 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
13080 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
13081 (VALID_SSE2_REG_VHF_MODE): New.
13082 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
13083 (SSE_REG_MODE_P): Add vector HFmode.
13084 * config/i386/i386.md (mode): Add HF vector modes.
13085 (MODE_SIZE): Likewise.
13086 (ssemodesuffix): Add ph suffix for HF vector modes.
13087 * config/i386/sse.md (VFH_128): New mode iterator.
13088 (VMOVE): Adjust for HF vector modes.
13090 (V_256_512): Likewise.
13091 (avx512): Likewise.
13092 (avx512fmaskmode): Likewise.
13093 (shuffletype): Likewise.
13094 (sseinsnmode): Likewise.
13095 (ssedoublevecmode): Likewise.
13096 (ssehalfvecmode): Likewise.
13097 (ssehalfvecmodelower): Likewise.
13098 (ssePScmode): Likewise.
13099 (ssescalarmode): Likewise.
13100 (ssescalarmodelower): Likewise.
13101 (sseintprefix): Likewise.
13103 (bcstscalarsuff): Likewise.
13104 (xtg_mode): Likewise.
13105 (VI12HF_AVX512VL): New mode_iterator.
13106 (VF_AVX512FP16): Likewise.
13108 (VIHF_256): Likewise.
13109 (VIHF_AVX512BW): Likewise.
13110 (V16_256): Likewise.
13111 (V32_512): Likewise.
13112 (sseintmodesuffix): New mode_attr.
13113 (sse): Add scalar and vector HFmodes.
13114 (ssescalarmode): Add vector HFmode mapping.
13115 (ssescalarmodesuffix): Add sh suffix for HFmode.
13116 (*<sse>_vm<insn><mode>3): Use VFH_128.
13117 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
13118 (*ieee_<ieee_maxmin><mode>3): Likewise.
13119 (<avx512>_blendm<mode>): New define_insn.
13120 (vec_setv8hf): New define_expand.
13121 (vec_set<mode>_0): New define_insn for HF vector set.
13122 (*avx512fp16_movsh): Likewise.
13123 (avx512fp16_movsh): Likewise.
13124 (vec_extract_lo_v32hi): Rename to ...
13125 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
13127 (vec_extract_hi_v32hi): Likewise.
13128 (vec_extract_hi_<mode>): Likewise.
13129 (vec_extract_lo_v16hi): Likewise.
13130 (vec_extract_lo_<mode>): Likewise.
13131 (vec_extract_hi_v16hi): Likewise.
13132 (vec_extract_hi_<mode>): Likewise.
13133 (vec_set_hi_v16hi): Likewise.
13134 (vec_set_hi_<mode>): Likewise.
13135 (vec_set_lo_v16hi): Likewise.
13136 (vec_set_lo_<mode>): Likewise.
13137 (*vec_extract<mode>_0): New define_insn_and_split for HF
13139 (*vec_extracthf): New define_insn.
13140 (VEC_EXTRACT_MODE): Add HF vector modes.
13141 (PINSR_MODE): Add V8HF.
13142 (sse2p4_1): Likewise.
13143 (pinsr_evex_isa): Likewise.
13144 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
13145 insert for V8HFmode.
13146 (pbroadcast_evex_isa): Add HF vector modes.
13147 (AVX2_VEC_DUP_MODE): Likewise.
13148 (VEC_INIT_MODE): Likewise.
13149 (VEC_INIT_HALF_MODE): Likewise.
13150 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
13152 (avx2_pbroadcast<mode>_1): Likewise.
13153 (<avx512>_vec_dup<mode>_1): Likewise.
13154 (<avx512>_vec_dup<mode><mask_name>): Likewise.
13155 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
13158 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
13159 H.J. Lu <hongjiu.lu@intel.com>
13160 Liu Hongtao <hongtao.liu@intel.com>
13161 Wang Hongyu <hongyu.wang@intel.com>
13162 Xu Dianhong <dianhong.xu@intel.com>
13164 * common/config/i386/cpuinfo.h (get_available_features):
13165 Detect FEATURE_AVX512FP16.
13166 * common/config/i386/i386-common.c
13167 (OPTION_MASK_ISA_AVX512FP16_SET,
13168 OPTION_MASK_ISA_AVX512FP16_UNSET,
13169 OPTION_MASK_ISA2_AVX512FP16_SET,
13170 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
13171 (OPTION_MASK_ISA2_AVX512BW_UNSET,
13172 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
13173 (ix86_handle_option): Handle -mavx512fp16.
13174 * common/config/i386/i386-cpuinfo.h (enum processor_features):
13175 Add FEATURE_AVX512FP16.
13176 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
13177 * config.gcc: Add avx512fp16intrin.h.
13178 * config/i386/avx512fp16intrin.h: New intrinsic header.
13179 * config/i386/cpuid.h: Add bit_AVX512FP16.
13180 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
13181 * config/i386/i386-builtins.c: Support _Float16 type for i386
13183 (ix86_register_float16_builtin_type): New function.
13184 (ix86_float16_type_node): New.
13185 * config/i386/i386-c.c (ix86_target_macros_internal): Define
13187 * config/i386/i386-expand.c (ix86_expand_branch): Support
13189 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
13190 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
13191 (ix86_expand_fp_movcc): Ditto.
13192 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
13193 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
13194 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
13195 * config/i386/i386.c (ix86_get_ssemov): Use
13196 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
13197 (ix86_get_excess_precision): Use
13198 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
13200 (sse_store_index): Use SFmode cost for HFmode cost.
13201 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
13202 GPR cost for HFmode.
13203 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
13204 (ix86_mangle_type): Add manlging for _Float16 type.
13205 (inline_secondary_memory_needed): No memory is needed for
13206 16bit movement between gpr and sse reg under
13208 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
13209 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
13210 (ix86_division_cost): Ditto.
13211 (ix86_rtx_costs): Ditto.
13212 (ix86_add_stmt_cost): Ditto.
13213 (ix86_optab_supported_p): Ditto.
13214 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
13215 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
13216 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
13217 * config/i386/i386.md (mode): Add HFmode.
13218 (MODE_SIZE): Add HFmode.
13219 (isa): Add avx512fp16.
13220 (enabled): Handle avx512fp16.
13221 (ssemodesuffix): Add sh suffix for HFmode.
13222 (comm): Add mult, div.
13223 (plusminusmultdiv): New code iterator.
13224 (insn): Add mult, div.
13225 (*movhf_internal): Adjust for avx512fp16 instruction.
13226 (*movhi_internal): Ditto.
13227 (*cmpi<unord>hf): New define_insn for HFmode.
13228 (*ieee_s<ieee_maxmin>hf3): Likewise.
13229 (extendhf<mode>2): Likewise.
13230 (trunc<mode>hf2): Likewise.
13231 (float<floatunssuffix><mode>hf2): Likewise.
13232 (*<insn>hf): Likewise.
13233 (cbranchhf4): New expander.
13234 (movhfcc): Likewise.
13235 (<insn>hf3): Likewise.
13236 (mulhf3): Likewise.
13237 (divhf3): Likewise.
13238 * config/i386/i386.opt: Add mavx512fp16.
13239 * config/i386/immintrin.h: Include avx512fp16intrin.h.
13240 * doc/invoke.texi: Add mavx512fp16.
13241 * doc/extend.texi: Add avx512fp16 Usage Notes.
13243 2021-09-08 liuhongt <hongtao.liu@intel.com>
13245 * common.opt: Support -fexcess-precision=16.
13246 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
13247 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
13248 EXCESS_PRECISION_TYPE_FLOAT16.
13249 * config/arm/arm.c (arm_excess_precision): Ditto.
13250 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
13251 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
13252 when EXCESS_PRECISION_TYPE_FLOAT16.
13253 * config/s390/s390.c (s390_excess_precision): Ditto.
13254 * coretypes.h (enum excess_precision_type): Add
13255 EXCESS_PRECISION_TYPE_FLOAT16.
13256 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
13257 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
13258 * doc/extend.texi (Half-Precision): Document
13259 -fexcess-precision=16.
13260 * flag-types.h (enum excess_precision): Add
13261 EXCESS_PRECISION_FLOAT16.
13262 * target.def (excess_precision): Update document.
13263 * tree.c (excess_precision_type): Set excess_precision_type to
13264 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
13266 2021-09-08 liuhongt <hongtao.liu@intel.com>
13268 * doc/extend.texi: (@node Floating Types): Adjust the wording.
13269 (@node Half-Precision): Ditto.
13271 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
13274 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
13275 'CONST_INT_P (src)' to the condition of the block that tries to
13276 eliminate literal when loading integer contant.
13278 2021-09-07 David Faust <david.faust@oracle.com>
13280 * doc/extend.texi (BPF Type Attributes) New node.
13281 Document new preserve_access_index attribute.
13282 Document new preserve_access_index builtin.
13283 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
13285 2021-09-07 David Faust <david.faust@oracle.com>
13287 * config/bpf/bpf.c: Adjust includes.
13288 (bpf_handle_preserve_access_index_attribute): New function.
13289 (bpf_attribute_table): Use it here.
13290 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
13291 (bpf_option_override): Handle "-mco-re" option.
13292 (bpf_asm_init_sections): New.
13293 (TARGET_ASM_INIT_SECTIONS): Redefine.
13294 (bpf_file_end): New.
13295 (TARGET_ASM_FILE_END): Redefine.
13296 (bpf_init_builtins): Add "__builtin_preserve_access_index".
13297 (bpf_core_compute, bpf_core_get_index): New.
13298 (is_attr_preserve_access): New.
13299 (bpf_expand_builtin): Handle new builtins.
13300 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
13301 (bpf_core_walk): New.
13302 (bpf_resolve_overloaded_builtin): New.
13303 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
13304 (handle_attr): New.
13305 (pass_bpf_core_attr): New RTL pass.
13306 * config/bpf/bpf-passes.def: New file.
13307 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
13308 * config/bpf/coreout.c: New file.
13309 * config/bpf/coreout.h: Likewise.
13310 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
13311 (coreout.o): New rule.
13312 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
13313 * config.gcc (bpf): Add coreout.h to extra_headers.
13314 Add coreout.o to extra_objs.
13315 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
13317 2021-09-07 David Faust <david.faust@oracle.com>
13319 * btfout.c (get_btf_id): Function is no longer static.
13320 * ctfc.h: Expose it here.
13322 2021-09-07 David Faust <david.faust@oracle.com>
13324 * ctfc.c (ctf_lookup_tree_type): New function.
13325 * ctfc.h: Likewise.
13327 2021-09-07 David Faust <david.faust@oracle.com>
13329 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
13330 * ctfc.h: Analogous change.
13332 2021-09-07 David Faust <david.faust@oracle.com>
13334 * dwarf2out.c (lookup_type_die): Function is no longer static.
13335 * dwarf2out.h: Expose it here.
13337 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
13339 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
13340 (ctf_debug_early_finish): New definition.
13341 (ctf_debug_finish): Likewise.
13342 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
13343 (ctf_debug_early_finish): New declaration.
13344 (ctf_debug_finish): Likewise.
13345 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
13346 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
13348 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
13350 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
13351 support when compiling for CO-RE.
13352 * config/bpf/bpf.opt: Add new command line option -mco-re.
13354 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
13356 * flag-types.h (enum debug_info_type): Add new enum
13357 DINFO_TYPE_BTF_WITH_CORE.
13358 (BTF_WITH_CORE_DEBUG): New bitmask.
13359 * flags.h (btf_with_core_debuginfo_p): New declaration.
13360 * opts.c (btf_with_core_debuginfo_p): New definition.
13362 2021-09-07 Jason Merrill <jason@redhat.com>
13364 * tree.h (error_operand_p): Change to inline function.
13366 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
13368 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
13369 (empty_block_with_phis_p): ...this.
13370 (potentially_threadable_block): Same.
13371 (jump_threader::thread_through_normal_block): Same.
13373 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
13376 * dwarf2out.c (mark_base_types): New overloaded function.
13377 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
13378 as the compilation unit, and call move_marked_base_types afterward.
13380 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
13383 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
13385 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
13387 2021-09-07 Richard Biener <rguenther@suse.de>
13389 PR tree-optimization/102226
13390 * tree-vect-loop.c (vect_transform_cycle_phi): Record
13391 the converted value for the epilogue PHI use.
13393 2021-09-07 Martin Liska <mliska@suse.cz>
13395 PR gcov-profile/80223
13396 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
13397 options, do not inline when no_profile_instrument_function
13398 attributes are different in early inliner. It's fine to inline
13399 it after PGO instrumentation.
13401 2021-09-07 Richard Biener <rguenther@suse.de>
13403 PR tree-optimization/101555
13404 * tree-ssa-pre.c (translate_vuse_through_block): Do not
13405 perform an alias walk to determine the validity of the
13406 mem at the start of the block which is already guaranteed
13407 by means of prune_clobbered_mems.
13408 (phi_translate_1): Pass edge to translate_vuse_through_block.
13410 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
13413 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
13414 (remainder<mode>3): Likewise.
13416 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
13418 * config/mips/mips.c (mips_file_start): add .module for
13421 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
13423 * wide-int.cc (wi::clz): Reorder tests to ensure the result
13424 is zero for all negative values.
13426 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
13428 * doc/invoke.texi (-foffload-options): Fix @opindex.
13430 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
13433 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
13434 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
13435 source alternative for AVX.
13437 2021-09-06 liuhongt <hongtao.liu@intel.com>
13439 PR middle-end/102182
13440 * optabs.c (expand_fix): Add from1 to avoid from being
13443 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
13445 * dwarf2out.c (modified_type_die): Deal with all array types earlier
13446 and use local variable consistently throughout the function.
13448 2021-09-06 Jakub Jelinek <jakub@redhat.com>
13450 PR tree-optimization/102207
13451 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
13452 were promoted from signed to wider unsigned type.
13454 2021-09-06 Andrew Pinski <apinski@marvell.com>
13456 PR tree-optimization/63184
13457 * match.pd: Add simplification of pointer_diff of two pointer_plus
13458 with addr_expr in the first operand of each pointer_plus.
13459 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
13460 in the first operand of each pointer_plus.
13462 2021-09-06 Richard Biener <rguenther@suse.de>
13464 PR tree-optimization/102176
13465 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
13467 (vect_bb_slp_scalar_cost): Use the computed set of
13468 vectorized scalar stmts instead of relying on the out-of-date
13469 and not accurate PURE_SLP_STMT.
13470 (vect_bb_vectorization_profitable_p): Compute the set
13471 of vectorized scalar stmts.
13473 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13475 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
13476 GIMPLE_COND special casing.
13477 (path_range_query::range_defined_in_block): Use range_of_stmt
13478 instead of calling fold_range directly.
13480 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13482 * gimple-range-path.cc (path_range_query::range_of_expr): Set
13483 m_undefined_path when appropriate.
13484 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
13485 (path_range_query::unreachable_path_p): New.
13486 (path_range_query::precompute_ranges): Set m_undefined_path.
13487 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
13488 (path_range_query::internal_range_of_expr): New.
13489 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
13490 Use unreachable_path_p.
13492 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13494 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
13495 Remove argument and call find_taken_edge.
13496 (back_threader::resolve_phi): Do not calculate taken edge before
13497 calling maybe_register_path.
13498 (back_threader::find_paths_to_names): Same.
13500 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
13502 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
13503 * config/h8300/testcompare.md (store_c): Update name, use new
13505 (store_neg_c, store_shifted_c): New patterns.
13507 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
13510 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
13511 instead of r12 for CR save, in all cases.
13513 2021-09-03 Andrew Pinski <apinski@marvell.com>
13515 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
13516 Handle error_mark_node as the type of the type_decl.
13518 2021-09-03 Andrew Pinski <apinski@marvell.com>
13520 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
13522 (aarch64_simd_types): Likewise.
13523 (aarch64_simd_intOI_type_node): Likewise.
13524 (aarch64_simd_intCI_type_node): Likewise.
13525 (aarch64_simd_intXI_type_node): Likewise.
13526 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
13527 (aarch64_fp16_ptr_type_node): Likewise.
13528 (aarch64_bf16_type_node): Likewise.
13529 (aarch64_bf16_ptr_type_node): Likewise.
13531 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13533 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
13535 (minus_op1_op2_relation_effect): ...here.
13536 (class operator_pointer_diff): New.
13537 (operator_pointer_diff::op1_op2_relation_effect): Call
13538 minus_op1_op2_relation_effect.
13539 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
13541 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13543 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
13544 Add may_peel_loop_headers.
13545 (back_threader_registry::thread_through_all_blocks): Same.
13546 (try_thread_blocks): Pass may_peel_loop_headers argument.
13547 (pass_early_thread_jumps::execute): Same.
13549 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13551 * tree-ssa-threadedge.c (has_phis_p): New.
13552 (forwarder_block_p): New.
13553 (potentially_threadable_block): Call forwarder_block_p.
13554 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
13555 (jump_threader::thread_through_normal_block): Call
13558 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13560 * tree-ssa-threadbackward.c (back_threader::dump): New.
13561 (back_threader::debug): New.
13562 (back_threader_profitability::profitable_path_p): Dump blocks
13563 even if we are bailing early.
13565 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13567 * tree-ssa-threadupdate.c (cancel_thread): New.
13568 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
13569 (jump_thread_path_registry::mark_threaded_blocks): Same.
13570 (jump_thread_path_registry::register_jump_thread): Same.
13572 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13574 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
13575 which objects are available.
13576 (jt_state::pop): Same.
13577 (jt_state::register_equiv): Same.
13578 (jt_state::register_equivs_on_edge): Same.
13580 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13582 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
13583 Move pop until after a thread is registered.
13585 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13587 * tree-ssa-threadupdate.c (debug): New.
13589 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13591 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
13592 (push_dump_file::~push_dump_file): New.
13593 (dump_ranger): Change dump_file temporarily while dumping
13595 * gimple-range-trace.h (class push_dump_file): New.
13597 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13599 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
13600 (dump_ranger): Dump function name.
13602 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13604 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13605 Adjust for non-null.
13606 (path_range_query::adjust_for_non_null_uses): New.
13607 (path_range_query::precompute_ranges): Call
13608 adjust_for_non_null_uses.
13609 * gimple-range-path.h: Add m_non_null and
13610 adjust_for_non_null_uses.
13612 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13614 * gimple-range-path.cc (path_range_query::dump): Dump path
13616 (path_range_query::precompute_ranges): Dump entire path.
13618 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13620 * value-relation.cc (relation_oracle::debug): New.
13621 * value-relation.h (relation_oracle::debug): New.
13623 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13625 * tree-ssa-loop-ch.c: Remove unnecessary include file.
13627 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13629 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
13630 Skip statements with no defining BB.
13631 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13632 Do not get confused by statements with no defining BB.
13634 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13636 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
13637 gimple_range_adjustment. Add support for constants.
13638 (adjust_realpart_expr): New.
13639 (gimple_range_adjustment): Move IMAGPART_EXPR code to
13640 adjust_imagpart_expr.
13641 * range-op.cc (integral_table::integral_table): Add entry for
13644 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13646 * omp-expand.c (expand_omp_atomic_pipeline): Use
13647 IFN_ATOMIC_COMPARE_EXCHANGE instead of
13648 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
13651 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13654 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
13655 that is false if DECL_BIT_FIELD.
13656 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
13657 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
13658 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
13659 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
13660 of DECL_FIELD_ABI_IGNORED.
13661 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
13662 SET_DECL_FIELD_ABI_IGNORED instead of writing to
13663 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
13664 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
13665 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
13666 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
13668 2021-09-03 liuhongt <hongtao.liu@intel.com>
13671 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
13672 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
13673 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
13675 2021-09-02 Martin Sebor <msebor@redhat.com>
13677 PR tree-optimization/17506
13679 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
13681 2021-09-02 Richard Biener <rguenther@suse.de>
13683 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
13684 fix for PR78185 and continue processing when leaving
13685 finite inner loops.
13687 2021-09-02 Jakub Jelinek <jakub@redhat.com>
13689 PR tree-optimization/99591
13690 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
13693 2021-09-02 Richard Biener <rguenther@suse.de>
13696 2021-09-02 Richard Biener <rguenther@suse.de>
13698 PR tree-optimization/102155
13699 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13700 over a part of the RPO array and do not recurse here.
13701 Dump blocks marked as always executed.
13702 (fill_always_executed_in): Walk over the RPO array and
13703 process loops whose header we run into.
13704 (loop_invariant_motion_in_fun): Compute the first RPO
13705 using rev_post_order_and_mark_dfs_back_seme in iteration
13706 order and pass that to fill_always_executed_in.
13708 2021-09-02 liuhongt <hongtao.liu@intel.com>
13710 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
13711 * config/i386/i386.c (enum x86_64_reg_class): Add
13712 X86_64_SSEHF_CLASS.
13713 (merge_classes): Handle X86_64_SSEHF_CLASS.
13714 (examine_argument): Ditto.
13715 (construct_container): Ditto.
13716 (classify_argument): Ditto, and set HFmode/HCmode to
13717 X86_64_SSEHF_CLASS.
13718 (function_value_32): Return _FLoat16/Complex Float16 by
13720 (function_value_64): Return _Float16/Complex Float16 by SSE
13722 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
13723 (ix86_secondary_reload): Require gpr as intermediate register
13724 to store _Float16 from sse register when sse4 is not
13726 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
13728 (ix86_scalar_mode_supported_p): Ditto.
13729 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
13730 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
13731 (VALID_INT_MODE_P): Add HFmode and HCmode.
13732 * config/i386/i386.md (*pushhf_rex64): New define_insn.
13734 (*movhf_internal): Ditto.
13735 * doc/extend.texi (Half-Precision Floating Point): Documemt
13738 2021-09-02 Richard Biener <rguenther@suse.de>
13740 PR tree-optimization/102155
13741 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13742 over a part of the RPO array and do not recurse here.
13743 Dump blocks marked as always executed.
13744 (fill_always_executed_in): Walk over the RPO array and
13745 process loops whose header we run into.
13746 (loop_invariant_motion_in_fun): Compute the first RPO
13747 using rev_post_order_and_mark_dfs_back_seme in iteration
13748 order and pass that to fill_always_executed_in.
13750 2021-09-02 YunQiang Su <syq@debian.org>
13753 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13755 * config/mips/mips.c (mips_module_isa_name): New.
13756 mips_file_start: add .module mipsREV to all asm output
13758 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
13760 PR tree-optimization/102152
13761 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
13762 comparison to a scalar comparison before calling
13763 update_stmt_if_modified.
13765 2021-09-01 Andrew Pinski <apinski@marvell.com>
13768 * config/aarch64/aarch64.c (aarch64_expand_setmem):
13769 Check STRICT_ALIGNMENT before creating an overlapping
13772 2021-09-01 Martin Sebor <msebor@redhat.com>
13774 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
13775 (check_access): Pass additional argument.
13776 (check_memop_access): Remove template and make a member function.
13777 (maybe_check_dealloc_call): Make a pass_waccess member function.
13778 (class pass_waccess): Add, rename, and remove members.
13779 (pass_waccess::pass_waccess): Adjust to name change.
13780 (pass_waccess::~pass_waccess): Same.
13781 (check_alloca): Make a member function.
13782 (check_alloc_size_call): Same.
13783 (check_strcat): Same.
13784 (check_strncat): Same.
13785 (check_stxcpy): Same.
13786 (check_stxncpy): Same.
13787 (check_strncmp): Same.
13788 (maybe_warn_rdwr_sizes): Rename...
13789 (pass_waccess::maybe_check_access_sizes): ...to this.
13790 (pass_waccess::check_call): Adjust to name changes.
13791 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
13793 (pass_waccess::execute): Adjust to name changes.
13794 * gimple-ssa-warn-access.h (check_memop_access): Remove.
13795 * pointer-query.cc (access_ref::phi): Handle null pointer.
13796 (access_ref::inform_access): Same.
13797 (pointer_query::put_ref): Modify a cached value, not a copy of it.
13798 (pointer_query::dump): New function.
13799 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
13801 * pointer-query.h (pointer_query::dump): Declare.
13802 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
13803 (dump_strlen_info): Use function query.
13804 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
13806 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
13808 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
13810 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
13812 * doc/extend.texi: Document unavailable attribute.
13813 * print-tree.c (print_node): Handle unavailable attribute.
13814 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
13815 * tree.c (error_unavailable_use): New.
13816 * tree.h (TREE_UNAVAILABLE): New.
13817 (error_unavailable_use): New.
13819 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13821 PR tree-optimization/102124
13822 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
13823 MINUS_EXPR, if itype is unsigned with smaller precision than type,
13824 add an extra cast to signed variant of itype to ensure sign-extension.
13826 2021-09-01 Martin Liska <mliska@suse.cz>
13828 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
13829 edges and rather use colors for TRUE and FALSE edges.
13831 2021-09-01 Richard Biener <rguenther@suse.de>
13833 PR tree-optimization/93491
13834 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
13835 after processing the stmt itself. Do not consider
13836 pure functions possibly not returning. Properly avoid
13837 adding possibly trapping calls to EXP_GEN when there's
13838 a preceeding possibly not returning call.
13839 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
13842 2021-09-01 Richard Biener <rguenther@suse.de>
13844 PR tree-optimization/102139
13845 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
13846 type to record a std::pair of the stmt-info and the innermost
13848 (dr_vec_info::group): New member.
13849 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
13850 (vect_compute_data_ref_alignment): Verify the recorded
13851 base alignment can be used.
13852 (data_ref_pair): Remove.
13853 (dr_group_sort_cmp): Adjust.
13854 (vect_analyze_data_ref_accesses): Store the group-ID in the
13855 dr_vec_info and operate on a vector of dr_vec_infos.
13857 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
13859 * read-md.c (md_reader::handle_enum): support value assignation.
13860 * doc/md.texi: record define_c_enum value assignation support.
13862 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13864 PR tree-optimization/102141
13865 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
13866 argument. If false, emit stmts after gsi instead of before, and
13868 (bswap_replace): Adjust callers. When converting output of bswap,
13869 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
13872 2021-09-01 liuhongt <hongtao.liu@intel.com>
13874 * emit-rtl.c (validate_subreg): Get rid of all float-int
13877 2021-09-01 liuhongt <hongtao.liu@intel.com>
13880 2021-08-30 liuhongt <hongtao.liu@intel.com>
13882 * expmed.c (extract_bit_field_1): Make sure we're playing with
13883 integral modes before call extract_integral_bit_field.
13884 (extract_integral_bit_field): Add a parameter of type
13885 scalar_int_mode which corresponds to of tmode.
13886 And call extract_and_convert_fixed_bit_field instead of
13887 extract_fixed_bit_field and convert_extracted_bit_field.
13888 (extract_and_convert_fixed_bit_field): New function, it's a
13889 combination of extract_fixed_bit_field and
13890 convert_extracted_bit_field.
13892 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13894 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
13896 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13898 * omp-general.h (omp_is_reference): Rename to...
13899 (omp_privatize_by_reference): ... this. Adjust all users...
13900 * omp-general.c: ... here, ...
13901 * gimplify.c: ... here, ...
13902 * omp-expand.c: ... here, ...
13903 * omp-low.c: ... here.
13905 2021-08-31 Martin Sebor <msebor@redhat.com>
13907 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
13908 pointer element for equality to zero, not that of the cotaining
13911 2021-08-31 Martin Sebor <msebor@redhat.com>
13913 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
13915 2021-08-31 Martin Sebor <msebor@redhat.com>
13917 * function.h (function): Add comments.
13918 (get_range_query): Same. Add attribute returns nonnull.
13920 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13922 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
13923 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
13924 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
13926 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
13928 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
13930 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
13931 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
13932 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
13933 (m32c-*-elf*, m32c-*-rtems*): Likewise.
13934 * config/cris/cris.h (DBX_NO_XREFS): Remove.
13935 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
13936 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
13937 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
13938 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
13939 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
13941 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
13943 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
13944 allowed on target constructs and only with particular other clauses.
13945 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
13946 'ancestor' is used.
13947 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
13948 structs when 'ancestor' is used.
13949 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
13950 'ancestor' is used.
13951 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
13952 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
13954 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13956 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
13957 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13959 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
13960 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
13961 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
13962 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
13963 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
13964 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13965 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
13966 would be paradoxical.
13968 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13970 * combine.c (combine_simplify_rtx): Avoid converting an explicit
13971 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
13972 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
13974 2021-08-31 Richard Biener <rguenther@suse.de>
13976 PR tree-optimization/102142
13977 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
13978 condition under which to unset the visited flag.
13980 2021-08-31 Richard Biener <rguenther@suse.de>
13982 PR middle-end/102129
13983 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
13984 possibly trapping expressions across calls.
13986 2021-08-31 Jakub Jelinek <jakub@redhat.com>
13988 PR tree-optimization/102134
13989 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
13990 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
13993 2021-08-31 Andrew Pinski <apinski@marvell.com>
13996 * collect-utils.c (setup_signals): New declaration.
13997 * collect-utils.h (setup_signals): New function.
13998 * collect2.c (handler): Delete.
13999 (main): Instead of manually setting up the signals,
14000 just call setup_signals.
14001 * lto-wrapper.c (main): Likewise.
14003 2021-08-31 Andrew Pinski <apinski@marvell.com>
14006 * config/i386/i386-protos.h (x86_output_aligned_bss):
14007 Change align argument to unsigned type.
14008 (x86_elf_aligned_decl_common): Likewise.
14009 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
14010 (x86_output_aligned_bss): Likewise.
14012 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
14014 * config/mips/mips.c (mips_module_isa_name): New.
14015 mips_file_start: add .module mipsREV to all asm output
14017 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
14019 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
14020 use enum instead of int for 'isa' member.
14021 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
14022 config/mips/netbsd.h: replace hardcoded numbers with enum.
14024 2021-08-31 liuhongt <hongtao.liu@intel.com>
14026 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
14027 define_split to define_insn_and_split.
14028 (*avx2_eq<mode>3): Removed.
14029 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
14030 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
14031 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
14033 (*avx2_gt<mode>3): Removed.
14034 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
14035 define_insn to define_expand, and adjust pattern.
14036 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
14038 2021-08-30 David Malcolm <dmalcolm@redhat.com>
14041 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
14043 2021-08-30 Jason Merrill <jason@redhat.com>
14045 * doc/invoke.texi: Document -Wmissing-requires.
14047 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
14049 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
14050 TARGET_EXTRA_BUILTINS guard.
14052 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
14054 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
14055 initialization of V2DI_type_node and unsigned_V2DI_type_node.
14057 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
14059 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
14060 decl when new_builtins_are_live.
14061 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
14064 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
14066 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
14067 OPTION_MASK_P10_FUSION_2STORE.
14068 (POWERPC_MASKS): Likewise.
14069 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
14070 store fusion for Power10.
14071 (is_fusable_store): New.
14072 (power10_sched_reorder): Likewise.
14073 (rs6000_sched_reorder): Do Power10 specific reordering.
14074 (rs6000_sched_reorder2): Likewise.
14075 * config/rs6000/rs6000.opt: Add new option.
14077 2021-08-30 Richard Biener <rguenther@suse.de>
14079 PR tree-optimization/102128
14080 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
14081 Move scanning for if-converted scalar code to the caller
14082 and instead delay clearing the visited flag for profitable
14084 (vect_slp_region): Cost all subgraphs before scheduling.
14085 For if-converted BB vectorization scan for scalar COND_EXPRs
14086 and do not vectorize if any found and the cost model is
14089 2021-08-30 Richard Biener <rguenther@suse.de>
14091 * common.opt (fexceptions): Mark
14092 EnabledBy(fnon-call-exceptions).
14093 * doc/invoke.texi (fnon-call-exceptions): Document this
14094 enables -fexceptions.
14096 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
14098 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
14099 is not already defined.
14101 2021-08-30 liuhongt <hongtao.liu@intel.com>
14103 * expmed.c (extract_bit_field_1): Make sure we're playing with
14104 integral modes before call extract_integral_bit_field.
14105 (extract_integral_bit_field): Add a parameter of type
14106 scalar_int_mode which corresponds to of tmode.
14107 And call extract_and_convert_fixed_bit_field instead of
14108 extract_fixed_bit_field and convert_extracted_bit_field.
14109 (extract_and_convert_fixed_bit_field): New function, it's a
14110 combination of extract_fixed_bit_field and
14111 convert_extracted_bit_field.
14113 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
14115 * config/darwin.c (darwin_libc_has_function): Do not run
14116 the checks for x86 or modern Darwin. Make sure that there
14117 is a value set for darwin_macosx_version_min before testing.
14119 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
14121 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
14123 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
14125 * ipa-modref-tree.h (modref_access_node::merge): Break out
14126 logic combining offsets and logic merging ranges to ...
14127 (modref_access_node::combined_offsets): ... here
14128 (modref_access_node::update2): ... here
14129 (modref_access_node::closer_pair_p): New member function.
14130 (modref_access_node::forced_merge): New member function.
14131 (modre_ref_node::insert): Do merging when table is full.
14133 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
14136 * config.gcc: MIPS: use N64 ABI by default if the triple end
14137 with -gnuabi64, which is used by Debian since 2013.
14139 2021-08-28 Alexandre Oliva <oliva@adacore.com>
14141 * ipa-modref.c (analyze_function): Skip debug stmts.
14142 * tree-inline.c (estimate_num_insn): Consider builtins even
14143 without a cgraph_node.
14145 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
14147 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
14148 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
14149 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
14150 * config/h8300/h8300.md (eqne): New code iterator.
14151 (geultu, geultu_to_c): Similarly.
14152 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
14153 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
14154 (cmp<mode>_c): New pattern
14156 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
14158 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
14160 (dom_opt_dom_walker::optimize_stmt): Use it.
14162 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
14164 * config/darwin.c (finalize_ctors): Add a section-start linker-
14166 (finalize_dtors): Likewise.
14167 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
14169 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
14171 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
14172 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
14173 old initialization logic when new builtins are enabled.
14174 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
14175 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
14176 (write_init_file): Likewise.
14178 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
14180 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
14181 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
14182 (amdgcn-* | gcn-*) Likewise.
14184 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
14186 * symbol-summary.h: Added missing template keyword.
14188 2021-08-27 Richard Biener <rguenther@suse.de>
14190 PR tree-optimization/45178
14191 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
14192 infinite loops without exit do not mark control dependent
14193 edges of the latch necessary.
14195 2021-08-27 konglin1 <lingling.kong@intel.com>
14198 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
14200 (<avx512>scattersi<mode>): Likewise.
14201 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
14202 (*avx512f_scatterdi<VI48F:mode>): Likewise
14204 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
14206 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
14207 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
14208 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
14209 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
14210 P10_BUILTIN_PEXTD on Power10.
14212 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
14214 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
14215 signedness for some Power10 bifs.
14217 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
14220 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
14221 computed alignment if the entire struct has attribute packed.
14223 2021-08-27 liuhongt <hongtao.liu@intel.com>
14227 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
14228 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
14229 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
14230 IX86_BUILTIN_SHUFPS256.
14231 (ix86_masked_all_ones): New function.
14233 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
14235 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
14236 (conditional moves with memory inputs splitters): Ditto.
14237 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
14239 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14241 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
14242 search after merging.
14244 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14246 * config/rs6000/rs6000-overload.def: Add remaining overloads.
14248 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14250 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
14252 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14254 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
14255 crypto, and htm stanzas.
14257 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
14259 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
14261 2021-08-26 Martin Sebor <msebor@redhat.com>
14263 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
14264 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
14265 (warn_uninitialized_vars): Same. Reduce visibility of locals.
14266 (warn_uninitialized_phi): Same.
14268 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
14270 * tree-ssa-ccp.c (get_individual_bits): Helper function to
14271 extract the individual bits from a widest_int constant (mask).
14272 (gray_code_bit_flips): New read-only table for effiently
14273 enumerating permutations/combinations of bits.
14274 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
14275 by unknown counts that are guaranteed less than the target
14276 precision and four or fewer unknown bits by enumeration.
14277 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
14278 enumeration under the same conditions. Handle remaining
14279 shifts as a mask based upon the minimum possible shift value.
14281 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
14282 Richard Biener <rguenther@suse.de>
14284 * match.pd (shift transformations): Remove a redundant
14285 !POINTER_TYPE_P check.
14287 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
14290 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
14291 Set all_regs to true in the call to replace_rtx.
14293 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14295 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
14296 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
14297 hiting --param modref-max-refs limit.
14298 (modref_tree:insert_base): Be smarter when hitting
14299 --param modref-max-bases limit. Add new parameter REF.
14300 (modref_tree:insert): Update.
14301 (modref_tree:merge): Update.
14302 * ipa-modref.c (read_modref_records): Update.
14304 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14306 * params.opt: (modref-max-adjustments): Add full stop.
14308 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
14310 * ipa-modref-tree.h (modref_ref_node::verify): New member
14312 (modref_ref_node::insert): Use it.
14313 (modref_ref_node::try_mere_with): Fix off by one error.
14315 2021-08-26 Martin Liska <mliska@suse.cz>
14316 Stefan Kneifel <stefan.kneifel@bluewin.ch>
14318 * cgraph.h (create_version_clone_with_body): Add new parameter.
14319 * cgraphclones.c: Likewise.
14320 * multiple_target.c (create_dispatcher_calls): Do not use
14322 (create_target_clone): Likewise here.
14324 2021-08-26 Jonathan Yong <10walls@gmail.com>
14326 * doc/extend.texi: Add note about reserved priorities
14327 to the constructor attribute.
14329 2021-08-25 Martin Sebor <msebor@redhat.com>
14331 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
14332 range table header alone.
14333 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
14335 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
14337 * doc/invoke.texi: Document --param modref-max-adjustments.
14338 * ipa-modref-tree.c (test_insert_search_collapse): Update.
14339 (test_merge): Update.
14340 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
14341 (modref_access_node::operator==): Fix handling of access ranges.
14342 (modref_access_node::contains): Constify parameter; handle also
14343 mismatched parm offsets.
14344 (modref_access_node::update): New function.
14345 (modref_access_node::merge): New function.
14346 (unspecified_modref_access_node): Update constructor.
14347 (modref_ref_node::insert_access): Add record_adjustments parameter;
14349 (modref_ref_node::try_merge_with): New private function.
14350 (modref_tree::insert): New record_adjustments parameter.
14351 (modref_tree::merge): New record_adjustments parameter.
14352 (modref_tree::copy_from): Update.
14353 * ipa-modref.c (dump_access): Dump adjustments field.
14354 (get_access): Update constructor.
14355 (record_access): Update call of insert.
14356 (record_access_lto): Update call of insert.
14357 (merge_call_side_effects): Add record_adjustments parameter.
14358 (get_access_for_fnspec): Update.
14359 (process_fnspec): Update.
14360 (analyze_call): Update.
14361 (analyze_function): Update.
14362 (read_modref_records): Update.
14363 (ipa_merge_modref_summary_after_inlining): Update.
14364 (propagate_unknown_call): Update.
14365 (modref_propagate_in_scc): Update.
14366 * params.opt (param-max-modref-adjustments=): New.
14368 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
14370 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
14372 (xxspltiw_v4si): Use vecperm type attribute.
14373 (xxspltiw_v4si_inst): Use vecperm type attribute.
14374 (xxspltiw_v4sf_inst): Likewise.
14375 (xxspltidp_v2df): Use vecperm type attribute. Use
14376 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
14377 (xxspltidp_v2df_inst): Likewise.
14378 (xxsplti32dx_v4si): Use vecperm type attribute.
14379 (xxsplti32dx_v4si_inst): Likewise.
14380 (xxsplti32dx_v4sf_inst): Likewise.
14381 (xxblend_<mode>): Likewise.
14382 (xxpermx): Likewise.
14383 (xxpermx_inst): Likewise.
14384 (xxeval): Likewise.
14386 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
14389 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
14390 * diagnostic.c (diagnostic_initialize_input_context): New function.
14391 * diagnostic.h (diagnostic_initialize_input_context): Declare.
14392 * input.c (default_charset_callback): New function.
14393 (file_cache::initialize_input_context): New function.
14394 (file_cache_slot::create): Added ability to convert the input
14395 according to the input context.
14396 (file_cache::file_cache): Initialize the new input context.
14397 (class file_cache_slot): Added new m_alloc_offset member.
14398 (file_cache_slot::file_cache_slot): Initialize the new member.
14399 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
14400 (file_cache_slot::maybe_grow): Likewise.
14401 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
14402 (file_cache_slot::get_next_line): Likewise.
14403 * input.h (class file_cache): Added input context member.
14405 2021-08-25 Richard Biener <rguenther@suse.de>
14407 PR tree-optimization/102046
14408 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
14409 update ->any_pattern when swapping operands.
14411 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
14414 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
14415 (ix86_decompose_address): Remove retval check for ASHIFT,
14416 allow non-canonical zero extend if AND mask covers ASHIFT
14418 (ix86_legitimate_address_p): Adjust condition for decompose.
14419 (ix86_rtx_costs): Adjust cost for lea with non-canonical
14421 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
14423 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
14425 PR tree-optimization/101145
14426 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
14428 (number_of_iterations_lt): Invoke above function.
14429 (adjust_cond_for_loop_until_wrap):
14430 Merge to number_of_iterations_until_wrap.
14431 (number_of_iterations_cond): Update invokes for
14432 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
14434 2021-08-25 konglin1 <lingling.kong@intel.com>
14437 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
14438 macro define in O0.
14439 (_mm512_mask_fpclass_ps_mask): Ditto.
14441 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
14443 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
14444 (vec_unpacku_hi_v8hi): Likewise.
14445 (vec_unpacku_lo_v16qi): Likewise.
14446 (vec_unpacku_lo_v8hi): Likewise.
14447 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
14448 (vec_unpacku_lo_<VP_small_lc>): Likewise.
14450 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
14452 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
14453 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
14454 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
14455 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
14457 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14459 PR middle-end/102031
14460 * simplify-rtx.c (simplify_truncation): When comparing precisions
14461 use "subreg_prec" variable, not "subreg_mode".
14463 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14465 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
14468 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14470 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
14471 various pointer type nodes.
14472 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
14473 values for various pointer types.
14474 (ptr_V16QI_type_node): New macro.
14475 (ptr_V1TI_type_node): New macro.
14476 (ptr_V2DI_type_node): New macro.
14477 (ptr_V2DF_type_node): New macro.
14478 (ptr_V4SI_type_node): New macro.
14479 (ptr_V4SF_type_node): New macro.
14480 (ptr_V8HI_type_node): New macro.
14481 (ptr_unsigned_V16QI_type_node): New macro.
14482 (ptr_unsigned_V1TI_type_node): New macro.
14483 (ptr_unsigned_V8HI_type_node): New macro.
14484 (ptr_unsigned_V4SI_type_node): New macro.
14485 (ptr_unsigned_V2DI_type_node): New macro.
14486 (ptr_bool_V16QI_type_node): New macro.
14487 (ptr_bool_V8HI_type_node): New macro.
14488 (ptr_bool_V4SI_type_node): New macro.
14489 (ptr_bool_V2DI_type_node): New macro.
14490 (ptr_bool_V1TI_type_node): New macro.
14491 (ptr_pixel_type_node): New macro.
14492 (ptr_intQI_type_node): New macro.
14493 (ptr_uintQI_type_node): New macro.
14494 (ptr_intHI_type_node): New macro.
14495 (ptr_uintHI_type_node): New macro.
14496 (ptr_intSI_type_node): New macro.
14497 (ptr_uintSI_type_node): New macro.
14498 (ptr_intDI_type_node): New macro.
14499 (ptr_uintDI_type_node): New macro.
14500 (ptr_intTI_type_node): New macro.
14501 (ptr_uintTI_type_node): New macro.
14502 (ptr_long_integer_type_node): New macro.
14503 (ptr_long_unsigned_type_node): New macro.
14504 (ptr_float_type_node): New macro.
14505 (ptr_double_type_node): New macro.
14506 (ptr_long_double_type_node): New macro.
14507 (ptr_dfloat64_type_node): New macro.
14508 (ptr_dfloat128_type_node): New macro.
14509 (ptr_ieee128_type_node): New macro.
14510 (ptr_ibm128_type_node): New macro.
14511 (ptr_vector_pair_type_node): New macro.
14512 (ptr_vector_quad_type_node): New macro.
14513 (ptr_long_long_integer_type_node): New macro.
14514 (ptr_long_long_unsigned_type_node): New macro.
14516 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14518 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
14519 and power9-64 stanzas.
14521 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14522 Tom de Vries <tdevries@suse.de>
14524 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
14525 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
14526 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
14527 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
14528 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
14529 * config/nvptx/nvptx-c.c: New source file.
14530 (nvptx_cpu_cpp_builtins): Move implementation here.
14532 2021-08-24 Martin Sebor <msebor@redhat.com>
14534 PR middle-end/101600
14535 PR middle-end/101977
14536 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
14537 the phrasing of a warning.
14538 (check_access): Use the remaining size after subtracting any offset
14539 rather than the whole object size.
14540 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
14541 clear for any nonnull PHI argument.
14542 (compute_objsize): Clear argument.
14544 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14546 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
14548 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14550 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
14553 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
14555 * value-relation.cc (rr_transitive_table): New.
14556 (relation_transitive): New.
14557 (value_relation::swap): Remove.
14558 (value_relation::apply_transitive): New.
14559 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
14560 (relation_oracle::register_relation): Call register_transitives.
14561 (relation_oracle::register_transitives): New.
14562 * value-relation.h (relation_oracle): Add new temporary bitmap and
14565 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
14568 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
14569 from integer to a pseudo vector register.
14571 2021-08-24 Richard Biener <rguenther@suse.de>
14573 PR tree-optimization/100089
14574 * tree-vectorizer.h (vect_slp_bb): Rename to ...
14575 (vect_slp_if_converted_bb): ... this and get the original
14576 loop as new argument.
14577 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
14578 pass original loop to vect_slp_if_converted_bb.
14579 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
14580 If orig_loop was passed scan the not vectorized stmts
14581 for COND_EXPRs and force not profitable if found.
14582 (vect_slp_region): Pass down all SLP instances to costing
14583 if orig_loop was specified.
14584 (vect_slp_bbs): Pass through orig_loop.
14585 (vect_slp_bb): Rename to ...
14586 (vect_slp_if_converted_bb): ... this and get the original
14587 loop as new argument.
14588 (vect_slp_function): Adjust.
14590 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14593 * config/arm/arm.md (attribute arch): Add fix_vlldm.
14594 (arch_enabled): Use it.
14595 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
14596 use when erratum mitigation is needed.
14598 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14601 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
14602 * doc/invoke.texi (Arm Options): Document it.
14603 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
14604 (ALL_QUIRKS): Add quirk_vlldm.
14605 (cortex-m33): Add quirk_vlldm.
14606 (cortex-m35p, cortex-m55): Likewise.
14607 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
14608 targetting an affected CPU and not explicitly controlled on
14611 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14613 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
14614 (lazy_load_multiple_insn): Likewise.
14616 2021-08-24 liuhongt <hongtao.liu@intel.com>
14619 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
14620 Enable avx512 embedded broadcast.
14621 (*<avx512>_vternlog<mode>_all): Ditto.
14622 (<avx512>_vternlog<mode>_mask): Ditto.
14624 2021-08-24 liuhongt <hongtao.liu@intel.com>
14627 * config/i386/i386.c (ix86_rtx_costs): Define cost for
14629 * config/i386/i386.h (STRIP_UNARY): New macro.
14630 * config/i386/predicates.md (reg_or_notreg_operand): New
14632 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
14633 (*<avx512>_vternlog<mode>_1): New pre_reload
14634 define_insn_and_split.
14635 (*<avx512>_vternlog<mode>_2): Ditto.
14636 (*<avx512>_vternlog<mode>_3): Ditto.
14637 (any_logic1,any_logic2): New code iterator.
14638 (logic_op): New code attribute.
14639 (ternlogsuffix): Extend to VNxDF and VNxSF.
14641 2021-08-24 Richard Biener <rguenther@suse.de>
14643 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
14644 * params.opt (--param vect-inner-loop-cost-factor): Adjust
14646 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
14647 inner_loop_cost_factor to the minimum of the estimated number
14648 of iterations of the inner loop and vect-inner-loop-cost-factor.
14650 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14651 Richard Biener <rguenther@suse.de>
14653 * config/i386/i386-features.c (compute_convert_gain): Provide
14654 more accurate values for CONST_INT, when optimizing for size.
14655 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
14656 * config/i386/i386.h (COSTS_N_BYTES): to here.
14658 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14659 Jakub Jelinek <jakub@redhat.com>
14661 PR middle-end/102029
14662 * match.pd (shift transformations): Add an additional check for
14663 !POINTER_TYPE_P in the recently added left shift transformation.
14665 2021-08-24 liuhongt <hongtao.liu@intel.com>
14667 PR tree-optimization/100089
14668 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
14669 loop vectorizer when cost model is very-cheap.
14671 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14673 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
14674 asprintf, which is not available on AIX.
14676 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14678 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
14679 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
14681 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14683 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
14684 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
14686 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
14687 (rbtree.o): Likewise.
14688 (rs6000-gen-builtins): Likewise.
14689 (rs6000-builtins.c): Likewise.
14690 (rs6000-builtins.h): Likewise.
14691 (rs6000.o): Add dependency.
14692 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
14693 (rs6000-vecdefines.h): New target.
14694 (rs6000-builtins.o): Likewise.
14695 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
14696 (rs6000-c.o): Likewise.
14698 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14701 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
14702 Diagnose buffer overrun.
14703 (safe_inc_pos): Fix overrun detection.
14704 (match_identifier): Diagnose buffer overrun.
14705 (match_integer): Likewise.
14706 (match_to_right_bracket): Likewise.
14708 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14710 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
14711 Improve range compare.
14712 (modref_access_node::contains): New member function.
14713 (modref_access_node::search): Remove.
14714 (modref_access_node::insert): Be smarter about subaccesses.
14716 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
14718 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
14719 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
14720 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
14723 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
14725 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
14727 * config/h8300/jumpcall.md (call, call_value): Restrict to
14728 !SIBLING_CALL_P cases.
14729 (subcall, sibcall_value): New patterns & expanders.
14730 * config/h8300/proepi.md (epilogue): Pass new argument to
14731 h8300_expand_epilogue.
14732 (sibcall_epilogue): New expander.
14733 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
14735 (h8300_ok_for_sibcall_p): New function.
14736 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
14738 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14740 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
14741 Handle case where the operand is already the desired mode.
14743 2021-08-23 Richard Biener <rguenther@suse.de>
14746 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
14747 in addition to has_gimple_body.
14749 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14751 PR middle-end/101949
14752 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
14755 2021-08-23 Martin Liska <mliska@suse.cz>
14757 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
14760 2021-08-23 Richard Biener <rguenther@suse.de>
14762 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
14763 Properly scale the inner loop cost only once.
14765 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14767 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
14768 Provide bounds for unsigned (and signed with non-negative operands)
14769 division and modulus.
14771 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14773 * simplify-rtx.c (simplify_truncation): Generalize simplification
14774 of (truncate:A (subreg:B X)).
14775 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
14776 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
14777 already has the desired machine mode.
14778 (test_scalar_int_ops): Add tests that useless extensions and
14779 truncations are optimized away.
14780 (test_scalar_int_ext_ops): New self-test function to confirm
14781 that truncations of extensions are correctly simplified.
14782 (test_scalar_int_ext_ops2): New self-test function to check
14783 truncations of truncations, extensions of extensions, and
14784 truncations of extensions.
14785 (test_scalar_ops): Call the above two functions with a
14786 representative sampling of integer machine modes.
14788 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14790 * match.pd (shift transformations): Change the sign of an
14791 LSHIFT_EXPR if it reduces the number of explicit conversions.
14793 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14795 PR tree-optimization/86723
14796 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
14797 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
14798 non-memory permutation of 64-bit src which only has bytes of
14799 0 or [5..8] and n->range is 4.
14800 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
14801 find_bswap_or_nop_finalize caller, support bswap with some bytes
14802 zeroed, as long as at least two bytes are not zeroed.
14803 (bswap_replace): Add mask argument and handle masking of bswap
14805 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
14806 caller, punt if cast64_to_32 or mask is not all ones.
14807 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
14808 caller, for now punt if cast64_to_32.
14810 2021-08-23 Richard Biener <rguenther@suse.de>
14812 PR tree-optimization/79334
14813 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
14814 a type also for COMPONENT_REFs.
14815 (vn_reference_may_trap): Check ARRAY_REF with constant index
14816 against the array domain.
14818 2021-08-23 liuhongt <hongtao.liu@intel.com>
14821 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
14822 TARGET_AVX512BW to condition.
14824 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14827 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
14828 register variables already during early_dwarf if possible.
14830 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14832 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
14834 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14837 * config/arm/arm.opt: Fix typo.
14838 * config/arm/t-rmprofile: Fix typo.
14840 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14842 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
14843 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
14844 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
14845 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
14846 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
14847 if either grainsize or num_tasks clause has the strict modifier.
14849 2021-08-23 Martin Liska <mliska@suse.cz>
14851 * dbgcnt.def (DEBUG_COUNTER): New counter.
14852 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
14854 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14856 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
14858 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
14861 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
14863 * config/mips/mips.c (mips_msa_output_shift_immediate): New
14865 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
14866 vlshr<mode>3): Call it.
14868 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
14869 Martin Liska <mliska@suse.cz>
14871 PR middle-end/101949
14872 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
14875 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
14877 * config/mips/mips.c (mips_function_rodata_section,
14878 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
14880 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
14882 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
14884 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14886 * configure.ac (thread-local storage support): Remove tls_first_major
14887 and tls_first_minor. Use "$conftest_s" to check support.
14888 * configure: Regenerate.
14890 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14892 * configure.ac: Fixup formatting.
14894 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14896 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
14897 * configure.ac: ... update all callers.
14899 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14902 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
14903 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
14904 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
14905 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
14906 Remove remaining checks for in-tree assembler.
14907 * configure: Regenerate.
14909 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
14911 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
14912 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
14914 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
14915 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
14916 Improve logical shifts by 27 bits for H8/S.
14917 (get_shift_alg): Corresponding changes.
14918 (h8300_option_override): Revert to loops for -Os when profitable.
14920 2021-08-20 Richard Biener <rguenther@suse.de>
14922 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
14924 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
14925 index as group_id when dataref_groups were not computed.
14926 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
14927 we advace to the next BB.
14929 2021-08-20 Jakub Jelinek <jakub@redhat.com>
14931 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
14934 2021-08-20 Martin Liska <mliska@suse.cz>
14936 PR gcov-profile/89961
14937 * gcov.c (make_gcov_file_name): Rewrite using std::string.
14938 (mangle_name): Simplify, do not used the second argument.
14939 (strip_extention): New function.
14940 (get_md5sum): Likewise.
14941 (get_gcov_intermediate_filename): Handle properly -p and -x
14943 (output_gcov_file): Use string type.
14944 (generate_results): Likewise.
14945 (md5sum_to_hex): Remove.
14947 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
14949 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
14950 (UNSPEC_XXSPLTIW): Move to vsx.md.
14951 (UNSPEC_XXSPLTID): Move to vsx.md.
14952 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
14953 (UNSPEC_XXBLEND): Move to vsx.md.
14954 (UNSPEC_XXPERMX): Move to vsx.md.
14955 (VM3): Move to vsx.md.
14956 (VM3_char): Move to vsx.md.
14957 (xxspltiw_v4si): Move to vsx.md.
14958 (xxspltiw_v4sf): Move to vsx.md.
14959 (xxspltiw_v4sf_inst): Move to vsx.md.
14960 (xxspltidp_v2df): Move to vsx.md.
14961 (xxspltidp_v2df_inst): Move to vsx.md.
14962 (xxsplti32dx_v4si_inst): Move to vsx.md.
14963 (xxsplti32dx_v4sf): Move to vsx.md.
14964 (xxsplti32dx_v4sf_inst): Move to vsx.md.
14965 (xxblend_<mode>): Move to vsx.md.
14966 (xxpermx): Move to vsx.md.
14967 (xxpermx_inst): Move to vsx.md.
14968 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
14969 (UNSPEC_XXSPLTIW): Move from altivec.md.
14970 (UNSPEC_XXSPLTID): Move from altivec.md.
14971 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
14972 (UNSPEC_XXBLEND): Move from altivec.md.
14973 (UNSPEC_XXPERMX): Move from altivec.md.
14974 (VM3): Move from altivec.md.
14975 (VM3_char): Move from altivec.md.
14976 (xxspltiw_v4si): Move from altivec.md.
14977 (xxspltiw_v4sf): Move from altivec.md.
14978 (xxspltiw_v4sf_inst): Move from altivec.md.
14979 (xxspltidp_v2df): Move from altivec.md.
14980 (xxspltidp_v2df_inst): Move from altivec.md.
14981 (xxsplti32dx_v4si_inst): Move from altivec.md.
14982 (xxsplti32dx_v4sf): Move from altivec.md.
14983 (xxsplti32dx_v4sf_inst): Move from altivec.md.
14984 (xxblend_<mode>): Move from altivec.md.
14985 (xxpermx): Move from altivec.md.
14986 (xxpermx_inst): Move from altivec.md.
14988 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
14990 * tree-vect-generic.c (expand_vector_operations_1): Use either
14991 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
14992 when constructing scalar splat expressions.
14994 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
14997 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
14998 pointer to __vector_pair *.
15000 2021-08-19 Martin Sebor <msebor@redhat.com>
15002 * gimple-range.cc: Add comments.
15003 * gimple-range.h: Same.
15005 2021-08-19 Martin Sebor <msebor@redhat.com>
15007 PR middle-end/101984
15008 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
15011 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
15013 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
15014 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
15015 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
15017 2021-08-19 Jakub Jelinek <jakub@redhat.com>
15019 PR middle-end/101950
15020 * optabs.c (expand_clrsb_using_clz): New function.
15021 (expand_unop): Use it as another clrsb expansion fallback.
15023 2021-08-19 liuhongt <hongtao.liu@intel.com>
15026 2021-07-28 liuhongt <hongtao.liu@intel.com>
15029 * config/i386/i386.h (processor_costs): Add new member
15031 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
15032 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
15033 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
15034 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
15035 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
15036 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
15037 generic_cost, core_cost): Initialize integer_to_sse same value
15039 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
15040 * config/i386/i386.c (ix86_builtin_vectorization_cost):
15041 Use integer_to_sse instead of sse_op to calculate the cost of
15044 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
15046 * config.gcc: Include rpath.opt for Darwin.
15047 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
15049 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
15051 PR bootstrap/101959
15052 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
15055 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
15057 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
15058 pattern and type cast.
15059 (vld4_lane_f32): Use float RTL pattern.
15060 (vld4q_lane_f64): Use float type cast.
15062 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
15064 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
15067 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
15069 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
15070 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
15071 (hash_map_tests_c_tests): Call it.
15073 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
15075 * ggc.h (enum ggc_collect): New.
15076 (ggc_collect): Use it.
15077 * ggc-page.c: Adjust.
15078 * ggc-common.c: Likewise.
15079 * ggc-tests.c: Likewise.
15080 * read-rtl-function.c: Likewise.
15081 * selftest-run-tests.c: Likewise.
15082 * doc/gty.texi (Invoking the garbage collector): Likewise.
15084 2021-08-18 liuhongt <hongtao.liu@intel.com>
15087 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
15089 * config/i386/sse.md (*sse3_haddv2df3_low): Add
15090 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
15091 (*sse3_hsubv2df3_low): Ditto.
15092 * config/i386/x86-tune.def
15093 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
15095 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15097 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
15098 (gori_compute::compute_operand_range): Add tracing.
15099 (gori_compute::logical_combine): Ditto.
15100 (gori_compute::compute_logical_operands): Ditto.
15101 (gori_compute::compute_operand1_range): Ditto.
15102 (gori_compute::compute_operand2_range): Ditto.
15103 (gori_compute::outgoing_edge_range_p): Ditto.
15104 * gimple-range-gori.h (class gori_compute): Add range_tracer.
15106 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15108 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
15109 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
15110 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
15111 * params.opt (--param=evrp-mode): Adjust options.
15113 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15115 * Makefile.in (OBJS): Add gimple-range-trace.o.
15116 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
15117 * gimple-range-fold.cc: Adjust headers.
15118 * gimple-range-trace.cc: New.
15119 * gimple-range-trace.h: New.
15120 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
15121 (gimple_ranger::range_of_expr): Add tracing.
15122 (gimple_ranger::range_on_entry): Ditto.
15123 (gimple_ranger::range_on_exit): Ditto.
15124 (gimple_ranger::range_on_edge): Ditto.
15125 (gimple_ranger::fold_range_internal): Ditto.
15126 (gimple_ranger::dump_bb): Do not calculate edge range twice.
15127 (trace_ranger::*): Remove.
15128 (enable_ranger): Never create a trace_ranger.
15129 (debug_seed_ranger): Move to gimple-range-trace.cc.
15130 (dump_ranger): Ditto.
15131 (debug_ranger): Ditto.
15132 * gimple-range.h: Include gimple-range-trace.h.
15133 (range_on_entry, range_on_exit): No longer virtual.
15134 (class trace_ranger): Remove.
15135 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
15137 2021-08-17 Martin Sebor <msebor@redhat.com>
15139 PR middle-end/101854
15140 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
15141 in gimple-ssa-warn-access.cc.
15142 * calls.c (alloc_max_size): Move code to check_alloca.
15143 (get_size_range): Move to pointer-query.cc.
15144 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
15145 (get_attr_nonstring_decl): Move to tree.c.
15146 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
15147 (append_attrname): Same.
15148 (maybe_warn_rdwr_sizes): Same.
15149 (initialize_argument_information): Move code to
15150 gimple-ssa-warn-access.cc.
15151 * calls.h (maybe_warn_alloc_args_overflow): Move to
15152 gimple-ssa-warn-access.h.
15153 (get_attr_nonstring_decl): Move to tree.h.
15154 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
15155 (enum size_range_flags): Move to pointer-query.h.
15156 (get_size_range): Same.
15157 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
15158 to avoid Clang -Wunused-function.
15159 (get_size_range): Declare static.
15160 (maybe_emit_free_warning): Rename...
15161 (maybe_check_dealloc_call): ...to this for consistency.
15162 (class pass_waccess): Add members.
15163 (pass_waccess::~pass_waccess): Defined.
15164 (alloc_max_size): Move here from calls.c.
15165 (maybe_warn_alloc_args_overflow): Same.
15166 (check_alloca): New function.
15167 (check_alloc_size_call): New function.
15168 (check_strncat): Handle another warning flag.
15169 (pass_waccess::check_builtin): Handle alloca.
15170 (fntype_argno_type): Move here from calls.c.
15171 (append_attrname): Same.
15172 (maybe_warn_rdwr_sizes): Same.
15173 (pass_waccess::check_call): Define.
15174 (check_nonstring_args): New function.
15175 (pass_waccess::check): Call new member functions.
15176 (pass_waccess::execute): Enable ranger.
15177 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
15178 (maybe_warn_nonstring_arg): Same.
15179 * gimple-ssa-warn-restrict.c: Remove #include.
15180 * pointer-query.cc (get_size_range): Move here from calls.c.
15181 * pointer-query.h (enum size_range_flags): Same.
15182 (get_size_range): Same.
15183 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
15184 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
15186 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
15188 * ggc.h (ggc_collect): Add 'force_collect' parameter.
15189 * ggc-page.c (ggc_collect): Use that one instead of global
15190 'ggc_force_collect'. Adjust all users.
15191 * doc/gty.texi (Invoking the garbage collector): Update.
15192 * ggc-internal.h (ggc_force_collect): Remove.
15193 * ggc-common.c (ggc_force_collect): Likewise.
15194 * selftest.h (forcibly_ggc_collect): Remove.
15195 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
15196 * read-rtl-function.c (test_loading_labels): Adjust.
15197 * selftest-run-tests.c (run_tests): Likewise.
15199 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
15201 * config/darwin.c (darwin_file_end): Reset and reclaim the
15202 section names table at the end of compile.
15204 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
15207 * config.in: Regenerate.
15208 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
15209 (ASM_SPEC): Pass options to disable branch shortening where
15211 * configure: Regenerate.
15212 * configure.ac: Detect versions of 'as' that support the
15213 optimisation which has the bug.
15215 2021-08-17 Richard Biener <rguenther@suse.de>
15217 * optabs-query.c (supports_vec_gather_load_p): Also check
15219 (supports_vec_scatter_store_p): Likewise.
15220 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
15221 back to masked variants if non-masked are not supported.
15222 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
15223 When we need to use masked gather/scatter but do not have
15224 a mask set up a constant true one.
15225 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
15226 non-SSA_NAME masks.
15228 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
15230 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
15231 algorithm as PLUS_EXPR to improve subtraction bit bounds.
15232 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
15234 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
15236 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
15237 calculate the mask-value pair result of a multiplication by an
15239 (bit_value_binop) [MULT_EXPR]: Call it from here for
15240 multiplications by (sparse) non-negative constants.
15242 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
15245 * config.gcc (gcc_cv_initfini_array): Leave undefined for
15246 uclinuxfdpiceabi targets.
15248 2021-08-17 Alexandre Oliva <oliva@adacore.com>
15250 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
15251 reverse debug stmts.
15253 2021-08-17 Alexandre Oliva <oliva@adacore.com>
15255 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
15257 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
15259 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
15260 (__LD4Q_LANE_FUNC): Likewise.
15261 (vld4_lane_u8): Define without macro.
15262 (vld4_lane_u16): Likewise.
15263 (vld4_lane_u32): Likewise.
15264 (vld4_lane_u64): Likewise.
15265 (vld4_lane_s8): Likewise.
15266 (vld4_lane_s16): Likewise.
15267 (vld4_lane_s32): Likewise.
15268 (vld4_lane_s64): Likewise.
15269 (vld4_lane_f16): Likewise.
15270 (vld4_lane_f32): Likewise.
15271 (vld4_lane_f64): Likewise.
15272 (vld4_lane_p8): Likewise.
15273 (vld4_lane_p16): Likewise.
15274 (vld4_lane_p64): Likewise.
15275 (vld4q_lane_u8): Likewise.
15276 (vld4q_lane_u16): Likewise.
15277 (vld4q_lane_u32): Likewise.
15278 (vld4q_lane_u64): Likewise.
15279 (vld4q_lane_s8): Likewise.
15280 (vld4q_lane_s16): Likewise.
15281 (vld4q_lane_s32): Likewise.
15282 (vld4q_lane_s64): Likewise.
15283 (vld4q_lane_f16): Likewise.
15284 (vld4q_lane_f32): Likewise.
15285 (vld4q_lane_f64): Likewise.
15286 (vld4q_lane_p8): Likewise.
15287 (vld4q_lane_p16): Likewise.
15288 (vld4q_lane_p64): Likewise.
15289 (vld4_lane_bf16): Likewise.
15290 (vld4q_lane_bf16): Likewise.
15292 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
15294 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
15295 (__LD3Q_LANE_FUNC): Delete.
15296 (vld3_lane_u8): Define without macro.
15297 (vld3_lane_u16): Likewise.
15298 (vld3_lane_u32): Likewise.
15299 (vld3_lane_u64): Likewise.
15300 (vld3_lane_s8): Likewise.
15301 (vld3_lane_s16): Likewise.
15302 (vld3_lane_s32): Likewise.
15303 (vld3_lane_s64): Likewise.
15304 (vld3_lane_f16): Likewise.
15305 (vld3_lane_f32): Likewise.
15306 (vld3_lane_f64): Likewise.
15307 (vld3_lane_p8): Likewise.
15308 (vld3_lane_p16): Likewise.
15309 (vld3_lane_p64): Likewise.
15310 (vld3q_lane_u8): Likewise.
15311 (vld3q_lane_u16): Likewise.
15312 (vld3q_lane_u32): Likewise.
15313 (vld3q_lane_u64): Likewise.
15314 (vld3q_lane_s8): Likewise.
15315 (vld3q_lane_s16): Likewise.
15316 (vld3q_lane_s32): Likewise.
15317 (vld3q_lane_s64): Likewise.
15318 (vld3q_lane_f16): Likewise.
15319 (vld3q_lane_f32): Likewise.
15320 (vld3q_lane_f64): Likewise.
15321 (vld3q_lane_p8): Likewise.
15322 (vld3q_lane_p16): Likewise.
15323 (vld3q_lane_p64): Likewise.
15324 (vld3_lane_bf16): Likewise.
15325 (vld3q_lane_bf16): Likewise.
15327 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
15329 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
15330 (__LD2Q_LANE_FUNC): Likewise.
15331 (vld2_lane_u8): Define without macro.
15332 (vld2_lane_u16): Likewise.
15333 (vld2_lane_u32): Likewise.
15334 (vld2_lane_u64): Likewise.
15335 (vld2_lane_s8): Likewise.
15336 (vld2_lane_s16): Likewise.
15337 (vld2_lane_s32): Likewise.
15338 (vld2_lane_s64): Likewise.
15339 (vld2_lane_f16): Likewise.
15340 (vld2_lane_f32): Likewise.
15341 (vld2_lane_f64): Likewise.
15342 (vld2_lane_p8): Likewise.
15343 (vld2_lane_p16): Likewise.
15344 (vld2_lane_p64): Likewise.
15345 (vld2q_lane_u8): Likewise.
15346 (vld2q_lane_u16): Likewise.
15347 (vld2q_lane_u32): Likewise.
15348 (vld2q_lane_u64): Likewise.
15349 (vld2q_lane_s8): Likewise.
15350 (vld2q_lane_s16): Likewise.
15351 (vld2q_lane_s32): Likewise.
15352 (vld2q_lane_s64): Likewise.
15353 (vld2q_lane_f16): Likewise.
15354 (vld2q_lane_f32): Likewise.
15355 (vld2q_lane_f64): Likewise.
15356 (vld2q_lane_p8): Likewise.
15357 (vld2q_lane_p16): Likewise.
15358 (vld2q_lane_p64): Likewise.
15359 (vld2_lane_bf16): Likewise.
15360 (vld2q_lane_bf16): Likewise.
15362 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
15364 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
15367 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
15369 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
15370 (rank_for_schedule): Use it.
15372 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
15374 PR rtl-optimization/91598
15375 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
15376 insns after memory reads and before memory writes.
15378 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
15380 * rtl.h (CONST_VECTOR_P): New macro.
15381 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
15382 code testing macros.
15383 (aarch64_ptrue_all_mode): Likewise.
15384 (aarch64_expand_mov_immediate): Likewise.
15385 (aarch64_const_vec_all_in_range_p): Likewise.
15386 (aarch64_rtx_costs): Likewise.
15387 (aarch64_legitimate_constant_p): Likewise.
15388 (aarch64_simd_valid_immediate): Likewise.
15389 (aarch64_simd_make_constant): Likewise.
15390 (aarch64_convert_mult_to_shift): Likewise.
15391 (aarch64_expand_sve_vec_perm): Likewise.
15392 (aarch64_vec_fpconst_pow_of_2): Likewise.
15394 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
15396 PR tree-optimization/101938
15397 * range-op.cc (operator_abs::op1_range): Special case
15398 -TYPE_MIN_VALUE for flag_wrapv.
15400 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
15402 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
15405 2021-08-17 Jakub Jelinek <jakub@redhat.com>
15407 * tree.def (OMP_SCOPE): New tree code.
15408 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
15409 * tree-nested.c (convert_nonlocal_reference_stmt,
15410 convert_local_reference_stmt, convert_gimple_call): Handle
15412 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
15413 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
15414 * gimple.c (gimple_build_omp_scope): New function.
15415 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
15416 * gimple.h (gimple_build_omp_scope): Declare.
15417 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
15418 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
15419 gimple_omp_scope_set_clauses): New inline functions.
15420 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
15421 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
15422 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
15423 * gimple-walk.c (walk_gimple_stmt): Likewise.
15424 * gimple-low.c (lower_stmt): Likewise.
15425 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
15426 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
15427 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
15428 allowing task reductions. Reject inscan reductions on scope.
15429 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
15430 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
15431 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
15432 (estimate_num_insns): Likewise.
15433 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
15434 contexts if var isn't privatized there.
15435 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
15436 (scan_omp_1_stmt): Likewise.
15437 (maybe_add_implicit_barrier_cancel): Look through outer
15439 (lower_omp_scope): New function.
15440 (lower_omp_task_reductions): Handle OMP_SCOPE.
15441 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
15442 (diagnose_sb_1, diagnose_sb_2): Likewise.
15443 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
15444 (expand_omp): Handle GIMPLE_OMP_SCOPE.
15445 (omp_make_gimple_edges): Likewise.
15446 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
15448 2021-08-17 Richard Biener <rguenther@suse.de>
15450 PR tree-optimization/101925
15451 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
15452 reverse on COMPONENT_REF and ARRAY_REF according to
15453 what reverse_storage_order_for_component_p does.
15454 (vn_reference_eq): Compare reversed on reference ops.
15455 (reverse_storage_order_for_component_p): New overload.
15456 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
15457 on the reference looked up.
15459 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
15461 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
15462 shifts on the H8/S.
15463 (h8300_option_override): Use loops on H8/S more often when optimizing
15465 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
15466 accordingly. Handle various arithmetic right shifts with special
15467 sequences that we couldn't handle before.
15469 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
15471 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
15473 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
15475 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
15476 * config/sparc/sparc.c (sparc_gcov_type_size): New.
15477 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
15478 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
15479 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
15480 * doc/tm.texi.in: Regenerate.
15481 * target.def (gcov_type_size): New target hook.
15482 * targhooks.c (default_gcov_type_size): New.
15483 * targhooks.h (default_gcov_type_size): Declare.
15484 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
15486 (gimple_gen_time_profiler): Likewise.
15488 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
15490 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
15492 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
15494 PR middle-end/101931
15495 * omp-low.c (omp_runtime_api_call): Update for routines
15496 added in the meanwhile.
15498 2021-08-16 Martin Liska <mliska@suse.cz>
15500 PR tree-optimization/100393
15501 * tree-switch-conversion.c (group_cluster::dump): Use
15502 get_comparison_count.
15503 (jump_table_cluster::find_jump_tables): Pre-compute number of
15504 comparisons and then decrement it. Cache also max_ratio.
15505 (jump_table_cluster::can_be_handled): Change signature.
15506 * tree-switch-conversion.h (get_comparison_count): New.
15508 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
15510 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
15511 encodings only when -fgnat-encodings=all is specified.
15512 (add_bound_info): Likewise.
15513 (add_byte_size_attribute): Likewise.
15514 (gen_member_die): Likewise.
15516 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15518 * omp-oacc-neuter-broadcast.cc
15519 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
15521 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15523 * omp-oacc-neuter-broadcast.cc
15524 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
15527 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15529 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
15530 (execute_omp_oacc_neuter_broadcast): ... here.
15531 (install_var_field, build_receiver_ref, build_sender_ref): Take
15532 'field_map_t *' parameter. Adjust all users.
15533 (worker_single_copy, neuter_worker_single): Take a
15534 'record_field_map_t *' parameter. Adjust all users.
15536 2021-08-16 liuhongt <hongtao.liu@intel.com>
15539 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
15542 2021-08-16 Martin Liska <mliska@suse.cz>
15545 * multiple_target.c (create_dispatcher_calls): Make default
15546 function local only if it is a definition.
15548 2021-08-16 Martin Liska <mliska@suse.cz>
15551 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
15552 consider equal SSA_NAMEs when one is a param.
15554 2021-08-16 liuhongt <hongtao.liu@intel.com>
15557 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
15558 Support vpermi2b for V32QI/V16QImode.
15559 (ix86_extract_perm_from_pool_constant): New function.
15560 (ix86_expand_vec_one_operand_perm_avx512): Support
15561 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
15562 (expand_vec_perm_1): Adjust comments for upper.
15563 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
15565 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
15566 (pshufb_truncv4siv4hi_operand): Ditto.
15567 (pshufb_truncv8hiv8qi_operand): Ditto.
15568 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
15569 New pre_reload define_insn_and_split.
15570 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
15571 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
15572 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
15573 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
15574 (*avx512f_permvar_truncv4div4si_1): Ditto.
15575 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
15576 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
15577 (*avx512f_pshufd_truncv2div2si_1): Ditto.
15579 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
15581 * config/riscv/multilib-generator: Support code model option for
15583 * doc/install.texi: Add document of new option for
15584 --with-multilib-generator.
15586 2021-08-15 Clément Chigot <clement.chigot@atos.net>
15588 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
15589 (rs6000_legitimize_tls_address_aix): Use it.
15590 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
15591 xcoff_tls_exec_model_detected is true.
15593 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
15595 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
15596 to allow a bit more code growth, saving many dozens of cycles.
15597 (h8300_option_override): Adjus shift_alg_si if optimizing for
15599 (get_shift_alg): Use special + inline shifts for residuals
15602 2021-08-14 Stafford Horne <shorne@gmail.com>
15605 * config/or1k/or1k-opts.h: New file.
15606 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
15607 Support generating gotha relocations if -mcmodel=large is
15609 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
15611 * config/or1k/or1k.opt (mcmodel=): New option.
15612 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
15614 2021-08-14 Martin Sebor <msebor@redhat.com>
15616 PR middle-end/101791
15617 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
15618 to valid_new_delete_pair_p.
15619 * tree.c (valid_new_delete_pair_p): Add argument.
15620 * tree.h (valid_new_delete_pair_p): Same.
15622 2021-08-14 Jakub Jelinek <jakub@redhat.com>
15625 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
15626 <case E_V64QImode>: For this mode assert
15627 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
15629 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
15632 * config/rs6000/altivec.md (xxeval): Use register_predicate
15633 instead of altivec_register_predicate.
15635 2021-08-13 Martin Sebor <msebor@redhat.com>
15637 PR middle-end/101734
15638 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
15639 (maybe_warn_operand): Call it.
15641 2021-08-13 Martin Liska <mliska@suse.cz>
15644 * attribs.c (decl_attributes): Make naked functions "noipa"
15647 2021-08-13 Martin Liska <mliska@suse.cz>
15650 * symtab.c (symtab_node::noninterposable_alias): Do not create
15651 local aliases for target_clone functions as the clonning pass
15654 2021-08-13 Martin Liska <mliska@suse.cz>
15656 * opts.c (LIVE_PATCHING_OPTION): Define.
15657 (control_options_for_live_patching): Use it in error messages.
15659 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
15661 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
15662 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
15663 ignore_stores_eaf_flags): New constants.
15664 (remove_useless_eaf_flags): New function.
15665 (eaf_flags_useful_p): Use it.
15666 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
15668 (modref_lattice::init): Add EAF_NOREAD.
15669 (modref_lattice::add_escape_point): Do not reacord escape point if
15671 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
15672 use remove_useless_eaf_flags.
15673 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
15674 (modref_lattice::merge_direct_load): Add EAF_NOREAD
15675 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
15676 (analyze_parms): Use remove_useless_eaf_flags.
15677 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
15678 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
15679 use remove_useless_eaf_flags.
15680 (modref_propagate_flags_in_scc): Update.
15681 * ipa-modref.h: Turn eaf_flags_t back to char.
15682 * tree-core.h (EAF_NOT_RETURNED): Fix.
15683 (EAF_NOREAD): New constant
15684 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
15686 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
15687 (handle_pure_call): Likewise.
15689 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15691 * tree.def (OMP_MASKED): New tree code.
15692 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
15693 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
15694 OMP_CLAUSE_FILTER_EXPR): Define.
15695 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
15696 (omp_clause_code_name): Likewise.
15697 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
15698 * tree-nested.c (convert_nonlocal_omp_clauses,
15699 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
15700 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
15701 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
15702 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
15703 (dump_generic_node): Handle OMP_MASTER.
15704 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
15705 * gimple.c (gimple_build_omp_masked): New function.
15706 (gimple_copy): Handle GIMPLE_OMP_MASKED.
15707 * gimple.h (gimple_build_omp_masked): Declare.
15708 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
15709 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
15710 gimple_omp_masked_set_clauses): New inline functions.
15711 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
15712 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
15713 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
15714 * gimple-walk.c (walk_gimple_stmt): Likewise.
15715 * gimple-low.c (lower_stmt): Likewise.
15716 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
15717 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
15718 that take one expression rather than decl or constant, force
15719 gimplification of that into a SSA_NAME or temporary unless min
15721 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
15722 (gimplify_expr): Handle OMP_MASKED.
15723 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
15724 (estimate_num_insns): Likewise.
15725 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
15726 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
15727 diagnostics for existence of masked construct.
15728 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
15729 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
15730 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
15733 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
15736 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
15737 (ldexp<mode>3): Use avx512f_scalef<mode>2.
15738 (UNSPEC_SCALEF): Move from ...
15739 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
15741 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
15743 * ipa-split.c (consider_split): Fix condition testing void functions.
15745 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
15747 * doc/invoke.texi: Remove docs for threader-mode param.
15748 * flag-types.h (enum threader_mode): Remove.
15749 * params.opt: Remove threader-mode param.
15750 * tree-ssa-threadbackward.c (class back_threader): Remove
15751 path_is_unreachable_p.
15752 Make find_paths private.
15753 Add maybe_thread and thread_through_all_blocks.
15754 Remove reference marker for m_registry.
15755 Remove reference marker for m_profit.
15756 (back_threader::back_threader): Adjust for registry and profit not
15758 (dump_path): Move down.
15759 (debug): Move down.
15760 (class thread_jumps): Remove.
15761 (class back_threader_registry): Remove m_all_paths.
15763 (thread_jumps::thread_through_all_blocks): Move to back_threader
15765 (fsm_find_thread_path): Remove
15766 (back_threader::maybe_thread): New.
15767 (back_threader::thread_through_all_blocks): Move from
15769 (back_threader_registry::back_threader_registry): Remove
15771 (back_threader_registry::~back_threader_registry): Remove.
15772 (thread_jumps::find_taken_edge): Remove.
15773 (thread_jumps::check_subpath_and_update_thread_path): Remove.
15774 (thread_jumps::maybe_register_path): Remove.
15775 (thread_jumps::handle_phi): Remove.
15776 (handle_assignment_p): Remove.
15777 (thread_jumps::handle_assignment): Remove.
15778 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
15779 (thread_jumps::find_jump_threads_backwards): Remove.
15780 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
15781 (try_thread_blocks): Rename find_jump_threads_backwards to
15783 (pass_early_thread_jumps::execute): Same.
15785 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
15787 * tree-core.h (omp_clause_proc_bind_kind): Add
15788 OMP_CLAUSE_PROC_BIND_PRIMARY.
15789 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
15790 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
15792 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
15794 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
15795 fno-common reference.
15796 * config/arc/arc.c (arc_override_options): Remove overriding of
15799 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15802 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
15803 If d->testing_p, return true after performing checks instead of
15804 actually expanding the insn.
15805 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
15806 !TARGET_AVX512BW and return false.
15808 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
15810 * configure.ac (PE linker --disable-dynamicbase support): New check.
15811 * configure: Regenerate.
15812 * config.in: Likewise.
15813 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
15814 (LINK_SPEC): Use it.
15815 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
15816 (LINK_SPEC): Likewise.
15818 2021-08-12 liuhongt <hongtao.liu@intel.com>
15821 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
15822 post_reload define_insn_and_split.
15823 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
15824 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
15825 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
15826 (*avx2_zero_extendv8hiv8si2_2): Ditto.
15827 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
15828 (*avx512f_zero_extendv8siv8di2_2): Ditto.
15829 (*avx2_zero_extendv4siv4di2_2): Ditto.
15830 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
15831 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
15834 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15836 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
15839 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15841 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
15843 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15845 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
15846 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
15847 initialization of pcvoid_type_node here...
15848 (altivec_init_builtins): ...from here.
15849 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
15850 RS6000_BTI_const_ptr_void.
15851 (pcvoid_type_node): New macro.
15853 2021-08-11 Richard Biener <rguenther@suse.de>
15856 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
15857 hard-register accesses.
15859 2021-08-11 Richard Biener <rguenther@suse.de>
15861 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
15862 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
15863 to determine has_volatile_ops.
15865 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
15867 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
15869 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
15870 Alexandre Oliva <oliva@adacore.com>
15872 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
15874 (modref_lattice::merge_deref): Fix handling of indirect scape points.
15875 (update_escape_summary_1): Likewise.
15876 (update_escape_summary): Likewise.
15877 (ipa_merge_modref_summary_after_inlining): Likewise.
15879 2021-08-11 Richard Biener <rguenther@suse.de>
15881 PR middle-end/101858
15882 * fold-const.c (fold_binary_loc): Guard simplification
15883 of X < (cast) (1 << Y) to integer types.
15885 2021-08-11 Richard Biener <rguenther@suse.de>
15887 PR tree-optimization/101861
15888 * tree-vect-stmts.c (vectorizable_load): Fix error in
15889 previous change with regard to gather vectorization.
15891 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15894 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
15896 (vdup_n_s16): Likewise.
15897 (vdup_n_s32): Likewise.
15898 (vdup_n_s64): Likewise.
15899 (vdup_n_u8): Likewise.
15900 (vdup_n_u16): Likewise.
15901 (vdup_n_u32): Likewise.
15902 (vdup_n_u64): Likewise.
15903 (vdup_n_p8): Likewise.
15904 (vdup_n_p16): Likewise.
15905 (vdup_n_p64): Likewise.
15906 (vdup_n_f16): Likewise.
15907 (vdup_n_f32): Likewise.
15908 (vdupq_n_s8): Likewise.
15909 (vdupq_n_s16): Likewise.
15910 (vdupq_n_s32): Likewise.
15911 (vdupq_n_s64): Likewise.
15912 (vdupq_n_u8): Likewise.
15913 (vdupq_n_u16): Likewise.
15914 (vdupq_n_u32): Likewise.
15915 (vdupq_n_u64): Likewise.
15916 (vdupq_n_p8): Likewise.
15917 (vdupq_n_p16): Likewise.
15918 (vdupq_n_p64): Likewise.
15919 (vdupq_n_f16): Likewise.
15920 (vdupq_n_f32): Likewise.
15921 (vmov_n_s8): Replace call to builtin with call to corresponding
15923 (vmov_n_s16): Likewise.
15924 (vmov_n_s32): Likewise.
15925 (vmov_n_s64): Likewise.
15926 (vmov_n_u8): Likewise.
15927 (vmov_n_u16): Likewise.
15928 (vmov_n_u32): Likewise.
15929 (vmov_n_u64): Likewise.
15930 (vmov_n_p8): Likewise.
15931 (vmov_n_p16): Likewise.
15932 (vmov_n_f16): Likewise.
15933 (vmov_n_f32): Likewise.
15934 (vmovq_n_s8): Likewise.
15935 (vmovq_n_s16): Likewise.
15936 (vmovq_n_s32): Likewise.
15937 (vmovq_n_s64): Likewise.
15938 (vmovq_n_u8): Likewise.
15939 (vmovq_n_u16): Likewise.
15940 (vmovq_n_u32): Likewise.
15941 (vmovq_n_u64): Likewise.
15942 (vmovq_n_p8): Likewise.
15943 (vmovq_n_p16): Likewise.
15944 (vmovq_n_f16): Likewise.
15945 (vmovq_n_f32): Likewise.
15946 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
15948 2021-08-11 liuhongt <hongtao.liu@intel.com>
15951 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
15952 when TARGET_AVX512F and TARGET_SSE_MATH.
15954 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15957 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
15958 for V32HImode if !TARGET_AVX512BW.
15959 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
15960 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
15961 early, but actually check the permutation.
15963 2021-08-10 Richard Biener <rguenther@suse.de>
15965 PR tree-optimization/101809
15966 * tree-vect-stmts.c (get_load_store_type): Allow emulated
15967 gathers with offset vector nunits being a constant multiple
15968 of the data vector nunits.
15969 (vect_get_gather_scatter_ops): Use the appropriate nunits
15970 for the offset vector defs.
15971 (vectorizable_store): Adjust call to
15972 vect_get_gather_scatter_ops.
15973 (vectorizable_load): Likewise. Handle the case of less
15974 offset vectors than data vectors.
15976 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15979 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
15980 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
15983 2021-08-10 Richard Biener <rguenther@suse.de>
15985 PR tree-optimization/101801
15986 PR tree-optimization/101819
15987 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
15988 * tree-vect-loop.c (vect_emulated_vector_p): New function.
15989 (vectorizable_reduction): Re-instantiate a check for emulated
15991 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15992 (vectorizable_operation): Likewise. Cost emulated vector
15993 operations according to the scalar sequence synthesized by
15996 2021-08-10 Richard Biener <rguenther@suse.de>
15998 PR middle-end/101824
15999 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
16000 volatile in case the variable was.
16002 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
16005 * config/i386/constraints.md (BC): Document for integer SSE
16006 constant all bits set operand.
16007 (BF): New constraint for const floating-point all bits set
16009 * config/i386/i386.c (standard_sse_constant_p): Likewise.
16010 (standard_sse_constant_opcode): Likewise.
16011 * config/i386/sse.md (sseconstm1): New mode attribute.
16012 (mov<mode>_internal): Replace BC with <sseconstm1>.
16014 2021-08-10 liuhongt <hongtao.liu@intel.com>
16016 * config/i386/sse.md (cond_<insn><mode>): New expander.
16017 (VI248_AVX512VLBW): New mode iterator.
16018 * config/i386/predicates.md
16019 (nonimmediate_or_const_vec_dup_operand): New predicate.
16021 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
16023 PR tree-optimization/101741
16024 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
16025 type of parameter for toupper/tolower.
16027 2021-08-09 Martin Jambor <mjambor@suse.cz>
16029 PR testsuite/101654
16030 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
16032 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
16034 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
16036 (is_store_insn1): Verify source is a register.
16038 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
16041 * config/i386/mmx.md (<any_logic:code>v2sf3):
16042 Rename from *mmx_<any_logic:code>v2sf3
16044 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16046 * config/nvptx/nvptx.c: Cross-reference parts adapted in
16047 'gcc/omp-oacc-neuter-broadcast.cc'.
16048 * omp-low.c: Likewise.
16049 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
16052 2021-08-09 Julian Brown <julian@codesourcery.com>
16053 Kwok Cheung Yeung <kcy@codesourcery.com>
16054 Thomas Schwinge <thomas@codesourcery.com>
16056 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
16057 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
16058 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
16059 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
16060 (gcn_fork_join): Update comment.
16061 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
16062 (macc_experimental_workers): Remove unused option.
16064 2021-08-09 Julian Brown <julian@codesourcery.com>
16065 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
16066 Kwok Cheung Yeung <kcy@codesourcery.com>
16067 Thomas Schwinge <thomas@codesourcery.com>
16069 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
16070 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
16071 Add documentation hook.
16072 * doc/tm.texi: Regenerate.
16073 * omp-oacc-neuter-broadcast.cc: New file.
16074 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
16075 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
16076 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
16077 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
16078 * target.def (goacc.create_worker_broadcast_record): Add target
16080 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
16082 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
16083 Rename prototype to...
16084 (gcn_goacc_create_worker_broadcast_record): ... this.
16085 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
16087 (gcn_goacc_create_worker_broadcast_record): ... this.
16088 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
16090 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
16092 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
16095 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
16096 the right iterator.
16098 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16100 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
16102 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16104 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
16105 consider '-foffload-abi'.
16106 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
16107 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
16110 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16112 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
16115 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16117 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
16119 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16121 * doc/gty.texi (Files): Update.
16123 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
16125 * doc/gty.texi (Files): Fix GTY header file example.
16127 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
16129 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
16130 determine the upper and lower bounds from a mask-value pair.
16131 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
16132 absolute value and unsigned absolute value expressions.
16133 (bit_value_binop): Initialize *VAL's precision.
16134 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
16135 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
16136 support when the operands are unknown but potentially equal.
16137 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
16139 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
16141 * config/aarch64/aarch64.md
16142 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
16144 2021-08-08 Sergei Trofimovich <siarheit@google.com>
16146 * lra-constraints.c: Fix s/otput/output/ typo.
16148 2021-08-06 Martin Sebor <msebor@redhat.com>
16150 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
16151 (expand_builtin_strcat): Same.
16152 (expand_builtin_stpncpy): Same.
16153 (expand_builtin_strncat): Same.
16154 (check_read_access): Same.
16155 (check_memop_access): Same.
16156 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
16157 (expand_builtin_strnlen): Same.
16158 (expand_builtin_memcpy): Same.
16159 (expand_builtin_memmove): Same.
16160 (expand_builtin_mempcpy): Same.
16161 (expand_builtin_strcpy): Same.
16162 (expand_builtin_strcpy_args): Same.
16163 (expand_builtin_stpcpy_1): Same.
16164 (expand_builtin_strncpy): Same.
16165 (expand_builtin_memset): Same.
16166 (expand_builtin_bzero): Same.
16167 (expand_builtin_strcmp): Same.
16168 (expand_builtin_strncmp): Same.
16169 (expand_builtin): Remove handlers.
16170 (fold_builtin_strlen): Add a comment.
16171 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
16172 * calls.c (maybe_warn_nonstring_arg): Same.
16173 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
16174 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
16175 (gimple_fold_builtin_stpcpy): Same.
16176 * gimple-ssa-warn-access.cc (has_location): New function.
16177 (get_location): Same.
16178 (get_callee_fndecl): Same.
16179 (call_nargs): Same.
16181 (warn_string_no_nul): Define.
16182 (unterminated_array): Same.
16183 (check_nul_terminated_array): Same.
16184 (maybe_warn_nonstring_arg): Same.
16185 (maybe_warn_for_bound): Same.
16186 (warn_for_access): Same.
16187 (check_access): Same.
16188 (check_memop_access): Same.
16189 (check_read_access): Same.
16190 (warn_dealloc_offset): Use helper functions.
16191 (maybe_emit_free_warning): Same.
16192 (class pass_waccess): Add members.
16193 (check_strcat): New function.
16194 (check_strncat): New function.
16195 (check_stxcpy): New function.
16196 (check_stxncpy): New function.
16197 (check_strncmp): New function.
16198 (pass_waccess::check_builtin): New function.
16199 (pass_waccess::check): Call it.
16200 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
16202 (maybe_warn_for_bound): Same.
16203 (check_access): Same.
16204 (check_memop_access): Same.
16205 (check_read_access): Same.
16206 * pointer-query.h (struct access_data): Define a ctor overload.
16208 2021-08-06 Richard Biener <rguenther@suse.de>
16210 PR tree-optimization/101801
16211 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
16212 (vect_can_vectorize_without_simd_p): ... to this.
16213 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
16214 (vect_can_vectorize_without_simd_p): ... to this and fold
16215 in vect_min_worthwhile_factor.
16216 (vect_min_worthwhile_factor): Remove.
16217 (vectorizable_reduction): Adjust and remove the cost part.
16218 * tree-vect-stmts.c (vectorizable_shift): Likewise.
16219 (vectorizable_operation): Likewise.
16221 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
16224 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
16225 Add general_gr_operand predicate to operand 3.
16227 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
16229 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
16230 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
16232 2021-08-06 Tamar Christina <tamar.christina@arm.com>
16234 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
16235 register_svprfop): Pass vec<> by pointer.
16236 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
16237 * langhooks.c (lhd_simulate_enum_decl): Likewise.
16238 * langhooks.h (struct lang_hooks_for_types): Likewise.
16240 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16242 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
16243 __builtin_memcpy instead of constructing an additional
16244 __builtin_aarch64_simd_oi one vector at a time.
16245 (vst1q_bf16_x2): Likewise.
16246 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
16247 an additional __builtin_aarch64_simd_ci one vector at a time.
16248 (vst1q_bf16_x3): Likewise.
16249 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
16250 (vst1q_bf16_x4): Likewise.
16251 (vst2_bf16): Use __builtin_memcpy instead of constructing an
16252 additional __builtin_aarch64_simd_oi one vector at a time.
16253 (vst2q_bf16): Likewise.
16254 (vst3_bf16): Use __builtin_memcpy instead of constructing an
16255 additional __builtin_aarch64_simd_ci mode one vector at a
16257 (vst3q_bf16): Likewise.
16258 (vst4_bf16): Use __builtin_memcpy instead of constructing an
16259 additional __builtin_aarch64_simd_xi one vector at a time.
16260 (vst4q_bf16): Likewise.
16262 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16264 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
16265 (__ST2Q_LANE_FUNC): Delete.
16266 (vst2_lane_f16): Use __builtin_memcpy to copy vector
16267 structure instead of constructing __builtin_aarch64_simd_oi
16268 one vector at a time.
16269 (vst2_lane_f32): Likewise.
16270 (vst2_lane_f64): Likewise.
16271 (vst2_lane_p8): Likewise.
16272 (vst2_lane_p16): Likewise.
16273 (vst2_lane_p64): Likewise.
16274 (vst2_lane_s8): Likewise.
16275 (vst2_lane_s16): Likewise.
16276 (vst2_lane_s32): Likewise.
16277 (vst2_lane_s64): Likewise.
16278 (vst2_lane_u8): Likewise.
16279 (vst2_lane_u16): Likewise.
16280 (vst2_lane_u32): Likewise.
16281 (vst2_lane_u64): Likewise.
16282 (vst2_lane_bf16): Likewise.
16283 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
16284 structure instead of using a union.
16285 (vst2q_lane_f32): Likewise.
16286 (vst2q_lane_f64): Likewise.
16287 (vst2q_lane_p8): Likewise.
16288 (vst2q_lane_p16): Likewise.
16289 (vst2q_lane_p64): Likewise.
16290 (vst2q_lane_s8): Likewise.
16291 (vst2q_lane_s16): Likewise.
16292 (vst2q_lane_s32): Likewise.
16293 (vst2q_lane_s64): Likewise.
16294 (vst2q_lane_u8): Likewise.
16295 (vst2q_lane_u16): Likewise.
16296 (vst2q_lane_u32): Likewise.
16297 (vst2q_lane_u64): Likewise.
16298 (vst2q_lane_bf16): Likewise.
16300 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16302 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
16303 (__ST3Q_LANE_FUNC): Delete.
16304 (vst3_lane_f16): Use __builtin_memcpy to copy vector
16305 structure instead of constructing __builtin_aarch64_simd_ci
16306 one vector at a time.
16307 (vst3_lane_f32): Likewise.
16308 (vst3_lane_f64): Likewise.
16309 (vst3_lane_p8): Likewise.
16310 (vst3_lane_p16): Likewise.
16311 (vst3_lane_p64): Likewise.
16312 (vst3_lane_s8): Likewise.
16313 (vst3_lane_s16): Likewise.
16314 (vst3_lane_s32): Likewise.
16315 (vst3_lane_s64): Likewise.
16316 (vst3_lane_u8): Likewise.
16317 (vst3_lane_u16): Likewise.
16318 (vst3_lane_u32): Likewise.
16319 (vst3_lane_u64): Likewise.
16320 (vst3_lane_bf16): Likewise.
16321 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
16322 structure instead of using a union.
16323 (vst3q_lane_f32): Likewise.
16324 (vst3q_lane_f64): Likewise.
16325 (vst3q_lane_p8): Likewise.
16326 (vst3q_lane_p16): Likewise.
16327 (vst3q_lane_p64): Likewise.
16328 (vst3q_lane_s8): Likewise.
16329 (vst3q_lane_s16): Likewise.
16330 (vst3q_lane_s32): Likewise.
16331 (vst3q_lane_s64): Likewise.
16332 (vst3q_lane_u8): Likewise.
16333 (vst3q_lane_u16): Likewise.
16334 (vst3q_lane_u32): Likewise.
16335 (vst3q_lane_u64): Likewise.
16336 (vst3q_lane_bf16): Likewise.
16338 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
16340 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
16341 (__ST4Q_LANE_FUNC): Delete.
16342 (vst4_lane_f16): Use __builtin_memcpy to copy vector
16343 structure instead of constructing __builtin_aarch64_simd_xi
16344 one vector at a time.
16345 (vst4_lane_f32): Likewise.
16346 (vst4_lane_f64): Likewise.
16347 (vst4_lane_p8): Likewise.
16348 (vst4_lane_p16): Likewise.
16349 (vst4_lane_p64): Likewise.
16350 (vst4_lane_s8): Likewise.
16351 (vst4_lane_s16): Likewise.
16352 (vst4_lane_s32): Likewise.
16353 (vst4_lane_s64): Likewise.
16354 (vst4_lane_u8): Likewise.
16355 (vst4_lane_u16): Likewise.
16356 (vst4_lane_u32): Likewise.
16357 (vst4_lane_u64): Likewise.
16358 (vst4_lane_bf16): Likewise.
16359 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
16360 structure instead of using a union.
16361 (vst4q_lane_f32): Likewise.
16362 (vst4q_lane_f64): Likewise.
16363 (vst4q_lane_p8): Likewise.
16364 (vst4q_lane_p16): Likewise.
16365 (vst4q_lane_p64): Likewise.
16366 (vst4q_lane_s8): Likewise.
16367 (vst4q_lane_s16): Likewise.
16368 (vst4q_lane_s32): Likewise.
16369 (vst4q_lane_s64): Likewise.
16370 (vst4q_lane_u8): Likewise.
16371 (vst4q_lane_u16): Likewise.
16372 (vst4q_lane_u32): Likewise.
16373 (vst4q_lane_u64): Likewise.
16374 (vst4q_lane_bf16): Likewise.
16376 2021-08-06 Martin Liska <mliska@suse.cz>
16378 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
16379 a target option is restored, it can have
16380 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
16381 and error should not be emitted.
16383 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
16385 * gcov-io.h (gcov_write): Declare.
16386 * gcov-io.c (gcov_write): New.
16387 (gcov_write_counter): Remove.
16388 (gcov_write_tag_length): Likewise.
16389 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
16390 gcov_write_unsigned().
16391 * doc/invoke.texi (fprofile-info-section): Mention
16392 __gcov_info_to_gdca().
16394 2021-08-06 Martin Sebor <msebor@redhat.com>
16396 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
16397 arguments to by-reference.
16398 (iterate_fix_dominators): Same.
16399 * dominance.h (iterate_fix_dominators): Same.
16400 * ipa-prop.h: Call auto_vec::to_vec_legacy.
16401 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
16402 arguments to by-reference.
16403 (debug_data_dependence_relation): Same.
16404 (dump_data_dependence_relations): Same.
16405 * tree-data-ref.h (debug_data_dependence_relation): Same.
16406 (dump_data_dependence_relations): Same.
16407 * tree-predcom.c (dump_chains): Same.
16408 (initialize_root_vars_lm): Same.
16409 (determine_unroll_factor): Same.
16410 (replace_phis_by_defined_names): Same.
16411 (insert_init_seqs): Same.
16412 (pcom_worker::tree_predictive_commoning_loop): Call
16413 auto_vec::to_vec_legacy.
16414 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
16415 arguments to by-reference.
16416 * tree-ssa-threadbackward.c (populate_worklist): Same.
16417 (back_threader::resolve_def): Same.
16418 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
16419 (vect_enhance_data_refs_alignment): Same.
16420 (vect_check_lower_bound): Same.
16421 (vect_prune_runtime_alias_test_list): Same.
16422 (vect_permute_store_chain): Same.
16423 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
16424 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
16425 * tree-vectorizer.h (vect_permute_store_chain): Same.
16426 * vec.c (test_init): New function.
16427 (vec_c_tests): Call new function.
16428 * vec.h (vec): Declare ctors, dtor, and assignment.
16429 (auto_vec::vec_to_legacy): New function.
16430 (vec::copy): Adjust initialization.
16432 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
16435 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
16436 callee only uses GPRs.
16437 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
16438 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
16439 * config/i386/x86gprintrin.h: Add
16440 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
16441 to disable non-GPR ISAs.
16443 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
16445 PR middle-end/101787
16446 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
16448 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
16450 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
16451 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
16452 (vect_is_extending_load, vect_is_integer_truncation): New functions,
16453 moved from aarch64.c but given different names.
16454 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
16455 (aarch64_is_reduction, aarch64_reduc_type)
16456 (aarch64_embedded_comparison_type, aarch64_comparison_type)
16457 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
16458 in favor of the above. Update callers accordingly.
16460 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16463 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
16464 writing .cpu directive in asm output.
16465 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
16466 (arm_last_printed_arch_string): Delete.
16467 (arm_last-printed_fpu_string): Delete.
16468 (arm_configure_build_target): If use of floating-point/SIMD is
16469 disabled, remove all fp/simd related features from the target ISA.
16470 (last_arm_targ_options): New variable.
16471 (arm_print_asm_arch_directives): Add new parameters. Change order
16472 of emitted directives and handle all cases here.
16473 (arm_file_start): Always call arm_print_asm_arch_directives, move
16474 all generation of .arch/.arch_extension here.
16475 (arm_file_end): Call arm_print_asm_arch.
16476 (arm_declare_function_name): Call arm_print_asm_arch_directives
16477 instead of printing .arch/.fpu directives directly.
16479 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16481 * config/arm/arm.c (arm_configure_build_target): Don't call
16482 arm_option_reconfigure_globals.
16483 (arm_option_restore): Call arm_option_reconfigure_globals after
16484 reconfiguring the target.
16485 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
16487 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16489 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
16490 arch_name is always set.
16492 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
16494 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
16495 of vec_select high-half from being added into Neon subtract
16498 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
16500 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
16501 of vec_select high-half from being added into Neon add cost.
16503 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
16505 * cfgloop.h (loops_list::loops_list): Add one optional argument
16506 root and adjust accordingly, update loop tree walking and factor
16508 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
16510 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
16512 PR tree-optimization/101626
16513 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
16514 reverse scalar storage order on a pointer or vector component.
16516 2021-08-05 liuhongt <hongtao.liu@intel.com>
16518 * config/i386/sse.md (cond_<code><mode>): New expander.
16520 2021-08-05 liuhongt <hongtao.liu@intel.com>
16522 * config/i386/sse.md (cond_<code><mode>): New expander.
16524 2021-08-05 liuhongt <hongtao.liu@intel.com>
16526 * config/i386/sse.md (cond_<code><mode>): New expander.
16528 2021-08-04 David Malcolm <dmalcolm@redhat.com>
16531 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
16533 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16536 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
16537 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
16539 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16542 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16543 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16544 data with SSE register from one memory location to another.
16546 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16548 * config/s390/s390.c (expand_perm_with_vpdi): New function.
16549 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
16550 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
16551 parameterized expander.
16552 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
16554 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16556 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
16557 (struct expand_vec_perm_d): Define struct.
16558 (expand_perm_with_merge): New function.
16559 (vectorize_vec_perm_const_1): New function.
16560 (s390_vectorize_vec_perm_const): New function.
16561 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
16563 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16565 * config/s390/vector.md (V_HW_64): Remove mode iterator.
16566 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
16567 * config/s390/vx-builtins.md
16568 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
16571 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16573 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
16575 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
16577 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
16578 instead of an unspec.
16580 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16582 * config/s390/s390-modes.def: Add more vector modes to support
16583 concatenation of two vectors.
16584 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
16586 (s390_expand_merge): Likewise.
16587 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
16588 (s390_expand_merge): New function.
16589 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
16590 Remove constant definitions.
16591 * config/s390/vector.md (V_HW_2): Add mode iterators.
16592 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
16593 (vec_2x_nelts, vec_2x_wide): New mode attributes.
16594 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
16595 New pattern definitions.
16596 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
16597 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
16598 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
16599 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
16601 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
16603 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
16604 emit vec merge pattern.
16606 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16608 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
16610 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
16611 vec_select high-half from being added into Neon multiply
16613 * rtlanal.c (vec_series_highpart_p): Define.
16614 * rtlanal.h (vec_series_highpart_p): Declare.
16616 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16618 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
16620 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
16621 vec_select cost from being added into Neon multiply cost.
16623 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16625 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
16626 which old_loop_vinfo is an epilogue loop that handles a constant
16627 number of iterations.
16629 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16631 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
16632 when a reanalyzed loop fails to be cheaper than the current
16635 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16637 * config/aarch64/aarch64.c: Fix a typo.
16639 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
16641 PR gcov-profile/101773
16642 * gcov-io.c (gcov_close): Check return code of a fclose.
16644 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
16647 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
16648 .file statement when needed.
16650 2021-08-04 Richard Biener <rguenther@suse.de>
16652 * tree-vect-data-refs.c (vect_check_gather_scatter):
16653 Include widening conversions only when the result is
16654 still handed by native gather or the current offset
16655 size not already matches the data size.
16656 Also succeed analysis in case there's no native support,
16657 noted by a IFN_LAST ifn and a NULL decl.
16658 (vect_analyze_data_refs): Always consider gathers.
16659 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
16660 Test for no IFN gather rather than decl gather.
16661 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
16662 gather-scatter info and cost emulated gathers accordingly.
16663 (vect_truncate_gather_scatter_offset): Properly test for
16665 (vect_use_strided_gather_scatters_p): Likewise.
16666 (get_load_store_type): Handle emulated gathers and its
16668 (vectorizable_load): Likewise. Emulate them by extracting
16669 scalar offsets, doing scalar loads and a vector construct.
16671 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16674 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
16675 argument to set m_max_size.
16676 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
16677 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
16678 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
16680 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
16681 Marc Glisse <marc.glisse@inria.fr>
16683 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
16684 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
16685 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
16687 2021-08-04 Richard Biener <rguenther@suse.de>
16689 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
16690 out code to decompose vector loads ...
16691 (optimize_vector_load): ... here. Generalize it to
16692 handle intermediate widening and TARGET_MEM_REF loads
16693 and apply it to loads with a supported vector mode as well.
16695 2021-08-04 Richard Biener <rguenther@suse.de>
16697 PR tree-optimization/101756
16698 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
16699 the result of the reduction epilogue is compatible to the original
16702 2021-08-04 liuhongt <hongtao.liu@intel.com>
16705 * config/i386/i386.md (peephole2): Refine predicate from
16706 register_operand to general_reg_operand.
16708 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
16710 * gimple-range-path.h (path_range_query::dump): Mark override.
16712 2021-08-04 Richard Biener <rguenther@suse.de>
16714 PR tree-optimization/101769
16715 * tree-tailcall.c (eliminate_tail_call): Add the created loop
16716 for the first recursion and return it via the new output parameter.
16717 (optimize_tail_call): Pass through new output param.
16718 (tree_optimize_tail_calls_1): After creating all latches,
16719 add the created loop to the loop tree. Do not mark loops for fixup.
16721 2021-08-04 Martin Liska <mliska@suse.cz>
16723 * doc/invoke.texi: Document threader-mode param.
16725 2021-08-04 liuhongt <hongtao.liu@intel.com>
16727 * config/i386/sse.md (cond_fma<mode>): New expander.
16728 (cond_fms<mode>): Ditto.
16729 (cond_fnma<mode>): Ditto.
16730 (cond_fnms<mode>): Ditto.
16732 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16734 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
16736 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16738 * config/rs6000/constraints.md: Remove "e" from the list of available
16739 constraint characters.
16741 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
16743 PR gcov-profile/71672
16744 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
16746 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
16748 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
16750 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
16752 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
16753 try XMM31 to avoid vzeroupper.
16755 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16757 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
16758 -mcpu=neoverse-512tvb.
16759 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
16760 * config/aarch64/aarch64-tune.md: Regenerate.
16761 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
16762 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
16763 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
16764 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
16765 (aarch64_adjust_body_cost): Likewise.
16767 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16769 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
16770 record issue information for operations that occur in the
16773 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16775 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
16776 parameter. Detect cases in which an Advanced SIMD MLA would almost
16777 certainly require a MOV.
16778 (aarch64_count_ops): Update accordingly.
16780 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16782 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
16783 function, split out from...
16784 (aarch64_detect_vector_stmt_subtype): ...here.
16785 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
16787 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16789 * config/aarch64/aarch64-protos.h (sve_vec_cost):
16790 Add gather_load_x32_cost and gather_load_x64_cost.
16791 * config/aarch64/aarch64.c (generic_sve_vector_cost)
16792 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
16793 accordingly, using the values given by the scalar_load * number
16794 of elements calculation that we used previously.
16795 (aarch64_detect_vector_stmt_subtype): Use the new fields.
16797 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16799 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
16800 function, split out from...
16801 (aarch64_adjust_body_cost): ...here.
16803 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16805 * config/aarch64/fractional-cost.h: New file.
16806 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
16807 and cost_fraction.h.
16808 (vec_cost_fraction): New typedef.
16809 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
16810 (aarch64_detect_vector_stmt_subtype): Likewise.
16811 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
16812 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
16814 (aarch64_adjust_body_cost): Likewise.
16815 (aarch64_test_cost_fraction): New function.
16816 (aarch64_run_selftests): Call it.
16818 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16820 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
16822 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
16824 (aarch64_estimated_poly_value): Likewise. Use the least significant
16825 set bit for the minimum and likely values. Use the most significant
16826 set bit for the maximum value.
16828 2021-08-03 liuhongt <hongtao.liu@intel.com>
16830 * config/i386/sse.md (cond_<insn><mode>): New expander.
16831 (cond_mul<mode>): Ditto.
16833 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
16835 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
16837 2021-08-03 liuhongt <hongtao.liu@intel.com>
16839 * config/i386/sse.md (cond_<insn><mode>):New expander.
16840 (cond_mul<mode>): Ditto.
16841 (cond_div<mode>): Ditto.
16843 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16845 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
16846 check stack_realign_needed for stack realignment.
16847 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
16848 than the largest integer supported by vector register.
16849 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
16850 (MOVE_MAX): Set to bytes of the largest integer supported by
16852 (STORE_MAX_PIECES): New.
16854 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16856 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16857 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16858 data from one memory location to another.
16860 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16862 PR middle-end/90773
16863 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
16865 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
16867 PR tree-optimization/101724
16868 * params.opt: Remove --param=threader-iterative.
16869 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
16872 2021-08-02 Tom de Vries <tdevries@suse.de>
16874 PR middle-end/101665
16875 * doc/extend.texi (nonnull attribute): Improve documentation.
16877 2021-08-02 Andrew Pinski <apinski@marvell.com>
16879 PR rtl-optimization/101683
16880 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
16882 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
16884 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
16885 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
16886 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
16887 (tree_ssa_phiop_worker): Update call to function above.
16889 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
16892 * config/i386/i386.md (bsr_rex64_1_zext): New.
16893 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
16894 with gen_bsr_rex64_1_zext.
16896 2021-07-31 Jakub Jelinek <jakub@redhat.com>
16899 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
16900 define_insn patterns.
16901 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
16902 Add combine splitters for constant - clz.
16903 (clz<mode>2): Use a temporary pseudo for bsr result.
16905 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16907 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
16908 _mm_floor_sd, _mm_floor_ss): New.
16910 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16912 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
16913 _mm_ceil_sd, _mm_ceil_ss): New.
16915 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16917 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
16918 _mm_blend_ps, _mm_blendv_ps): New.
16920 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
16921 Uroš Bizjak <ubizjak@gmail.com>
16923 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
16924 to generate a conditional move using the carry flag after sub $1.
16925 (peephole2): Eliminate a register-to-register move by inverting
16926 the condition of a conditional move.
16928 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16930 * config/mmix/mmix.md ("call", "call_value", "*call_real")
16931 ("*call_value_real"): Don't generate rtx mentioning the generic
16932 operands 1 and 2 to "call", and similarly for "call_value".
16933 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
16934 (mmix_print_operand): Use '!' instead of 'p'.
16936 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16938 * doc/md.texi (call): Correct information about operand 2.
16939 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
16941 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16943 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
16945 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16947 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
16949 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16951 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
16952 const_basic_block..
16953 (*::get_bb_range): Ditto.
16954 (*::bb_range_p): Ditto.
16955 * gimple-range-cache.h: Change prototypes.
16957 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16959 PR middle-end/90773
16960 * builtins.c (builtin_memcpy_read_str): Change the mode argument
16961 from scalar_int_mode to fixed_size_mode.
16962 (builtin_strncpy_read_str): Likewise.
16963 (gen_memset_value_from_prev): New function.
16964 (builtin_memset_read_str): Change the mode argument from
16965 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
16966 and support CONST_VECTOR.
16967 (builtin_memset_gen_str): Likewise.
16968 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
16970 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
16971 with fixed_size_mode.
16972 (builtin_memset_read_str): Likewise.
16973 * expr.c (widest_int_mode_for_size): Renamed to ...
16974 (widest_fixed_size_mode_for_size): Add a bool argument to
16975 indicate if QI vector mode can be used.
16976 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
16977 instead of widest_int_mode_for_size.
16978 (pieces_addr::adjust): Change the mode argument from
16979 scalar_int_mode to fixed_size_mode.
16980 (op_by_pieces_d): Make m_len read-only. Add a bool member,
16981 m_qi_vector_mode, to indicate that QI vector mode can be used.
16982 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
16983 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
16984 instead of widest_int_mode_for_size.
16985 (op_by_pieces_d::get_usable_mode): Change the mode argument from
16986 scalar_int_mode to fixed_size_mode. Call
16987 widest_fixed_size_mode_for_size instead of
16988 widest_int_mode_for_size.
16989 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
16990 function to return the smallest integer or QI vector mode.
16991 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
16992 instead of widest_int_mode_for_size. Call
16993 smallest_fixed_size_mode_for_size instead of
16994 smallest_int_mode_for_size.
16995 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
16996 indicate that QI vector mode can be used and pass it to
16997 op_by_pieces_d::op_by_pieces_d.
16998 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
16999 instead of widest_int_mode_for_size. Pass memsetp to
17000 widest_fixed_size_mode_for_size to support QI vector mode.
17001 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
17002 (store_by_pieces): Pass memsetp to
17003 store_by_pieces_d::store_by_pieces_d.
17004 (clear_by_pieces_1): Removed.
17005 (clear_by_pieces): Replace clear_by_pieces_1 with
17006 builtin_memset_read_str and pass true to store_by_pieces_d to
17007 support vector mode broadcast.
17008 (string_cst_read_str): Change the mode argument from
17009 scalar_int_mode to fixed_size_mode.
17010 * expr.h (by_pieces_constfn): Change scalar_int_mode to
17012 (by_pieces_prev): Likewise.
17013 * rtl.h (lowpart_subreg_regno): New.
17014 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
17015 simplify_subreg_regno.
17016 * target.def (gen_memset_scratch_rtx): New hook.
17017 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
17018 * doc/tm.texi: Regenerated.
17020 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
17023 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
17024 TARGET_EXPR instead of MODIFY_EXPR.
17026 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
17029 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
17030 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
17031 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
17033 (vec_cmpu<IMSA:mode><mode_i>): New expander.
17035 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
17038 * config/i386/i386-options.c (ix86_option_override_internal):
17039 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
17041 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
17044 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
17045 explicitly dereferencing __a.
17046 (vld1_s64): Likewise.
17047 (vld1_u64): Likewise.
17048 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
17049 and change to VAR13.
17051 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
17053 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
17054 use of m_range_analyzer.
17055 (loop_versioning::lv_dom_walker::before_dom_children): Same.
17056 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
17057 (loop_versioning::prune_loop_conditions): Replace vr_values use
17058 with range_query interface.
17059 (pass_loop_versioning::execute): Use ranger.
17061 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
17064 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
17065 enum values, and emit a warning if they mismatch.
17067 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
17069 * cfgloop.h (as_const): New function.
17070 (class loop_iterator): Rename to ...
17071 (class loops_list): ... this.
17072 (loop_iterator::next): Rename to ...
17073 (loops_list::Iter::fill_curr_loop): ... this and adjust.
17074 (loop_iterator::loop_iterator): Rename to ...
17075 (loops_list::loops_list): ... this and adjust.
17076 (loops_list::Iter): New class.
17077 (loops_list::iterator): New type.
17078 (loops_list::const_iterator): New type.
17079 (loops_list::begin): New function.
17080 (loops_list::end): Likewise.
17081 (loops_list::begin const): Likewise.
17082 (loops_list::end const): Likewise.
17083 (FOR_EACH_LOOP): Remove.
17084 (FOR_EACH_LOOP_FN): Remove.
17085 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
17086 for loop with loops_list instance.
17087 (sort_sibling_loops): Likewise.
17088 (disambiguate_loops_with_multiple_latches): Likewise.
17089 (verify_loop_structure): Likewise.
17090 * cfgloopmanip.c (create_preheaders): Likewise.
17091 (force_single_succ_latches): Likewise.
17092 * config/aarch64/falkor-tag-collision-avoidance.c
17093 (execute_tag_collision_avoidance): Likewise.
17094 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
17095 * config/s390/s390.c (s390_adjust_loops): Likewise.
17096 * doc/loop.texi: Likewise.
17097 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
17098 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
17099 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
17100 (loop_versioning::make_versioning_decisions): Likewise.
17101 * gimple-ssa-split-paths.c (split_paths): Likewise.
17102 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
17103 * graphite.c (canonicalize_loop_form): Likewise.
17104 (graphite_transform_loops): Likewise.
17105 * ipa-fnsummary.c (analyze_function_body): Likewise.
17106 * ipa-pure-const.c (analyze_function): Likewise.
17107 * loop-doloop.c (doloop_optimize_loops): Likewise.
17108 * loop-init.c (loop_optimizer_finalize): Likewise.
17109 (fix_loop_structure): Likewise.
17110 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
17111 (move_loop_invariants): Likewise.
17112 * loop-unroll.c (decide_unrolling): Likewise.
17113 (unroll_loops): Likewise.
17114 * modulo-sched.c (sms_schedule): Likewise.
17115 * predict.c (predict_loops): Likewise.
17116 (pass_profile::execute): Likewise.
17117 * profile.c (branch_prob): Likewise.
17118 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
17119 (sel_find_rgns): Likewise.
17120 * tree-cfg.c (replace_loop_annotate): Likewise.
17121 (replace_uses_by): Likewise.
17122 (move_sese_region_to_fn): Likewise.
17123 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
17124 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
17125 * tree-parloops.c (parallelize_loops): Likewise.
17126 * tree-predcom.c (tree_predictive_commoning): Likewise.
17127 * tree-scalar-evolution.c (scev_initialize): Likewise.
17128 (scev_reset): Likewise.
17129 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
17130 * tree-ssa-live.c (remove_unused_locals): Likewise.
17131 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
17132 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
17133 (tree_ssa_lim_initialize): Likewise.
17134 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
17135 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
17136 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
17137 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
17138 (free_numbers_of_iterations_estimates): Likewise.
17139 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
17140 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
17141 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
17142 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
17143 (pass_scev_cprop::execute): Likewise.
17144 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
17145 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
17146 * tree-ssa-threadupdate.c
17147 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
17148 * tree-vectorizer.c (vectorize_loops): Likewise.
17149 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
17151 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
17153 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
17154 generating a VOIDmode register for e.g the
17155 function_arg_info::end_marker.
17157 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
17159 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
17160 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
17161 (cc mode_attr): Similarly.
17162 (ccz subst_attr): Similarly.
17163 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
17164 * config/h8300/testcompare.md: Remove various cc0 based patterns
17165 that had been commented out. Add pattern to set CCZ from a bit
17168 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
17169 Julian Brown <julian@codesourcery.com>
17170 Kwok Cheung Yeung <kcy@codesourcery.com>
17172 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
17173 'update_stmt' after modification.
17174 (pass_oacc_loop_designation): New function, extracted out of...
17175 (pass_oacc_device_lower): ... this.
17176 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
17177 (make_pass_oacc_loop_designation): New
17178 * passes.def: Add it.
17179 * tree-parloops.c (create_parallel_loop): Adjust.
17180 * tree-pass.h (make_pass_oacc_loop_designation): New.
17182 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
17184 * flag-types.h (enum threader_mode): New.
17185 * params.opt: Add entry for --param=threader-mode.
17186 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
17187 (class back_threader): New.
17188 (back_threader::back_threader): New.
17189 (back_threader::~back_threader): New.
17190 (back_threader::maybe_register_path): New.
17191 (back_threader::find_taken_edge): New.
17192 (back_threader::find_taken_edge_switch): New.
17193 (back_threader::find_taken_edge_cond): New.
17194 (back_threader::resolve_def): New.
17195 (back_threader::resolve_phi): New.
17196 (back_threader::find_paths_to_names): New.
17197 (back_threader::find_paths): New.
17200 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
17201 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
17202 (pass_thread_jumps::execute): Abstract out code...
17203 (try_thread_blocks): ...here.
17204 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
17205 Abstract out threading candidate code to...
17206 (single_succ_to_potentially_threadable_block): ...here.
17207 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
17209 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
17210 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
17211 Return bool from register_jump_thread.
17213 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
17215 * target.def: in0 and in1 do not need to be registers.
17216 * doc/tm.texi: Regenerate.
17218 2021-07-29 liuhongt <hongtao.liu@intel.com>
17221 * config/i386/i386.c (ix86_widen_mult_cost): New function.
17222 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
17225 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
17228 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
17229 (rs6000_preferred_doloop_mode): New hook.
17230 * doc/tm.texi: Regenerate.
17231 * doc/tm.texi.in: Add hook preferred_doloop_mode.
17232 * target.def (preferred_doloop_mode): New hook.
17233 * targhooks.c (default_preferred_doloop_mode): New hook.
17234 * targhooks.h (default_preferred_doloop_mode): New hook.
17235 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
17236 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
17237 and compute_doloop_base_on_mode.
17239 2021-07-28 Martin Sebor <msebor@redhat.com>
17241 PR middle-end/101494
17242 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
17243 and size computation.
17245 2021-07-28 Martin Sebor <msebor@redhat.com>
17247 PR middle-end/101601
17248 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
17250 Handle pointers to functions.
17252 2021-07-28 Martin Sebor <msebor@redhat.com>
17254 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
17255 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
17256 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
17257 (access_ref::access_ref): Same.
17258 (access_ref::phi): Same.
17259 (access_ref::get_ref): Same.
17260 (access_ref::size_remaining): Same.
17261 (access_ref::offset_in_range): Same.
17262 (access_ref::add_offset): Same.
17263 (access_ref::inform_access): Same.
17264 (ssa_name_limit_t::visit_phi): Same.
17265 (ssa_name_limit_t::leave_phi): Same.
17266 (ssa_name_limit_t::next): Same.
17267 (ssa_name_limit_t::next_phi): Same.
17268 (ssa_name_limit_t::~ssa_name_limit_t): Same.
17269 (pointer_query::pointer_query): Same.
17270 (pointer_query::get_ref): Same.
17271 (pointer_query::put_ref): Same.
17272 (pointer_query::flush_cache): Same.
17273 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
17274 (check_nul_terminated_array): Same.
17275 (unterminated_array): Same.
17276 (maybe_warn_for_bound): Same.
17277 (check_read_access): Same.
17278 (warn_for_access): Same.
17279 (get_size_range): Same.
17280 (check_access): Same.
17281 (gimple_call_alloc_size): Move to tree.c.
17282 (gimple_parm_array_size): Move to pointer-query.cc.
17283 (get_offset_range): Same.
17284 (gimple_call_return_array): Same.
17285 (handle_min_max_size): Same.
17286 (handle_array_ref): Same.
17287 (handle_mem_ref): Same.
17288 (compute_objsize): Same.
17289 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
17290 (call_dealloc_argno): Same.
17291 (fndecl_dealloc_argno): Same.
17292 (new_delete_mismatch_p): Same.
17293 (matching_alloc_calls_p): Same.
17294 (warn_dealloc_offset): Same.
17295 (maybe_emit_free_warning): Same.
17296 * builtins.h (check_nul_terminated_array): Move to
17297 gimple-ssa-warn-access.h.
17298 (check_nul_terminated_array): Same.
17299 (warn_string_no_nul): Same.
17300 (unterminated_array): Same.
17301 (class ssa_name_limit_t): Same.
17302 (class pointer_query): Same.
17303 (struct access_ref): Same.
17304 (class range_query): Same.
17305 (struct access_data): Same.
17306 (gimple_call_alloc_size): Same.
17307 (gimple_parm_array_size): Same.
17308 (compute_objsize): Same.
17309 (class access_data): Same.
17310 (maybe_emit_free_warning): Same.
17311 * calls.c (initialize_argument_information): Remove call to
17312 maybe_emit_free_warning.
17313 * gimple-array-bounds.cc: Include new header..
17314 * gimple-fold.c: Same.
17315 * gimple-ssa-sprintf.c: Same.
17316 * gimple-ssa-warn-restrict.c: Same.
17317 * passes.def: Add pass_warn_access.
17318 * tree-pass.h (make_pass_warn_access): Declare.
17319 * tree-ssa-strlen.c: Include new headers.
17320 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
17321 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
17322 * gimple-ssa-warn-access.cc: New file.
17323 * gimple-ssa-warn-access.h: New file.
17324 * pointer-query.cc: New file.
17325 * pointer-query.h: New file.
17327 2021-07-28 Jakub Jelinek <jakub@redhat.com>
17329 PR middle-end/101624
17330 * ubsan.c (maybe_instrument_pointer_overflow,
17331 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
17332 PARM_DECLs or RESULT_DECLs.
17333 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
17335 2021-07-28 Jakub Jelinek <jakub@redhat.com>
17337 PR middle-end/101642
17338 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
17339 to type of bswap16 for comparison.
17340 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
17342 2021-07-28 Richard Biener <rguenther@suse.de>
17344 PR tree-optimization/101615
17345 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
17346 at CTOR SLP graph entries.
17348 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
17350 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
17351 Add "r,w" alternative.
17353 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
17356 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
17357 AVX_U128_DIRTY when all bits are zero.
17359 2021-07-28 Richard Biener <rguenther@suse.de>
17361 PR tree-optimization/101615
17362 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
17363 external nodes cannot be permuted so make them perm_out 0.
17365 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
17368 * config.in: Regenerate.
17369 * config/gcn/gcn-hsa.h (A_FIJI): New define.
17370 (A_900): New define.
17371 (A_906): New define.
17372 (A_908): New define.
17373 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
17374 * config/gcn/gcn.c (output_file_start): Adjust attributes according
17375 to the assembler capabilities.
17376 * config/gcn/mkoffload.c (main): Likewise.
17377 * configure: Regenerate.
17378 * configure.ac: Add tests for LLVM assembler attribute features.
17380 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
17382 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
17383 cond_false and cond_true on branches.
17385 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
17387 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
17390 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
17392 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
17394 2021-07-28 Jakub Jelinek <jakub@redhat.com>
17397 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
17398 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
17399 and has special !TARGET_AVX512VL expansion.
17400 (vashrv2di3<mask_name>): Rename to ...
17401 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
17402 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
17404 2021-07-28 Martin Uecker <muecker@gwdg.de>
17406 * calls.c (maybe_warn_rdwr_sizes): Correct argument
17407 numbers in warning that were switched.
17409 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
17411 PR tree-optimization/101596
17412 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
17413 by using new_type's precision instead.
17415 2021-07-28 liuhongt <hongtao.liu@intel.com>
17418 * config/i386/i386.h (processor_costs): Add new member
17420 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
17421 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
17422 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
17423 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
17424 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
17425 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
17426 generic_cost, core_cost): Initialize integer_to_sse same value
17428 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
17429 * config/i386/i386.c (ix86_builtin_vectorization_cost):
17430 Use integer_to_sse instead of sse_op to calculate the cost of
17433 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
17435 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
17437 (write_init_file): Call write_ovld_static_init.
17439 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
17441 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
17443 (write_init_file): Call write_bif_static_init.
17445 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
17447 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
17448 (TYPE_MAP_SIZE): New macro.
17449 (type_map): New initialized variable.
17450 (typemap_cmp): New function.
17451 (write_type_node): Likewise.
17452 (write_fntype_init): Implement.
17454 2021-07-27 Martin Sebor <msebor@redhat.com>
17456 PR tree-optimization/101584
17457 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
17458 (check_defs): Call it.
17460 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
17462 * tree-ssa-dom.c (dom_jump_threader_simplifier):
17463 Put avail_exprs_stack in the class, instead of passing it to
17464 jump_threader_simplifier.
17465 (dom_jump_threader_simplifier::simplify): Add state argument.
17466 (dom_opt_dom_walker): Add state.
17467 (pass_dominator::execute): Pass state to threader.
17468 (dom_opt_dom_walker::before_dom_children): Use state.
17469 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
17470 arguments by state.
17471 (jump_threader::record_temporary_equivalences_from_phis):
17472 Register equivalences through the state variable.
17473 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
17474 Record ranges in a statement through the state variable.
17475 (jump_threader::simplify_control_stmt_condition): Pass state to
17477 (jump_threader::simplify_control_stmt_condition_1): Same.
17478 (jump_threader::thread_around_empty_blocks): Remove obsolete
17480 (jump_threader::thread_through_normal_block): Record equivalences
17481 on edge through the state variable.
17482 (jump_threader::thread_across_edge): Abstract state pushing.
17483 (jt_state::jt_state): New.
17484 (jt_state::push): New.
17485 (jt_state::pop): New.
17486 (jt_state::register_equiv): New.
17487 (jt_state::record_ranges_from_stmt): New.
17488 (jt_state::register_equivs_on_edge): New.
17489 (jump_threader_simplifier::jump_threader_simplifier): Move from
17491 (jump_threader_simplifier::simplify): Add state argument.
17492 * tree-ssa-threadedge.h (class jt_state): New.
17493 (class jump_threader): Add state to constructor.
17494 (class jump_threader_simplifier): Add state to simplify. Remove
17495 avail_exprs_stack from class.
17496 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
17498 (vrp_jump_threader::vrp_jump_threader): Add state.
17499 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
17501 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
17503 * Makefile.in (OBJS): Add gimple-range-path.o.
17504 * gimple-range-path.cc: New file.
17505 * gimple-range-path.h: New file.
17507 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
17509 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
17510 inside vec_duplicate for all patterns.
17511 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
17512 Push sign/zero-extension inside vec_duplicate.
17514 2021-07-27 Richard Biener <rguenther@suse.de>
17516 PR tree-optimization/101573
17517 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
17518 looking at uninitialized PHI arg defs in some constrained cases.
17519 (warn_uninitialized_vars): Call it.
17520 (execute_early_warn_uninitialized): Calculate dominators.
17522 2021-07-27 Richard Biener <rguenther@suse.de>
17524 PR tree-optimization/39821
17525 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
17526 vector_stmt for widening arithmetic.
17527 (vectorizable_conversion): Adjust.
17529 2021-07-27 Martin Jambor <mjambor@suse.cz>
17531 * cgraph.h (ipa_replace_map): New field force_load_ref.
17532 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
17533 aded new flag load_dereferenced, adjusted comments.
17534 (ipa_get_param_dereferenced): New function.
17535 (ipa_set_param_dereferenced): Likewise.
17536 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
17537 * ipa-cp.c: Include gimple.h.
17538 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
17539 (get_replacement_map): New parameter force_load_ref, set the
17540 appropriate flag in ipa_replace_map if set.
17541 (struct symbol_and_index_together): New type.
17542 (adjust_refs_in_act_callers): New function.
17543 (adjust_references_in_caller): Likewise.
17544 (create_specialized_node): When appropriate, call
17545 adjust_references_in_caller and force only load references.
17546 * ipa-prop.c (load_from_dereferenced_name): New function.
17547 (ipa_analyze_controlled_uses): Also detect loads from a
17548 dereference, harden testing of call statements.
17549 (ipa_write_node_info): Stream the dereferenced flag.
17550 (ipa_read_node_info): Likewise.
17551 (ipa_set_jf_constant): Also create refdesc when jump function
17552 references a variable.
17553 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
17554 also on references of variables and return a symtab_node. Adjust
17556 (propagate_controlled_uses): Also remove references to VAR_DECLs.
17558 2021-07-27 Jakub Jelinek <jakub@redhat.com>
17560 PR middle-end/101586
17561 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
17562 positions above or equal to sz except for diagnostics of flexible
17565 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17567 PR tree-optimization/78888
17568 * gimple-range-fold.cc (get_letter_range): New.
17569 (fold_using_range::range_of_builtin_call): Call get_letter_range.
17571 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17573 PR tree-optimization/78888
17574 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
17575 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
17577 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
17578 Marc Glisse <marc.glisse@inria.fr>
17580 * match.pd (rotate): Simplify equality/inequality of rotations.
17581 (bswap): Simplify equality/inequality tests of byte swapping.
17583 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17585 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
17588 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17590 * range-op.cc (operator_lshift::fold_range): Pass rel to
17591 base class fold_range.
17592 (operator_rshift::fold_range): Same.
17594 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
17597 * toplev.h (min_align_loops_log): Remove declaration.
17598 (min_align_jumps_log, min_align_labels_log): Likewise.
17599 (min_align_functions_log): Likewise.
17601 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17603 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
17605 (execute_vrp): Abstract out simplification of conditionals...
17606 (simplify_casted_conds): ...here.
17608 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17610 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
17611 Add gimple argument.
17612 (array_bounds_checker::check_array_ref): Same.
17613 (array_bounds_checker::check_addr_expr): Same.
17614 (array_bounds_checker::check_array_bounds): Pass statement to
17615 check_array_bounds and check_addr_expr.
17616 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
17617 (check_addr_expr): Same.
17618 (get_value_range): Same.
17620 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17622 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
17623 (sdot_prod, udot_prod): ... This.
17624 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
17626 (<sur>dot_prod<vsi2qi>): ... this.
17627 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
17628 Change operands order.
17629 (<sur>sadv16qi): Use new operands order.
17630 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
17631 vdotq_s32): Use new RTL ordering.
17633 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17635 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
17636 aarch64_types_ternop_suss_qualifiers): New.
17637 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
17638 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
17639 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
17641 2021-07-23 Jakub Jelinek <jakub@redhat.com>
17643 PR rtl-optimization/101562
17644 * expmed.c (store_integral_bit_field): Only use movstrict_optab
17645 if the operand isn't paradoxical.
17647 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
17649 * gimple-array-bounds.h (class array_bounds_checker): Change
17650 ranges type to range_query.
17652 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17654 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
17655 __builtin_memcpy instead of constructing
17656 __builtin_aarch64_simd_oi one vector at a time.
17657 (vst1_u64_x2): Likewise.
17658 (vst1_f64_x2): Likewise.
17659 (vst1_s8_x2): Likewise.
17660 (vst1_p8_x2): Likewise.
17661 (vst1_s16_x2): Likewise.
17662 (vst1_p16_x2): Likewise.
17663 (vst1_s32_x2): Likewise.
17664 (vst1_u8_x2): Likewise.
17665 (vst1_u16_x2): Likewise.
17666 (vst1_u32_x2): Likewise.
17667 (vst1_f16_x2): Likewise.
17668 (vst1_f32_x2): Likewise.
17669 (vst1_p64_x2): Likewise.
17670 (vst1q_s8_x2): Likewise.
17671 (vst1q_p8_x2): Likewise.
17672 (vst1q_s16_x2): Likewise.
17673 (vst1q_p16_x2): Likewise.
17674 (vst1q_s32_x2): Likewise.
17675 (vst1q_s64_x2): Likewise.
17676 (vst1q_u8_x2): Likewise.
17677 (vst1q_u16_x2): Likewise.
17678 (vst1q_u32_x2): Likewise.
17679 (vst1q_u64_x2): Likewise.
17680 (vst1q_f16_x2): Likewise.
17681 (vst1q_f32_x2): Likewise.
17682 (vst1q_f64_x2): Likewise.
17683 (vst1q_p64_x2): Likewise.
17685 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17687 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
17688 __builtin_memcpy instead of constructing
17689 __builtin_aarch64_simd_ci one vector at a time.
17690 (vst1_u64_x3): Likewise.
17691 (vst1_f64_x3): Likewise.
17692 (vst1_s8_x3): Likewise.
17693 (vst1_p8_x3): Likewise.
17694 (vst1_s16_x3): Likewise.
17695 (vst1_p16_x3): Likewise.
17696 (vst1_s32_x3): Likewise.
17697 (vst1_u8_x3): Likewise.
17698 (vst1_u16_x3): Likewise.
17699 (vst1_u32_x3): Likewise.
17700 (vst1_f16_x3): Likewise.
17701 (vst1_f32_x3): Likewise.
17702 (vst1_p64_x3): Likewise.
17703 (vst1q_s8_x3): Likewise.
17704 (vst1q_p8_x3): Likewise.
17705 (vst1q_s16_x3): Likewise.
17706 (vst1q_p16_x3): Likewise.
17707 (vst1q_s32_x3): Likewise.
17708 (vst1q_s64_x3): Likewise.
17709 (vst1q_u8_x3): Likewise.
17710 (vst1q_u16_x3): Likewise.
17711 (vst1q_u32_x3): Likewise.
17712 (vst1q_u64_x3): Likewise.
17713 (vst1q_f16_x3): Likewise.
17714 (vst1q_f32_x3): Likewise.
17715 (vst1q_f64_x3): Likewise.
17716 (vst1q_p64_x3): Likewise.
17718 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
17721 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
17722 hard register when LRA is in progress.
17724 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17726 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
17727 __builtin_memcpy instead of using a union.
17728 (vst1q_s8_x4): Likewise.
17729 (vst1_s16_x4): Likewise.
17730 (vst1q_s16_x4): Likewise.
17731 (vst1_s32_x4): Likewise.
17732 (vst1q_s32_x4): Likewise.
17733 (vst1_u8_x4): Likewise.
17734 (vst1q_u8_x4): Likewise.
17735 (vst1_u16_x4): Likewise.
17736 (vst1q_u16_x4): Likewise.
17737 (vst1_u32_x4): Likewise.
17738 (vst1q_u32_x4): Likewise.
17739 (vst1_f16_x4): Likewise.
17740 (vst1q_f16_x4): Likewise.
17741 (vst1_f32_x4): Likewise.
17742 (vst1q_f32_x4): Likewise.
17743 (vst1_p8_x4): Likewise.
17744 (vst1q_p8_x4): Likewise.
17745 (vst1_p16_x4): Likewise.
17746 (vst1q_p16_x4): Likewise.
17747 (vst1_s64_x4): Likewise.
17748 (vst1_u64_x4): Likewise.
17749 (vst1_p64_x4): Likewise.
17750 (vst1q_s64_x4): Likewise.
17751 (vst1q_u64_x4): Likewise.
17752 (vst1q_p64_x4): Likewise.
17753 (vst1_f64_x4): Likewise.
17754 (vst1q_f64_x4): Likewise.
17756 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
17758 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
17759 instead of constructing __builtin_aarch64_simd_oi one vector
17761 (vst2_u64): Likewise.
17762 (vst2_f64): Likewise.
17763 (vst2_s8): Likewise.
17764 (vst2_p8): Likewise.
17765 (vst2_s16): Likewise.
17766 (vst2_p16): Likewise.
17767 (vst2_s32): Likewise.
17768 (vst2_u8): Likewise.
17769 (vst2_u16): Likewise.
17770 (vst2_u32): Likewise.
17771 (vst2_f16): Likewise.
17772 (vst2_f32): Likewise.
17773 (vst2_p64): Likewise.
17774 (vst2q_s8): Likewise.
17775 (vst2q_p8): Likewise.
17776 (vst2q_s16): Likewise.
17777 (vst2q_p16): Likewise.
17778 (vst2q_s32): Likewise.
17779 (vst2q_s64): Likewise.
17780 (vst2q_u8): Likewise.
17781 (vst2q_u16): Likewise.
17782 (vst2q_u32): Likewise.
17783 (vst2q_u64): Likewise.
17784 (vst2q_f16): Likewise.
17785 (vst2q_f32): Likewise.
17786 (vst2q_f64): Likewise.
17787 (vst2q_p64): Likewise.
17789 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17791 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
17792 instead of constructing __builtin_aarch64_simd_ci one vector
17794 (vst3_u64): Likewise.
17795 (vst3_f64): Likewise.
17796 (vst3_s8): Likewise.
17797 (vst3_p8): Likewise.
17798 (vst3_s16): Likewise.
17799 (vst3_p16): Likewise.
17800 (vst3_s32): Likewise.
17801 (vst3_u8): Likewise.
17802 (vst3_u16): Likewise.
17803 (vst3_u32): Likewise.
17804 (vst3_f16): Likewise.
17805 (vst3_f32): Likewise.
17806 (vst3_p64): Likewise.
17807 (vst3q_s8): Likewise.
17808 (vst3q_p8): Likewise.
17809 (vst3q_s16): Likewise.
17810 (vst3q_p16): Likewise.
17811 (vst3q_s32): Likewise.
17812 (vst3q_s64): Likewise.
17813 (vst3q_u8): Likewise.
17814 (vst3q_u16): Likewise.
17815 (vst3q_u32): Likewise.
17816 (vst3q_u64): Likewise.
17817 (vst3q_f16): Likewise.
17818 (vst3q_f32): Likewise.
17819 (vst3q_f64): Likewise.
17820 (vst3q_p64): Likewise.
17822 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17824 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
17825 instead of constructing __builtin_aarch64_simd_xi one vector
17827 (vst4_u64): Likewise.
17828 (vst4_f64): Likewise.
17829 (vst4_s8): Likewise.
17830 (vst4_p8): Likewise.
17831 (vst4_s16): Likewise.
17832 (vst4_p16): Likewise.
17833 (vst4_s32): Likewise.
17834 (vst4_u8): Likewise.
17835 (vst4_u16): Likewise.
17836 (vst4_u32): Likewise.
17837 (vst4_f16): Likewise.
17838 (vst4_f32): Likewise.
17839 (vst4_p64): Likewise.
17840 (vst4q_s8): Likewise.
17841 (vst4q_p8): Likewise.
17842 (vst4q_s16): Likewise.
17843 (vst4q_p16): Likewise.
17844 (vst4q_s32): Likewise.
17845 (vst4q_s64): Likewise.
17846 (vst4q_u8): Likewise.
17847 (vst4q_u16): Likewise.
17848 (vst4q_u32): Likewise.
17849 (vst4q_u64): Likewise.
17850 (vst4q_f16): Likewise.
17851 (vst4q_f32): Likewise.
17852 (vst4q_f64): Likewise.
17853 (vst4q_p64): Likewise.
17855 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17857 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
17858 instead of constructing __builtin_aarch64_simd_oi one vector
17860 (vtbx4_u8): Likewise.
17861 (vtbx4_p8): Likewise.
17863 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17865 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
17866 instead of constructing __builtin_aarch64_simd_oi one vector
17868 (vtbl3_u8): Likewise.
17869 (vtbl3_p8): Likewise.
17870 (vtbl4_s8): Likewise.
17871 (vtbl4_u8): Likewise.
17872 (vtbl4_p8): Likewise.
17874 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17876 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
17877 instead of constructing __builtin_aarch64_simd_oi one vector
17879 (vqtbx2_u8): Likewise.
17880 (vqtbx2_p8): Likewise.
17881 (vqtbx2q_s8): Likewise.
17882 (vqtbx2q_u8): Likewise.
17883 (vqtbx2q_p8): Likewise.
17884 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
17885 __builtin_aarch64_simd_ci one vector at a time.
17886 (vqtbx3_u8): Likewise.
17887 (vqtbx3_p8): Likewise.
17888 (vqtbx3q_s8): Likewise.
17889 (vqtbx3q_u8): Likewise.
17890 (vqtbx3q_p8): Likewise.
17891 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
17892 __builtin_aarch64_simd_xi one vector at a time.
17893 (vqtbx4_u8): Likewise.
17894 (vqtbx4_p8): Likewise.
17895 (vqtbx4q_s8): Likewise.
17896 (vqtbx4q_u8): Likewise.
17897 (vqtbx4q_p8): Likewise.
17899 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17901 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
17902 instead of constructing __builtin_aarch64_simd_oi one vector
17904 (vqtbl2_u8): Likewise.
17905 (vqtbl2_p8): Likewise.
17906 (vqtbl2q_s8): Likewise.
17907 (vqtbl2q_u8): Likewise.
17908 (vqtbl2q_p8): Likewise.
17909 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
17910 __builtin_aarch64_simd_ci one vector at a time.
17911 (vqtbl3_u8): Likewise.
17912 (vqtbl3_p8): Likewise.
17913 (vqtbl3q_s8): Likewise.
17914 (vqtbl3q_u8): Likewise.
17915 (vqtbl3q_p8): Likewise.
17916 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
17917 __builtin_aarch64_simd_xi one vector at a time.
17918 (vqtbl4_u8): Likewise.
17919 (vqtbl4_p8): Likewise.
17920 (vqtbl4q_s8): Likewise.
17921 (vqtbl4q_u8): Likewise.
17922 (vqtbl4q_p8): Likewise.
17924 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
17927 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
17929 2021-07-22 Andrew Pinski <apinski@marvell.com>
17931 PR tree-optimization/10153
17932 * tree-tailcall.c (create_tailcall_accumulator):
17933 Don't call fold_convert as the type should be correct already.
17934 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
17935 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
17937 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
17939 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
17940 varying_p check for null/non-null check.
17942 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17944 PR tree-optimization/101511
17945 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
17946 is in ssa2's equiv set, and don't trap if so.
17948 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17950 PR tree-optimization/101497
17951 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
17954 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17956 PR tree-optimization/101496
17957 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
17958 first, then vrp_visit_cond_Stmt.
17960 2021-07-22 liuhongt <hongtao.liu@intel.com>
17962 * config/i386/i386-expand.c
17963 (ix86_broadcast_from_integer_constant): Rename to ..
17964 (ix86_broadcast_from_constant): .. this, and extend it to
17966 (ix86_expand_vector_move): Extend to float mode.
17967 * config/i386/i386-features.c
17968 (replace_constant_pool_with_broadcast): Remove.
17969 (remove_partial_avx_dependency_gate): Ditto.
17970 (constant_pool_broadcast): Ditto.
17971 (class pass_constant_pool_broadcast): Ditto.
17972 (make_pass_constant_pool_broadcast): Ditto.
17973 (remove_partial_avx_dependency): Adjust gate.
17974 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
17975 * config/i386/i386-protos.h
17976 (make_pass_constant_pool_broadcast): Remove.
17978 2021-07-22 liuhongt <hongtao.liu@intel.com>
17980 * config/i386/constraints.md (Wb): New constraint.
17982 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
17984 (*ashlqi3_1): Ditto.
17985 (*<insn><mode>3_1): Split to ..
17986 (*ashr<mode>3_1): this, ...
17987 (*lshr<mode>3_1): and this, also extend this pattern to avx512
17989 (*<insn><mode>3_1): Split to ..
17990 (*ashr<mode>3_1): this, ...
17991 (*lshrqi3_1): and this, also extend this pattern to avx512
17993 (*lshrhi3_1): And this, also extend this pattern to avx512
17995 * config/i386/sse.md (k<code><mode>): New define_split after
17996 it to convert generic shift pattern to mask shift ones.
17998 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
17999 Joseph Myers <joseph@codesourcery.com>
18000 Cesar Philippidis <cesar@codesourcery.com>
18002 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
18003 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
18005 * tree-pretty-print.c (dump_omp_clause): Likewise.
18006 * omp-general.c (oacc_verify_routine_clauses): Likewise.
18007 * gimplify.c (gimplify_scan_omp_clauses)
18008 (gimplify_adjust_omp_clauses): Likewise.
18009 * tree-nested.c (convert_nonlocal_omp_clauses)
18010 (convert_local_omp_clauses): Likewise.
18011 * omp-low.c (scan_sharing_clauses): Likewise.
18012 * omp-offload.c (execute_oacc_device_lower): Update.
18014 2021-07-21 Martin Sebor <msebor@redhat.com>
18016 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
18018 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18020 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
18023 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18025 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
18027 (write_fntype_init): New stub function.
18028 (write_init_bif_table): Likewise.
18029 (write_init_ovld_table): New function.
18030 (write_init_file): Implement.
18032 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18034 * config/rs6000/rs6000-gen-builtins.c
18035 (write_autogenerated_header): New function.
18036 (write_decls): Likewise.
18037 (write_extern_fntype): New callback function.
18038 (write_header_file): Implement.
18040 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18042 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
18045 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18047 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
18049 (complete_base_type): Likewise.
18050 (construct_fntype_id): Likewise.
18051 (parse_bif_entry): Call contruct_fntype_id.
18052 (parse_ovld_entry): Likewise.
18054 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18056 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
18057 (MAXOVLDSTANZAS): New macro.
18058 (ovld_stanzas): New variable.
18059 (curr_ovld_stanza): Likewise.
18060 (MAXOVLDS): New macro.
18061 (ovlddata): New struct.
18062 (ovlds): New variable.
18063 (curr_ovld): Likewise.
18064 (max_ovld_args): Likewise.
18065 (parse_ovld_entry): New function.
18066 (parse_ovld_stanza): Likewise.
18067 (parse_ovld): Implement.
18069 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18071 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
18074 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18076 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
18077 (parse_prototype): Implement.
18079 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18081 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
18082 (curr_bif_stanza): New variable.
18083 (stanza_entry): New struct.
18084 (stanza_map): New initialized variable.
18085 (enable_string): Likewise.
18086 (fnkinds): New enum.
18087 (typelist): New struct.
18088 (attrinfo): Likewise.
18089 (MAXRESTROPNDS): New macro.
18090 (prototype): New struct.
18091 (MAXBIFS): New macro.
18092 (bifdata): New struct.
18093 (bifs): New variable.
18094 (curr_bif): Likewise.
18095 (bif_order): Likewise.
18096 (bif_index): Likewise.
18097 (fatal): New function.
18098 (stanza_name_to_stanza): Likewise.
18099 (parse_bif_attrs): New stub function.
18100 (parse_prototype): Likewise.
18101 (parse_bif_entry): New function.
18102 (parse_bif_stanza): Likewise.
18103 (parse_bif): Implement.
18104 (set_bif_order): New function.
18105 (create_bif_order): Implement.
18107 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
18109 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
18110 (num_bifs): New variable.
18111 (num_ovld_stanzas): Likewise.
18112 (num_ovlds): Likewise.
18113 (parse_codes): New enum.
18114 (bif_rbt): New variable.
18115 (ovld_rbt): Likewise.
18116 (fntype_rbt): Likewise.
18117 (bifo_rbt): Likewise.
18118 (parse_bif): New stub function.
18119 (create_bif_order): Likewise.
18120 (parse_ovld): Likewise.
18121 (write_header_file): Likewise.
18122 (write_init_file): Likewise.
18123 (write_defines_file): Likewise.
18124 (delete_output_files): New function.
18127 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
18130 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
18131 from CRC32 _builtin functions.
18133 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
18135 * coverage.c (build_gcov_info_var_registration): Mark the object placed
18136 in the linker set as referenced so that it does not get optimized away.
18138 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
18141 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
18143 * config.gcc (riscv*-*-*): Detect which python is available.
18145 2021-07-21 Jakub Jelinek <jakub@redhat.com>
18147 PR middle-end/101535
18148 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
18149 contexts in which decl isn't privatized and for ORT_TARGET return
18150 false if decl is mapped.
18152 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
18154 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
18155 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
18156 has been successfully unrolled.
18158 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
18160 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
18161 exit condition if no tail loop is needed, and if the original exit
18162 condition should therefore be kept as-is.
18163 (tree_transform_and_unroll_loop): Handle that case here too.
18165 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
18167 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
18169 (free_data_refs): Likewise.
18170 * tree-data-ref.h (free_dependence_relations): Likewise.
18171 (free_data_refs): Likewise.
18172 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
18174 (struct component): Likewise.
18175 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
18176 (pcom_worker::~pcom_worker): Likewise.
18177 (pcom_worker::release_chain): Adjust as auto_vec changes.
18178 (pcom_worker::loop): Rename to ...
18179 (pcom_worker::m_loop): ... this.
18180 (pcom_worker::datarefs): Rename to ...
18181 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
18182 (pcom_worker::dependences): Rename to ...
18183 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
18184 (pcom_worker::chains): Rename to ...
18185 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
18186 (pcom_worker::looparound_phis): Rename to ...
18187 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
18189 (pcom_worker::cache): Rename to ...
18190 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
18191 (pcom_worker::release_chain): Adjust for auto_vec changes.
18192 (pcom_worker::release_chains): Adjust for auto_vec and renaming
18194 (release_component): Remove.
18195 (release_components): Adjust for release_component removal.
18196 (component_of): Adjust to use vec.
18197 (merge_comps): Likewise.
18198 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
18199 (pcom_worker::determine_offset): Likewise.
18200 (class comp_ptrs): Remove.
18201 (pcom_worker::split_data_refs_to_components): Adjust for renaming
18202 changes, for comp_ptrs removal with auto_vec.
18203 (pcom_worker::suitable_component_p): Adjust for renaming changes.
18204 (pcom_worker::filter_suitable_components): Adjust for release_component
18206 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
18207 (pcom_worker::find_looparound_phi): Likewise.
18208 (pcom_worker::add_looparound_copies): Likewise.
18209 (pcom_worker::determine_roots_comp): Likewise.
18210 (pcom_worker::single_nonlooparound_use): Likewise.
18211 (pcom_worker::execute_pred_commoning_chain): Likewise.
18212 (pcom_worker::execute_pred_commoning): Likewise.
18213 (pcom_worker::try_combine_chains): Likewise.
18214 (pcom_worker::prepare_initializers_chain): Likewise.
18215 (pcom_worker::prepare_initializers): Likewise.
18216 (pcom_worker::prepare_finalizers_chain): Likewise.
18217 (pcom_worker::prepare_finalizers): Likewise.
18218 (pcom_worker::tree_predictive_commoning_loop): Likewise.
18220 2021-07-20 Martin Sebor <msebor@redhat.com>
18222 PR middle-end/101397
18223 * builtins.c (gimple_call_return_array): Add argument. Correct
18224 offsets for memchr, mempcpy, stpcpy, and stpncpy.
18225 (compute_objsize_r): Adjust offset computation for argument returning
18228 2021-07-20 Martin Sebor <msebor@redhat.com>
18230 PR middle-end/101300
18231 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
18233 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
18235 * function.c (assign_parm_setup_block): Use adjust_address instead
18236 of change_address to preserve MEM_EXPR and friends.
18238 2021-07-20 Martin Sebor <msebor@redhat.com>
18240 * cfgloop.h (single_likely_exit): Adjust by-value argument to
18241 by-const-reference.
18242 * cfgloopanal.c (single_likely_exit): Same.
18243 * cgraph.h (struct cgraph_node): Same.
18244 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
18245 * genautomata.c (merge_states): Same.
18246 * genextract.c (VEC_char_to_string): Same.
18247 * genmatch.c (dt_node::gen_kids_1): Same.
18248 (walk_captures): Adjust by-value argument to by-reference.
18249 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
18250 to by-const-reference.
18251 * gimple.c (gimple_build_call_vec): Same.
18252 (gimple_build_call_internal_vec): Same.
18253 (gimple_build_switch): Same.
18254 (sort_case_labels): Same.
18255 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
18257 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
18258 by-const-reference.
18259 (gimple_build_call_internal_vec): Same.
18260 (gimple_build_switch): Same.
18261 (sort_case_labels): Same.
18262 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
18264 * haifa-sched.c (calc_priorities): Adjust by-value argument to
18265 by-const-reference.
18266 (sched_init_luids): Same.
18267 (haifa_init_h_i_d): Same.
18268 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
18269 (adjust_callers_for_value_intersection): Adjust by-value argument to
18271 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
18272 by-const-reference.
18273 (find_more_contexts_for_caller_subset): Same.
18274 (find_aggregate_values_for_callers_subset): Same.
18275 (copy_useful_known_contexts): Same.
18276 * ipa-fnsummary.c (remap_edge_summaries): Same.
18277 (remap_freqcounting_predicate): Same.
18278 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
18280 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
18281 to by-const-reference.
18282 * ipa-predicate.h (predicate::remap_after_inlining): Same.
18283 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
18284 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
18285 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
18286 * read-rtl.c (add_overload_instance): Same.
18287 * rtl.h (native_decode_rtx): Same.
18288 (native_decode_vector_rtx): Same.
18289 * sched-int.h (sched_init_luids): Same.
18290 (haifa_init_h_i_d): Same.
18291 * simplify-rtx.c (native_decode_vector_rtx): Same.
18292 (native_decode_rtx): Same.
18293 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
18294 (shrink_wrap_one_built_in_call_with_conds): Same.
18295 (shrink_wrap_conditional_dead_built_in_calls): Same.
18296 * tree-data-ref.c (create_runtime_alias_checks): Same.
18297 (compute_all_dependences): Same.
18298 * tree-data-ref.h (compute_all_dependences): Same.
18299 (create_runtime_alias_checks): Same.
18300 (index_in_loop_nest): Same.
18301 * tree-if-conv.c (mask_exists): Same.
18302 * tree-loop-distribution.c (class loop_distribution): Same.
18303 (loop_distribution::create_rdg_vertices): Same.
18304 (dump_rdg_partitions): Same.
18305 (debug_rdg_partitions): Same.
18306 (partition_contains_all_rw): Same.
18307 (loop_distribution::distribute_loop): Same.
18308 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
18309 (oacc_entry_exit_single_gang): Same.
18310 * tree-ssa-loop-im.c (hoist_memory_references): Same.
18311 (loop_suitable_for_sm): Same.
18312 * tree-ssa-loop-niter.c (bound_index): Same.
18313 * tree-ssa-reassoc.c (update_ops): Same.
18314 (swap_ops_for_binary_stmt): Same.
18315 (rewrite_expr_tree): Same.
18316 (rewrite_expr_tree_parallel): Same.
18317 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
18318 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
18319 * tree-ssa-structalias.c (process_all_all_constraints): Same.
18320 (make_constraints_to): Same.
18321 (handle_lhs_call): Same.
18322 (find_func_aliases_for_builtin_call): Same.
18323 (sort_fieldstack): Same.
18324 (check_for_overlaps): Same.
18325 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
18326 (vect_create_cond_for_unequal_addrs): Same.
18327 (vect_create_cond_for_lower_bounds): Same.
18328 (vect_create_cond_for_alias_checks): Same.
18329 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
18330 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
18331 (vect_make_slp_decision): Same.
18332 (vect_slp_bbs): Same.
18333 (duplicate_and_interleave): Same.
18334 (vect_transform_slp_perm_load): Same.
18335 (vect_schedule_slp): Same.
18336 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
18337 (vect_schedule_slp): Same.
18338 (duplicate_and_interleave): Same.
18339 * tree.c (build_vector_from_ctor): Same.
18340 (build_vector): Same.
18341 (check_vector_cst): Same.
18342 (check_vector_cst_duplicate): Same.
18343 (check_vector_cst_fill): Same.
18344 (check_vector_cst_stepped): Same.
18345 * tree.h (build_vector_from_ctor): Same.
18347 2021-07-20 Jakub Jelinek <jakub@redhat.com>
18350 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
18351 type from bool to int.
18352 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
18353 EASY_VECTOR_MSB case if either step or copies is not 1.
18354 (vspltis_shifted): Fix comment typo.
18355 (easy_altivec_constant): Change return type from bool to int, instead
18356 of returning true return byte size of the element mode that should be
18357 used to synthetize the constant.
18358 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
18359 that vspltis_shifted is 0, handle the case where easy_altivec_constant
18360 assumes using different vector mode from CONST_VECTOR's mode.
18361 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
18362 easy_altivec_constant to determine mode in which -1 >> -1 should be
18363 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
18365 2021-07-20 Richard Biener <rguenther@suse.de>
18368 * dwarf2out.h (dwarf_file_data): Add key member.
18369 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
18370 (dwarf_file_hasher::hash): Hash key.
18371 (lookup_filename): Remap the filename and store it in the
18372 filename member of dwarf_file_data when creating a new
18374 (file_name_acquire): Do not remap the filename again.
18375 (maybe_emit_file): Likewise.
18377 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
18379 * config/aarch64/aarch64-simd-builtins.def: Use two variant
18380 generators for all TBL/TBX intrinsics and rename to
18381 consistent forms: qtbl[1234] or qtbx[1234].
18382 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
18384 (aarch64_qtbl1<mode>): This.
18385 (aarch64_tbx1<mode>): Rename to...
18386 (aarch64_qtbx1<mode>): This.
18387 (aarch64_tbl2v16qi): Delete.
18388 (aarch64_tbl3<mode>): Rename to...
18389 (aarch64_qtbl2<mode>): This.
18390 (aarch64_tbx4<mode>): Rename to...
18391 (aarch64_qtbx2<mode>): This.
18392 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
18393 renamed qtbl1 and qtbl2 RTL patterns.
18394 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
18396 (vqtbl1_s8): Likewise.
18397 (vqtbl1_u8): Likewise.
18398 (vqtbl1q_p8): Likewise.
18399 (vqtbl1q_s8): Likewise.
18400 (vqtbl1q_u8): Likewise.
18401 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
18402 (vqtbx1_u8): Likewise.
18403 (vqtbx1_p8): Likewise.
18404 (vqtbx1q_s8): Likewise.
18405 (vqtbx1q_u8): Likewise.
18406 (vqtbx1q_p8): Likewise.
18407 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
18408 (vtbl1_u8): Likewise.
18409 (vtbl1_p8): Likewise.
18410 (vtbl2_s8): Likewise
18411 (vtbl2_u8): Likewise.
18412 (vtbl2_p8): Likewise.
18413 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
18414 (vtbl3_u8): Likewise.
18415 (vtbl3_p8): Likewise.
18416 (vtbl4_s8): Likewise.
18417 (vtbl4_u8): Likewise.
18418 (vtbl4_p8): Likewise.
18419 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
18420 (vtbx2_u8): Likewise.
18421 (vtbx2_p8): Likewise.
18422 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
18423 (vqtbl2_u8): Likewise.
18424 (vqtbl2_p8): Likewise.
18425 (vqtbl2q_s8): Likewise.
18426 (vqtbl2q_u8): Likewise.
18427 (vqtbl2q_p8): Likewise.
18428 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
18429 (vqtbx2_u8): Likewise.
18430 (vqtbx2_p8): Likewise.
18431 (vqtbx2q_s8): Likewise.
18432 (vqtbx2q_u8): Likewise.
18433 (vqtbx2q_p8): Likewise.
18434 (vtbx4_s8): Likewise.
18435 (vtbx4_u8): Likewise.
18436 (vtbx4_p8): Likewise.
18438 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
18441 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
18443 (define_peephole2 atomic_loaddi_fpu): Ditto.
18445 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
18447 * config.gcc (riscv*-*-*): Detect which python is available.
18449 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
18451 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
18452 (smul<mode>3_highpart): ... this.
18453 (mulhu_<mode>): Rename to...
18454 (umul<mode>3_highpart): ... this.
18455 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
18456 MULHU_V2DI, MULHU_V4SI): Adjust.
18458 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
18460 PR tree-optimization/100696
18461 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
18462 * internal-fn.def (IFN_MULH): New internal function.
18463 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
18464 recog normal multiply highpart as IFN_MULH.
18465 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
18468 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
18470 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
18471 (BTF_DEBUGGING_INFO): Likewise.
18472 * doc/tm.texi.in: Document the new macros.
18473 * doc/tm.texi: Regenerated.
18474 * toplev.c: Guard initialization of debug hooks.
18476 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
18478 * flags.h (ctf_debuginfo_p): New function declaration.
18479 * opts.c (ctf_debuginfo_p): New function definition.
18481 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
18484 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
18485 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
18486 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
18487 * config/gcn/gcn-valu.md: Add a warning comment.
18488 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
18489 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
18490 * config/gcn/gcn.md: Add a warning comment.
18491 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
18492 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
18494 (EF_AMDGPU_XNACK): New.
18495 (EF_AMDGPU_SRAM_ECC): New.
18497 (copy_early_debug_info): Use elf_flags.
18498 (main): Handle -mxnack and -msram-ecc options.
18499 * doc/invoke.texi: Document -mxnack and -msram-ecc.
18501 2021-07-19 Andrew Pinski <apinski@marvell.com>
18504 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
18505 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
18507 2021-07-19 Richard Biener <rguenther@suse.de>
18509 PR tree-optimization/101505
18510 * tree-vect-patterns.c (vect_determine_precisions): Walk
18511 PHIs also for loop vectorization.
18513 2021-07-19 Richard Biener <rguenther@suse.de>
18515 * gimple.h (gimple_expr_type): Remove.
18516 * doc/gimple.texi: Remove gimple_expr_type documentation.
18518 2021-07-19 Richard Biener <rguenther@suse.de>
18520 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
18521 (ao_ref_init_from_vn_reference): Likewise.
18522 (fully_constant_reference): Likewise.
18523 (vn_reference_lookup_call): Do not set vr->type to random
18525 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
18527 * tree-vect-generic.c (expand_vector_piecewise): Pass in
18528 whether we expanded parallel.
18529 (expand_vector_parallel): Adjust.
18530 (expand_vector_addition): Likewise.
18531 (expand_vector_comparison): Likewise.
18532 (expand_vector_operation): Likewise.
18533 (expand_vector_scalar_condition): Likewise.
18534 (expand_vector_conversion): Likewise.
18536 2021-07-19 Richard Biener <rguenther@suse.de>
18538 * tree-vrp.c (register_edge_assert_for_2): Use the
18540 (vrp_folder::fold_predicate_in): Likewise.
18541 * vr-values.c (gimple_assign_nonzero_p): Likewise.
18542 (vr_values::extract_range_from_comparison): Likewise.
18543 (vr_values::extract_range_from_ubsan_builtin): Use the
18544 type of the first operand.
18545 (vr_values::extract_range_basic): Push down type
18546 computation, use the appropriate LHS.
18547 (vr_values::extract_range_from_assignment): Use the
18550 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
18553 * common/config/i386/i386-common.c (ix86_handle_option): For
18554 -mgeneral-regs-only, enable the GPR only instructions which are
18555 enabled implicitly by SSE ISAs unless they have been disabled
18558 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
18561 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
18562 ix86_avx_u128_mode_needed.
18563 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
18564 returns AVX register.
18566 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
18568 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
18569 (handle_const_call): Liekise
18570 (handle_pure_call): Liekise
18572 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
18574 PR tree-optimization/96542
18575 * range-op.cc (range_operator::wi_fold_in_parts): New.
18576 (range_operator::fold_range): Call wi_fold_in_parts.
18577 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
18578 * range-op.h (wi_fold_in_parts): Add prototype.
18580 2021-07-16 David Malcolm <dmalcolm@redhat.com>
18582 * doc/analyzer.texi: Add __analyzer_dump_state.
18584 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18586 * config/rs6000/rbtree.c: New file.
18587 * config/rs6000/rbtree.h: New file.
18589 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18591 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
18592 (typeinfo): Add restr field.
18593 (match_bracketed_pair): New function.
18594 (match_const_restriction): Implement.
18596 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18598 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
18600 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18602 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
18603 (basetype): Likewise.
18604 (typeinfo): Likewise.
18605 (handle_pointer): New function.
18606 (match_basetype): New stub function.
18607 (match_const_restriction): Likewise.
18608 (match_type): New function.
18610 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18612 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
18614 (advance_line): Likewise.
18615 (safe_inc_pos): Likewise.
18616 (match_identifier): Likewise.
18617 (match_integer): Likewise.
18618 (match_to_right_bracket): Likewise.
18620 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18622 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
18623 (ovld_file): Likewise.
18624 (header_file): Likewise.
18625 (init_file): Likewise.
18626 (defines_file): Likewise.
18627 (pgm_path): Likewise.
18628 (bif_path): Likewise.
18629 (ovld_path): Likewise.
18630 (header_path): Likewise.
18631 (init_path): Likewise.
18632 (defines_path): Likewise.
18633 (LINELEN): New macro.
18634 (linebuf): New variable.
18638 (bif_diag): New function.
18639 (ovld_diag): Likewise.
18641 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18643 * config/rs6000/rs6000-builtin-new.def: New.
18644 * config/rs6000/rs6000-overload.def: New.
18646 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18648 * config/rs6000/rs6000-gen-builtins.c: New.
18650 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18652 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
18653 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
18654 * gengtype-state.c (state_writer::write_state_file_list): Add a
18655 parameter to the fileslist expression for the number of build
18657 (read_state_files_list): Detect build headers and strip the
18658 initial "./" or ".\" from their names.
18659 * gengtype.c (build_headers): New global variable.
18660 (num_build_headers): Likewise.
18661 (open_base_files): Emit #include for each build header.
18662 (main): Detect and count build headers.
18663 * gengtype.h (build_headers): New extern variable.
18664 (num_build_headers): Likewise.
18666 2021-07-16 Richard Biener <rguenther@suse.de>
18668 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
18669 the type of the LHS.
18670 (find_bswap_or_nop_1): Likewise.
18671 (find_bswap_or_nop): Likewise.
18672 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
18674 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
18675 Remove unused parameters, pass in the scalar type. Fix
18676 internal store function handling.
18677 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
18678 (vect_get_vector_types_for_stmt): Move down check for
18679 existing vector stmt after we've determined a scalar type.
18680 Pass down the used scalar type to vect_get_smallest_scalar_type.
18681 * tree-vect-generic.c (expand_vector_condition): Use
18682 the type of the LHS.
18683 (expand_vector_scalar_condition): Likewise.
18684 (expand_vector_operations_1): Likewise.
18685 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
18686 (vect_recog_dot_prod_pattern): Likewise.
18687 (vect_recog_sad_pattern): Likewise.
18688 (vect_recog_widen_op_pattern): Likewise.
18689 (vect_recog_widen_sum_pattern): Likewise.
18690 (vect_recog_mixed_size_cond_pattern): Likewise.
18692 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
18694 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
18695 (dump_eaf_flags): Dump EAF_NOT_RETURNED
18696 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
18697 and EAF_NOT_RETURNED.
18698 (modref_summary::useful_p): Likewise.
18699 (modref_summary_lto::useful_p): Likewise.
18700 (struct) modref_summary_lto: Use eaf_fleags_t.
18701 (deref_flags): Handle EAF_NOT_RETURNED.
18702 (struct escape_point): Use min_flags.
18703 (modref_lattice::init): Add EAF_NOT_RETURNED.
18704 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
18705 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
18707 (analyze_parms): Also analyze const functions; update conition on
18709 (modref_write): Update streaming.
18710 (read_section): Update streaming.
18711 (remap_arg_flags): Use eaf_flags_t.
18712 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
18713 * ipa-modref.h: (eaf_flags_t): New typedef.
18714 (struct modref_summary): Use eaf_flags_t.
18715 * tree-core.h (EAF_NOT_RETURNED): New constant.
18717 2021-07-16 Richard Biener <rguenther@suse.de>
18719 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
18720 the type of the LHS.
18721 (gimple_assign_nonnegative_warnv_p): Likewise.
18722 (gimple_call_nonnegative_warnv_p): Likewise. Return false
18723 if the call has no LHS.
18724 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
18725 * tree-eh.c (stmt_could_throw_1_p): Likewise.
18726 * tree-inline.c (insert_init_stmt): Likewise.
18727 * tree-ssa-loop-niter.c (get_val_for): Likewise.
18728 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
18730 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
18731 gassign *. Use the type of the lhs.
18732 (vn_nary_op_lookup_stmt): Adjust.
18733 (vn_nary_op_insert_stmt): Likewise.
18735 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
18737 * config/s390/predicates.md (bras_sym_operand): Accept all
18738 functions in 64-bit mode, use UNSPEC_PLT31.
18739 (larl_operand): Use UNSPEC_PLT31.
18740 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
18741 (legitimize_pic_address): Likewise.
18742 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
18744 (s390_delegitimize_address): Use UNSPEC_PLT31.
18745 (s390_output_addr_const_extra): Likewise.
18746 (print_operand): Add @PLT to TLS calls, handle %K.
18747 (s390_function_profiler): Mark __fentry__/_mcount as function,
18748 use %K, use UNSPEC_PLT31.
18749 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
18750 (s390_emit_call): Use UNSPEC_PLT31.
18751 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
18752 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
18753 (*movdi_64): Use %K.
18754 (reload_base_64): Likewise.
18755 (*sibcall_brc): Likewise.
18756 (*sibcall_brcl): Likewise.
18757 (*sibcall_value_brc): Likewise.
18758 (*sibcall_value_brcl): Likewise.
18760 (*brasl): Likewise.
18761 (*bras_r): Likewise.
18762 (*brasl_r): Likewise.
18763 (*bras_tls): Likewise.
18764 (*brasl_tls): Likewise.
18765 (main_base_64): Likewise.
18766 (reload_base_64): Likewise.
18767 (@split_stack_call<mode>): Likewise.
18769 2021-07-16 Richard Biener <rguenther@suse.de>
18771 PR tree-optimization/101467
18772 * tree-vect-stmts.c (vect_gen_while): Properly guard
18773 make_temp_ssa_name usage.
18775 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
18777 * config.gcc: Don't use forked print-sysroot-suffix.sh and
18778 t-sysroot-suffix for C-SKY.
18779 * config/csky/print-sysroot-suffix.sh: Delete.
18780 * config/csky/t-csky-linux: Delete.
18781 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
18782 instead of CSKY_MULTILIB_DIRNAMES.
18784 2021-07-16 Richard Biener <rguenther@suse.de>
18786 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
18787 conversion issues with the partial reduction of the reused
18788 vector accumulator.
18790 2021-07-16 Richard Biener <rguenther@suse.de>
18792 * config/i386/i386-options.c (ix86_option_override_internal): Set
18793 param_vect_partial_vector_usage to zero if not set.
18795 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
18798 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
18799 (VALID_INT_MODE_P): Add SDmode and DDmode.
18800 Add TDmode for TARGET_64BIT.
18801 (VALID_DFP_MODE_P): Remove.
18802 * config/i386/i386.c (ix86_hard_regno_mode_ok):
18803 Do not use VALID_DFP_MODE_P.
18805 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
18807 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
18809 (fold_using_range::fold_stmt): Ditto.
18810 (fold_using_range::range_of_range_op): Ditto.
18811 (fold_using_range::range_of_phi): Ditto.
18812 (fold_using_range::range_of_call): Ditto.
18813 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
18814 (fold_using_range::range_of_builtin_call): Ditto.
18815 (fold_using_range::range_of_cond_expr): Ditto.
18816 * gimple-range-fold.h (gimple_range_type): New.
18818 2021-07-15 Martin Sebor <msebor@redhat.com>
18820 PR middle-end/97027
18821 * tree-ssa-strlen.c (handle_assign): New function.
18822 (maybe_warn_overflow): Add argument.
18823 (nonzero_bytes_for_type): New function.
18824 (count_nonzero_bytes): Handle more tree types. Call
18825 nonzero_bytes_for_tye.
18826 (count_nonzero_bytes): Handle types.
18827 (handle_store): Handle stores from function calls.
18828 (strlen_check_and_optimize_call): Move code to handle_assign. Call
18829 it for assignments from function calls.
18831 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18836 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
18838 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18840 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
18842 2021-07-15 Martin Sebor <msebor@redhat.com>
18846 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
18847 (operand_compare::verify_hash_value): Same.
18848 * tree-core.h (OEP_DECL_NAME): New.
18850 2021-07-15 Martin Jambor <mjambor@suse.cz>
18852 * profile-count.h (profile_count::value): Change the return type to
18854 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
18856 * tree-cfg.c (dump_function_to_file): Likewise.
18858 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
18861 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
18862 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
18865 2021-07-15 Richard Biener <rguenther@suse.de>
18867 * tree-vectorizer.h (vect_gen_while): Match up with
18868 vect_gen_while_not.
18869 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
18870 of vect_gen_while_not.
18871 (vect_gen_while_not): Adjust.
18872 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
18874 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
18876 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
18877 (ranger_cache::range_of_def): Call adjust_range.
18878 (ranger_cache::entry_range): Same.
18879 * gimple-range-cache.h (non_null_ref::adjust_range): New.
18880 * gimple-range.cc (gimple_ranger::range_of_expr): Call
18882 (gimple_ranger::range_on_entry): Same.
18884 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18887 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18889 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18891 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18894 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18896 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18897 (sdot_prod, udot_prod): ...These.
18898 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18899 (aarch64_<sur>dot<vsi2qi>): Rename to...
18900 (<sur>dot_prod<vsi2qi>): ...This.
18901 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18904 2021-07-15 Jakub Jelinek <jakub@redhat.com>
18906 PR middle-end/101437
18907 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
18908 types even if they have non-BLKmode TYPE_MODE.
18910 2021-07-15 Richard Biener <rguenther@suse.de>
18913 * gcc.c (process_command): Process -gtoggle like process_options
18914 would after parsing options.
18916 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18918 * cfgexpand.c (expand_asm_loc): Adjust.
18919 (expand_asm_stmt): Likewise.
18920 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
18921 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18922 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
18923 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
18924 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
18925 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
18926 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
18927 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
18928 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
18929 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
18930 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
18931 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
18932 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
18933 * doc/tm.texi: Regenerate.
18934 * target.def: Add location argument to md_asm_adjust.
18936 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18938 * tree-diagnostic.c (diagnostic_report_current_function): Use the
18939 diagnostic's location, not input_location.
18941 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18943 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
18945 (expand_asm_stmt): Likewise.
18947 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18949 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
18950 addressed memory rtx, if any.
18951 (rs6000_split_multireg_move): Fix code formatting.
18952 Handle MMA build built-ins with operands in adjacent memory locations.
18954 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18956 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
18959 2021-07-14 Jason Merrill <jason@redhat.com>
18961 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
18963 2021-07-14 Jason Merrill <jason@redhat.com>
18965 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
18968 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18970 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
18971 Remove erroneous line.
18973 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
18975 * params.opt (param_evrp_mode): Change default.
18977 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18979 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18980 (sdot_prod, udot_prod): ...These.
18981 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18982 (aarch64_<sur>dot<vsi2qi>): Rename to...
18983 (<sur>dot_prod<vsi2qi>): ...This.
18984 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18987 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18989 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18991 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18993 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
18995 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18997 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
18999 2021-07-14 Tamar Christina <tamar.christina@arm.com>
19001 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
19002 (usdot_prod<vsi2qi>): ... This.
19003 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
19004 (usdot_prod): ...This.
19005 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
19006 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
19008 (@<sur>dot_prod<vsi2qi>): ...This.
19009 * config/aarch64/aarch64-sve-builtins-base.cc
19010 (svusdot_impl::expand): Use it.
19012 2021-07-14 Tamar Christina <tamar.christina@arm.com>
19014 * optabs.def (usdot_prod_optab): New.
19015 * doc/md.texi: Document it and clarify other dot prod optabs.
19016 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
19017 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
19018 * optabs.c (expand_widen_pattern_expr): Likewise.
19019 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
19020 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
19021 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
19023 (vect_widened_op_tree): Optionally ignore
19025 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
19027 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
19030 * config/i386/driver-i386.c (host_detect_local_cpu): Check
19031 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
19032 Enable UINTR only for 64-bit codegen.
19033 * config/i386/i386-options.c
19034 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
19036 * config/i386/i386.h (ARCH_ARG): New.
19037 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
19038 "[arch|tune] 64" for 64-bit codegen.
19040 2021-07-14 Richard Biener <rguenther@suse.de>
19042 PR tree-optimization/101445
19043 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
19044 of the IV in the correct direction for negative stride
19047 2021-07-14 Jakub Jelinek <jakub@redhat.com>
19050 * godump.c (godump_str_hash): New type.
19051 (godump_container::pot_dummy_types): Use string_hash instead of
19052 ptr_hash in the hash_set.
19054 2021-07-14 Richard Biener <rguenther@suse.de>
19056 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
19057 vector types where the old vector type has a multiple of
19058 the new vector type elements.
19059 (vect_create_partial_epilog): New function, split out from...
19060 (vect_create_epilog_for_reduction): ... here.
19061 (vect_transform_cycle_phi): Reduce the re-used accumulator
19062 to the new vector type.
19064 2021-07-14 Alexandre Oliva <oliva@adacore.com>
19066 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
19067 non-'t'-sized arg check.
19069 2021-07-14 Alexandre Oliva <oliva@adacore.com>
19071 * tree-cfg.c (cleanup_dead_labels_eh): Update
19072 post_landing_pad label upon change of landing pad block's
19074 (cleanup_dead_labels): Check that a removed label is not that
19077 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
19079 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
19081 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
19082 Add Neon to general purpose register case for zero-extend
19084 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
19085 case to prevent some cases opting to go through memory.
19086 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
19087 * rtl.c (rtvec_series_p): Define predicate to determine
19088 whether a vector contains a linear series of integers.
19089 * rtl.h (rtvec_series_p): Define.
19090 * rtlanal.c (vec_series_lowpart_p): Define predicate to
19091 determine if a vector selection is equivalent to the low part
19093 * rtlanal.h (vec_series_lowpart_p): Define.
19094 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
19095 Add vec_select -> subreg simplification.
19097 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
19099 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
19100 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
19101 _mm_test_mix_ones_zeros): New.
19103 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
19104 Richard Biener <rguenther@suse.de>
19106 * gimple.c (gimple_could_trap_p_1): Make S argument a
19107 "const gimple*". Preserve constness in call to
19108 gimple_asm_volatile_p.
19109 (gimple_could_trap_p): Make S argument a "const gimple*".
19110 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
19111 Update function prototypes.
19113 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19115 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
19116 (_loop_vec_info::main_loop_edge): New field.
19117 (_loop_vec_info::skip_main_loop_edge): Likewise.
19118 (_loop_vec_info::skip_this_loop_edge): Likewise.
19119 (_loop_vec_info::reusable_accumulators): Likewise.
19120 (_stmt_vec_info::reduc_scalar_results): Likewise.
19121 (_stmt_vec_info::reused_accumulator): Likewise.
19122 (vect_get_main_loop_result): Declare.
19123 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
19124 reduc_scalar_inputs.
19125 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
19126 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
19127 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
19128 skip_main_loop_edge and skip_this_loop_edge fields.
19129 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
19130 (vect_emit_reduction_init_stmts): New function.
19131 (get_initial_def_for_reduction): Use it.
19132 (get_initial_defs_for_reduction): Likewise. Change the vinfo
19133 parameter to a loop_vec_info.
19134 (vect_create_epilog_for_reduction): Store the scalar results
19135 in the reduc_info. If an epilogue loop is reusing an accumulator
19136 from the main loop, and if the epilogue loop can also be skipped,
19137 try to place the reduction code in the join block. Record
19138 accumulators that could potentially be reused by epilogue loops.
19139 (vect_transform_cycle_phi): When vectorizing epilogue loops,
19140 try to reuse accumulators from the main loop. Record the initial
19141 value in reduc_info for non-SLP reductions too.
19143 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19145 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
19146 adjustment handling. Take the neutral value as an argument,
19147 in place of the code argument.
19148 (vect_transform_cycle_phi): Update accordingly. Handle the
19149 initial values of cond reductions separately from code reductions.
19150 Choose the adjustment here rather than in
19151 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
19153 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19155 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
19156 (neutral_op_for_reduction): ...this, providing a more general
19158 (vect_create_epilog_for_reduction): Update accordingly.
19159 (vectorizable_reduction): Likewise.
19160 (vect_transform_cycle_phi): Likewise.
19162 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19164 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
19165 reduc_info instead of the original stmt_vec_info.
19166 (vect_transform_cycle_phi): Update accordingly.
19168 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19170 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
19171 reduc_info as an additional parameter.
19172 (vect_transform_cycle_phi): Update accordingly.
19174 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19176 * tree-vectorizer.h: Include tree-ssa-operands.h.
19177 (vect_phi_initial_value): New function.
19178 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
19179 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
19180 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
19181 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
19183 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19185 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
19186 the phi results to vectype after creating them. Remove later
19187 conversion code that thus becomes redundant.
19189 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19191 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
19192 the new_phis vector with a reduc_inputs vector. Combine handling
19193 of reduction chains and ncopies > 1.
19195 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19197 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
19198 scalar_results to group_size elements after reducing down from
19199 N*group_size elements. Construct an array_slice of the live-out
19200 stmts and assert that there is one stmt per scalar result.
19202 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19204 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
19205 nested_in_vect_loop and use double_reduc everywhere. Remove dead
19206 assignment to "loop".
19208 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
19210 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
19211 vector types first. For scalar types, consider both the preferred
19212 vector mode and the alternative vector modes.
19213 * optabs-query.c (can_vec_mask_load_store_p): Use the same
19214 structure as above, in particular using related_vector_mode
19215 for modes provided by autovectorize_vector_modes.
19217 2021-07-13 Jakub Jelinek <jakub@redhat.com>
19218 Richard Biener <rguenther@suse.de>
19220 PR tree-optimization/101419
19221 * tree-pass.h (PROP_objsz): Define.
19222 (make_pass_early_object_sizes): Declare.
19223 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
19224 there to pass_early_object_sizes, drop parameter.
19225 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
19226 drop parameter, move pass_post_ipa_warn right after that.
19227 * tree-object-size.c (pass_object_sizes::execute): Rename to...
19228 (object_sizes_execute): ... this. Add insert_min_max_p argument.
19229 (pass_data_object_sizes): Move after object_sizes_execute.
19230 (pass_object_sizes): Likewise. In execute method call
19231 object_sizes_execute, drop set_pass_param method and insert_min_max_p
19232 non-static data member and its initializer in the ctor.
19233 (pass_data_early_object_sizes, pass_early_object_sizes,
19234 make_pass_early_object_sizes): New.
19235 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
19236 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
19238 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
19241 * config/riscv/constraints.md ("S"): Update description and remove
19243 * doc/md.texi (Machine Constraints): Document the 'S' constraints
19246 2021-07-13 Richard Biener <rguenther@suse.de>
19249 2021-07-12 Richard Biener <rguenther@suse.de>
19251 * tree-vect-slp.c (vect_slp_region): Show the number of
19252 SLP graph entries in the optimization message.
19254 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
19256 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
19258 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
19259 return type to long.
19260 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
19263 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
19265 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
19266 Query relation between the 2 operands and use it.
19268 2021-07-12 Sergei Trofimovich <siarheit@google.com>
19270 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
19272 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
19275 * config/i386/predicates.md (vec_setm_sse41_operand):
19276 Rename from vec_setm_operand.
19277 (vec_setm_avx2_operand): New predicate.
19278 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
19279 Use vec_setm_sse41_operand as operand 2 predicate.
19280 (vec_set<V_256_512:mode): New expander.
19281 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
19282 as operand 2 predicate.
19284 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
19286 PR tree-optimization/101335
19287 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
19289 2021-07-12 Andrew Pinski <apinski@marvell.com>
19291 * tree-ssa-phiopt.c (match_simplify_replacement): Move
19292 insert of the sequence before the movement of the
19293 statement. Check if to see if the statement is used
19294 outside of the original phi to see if we should move it.
19296 2021-07-12 Richard Biener <rguenther@suse.de>
19298 * dump-context.h (debug_dump_context::debug_dump_context):
19299 Add FILE * parameter defaulted to stderr.
19300 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
19301 * tree-vect-slp.c (dot_slp_tree): New functions.
19303 2021-07-12 Richard Biener <rguenther@suse.de>
19305 PR tree-optimization/101373
19306 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
19307 references when the BB may not return.
19308 (compute_avail): Pass in the function we're working on and
19309 replace cfun references with it. Externally throwing
19310 const calls also possibly terminate the function.
19311 (pass_pre::execute): Pass down the function we're working on.
19312 * gcse.c (compute_hash_table_work): Externally throwing
19313 const/pure calls also need record_last_mem_set_info.
19314 * postreload-gcse.c (record_opr_changes): Looping or externally
19315 throwing const/pure calls also need record_last_mem_set_info.
19317 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
19319 * recog.c (memory_address_addr_space_p): Change the type to bool.
19320 Return true/false instead of 1/0.
19321 (offsettable_memref_p): Ditto.
19322 (offsettable_nonstrict_memref_p): Ditto.
19323 (offsettable_address_addr_space_p): Ditto.
19324 Change the type of addressp indirect function to bool.
19325 * recog.h (memory_address_addr_space_p): Change the type to bool.
19326 (strict_memory_address_addr_space_p): Ditto.
19327 (offsettable_memref_p): Ditto.
19328 (offsettable_nonstrict_memref_p): Ditto.
19329 (offsettable_address_addr_space_p): Ditto.
19330 * reload.c (maybe_memory_address_addr_space_p): Ditto.
19331 (strict_memory_address_addr_space_p): Change the type to bool.
19332 Return true/false instead of 1/0.
19333 (maybe_memory_address_addr_space_p): Change the type to bool.
19335 2021-07-12 Richard Biener <rguenther@suse.de>
19337 * tree-vect-slp.c (vect_slp_region): Show the number of
19338 SLP graph entries in the optimization message.
19340 2021-07-12 Richard Biener <rguenther@suse.de>
19342 PR tree-optimization/101394
19343 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
19344 copies from abnormals for a full redundancy.
19346 2021-07-12 Richard Biener <rguenther@suse.de>
19348 PR middle-end/101423
19349 * gimple.c (gimple_could_trap_p_1): Internal function calls
19351 * tree-eh.c (tree_could_trap_p): Likewise.
19353 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19356 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
19358 (vmulq_n_u32): Likewise.
19359 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
19360 (vmulq_n_f32): Likewise.
19361 (vmul_n_f16): Likewise.
19362 (vmulq_n_f16): Likewise.
19364 2021-07-12 Martin Liska <mliska@suse.cz>
19366 PR sanitizer/101425
19367 * gcc.c (check_offload_target_name): Call
19368 candidates_list_and_hint only if we have a candidate.
19370 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19373 * config/arm/neon.md (vec_init): Move to ...
19374 * config/arm/vec-common.md (vec_init): ... here.
19375 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
19377 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
19379 PR tree-optimization/101403
19380 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
19381 signedness of the shift is not the same as the signedness of
19382 the type extension.
19384 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
19385 Uroš Bizjak <ubizjak@gmail.com>
19387 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
19388 divmod of a constant numerator with new define_insn_and_split.
19390 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
19393 * config/i386/i386-expand.c (ix86_expand_call): If a call is
19394 to a non-local-binding, or local but to a public symbol, then
19395 assume that it might be indirected via the lazy symbol binder.
19396 Mark R10 and R10 as clobbered in that case.
19398 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
19401 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
19402 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
19403 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
19405 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
19407 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
19408 log2 range of operands[3] to [1,31].
19409 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
19411 2021-07-09 Sergei Trofimovich <siarheit@google.com>
19413 * doc/md.texi: Don't split @smallexample in multiple @groups.
19415 2021-07-09 Sergei Trofimovich <siarheit@google.com>
19417 * doc/md.texi: Add missing 'see' word.
19419 2021-07-09 Andrew Pinski <apinski@marvell.com>
19421 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
19422 to take sequence and gimple_match_op. Accept the case where
19423 op is a SSA_NAME and one statement in the sequence.
19424 Also allow constants.
19425 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
19426 Update call to phiopt_early_allow. Discard the sequence if not
19429 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
19434 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
19435 instead of ISA_HAS_MSA.
19436 (mips_expand_vec_unpack): Likewise.
19437 (mips_expand_vector_init): Likewise.
19439 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
19441 * config/rs6000/vsx.md (mods_<mode>): Rename to...
19442 (mod<mode>3): ... this.
19443 (modu_<mode>): Rename to...
19444 (umod<mode>3): ... this.
19445 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
19446 MODU_V4SI): Adjust.
19448 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
19450 * config/h8300/shiftrotate.md (variable shifts): Expose condition
19451 code handling for the test before the loop.
19453 2021-07-08 Martin Jambor <mjambor@suse.cz>
19456 * ipa-sra.c (class isra_call_summary): New member
19457 m_before_any_store, initialize it in the constructor.
19458 (isra_call_summary::dump): Dump the new field.
19459 (ipa_sra_call_summaries::duplicate): Copy it.
19460 (process_scan_results): Set it.
19461 (isra_write_edge_summary): Stream it.
19462 (isra_read_edge_summary): Likewise.
19463 (param_splitting_across_edge): Only override
19464 safe_to_import_accesses if m_before_any_store is set.
19466 2021-07-08 Martin Sebor <msebor@redhat.com>
19468 PR bootstrap/101374
19469 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
19470 Use Object Size Type 0 instead of 1.
19472 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
19474 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
19477 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
19479 * match.pd: Simplify an extend-operate-truncate sequence involving
19482 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
19483 Richard Biener <rguenther@suse.de>
19485 PR tree-optimization/40210
19486 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
19487 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
19488 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
19490 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
19493 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
19495 * config/i386/mmx.md (V_32): New mode iterator.
19496 (mov<V_32:mode>): Use V_32 mode iterator.
19497 (*mov<V_32:mode>_internal): Ditto.
19498 (*push<V_32:mode>2_rex64): Ditto.
19499 (*push<V_32:mode>2): Ditto.
19500 (movmisalign<V_32:mode>): Ditto.
19501 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
19502 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
19503 (vec_unpacks_lo_v4qi): New expander.
19504 (vec_unpacks_hi_v4qi): Ditto.
19505 (vec_unpacku_lo_v4qi): Ditto.
19506 (vec_unpacku_hi_v4qi): Ditto.
19507 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
19508 (VALID_INT_MODE_P): Ditto.
19510 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
19513 * config/rs6000/rs6000.md (udivti3): New insn.
19514 (divti3): New insn.
19515 (umodti3): New insn.
19516 (modti3): New insn.
19518 2021-07-07 Martin Sebor <msebor@redhat.com>
19520 PR tree-optimization/100137
19521 PR tree-optimization/99121
19522 PR tree-optimization/97027
19523 * builtins.c (access_ref::access_ref): Also set offmax.
19524 (access_ref::offset_in_range): Define new function.
19525 (access_ref::add_offset): Set offmax.
19526 (access_ref::inform_access): Handle access_none.
19527 (handle_mem_ref): Clear ostype.
19528 (compute_objsize_r): Handle ASSERT_EXPR.
19529 * builtins.h (struct access_ref): Add offmax member.
19530 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
19531 compute_objsize() and simplify.
19533 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
19535 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
19536 and VSX_BUILTIN_STXVP.
19538 2021-07-07 Martin Sebor <msebor@redhat.com>
19541 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
19542 a stray %K from error_at() missed in r12-2088.
19544 2021-07-07 Richard Biener <rguenther@suse.de>
19546 PR tree-optimization/99728
19547 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
19549 (mem_refs_may_alias_p): Add assert we handled aggregate
19551 (sm_seq_valid_bb): Give up when running into aggregate copies.
19552 (ref_indep_loop_p): Handle aggregate copies as never
19553 being invariant themselves but allow other refs to be
19554 disambiguated against them.
19555 (can_sm_ref_p): Do not try to apply store-motion to aggregate
19558 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
19561 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
19564 2021-07-06 Martin Sebor <msebor@redhat.com>
19566 * gimple-pretty-print.c (percent_G_format): Remove.
19567 * tree-diagnostic.c (default_tree_printer): Remove calls.
19568 * tree-pretty-print.c (percent_K_format): Remove.
19569 * tree-pretty-print.h (percent_K_format): Remove.
19571 2021-07-06 Martin Sebor <msebor@redhat.com>
19573 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
19574 Remove %K and use error_at.
19575 (aarch64_expand_fcmla_builtin): Same.
19576 (aarch64_expand_builtin_tme): Same.
19577 (aarch64_expand_builtin_memtag): Same.
19578 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
19579 (arm_expand_builtin): Same.
19580 * config/arm/arm.c (bounds_check): Same.
19582 2021-07-06 Martin Sebor <msebor@redhat.com>
19584 * builtins.c (warn_string_no_nul): Remove %G.
19585 (maybe_warn_for_bound): Same.
19586 (warn_for_access): Same.
19587 (check_access): Same.
19588 (check_strncat_sizes): Same.
19589 (expand_builtin_strncat): Same.
19590 (expand_builtin_strncmp): Same.
19591 (expand_builtin): Same.
19592 (expand_builtin_object_size): Same.
19593 (warn_dealloc_offset): Same.
19594 (maybe_emit_free_warning): Same.
19595 * calls.c (maybe_warn_alloc_args_overflow): Same.
19596 (maybe_warn_nonstring_arg): Same.
19597 (maybe_warn_rdwr_sizes): Same.
19598 * expr.c (expand_expr_real_1): Remove %K.
19599 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
19600 (gimple_fold_builtin_strncat): Same.
19601 * gimple-ssa-sprintf.c (format_directive): Same.
19602 (handle_printf_call): Same.
19603 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
19604 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19605 (maybe_diag_access_bounds): Same. Call gimple_location.
19606 (check_bounds_or_overlap): Same.
19607 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
19608 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
19609 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
19610 (maybe_diag_stxncpy_trunc): Same.
19611 (handle_builtin_stxncpy_strncat): Same.
19612 (maybe_warn_pointless_strcmp): Same.
19613 * tree-ssa-uninit.c (maybe_warn_operand): Same.
19615 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
19618 * config/i386/predicates.md (vec_setm_operand): Enable
19619 register_operand for TARGET_SSE4_1.
19620 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
19621 as operand 2 predicate. Call ix86_expand_vector_set_var
19622 for non-constant index operand.
19623 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
19624 Call ix86_expand_vector_set_var for non-constant index operand.
19626 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
19628 * config/h8300/jumpcall.md (*branch): When possible, generate
19629 the comparison in CCZN mode.
19630 * config/h8300/predicates.md (simple_memory_operand): Reject all
19631 auto-increment addressing modes.
19633 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19635 PR bootstrap/100246
19636 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
19639 2021-07-06 Richard Biener <rguenther@suse.de>
19641 * doc/md.texi (vec_fmaddsub<mode>4): Document.
19642 (vec_fmsubadd<mode>4): Likewise.
19643 * optabs.def (vec_fmaddsub$a4): Add.
19644 (vec_fmsubadd$a4): Likewise.
19645 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
19646 (IFN_VEC_FMSUBADD): Likewise.
19647 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
19648 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
19649 (addsub_pattern::build): Likewise.
19650 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
19651 and CFN_VEC_FMSUBADD are not transparent for permutes.
19652 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
19653 (vec_fmsubadd<mode>4): Likewise.
19655 2021-07-06 Richard Biener <rguenther@suse.de>
19657 * doc/invoke.texi (fmove-loop-stores): Document.
19658 * common.opt (fmove-loop-stores): New option.
19659 * opts.c (default_options_table): Enable -fmove-loop-stores
19660 at -O1 but not -Og.
19661 * tree-ssa-loop-im.c (pass_lim::execute): Pass
19662 flag_move_loop_stores instead of true to
19663 loop_invariant_motion_in_fun.
19665 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19667 * doc/install.texi: Document --with-dsymutil.
19669 2021-07-06 Andrew Pinski <apinski@marvell.com>
19671 PR tree-optimization/101256
19672 * dbgcnt.def (phiopt_edge_range): New counter.
19673 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
19674 Check to make sure the new name is defined in the same
19675 bb as the conditional before duplicating range info.
19676 Also add debug counter.
19678 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19680 PR rtl-optimization/100328
19681 * config/i386/i386-options.c (ix86_option_override_internal):
19682 Set param_ira_consider_dup_in_all_alts to 0.
19684 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19686 PR rtl-optimization/100328
19687 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
19689 * ira.c (ira_get_dup_out_num): Adjust as parameter
19690 param_ira_consider_dup_in_all_alts.
19691 * params.opt (ira-consider-dup-in-all-alts): New.
19692 * ira-conflicts.c (process_regs_for_copy): Add one parameter
19693 single_input_op_has_cstr_p.
19694 (get_freq_for_shuffle_copy): New function.
19695 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
19696 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
19698 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
19700 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
19701 generate condition code aware RTL directly.
19703 2021-07-05 Andrew Pinski <apinski@marvell.com>
19705 PR tree-optimization/101039
19706 * match.pd (A CMP 0 ? A : -A): New patterns.
19707 * tree-ssa-phiopt.c (abs_replacement): Delete function.
19708 (tree_ssa_phiopt_worker): Don't call abs_replacement.
19709 Update comment about abs_replacement.
19711 2021-07-05 Andrew Pinski <apinski@marvell.com>
19713 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
19714 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
19716 2021-07-05 Andrew Pinski <apinski@marvell.com>
19718 * tree-ssa-phiopt.c (match_simplify_replacement):
19719 Add early_p argument. Call gimple_simplify_phiopt
19720 instead of gimple_simplify.
19721 (tree_ssa_phiopt_worker): Update call to
19722 match_simplify_replacement and allow unconditionally.
19723 (phiopt_early_allow): New function.
19724 (gimple_simplify_phiopt): New function.
19726 2021-07-05 Andrew Pinski <apinski@marvell.com>
19728 PR middle-end/101237
19729 * fold-const.c (negate_expr_p): Remove call to element_mode
19730 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
19731 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
19732 (fold_negate_expr_1): Likewise.
19733 (const_unop): Likewise.
19734 (fold_cond_expr_with_comparison): Likewise.
19735 (fold_binary_loc): Likewise.
19736 (fold_ternary_loc): Likewise.
19737 (tree_call_nonnegative_warnv_p): Likewise.
19738 * match.pd (-(A + B) -> (-B) - A): Likewise.
19740 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
19742 * configure.ac: Handle --with-dsymutil in the same way as we
19743 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
19744 Extract the type and version for the dsymutil configured or
19745 found by the default searches.
19746 * config.in: Regenerated.
19747 * configure: Regenerated.
19748 * collect2.c (do_dsymutil): Handle locating dsymutil in the
19749 same way as for the assembler and linker.
19750 * config/darwin.h (DSYMUTIL): Delete.
19751 * gcc.c: Report a configured dsymutil correctly.
19752 * exec-tool.in: Allow for dsymutil.
19754 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
19756 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
19757 Handle V4QI and V2HI modes.
19758 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
19759 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
19760 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
19761 Handle 4-byte dual operands with TARGET_XOP and single operands
19762 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
19763 mmx_pshufbv4qi3 for TARGET_SSSE3.
19764 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
19765 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
19766 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
19767 (expand_vec_perm_even_odd_1): Handle V4QI mode.
19768 (expand_vec_perm_broadcast_1): Handle V4QI mode.
19769 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
19770 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
19771 (mmx_pshufbv4qi3): Ditto.
19772 (*mmx_pblendw32): Ditto.
19773 (*mmx_pblendw64): Rename from *mmx_pblendw.
19774 (mmx_punpckhbw_low): New insn_and_split pattern.
19775 (mmx_punpcklbw_low): Ditto.
19777 2021-07-05 Richard Biener <rguenther@suse.de>
19779 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
19780 set LOOP_C_INFINITE on the vectorized loop.
19782 2021-07-05 Richard Biener <rguenther@suse.de>
19784 PR middle-end/101291
19785 * cfgloopmanip.c (loop_version): Set the loop copy of the
19786 versioned loop to the new loop.
19788 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
19791 * config.gcc: Ensure that Darwin biarch definitions are
19792 added before i386.h.
19793 * config/i386/darwin.h (TARGET_64BIT): Remove.
19794 (PR80556_WORKAROUND): New.
19795 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
19796 (DARWIN_SUBARCH_SPEC): New.
19797 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
19798 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19799 (REAL_LIBGCC_SPEC): Remove.
19800 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
19801 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19802 (REAL_LIBGCC_SPEC): Remove.
19804 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
19806 PR middle-end/101294
19807 * expr.c (store_constructor): Don't use vec_duplicate on vector.
19809 2021-07-02 Martin Sebor <msebor@redhat.com>
19811 PR middle-end/98871
19812 PR middle-end/98512
19813 * diagnostic.c (get_any_inlining_info): New.
19814 (update_effective_level_from_pragmas): Handle inlining context.
19815 (diagnostic_enabled): Same.
19816 (diagnostic_report_diagnostic): Same.
19817 * diagnostic.h (struct diagnostic_info): Add ctor.
19818 (struct diagnostic_context): Add new member.
19819 * tree-diagnostic.c (set_inlining_locations): New.
19820 (tree_diagnostics_defaults): Set new callback pointer.
19822 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
19824 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
19826 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
19827 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
19828 lxvp and stxvp built-ins.
19829 (mma_init_builtins): Handle lxvp and stxvp built-ins.
19830 (builtin_function_type): Likewise.
19831 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
19833 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19835 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
19836 additional argument for the code.
19837 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
19838 argument for the code. Just return if the ZN bits are useful or
19839 not rather than the old style CC_* enums.
19840 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
19841 more generic shiftqi patterns.
19842 (shifthi_noscratch, shiftsi_noscratch): Similarly.
19843 (shiftqi_noscratch_set_flags): New pattern.
19844 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
19846 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
19848 PR tree-optimization/101223
19849 * range-op.cc (build_lt): Add -1 for signed values.
19850 (built_gt): Subtract -1 for signed values.
19852 2021-07-02 David Faust <david.faust@oracle.com>
19854 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
19855 (btf_asm_type): Likewise.
19857 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19859 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
19860 an array of rtx rather than a pointer to rtx. Add code argument.
19861 (compute_a_shift_length): Similarly.
19862 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
19863 shift itself isn't an operand. Create dummy operand[0] to carry
19864 a mode and pass a suitable rtx code to compute_a_shift_length.
19865 (get_shift_alg): Adjust operand number of clobber in output templates.
19866 (output_a_shift): Make first argument an array of rtx rather than
19867 a pointer to rtx. Add code argument for the type of shift.
19868 Adjust now that the shift itself is no longer an operand.
19869 (compute_a_shift_length): Similarly.
19870 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
19871 iterator rather than nshift_operator.
19872 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
19873 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
19874 and compute_a_shift_length.
19875 (shiftqi_noscratch_clobber_flags): Likewise.
19876 (shifthi_noscratch_clobber_flags): Likewise.
19877 (shiftsi_noscratch_clobber_flags): Likewise.
19879 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19882 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
19883 dsymutil for BTF/CTF.
19885 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19888 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
19889 segment to include BTF.
19890 (BTF_INFO_SECTION_NAME): New.
19892 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19894 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
19895 (small_data_operand, memreg_operand, small_insn_p): Likewise.
19896 * config/m32r/m32r.c (call_operand): Adjust return type.
19897 (small_data_operand, memreg_operand): Likewise.
19899 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19901 * config/frv/frv-protos.h (integer_register_operand): Adjust return
19903 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
19904 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
19905 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
19906 (move_source_operand, move_destination_operand): Likewise.
19907 (condexec_source_operand, condexec_dest_operand): Likewise.
19908 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
19909 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
19910 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
19911 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
19912 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
19913 (even_fpr_operand, odd_fpr_operand): Likewise.
19914 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
19915 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
19916 (uint4_operand, uint1_operand, int_2word_operand): Likewise
19917 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
19918 (relational_operator, float_relational_operator): Likewise.
19919 (ccr_eqne_operator, minmax_operator): Likewise.
19920 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
19921 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
19922 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
19923 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
19924 (quad_acc_operand, accg_operand): Likewise.
19926 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19928 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
19929 return type to a bool.
19930 (nonimmediate_nonstack_operand): Likewise.
19931 (xstormy16_splittable_below100_operand): Likewise.
19932 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
19934 (xstormy16_splittable_below100_operand): Likewise.
19936 2021-07-02 Richard Biener <rguenther@suse.de>
19938 PR tree-optimization/101293
19939 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
19940 with combined offsets.
19941 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
19942 combined with the rest of the offset.
19944 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
19946 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
19947 PIC encodings for PE-COFF targets.
19949 2021-07-02 Jakub Jelinek <jakub@redhat.com>
19952 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
19953 Return nullptr for TImode inner mode.
19955 2021-07-02 Richard Biener <rguenther@suse.de>
19957 PR tree-optimization/101280
19958 PR tree-optimization/101173
19959 * gimple-loop-interchange.cc
19960 (tree_loop_interchange::valid_data_dependences): Properly
19961 guard all dependence checks with DDR_REVERSED_P or its
19964 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
19966 * config/i386/i386-expand.c (ix86_expand_builtin):
19967 Add branch to clear odata when ZF is set for asedecenc_expand
19968 and wideaesdecenc_expand.
19970 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
19972 * config/i386/gcc-auto-profile: regenerate
19974 2021-07-02 liuhongt <hongtao.liu@intel.com>
19976 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
19977 (trunc<mode><pmov_dst_4_lower>2): this.
19979 2021-07-01 David Malcolm <dmalcolm@redhat.com>
19981 * diagnostic.h (diagnostic_context::m_file_cache): New field.
19982 * input.c (class fcache): Rename to...
19983 (class file_cache_slot): ...this, making most members private and
19984 prefixing fields with "m_".
19985 (file_cache_slot::get_file_path): New accessor.
19986 (file_cache_slot::get_use_count): New accessor.
19987 (file_cache_slot::missing_trailing_newline_p): New accessor.
19988 (file_cache_slot::inc_use_count): New.
19989 (fcache_buffer_size): Move to...
19990 (file_cache_slot::buffer_size): ...here.
19991 (fcache_line_record_size): Move to...
19992 (file_cache_slot::line_record_size): ...here.
19993 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
19994 (fcache_tab_size): Move to file_cache::num_file_slots.
19995 (diagnostic_file_cache_init): Update for move of fcache_tab
19996 to global_dc->m_file_cache.
19997 (diagnostic_file_cache_fini): Likewise.
19998 (lookup_file_in_cache_tab): Convert to...
19999 (file_cache::lookup_file): ...this.
20000 (diagnostics_file_cache_forcibly_evict_file): Update for move of
20001 fcache_tab to global_dc->m_file_cache, moving most of
20002 implementation to...
20003 (file_cache::forcibly_evict_file): ...this new function and...
20004 (file_cache_slot::evict): ...this new function.
20005 (evicted_cache_tab_entry): Convert to...
20006 (file_cache::evicted_cache_tab_entry): ...this.
20007 (add_file_to_cache_tab): Convert to...
20008 (file_cache::add_file): ...this, moving bulk of implementation
20010 (file_cache_slot::create): ..this new function.
20011 (file_cache::file_cache): New.
20012 (file_cache::~file_cache): New.
20013 (lookup_or_add_file_to_cache_tab): Convert to...
20014 (file_cache::lookup_or_add_file): ..this new function.
20015 (fcache::fcache): Rename to...
20016 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
20018 (fcache::~fcache): Rename to...
20019 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
20021 (needs_read): Convert to...
20022 (file_cache_slot::needs_read_p): ...this.
20023 (needs_grow): Convert to...
20024 (file_cache_slot::needs_grow_p): ...this.
20025 (maybe_grow): Convert to...
20026 (file_cache_slot::maybe_grow): ...this.
20027 (read_data): Convert to...
20028 (file_cache_slot::read_data): ...this.
20029 (maybe_read_data): Convert to...
20030 (file_cache_slot::maybe_read_data): ...this.
20031 (get_next_line): Convert to...
20032 (file_cache_slot::get_next_line): ...this.
20033 (goto_next_line): Convert to...
20034 (file_cache_slot::goto_next_line): ...this.
20035 (read_line_num): Convert to...
20036 (file_cache_slot::read_line_num): ...this.
20037 (location_get_source_line): Update for moving of globals to
20038 global_dc->m_file_cache.
20039 (location_missing_trailing_newline): Likewise.
20040 * input.h (class file_cache_slot): New forward decl.
20041 (class file_cache): New.
20043 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
20045 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
20046 128-bit floating point conditional move support.
20047 (have_compare_and_set_mask): Add IEEE 128-bit floating point
20049 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
20050 (mov<mode>cc_p10, IEEE128 iterator): New insn.
20051 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
20052 (fpmask<mode>, IEEE128 iterator): New insn.
20053 (xxsel<mode>, IEEE128 iterator): New insn.
20055 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
20058 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
20060 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
20062 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
20064 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
20066 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
20067 (vec_duplicate<mode>): New expander.
20069 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
20072 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
20074 (ix86_byte_broadcast): New function.
20075 (ix86_convert_const_wide_int_to_broadcast): Likewise.
20076 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
20077 size is 16 bytes or bigger.
20078 (ix86_broadcast_from_integer_constant): New function.
20079 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
20080 to broadcast if mode size is 16 bytes or bigger.
20081 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
20083 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
20085 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20087 * config/i386/predicates.md (ix86_endbr_immediate_operand):
20088 Return true/false instead of 1/0.
20089 (movq_parallel): Ditto.
20091 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20093 * recog.c (general_operand): Return true/false instead of 1/0.
20094 (register_operand): Ditto.
20095 (immediate_operand): Ditto.
20096 (const_int_operand): Ditto.
20097 (const_scalar_int_operand): Ditto.
20098 (const_double_operand): Ditto.
20099 (push_operand): Ditto.
20100 (pop_operand): Ditto.
20101 (memory_operand): Ditto.
20102 (indirect_operand): Ditto.
20104 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20106 * genpreds.c (write_predicate_subfunction):
20107 Change the type of written subfunction to bool.
20108 (write_one_predicate_function):
20109 Change the type of written function to bool.
20110 (write_tm_preds_h): Ditto.
20111 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
20112 * recog.c (general_operand): Change the type to bool.
20113 (address_operand): Ditto.
20114 (register_operand): Ditto.
20115 (pmode_register_operand): Ditto.
20116 (scratch_operand): Ditto.
20117 (immediate_operand): Ditto.
20118 (const_int_operand): Ditto.
20119 (const_scalar_int_operand): Ditto.
20120 (const_double_operand): Ditto.
20121 (nonimmediate_operand): Ditto.
20122 (nonmemory_operand): Ditto.
20123 (push_operand): Ditto.
20124 (pop_operand): Ditto.
20125 (memory_operand): Ditto.
20126 (indirect_operand): Ditto.
20127 (ordered_comparison_operator): Ditto.
20128 (comparison_operator): Ditto.
20129 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
20130 Change the type of indirect predicate function to bool.
20131 * config/rs6000/rs6000.c (easy_vector_constant):
20132 Change the type to bool.
20133 * config/mips/mips-protos.h (m16_based_address_p):
20134 Change the type of operand 3 to bool.
20136 2021-07-01 Richard Biener <rguenther@suse.de>
20138 PR tree-optimization/101280
20139 PR tree-optimization/101173
20140 * gimple-loop-interchange.cc
20141 (tree_loop_interchange::valid_data_dependences): Revert
20142 previous change and instead correctly handle DDR_REVERSED_P
20145 2021-07-01 Richard Biener <rguenther@suse.de>
20147 PR tree-optimization/101278
20148 * tree-ssa-dse.c (dse_classify_store): First check for
20149 uses, then ignore stmt for chaining purposes.
20151 2021-07-01 Richard Biener <rguenther@suse.de>
20153 PR tree-optimization/100778
20154 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
20155 vectorized ops ahead of their scalar BB.
20157 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
20160 * config/i386/i386.md (*nabs<dwi>2_doubleword):
20161 New insn_and_split pattern.
20162 (*nabs<dwi>2_1): Ditto.
20163 * config/i386/i386-features.c
20164 (general_scalar_chain::compute_convert_gain):
20165 Handle (NEG (ABS (...))) RTX. Rewrite src code
20166 scanner as switch statement.
20167 (general_scalar_chain::convert_insn):
20168 Handle (NEG (ABS (...))) RTX.
20169 (general_scalar_to_vector_candidate_p):
20170 Detect (NEG (ABS (...))) RTX. Reorder case statements
20171 for (AND (NOT (...) ...)) fallthrough.
20173 2021-07-01 Richard Biener <rguenther@suse.de>
20175 PR tree-optimization/101178
20176 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
20177 (slpg::perm_in): Add.
20178 (slpg::get_perm_in): Remove.
20179 (slpg::get_perm_materialized): Add.
20180 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
20181 during permute propagation and materialization.
20183 2021-07-01 Jakub Jelinek <jakub@redhat.com>
20186 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
20188 2021-07-01 Jakub Jelinek <jakub@redhat.com>
20190 PR middle-end/94366
20191 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
20192 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
20193 use boolean_type_node instead of integer_type_node as NE_EXPR type.
20194 (lower_reduction_clauses): Likewise.
20196 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
20198 * config/gcn/gcn.c: Include dwarf2.h.
20199 (gcn_addr_space_debug): New function.
20200 (TARGET_ADDR_SPACE_DEBUG): New hook.
20202 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
20204 * common/config/gcn/gcn-common.c
20205 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
20206 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
20208 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
20209 (gcn_frame_pointer_rqd): New function.
20210 (TARGET_FRAME_POINTER_REQUIRED): New hook.
20212 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
20214 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
20215 prologue register saves.
20216 (gcn_debug_unwind_info): Use UI_DWARF2.
20217 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
20218 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
20219 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
20220 (DWARF_LINK_REGISTER): New define.
20221 (FIRST_PSEUDO_REGISTER): Increment.
20222 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
20223 (CALL_USED_REGISTERS): Likewise.
20224 (REGISTER_NAMES): Likewise.
20226 2021-06-30 Richard Biener <rguenther@suse.de>
20228 PR tree-optimization/101267
20229 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
20230 API and use SLP compatible interface of vect_is_simple_use.
20231 Reject not vectorized SLP defs for callers that do not support
20233 (vect_check_store_rhs): Handle masked stores and pass down
20234 the appropriate operator index.
20235 (vectorizable_call): Adjust.
20236 (vectorizable_store): Likewise.
20237 (vectorizable_load): Likewise. Handle SLP pecularity of
20239 (vect_is_simple_use): Remove special-casing of masked stores.
20241 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
20243 * common.opt (foffload): Remove help as Driver only.
20244 * gcc.c (display_help): Add -foffload.
20246 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
20248 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
20249 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
20250 a candidate list; better inform no offload target is configured
20251 and fix hint extraction when passed target is not '\0' at [len].
20252 * common.opt (foffload): Add tailing '.'.
20253 (foffload-options): Likewise; fix flag name in the help string.
20255 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20258 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
20259 (vcage_f32): Gate comparison on __FAST_MATH__.
20260 (vcageq_f32): Likewise.
20261 (vcale_f32): Likewise.
20262 (vcaleq_f32): Likewise.
20263 (vcagt_f32): Likewise.
20264 (vcagtq_f32): Likewise.
20265 (vcalt_f32): Likewise.
20266 (vcaltq_f32): Likewise.
20267 (vcage_f16): Likewise.
20268 (vcageq_f16): Likewise.
20269 (vcale_f16): Likewise.
20270 (vcaleq_f16): Likewise.
20271 (vcagt_f16): Likewise.
20272 (vcagtq_f16): Likewise.
20273 (vcalt_f16): Likewise.
20274 (vcaltq_f16): Likewise.
20276 2021-06-30 Richard Biener <rguenther@suse.de>
20278 PR tree-optimization/101264
20279 * tree-vect-slp.c (vect_optimize_slp): Propagate the
20280 computed perm_in to all "any" permute successors
20281 we cannot de-duplicate immediately.
20283 2021-06-30 liuhongt <hongtao.liu@intel.com>
20286 * config/i386/sse.md
20287 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
20289 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
20291 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
20292 * config/i386/subst.md (maskz_scalar): New define_subst.
20293 (maskz_scalar_name): New subst_attr.
20294 (maskz_scalar_op5): Ditto.
20295 (round_saeonly_maskz_scalar_op5): Ditto.
20296 (round_saeonly_maskz_scalar_operand5): Ditto.
20298 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
20300 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
20301 Increase code CSECT alignment to at least 32 bytes.
20302 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
20303 alignment designation.
20305 2021-06-29 Sergei Trofimovich <siarheit@google.com>
20307 * doc/generic.texi: Fix s/net yet/not yet/ typo.
20309 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
20311 PR tree-optimization/101254
20312 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
20313 wrapping/non-wrapping when setting the result range.
20315 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
20317 * value-query.cc (gimple_range_global): Allow phis.
20319 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
20321 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
20322 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
20323 statement for location context.
20324 (check_for_binary_op_overflow): Ditto.
20325 (simplify_using_ranges::get_vr_for_comparison): Ditto.
20326 (simplify_using_ranges::compare_name_with_value): Ditto.
20327 (simplify_using_ranges::compare_names): Ditto.
20328 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
20329 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
20330 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
20331 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
20332 (simplify_using_ranges::two_valued_val_range_p): Ditto.
20333 (simplify_using_ranges::simplify): Ditto.
20334 * vr-values.h: Adjust prototypes.
20336 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
20339 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
20341 2021-06-29 Julian Brown <julian@codesourcery.com>
20343 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
20344 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
20345 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
20347 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
20348 (MAX_FIXED_MODE_SIZE): Change to 128.
20350 2021-06-29 Julian Brown <julian@codesourcery.com>
20352 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
20353 (s_mnemonic): Add clrsb.
20354 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
20355 (clrsb<mode>2): Add expander for SImode/DImode.
20357 2021-06-29 Julian Brown <julian@codesourcery.com>
20359 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
20360 muldi3): Add patterns.
20362 2021-06-29 Julian Brown <julian@codesourcery.com>
20364 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
20365 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
20367 2021-06-29 Julian Brown <julian@codesourcery.com>
20369 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
20371 2021-06-29 Joseph Myers <joseph@codesourcery.com>
20373 * btfout.c, ctfout.c: Include "memmodel.h".
20375 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
20377 * gcc.c (check_offload_target_name): Cast len argument to
20378 %q.*s to 'int'; avoid -Wstringop-truncation warning.
20380 2021-06-29 Richard Biener <rguenther@suse.de>
20382 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
20383 to "any" permute nodes and relax "any" permute proapgation
20384 during iterative backward propagation.
20386 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
20389 * common.opt (-foffload=): Update description.
20390 (-foffload-options=): New.
20391 * doc/invoke.texi (C Language Options): Document
20392 -foffload and -foffload-options.
20393 * gcc.c (check_offload_target_name): New, split off from
20394 handle_foffload_option.
20395 (check_foffload_target_names): New.
20396 (handle_foffload_option): Handle -foffload=default.
20397 (driver_handle_option): Update for -foffload-options.
20398 * lto-opts.c (lto_write_options): Use -foffload-options
20399 instead of -foffload.
20400 * lto-wrapper.c (merge_and_complain, append_offload_options):
20402 * opts.c (common_handle_option): Likewise.
20404 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
20406 * doc/invoke.texi (C Language Options): Sort options
20407 alphabetically in optlist and also the description itself.
20408 Remove leftover -fallow-single-precision from and add missing
20409 -fgnu-tm to the optlist.
20411 2021-06-29 Richard Biener <rguenther@suse.de>
20413 * tree-vect-slp.c (slpg_vertex::visited): Remove.
20414 (vect_slp_perms_eq): Handle -1 permutes.
20415 (vect_optimize_slp): Rewrite permute propagation.
20417 2021-06-29 Jakub Jelinek <jakub@redhat.com>
20420 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
20421 perform the optimization in GENERIC when sanitizing and x has a
20424 2021-06-29 Richard Biener <rguenther@suse.de>
20426 PR tree-optimization/101242
20427 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
20428 PHIs with not represented initial values as leafs.
20430 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
20432 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
20433 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
20434 "register" keyword.
20435 (pdp11_initial_elimination_offset) Remove unused variable.
20436 (pdp11_cmp_length) Ditto.
20437 (pdp11_insn_cost): Ditto, and fix signedness warning.
20439 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
20441 * btfout.c: Include tm_p.h.
20444 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
20446 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
20448 (bpf_expand_epilogue): Likewise.
20449 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
20450 Do not define DBX_DEBUGGING_INFO.
20452 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
20454 * doc/invoke.texi: Document the CTF and BTF debug info options.
20456 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
20457 David Faust <david.faust@oracle.com>
20458 Jose E. Marchesi <jose.marchesi@oracle.com>
20459 Weimin Pan <weimin.pan@oracle.com>
20461 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
20462 GTFILES. Add new object files.
20463 * common.opt: Add CTF and BTF debug info options.
20464 * btfout.c: New file.
20465 * ctfc.c: Likewise.
20466 * ctfc.h: Likewise.
20467 * ctfout.c: Likewise.
20468 * dwarf2ctf.c: Likewise.
20469 * dwarf2ctf.h: Likewise.
20470 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
20472 * dwarf2out.c (dwarf2out_source_line): Likewise.
20473 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
20475 (debug_format_do_cu): New function.
20476 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
20478 Include dwarf2ctf.c.
20479 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
20481 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
20482 (CTF_DEBUG): New bitmask.
20483 (BTF_DEBUG): Likewise.
20484 (enum ctf_debug_info_levels): New enum.
20485 * gengtype.c (open_base_files): Handle ctfc.h.
20486 (main): Handle uint32_t type.
20487 * flags.h (btf_debuginfo_p): New definition.
20488 (dwarf_based_debuginfo_p): Likewise.
20489 * opts.c (debug_type_names): Add entries for CTF and BTF.
20490 (btf_debuginfo_p): New function.
20491 (dwarf_based_debuginfo_p): Likewise.
20492 (common_handle_option): Handle -gctfN and -gbtf options.
20493 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
20494 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
20497 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
20499 * dwarf2out.c (AT_class): Function is no longer static.
20500 (AT_int): Likewise.
20501 (AT_unsigned): Likewise.
20502 (AT_loc): Likewise.
20503 (get_AT): Likewise.
20504 (get_AT_string): Likewise.
20505 (get_AT_flag): Likewise.
20506 (get_AT_unsigned): Likewise.
20507 (get_AT_ref): Likewise.
20508 (new_die_raw): Likewise.
20509 (lookup_decl_die): Likewise.
20510 (base_type_die): Likewise.
20511 (add_name_attribute): Likewise.
20512 (add_AT_int): Likewise.
20513 (add_AT_unsigned): Likewise.
20514 (add_AT_loc): Likewise.
20515 (dw_get_die_tag): New function.
20516 (dw_get_die_child): Likewise.
20517 (dw_get_die_sib): Likewise.
20518 (struct dwarf_file_data): Move from here to dwarf2out.h
20519 (struct dw_attr_struct): Likewise.
20520 * dwarf2out.h: Analogous changes.
20522 2021-06-28 Martin Jambor <mjambor@suse.cz>
20525 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
20526 members m_dead_stmts and m_dead_ssas.
20527 * ipa-param-manipulation.c
20528 (ipa_param_body_adjustments::mark_dead_statements): New function.
20529 (ipa_param_body_adjustments::common_initialization): Call it on
20530 all removed but not split parameters.
20531 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
20533 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
20535 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
20536 dead debug statements.
20537 (copy_phis_for_bb): Do not copy dead PHI nodes.
20539 2021-06-28 Martin Jambor <mjambor@suse.cz>
20542 * symtab-clones.h (clone_info): Removed member param_adjustments.
20543 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
20544 deal with pass-through splits now.
20545 (ipa_param_performed_split): Removed.
20546 (ipa_param_adjustments::modify_call): Adjusted parameters.
20547 (class ipa_param_body_adjustments): Adjusted parameters of
20548 register_replacement, modify_gimple_stmt and modify_call_stmt.
20549 (ipa_verify_edge_has_no_modifications): Declare.
20550 (ipa_edge_modifications_finalize): Declare.
20551 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
20552 performed_splits processing, pas only edge to padjs->modify_call,
20553 check that call arguments were not modified if they should not have
20555 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
20557 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
20558 (ipa_edge_modification_info): Likewise.
20559 (ipa_edge_modification_sum): Likewise.
20560 (ipa_edge_modifications): New edge summary.
20561 (ipa_verify_edge_has_no_modifications): New function.
20562 (transitive_split_p): Removed.
20563 (transitive_split_map): Likewise.
20564 (init_transitive_splits): Likewise.
20565 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
20566 summary instead of performed_splits.
20567 (ipa_param_body_adjustments::register_replacement): Drop dummy
20568 parameter, set base_index of the created ipa_param_body_replacement.
20569 (phi_arg_will_live_p): New function.
20570 (ipa_param_body_adjustments::common_initialization): Do not create
20571 IPA_SRA dummy decls.
20572 (simple_tree_swap_info): Removed.
20573 (remap_split_decl_to_dummy): Likewise.
20574 (record_argument_state_1): New function.
20575 (record_argument_state): Likewise.
20576 (ipa_param_body_adjustments::modify_call_stmt): New parameter
20577 orig_stmt. Do not work with dummy decls, save necessary info about
20578 changes to ipa_edge_modifications.
20579 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
20580 orig_stmt, pass it to modify_call_stmt.
20581 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
20582 modify_gimple_stmt.
20583 (ipa_edge_modifications_finalize): New function.
20584 * tree-inline.c (remap_gimple_stmt): Pass original statement to
20585 modify_gimple_stmt.
20586 (copy_phis_for_bb): Do not copy dead PHI nodes.
20587 (expand_call_inline): Do not remap performed_splits.
20588 (update_clone_info): Likewise.
20589 * toplev.c: Include ipa-param-manipulation.h.
20590 (toplev::finalize): Call ipa_edge_modifications_finalize.
20592 2021-06-28 Andrew Pinski <apinski@marvell.com>
20594 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
20595 info if we're the only things setting the target PHI.
20596 (value_replacement): Don't duplicate range here.
20597 (minmax_replacement): Likewise.
20599 2021-06-28 Richard Biener <rguenther@suse.de>
20601 PR tree-optimization/101229
20602 * gimple-walk.c (gimple_walk_op): Handle PHIs.
20604 2021-06-28 Martin Liska <mliska@suse.cz>
20606 * config/v850/v850.c (construct_dispose_instruction): Allocate
20608 (construct_prepare_instruction): Likewise.
20610 2021-06-28 Martin Liska <mliska@suse.cz>
20612 * config/v850/v850.c (v850_option_override): Build default
20614 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
20615 ignored for inlining.
20616 (TARGET_CAN_INLINE_P): New.
20618 2021-06-28 Richard Biener <rguenther@suse.de>
20620 PR tree-optimization/101207
20621 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
20622 permute eliding for load permutations properly.
20624 2021-06-28 Richard Biener <rguenther@suse.de>
20626 PR tree-optimization/101173
20627 * gimple-loop-interchange.cc
20628 (tree_loop_interchange::valid_data_dependences): Disallow outer
20629 loop dependence distance of zero.
20631 2021-06-28 liuhongt <hongtao.liu@intel.com>
20634 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
20635 define_insn_and_split.
20636 (*avx_cmp<mode>3_ltint): Ditto.
20637 (*avx2_pcmp<mode>3_3): Ditto.
20638 (*avx2_pcmp<mode>3_4): Ditto.
20639 (*avx2_pcmp<mode>3_5): Ditto.
20641 2021-06-28 liuhongt <hongtao.liu@intel.com>
20643 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
20644 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
20645 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
20646 IX86_BUILTIN_PBLENDVB128): Replace icode with
20648 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
20650 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
20651 New pre_reload splitter.
20653 2021-06-27 Andrew Pinski <apinski@marvell.com>
20655 PR middle-end/101230
20656 * fold-const.c (fold_ternary_loc): Check
20657 the return value of invert_tree_comparison.
20659 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
20661 * config.gcc: Add SPDX License Identifier.
20662 (powerpc-ibm-aix789): Default to aix73.h.
20663 (powerpc-ibm-aix7.2.*.*): New stanza.
20664 * config/rs6000/aix72.h: Add SPDX License Identifier.
20665 * config/rs6000/aix73.h: New file.
20667 2021-06-26 Jason Merrill <jason@redhat.com>
20669 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
20671 2021-06-26 Andrew Pinski <apinski@marvell.com>
20673 * genmatch.c (lower_cond): Copy for_subst_vec
20674 for the simplify also.
20675 (lower): Swap the order for lower_for and lower_cond.
20677 2021-06-26 Andrew Pinski <apinski@marvell.com>
20679 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
20680 flow senatitive info on the moved ssa set.
20682 2021-06-26 Andrew Pinski <apinski@marvell.com>
20684 * fold-const.c (fold_cond_expr_with_comparison):
20685 Exand arg0 into comp_code, arg00, and arg01.
20686 (fold_ternary_loc): Use invert_tree_comparison
20687 instead of fold_invert_truthvalue for the case
20688 where we have A CMP B ? C : A.
20690 2021-06-25 Martin Sebor <msebor@redhat.com>
20692 PR middle-end/101216
20693 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
20695 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
20697 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
20699 2021-06-25 Richard Biener <rguenther@suse.de>
20701 PR tree-optimization/101202
20702 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
20705 2021-06-25 Richard Biener <rguenther@suse.de>
20707 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
20708 STMT_VINFO_REDUC_DEF from the original representative.
20710 2021-06-25 Martin Sebor <msebor@redhat.com>
20712 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
20713 gimple_no_warning_p and gimple_set_no_warning with
20714 warning_suppressed_p, and suppress_warning.
20716 (maybe_warn_for_bound): Same.
20717 (warn_for_access): Same.
20718 (check_access): Same.
20719 (expand_builtin_strncmp): Same.
20720 (fold_builtin_varargs): Same.
20721 * calls.c (maybe_warn_nonstring_arg): Same.
20722 (maybe_warn_rdwr_sizes): Same.
20723 * cfgexpand.c (expand_call_stmt): Same.
20724 * cgraphunit.c (check_global_declaration): Same.
20725 * fold-const.c (fold_undefer_overflow_warnings): Same.
20726 (fold_truth_not_expr): Same.
20727 (fold_unary_loc): Same.
20728 (fold_checksum_tree): Same.
20729 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
20730 (array_bounds_checker::check_mem_ref): Same.
20731 (array_bounds_checker::check_addr_expr): Same.
20732 (array_bounds_checker::check_array_bounds): Same.
20733 * gimple-expr.c (copy_var_decl): Same.
20734 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
20735 (gimple_fold_builtin_strncat): Same.
20736 (gimple_fold_builtin_stxcpy_chk): Same.
20737 (gimple_fold_builtin_stpcpy): Same.
20738 (gimple_fold_builtin_sprintf): Same.
20739 (fold_stmt_1): Same.
20740 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
20741 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
20742 * gimple-ssa-sprintf.c (handle_printf_call): Same.
20743 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
20744 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
20745 * gimple-ssa-warn-restrict.h: Adjust declarations.
20746 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
20747 gimple_no_warning_p and gimple_set_no_warning with
20748 warning_suppressed_p, and suppress_warning.
20749 (check_call): Same.
20750 (check_bounds_or_overlap): Same.
20751 * gimple.c (gimple_build_call_from_tree): Same.
20752 * gimplify.c (gimplify_return_expr): Same.
20753 (gimplify_cond_expr): Same.
20754 (gimplify_modify_expr_complex_part): Same.
20755 (gimplify_modify_expr): Same.
20756 (gimple_push_cleanup): Same.
20757 (gimplify_expr): Same.
20758 * omp-expand.c (expand_omp_for_generic): Same.
20759 (expand_omp_taskloop_for_outer): Same.
20760 * omp-low.c (lower_rec_input_clauses): Same.
20761 (lower_lastprivate_clauses): Same.
20762 (lower_send_clauses): Same.
20763 (lower_omp_target): Same.
20764 * tree-cfg.c (pass_warn_function_return::execute): Same.
20765 * tree-complex.c (create_one_component_var): Same.
20766 * tree-inline.c (remap_gimple_op_r): Same.
20767 (copy_tree_body_r): Same.
20768 (declare_return_variable): Same.
20769 (expand_call_inline): Same.
20770 * tree-nested.c (lookup_field_for_decl): Same.
20771 * tree-sra.c (create_access_replacement): Same.
20772 (generate_subtree_copies): Same.
20773 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
20774 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
20775 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
20776 * tree-ssa-loop-im.c (execute_sm): Same.
20777 * tree-ssa-phiopt.c (cond_store_replacement): Same.
20778 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
20779 (handle_builtin_strcpy): Same.
20780 (maybe_diag_stxncpy_trunc): Same.
20781 (handle_builtin_stxncpy_strncat): Same.
20782 (handle_builtin_strcat): Same.
20783 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
20784 (set_no_uninit_warning): Same.
20785 (uninit_undefined_value_p): Same.
20786 (warn_uninit): Same.
20787 (maybe_warn_operand): Same.
20788 * tree-vrp.c (compare_values_warnv): Same.
20789 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
20790 (test_for_singularity): Same.
20791 * gimple.h (warning_suppressed_p): New function.
20792 (suppress_warning): Same.
20793 (copy_no_warning): Same.
20794 (gimple_set_block): Call gimple_set_location.
20795 (gimple_set_location): Call copy_warning.
20797 2021-06-25 Martin Sebor <msebor@redhat.com>
20799 * tree.h (warning_suppressed_at, copy_warning,
20800 warning_suppressed_p, suppress_warning): New functions.
20802 2021-06-25 Martin Sebor <msebor@redhat.com>
20804 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
20805 * gengtype.c (open_base_files): Add diagnostic-spec.h.
20806 * diagnostic-spec.c: New file.
20807 * diagnostic-spec.h: New file.
20808 * tree.h (no_warning, all_warnings, suppress_warning_at): New
20810 * warning-control.cc: New file.
20812 2021-06-25 liuhongt <hongtao.liu@intel.com>
20815 * config/i386/i386.c (x86_order_regs_for_local_alloc):
20818 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20820 PR tree-optimization/101189
20821 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
20822 LHS range of condition to postfold routine.
20823 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
20824 FALSE edge if the LHS range supports it being taken.
20825 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
20827 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20829 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
20830 (relation_oracle::find_relation_block): Check correct bitmap.
20831 (relation_oracle::dump): Do not dump NULL blocks.
20833 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20835 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
20836 range_on_edge instead of manually calculating.
20838 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20840 * range-op.cc: Fix comment.
20842 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
20845 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
20846 Handle V8QI and V4HI modes.
20847 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
20849 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
20850 (mmxpackmode): New mode attribute.
20851 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
20852 (mmxunpackmode): New mode attribute.
20853 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
20854 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
20855 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
20856 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
20857 * config/i386/i386.md (extsuffix): Move from ...
20858 * config/i386/sse.md: ... here.
20860 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20862 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
20863 (dwarf2out_finish): ...instead of here.
20865 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20867 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
20868 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
20869 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
20870 * configure: Regenerate.
20872 2021-06-24 Richard Biener <rguenther@suse.de>
20874 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
20875 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
20876 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
20878 2021-06-24 Richard Biener <rguenther@suse.de>
20880 * config/i386/sse.md (avx_addsubv4df3): Rename to
20882 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
20883 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
20884 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
20885 * config/i386/i386-builtin.def: Adjust.
20886 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
20887 * optabs.def (vec_addsub_optab): New optab.
20888 * tree-vect-slp-patterns.c (class addsub_pattern): New.
20889 (slp_patterns): Add addsub_pattern.
20890 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
20891 across CFN_VEC_ADDSUB.
20892 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
20894 * doc/md.texi (vec_addsub<mode>3): Document.
20896 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20898 PR middle-end/101170
20899 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
20900 where regno + subreg_regno_offset wraps around use 0 as starting
20903 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20905 PR middle-end/101172
20906 * stor-layout.c (finish_bitfield_representative): If nextf has
20907 error_mark_node type, set repr type to error_mark_node too.
20909 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
20911 * config/s390/s390.c (s390_function_profiler): Ignore labelno
20913 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
20915 2021-06-24 Richard Biener <rguenther@suse.de>
20917 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
20918 across operations that have different semantics on different
20921 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20923 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
20924 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
20925 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
20926 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
20927 OMP_TARGET user outer_ctx instead of ctx for placeholders and
20928 initializer/combiner gimplification.
20929 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
20930 on target constructs.
20931 (lower_rec_input_clauses): Likewise.
20932 (lower_omp_target): Likewise.
20933 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
20934 on target if in_reduction is present.
20936 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
20938 * tree-predcom.c (class pcom_worker): New class.
20939 (release_chain): Renamed to...
20940 (pcom_worker::release_chain): ...this.
20941 (release_chains): Renamed to...
20942 (pcom_worker::release_chains): ...this.
20943 (aff_combination_dr_offset): Renamed to...
20944 (pcom_worker::aff_combination_dr_offset): ...this.
20945 (determine_offset): Renamed to...
20946 (pcom_worker::determine_offset): ...this.
20947 (class comp_ptrs): New class.
20948 (split_data_refs_to_components): Renamed to...
20949 (pcom_worker::split_data_refs_to_components): ...this,
20950 and update with class comp_ptrs.
20951 (suitable_component_p): Renamed to...
20952 (pcom_worker::suitable_component_p): ...this.
20953 (filter_suitable_components): Renamed to...
20954 (pcom_worker::filter_suitable_components): ...this.
20955 (valid_initializer_p): Renamed to...
20956 (pcom_worker::valid_initializer_p): ...this.
20957 (find_looparound_phi): Renamed to...
20958 (pcom_worker::find_looparound_phi): ...this.
20959 (add_looparound_copies): Renamed to...
20960 (pcom_worker::add_looparound_copies): ...this.
20961 (determine_roots_comp): Renamed to...
20962 (pcom_worker::determine_roots_comp): ...this.
20963 (determine_roots): Renamed to...
20964 (pcom_worker::determine_roots): ...this.
20965 (single_nonlooparound_use): Renamed to...
20966 (pcom_worker::single_nonlooparound_use): ...this.
20967 (remove_stmt): Renamed to...
20968 (pcom_worker::remove_stmt): ...this.
20969 (execute_pred_commoning_chain): Renamed to...
20970 (pcom_worker::execute_pred_commoning_chain): ...this.
20971 (execute_pred_commoning): Renamed to...
20972 (pcom_worker::execute_pred_commoning): ...this.
20973 (struct epcc_data): New member worker.
20974 (execute_pred_commoning_cbck): Call execute_pred_commoning
20975 with pcom_worker pointer.
20976 (find_use_stmt): Renamed to...
20977 (pcom_worker::find_use_stmt): ...this.
20978 (find_associative_operation_root): Renamed to...
20979 (pcom_worker::find_associative_operation_root): ...this.
20980 (find_common_use_stmt): Renamed to...
20981 (pcom_worker::find_common_use_stmt): ...this.
20982 (combinable_refs_p): Renamed to...
20983 (pcom_worker::combinable_refs_p): ...this.
20984 (reassociate_to_the_same_stmt): Renamed to...
20985 (pcom_worker::reassociate_to_the_same_stmt): ...this.
20986 (stmt_combining_refs): Renamed to...
20987 (pcom_worker::stmt_combining_refs): ...this.
20988 (combine_chains): Renamed to...
20989 (pcom_worker::combine_chains): ...this.
20990 (try_combine_chains): Renamed to...
20991 (pcom_worker::try_combine_chains): ...this.
20992 (prepare_initializers_chain): Renamed to...
20993 (pcom_worker::prepare_initializers_chain): ...this.
20994 (prepare_initializers): Renamed to...
20995 (pcom_worker::prepare_initializers): ...this.
20996 (prepare_finalizers_chain): Renamed to...
20997 (pcom_worker::prepare_finalizers_chain): ...this.
20998 (prepare_finalizers): Renamed to...
20999 (pcom_worker::prepare_finalizers): ...this.
21000 (tree_predictive_commoning_loop): Renamed to...
21001 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
21002 some calls and remove some cleanup code.
21003 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
21004 (static variable looparound_phis): Remove.
21005 (static variable name_expansions): Remove.
21007 2021-06-24 Richard Biener <rguenther@suse.de>
21009 * tree-vect-slp.c (slpg_vertex): New struct.
21010 (vect_slp_build_vertices): Adjust.
21011 (vect_optimize_slp): Likewise. Maintain an outgoing permute
21012 and a materialized one.
21014 2021-06-24 Richard Biener <rguenther@suse.de>
21016 PR tree-optimization/101105
21017 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
21018 Only ignore steps when they are equal or scalar order is preserved.
21020 2021-06-24 liuhongt <hongtao.liu@intel.com>
21023 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
21024 Adjust comments for ix86_expand_vecop_qihi2.
21025 (ix86_expand_vecmul_qihi): Renamed to ..
21026 (ix86_expand_vecop_qihi2): Adjust function prototype to
21027 support shift operation, add static to definition.
21028 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
21029 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
21030 ix86_expand_vec_shift_qihi_constant.
21031 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
21032 (ix86_expand_vec_shift_qihi_constant): Deleted.
21033 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
21035 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
21036 condition TARGET_64BIT.
21037 (mul<mode>3): Ditto.
21038 (<insn><mode>3): Ditto.
21039 (vlshr<mode>3): Extend to support avx512 vlshr.
21040 (v<insn><mode>3): New expander for
21042 (v<insn>v8qi3): Ditto.
21043 (vashrv8hi3<mask_name>): Renamed to ..
21044 (vashr<mode>3): And extend to support V16QImode for avx512.
21045 (vashrv16qi3): Deleted.
21046 (vashrv2di3<mask_name>): Extend expander to support avx512
21049 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
21051 * doc/lto.texi (Design Overview): Update that slim objects are
21054 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
21056 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
21057 of OTHER_POWER10_MASKS so it will not be enabled by default.
21059 2021-06-23 Richard Biener <rguenther@suse.de>
21060 Martin Jambor <mjambor@suse.cz>
21062 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
21063 param replacement unconditionally. Adjust comment.
21065 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21067 * Makefile.in (OBJS): Add gimple-range-fold.o
21068 * gimple-range-fold.cc: New.
21069 * gimple-range-fold.h: New.
21070 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
21071 (gimple_range_calc_op2): Ditto.
21072 * gimple-range-gori.h: Move prototypes to here.
21073 * gimple-range.cc: Adjust include files.
21074 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
21075 (fur_source::get_operand): Ditto.
21076 (fur_source::get_phi_operand): Ditto.
21077 (fur_source::query_relation): Ditto.
21078 (fur_source::register_relation): Ditto.
21079 (class fur_edge): Ditto.
21080 (fur_edge::fur_edge): Ditto.
21081 (fur_edge::get_operand): Ditto.
21082 (fur_edge::get_phi_operand): Ditto.
21083 (fur_stmt::fur_stmt): Ditto.
21084 (fur_stmt::get_operand): Ditto.
21085 (fur_stmt::get_phi_operand): Ditto.
21086 (fur_stmt::query_relation): Ditto.
21087 (class fur_depend): Relocate to gimple-range-fold.h.
21088 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
21089 (fur_depend::register_relation): Ditto.
21090 (fur_depend::register_relation): Ditto.
21091 (class fur_list): Ditto.
21092 (fur_list::fur_list): Ditto.
21093 (fur_list::get_operand): Ditto.
21094 (fur_list::get_phi_operand): Ditto.
21095 (fold_range): Ditto.
21096 (adjust_pointer_diff_expr): Ditto.
21097 (gimple_range_adjustment): Ditto.
21098 (gimple_range_base_of_assignment): Ditto.
21099 (gimple_range_operand1): Ditto.
21100 (gimple_range_operand2): Ditto.
21101 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
21102 (gimple_range_calc_op2): Ditto.
21103 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
21104 (fold_using_range::range_of_range_op): Ditto.
21105 (fold_using_range::range_of_address): Ditto.
21106 (fold_using_range::range_of_phi): Ditto.
21107 (fold_using_range::range_of_call): Ditto.
21108 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
21109 (fold_using_range::range_of_builtin_call): Ditto.
21110 (fold_using_range::range_of_cond_expr): Ditto.
21111 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
21112 (fold_using_range::relation_fold_and_or): Ditto.
21113 (fold_using_range::postfold_gcond_edges): Ditto.
21114 * gimple-range.h: Add gimple-range-fold.h to include files. Change
21115 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
21116 (gimple_range_handler): Relocate to gimple-range-fold.h.
21117 (gimple_range_ssa_p): Ditto.
21118 (range_compatible_p): Ditto.
21119 (class fur_source): Ditto.
21120 (class fur_stmt): Ditto.
21121 (class fold_using_range): Ditto.
21122 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
21123 (gimple_range_calc_op2): Ditto.
21125 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21127 PR tree-optimization/101148
21128 PR tree-optimization/101014
21129 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21130 (ranger_cache::~ranger_cache): Adjust.
21131 (ranger_cache::block_range): Check if propagation disallowed.
21132 (ranger_cache::propagate_cache): Disallow propagation if new value
21133 can't be stored properly.
21134 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
21136 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21138 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
21139 (sbr_vector::set_bb_range): Return true.
21140 (class sbr_sparse_bitmap): Adjust.
21141 (sbr_sparse_bitmap::set_bb_range): Return value.
21142 (block_range_cache::set_bb_range): Return value.
21143 (ranger_cache::propagate_cache): Use return value to print msg.
21144 * gimple-range-cache.h (class block_range_cache): Adjust.
21146 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
21148 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
21150 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
21152 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
21153 so this pattern can be used for test/compare removal. Pass
21154 current insn to compute_logical_op_length and output_logical_op.
21155 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
21156 (h8300_and_costs): Add argument to compute_logical_op_length.
21157 (output_logical_op): Add new argument. Use it to determine if the
21158 condition codes are used and adjust the output accordingly.
21159 (compute_logical_op_length): Add new argument and update length
21160 computations when condition codes are used.
21161 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
21163 (output_logical_op): Likewise.
21165 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
21168 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21169 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
21170 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
21171 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
21172 * config/i386/sse.md (unspec): ... here.
21174 2021-06-23 Martin Liska <mliska@suse.cz>
21177 * optc-save-gen.awk: Put back arm_fp16_format to
21180 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
21183 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
21186 (clz<mode>2): Update RTX pattern for additions.
21188 2021-06-23 Jakub Jelinek <jakub@redhat.com>
21190 PR middle-end/101167
21191 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
21192 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
21194 2021-06-22 Sergei Trofimovich <siarheit@google.com>
21196 * doc/rtl.texi: drop unbalanced parenthesis.
21198 2021-06-22 Richard Biener <rguenther@suse.de>
21200 PR middle-end/101156
21201 * gimplify.c (gimplify_expr): Remove premature incorrect
21204 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21206 PR tree-optimization/101159
21207 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
21210 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21212 PR middle-end/101160
21213 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
21214 clear crtl->return_rtx instead of keeping it referencing a pseudo.
21216 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21217 Andrew Pinski <apinski@marvell.com>
21219 PR tree-optimization/101162
21220 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
21223 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21225 * range-op.cc (range_relational_tests): New.
21226 (range_op_tests): Call range_relational_tests.
21228 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21230 * range-op.cc (operator_cast::lhs_op1_relation): New.
21231 (operator_identity::lhs_op1_relation): Mew.
21233 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21235 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
21237 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21239 * range-op.cc (operator_plus::lhs_op1_relation): New.
21240 (operator_plus::lhs_op2_relation): New.
21242 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21244 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
21245 relation_oracle if dominators exist.
21246 (ranger_cache::~ranger_cache): Dispose of oracle.
21247 (ranger_cache::dump_bb): Dump oracle.
21248 * gimple-range.cc (fur_source::fur_source): New.
21249 (fur_source::get_operand): Use mmeber query.
21250 (fur_source::get_phi_operand): Use member_query.
21251 (fur_source::query_relation): New.
21252 (fur_source::register_dependency): Delete.
21253 (fur_source::register_relation): New.
21254 (fur_edge::fur_edge): Adjust.
21255 (fur_edge::get_phi_operand): Fix comment.
21256 (fur_edge::query): Delete.
21257 (fur_stmt::fur_stmt): Adjust.
21258 (fur_stmt::query): Delete.
21259 (fur_depend::fur_depend): Adjust.
21260 (fur_depend::register_relation): New.
21261 (fur_depend::register_relation): New.
21262 (fur_list::fur_list): Adjust.
21263 (fur_list::get_operand): Use member query.
21264 (fold_using_range::range_of_range_op): Process and query relations.
21265 (fold_using_range::range_of_address): Adjust dependency call.
21266 (fold_using_range::range_of_phi): Ditto.
21267 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
21268 (fold_using_range::relation_fold_and_or): New.
21269 (fold_using_range::postfold_gcond_edges): New.
21270 * gimple-range.h (class gimple_ranger): Adjust.
21271 (class fur_source): Adjust members.
21272 (class fur_stmt): Ditto.
21273 (class fold_using_range): Ditto.
21275 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21277 * range-op.cc (range_operator::wi_fold): Apply relation effect.
21278 (range_operator::fold_range): Adjust and apply relation effect.
21279 (*::fold_range): Add relation parameters.
21280 (*::op1_range): Ditto.
21281 (*::op2_range): Ditto.
21282 (range_operator::lhs_op1_relation): New.
21283 (range_operator::lhs_op2_relation): New.
21284 (range_operator::op1_op2_relation): New.
21285 (range_operator::op1_op2_relation_effect): New.
21286 (relop_early_resolve): New.
21287 (operator_equal::op1_op2_relation): New.
21288 (operator_equal::fold_range): Call relop_early_resolve.
21289 (operator_not_equal::op1_op2_relation): New.
21290 (operator_not_equal::fold_range): Call relop_early_resolve.
21291 (operator_lt::op1_op2_relation): New.
21292 (operator_lt::fold_range): Call relop_early_resolve.
21293 (operator_le::op1_op2_relation): New.
21294 (operator_le::fold_range): Call relop_early_resolve.
21295 (operator_gt::op1_op2_relation): New.
21296 (operator_gt::fold_range): Call relop_early_resolve.
21297 (operator_ge::op1_op2_relation): New.
21298 (operator_ge::fold_range): Call relop_early_resolve.
21299 * range-op.h (class range_operator): Adjust parameters and methods.
21301 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
21303 * Makefile.in (OBJS): Add value-relation.o.
21304 * gimple-range.h: Adjust include files.
21305 * tree-data-ref.c: Adjust include file order.
21306 * value-query.cc (range_query::get_value_range): Default to no oracle.
21307 (range_query::query_relation): New.
21308 (range_query::query_relation): New.
21309 * value-query.h (class range_query): Adjust.
21310 * value-relation.cc: New.
21311 * value-relation.h: New.
21313 2021-06-22 Richard Biener <rguenther@suse.de>
21315 PR tree-optimization/101151
21316 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
21319 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
21321 * config/riscv/riscv.c (thead_c906_tune_info): New.
21322 (riscv_tune_info_table): Use new tune.
21324 2021-06-22 Richard Biener <rguenther@suse.de>
21326 PR tree-optimization/101158
21327 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
21328 checking after checking for matching operation.
21330 2021-06-22 Richard Biener <rguenther@suse.de>
21332 PR tree-optimization/101159
21333 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
21334 missing NULL vectype check.
21336 2021-06-22 Richard Biener <rguenther@suse.de>
21338 PR tree-optimization/101154
21339 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
21341 2021-06-22 Jakub Jelinek <jakub@redhat.com>
21344 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
21345 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
21346 * config/i386/i386.c (ix86_expand_prologue): Clear it.
21347 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
21348 Remove "" from match_operand. Emit new insns using emit_move_insn and
21349 set ix86_last_zero_store_uid to INSN_UID of the last store.
21350 Add peephole2s for 1/2/4 stores of const0_rtx following previous
21353 2021-06-22 Martin Liska <mliska@suse.cz>
21355 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
21358 2021-06-22 Martin Liska <mliska@suse.cz>
21360 * gcov-io.h: Remove padding entries.
21362 2021-06-22 liuhongt <hongtao.liu@intel.com>
21364 PR tree-optimization/97770
21365 * tree-vect-patterns.c (vect_recog_popcount_pattern):
21367 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
21369 2021-06-22 liuhongt <hongtao.liu@intel.com>
21372 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
21373 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
21374 (expand<mode>_mask): this ..
21375 (*expand<mode>_mask): New pre_reload splitter to transform
21376 v{,p}expand* to vmov* when mask is zero, all ones, or has all
21377 ones in it's lower part, otherwise still generate
21380 2021-06-22 liuhongt <hongtao.liu@intel.com>
21383 * config/i386/i386-expand.c
21384 (ix86_expand_special_args_builtin): Keep constm1_operand only
21385 if it satisfies insn's operand predicate.
21387 2021-06-21 Jason Merrill <jason@redhat.com>
21390 * df-scan.c (df_ref_record): Check that regno < endregno.
21391 * function.c (assign_parms, expand_function_end): Do nothing with a
21392 TYPE_EMPTY_P result.
21394 2021-06-21 Richard Biener <rguenther@suse.de>
21396 PR tree-optimization/101120
21397 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
21399 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
21400 DR chain DCE capability.
21401 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
21402 * tree-vect-stmts.c (vectorizable_load): Remove unused
21403 loads in the DR chain for SLP.
21405 2021-06-21 Jakub Jelinek <jakub@redhat.com>
21407 PR inline-asm/100785
21408 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
21409 output or input operands were already error_mark_node.
21410 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
21411 remove all inputs, outputs and clobbers from the asm and
21412 set template to "".
21414 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
21416 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
21417 (vceq_s16): Likewise.
21418 (vceq_s32): Likewise.
21419 (vceq_u8): Likewise.
21420 (vceq_u16): Likewise.
21421 (vceq_u32): Likewise.
21422 (vceq_p8): Likewise.
21423 (vceqq_s8): Likewise.
21424 (vceqq_s16): Likewise.
21425 (vceqq_s32): Likewise.
21426 (vceqq_u8): Likewise.
21427 (vceqq_u16): Likewise.
21428 (vceqq_u32): Likewise.
21429 (vceqq_p8): Likewise.
21430 (vceq_f32): Gate __a == __b on __FAST_MATH__.
21431 (vceqq_f32): Likewise.
21432 (vceq_f16): Likewise.
21433 (vceqq_f16): Likewise.
21435 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
21438 * config/arm/iterators.md (NEON_VACMP): Remove.
21439 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
21441 (neon_vca<cmp_op><mode>_insn): Likewise.
21442 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
21445 2021-06-21 Richard Biener <rguenther@suse.de>
21447 PR tree-optimization/101121
21448 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
21449 when we just lack a stmt with the desired op when doing permutation.
21450 (vect_build_slp_tree): When caching a failed SLP build attempt
21451 assert that at least one lane is marked as not matching.
21453 2021-06-21 liuhongt <hongtao.liu@intel.com>
21456 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
21457 register alternative.
21458 (*and<mode>_1): Ditto.
21460 (*andn<mode>_1): Ditto.
21461 (*<code><mode>_1): Ditto.
21462 (*<code>qi_1): Ditto.
21463 (*one_cmpl<mode>2_1): Ditto.
21464 (*one_cmplsi2_1_zext): Ditto.
21465 (*one_cmplqi2_1): Ditto.
21466 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
21467 the order of mask registers to be before general registers.
21469 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
21472 * config/i386/i386.md: New define_peephole2s to shrink writing
21473 1, 2 or 4 consecutive zeros to memory when optimizing for size.
21475 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
21477 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
21478 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
21479 more efficient code when the source can be trivially simplified.
21481 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
21483 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
21484 a range if global is not available.
21485 (ranger_cache::entry_range): Fallback to range_of_def.
21486 * gimple-range-cache.h (range_of_def): Adjust prototype.
21488 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
21490 PR tree-optimization/101014
21491 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
21493 (ranger_cache::~ranger_cache): Ditto.
21494 (ranger_cache::enable_new_values): Delete.
21495 (ranger_cache::push_poor_value): Delete.
21496 (ranger_cache::range_of_def): Remove poor value processing.
21497 (ranger_cache::entry_range): Ditto.
21498 (ranger_cache::fill_block_cache): Ditto.
21499 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
21500 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
21501 * gimple-range.h (class gimple_ranger): Adjust.
21503 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21506 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
21507 derived from arm_canon_arch.
21508 (arm_canon_arch_option): Call it.
21509 (arm_canon_arch_multilib_option): New function.
21510 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
21511 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
21512 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
21513 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
21514 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
21515 * config/arm/arm.opt (mlibarch): New option.
21516 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
21517 of march on RHS with mlibarch.
21519 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
21521 * config.in: Regenerate.
21522 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
21524 * configure: Regenerate.
21525 * configure.ac: Fix for global_load assembler functions.
21527 2021-06-18 Richard Biener <rguenther@suse.de>
21529 PR tree-optimization/101112
21530 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
21531 to lookup a pattern stmt def.
21533 2021-06-18 Jakub Jelinek <jakub@redhat.com>
21535 PR middle-end/101062
21536 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
21537 representatives in QUAL_UNION_TYPE.
21539 2021-06-18 Andrew Pinski <apinski@marvell.com>
21541 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
21542 Add counting of how many times it is done.
21543 (factor_out_conditional_conversion): Likewise.
21544 (match_simplify_replacement): Likewise.
21545 (value_replacement): Likewise.
21546 (spaceship_replacement): Likewise.
21547 (cond_store_replacement): Likewise.
21548 (cond_if_else_store_replacement_1): Likewise.
21549 (hoist_adjacent_loads): Likewise.
21551 2021-06-18 Andrew Pinski <apinski@marvell.com>
21553 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
21554 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
21555 (verify_gimple_assign_binary): Reject point and offset types on
21556 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
21557 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
21558 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
21560 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
21562 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
21563 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
21564 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
21567 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
21569 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
21570 earlyclobber to alts 0/1.
21571 (gen_addadd): Add earlyclobber to alts 0/1.
21572 * config/rs6000/fusion.md: Regenerate file.
21574 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21576 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
21578 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21580 * gimple-range-cache.cc: Comment cleanups.
21581 * gimple-range-gori.cc: Comment cleanups.
21582 * gimple-range.cc: Comment/spacing cleanups
21583 * value-range.h: Comment cleanups.
21585 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
21588 * calls.c (expand_call): Replace PUSH_ARGS with
21589 targetm.calls.push_argument (0).
21590 (emit_library_call_value_1): Likewise.
21591 * defaults.h (PUSH_ARGS): Removed.
21592 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
21593 targetm.calls.push_argument (0).
21594 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
21595 (emit_push_insn): Pass the number bytes to push to
21596 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
21597 * hooks.c (hook_bool_uint_true): New.
21598 * hooks.h (hook_bool_uint_true): Likewise.
21599 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
21600 targetm.calls.push_argument (0).
21601 * target.def (push_argument): Add a targetm.calls hook.
21602 * targhooks.c (default_push_argument): New.
21603 * targhooks.h (default_push_argument): Likewise.
21604 * config/bpf/bpf.h (PUSH_ARGS): Removed.
21605 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
21606 * config/cr16/cr16.h (PUSH_ARGS): Removed.
21607 * config/i386/i386.c (ix86_push_argument): New.
21608 (TARGET_PUSH_ARGUMENT): Likewise.
21609 * config/i386/i386.h (PUSH_ARGS): Removed.
21610 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
21611 * config/m32c/m32c.h (PUSH_ARGS): Removed.
21612 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
21613 * config/pru/pru.h (PUSH_ARGS): Likewise.
21614 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
21615 TARGET_PUSH_ARGUMENT hook.
21616 * doc/tm.texi: Regenerated.
21618 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
21621 * config/i386/i386-expand.c (expand_vector_set_var):
21622 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
21623 ix86_expand_vector_init_duplicate.
21624 (ix86_expand_vector_init_duplicate): Emit insv_1 for
21625 QImode for !TARGET_PARTIAL_REG_STALL.
21626 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
21627 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
21628 as operand 2 predicate. Call ix86_expand_vector_set_var
21629 for non-constant index operand.
21630 (vec_setv2si): Ditto.
21631 (vec_setv4hi): Ditto.
21632 (vec_setv8qi): ditto.
21634 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21636 PR tree-optimization/100790
21637 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
21640 2021-06-17 Martin Liska <mliska@suse.cz>
21642 * doc/invoke.texi: Use consistently -O1 instead of -O.
21644 2021-06-17 Martin Liska <mliska@suse.cz>
21646 * gcov-io.h: Update documentation entry about string format.
21648 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
21651 * config/s390/vecintrin.h (vec_doublee): Fix to use
21652 __builtin_s390_vflls.
21653 (vec_floate): Fix to use __builtin_s390_vflrd.
21655 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21657 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
21658 * dominance.h (get_dominated_to_depth): Likewise.
21659 (get_all_dominated_blocks): Likewise.
21660 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
21661 * gcse.c (hoist_code): Likewise.
21662 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
21663 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
21664 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
21665 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
21667 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21669 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
21670 * dominance.h (get_dominated_by_region): Likewise.
21671 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
21672 (gimple_duplicate_sese_tail): Likewise.
21673 (move_sese_region_to_fn): Likewise.
21675 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21677 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
21678 * dominance.h (get_dominated_by): Likewise.
21679 * auto-profile.c (afdo_find_equiv_class): Adjust.
21680 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
21681 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
21682 * tree-cfg.c (test_linear_chain): Likewise.
21683 (test_diamond): Likewise.
21685 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21687 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
21688 * cfgloopanal.c (get_loop_hot_path): Likewise.
21689 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
21691 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21693 * cgraph.c (cgraph_node::collect_callers): Return
21694 auto_vec<cgraph_edge *>.
21695 * cgraph.h (cgraph_node::collect_callers): Likewise.
21696 * ipa-cp.c (create_specialized_node): Adjust.
21697 (decide_about_value): Likewise.
21698 (decide_whether_version_node): Likewise.
21699 * ipa-sra.c (process_isra_node_results): Likewise.
21701 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21703 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
21704 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
21706 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
21709 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21711 * gimple-range.cc (debug_seed_ranger): New.
21712 (dump_ranger): New.
21713 (debug_ranger): New.
21715 2021-06-17 Richard Biener <rguenther@suse.de>
21717 PR tree-optimization/54400
21718 * tree-vectorizer.h (enum slp_instance_kind): Add
21719 slp_inst_kind_bb_reduc.
21720 (reduction_fn_for_scalar_code): Declare.
21721 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
21722 Check SLP_INSTANCE_KIND instead of looking at the
21724 (vect_slp_analyze_instance_alignment): Likewise.
21725 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
21726 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
21727 chain linearization from vect_build_slp_tree_2 and generalize
21728 for the use of BB reduction vectorization.
21729 (vect_build_slp_tree_2): Adjust accordingly.
21730 (vect_optimize_slp): Elide permutes at the root of BB reduction
21732 (vectorizable_bb_reduc_epilogue): New function.
21733 (vect_slp_prune_covered_roots): Likewise.
21734 (vect_slp_analyze_operations): Use them.
21735 (vect_slp_check_for_constructors): Recognize associatable
21736 chains for BB reduction vectorization.
21737 (vectorize_slp_instance_root_stmt): Generate code for the
21738 BB reduction epilogue.
21740 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21742 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
21744 (gori_compute::may_recompute_p): New.
21745 (gori_compute::outgoing_edge_range_p): Perform recomputations.
21746 * gimple-range-gori.h (class gori_compute): Add prototype.
21748 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21750 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
21751 true when a range can be calculated.
21752 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
21754 2021-06-16 Martin Sebor <msebor@redhat.com>
21756 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
21757 Correct documented defaults.
21759 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
21761 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
21762 m_new_value_p directly.
21764 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
21767 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
21768 Handle 64bit modes for TARGET_SSE4_1.
21769 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
21770 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
21771 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
21772 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
21773 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
21774 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
21776 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21778 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
21779 Change to an expander that emits the correct instruction
21780 depending on endianness.
21781 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
21782 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
21784 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21786 * config/aarch64/aarch64-simd-builtins.def: Split generator
21787 for aarch64_<su>qmovn builtins into scalar and vector
21789 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
21791 (aarch64_<su>qmovn<mode>_insn_be): Define.
21792 (aarch64_<su>qmovn<mode>): Split into scalar and vector
21793 variants. Change vector variant to an expander that emits the
21794 correct instruction depending on endianness.
21796 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21798 * config/aarch64/aarch64-simd-builtins.def: Split generator
21799 for aarch64_sqmovun builtins into scalar and vector variants.
21800 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
21801 Split into scalar and vector variants. Change vector variant
21802 to an expander that emits the correct instruction depending
21804 (aarch64_sqmovun<mode>_insn_le): Define.
21805 (aarch64_sqmovun<mode>_insn_be): Define.
21807 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21809 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
21810 Define - modeling zero-high-half semantics.
21811 (aarch64_xtn<mode>): Change to an expander that emits the
21812 appropriate instruction depending on endianness.
21813 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
21815 (aarch64_xtn2<mode>_le): Rename to...
21816 (aarch64_xtn2<mode>_insn_le): This.
21817 (aarch64_xtn2<mode>_be): Rename to...
21818 (aarch64_xtn2<mode>_insn_be): This.
21819 (vec_pack_trunc_<mode>): Emit truncation instruction instead
21821 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
21822 attribute iterator.
21824 2021-06-16 Martin Jambor <mjambor@suse.cz>
21826 PR tree-optimization/100453
21827 * tree-sra.c (create_access): Disqualify any const candidates
21828 which are written to.
21829 (sra_modify_expr): Do not store sub-replacements back to a const base.
21830 (handle_unscalarized_data_in_subtree): Likewise.
21831 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
21832 instead of constant_decl_p.
21834 2021-06-16 Jakub Jelinek <jakub@redhat.com>
21836 PR middle-end/101062
21837 * stor-layout.c (finish_bitfield_representative): For fields in unions
21838 assume nextf is always NULL.
21839 (finish_bitfield_layout): Compute bit field representatives also in
21840 unions, but handle it as if each bitfield was the only field in the
21843 2021-06-16 Richard Biener <rguenther@suse.de>
21845 PR tree-optimization/101088
21846 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
21847 supported refs on edges. Do not assert same ref but
21848 different kind stores are unsuported but mark them so.
21849 (hoist_memory_references): Only look for supported refs
21852 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
21854 PR rtl-optimization/46235
21855 * config/i386/i386.md: New define_split for bt followed by cmov.
21856 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
21857 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
21858 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
21859 by setnc with zero extension.
21861 2021-06-16 Richard Biener <rguenther@suse.de>
21863 PR tree-optimization/101083
21864 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
21865 vectype as argument.
21866 (vect_build_slp_tree_2): Adjust.
21868 2021-06-15 Martin Sebor <msebor@redhat.com>
21870 PR middle-end/100876
21871 * builtins.c: (gimple_call_return_array): Account for size_t
21872 mangling as either unsigned int or unsigned long
21874 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
21876 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
21877 up before eliminating comparisons.
21879 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
21881 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
21882 nonzero|X is nonzero.
21883 (range_op_bitwise_and_tests): Add tests for above.
21885 2021-06-15 Carl Love <cel@us.ibm.com>
21888 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
21890 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
21893 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
21896 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
21897 (struct gimplify_omp_ctx): Extend defaultmap array by one.
21898 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
21899 (omp_notice_variable): Update type classification for Fortran.
21900 (gimplify_scan_omp_clauses): Update calls for new argument; handle
21901 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
21902 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
21903 * langhooks.c (lhd_omp_scalar_p): Likewise.
21904 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
21905 (LANG_HOOKS_DECLS): Add them.
21906 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
21907 omp_scalar_p pointer type to include the new bool argument.
21909 2021-06-15 David Malcolm <dmalcolm@redhat.com>
21911 * doc/analyzer.texi
21912 (Special Functions for Debugging the Analyzer): Add
21913 __analyzer_dump_capacity.
21915 2021-06-15 Jakub Jelinek <jakub@redhat.com>
21918 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
21919 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
21921 2021-06-15 Richard Biener <rguenther@suse.de>
21923 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
21924 check to identify loop latches.
21925 * cfgloop.c (verify_loop_structure): Likewise.
21926 * loop-init.c (apply_loop_flags): Allow marked irreducible
21927 regions even with multiple latches.
21928 * predict.c (rebuild_frequencies): Simplify.
21930 2021-06-15 Richard Biener <rguenther@suse.de>
21932 * tree-ssa-threadupdate.c
21933 (jump_thread_path_registry::mark_threaded_blocks): Assert we
21934 have marked irreducible regions.
21936 2021-06-14 Martin Sebor <msebor@redhat.com>
21939 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
21940 Handle calls to placement new.
21941 (ndecl_dealloc_argno): Avoid placement delete.
21943 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
21946 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
21947 create_tmp_reg_or_ssa_name().
21949 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
21951 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21952 (ranger_cache::enable_new_values): Set to specified value and
21953 return the old value.
21954 (ranger_cache::disable_new_values): Delete.
21955 (ranger_cache::fill_block_cache): Disable non 1st order derived
21957 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
21958 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
21960 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
21963 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
21964 Return true early when testing with V2HImode.
21965 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
21967 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
21969 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
21970 (mve_vec_unpack<US>_hi_<mode>): New pattern.
21971 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
21972 (mve_vmovntq_<supf><mode>): Prefix with '@'.
21973 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
21975 (vec_unpack<US>_lo_<mode>): Likewise.
21976 (vec_pack_trunc_<mode>): Rename to
21977 neon_quad_vec_pack_trunc_<mode>.
21978 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
21980 (vec_unpack<US>_lo_<mode>): New.
21981 (vec_pack_trunc_<mode>): New.
21983 2021-06-14 Richard Biener <rguenther@suse.de>
21985 PR tree-optimization/100934
21986 * tree-ssa-dom.c (pass_dominator::execute): Properly
21987 mark irreducible regions.
21989 2021-06-14 Martin Liska <mliska@suse.cz>
21991 * doc/invoke.texi: Put r{...} on the same line as @item.
21993 2021-06-14 Martin Liska <mliska@suse.cz>
21995 * doc/invoke.texi: Add missing newline.
21997 2021-06-14 Martin Liska <mliska@suse.cz>
21999 * doc/invoke.texi: Remove '+' charasters.
22001 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
22003 * config.gcc (arc): Add support for with_cpu option.
22004 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
22006 2021-06-14 Richard Biener <rguenther@suse.de>
22008 PR tree-optimization/101031
22009 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
22010 instead of size when accounting for a possibly string
22013 2021-06-14 Martin Liska <mliska@suse.cz>
22015 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
22017 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
22019 * value-query.cc (gimple_range_global): Call get_range_global
22020 if called after inlining.
22022 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
22025 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
22026 Emit constant permutation insn directly from here.
22028 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
22030 * attribs.c (find_attribute_namespace): Iterate over vec<> with
22032 * auto-profile.c (afdo_find_equiv_class): Likewise.
22033 * gcc.c (do_specs_vec): Likewise.
22034 (do_spec_1): Likewise.
22035 (driver::set_up_specs): Likewise.
22036 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
22037 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
22038 (imm_store_chain_info::try_coalesce_bswap): Likewise.
22039 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
22040 (get_location_for_stmts): Likewise.
22041 * graphite-poly.c (print_iteration_domains): Likewise.
22042 (free_poly_bb): Likewise.
22043 (remove_gbbs_in_scop): Likewise.
22044 (free_scop): Likewise.
22045 (dump_gbb_cases): Likewise.
22046 (dump_gbb_conditions): Likewise.
22047 (print_pdrs): Likewise.
22048 (print_scop): Likewise.
22049 * ifcvt.c (cond_move_process_if_block): Likewise.
22050 * lower-subreg.c (decompose_multiword_subregs): Likewise.
22051 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
22052 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
22053 * sel-sched-dump.c (dump_insn_vector): Likewise.
22054 * store-motion.c (store_ops_ok): Likewise.
22055 (store_killed_in_insn): Likewise.
22056 * timevar.c (timer::named_items::print): Likewise.
22057 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
22058 (cleanup_tree_cfg_noloop): Likewise.
22059 * tree-data-ref.c (dump_data_references): Likewise.
22060 (print_dir_vectors): Likewise.
22061 (print_dist_vectors): Likewise.
22062 (dump_data_dependence_relations): Likewise.
22063 (dump_dist_dir_vectors): Likewise.
22064 (dump_ddrs): Likewise.
22065 (create_runtime_alias_checks): Likewise.
22066 (free_subscripts): Likewise.
22067 (save_dist_v): Likewise.
22068 (save_dir_v): Likewise.
22069 (invariant_access_functions): Likewise.
22070 (same_access_functions): Likewise.
22071 (access_functions_are_affine_or_constant_p): Likewise.
22072 (find_data_references_in_stmt): Likewise.
22073 (graphite_find_data_references_in_stmt): Likewise.
22074 (free_dependence_relations): Likewise.
22075 (free_data_refs): Likewise.
22076 * tree-inline.c (copy_debug_stmts): Likewise.
22077 * tree-into-ssa.c (dump_currdefs): Likewise.
22078 (rewrite_update_phi_arguments): Likewise.
22079 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
22080 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
22082 (vect_slp_analyze_node_dependences): Likewise.
22083 (vect_slp_analyze_instance_dependence): Likewise.
22084 (vect_record_base_alignments): Likewise.
22085 (vect_get_peeling_costs_all_drs): Likewise.
22086 (vect_peeling_supportable): Likewise.
22087 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
22088 (vec_info::free_stmt_vec_infos): Likewise.
22090 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
22092 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
22093 (andqi3_1<cczn>): Removed.
22094 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
22095 (H8/SX bit logicals): Split out from other patterns.
22096 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
22097 mulqihi3_const_clobber_flags.
22098 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
22100 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
22103 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
22104 to true if red zone is used.
22105 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
22106 ix86_red_zone_used.
22107 * config/i386/i386.h (machine_function): Add red_zone_used.
22108 (ix86_red_zone_size): Removed.
22109 (ix86_red_zone_used): New.
22110 * config/i386/i386.md (peephole2 patterns): Replace
22111 ix86_red_zone_size with ix86_red_zone_used.
22113 2021-06-12 Jason Merrill <jason@redhat.com>
22115 * doc/extend.texi (unused variable attribute): Applies to
22116 structure fields as well.
22118 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
22120 * auto-profile.c (read_profile): fix a typo in an error string
22122 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
22124 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
22126 * tree-pretty-print.c (dump_omp_clauses): Update.
22127 (dump_generic_node) <OMP_CLAUSE>: Use it.
22129 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
22132 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
22133 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
22134 the polymorphic variants matching code.
22135 (__arm_vld1q_z): Likewise.
22136 (__arm_vld2q): Likewise.
22137 (__arm_vld4q): Likewise.
22138 (__arm_vldrbq_gather_offset): Likewise.
22139 (__arm_vldrbq_gather_offset_z): Likewise.
22141 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
22143 PR tree-optimization/96392
22144 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
22146 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
22148 PR tree-optimization/96392
22149 * fold-const.c (fold_real_zero_addition_p): Take both arguments
22150 of the addition or subtraction, not just the zero. Use this
22151 other argument in tests for signaling NaNs and signed zeros.
22152 (tree_expr_maybe_real_minus_zero_p): New predicate.
22153 * fold-const.h (fold_real_zero_addition_p): Update prototype.
22154 (tree_expr_maybe_real_minus_zero_p): New function prototype.
22155 * match.pd: Update calls to fold_real_zero_addition_p.
22156 Replace HONOR_NANS with tree_expr_maybe_nan_p.
22157 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
22158 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
22159 * tree-ssa-reassoc.c (eliminate_using_constants): Update
22160 call to fold_real_zero_addition_p.
22162 2021-06-11 Richard Biener <rguenther@suse.de>
22164 PR tree-optimization/101025
22165 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
22166 all refs that require dependence checking.
22168 2021-06-11 Richard Biener <rguenther@suse.de>
22170 PR tree-optimization/101028
22171 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
22172 reassoc discovery fails fatally, mark appropriate lanes
22175 2021-06-11 Richard Biener <rguenther@suse.de>
22177 PR tree-optimization/101026
22178 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
22179 have a representative for the associated chain nodes.
22181 2021-06-11 Jakub Jelinek <jakub@redhat.com>
22183 PR rtl-optimization/101008
22184 * simplify-rtx.c (relational_result): New function.
22185 (simplify_logical_relational_operation,
22186 simplify_relational_operation): Use it.
22188 2021-06-11 Jakub Jelinek <jakub@redhat.com>
22191 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
22193 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
22196 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
22197 false if the permutation can be implemented with constant
22198 permutation instruction in wider mode.
22199 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
22200 Handle V8QImode and V4HImode.
22202 2021-06-11 Martin Liska <mliska@suse.cz>
22204 PR gcov-profile/100788
22205 * common.opt: Add new option.
22206 * coverage.c (coverage_begin_function): Emit warning instead on
22207 the internal compiler error.
22208 * doc/invoke.texi: Document the option.
22209 * toplev.c (process_options): Enable it by default.
22211 2021-06-11 Richard Biener <rguenther@suse.de>
22213 PR middle-end/101009
22214 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
22215 to set *init_b to true when we encounter a constant equal
22217 (compute_affine_dependence): Also dump the actual DR_REF.
22219 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
22221 PR tree-optimization/100984
22222 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
22223 replacements table.
22224 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
22226 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
22228 * config/rs6000/rs6000.md
22229 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
22230 define_insn_and_split.
22232 2021-06-11 Richard Biener <rguenther@suse.de>
22234 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
22235 to sort operands of the associative chain.
22237 2021-06-11 Richard Biener <rguenther@suse.de>
22239 * system.h (gcc_stablesort_r): Declare.
22240 * sort.cc (gcc_sort_r): Support stable sort.
22241 (gcc_stablesort_r): Define.
22242 * vec.h (vec<>::stablesort): Add.
22244 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
22247 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
22248 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
22249 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
22250 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
22251 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
22252 (expand_vec_perm_interleave2): Handle 64bit modes.
22253 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
22254 (expand_vec_perm_even_odd_1): Ditto.
22255 (ix86_vectorize_vec_perm_const): Ditto.
22256 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
22257 * config/i386/sse.md: ... here.
22258 * config/i386/mmx.md (*vec_interleave_lowv2sf):
22259 New insn_and_split pattern.
22260 (*vec_interleave_highv2sf): Ditto.
22261 (mmx_pshufbv8qi3): New insn pattern.
22262 (*mmx_pblendw): Ditto.
22264 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
22266 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
22267 (build_acc): Likewise.
22268 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
22269 source operands in little-endian mode.
22270 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
22271 (mma_init_builtins): Likewise.
22272 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
22273 ordering for the MMA assemble and build source operands.
22274 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
22276 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
22279 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
22281 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
22283 * config/h8300/extensions.md: Replace _clobber_flags patterns
22286 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
22288 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
22289 (vcond_mask_<mode><tointvec>): this.
22291 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
22292 Thomas Schwinge <thomas@codesourcery.com>
22294 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
22295 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
22296 * gimple.h (enum gf_mask): Split
22297 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
22298 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
22299 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
22300 (is_gimple_omp_oacc): Update.
22301 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
22302 * gimplify.c (gimplify_omp_target_update): Likewise.
22303 * omp-expand.c (expand_omp_target, build_omp_regions_1)
22304 (omp_make_gimple_edges): Likewise.
22305 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
22308 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
22310 * value-query.cc (value_query::value_on_edge): Rename name to
22312 (range_query::range_on_edge): Same.
22313 (range_query::value_of_expr): Same.
22314 (range_query::value_on_edge): Same.
22315 * value-query.h (class value_query): Same.
22316 (class range_query): Same.
22318 2021-06-10 Richard Biener <rguenther@suse.de>
22320 PR tree-optimization/101003
22321 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
22322 use the pattern stmt defs when linearizing a chain.
22324 2021-06-10 Jakub Jelinek <jakub@redhat.com>
22327 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
22328 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
22330 2021-06-10 Clement Chigot <clement.chigot@atos.net>
22332 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
22333 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
22335 2021-06-09 Andrew Pinski <apinski@marvell.com>
22337 PR tree-optimization/100925
22338 * match.pd (a ? CST1 : CST2): Limit transformations
22339 that would produce a negative to integeral types only.
22340 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
22342 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
22345 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
22347 * doc/tm.texi: Correctly update.
22349 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
22351 * doc/tm.texi: Correctly update.
22353 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
22356 * doc/tm.texi.in (Trampolines): Add a missing blank line.
22358 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
22361 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
22362 and -ftrampolines work only with Ada.
22363 * doc/tm.texi.in (Trampolines): Likewise.
22364 * doc/tm.texi: Regenerated.
22366 2021-06-09 Carl Love <cel@us.ibm.com>
22368 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
22369 Add define for new builtins.
22370 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
22371 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
22372 overloaded builtin definitions.
22373 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
22374 VSIGNEXTSD2Q): Add builtin expansions.
22375 (SIGNEXT): Add P10 overload definition.
22376 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
22377 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
22378 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
22379 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
22380 vsignextend_si_v2di)[VIlong]: Add define_expand.
22381 Make define_insn vsx_sign_extend_si_v2di visible.
22382 * doc/extend.texi: Add documentation for the vec_signexti,
22383 vec_signextll builtins and vec_signextq.
22385 2021-06-09 Carl Love <cel@us.ibm.com>
22387 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
22388 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
22389 __floattikf_sw, __floatuntikf_sw respectively.
22390 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
22391 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
22392 define_insn for mode IEEE 128.
22394 2021-06-09 Carl Love <cel@us.ibm.com>
22396 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
22397 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
22398 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
22399 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
22400 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
22401 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
22402 uses of VSX_TI to VEC_TI.
22404 2021-06-09 Carl Love <cel@us.ibm.com>
22406 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
22408 2021-06-09 Carl Love <cel@us.ibm.com>
22410 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
22412 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
22413 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
22414 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
22415 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
22416 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
22417 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
22418 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
22420 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
22421 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
22422 altivec_vrlqnm): New define_expands.
22423 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
22424 VCMPGTUT_P): Add macro expansions.
22425 (BU_P10V_AV_P): Add builtin predicate definition.
22426 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
22427 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
22428 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
22429 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
22430 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
22431 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
22432 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
22433 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
22434 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
22435 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
22436 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
22437 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
22438 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
22439 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
22440 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
22441 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
22442 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
22443 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
22444 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
22445 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
22446 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
22447 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
22448 P10V_BUILTIN_MODU_V1TI):
22449 New overloaded definitions.
22450 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
22451 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
22452 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
22453 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
22454 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
22455 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
22457 (altivec_init_builtins): New E_V1TImode case statement.
22458 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
22459 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
22460 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
22461 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
22462 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
22463 E_V1TImode]: New case statements.
22464 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
22465 value RS6000_BTI_bool_V1TI.
22466 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
22467 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
22468 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
22469 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
22470 vlshrv1ti3, vashrv1ti3): New define_expands.
22471 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
22472 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
22473 UNSPEC_VSX_MODUQ): New unspecs.
22474 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
22475 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
22477 (vcmpnet): New define_expand.
22478 * doc/extend.texi: Add documentation for the new builtins vec_rl,
22479 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
22480 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
22481 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
22482 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
22483 vec_any_ge, vec_any_le.
22485 2021-06-09 Carl Love <cel@us.ibm.com>
22487 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
22488 bug in argument generation.
22490 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
22492 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
22494 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
22495 remove <supf> iterator.
22496 (mve_vclzq_u<mode>): New.
22497 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
22498 (neon_vclz<mode): Move to ...
22499 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
22500 * config/arm/vec-common.md: ... here. Add support for MVE.
22502 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
22504 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
22505 (@mve_vrhaddq_<supf><mode): Likewise.
22506 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
22507 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
22508 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
22510 2021-06-09 imba-tjd <109224573@qq.com>
22512 * doc/invoke.texi: Fix typo.
22514 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
22516 PR middle-end/53267
22517 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
22518 Support evaluation of fmod/fmodf/fmodl at compile-time.
22520 2021-06-09 Richard Biener <rguenther@suse.de>
22522 PR tree-optimization/100981
22523 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
22524 gimple_get_lhs to also handle calls.
22525 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
22528 2021-06-09 Richard Biener <rguenther@suse.de>
22530 PR tree-optimization/97832
22531 * tree-vectorizer.h (_slp_tree::failed): New.
22532 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
22534 (_slp_tree::~_slp_tree): Free failed.
22535 (vect_build_slp_tree): Retain failed nodes and record
22536 matches in them, copying that back out when running
22537 into a cached fail. Dump start and end of discovery.
22538 (dt_sort_cmp): New.
22539 (vect_build_slp_tree_2): Handle associatable chains
22540 together doing more aggressive operand swapping.
22542 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
22545 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
22547 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
22548 Linux and GNU targets.
22550 2021-06-09 Richard Biener <rguenther@suse.de>
22552 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
22555 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22557 * config/arc/arc.md (loop_end): Change it to
22558 define_insn_and_split.
22560 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22562 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
22563 (machi): New pattern.
22564 (umaddhisi4): Use VMAC2HU instruction.
22565 (umachi): New pattern.
22567 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22569 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
22570 * config/arc/arc.c (arc_split_move_p): New function.
22571 (arc_split_move): Clean up.
22572 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
22573 (movdf_insn): Likewise.
22574 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
22576 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
22579 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
22580 argument to "raw". Do not emit segment overrides when "raw" is true.
22582 2021-06-09 Martin Liska <mliska@suse.cz>
22584 * doc/gcov.texi: Create a proper JSON files.
22585 * doc/invoke.texi: Remove dots in order to make it a valid
22588 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
22590 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
22591 (insn_is_load_p): Use pattern_is_rotate64.
22592 (insn_is_swap_p): Likewise.
22593 (quad_aligned_load_p): Likewise.
22594 (const_load_sequence_p): Likewise.
22595 (replace_swapped_aligned_load): Likewise.
22596 (recombine_lvx_pattern): Likewise.
22597 (recombine_stvx_pattern): Likewise.
22599 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
22601 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
22602 fur_stmt source record.
22603 * gimple-range.cc (fur_source::get_operand): Generic range query.
22604 (fur_source::get_phi_operand): New.
22605 (fur_source::register_dependency): New.
22606 (fur_source::query): New.
22607 (class fur_edge): New. Edge source for operands.
22608 (fur_edge::fur_edge): New.
22609 (fur_edge::get_operand): New.
22610 (fur_edge::get_phi_operand): New.
22611 (fur_edge::query): New.
22612 (fur_stmt::fur_stmt): New.
22613 (fur_stmt::get_operand): New.
22614 (fur_stmt::get_phi_operand): New.
22615 (fur_stmt::query): New.
22616 (class fur_depend): New. Statement source and process dependencies.
22617 (fur_depend::fur_depend): New.
22618 (fur_depend::register_dependency): New.
22619 (class fur_list): New. List source for operands.
22620 (fur_list::fur_list): New.
22621 (fur_list::get_operand): New.
22622 (fur_list::get_phi_operand): New.
22623 (fold_range): New. Instantiate appropriate fur_source class and fold.
22624 (fold_using_range::range_of_range_op): Use new API.
22625 (fold_using_range::range_of_address): Ditto.
22626 (fold_using_range::range_of_phi): Ditto.
22627 (imple_ranger::fold_range_internal): Use fur_depend class.
22628 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
22629 * gimple-range.h (class fur_source): Now a base class.
22630 (class fur_stmt): New.
22631 (fold_range): New prototypes.
22632 (fur_source::fur_source): Delete.
22634 2021-06-08 Andrew Pinski <apinski@marvell.com>
22636 PR tree-optimization/25290
22637 * tree-ssa-phiopt.c (xor_replacement): Delete.
22638 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
22639 (match_simplify_replacement): Allow one cheap preparation
22640 statement that can be moved to before the if.
22642 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
22644 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
22645 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
22647 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
22649 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
22650 Create length attribute on define_insn_and_split. Only split for cases which we
22652 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
22653 fix length computation.
22654 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
22656 2021-06-08 Richard Biener <rguenther@suse.de>
22658 PR tree-optimization/100923
22659 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
22660 the operand vector to be valueized.
22661 (valueize_refs): Likewise.
22662 (valueize_shared_reference_ops_from_ref): Adjust.
22663 (valueize_shared_reference_ops_from_call): Likewise.
22664 (vn_reference_lookup_3): Likewise.
22665 (vn_reference_lookup_pieces): Likewise. Re-valueize
22666 with honoring availability when we are about to create
22667 the ao_ref and valueized before.
22668 (vn_reference_lookup): Likewise.
22669 (vn_reference_insert_pieces): Adjust.
22671 2021-06-08 Richard Biener <rguenther@suse.de>
22673 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
22674 (_slp_instance::root_stmts): ... a vector.
22675 (SLP_INSTANCE_ROOT_STMT): Rename to ...
22676 (SLP_INSTANCE_ROOT_STMTS): ... this.
22677 (slp_root::root): Change to...
22678 (slp_root::roots): ... a vector.
22679 (slp_root::slp_root): Adjust.
22680 * tree-vect-slp.c (_slp_instance::location): Adjust.
22681 (vect_free_slp_instance): Release the root stmt vector.
22682 (vect_build_slp_instance): Adjust.
22683 (vect_analyze_slp): Likewise.
22684 (_bb_vec_info::~_bb_vec_info): Likewise.
22685 (vect_slp_analyze_operations): Likewise.
22686 (vect_bb_vectorization_profitable_p): Likewise. Adjust
22687 costs for the root stmt.
22688 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
22690 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
22692 (vectorize_slp_instance_root_stmt): Adjust.
22693 (vect_schedule_slp): Likewise.
22695 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
22697 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
22698 (ssa_equiv_stack::ssa_equiv_stack): New.
22699 (ssa_equiv_stack::~ssa_equiv_stack): New.
22700 (ssa_equiv_stack::enter): New.
22701 (ssa_equiv_stack::leave): New.
22702 (ssa_equiv_stack::push_replacement): New.
22703 (ssa_equiv_stack::get_replacement): New.
22704 (is_pointer_ssa): New.
22705 (class pointer_equiv_analyzer): New.
22706 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
22707 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
22708 (pointer_equiv_analyzer::set_global_equiv): New.
22709 (pointer_equiv_analyzer::set_cond_equiv): New.
22710 (pointer_equiv_analyzer::get_equiv): New.
22711 (pointer_equiv_analyzer::enter): New.
22712 (pointer_equiv_analyzer::leave): New.
22713 (pointer_equiv_analyzer::get_equiv_expr): New.
22714 (pta_valueize): New.
22715 (pointer_equiv_analyzer::visit_stmt): New.
22716 (pointer_equiv_analyzer::visit_edge): New.
22717 (hybrid_folder::value_of_expr): Call PTA.
22718 (hybrid_folder::value_on_edge): Same.
22719 (hybrid_folder::pre_fold_bb): New.
22720 (hybrid_folder::post_fold_bb): New.
22721 (hybrid_folder::pre_fold_stmt): New.
22722 (rvrp_folder::pre_fold_bb): New.
22723 (rvrp_folder::post_fold_bb): New.
22724 (rvrp_folder::pre_fold_stmt): New.
22725 (rvrp_folder::value_of_expr): Call PTA.
22726 (rvrp_folder::value_on_edge): Same.
22728 2021-06-08 Jakub Jelinek <jakub@redhat.com>
22731 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
22732 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
22734 2021-06-08 Richard Biener <rguenther@suse.de>
22736 PR middle-end/100951
22737 * tree-vect-generic.c (expand_vector_piecewise): Build a
22738 VECTOR_CST if all elements are constant.
22739 (expand_vector_condition): Likewise.
22740 (lower_vec_perm): Likewise.
22741 (expand_vector_conversion): Likewise.
22743 2021-06-08 Martin Liska <mliska@suse.cz>
22745 * doc/invoke.texi: Document new param evrp-sparse-threshold.
22747 2021-06-08 Martin Liska <mliska@suse.cz>
22749 * genautomata.c (create_automata): Fix typo.
22751 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22753 PR tree-optimization/100794
22754 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
22755 allow_unroll_p and only allow unrolling when it's true.
22756 (tree_predictive_commoning): Add parameter allow_unroll_p and
22758 (run_tree_predictive_commoning): Likewise.
22759 (pass_predcom::gate): Check flag_tree_loop_vectorize and
22760 global_options_set.x_flag_predictive_commoning.
22761 (pass_predcom::execute): Adjust for allow_unroll_p.
22763 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22765 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
22766 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
22767 lambda function cleanup, remove scev_reset call, and adjust return
22769 (tree_predictive_commoning): Adjust for different changed values,
22770 only set flag TODO_update_ssa_only_virtuals if changed.
22771 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
22772 from todo_flags_finish.
22774 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22776 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
22777 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
22778 (sbr_sparse_bitmap::bitmap_set_quad): New.
22779 (sbr_sparse_bitmap::bitmap_get_quad): New.
22780 (sbr_sparse_bitmap::set_bb_range): New.
22781 (sbr_sparse_bitmap::get_bb_range): New.
22782 (sbr_sparse_bitmap::bb_range_p): New.
22783 (block_range_cache::block_range_cache): initialize bitmap obstack.
22784 (block_range_cache::~block_range_cache): Destruct obstack.
22785 (block_range_cache::set_bb_range): Decide when to utilze the
22786 sparse on entry cache.
22787 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
22788 * params.opt (-param=evrp-sparse-threshold): New.
22790 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22792 * bitmap.c (bitmap_set_aligned_chunk): New.
22793 (bitmap_get_aligned_chunk): New.
22794 (test_aligned_chunk): New.
22795 (bitmap_c_tests): Call test_aligned_chunk.
22796 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
22798 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
22801 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22803 (ix86_expand_vector_init_one_nonzero): Ditto.
22804 (ix86_expand_vector_init_one_var): Ditto.
22805 (ix86_expand_vector_init_general): Ditto.
22806 * config/i386/mmx.md (vec_initv4qiqi): New expander.
22808 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
22810 * config/h8300/movepush.md: Change most _clobber_flags
22811 patterns to instead use <cczn> subst.
22812 (movsi_cczn): New pattern with usable CC cases split out.
22813 (movsi_h8sx_cczn): Likewise.
22815 2021-06-07 Martin Liska <mliska@suse.cz>
22817 * common/common-target.def: Split long lines and replace them
22819 * target.def: Likewise.
22820 * doc/tm.texi: Re-generated.
22822 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22825 * fold-const.c (fold_read_from_vector): Return NULL if trying to
22826 read from a CONSTRUCTOR with vector type elements.
22828 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22830 PR middle-end/100898
22831 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
22832 should copy any arguments. Don't call gimple_call_num_args
22833 on id->call_stmt or call_stmt more than once.
22835 2021-06-07 liuhongt <hongtao.liu@intel.com>
22838 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
22840 (<insn>v4siv4di2): Delete constraints for define_expand.
22842 2021-06-07 liuhongt <hongtao.liu@intel.com>
22845 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
22846 assignment of cfun->machine->has_explicit_vzeroupper.
22847 * config/i386/i386-features.c
22848 (ix86_add_reg_usage_to_vzerouppers): Delete.
22849 (ix86_add_reg_usage_to_vzeroupper): Ditto.
22850 (rest_of_handle_insert_vzeroupper): Remove
22851 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
22853 (gate): Remove cfun->machine->has_explicit_vzeroupper.
22854 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
22856 * config/i386/i386.c (ix86_insn_callee_abi): New function.
22857 (ix86_initialize_callee_abi): Ditto.
22858 (ix86_expand_avx_vzeroupper): Ditto.
22859 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
22861 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
22862 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
22864 * config/i386/i386.h (struct GTY(()) machine_function): Delete
22865 has_explicit_vzeroupper.
22866 * config/i386/i386.md (enum unspec): New member
22868 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
22869 define_constants for insn callee abi index.
22870 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
22871 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
22872 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
22873 (*avx_vzeroupper): Rename to ..
22874 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
22875 call_insn which has a special vzeroupper ABI.
22876 (*avx_vzeroupper_1): Deleted.
22878 2021-06-07 liuhongt <hongtao.liu@intel.com>
22881 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
22882 it won't use stack pointer reg.
22883 * final.c (leaf_function_p): When call_insn is a fake call, it
22884 won't affect caller as a leaf function.
22885 * reg-stack.c (callee_clobbers_any_stack_reg): New.
22886 (subst_stack_regs): When call_insn doesn't clobber any stack
22887 reg, don't clear the arguments.
22888 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
22890 * shrink-wrap.c (requires_stack_frame_p): No need for stack
22891 frame for a fake call.
22892 * rtl.h (FAKE_CALL_P): New macro.
22894 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
22896 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
22898 (sparc_order_regs_for_local_alloc): ...this.
22899 (sparc_leaf_reg_remap): Declare.
22900 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
22901 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
22902 * config/sparc/sparc.c (leaf_reg_remap): Delete.
22903 (order_regs_for_local_alloc): Rename to...
22904 (sparc_order_regs_for_local_alloc): ...this.
22905 (sparc_leaf_reg_remap): New function.
22906 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
22908 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
22910 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
22911 Use assemble_name to output BSS section name.
22913 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
22915 * config/i386/constraints.md (Bs):
22916 Remove boolean operators from match_test RTX.
22919 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
22922 2021-06-06 Martin Liska <mliska@suse.cz>
22924 * doc/extend.texi: Add missing @headitem.
22925 * doc/invoke.texi: Likewise.
22926 * doc/objc.texi: Likewise.
22928 2021-06-06 Martin Liska <mliska@suse.cz>
22930 * genhooks.c (emit_findices): Remove unused function.
22931 (emit_documentation): Do not call emit_findices
22932 and do not search for @Fcode directives.
22934 2021-06-06 Martin Liska <mliska@suse.cz>
22936 * doc/invoke.texi: Remove extra character.
22938 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22940 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
22942 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22944 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
22945 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
22946 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
22947 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
22949 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22951 * config/or1k/or1k.md (*movdi): Fix empty split condition.
22953 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22955 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
22958 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22960 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
22961 *zero_extendsidi2): Fix empty split condition.
22963 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
22965 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
22967 * config/h8300/bitfield.md: Likewise.
22968 * config/h8300/combiner.md: Likewise.
22969 * config/h8300/divmod.md: Likewise.
22970 * config/h8300/extensions.md: Likewise.
22971 * config/h8300/jumpcall.md: Likewise.
22972 * config/h8300/movepush.md: Likewise.
22973 * config/h8300/multiply.md: Likewise.
22974 * config/h8300/other.md: Likewise.
22975 * config/h8300/shiftrotate.md: Likewise.
22976 * config/h8300/logical.md: Likewise. Fix split pattern to use
22977 code iterator that somehow slipped through.
22979 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
22981 PR middle-end/100905
22982 * tree-nested.c (convert_nonlocal_omp_clauses,
22983 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
22985 2021-06-04 Martin Sebor <msebor@redhat.com>
22987 PR middle-end/100732
22988 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
22989 with either source or destination argument of invalid type.
22990 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
22991 calls with arguments of invalid type.
22993 2021-06-04 Martin Sebor <msebor@redhat.com>
22995 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
22997 (attr_access::vla_bounds): Also handle VLA bounds.
22999 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
23001 * config/i386/predicates.md (GOT_memory_operand):
23002 Implement using match_code RTXes.
23003 (GOT32_symbol_operand): Ditto.
23005 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
23008 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
23010 (ix86_expand_vector_init_general): Ditto.
23011 Use SImode instead of word_mode for logic operations
23012 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
23013 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
23014 implemented by expand_vec_perm_1.
23015 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
23016 should be implemented using standard shuffle patterns.
23017 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
23018 V2HI modes to modes, implementable with shuffle for one operand.
23019 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
23020 (*pshufw_1): New insn pattern.
23021 (*vec_dupv2hi): Ditto.
23022 (vec_initv2hihi): New expander.
23024 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
23026 * config/arm/vfp.md (no_literal_pool_df_immediate,
23027 no_literal_pool_sf_immediate): Fix empty split condition.
23029 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
23031 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
23032 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
23033 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
23034 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
23035 *<sse4_1_avx2>_pblendvb_lt): Likewise.
23037 2021-06-04 Jakub Jelinek <jakub@redhat.com>
23040 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
23041 concatenation from half-sized modes with TImode elements.
23043 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
23045 * config/arc/arc.c (arc_override_options): Disable millicode
23046 thunks when RF16 is on.
23048 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
23050 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
23052 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
23054 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
23055 Replace PROMOTE_MODE marco with its content.
23057 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
23059 * config/cris/cris.md (*addi_reload): Fix empty split condition.
23061 2021-06-03 Jim Wilson <jimw@sifive.com>
23063 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
23064 turn it on for all riscv targets.
23066 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
23069 * config/i386/i386-expand.c (ix86_expand_vector_set):
23070 Handle V2HI and V4QI modes.
23071 (ix86_expand_vector_extract): Ditto.
23072 * config/i386/mmx.md (*pinsrw): New insn pattern.
23075 (*pextrw_zext): Ditto.
23077 (*pextrb_zext): Ditto.
23078 (vec_setv2hi): New expander.
23079 (vec_extractv2hihi): Ditto.
23080 (vec_setv4qi): Ditto.
23081 (vec_extractv4qiqi): Ditto.
23082 (vec_setv8qi): Enable only for TARGET_SSE4_1.
23083 (vec_extractv8qiqi): Ditto.
23085 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
23087 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
23088 order to subf instruction.
23089 * config/rs6000/fusion.md: Regenerate.
23091 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
23093 * calls.c (get_size_range): Use range_of_expr instead of
23094 determine_value_range.
23095 * tree-affine.c (expr_to_aff_combination): Same.
23096 * tree-data-ref.c (split_constant_offset): Same.
23097 * tree-vrp.c (determine_value_range_1): Remove.
23098 (determine_value_range): Remove.
23099 * tree-vrp.h (determine_value_range): Remove.
23101 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
23103 * function-tests.c (test_ranges): Call gimple_range_tests.
23104 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
23106 * gimple-range.cc (fur_source::get_operand): Do not call
23107 get_tree_range or gimple_range_global.
23109 (get_tree_range): Move to value-query.cc.
23110 Call get_arith_expr_range.
23111 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
23112 Include gimple-range-tests.cc.
23113 * gimple-range.h (fold_range): Add argument.
23114 (get_tree_range): Remove.
23115 * selftest.h (gimple_range_tests): New.
23116 * value-query.cc (global_range_query::range_of_expr): Add
23118 (range_query::get_tree_range): Move from gimple-range.cc.
23119 * value-query.h (class range_query): Add get_tree_range and
23120 get_arith_expr_range. Make fur_source a friend.
23121 * vr-values.c (vr_values::range_of_expr): Pass stmt to
23123 * gimple-range-tests.cc: New file.
23125 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
23127 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
23128 update_global_range.
23129 * value-query.cc (update_global_range): New.
23130 * value-query.h (update_global_range): New.
23132 2021-06-03 David Malcolm <dmalcolm@redhat.com>
23134 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
23135 printing the same location twice if there are fix-it hints,
23136 multiple locations, or a label.
23138 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
23140 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
23141 thresholds to narrow the upper bound on epilogue iterations.
23143 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
23145 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
23146 (mve_vabsq_s<mode>): Likewise.
23147 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
23148 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
23149 * config/arm/vec-common.md (neg<mode>2): Rename to
23150 <absneg_str><mode>2.
23152 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
23154 * common/config/arc/arc-common.c (arc_option_optimization_table):
23155 Remove malign-call.
23156 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
23157 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
23158 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
23159 * config/arc/arc.md (abssi2_mixed): Remove pattern.
23160 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
23161 (malign-call): Likewise.
23162 (mmixed-code): Likewise.
23163 * doc/invoke.texi (ARC): Update doc.
23165 2021-06-03 Martin Liska <mliska@suse.cz>
23167 * common.opt: Use proper Enum values.
23168 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
23169 (parse_sanitizer_options): Handle only sanitizer_opts.
23170 (common_handle_option): Just assign value.
23172 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23175 * tree-inline.c (inline_forbidden_p): Remove test on return type.
23177 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23179 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
23180 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
23181 (type_byte_size): Inline into...
23182 (add_byte_size_attribute): ...this and call add_scalar_info.
23184 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23186 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
23187 (typed_binop_from_tree): New function.
23188 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
23189 turn a divide by a power of 2 into a shift.
23190 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
23191 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
23192 typed divide by calling typed_binop_from_tree.
23194 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
23196 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
23197 (is_handled_procedure_type): Likewise.
23198 (struct loc_descr_context): Add strict_signedness field.
23199 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
23200 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
23201 (resolve_args_picking): Minor tweak.
23202 (function_to_dwarf_procedure): Initialize strict_signedness field.
23203 (type_byte_size): Likewise.
23204 (field_byte_offset): Likewise.
23205 (gen_descr_array_type_die): Likewise.
23206 (gen_variant_part): Likewise.
23207 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
23208 to true when a context is present before evaluating the arguments.
23209 <COND_EXPR>: Do not generate a useless comparison with zero.
23210 When dereferencing an address, if strict_signedness is true and the
23211 type is small and signed, use DW_OP_deref_type to do the dereference
23212 and then DW_OP_convert to convert back to the generic type.
23214 2021-06-03 Jakub Jelinek <jakub@redhat.com>
23217 * tree-inline.c (copy_tree_body_r): Handle iterators on
23218 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
23220 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
23222 * config/arc/arc.md (*bbit_di): Remove.
23224 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
23226 PR rtl-optimization/100264
23227 * ree.c (get_sub_rtx): Ignore SET expressions without register
23228 destinations and remove assertion, as it is not valid anymore
23229 with this new behaviour.
23230 (merge_def_and_ext): Eliminate destination check for register
23231 as such SET expressions can't occur anymore.
23232 (combine_reaching_defs): Likewise.
23234 2021-06-02 Jakub Jelinek <jakub@redhat.com>
23237 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
23238 -Wtype-limits warnings.
23239 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
23240 in operands to avoid -Wsign-compare warnings.
23242 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
23244 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
23247 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
23249 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
23251 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
23253 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
23255 * config/s390/subst.md(cconly_subst): Use a single constraint
23256 in (match_scratch).
23258 2021-06-02 Martin Liska <mliska@suse.cz>
23260 * ipa-icf.h: Use auto_vec for memory_access_types.
23262 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
23264 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
23265 argument from prototype.
23266 (output_logical_op): Add rtx_code argument.
23267 (compute_logical_op_length): Likewise.
23268 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
23269 to compute_a_shift_length.
23270 (output_logical_op); New argument with the rtx code rather than
23271 extracting it from an operand. Handle QImode too.
23272 (compute_logical_op_length): Similary.
23273 (compute_a_shift_length): Drop unused argument.
23274 * config/h8300/h8300.md (logicals): New code iterator.
23275 * config/h8300/logical.md (<code><mode>3 expander): Combine
23276 the "and" expander with the "ior"/"xor" expander.
23277 (bclr<mode>msx): Combine the QI/HI mode patterns.
23278 (<logical><mode>3 insns): Use code iterator rather than match_operator.
23279 Handle QImode as well. Update call to output_logical_op and
23280 compute_logical_op_length to pass in rtx_code
23281 Fix split condition on all define_insn_and_split patterns.
23282 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
23283 the flags and setting ZN via existing define_subst.
23284 * config/h8300/shiftrotate.md: Drop unused argument from
23285 calls to compute_a_shift_length.
23286 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
23288 2021-06-01 Andrew Pinski <apinski@marvell.com>
23290 PR tree-optimization/25290
23291 * tree-ssa-phiopt.c (match_simplify_replacement):
23293 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
23294 (two_value_replacement): Change the comment about
23295 conditional_replacement.
23296 (conditional_replacement): Delete.
23298 2021-06-01 Andrew Pinski <apinski@marvell.com>
23300 PR tree-optimization/95481
23301 * tree-tailcall.c (find_tail_calls): Handle empty typed
23304 2021-06-01 Andrew Pinski <apinski@marvell.com>
23306 * gimplify.c (zero_sized_field_decl): Delete
23307 (zero_sized_type): Delete
23308 (gimplify_init_ctor_eval): Use is_empty_type instead
23309 of zero_sized_field_decl.
23310 (gimplify_modify_expr): Use is_empty_type instead of
23313 2021-06-01 Jason Merrill <jason@redhat.com>
23316 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
23318 2021-06-01 Jason Merrill <jason@redhat.com>
23321 * diagnostic.h (warning_enabled_at): Declare.
23322 * diagnostic.c (diagnostic_enabled): Factor out from...
23323 (diagnostic_report_diagnostic): ...here.
23324 (warning_enabled_at): New.
23326 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
23328 * gimple-ssa-evrp.c: Enable exporting of global ranges.
23330 2021-06-01 Martin Liska <mliska@suse.cz>
23333 * doc/invoke.texi: Mention that -fgcse-after-reload
23334 is enabled with -O3.
23336 2021-06-01 liuhongt <hongtao.liu@intel.com>
23338 PR tree-optimization/98365
23339 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
23340 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
23341 (convert_scalar_cond_reduction): Ditto.
23342 (predicate_scalar_phi): Ditto.
23344 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23346 PR tree-optimization/100781
23347 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
23348 value calculation by default.
23349 (ranger_cache::enable_new_values): New.
23350 (ranger_cache::disable_new_values): New.
23351 (ranger_cache::push_poor_value): Check if new values are allowed.
23352 * gimple-range-cache.h (class ranger_cache): New member/methods.
23353 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
23354 statement, and disable/renable new value calculation.
23356 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23358 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
23359 (ranger_cache::range_of_def): New.
23360 (ranger_cache::entry_range): New.
23361 (ranger_cache::exit_range): New.
23362 (ranger_cache::range_of_expr): Adjust.
23363 (ranger_cache::range_on_edge): Adjust.
23364 (ranger_cache::propagate_cache): Call exit_range directly.
23365 * gimple-range-cache.h (class ranger_cache): Adjust.
23367 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23369 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
23370 gori_compute being a member rather than base class.
23371 dervied call to member call.
23372 (ranger_cache::dump): No longer dump gori_map.
23373 (ranger_cache::dump_bb): New.
23374 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
23375 being a member rather than base class.
23376 (ranger_cache::set_global_range): Ditto.
23377 (ranger_cache::ssa_range_in_bb): Ditto.
23378 (ranger_cache::range_of_expr): New.
23379 (ranger_cache::range_on_edge): New.
23380 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
23381 (ranger_cache::propagate_cache): Adjust debugging output.
23382 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
23384 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
23385 member, and inherit from range_query instead.
23386 (ranger_cache::dump_bb): New. split from dump.
23387 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
23388 (gori_compute::expr_range_at_stmt): Delete.
23389 (gori_compute::compute_name_range_op): Delete.
23390 (gori_compute::compute_operand_range_switch): Add fur_source.
23391 (gori_compute::compute_operand_range): Add fur_source param, inline
23392 old compute_name_range_op and optimize_logical_operands.
23393 (struct tf_range): Delete.
23394 (gori_compute::logical_combine): Adjust
23395 (gori_compute::optimize_logical_operands): Delete.
23396 (gori_compute::compute_logical_operands_in_chain): Delete.
23397 (gori_compute::compute_logical_operands): Adjust.
23398 (gori_compute::compute_operand1_range): Adjust to fur_source.
23399 (gori_compute::compute_operand2_range): Ditto.
23400 (gori_compute::compute_operand1_and_operand2_range): Ditto.
23401 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
23402 and adjust to fur_source.
23403 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
23404 range_query and fur_source.
23405 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
23406 from the ranger_cache..
23407 (gimple_ranger::fold_range_internal): Adjust to base class change of
23409 (gimple_ranger::dump_bb): Adjust dump.
23410 * gimple-range.h (gimple_ranger):export gori computes object.
23412 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
23414 PR tree-optimization/100774
23415 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
23416 Constant values are also not stale.
23417 (ranger_cache::set_global_range): Range invariant values should also
23418 have the correct timestamp.
23420 2021-05-31 Martin Liska <mliska@suse.cz>
23422 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
23423 Unpack FUNCTION_DECL_DECL_TYPE.
23424 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
23425 Stream FUNCTION_DECL_DECL_TYPE instead of
23426 DECL_IS_OPERATOR_NEW_P.
23427 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
23429 (DECL_IS_OPERATOR_NEW_P): Likewise.
23430 (DECL_IS_OPERATOR_DELETE_P): Likewise.
23431 (DECL_LAMBDA_FUNCTION_P): Likewise.
23433 2021-05-31 Richard Biener <rguenther@suse.de>
23436 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
23437 * internal-fn.def (SHUFFLEVECTOR): New.
23438 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
23439 * doc/extend.texi: Document __builtin_shufflevector.
23441 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
23444 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
23445 indexed form addresses.
23447 2021-05-29 Jeff Law <jlaw@tachyum.com>
23449 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
23450 parameter. Call callers fixed.
23452 (output_plussi): Add FALLTHRU markers.
23453 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
23455 2021-05-29 Jakub Jelinek <jakub@redhat.com>
23457 PR middle-end/99928
23458 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
23459 combined with parallel, make sure to add shared clause to
23460 parallel for explicit linear clause.
23462 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
23464 PR tree-optimization/100787
23465 * gimple-ssa-evrp.c: Disable exporting of global ranges.
23467 2021-05-28 Jason Merrill <jason@redhat.com>
23469 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
23470 operator--, operator*, operator==, and operator!=.
23471 (class tsi_range): New.
23473 2021-05-28 Richard Biener <rguenther@suse.de>
23475 PR tree-optimization/100778
23476 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
23477 trapping ops in different BBs.
23479 2021-05-28 Richard Biener <rguenther@suse.de>
23482 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
23483 copy fntype from original call.
23485 2021-05-28 Martin Liska <mliska@suse.cz>
23487 PR gcov-profile/100751
23488 * doc/gcov.texi: Revert partially a hunk that was wrong.
23490 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
23492 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
23494 (HAVE_sync_compare_and_swaphi): Likewise.
23495 (HAVE_sync_compare_and_swapsi): Likewise.
23497 2021-05-28 Jakub Jelinek <jakub@redhat.com>
23499 PR middle-end/99928
23500 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
23502 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
23504 * gimplify.c (gimplify_omp_affinity): New.
23505 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
23506 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
23507 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
23508 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
23509 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
23511 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
23512 Richard Biener <rguenther@suse.de>
23514 * match.pd <popcount & / + pattern matching>:
23515 When generating popcount directly fails, try doing it in two halves.
23517 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
23519 * Makefile.in (generated_files): Add gimple-match.c and
23522 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
23524 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
23526 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
23528 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
23530 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
23532 PR tree-optimization/99398
23533 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
23534 where the fed operands are CTOR/CST and propagated through
23535 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
23536 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
23538 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
23541 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23543 * config/i386/mmx.md (addv2sf3): Do not call
23544 ix86_fixup_binary_operands_no_copy.
23547 (<smaxmin:code>v2sf3): Ditto.
23548 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
23549 (<plusminus:insn><VI_32:mode>3): Remove expander.
23550 (<plusminus:insn><VI_32:mode>3): Rename from
23551 "*<plusminus:insn><VI_32:mode>3".
23552 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
23553 (mulv2hi3): Remove expander.
23554 (mulv2hi3): Rename from *mulv2hi3.
23555 (<s>mulv2hi3_highpart): Remove expander.
23556 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
23557 (<smaxmin:code><MMXMODE14:mode>3): Rename from
23558 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
23559 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
23560 (SMAXMIN_MMXMODEI): Remove mode iterator.
23561 (<smaxmin:code>v4hi3): New expander.
23562 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
23563 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
23564 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23565 (SMAXMIN_VI_32): Remove mode iterator.
23566 (<umaxmin:code><MMXMODE24:mode>3): Rename from
23567 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
23568 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
23569 (UMAXMIN_MMXMODEI): Remove mode iterator.
23570 (<umaxmin:code>v8qi3): New expander.
23571 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
23572 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
23573 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23574 (UMAXMIN_VI_32): Remove mode iterator.
23575 (<any_shift:insn>v2hi3): Remove expander.
23576 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
23577 (<any_logic:code><MMXMODEI:mode>3): Do not call
23578 ix86_fixup_binary_operands_no_copy.
23579 (<any_logic:code><VI_32:mode>3): Remove expander.
23580 (<any_logic:code><VI_32:mode>3): Rename from
23581 "*<any_logic:code><VI_32:mode>3".
23582 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
23583 * config/i386/sse.md (div<VF2:mode>3): Do not call
23584 ix86_fixup_binary_operands_no_copy.
23585 (div<VF1:mode>3): Ditto.
23586 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
23587 (smulhrsv4hi3): Ditto.
23588 (smulhrsv2hi3): Ditto.
23590 2021-05-27 Martin Sebor <msebor@redhat.com>
23592 * ggc.h (gt_ggc_mx): Add overloads for all integers.
23594 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
23596 (hash_map::operator==): New function.
23598 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23601 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
23602 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
23603 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
23604 (*xop_maskcmp<VI_32:mode>3): Ditto.
23605 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
23606 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
23608 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
23611 * config/arm/arm.c (arm_configure_build_target): Remove parameter
23612 opts_set, directly check opts parameters for being non-null.
23613 (arm_option_restore): Update call to arm_configure_build_target.
23614 (arm_option_override): Likewise.
23615 (arm_can_inline_p): Likewise.
23616 (arm_valid_target_attribute_tree): Likewise.
23617 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
23618 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
23621 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23623 * vr-values.c (simplify_conversion_using_ranges): Use
23624 get_range_query instead of get_global_range_query.
23626 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23628 * gimple-range.cc (get_range_global): Move to value-query.cc.
23629 (gimple_range_global): Same.
23630 (get_global_range_query): Same.
23631 (global_range_query::range_of_expr): Same.
23632 * gimple-range.h (class global_range_query): Move to
23634 (gimple_range_global): Same.
23635 * tree-ssanames.c (get_range_info): Move to value-query.cc.
23636 (get_ptr_nonnull): Same.
23637 * tree-ssanames.h (get_range_info): Remove.
23638 (get_ptr_nonnull): Remove.
23639 * value-query.cc (get_ssa_name_range_info): Move from
23641 (get_ssa_name_ptr_info_nonnull): Same.
23642 (get_range_global): Move from gimple-range.cc.
23643 (gimple_range_global): Same.
23644 (get_global_range_query): Same.
23645 (global_range_query::range_of_expr): Same.
23646 * value-query.h (class global_range_query): Move from
23648 (gimple_range_global): Same.
23650 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23653 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
23654 (uavgv2hi3_ceil): Ditto.
23656 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23659 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
23661 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23663 * tree-ssanames.c (get_range_info): Merge both copies of
23664 get_range_info into one that works with irange.
23665 * tree-ssanames.h (get_range_info): Remove version that works on
23668 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23670 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
23671 (expand_builtin_strnlen): Same.
23672 (determine_block_size): Same.
23673 * fold-const.c (expr_not_equal_to): Same.
23674 * gimple-fold.c (size_must_be_zero_p): Same.
23675 * gimple-match-head.c: Include gimple-range.h.
23676 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
23677 * gimple-ssa-warn-restrict.c
23678 (builtin_memref::extend_offset_range): Same.
23679 * graphite-sese-to-poly.c (add_param_constraints): Same.
23680 * internal-fn.c (get_min_precision): Same.
23681 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
23682 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
23684 * tree-data-ref.c (split_constant_offset): Same.
23685 (dr_step_indicator): Same.
23686 * tree-dfa.c (get_ref_base_and_extent): Same.
23687 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
23688 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
23689 (determine_value_range): Same.
23690 (record_nonwrapping_iv): Same.
23691 (infer_loop_bounds_from_signedness): Same.
23692 (scev_var_range_cant_overflow): Same.
23693 * tree-ssa-phiopt.c (two_value_replacement): Same.
23694 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
23695 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
23696 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
23698 (dump_strlen_info): Same.
23699 (set_strlen_range): Same.
23700 (maybe_diag_stxncpy_trunc): Same.
23701 (get_len_or_size): Same.
23702 (handle_integral_assign): Same.
23703 * tree-ssa-structalias.c (find_what_p_points_to): Same.
23704 * tree-ssa-uninit.c (find_var_cmp_const): Same.
23705 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
23706 * tree-vect-patterns.c (vect_get_range_info): Same.
23707 (vect_recog_divmod_pattern): Same.
23708 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
23709 (register_edge_assert_for_2): Same.
23710 (determine_value_range_1): Same.
23711 * tree.c (get_range_pos_neg): Same.
23712 * vr-values.c (vr_values::get_lattice_entry): Same.
23713 (vr_values::update_value_range): Same.
23714 (simplify_conversion_using_ranges): Same.
23716 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23718 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
23719 get_range_query instead of query argument.
23720 (pass_walloca::execute): Enable and disable global ranger.
23722 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23724 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
23726 (rvrp_folder::~rvrp_folder): Call disable_ranger.
23727 (hybrid_folder::hybrid_folder): Call enable_ranger.
23728 (hybrid_folder::~hybrid_folder): Call disable_ranger.
23730 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23732 * function.c (allocate_struct_function): Set cfun->x_range_query.
23733 * function.h (struct function): Declare x_range_query.
23734 (get_range_query): New.
23735 (get_global_range_query): New.
23736 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
23737 Remove call to safe_grow_cleared.
23738 * gimple-range.cc (get_range_global): New.
23739 (gimple_range_global): Move from gimple-range.h.
23740 (get_global_range_query): New.
23741 (global_range_query::range_of_expr): New.
23742 (enable_ranger): New.
23743 (disable_ranger): New.
23744 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
23745 (class global_range_query): New.
23746 (enable_ranger): New.
23747 (disable_ranger): New.
23748 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
23749 dump_all_value_ranges to dump.
23750 * tree-vrp.c (vrp_prop::finalize): Same.
23751 * value-query.cc (range_query::dump): New.
23752 * value-query.h (range_query::dump): New.
23753 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
23754 (vr_values::dump): ...this.
23755 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
23756 dump and make virtual.
23758 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
23760 * config/i386/i386.c (ix86_autovectorize_vector_modes):
23761 Add V4QImode and V16QImode for TARGET_SSE2.
23762 * doc/sourcebuild.texi (Vector-specific attributes):
23763 Add vect64 and vect32 description.
23765 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
23767 * gimple-range-gori.cc (range_def_chain::register_dependency):
23768 Resize m_def_chain when needed.
23770 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
23772 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
23773 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
23774 * config/arm/vec-common.md: .. here. Add support for MVE.
23776 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23778 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
23780 * config/microblaze/microblaze.c (microblaze_legitimize_address,
23782 microblaze_option_override, print_operand): Likewise.
23783 * config/microblaze/microblaze.md (call_internal_plt,
23784 call_value_intern_plt, call_value_intern): Likewise.
23785 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
23786 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
23787 call_value_multiple_internal1): Likewise.
23788 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
23790 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
23792 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
23793 arc_ccfsm_advance, symbolic_reference_mentioned_p,
23794 arc_raw_symbolic_reference_mentioned_p): Remove register
23797 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23800 * omp-low.c: Include omp-offload.h.
23801 (create_omp_child_function): If current_function_decl has
23802 "omp declare target" attribute and is_gimple_omp_offloaded,
23803 remove that attribute from the copy of attribute list and
23804 add "omp target entrypoint" attribute instead.
23805 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
23806 variables for offloading if in omp_maybe_offloaded_ctx.
23807 * omp-offload.c (pass_omp_target_link::execute): Nullify second
23808 argument to GOMP_target_data_ext in offloaded code.
23810 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23812 * config/csky/csky.c (csky_can_change_mode_class): Delete.
23813 For csky, HF/SF mode use the low bits of VREGS.
23815 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23817 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
23818 DECL which is a reference for OMP.
23820 2021-05-26 Martin Liska <mliska@suse.cz>
23822 PR gcov-profile/100751
23823 * doc/gcov.texi: Document that __gcov_dump can be called just
23824 once and that __gcov_reset resets run-time counters.
23826 2021-05-26 Martin Liska <mliska@suse.cz>
23828 * doc/install.texi: Port relevant part from install-old.texi
23829 and re-generate list of CPUs and systems.
23831 2021-05-26 Martin Liska <mliska@suse.cz>
23833 * Makefile.in: Remove it.
23834 * doc/include/fdl.texi: Update next/previous chapters.
23835 * doc/install.texi: Likewise.
23836 * doc/install-old.texi: Removed.
23838 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23840 * config/csky/csky.c (ck810_legitimate_index_p): Support
23841 "base + index" with DF mode.
23842 * config/csky/constraints.md ("Y"): New constraint for memory operands
23843 without index register.
23844 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
23845 when mov between memory and general registers, and lower their priority.
23846 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
23848 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23850 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
23852 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23854 * config/csky/csky.md (untyped_call): Emit clobber for return
23855 registers to mark them used.
23857 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23859 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
23861 2021-05-26 Andrew Pinski <apinski@marvell.com>
23863 * match.pd (x < 0 ? ~y : y): New patterns.
23865 2021-05-26 Andrew Pinski <apinski@marvell.com>
23867 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
23868 A?POW2:0 and A?0:POW2.
23870 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23872 * gimple-range-gori.cc (class logical_stmt_cache): Delete
23873 (logical_stmt_cache::logical_stmt_cache ): Delete.
23874 (logical_stmt_cache::~logical_stmt_cache): Delete.
23875 (logical_stmt_cache::cache_entry::dump): Delete.
23876 (logical_stmt_cache::get_range): Delete.
23877 (logical_stmt_cache::cached_name ): Delete.
23878 (logical_stmt_cache::same_cached_name): Delete.
23879 (logical_stmt_cache::cacheable_p): Delete.
23880 (logical_stmt_cache::slot_diagnostics ): Delete.
23881 (logical_stmt_cache::dump): Delete.
23882 (gori_compute_cache::gori_compute_cache): Delete.
23883 (gori_compute_cache::~gori_compute_cache): Delete.
23884 (gori_compute_cache::compute_operand_range): Delete.
23885 (gori_compute_cache::cache_stmt): Delete.
23886 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
23888 (class gori_compute_cache): Delete.
23890 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23892 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
23894 (fold_using_range::range_of_address): Adjust.
23895 (fold_using_range::range_of_phi): Adjust.
23896 * gimple-range.h (class fur_source): Adjust.
23897 (fur_source::fur_source): Adjust.
23899 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23901 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
23902 from expr_range_in_bb and adjust.
23903 (gori_compute::compute_name_range_op): Adjust.
23904 (gori_compute::optimize_logical_operands): Adjust.
23905 (gori_compute::compute_logical_operands_in_chain): Adjust.
23906 (gori_compute::compute_operand1_range): Adjust.
23907 (gori_compute::compute_operand2_range): Adjust.
23908 (ori_compute_cache::cache_stmt): Adjust.
23909 * gimple-range-gori.h (gori_compute): Rename prototype.
23911 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23913 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
23914 checked only after range_of_stmt, not range_on_entry.
23915 (gimple_ranger::range_on_entry): Check for non-null in any
23916 predecessor block, if it is not already non-null.
23917 (gimple_ranger::range_on_exit): DOnt check for non-null after
23918 range on entry call.
23919 (gimple_ranger::dump_bb): New. Split from dump.
23920 (gimple_ranger::dump): Adjust.
23921 * gimple-range.h (class gimple_ranger): Adjust.
23923 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23925 * gimple-range-cache.cc (struct range_timestamp): Delete.
23926 (class temporal_cache): Adjust.
23927 (temporal_cache::get_timestamp): Delete.
23928 (temporal_cache::set_dependency): Delete.
23929 (temporal_cache::temporal_value): Adjust.
23930 (temporal_cache::current_p): Take dependencies as params.
23931 (temporal_cache::set_timestamp): Adjust.
23932 (temporal_cache::set_always_current): Adjust.
23933 (ranger_cache::get_non_stale_global_range): Adjust.
23934 (ranger_cache::register_dependency): Delete.
23935 * gimple-range-cache.h (class range_cache): Adjust.
23937 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23939 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
23941 (range_def_chain::~range_def_chain): Dispose of obstack rather than
23942 each individual bitmap.
23943 (range_def_chain::set_import): New.
23944 (range_def_chain::get_imports): New.
23945 (range_def_chain::chain_import_p): New.
23946 (range_def_chain::register_dependency): Rename from build_def_chain
23948 (range_def_chain::def_chain_in_bitmap_p): New.
23949 (range_def_chain::add_def_chain_to_bitmap): New.
23950 (range_def_chain::has_def_chain): Just check first depenedence.
23951 (range_def_chain::get_def_chain): Process imports, use generic
23952 register_dependency routine.
23953 (range_def_chain::dump): New.
23954 (gori_map::gori_map): Allocate import list.
23955 (gori_map::~gori_map): Release imports.
23956 (gori_map::exports): Check for past allocated block size.
23957 (gori_map::imports): New.
23958 (gori_map::def_chain_in_export_p): Delete.
23959 (gori_map::is_import_p): New.
23960 (gori_map::maybe_add_gori): Handle imports.
23961 (gori_map::dump): Adjust output, add imports.
23962 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
23963 (gori_export_iterator::gori_export_iterator): New.
23964 (gori_export_iterator::next): New.
23965 (gori_export_iterator::get_name): New.
23966 * gimple-range-gori.h (range_def_chain): Add imports and direct
23967 dependecies via struct rdc.
23968 (range_def_chain::depend1): New.
23969 (range_def_chain::depend2): New.
23970 (class gori_map): Adjust.
23971 (FOR_EACH_GORI_IMPORT_NAME): New.
23972 (FOR_EACH_GORI_EXPORT_NAME): New.
23973 (class gori_export_iterator): New.
23975 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23977 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
23978 export cache filling to here.
23979 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
23981 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23983 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
23984 (gori_map): Move to gimple-range-gori.h.
23985 (gori_compute::gori_compute): Adjust.
23986 (gori_compute::~gori_compute): Delete.
23987 (gori_compute::compute_operand_range_switch): Adjust.
23988 (gori_compute::compute_operand_range): Adjust.
23989 (gori_compute::compute_logical_operands): Adjust.
23990 (gori_compute::has_edge_range_p ): Adjust.
23991 (gori_compute::set_range_invariant): Delete.
23992 (gori_compute::dump): Adjust.
23993 (gori_compute::outgoing_edge_range_p): Adjust.
23994 * gimple-range-gori.h (class range_def_chain): Relocate here.
23995 (class gori_map): Relocate here.
23996 (class gori_compute): Inherit from gori_map, and adjust.
23998 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
24000 * value-range.cc (range_tests_legacy): Use
24001 build_nonstandard_integer_type instead of int and short.
24003 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
24005 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
24006 when really creating an initialization statement for it.
24008 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
24010 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
24012 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
24014 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
24016 2021-05-25 Martin Liska <mliska@suse.cz>
24018 PR tree-optimization/92860
24020 * optc-save-gen.awk: Remove exceptions.
24022 2021-05-25 Martin Liska <mliska@suse.cz>
24024 * asan.h (sanitize_coverage_p): New function.
24025 * doc/extend.texi: Document it.
24026 * fold-const.c (fold_range_test): Use sanitize_flags_p
24027 instead of flag_sanitize_coverage.
24028 (fold_truth_andor): Likewise.
24029 * sancov.c: Likewise.
24030 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
24031 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
24032 -fsanitize-coverage when inlining.
24034 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
24036 * config/csky/csky-modes.def : Fix copyright.
24038 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
24040 * config/csky/csky-modes.def : Amend copyright.
24041 * config/csky/csky_insn_fpuv2.md : Likewise.
24042 * config/csky/csky_insn_fpuv3.md : Likewise.
24044 2021-05-25 Richard Biener <rguenther@suse.de>
24046 PR middle-end/100727
24047 * calls.c (initialize_argument_information): Explicitely test
24048 for WITH_SIZE_EXPR.
24049 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
24051 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24053 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
24054 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
24055 the signle definition. The signle definition may not work well
24056 at simplify_subreg_regno().
24057 (HARD_FRAME_POINTER_REGNUM): New.
24058 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
24059 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
24060 csky_initial_elimination_offset, csky_expand_prologue,
24061 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
24063 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24065 * config/csky/csky.c (csky_option_override):
24066 Init csky_arch_isa_features[] in advance, so TARGET_DSP
24067 and TARGET_DIV can be set well.
24069 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24071 * config/csky/constraints.md ("l", "h"): Delete.
24072 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
24073 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
24074 * config/csky/csky.c (regno_reg_classm,
24075 csky_secondary_reload, csky_register_move_cost):
24076 Use HILO_REGS instead of LO_REGS and HI_REGS.
24078 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
24080 * config/csky/constraints.md ("W"): New constriant for mem operand
24081 with base reg, index register.
24082 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
24083 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
24085 ("Dv"): New constraint for const double value that can be used at
24087 * config/csky/csky-modes.def (HFmode): New mode.
24088 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
24089 to "csky_valid_mem_constraint_operand" and support new constraint
24091 (csky_get_movedouble_length): New.
24092 (fpuv3_output_move): New.
24093 (fpuv3_const_double): New.
24094 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
24095 (decompose_csky_address): Refine.
24096 (csky_print_operand): New "CONST_DOUBLE" operand.
24097 (csky_output_move): Support fpv3 instructions.
24098 (csky_get_movedouble_length): New.
24099 (fpuv3_output_move): New.
24100 (fpuv3_const_double): New.
24101 (csky_emit_compare): Cover float comparsion.
24102 (csky_emit_compare_float): Refine.
24103 (csky_vaild_fpuv2_mem_operand): Rename to
24104 "csky_valid_mem_constraint_operand" and support new constraint "W".
24105 (ck860_rtx_costs): New.
24106 (csky_rtx_costs): Add the cost calculation of CK860.
24107 (regno_reg_class): New vregs for fpuv3.
24108 (csky_dbx_regno): Likewise.
24109 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
24110 (csky_conditional_register_usage): Suporrot fpuv3.
24111 (csky_dwarf_register_span): Suporrot fpuv3.
24112 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
24113 (ck810_legitimate_index_p): Support fp16.
24114 * config/csky/csky.h (TARGET_TLS): ADD CK860.
24115 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
24116 (TARGET_SINGLE_FPU): Support fpuv3.
24117 (TARGET_SUPPORT_FPV3): New.
24118 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
24119 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
24120 REG_CLASS_CONTENTS): Support fpuv3.
24121 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
24122 (csky_movsf_fpv2): Likewise.
24123 (ck801_movsf): Likewise.
24124 (csky_movsf): Likewise.
24126 (csky_movdf_fpv2): Likewise.
24127 (ck801_movdf): Likewise.
24128 (csky_movdf): Likewise.
24129 (movsicc): Refine. Use "comparison_operatior" instead of
24130 "ordered_comparison_operatior".
24131 (addsicc): Likewise.
24132 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
24133 (call_value_internal_vh): New.
24134 * config/csky/csky_cores.def (CK860): New arch and cpu.
24139 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
24140 into emit-patterns and match-patterns, remain the emit-patterns here,
24141 and move the match-patterns to csky_insn_fpuv2.md or
24142 csky_insn_fpuv3.md.
24143 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
24144 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
24146 * config/csky/csky_isa.def (fcr): New.
24151 (CK860): New definition for ck860.
24152 * config/csky/csky_tables.opt (ck860): New processors ck860,
24153 ck860f. And new arch ck860.
24158 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
24159 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
24160 * config/csky/t-csky-elf: Support 860.
24161 * config/csky/t-csky-linux: Likewise.
24162 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
24164 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
24166 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
24167 add generation of logical-add and add-logical fusion pairs.
24168 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
24170 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
24171 logical-add and add-logical fusion by default.
24172 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
24173 -mpower10-fusion-add-logical options.
24174 * config/rs6000/fusion.md: Regenerate file.
24176 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
24178 * value-range.cc (irange::legacy_equal_p): Check type when
24179 comparing VR_VARYING types.
24180 (range_tests_legacy): Test comparing VARYING ranges of different
24183 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
24185 * config/aarch64/aarch64.c (neoversen1_tunings):
24186 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
24188 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
24190 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
24191 extern weak symbols. Limit symbol offsets for non-GOT symbols with
24194 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
24196 * config/arm/neon.md (vec_load_lanesxi<mode>)
24197 (vec_store_lanexoi<mode>): Move ...
24198 * config/arm/vec-common.md: here.
24200 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
24202 * config/arm/neon.md (vec_load_lanesoi<mode>)
24203 (vec_store_lanesoi<mode>): Move ...
24204 * config/arm/vec-common.md: here.
24206 2021-05-24 liuhongt <hongtao.liu@intel.com>
24209 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
24210 stmt with GIMPLE_NOP when lhs doesn't exist.
24212 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
24215 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
24216 New instruction pattern.
24217 (*push<VI_32:mode>2): Ditto.
24218 (push splitter for SSE registers): New splitter.
24220 2021-05-23 Andrew Pinski <apinski@marvell.com>
24222 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
24224 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
24226 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
24227 * config/rs6000/fusion.md: Regenerate file.
24229 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
24231 * config/rs6000/genfusion.pl (gen_addadd): New function.
24232 * config/rs6000/fusion.md: Regenerate file.
24233 * config/rs6000/rs6000-cpus.def: Add
24234 OPTION_MASK_P10_FUSION_2ADD to masks.
24235 * config/rs6000/rs6000.c (rs6000_option_override_internal):
24236 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
24237 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
24239 2021-05-21 Jakub Jelinek <jakub@redhat.com>
24241 PR middle-end/99928
24242 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
24243 * gimplify.c (enum gimplify_omp_var_data): Fix up
24244 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
24245 (omp_lastprivate_for_combined_outer_constructs): If combined target
24246 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
24247 GOVD_MAP | GOVD_SEEN.
24248 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
24249 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
24250 (gimplify_adjust_omp_clauses): For firstprivate clauses with
24251 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
24252 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
24253 let it be replaced by implicit map clause.
24255 2021-05-21 Jakub Jelinek <jakub@redhat.com>
24257 PR middle-end/99928
24258 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
24260 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
24261 (gimplify_omp_for): Likewise.
24263 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24265 PR middle-end/90115
24266 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
24267 'external' in blocks.
24269 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24271 PR middle-end/90115
24272 * flag-types.h (enum openacc_privatization): New.
24273 * params.opt (-param=openacc-privatization): New.
24274 * doc/invoke.texi (openacc-privatization): Document it.
24275 * omp-general.h (get_openacc_privatization_dump_flags): New
24277 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
24278 * omp-offload.c (execute_oacc_device_lower)
24279 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
24280 * target.def (goacc.adjust_private_decl): Add 'location_t'
24282 * doc/tm.texi: Regenerate.
24283 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
24284 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
24285 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
24286 Likewise. Preserve it for...
24287 (nvptx_goacc_expand_var_decl): ... use here.
24289 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24291 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
24294 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24296 PR middle-end/90115
24297 * omp-low.c (oacc_privatization_candidate_p): New function.
24298 (oacc_privatization_scan_clause_chain)
24299 (oacc_privatization_scan_decl_chain): Use it. Also
24300 'gcc_checking_assert' that we're not seeing duplicates.
24302 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24304 PR middle-end/90115
24305 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
24308 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24310 PR middle-end/90115
24311 * omp-offload.c (execute_oacc_device_lower): Explain.
24313 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24315 PR middle-end/90115
24316 * omp-offload.c (execute_oacc_device_lower)
24317 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
24319 * internal-fn.c (expand_UNIQUE): Don't expect
24320 'IFN_UNIQUE_OACC_PRIVATE'.
24322 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24324 PR middle-end/90115
24325 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
24327 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
24329 PR middle-end/90115
24330 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
24331 (nvptx_goacc_expand_var_decl): Tighten.
24333 2021-05-21 Julian Brown <julian@codesourcery.com>
24334 Chung-Lin Tang <cltang@codesourcery.com>
24335 Thomas Schwinge <thomas@codesourcery.com>
24337 PR middle-end/90115
24338 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
24339 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
24340 * doc/tm.texi: Regenerate.
24341 * expr.c (expand_expr_real_1): Expand decls using the
24342 expand_var_decl OpenACC hook if defined.
24343 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
24344 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
24345 * omp-low.c (omp_context): Add oacc_privatization_candidates
24347 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
24349 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
24350 private marker's gimple call arguments, and pass it to
24351 lower_oacc_reductions.
24352 (oacc_privatization_scan_clause_chain)
24353 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
24355 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
24356 * omp-offload.c (convert.h): Include.
24357 (oacc_loop_xform_head_tail): Treat private-variable markers like
24358 fork/join when transforming head/tail sequences.
24359 (struct var_decl_rewrite_info): Add struct.
24360 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
24361 (execute_oacc_device_lower): Support rewriting gang-private
24362 variables using target hook, and fix up addr_expr and var_decl
24364 * target.def (adjust_private_decl, expand_var_decl): New hooks.
24365 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
24367 (gcn_goacc_adjust_private_decl): ...this.
24368 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
24370 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
24371 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
24372 definition using gcn_goacc_adjust_gangprivate_decl...
24373 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
24374 gcn_goacc_adjust_private_decl.
24375 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
24376 (gang_private_shared_size): New global variable.
24377 (gang_private_shared_align): Likewise.
24378 (gang_private_shared_sym): Likewise.
24379 (gang_private_shared_hmap): Likewise.
24380 (nvptx_option_override): Initialize these.
24381 (nvptx_file_end): Output gang_private_shared_sym.
24382 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
24384 (nvptx_set_current_function): Clear gang_private_shared_hmap.
24385 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
24386 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
24388 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
24390 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
24392 2021-05-21 Richard Biener <rguenther@suse.de>
24393 H.J. Lu <hjl.tools@gmail.com>
24395 PR middle-end/90773
24396 * expr.c (expand_constructor): Elide expand_constructor if
24397 move by pieces is preferred.
24399 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24401 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
24402 Take a flag and mode value as arguments.
24403 (aarch64_modifies_global_state_p): Likewise.
24404 (aarch64_reads_global_state_p): Likewise.
24405 (aarch64_could_trap_p): Likewise.
24406 (aarch64_get_attributes): Likewise.
24407 (aarch64_init_simd_builtins): Adjust callsite of above.
24408 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
24409 function attributes to apply to builtins.
24410 (aarch64_init_crc32_builtins): Likewise.
24411 (aarch64_init_builtin_rsqrt): Likewise.
24413 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
24415 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
24416 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
24417 (gen_2logical): Use new fusion types.
24418 * config/rs6000/fusion.md: Regenerate.
24420 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
24423 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
24424 Handle V4QI and V2HI modes.
24425 (ix86_expand_sse_movcc): Ditto.
24426 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
24427 New instruction pattern.
24428 (*eq<VI_32:mode>3): Ditto.
24429 (*gt<VI_32:mode>3): Ditto.
24430 (*xop_pcmov_<VI_32:mode>): Ditto.
24431 (mmx_pblendvb32): Ditto.
24432 (mmx_pblendvb64): Rename from mmx_pblendvb.
24433 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
24434 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
24435 (vcond<VI_32:mode><VI_32:mode>): Ditto.
24436 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
24437 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
24439 2021-05-21 Jakub Jelinek <jakub@redhat.com>
24441 PR tree-optimization/94589
24442 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
24443 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
24444 not the same as cmp3) treat <= the same as < and >= the same as >.
24445 Don't require that cond2_phi_edge is true edge, instead take
24446 false/true edges into account based on cmp1/cmp2 comparison kinds.
24448 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
24451 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
24452 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
24453 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
24454 using SMAXMIN_MMXMODEI mode iterator.
24455 (*<smaxmin:code>v4qi3): New insn pattern.
24456 (*<smaxmin:code>v2hi3): Ditto.
24457 (SMAXMIN_VI_32): New mode iterator.
24458 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
24459 (UMAXMIN_MMXMODEI): New mode iterator.
24460 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
24461 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
24462 using UMAXMIN_MMXMODEI mode iterator.
24463 (*<umaxmin:code>v4qi3): New insn pattern.
24464 (*<umaxmin:code>v2hi3): Ditto.
24465 (UMAXMIN_VI_32): New mode iterator.
24466 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
24467 (abs<VI_32:mode>2): New insn pattern.
24468 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
24469 * config/i386/sse.md: ... here.
24471 2021-05-20 Clement Chigot <clement.chigot@atos.net>
24472 David Edelsohn <dje.gcc@gmail.com>
24474 * collect2.c (scan_prog_file): Issue non-fatal warning for
24477 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
24479 * doc/invoke.texi (-Wno-c++11-extensions)
24480 (-Wno-c++14-extensions, -Wno-c++17-extensions)
24481 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
24484 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
24486 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
24487 * config/darwin.c (darwin_override_options): Likewise.
24488 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
24489 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
24490 (DWARF2_FRAME_REG_OUT): Likewise.
24491 * config/mips/mips.c (mips_output_filename): Likewise.
24492 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
24494 (rs6000_dbx_register_number): Likewise.
24495 * dbxout.c: Include flags.h.
24496 * dwarf2cfi.c (cfi_label_required_p): Likewise.
24497 (dwarf2out_do_frame): Likewise.
24498 * except.c: Include flags.h.
24499 * final.c (dwarf2_debug_info_emitted_p): Likewise.
24500 (final_scan_insn_1): Likewise.
24501 * flags.h (dwarf_debuginfo_p): New function declaration.
24502 * opts.c (dwarf_debuginfo_p): New function definition.
24503 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
24504 * toplev.c (process_options): Likewise.
24506 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
24508 * common.opt: Change type to support bitmasks.
24509 * flag-types.h (enum debug_info_type): Rename enumerator constants.
24510 (NO_DEBUG): New bitmask.
24511 (DBX_DEBUG): Likewise.
24512 (DWARF2_DEBUG): Likewise.
24513 (XCOFF_DEBUG): Likewise.
24514 (VMS_DEBUG): Likewise.
24515 (VMS_AND_DWARF2_DEBUG): Likewise.
24516 * flags.h (debug_set_to_format): New function declaration.
24517 (debug_set_count): Likewise.
24518 (debug_set_names): Likewise.
24519 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
24520 (debug_set_to_format): New function definition.
24521 (debug_set_count): Likewise.
24522 (debug_set_names): Likewise.
24523 (set_debug_level): Update access to debug_type_names.
24524 * toplev.c: Likewise.
24526 2021-05-20 Martin Sebor <msebor@redhat.com>
24528 PR middle-end/100684
24529 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
24531 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
24534 * config/i386/i386.md (isa): Remove x64_bmi.
24535 (enabled): Remove x64_bmi.
24536 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24537 Remove general register alternative.
24538 (*andnot<VI_32:mode>3): Ditto.
24539 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24540 (*<any_logic:code><VI_32:mode>3): Ditto.
24542 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
24544 * config/arm/arm.c: Include head files tree-vectorizer.h and
24547 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
24550 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
24551 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
24552 (<s>mulv4hi3_highpart): New expander.
24553 (*<s>mulv2hi3_highpart): New insn pattern.
24554 (<s>mulv2hi3_higpart): New expander.
24555 (*<any_shift:insn>v2hi3): New insn pattern.
24556 (<any_shift:insn>v2hi3): New expander.
24557 * config/i386/sse.md (smulhrsv2hi3): New expander.
24558 (*smulhrsv2hi3): New insn pattern.
24560 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
24562 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
24564 * params.opt (vect-inner-loop-cost-factor): New.
24565 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
24566 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
24567 tree-vectorizer.h and its required ones.
24568 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
24569 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
24570 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
24571 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
24572 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
24573 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
24575 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
24576 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
24577 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
24579 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
24580 Torbjörn Svensson <torbjorn.svensson@st.com>
24583 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
24585 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24587 PR middle-end/99928
24588 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
24589 explicit linear clause when combined with target, make it map(tofrom:)
24590 instead of no clause or firstprivate.
24592 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24594 PR tree-optimization/94589
24595 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
24596 of integral conversions.
24598 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
24600 * gimple-range.cc (fur_source::get_operand): New.
24601 (gimple_range_fold): Delete.
24602 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
24603 (fold_using_range::range_of_range_op): Move from gimple_ranger.
24604 (fold_using_range::range_of_address): Ditto.
24605 (fold_using_range::range_of_phi): Ditto.
24606 (fold_using_range::range_of_call): Ditto.
24607 (fold_using_range::range_of_builtin_ubsan_call): Move from
24608 range_of_builtin_ubsan_call.
24609 (fold_using_range::range_of_builtin_call): Move from
24610 range_of_builtin_call.
24611 (gimple_ranger::range_of_builtin_call): Delete.
24612 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
24613 (gimple_ranger::fold_range_internal): New.
24614 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
24615 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
24616 gimple_ranger. Improve ranges of SSA_NAMES when possible.
24617 * gimple-range.h (gimple_ranger): Remove various range_of routines.
24618 (class fur_source): New.
24619 (class fold_using_range): New.
24620 (fur_source::fur_source): New.
24622 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
24623 instead of range_of_builtin_call.
24625 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
24627 * doc/cpp.texi (Common Predefined Macros): Update documentation
24628 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
24630 2021-05-19 Alex Coplan <alex.coplan@arm.com>
24633 * config/arm/arm.md (nonsecure_call_internal): Always ensure
24634 callee's address is in a register.
24636 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
24638 * common/config/riscv/riscv-common.c
24639 (riscv_subset_list::parsing_subset_version): Properly parse the letter
24641 (riscv_subset_list::parse_std_ext,
24642 riscv_subset_list::parse_multiletter_ext): To handle errors generated
24643 in riscv_subset_list::parsing_subset_version.
24645 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24647 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
24648 type attribute in patterns generating XTN(2).
24650 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24652 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
24653 Remove as duplicate of...
24654 (aarch64_xtn<mode>): This.
24655 (aarch64_xtn2<mode>_le): Move position in file.
24656 (aarch64_xtn2<mode>_be): Move position in file.
24657 (aarch64_xtn2<mode>): Move position in file.
24658 (vec_pack_trunc_<mode>): Define as an expander.
24660 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24662 * config/aarch64/aarch64-simd-builtins.def: Split builtin
24663 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
24664 separate scalar and vector generators.
24665 * config/aarch64/aarch64-simd.md
24666 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
24668 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
24669 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
24670 * config/aarch64/iterators.md: Define SD_HSDI iterator.
24672 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
24674 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
24676 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
24678 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24680 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
24681 Implement as an expand emitting a big/little endian
24682 instruction pattern.
24683 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
24684 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
24686 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24688 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
24689 Implement as an expand emitting a big/little endian
24690 instruction pattern.
24691 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
24692 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
24693 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
24694 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
24696 2021-05-19 Richard Biener <rguenther@suse.de>
24698 PR middle-end/100672
24699 * fold-const.c (fold_negate_expr_1): Use element_precision.
24700 (negate_expr_p): Likewise.
24702 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
24704 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
24705 (pred_load): New int attribute.
24706 * config/aarch64/aarch64-sve.md
24707 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
24708 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
24709 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
24710 code_for_aarch64_load.
24712 2021-05-19 Richard Biener <rguenther@suse.de>
24714 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
24715 sure TARGET_MEM_REF bases are expanded as memory.
24716 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
24717 Do not mark TARGET_MEM_REF bases addressable.
24718 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
24719 TARGET_MEM_REF bases as never rewritable.
24720 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
24721 walk TARGET_MEM_REF bases as address-takens.
24722 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
24724 2021-05-19 Richard Biener <rguenther@suse.de>
24726 * builtins.c (get_object_alignment_1): Strip outer
24728 * tree-dfa.c (get_ref_base_and_extent): Handle outer
24729 WITH_SIZE_EXPR for size processing and process the
24731 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
24732 outer WITH_SIZE_EXPR.
24733 (ao_ref_base_alias_ptr_type): Likewise.
24734 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
24735 and handle that accordingly, stripping it for the
24736 core alias workers.
24737 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
24738 looking through it instead of returning NULL.
24740 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24742 PR middle-end/100576
24743 * builtins.c (check_read_access): Convert bound to size_type_node if
24746 2021-05-19 Richard Biener <rguenther@suse.de>
24748 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
24749 (verify_types_in_gimple_reference): ... here. Sanitize.
24750 (verify_gimple_call): Verify references in LHS and arguments.
24751 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
24753 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
24755 * config/i386/i386.h (VALID_INT_MODE_P):
24756 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
24757 * config/i386/i386.md (isa): Add x64_bmi.
24758 (enabled): Handle x64_bmi.
24759 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24760 Add alternative using 64bit general registers.
24761 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24763 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24765 PR middle-end/99928
24766 * tree.h (OMP_MASTER_COMBINED): Define.
24767 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
24768 handling for outer combined/composite constructs to a loop.
24769 Handle lastprivate on combined target.
24770 (gimplify_expr): Formatting fix.
24772 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
24774 * passes.def: Add sink_code pass before store_merging.
24775 * tree-ssa-sink.c (pass_sink_code:clone): New.
24777 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
24779 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
24780 rs6000_special_adjust_field_align_p.
24781 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
24782 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
24783 Remove ABI warning.
24784 (rs6000_function_arg): Likewise.
24785 * config/rs6000/rs6000-protos.h
24786 (rs6000_special_adjust_field_align_p): Remove prototype.
24787 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
24789 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
24790 rs6000_special_adjust_field_align_p.
24792 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24795 * config/i386/i386.h (VALID_SSE2_REG_MODE):
24796 Add V4QI and V2HI modes.
24797 (VALID_INT_MODE_P): Ditto.
24798 * config/i386/mmx.md (VI_32): New mode iterator.
24799 (mmxvecsize): Handle V4QI and V2HI.
24801 (mov<VI_32:mode>): New expander.
24802 (*mov<mode>_internal): New insn pattern.
24803 (movmisalign<VI_32:mode>): New expander.
24804 (neg<VI_32:mode>): New expander.
24805 (<plusminus:insn><VI_32:mode>3): New expander.
24806 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
24807 (mulv2hi3): New expander.
24808 (*mulv2hi3): New insn pattern.
24809 (one_cmpl<VI_32:mode>2): New expander.
24810 (*andnot<VI_32:mode>3): New insn pattern.
24811 (<any_logic:code><VI_32:mode>3): New expander.
24812 (*<any_logic:code><VI_32:mode>3): New insn pattern.
24814 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24816 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
24817 Fix a mode mismatch with operand 1.
24819 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24822 * config/i386/i386-expand.c (split_double_mode): Return
24823 temporary register when simplify_gen_subreg fails with
24824 the high half od the paradoxical subreg.
24826 2021-05-18 Richard Biener <rguenther@suse.de>
24828 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
24829 and honor it when expanding.
24830 (expand_used_vars_for_block): Pass through forced_stack_var.
24831 (expand_used_vars): Likewise.
24832 (discover_nonconstant_array_refs_r): Set bits in
24833 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
24834 (avoid_type_punning_on_regs): Likewise.
24835 (discover_nonconstant_array_refs): Likewise.
24836 (pass_expand::execute): Create and pass down forced_stack_var
24837 bitmap. For parameters and returns temporarily set
24838 TREE_ADDRESSABLE when expand_function_start.
24840 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24842 * doc/sourcebuild.texi: Document 'dg-note'.
24844 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
24847 * configure: Regenerate.
24848 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
24850 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24852 * gimple.h (is_gimple_omp_oacc): Tighten.
24853 * omp-low.c (check_omp_nesting_restrictions): Adjust.
24855 2021-05-18 Richard Biener <rguenther@suse.de>
24857 * tree-ssa-operands.c (mark_address_taken): Simplify.
24859 2021-05-18 Martin Liska <mliska@suse.cz>
24861 * config/gcn/mkoffload.c (STR): Redefine.
24862 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24863 * config/nvptx/mkoffload.c (STR): Likewise.
24865 2021-05-18 Martin Liska <mliska@suse.cz>
24867 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
24868 Use startswith function instead of strncmp.
24869 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
24870 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
24871 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
24872 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
24873 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
24874 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
24875 * config/arm/arm.c (arm_file_start): Likewise.
24876 (arm_valid_target_attribute_rec): Likewise.
24877 (thumb1_md_asm_adjust): Likewise.
24878 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
24879 * config/avr/avr.c (STR_PREFIX_P): Likewise.
24880 (avr_set_current_function): Likewise.
24881 (avr_handle_addr_attribute): Likewise.
24882 (avr_asm_output_aligned_decl_common): Likewise.
24883 (avr_asm_named_section): Likewise.
24884 (avr_section_type_flags): Likewise.
24885 (avr_asm_select_section): Likewise.
24886 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
24887 (c6x_section_type_flags): Likewise.
24888 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
24889 (darwin_objc_declare_unresolved_class_reference): Likewise.
24890 (darwin_objc_declare_class_definition): Likewise.
24891 * config/darwin.c (indirect_data): Likewise.
24892 (darwin_encode_section_info): Likewise.
24893 (darwin_objc2_section): Likewise.
24894 (darwin_objc1_section): Likewise.
24895 (machopic_select_section): Likewise.
24896 (darwin_globalize_label): Likewise.
24897 (darwin_label_is_anonymous_local_objc_name): Likewise.
24898 (darwin_asm_named_section): Likewise.
24899 (darwin_asm_output_dwarf_offset): Likewise.
24900 * config/frv/frv.c (frv_string_begins_with): Likewise.
24901 (frv_in_small_data_p): Likewise.
24902 * config/gcn/mkoffload.c (STR): Likewise.
24904 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
24905 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
24906 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
24907 (ix86_md_asm_adjust): Likewise.
24908 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24909 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
24910 (i386_pe_file_end): Likewise.
24911 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
24912 (ia64_section_type_flags): Likewise.
24913 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
24914 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
24915 (mips16_stub_function_p): Likewise.
24916 (mips_function_rodata_section): Likewise.
24917 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
24918 (msp430_function_section): Likewise.
24919 (msp430_section_type_flags): Likewise.
24920 (msp430_expand_helper): Likewise.
24921 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
24922 (nios2_valid_target_attribute_rec): Likewise.
24923 * config/nvptx/mkoffload.c (process): Likewise.
24925 * config/pa/som.h: Likewise.
24926 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
24927 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
24928 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
24929 (rs6000_inner_target_options): Likewise.
24930 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
24931 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
24932 * config/vax/vax.c (vax_output_int_move): Likewise.
24933 * config/vms/vms-ld.c (startswith): Likewise.
24934 (process_args): Likewise.
24936 * config/vms/vms.c: Likewise.
24938 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24940 PR rtl-optimization/100590
24941 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
24942 they are NONJUMP_INSN_P.
24944 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24947 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
24948 DECL_ASSEMBLER_NAME on the fn_decl.
24950 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24952 PR tree-optimization/94589
24953 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
24954 phi result used in (res & ~1) == 0 comparison as res >= 0 as
24955 res == 2 would be UB with -ffinite-math-only.
24957 2021-05-18 Martin Liska <mliska@suse.cz>
24959 * Makefile.in: genversion.o should depend on DATESTAMP.
24961 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
24963 * config/arc/simdext.md (negv2si2): Remove round bracket.
24965 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24967 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
24968 _Bool as macro expanding to _Bool.
24970 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24973 * tree.c (build_reference_type_for_mode)
24974 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
24976 (build_reference_type, build_pointer_type): Invoke
24977 build_*_type_for_mode with VOIDmode.
24979 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
24981 PR tree-optimization/100512
24982 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
24983 and non-zero pointer ranges as invariant.
24984 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
24985 processing from here.
24987 2021-05-17 Tom de Vries <tdevries@suse.de>
24990 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
24991 * config/nvptx/nvptx.c (nvptx_output_barrier)
24992 (nvptx_output_atomic_insn): New function.
24993 (nvptx_print_operand): Add support for 'B'.
24994 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
24997 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
24999 PR tree-optimization/100349
25000 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
25003 2021-05-17 Tamar Christina <tamar.christina@arm.com>
25005 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
25006 (host_detect_local_cpu): Use it.
25008 2021-05-17 Martin Liska <mliska@suse.cz>
25010 * doc/invoke.texi: Add 2 missing dots.
25012 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
25014 PR bootstrap/100552
25015 * configure.ac: Replace pattern substitution with call to sed.
25016 * configure: Regenerate.
25018 2021-05-17 Richard Biener <rguenther@suse.de>
25020 PR middle-end/100582
25021 * tree.c (array_at_struct_end_p): Get to the base of the
25022 reference before looking for the underlying decl.
25024 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
25026 * genoutput.c (validate_insn_alternatives) Make "wrong number of
25027 alternatives" message more specific, and remove assumption on where
25030 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
25032 * config/arm/iterators.md (V16): New iterator.
25033 (VH_cvtto): New iterator.
25034 (v_cmp_result): Added V4HF and V8HF support.
25035 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
25036 (vcond<mode><mode>): Likewise.
25037 (vcond_mask_<mode><v_cmp_result>): Likewise.
25038 (vcond<VH_cvtto><mode>): New expander.
25040 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
25042 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
25044 * config/arm/arm.c (arm_expand_vector_compare): Add support for
25046 (arm_expand_vcond): Likewise.
25047 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
25048 VCMPEQQ_N_S, VCMPNEQ_N_S.
25049 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
25050 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
25051 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
25052 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
25053 (@mve_vpselq_<supf><mode>): Likewise.
25054 (@mve_vpselq_f<mode>"): Likewise.
25055 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
25056 and move to vec-common.md.
25057 (vec_cmpu<mode><mode>): Likewise.
25058 (vcond<mode><mode>): Likewise.
25059 (vcond<V_cvtto><mode>): Likewise.
25060 (vcondu<mode><v_cmp_result>): Likewise.
25061 (vcond_mask_<mode><v_cmp_result>): Likewise.
25062 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
25063 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
25064 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
25065 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
25066 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
25067 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
25069 (vec_cmpu<mode><mode>): Likewise.
25070 (vcond<mode><mode>): Likewise.
25071 (vcond<V_cvtto><mode>): Likewise.
25072 (vcondu<mode><v_cmp_result>): Likewise.
25073 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
25076 2021-05-17 liuhongt <hongtao.liu@intel.com>
25079 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
25080 gsi_insert_seq_before instead.
25082 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
25084 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
25085 (arm_sat_ok): ...this.
25087 2021-05-17 Martin Liska <mliska@suse.cz>
25089 * lto-wrapper.c (merge_flto_options): Factor out a new function.
25090 (merge_and_complain): Use it.
25091 (run_gcc): Merge also linker command line -flto=foo argument
25094 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
25096 * config/arm/arm.h (CPP_SPEC): Remove error message about
25097 -mlittle-endian/-mbig-endian conflict.
25099 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
25101 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
25102 __ROP_PROTECT__ if -mrop-protect is selected.
25104 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
25106 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
25107 rop_hash_save_offset and rop_hash_size.
25108 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
25109 rop_hash_size and rop_hash_save_offset.
25110 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
25111 (rs6000_emit_prologue): Emit hashst[p] in prologue.
25112 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
25113 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
25115 (hashst): New define_insn.
25116 (hashchk): Likewise.
25118 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
25120 * config/rs6000/rs6000.c (rs6000_option_override_internal):
25121 Disable shrink wrap when inserting ROP-protect instructions.
25122 * config/rs6000/rs6000.opt (mrop-protect): New option.
25123 (mprivileged): Likewise.
25124 * doc/invoke.texi: Document mrop-protect and mprivileged.
25126 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
25128 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
25131 2021-05-15 Martin Jambor <mjambor@suse.cz>
25134 2021-05-13 Martin Jambor <mjambor@suse.cz>
25136 PR tree-optimization/100453
25137 * tree-sra.c (sra_modify_assign): All const base accesses do not
25138 need refreshing, not just those from decl_pool.
25139 (sra_modify_assign): Do not refresh into a const base decl.
25141 2021-05-15 Jakub Jelinek <jakub@redhat.com>
25143 PR rtl-optimization/100342
25144 * regcprop.c (copy_value): When copying a source reg in a wider
25145 mode than it has recorded for the value, adjust recorded destination
25146 mode too or punt if !REG_CAN_CHANGE_MODE_P.
25148 2021-05-14 Jason Merrill <jason@redhat.com>
25150 * intl.h: Add comments.
25152 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25154 * config/aarch64/aarch64-simd.md
25155 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
25156 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
25157 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
25158 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
25159 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
25160 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
25161 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
25162 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
25163 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
25165 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25168 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
25169 boolean logic equivalent.
25170 (vtst_s16): Likewise.
25171 (vtst_s32): Likewise.
25172 (vtst_u8): Likewise.
25173 (vtst_u16): Likewise.
25174 (vtst_u32): Likewise.
25175 (vtst_p8): Likewise.
25176 (vtst_p16): Likewise.
25177 (vtstq_s8): Likewise.
25178 (vtstq_s16): Likewise.
25179 (vtstq_s32): Likewise.
25180 (vtstq_u8): Likewise.
25181 (vtstq_u16): Likewise.
25182 (vtstq_u32): Likewise.
25183 (vtstq_p8): Likewise.
25184 (vtstq_p16): Likewise.
25185 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
25186 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
25188 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25190 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
25191 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
25192 (aarch64_sqdmlsl2<mode>): Delete.
25193 (aarch64_sqdmlal2_lane<mode>): Merge this...
25194 (aarch64_sqdmlsl2_lane<mode>): ... And this...
25195 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
25196 (aarch64_sqdmlal2_laneq<mode>): Merge this...
25197 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
25198 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
25199 (aarch64_sqdmlal2_n<mode>): Merge this...
25200 (aarch64_sqdmlsl2_n<mode>): ... And this...
25201 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
25203 2021-05-13 Martin Sebor <msebor@redhat.com>
25205 PR middle-end/100574
25206 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
25207 all null arguments.
25209 2021-05-13 Martin Sebor <msebor@redhat.com>
25211 PR tree-optimization/93100
25212 PR middle-end/98583
25213 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
25214 don't modify referenced objects.
25216 2021-05-13 Martin Jambor <mjambor@suse.cz>
25218 PR tree-optimization/100453
25219 * tree-sra.c (sra_modify_assign): All const base accesses do not
25220 need refreshing, not just those from decl_pool.
25221 (sra_modify_assign): Do not refresh into a const base decl.
25223 2021-05-13 Martin Liska <mliska@suse.cz>
25225 * tree-ssa-dom.c: Remove m_simplifier.
25227 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
25230 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
25231 canonicalize DImode inequality comparisons against the
25232 maximum integral value.
25234 2021-05-13 Jakub Jelinek <jakub@redhat.com>
25236 PR tree-optimization/98856
25237 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
25238 Expect V2DI and V4DI arithmetic right shifts to be emulated.
25239 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
25241 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
25242 expand_vec_perm_2perm_pblendv): New functions.
25243 (ix86_expand_vec_perm_const_1): Use them.
25244 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
25245 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
25246 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
25247 (ashrv4di3): New define_expand.
25248 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
25249 and !TARGET_AVX512VL expansion.
25251 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
25254 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
25255 sizes < 16 to a register when constructing vpcmov pattern.
25256 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
25258 2021-05-13 Martin Liska <mliska@suse.cz>
25260 * gcov-io.c (gcov_write_block): Remove.
25261 (gcov_write_words): Likewise.
25262 (gcov_read_words): Re-implement using gcov_read_bytes.
25263 (gcov_allocate): Remove.
25264 (GCOV_BLOCK_SIZE): Likewise.
25265 (struct gcov_var): Remove most of the fields.
25266 (gcov_position): Implement with ftell.
25267 (gcov_rewrite): Remove setting of start and offset fields.
25268 (from_file): Re-format.
25269 (gcov_open): Remove setbuf call. It should not be needed.
25270 (gcov_close): Remove internal buffer handling.
25271 (gcov_magic): Use __builtin_bswap32.
25272 (gcov_write_counter): Use directly gcov_write_unsigned.
25273 (gcov_write_string): Use direct fwrite and do not round
25275 (gcov_seek): Use directly fseek.
25276 (gcov_write_tag): Use gcov_write_unsigned directly.
25277 (gcov_write_length): Likewise.
25278 (gcov_write_tag_length): Likewise.
25279 (gcov_read_bytes): Use directly fread.
25280 (gcov_read_unsigned): Use gcov_read_words.
25281 (gcov_read_counter): Likewise.
25282 (gcov_read_string): Use gcov_read_bytes.
25283 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
25284 that size is not in bytes, but words (4B).
25285 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
25286 (GCOV_TAG_ARCS_LENGTH): Likewise.
25287 (GCOV_TAG_ARCS_NUM): Likewise.
25288 (GCOV_TAG_COUNTER_LENGTH): Likewise.
25289 (GCOV_TAG_COUNTER_NUM): Likewise.
25290 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
25292 2021-05-13 liuhongt <hongtao.liu@intel.com>
25295 * config/i386/sse.md (ssedoublevecmode): Add attribute for
25296 V64QI/V32HI/V16SI/V4DI.
25297 (ssehalfvecmode): Add attribute for V2DI/V2DF.
25298 (*vec_concatv4si_0): Extend to VI124_128.
25299 (*vec_concat<mode>_0): New pre-reload splitter.
25300 * config/i386/predicates.md (movq_parallel): New predicate.
25302 2021-05-13 Alexandre Oliva <oliva@adacore.com>
25304 * targhooks.c (default_zero_call_used_regs): Retry using
25305 successfully-zeroed registers as sources.
25307 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
25309 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
25311 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
25314 * gimple-range.cc (range_of_builtin_call): Skip out on
25315 processing __builtin_clz when varying.
25317 2021-05-12 Tom de Vries <tdevries@suse.de>
25320 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
25321 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
25322 to ptx_version_option.
25323 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
25324 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
25325 (define_insn "nvptx_vote_ballot"): Use sync variant for
25327 * config/nvptx/nvptx.opt (ptx_version): Add enum.
25328 (mptx): Add option.
25329 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
25331 2021-05-12 Richard Biener <rguenther@suse.de>
25333 PR tree-optimization/100566
25334 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
25335 allow_back for all edge queries.
25337 2021-05-12 liuhongt <hongtao.liu@intel.com>
25340 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
25341 splitters for pblendvb of NOT mask register.
25343 2021-05-12 Richard Biener <rguenther@suse.de>
25345 PR tree-optimization/100519
25346 * tree-ssa-reassoc.c (can_associate_p): Split into...
25347 (can_associate_op_p): ... this
25348 (can_associate_type_p): ... and this.
25349 (is_reassociable_op): Call can_associate_op_p.
25350 (break_up_subtract_bb): Call the appropriate predicates.
25351 (reassociate_bb): Likewise.
25353 2021-05-12 Martin Liska <mliska@suse.cz>
25355 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
25356 (run_gcc): Use -flto argument detection for merged
25359 2021-05-12 Martin Liska <mliska@suse.cz>
25361 * lto-wrapper.c (print_lto_docs_link): New function.
25362 (run_gcc): Print warning about missing job server detection
25363 after we know NR of partitions. Do the same for -flto{,=1}.
25364 * opts.c (get_option_html_page): Support -flto option.
25366 2021-05-12 Martin Liska <mliska@suse.cz>
25368 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
25370 (append_option): Remove.
25371 (find_option): Rework to use the vector type.
25372 (remove_option): Remove.
25373 (merge_and_complain): Use vectors for cl_decoded_option data
25375 (append_compiler_options): Likewise.
25376 (append_diag_options): Likewise.
25377 (append_linker_options): Likewise.
25378 (append_offload_options): Likewise.
25379 (compile_offload_image): Likewise.
25380 (compile_images_for_offload_targets): Likewise.
25381 (find_and_merge_options): Likewise.
25382 (run_gcc): Likewise.
25384 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
25387 * dwarf2out.c (dwarf2out_finish): Set
25388 have_multiple_function_sections with multi-range text_section.
25390 2021-05-12 Martin Liska <mliska@suse.cz>
25392 PR bootstrap/100560
25393 * Makefile.in: Remove version.h from linker command line.
25395 2021-05-12 Richard Biener <rguenther@suse.de>
25397 PR middle-end/100547
25398 * rtl.h (rtvec_alloc): Make argument size_t.
25399 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
25401 2021-05-12 Jakub Jelinek <jakub@redhat.com>
25403 PR middle-end/100508
25404 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
25405 type, don't reuse DECL_RTL if it has different mode, instead force
25406 creation of a new DEBUG_EXPR.
25408 2021-05-12 Jakub Jelinek <jakub@redhat.com>
25409 Marc Glisse <marc.glisse@inria.fr>
25411 PR tree-optimization/94589
25412 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
25413 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
25415 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
25418 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
25419 * config/i386/mmx.md (MMXMODE124): New mode iterator.
25421 (mmxintvecmode): New mode attribute.
25422 (mmxintvecmodelower): Ditto.
25423 (*mmx_maskcmpv2sf3_comm): New insn pattern.
25424 (*mmx_maskcmpv2sf3): Ditto.
25425 (vec_cmpv2sfv2si): New expander.
25426 (vcond<V2FI:mode>v2si): Ditto.
25427 (mmx_vlendvps): New insn pattern.
25428 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
25429 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
25430 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
25432 2021-05-11 Martin Sebor <msebor@redhat.com>
25434 PR middle-end/21433
25435 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
25437 2021-05-11 Richard Biener <rguenther@suse.de>
25439 * gimple-fold.c (gimple_fold_call): Do not call
25440 maybe_fold_reference on call arguments or the static chain.
25441 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
25444 2021-05-11 Martin Liska <mliska@suse.cz>
25446 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
25447 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
25448 (DEF_HSAIL_SAT_BUILTIN): Likewise.
25449 (DEF_HSAIL_INTR_BUILTIN): Likewise.
25450 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
25451 * doc/frontends.texi: Remove BRIG.
25452 * doc/install.texi: Likewise.
25453 * doc/invoke.texi: Likewise.
25454 * doc/standards.texi: Likewise.
25455 * brig-builtins.def: Removed.
25456 * brig/ChangeLog: Removed.
25457 * brig/Make-lang.in: Removed.
25458 * brig/brig-builtins.h: Removed.
25459 * brig/brig-c.h: Removed.
25460 * brig/brig-lang.c: Removed.
25461 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
25462 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
25463 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
25464 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
25465 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
25466 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
25467 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
25468 * brig/brigfrontend/brig-comment-handler.cc: Removed.
25469 * brig/brigfrontend/brig-control-handler.cc: Removed.
25470 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
25471 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
25472 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
25473 * brig/brigfrontend/brig-function-handler.cc: Removed.
25474 * brig/brigfrontend/brig-function.cc: Removed.
25475 * brig/brigfrontend/brig-function.h: Removed.
25476 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
25477 * brig/brigfrontend/brig-label-handler.cc: Removed.
25478 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
25479 * brig/brigfrontend/brig-machine.c: Removed.
25480 * brig/brigfrontend/brig-machine.h: Removed.
25481 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
25482 * brig/brigfrontend/brig-module-handler.cc: Removed.
25483 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
25484 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
25485 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
25486 * brig/brigfrontend/brig-to-generic.cc: Removed.
25487 * brig/brigfrontend/brig-to-generic.h: Removed.
25488 * brig/brigfrontend/brig-util.cc: Removed.
25489 * brig/brigfrontend/brig-util.h: Removed.
25490 * brig/brigfrontend/brig-variable-handler.cc: Removed.
25491 * brig/brigfrontend/hsa-brig-format.h: Removed.
25492 * brig/brigfrontend/phsa.h: Removed.
25493 * brig/brigspec.c: Removed.
25494 * brig/config-lang.in: Removed.
25495 * brig/gccbrig.texi: Removed.
25496 * brig/lang-specs.h: Removed.
25497 * brig/lang.opt: Removed.
25499 2021-05-11 Richard Biener <rguenther@suse.de>
25502 * ipa-param-manipulation.c
25503 (ipa_param_body_adjustments::modify_call_stmt): Avoid
25504 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
25505 via gimple_call_lhs_ptr.
25507 2021-05-11 Alex Coplan <alex.coplan@arm.com>
25510 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
25511 Avoid emitting CFA adjusts on the sp if we have the fp.
25513 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
25515 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
25516 (VMULD): New iterator.
25517 (VCOND): Handle V4HF and V8HF.
25518 (VCONQ): Fix entry for V2SF.
25519 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
25520 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
25521 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
25522 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
25523 Use a 128-bit vector mode for the indexed operand. Use stype for
25524 the scheduling type.
25526 2021-05-11 Richard Biener <rguenther@suse.de>
25528 * gimple-fold.c (maybe_fold_reference): Only return
25529 is_gimple_min_invariant values.
25531 2021-05-11 Richard Biener <rguenther@suse.de>
25533 PR middle-end/100509
25534 * gimple-fold.c (fold_gimple_assign): Only call
25535 get_symbol_constant_value on register type symbols.
25537 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
25538 Joe Ramsay <joe.ramsay@arm.com>
25541 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
25542 (__arm_vcmpneq): Remove duplicate definition.
25543 (__arm_vstrwq_scatter_offset_p): Likewise.
25544 (__arm_vmaxq_x): Likewise.
25545 (__arm_vmlsdavaq): Likewise.
25546 (__arm_vmlsdavaxq): Likewise.
25547 (__arm_vmlsdavq_p): Likewise.
25548 (__arm_vmlsdavxq_p): Likewise.
25549 (__arm_vrmlaldavhaq): Likewise.
25550 (__arm_vstrbq_p): Likewise.
25551 (__arm_vstrbq_scatter_offset): Likewise.
25552 (__arm_vstrbq_scatter_offset_p): Likewise.
25553 (__arm_vstrdq_scatter_offset): Likewise.
25554 (__arm_vstrdq_scatter_offset_p): Likewise.
25555 (__arm_vstrdq_scatter_shifted_offset): Likewise.
25556 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
25558 2021-05-11 Jakub Jelinek <jakub@redhat.com>
25560 PR middle-end/100471
25561 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
25562 is 0, bypass the reduction loop including
25563 GOMP_taskgroup_reduction_unregister call.
25565 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25567 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
25568 costing_for_scalar.
25569 (rs6000_density_test): Early return if costing_for_scalar is true.
25570 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
25572 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25574 * doc/tm.texi: Regenerated.
25575 * target.def (init_cost): Add new parameter costing_for_scalar.
25576 * targhooks.c (default_init_cost): Adjust for new parameter.
25577 * targhooks.h (default_init_cost): Likewise.
25578 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
25579 (vect_compute_single_scalar_iteration_cost): Likewise.
25580 (vect_analyze_loop_2): Likewise.
25581 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
25582 (vect_bb_vectorization_profitable_p): Likewise.
25583 * tree-vectorizer.h (init_cost): Likewise.
25584 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
25585 * config/i386/i386.c (ix86_init_cost): Likewise.
25586 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
25588 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25590 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
25591 vect_nonmem and moved into...
25592 (struct rs6000_cost_data): ...here.
25593 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
25594 (rs6000_add_stmt_cost): Likewise.
25595 (rs6000_finish_cost): Likewise.
25597 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
25599 * range-op.cc (get_bool_state): Adjust head comment.
25600 (operator_not_equal::op1_range): Fix comment.
25601 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
25603 2021-05-10 Martin Sebor <msebor@redhat.com>
25605 PR middle-end/100425
25606 PR middle-end/100510
25607 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
25608 (pass_walloca::xlimit_certain_p): ...to this.
25609 (pass_walloca::gate): Execute for any kind of handled warning.
25610 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
25611 warnings when xlimit_certain_p is set.
25613 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
25615 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
25616 Return ALTIVEC_REGS if that is best_class.
25617 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
25619 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25621 * config/arm/arm.h (CPP_SPEC): Remove error message about
25624 2021-05-10 Martin Jambor <mjambor@suse.cz>
25626 * ipa-prop.h (IPA_NODE_REF): Removed.
25627 (IPA_NODE_REF_GET_CREATE): Likewise.
25628 (IPA_EDGE_REF): Likewise.
25629 (IPA_EDGE_REF_GET_CREATE): Likewise.
25630 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
25631 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
25632 use of ipa_node_params_sum.
25633 (ipcp_versionable_function_p): Likewise.
25634 (push_node_to_stack): Likewise.
25635 (pop_node_from_stack): Likewise.
25636 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
25637 direct use of ipa_node_params_sum.
25638 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
25639 ipa_node_params_sum.
25640 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
25642 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
25643 use of ipa_node_params_sum.
25644 (self_recursively_generated_p): Likewise.
25645 (propagate_scalar_across_jump_function): Likewise.
25646 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
25647 direct use of ipa_edge_args_sum, moved the lookup after the early
25648 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
25649 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
25650 direct uses of ipa_node_params_sum.
25651 (propagate_vr_across_jump_function): Likewise.
25652 (propagate_aggregate_lattice): Likewise.
25653 (propagate_aggs_across_jump_function): Likewise.
25654 (propagate_constants_across_call): Likewise, also replaced
25655 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
25656 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
25657 of ipa_node_params_sum.
25658 (estimate_local_effects): Likewise.
25659 (add_all_node_vals_to_toposort): Likewise.
25660 (propagate_constants_topo): Likewise.
25661 (ipcp_propagate_stage): Likewise.
25662 (ipcp_discover_new_direct_edges): Likewise.
25663 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
25664 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
25665 (get_info_about_necessary_edges): Likewise.
25666 (want_remove_some_param_p): Likewise.
25667 (create_specialized_node): Likewise.
25668 (self_recursive_pass_through_p): Likewise.
25669 (self_recursive_agg_pass_through_p): Likewise.
25670 (find_more_scalar_values_for_callers_subset): Likewise and also
25671 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
25672 case replacing two of those with a single query.
25673 (find_more_contexts_for_caller_subset): Likewise for the
25674 ipa_polymorphic_call_context overload.
25675 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
25676 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
25677 ipa_node_params_sum.
25678 (find_aggregate_values_for_callers_subset): Likewise, also reusing
25679 results of ipa_edge_args_sum->get.
25680 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
25681 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
25682 direct use of ipa_edge_args_sum.
25683 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
25684 summary query after the early exit and reused the result later.
25685 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
25686 ipa_node_params_sum.
25687 (decide_whether_version_node): Likewise. Removed re-querying for
25688 summaries after cloning.
25689 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
25690 ipa_node_params_sum.
25691 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
25693 (identify_dead_nodes): Likewise.
25694 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
25695 ipa_node_params_sum.
25696 (ipcp_store_vr_results): Likewise.
25697 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
25698 (ipa_fn_summary_t::duplicate): Likewise.
25699 (analyze_function_body): Likewise.
25700 (estimate_calls_size_and_time): Likewise.
25701 (ipa_cached_call_context::duplicate_from): Likewise.
25702 (ipa_call_context::equal_to): Likewise.
25703 (remap_edge_params): Likewise.
25704 (ipa_merge_fn_summary_after_inlining): Likewise.
25705 (inline_read_section): Likewise.
25706 * ipa-icf.c (sem_function::param_used_p): Likewise.
25707 * ipa-modref.c (compute_parm_map): Likewise.
25708 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
25710 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
25711 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25713 * ipa-profile.c (check_argument_count): Likewise.
25714 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
25715 with a direct use of ipa_node_params_sum.
25716 (ipa_initialize_node_params): Likewise.
25717 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
25718 direct use of ipa_edge_args_sum and reused the query result.
25719 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
25720 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25721 direct use of ipa_edge_args_sum.
25722 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
25723 ipa_node_params_sum and reused the result of the query.
25724 (ipa_analyze_node): Likewise.
25725 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
25726 of ipa_node_params_sum.
25727 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
25728 direct uses of ipa_edge_args_sum.
25729 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
25730 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25731 direct use of ipa_edge_args_sum. Removed superficial re-querying the
25733 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
25734 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25736 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
25737 direct use of ipa_edge_args_sum.
25738 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
25739 use of ipa_node_params_sum.
25740 (ipa_print_node_params): Likewise.
25741 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
25742 direct uses of ipa_edge_args_sum.
25743 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
25745 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
25746 ipa_node_params_sum.
25747 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
25748 scopes where it is used.
25750 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
25752 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
25753 <case E_V2SImode>: Force op_true to register.
25755 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25757 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
25758 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
25759 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
25761 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
25762 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
25763 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
25764 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
25765 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
25766 (mve_vcmpneq_n_f<mode>): Remove.
25767 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
25768 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
25769 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
25771 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25773 * config/arm/iterators.md (MVE_COMPARISONS): New.
25775 (mve_cmp_type): New.
25776 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
25778 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
25779 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
25780 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
25781 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
25782 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25783 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25784 (mve_vcmpneq_n_<mode>): Remove.
25786 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25788 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
25789 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
25790 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
25793 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25795 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
25796 (vcmpneq_n_u): Likewise.
25797 (vcmpeqq_u,): Likewise.
25798 (vcmpeqq_n_u): Likewise.
25799 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
25800 VCMPEQQ_N_U and VCMPNEQ_N_U.
25801 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
25802 (mve_vcmpeqq_n): Likewise.
25803 (mve_vcmpeqq): Likewise.
25804 (mve_vcmpneq_n): Likewise.
25806 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25808 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
25809 the 's' version of the builtin.
25811 2021-05-10 Richard Biener <rguenther@suse.de>
25813 PR tree-optimization/100492
25814 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
25815 Find nothing when the loop contains an irreducible region.
25817 2021-05-10 Richard Biener <rguenther@suse.de>
25819 PR middle-end/100464
25821 * gimple-fold.c (canonicalize_constructor_val): Do not set
25824 2021-05-10 Richard Biener <rguenther@suse.de>
25826 PR tree-optimization/100434
25827 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
25829 (dse_optimize_stmt): Handle call LHS by dropping the
25830 LHS or the whole call if it doesn't have other
25832 (pass_dse::execute): Adjust.
25834 2021-05-10 Martin Liska <mliska@suse.cz>
25836 * Makefile.in: Add missing genversion rule.
25838 2021-05-10 Alex Coplan <alex.coplan@arm.com>
25841 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
25842 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
25844 2021-05-10 Martin Liska <mliska@suse.cz>
25846 * builtins.c (is_builtin_name): Use startswith
25847 function instead of strncmp.
25848 * collect2.c (main): Likewise.
25849 (has_lto_section): Likewise.
25850 (scan_libraries): Likewise.
25851 * coverage.c (coverage_checksum_string): Likewise.
25852 (coverage_init): Likewise.
25853 * dwarf2out.c (is_cxx): Likewise.
25854 (gen_compile_unit_die): Likewise.
25855 * gcc-ar.c (main): Likewise.
25856 * gcc.c (init_spec): Likewise.
25857 (read_specs): Likewise.
25858 (execute): Likewise.
25859 (check_live_switch): Likewise.
25860 * genattrtab.c (write_attr_case): Likewise.
25861 (IS_ATTR_GROUP): Likewise.
25862 * gencfn-macros.c (main): Likewise.
25863 * gengtype.c (type_for_name): Likewise.
25864 (gen_rtx_next): Likewise.
25865 (get_file_langdir): Likewise.
25866 (write_local): Likewise.
25867 * genmatch.c (get_operator): Likewise.
25868 (get_operand_type): Likewise.
25869 (expr::gen_transform): Likewise.
25870 * genoutput.c (validate_optab_operands): Likewise.
25871 * incpath.c (add_sysroot_to_chain): Likewise.
25872 * langhooks.c (lang_GNU_C): Likewise.
25873 (lang_GNU_CXX): Likewise.
25874 (lang_GNU_Fortran): Likewise.
25875 (lang_GNU_OBJC): Likewise.
25876 * lto-wrapper.c (run_gcc): Likewise.
25877 * omp-general.c (omp_max_simt_vf): Likewise.
25878 * omp-low.c (omp_runtime_api_call): Likewise.
25879 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
25880 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
25881 * real.c (real_from_string): Likewise.
25882 * selftest.c (assert_str_startswith): Likewise.
25883 * timevar.c (timer::validate_phases): Likewise.
25884 * tree.c (get_file_function_name): Likewise.
25885 * ubsan.c (ubsan_use_new_style_p): Likewise.
25886 * varasm.c (default_function_rodata_section): Likewise.
25887 (incorporeal_function_p): Likewise.
25888 (default_section_type_flags): Likewise.
25889 * system.h (startswith): Define startswith.
25891 2021-05-10 Martin Liska <mliska@suse.cz>
25893 * bitmap.h (class auto_bitmap): Remove
25894 __cplusplus >= 201103.
25895 * config/aarch64/aarch64.c: Likewise.
25896 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
25898 * sbitmap.h: Likewise.
25900 2021-05-10 Martin Liska <mliska@suse.cz>
25902 * Makefile.in: Rename gcov-iov to genversion and depend
25903 on version.h (instead of gcov-iov.h).
25904 * gcov-io.h: Include version.h instread of gcov-iov.h.
25905 * gengtype-state.c (read_state_version): Likewise.
25906 * gcov-iov.c: Moved to...
25907 * genversion.c: ...here.
25908 * lto-streamer.h (LTO_major_version): Define it with
25910 * version.c: Removed.
25911 * version.h: Removed.
25913 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25915 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
25916 * config/arc/simdext.md (VCT): Add predicates for iterator
25919 (voptab): Likewise.
25920 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
25921 (<voptab>v2si3): New patterns.
25923 (reduc_plus_scal_v4hi): Likewise.
25924 (reduc_plus_scal_v2si): Likewise.
25925 (vec_duplicatev2si): Likewise.
25926 (vec_duplicatev4hi): Likewise.
25928 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25930 * config/arc/simdext.md: Format and cleanup file.
25932 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25934 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
25935 only when munaligned-access option is on.
25936 (movmisalign<mode>): Likewise.
25938 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25940 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
25942 * config/arc/arc.c (arc_reorg): Remove underscore from string.
25944 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25946 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
25947 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
25948 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
25949 (norm_f): Likewise.
25952 (clzsi2): Use fls instruction when available.
25953 (arc_clzsi2): Likewise.
25955 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25957 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
25959 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25961 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
25963 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
25965 PR middle-end/100467
25966 * toplev.c (compile_file): Call insn_locations_init before
25967 targetm.asm_out.code_end.
25969 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25972 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25974 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25976 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25977 Andrew Stubbs <amd@codesourcery.com>
25980 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
25981 emit_move_insn operands.
25983 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25985 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
25986 location for the outgoing edges of an empty block.
25987 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
25988 type by means of the get_array_descr_info langhook, if it is set and
25989 returns true. Remove obsolete code dealing with unnamed subtypes.
25991 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25993 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
25994 (sbr_vector): Renamed from ssa_block_cache.
25995 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
25996 (ssa_block_ranges::~ssa_block_ranges): Remove.
25997 (sbr_vector::set_bb_range): Use varying and undefined cached values.
25998 (ssa_block_ranges::set_bb_varying): Remove.
25999 (sbr_vector::get_bb_range): Adjust assert.
26000 (sbr_vector::bb_range_p): Adjust assert.
26001 (~block_range_cache): No freeing loop required.
26002 (block_range_cache::get_block_ranges): Remove.
26003 (block_range_cache::set_bb_range): Inline get_block_ranges.
26004 (block_range_cache::set_bb_varying): Remove.
26005 * gimple-range-cache.h (set_bb_varying): Remove prototype.
26006 * value-range.h (irange_allocator::get_memory): New.
26008 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
26010 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
26011 dominator tree is available and requested.
26012 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
26013 (ranger_cache::fill_block_cache): Don't search dom tree here either.
26014 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
26016 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
26018 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
26019 only PHI nodes better.
26021 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
26023 * gimple-range-edge.h (gimple_outgoing_range): Rename from
26025 (gcond_edge_range): Export prototype.
26026 * gimple-range-edge.cc (gcond_edge_range): New.
26027 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
26028 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
26030 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
26032 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
26033 default range into a temp and allocate only what is needed.
26035 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
26037 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
26039 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
26041 * gimple-range.h (gimple_range_global): Pick up parameter initial
26042 values, and use-before defined locals are UNDEFINED.
26044 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
26046 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
26048 * tree.h (reverse_storage_order_for_component_p): Return false if
26049 the type is a pointer.
26051 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
26053 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
26055 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
26058 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
26059 Handle V8QI, V4HI and V2SI modes.
26060 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
26061 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
26062 * config/i386/i386.md (unspec): ... here.
26064 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
26065 Tom de Vries <tdevries@suse.de>
26067 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
26068 a truth_value_p reduction variable is nonintegral.
26070 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
26073 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
26074 Return false for mode sizes < 16.
26076 2021-05-07 Jakub Jelinek <jakub@redhat.com>
26079 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
26081 2021-05-06 Martin Jambor <mjambor@suse.cz>
26083 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
26084 when there is no function summary.
26085 (ipa_sra_summarize_function): produce edge summaries even when
26088 2021-05-06 Tom Tromey <tom@tromey.com>
26090 * godump.c (string_hash_eq): Remove.
26091 (go_finish): Use htab_eq_string.
26093 2021-05-06 Tom Tromey <tom@tromey.com>
26095 * gengtype-state.c (read_state): Use htab_eq_string.
26096 (string_eq): Remove.
26098 2021-05-06 Tom Tromey <tom@tromey.com>
26100 * gensupport.c (htab_eq_string): Remove.
26102 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
26105 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
26106 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
26107 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
26108 set_ignored_loc callbacks.
26109 * debug.c (do_nothing_debug_hooks): Likewise.
26110 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
26111 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
26112 (in_text_section_p, last_text_label, last_cold_label,
26113 switch_text_ranges, switch_cold_ranges): New data items.
26114 (dwarf2out_note_section_used): Remove.
26115 (dwarf2out_begin_prologue): Set fde->ignored_debug and
26117 (mark_ignored_debug_section): New helper function.
26118 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
26119 mark_ignored_debug_section.
26120 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
26121 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
26122 (size_of_aranges): Adjust formula for multi-part text ranges size.
26123 (output_aranges): Output multi-part text ranges.
26124 (dwarf2out_set_ignored_loc): New callback function.
26125 (dwarf2out_finish): Output multi-part text ranges.
26126 (dwarf2out_c_finalize): Clear new data items.
26127 * final.c (final_start_function_1): Call set_ignored_loc callback.
26128 (final_scan_insn_1): Likewise.
26129 * ggc-page.c (gt_ggc_mx): New helper function.
26130 * stringpool.c (gt_pch_nx): Likewise.
26132 2021-05-06 Richard Biener <rguenther@suse.de>
26134 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
26135 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
26136 (TV_TREE_INTO_SSA): New.
26137 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
26138 (rewrite_blocks): Likewise.
26139 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
26141 2021-05-06 Jakub Jelinek <jakub@redhat.com>
26143 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
26144 abs_replacement, xor_replacement,
26145 cond_removal_in_popcount_clz_ctz_pattern,
26146 replace_phi_edge_with_variable): Change type of phi argument from
26147 gimple * to gphi *.
26149 2021-05-06 Richard Biener <rguenther@suse.de>
26151 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
26152 Output an opt-info message.
26153 (do_split_loop_on_cond): Likewise.
26154 (tree_ssa_split_loops): Update SSA form here.
26156 2021-05-06 Richard Biener <rguenther@suse.de>
26158 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
26159 return variable removal.
26161 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
26163 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
26164 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
26165 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
26167 * config/s390/s390.c (s390_const_operand_ok): Remove unused
26170 2021-05-06 Jakub Jelinek <jakub@redhat.com>
26172 PR tree-optimization/94589
26173 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
26174 spaceship_replacement.
26175 (cond_only_block_p, spaceship_replacement): New functions.
26177 2021-05-06 Richard Biener <rguenther@suse.de>
26180 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
26181 dealing with a debug use and only query existing addresses
26183 (lower_emutls_1): Avoid splitting out addresses for debug
26184 stmts, reset the debug stmt when we fail to find existing
26186 (lower_emutls_phi_arg): Set wi.stmt.
26188 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
26191 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
26192 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
26193 (stack_protect_test): Use cbranch helper.
26195 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
26198 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
26199 always return the establisher frame for __builtin_frame_address (0).
26201 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
26204 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
26205 (ix86_cpu_model_var): Likewise.
26206 (ix86_cpu_features2_type_node): Likewise.
26207 (ix86_cpu_features2_var): Likewise.
26208 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
26211 2021-05-05 Martin Sebor <msebor@redhat.com>
26213 * passes.def (pass_warn_printf): Run after SSA.
26215 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26217 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
26218 * config/arm/predicates.md (minus_one_operand): New predicate.
26220 2021-05-05 Jeff Law <jlaw@tachyum.com>
26222 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
26224 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
26226 PR rtl-optimization/100263
26227 * postreload.c (move2add_valid_value_p): Ensure register can
26230 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
26232 PR rtl-optimization/100411
26233 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
26234 and beginning of function markers.
26236 2021-05-05 Jeff Law <jlaw@tachyum.com>
26238 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
26239 * config/cr16/cr16.c (notice_update_cc): Remove.
26240 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
26242 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
26245 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
26246 Handle V8QI, V4HI and V2SI modes.
26247 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
26248 (ix86_build_signbit_mask): Ditto.
26249 * config/i386/mmx.md (MMXMODE14): New mode iterator.
26250 (<smaxmin:code><MMXMODE14:mode>3): New expander.
26251 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
26252 (<umaxmin:code><MMXMODE24:mode>3): New expander.
26253 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
26254 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
26255 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26256 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26257 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26258 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
26260 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
26262 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
26263 not expand the VALUE_EXPR of variables put in the non-local frame.
26264 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
26265 to be ignored for debug info, ensure its variable offsets are not.
26267 2021-05-05 Richard Biener <rguenther@suse.de>
26269 PR tree-optimization/79333
26270 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
26271 Fold stmt following SSA edges.
26273 2021-05-05 Richard Biener <rguenther@suse.de>
26275 PR middle-end/100394
26276 * calls.c (expand_call): Preserve possibly throwing calls.
26277 * cfgexpand.c (expand_call_stmt): When a call can throw signal
26278 RTL expansion there are side-effects.
26279 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
26280 mark all possibly throwing stmts necessary unless we can elide
26282 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
26283 -fdelete-dead-exceptions.
26284 * tree.h (DECL_PURE_P): Add note about exceptions.
26286 2021-05-05 Alexandre Oliva <oliva@adacore.com>
26288 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
26291 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
26293 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
26294 get_fnname_from_decl for name of thunk.
26295 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
26296 and ASM_OUTPUT_LABEL.
26297 (rs6000_xcoff_declare_function_name): Use assemble_name and
26299 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
26300 (rs6000_xcoff_encode_section_info): Don't add mapping class
26301 for aliases. Always add [DS] mapping class to primary
26303 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
26305 2021-05-04 Martin Sebor <msebor@redhat.com>
26307 PR middle-end/100307
26308 * builtins.c (compute_objsize_r): Clear base0 for pointers.
26310 2021-05-04 Jeff Law <jlaw@tachyum.com>
26312 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
26314 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
26316 * caller-save.c: Remove CC0.
26317 * cfgcleanup.c: Remove CC0.
26318 * cfgrtl.c: Remove CC0.
26319 * combine.c: Remove CC0.
26320 * compare-elim.c: Remove CC0.
26321 * conditions.h: Remove CC0.
26322 * config/h8300/h8300.h: Remove CC0.
26323 * config/h8300/h8300-protos.h: Remove CC0.
26324 * config/h8300/peepholes.md: Remove CC0.
26325 * config/i386/x86-tune-sched.c: Remove CC0.
26326 * config/m68k/m68k.c: Remove CC0.
26327 * config/rl78/rl78.c: Remove CC0.
26328 * config/sparc/sparc.c: Remove CC0.
26329 * config/xtensa/xtensa.c: Remove CC0.
26330 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
26331 RTL where that is used as a placeholder only.
26332 * cprop.c: Remove CC0.
26333 * cse.c: Remove CC0.
26334 * cselib.c: Remove CC0.
26335 * df-problems.c: Remove CC0.
26336 * df-scan.c: Remove CC0.
26337 * doc/md.texi: Remove CC0. Adjust an example.
26338 * doc/rtl.texi: Remove CC0. Adjust an example.
26339 * doc/tm.texi: Regenerate.
26340 * doc/tm.texi.in: Remove CC0.
26341 * emit-rtl.c: Remove CC0.
26342 * final.c: Remove CC0.
26343 * fwprop.c: Remove CC0.
26344 * gcse-common.c: Remove CC0.
26345 * gcse.c: Remove CC0.
26346 * genattrtab.c: Remove CC0.
26347 * genconfig.c: Remove CC0.
26348 * genemit.c: Remove CC0.
26349 * genextract.c: Remove CC0.
26350 * gengenrtl.c: Remove CC0.
26351 * genrecog.c: Remove CC0.
26352 * haifa-sched.c: Remove CC0.
26353 * ifcvt.c: Remove CC0.
26354 * ira-costs.c: Remove CC0.
26355 * ira.c: Remove CC0.
26356 * jump.c: Remove CC0.
26357 * loop-invariant.c: Remove CC0.
26358 * lra-constraints.c: Remove CC0.
26359 * lra-eliminations.c: Remove CC0.
26360 * optabs.c: Remove CC0.
26361 * postreload-gcse.c: Remove CC0.
26362 * postreload.c: Remove CC0.
26363 * print-rtl.c: Remove CC0.
26364 * read-rtl-function.c: Remove CC0.
26365 * reg-notes.def: Remove CC0.
26366 * reg-stack.c: Remove CC0.
26367 * reginfo.c: Remove CC0.
26368 * regrename.c: Remove CC0.
26369 * reload.c: Remove CC0.
26370 * reload1.c: Remove CC0.
26371 * reorg.c: Remove CC0.
26372 * resource.c: Remove CC0.
26373 * rtl.c: Remove CC0.
26374 * rtl.def: Remove CC0.
26375 * rtl.h: Remove CC0.
26376 * rtlanal.c: Remove CC0.
26377 * sched-deps.c: Remove CC0.
26378 * sched-rgn.c: Remove CC0.
26379 * shrink-wrap.c: Remove CC0.
26380 * simplify-rtx.c: Remove CC0.
26381 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
26382 CC_STATUS_MDEP, and CC_STATUS.
26383 * target.def: Remove CC0.
26384 * valtrack.c: Remove CC0.
26385 * var-tracking.c: Remove CC0.
26387 2021-05-04 Richard Biener <rguenther@suse.de>
26389 PR tree-optimization/100414
26390 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
26392 (tree_ssa_phiopt_worker): But unconditionally here.
26394 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
26396 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
26397 && and || with floating-point and complex arguments.
26399 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
26401 * tree-inline.c (insert_debug_decl_map): Delete.
26402 (copy_debug_stmt): Minor tweak.
26403 (setup_one_parameter): Do not use a variable if the value is either
26404 a read-only DECL or a non-addressable local variable in the caller.
26405 In this case, insert the debug-only variable in the map manually.
26406 (expand_call_inline): Do not generate a CLOBBER for these values.
26407 * tree-inline.h (debug_map): Minor tweak.
26409 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
26411 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
26412 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
26414 2021-05-04 Richard Biener <rguenther@suse.de>
26416 PR tree-optimization/100329
26417 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
26419 (insert_stmt_after): Assert we're not running into asm goto.
26421 2021-05-04 Richard Biener <rguenther@suse.de>
26423 PR tree-optimization/100398
26424 * tree-ssa-dse.c (pass_dse::execute): Preserve control
26427 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26429 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
26431 2021-05-04 Alexandre Oliva <oliva@adacore.com>
26433 * builtins.c (try_store_by_multiple_pieces): New.
26434 (expand_builtin_memset_args): Use it. If target_char_cast
26435 fails, proceed as for non-constant val. Pass len's ctz to...
26436 * expr.c (clear_storage_hints): ... this. Try store by
26437 multiple pieces after setmem.
26438 (clear_storage): Adjust.
26439 * expr.h (clear_storage_hints): Likewise.
26440 (try_store_by_multiple_pieces): Declare.
26441 * passes.def: Replace the last copy_prop with ccp.
26443 2021-05-03 Tom de Vries <tdevries@suse.de>
26446 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
26449 2021-05-03 Richard Biener <rguenther@suse.de>
26451 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
26453 2021-05-03 Richard Biener <rguenther@suse.de>
26455 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
26456 (dse_dom_walker): Remove.
26457 (dse_dom_walker::dse_optimize_stmt): Rename...
26458 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
26459 (dse_dom_walker::before_dom_children): Inline ...
26460 (pass_dse::execute): ... here. Perform a reverse program
26463 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
26466 * configure: Regenerated.
26468 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
26471 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
26472 (s390_md_asm_adjust): Handle hard registers.
26474 2021-05-03 Jakub Jelinek <jakub@redhat.com>
26476 PR tree-optimization/100382
26477 * tree-ssa-dse.c: Include tree-eh.h.
26478 (dse_dom_walker::before_dom_children): Don't remove stmts if
26479 stmt_unremovable_because_of_non_call_eh_p is true.
26481 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
26483 * varasm.c (compute_reloc_for_var): Split out from...
26484 (get_variable_section): Use it.
26485 * output.h (compute_reloc_for_var): Declare.
26486 * config/rs6000/rs6000-protos.h
26487 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
26489 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
26490 Don't append storage mapping class to symbol.
26491 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
26492 Don't convert TLS BSS to common.
26493 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
26494 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
26496 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
26498 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
26499 If align is 0 from TLS class, use the same rules as varasm.c
26500 If not common, switch to BSS section manually.
26501 If common, emit appropriate comm or lcomm directive.
26502 (rs6000_xcoff_encode_section_info): Add logic to append all
26503 storage mapping classes.
26504 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
26505 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
26506 rs6000_xcoff_asm_output_aligned_decl_common.
26507 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
26508 rs6000_xcoff_asm_output_aligned_decl_common.
26509 (ASM_OUTPUT_TLS_COMMON): Use
26510 rs6000_xcoff_asm_output_aligned_decl_common.
26512 2021-05-02 Jakub Jelinek <jakub@redhat.com>
26515 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
26516 as first argument of pseudo_node_t constructors.
26518 2021-05-02 Jakub Jelinek <jakub@redhat.com>
26521 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
26523 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
26525 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
26526 (gt_pch_nx (int_range<1> *&)): New.
26527 (gt_ggc_mx (int_range<1> *&)): New.
26528 * value-range.h (class irange): Add GTY support for
26531 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
26533 * doc/options.texi (Negative): Change either or to both and.
26535 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26537 * config/aarch64/aarch64-simd-builtins.def: Add
26538 float_ml[as][q]_laneq builtin generator macros.
26539 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
26540 (aarch64_float_mla_laneq<mode>): Define.
26541 (aarch64_float_mls_laneq<mode>): Define.
26542 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
26543 instead of GCC vector extensions.
26544 (vmlaq_laneq_f32): Likewise.
26545 (vmls_laneq_f32): Likewise.
26546 (vmlsq_laneq_f32): Likewise.
26548 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26550 * config/aarch64/aarch64-simd-builtins.def: Add
26551 float_ml[as]_lane builtin generator macros.
26552 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
26554 (mul_lane<mode>3): This, and re-order arguments.
26555 (aarch64_float_mla_lane<mode>): Define.
26556 (aarch64_float_mls_lane<mode>): Define.
26557 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
26558 instead of GCC vector extensions.
26559 (vmlaq_lane_f32): Likewise.
26560 (vmls_lane_f32): Likewise.
26561 (vmlsq_lane_f32): Likewise.
26563 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26565 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
26566 builtin generator macros.
26567 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
26569 (aarch64_float_mls<mode>): Define.
26570 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
26571 instead of relying on GCC vector extensions.
26572 (vmla_f64): Likewise.
26573 (vmlaq_f32): Likewise.
26574 (vmlaq_f64): Likewise.
26575 (vmls_f32): Likewise.
26576 (vmls_f64): Likewise.
26577 (vmlsq_f32): Likewise.
26578 (vmlsq_f64): Likewise.
26579 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
26581 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26583 * config/aarch64/aarch64-simd-builtins.def: Add
26584 float_ml[as]_n_builtin generator macros.
26585 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
26587 (mul_n<mode>3): This, and re-order arguments.
26588 (aarch64_float_mla_n<mode>): Define.
26589 (aarch64_float_mls_n<mode>): Define.
26590 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
26591 instead of inline asm.
26592 (vmlaq_n_f32): Likewise.
26593 (vmls_n_f32): Likewise.
26594 (vmlsq_n_f32): Likewise.
26596 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
26598 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
26599 builtin generator macros.
26600 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
26601 (aarch64_pmull_hiv16qi_insn): Define.
26602 (aarch64_pmull_hiv16qi): Define.
26603 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
26604 instead of inline asm.
26605 (vmull_p8): Likewise.
26607 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26609 * config/avr/avr.md: Adjust peepholes to match and
26610 generate parallels with clobber of REG_CC.
26611 (mov<mode>_insn): Rename to mov<mode>_insn_split.
26612 (*mov<mode>_insn): Rename to mov<mode>_insn.
26614 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
26616 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
26617 for VAR_DECLs if -fdata-sections enabled.
26619 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
26621 PR bootstrap/100327
26622 * config/rs6000/rs6000.c
26623 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
26624 (rs6000_libgcc_floating_mode_supported_p): New target hook.
26626 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26628 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
26630 (class back_threader_registry): ...to here...
26631 (class back_threader_profitability): ...and here...
26632 (thread_jumps::thread_through_all_blocks): Remove argument.
26633 (back_threader_registry::back_threader_registry): New.
26634 (back_threader_registry::~back_threader_registry): New.
26635 (back_threader_registry::thread_through_all_blocks): New.
26636 (thread_jumps::profitable_jump_thread_path): Move from here...
26637 (back_threader_profitability::profitable_path_p): ...to here.
26638 (thread_jumps::find_taken_edge): New.
26639 (thread_jumps::convert_and_register_current_path): Move...
26640 (back_threader_registry::register_path): ...to here.
26641 (thread_jumps::register_jump_thread_path_if_profitable): Move...
26642 (thread_jumps::maybe_register_path): ...to here.
26643 (thread_jumps::handle_phi): Call find_taken_edge and
26644 maybe_register_path.
26645 (thread_jumps::handle_assignment): Same.
26646 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
26647 tree argument to handle_phi and handle_assignment.
26648 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
26649 set of m_speed_p and m_max_threaded_paths.
26650 (pass_thread_jumps::execute): Remove second argument from
26651 find_jump_threads_backwards.
26652 (pass_early_thread_jumps::execute): Same.
26654 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26656 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
26657 (class dom_opt_dom_walker): Initialize some class variables.
26658 (pass_dominator::execute): Pass evrp_range_analyzer and
26659 dom_jump_threader_simplifier to dom_opt_dom_walker.
26660 Adjust for some functions moving into classes.
26661 (simplify_stmt_for_jump_threading): Adjust and move to...
26662 (jump_threader_simplifier::simplify): ...here.
26663 (dom_opt_dom_walker::before_dom_children): Adjust for
26664 m_evrp_range_analyzer.
26665 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
26666 (test_for_singularity): Place in dom_opt_dom_walker class.
26667 (dom_opt_dom_walker::optimize_stmt): The argument
26668 evrp_range_analyzer is now a class field.
26669 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
26670 (thread_jumps::thread_through_all_blocks): New.
26671 (thread_jumps::convert_and_register_current_path): Use m_registry.
26672 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
26673 being in the threader class.
26674 (pass_early_thread_jumps::execute): Same.
26675 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
26676 (jump_threader::jump_threader): ...here.
26677 (threadedge_finalize_values): Move...
26678 (jump_threader::~jump_threader): ...here.
26679 (jump_threader::remove_jump_threads_including): New.
26680 (jump_threader::thread_through_all_blocks): New.
26681 (record_temporary_equivalences_from_phis): Move...
26682 (jump_threader::record_temporary_equivalences_from_phis): ...here.
26683 (record_temporary_equivalences_from_stmts_at_dest): Move...
26684 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
26686 (simplify_control_stmt_condition_1): Move to jump_threader class.
26687 (simplify_control_stmt_condition): Move...
26688 (jump_threader::simplify_control_stmt_condition): ...here.
26689 (thread_around_empty_blocks): Move...
26690 (jump_threader::thread_around_empty_blocks): ...here.
26691 (thread_through_normal_block): Move...
26692 (jump_threader::thread_through_normal_block): ...here.
26693 (thread_across_edge): Move...
26694 (jump_threader::thread_across_edge): ...here.
26695 (thread_outgoing_edges): Move...
26696 (jump_threader::thread_outgoing_edges): ...here.
26697 * tree-ssa-threadedge.h: Move externally facing functings...
26698 (class jump_threader): ...here...
26699 (class jump_threader_simplifier): ...and here.
26700 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
26701 (jump_thread_path_allocator::jump_thread_path_allocator): New.
26702 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
26703 (jump_thread_path_allocator::allocate_thread_edge): New.
26704 (jump_thread_path_allocator::allocate_thread_path): New.
26705 (jump_thread_path_registry::jump_thread_path_registry): New.
26706 (jump_thread_path_registry::~jump_thread_path_registry): New.
26707 (jump_thread_path_registry::allocate_thread_edge): New.
26708 (jump_thread_path_registry::allocate_thread_path): New.
26709 (dump_jump_thread_path): Make extern.
26710 (debug (const vec<jump_thread_edge *> &path)): New.
26711 (struct removed_edges): Move to tree-ssa-threadupdate.h.
26712 (struct thread_stats_d): Remove.
26713 (remove_ctrl_stmt_and_useless_edges): Make static.
26714 (lookup_redirection_data): Move...
26715 (jump_thread_path_registry::lookup_redirection_data): ...here.
26716 (ssa_redirect_edges): Make static.
26717 (thread_block_1): Move...
26718 (jump_thread_path_registry::thread_block_1): ...here.
26719 (thread_block): Move...
26720 (jump_thread_path_registry::thread_block): ...here.
26721 (thread_through_loop_header): Move...
26722 (jump_thread_path_registry::thread_through_loop_header): ...here.
26723 (mark_threaded_blocks): Move...
26724 (jump_thread_path_registry::mark_threaded_blocks): ...here.
26725 (debug_path): Move...
26726 (jump_thread_path_registry::debug_path): ...here.
26727 (debug_all_paths): Move...
26728 (jump_thread_path_registry::dump): ..here.
26729 (rewire_first_differing_edge): Move...
26730 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
26731 (adjust_paths_after_duplication): Move...
26732 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
26733 (duplicate_thread_path): Move...
26734 (jump_thread_path_registry::duplicate_thread_path): ..here.
26735 (remove_jump_threads_including): Move...
26736 (jump_thread_path_registry::remove_jump_threads_including): ...here.
26737 (thread_through_all_blocks): Move to...
26738 (jump_thread_path_registry::thread_through_all_blocks): ...here.
26739 (delete_jump_thread_path): Remove.
26740 (register_jump_thread): Move...
26741 (jump_thread_path_registry::register_jump_thread): ...here.
26742 * tree-ssa-threadupdate.h: Move externally facing functions...
26743 (class jump_thread_path_allocator): ...here...
26744 (class jump_thread_path_registry): ...and here.
26745 (thread_through_all_blocks): Remove.
26746 (struct removed_edges): New.
26747 (register_jump_thread): Remove.
26748 (remove_jump_threads_including): Remove.
26749 (delete_jump_thread_path): Remove.
26750 (remove_ctrl_stmt_and_useless_edges): Remove.
26751 (free_dom_edge_info): New prototype.
26752 * tree-vrp.c: Remove x_vr_values hack.
26753 (class vrp_jump_threader_simplifier): New.
26754 (vrp_jump_threader_simplifier::simplify): New.
26755 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
26756 Remove m_dummy_cond.
26757 Instantiate m_simplifier and m_threader.
26758 (vrp_jump_threader::thread_through_all_blocks): New.
26759 (vrp_jump_threader::simplify_stmt): Remove.
26760 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
26761 Remove x_vr_values hack.
26762 (execute_vrp): Adjust for thread_through_all_blocks being in a
26765 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
26767 * genflags.c (gen_insn): Print failed expansion string.
26769 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26771 * expr.c (alignment_for_piecewise_move): Call mode_for_size
26772 without limit to MAX_FIXED_MODE_SIZE.
26774 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26776 PR middle-end/90773
26777 * builtins.c (builtin_memset_gen_str): Don't use return from
26778 simplify_gen_subreg.
26780 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
26783 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
26784 (*addsi3_carry_zext_0r): Ditto.
26785 (*sub<mode>3_carry_0): Ditto.
26786 (*subsi3_carry_zext_0r): Ditto.
26787 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
26789 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
26790 Also consider ix86_carry_flag_unset_operator to calculate
26791 the cost of adc/sbb insn.
26793 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
26795 PR rtl-optimization/100225
26796 PR rtl-optimization/84878
26797 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
26798 where we have an instruction which touches (writes) any hard
26799 register from df->regular_block_artificial_uses set.
26800 Allow not-single-set instruction only right before basic block
26803 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
26805 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
26807 2021-04-30 LevyHsu <admin@levyhsu.com>
26809 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
26810 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
26811 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
26812 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
26814 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26816 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
26818 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26820 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
26821 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
26822 * config/i386/i386.c: Adjust.
26823 * config/i386/i386.md: Adjust.
26824 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
26825 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26826 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26827 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26828 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26829 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26830 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26831 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26832 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26833 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26834 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
26836 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26838 * config/i386/i386-expand.c (ix86_expand_int_compare):
26839 Swap operands of GTU and LEU comparison to emit carry flag comparison.
26840 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
26841 predicate to allow more combine opportunities with memory operands.
26842 (*sub<mode>3_carry_0): Ditto.
26844 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
26846 PR rtl-optimization/100303
26847 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
26848 boolean that indicates whether the use will only be used in
26849 debug instructions. Treat it in the same way that existing
26850 cross-EBB debug references would be handled if so.
26851 (function_info::make_uses_available): Likewise.
26852 * rtl-ssa/functions.h (function_info::make_uses_available): Update
26853 prototype accordingly.
26854 (function_info::make_uses_available): Likewise.
26855 * fwprop.c (try_fwprop_subst): Update call accordingly.
26857 2021-04-29 Jeff Law <jlaw@tachyum.com>
26859 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
26862 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26863 Richard Biener <rguenther@suse.de>
26866 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
26867 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
26868 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
26869 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
26870 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
26871 to PURE_ARGS category.
26872 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26873 Handle PURE_ARGS category.
26874 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
26876 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26878 * configure.ac: Check for the presence of sys/locking.h header and
26879 for whether _LK_LOCK is supported by _locking.
26880 * configure: Regenerate.
26881 * config.in: Likewise.
26882 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
26883 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
26884 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
26886 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26888 * config/i386/predicates.md (fcmov_comparison_operator):
26889 Do not check for trivial FP comparison operator.
26890 <case GEU, case LTU>: Allow CCGZmode.
26891 <case GTU, case LEU>: Do not allow CCCmode.
26892 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
26893 (ix86_carry_flag_operator): Match only LTU and UNLT code.
26894 Do not check for trivial FP comparison operator. Allow CCGZmode.
26896 2021-04-29 Tom de Vries <tdevries@suse.de>
26898 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
26899 fd->loop.step by either step or orig_step.
26901 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26903 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
26904 (load_got_register): Do the PIC dance here.
26905 (sparc_legitimize_tls_address): Simplify.
26906 (sparc_emit_probe_stack_range): Likewise.
26907 (sparc32_initialize_trampoline): Likewise.
26908 (sparc64_initialize_trampoline): Likewise.
26909 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
26910 (probe_stack_range<P:mode>): Likewise.
26911 (flush<P:mode>): Likewise.
26912 (tgd_hi22<P:mode>): Likewise.
26913 (tgd_lo10<P:mode>): Likewise.
26914 (tgd_add<P:mode>): Likewise.
26915 (tgd_call<P:mode>): Likewise.
26916 (tldm_hi22<P:mode>): Likewise.
26917 (tldm_lo10<P:mode>): Likewise.
26918 (tldm_add<P:mode>): Likewise.
26919 (tldm_call<P:mode>): Likewise.
26920 (tldo_hix22<P:mode>): Likewise.
26921 (tldo_lox10<P:mode>): Likewise.
26922 (tldo_add<P:mode>): Likewise.
26923 (tie_hi22<P:mode>): Likewise.
26924 (tie_lo10<P:mode>): Likewise.
26925 (tie_add<P:mode>): Likewise.
26926 (tle_hix22<P:mode>): Likewise.
26927 (tle_lox10<P:mode>): Likewise.
26928 (stack_protect_setsi): Rename to...
26929 (stack_protect_set32): ...this.
26930 (stack_protect_setdi): Rename to...
26931 (stack_protect_set64): ...this.
26932 (stack_protect_set): Adjust calls to above.
26933 (stack_protect_testsi): Rename to...
26934 (stack_protect_test32): ...this.
26935 (stack_protect_testdi): Rename to...
26936 (stack_protect_test64): ...this.
26937 (stack_protect_test): Adjust calls to above.
26939 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
26941 PR middle-end/90773
26942 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
26943 (builtin_strncpy_read_str): Likewise.
26944 (builtin_memset_read_str): Add an argument for the previous RTL
26945 information and generate the new RTL from the previous RTL info.
26946 (builtin_memset_gen_str): Likewise.
26947 * builtins.h (builtin_strncpy_read_str): Update the prototype.
26948 (builtin_memset_read_str): Likewise.
26949 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
26950 returns true, round up size and alignment to the widest integer
26951 mode for maximum size.
26952 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
26953 and pass it to m_constfn.
26954 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
26955 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
26956 initialize m_push. Initialize m_overlap_op_by_pieces with
26957 targetm.overlap_op_by_pieces_p ().
26958 (op_by_pieces_d::run): Pass the previous RTL information to
26959 pieces_addr::adjust and generate overlapping operations if
26960 m_overlap_op_by_pieces is true.
26962 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
26964 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
26966 (can_store_by_pieces): Use by_pieces_constfn on constfun.
26967 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
26968 for op_by_pieces_d change.
26969 (clear_by_pieces_1): Add a dummy argument.
26970 (clear_by_pieces): Updated for op_by_pieces_d change.
26971 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
26972 (string_cst_read_str): Add a dummy argument.
26973 * expr.h (by_pieces_constfn): Add a dummy argument.
26974 (by_pieces_prev): New.
26975 * target.def (overlap_op_by_pieces_p): New target hook.
26976 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
26977 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
26978 * doc/tm.texi: Regenerated.
26980 2021-04-29 Richard Biener <rguenther@suse.de>
26982 PR tree-optimization/100253
26983 * tree-vect-stmts.c (vectorizable_load): Do not assume
26984 element alignment when DR_MISALIGNMENT is -1.
26985 (vectorizable_store): Likewise.
26987 2021-04-29 Jakub Jelinek <jakub@redhat.com>
26990 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
26991 absu_hwi instead of abs_hwi.
26993 2021-04-29 Richard Biener <rguenther@suse.de>
26995 PR middle-end/38474
26996 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
26997 forwarding when indirect forwarding through ESCAPED
27000 2021-04-29 Tom de Vries <tdevries@suse.de>
27003 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
27004 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
27005 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
27006 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
27008 2021-04-29 Richard Biener <rguenther@suse.de>
27010 PR tree-optimization/99912
27011 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
27012 (dse_dom_walker::todo): Likewise.
27013 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
27015 (dse_dom_walker::before_dom_children): Remove trivially
27016 dead SSA defs and schedule CFG cleanup if we removed all
27018 (pass_dse::execute): Get TODO as computed by the DOM walker
27019 and return it. Wipe dominator info earlier.
27021 2021-04-29 Richard Biener <rguenther@suse.de>
27024 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
27025 Track blocks to cleanup EH in new m_need_eh_cleanup.
27026 (ipcp_modif_dom_walker::cleanup_eh): New.
27027 (ipcp_transform_function): Release dominator info before
27030 2021-04-29 Martin Sebor <msebor@redhat.com>
27032 PR middle-end/100250
27033 * attribs.c (attr_access::array_as_string): Avoid dereferencing
27034 a pointer when it's null.
27036 2021-04-29 Martin Sebor <msebor@redhat.com>
27038 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
27039 * ipa-free-lang-data.cc: New file.
27040 * tree.c: Move pass free_lang_data to file above.
27041 (build_array_type_1): Declare extern.
27042 * tree.h (build_array_type_1): Declare.
27044 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27046 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
27047 make consistent with updated RTL pattern.
27048 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
27049 Implement using ss_truncate and us_truncate rather than
27051 * config/aarch64/iterators.md: Remove redundant unspecs and
27052 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
27054 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27056 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
27057 attributes consistent with those defined in arm_neon.h.
27059 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27061 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
27062 attributes consistent with those defined in arm_neon.h.
27064 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27066 * config/aarch64/aarch64-simd-builtins.def: Add
27067 float_trunc_rodd builtin generator macros.
27068 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
27070 (aarch64_float_trunc_rodd_lo_v2sf): Define.
27071 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
27072 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
27073 (aarch64_float_trunc_rodd_hi_v4sf): Define.
27074 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
27075 instead of inline asm.
27076 (vcvtx_high_f32_f64): Likewise.
27077 (vcvtxd_f32_f64): Likewise.
27078 * config/aarch64/iterators.md: Add FCVTXN unspec.
27080 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27082 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
27084 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
27086 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
27087 instead of inline asm.
27088 (vqtbx1_u8): Likewise.
27089 (vqtbx1_p8): Likewise.
27090 (vqtbx1q_s8): Likewise.
27091 (vqtbx1q_u8): Likewise.
27092 (vqtbx1q_p8): Likewise.
27093 (vtbx2_s8): Likewise.
27094 (vtbx2_u8): Likewise.
27095 (vtbx2_p8): Likewise.
27097 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27099 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
27101 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
27102 instead of inline asm.
27103 (vqtbl1_s8): Likewise.
27104 (vqtbl1_u8): Likewise.
27105 (vqtbl1q_p8): Likewise.
27106 (vqtbl1q_s8): Likewise.
27107 (vqtbl1q_u8): Likewise.
27108 (vtbl1_s8): Likewise.
27109 (vtbl1_u8): Likewise.
27110 (vtbl1_p8): Likewise.
27111 (vtbl2_s8): Likewise.
27112 (vtbl2_u8): Likewise.
27113 (vtbl2_p8): Likewise.
27115 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27117 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
27118 ssri_n buitin generator macro.
27119 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
27120 instead of inline asm.
27121 (vsri_n_p16): Likewise.
27122 (vsri_n_p64): Likewise.
27123 (vsriq_n_p8): Likewise.
27124 (vsriq_n_p16): Likewise.
27125 (vsriq_n_p64): Likewise.
27127 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27129 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
27130 iterator for polynomial ssli_n builtin generator macro.
27131 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
27132 instead of inline asm.
27133 (vsli_n_p16): Likewise.
27134 (vsliq_n_p8): Likewise.
27135 (vsliq_n_p16): Likewise.
27136 * config/aarch64/iterators.md: Define VALLP mode iterator.
27138 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27140 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
27141 iterator to generate [su]adalp RTL builtins.
27142 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
27143 [su]adalp RTL pattern.
27144 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
27145 instead of inline asm.
27146 (vpadal_u32): Likewise.
27148 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27150 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
27151 builtin generator macros.
27152 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
27154 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
27155 instead of inline asm.
27156 (vpaddl_s16): Likewise.
27157 (vpaddl_s32): Likewise.
27158 (vpaddl_u8): Likewise.
27159 (vpaddl_u16): Likewise.
27160 (vpaddl_u32): Likewise.
27161 (vpaddlq_s8): Likewise.
27162 (vpaddlq_s16): Likewise.
27163 (vpaddlq_s32): Likewise.
27164 (vpaddlq_u8): Likewise.
27165 (vpaddlq_u16): Likewise.
27166 (vpaddlq_u32): Liwewise.
27167 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
27168 appropriate attributes.
27170 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27172 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
27173 for aarch64_addp<mode> builtin macro generator.
27174 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
27175 aarch64_addp<mode> RTL pattern.
27176 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
27177 instead of inline asm.
27178 (vpaddq_s16): Likewise.
27179 (vpaddq_s32): Likewise.
27180 (vpaddq_s64): Likewise.
27181 (vpaddq_u8): Likewise.
27182 (vpaddq_u16): Likewise.
27183 (vpaddq_u32): Likewise.
27184 (vpaddq_u64): Likewise.
27186 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
27188 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
27189 builtin generator macros.
27190 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
27192 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
27193 instead of inline asm.
27194 (vqdmulh_n_s32): Likewise.
27195 (vqdmulhq_n_s16): Likewise.
27196 (vqdmulhq_n_s32): Likewise.
27197 (vqrdmulh_n_s16): Likewise.
27198 (vqrdmulh_n_s32): Likewise.
27199 (vqrdmulhq_n_s16): Likewise.
27200 (vqrdmulhq_n_s32): Likewise.
27202 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
27204 * doc/install.texi (--enable-offload-defaulted): Document.
27206 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
27208 * config/avr/avr-dimode.md: Turn existing patterns into
27209 define_insn_and_split style patterns where the splitter
27210 adds a clobber of the condition code register. Drop "cc"
27211 attribute. Add new patterns to match output of
27213 * config/avr/avr-fixed.md: Likewise.
27214 * config/avr/avr.c (cc_reg_rtx): New.
27215 (avr_parallel_insn_from_insns): Adjust insn count
27216 for removal of set of cc0.
27217 (avr_is_casesi_sequence): Likewise.
27218 (avr_casei_sequence_check_operands): Likewise.
27219 (avr_optimize_casesi): Likewise. Also insert
27220 new insns after jump_insn.
27221 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
27222 for removal of set of cc0.
27223 (avr_init_expanders): Initialize cc_reg_rtx.
27224 (avr_regno_reg_class): Handle REG_CC.
27225 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
27226 (avr_notice_update_cc): Remove function.
27227 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
27228 (compare_condition): Adjust for PARALLEL with
27230 (out_shift_with_cnt): Likewise.
27231 (ashlhi3_out): Likewise.
27232 (ashrhi3_out): Likewise.
27233 (lshrhi3_out): Likewise.
27234 (avr_class_max_nregs): Return single reg for REG_CC.
27235 (avr_compare_pattern): Check for REG_CC instead
27237 (avr_reorg_remove_redundant_compare): Likewise.
27238 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
27239 (avr_hard_regno_nregs): Return single reg for REG_CC.
27240 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
27241 (avr_md_asm_adjust): Clobber REG_CC.
27242 (TARGET_HARD_REGNO_NREGS): Define.
27243 (TARGET_CLASS_MAX_NREGS): Define.
27244 (TARGET_MD_ASM_ADJUST): Define.
27245 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
27247 (enum reg_class): Add CC_REG class.
27248 (NOTICE_UPDATE_CC): Remove.
27249 (CC_OVERFLOW_UNUSABLE): Remove.
27250 (CC_NO_CARRY): Remove.
27251 * config/avr/avr.md: Turn existing patterns into
27252 define_insn_and_split style patterns where the splitter
27253 adds a clobber of the condition code register. Drop "cc"
27254 attribute. Add new patterns to match output of
27256 (sez): Remove unused pattern.
27258 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
27261 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
27264 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
27267 * config/aarch64/constraints.md (Utq): Require the address to
27268 be valid for both the element mode and for V2DImode.
27270 2021-04-28 Jakub Jelinek <jakub@redhat.com>
27271 Tobias Burnus <tobias@codesourcery.com>
27273 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
27274 * gcc.c (process_command): New variable.
27275 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
27276 set it if -foffload is defaulted.
27277 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
27278 (compile_offload_image): If OFFLOAD_DEFAULTED and
27279 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
27280 if corresponding mkoffload can't be found.
27281 (compile_images_for_offload_targets): Likewise. Free and clear
27282 offload_names if no valid offload is found.
27283 * config.in: Regenerate.
27284 * configure: Regenerate.
27286 2021-04-28 Richard Biener <rguenther@suse.de>
27288 PR tree-optimization/100292
27289 * tree-vect-generic.c (expand_vector_condition): Do not fold
27292 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
27294 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
27295 * config/rs6000/aix64.opt (m64): New.
27298 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
27300 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
27301 (index_term_p): Handle ASHIFT too.
27303 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
27305 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
27306 (sync_lock_test_and_set<mode>): Adjust accordingly.
27307 (sync_lock_release<mode>): Likewise.
27309 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
27311 * config/vax/vax-protos.h (adjacent_operands_p): Remove
27313 * config/vax/vax.c (adjacent_operands_p): Remove.
27315 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
27317 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
27318 through to the non-conditional execution case if getting the
27319 condition for conditional execution has failed.
27321 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
27323 PR middle-end/100284
27324 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
27325 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
27326 than asserting on it.
27328 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
27330 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
27331 with TARGET_AIX_OS.
27333 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
27336 * calls.c (precompute_register_parameters): Additionally test
27337 targetm.precompute_tls_p to pre-compute argument.
27338 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
27339 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
27340 * target.def (precompute_tls_p): New.
27341 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
27342 * doc/tm.texi: Regenerated.
27344 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27347 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
27348 back to HOST_WIDE_INT.
27350 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
27353 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
27354 memory alignment for the outer mode.
27356 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
27358 PR middle-end/90773
27359 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
27360 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
27362 2021-04-27 Alex Coplan <alex.coplan@arm.com>
27365 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
27366 with negative immediates: ensure we expand cbranchsi4_scratch
27367 correctly and ensure we satisfy its constraints.
27368 * config/arm/sync.md
27369 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
27370 attempt to tie two output operands together with constraints;
27371 collapse two alternatives.
27372 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
27373 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
27375 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27378 * config/aarch64/predicates.md (aarch64_sub_immediate,
27379 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
27380 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
27381 * config/aarch64/aarch64.c (aarch64_print_operand,
27382 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
27384 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27386 PR tree-optimization/100239
27387 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
27388 permutations with all indices from the first zero element as vec_shl.
27390 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27392 PR rtl-optimization/100254
27393 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
27394 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
27396 2021-04-27 Richard Biener <rguenther@suse.de>
27398 PR tree-optimization/99912
27399 * passes.def: Add comment about new TODO_remove_unused_locals.
27400 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
27403 2021-04-27 Richard Biener <rguenther@suse.de>
27405 PR tree-optimization/99912
27406 * passes.def (pass_all_optimizations): Add pass_dse before
27407 the first pass_dce, move the first pass_dse before the
27408 pass_dce following pass_pre.
27410 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27412 PR tree-optimization/95527
27413 * generic-match-head.c: Include tm.h.
27414 * gimple-match-head.c: Include tm.h.
27415 * match.pd (CLZ == INTEGER_CST): Don't use
27416 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
27417 if clz == CFN_CLZ. Add missing val declaration.
27418 (CTZ cmp CST): New simplifications.
27420 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27422 PR tree-optimization/96696
27423 * expr.c (expand_expr_divmod): New function.
27424 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
27425 divisions. Formatting fixes.
27426 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
27429 2021-04-27 Martin Jambor <mjambor@suse.cz>
27432 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
27433 If removing a call statement LHS SSA name, release it.
27435 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
27438 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
27439 is valid before including it in the mask.
27441 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
27444 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
27447 2021-04-27 Richard Biener <rguenther@suse.de>
27449 PR tree-optimization/100051
27450 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
27451 disambiguator based on access size vs. decl size.
27453 2021-04-27 Richard Biener <rguenther@suse.de>
27455 PR tree-optimization/100278
27456 * tree-ssa-pre.c (compute_avail): Give up when we cannot
27457 adjust TBAA beacuse of mismatching bases.
27459 2021-04-27 Jakub Jelinek <jakub@redhat.com>
27462 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
27463 For any_rotate define_insn_split and following splitters, use
27464 SWI iterator instead of SWI48.
27466 2021-04-27 Richard Biener <rguenther@suse.de>
27468 PR tree-optimization/99776
27469 * match.pd (bit_field_ref (ctor)): Relax element extract
27470 type compatibility checks.
27472 2021-04-27 Cui,Lili <lili.cui@intel.com>
27474 * common/config/i386/i386-common.c (processor_names):
27475 Sync processor_names with processor_type.
27476 * config/i386/i386-options.c (processor_cost_table):
27477 Sync processor_cost_table with processor_type.
27479 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27481 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
27482 (irange::set): Call irange_set_1bit_anti_range for handling all
27483 1-bit ranges. Fall through on ~[MIN,MAX].
27485 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27487 * value-range.cc (irange::legacy_num_pairs): Remove.
27488 (irange::invert): Change gcc_assert to gcc_checking_assert.
27489 * value-range.h (irange::num_pairs): Adjust for a cached
27490 num_pairs(). Also, rename all gcc_assert's to
27491 gcc_checking_assert's.
27493 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27495 * value-range.cc (irange::operator=): Set m_kind.
27496 (irange::copy_to_legacy): Handle varying and undefined sources
27497 as a legacy copy since they can be easily copied.
27498 (irange::irange_set): Set m_kind.
27499 (irange::irange_set_anti_range): Same.
27500 (irange::set): Rename normalize_min_max to normalize_kind.
27501 (irange::verify_range): Adjust for multi-ranges having the
27503 (irange::irange_union): Set m_kind.
27504 (irange::irange_intersect): Same.
27505 (irange::invert): Same.
27506 * value-range.h (irange::kind): Always return m_kind.
27507 (irange::varying_p): Rename to...
27508 (irange::varying_comptaible_p): ...this.
27509 (irange::undefined_p): Only look at m_kind.
27510 (irange::irange): Always set VR_UNDEFINED if applicable.
27511 (irange::set_undefined): Always set VR_UNDEFINED.
27512 (irange::set_varying): Always set m_kind to VR_VARYING.
27513 (irange::normalize_min_max): Rename to...
27514 (irange::normalize_kind): ...this.
27516 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27518 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
27519 Adjust for constant_p including varying_p.
27520 * tree-vrp.c (vrp_prop::finalize): Same.
27521 (determine_value_range): Same.
27522 * vr-values.c (vr_values::range_of_expr): Same.
27523 * value-range.cc (irange::symbolic_p): Do not check varying_p.
27524 (irange::constant_p): Same.
27526 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27528 * value-range.cc (irange::legacy_lower_bound): Replace
27529 !undefined_p check with num_ranges > 0.
27530 (irange::legacy_upper_bound): Same.
27531 * value-range.h (irange::type): Same.
27532 (irange::lower_bound): Same.
27533 (irange::upper_bound): Same.
27535 2021-04-26 Richard Biener <rguenther@suse.de>
27537 PR tree-optimization/99956
27538 * gimple-loop-interchange.cc (compute_access_stride):
27539 Try instantiating the access in a shallower loop nest
27540 if instantiating failed.
27541 (compute_access_strides): Pass adjustable loop_nest
27542 to compute_access_stride.
27544 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
27546 * doc/sourcebuild.texi (arm_cmse_hw): Document.
27548 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27550 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
27552 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
27553 Nathan Sidwell <nathan@codesourcery.com>
27554 Tom de Vries <vries@codesourcery.com>
27555 Julian Brown <julian@codesourcery.com>
27556 Kwok Cheung Yeung <kcy@codesourcery.com>
27558 * omp-offload.c (oacc_validate_dims): Implement
27559 '-Wopenacc-parallelism'.
27560 * doc/invoke.texi (-Wopenacc-parallelism): Document.
27562 2021-04-26 Richard Biener <rguenther@suse.de>
27564 * tree-cfg.h (gimplify_build1): Remove.
27565 (gimplify_build2): Likewise.
27566 (gimplify_build3): Likewise.
27567 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
27568 (gimplify_build2): Likewise.
27569 (gimplify_build3): Likewise.
27570 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
27572 (gimplify_build2): Likewise.
27573 (gimplify_build3): Likewise.
27574 (tree_vec_extract): Use resimplify with following SSA edges.
27575 (expand_vector_parallel): Avoid passing NULL size/bitpos
27576 to tree_vec_extract.
27577 * expr.c (store_constructor): Deal with zero-element CTORs.
27578 * match.pd (bit_field_ref <vector CTOR>): Make sure to
27579 produce vector constants when possible.
27581 2021-04-26 Richard Biener <rguenther@suse.de>
27583 * tree-complex.c: Include gimple-fold.h.
27584 (expand_complex_addition): Use gimple_build.
27585 (expand_complex_multiplication_components): Likewise.
27586 (expand_complex_multiplication): Likewise.
27587 (expand_complex_div_straight): Likewise.
27588 (expand_complex_div_wide): Likewise.
27589 (expand_complex_division): Likewise.
27590 (expand_complex_conjugate): Likewise.
27591 (expand_complex_comparison): Likewise.
27593 2021-04-26 Richard Biener <rguenther@suse.de>
27595 * tree-ssa-phiopt.c (two_value_replacement): Remove use
27596 of legacy gimplify_buildN API.
27598 2021-04-26 Richard Biener <rguenther@suse.de>
27600 PR tree-optimization/99473
27601 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
27604 2021-04-26 Richard Biener <rguenther@suse.de>
27606 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
27607 Use replace_call_with_value.
27609 2021-04-26 Richard Biener <rguenther@suse.de>
27611 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
27612 (update_gimple_call): Likewise.
27613 (update_call_from_tree): Likewise.
27614 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
27615 (valid_gimple_call_p): Likewise.
27616 (move_ssa_defining_stmt_for_defs): Likewise.
27617 (finish_update_gimple_call): Likewise.
27618 (update_gimple_call): Likewise.
27619 (update_call_from_tree): Likewise.
27620 (propagate_tree_value_into_stmt): Use replace_call_with_value.
27621 * gimple-fold.h (update_gimple_call): Declare.
27622 * gimple-fold.c (valid_gimple_rhs_p): Move here from
27623 tree-ssa-propagate.c.
27624 (update_gimple_call): Likewise.
27625 (valid_gimple_call_p): Likewise.
27626 (finish_update_gimple_call): Likewise, and simplify.
27627 (gimplify_and_update_call_from_tree): Implement
27628 update_call_from_tree functionality, avoid excessive
27629 push/pop_gimplify_context.
27630 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
27631 (gimple_fold_call): Likewise.
27632 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
27633 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
27634 (pass_fold_builtins::execute): Likewise.
27635 (optimize_stack_restore): Use replace_call_with_value.
27636 * tree-cfg.c (fold_loop_internal_call): Likewise.
27637 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
27638 only gimplify_and_update_call_from_tree.
27639 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
27640 (handle_builtin_strchr): Likewise.
27641 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
27643 2021-04-26 Jakub Jelinek <jakub@redhat.com>
27646 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
27647 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
27650 2021-04-25 liuhongt <hongtao.liu@intel.com>
27653 * config/i386/i386-builtin.def (BDESC): Change the icode of
27654 the following builtins to CODE_FOR_nothing.
27655 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
27656 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
27657 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
27658 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
27659 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
27660 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
27661 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
27662 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
27663 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
27664 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
27665 (sse2_eq<mode>3): Ditto.
27666 (sse4_1_eqv2di3): Ditto.
27667 (sse2_gt<mode>3): Rename to ..
27668 (*sse2_gt<mode>3): .. this.
27670 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27673 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27676 * config/darwin.c (darwin_binds_local_p): Assume that any
27677 public symbol might be interposed for PIC code. Update function
27678 header comment to reflect current Darwin capability.
27680 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27683 * config/darwin.c (darwin_binds_local_p): Assume that any
27684 public symbol might be interposed for PIC code. Update function
27685 header comment to reflect current Darwin capability.
27687 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
27689 * doc/sourcebuild.texi: Document no-opts and any-opts target
27692 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27694 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
27696 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27698 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
27700 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27703 * config/i386/i386-options.c (ix86_option_override_internal):
27704 Error out when -m96bit-long-double is used with 64bit targets.
27705 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
27707 2021-04-23 Martin Liska <mliska@suse.cz>
27709 * lto-wrapper.c: Remove FIXME about usage of
27710 hardware_concurrency. The function is not on par with
27713 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27716 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
27717 Copy operand 3 to operand 4. Use sse_reg_operand
27718 as operand 3 predicate.
27719 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
27720 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
27721 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
27722 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
27723 Copy operand 1 to operand 0.
27724 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
27725 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
27726 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
27728 2021-04-23 Alex Coplan <alex.coplan@arm.com>
27730 PR rtl-optimization/100230
27731 * early-remat.c (early_remat::sort_candidates): Use delete[]
27732 instead of delete for array allocated with new[].
27734 2021-04-23 Richard Biener <rguenther@suse.de>
27736 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
27737 (capture_info::capture_info): Likewise.
27738 (capture_info::walk_match): Likewise.
27739 (expr::gen_transform): Likewise.
27740 (dt_simplify::gen_1): Likewise.
27741 * gimple-match-head.c (maybe_resimplify_conditional_op):
27742 Remove VEC_COND_EXPR special-casing.
27743 (gimple_simplify): Likewise.
27744 * gimple.c (gimple_could_trap_p_1): Adjust.
27745 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
27746 to participate in PRE.
27748 2021-04-23 Richard Biener <rguenther@suse.de>
27750 * cfganal.c (connect_infinite_loops_to_exit): First call
27751 add_noreturn_fake_exit_edges.
27752 * ipa-sra.c (process_scan_results): Do not call the now redundant
27753 add_noreturn_fake_exit_edges.
27754 * predict.c (tree_estimate_probability): Likewise.
27755 (rebuild_frequencies): Likewise.
27756 * store-motion.c (one_store_motion_pass): Likewise.
27758 2021-04-23 Richard Biener <rguenther@suse.de>
27760 PR tree-optimization/100222
27761 * predict.c (pass_profile::execute): Remove redundant call to
27762 mark_irreducible_loops.
27763 (report_predictor_hitrates): Likewise.
27765 2021-04-23 Richard Biener <rguenther@suse.de>
27767 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
27768 valid_gimple_rhs_p by instead gimplifying to one.
27770 2021-04-23 Richard Biener <rguenther@suse.de>
27772 PR tree-optimization/99971
27773 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
27774 Always use TBAA for loads.
27776 2021-04-23 liuhongt <hongtao.liu@intel.com>
27779 * config/i386/i386-options.c (ix86_option_override_internal):
27780 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
27781 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
27782 by target attribute.
27784 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
27786 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
27788 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
27790 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
27792 * config.gcc (powerpc-ibm-aix6.*): Remove.
27793 * config/rs6000/aix61.h: Delete.
27795 2021-04-22 Martin Liska <mliska@suse.cz>
27797 PR testsuite/100159
27798 PR testsuite/100192
27799 * builtins.c (expand_builtin): Fix typos and missing comments.
27800 * dwarf2out.c (gen_subprogram_die): Likewise.
27801 (gen_struct_or_union_type_die): Likewise.
27803 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
27806 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
27807 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
27809 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
27811 * config/i386/darwin.h (TARGET_64BIT): Remove definition
27812 based on TARGET_ISA_64BIT.
27813 (TARGET_64BIT_P): Remove definition based on
27814 TARGET_ISA_64BIT_P().
27816 2021-04-21 Martin Liska <mliska@suse.cz>
27819 2021-04-21 Martin Liska <mliska@suse.cz>
27821 * lto-wrapper.c (cpuset_popcount): Remove.
27822 (init_num_threads): Remove and use hardware_concurrency.
27824 2021-04-21 Martin Liska <mliska@suse.cz>
27827 * main.c (main): Call toplev::finalize in CHECKING_P mode.
27828 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
27829 when incremental LTO linking happens.
27831 2021-04-21 Martin Liska <mliska@suse.cz>
27833 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
27834 makeserver cannot be detected, then use -flto=N fallback.
27836 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27838 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
27839 default to yes for aarch64-linux-gnu.
27840 * configure: Regenerate.
27842 2021-04-21 Martin Liska <mliska@suse.cz>
27844 * lto-wrapper.c (cpuset_popcount): Remove.
27845 (init_num_threads): Remove and use hardware_concurrency.
27847 2021-04-21 Martin Liska <mliska@suse.cz>
27849 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
27850 which remains to be '(... || 0)' and clang complains about it.
27851 * dwarf2out.c (AT_vms_delta): Declare conditionally.
27852 (add_AT_vms_delta): Likewise.
27853 * tree.c (fld_simplified_type): Use rather more common pattern
27854 for disabling of something (#if 0).
27855 (get_tree_code_name): Likewise.
27856 (verify_type_variant): Likewise.
27858 2021-04-21 Martin Liska <mliska@suse.cz>
27860 * config/i386/i386-expand.c (decide_alignment): Use newly named
27861 macro TARGET_CPU_P.
27862 * config/i386/i386.c (ix86_decompose_address): Likewise.
27863 (ix86_address_cost): Likewise.
27864 (ix86_lea_outperforms): Likewise.
27865 (ix86_avoid_lea_for_addr): Likewise.
27866 (ix86_add_stmt_cost): Likewise.
27867 * config/i386/i386.h (TARGET_*): Remove.
27868 (TARGET_CPU_P): New macro.
27869 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
27870 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
27871 (swap_top_of_ready_list): Likewise.
27872 (ix86_atom_sched_reorder): Likewise.
27873 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
27874 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
27876 2021-04-21 Martin Liska <mliska@suse.cz>
27878 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
27880 (SET_TARGET_NO_SAHF): Likewise.
27881 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
27882 (SET_TARGET_PREFETCH_SSE): Likewise.
27883 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
27884 (SET_TARGET_NO_TUNE): Likewise.
27885 (TARGET_EXPLICIT_NO_80387_P): Likewise.
27886 (SET_TARGET_NO_80387): Likewise.
27888 * config/i386/i386.h (TARGET_*): Remove.
27889 * opth-gen.awk: Generate new used macros.
27891 2021-04-21 Martin Liska <mliska@suse.cz>
27893 * config/i386/i386.h (PTA_*): Remove.
27894 (enum pta_flag): New.
27895 (DEF_PTA): Generate PTA_* values from i386-isa.def.
27896 * config/i386/i386-isa.def: New file.
27898 2021-04-21 Alex Coplan <alex.coplan@arm.com>
27901 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
27902 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
27903 jump table targets.
27905 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
27907 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
27908 x86_64-*-* targets.
27909 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
27911 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
27912 (ix86_handle_option): Handle -mmwait.
27913 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
27914 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
27915 __builtin_ia32_monitor and __builtin_ia32_mwait.
27916 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
27917 (ix86_valid_target_attribute_inner_p): Likewise.
27918 (ix86_option_override_internal): Enable mwait/monitor
27919 instructions for -msse3.
27920 * config/i386/i386.h (TARGET_MWAIT): New.
27921 (TARGET_MWAIT_P): Likewise.
27922 * config/i386/i386.opt: Add -mmwait.
27923 * config/i386/mwaitintrin.h: New file.
27924 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
27925 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
27927 (@sse3_monitor_<mode>): Likewise.
27928 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
27929 * doc/extend.texi: Document mwait target attribute.
27930 * doc/invoke.texi: Document -mmwait.
27932 2021-04-21 Martin Liska <mliska@suse.cz>
27934 * config/i386/i386-options.c (DEF_ENUM): Remove it.
27935 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
27936 * config/i386/stringop.def (DEF_ENUM): Likewise.
27938 2021-04-21 Martin Liska <mliska@suse.cz>
27940 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
27941 of print_generic_expr.
27943 2021-04-21 Jakub Jelinek <jakub@redhat.com>
27945 PR rtl-optimization/100148
27946 * cprop.c (constprop_register): Use next_nondebug_insn instead of
27949 2021-04-21 Martin Liska <mliska@suse.cz>
27952 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
27953 free_dominance_info calls.
27955 2021-04-21 Richard Biener <rguenther@suse.de>
27957 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
27958 parameter (and assume it to be false).
27959 (fold_gimple_assign): Adjust, remove all callers of
27960 maybe_fold_reference calling it with is_lhs true.
27961 (gimple_fold_call): Likewise.
27962 (fold_stmt_1): Likewise.
27964 2021-04-21 Richard Biener <rguenther@suse.de>
27966 * fold-const.c (pedantic_non_lvalue_loc): Remove.
27967 (fold_binary_loc): Adjust.
27968 (fold_ternary_loc): Likewise.
27970 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27972 PR middle-end/100130
27973 * varasm.c (get_block_for_decl): Make sure that any use of the
27974 retain attribute matches the section's retain flag.
27975 (switch_to_section): Check for retain mismatches even when
27976 changing sections, but do not warn if the given decl is the
27977 section's named.decl.
27978 (output_object_block): Pass the first decl in the block (if any)
27979 to switch_to_section.
27981 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
27983 * config/i386/i386-c.c (ix86_target_macros_internal): Define
27984 __CRC32__ for -mcrc32.
27985 * config/i386/i386-options.c (ix86_option_override_internal):
27986 Enable crc32 instruction for -msse4.2.
27987 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
27989 (sse4_2_crc32di): Likewise.
27990 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
27993 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
27996 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
27999 2021-04-20 Martin Liska <mliska@suse.cz>
28001 * doc/invoke.texi: Fix typo.
28002 * params.opt: Likewise.
28004 2021-04-20 Martin Liska <mliska@suse.cz>
28006 * doc/invoke.texi: Document new param.
28008 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
28010 PR tree-optimization/100081
28011 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
28012 rather than gori_compute_cache.
28013 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
28014 (range_def_chain::m_logical_depth): New member.
28015 (range_def_chain::range_def_chain): Initialize m_logical_depth.
28016 (range_def_chain::get_def_chain): Don't build defchains through more
28017 than LOGICAL_LIMIT logical expressions.
28018 * params.opt (param_ranger_logical_depth): New.
28020 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
28023 * config/arm/arm.c (arm_configure_build_target): Do not strip
28024 extended FPU/SIMD feature bits from the target ISA when -mfpu
28025 is specified (partial revert of r11-8168).
28027 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
28029 * params.opt (-param=openacc-kernels=): Add.
28030 * omp-oacc-kernels-decompose.cc
28031 (pass_omp_oacc_kernels_decompose::gate): Use it.
28032 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
28033 (--param): ... here, 'openacc-kernels'.
28035 2021-04-19 Martin Liska <mliska@suse.cz>
28038 * gengtype.c (finish_root_table): Align function arguments
28039 in between declaration and definition.
28041 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
28043 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
28044 frames larger than the SEH maximum frame size.
28046 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
28048 PR rtl-optimization/99927
28049 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
28050 is dead, just drop it.
28052 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
28055 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
28056 * doc/tm.texi: Regenerate.
28057 * doc/tm.texi.in (D language and ABI): Add @hook for
28058 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
28060 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
28062 * config/darwin-d.c (darwin_d_handle_target_object_format): New
28064 (darwin_d_register_target_info): New function.
28065 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28066 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
28068 (dragonfly_d_register_target_info): New function.
28069 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28070 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
28072 (freebsd_d_register_target_info): New function.
28073 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28074 * config/glibc-d.c (glibc_d_handle_target_object_format): New
28076 (glibc_d_register_target_info): New function.
28077 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28078 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
28080 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
28081 as handler for objectFormat key.
28082 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
28084 (winnt_d_register_target_info): New function.
28085 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28086 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
28088 (netbsd_d_register_target_info): New function.
28089 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28090 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
28092 (openbsd_d_register_target_info): New function.
28093 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28094 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
28095 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
28096 handler for objectFormat key.
28097 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
28099 (rs6000_d_register_target_info): Add
28100 rs6000_d_handle_target_object_format as handler for objectFormat key.
28101 * config/sol2-d.c (solaris_d_handle_target_object_format): New
28103 (solaris_d_register_target_info): New function.
28104 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
28106 2021-04-16 Jakub Jelinek <jakub@redhat.com>
28109 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
28110 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
28112 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
28113 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
28114 the old and new alignment after applying MIN/MAX to it is different.
28116 2021-04-16 Tamar Christina <tamar.christina@arm.com>
28119 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
28120 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
28122 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
28124 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
28126 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
28127 this section and its subsections.
28129 2021-04-16 Jakub Jelinek <jakub@redhat.com>
28132 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
28133 define_insn patterns.
28135 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
28137 PR rtl-optimization/98689
28138 * reg-notes.def (UNTYPED_CALL): New note.
28139 * combine.c (distribute_notes): Handle it.
28140 * emit-rtl.c (try_split): Likewise.
28141 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
28142 that calls with the note implicitly set all return value registers.
28143 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
28146 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
28148 PR rtl-optimization/99596
28149 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
28150 register accesses for const calls. Assume that pure functions
28151 can only read from global registers. Ignore cases in which
28152 the stack pointer has been marked global.
28154 2021-04-16 Jakub Jelinek <jakub@redhat.com>
28157 * tree-vect-loop.c (vect_transform_loop): Don't remove just
28158 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
28159 them by their last argument.
28161 2021-04-15 Martin Liska <mliska@suse.cz>
28163 * doc/invoke.texi: Other params don't use it, remove it.
28165 2021-04-15 Richard Biener <rguenther@suse.de>
28167 * gimple-builder.h: Add deprecation note.
28169 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
28172 * attribs.h (restrict_type_identity_attributes_to): Declare.
28173 * attribs.c (restrict_type_identity_attributes_to): New function.
28175 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
28178 * attribs.h (affects_type_identity_attributes): Declare.
28179 * attribs.c (remove_attributes_matching): New function.
28180 (affects_type_identity_attributes): Likewise.
28182 2021-04-15 Jakub Jelinek <jakub@redhat.com>
28185 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
28186 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
28187 ZERO_EXTEND, SIGN_EXTEND or AND.
28189 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
28191 PR rtl-optimization/99929
28192 * rtl.h (same_vector_encodings_p): New function.
28193 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
28194 * cselib.c (rtx_equal_for_cselib_1): Likewise.
28195 * jump.c (rtx_renumbered_equal_p): Likewise.
28196 * lra-constraints.c (operands_match_p): Likewise.
28197 * reload.c (operands_match_p): Likewise.
28198 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
28200 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
28202 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
28203 more information about variable-length CONST_VECTORs.
28205 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
28207 PR rtl-optimization/100066
28208 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
28209 ordered modes when choosing splitting mode for hard reg.
28211 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
28214 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
28216 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
28218 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
28220 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
28221 for mask operand types.
28222 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
28223 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
28225 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
28227 * config/s390/s390.c (s390_const_operand_ok): Check the new
28228 operand types and generate a list of valid values.
28230 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
28232 * doc/tm.texi: Regenerate.
28233 * doc/tm.texi.in (D language and ABI): Add @hook for
28234 TARGET_D_REGISTER_OS_TARGET_INFO.
28236 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
28238 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
28240 (aarch64_d_register_target_info): New function.
28241 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
28243 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
28245 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
28246 (arm_d_register_target_info): New function.
28247 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
28248 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28249 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
28250 (ix86_d_register_target_info): New function.
28251 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
28252 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28253 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
28254 (mips_d_register_target_info): New function.
28255 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
28256 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28257 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
28258 (pa_d_register_target_info): New function.
28259 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
28260 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28261 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
28263 (riscv_d_register_target_info): New function.
28264 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
28265 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28266 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
28268 (rs6000_d_register_target_info): New function.
28269 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
28271 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28272 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
28273 (s390_d_register_target_info): New function.
28274 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
28275 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28276 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
28278 (sparc_d_register_target_info): New function.
28279 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
28280 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
28281 * doc/tm.texi: Regenerate.
28282 * doc/tm.texi.in (D language and ABI): Add @hook for
28283 TARGET_D_REGISTER_CPU_TARGET_INFO.
28285 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
28287 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
28288 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
28289 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
28290 * doc/tm.texi: Regenerate.
28291 * doc/tm.texi.in (D language and ABI): Add @hook for
28292 TARGET_D_HAS_STDCALL_CONVENTION.
28294 2021-04-14 Richard Biener <rguenther@suse.de>
28296 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
28297 VEC_COND_EXPRs have a gimple_val condition.
28298 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
28299 can no longer have a GENERIC condition.
28301 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
28304 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
28305 from the isa_delta when -mfpu has been used.
28306 (arm_options_perform_arch_sanity_checks): It's the architecture that
28307 lacks an FPU not the processor.
28309 2021-04-13 Richard Biener <rguenther@suse.de>
28311 PR tree-optimization/100053
28312 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
28313 not use optimistic dominance queries for backedges to validate
28315 (dominated_by_p_w_unex): Add parameter to ignore executable
28316 state on backedges.
28317 (rpo_elim::eliminate_avail): Adjust.
28319 2021-04-13 Jakub Jelinek <jakub@redhat.com>
28322 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
28323 *aarch64_bfxilsi_extrdi): New define_insn patterns.
28325 2021-04-13 Jakub Jelinek <jakub@redhat.com>
28328 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
28329 outermode, return NULL if the result doesn't encode back to the
28330 original byte sequence.
28331 (simplify_gen_subreg): Don't create SUBREGs from constants to
28332 MODE_COMPOSITE_P outermode.
28334 2021-04-12 Jakub Jelinek <jakub@redhat.com>
28336 PR rtl-optimization/99905
28337 * combine.c (expand_compound_operation): If pos + len > modewidth,
28338 perform the right shift by pos in inner_mode and then convert to mode,
28339 instead of trying to simplify a shift of rtx with inner_mode by pos
28340 as if it was a shift in mode.
28342 2021-04-12 Jakub Jelinek <jakub@redhat.com>
28345 * combine.c (simplify_and_const_int_1): Don't optimize varop
28346 away if it has side-effects.
28348 2021-04-12 Martin Liska <mliska@suse.cz>
28350 * doc/extend.texi: Escape @smallexample content.
28352 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28354 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
28355 alternative in order to load a DFP zero.
28357 2021-04-12 Martin Liska <mliska@suse.cz>
28359 * doc/extend.texi: Be more precise in documentation
28360 of symver attribute.
28362 2021-04-12 Martin Liska <mliska@suse.cz>
28365 * gimplify.c (gimplify_expr): Right now, we unpoison all
28366 variables before a goto <dest>. We should not do it if we are
28369 2021-04-12 Cui,Lili <lili.cui@intel.com>
28371 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
28373 * common/config/i386/i386-common.c (processor_names): Add
28375 (processor_alias_table): Add rocketlake.
28376 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
28377 INTEL_COREI7_ROCKETLAKE.
28378 * config.gcc: Add -march=rocketlake.
28379 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
28381 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
28382 (processor_cost_table): Add rocketlake cost.
28383 * config/i386/i386.h (ix86_size_cost) : Define
28385 (processor_type) : Add PROCESSOR_ROCKETLAKE.
28386 (PTA_ROCKETLAKE): Ditto.
28387 * doc/extend.texi: Add rocketlake.
28388 * doc/invoke.texi: Add rocketlake.
28390 2021-04-12 Cui,Lili <lili.cui@intel.com>
28392 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
28393 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
28394 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
28395 * doc/invoke.texi: Change alderlake ISA list.
28397 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
28399 PR middle-end/98088
28400 * omp-expand.c (expand_oacc_collapse_init): Update condition in
28403 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
28406 * config/i386/serializeintrin.h (_serialize): Defined as macro.
28408 2021-04-10 Jakub Jelinek <jakub@redhat.com>
28411 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
28412 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
28414 2021-04-10 Jakub Jelinek <jakub@redhat.com>
28416 PR middle-end/99989
28417 * gimple-ssa-warn-alloca.c
28418 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
28419 0 with integer precision unconditionally.
28421 2021-04-10 Jakub Jelinek <jakub@redhat.com>
28423 PR rtl-optimization/98601
28424 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
28425 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
28426 unaligned_mems handle VOIDmode like BLKmode.
28428 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
28431 * tree.c (free_lang_data_in_decl): Do not release body of
28432 declare_variant_alt.
28434 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
28436 * config/aarch64/aarch64.c (aarch64_option_restore): If the
28437 architecture was specified explicitly and the tuning wasn't,
28438 tune for the architecture rather than the configured default CPU.
28440 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
28442 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
28443 as the temporary register.
28445 2021-04-09 Martin Liska <mliska@suse.cz>
28447 * doc/extend.texi: Move non-target attributes on the top level.
28449 2021-04-09 Martin Liska <mliska@suse.cz>
28451 * doc/invoke.texi: Document minimum and maximum value of the
28452 argument for both supported compression algorithms.
28454 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
28456 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
28457 TLS BSS before TLS data.
28458 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
28460 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
28462 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
28464 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
28466 * match.pd: Extend vec_cond folds to handle shifts.
28468 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
28470 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
28473 2021-04-08 Alex Coplan <alex.coplan@arm.com>
28476 * config/arm/iterators.md (MVE_vecs): New.
28477 (V_elem): Also handle V2DF.
28478 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
28479 (*mve_vdup<mode>): ... this. Remove second alternative since
28480 vec_duplicate of const_int is not canonical RTL, and we don't
28481 want to match symbol_refs.
28482 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
28484 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
28486 * fold-const.c (fold_single_bit_test): Fix typo.
28487 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
28490 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
28492 PR tree-optimization/97513
28493 * tree-vect-slp.c (vect_add_slp_permutation): New function,
28495 (vectorizable_slp_permutation): ...here. Detect cases in which
28496 all VEC_PERM_EXPRs are guaranteed to have the same stepped
28497 permute vector and only generate one permute vector for that case.
28498 Extend that case to handle variable-length vectors.
28500 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
28502 PR tree-optimization/99873
28503 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
28504 (vect_build_slp_instance): Don't split store groups that could
28505 use IFN_STORE_LANES.
28507 2021-04-07 Jakub Jelinek <jakub@redhat.com>
28510 * varasm.c (output_constant_pool_contents): Don't strip name encoding
28511 from XSTR (desc->sym, 0) or from label before passing those to
28514 2021-04-07 Richard Biener <rguenther@suse.de>
28516 PR tree-optimization/99954
28517 * tree-loop-distribution.c: Include tree-affine.h.
28518 (generate_memcpy_builtin): Try using tree-affine to prove
28520 (loop_distribution::classify_builtin_ldst): Always classify
28523 2021-04-07 Richard Biener <rguenther@suse.de>
28525 PR tree-optimization/99947
28526 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
28527 steps vector to avoid pushing elements from the reallocated
28530 2021-04-07 Richard Biener <rguenther@suse.de>
28532 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
28533 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
28535 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
28537 (debug_vn_reference_ops): New.
28539 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
28541 PR tree-optimization/98736
28542 * tree-loop-distribution.c
28543 * (loop_distribution::bb_top_order_init):
28544 Compute RPO with programing order preserved by calling function
28545 rev_post_order_and_mark_dfs_back_seme.
28547 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
28550 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
28551 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
28553 (process_bb_lives): Don't update biggest mode of hard reg for
28554 implicit in multi-register group. Use the new functions for
28555 updating dead_set and unused_set by register notes.
28557 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
28559 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
28562 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
28564 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
28565 (skylake_memset): Likewise.
28566 (skylake_cost): Change CLEAR_RATIO to 17.
28567 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28568 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
28569 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
28571 2021-04-06 Richard Biener <rguenther@suse.de>
28573 PR tree-optimization/99880
28574 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
28575 set vectorized defs of relevant PHIs.
28577 2021-04-06 Richard Biener <rguenther@suse.de>
28579 PR tree-optimization/99924
28580 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
28581 nodes w/o scalar stmts as visited.
28583 2021-04-06 Alex Coplan <alex.coplan@arm.com>
28586 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
28587 PCS for [su]fix_optab.
28589 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28591 * config/darwin.c (machopic_legitimize_pic_address): Check
28592 that the current pic register is one of the hard reg set
28593 before setting liveness.
28595 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28597 * config/darwin.c (machopic_legitimize_pic_address): Fix
28598 whitespace, remove unused code.
28600 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28602 PR tree-optimization/99882
28603 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
28606 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28608 PR rtl-optimization/99863
28609 * dse.c (replace_read): Drop regs_live argument. Instead of
28610 regs_live, use store_insn->fixed_regs_live if non-NULL,
28611 otherwise punt if insns sequence clobbers or sets any hard
28614 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28617 * targhooks.h (default_print_patchable_function_entry_1): Declare.
28618 * targhooks.c (default_print_patchable_function_entry_1): New function,
28619 copied from default_print_patchable_function_entry with an added flags
28621 (default_print_patchable_function_entry): Rewritten into a small
28622 wrapper around default_print_patchable_function_entry_1.
28623 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
28625 (rs6000_print_patchable_function_entry): New function.
28627 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
28629 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
28631 2021-04-01 Jason Merrill <jason@redhat.com>
28634 * common.opt: Document v15 and v16.
28636 2021-04-01 Richard Biener <rguenther@suse.de>
28638 PR tree-optimization/99863
28639 * gimplify.c (gimplify_init_constructor): Recompute vector
28642 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28644 * doc/extend.texi (symver attribute): Fix up syntax errors
28647 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28649 PR tree-optimization/96573
28650 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
28651 also pointer types.
28653 2021-04-01 Richard Biener <rguenther@suse.de>
28655 PR tree-optimization/99856
28656 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
28657 precision to vector element precision.
28659 2021-04-01 Martin Jambor <mjambor@suse.cz>
28661 PR tree-optimization/97009
28662 * tree-sra.c (access_or_its_child_written): New function.
28663 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
28666 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
28669 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
28671 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
28674 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
28675 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
28676 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
28677 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
28678 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
28679 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
28681 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
28682 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
28683 (define_attr "prefixed"): Update initializer.
28685 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28688 * dwarf2out.c (debug_ranges_dwo_section): New variable.
28689 (DW_RANGES_IDX_SKELETON): Define.
28690 (struct dw_ranges): Add begin_entry and end_entry members.
28691 (DEBUG_DWO_RNGLISTS_SECTION): Define.
28692 (add_ranges_num): Adjust r initializer for addition of *_entry
28694 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
28695 set idx to DW_RANGES_IDX_SKELETON.
28696 (use_distinct_base_address_for_range): New function.
28697 (index_rnglists): Don't set r->idx if it is equal to
28698 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
28699 r->end_entry for -gsplit-dwarf if those will be needed by
28701 (output_rnglists): Add DWO argument. If true, switch to
28702 debug_ranges_dwo_section rather than debug_ranges_section.
28703 Adjust l1/l2 label indexes. Only output the offset table when
28704 dwo is true and don't include in there the skeleton range
28705 entry if present. For -gsplit-dwarf, skip ranges that belong
28706 to the other rnglists section. Change return type from void
28707 to bool and return true if there are any range entries for
28708 the other section. For dwarf_split_debug_info use
28709 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
28710 entries instead of DW_RLE_start_end, DW_RLE_start_length and
28711 DW_RLE_base_address. Use use_distinct_base_address_for_range.
28712 (init_sections_and_labels): Initialize debug_ranges_dwo_section
28713 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
28714 and range_base_label indexes.
28715 (dwarf2out_finish): Call index_rnglists earlier before finalizing
28716 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
28717 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
28718 with different dwo arguments.
28719 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
28721 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28723 PR tree-optimization/98268
28724 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
28725 recompute_tree_invariant_for_addr_expr after successfully
28726 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
28728 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28730 PR tree-optimization/99726
28731 * tree-data-ref.c (create_intersect_range_checks_index): Bail
28732 out if there is more than one access function SCEV for the loop
28735 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28737 PR rtl-optimization/97141
28738 PR rtl-optimization/98726
28739 * emit-rtl.c (valid_for_const_vector_p): Return true for
28741 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
28742 poly_wide_int instead of a wide_int.
28743 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
28745 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
28746 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
28747 false for CONST_VECTORs that cannot be forced to memory.
28748 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
28749 is too complex to force to memory, build it up from individual
28752 2021-03-31 Jan Hubicka <jh@suse.cz>
28755 * cgraph.c (cgraph_node::release_body): Fix overactive check.
28757 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
28760 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
28763 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
28765 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
28766 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
28768 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
28769 "rep movsb/stosb" only for known sizes.
28770 * config/i386/i386-options.c (processor_cost_table): Use Ice
28771 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
28772 Rapids and Alder Lake.
28773 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
28774 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
28775 (icelake_memset): Likewise.
28776 (icelake_cost): Likewise.
28777 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28780 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28783 * config/aarch64/aarch64.c
28784 (aarch64_vectorize_preferred_vector_alignment): Query the size
28785 of the provided SVE vector; do not assume that all SVE vectors
28786 have the same size.
28788 2021-03-31 Jan Hubicka <jh@suse.cz>
28791 * cgraph.c (cgraph_node::release_body): Remove all callers and
28793 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
28794 * cgraphunit.c (cgraph_node::expand): And here.
28796 2021-03-31 Martin Liska <mliska@suse.cz>
28798 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
28799 and one negated condition.
28801 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28802 Richard Sandiford <richard.sandiford@arm.com>
28805 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
28806 constraints on operands[2] and similarly 0 and rk constraints
28807 on operands[1] corresponding to that.
28809 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28812 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
28813 linker doesn't support DWARF sections new in DWARF5.
28814 * config/i386/i386-options.c (ix86_option_override_internal): Default
28815 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
28817 * config.in: Regenerated.
28818 * configure: Regenerated.
28820 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28823 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
28824 available issue_info before using it.
28826 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28829 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
28832 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
28835 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
28836 (altivec_lvsl_reg_<mode>): ... this.
28837 (altivec_lvsr_reg): Change to ...
28838 (altivec_lvsr_reg_<mode>): ... this.
28839 * config/rs6000/predicates.md (vec_set_index_operand): New.
28840 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
28841 Enable 32bit variable vec_insert for all TARGET_VSX.
28842 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28843 Enable 32bit variable vec_insert for p9 and above.
28844 (rs6000_expand_vector_set_var_p8): Rename to ...
28845 (rs6000_expand_vector_set_var_p7): ... this.
28846 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
28848 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
28849 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
28850 gen_altivec_lvsr_reg_di.
28852 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
28855 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
28856 (__rdtscp): Likewise.
28858 2021-03-30 Tamar Christina <tamar.christina@arm.com>
28860 PR tree-optimization/99825
28861 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
28862 Reject non-mult 2 lanes.
28864 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
28867 * config/arm/arm.c (arm_file_start): Fix emission of
28868 Tag_ABI_VFP_args attribute.
28870 2021-03-30 Richard Biener <rguenther@suse.de>
28872 PR tree-optimization/99824
28873 * stor-layout.c (set_min_and_max_values_for_integral_type):
28874 Assert the precision is within the bounds of
28875 WIDE_INT_MAX_PRECISION.
28876 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
28877 the outermost component ref only to lower the access size
28878 and initialize that from the access type.
28880 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
28883 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
28884 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
28886 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
28888 * config/aarch64/aarch64.md
28889 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
28890 attribute to disambiguate between SIMD and FP variants of the
28893 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
28895 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
28896 (analyze_ssa_name_flags): Fix typo in comment.
28898 2021-03-29 Alex Coplan <alex.coplan@arm.com>
28901 * config/aarch64/aarch64-sve-builtins.cc
28902 (function_builder::add_function): Add placeholder_p argument, use
28903 placeholder decls if this is set.
28904 (function_builder::add_unique_function): Instead of conditionally adding
28905 direct overloads, unconditionally add either a direct overload or a
28907 (function_builder::add_overloaded_function): Set placeholder_p if we're
28908 using C++ overloads. Use the obstack for string storage instead
28909 of relying on the tree nodes.
28910 (function_builder::add_overloaded_functions): Don't return early for
28911 m_direct_overloads: we need to add placeholders.
28912 * config/aarch64/aarch64-sve-builtins.h
28913 (function_builder::add_function): Add placeholder_p argument.
28915 2021-03-29 Richard Biener <rguenther@suse.de>
28917 PR tree-optimization/99807
28918 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
28919 assert below VEC_PERM handling.
28921 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28924 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
28925 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
28926 matching const_int 0.
28927 (move_lo_quad_internal_be_<mode>): Likewise.
28928 (move_lo_quad_<mode>): Update for the above.
28929 * config/aarch64/iterators.md (VQ_2E): Delete.
28931 2021-03-29 Jakub Jelinek <jakub@redhat.com>
28933 PR tree-optimization/99777
28934 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
28935 types other than scalar integral types.
28937 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
28939 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
28940 XCOFF TLS reloc decorations.
28942 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
28944 * doc/analyzer.texi (Analyzer Internals): Update link to
28945 "A Memory Model for Static Analysis of C Programs".
28947 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
28949 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
28950 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
28952 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
28953 (rs6000_special_round_type_align): Recursively check innermost first
28956 2021-03-26 Jakub Jelinek <jakub@redhat.com>
28959 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
28960 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
28961 assignment with drap_reg active, queue reg save for hfp with offset 0
28962 and flush queued reg saves. When handling a push with rule18,
28963 defer queueing reg save for hfp and just assert the offset is 0.
28964 (scan_trace): Assert that fde->rule18 is false.
28966 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
28969 * ira-costs.c (record_reg_classes): Put case with
28970 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
28971 * ira.c (ira_setup_alts): Ditto.
28972 * lra-constraints.c (process_alt_operands): Ditto.
28973 * recog.c (asm_operand_ok): Ditto.
28974 * reload.c (find_reloads): Ditto.
28976 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28978 * config/aarch64/aarch64-protos.h
28979 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
28980 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
28981 * config/aarch64/aarch64.c (generic_addrcost_table): Update
28982 accordingly, using the same costs as for post_modify.
28983 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
28984 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
28985 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
28986 (a64fx_addrcost_table): Likewise.
28987 (neoversev1_addrcost_table): New.
28988 (neoversev1_tunings): Use neoversev1_addrcost_table.
28989 (aarch64_address_cost): Use the new post_modify costs for CImode
28992 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28994 * config/aarch64/aarch64.opt
28995 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
28996 * doc/invoke.texi: Document it.
28997 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
28998 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
28999 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
29000 (aarch64_vec_issue_info): New structures.
29001 (cpu_vector_cost): Write comments above the variables rather
29003 (cpu_vector_cost::issue_info): New member variable.
29004 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
29005 and tree-ssa-loop-niter.h.
29006 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
29007 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
29008 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
29009 (thunderx3t110_vector_cost): Initialize issue_info to null.
29010 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
29011 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
29012 (neoversev1_vector_cost): Use them.
29013 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
29014 (aarch64_vector_costs::saw_sve_only_op): New member variable.
29015 (aarch64_vector_costs::num_vector_iterations): Likewise.
29016 (aarch64_vector_costs::scalar_ops): Likewise.
29017 (aarch64_vector_costs::advsimd_ops): Likewise.
29018 (aarch64_vector_costs::sve_ops): Likewise.
29019 (aarch64_vector_costs::seen_loads): Likewise.
29020 (aarch64_simd_vec_costs_for_flags): New function.
29021 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
29022 Count the number of predicate operations required by SVE WHILE
29024 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
29025 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
29026 (aarch64_count_ops): Likewise.
29027 (aarch64_add_stmt_cost): Record whether see an SVE operation
29028 that cannot currently be implementing using Advanced SIMD.
29029 Record issue information about the scalar, Advanced SIMD
29030 and (where relevant) SVE versions of a loop.
29031 (aarch64_vec_op_count::dump): New function.
29032 (aarch64_sve_op_count::dump): Likewise.
29033 (aarch64_estimate_min_cycles_per_iter): Likewise.
29034 (aarch64_adjust_body_cost): If issue information is available,
29035 try to compare the issue rates of the various loop implementations
29036 and increase or decrease the vector body cost accordingly.
29038 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29040 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
29041 Assume a zero cost for induction phis.
29043 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29045 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
29047 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
29048 vector comparisons.
29050 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29052 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
29054 (aarch64_add_stmt_cost): Call it.
29056 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29058 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
29059 New tuning parameter.
29060 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
29061 (aarch64_estimated_sve_vq): New function.
29062 (aarch64_vector_costs::analyzed_vinfo): New member variable.
29063 (aarch64_vector_costs::is_loop): Likewise.
29064 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
29065 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
29066 (aarch64_record_potential_advsimd_unrolling): New function.
29067 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
29068 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
29069 aarch64_analyze_bb_vinfo on the first use of a costs structure.
29070 Detect whether we're vectorizing a loop for SVE that might be
29071 completely unrolled if it used Advanced SIMD instead.
29072 (aarch64_adjust_body_cost_for_latency): New function.
29073 (aarch64_finish_cost): Call it.
29075 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29077 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
29078 (aarch64_init_cost): New function.
29079 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
29080 the default unsigned[3].
29081 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
29082 (TARGET_VECTORIZE_INIT_COST): Override.
29083 (TARGET_VECTORIZE_FINISH_COST): Likewise.
29084 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
29086 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29088 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
29089 (neoversev1_sve_vector_cost): New cost structures.
29090 (neoversev1_vector_cost): Likewise.
29091 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
29093 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29095 * config/aarch64/aarch64-protos.h
29096 (sve_vec_cost::scatter_store_elt_cost): New member variable.
29097 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
29098 accordingly, taking the cost from the cost of a scalar_store.
29099 (a64fx_sve_vector_cost): Likewise.
29100 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
29102 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29104 * config/aarch64/aarch64-protos.h
29105 (simd_vec_cost::store_elt_extra_cost): New member variable.
29106 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
29107 accordingly, using the vec_to_scalar cost for the new field.
29108 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
29109 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
29110 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
29111 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
29112 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
29113 (thunderx3t110_advsimd_vector_cost): Likewise.
29114 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
29116 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29118 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
29119 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
29120 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
29121 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
29122 accordingly, using zero for the new costs.
29123 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
29124 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
29125 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
29126 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
29127 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
29128 (thunderx3t110_advsimd_vector_cost): Likewise.
29129 (aarch64_ld234_st234_vectors): New function.
29130 (aarch64_adjust_stmt_cost): Likewise.
29131 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
29132 the new vector costs.
29134 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29136 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
29137 derived class of simd_vec_cost. Add information about CLAST[AB]
29138 and FADDA instructions.
29139 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
29140 accordingly, using the vec_to_scalar costs for the new fields.
29141 (a64fx_sve_vector_cost): Likewise.
29142 (aarch64_reduc_type): New function.
29143 (aarch64_sve_in_loop_reduction_latency): Likewise.
29144 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
29145 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
29146 that occur in the loop body.
29147 (aarch64_add_stmt_cost): Update call accordingly.
29149 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
29151 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
29153 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
29154 above the fields rather than to the right.
29155 (simd_vec_cost::reduc_i8_cost): New member variable.
29156 (simd_vec_cost::reduc_i16_cost): Likewise.
29157 (simd_vec_cost::reduc_i32_cost): Likewise.
29158 (simd_vec_cost::reduc_i64_cost): Likewise.
29159 (simd_vec_cost::reduc_f16_cost): Likewise.
29160 (simd_vec_cost::reduc_f32_cost): Likewise.
29161 (simd_vec_cost::reduc_f64_cost): Likewise.
29162 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
29163 accordingly, using the vec_to_scalar_cost for the new fields.
29164 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
29165 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
29166 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
29167 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
29168 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
29169 (thunderx3t110_advsimd_vector_cost): Likewise.
29170 (aarch64_use_new_vector_costs_p): New function.
29171 (aarch64_simd_vec_costs): New function, split out from...
29172 (aarch64_builtin_vectorization_cost): ...here.
29173 (aarch64_is_reduction): New function.
29174 (aarch64_detect_vector_stmt_subtype): Likewise.
29175 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
29176 using the new vector costs.
29178 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29181 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
29182 TLS declarations as public.
29184 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29186 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
29187 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
29188 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
29189 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
29190 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
29191 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
29192 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
29193 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
29194 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
29196 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29199 * config.gcc (*-*-cygwin*): Add winnt-d.o
29200 (*-*-mingw*): Likewise.
29201 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
29202 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
29203 * config/i386/t-cygming: Add winnt-d.o.
29204 * config/i386/winnt-d.c: New file.
29206 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29208 * config/freebsd-d.c: Include memmodel.h.
29210 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
29213 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
29214 * config/t-openbsd: Add openbsd-d.o.
29215 * config/openbsd-d.c: New file.
29217 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
29219 PR tree-optimization/96974
29220 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
29221 with graceful exit.
29223 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
29226 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
29230 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
29231 always_inline in system headers.
29233 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
29235 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
29237 2021-03-25 Jakub Jelinek <jakub@redhat.com>
29240 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
29241 * fold-const.c (operand_compare::operand_equal_p): Don't compare
29242 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
29244 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
29248 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
29249 always_inline in system headers.
29251 2021-03-25 Richard Biener <rguenther@suse.de>
29253 PR tree-optimization/99746
29254 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
29255 the scalar stmt as patterned. Instead set up required things
29258 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
29260 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
29263 2021-03-24 Martin Liska <mliska@suse.cz>
29266 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
29268 * config/i386/i386-options.c (ix86_option_override_internal):
29269 Add run-time assert.
29271 2021-03-24 Martin Jambor <mjambor@suse.cz>
29274 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
29275 parameters with unknown type.
29276 (ipacp_value_safe_for_type): New function.
29277 (propagate_vals_across_arith_jfunc): Verify that the constant type
29278 can be used for a type of the formal parameter.
29279 (propagate_vals_across_ancestor): Likewise.
29280 (propagate_scalar_across_jump_function): Likewise. Pass the type
29281 also to propagate_vals_across_ancestor.
29283 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
29286 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
29288 (movmisalign<mode>_mve_load): Likewise.
29290 2021-03-24 Jakub Jelinek <jakub@redhat.com>
29293 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
29294 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
29296 2021-03-24 Alexandre Oliva <oliva@adacore.com>
29298 * doc/sourcebuild.texi (sysconf): New effective target.
29300 2021-03-24 Alexandre Oliva <oliva@adacore.com>
29302 * config/i386/predicates.md (reg_or_const_vec_operand): New.
29303 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
29304 the now *-prefixed insn_and_split, turn the splitter const vec
29305 into an input for the insn, making it an ignored immediate for
29306 non-split cases, and loaded into the scratch register
29309 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
29312 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
29313 Use define_relaxed_memory_constraint for them.
29315 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
29318 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
29319 colon to the diagnostic message.
29321 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
29323 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
29325 (try_fwprop_subst_note): Use set_info instead of insn_info.
29326 (try_fwprop_subst_pattern): Likewise.
29327 (try_fwprop_subst_notes): Likewise.
29328 (try_fwprop_subst): Likewise.
29329 (forward_propagate_subreg): Likewise.
29330 (forward_propagate_and_simplify): Likewise.
29331 (forward_propagate_into): Likewise.
29332 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
29334 (set_info::single_nondebug_insn_use): Likewise.
29335 (set_info::single_phi_use): Likewise.
29336 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
29338 (set_info::single_nondebug_insn_use): Likewise.
29339 (set_info::single_phi_use): Likewise.
29341 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
29343 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
29345 2021-03-23 Jakub Jelinek <jakub@redhat.com>
29348 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
29349 expand_mult to perform an unsigned rather than a signed
29352 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
29355 * config/i386/cpuid.h (__cpuid): Add __volatile__.
29356 (__cpuid_count): Likewise.
29358 2021-03-23 Richard Biener <rguenther@suse.de>
29360 PR tree-optimization/99721
29361 * tree-vect-slp.c (vect_slp_analyze_node_operations):
29362 Make sure we can schedule the node.
29364 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29366 * config/riscv/riscv.c (riscv_subword): Take endianness into
29367 account when calculating the byte offset.
29369 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29371 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
29372 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
29373 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
29374 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
29375 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
29376 new predicate "subreg_lowpart_operator"
29378 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29380 * config/riscv/riscv.c (riscv_swap_instruction): New function
29381 to byteswap an SImode rtx containing an instruction.
29382 (riscv_trampoline_init): Byteswap the generated instructions
29385 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29387 * common/config/riscv/riscv-common.c
29388 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
29389 * config.gcc (riscv32be-*, riscv64be-*): Set
29390 TARGET_BIG_ENDIAN_DEFAULT to 1.
29391 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
29392 depending on default endianness.
29393 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
29394 * config/riscv/linux.h (LINK_SPEC): Likewise.
29395 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
29396 default endianness.
29397 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
29399 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
29401 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
29402 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
29403 * config/riscv/linux.h (LINK_SPEC): Likewise.
29404 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
29406 (BYTES_BIG_ENDIAN): Handle big endian.
29407 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
29408 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
29410 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
29412 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
29414 * regcprop.c (find_oldest_value_reg): Ask target whether
29415 different mode is fine for replacement register.
29417 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
29419 PR tree-optimization/99296
29420 * value-range.cc (irange::irange_set_1bit_anti_range): New.
29421 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
29422 * value-range.h (irange::irange_set_1bit_anti_range): New.
29424 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
29427 * config/aarch64/constraints.md (UtQ): Use
29428 define_relaxed_memory_constraint for it.
29429 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
29430 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
29431 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
29432 (have_relaxed_memory_constraints): New static var.
29433 (relaxed_memory_start, relaxed_memory_end): Ditto.
29434 (add_constraint): Add arg is_relaxed_memory. Check name for
29435 relaxed memory. Set up is_relaxed_memory in constraint_data and
29436 have_relaxed_memory_constraints. Adjust calls.
29437 (choose_enum_order): Process relaxed memory.
29438 (write_tm_preds_h): Ditto.
29439 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
29440 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
29441 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
29442 * ira-lives.c (single_reg_class): Use
29443 insn_extra_relaxed_memory_constraint.
29444 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
29445 * lra-constraints.c (valid_address_p): Use
29446 insn_extra_relaxed_memory_constraint instead of other memory
29448 (process_alt_operands): Process CT_RELAXED_MEMORY.
29449 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
29450 * recog.c (asm_operand_ok, preprocess_constraints): Process
29452 * reload.c (find_reloads): Ditto.
29453 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
29454 * stmt.c (parse_input_constraint): Use
29455 insn_extra_relaxed_memory_constraint.
29457 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
29460 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
29463 2021-03-22 Alex Coplan <alex.coplan@arm.com>
29466 * config/arm/arm-protos.h (neon_make_constant): Add generate
29467 argument to guard emitting insns, default to true.
29468 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
29469 CONST_VECTORs which neon_make_constant can't handle.
29470 (neon_vdup_constant): Add generate argument, avoid emitting
29471 insns if it's not set.
29472 (neon_make_constant): Plumb new generate argument through.
29473 * config/arm/constraints.md (Ui): New. Use it...
29474 * config/arm/mve.md (*mve_mov<mode>): ... here.
29475 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
29476 synthesize constants.
29478 2021-03-22 Richard Biener <rguenther@suse.de>
29480 * debug.h: Add deprecation warning.
29482 2021-03-22 Richard Biener <rguenther@suse.de>
29484 PR tree-optimization/99694
29485 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
29488 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
29491 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
29492 after type checking.
29494 2021-03-22 Jakub Jelinek <jakub@redhat.com>
29498 * dwarf2out.c (get_full_len): Use get_precision rather than
29500 (add_const_value_attribute): Make sure add_AT_wide argument has
29501 precision prec rather than some very wide one.
29503 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
29505 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
29506 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
29507 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
29508 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
29509 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
29510 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
29511 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
29512 *fix_trunc<mode>si2_internal): Fix empty split condition.
29513 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
29514 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
29515 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
29516 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
29518 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
29521 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
29522 Convert idx to DImode.
29523 (rs6000_expand_vector_set_var_p8): Likewise.
29525 2021-03-21 Jakub Jelinek <jakub@redhat.com>
29528 * dwarf2out.c (insert_float): Change return type from void to
29529 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
29530 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
29533 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
29536 * config/i386/i386.c (construct_container): Check cfun != NULL
29537 before accessing silent_p.
29539 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
29541 * asan.c: Fix typos in comments.
29543 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
29545 PR rtl-optimization/99680
29546 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
29547 (process_address_1): Check empty constraint before using
29550 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
29552 * config/rs6000/rs6000.c (power10_cost): New.
29553 (rs6000_option_override_internal): Set Power10 costs.
29554 (rs6000_issue_rate): Set Power10 issue rate.
29555 * config/rs6000/power10.md: Rewrite for Power10.
29557 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
29560 * lra-constraints.c (process_address_1): Don't use unknown
29561 constraint for address constraint.
29563 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
29566 * config.gcc (powerpc-*-darwin8): Delete the reference to
29567 the now removed darwin8.h.
29569 2021-03-19 Olivier Hainque <hainque@adacore.com>
29572 * config/vxworksae.h (VX_CPU_PREFIX): Define.
29574 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
29576 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
29578 2021-03-19 Tamar Christina <tamar.christina@arm.com>
29580 PR tree-optimization/99656
29581 * tree-vect-slp-patterns.c (linear_loads_p,
29582 complex_add_pattern::matches, is_eq_or_top,
29583 vect_validate_multiplication, complex_mul_pattern::matches,
29584 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
29585 * tree-vectorizer.h: (complex_load_perm_t): Removed.
29586 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
29587 complex_load_perm_t.
29589 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
29592 * config/i386/i386-options.c (ix86_init_machine_status): Set
29594 * config/i386/i386.c (init_cumulative_args): Set silent_p to
29596 (construct_container): Return early for return and argument
29597 errors if silent_p is true.
29598 * config/i386/i386.h (machine_function): Add silent_p.
29600 2021-03-19 Jakub Jelinek <jakub@redhat.com>
29603 * config/arm/constraints.md (Ds): New constraint.
29604 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
29605 constraint instead of w,Dm.
29607 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
29609 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
29612 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
29614 PR middle-end/99641
29615 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
29616 array type, do the computation of the current position in sizetype.
29618 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
29621 * lra-constraints.c (process_address_1): Use lookup_constraint
29622 only for a single constraint.
29624 2021-03-18 Martin Sebor <msebor@redhat.com>
29626 PR middle-end/99502
29627 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
29628 (inbounds_memaccess_p): ...to this. Check the ending offset of
29629 the accessed member.
29631 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
29633 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
29634 %> quote markers to error messages.
29635 (gcn_goacc_validate_dims): Likewise.
29636 (gcn_conditional_register_usage): Remove exclaimation mark from error
29638 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
29640 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
29642 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
29645 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
29646 Kito Cheng <kito.cheng@sifive.com>
29648 * config/riscv/riscv.c (riscv_block_move_straight): Change type
29649 to unsigned HOST_WIDE_INT for parameter and local variable with
29650 HOST_WIDE_INT type.
29651 (riscv_adjust_block_mem): Ditto.
29652 (riscv_block_move_loop): Ditto.
29653 (riscv_expand_block_move): Ditto.
29655 2021-03-18 Nick Clifton <nickc@redhat.com>
29657 * config/v850/v850.c (construct_restore_jr): Increase static
29659 (construct_save_jarl): Likewise.
29660 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
29662 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29664 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
29665 (aarch64_override_options_internal): Use it.
29666 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
29669 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
29671 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
29672 error message format issues.
29673 (nios2_option_override): Likewise.
29674 (nios2_expand_fpu_builtin): Likewise.
29675 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
29676 truncation warning.
29677 (nios2_expand_custom_builtin): More error message format fixes.
29678 (nios2_expand_rdwrctl_builtin): Likewise.
29679 (nios2_expand_rdprs_builtin): Likewise.
29680 (nios2_expand_eni_builtin): Likewise.
29681 (nios2_expand_builtin): Likewise.
29682 (nios2_register_custom_code): Likewise.
29683 (nios2_valid_target_attribute_rec): Likewise.
29684 (nios2_add_insn_asm): Fix uninitialized variable warning.
29686 2021-03-17 Jan Hubicka <jh@suse.cz>
29688 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
29689 of gather to match reality.
29690 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
29692 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29694 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
29695 to compare against CC_REG rather than NE.
29697 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
29700 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
29701 inline assembly statements.
29702 (ix86_print_operand): Update 'P' handling for -fno-plt.
29704 2021-03-17 Tamar Christina <tamar.christina@arm.com>
29707 * config/aarch64/aarch64.c
29708 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
29710 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
29713 * config/rs6000/predicates.md (branch_comparison_operator): Allow
29714 ordered and unordered for CCFPmode, if flag_finite_math_only.
29716 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29719 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
29720 rather than ASHIFT.
29721 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
29723 2021-03-16 Martin Liska <mliska@suse.cz>
29726 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
29727 cl_optimization_compare function.
29729 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
29731 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
29734 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29737 * config/i386/i386.h (struct machine_function): Add
29738 has_explicit_vzeroupper bitfield.
29739 * config/i386/i386-expand.c (ix86_expand_builtin): Set
29740 cfun->machine->has_explicit_vzeroupper when expanding
29741 IX86_BUILTIN_VZEROUPPER.
29742 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
29743 Do the mode switching only when TARGET_VZEROUPPER, expensive
29744 optimizations turned on and not optimizing for size.
29745 (pass_insert_vzeroupper::gate): Enable even when
29746 cfun->machine->has_explicit_vzeroupper is set.
29748 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29751 * config/aarch64/aarch64.c
29752 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
29753 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
29754 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
29756 2021-03-15 Richard Biener <rguenther@suse.de>
29758 PR tree-optimization/98834
29759 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
29760 subsetting by truncating the access size.
29762 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
29764 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
29765 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
29768 2021-03-15 Martin Liska <mliska@suse.cz>
29770 * spellcheck.c: Add missing comma in initialization.
29772 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
29774 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
29775 alternative 2 and alternative 1 with alternative 3 using
29776 YW register constraint.
29777 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
29778 using YW register constraint.
29779 (*vec_extractv16qi_zext): Ditto.
29780 (*vec_extractv4si): Merge alternatives 4 and 5
29781 using Yw register constraint.
29782 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
29784 2021-03-13 Martin Sebor <msebor@redhat.com>
29786 PR tree-optimization/99489
29787 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
29788 is not a call statement.
29790 2021-03-13 Jakub Jelinek <jakub@redhat.com>
29792 PR tree-optimization/99544
29793 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
29794 if for vector types multiplication can't be done in type's mode.
29796 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
29799 * config/sparc/constraints.md (w): Rename to...
29800 (W): ... this and ditch previous implementation.
29801 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
29802 (*movdf_insn_sp64): Likewise.
29803 (*mov<VM64:mode>_insn_sp64): Likewise.
29804 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
29806 (atomic_compare_and_swap_leon3_1): Likewise.
29807 (*atomic_compare_and_swapdi_v8plus): Likewise.
29808 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
29809 architecture and add missing address validity check during LRA.
29811 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
29814 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
29815 occuring for assumed-size arrays in use_device_{ptr,addr}.
29817 2021-03-12 Jakub Jelinek <jakub@redhat.com>
29820 * config/i386/constraints.md (YW): New internal constraint.
29821 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
29822 (*<sse2_avx2>_<insn><mode>3<mask_name>,
29823 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
29824 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
29826 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
29827 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
29828 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
29829 into one, use Yw instead of former x,v.
29830 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
29831 the last alternative.
29832 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
29833 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
29834 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
29835 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
29836 into one, use <v_Yw> instead of former x,v.
29837 (avx2_interleave_highv32qi<mask_name>,
29838 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
29839 constraints. Add && <mask_avx512bw_condition> to condition.
29840 (avx2_interleave_lowv32qi<mask_name>,
29841 vec_interleave_lowv16qi<mask_name>,
29842 avx2_interleave_highv16hi<mask_name>,
29843 vec_interleave_highv8hi<mask_name>,
29844 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
29845 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
29846 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
29847 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
29848 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
29849 Yw instead of v in constraints.
29850 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
29851 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
29852 instead of Yv in constraints.
29853 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
29854 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
29855 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
29856 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
29857 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
29859 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
29860 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
29861 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
29862 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
29863 two, one with just x, another isa avx512vl with v.
29865 2021-03-12 Martin Liska <mliska@suse.cz>
29867 * doc/invoke.texi: Add missing param documentation.
29869 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29872 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
29873 analyzer/trimmed-graph.o.
29874 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
29875 feasibility checking to reflect new implementation.
29876 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
29878 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
29880 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29882 * digraph.cc (selftest::test_shortest_paths): Update
29883 shortest_paths init for new param. Add test of
29884 SPS_TO_GIVEN_TARGET.
29885 * shortest-paths.h (enum shortest_path_sense): New.
29886 (shortest_paths::shortest_paths): Add "sense" param.
29887 Update for renamings. Generalize to use "sense" param.
29888 (shortest_paths::get_shortest_path): Rename param.
29889 (shortest_paths::m_sense): New field.
29890 (shortest_paths::m_prev): Rename...
29891 (shortest_paths::m_best_edge): ...to this.
29892 (shortest_paths::get_shortest_path): Update for renamings.
29893 Conditionalize flipping of path on sense of traversal.
29895 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29897 * digraph.cc (selftest::test_shortest_paths): Add test coverage
29898 for paths from B and C.
29899 * shortest-paths.h (shortest_paths::shortest_paths): Handle
29900 unreachable nodes, rather than asserting.
29902 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
29905 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
29906 xcoff_tbss_section_name.
29907 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
29908 * xcoffout.c (xcoff_tbss_section_name): Delete.
29909 * xcoffout.h (xcoff_tbss_section_name): Delete.
29911 2021-03-11 Richard Biener <rguenther@suse.de>
29913 PR tree-optimization/99523
29914 * tree-cfg.c (dump_function_to_file): Dump SSA names
29915 w/o identifier to the decls section as well, not only those
29916 without a VAR_DECL.
29918 2021-03-11 Jakub Jelinek <jakub@redhat.com>
29921 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
29922 function calls with lhs fail if the lhs don't have compatible types.
29924 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
29926 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
29927 Change FRAME_POINTER_REGNUM to correspond to a new faked
29928 register faked_fp, part of GENNONACR_REGS like faked_ap.
29929 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
29930 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
29931 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
29932 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
29933 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
29935 (CRIS_REAL_FP_REGNUM): New constant.
29936 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
29937 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
29938 (cris_initial_elimination_offset): Handle elimination changes
29939 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
29940 and add one from FRAME_POINTER_REGNUM to
29941 HARD_FRAME_POINTER_REGNUM.
29942 (cris_expand_prologue, cris_expand_epilogue): Emit code for
29943 hard_frame_pointer_rtx instead of frame_pointer_rtx.
29945 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
29948 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
29949 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
29951 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
29954 * lra-constraints.c (process_address_1): Don't check unknown
29955 constraint, use X for empty constraint.
29957 2021-03-10 Alex Coplan <alex.coplan@arm.com>
29959 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
29960 Fix typo in comment describing "is_ha" argument.
29962 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
29964 * doc/sourcebuild.texi: Document LRA target selector.
29966 2021-03-10 David Malcolm <dmalcolm@redhat.com>
29968 * doc/ux.texi: Add subsection contrasting interactive versus
29969 batch usage of GCC.
29971 2021-03-10 Joel Hutton <joel.hutton@arm.com>
29974 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
29976 (vectorizable_load): Fix gather load mask check condition.
29978 2021-03-10 Richard Biener <rguenther@suse.de>
29980 PR tree-optimization/99510
29981 * tree.c (check_aligned_type): Check that the candidate
29982 has TYPE_USER_ALIGN set instead of matching with the
29985 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
29987 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
29988 float and vector integer modes only if the mode is not larger.
29990 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
29992 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
29994 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29996 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
29998 * ira-lives.c (single_reg_class): Ditto.
30000 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
30002 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
30003 the architecture-specific rtems.h.
30004 (aarch64-*-rtems*): Likewise.
30005 (arm*-*-rtems*): Likewise.
30006 (epiphany-*-rtems*): Likewise.
30007 (riscv*-*-rtems*): Likewise.
30009 2021-03-09 Jakub Jelinek <jakub@redhat.com>
30011 PR tree-optimization/99305
30012 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
30013 before integer_all_onesp instead of vice versa.
30015 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
30017 * common/config/arm/arm-common.c (arm_config_default): Change type
30018 of 'i' to unsigned.
30020 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
30023 * lra-constraints.c (process_address_1): Process constraint 'g'
30024 separately and digital constraints containing more one digit.
30026 2021-03-09 Nick Clifton <nickc@redhat.com>
30028 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
30029 (DWARF"_DEBUGGING_INFO): Define.
30031 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
30034 * calls.c (initialize_argument_information): When the argument
30035 is passed by reference, do not make a copy in a thunk only if
30036 the argument is already in memory. Remove redundant test for
30037 the case of callee copy.
30039 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
30042 * lra-constraints.c (process_address_1): Process 0..9 constraints
30043 in process_address_1.
30045 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
30047 * config/s390/s390.c (struct s390_processor processor_table):
30048 Binutils name string must not be empty.
30050 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
30052 * config/arc/arc.c (arc_attr_type): Remove function.
30054 2021-03-09 Martin Liska <mliska@suse.cz>
30057 * config/i386/i386-options.c (ix86_option_override_internal):
30058 Set isa_flags for OPTS argument and not for the global
30061 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
30063 * config/rs6000/predicates.md (ds_form_mem_operand): Check
30066 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
30069 * config/rs6000/predicates.md (ds_form_mem_operand) New
30071 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
30072 ds_form_mem_operand in ld/lwa patterns.
30073 * config/rs6000/fusion.md: Regenerate file.
30075 2021-03-08 Martin Sebor <msebor@redhat.com>
30077 PR middle-end/98266
30078 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
30079 (array_bounds_checker::check_array_bounds): Call it.
30081 2021-03-08 Martin Sebor <msebor@redhat.com>
30083 PR middle-end/97631
30084 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
30085 (handle_builtin_stxncpy_strncat): Rename locals. Determine
30086 destination size from allocation calls. Issue a more appropriate
30088 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
30089 (handle_builtin_memset): Same.
30091 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
30094 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
30095 to ensure we do not have an Altivec style address.
30096 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
30097 an Altivec style address.
30098 (*vsx_le_perm_store_<mode>): Likewise.
30099 (splitters after *vsx_le_perm_store_<mode>): Likewise.
30100 (vsx_load_<mode>): Disable special expander if passed an Altivec
30102 (vsx_store_<mode>): Likewise.
30104 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30107 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
30108 (aarch64_simd_shift_imm_vec_hi): Likewise.
30109 (aarch64_simd_shift_imm_vec_si): Likewise.
30110 (aarch64_simd_shift_imm_vec_di): Likewise.
30111 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
30112 predicate from above.
30113 (aarch64_shrn<mode>_insn_be): Likewise.
30114 (aarch64_rshrn<mode>_insn_le): Likewise.
30115 (aarch64_rshrn<mode>_insn_be): Likewise.
30116 (aarch64_shrn2<mode>_insn_le): Likewise.
30117 (aarch64_shrn2<mode>_insn_be): Likewise.
30118 (aarch64_rshrn2<mode>_insn_le): Likewise.
30119 (aarch64_rshrn2<mode>_insn_be): Likewise.
30121 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
30124 * lra-constraints.c (skip_contraint_modifiers): New function.
30125 (process_address_1): Use it before lookup_constraint call.
30127 2021-03-08 Martin Liska <mliska@suse.cz>
30130 * config/i386/i386-options.c (ix86_option_override_internal):
30131 Enable UINTR and HRESET for -march that supports it.
30133 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
30135 * config/s390/s390.c (f_constraint_p): New function.
30136 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
30137 (TARGET_MD_ASM_ADJUST): Likewise.
30139 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
30142 * tree-nested.c (convert_local_reference_stmt): Avoid calling
30143 lookup_field_for_decl for Fortran module (= namespace context).
30145 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
30147 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
30148 comparison with arithmetic right shift.
30149 (s390_expand_vcond): No need for a force_reg anymore.
30150 s390_vec_compare will do it.
30151 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
30152 immediate operands.
30154 2021-03-07 Jakub Jelinek <jakub@redhat.com>
30157 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
30158 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
30160 * config/i386/sse.md (v_Yw): New define_mode_attr.
30161 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
30162 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
30164 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
30165 xYw in constraints.
30167 2021-03-06 Julian Brown <julian@codesourcery.com>
30169 * tree-pretty-print.c (dump_generic_node): Emit non-generic
30170 address space info for aggregates.
30172 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
30174 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
30176 2021-03-05 Jakub Jelinek <jakub@redhat.com>
30178 PR middle-end/99322
30179 * tree-cfg.c (bb_to_omp_idx): New variable.
30180 (execute_build_cfg): Release the bb_to_omp_idx vector after
30181 cleanup_tree_cfg returns.
30182 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
30183 for bb_to_omp_idx being a vec<int> instead of pointer to array
30185 (make_edges): Remove bb_to_omp_idx local variable, don't pass
30186 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
30187 vec<int> instead of pointer to array of ints and don't free/release
30189 (remove_bb): When removing a bb and placing forced label somewhere
30190 else, ensure it is put into the same OpenMP region during cfg
30191 pass if possible or to entry successor as fallback. Unregister
30192 bb from bb_to_omp_idx.
30194 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
30197 * lra-constraints.c (process_address_1): Skip decomposing address
30198 for asm insn operand with unknown constraint.
30200 2021-03-05 Martin Jambor <mjambor@suse.cz>
30203 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
30204 corresponding speculative edges if we are about to resolve
30205 sepculation. Make edge direct (and so resolve speculations) before
30206 removing it from call_site_hash.
30207 (cgraph_edge::make_direct): Relax the initial assert to allow calling
30208 the function on speculative direct edges.
30210 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
30212 PR rtl-optimization/99376
30213 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
30214 of low-order zero bits is too large, set the result to 0 directly.
30216 2021-03-04 Jakub Jelinek <jakub@redhat.com>
30218 PR middle-end/93235
30219 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
30220 SUBREG and a SUBREG to op_mode can't be created.
30222 2021-03-04 Alex Coplan <alex.coplan@arm.com>
30225 * config/aarch64/aarch64-sve-builtins.cc
30226 (function_resolver::require_vector_type): Handle error_mark_node.
30228 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
30230 * cfgexpand.c (expand_asm_loc): Pass new parameter.
30231 (expand_asm_stmt): Likewise.
30232 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
30234 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
30235 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
30236 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
30237 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
30238 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
30239 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
30240 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
30241 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
30242 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
30243 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
30244 * doc/tm.texi (md_asm_adjust): Likewise.
30245 * target.def (md_asm_adjust): Likewise.
30247 2021-03-04 Richard Biener <rguenther@suse.de>
30249 PR middle-end/97855
30250 * tree-pretty-print.c: Poison pp_printf.
30251 (dump_decl_name): Avoid use of pp_printf.
30252 (dump_block_node): Likewise.
30253 (dump_generic_node): Likewise.
30255 2021-03-04 Martin Sebor <msebor@redhat.com>
30257 PR middle-end/96963
30258 PR middle-end/94655
30259 * builtins.c (handle_array_ref): New helper.
30260 (handle_mem_ref): New helper.
30261 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
30262 into new helper functions. Correct a workaround for vectorized
30265 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
30267 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
30268 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
30269 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
30270 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
30271 attribute for Power10.
30272 * config/rs6000/mma.md (*movoo): Likewise.
30273 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
30274 (define_mode_attr bits): Add DD/TD modes.
30275 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
30276 store_conditionalpti): Update size attribute for Power10.
30278 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
30281 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
30282 -Wuninitialized, -Wmaybe-uninitialized.
30283 (wide-int.o-warn): Likewise.
30285 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
30287 * common/config/arm/arm-common.c: Include configargs.h.
30288 (arm_config_default): New function.
30289 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
30290 processors that do not support Thumb. Take into account the
30291 --with-mode configuration setting for selecting the default.
30292 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
30293 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
30295 2021-03-03 Martin Liska <mliska@suse.cz>
30297 PR gcov-profile/97461
30298 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
30300 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
30303 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
30304 point back the hard frame pointer to its default location when the
30305 frame is larger than SEH_MAX_FRAME_SIZE.
30307 2021-03-03 Jakub Jelinek <jakub@redhat.com>
30310 * config/i386/predicates.md (logic_operator): New define_predicate.
30311 * config/i386/i386.md (mov + mem using comm arith peephole2):
30312 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
30313 and the inner mode is [QH]Imode.
30315 2021-03-03 Jakub Jelinek <jakub@redhat.com>
30318 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
30319 (new_loc_list): Clear end_entry.
30320 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
30321 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
30323 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
30324 initialize also end_entry.
30326 2021-03-03 Jakub Jelinek <jakub@redhat.com>
30329 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
30330 partitions, if in non-layout mode after reorder_blocks also move
30331 affected blocks to ensure a single partition transition.
30333 2021-03-03 Jason Merrill <jason@redhat.com>
30336 * cgraphunit.c (process_function_and_variable_attributes): Don't
30337 warn about flatten on an alias if the target also has it.
30338 * cgraph.h (symtab_node::get_alias_target_tree): New.
30340 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
30342 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
30343 period to symbol name.
30344 (tls_get_addr_internal<mode>): Same.
30346 2021-03-02 David Malcolm <dmalcolm@redhat.com>
30349 * diagnostic-show-locus.c
30350 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
30353 2021-03-02 Martin Sebor <msebor@redhat.com>
30355 PR middle-end/99276
30356 * builtins.c (warn_for_access): Remove stray warning text.
30358 2021-03-02 Martin Sebor <msebor@redhat.com>
30360 PR middle-end/99295
30361 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
30364 2021-03-02 Jakub Jelinek <jakub@redhat.com>
30367 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
30368 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
30369 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
30370 DW_MACRO_define_strx and DW_MACRO_undef_strx.
30371 (save_macinfo_strings): Use DW_MACRO_*_str* even with
30372 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
30373 DW_MACRO_undef_strx.
30375 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
30377 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
30379 (BT_FN_V8HI_V8HI_UINT): Likewise.
30380 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
30381 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
30382 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
30383 New builtin definitions.
30384 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
30385 vector extension version.
30386 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
30387 available with current -march level.
30388 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
30389 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
30390 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
30391 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
30392 (vec_extend_to_fp32_lo): Likewise.
30393 (vec_round_from_fp32): Likewise.
30394 (vec_convert_to_fp16): Likewise.
30395 (vec_convert_from_fp16): Likewise.
30396 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
30397 (vclfnls_v8hi): Likewise.
30398 (vcrnfs_v8hi): Likewise.
30399 (vcfn_v8hi): Likewise.
30400 (vcnf_v8hi): Likewise.
30402 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
30404 * common/config/s390/s390-common.c (processor_flags_table): New entry.
30405 * config.gcc: Enable arch14 for --with-arch and --with-tune.
30406 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
30407 arch14 for unknown CPU models.
30408 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
30409 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
30410 (s390_get_sched_attrmask): Likewise.
30411 (s390_get_unit_mask): Likewise.
30412 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
30413 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
30414 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
30415 (TARGET_NNPA_P): New macro definitions.
30416 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
30417 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
30419 2021-03-02 Jakub Jelinek <jakub@redhat.com>
30421 PR middle-end/95757
30422 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
30423 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
30424 != 1 comparisons if name is lhs of a comparison.
30426 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
30430 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
30431 * config/darwin.c (darwin_should_restore_cfa_state): New.
30432 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
30433 * doc/tm.texi: Regenerated.
30434 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
30435 * dwarf2cfi.c (connect_traces): If the target requests, restore
30436 the CFA expression after a DW_CFA_restore.
30437 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
30439 2021-03-01 Martin Liska <mliska@suse.cz>
30442 * optc-save-gen.awk: Add 4 more exceptions.
30444 2021-03-01 Nathan Sidwell <nathan@acm.org>
30447 * tree.h (TYPE_ALIGN_RAW): New accessor.
30448 (TYPE_ALIGN): Use it.
30450 2021-03-01 Jan Hubicka <jh@suse.cz>
30453 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
30455 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
30458 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
30459 point the hard frame pointer to the SSE register save area instead
30460 of the general register save area. Perform only minimal adjustment
30461 for small frames if it is initially not correctly aligned.
30462 (ix86_expand_prologue): Remove early saves for a SEH target.
30463 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
30465 2021-02-28 Jakub Jelinek <jakub@redhat.com>
30468 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
30469 typo - referneced -> referenced.
30470 * tree.c (component_ref_size): Fix comment typo -
30471 refernce -> reference.
30472 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
30473 traling -> trailing.
30474 (aliasing_component_refs_p): Fix comment typos -
30475 refernce -> reference and refernece -> reference and
30476 traling -> trailing.
30477 (nonoverlapping_refs_since_match_p): Fix comment typo -
30478 referneces -> references.
30479 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
30480 referneces -> references.
30482 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
30484 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
30485 diagnostic message to avoid use of a contraction and format
30488 2021-02-27 Jakub Jelinek <jakub@redhat.com>
30491 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
30492 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
30493 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
30494 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
30497 2021-02-26 Richard Biener <rguenther@suse.de>
30499 PR middle-end/99281
30500 * expr.c (store_field): For calls with return-slot optimization
30501 and addressable return type expand the store directly.
30503 2021-02-26 Richard Biener <rguenther@suse.de>
30506 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
30508 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
30511 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
30514 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
30516 * config.gcc: Add rs6000-pcrel-opt.o.
30517 * config/rs6000/rs6000-pcrel-opt.c: New file.
30518 * config/rs6000/pcrel-opt.md: New file.
30519 * config/rs6000/predicates.md: Add d_form_memory predicate.
30520 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
30521 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
30522 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
30523 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
30524 and make_pass_pcrel_opt().
30525 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
30526 (rs6000_option_override_internal): Add pcrel-opt.
30527 (rs6000_delegitimize_address): Support pcrel-opt.
30528 (rs6000_opt_masks): Add pcrel-opt.
30529 (pcrel_opt_valid_mem_p): New function.
30530 (reg_to_non_prefixed): Make global.
30531 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
30532 (output_pcrel_opt_reloc): New function.
30533 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
30534 (pcrel_extern_addr): Set loads_extern_addr.
30535 Add include for pcrel-opt.md.
30536 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
30537 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
30540 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
30543 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
30544 If TARGET_64BIT and dest is SUBREG, we check the width, if it
30545 equal to SImode, we use SImode operation, just like what we are
30548 2021-02-26 Marek Polacek <polacek@redhat.com>
30550 * builtins.c (warn_for_access): Fix typos.
30552 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
30554 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
30555 mark in front of the immediate quantity.
30556 (<optab>_rolsi3_uxtw): Likewise.
30558 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
30561 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
30562 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
30563 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
30564 address and disable when the FPCXT is not available.
30565 (nonsecure_call_value_reg_thumb2): Likewise.
30567 2021-02-25 Nathan Sidwell <nathan@acm.org>
30570 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
30572 2021-02-25 Tamar Christina <tamar.christina@arm.com>
30574 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
30576 2021-02-25 Richard Biener <rguenther@suse.de>
30578 PR tree-optimization/99253
30579 * tree-vect-loop.c (check_reduction_path): First compute
30580 code, then verify out-of-loop uses.
30582 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30585 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
30587 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30589 PR tree-optimization/80635
30590 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
30591 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
30592 has mode precision.
30594 2021-02-25 Richard Biener <rguenther@suse.de>
30596 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
30597 load_map population.
30598 (vect_match_slp_patterns_2): Revert part of last change.
30599 (vect_analyze_slp): Do not interleave optimize_load_redistribution
30600 with pattern detection but do it afterwards. Dump the
30601 whole SLP graph after pattern recognition and load
30602 redistribution optimization finished.
30604 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30607 * omp-low.c (struct omp_context): Add teams_nested_p and
30608 nonteams_nested_p members.
30609 (scan_omp_target): Diagnose teams nested inside of target with other
30610 directives strictly nested inside of the same target.
30611 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
30612 ctx->nonteams_nested_p as needed.
30614 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
30616 PR inline-asm/99123
30617 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
30619 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
30621 * config/cris/cris.c (cris_expand_prologue): Set
30622 current_function_static_stack_size, if flag_stack_usage_info.
30624 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
30626 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
30627 (rs6000_final_prescan_insn): Adjust.
30628 (rs6000_asm_output_opcode): Likewise.
30630 2021-02-24 Martin Sebor <msebor@redhat.com>
30632 PR middle-end/97172
30633 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
30634 from function arguments.
30636 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30638 PR tree-optimization/99220
30639 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
30640 node from cache when it's about to be deleted.
30642 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30644 PR tree-optimization/99225
30645 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
30646 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
30647 build_int_cst (..., 1). Formatting fixes.
30649 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30651 PR tree-optimization/99149
30652 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
30654 (vect_slp_reset_pattern): Remove.
30655 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
30656 (complex_mul_pattern::build, complex_fma_pattern::build,
30657 complex_fms_pattern::build): Fix ref counts.
30658 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
30659 when node is being deleted.
30660 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
30661 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
30663 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
30665 2021-02-24 Matthias Klose <doko@ubuntu.com>
30668 2020-12-07 Matthias Klose <doko@ubuntu.com>
30670 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
30671 and ENABLE_RTL_FLAG_CHECKING.
30673 2021-02-24 Richard Biener <rguenther@suse.de>
30676 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
30678 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
30680 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
30681 (vsx_assemble_pair): ...to this.
30682 (*mma_assemble_pair): Rename from this...
30683 (*vsx_assemble_pair): ...to this.
30684 (mma_disassemble_pair): Rename from this...
30685 (vsx_disassemble_pair): ...to this.
30686 (*mma_disassemble_pair): Rename from this...
30687 (*vsx_disassemble_pair): ...to this.
30688 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
30689 BU_COMPAT): New macros.
30690 (mma_assemble_pair): Rename from this...
30691 (vsx_assemble_pair): ...to this.
30692 (mma_disassemble_pair): Rename from this...
30693 (vsx_disassemble_pair): ...to this.
30694 (mma_assemble_pair): New compatibility built-in.
30695 (mma_disassemble_pair): Likewise.
30696 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
30697 (RS6000_BUILTIN_COMPAT): Define.
30698 (bdesc_compat): New.
30699 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30700 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
30701 and VSX_BUILTIN_ASSEMBLE_PAIR.
30702 (rs6000_init_builtins): Register compatibility built-ins.
30703 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
30704 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
30705 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30706 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
30707 (__builtin_vsx_assemble_pair): ...to this.
30708 (__builtin_mma_disassemble_pair): Rename from this...
30709 (__builtin_vsx_disassemble_pair): ...to this.
30711 2021-02-23 Martin Liska <mliska@suse.cz>
30714 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
30715 with different alignment. That leads to an invalid red zone
30716 size allocated in runtime.
30718 2021-02-23 Jakub Jelinek <jakub@redhat.com>
30720 PR tree-optimization/99204
30721 * fold-const.c (fold_read_from_constant_string): Check that
30722 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
30724 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
30725 Kewen Lin <linkw@gcc.gnu.org>
30727 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
30728 (rotl<mode>3_insert_3): ...this.
30729 (plus_ior_xor): New code_iterator.
30730 (define_split for GPR rl*imi): New splitter.
30731 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
30732 for integer merging.
30734 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30736 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
30738 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
30739 into a register when the above is enabled.
30740 * config/aarch64/aarch64.c (neoversev1_tunings):
30741 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30742 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30744 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
30746 * config/cris/cris.c (cris_print_operand) <'T'>: Change
30747 valid operand from is now an addi mult-value to shift-value.
30748 * config/cris/cris.md (*addi): Change expression of scaled
30749 operand from mult to ashift.
30750 * config/cris/cris.md (*addi_reload): New insn_and_split.
30752 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
30755 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
30756 hook_bool_const_tree_hwi_hwi_const_tree_true.
30757 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
30759 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
30761 PR rtl-optimization/98791
30762 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
30763 for unordered modes.
30765 2021-02-22 Martin Liska <mliska@suse.cz>
30767 * tree-inline.c (inline_forbidden_p): Set
30768 inline_forbidden_reason.
30770 2021-02-22 Richard Biener <rguenther@suse.de>
30772 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
30775 2021-02-22 Richard Biener <rguenther@suse.de>
30777 PR tree-optimization/99165
30778 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
30779 Accumulate changed to ret.
30781 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
30784 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
30786 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
30788 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30791 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
30793 (trunctf<DFP_ALL:mode>2): Likewise.
30794 (trunctdtf2_vr): Likewise.
30795 (trunctdtf2): Likewise.
30796 (extend<DFP_ALL:mode>tf2_vr): Likewise.
30797 (extend<DFP_ALL:mode>tf2): Likewise.
30798 (extendtftd2_vr): Likewise.
30799 (extendtftd2): Likewise.
30801 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30803 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
30804 add memory alternative.
30805 (tf_to_fprx2): New pattern.
30807 2021-02-19 Martin Sebor <msebor@redhat.com>
30810 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
30811 (attr_access::free_lang_data): Remove a spurious test.
30813 2021-02-19 Nathan Sidwell <nathan@acm.org>
30815 * doc/invoke.texi (flang-info-module-read): Document.
30817 2021-02-19 Martin Liska <mliska@suse.cz>
30819 PR translation/99167
30820 * params.opt: Fix typo.
30822 2021-02-19 Richard Biener <rguenther@suse.de>
30824 PR middle-end/99122
30825 * tree-inline.c (inline_forbidden_p): Do not inline functions
30826 with VLA arguments or return value.
30828 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30831 * config/arm/arm.md (*stack_protect_combined_set_insn,
30832 *stack_protect_combined_test_insn): If force_const_mem result
30833 is not valid general operand, force its address into the destination
30836 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30839 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
30840 pad or non-local label, put FORCED_LABELs from bb b after that label
30841 rather than before it.
30843 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
30846 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
30847 expand_vector_broadcast' to emit the vec_duplicate operand.
30849 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
30851 PR rtl-optimization/96264
30852 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
30855 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
30858 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
30859 looking up the retain attribute.
30860 (resolve_unique_section): Likewise.
30861 (get_variable_section): Likewise.
30862 (switch_to_section): Likewise. Warn when a symbol without the
30863 retain attribute and a symbol with the retain attribute are
30864 placed in the section with the same name, instead of the used
30866 * doc/extend.texi: Document the "retain" attribute.
30868 2021-02-18 Nathan Sidwell <nathan@acm.org>
30871 * doc/invoke.texi (flang-info-include-translate): Document header
30874 2021-02-18 Richard Biener <rguenther@suse.de>
30876 PR middle-end/99122
30877 * ipa-fnsummary.c (analyze_function_body): Set
30878 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
30879 * tree-inline.c (insert_init_debug_bind): Pass NULL for
30880 error_mark_node values.
30881 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
30883 (setup_one_parameter): Delay force_value_to_type until when
30886 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
30888 PR tree-optimization/99142
30889 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
30891 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30893 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
30894 wide_int_bitmask::wide_int_bitmask (uint64_t),
30895 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
30896 wide_int_bitmask::operator ~ () const,
30897 wide_int_bitmask::operator | (wide_int_bitmask) const,
30898 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
30900 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
30901 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
30902 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
30903 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
30904 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
30905 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
30906 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
30907 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
30908 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
30909 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
30910 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
30911 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
30912 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
30913 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
30914 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
30915 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
30916 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
30917 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
30918 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
30919 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
30920 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
30921 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
30922 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
30923 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
30925 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30927 PR middle-end/99109
30928 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
30929 (build_printable_array_type): ... this. Add nelts argument. For
30930 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
30931 nelts, call build_array_type_nelts.
30932 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
30933 instead of build_zero_elt_array_type and build_array_type_nelts.
30935 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30938 * config/i386/i386.c (distance_non_agu_define): Don't call
30939 extract_insn_cached here.
30940 (ix86_lea_outperforms): Save and restore recog_data around call
30941 to distance_non_agu_define and distance_agu_use.
30942 (ix86_ok_to_clobber_flags): Remove.
30943 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
30944 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
30945 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
30946 into define_insn. Move the splitting to define_peephole2 and
30947 check there using peep2_regno_dead_p if FLAGS_REG is dead.
30949 2021-02-17 Julian Brown <julian@codesourcery.com>
30951 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
30954 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
30957 * config/mips/mips.c (mips_symbol_insns): Do not use
30958 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
30960 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
30962 PR inline-asm/98096
30963 * stmt.c (resolve_operand_name_1): Take inout operands into account
30964 for access to labels by names.
30965 * doc/extend.texi: Describe counting operands for accessing labels.
30967 2021-02-16 Richard Biener <rguenther@suse.de>
30969 PR tree-optimization/38474
30970 * tree-ssa-structalias.c (variable_info::address_taken): New.
30971 (new_var_info): Initialize address_taken.
30972 (process_constraint): Set address_taken.
30973 (solve_constraints): Use the new address_taken flag rather
30974 than is_reg_var for sorting variables.
30975 (dump_constraint): Dump the variable number if the name
30978 2021-02-16 Jakub Jelinek <jakub@redhat.com>
30981 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
30982 multiply by 4096 and for inbranch by 8192.
30983 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
30984 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
30986 2021-02-15 Maya Rashish <coypu@sdf.org>
30988 * config/aarch64/aarch64.c (aarch64_init_builtins):
30989 Call SUBTARGET_INIT_BUILTINS.
30991 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
30993 PR rtl-optimization/98872
30994 * init-regs.c (initialize_uninitialized_regs): Skip initialization
30995 if CONST0_RTX is NULL.
30997 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
30999 PR rtl-optimization/98863
31000 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
31001 (function_info::build_info): Turn into a declaration, moving the
31002 definition to internals.h.
31003 (function_info::bb_walker): Declare.
31004 (function_info::create_reg_use): Likewise.
31005 (function_info::calculate_potential_phi_regs): Take a build_info
31007 (function_info::place_phis, function_info::create_ebbs): Declare.
31008 (function_info::calculate_ebb_live_in_for_debug): Likewise.
31009 (function_info::populate_backedge_phis): Delete.
31010 (function_info::start_block, function_info::end_block): Declare.
31011 (function_info::populate_phi_inputs): Delete.
31012 (function_info::m_potential_phi_regs): Move information to build_info.
31013 * rtl-ssa/internals.h: New file.
31014 (function_info::bb_phi_info): New class.
31015 (function_info::build_info): Moved from functions.h.
31016 Add a constructor and destructor.
31017 (function_info::build_info::ebb_use): Delete.
31018 (function_info::build_info::ebb_def): Likewise.
31019 (function_info::build_info::bb_live_out): Likewise.
31020 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
31021 (function_info::build_info::potential_phi_regs): Likewise.
31022 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
31023 (function_info::build_info::ebb_def_regs): Likewise.
31024 (function_info::build_info::bb_phis): Likewise.
31025 (function_info::build_info::bb_mem_live_out): Likewise.
31026 (function_info::build_info::bb_to_rpo): Likewise.
31027 (function_info::build_info::def_stack): Likewise.
31028 (function_info::build_info::old_def_stack_limit): Likewise.
31029 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
31030 Remove the regno argument. Push the previous definition onto the
31031 definition stack where necessary.
31032 * rtl-ssa/accesses.cc: Include internals.h.
31033 * rtl-ssa/changes.cc: Likewise.
31034 * rtl-ssa/blocks.cc: Likewise.
31035 (function_info::build_info::build_info): Define.
31036 (function_info::build_info::~build_info): Likewise.
31037 (function_info::bb_walker): New class.
31038 (function_info::bb_walker::bb_walker): Define.
31039 (function_info::add_live_out_use): Convert a logarithmic-complexity
31040 test into a linear one. Allow the same definition to be passed
31042 (function_info::calculate_potential_phi_regs): Moved from
31043 functions.cc. Take a build_info parameter and store the
31044 information there instead.
31045 (function_info::place_phis): New function.
31046 (function_info::add_entry_block_defs): Update call to record_reg_def.
31047 (function_info::calculate_ebb_live_in_for_debug): New function.
31048 (function_info::add_phi_nodes): Use bb_phis to decide which
31049 registers need phi nodes and initialize ebb_def_regs accordingly.
31050 Do not add degenerate phis here.
31051 (function_info::add_artificial_accesses): Use create_reg_use.
31052 Assert that all definitions are listed in the DF LR sets.
31053 Update call to record_reg_def.
31054 (function_info::record_block_live_out): Record live-out register
31055 values in the phis of successor blocks. Use the live-out set
31056 when processing the last block in an EBB, instead of always
31057 using the live-in sets of successor blocks. AND the live sets
31058 with the set of registers that have been defined in the EBB,
31059 rather than with all potential phi registers. Cope correctly
31060 with branches back to the start of the current EBB.
31061 (function_info::start_block): New function.
31062 (function_info::end_block): Likewise.
31063 (function_info::populate_phi_inputs): Likewise.
31064 (function_info::create_ebbs): Likewise.
31065 (function_info::process_all_blocks): Rewrite into a multi-phase
31067 * rtl-ssa/functions.cc: Include internals.h.
31068 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
31069 (function_info::init_function_data): Remove caller.
31070 * rtl-ssa/insns.cc: Include internals.h
31071 (function_info::create_reg_use): New function. Lazily any
31072 degenerate phis needed by the linear RPO view.
31073 (function_info::record_use): Use create_reg_use. When processing
31074 debug uses, use potential_phi_regs and test it before checking
31075 whether the register is live on entry to the current EBB. Lazily
31076 calculate ebb_live_in_for_debug.
31077 (function_info::record_call_clobbers): Update call to record_reg_def.
31078 (function_info::record_def): Likewise.
31080 2021-02-15 Martin Liska <mliska@suse.cz>
31082 * toplev.c (init_asm_output): Free output of
31083 gen_command_line_string function.
31084 (process_options): Likewise.
31086 2021-02-15 Martin Liska <mliska@suse.cz>
31088 * params.opt: Add 2 missing Param keywords.
31090 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
31092 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
31094 2021-02-15 Jakub Jelinek <jakub@redhat.com>
31096 PR tree-optimization/99079
31097 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
31098 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
31099 require both type and TREE_TYPE (@1) to be integral types and either
31100 type having smaller or equal precision, or TREE_TYPE (@1) being
31101 unsigned type, or type being signed type. If TREE_TYPE (@1)
31102 doesn't have wrapping overflow, perform the subtraction of one in
31105 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
31106 Richard Biener <rguether@suse.de>
31109 * ipa-reference.c (ipa_init): Only conditinally initialize
31110 reference_vars_to_consider.
31111 (propagate): Conditionally deninitialize reference_vars_to_consider.
31112 (ipa_reference_write_optimization_summary): Sanity check that
31113 reference_vars_to_consider is not allocated.
31115 2021-02-13 Levy Hsu <admin@levyhsu.com>
31118 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
31119 extend parameter to get_si_mem_base_reg declaration.
31120 (get_si_mem_base_reg): Add extend parameter. Set it.
31121 (analyze): Pass extend arg to get_si_mem_base_reg.
31122 (transform): Likewise. Use it when rewriting mems.
31123 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
31124 loads and emit sign/zero extending load followed by subreg move.
31126 2021-02-13 Jim Wilson <jimw@sifive.com>
31129 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
31130 exit when !reload_completed. Only perform check for compressed reg
31131 if reload_completed.
31132 (riscv_rtx_costs): In MEM case, when optimizing for size and
31133 shorten memrefs, if not compressible, then increase cost.
31135 2021-02-13 Jakub Jelinek <jakub@redhat.com>
31137 PR rtl-optimization/98439
31138 * recog.c (pass_split_before_regstack::gate): Enable even when
31139 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
31142 2021-02-13 Jakub Jelinek <jakub@redhat.com>
31145 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
31146 swap of V2SImode elements in memory into DImode memory rotate by 32.
31148 2021-02-12 Martin Sebor <msebor@redhat.com>
31150 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
31152 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
31154 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
31155 m_temp_obstack rather than m_obstack to allocate the temporary use.
31157 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
31159 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
31160 as read-modify operations.
31162 2021-02-12 Richard Biener <rguenther@suse.de>
31164 PR middle-end/38474
31165 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
31166 fbi->aa_walk_budget is bigger than zero. Update
31167 fbi->aa_walk_budget.
31168 (param_change_prob): Likewise.
31169 * ipa-prop.c (detect_type_change_from_memory_writes):
31170 Properly account walk_aliased_vdefs.
31171 (parm_preserved_before_stmt_p): Canonicalize updates.
31172 (parm_ref_data_preserved_p): Likewise.
31173 (parm_ref_data_pass_through_p): Likewise.
31174 (determine_known_aggregate_parts): Account own alias queries.
31176 2021-02-12 Martin Liska <mliska@suse.cz>
31178 * opts-common.c (decode_cmdline_option): Release werror_arg.
31179 * opts.c (gen_producer_string): Release output of
31180 gen_command_line_string.
31182 2021-02-12 Richard Biener <rguenther@suse.de>
31184 PR tree-optimization/38474
31185 * params.opt (-param=max-store-chains-to-track=): New param.
31186 (-param=max-stores-to-track=): Likewise.
31187 * doc/invoke.texi (max-store-chains-to-track): Document.
31188 (max-stores-to-track): Likewise.
31189 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
31191 (pass_store_merging::m_n_stores): Likewise.
31192 (pass_store_merging::terminate_and_process_chain): Update
31193 m_n_stores and m_n_chains.
31194 (pass_store_merging::process_store): Likewise. Terminate
31195 oldest chains if the number of stores or chains get too large.
31196 (imm_store_chain_info::terminate_and_process_chain): Dump
31199 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
31201 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
31202 the cold section, emit a nop before the directive if the previous
31203 active instruction can throw.
31205 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
31208 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
31209 memory addresses that are legal for quad word accesses.
31211 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
31214 * config/arm/thumb2.md (*doloop_end_internal): Generate
31215 alternative sequence to handle long range branches.
31217 2021-02-11 Joel Hutton <joel.hutton@arm.com>
31219 PR tree-optimization/98772
31220 * optabs-tree.c (supportable_half_widening_operation): New function
31221 to check for supportable V8QI->V8HI widening patterns.
31222 * optabs-tree.h (supportable_half_widening_operation): New function.
31223 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
31224 to create promotion stmts for V8QI->V8HI widening patterns.
31225 (vectorizable_conversion): Add case for V8QI->V8HI.
31227 2021-02-11 Richard Biener <rguenther@suse.de>
31229 * sparseset.h (SPARSESET_ELT_BITS): Remove.
31230 (SPARSESET_ELT_TYPE): Use unsigned int.
31231 * fwprop.c: Do not include sparseset.h.
31233 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31236 * varasm.c (declare_weak): For -fsyntax-only, allow even
31237 TREE_ASM_WRITTEN function decls.
31239 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31242 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
31243 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
31244 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
31245 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
31246 calling simplify_gen_subreg on it.
31248 2021-02-10 Martin Liska <mliska@suse.cz>
31250 * config/nvptx/nvptx.c (nvptx_option_override): Use
31251 flag_patchable_function_entry instead of the removed
31252 function_entry_patch_area_size.
31254 2021-02-10 Martin Liska <mliska@suse.cz>
31256 PR tree-optimization/99002
31257 PR tree-optimization/99026
31258 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
31259 leak when adjacent cases are merged.
31260 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
31262 (make_pass_lower_switch): Remove trailing whitespace.
31263 * tree-switch-conversion.h (release_clusters): New.
31265 2021-02-10 Richard Biener <rguenther@suse.de>
31267 PR rtl-optimization/99054
31268 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
31269 (fixup_partitions): Adjust.
31270 (rtl_verify_edges): Likewise.
31272 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31274 PR middle-end/99007
31275 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
31276 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
31279 2021-02-10 Richard Biener <rguenther@suse.de>
31282 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
31285 2021-02-10 Richard Biener <rguenther@suse.de>
31287 PR tree-optimization/99024
31288 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
31289 clear loop->aux if it is associated with the destroyed loop_vinfo.
31291 2021-02-10 Martin Liska <mliska@suse.cz>
31293 PR tree-optimization/99002
31294 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
31297 2021-02-10 Martin Liska <mliska@suse.cz>
31300 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
31301 a reference exists.
31303 2021-02-10 Jakub Jelinek <jakub@redhat.com>
31306 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
31307 at class scope for DWARF5+.
31309 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
31311 PR rtl-optimization/96015
31312 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
31313 (relax_delay_slots): When deleting a jump to the next active
31314 instruction over a barrier, first delete the barrier if the
31315 jump is the only way to reach the target label.
31317 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
31319 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
31320 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
31321 vector multiplies and vect.alu for SSRA.
31322 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
31323 vect.mul cost field.
31324 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
31325 * config/arm/arm.c: Likewise.
31327 2021-02-09 Richard Biener <rguenther@suse.de>
31329 PR tree-optimization/98863
31330 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
31331 * tree-ssa-sccvn.c (last_pushed_avail): New global.
31332 (rpo_elim::eliminate_push_avail): Chain pushed avails.
31333 (unwind_state::avail_top): Add.
31334 (do_unwind): Rewrite unwinding of avail entries.
31335 (do_rpo_vn): Initialize last_pushed_avail and
31336 avail_top of the undo state.
31338 2021-02-09 Jakub Jelinek <jakub@redhat.com>
31340 PR middle-end/99004
31341 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
31342 const char * to char * and free those pointers after use.
31344 2021-02-09 Richard Biener <rguenther@suse.de>
31346 PR tree-optimization/99017
31347 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
31348 zero vector cost entries.
31350 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
31352 PR middle-end/98974
31353 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
31354 parameter in vectorizable_condition.
31356 2021-02-08 Richard Biener <rguenther@suse.de>
31359 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
31361 2021-02-08 Martin Liska <mliska@suse.cz>
31364 * cfgexpand.c (pass_expand::execute): Parse per-function option
31365 flag_patchable_function_entry and use it.
31366 * common.opt: Remove function_entry_patch_area_size and
31367 function_entry_patch_area_start global variables.
31368 * opts.c (parse_and_check_patch_area): New function.
31369 (common_handle_option): Use it.
31370 * opts.h (parse_and_check_patch_area): New function.
31371 * toplev.c (process_options): Parse and use
31372 function_entry_patch_area_size.
31374 2021-02-08 Martin Sebor <msebor@redhat.com>
31376 * doc/extend.texi (attribute malloc): Correct typos.
31378 2021-02-05 Nathan Sidwell <nathan@acm.org>
31381 * gcc.c (driver::maybe_run_linker): Check for input file
31382 accessibility if not linking.
31384 2021-02-05 Richard Biener <rguenther@suse.de>
31386 PR tree-optimization/98855
31387 * tree-vectorizer.h (add_stmt_cost): New overload.
31388 * tree-vect-slp.c (li_cost_vec_cmp): New.
31389 (vect_bb_slp_scalar_cost): Cost individual loop regions
31390 separately. Account for the scalar instance root stmt.
31392 2021-02-05 Tom de Vries <tdevries@suse.de>
31395 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
31397 (bit_test_cluster::emit): Reuse location_t for newly created
31399 (switch_decision_tree::try_switch_expansion): Preserve
31401 * tree-switch-conversion.h: Change function signatures.
31403 2021-02-05 Jakub Jelinek <jakub@redhat.com>
31406 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
31407 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
31408 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
31409 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
31411 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31413 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
31414 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
31415 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
31416 (vget_high_f16): Reimplement using new builtin.
31417 (vget_high_f32): Likewise.
31418 (vget_high_f64): Likewise.
31419 (vget_high_p8): Likewise.
31420 (vget_high_p16): Likewise.
31421 (vget_high_p64): Likewise.
31422 (vget_high_s8): Likewise.
31423 (vget_high_s16): Likewise.
31424 (vget_high_s32): Likewise.
31425 (vget_high_s64): Likewise.
31426 (vget_high_u8): Likewise.
31427 (vget_high_u16): Likewise.
31428 (vget_high_u32): Likewise.
31429 (vget_high_u64): Likewise.
31431 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31433 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
31434 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
31435 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
31436 (vget_low_f16): Reimplement using new builtin.
31437 (vget_low_f32): Likewise.
31438 (vget_low_f64): Likewise.
31439 (vget_low_p8): Likewise.
31440 (vget_low_p16): Likewise.
31441 (vget_low_p64): Likewise.
31442 (vget_low_s8): Likewise.
31443 (vget_low_s16): Likewise.
31444 (vget_low_s32): Likewise.
31445 (vget_low_s64): Likewise.
31446 (vget_low_u8): Likewise.
31447 (vget_low_u16): Likewise.
31448 (vget_low_u32): Likewise.
31449 (vget_low_u64): Likewise.
31451 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
31453 * gcc.c (print_multilib_info): Check all required argument is provided
31456 2021-02-05 liuhongt <hongtao.liu@intel.com>
31459 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
31460 generate integer mask comparison for 128/256-bits vector when
31461 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
31462 delete redundant !maskcmp condition.
31463 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
31465 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
31466 condition directly to if (maskcmp), add extra check for
31467 cmpmode, it should be MODE_INT.
31468 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
31469 parameters op_true/op_false.
31470 (ix86_use_mask_cmp_p): New.
31472 2021-02-05 liuhongt <hongtao.liu@intel.com>
31475 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
31476 Remove m_GENERIC from ~list.
31477 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
31479 2021-02-04 David Malcolm <dmalcolm@redhat.com>
31482 * diagnostic-show-locus.c (compatible_locations_p): Require
31483 locations in the same macro map to be either both from the
31484 macro definition, or both from the macro arguments.
31486 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
31488 * config/aarch64/aarch64-simd-builtins.def: Add
31489 [su]mull_hi_lane[q] builtin generator macros.
31490 * config/aarch64/aarch64-simd.md
31491 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
31492 (aarch64_<su>mull_hi_lane<mode>): Define.
31493 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
31494 (aarch64_<su>mull_hi_laneq<mode>): Define.
31495 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
31496 builtin instead of inline asm.
31497 (vmull_high_lane_s32): Likewise.
31498 (vmull_high_lane_u16): Likewise.
31499 (vmull_high_lane_u32): Likewise.
31500 (vmull_high_laneq_s16): Likewise.
31501 (vmull_high_laneq_s32): Likewise.
31502 (vmull_high_laneq_u16): Likewise.
31503 (vmull_high_laneq_u32): Liekwise.
31505 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
31507 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
31508 builtin generator macros.
31509 * config/aarch64/aarch64-simd.md
31510 (aarch64_<su>mull_hi_n<mode>_insn): Define.
31511 (aarch64_<su>mull_hi_n<mode>): Define.
31512 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
31513 instead of inline asm.
31514 (vmull_high_n_s32): Likewise.
31515 (vmull_high_n_u16): Likewise.
31516 (vmull_high_n_u32): Likewise.
31518 2021-02-04 Richard Biener <rguenther@suse.de>
31520 PR tree-optimization/98855
31521 * tree-vect-loop.c (vectorizable_phi): Do not cost
31522 single-argument PHIs.
31523 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
31524 * tree-vect-stmts.c (vectorizable_bswap): Also perform
31525 costing for SLP operation.
31527 2021-02-04 Martin Liska <mliska@suse.cz>
31529 * doc/extend.texi: Mention -mprefer-vector-width in target
31532 2021-02-03 Martin Sebor <msebor@redhat.com>
31534 PR tree-optimization/98937
31535 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
31536 Flush pointer_query cache.
31538 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
31540 * config/rs6000/genfusion.pl (gen_2logical): Add missing
31541 fixes based on patch review.
31542 * config/rs6000/fusion.md: Regenerate file.
31544 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
31546 * config/rs6000/t-rs6000: Comment out auto generation of
31549 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
31551 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
31552 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
31553 (output_file_start): Add gfx908.
31554 * config/gcn/gcn.opt (gpu_type): Add gfx908.
31555 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
31556 (MULTILIB_DIRNAMES): Add gfx908.
31557 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
31558 (main): Recognize gfx908.
31559 * config/gcn/t-omp-device: Add gfx908.
31561 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31563 * config/aarch64/aarch64-simd-builtins.def: Add
31564 [su]mlsl_hi_lane[q] builtin macro generators.
31565 * config/aarch64/aarch64-simd.md
31566 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
31567 (aarch64_<su>mlsl_hi_lane<mode>): Define.
31568 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
31569 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
31570 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
31571 builtin instead of inline asm.
31572 (vmlsl_high_lane_s32): Likewise.
31573 (vmlsl_high_lane_u16): Likewise.
31574 (vmlsl_high_lane_u32): Likewise.
31575 (vmlsl_high_laneq_s16): Likewise.
31576 (vmlsl_high_laneq_s32): Likewise.
31577 (vmlsl_high_laneq_u16): Likewise.
31578 (vmlsl_high_laneq_u32): Likewise.
31579 (vmlal_high_laneq_u32): Likewise.
31581 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31583 * config/aarch64/aarch64-simd-builtins.def: Add
31584 [su]mlal_hi_lane[q] builtin generator macros.
31585 * config/aarch64/aarch64-simd.md
31586 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
31587 (aarch64_<su>mlal_hi_lane<mode>): Define.
31588 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
31589 (aarch64_<su>mlal_hi_laneq<mode>): Define.
31590 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
31591 builtin instead of inline asm.
31592 (vmlal_high_lane_s32): Likewise.
31593 (vmlal_high_lane_u16): Likewise.
31594 (vmlal_high_lane_u32): Likewise.
31595 (vmlal_high_laneq_s16): Likewise.
31596 (vmlal_high_laneq_s32): Likewise.
31597 (vmlal_high_laneq_u16): Likewise.
31598 (vmlal_high_laneq_u32): Likewise.
31600 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31602 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
31603 builtin generator macros.
31604 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
31606 (aarch64_<su>mlsl_hi_n<mode>): Define.
31607 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
31608 instead of inline asm.
31609 (vmlsl_high_n_s32): Likewise.
31610 (vmlsl_high_n_u16): Likewise.
31611 (vmlsl_high_n_u32): Likewise.
31613 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31615 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
31616 builtin generator macros.
31617 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
31619 (aarch64_<su>mlal_hi_n<mode>): Define.
31620 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
31621 instead of inline asm.
31622 (vmlal_high_n_s32): Likewise.
31623 (vmlal_high_n_u16): Likewise.
31624 (vmlal_high_n_u32): Likewise.
31626 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31628 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
31630 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
31632 (aarch64_<su>mlal_hi<mode>_insn): This.
31633 (aarch64_<su>mlal_hi<mode>): Define.
31634 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
31635 instead of inline asm.
31636 (vmlal_high_s16): Likewise.
31637 (vmlal_high_s32): Likewise.
31638 (vmlal_high_u8): Likewise.
31639 (vmlal_high_u16): Likewise.
31640 (vmlal_high_u32): Likewise.
31642 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
31644 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
31645 after calling alter_subreg() on a (mem).
31647 2021-02-03 Martin Liska <mliska@suse.cz>
31650 * lto-streamer-out.c (produce_lto_section): Fill up missing
31652 * lto-streamer.h (struct lto_section): Add _padding field.
31654 2021-02-03 Richard Biener <rguenther@suse.de>
31656 * lto-streamer.c (lto_get_section_name): Free temporary
31658 * tree-loop-distribution.c
31659 (loop_distribution::merge_dep_scc_partitions): Free edge data.
31661 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31663 PR middle-end/97487
31664 * ifcvt.c (noce_can_force_operand): New function.
31665 (noce_emit_move_insn): Use it.
31666 (noce_try_sign_mask): Likewise. Formatting fix.
31668 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31670 PR middle-end/97971
31671 * lra-constraints.c (process_alt_operands): For inline asm, don't call
31672 fatal_insn, but instead return false.
31674 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31676 PR tree-optimization/98287
31677 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
31680 2021-02-03 Tamar Christina <tamar.christina@arm.com>
31682 PR tree-optimization/98928
31683 * tree-vect-loop.c (vect_analyze_loop_2): Change
31684 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
31685 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
31686 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
31687 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
31689 2021-02-02 Richard Biener <rguenther@suse.de>
31691 * gimple-loop-interchange.cc (prepare_data_references):
31693 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
31694 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
31695 * tree-vect-stmts.c (vectorizable_condition): Do not
31697 (vectorizable_comparison): Likewise.
31699 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31701 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
31702 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
31703 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
31704 (vrsqrteq_u32): Likewise.
31706 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31708 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
31709 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
31710 (aarch64_sqxtun2<mode>_be): Likewise.
31711 (aarch64_sqxtun2<mode>): Likewise.
31712 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
31713 (vqmovun_high_s32): Likewise.
31714 (vqmovun_high_s64): Likewise.
31715 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
31717 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31719 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
31721 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
31723 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31725 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
31726 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
31727 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
31728 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
31729 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
31730 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
31731 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
31732 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
31734 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31736 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
31737 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
31738 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
31740 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31742 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
31743 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
31745 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31747 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
31748 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
31749 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
31750 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
31751 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
31752 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
31753 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
31754 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
31755 NONE builtin flags.
31757 2021-02-02 Jakub Jelinek <jakub@redhat.com>
31759 PR tree-optimization/98848
31760 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
31761 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
31763 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
31766 * expr.c: Check mode before calling store_expr.
31768 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
31770 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
31772 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
31773 instruction using expression ior.
31774 (mve_vornq_u<mode>): New expander.
31775 (mve_vornq_f<mode>): Use ior code instead of unspec.
31776 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
31778 2021-02-02 Alexandre Oliva <oliva@adacore.com>
31780 * tree-nested.c (convert_nonlocal_reference_op): Move
31781 current_function_decl restore after re-gimplification.
31782 (convert_local_reference_op): Likewise.
31784 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31786 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
31788 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
31790 (aarch64_rshrn<mode>_insn_be): Likewise.
31791 (aarch64_rshrn<mode>): Likewise.
31792 (aarch64_rshrn2<mode>_insn_le): Likewise.
31793 (aarch64_rshrn2<mode>_insn_be): Likewise.
31794 (aarch64_rshrn2<mode>): Likewise.
31795 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
31796 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
31798 (vrshrn_high_n_s32): Likewise.
31799 (vrshrn_high_n_s64): Likewise.
31800 (vrshrn_high_n_u16): Likewise.
31801 (vrshrn_high_n_u32): Likewise.
31802 (vrshrn_high_n_u64): Likewise.
31803 (vrshrn_n_s16): Likewise.
31804 (vrshrn_n_s32): Likewise.
31805 (vrshrn_n_s64): Likewise.
31806 (vrshrn_n_u16): Likewise.
31807 (vrshrn_n_u32): Likewise.
31808 (vrshrn_n_u64): Likewise.
31810 2021-02-01 Sergei Trofimovich <siarheit@google.com>
31812 PR tree-optimization/98499
31813 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
31814 conservatively and assume all possible side-effects.
31816 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31818 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
31819 vec_unpacku_hi_): Define builtins.
31820 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
31822 (vmovl_high_s16): Likewise.
31823 (vmovl_high_s32): Likewise.
31824 (vmovl_high_u8): Likewise.
31825 (vmovl_high_u16): Likewise.
31826 (vmovl_high_u32): Likewise.
31828 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31830 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
31832 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
31834 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
31836 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
31838 (vabdl_s16): Likewise.
31839 (vabdl_s32): Likewise.
31840 (vabdl_u8): Likewise.
31841 (vabdl_u16): Likewise.
31842 (vabdl_u32): Likewise.
31843 * config/aarch64/iterators.md (ABDL): New int iterator.
31844 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
31846 2021-02-01 Martin Sebor <msebor@redhat.com>
31848 * tree.h (BLOCK_VARS): Add comment.
31849 (BLOCK_SUBBLOCKS): Same.
31850 (BLOCK_SUPERCONTEXT): Same.
31851 (BLOCK_ABSTRACT_ORIGIN): Same.
31852 (inlined_function_outer_scope_p): Same.
31854 2021-02-01 Martin Sebor <msebor@redhat.com>
31856 PR middle-end/97172
31857 * attribs.c (attr_access::free_lang_data): Define new function.
31858 * attribs.h (attr_access::free_lang_data): Declare new function.
31860 2021-02-01 Richard Biener <rguenther@suse.de>
31862 * vec.h (auto_vec::auto_vec): Add memory stat parameters
31864 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
31866 2021-02-01 Tamar Christina <tamar.christina@arm.com>
31868 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
31869 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
31871 2021-02-01 Richard Biener <rguenther@suse.de>
31873 PR rtl-optimization/98863
31874 * config/i386/i386-features.c (convert_scalars_to_vector):
31875 Set DF_RD_PRUNE_DEAD_DEFS.
31877 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
31879 * system.h (SIZE_MAX): Define if not already defined.
31881 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
31883 * config/rs6000/genfusion.pl (gen_2logical): New function to
31884 generate patterns for logical-logical fusion.
31885 * config/rs6000/fusion.md: Regenerated patterns.
31886 * config/rs6000/rs6000-cpus.def: Add
31887 OPTION_MASK_P10_FUSION_2LOGICAL.
31888 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31889 Enable logical-logical fusion for p10.
31890 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
31892 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31894 * config/rs6000/rs6000.opt: Add periods to new AIX options.
31896 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31898 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
31899 (mabi=vec-default): New.
31900 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
31901 __EXTABI__ for AIX Vector extended ABI.
31902 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
31904 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
31906 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
31908 2021-01-30 Jakub Jelinek <jakub@redhat.com>
31910 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
31911 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
31913 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
31916 * lra-constraints.c (in_class_p): Don't narrow class only for REG
31919 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
31921 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
31922 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
31923 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
31925 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
31927 PR tree-optimization/98866
31928 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
31929 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
31930 (gori_map::m_maybe_invariant): Rename from all_outgoing.
31931 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
31932 (gori_map::is_export_p): Ditto.
31933 (gori_map::calculate_gori): Ditto.
31934 (gori_compute::set_range_invariant): New.
31935 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
31936 invariant for pointers evaluating to [1, +INF].
31938 2021-01-29 Richard Biener <rguenther@suse.de>
31940 PR rtl-optimization/98863
31941 * config/i386/i386-features.c (remove_partial_avx_dependency):
31942 Do not perform DF analysis.
31943 (pass_data_remove_partial_avx_dependency): Remove
31946 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31948 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
31949 builtin generator macros.
31950 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
31952 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
31953 instead of inline asm.
31954 (vmull_n_s32): Likewise.
31955 (vmull_n_u16): Likewise.
31956 (vmull_n_u32): Likewise.
31958 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31960 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
31962 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
31964 (aarch64_<sur>abdl2<mode>): ... This.
31965 (<sur>sadv16qi): Adjust use of above.
31966 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
31968 (vabdl_high_s16): Likewise.
31969 (vabdl_high_s32): Likewise.
31970 (vabdl_high_u8): Likewise.
31971 (vabdl_high_u16): Likewise.
31972 (vabdl_high_u32): Likewise.
31974 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31976 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
31978 (uabal2): Likewise.
31979 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
31981 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
31983 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
31985 (vabal_high_s16): Likewise.
31986 (vabal_high_s32): Likewise.
31987 (vabal_high_u8): Likewise.
31988 (vabal_high_u16): Likewise.
31989 (vabal_high_u32): Likewise.
31990 * config/aarch64/iterators.md (ABAL2): New mode iterator.
31991 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
31993 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31995 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
31998 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
32000 (aarch64_<sur>abal<mode>): ... This
32001 (<sur>sadv16qi): Adust use of the above.
32002 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
32004 (vabal_s16): Likewise.
32005 (vabal_s32): Likewise.
32006 (vabal_u8): Likewise.
32007 (vabal_u16): Likewise.
32008 (vabal_u32): Likewise.
32010 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32012 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
32014 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
32016 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
32018 (vaddlv_s16): Likewise.
32019 (vaddlv_u8): Likewise.
32020 (vaddlv_u16): Likewise.
32021 (vaddlvq_s8): Likewise.
32022 (vaddlvq_s16): Likewise.
32023 (vaddlvq_s32): Likewise.
32024 (vaddlvq_u8): Likewise.
32025 (vaddlvq_u16): Likewise.
32026 (vaddlvq_u32): Likewise.
32027 (vaddlv_s32): Likewise.
32028 (vaddlv_u32): Likewise.
32029 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
32030 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
32031 (Vwstype): New mode attribute.
32033 (VWIDE_S): Likewise.
32034 (USADDLV): New int iterator.
32035 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
32037 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
32039 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
32040 builtin generator macros.
32041 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
32043 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
32044 instead of inline asm.
32045 (vmlsl_lane_s32): Likewise.
32046 (vmlsl_lane_u16): Likewise.
32047 (vmlsl_lane_u32): Likewise.
32048 (vmlsl_laneq_s16): Likewise.
32049 (vmlsl_laneq_s32): Likewise.
32050 (vmlsl_laneq_u16): Likewise.
32051 (vmlsl_laneq_u32): Likewise.
32053 2021-01-29 Richard Biener <rguenther@suse.de>
32055 * doc/invoke.texi (--param max-gcse-memory): Document unit
32057 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
32058 * params.opt (--param max-gcse-memory): Adjust default and
32059 document unit of size.
32061 2021-01-29 Richard Biener <rguenther@suse.de>
32063 PR rtl-optimization/98863
32064 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
32065 HOST_WIDE_INT for the memory estimate.
32067 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
32068 Richard Biener <rguenther@suse.de>
32070 PR tree-optimization/97627
32071 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
32072 Do not analyze fake edges.
32074 2021-01-29 Richard Biener <rguenther@suse.de>
32076 PR rtl-optimization/98144
32077 * df.h (df_mir_bb_info): Add con_visited member.
32078 * df-problems.c (df_mir_alloc): Initialize con_visited,
32079 do not fully populate IN and OUT.
32080 (df_mir_reset): Likewise.
32081 (df_mir_confluence_0): Set con_visited.
32082 (df_mir_confluence_n): Properly handle implicitely
32083 fully populated IN and OUT as designated by con_visited
32084 and update con_visited accordingly.
32086 2021-01-29 Jakub Jelinek <jakub@redhat.com>
32089 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
32090 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
32091 && !TARGET_REALLY_IWMMXT to conditions.
32093 2021-01-29 Jakub Jelinek <jakub@redhat.com>
32096 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
32099 2021-01-28 Marek Polacek <polacek@redhat.com>
32102 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
32103 the main variant, maybe reset it in its variants too.
32104 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
32105 (check_aligned_type): Check if TYPE_USER_ALIGN match.
32107 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
32110 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
32111 of constant zero for comparisons.
32113 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
32115 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
32116 support for mapping built-in function names for long double
32117 built-in functions if long double is IEEE 128-bit.
32119 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
32121 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
32122 builtin generator macros.
32123 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
32125 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
32126 instead of inline asm.
32127 (vmlsl_n_s32): Likewise.
32128 (vmlsl_n_u16): Likewise.
32129 (vmlsl_n_u32): Likewise.
32131 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
32133 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
32134 builtin generator macros.
32135 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
32137 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
32138 instead of inline asm.
32139 (vmlal_n_s32): Likewise.
32140 (vmlal_n_u16): Likewise.
32141 (vmlal_n_u32): Likewise.
32143 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32145 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
32147 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
32149 (aarch64_shrn2<mode>_insn_be): Likewise.
32150 (aarch64_shrn2<mode>): Likewise.
32151 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
32153 (vshrn_high_n_s32): Likewise.
32154 (vshrn_high_n_s64): Likewise.
32155 (vshrn_high_n_u16): Likewise.
32156 (vshrn_high_n_u32): Likewise.
32157 (vshrn_high_n_u64): Likewise.
32159 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32161 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
32163 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
32165 (aarch64_shrn<mode>_insn_be): Likewise.
32166 (aarch64_shrn<mode>): Likewise.
32167 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
32169 (vshrn_n_s32): Likewise.
32170 (vshrn_n_s64): Likewise.
32171 (vshrn_n_u16): Likewise.
32172 (vshrn_n_u32): Likewise.
32173 (vshrn_n_u64): Likewise.
32174 * config/aarch64/iterators.md (vn_mode): New mode attribute.
32176 2021-01-28 Richard Biener <rguenther@suse.de>
32178 PR rtl-optimization/80960
32179 * dse.c (check_mem_read_rtx): Call get_addr on the
32182 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
32183 David Edelsohn <dje.gcc@gmail.com>
32186 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32187 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
32189 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
32191 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
32192 wrapper call rs6000_expand_vector_set_var for cleanup. Call
32193 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
32195 (rs6000_expand_vector_set_var): Delete.
32196 (rs6000_expand_vector_set_var_p9): Make static.
32197 (rs6000_expand_vector_set_var_p8): Make static.
32199 2021-01-28 Xing GUO <higuoxing@gmail.com>
32201 * common/config/riscv/riscv-common.c
32202 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
32203 when `p` extension exists.
32205 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
32207 PR rtl-optimization/97684
32208 * ira.c (ira): Call ira_set_pseudo_classes before
32209 update_equiv_regs when it is necessary.
32211 2021-01-27 Jakub Jelinek <jakub@redhat.com>
32214 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
32215 %w0, %w1 and %2 instead of %0, %1 and %2.
32217 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
32219 * config/rs6000/genfusion.pl: New script to generate
32220 define_insn_and_split patterns so combine can arrange fused
32221 instructions next to each other.
32222 * config/rs6000/fusion.md: New file, generated fused instruction
32223 patterns for combine.
32224 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
32225 (non_update_memory_operand): New predicate.
32226 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
32227 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
32229 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
32231 * config/rs6000/rs6000.c (rs6000_option_override_internal):
32232 Automatically set OPTION_MASK_P10_FUSION and
32233 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
32234 (rs600_opt_masks): Allow -mpower10-fusion
32235 in function attributes.
32236 (address_is_non_pfx_d_or_x): New function.
32237 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
32238 * config/rs6000/rs6000.md: Include fusion.md.
32239 * config/rs6000/rs6000.opt: Add -mpower10-fusion
32240 and -mpower10-fusion-ld-cmpi.
32241 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
32243 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32245 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
32246 builtin generator macros.
32247 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
32249 (aarch64_<su>mlal<mode>): This.
32250 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
32251 instead of inline asm.
32252 (vmlal_s16): Likewise.
32253 (vmlal_s32): Likewise.
32254 (vmlal_u8): Likewise.
32255 (vmlal_u16): Likewise.
32256 (vmlal_u32): Likewise.
32258 2021-01-27 Richard Biener <rguenther@suse.de>
32260 PR tree-optimization/98854
32261 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
32262 PHIs from scalars when the number of CTORs matches the
32263 number of children.
32265 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32267 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
32269 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
32271 (aarch64_mls_n<mode>): This.
32272 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
32274 (vmls_n_s32): Likewise.
32275 (vmls_n_u16): Likewise.
32276 (vmls_n_u32): Likewise.
32277 (vmlsq_n_s16): Likewise.
32278 (vmlsq_n_s32): Likewise.
32279 (vmlsq_n_u16): Likewise.
32280 (vmlsq_n_u32): Likewise.
32282 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32284 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
32286 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
32288 (vmls_s16): Likewise.
32289 (vmls_s32): Likewise.
32290 (vmls_u8): Likewise.
32291 (vmls_u16): Likewise.
32292 (vmls_u32): Likewise.
32293 (vmlsq_s8): Likewise.
32294 (vmlsq_s16): Likewise.
32295 (vmlsq_s32): Likewise.
32296 (vmlsq_u8): Likewise.
32297 (vmlsq_u16): Likewise.
32298 (vmlsq_u32): Likewise.
32300 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
32302 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
32304 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
32306 (aarch64_mla_n<mode>): This.
32307 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
32309 (vmla_n_s32): Likewise.
32310 (vmla_n_u16): Likewise.
32311 (vmla_n_u32): Likewise.
32312 (vmlaq_n_s16): Likewise.
32313 (vmlaq_n_s32): Likewise.
32314 (vmlaq_n_u16): Likewise.
32315 (vmlaq_n_u32): Likewise.
32317 2021-01-27 liuhongt <hongtao.liu@intel.com>
32320 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
32321 (*sse2_eq<mode>3): Ditto.
32323 2021-01-27 Jakub Jelinek <jakub@redhat.com>
32325 * tree-pass.h (PROP_trees): Rename to ...
32326 (PROP_gimple): ... this.
32327 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
32328 * passes.c (execute_function_dump, execute_function_todo,
32329 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
32330 * varpool.c (ctor_for_folding): Likewise.
32332 2021-01-27 Jakub Jelinek <jakub@redhat.com>
32334 PR tree-optimization/97260
32335 * varpool.c: Include tree-pass.h.
32336 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
32337 non-TREE_SIDE_EFFECTS automatic variables.
32339 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
32341 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
32343 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
32344 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
32345 (gen_compile_unit_die): Recognise C++23.
32347 2021-01-26 Jakub Jelinek <jakub@redhat.com>
32350 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
32353 2021-01-26 Jakub Jelinek <jakub@redhat.com>
32356 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
32357 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
32358 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
32360 2021-01-26 Richard Biener <rguenther@suse.de>
32362 * gimple-pretty-print.c (dump_binary_rhs): Handle
32363 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
32365 2021-01-26 Richard Biener <rguenther@suse.de>
32367 PR middle-end/98726
32368 * tree.h (vector_cst_int_elt): Remove.
32369 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
32372 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
32374 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
32375 for V64DFmode min/max reductions.
32377 2021-01-26 Jakub Jelinek <jakub@redhat.com>
32379 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
32380 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
32381 two halves, one with x and the other with const0_rtx, ordered
32382 depending on endianity.
32384 2021-01-26 Alexandre Oliva <oliva@adacore.com>
32386 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
32387 temporaries not seen in binding block, and not about to be
32388 added as gimple variables.
32390 2021-01-25 Martin Sebor <msebor@redhat.com>
32393 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
32395 2021-01-25 Martin Liska <mliska@suse.cz>
32397 * value-prof.c (get_nth_most_common_value): Use %s instead
32400 2021-01-25 Jakub Jelinek <jakub@redhat.com>
32403 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
32404 readelf -wi is able to read the emitted .debug_info back.
32405 * configure: Regenerated.
32407 2021-01-25 Martin Liska <mliska@suse.cz>
32409 PR gcov-profile/98739
32410 * common.opt: Add missing sign symbol.
32411 * value-prof.c (get_nth_most_common_value): Restore handling
32412 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
32413 PROFILE_REPRODUCIBILITY_MULTITHREADED.
32415 2021-01-25 Richard Biener <rguenther@suse.de>
32417 PR middle-end/98807
32418 * tree.c (vector_element_bits): Always use precision of
32419 the element type for boolean vectors.
32421 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
32423 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
32424 (ENDFILE_SPEC): Evaluate qnolinkcmds.
32426 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
32428 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
32429 nostartfiles handling since this is already done by
32430 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
32431 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
32432 is already done by LINK_COMMAND_SPEC.
32433 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
32434 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
32437 2021-01-25 Jakub Jelinek <jakub@redhat.com>
32440 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
32441 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
32442 value rather than host size_t.
32443 (fold_const_call): Change type of s2 from size_t to
32444 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
32445 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
32446 instead of s2 as last argument.
32448 2021-01-25 Tamar Christina <tamar.christina@arm.com>
32450 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
32451 VCMLA_OP, VCMUL_OP): New.
32452 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
32453 * config/arm/neon.md (cmul<conj_op><mode>3): New.
32454 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
32455 UNSPEC_VCMUL_CONJ): New.
32456 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
32457 cml<fcmac1><conj_op><mode>4): New.
32459 2021-01-23 Jakub Jelinek <jakub@redhat.com>
32462 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
32464 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
32466 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
32468 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
32470 (vmla_s16): Likewise.
32471 (vmla_s32): Likewise.
32472 (vmla_u8): Likewise.
32473 (vmla_u16): Likewise.
32474 (vmla_u32): Likewise.
32475 (vmlaq_s8): Likewise.
32476 (vmlaq_s16): Likewise.
32477 (vmlaq_s32): Likewise.
32478 (vmlaq_u8): Likewise.
32479 (vmlaq_u16): Likewise.
32480 (vmlaq_u32): Likewise.
32482 2021-01-22 David Malcolm <dmalcolm@redhat.com>
32484 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
32487 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32490 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
32491 filenames to emit, still emit the required 0 index directory and
32492 filename entries that match DW_AT_comp_dir and DW_AT_name of the
32495 2021-01-22 Marek Polacek <polacek@redhat.com>
32498 * doc/invoke.texi: Update C++ ABI Version 15 description.
32500 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32502 PR tree-optimization/98766
32503 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
32504 comparing against type size with param_avoid_fma_max_bits.
32506 2021-01-22 Richard Biener <rguenther@suse.de>
32508 PR middle-end/98793
32509 * tree.c (vector_element_bits): Key single-bit bool vector on
32510 integer mode rather than not vector mode.
32512 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
32515 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32516 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
32518 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
32519 to call different path for P8 and P9.
32520 (rs6000_expand_vector_set_var_p9): New function.
32521 (rs6000_expand_vector_set_var_p8): New function.
32523 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
32527 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32528 Ajdust variable index vec_insert from address dereference to
32529 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
32530 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
32532 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
32534 2021-01-22 Martin Liska <mliska@suse.cz>
32536 PR gcov-profile/98739
32537 * profile.c (compute_value_histograms): Drop time profile for
32538 -fprofile-reproducible=multithreaded.
32540 2021-01-22 Nathan Sidwell <nathan@acm.org>
32542 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
32545 2021-01-22 Richard Biener <rguenther@suse.de>
32547 PR middle-end/98773
32548 * tree-data-ref.c (initalize_matrix_A): Revert previous
32549 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
32551 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32553 PR tree-optimization/90248
32554 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
32555 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
32557 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
32558 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
32560 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32562 PR tree-optimization/98255
32563 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
32564 extend index - low_bound from sizetype's precision rather than index
32566 (get_addr_base_and_unit_offset_1): Likewise.
32567 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
32568 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
32570 2021-01-22 Richard Biener <rguenther@suse.de>
32572 PR tree-optimization/98786
32573 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
32574 adding new uses of abnormals. Verify we deal with a conditional
32577 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32580 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
32582 2021-01-22 liuhongt <hongtao.liu@intel.com>
32586 * config/i386/sse.md (VI_128_256): New mode iterator.
32587 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
32588 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
32589 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
32590 define_insn_and_split to lower avx512 vector comparison to avx
32591 version when dest is vector.
32592 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
32593 define_insn_and_split for negating the comparison result.
32594 * config/i386/predicates.md (float_vector_all_ones_operand):
32596 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
32597 general NOT operator without UNSPEC_MASKOP.
32599 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
32601 PR rtl-optimization/98777
32602 * lra-int.h (lra_pmode_pseudo): New extern.
32603 * lra.c (lra_pmode_pseudo): New global.
32605 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
32607 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
32609 * fwprop.c (fwprop_propagation::classify_result): Allow
32610 (subreg (mem)) simplifications.
32612 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32614 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
32616 (aarch64_sqdmlal<mode>): ... This...
32617 (aarch64_sqdmlsl<mode>): ... And this.
32618 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
32619 (aarch64_sqdmlal_lane<mode>): ... This...
32620 (aarch64_sqdmlsl_lane<mode>): ... And this.
32621 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
32622 (aarch64_sqdmlsl_laneq<mode>): ... This...
32623 (aarch64_sqdmlal_laneq<mode>): ... And this.
32624 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
32625 (aarch64_sqdmlsl_n<mode>): ... This...
32626 (aarch64_sqdmlal_n<mode>): ... And this.
32627 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
32628 (aarch64_sqdmlal2<mode>_internal): ... This...
32629 (aarch64_sqdmlsl2<mode>_internal): ... And this.
32631 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
32633 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
32635 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
32638 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
32640 2021-01-21 liuhongt <hongtao.liu@intel.com>
32642 PR rtl-optimization/98694
32643 * regcprop.c (copy_value): If SRC had been assigned a mode
32644 narrower than the copy, we can't link DEST into the chain even
32645 they have same hard_regno_nregs(i.e. HImode/SImode in i386
32648 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32650 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
32651 Convert to define_insn_and_split. Split into simple move when moving
32654 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
32656 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
32657 Adjust comment. Simplify code.
32659 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32662 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
32663 with DW_FORM_line_strp form.
32664 (prune_unused_types_update_strings): Don't add into debug_str_hash
32665 indirect strings with DW_FORM_line_strp form.
32666 (adjust_name_comp_dir): New function.
32667 (dwarf2out_finish): Call it on CU DIEs after resetting
32668 debug_line_str_hash.
32670 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
32672 PR rtl-optimization/98722
32673 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
32674 has no 3-op add insn to transform insns containing two pluses.
32676 2021-01-20 Richard Biener <rguenther@suse.de>
32678 * hwint.h (add_hwi): New function.
32679 (mul_hwi): Likewise.
32680 * tree-data-ref.c (initialize_matrix_A): Properly translate
32681 tree constants and avoid HOST_WIDE_INT_MIN.
32682 (lambda_matrix_row_add): Avoid undefined integer overflow
32683 and return true on such overflow.
32684 (lambda_matrix_right_hermite): Handle overflow from
32685 lambda_matrix_row_add gracefully. Simplify previous fix.
32686 (analyze_subscript_affine_affine): Likewise.
32688 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
32690 PR tree-optimization/96674
32691 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
32692 x >= y && y != XXX_MIN --> x > y - 1
32694 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
32696 PR tree-optimization/98535
32697 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
32698 If the high and low permutes are the same, remove the high permutes
32699 from the working set and only continue with the low ones.
32701 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32703 PR tree-optimization/98721
32704 * builtins.c (access_ref::inform_access): Don't assume
32705 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
32706 object whenever allocfn is NULL, rather than only when DECL_P
32707 is true. Use %qE instead of %qD for that. Formatting fixes.
32709 2021-01-20 Richard Biener <rguenther@suse.de>
32711 PR tree-optimization/98758
32712 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
32713 (lambda_matrix_right_hermite): Avoid undefinedness with
32714 signed integer abs and multiplication.
32715 (analyze_subscript_affine_affine): Use lambda_int.
32717 2021-01-20 David Malcolm <dmalcolm@redhat.com>
32720 * dwarf2out.c (output_line_info): Rename static variable
32721 "generation", moving it out of the function to...
32722 (output_line_info_generation): New.
32723 (init_sections_and_labels): Likewise, renaming the variable to...
32724 (init_sections_and_labels_generation): New.
32725 (dwarf2out_c_finalize): Reset the new variables.
32727 2021-01-19 Martin Sebor <msebor@redhat.com>
32729 PR middle-end/98664
32730 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
32731 all functions, even if they're not declared artificial or inline.
32732 * tree.c (tree_inlined_location): Use macro expansion location
32733 only if scope traversal fails to expose one.
32735 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
32737 PR rtl-optimization/92294
32738 * alias.c (compare_base_symbol_refs): Take an extra parameter
32739 and add the distance between two symbols to it. Enshrine in
32740 comments that -1 means "either 0 or 1, but we can't tell
32741 which at compile time".
32742 (memrefs_conflict_p): Update call accordingly.
32743 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
32746 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32748 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
32749 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
32750 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
32751 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
32752 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
32753 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
32754 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
32755 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
32756 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
32757 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
32758 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
32759 sqmovun): Use NONE flags.
32761 2021-01-19 Richard Biener <rguenther@suse.de>
32764 * ipa-modref.c (analyze_stmt): Only record a summary for a
32767 2021-01-19 Richard Biener <rguenther@suse.de>
32769 PR middle-end/98638
32770 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
32772 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
32774 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
32775 built-in define __FIX_LEON3FT_TN0018.
32777 2021-01-19 Richard Biener <rguenther@suse.de>
32780 * tree-inline.c (tree_function_versioning): Set input_location
32781 to UNKNOWN_LOCATION throughout the function.
32783 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
32786 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
32788 2021-01-19 Martin Jambor <mjambor@suse.cz>
32791 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
32792 whether non-call exceptions allow removal of a statement.
32793 (isra_analyze_call): Pass the appropriate function to
32794 ssa_name_only_returned_p.
32796 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
32798 * config/riscv/arch-canonicalize (longext_sort): New function for
32799 sorting 'multi-letter'.
32800 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
32801 'alts'. The 'arch' may not be the first of 'alts'.
32802 (_expand_combination): Add underline for the 'ext' without '*'.
32803 This is because, a single-letter extension can always be treated well
32804 with a '_' prefix, but it cannot be separated out if it is appended
32807 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
32810 * ira.c (ira): Skip abnormal critical edge splitting.
32812 2021-01-18 Jakub Jelinek <jakub@redhat.com>
32814 PR tree-optimization/98727
32815 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
32816 second .MUL_OVERFLOW operand for signed multiplication with overflow
32817 checking if the second operand of multiplication is not constant.
32819 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
32821 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
32822 defaults to version 4.
32824 2021-01-18 David Malcolm <dmalcolm@redhat.com>
32826 * attribs.h (fndecl_dealloc_argno): New decl.
32827 * builtins.c (call_dealloc_argno): Split out second half of
32829 (fndecl_dealloc_argno): New.
32830 * doc/extend.texi (Common Function Attributes): Document the
32831 interaction between the analyzer and the malloc attribute.
32832 * doc/invoke.texi (Static Analyzer Options): Likewise.
32834 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
32836 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
32837 dwarf_version to 4.
32838 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
32840 2021-01-17 Martin Jambor <mjambor@suse.cz>
32843 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
32846 2021-01-17 Mark Wielaard <mark@klomp.org>
32848 * common.opt (gdwarf-): Init(5).
32849 * doc/invoke.texi (-gdwarf): Document default to 5.
32851 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
32853 * builtin-types.def
32854 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
32856 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
32857 ...this. Add extra argument.
32858 * gimplify.c (omp_default_clause): Ensure that event handle is
32859 firstprivate in a task region.
32860 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
32861 (gimplify_adjust_omp_clauses): Likewise.
32862 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
32863 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
32864 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
32865 if detach clause specified. Add detach argument when generating
32867 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
32869 (finish_taskreg_scan): Move field for variable containing the event
32870 handle to the front of the struct.
32871 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
32873 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
32874 OMP_CLAUSE_DETACH clause.
32875 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
32876 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
32877 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
32879 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
32881 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
32883 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32885 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
32886 multilib directory names. Use MULTILIB_REQUIRED instead of
32887 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
32888 -mcustom-fpu-cfg=fph2 multilib.
32890 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32892 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
32893 (nios2_init_fpu_configs): Provide register values for new
32894 -mcustom-fpu-cfg=fph2 option variant.
32895 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
32898 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32900 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
32901 custom instruction warnings.
32903 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32905 PR tree-optimization/96669
32906 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
32908 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32910 PR tree-optimization/96271
32911 * passes.def: Pass false argument to first two pass_cd_dce
32912 instances and true to last instance. Add comment that
32913 last instance rewrites no longer addressed locals.
32914 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
32916 (pass_cd_dce::set_pass_param): New method.
32917 (pass_cd_dce::execute): Return TODO_update_address_taken from
32918 last cd_dce instance.
32920 2021-01-15 Carl Love <cel@us.ibm.com>
32922 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
32924 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
32925 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
32926 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
32927 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
32928 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
32929 Add builtin define.
32930 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
32931 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
32932 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
32933 New overloaded definitions.
32934 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
32935 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
32936 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
32937 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
32938 P10V_BUILTIN_MULHU_V4SI]: Add case
32939 statement for builtins.
32940 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
32941 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
32942 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
32943 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
32944 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
32945 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
32946 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
32947 Add define_insn, mode is VIlong.
32948 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
32949 Add builtin descriptions.
32951 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
32953 * final.c (final_start_function_1): Reset force_source_line.
32955 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32957 PR tree-optimization/96669
32958 * match.pd (((1 << A) & 1) != 0 -> A == 0,
32959 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
32960 possibly different power of two constants and to right shift too.
32962 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32964 PR tree-optimization/96681
32965 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
32966 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
32967 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
32968 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
32970 2021-01-15 Alexandre Oliva <oliva@adacore.com>
32972 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
32974 2021-01-15 Tamar Christina <tamar.christina@arm.com>
32976 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
32977 cmul<conj_op><mode>3): New.
32978 * config/aarch64/iterators.md (UNSPEC_FCMUL,
32979 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
32980 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
32981 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
32982 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
32983 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
32984 (rot_op): Renamed to conj_op.
32985 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
32986 cmul<conj_op><mode>3): New.
32987 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
32988 cmul<conj_op><mode>3): New.
32990 2021-01-15 David Malcolm <dmalcolm@redhat.com>
32994 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32995 Escape the tempfile name when constructing the expected output.
32997 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32999 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
33001 (aarch64_<su>mlsl_hi<mode>): ... This.
33002 (aarch64_<su>mlsl_hi<mode>): Define.
33003 (*aarch64_<su>mlsl<mode): Rename to...
33004 (aarch64_<su>mlsl<mode): ... This.
33005 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
33006 smlsl_hi, umlsl_hi): Define builtins.
33007 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
33008 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
33009 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
33010 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
33012 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
33014 * config/i386/i386-c.c (ix86_target_macros):
33015 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
33017 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
33020 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
33021 * Makefile.in (RTL_SSA_H): New variable.
33022 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
33023 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
33024 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
33026 * config/aarch64/aarch64-cc-fusion.cc: New file.
33028 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
33030 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
33031 calling cancel_changes for changes that no longer exist.
33033 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
33035 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
33036 (function_info::reg_defs): ...this.
33037 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
33038 (function_info::reg_defs): ...this.
33040 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33043 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
33045 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33048 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33051 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
33053 2021-01-15 Richard Biener <rguenther@suse.de>
33055 PR tree-optimization/96376
33056 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
33057 for VMAT_INVARIANT.
33059 2021-01-15 Martin Liska <mliska@suse.cz>
33061 * doc/install.texi: Document that some tests need pytest module.
33062 * doc/sourcebuild.texi: Likewise.
33064 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33067 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
33069 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33071 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
33072 (mve_vshrq_n_u<mode>_imm): Likewise.
33073 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
33074 * config/arm/vec-common.md: ... here.
33076 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
33078 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
33080 * config/arm/neon.md (vashl<mode>3): Delete.
33081 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
33082 (vasl<mode>3): New expander.
33084 2021-01-15 Richard Biener <rguenther@suse.de>
33086 PR tree-optimization/98685
33087 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
33088 of vector extern defs.
33090 2021-01-14 David Malcolm <dmalcolm@redhat.com>
33093 * diagnostic.c (diagnostic_kind_text): Break out this array
33095 (diagnostic_build_prefix): ...here.
33096 (fancy_abort): Detect when diagnostic_initialize has not yet been
33097 called and fall back to a minimal implementation of printing the
33098 ICE, rather than segfaulting in internal_error.
33100 2021-01-14 David Malcolm <dmalcolm@redhat.com>
33102 * diagnostic.c (diagnostic_initialize): Eliminate
33103 parseable_fixits_p in favor of initializing extra_output_kind from
33104 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
33105 (convert_column_unit): New function, split out from...
33106 (diagnostic_converted_column): ...this.
33107 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
33108 Use them to call convert_column_unit on the column values.
33109 (diagnostic_report_diagnostic): Eliminate conditional on
33110 parseable_fixits_p in favor of a switch statement on
33111 extra_output_kind, passing the appropriate values to the new
33112 params of print_parseable_fixits.
33113 (selftest::test_print_parseable_fixits_none): Update for new
33114 params of print_parseable_fixits.
33115 (selftest::test_print_parseable_fixits_insert): Likewise.
33116 (selftest::test_print_parseable_fixits_remove): Likewise.
33117 (selftest::test_print_parseable_fixits_replace): Likewise.
33118 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
33120 (selftest::diagnostic_c_tests): Call it.
33121 * diagnostic.h (enum diagnostics_extra_output_kind): New.
33122 (diagnostic_context::parseable_fixits_p): Delete field in favor
33124 (diagnostic_context::extra_output_kind): ...this new field.
33125 * doc/invoke.texi (Environment Variables): Add
33126 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
33127 * opts.c (common_handle_option): Update handling of
33128 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
33131 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33133 * tree-vect-slp-patterns.c (class complex_operations_pattern,
33134 complex_operations_pattern::matches,
33135 complex_operations_pattern::recognize,
33136 complex_operations_pattern::build): New.
33137 (slp_patterns): Use it.
33139 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33141 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
33142 * optabs.def (cmls_optab, cmls_conj_optab): New.
33143 * doc/md.texi: Document them.
33144 * tree-vect-slp-patterns.c (class complex_fms_pattern,
33145 complex_fms_pattern::matches, complex_fms_pattern::recognize,
33146 complex_fms_pattern::build): New.
33148 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33150 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
33151 * optabs.def (cmla_optab, cmla_conj_optab): New.
33152 * doc/md.texi: Document them.
33153 * tree-vect-slp-patterns.c (vect_match_call_p,
33154 class complex_fma_pattern, vect_slp_reset_pattern,
33155 complex_fma_pattern::matches, complex_fma_pattern::recognize,
33156 complex_fma_pattern::build): New.
33158 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33160 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
33161 * optabs.def (cmul_optab, cmul_conj_optab): New.
33162 * doc/md.texi: Document them.
33163 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
33164 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
33165 vect_build_combine_node, class complex_mul_pattern,
33166 complex_mul_pattern::matches, complex_mul_pattern::recognize,
33167 complex_mul_pattern::build): New.
33169 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33171 * tree-vect-slp.c (optimize_load_redistribution_1): New.
33172 (optimize_load_redistribution, vect_is_slp_load_node): New.
33173 (vect_match_slp_patterns): Use it.
33175 2021-01-14 Tamar Christina <tamar.christina@arm.com>
33177 * tree-vect-slp-patterns.c (complex_add_pattern::build):
33180 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
33182 * config/gcn/mkoffload.c (main): Create an offload image only in
33183 64-bit configurations.
33185 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
33188 * config/i386/i386-options.c (ix86_option_override_internal):
33189 Issue an error for -fcf-protection with CF_BRANCH when compiling
33190 for 32-bit non-TARGET_CMOV targets.
33192 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
33195 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
33196 Remove declaration and initialization of shadow variable "ret".
33197 (ix86_option_override_internal): Remove delcaration of
33198 shadow variable "i". Redeclare shadowed variable to unsigned.
33199 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
33200 * config/i386/i386-builtins.c (get_builtin_code_for_version):
33201 Update for redeclaration.
33202 * config/i386/i386.h (pta_size): Ditto.
33204 2021-01-14 Richard Biener <rguenther@suse.de>
33206 PR tree-optimization/98674
33207 * tree-data-ref.c (base_supports_access_fn_components_p): New.
33208 (initialize_data_dependence_relation): For two bases without
33209 possible access fns resort to type size equality when determining
33210 shape compatibility.
33212 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
33215 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
33216 <=, >= operators in vcle and vcge intrinsics respectively.
33217 * config/arm/arm_neon_builtins.def: Remove entry for
33220 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
33223 * config/i386/i386-options.c (ix86_function_specific_save):
33224 Remove redundant assignment to opts->x_ix86_branch_cost.
33225 * config/i386/i386.c (ix86_prefetch_sse):
33226 Rename from x86_prefetch_sse. Update all uses.
33227 * config/i386/i386.h: Update for rename.
33228 * config/i386/i386-options.h: Ditto.
33230 2021-01-14 Jakub Jelinek <jakub@redhat.com>
33233 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
33234 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
33235 Use Bm instead of m for non-avx. Add isa attribute.
33237 2021-01-14 Jakub Jelinek <jakub@redhat.com>
33239 PR tree-optimization/96688
33240 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
33241 ~X can be simplified.
33243 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
33245 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
33246 IFN_LOAD_LANES results.
33248 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33250 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
33252 (aarch64_xtn<mode>): Likewise.
33253 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
33256 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
33258 (vmovl_s16): Likewise.
33259 (vmovl_s32): Likewise.
33260 (vmovl_u8): Likewise.
33261 (vmovl_u16): Likewise.
33262 (vmovl_u32): Likewise.
33263 (vmovn_s16): Likewise.
33264 (vmovn_s32): Likewise.
33265 (vmovn_s64): Likewise.
33266 (vmovn_u16): Likewise.
33267 (vmovn_u32): Likewise.
33268 (vmovn_u64): Likewise.
33270 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33272 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
33274 (aarch64_<su>qxtn2<mode>_be): Likewise.
33275 (aarch64_<su>qxtn2<mode>): Likewise.
33276 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
33278 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
33279 (su): Handle ss_truncate and us_truncate.
33280 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
33282 (vqmovn_high_s32): Likewise.
33283 (vqmovn_high_s64): Likewise.
33284 (vqmovn_high_u16): Likewise.
33285 (vqmovn_high_u32): Likewise.
33286 (vqmovn_high_u64): Likewise.
33288 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33290 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
33292 (aarch64_xtn2<mode>_be): Likewise.
33293 (aarch64_xtn2<mode>): Likewise.
33294 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
33296 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
33298 (vmovn_high_s32): Likewise.
33299 (vmovn_high_s64): Likewise.
33300 (vmovn_high_u16): Likewise.
33301 (vmovn_high_u32): Likewise.
33302 (vmovn_high_u64): Likewise.
33304 2021-01-13 Stafford Horne <shorne@gmail.com>
33306 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
33308 2021-01-13 Stafford Horne <shorne@gmail.com>
33310 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
33312 2021-01-13 Stafford Horne <shorne@gmail.com>
33314 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
33315 define for __or1k_hard_float__.
33317 2021-01-13 Stafford Horne <shorne@gmail.com>
33319 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
33320 (PROFILE_HOOK): Define to call _mcount.
33321 (FUNCTION_PROFILER): Change from abort to no-op.
33323 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33325 PR tree-optimization/96691
33326 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
33327 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
33328 (~D ^ C) or (D ^ C) can be simplified.
33330 2021-01-13 Richard Biener <rguenther@suse.de>
33332 PR tree-optimization/92645
33333 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
33334 until after vector lowering.
33336 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33338 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
33340 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
33341 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
33343 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33345 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
33347 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
33348 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
33350 2021-01-13 Richard Biener <rguenther@suse.de>
33352 PR tree-optimization/92645
33353 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
33354 BIT_FIELD_REF argument.
33355 (vect_build_slp_tree_2): Record the desired vector type
33356 on the external vector def.
33357 (vectorizable_slp_permutation): Handle required punning
33358 of existing vector defs.
33360 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33362 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
33364 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
33366 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
33368 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
33370 * config.gcc [$target == *-*-gnu*]: Enable
33371 'default_gnu_indirect_function'.
33373 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33376 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
33377 registers before calling targetm.vectorize.vec_perm_const, only after
33379 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
33380 two argument permutation when one operand is zero vector and only
33381 after that force operands into registers.
33382 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
33383 define_insn_and_split pattern.
33384 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
33385 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
33386 (*avx2_zero_extendv8hiv8si2_1): Likewise.
33387 (*avx512f_zero_extendv8siv8di2_1): Likewise.
33388 (*avx2_zero_extendv4siv4di2_1): Likewise.
33389 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
33391 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
33392 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
33393 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
33394 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
33395 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
33396 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
33398 2021-01-13 Martin Liska <mliska@suse.cz>
33400 PR tree-optimization/98455
33401 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
33402 Record also virtual PHIs.
33403 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
33406 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
33408 * doc/invoke.texi (C++ Modules): Fix typos.
33410 2021-01-13 Richard Biener <rguenther@suse.de>
33412 PR tree-optimization/98640
33413 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
33414 handle plus or minus from a truncated operand to be
33417 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33420 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
33421 define_insn_and_split patterns.
33422 (splitter after *btr<mode>_2): New splitter.
33424 2021-01-13 Martin Liska <mliska@suse.cz>
33427 * cgraphunit.c (analyze_functions): Remove dead code.
33429 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
33431 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
33432 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
33433 (a64fx_regmove_cost, a64fx_vector_cost): New.
33434 (a64fx_tunings): Use the new added cost tables.
33436 2021-01-13 Jakub Jelinek <jakub@redhat.com>
33439 * config/i386/predicates.md (pmovzx_parallel): New predicate.
33440 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
33441 define_insn_and_split pattern.
33442 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
33443 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
33445 2021-01-13 Julian Brown <julian@codesourcery.com>
33447 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
33448 to fix v0 register.
33450 2021-01-13 Julian Brown <julian@codesourcery.com>
33452 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
33455 2021-01-13 Julian Brown <julian@codesourcery.com>
33457 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
33458 for reciprocal-approximation instructions.
33459 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
33460 refinement and division result.
33461 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
33463 2021-01-13 Julian Brown <julian@codesourcery.com>
33465 * config/gcn/gcn-valu.md (subdf): Rename to...
33468 2021-01-12 Martin Liska <mliska@suse.cz>
33470 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
33472 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
33474 * function-abi.h: Fix typo.
33476 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
33480 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
33481 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
33482 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
33483 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
33484 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
33485 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
33486 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
33487 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
33488 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
33489 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
33490 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
33491 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
33492 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
33493 (ARM_HAVE_V2DI_LDST): Likewise.
33494 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
33495 (*movmisalign<mode>_mve_load): New pattern.
33496 * config/arm/neon.md (movmisalign<mode>): Move to ...
33497 * config/arm/vec-common.md: ... here.
33499 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
33502 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
33503 of pattern 'plus (plus (hard reg, const), pseudo)'.
33505 2021-01-12 Richard Biener <rguenther@suse.de>
33507 PR tree-optimization/98550
33508 * tree-vect-slp.c (vect_record_max_nunits): Check whether
33509 the group size is a multiple of the vector element count.
33510 (vect_build_slp_tree_1): When we need to fail because
33511 the vector type choosen causes unrolling do so lazily
33512 without affecting matches only at the end to guide group splitting.
33514 2021-01-12 Martin Liska <mliska@suse.cz>
33517 * optc-save-gen.awk: Compare also n_target_save vars with
33520 2021-01-12 Martin Liska <mliska@suse.cz>
33522 * gcov.c (source_info::debug): New.
33523 (print_usage): Add --debug (-D) option.
33524 (process_args): Likewise.
33525 (generate_results): Call src->debug after
33526 accumulate_line_counts.
33527 (read_graph_file): Properly assign id for EXIT_BLOCK.
33528 * profile.c (branch_prob): Dump function body before it is
33531 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33533 PR tree-optimization/98629
33534 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
33535 unless returning non-zero.
33537 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33539 PR tree-optimization/95731
33540 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
33541 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
33542 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
33543 only after optimize_range_tests_var_bound.
33545 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33547 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
33548 * configure: Regenerated.
33550 2021-01-12 liuhongt <hongtao.liu@intel.com>
33553 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
33555 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
33558 2021-01-12 Alexandre Oliva <oliva@adacore.com>
33560 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
33562 (auto_end_imm_use_stmt_traverse): New struct.
33563 (FOR_EACH_IMM_USE_STMT): Use it.
33564 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
33566 * gimple-ssa-strength-reduction.c: ... here, ...
33567 * graphite-scop-detection.c: ... here, ...
33568 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
33569 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
33570 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
33571 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
33572 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
33573 * tree-vect-slp.c: ... and here, ...
33574 * doc/tree-ssa.texi: ... and the example here.
33576 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33578 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
33579 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
33580 (*sdiv_pow2<mode>3): New pattern.
33581 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
33582 Wrap the ASRD in an UNSPEC_PRED_X.
33583 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
33584 predicate with a constant PTRUE, if it isn't already.
33585 (*cond_<sve_int_op><mode>_z): Replace with...
33586 (*cond_<sve_int_op><mode>_any): ...this new pattern.
33588 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33590 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
33591 SVE_FULL_I to SVE_I.
33592 (*cond_bic<mode>_any): Likewise.
33594 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33596 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
33597 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
33600 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33602 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
33603 SVE_FULL_I to SVE_I.
33604 (*aarch64_cond_<su>abd<mode>_2): Likewise.
33605 (*aarch64_cond_<su>abd<mode>_any): Likewise.
33606 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
33607 for the max and min but not for the minus.
33608 (*aarch64_cond_<su>abd<mode>_3): New pattern.
33610 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33612 * config/aarch64/iterators.md (SVE_24I): New iterator.
33613 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
33614 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
33616 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33618 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
33619 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
33621 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
33622 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
33623 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
33624 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
33626 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33628 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
33629 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
33630 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
33633 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33635 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
33636 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
33637 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
33639 2021-01-11 Martin Liska <mliska@suse.cz>
33642 * symtab-clones.h (clone_info::release): Release
33643 symtab::m_clones with ggc_delete as it's a GGC memory.
33645 2021-01-11 Matthias Klose <doko@ubuntu.com>
33647 * Makefile.in (LINK_PROGRESS): Show the link target.
33649 2021-01-11 Richard Biener <rguenther@suse.de>
33651 PR tree-optimization/91403
33652 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
33653 single-element interleaving group size at 4096 elements.
33655 2021-01-11 Richard Biener <rguenther@suse.de>
33657 PR tree-optimization/98526
33658 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
33659 of the actual reduction op for the regular case.
33660 (vectorizable_reduction): Cost the stmts
33661 vect_transform_reduction produces here.
33663 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
33665 * tree-ssa-forwprop.c (simplify_vector_constructor): For
33666 big-endian, use UNPACK[_FLOAT]_HI.
33668 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33670 * tree-vect-slp-patterns.c (class complex_pattern,
33671 class complex_add_pattern): Add parameters to matches.
33672 (complex_add_pattern::build): Free memory.
33673 (complex_add_pattern::matches): Move validation end of match.
33674 (complex_add_pattern::recognize): Likewise.
33676 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33678 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
33680 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33682 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
33684 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33686 PR tree-optimization/95867
33687 * tree-ssa-math-opts.h: New header.
33688 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
33689 (powi_as_mults): No longer static. Use build_one_cst instead of
33690 build_real. Formatting fix.
33691 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
33692 (attempt_builtin_powi): Handle multiplication reassociation without
33693 powi_fndecl using powi_as_mults.
33694 (reassociate_bb): For integral types don't require
33695 -funsafe-math-optimizations to call attempt_builtin_powi.
33697 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33699 PR tree-optimization/95852
33700 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
33701 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
33702 allow in the bb any of the stmts in that vector, div_stmt and
33703 up to 3 cast stmts.
33704 (arith_cast_equal_p): New function.
33705 (arith_overflow_check_p): Add cast_stmt argument, handle signed
33706 multiply overflow checks.
33707 (match_arith_overflow): Adjust caller. Handle signed multiply
33710 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33712 PR tree-optimization/95852
33713 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
33714 (uaddsub_overflow_check_p): Renamed to ...
33715 (arith_overflow_check_p): ... this. Handle also multiplication
33716 with overflow check.
33717 (match_uaddsub_overflow): Renamed to ...
33718 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
33719 also multiplication with overflow check. Adjust function comment.
33720 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
33721 match_arith_overflow also for MULT_EXPR.
33723 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33725 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
33726 __builtin_convertvector.
33727 (vmovl_s16): Likewise.
33728 (vmovl_s32): Likewise.
33729 (vmovl_u8): Likewise.
33730 (vmovl_u16): Likewise.
33731 (vmovl_u32): Likewise.
33732 (vmovn_s16): Likewise.
33733 (vmovn_s32): Likewise.
33734 (vmovn_s64): Likewise.
33735 (vmovn_u16): Likewise.
33736 (vmovn_u32): Likewise.
33737 (vmovn_u64): Likewise.
33739 2021-01-11 Martin Liska <mliska@suse.cz>
33741 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
33742 (if_chain::is_beneficial): Delete clusters
33743 (find_conditions): Make second argument of conditions_in_bbs a
33744 pointer so that we control over it's lifetime.
33745 (pass_if_to_switch::execute): Delete them.
33747 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
33749 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
33752 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33754 * config/vax/vax.md (cc): Remove mode attribute.
33755 (subst_<cc>, subst_f<cc>): Rename to...
33756 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
33757 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
33758 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
33759 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
33761 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33763 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
33764 `const_double_zero'.
33766 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33768 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
33769 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
33772 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33774 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
33776 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
33777 with `const_double_zero'.
33778 * doc/rtl.texi (Constant Expression Types): Document it.
33780 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33783 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
33784 POINTER_DIFF_EXPR to be any integral type.
33786 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33788 PR rtl-optimization/98603
33789 * function.c (instantiate_virtual_regs_in_insn): For asm goto
33790 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
33791 if any, set ASM_OPERANDS mode to VOIDmode and change
33792 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
33794 2021-01-09 Alexandre Oliva <oliva@gnu.org>
33797 * final.c (notice_source_line): Narrow down the condition to
33798 skip a line-0 marker.
33800 2021-01-08 Sergei Trofimovich <siarheit@google.com>
33802 * ipa-modref.c (merge_call_side_effects): Fix
33803 linebreak split by reordering two print calls.
33805 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33807 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
33808 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
33809 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
33810 consistency, fix constraint.
33812 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33814 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
33815 callables instead of mask values.
33816 (struct target_flag_set_p): New predicate.
33817 (s390_cpu_cpp_builtins_internal): Define or undefine
33818 __LONG_DOUBLE_VX__ macro.
33820 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33823 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
33824 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
33827 2021-01-08 Richard Biener <rguenther@suse.de>
33829 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
33831 2021-01-08 Richard Biener <rguenther@suse.de>
33833 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
33834 (vect_build_slp_tree): On cache hit release the matched
33835 scalar stmts vector.
33836 * tree-vect-stmts.c (vectorizable_store): Properly free
33837 vec_oprnds before possibly gathering them again.
33839 2021-01-08 Richard Biener <rguenther@suse.de>
33841 PR tree-optimization/98544
33842 * tree-vect-slp.c (vect_optimize_slp): Always materialize
33843 permutes at a permute node.
33845 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33848 * config/i386/i386.c (x86_function_profiler): Use R10 to call
33849 mcount in large model. Sorry for large model with PIC.
33851 2021-01-08 Jakub Jelinek <jakub@redhat.com>
33854 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
33855 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
33856 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
33857 TargetSave and initialize for variables with enum types.
33858 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
33859 mstack-protector-guard-symbol=): Add Save.
33860 * config/i386/i386-options.c (ix86_function_specific_save,
33861 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
33862 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
33863 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
33864 x_ix86_veclibabi_type.
33866 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33868 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
33869 SVE_FULL_I to SVE_I.
33870 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
33872 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33874 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
33875 SVE_FULL_I to SVE_I.
33876 (*cond_uxt<mode>_any): Likewise.
33878 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33880 * config/aarch64/iterators.md (Vwhalf): New iterator.
33881 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
33883 (aarch64_<sur>adalp<mode>): ... This. Make more
33885 (<sur>sadv16qi): Adjust callsite of the above.
33886 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
33888 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
33890 (vpadal_s16): Likewise.
33891 (vpadal_u8): Likewise.
33892 (vpadal_u16): Likewise.
33893 (vpadalq_s8): Likewise.
33894 (vpadalq_s16): Likewise.
33895 (vpadalq_s32): Likewise.
33896 (vpadalq_u8): Likewise.
33897 (vpadalq_u16): Likewise.
33898 (vpadalq_u32): Likewise.
33900 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33902 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
33904 (aarch64_<su>abd<mode>): ... This.
33905 (<sur>sadv16qi): Adjust callsite of the above.
33906 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
33908 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
33910 (vabd_s16): Likewise.
33911 (vabd_s32): Likewise.
33912 (vabd_u8): Likewise.
33913 (vabd_u16): Likewise.
33914 (vabd_u32): Likewise.
33915 (vabdq_s8): Likewise.
33916 (vabdq_s16): Likewise.
33917 (vabdq_s32): Likewise.
33918 (vabdq_u8): Likewise.
33919 (vabdq_u16): Likewise.
33920 (vabdq_u32): Likewise.
33922 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33924 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
33926 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
33927 (vaba_s16): Likewise.
33928 (vaba_s32): Likewise.
33929 (vaba_u8): Likewise.
33930 (vaba_u16): Likewise.
33931 (vaba_u32): Likewise.
33932 (vabaq_s8): Likewise.
33933 (vabaq_s16): Likewise.
33934 (vabaq_s32): Likewise.
33935 (vabaq_u8): Likewise.
33936 (vabaq_u16): Likewise.
33937 (vabaq_u32): Likewise.
33939 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33941 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
33942 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
33943 Change RTL pattern to match.
33945 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33947 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
33948 * config/riscv/riscv-c.c (riscv-subset.h): New.
33949 (INCLUDE_STRING): Define.
33950 (riscv_cpu_cpp_builtins): Add new style architecture extension
33952 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
33953 (riscv_subset_list::end): New.
33954 (riscv_current_subset_list): New.
33956 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33958 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
33959 Move to riscv-subset.h.
33960 (struct riscv_subset_t): Ditto.
33961 (class riscv_subset_list): Ditto.
33962 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
33963 from riscv-common.c.
33964 (struct riscv_subset_t): Ditto.
33965 (class riscv_subset_list): Ditto.
33966 * config/riscv/t-riscv ($(common_out_file)): Add file
33969 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33972 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
33973 New define_insn patterns.
33975 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33977 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
33978 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
33979 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
33981 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33983 PR tree-optimization/98560
33984 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
33985 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
33987 (vec_cond_direct): Likewise argument 2.
33988 (vec_condu_direct, vec_condeq_direct): Delete.
33989 (expand_vect_cond_optab_fn): Rename to...
33990 (expand_vec_cond_optab_fn): ...this, replacing old macro.
33991 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
33992 (expand_vect_cond_mask_optab_fn): Rename to...
33993 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
33994 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
33996 (direct_vec_cond_optab_supported_p): Likewise.
33997 (direct_vec_condu_optab_supported_p): Delete.
33998 (direct_vec_condeq_optab_supported_p): Delete.
33999 * gimple-isel.cc: Include internal-fn.h.
34000 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
34003 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
34005 PR tree-optimization/98560
34006 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
34007 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
34009 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
34011 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
34012 rotate_insn and optab code attributes.
34013 Update all uses to merged code attribute.
34014 * config/i386/sse.md: Update all uses to merged code attribute.
34015 * config/i386/mmx.md: Update all uses to merged code attribute.
34017 2021-01-07 Jakub Jelinek <jakub@redhat.com>
34019 PR tree-optimization/98568
34020 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
34021 (bswap_replace): Use it.
34023 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
34025 PR rtl-optimization/97978
34026 * lra-int.h (lra_hard_reg_split_p): New external.
34027 * lra.c (lra_hard_reg_split_p): New global.
34028 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
34029 * lra-assigns.c (lra_assign): Don't check allocation correctness
34030 after hard reg splitting.
34032 2021-01-06 Martin Sebor <msebor@redhat.com>
34035 * builtins.c (new_delete_mismatch_p): New overload.
34036 (new_delete_mismatch_p (tree, tree)): Call it.
34038 2021-01-06 Alexandre Oliva <oliva@adacore.com>
34040 * Makefile.in (T_GLIMITS_H): New.
34041 (stmp-int-hdrs): Depend on it, use it.
34042 * config/t-vxworks (T_GLIMITS_H): Override it.
34043 (vxw-glimits.h): New.
34045 2021-01-06 Richard Biener <rguenther@suse.de>
34047 PR tree-optimization/98513
34048 * value-range.cc (intersect_ranges): Compare the upper bounds
34049 for the expected relation.
34051 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
34054 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
34056 * doc/standards.texi (HSAIL): Remove section.
34058 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
34060 * configure: Re-generate.
34062 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34064 * doc/invoke.texi (-std=c++20): Adjust for the publication of
34065 ISO 14882:2020 standard.
34066 * doc/standards.texi: Likewise.
34068 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34070 PR tree-optimization/94802
34071 * expr.h (maybe_optimize_sub_cmp_0): Declare.
34072 * expr.c: Include tree-pretty-print.h and flags.h.
34073 (maybe_optimize_sub_cmp_0): New function.
34074 (do_store_flag): Use it.
34075 * cfgexpand.c (expand_gimple_cond): Likewise.
34077 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34079 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
34080 * rtlanal.c (simple_regno_set): Tweak description to clarify the
34083 2021-01-05 Richard Biener <rguenther@suse.de>
34085 PR tree-optimization/98516
34086 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
34087 lanes when materializing on a VEC_PERM node.
34088 (vectorizable_slp_permutation): Dump the permute properly.
34090 2021-01-05 Richard Biener <rguenther@suse.de>
34092 * tree-vect-slp.c (vect_slp_region): Move debug counter
34093 to cover individual subgraphs.
34095 2021-01-05 Richard Biener <rguenther@suse.de>
34097 PR tree-optimization/98428
34098 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
34099 vector lane extracts for loop vectorization.
34101 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34103 PR tree-optimization/98514
34104 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
34106 (operand_rank): Change type from hash_map<tree, long> to
34107 hash_map<tree, int64_t>.
34108 (phi_rank): Change return type from long to int64_t.
34109 (loop_carried_phi): Change block_rank variable type from long to
34111 (propagate_rank): Change return type, rank parameter type and
34112 op_rank variable type from long to int64_t.
34113 (find_operand_rank): Change return type from long to int64_t
34114 and change slot variable type from long * to int64_t *.
34115 (insert_operand_rank): Change rank parameter type from long to
34117 (get_rank): Change return type and rank variable type from long to
34118 int64_t. Use PRId64 instead of ld to print the rank.
34119 (init_reassoc): Change rank variable type from long to int64_t
34120 and adjust correspondingly bb_rank and operand_rank initialization.
34122 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34124 PR tree-optimization/96928
34125 * tree-ssa-phiopt.c (xor_replacement): New function.
34126 (tree_ssa_phiopt_worker): Call it.
34128 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34130 PR tree-optimization/96930
34131 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
34132 from narrower value which has the same type as 1 << B, perform
34133 the right shift on the narrower value followed by extension.
34135 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34137 PR tree-optimization/96239
34138 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
34140 (get_status_for_store_merging): Don't return BB_INVALID for blocks
34141 with potential bswap optimizable CONSTRUCTORs.
34142 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
34145 2021-01-05 Richard Biener <rguenther@suse.de>
34147 PR tree-optimization/98381
34148 * tree.c (vector_element_bits): Properly compute bool vector
34150 * tree-vect-loop.c (vectorizable_live_operation): Properly
34151 compute the last lane bit offset.
34153 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
34156 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
34157 Clear the top 64 bytes of the input XMM register.
34158 (sse_cvttps2pi): Ditto.
34160 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
34163 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
34165 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
34168 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
34171 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
34173 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
34174 (umaddsidi4_split): Likewise.
34176 2021-01-05 liuhongt <hongtao.liu@intel.com>
34179 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
34180 define_insn_and_split for zero_extend of subreg HI of pmovskb
34182 (*sse2_pmovskb_zexthisi): Add new combine splitters for
34183 zero_extend of not of subreg HI of pmovskb result.
34185 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34188 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
34190 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
34191 convert_memory_address to convert symbolic immediates to ptr_mode
34192 before forcing them to memory.
34194 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34196 PR rtl-optimization/97144
34197 * recog.c (constrain_operands): Initialize matching_operand
34198 for each alternative, rather than only doing it once.
34200 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34202 PR rtl-optimization/98403
34203 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
34204 why we don't remove call clobbers.
34205 (function_info::apply_changes_to_insn): Don't attempt to add
34206 call clobbers here.
34208 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
34210 PR tree-optimization/98371
34211 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
34212 (vect_analyze_loop): If an epilogue loop appears to be cheaper
34213 than the main loop, re-analyze it as a main loop before adopting
34216 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
34219 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
34220 * aclocal.m4, configure: Regenerate.
34221 * Makefile.in (NETLIBS): Define.
34222 (BACKEND): Remove $(CODYLIB).
34224 2021-01-05 Jakub Jelinek <jakub@redhat.com>
34226 PR rtl-optimization/98334
34227 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
34228 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
34230 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
34232 * tree-inline.c (expand_call_inline): Restore input_location.
34233 Return result from recursive call.
34235 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
34237 PR tree-optimization/95401
34238 * config/aarch64/aarch64-sve-builtins.cc
34239 (gimple_folder::load_store_cookie): Use bits rather than bytes
34240 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
34241 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
34242 * tree-vect-stmts.c (vectorizable_store): Likewise.
34243 (vectorizable_load): Likewise.
34245 2021-01-04 Richard Biener <rguenther@suse.de>
34247 PR tree-optimization/98308
34248 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
34251 2021-01-04 Jakub Jelinek <jakub@redhat.com>
34253 PR tree-optimization/95771
34254 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
34255 with precision smaller than int's precision and types with precision
34256 twice as large as long long. Formatting fixes.
34258 2021-01-04 Richard Biener <rguenther@suse.de>
34260 PR tree-optimization/98464
34261 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
34262 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
34263 (process_bb): Adjust.
34265 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
34268 * doc/invoke.texi (-fsanitize=address): Fix wording describing
34269 clash with -fsanitize=hwaddress.
34271 2021-01-04 Richard Biener <rguenther@suse.de>
34273 PR tree-optimization/98282
34274 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
34275 invariants as VN_NARY.
34277 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
34280 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
34281 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
34282 to handle zero operands.
34284 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
34286 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
34287 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
34288 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
34291 2021-01-04 Richard Biener <rguenther@suse.de>
34293 PR tree-optimization/98393
34294 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
34295 when hitting the limit.
34297 2021-01-04 Richard Biener <rguenther@suse.de>
34299 PR tree-optimization/98291
34300 * tree-vect-loop.c (vectorizable_reduction): Bypass
34301 associativity check for SLP reductions with VF 1.
34303 2021-01-04 Jakub Jelinek <jakub@redhat.com>
34305 PR tree-optimization/96782
34306 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
34308 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
34310 * collect-utils.c (collect_execute): Check dumppfx.
34311 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
34312 to collect_execute.
34313 (do_link): Add new parameter atsuffix.
34314 (main): Handle -dumpdir option. Skip one argument for
34315 -o, -isystem and -B options.
34316 * gcc.c (make_at_file): New helper function.
34317 (close_at_file): Use it.
34319 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34321 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
34322 Amend handling for LD64_VERSION fallback defaults.
34324 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34326 * config.gcc: Compute default version information
34327 from the configured target. Likewise defaults for
34329 * config/darwin10.h: Removed.
34330 * config/darwin12.h: Removed.
34331 * config/darwin9.h: Removed.
34332 * config/rs6000/darwin8.h: Removed.
34334 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34336 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
34338 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34340 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
34341 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
34343 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34345 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
34347 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
34349 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34351 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
34352 for the Darwin10 unwinder stub from here ...
34353 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
34355 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
34357 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
34358 (ASM_DEBUG_SPEC):Only define if the assembler supports
34360 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
34361 (DARWIN_PREFER_DWARF): Define.
34362 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
34363 (DARWIN_PREFER_DWARF): Likewise
34364 (DSYMUTIL_SPEC): Likewise.
34365 (COLLECT_RUN_DSYMUTIL): Likewise.
34366 (ASM_DEBUG_SPEC): Likewise.
34367 (ASM_DEBUG_OPTION_SPEC): Likewise.
34369 2021-01-02 Jan Hubicka <jh@suse.cz>
34371 * cfg.c (free_block): ggc_free bb.
34373 2021-01-01 Jakub Jelinek <jakub@redhat.com>
34375 * gcc.c (process_command): Update copyright notice dates.
34376 * gcov-dump.c (print_version): Ditto.
34377 * gcov.c (print_version): Ditto.
34378 * gcov-tool.c (print_version): Ditto.
34379 * gengtype.c (create_file): Ditto.
34380 * doc/cpp.texi: Bump @copying's copyright year.
34381 * doc/cppinternals.texi: Ditto.
34382 * doc/gcc.texi: Ditto.
34383 * doc/gccint.texi: Ditto.
34384 * doc/gcov.texi: Ditto.
34385 * doc/install.texi: Ditto.
34386 * doc/invoke.texi: Ditto.
34388 2021-01-01 Jakub Jelinek <jakub@redhat.com>
34390 * ChangeLog-2020: Rotate ChangeLog. New file.
34393 Copyright (C) 2021 Free Software Foundation, Inc.
34395 Copying and distribution of this file, with or without modification,
34396 are permitted in any medium without royalty provided the copyright
34397 notice and this notice are preserved.