1 2021-12-06 Paul A. Clarke <pc@us.ibm.com>
4 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Replace "vector" with
7 2021-12-06 Jose E. Marchesi <jose.marchesi@oracle.com>
9 * config/bpf/bpf.c (bpf_handle_preserve_access_index_attribute):
10 Mark arguments `args' and flags' as unused.
11 (bpf_core_newdecl): Remove unused local `newdecl'.
12 (bpf_core_newdecl): Remove unused argument `loc'.
13 (ctfc_debuginfo_early_finish_p): Remove unused function.
14 (TARGET_CTFC_DEBUGINFO_EARLY_FINISH_P): Remove definition.
15 (bpf_core_walk): Do not pass a location to bpf_core_newdecl.
17 2021-12-06 Richard Sandiford <richard.sandiford@arm.com>
19 * gimple-range-edge.cc (gimple_outgoing_range::edge_range_p): Add
20 a shortcut for blocks with single successors.
21 * gimple-range-gori.cc (gori_map::calculate_gori): Likewise.
23 2021-12-06 Richard Sandiford <richard.sandiford@arm.com>
25 * value-range.cc (irange::irange_union): Use quick_push rather
26 than safe_push. Use widest_int rather than wide_int. Avoid
27 assigning wi::to_* results to wide*_int temporaries.
29 2021-12-06 Andrew MacLeod <amacleod@redhat.com>
31 * gimple-range-cache.cc (ranger_cache::fill_block_cache): Check for
32 a range from dominators before filling the cache.
33 (ranger_cache::range_from_dom): New.
34 * gimple-range-cache.h (ranger_cache::range_from_dom): Add prototype.
36 2021-12-06 Andrew MacLeod <amacleod@redhat.com>
38 * gimple-range-gori.h (class gori_compute):: Add prototypes.
39 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Add alternate
40 API for basic block. Call for edge alterantive.
41 (gori_compute::may_recompute_p): Ditto.
43 2021-12-06 Richard Biener <rguenther@suse.de>
45 PR tree-optimization/103581
46 * tree-vect-stmts.c (vect_build_gather_load_calls): Properly
47 guard all the AVX512 mask cases.
49 2021-12-06 Richard Biener <rguenther@suse.de>
51 PR tree-optimization/103544
52 * tree-vect-slp.c (vect_analyze_slp): Only add a SLP reduction
53 opportunity if the stmt in question is the reduction root.
54 (dot_slp_tree): Add missing check for NULL child.
56 2021-12-06 Jakub Jelinek <jakub@redhat.com>
59 * config/avr/avr.c (avr_output_data_section_asm_op,
60 avr_output_bss_section_asm_op): Change argument type from const void *
63 2021-12-06 Tamar Christina <tamar.christina@arm.com>
65 PR rtl-optimization/103404
66 * cse.c (find_sets_in_insn): Don't select elements out of a V1 mode
69 2021-12-06 Hongtao Liu <Hongtao.liu@intel.com>
70 Uroš Bizjak <ubizjak@gmail.com>
73 * config/i386/i386.c (ix86_preferred_reload_class): Allow
74 integer regs when moves between register units are cheap.
75 * config/i386/i386.h (INT_SSE_CLASS_P): New.
77 2021-12-05 David Edelsohn <dje.gcc@gmail.com>
79 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
80 AIX math builtin initialization before new_builtins_are_live.
82 2021-12-04 Jakub Jelinek <jakub@redhat.com>
84 * config/i386/x86-tune.def (X86_TUNE_PARTIAL_REG_DEPENDENCY): Fix
85 comment typo, Preffer -> prefer.
86 * ipa-modref-tree.c (modref_access_node::closer_pair_p): Likewise.
88 2021-12-04 Alexandre Oliva <oliva@adacore.com>
90 PR rtl-optimization/103028
91 * ifcvt.c (find_cond_trap): Validate new insns more strictly
94 2021-12-03 Martin Liska <mliska@suse.cz>
95 Peter Bergner <bergner@linux.ibm.com>
98 * config/rs6000/rs6000.c (rs6000_option_override_internal): Move the
99 disabling of shrink-wrapping when using -mrop-protect from here...
100 (rs6000_override_options_after_change): ...to here.
102 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
105 * config/i386/i386-expand.c (ix86_expand_builtin): Pass PVW_NONE
106 and PVW_NONE to ix86_target_string.
107 * config/i386/i386-options.c (ix86_target_string): Add arguments
108 for move_max and store_max.
109 (ix86_target_string::add_vector_width): New lambda.
110 (ix86_debug_options): Pass ix86_move_max and ix86_store_max to
112 (ix86_function_specific_print): Pass ptr->x_ix86_move_max and
113 ptr->x_ix86_store_max to ix86_target_string.
114 (ix86_valid_target_attribute_tree): Handle x_ix86_move_max and
116 (ix86_option_override_internal): Set the default x_ix86_move_max
117 and x_ix86_store_max.
118 * config/i386/i386-options.h (ix86_target_string): Add
119 prefer_vector_width and prefer_vector_width.
120 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): Removed.
121 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
122 (MOVE_MAX): Use 64 if ix86_move_max or ix86_store_max ==
123 PVW_AVX512. Use 32 if ix86_move_max or ix86_store_max >=
125 (STORE_MAX_PIECES): Use 64 if ix86_store_max == PVW_AVX512.
126 Use 32 if ix86_store_max >= PVW_AVX256.
127 * config/i386/i386.opt: Add -mmove-max=bits and -mstore-max=bits.
128 * config/i386/x86-tune.def (X86_TUNE_AVX512_MOVE_BY_PIECES): New.
129 (X86_TUNE_AVX512_STORE_BY_PIECES): Likewise.
130 * doc/invoke.texi: Document -mmove-max=bits and -mstore-max=bits.
132 2021-12-03 Bill Schmidt <wschmidt@linux.ibm.com>
134 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Fix builtin
137 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
139 * common/config/riscv/riscv-common.c (riscv_implied_info):
140 Add K-ext related entry.
141 (riscv_supported_std_ext): Add 'k'.
142 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add 'k'.
143 (IMPLIED_EXT): Add K-ext related entry.
145 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
147 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
149 * config/riscv/riscv-opts.h (MASK_ZBKB): New.
159 (TARGET_ZBKB): Ditto.
160 (TARGET_ZBKC): Ditto.
161 (TARGET_ZBKX): Ditto.
162 (TARGET_ZKNE): Ditto.
163 (TARGET_ZKND): Ditto.
164 (TARGET_ZKNH): Ditto.
166 (TARGET_ZKSED): Ditto.
167 (TARGET_ZKSH): Ditto.
169 * config/riscv/riscv.opt (riscv_zk_subext): New.
171 2021-12-03 Tamar Christina <tamar.christina@arm.com>
173 * tree-vect-stmts.c (prepare_load_store_mask): Rename to...
174 (prepare_vec_mask): ...This and record operations that have already been
176 (vectorizable_call): Use it.
177 (vectorizable_operation): Likewise.
178 (vectorizable_store): Likewise.
179 (vectorizable_load): Likewise.
180 * tree-vectorizer.h (class _loop_vec_info): Add vec_cond_masked_set.
181 (vec_cond_masked_set_type, tree_cond_mask_hash): New.
183 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
187 * target.def (ifunc_ref_local_ok): Add a target hook.
188 * varasm.c (default_binds_local_p_3): Force indirect function
189 resolver non-local only if targetm.ifunc_ref_local_ok returns
191 * config/i386/i386-expand.c (ix86_expand_call): Call
192 ix86_call_use_plt_p to check if PLT should be used.
193 * config/i386/i386-protos.h (ix86_call_use_plt_p): New.
194 * config/i386/i386.c (output_pic_addr_const): Call
195 ix86_call_use_plt_p to check if "@PLT" is needed.
196 (ix86_call_use_plt_p): New.
197 (TARGET_IFUNC_REF_LOCAL_OK): New.
198 * doc/tm.texi.in: Add TARGET_IFUNC_REF_LOCAL_OK.
199 * doc/tm.texi: Regenerated.
201 2021-12-03 Jakub Jelinek <jakub@redhat.com>
203 * attribs.h (simple_cst_list_equal): Declare.
204 * attribs.c (simple_cst_list_equal): No longer static.
205 * config/i386/i386-options.c (target_attribute_cache): New variable.
206 (ix86_valid_target_attribute_p): Cache DECL_FUNCTION_SPECIFIC_TARGET
207 and DECL_FUNCTION_SPECIFIC_OPTIMIZATION based on args.
209 2021-12-03 Jakub Jelinek <jakub@redhat.com>
212 * ggc.h (gt_pch_note_callback): Declare.
213 * gengtype.h (enum typekind): Add TYPE_CALLBACK.
214 (callback_type): Declare.
215 * gengtype.c (dbgprint_count_type_at): Handle TYPE_CALLBACK.
216 (callback_type): New variable.
217 (process_gc_options): Add CALLBACK argument, handle callback
219 (set_gc_used_type): Adjust process_gc_options caller, if callback,
220 set type to &callback_type.
221 (output_mangled_typename): Handle TYPE_CALLBACK.
222 (walk_type): Likewise. Handle callback option.
223 (write_types_process_field): Handle TYPE_CALLBACK.
224 (write_types_local_user_process_field): Likewise.
225 (write_types_local_process_field): Likewise.
226 (write_root): Likewise.
227 (dump_typekind): Likewise.
228 (dump_type): Likewise.
229 * gengtype-state.c (type_lineloc): Handle TYPE_CALLBACK.
230 (state_writer::write_state_callback_type): New method.
231 (state_writer::write_state_type): Handle TYPE_CALLBACK.
232 (read_state_callback_type): New function.
233 (read_state_type): Handle TYPE_CALLBACK.
234 * ggc-common.c (callback_vec): New variable.
235 (gt_pch_note_callback): New function.
236 (gt_pch_save): Stream out gt_pch_save function address and relocation
238 (gt_pch_restore): Stream in saved gt_pch_save function address and
239 relocation table and apply relocations if needed.
240 * doc/gty.texi (callback): Document new GTY option.
241 * varasm.c (get_unnamed_section): Change callback argument's type and
242 last argument's type from const void * to const char *.
243 (output_section_asm_op): Change argument's type from const void *
244 to const char *, remove unnecessary cast.
245 * tree-core.h (struct tree_translation_unit_decl): Drop GTY((skip))
246 from language member.
247 * output.h (unnamed_section_callback): Change argument type from
248 const void * to const char *.
249 (struct unnamed_section): Use GTY((callback)) instead of GTY((skip))
250 for callback member. Change data member type from const void *
252 (struct noswitch_section): Use GTY((callback)) instead of GTY((skip))
254 (get_unnamed_section): Change callback argument's type and
255 last argument's type from const void * to const char *.
256 (output_section_asm_op): Change argument's type from const void *
258 * config/avr/avr.c (avr_output_progmem_section_asm_op): Likewise.
259 Remove unneeded cast.
260 * config/darwin.c (output_objc_section_asm_op): Change argument's type
261 from const void * to const char *.
262 * config/pa/pa.c (som_output_text_section_asm_op): Likewise.
263 (som_output_comdat_data_section_asm_op): Likewise.
264 * config/rs6000/rs6000.c (rs6000_elf_output_toc_section_asm_op):
266 (rs6000_xcoff_output_readonly_section_asm_op): Likewise. Instead
267 of dereferencing directive hardcode variable names and decide based on
268 whether directive is NULL or not.
269 (rs6000_xcoff_output_readwrite_section_asm_op): Change argument's type
270 from const void * to const char *.
271 (rs6000_xcoff_output_tls_section_asm_op): Likewise. Instead
272 of dereferencing directive hardcode variable names and decide based on
273 whether directive is NULL or not.
274 (rs6000_xcoff_output_toc_section_asm_op): Change argument's type
275 from const void * to const char *.
276 (rs6000_xcoff_asm_init_sections): Adjust get_unnamed_section callers.
278 2021-12-03 Richard Purdie <richard.purdie@linuxfoundation.org>
280 * Makefile.in: Fix "argument list too long" from install-plugins.
282 2021-12-03 Matt Jacobson <mhjacobson@me.com>
284 * config.gcc: For the AVR target, populate TM_MULTILIB_CONFIG.
285 * config/avr/genmultilib.awk: Add ability to filter generated multilib
287 * config/avr/t-avr: Pass TM_MULTILIB_CONFIG to genmultilib.awk.
288 * configure.ac: Update help string for --with-multilib-list.
289 * configure: Regenerate.
291 2021-12-02 Ilya Lipnitskiy <ilya.lipnitskiy@gmail.com>
293 * configure: Regenerate.
294 * configure.ac: Define TARGET_LIBC_GNUSTACK on musl.
296 2021-12-02 Iain Sandoe <iain@sandoe.co.uk>
299 * config/host-darwin.c (TRY_EMPTY_VM_SPACE,
300 SAFE_ALLOC_SIZE): New.
301 (darwin_gt_pch_get_address): Rewrite to use nominated
302 memory segments rather than part of the compiler __DATA
304 (darwin_gt_pch_use_address): Likewise.
306 2021-12-02 Gerald Pfeifer <gerald@pfeifer.com>
308 * doc/install.texi (*-*-freebsd*): Remove references to
309 FreeBSD 1 and FreeBSD 2.
311 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
313 PR rtl-optimization/103437
314 * ira-color.c (setup_allocno_priorities): Switch off backup code
315 for overflow if compiler has __builtin_smul_overflow. Use <
316 for comparison with -INT_MAX.
318 2021-12-02 Bill Schmidt <wschmidt@linux.ibm.com>
320 * config/rs6000/altivec.h: Delete a number of #defines that are now
321 superfluous. Alphabetize. Include rs6000-vecdefines.h. Include some
323 * config/rs6000/rs6000-builtin-new.def (CMPB): Flag as no32bit.
324 (BPERMD): Flag as 32bit (needing special handling for 32-bit).
325 (UNPACK_TD): Return unsigned long long instead of unsigned long.
326 (GET_TEXASR): Return unsigned long instead of unsigned long long.
327 (GET_TEXASRU): Likewise.
328 (GET_TFHAR): Likewise.
329 (GET_TFIAR): Likewise.
330 (SET_TEXASR): Pass unsigned long instead of unsigned long long.
331 (SET_TEXASRU): Likewise.
332 (SET_TFHAR): Likewise.
333 (SET_TFIAR): Likewise.
334 (TABORTDC): Likewise.
335 (TABORTDCI): Likewise.
336 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): Fix error
337 handling for no32bit. Add 32bit handling for RS6000_BIF_BPERMD.
338 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Initialize
339 new_builtins_are_live to 1.
341 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
343 PR rtl-optimization/103437
344 * ira-color.c (setup_allocno_priorities): Use long long
345 multiplication as backup for overflow processing.
347 2021-12-02 Tamar Christina <tamar.christina@arm.com>
349 * config/aarch64/aarch64-protos.h (aarch64_gen_shareable_zero): New.
350 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>,
351 aarch64_rshrn2<mode>): Generate rounding half-ing add when appropriate.
352 * config/aarch64/aarch64.c (aarch64_gen_shareable_zero): New.
354 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
356 PR rtl-optimization/103437
357 * ira-color.c (setup_allocno_priorities): Process multiplication
360 2021-12-02 Martin Liska <mliska@suse.cz>
362 PR gcov-profile/96092
363 * common.opt: New option.
364 * coverage.c (coverage_begin_function): Emit filename with
365 remap_profile_filename.
366 * doc/invoke.texi: Document the new option.
367 * file-prefix-map.c (add_profile_prefix_map): New.
368 (remap_profile_filename): Likewise.
369 * file-prefix-map.h (add_profile_prefix_map): Likewise.
370 (remap_profile_filename): Likewise.
371 * lto-opts.c (lto_write_options): Handle
372 OPT_fprofile_prefix_map_.
373 * opts-global.c (handle_common_deferred_options): Likewise.
374 * opts.c (common_handle_option): Likewise.
375 (gen_command_line_string): Likewise.
376 * profile.c (output_location): Emit filename with
377 remap_profile_filename.
379 2021-12-02 Richard Biener <rguenther@suse.de>
382 * internal-fn.c (expand_DEFERRED_INIT): When the base
383 of the LHS is a decl with matching constant size use
384 that as the initialization target instead of an
385 eventual VLA typed one.
387 2021-12-02 Richard Biener <rguenther@suse.de>
389 PR tree-optimization/103527
390 * tree-vect-stmts.c (vect_build_gather_load_calls): Always
391 use a truth type for building the vector mask.
393 2021-12-02 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
395 * config/vxworks.h (LTO_PLUGIN_SPEC): Adapt to corrected
396 spelling of -linker-output-auto-nolto-rel.
398 2021-12-01 Uroš Bizjak <ubizjak@gmail.com>
401 * config/i386/sse.md (VI2F): Remove mode iterator.
402 (VI2F_256_512): New mode iterator.
403 (vec_set<V8_128:mode>_0): New insn pattern.
404 (vec_set<VI2F_256_512:mode>_0>): Rename from vec_set<VI2F:mode>mode.
405 Use VI2F_256_512 mode iterator instead of VI2F.
406 (*axv512fp16_movsh): Remove.
407 (<sse2p4_1>_pinsr<ssemodesuffix>): Add (x,x,x) AVX2 alternative.
408 Do not disable V8HF mode insn on AVX2 targets.
409 (pinsrw -> pbroadcast + pblendw peephole2): New peephole.
410 (pinsrw -> pbroadcast + pblendw splitter): New post-reload splitter.
411 * config/i386/i386.md (extendhfsf): Call gen_vec_setv8hf_0.
412 * config/i386/i386-expand.c (ix86_expand_vector_set)
413 <case E_V8HFmode>: Use vec_merge path for TARGET_AVX2.
415 2021-12-01 Roger Sayle <roger@nextmovesoftware.com>
416 Richard Biener <rguenther@suse.de>
418 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
419 Check if simplify_using_initial_conditions allows us to
420 simplify the expression for may_be_zero.
421 * match.pd (X != C ? -X : -C -> -X): New transform.
422 (X != C ? ~X : ~C -> ~X): Likewise.
423 ((X+1) > Y ? -X : 1 -> X >= Y ? -X : 1): Likewise.
425 2021-12-01 Peter Bergner <bergner@linux.ibm.com>
428 * gimplify.c (is_var_need_auto_init): Handle opaque types.
430 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
432 * common.opt (static-libphobos): Add option.
433 * config/darwin.h (LINK_SPEC): Substitute -lgphobos with libgphobos.a
434 when linking statically.
435 * gcc.c (driver_handle_option): Set -static-libphobos as always valid.
437 2021-12-01 Jason Merrill <jason@redhat.com>
440 * fold-const.c (maybe_nonzero_address): Use get_create or get
441 depending on folding_initializer.
442 (fold_binary_initializer_loc): New.
443 * fold-const.h (fold_binary_initializer_loc): Declare.
444 * varasm.c (mark_weak): Don't use the decl location.
445 * doc/invoke.texi: Document -fconstexpr-fp-except.
447 2021-12-01 Bill Schmidt <wschmidt@linux.ibm.com>
450 * config/rs6000/rs6000-c.c (rs6000-builtins.h): Stop including.
451 (rs6000-internal.h): Include.
452 (altivec_resolve_new_overloaded_builtin): Move call to
453 rs6000_invalid_new_builtin here from rs6000_new_builtin_decl.
454 * config/rs6000/rs6000-call.c (rs6000-builtins.h): Stop including.
455 (rs6000_invalid_new_builtin): Remove static qualifier.
456 (rs6000_new_builtin_decl): Remove test for supported builtin.
457 * config/rs6000/rs6000-internal.h (rs6000-builtins.h): Include.
458 (rs6000_invalid_new_builtin): Declare.
459 * config/rs6000/rs6000.c (rs6000-builtins.h): Don't include.
461 2021-12-01 Aldy Hernandez <aldyh@redhat.com>
463 PR tree-optimization/103409
464 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
465 Do all the work with just one ssa_global_cache.
466 * gimple-range-path.h: Remove m_tmp_phi_cache.
468 2021-12-01 Richard Sandiford <richard.sandiford@arm.com>
470 PR tree-optimization/103517
471 * tree-vect-slp.c (vect_build_slp_tree_1): When allowing two
472 different component references, check the codes of both them,
473 rather than just the first.
475 2021-12-01 Andrew MacLeod <amacleod@redhat.com>
477 PR tree-optimization/103464
478 * gimple-range.cc (gimple_ranger::prefill_name): Process phis also.
479 (gimple_ranger::prefill_stmt_dependencies): Ditto.
481 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
483 * doc/install.texi (Prerequisites): Add note that D front end now
484 requires GDC installed in order to bootstrap.
485 (Building): Add D compiler section, referencing prerequisites.
487 2021-12-01 Jakub Jelinek <jakub@redhat.com>
489 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE,
490 X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY,
491 X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Formatting fixes.
492 (X86_TUNE_USE_GATHER): Put m_GENERIC last for consistency.
494 2021-12-01 Jakub Jelinek <jakub@redhat.com>
496 PR rtl-optimization/102356
497 * rtl.h (simplify_context): Add assoc_count member and
498 max_assoc_count static member.
499 * simplify-rtx.c (simplify_associative_operation): Don't reassociate
500 more than max_assoc_count times within one outermost simplify_* call.
501 * dwarf2out.c (mem_loc_descriptor): Optimize binary operation
502 with both operands the same using DW_OP_dup.
504 2021-12-01 Tamar Christina <tamar.christina@arm.com>
506 * match.pd: Move below pattern that rewrites to EQ, NE.
507 * tree.c (bitmask_inv_cst_vector_p): Correct do .. while indentation.
509 2021-12-01 Siddhesh Poyarekar <siddhesh@gotplt.org>
510 Martin Liška <mliska@suse.cz>
512 PR tree-optimization/103456
513 * tree-object-size.c (merge_object_sizes): Update osi->changed
514 only if object_sizes_set succeeded.
516 2021-11-30 liuhongt <hongtao.liu@intel.com>
520 * config/i386/i386.md (*x86_64_shld_1): Set_attr
521 length_immediate to 1.
522 (*x86_shld_1): Ditto.
523 (*x86_64_shrd_1): Ditto.
524 (*x86_shrd_1): Ditto.
526 2021-11-30 Bill Schmidt <wschmidt@linux.ibm.com>
528 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
530 (altivec_resolve_new_overloaded_builtin): Likewise.
532 2021-11-30 Martin Jambor <mjambor@suse.cz>
535 * ipa-sra.c (scan_function): Also check ECF_LOOPING_CONST_OR_PURE flag.
537 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
539 PR tree-optimization/103494
540 * tree-vect-stmts.c (vect_get_gather_scatter_ops): Remove ncopies
541 argument and calculate ncopies from gs_info->offset_vectype
543 (vectorizable_store, vectorizable_load): Update accordingly.
545 2021-11-30 Iain Buclaw <ibuclaw@gdcproject.org>
547 * Makefile.in (GDC): New variable.
548 (GDCFLAGS): New variable.
549 * configure: Regenerate.
550 * configure.ac: Add call to ACX_PROG_GDC. Substitute GDCFLAGS.
552 2021-11-30 Martin Liska <mliska@suse.cz>
553 Martin Jambor <mjambor@suse.cz>
556 * ipa-param-manipulation.c
557 (ipa_param_body_adjustments::prepare_debug_expressions): Be
558 careful about hash_map reallocating itself. Simpify a return
559 which always returns true.
561 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
563 PR tree-optimization/103440
564 * gimple-range-fold.cc (fold_using_range::range_of_phi): Continue
565 normal param processing for equiv params.
567 2021-11-30 Richard Biener <rguenther@suse.de>
569 * config/i386/i386.c (ix86_shift_rotate_cost): Remove
571 * tree-chrec.c (evolution_function_is_invariant_rec_p):
573 * tree-if-conv.c (if_convertible_stmt_p): Likewise.
574 * tree-ssa-pre.c (fully_constant_expression): Likewise.
575 * tree-vrp.c (operand_less_p): Likewise.
576 * reload.c (reg_overlap_mentioned_for_reload_p): Remove
577 unreachable gcc_unreachable ().
578 * sel-sched-ir.h (bb_next_bb): Likewise.
579 * varasm.c (compare_constant): Likewise.
581 2021-11-30 Richard Biener <rguenther@suse.de>
583 PR tree-optimization/103489
584 * tree-vect-loop.c (vectorizable_phi): Verify argument
585 vector type compatibility to mitigate bool pattern recog
588 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
590 * omp-offload.c (oacc_loop_auto_partitions): Remove erroneous
591 "Orphan reductions cannot have gang partitioning" handling.
593 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
595 * omp-offload.c (oacc_loop_process): Implement "gang reduction on
596 an orphan loop" checking.
598 2021-11-30 Cesar Philippidis <cesar@codesourcery.com>
599 Thomas Schwinge <thomas@codesourcery.com>
601 * omp-general.h (enum oacc_loop_flags): Add OLF_REDUCTION enum.
602 * omp-low.c (lower_oacc_head_mark): Use it to mark OpenACC
604 * omp-offload.c (oacc_loop_auto_partitions): Don't assign gang
605 level parallelism to orphan reductions.
607 2021-11-30 Aldy Hernandez <aldyh@redhat.com>
609 PR tree-optimization/103451
610 * range-op.cc (operator_div::wi_fold): Remove
611 can_throw_non_call_exceptions special case.
612 * tree-ssa-sink.c (sink_code_in_bb): Same.
614 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
616 * tree-vect-slp.c (arg1_arg4_map): New variable.
617 (vect_get_operand_map): Handle IFN_MASK_GATHER_LOAD.
618 (vect_build_slp_tree_1): Likewise.
619 (vect_build_slp_tree_2): Likewise.
620 * tree-vect-stmts.c (vectorizable_load): Expect the mask to be
621 the last SLP child node rather than the first.
623 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
625 * tree-if-conv.c: Include tree-eh.h.
626 (predicate_statements): Remove pe argument. Don't hoist
628 (combine_blocks): Remove pe argument.
629 (ifcvt_available_on_edge_p, ifcvt_can_hoist): New functions.
630 (ifcvt_hoist_invariants): Likewise.
631 (tree_if_conversion): Update call to combine_blocks. Call
632 ifcvt_hoist_invariants after VN.
634 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
636 PR tree-optimization/102467
637 * doc/sourcebuild.texi (vect_gather_load_ifn): Document.
638 * tree-vect-data-refs.c (vect_analyze_data_ref_dependence):
639 Commonize safelen handling. Punt for anything involving
640 gathers and scatters unless safelen says otherwise.
641 * tree-vect-slp.c (arg1_map): New variable.
642 (vect_get_operand_map): Handle IFN_GATHER_LOAD.
643 (vect_build_slp_tree_1): Likewise.
644 (vect_build_slp_tree_2): Likewise.
645 (compatible_calls_p): If vect_get_operand_map returns nonnull,
646 check that any skipped arguments are equal.
647 (vect_slp_analyze_node_operations_1): Tighten reduction check.
648 * tree-vect-stmts.c (check_load_store_for_partial_vectors): Take
650 (vect_get_gather_scatter_ops): Take slp_node and ncopies arguments.
652 (vectorizable_store, vectorizable_load): Adjust accordingly.
654 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
656 * gimple.h (gimple_num_args, gimple_arg, gimple_arg_ptr): New
658 * tree-vect-slp.c (cond_expr_maps, arg2_map): New variables.
659 (vect_get_operand_map): New function.
660 (vect_get_and_check_slp_defs): Fix outdated comment.
661 Use vect_get_operand_map and new gimple argument accessors.
662 (vect_build_slp_tree_2): Likewise.
664 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
666 * tree-vect-slp.c (vect_build_slp_tree_1): Use code_helper
667 to record the operations performed by statements, only using
668 CALL_EXPR for things that don't map to built-in or internal
669 functions. For shifts, require all shift amounts to be equal
670 if optab_vector is not supported but optab_scalar is.
672 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
674 * tree-vect-data-refs.c (vect_check_gather_scatter): Continue
675 processing conversions if the current offset is a pointer.
677 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
679 * tree-vectorizer.h (vect_is_reduction): Use STMT_VINFO_REDUC_IDX.
681 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
683 * genopinit.c (main): Turn supports_vec_gather_load and
684 supports_vec_scatter_store into signed char arrays and remove
685 supports_vec_gather_load_cached and supports_vec_scatter_store_cached.
686 * optabs-query.c (supports_vec_convert_optab_p): Add a mode parameter.
687 If the mode is not VOIDmode, test only for that mode.
688 (supports_vec_gather_load_p): Likewise.
689 (supports_vec_scatter_store_p): Likewise.
690 * optabs-query.h (supports_vec_gather_load_p): Likewise.
691 (supports_vec_scatter_store_p): Likewise.
692 * tree-vect-data-refs.c (vect_check_gather_scatter): Pass the
693 vector mode to supports_vec_gather_load_p and
694 supports_vec_scatter_store_p.
696 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
698 * internal-fn.c (commutative_binary_fn_p): Handle IFN_ADD_OVERFLOW
699 and IFN_MUL_OVERFLOW.
701 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
703 * internal-fn.c (commutative_binary_fn_p): Handle IFN_UBSAN_CHECK_ADD
704 and IFN_UBSAN_CHECK_MUL.
706 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
708 * internal-fn.c (commutative_binary_fn_p): Handle IFN_COMPLEX_MUL.
710 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
712 * gimple-fold.c: Include internal-fn.h.
713 (fold_stmt_1): If a function maps to an internal one, use
714 first_commutative_argument to canonicalize the order of
715 commutative arguments.
716 * gimple-match-head.c (gimple_resimplify2, gimple_resimplify3)
717 (gimple_resimplify4, gimple_resimplify5): Extend commutativity
720 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
722 * doc/md.texi (reduc_fmin_scal_@var{m}): Document.
723 (reduc_fmax_scal_@var{m}): Likewise.
724 * optabs.def (reduc_fmax_scal_optab): New optab.
725 (reduc_fmin_scal_optab): Likewise
726 * internal-fn.def (REDUC_FMAX, REDUC_FMIN): New functions.
727 * tree-vect-loop.c (reduction_fn_for_scalar_code): Handle
728 CASE_CFN_FMAX and CASE_CFN_FMIN.
729 (neutral_op_for_reduction): Likewise.
730 (needs_fold_left_reduction_p): Likewise.
731 * config/aarch64/iterators.md (FMAXMINV): New iterator.
732 (fmaxmin): Handle UNSPEC_FMAXNMV and UNSPEC_FMINNMV.
733 * config/aarch64/aarch64-simd.md (reduc_<optab>_scal_<mode>): Fix
735 (reduc_<fmaxmin>_scal_<mode>): New pattern.
736 * config/aarch64/aarch64-sve.md (reduc_<fmaxmin>_scal_<mode>):
739 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
741 * builtins.h (associated_internal_fn): Declare overload that
742 takes a (combined_cfn, return type) pair.
743 * builtins.c (associated_internal_fn): Split new overload out
744 of original fndecl version. Also provide an overload that takes
745 a (combined_cfn, return type) pair.
746 * internal-fn.h (commutative_binary_fn_p): Declare.
747 (commutative_ternary_fn_p): Likewise.
748 (associative_binary_fn_p): Likewise.
749 * internal-fn.c (commutative_binary_fn_p, commutative_ternary_fn_p):
750 New functions, split out from...
751 (first_commutative_argument): ...here.
752 (associative_binary_fn_p): New function.
753 * gimple-match.h (code_helper): Add a constructor that takes
755 (commutative_binary_op_p): Declare.
756 (commutative_ternary_op_p): Likewise.
757 (first_commutative_argument): Likewise.
758 (associative_binary_op_p): Likewise.
759 (canonicalize_code): Likewise.
760 (directly_supported_p): Likewise.
761 (get_conditional_internal_fn): Likewise.
762 (gimple_build): New overloads that takes a code_helper.
763 * gimple-fold.c (gimple_build): Likewise.
764 * gimple-match-head.c (commutative_binary_op_p): New function.
765 (commutative_ternary_op_p): Likewise.
766 (first_commutative_argument): Likewise.
767 (associative_binary_op_p): Likewise.
768 (canonicalize_code): Likewise.
769 (directly_supported_p): Likewise.
770 (get_conditional_internal_fn): Likewise.
771 * tree-vectorizer.h: Include gimple-match.h.
772 (neutral_op_for_reduction): Take a code_helper instead of a tree_code.
773 (needs_fold_left_reduction_p): Likewise.
774 (reduction_fn_for_scalar_code): Likewise.
775 (vect_can_vectorize_without_simd_p): Declare a nNew overload that takes
777 * tree-vect-loop.c: Include case-cfn-macros.h.
778 (fold_left_reduction_fn): Take a code_helper instead of a tree_code.
779 (reduction_fn_for_scalar_code): Likewise.
780 (neutral_op_for_reduction): Likewise.
781 (needs_fold_left_reduction_p): Likewise.
782 (use_mask_by_cond_expr_p): Likewise.
783 (build_vect_cond_expr): Likewise.
784 (vect_create_partial_epilog): Likewise. Use gimple_build rather
785 than gimple_build_assign.
786 (check_reduction_path): Handle calls and operate on code_helpers
787 rather than tree_codes.
788 (vect_is_simple_reduction): Likewise.
789 (vect_model_reduction_cost): Likewise.
790 (vect_find_reusable_accumulator): Likewise.
791 (vect_create_epilog_for_reduction): Likewise.
792 (vect_transform_cycle_phi): Likewise.
793 (vectorizable_reduction): Likewise. Make more use of
795 (vect_transform_reduction): Use gimple_extract_op but expect
797 (vect_can_vectorize_without_simd_p): New overload that takes
799 * tree-vect-stmts.c (vectorizable_call): Handle reductions in
801 * tree-vect-patterns.c (vect_mark_pattern_stmts): Use
802 gimple_extract_op when updating STMT_VINFO_REDUC_IDX.
804 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
806 * gimple-match.h (code_helper): Provide == and != overloads.
807 (code_helper::operator tree_code): Make explicit.
808 (code_helper::operator combined_fn): Likewise.
809 * gimple-match-head.c (convert_conditional_op): Use explicit
810 conversions where necessary.
811 (gimple_resimplify1, gimple_resimplify2, gimple_resimplify3): Likewise.
812 (maybe_push_res_to_seq, gimple_simplify): Likewise.
813 * gimple-fold.c (replace_stmt_with_simplification): Likewise.
815 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
817 * gimple-match.h (code_helper): Add functions for querying whether
818 the code represents an internal_fn or a built_in_function.
819 Provide explicit conversion operators for both cases.
820 (gimple_extract_op): Declare.
821 * gimple-match-head.c (gimple_extract): New function, extracted from...
822 (gimple_simplify): ...here.
823 (gimple_extract_op): New function.
825 2021-11-30 Eric Botcazou <ebotcazou@adacore.com>
828 * config/i386/i386.c (ix86_output_call_insn): Beef up comment about
829 nops emitted with SEH.
830 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
831 the cold section, emit a nop before the directive if the previous
832 active instruction is a call.
834 2021-11-30 Roger Sayle <roger@nextmovesoftware.com>
835 Uroš Bizjak <ubizjak@gmail.com>
838 * config/i386/sse.md (define_split): New splitters to simplify
839 not;vec_duplicate;and as vec_duplicate;andn.
841 2021-11-30 Richard Biener <rguenther@suse.de>
844 * match.pd (-((int)x >> (prec - 1)) to (unsigned)x >> (prec - 1)):
845 Use VIEW_CONVERT_EXPR for vectors.
847 2021-11-30 Richard Biener <rguenther@suse.de>
849 * cfgrtl.c (skip_insns_after_block): Refactor code to
850 be more easily readable.
851 * expr.c (op_by_pieces_d::run): Remove unreachable
853 * sched-deps.c (sched_analyze): Remove unreachable
855 * sel-sched-ir.c (in_same_ebb_p): Likewise.
856 * tree-ssa-alias.c (nonoverlapping_refs_since_match_p):
857 Remove unreachable code.
858 * tree-vect-slp.c (vectorize_slp_instance_root_stmt):
859 Refactor to avoid unreachable loop iteration.
860 * tree.c (walk_tree_1): Remove unreachable break.
861 * vec-perm-indices.c (vec_perm_indices::series_p): Remove
864 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
867 * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin mask
870 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
872 * config/rs6000/rs6000.c
873 (rs6000_cost_data::update_target_cost_per_stmt): Adjust the way to
874 compute extra penalized cost. Remove useless parameter.
875 (rs6000_cost_data::rs6000_add_stmt_cost): Adjust the call to function
876 update_target_cost_per_stmt.
878 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
881 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
883 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
884 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
885 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
886 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
887 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
888 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
889 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
890 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
892 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
894 PR tree-optimization/103467
895 * gimple-range-gori.cc (range_def_chain::register_dependency): Don't
896 use an object reference after a potential resize.
898 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
901 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
902 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
903 optimizing for size. Fix vpinsrw insn template.
904 (*movhf_internal): Fix pinsrw and pextrw insn templates for
905 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
906 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
907 and TARGET_SSE_SPLIT_REGS targets.
909 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
911 * configure: Re-regenerate.
913 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
916 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
917 variables. Use ETAGS variable in TAGS target.
918 * configure: Regenerate.
919 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
920 variables to be overridden.
922 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
924 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
926 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
927 (_mm_movemask_epi8): Likewise.
929 2021-11-29 Richard Biener <rguenther@suse.de>
931 * read-rtl-function.c (function_reader::read_rtx_operand):
932 Return only after resetting m_in_call_function_usage.
934 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
936 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
937 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
938 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
939 on QImode shift amounts from const_int_operand to general_operand.
941 2021-11-29 Richard Biener <rguenther@suse.de>
943 * tree.h (reverse_storage_order_for_component_p): Remove
944 spurious gcc_unreachable.
945 * cfganal.c (dfs_find_deadend): Likewise.
946 * fold-const-call.c (fold_const_logb): Likewise.
947 (fold_const_significand): Likewise.
948 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
951 2021-11-29 Richard Biener <rguenther@suse.de>
953 * vec.c (qsort_chk): Do not return the void return value
954 from the noreturn qsort_chk_error.
955 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
956 * df-scan.c (df_ref_equal_p): Likewise.
957 * dwarf2out.c (is_base_type): Likewise.
958 (add_const_value_attribute): Likewise.
959 * fixed-value.c (fixed_arithmetic): Likewise.
960 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
961 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
962 * graphite-isl-ast-to-gimple.c
963 (gcc_expression_from_isl_expr_op): Likewise.
964 (gcc_expression_from_isl_expression): Likewise.
965 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
967 * lto-streamer-in.c (lto_input_mode_table): Likewise.
969 2021-11-29 liuhongt <hongtao.liu@intel.com>
972 * config/i386/i386.c (inline_secondary_memory_needed): HImode
973 move between GPR and SSE registers is supported under
974 TARGET_SSE2 and above.
975 * config/i386/i386.md (extendhfsf2): Optimize expander.
977 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
980 2021-11-29 liuhongt <hongtao.liu@intel.com>
984 * config/i386/i386.c (ix86_secondary_reload): Without
985 TARGET_SSE4_1, General register is needed to move HImode from
986 sse register to memory.
987 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
988 pextrw in output templates.
989 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
990 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
991 alternatives related to sse register.
993 2021-11-29 Richard Biener <rguenther@suse.de>
995 PR tree-optimization/103458
996 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
997 create forwarders for blocks with abnormal predecessors.
999 2021-11-29 Richard Biener <rguenther@suse.de>
1001 * gimple-predicate-analysis.cc (can_be_invalidated_p):
1002 Restore semantics to the one before the split from
1005 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
1007 * profile.c: Include sreal.h
1008 (struct bb_stats): New.
1009 (cmp_stats): New function.
1010 (compute_branch_probabilities): Output bb stats.
1012 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
1014 * cfghooks.c: Include sreal.h, profile.h.
1015 (profile_record_check_consistency): Fix checking of count counsistency;
1016 record also dynamic mismatches.
1017 * cfgrtl.c (rtl_account_profile_record): Similarly.
1018 * tree-cfg.c (gimple_account_profile_record): Likewise.
1019 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
1020 num_mismatched_freq_out, turn time to double, add
1021 dyn_mismatched_prob_out, dyn_mismatched_count_in,
1022 num_mismatched_prob_out; remove num_mismatched_count_out.
1023 * passes.c (account_profile_1): New function.
1024 (account_profile_in_list): New function.
1025 (pass_manager::dump_profile_report): Rewrite.
1026 (execute_one_ipa_transform_pass): Check profile consistency after
1028 (execute_all_ipa_transforms): Remove cfun test; record all transform
1030 (execute_one_pass): Fix collecting of profile stats.
1032 2021-11-27 Jakub Jelinek <jakub@redhat.com>
1034 PR middle-end/103431
1035 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
1036 Change insn pattern to accurately describe the instructions.
1037 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
1038 define_insn patterns.
1039 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
1040 define_insn_and_split patterns.
1041 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
1042 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
1043 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
1044 splitters for x86_{,64_}sh{l,r}d pattern changes.
1046 2021-11-27 Jakub Jelinek <jakub@redhat.com>
1048 PR tree-optimization/103435
1049 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
1050 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
1053 2021-11-27 Martin Jambor <mjambor@suse.cz>
1056 * ipa-prop.c (ipcp_transform_function): Call
1057 delete_unreachable_blocks_update_callgraph instead of returning
1060 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
1062 * tree-object-size.c (object_sizes_grow, object_sizes_release,
1063 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
1064 object_sizes_set): New functions.
1065 (addr_object_size, compute_builtin_object_size,
1066 expr_object_size, call_object_size, unknown_object_size,
1067 merge_object_sizes, plus_stmt_object_size,
1068 cond_expr_object_size, collect_object_sizes_for,
1069 check_for_plus_in_loops_1, init_object_sizes,
1070 fini_object_sizes): Adjust.
1072 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
1074 * tree-object-size.c: New enum.
1075 (object_sizes, computed, addr_object_size,
1076 compute_builtin_object_size, expr_object_size, call_object_size,
1077 merge_object_sizes, plus_stmt_object_size,
1078 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
1079 object_sizes_execute): Replace magic numbers with enums.
1081 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
1083 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
1084 tree_niter_desc* argument instead of just a tree for NITER.
1085 If we require the iv candidate value at the end of the final
1086 loop iteration, try using the original loop bound as the
1087 NITER for sufficiently simple loops.
1088 (may_eliminate_iv): Update (only) call to cand_value_at.
1090 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
1092 * ipa-modref.c (analyze_function): Drop parameter F and dump
1093 cgraph node name rather than cfun name.
1094 (modref_generate): Update.
1095 (modref_summaries::insert):Update.
1096 (modref_summaries_lto::insert):Update.
1097 (pass_modref::execute):Update.
1098 (ipa_merge_modref_summary_after_inlining): Improve combining of
1101 2021-11-26 Jan Hubicka <jh@suse.cz>
1104 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
1106 2021-11-26 konglin1 <lingling.kong@intel.com>
1109 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
1111 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
1112 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
1113 (*extendhf<mode>2): Rename from extendhf<mode>2.
1114 (truncsfhf2): Likewise.
1115 (truncdfhf2): Likewise.
1116 (*trunc<mode>2): Likewise.
1118 2021-11-26 liuhongt <hongtao.liu@intel.com>
1120 PR middle-end/103419
1121 * match.pd: Fix typo, use the type of second parameter, not
1124 2021-11-25 Jan Hubicka <jh@suse.cz>
1126 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
1128 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
1130 PR middle-end/103406
1131 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
1133 2021-11-25 Martin Jambor <mjambor@suse.cz>
1136 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
1138 * ipa-param-manipulation.h (ipa_param_adjustments): New member
1139 function get_updated_index_or_split.
1140 * ipa-param-manipulation.c
1141 (ipa_param_adjustments::get_updated_index_or_split): New function.
1142 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
1143 capability to identify scalarized parameters and perform substitution
1145 (ipcp_transform_function): Create descriptors earlier, handle new
1146 return values of adjust_agg_replacement_values.
1148 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1150 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
1151 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
1152 * gimple-range-path.h: Remove compute_ranges_defined.
1154 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
1157 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
1158 number of spilled registers is counted correctly.
1160 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1162 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
1165 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
1167 PR tree-optimization/103359
1168 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
1169 equivalent to def, don't initially include it's range.
1171 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1173 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
1174 gimple_static_chain.
1176 2021-11-25 Richard Biener <rguenther@suse.de>
1178 * cgraphunit.c (symbol_table::output_weakrefs): Remove
1180 (get_alias_symbol): Remove now unused function.
1182 2021-11-25 Richard Biener <rguenther@suse.de>
1184 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
1186 (skip_insns_after_block): Remove unreachable break and continue.
1188 2021-11-25 Richard Biener <rguenther@suse.de>
1190 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
1192 2021-11-25 Richard Biener <rguenther@suse.de>
1194 * regset.h (REG_SET_EMPTY_P): New macro.
1195 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
1197 2021-11-25 Martin Liska <mliska@suse.cz>
1199 * doc/invoke.texi: Use @option for -Wuninitialized.
1201 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1203 PR tree-optimization/103254
1204 * gimple-range-path.cc (path_range_query::compute_ranges): Move
1205 exported boolean code...
1206 (path_range_query::compute_imports): ...here.
1208 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1210 PR tree-optimization/103254
1211 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
1212 (path_range_query::compute_ranges_in_block): Move to
1213 compute_ranges_defined.
1214 * gimple-range-path.h (compute_ranges_defined): New.
1216 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1218 PR tree-optimization/103417
1219 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
1222 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1224 PR tree-optimization/103376
1225 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
1226 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
1227 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
1228 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
1229 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
1232 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1235 * doc/invoke.texi (-Wcomma-subscript): Document that for
1236 -std=c++20 the option isn't enabled by default with -Wno-deprecated
1237 but for -std=c++23 it is.
1239 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
1241 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
1242 (modref_merge_call_site_flags): ... here.
1243 (ipa_merge_modref_summary_after_inlining): Use it.
1245 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1247 * cfgloop.c (verify_loop_structure): Reduce scope of
1248 'class loop *loop' variable.
1249 * ipa-fnsummary.c (analyze_function_body): Likewise.
1250 * loop-init.c (fix_loop_structure): Likewise.
1251 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
1252 * predict.c (predict_loops): Likewise.
1253 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
1254 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
1256 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1258 PR tree-optimization/103231
1259 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
1260 (gimple_ranger::gimple_ranger): Delete stmt stack.
1261 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
1263 (gimple_ranger::prefill_name): New.
1264 (gimple_ranger::prefill_stmt_dependencies): New.
1265 * gimple-range.h (class gimple_ranger): Add prototypes.
1267 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1269 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
1270 return a range, return if it came from the cache or not.
1271 (get_non_stale_global_range): Rename to get_global_range, and return
1272 the temporal state in a flag.
1273 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
1274 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
1276 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
1279 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1281 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
1282 a constant is encountered to terminate the trace.
1284 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
1286 PR middle-end/103059
1287 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
1288 of indexed addressing.
1289 (find_reloads): Adjust accordingly.
1291 2021-11-24 Richard Biener <rguenther@suse.de>
1292 Jan Hubicka <jh@suse.cz>
1294 PR tree-optimization/103168
1295 * ipa-modref.h (struct modref_summary): Add load_accesses.
1296 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
1297 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
1298 info to walk the virtual use->def chain to CSE const/pure
1299 function calls possibly reading from memory.
1301 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1303 PR middle-end/103244
1304 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
1307 2021-11-24 Richard Biener <rguenther@suse.de>
1309 PR middle-end/103193
1310 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
1311 with NaNs and -ftrapping-math.
1313 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1315 PR middle-end/103384
1316 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
1317 return 0 for kind(host) and continue for kind(nohost).
1319 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1321 PR middle-end/103365
1322 * attribs.h (lookup_attribute): Allow attr_name to start with
1323 underscore, as long as canonicalize_attr_name returns false.
1324 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
1325 * attribs.c (extract_attribute_substring): Reimplement using
1326 canonicalize_attr_name.
1327 (register_scoped_attribute): Change gcc_assert into
1328 gcc_checking_assert, verify !canonicalize_attr_name rather than
1329 that str.str doesn't start with '_'.
1331 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1333 PR tree-optimization/103376
1334 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
1335 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
1337 (find_bswap_or_nop_1, find_bswap_or_nop,
1338 imm_store_chain_info::try_coalesce_bswap): Adjust
1339 perform_symbolic_merge callers.
1341 2021-11-24 Richard Biener <rguenther@suse.de>
1343 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
1344 argument instead of re-computing it.
1345 (find_interesting_uses): Likewise.
1346 (find_induction_variables): Pass through loop body.
1347 (tree_ssa_iv_optimize_loop): Pass down loop body.
1349 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1351 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
1354 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1356 * tree.c (bitmask_inv_cst_vector_p): New.
1357 * tree.h (bitmask_inv_cst_vector_p): New.
1358 * match.pd: Use it in new bitmask compare pattern.
1360 2021-11-24 Jason Merrill <jason@redhat.com>
1362 * timevar.h (class auto_cond_timevar): New.
1364 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
1365 H.J. Lu <hongjiu.lu@intel.com>
1367 PR tree-optimization/103194
1368 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
1370 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
1371 (optimize_atomic_bit_test_and): Enhance
1372 optimize_atomic_bit_test_and to handle truncation.
1374 2021-11-23 Martin Sebor <msebor@redhat.com>
1377 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
1378 * common.opt: Add -Winfinite-recursion.
1379 * doc/invoke.texi (-Winfinite-recursion): Document.
1380 * passes.def (pass_warn_recursion): Schedule a new pass.
1381 * tree-pass.h (make_pass_warn_recursion): Declare.
1382 * gimple-warn-recursion.c: New file.
1384 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1386 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
1387 error messages for ENB_P8V and ENB_P9V.
1389 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1391 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
1392 CMPB to power6-64 stanza.
1393 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
1395 (rs6000_new_builtin_is_supported): Likewise.
1396 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
1397 (rs6000_init_builtins): Handle ENB_P6_64 case.
1398 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
1399 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
1400 (enable_string): Add "ENB_P6_64".
1401 (write_decls): Add ENB_P6_64 to bif_enable enum.
1403 2021-11-23 Jan Hubicka <jh@suse.cz>
1405 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
1407 (modref_tree::modref_tree): Remove parametr.
1408 (modref_tree::insert_base): Add max_bases parameter.
1409 (modref_tree::insert): Add max_bases, max_refs, max_accesses
1411 (modref_tree::insert): New member function.
1412 (modref_tree::merge): Add max_bases, max_refs, max_accesses
1414 (modref_tree::insert): New member function.
1415 * ipa-modref-tree.c (test_insert_search_collapse): Update.
1416 (test_merge): Update.
1417 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
1418 (dump_lto_records): Likewise.
1419 (modref_summary::finalize): Fix whitespace.
1420 (get_modref_function_summary): Likewise.
1421 (modref_access_analysis::record_access): Update.
1422 (modref_access_analysis::record_access_lto): Update.
1423 (modref_access_analysis::process_fnspec): Update.
1424 (analyze_function): Update.
1425 (modref_summaries::duplicate): Update.
1426 (modref_summaries_lto::duplicate): Update.
1427 (write_modref_records): Update.
1428 (read_modref_records): Update.
1429 (read_section): Update.
1430 (propagate_unknown_call): Update.
1431 (modref_propagate_in_scc): Update.
1432 (ipa_merge_modref_summary_after_inlining): Update.
1434 2021-11-23 Martin Liska <mliska@suse.cz>
1436 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
1438 2021-11-23 Richard Biener <rguenther@suse.de>
1440 PR tree-optimization/103361
1441 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
1442 for the dependence distance.
1443 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
1445 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1447 * tree-inline.h (struct copy_body_data): Remove
1448 transform_lang_insert_block member.
1449 * tree-inline.c (remap_block): Don't call
1450 id->transform_lang_insert_block.
1451 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
1452 tree_function_versioning, maybe_inline_call_in_expr,
1453 copy_fn): Don't initialize id.transform_lang_insert_block.
1454 * gimplify.c (gimplify_omp_loop): Likewise.
1456 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
1458 PR tree-optimization/103335
1459 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
1460 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
1461 are divisible by BITS_PER_UNIT.
1462 (get_byte_aligned_range_containing_ref): New function.
1463 (get_byte_aligned_range_contained_in_ref): New function.
1464 (normalize_ref): Rename to ...
1465 (get_byte_range): ... this one; handle accesses not aligned to byte
1466 boundary; return range in bytes rater than updating ao_ref.
1467 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
1469 (setup_live_bytes_from_ref): Likewise.
1470 (clear_bytes_written_by): Update.
1471 (live_bytes_read): Update.
1472 (dse_classify_store): Simplify tech before live_bytes_read checks.
1474 2021-11-23 Andrew Pinski <apinski@marvell.com>
1476 PR tree-optimization/102216
1477 * gimple-fold.c (fold_stmt_1): Add canonicalization
1478 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
1479 can only be done if !in_place.
1481 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1483 PR middle-end/102431
1484 * gimplify.c (replace_reduction_placeholders): Remove.
1485 (note_no_context_vars): New function.
1486 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
1487 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
1488 and replace_reduction_placeholders for duplication of
1489 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
1490 automatic vars have DECL_CONTEXT set to non-NULL before doing so
1491 and reset it afterwards for those vars and their corresponding
1494 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
1497 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
1498 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
1499 or v4sf when p9_vector is set.
1500 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
1502 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1504 PR tree-optimization/102232
1505 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
1507 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1509 PR tree-optimization/96779
1510 * match.pd (-x == x) -> (x == 0): New optimization.
1512 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
1514 PR tree-optimization/98953
1515 PR tree-optimization/103345
1516 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
1517 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
1518 (pass_optimize_bswap::execute): Likewise.
1520 2021-11-22 Martin Liska <mliska@suse.cz>
1522 * doc/invoke.texi: Remove duplicate documentation for 3 params.
1524 2021-11-22 Richard Biener <rguenther@suse.de>
1526 PR tree-optimization/103351
1527 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
1530 2021-11-22 liuhongt <hongtao.liu@intel.com>
1533 * config/i386/constraints.md (Bk): New
1534 define_memory_constraint.
1535 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
1537 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
1539 * config/i386/i386.md (*movsi_internal): Don't allow
1540 mask/sse/mmx move in TLS code sequences.
1541 (*movdi_internal): Ditto.
1543 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
1545 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
1548 2021-11-21 Jakub Jelinek <jakub@redhat.com>
1551 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
1552 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
1553 instead of target_option_default_node.
1555 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1558 * ipa-modref.c (parm_map_for_arg): Rename to ...
1559 (parm_map_for_ptr): .. this one; handle static chain and calls to
1561 (modref_access_analysis::get_access): Use parm_map_for_ptr.
1562 (modref_access_analysis::process_fnspec): Update.
1563 (modref_access_analysis::analyze_load): Update.
1564 (modref_access_analysis::analyze_store): Update.
1566 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1568 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
1570 (ignore_retval_p): Likewise.
1571 (ignore_stores_p): Likewise.
1572 (parm_map_for_arg): Likewise.
1573 (class modref_access_analysis): New class.
1574 (modref_access_analysis::set_side_effects): New member function.
1575 (modref_access_analysis::set_nondeterministic): New member function.
1576 (get_access): Turn to ...
1577 (modref_access_analysis::get_access): ... this one.
1578 (record_access): Turn to ...
1579 (modref_access_analysis::record_access): ... this one.
1580 (record_access_lto): Turn to ...
1581 (modref_access_analysis::record_access_lto): ... This one.
1582 (record_access_p): Turn to ...
1583 (modref_access_analysis::record_access_p): ... This one
1584 (modref_access_analysis::record_unknown_load): New member function.
1585 (modref_access_analysis::record_unknown_store): New member function.
1586 (get_access_for_fnspec): Turn to ...
1587 (modref_access_analysis::get_access_for_fnspec): ... this one.
1588 (merge_call_side_effects): Turn to ...
1589 (moderf_access_analysis::merge_call_side_effects): Turn to ...
1590 (collapse_loads): Move later in source code.
1591 (collapse_stores): Move later in source code.
1592 (process_fnspec): Turn to ...
1593 (modref_access_analysis::process_fnspec): ... this one.
1594 (analyze_call): Turn to ...
1595 (modref_access_analysis::analyze_call): ... this one.
1596 (struct summary_ptrs): Remove.
1597 (analyze_load): Turn to ...
1598 (modref_access_analysis::analyze_load): ... this one.
1599 (analyze_store): Turn to ...
1600 (modref_access_analysis::analyze_store): ... this one.
1601 (analyze_stmt): Turn to ...
1602 (modref_access_analysis::analyze_stmt): ... This one.
1603 (remove_summary): Remove.
1604 (modref_access_analysis::propagate): Break out from ...
1605 (modref_access_analysis::analyze): Break out from ...
1606 (analyze_function): ... here.
1608 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
1609 Robin Dapp <rdapp@linux.ibm.com>
1612 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
1613 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
1615 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1618 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
1619 (merge_call_side_effects): Improve debug output.
1621 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1624 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
1626 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
1628 PR tree-optimization/103226
1629 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
1632 2021-11-20 Andrew Pinski <apinski@marvell.com>
1634 PR tree-optimization/103220
1635 * match.pd ((type) X bitop CST): Don't check if CST
1636 fits into the type if only the sign changes.
1638 2021-11-20 Alexandre Oliva <oliva@adacore.com>
1640 PR tree-optimization/102988
1641 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
1642 without decl sharing.
1644 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1646 * doc/invoke.texi: Remove whitespace after an @option.
1648 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
1650 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
1651 _ARCH_PWR9, optimize vec_sum2s when LE.
1653 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1656 * config/darwin-driver.c (darwin_driver_init): Handle exported
1657 symbols and symbol lists (suppress automatic export of the TLS
1659 * config/darwin.c (darwin_rename_builtins): Remove workaround.
1660 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
1661 (REAL_LIBGCC_SPEC): Handle revised library uses.
1662 * config/darwin.opt (nodefaultexport): New.
1663 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
1664 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
1665 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
1667 2021-11-19 Martin Jambor <mjambor@suse.cz>
1669 * opts.c (default_options_table): Switch off
1670 flag_semantic_interposition at Ofast.
1671 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
1672 -fsemantic-interposition.
1674 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
1676 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
1677 by !gimple_call_chain.
1679 2021-11-19 Martin Sebor <msebor@redhat.com>
1683 * doc/invoke.texi (-Waddress): Update.
1685 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
1687 PR tree-optimization/103254
1688 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
1689 depth for all statements with multple ssa names.
1691 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
1693 * config/s390/s390.md (define_peephole2): Variable insn points
1694 to the first matched insn. Use peep2_next_insn(1) to refer to
1695 the second matched insn.
1697 2021-11-19 Tamar Christina <tamar.christina@arm.com>
1699 PR tree-optimization/103311
1701 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
1702 test to new codegen.
1703 (complex_mul_pattern::matches): Move check downwards.
1705 2021-11-19 Martin Liska <mliska@suse.cz>
1708 2021-11-19 Martin Liska <mliska@suse.cz>
1710 * cfgexpand.c (pass_expand::execute): Use option directly.
1711 * function.c (allocate_struct_function): Likewise.
1712 * gimple-low.c (lower_function_body): Likewise.
1713 (lower_stmt): Likewise.
1714 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1715 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1716 * ipa-split.c (split_function): Likewise.
1717 * lto-streamer-in.c (input_function): Likewise.
1718 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1719 * ssa-iterators.h (num_imm_uses): Likewise.
1720 * tree-cfg.c (make_blocks): Likewise.
1721 (gimple_merge_blocks): Likewise.
1722 * tree-inline.c (tree_function_versioning): Likewise.
1723 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1724 * tree-sra.c (analyze_access_subtree): Likewise.
1725 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1726 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1727 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1728 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1729 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1730 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1731 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1732 (target_for_debug_bind): Likewise.
1733 (insert_debug_temp_for_var_def): Likewise.
1734 (insert_debug_temps_for_defs): Likewise.
1735 (reset_debug_uses): Likewise.
1736 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1737 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1738 (adjust_debug_stmts): Likewise.
1739 (adjust_phi_and_debug_stmts): Likewise.
1740 (vect_do_peeling): Likewise.
1741 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1742 (vect_transform_loop): Likewise.
1743 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1744 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1745 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1747 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
1749 * gcc.c (process_command): Skip dumpdir override if file is a
1751 * doc/invoke.texi: Update -dumpdir documentation.
1753 2021-11-19 Andrew Pinski <apinski@marvell.com>
1755 PR tree-optimization/103314
1756 * match.pd ((type) X op CST): Restrict the equal
1757 TYPE_PRECISION case to GIMPLE only.
1759 2021-11-19 Martin Liska <mliska@suse.cz>
1762 * ipa-modref-tree.h (struct modref_parm_map): Add default
1764 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
1766 2021-11-19 Richard Biener <rguenther@suse.de>
1768 PR middle-end/103248
1769 * tree-eh.c (operation_could_trap_helper_p): Properly handle
1770 fixed-point RDIV_EXPR.
1772 2021-11-19 Richard Biener <rguenther@suse.de>
1774 PR tree-optimization/102436
1775 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
1776 to just create the if structure and return the then block.
1777 (execute_sm): Add flag to indicate the var will re-use
1779 (hoist_memory_references): Support a single conditional
1780 block with all stores as special case.
1782 2021-11-19 Andrew Pinski <apinski@marvell.com>
1784 PR tree-optimization/103317
1785 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
1786 middle bb case, check to make sure it has a single predecessor.
1788 2021-11-19 Andrew Pinski <apinski@marvell.com>
1790 PR tree-optimization/103257
1792 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
1793 Disable until !canonicalize_math_p ().
1795 2021-11-19 Marek Polacek <polacek@redhat.com>
1799 * doc/invoke.texi: Update documentation for -Wuninitialized.
1800 * tree.c (stabilize_reference): Set location.
1802 2021-11-19 liuhongt <hongtao.liu@intel.com>
1805 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
1806 storing 256/512-bit SSE register to be equal to cost of
1807 unaligned store to avoid odd alignment peeling.
1808 (icelake_cost): Ditto.
1810 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1812 * config/rs6000/predicates.md (current_file_function_operand):
1813 Add flag_semantic_interposition to call of decl_replaceable_p.
1815 2021-11-18 Martin Liska <mliska@suse.cz>
1817 * ipa-modref.c (analyze_function): Do not execute the code
1818 only if dump_file != NULL.
1820 2021-11-18 Martin Liska <mliska@suse.cz>
1822 * ipa-modref.c (analyze_function): Use fnode instead of repeated
1823 cgraph_node::get (current_function_decl).
1825 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1827 * cgraph.c (cgraph_node::get_availability): Update call of
1829 (cgraph_node::verify_node): Verify that semantic_interposition flag
1831 * cgraph.h: (symtab_node): Add semantic_interposition flag.
1832 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
1833 semantic_interposition flag.
1834 * cgraphunit.c (cgraph_node::finalize_function): Set
1835 semantic_interposition flag.
1836 (cgraph_node::add_new_function): Likewise.
1837 (varpool_node::finalize_decl): Likewise.
1838 (cgraph_node::create_wrapper): Likewise.
1839 * common.opt (fsemantic-interposition): Turn to optimization node.
1840 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
1841 (lto_output_varpool_node): Likewise.
1842 (input_overwrite_node): Likewise.
1843 (input_varpool_node): Likewise.
1844 * symtab.c (symtab_node::dump_base): Dump new flag.
1845 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
1847 * varasm.h (decl_replaceable_p): Update declaration.
1848 * varpool.c (varpool_node::ctor_useable_for_folding_p):
1849 Use semantic_interposition flag.
1850 (varpool_node::get_availability): Likewise.
1851 (varpool_node::create_alias): Copy semantic_interposition flag.
1853 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1856 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
1857 parameter may still be returned.
1858 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
1859 even for unused function args.
1861 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1863 PR tree-optimization/103311
1864 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
1865 complex_fms_pattern::matches): Check for multiplications.
1867 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
1870 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
1871 CS prefix for -mindirect-branch-cs-prefix.
1872 (ix86_output_indirect_branch_via_reg): Likewise.
1873 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
1874 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
1876 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1878 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
1881 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1883 PR tree-optimization/103253
1884 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
1886 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
1888 * doc/extend.texi: Document __builtin_assoc_barrier.
1890 2021-11-18 Martin Liska <mliska@suse.cz>
1892 * cfgexpand.c (pass_expand::execute): Use option directly.
1893 * function.c (allocate_struct_function): Likewise.
1894 * gimple-low.c (lower_function_body): Likewise.
1895 (lower_stmt): Likewise.
1896 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1897 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1898 * ipa-split.c (split_function): Likewise.
1899 * lto-streamer-in.c (input_function): Likewise.
1900 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1901 * ssa-iterators.h (num_imm_uses): Likewise.
1902 * tree-cfg.c (make_blocks): Likewise.
1903 (gimple_merge_blocks): Likewise.
1904 * tree-inline.c (tree_function_versioning): Likewise.
1905 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1906 * tree-sra.c (analyze_access_subtree): Likewise.
1907 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1908 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1909 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1910 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1911 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1912 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1913 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1914 (target_for_debug_bind): Likewise.
1915 (insert_debug_temp_for_var_def): Likewise.
1916 (insert_debug_temps_for_defs): Likewise.
1917 (reset_debug_uses): Likewise.
1918 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1919 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1920 (adjust_debug_stmts): Likewise.
1921 (adjust_phi_and_debug_stmts): Likewise.
1922 (vect_do_peeling): Likewise.
1923 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1924 (vect_transform_loop): Likewise.
1925 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1926 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1927 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1929 2021-11-18 Richard Biener <rguenther@suse.de>
1931 PR tree-optimization/103277
1932 * tree-ssa-dse.c (need_ab_cleanup): New.
1933 (dse_optimize_redundant_stores): Adjust.
1934 (delete_dead_or_redundant_assignment): Get extra
1935 need_ab_cleanup argument and set when abnormal cleanup is
1937 (dse_optimize_call): Adjust.
1938 (dse_optimize_stmt): Likewise.
1939 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
1940 Perform abnormal cleanup.
1941 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
1943 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
1945 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1946 Adjust generated cfg to avoid infinite loop.
1948 2021-11-18 konglin1 <lingling.kong@intel.com>
1950 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
1951 (_mm512_mask_mul_pch): Likewise.
1952 (_mm512_maskz_mul_pch): Likewise.
1953 (_mm512_mul_round_pch): Likewise.
1954 (_mm512_mask_mul_round_pch): Likewise.
1955 (_mm512_maskz_mul_round_pch): Likewise.
1956 (_mm512_cmul_pch): Likewise.
1957 (_mm512_mask_cmul_pch): Likewise.
1958 (_mm512_maskz_cmul_pch): Likewise.
1959 (_mm512_cmul_round_pch): Likewise.
1960 (_mm512_mask_cmul_round_pch): Likewise.
1961 (_mm512_maskz_cmul_round_pch): Likewise.
1962 (_mm_mul_sch): Likewise.
1963 (_mm_mask_mul_sch): Likewise.
1964 (_mm_maskz_mul_sch): Likewise.
1965 (_mm_mul_round_sch): Likewise.
1966 (_mm_mask_mul_round_sch): Likewise.
1967 (_mm_maskz_mul_round_sch): Likewise.
1968 (_mm_cmul_sch): Likewise.
1969 (_mm_mask_cmul_sch): Likewise.
1970 (_mm_maskz_cmul_sch): Likewise.
1971 (_mm_cmul_round_sch): Likewise.
1972 (_mm_mask_cmul_round_sch): Likewise.
1973 (_mm_maskz_cmul_round_sch): Likewise.
1974 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
1975 (_mm_mask_mul_pch): Likewise.
1976 (_mm_maskz_mul_pch): Likewise.
1977 (_mm256_mul_pch): Likewise.
1978 (_mm256_mask_mul_pch): Likewise.
1979 (_mm256_maskz_mul_pch): Likewise.
1980 (_mm_cmul_pch): Likewise.
1981 (_mm_mask_cmul_pch): Likewise.
1982 (_mm_maskz_cmul_pch): Likewise.
1983 (_mm256_cmul_pch): Likewise.
1984 (_mm256_mask_cmul_pch): Likewise.
1985 (_mm256_maskz_cmul_pch): Likewise.
1987 2021-11-17 Andrew Pinski <apinski@marvell.com>
1989 PR tree-optimization/103228
1990 PR tree-optimization/55177
1991 * match.pd ((type) X bitop CST): Also do this
1992 transformation for nop conversions.
1994 2021-11-17 Martin Sebor <msebor@redhat.com>
1996 PR tree-optimization/102759
1997 * gimple-array-bounds.cc (build_printable_array_type): Move...
1998 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
1999 pathological function redeclarations that remove a previously
2001 Improve formatting of function arguments in informational notes.
2002 * pointer-query.cc (build_printable_array_type): ...to here.
2003 * pointer-query.h (build_printable_array_type): Declared.
2005 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
2008 * config/i386/i386-opts.h (harden_sls): New enum.
2009 * config/i386/i386.c (output_indirect_thunk): Mitigate against
2010 SLS for function return.
2011 (ix86_output_function_return): Likewise.
2012 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
2014 (ix86_output_indirect_jmp): Likewise.
2015 (ix86_output_call_insn): Likewise.
2016 * config/i386/i386.opt: Add -mharden-sls=.
2017 * doc/invoke.texi: Document -mharden-sls=.
2019 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
2022 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
2023 (ix86_output_function_return): Likewise.
2024 * config/i386/i386.md (simple_return_pop_internal): Likewise.
2026 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2029 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
2032 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
2034 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
2035 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
2036 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
2037 on indirect_thunks_used.
2038 (ix86_output_indirect_function_return): Ditto.
2040 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2042 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
2043 (modref_access_node::stream_out): New member function.
2044 (modref_access_node::stream_in): New member function.
2045 * ipa-modref-tree.h (modref_access_node::stream_out,
2046 modref_access_node::stream_in): Declare.
2047 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
2048 (modref_summary_lto::dump): Dump kills.
2049 (analyze_store): Record kills for LTO
2050 (analyze_stmt): Likewise.
2051 (modref_summaries_lto::duplicate): Duplicate kills.
2052 (write_modref_records): Use new stream_out member function.
2053 (read_modref_records): Likewise.
2054 (modref_write): Stream out kills.
2055 (read_section): Stream in kills
2056 (remap_kills): New function.
2057 (update_signature): Use it.
2059 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
2061 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
2062 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
2063 * config/i386/i386.c (zero_all_vector_registers):
2064 Use LEGACY_SSE_REGNO_P predicate.
2065 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
2066 and EXT_REG_SSE_REGNO_P predicates.
2067 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
2068 and LEGACY_SSE_REGNO_P predicates.
2070 2021-11-17 Jason Merrill <jason@redhat.com>
2072 * doc/invoke.texi (C++ Dialect Options): Document
2073 -fimplicit-constexpr.
2075 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2077 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
2078 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
2080 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
2083 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
2087 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
2088 (gimplify_compound_lval): Gimplify base expression first.
2089 (gimplify_target_expr): Add comment.
2091 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2093 PR tree-optimization/103192
2094 * tree-ssa-loop-im.c (move_computations_worker): Use
2095 reset_flow_sensitive_info instead of manually clearing
2096 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
2097 with integral types.
2099 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2101 PR tree-optimization/103255
2102 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
2103 range_nonzero rather than unadjusted base's range. Formatting fixes.
2105 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
2107 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
2108 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
2109 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
2110 * internal-fn.c (first_commutative_argument): Handle them.
2111 (FOR_EACH_COND_FN_PAIR): Likewise.
2112 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
2113 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
2116 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2118 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
2119 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
2120 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
2121 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
2122 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
2125 2021-11-17 Andrew Pinski <apinski@marvell.com>
2127 PR tree-optimization/103288
2128 * tree-ssa-phiopt.c (value_replacement): Return early if middle
2129 block has more than one pred.
2131 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2133 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
2134 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
2135 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
2136 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
2137 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
2138 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
2139 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
2140 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
2142 2021-11-17 Marek Polacek <polacek@redhat.com>
2144 PR preprocessor/103026
2145 * doc/invoke.texi: Document -Wbidi-chars.
2147 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2150 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
2153 2021-11-16 Jan Hubicka <jh@suse.cz>
2155 * ipa-modref.c (get_modref_function_summary): Declare.
2156 * ipa-modref.h (get_modref_function_summary): New function.
2157 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
2158 (clear_bytes_written_by): ... here; also clear memory killed by
2161 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
2163 * ggc-common.c (gt_pch_save): If we cannot find a suitable
2164 memory segment for save, then error-out, do not try to
2166 (gt_pch_restore): Save the existing line table, and when
2167 the replacement is being read, use that when constructing
2170 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
2173 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
2175 (*mma_assemble_acc): Likewise.
2177 2021-11-16 Martin Sebor <msebor@redhat.com>
2179 PR tree-optimization/102960
2180 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
2181 than a pointer to it.
2182 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
2184 (get_maxval_strlen): Use an auto_bitmap.
2185 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
2187 (get_range_strlen_phi): ...into this function.
2188 Avoid assuming maximum string length is constant
2189 (printf_strlen_execute): Dump pointer query cache contents when
2190 details are requisted.
2192 2021-11-16 Jason Merrill <jason@redhat.com>
2194 * langhooks.h (struct lang_hooks): Adjust comment.
2195 * print-tree.c (print_node): Also call print_xnode hook for
2198 2021-11-16 Andrew Pinski <apinski@marvell.com>
2200 PR tree-optimization/103218
2201 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
2203 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
2205 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
2206 (umaddhisi4): Likewise.
2207 (machi): Update pattern.
2210 2021-11-16 Richard Biener <rguenther@suse.de>
2212 PR tree-optimization/102880
2213 * tree-ssa-dce.c (sort_phi_args): New function.
2214 (make_forwarders_with_degenerate_phis): Likewise.
2215 (perform_tree_ssa_dce): Call
2216 make_forwarders_with_degenerate_phis.
2218 2021-11-16 Richard Biener <rguenther@suse.de>
2220 PR tree-optimization/102880
2221 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
2222 single_pred (bb1) condition to places that really need it.
2223 (match_simplify_replacement): Likewise.
2224 (value_replacement): Likewise.
2225 (replace_phi_edge_with_variable): Deal with extra edges
2228 2021-11-16 Martin Jambor <mjambor@suse.cz>
2230 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
2231 add a fixme note about the mode assignment perhaps being unnecessary.
2232 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
2234 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
2235 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
2236 * tree-inline.c (remap_ssa_name): Likewise.
2237 (tree_function_versioning): Likewise.
2238 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
2239 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2240 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
2242 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2244 PR tree-optimization/103208
2245 * omp-expand.c (expand_omp_build_cond): New function.
2246 (expand_omp_for_init_counts, expand_omp_for_init_vars,
2247 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
2249 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2251 PR tree-optimization/102009
2252 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
2253 Punt if any of alloc_size arguments is out of bounds vs. number of
2256 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
2258 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
2259 on !optimize_function_for_size_p.
2260 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
2261 (define_splits): Conditionalize on !optimize_function_for_size_p.
2262 (*bmi2_rorxsi3_1_zext): Likewise.
2263 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
2264 (define_splits): Conditionalize on !optimize_function_for_size_p.
2266 2021-11-16 Jan Hubicka <jh@suse.cz>
2269 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
2272 2021-11-16 Andrew Pinski <apinski@marvell.com>
2274 PR tree-optimization/103245
2275 * match.pd: Combine the abs pattern matching using multiplication.
2276 Adding optional nop_convert too.
2278 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
2280 PR middle-end/103268
2281 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
2284 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2286 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
2287 determine if it is safe to transform to strcat.
2288 (gimple_fold_builtin_snprintf): Likewise.
2290 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2292 * gimple-fold.c (known_lower): New function.
2293 (gimple_fold_builtin_strncat_chk,
2294 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
2295 gimple_fold_builtin_stxncpy_chk,
2296 gimple_fold_builtin_snprintf_chk,
2297 gimple_fold_builtin_sprintf_chk): Use it.
2299 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2301 * gimple-fold.c (dump_transformation): New function.
2302 (gimple_fold_builtin_stxcpy_chk,
2303 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
2304 BUILT_IN_STRNCPY if return value is not used.
2306 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
2308 PR middle-end/103184
2309 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
2310 before transforming equivalent, but slighly different cases to
2311 their canonical forms.
2313 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2316 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
2317 * doc/tm.texi: Regenerated.
2318 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
2319 * ipa.c (cgraph_build_static_cdtor_1): Return the built
2321 (build_cxa_atexit_decl): New.
2322 (build_dso_handle_decl): New.
2323 (build_cxa_dtor_registrations): New.
2324 (compare_cdtor_tu_order): New.
2325 (build_cxa_atexit_fns): New.
2326 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
2327 process the DTORs/CTORs accordingly.
2328 (pass_ipa_cdtor_merge::gate): Also run if
2329 dtors_from_cxa_atexit is set.
2330 * target.def (dtors_from_cxa_atexit): New hook.
2332 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2334 * config.in: Regenerate.
2335 * configure: Regenerate.
2336 * configure.ac: Test ld64 for -platform-version support.
2338 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2340 * diagnostic-spec.h (typedef xint_hash_t)
2341 (typedef xint_hash_map_t): Replace with...
2342 (typedef nowarn_map_t): ... this.
2343 (nowarn_map): Adjust.
2344 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
2346 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2348 * profile.c (branch_prob): Use 'location_hash' for
2351 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2353 PR tree-optimization/103207
2354 * value-range.cc (irange::set): Drop overflow.
2356 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2358 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
2359 to OMP_TARGET_CLAUSES if it isn't there already.
2361 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2363 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2364 Remove useless code.
2365 (path_range_query::ssa_defined_in_bb): New.
2366 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
2367 could trigger additional lookups.
2368 Do not use the cache for ARGs defined in this block.
2369 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
2370 (path_range_query::maybe_register_phi_relation): Same.
2371 (path_range_query::range_of_stmt): Adjust comment.
2372 * gimple-range-path.h (ssa_defined_in_bb): New.
2374 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2376 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2377 Default to global range if nothing found.
2379 2021-11-15 Richard Biener <rguenther@suse.de>
2381 PR tree-optimization/103237
2382 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
2383 double reductions with multiple inner loop LC PHI nodes.
2385 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
2388 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2389 New expand function.
2390 * config/i386/i386-options.c (ix86_target_string): Add
2391 -mrelax-cmpxchg-loop flag.
2392 (ix86_valid_target_attribute_inner_p): Likewise.
2393 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
2394 New expand function prototype.
2395 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
2396 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
2398 (atomic_<logic>_fetch<mode>): Likewise.
2399 (atomic_fetch_nand<mode>): Likewise.
2400 (atomic_nand_fetch<mode>): Likewise.
2401 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
2402 (atomic_<logic>_fetch<mode>): Likewise.
2403 (atomic_fetch_nand<mode>): Likewise.
2404 (atomic_nand_fetch<mode>): Likewise.
2405 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
2407 2021-11-15 Richard Biener <rguenther@suse.de>
2409 PR tree-optimization/103219
2410 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
2411 to determine the exit for the VN region.
2413 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2415 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
2416 (pass_data_vectorize): Likewise.
2417 (make_pass_vectorize): Likewise.
2418 * tree-vectorizer.c (vectorize_loops): Merge with
2419 pass_vectorize::execute and replace cfun occurences with fun param.
2420 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
2421 fun, and adjust callers approrpiately.
2422 (note_simd_array_uses): Likewise.
2423 (vect_loop_dist_alias_call): Likewise.
2424 (set_uid_loop_bbs): Likewise.
2425 (vect_transform_loops): Likewise.
2426 (try_vectorize_loop_1): Likewise.
2427 (try_vectorize_loop): Likewise.
2429 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2432 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
2433 atomic_bit_test_and_complement<mode>,
2434 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
2437 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
2439 * config/vax/vax.c (vax_lra_p): New prototype and function.
2440 (TARGET_LRA_P): Wire it.
2441 * config/vax/vax.opt (mlra): New option.
2442 * doc/invoke.texi (Option Summary, VAX Options): Document the
2445 2021-11-14 Jan Hubicka <jh@suse.cz>
2447 * ipa-modref.h (struct modref_summary): Add nondeterministic
2448 and calls_interposable flags.
2449 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
2450 (modref_summary::useful_p): Check new flags.
2451 (struct modref_summary_lto): Add nondeterministic and
2452 calls_interposable flags.
2453 (modref_summary_lto::modref_summary_lto): Initialize new flags.
2454 (modref_summary_lto::useful_p): Check new flags.
2455 (modref_summary::dump): Dump new flags.
2456 (modref_summary_lto::dump): Dump new flags.
2457 (ignore_nondeterminism_p): New function.
2458 (merge_call_side_effects): Merge new flags.
2459 (process_fnspec): Likewise.
2460 (analyze_load): Volatile access is nondeterministic.
2461 (analyze_store): Liekwise.
2462 (analyze_stmt): Volatile ASM is nondeterministic.
2463 (analyze_function): Clear new flags.
2464 (modref_summaries::duplicate): Duplicate new flags.
2465 (modref_summaries_lto::duplicate): Duplicate new flags.
2466 (modref_write): Stream new flags.
2467 (read_section): Stream new flags.
2468 (propagate_unknown_call): Update new flags.
2469 (modref_propagate_in_scc): Propagate new flags.
2470 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
2472 * tree-ssa-structalias.c (determine_global_memory_access):
2475 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
2477 * config/vax/vax.h (SET_RATIO): New macro.
2478 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
2479 (setmemhi): New expander.
2480 (setmemhi1): New insn and splitter.
2481 (*setmemhi1): New insn.
2483 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2485 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
2487 (modref_access_node::merge_for_kills): Likewise.
2488 (modref_access_node::insert_kill): Likewise.
2489 * ipa-modref-tree.h (modref_access_node::update_for_kills,
2490 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
2492 (modref_access_node::useful_for_kill): New member function.
2493 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
2494 (lto_modref_summary): Add kills.
2495 (modref_summary::dump): Dump kills.
2496 (record_access): Add mdoref_access_node parameter.
2497 (record_access_lto): Likewise.
2498 (merge_call_side_effects): Merge kills.
2499 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
2500 (struct summary_ptrs): Add always_executed filed.
2501 (analyze_load): Update.
2502 (analyze_store): Update; record kills.
2503 (analyze_stmt): Add always_executed; record kills in clobbers.
2504 (analyze_function): Track always_executed.
2505 (modref_summaries::duplicate): Duplicate kills.
2506 (update_signature): Release kills.
2507 * ipa-modref.h (struct modref_summary): Add kills.
2508 * tree-ssa-alias.c (alias_stats): Add kill stats.
2509 (dump_alias_stats): Dump kill stats.
2510 (store_kills_ref_p): Break out from ...
2511 (stmt_kills_ref_p): Use it; handle modref info based kills.
2513 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
2515 PR tree-optimization/103229
2516 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
2517 null value to memset.
2519 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2521 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
2523 (modref_access_node::get_ao_ref): Likewise.
2524 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
2525 (modref_access_node::get_ao_ref): Declare.
2526 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
2527 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
2529 2021-11-13 Jan Hubicka <jh@suse.cz>
2532 * dbgcnt.def (ipa_attr): New counters.
2533 * ipa-pure-const.c: Include dbgcnt.c
2534 (ipa_make_function_const): Use debug counter.
2535 (ipa_make_function_pure): Likewise.
2536 (propagate_pure_const): Fix bug in my previous change.
2538 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2540 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
2541 Offline from ipa-modref-tree.h.
2542 (modref_access_node::dump): Move from ipa-modref.c; make member
2544 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
2545 modref_access_node::dump): Declare.
2546 * ipa-modref.c (dump_access): Remove.
2547 (dump_records): Update.
2548 (dump_lto_records): Update.
2549 (record_access): Update.
2550 (record_access_lto): Update.
2552 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2554 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
2555 (modref_summary::dump): Dump try_dse.
2556 (modref_summary::finalize): Add FUN attribute; compute try-dse.
2557 (analyze_function): Update.
2558 (read_section): Update.
2559 (update_signature): Update.
2560 (pass_ipa_modref::execute): Update.
2561 * ipa-modref.h (struct modref_summary):
2562 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
2563 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
2564 * tree-ssa-dse.c (dse_optimize_call): New function.
2565 (dse_optimize_stmt): Use it.
2567 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2569 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
2571 2021-11-13 Jan Hubicka <jh@suse.cz>
2574 (struct modref_access_node): Move longer member functions to
2576 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
2578 * ipa-modref-tree.c (modref_access_node::contains): Move here
2579 from ipa-modref-tree.h.
2580 (modref_access_node::update): Likewise.
2581 (modref_access_node::merge): Likewise.
2582 (modref_access_node::closer_pair_p): Likewise.
2583 (modref_access_node::forced_merge): Likewise.
2584 (modref_access_node::update2): Likewise.
2585 (modref_access_node::combined_offsets): Likewise.
2586 (modref_access_node::try_merge_with): Likewise.
2587 (modref_access_node::insert): Likewise.
2589 2021-11-13 Jan Hubicka <jh@suse.cz>
2591 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
2592 (modref_summary::global_memory_written_p): Remove.
2593 (modref_summary::dump): Dump new flags.
2594 (modref_summary::finalize): New member function.
2595 (analyze_function): Call it.
2596 (read_section): Call it.
2597 (update_signature): Call it.
2598 (pass_ipa_modref::execute): Call it.
2599 * ipa-modref.h (struct modref_summary): Remove
2600 global_memory_read_p and global_memory_written_p.
2601 Add global_memory_read, global_memory_written.
2602 * tree-ssa-structalias.c (determine_global_memory_access):
2605 2021-11-13 Jan Hubicka <jh@suse.cz>
2607 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
2608 * ipa-param-manipulation.c
2609 (ipa_param_adjustments::type_attribute_allowed_p):
2610 New member function.
2611 (drop_type_attribute_if_params_changed_p): New function.
2612 (build_adjusted_function_type): Use it.
2613 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
2615 2021-11-13 David Malcolm <dmalcolm@redhat.com>
2617 * doc/invoke.texi (Static Analyzer Options): Add
2618 -Wno-analyzer-tainted-allocation-size,
2619 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
2620 -Wno-analyzer-tainted-size to list. Add
2621 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
2622 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
2623 of options effectively enabled by -fanalyzer.
2624 (-Wanalyzer-tainted-allocation-size): New.
2625 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
2626 (-Wanalyzer-tainted-divisor): New.
2627 (-Wanalyzer-tainted-offset): New.
2628 (-Wanalyzer-tainted-size): New.
2630 2021-11-13 Jan Hubicka <jh@suse.cz>
2632 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
2633 * gimple.c (gimple_call_arg_flags): ... here.
2634 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
2635 (modref_merge_call_site_flags): Use arg_eaf_flags.
2637 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2639 PR tree-optimization/103222
2640 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
2642 (path_range_query::compute_ranges_in_block): Call
2643 compute_ranges_in_phis.
2644 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
2647 2021-11-13 Jan Hubicka <jh@suse.cz>
2649 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
2650 changes on "fn spec" attribute; give up on varadic types.
2651 * ipa-param-manipulation.c: Include attribs.h.
2652 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
2653 true remove "fn spec" attribute.
2654 (ipa_param_adjustments::build_new_function_type): Update.
2655 (ipa_param_body_adjustments::modify_formal_parameters): update.
2656 * ipa-sra.c: Include attribs.h.
2657 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
2659 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2661 * gimple-range-path.cc (path_range_query::path_range_query): Merge
2663 (path_range_query::import_p): Move from header file.
2664 (path_range_query::~path_range_query): Adjust for combined ctors.
2665 * gimple-range-path.h: Merge ctors.
2666 (path_range_query::import_p): Move to .cc file.
2668 2021-11-13 Jan Hubicka <jh@suse.cz>
2670 PR tree-optimization/103182
2671 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
2672 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
2674 2021-11-12 Stafford Horne <shorne@gmail.com>
2676 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
2677 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
2678 new function or1k_profile_hook.
2679 * config/or1k/or1k.c (struct machine_function): Add new field
2680 set_mcount_arg_insn.
2681 (or1k_profile_hook): New function.
2682 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
2684 (or1k_frame_pointer_required): Frame pointer no longer needed
2687 2021-11-12 Jan Hubicka <jh@suse.cz>
2689 PR tree-optimization/103209
2690 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
2691 use of handle_rhs_call
2693 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
2695 PR tree-optimization/103202
2696 * gimple-range-path.cc
2697 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
2699 2021-11-12 Jan Hubicka <jh@suse.cz>
2701 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
2702 fix call of ipa_make_function_const and ipa_make_function_pure.
2704 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2706 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
2707 (aarch64_vector_op::m_seen_loads): Likewise.
2708 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
2710 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
2711 Remove code that tries to predict different vec_flags from the
2713 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
2714 Remove m_advsimd_ops handling.
2716 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2718 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
2719 (aarch64_sve_only_stmt_p): Delete.
2720 (aarch64_vector_costs::prefer_unrolled_loop): New function,
2721 extracted from adjust_body_cost.
2722 (aarch64_vector_costs::better_main_loop_than_p): New function,
2723 using heuristics extracted from adjust_body_cost and
2724 adjust_body_cost_sve.
2725 (aarch64_vector_costs::adjust_body_cost_sve): Remove
2726 advsimd_cycles_per_iter and could_use_advsimd parameters.
2727 Update after changes above.
2728 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
2730 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2732 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
2733 New member variable.
2734 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
2735 (aarch64_vec_op_count::vf_factor): New function.
2736 (aarch64_vector_costs::aarch64_vector_costs): When costing for
2737 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
2739 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
2740 instead of hard-coding 2.
2742 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2744 * config/aarch64/aarch64.c
2745 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
2746 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
2747 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
2748 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
2749 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
2750 above properties too.
2751 (aarch64_estimate_min_cycles_per_iter): Delete.
2752 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
2753 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
2754 routine to print CPI estimates.
2755 (adjust_body_cost_sve): Likewise. Use the other functions above
2756 instead of doing the work inline.
2758 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2760 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
2762 (aarch64_vec_op_count::base_issue_info): Remove handling of null
2764 (aarch64_vec_op_count::simd_issue_info): Likewise.
2765 (aarch64_vec_op_count::sve_issue_info): Likewise.
2766 (aarch64_vector_costs::m_ops): Turn into a vector.
2767 (aarch64_vector_costs::m_advsimd_ops): Likewise.
2768 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
2769 the vectors based on aarch64_tune_params.
2770 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
2771 of all entries in m_ops.
2772 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
2774 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
2775 parameter and get the information from the ops instead.
2776 (aarch64_vector_costs::adjust_body_cost_sve): Take a
2777 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
2778 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
2779 Exit earlier if m_ops is empty for either cost structure.
2781 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2783 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
2784 (aarch64_vector_costs::m_sve_ops): Replace with...
2785 (aarch64_vector_costs::m_ops): ...this.
2786 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
2787 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
2788 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
2789 Initialize m_vec_flags here rather than in add_stmt_cost.
2790 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
2791 Allow vectype to be null.
2792 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
2793 code too. Don't require vectype to be nonnull.
2794 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
2795 and scalar costs as parameters. Use the scalar costs to determine
2796 the cycles per iteration of the scalar loop, then multiply it
2797 by the estimated VF.
2798 (aarch64_vector_costs::finish_cost): Update call accordingly.
2800 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2802 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
2803 (aarch64_vector_costs::count_ops): Use it rather than the
2804 vectype to determine floatness.
2806 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2808 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
2809 Remove vectype parameter and get floatness from the type of the
2811 (arch64_in_loop_reduction_latency): Likewise.
2812 (aarch64_detect_vector_stmt_subtype): Update caller.
2813 (aarch64_vector_costs::count_ops): Likewise.
2815 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2817 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
2818 (aarch64_vec_op_count): ...this. Add a constructor.
2819 (aarch64_vec_op_count::vec_flags): New function.
2820 (aarch64_vec_op_count::base_issue_info): Likewise.
2821 (aarch64_vec_op_count::simd_issue_info): Likewise.
2822 (aarch64_vec_op_count::sve_issue_info): Likewise.
2823 (aarch64_vec_op_count::m_issue_info): New member variable.
2824 (aarch64_vec_op_count::m_vec_flags): Likewise.
2825 (aarch64_vector_costs): Add a constructor.
2826 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
2827 (aarch64_vector_costs::aarch64_vector_costs): New function.
2828 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
2829 (aarch64_vector_costs::count_ops): Remove vec_flags and
2830 issue_info parameters, using the new aarch64_vec_op_count
2832 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
2833 (aarch64_sve_op_count::dump): Fold into...
2834 (aarch64_vec_op_count::dump): ..here.
2836 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2838 * config/aarch64/aarch64.c: Include tree-dfa.h.
2839 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
2840 and MEM_OFFSET into account.
2841 (aarch64_swap_ldrstr_operands): Use it.
2842 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
2843 address of the second memory doesn't depend on the result of
2846 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
2848 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
2850 2021-11-12 Jan Hubicka <jh@suse.cz>
2852 PR tree-optimization/103175
2853 * ipa-modref.c (modref_lattice::merge): Add sanity check.
2854 (callee_to_caller_flags): Make flags adjustment sane.
2855 (modref_eaf_analysis::analyze_ssa_name): Likewise.
2857 2021-11-12 Jan Hubicka <jh@suse.cz>
2860 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
2861 not mark pure/const function if there are side-effects.
2863 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
2865 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
2866 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
2867 * tree-pretty-print.c (dump_omp_clause): Add support for printing
2868 implicit attribute in tree dumping.
2869 * gimplify.c (gimplify_adjust_omp_clauses_1):
2870 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
2872 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
2873 clauses, from simple append, to starting of list, after non-map clauses.
2874 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
2875 values passed to libgomp for implicit maps.
2877 2021-11-12 Jakub Jelinek <jakub@redhat.com>
2879 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
2880 (BUILT_IN_GOMP_TEAMS4): New.
2881 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
2882 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
2883 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
2884 GOMP_teams, pass to it also num_teams lower-bound expression
2885 or a dup of upper-bound if it is missing and a flag whether
2886 it is the first call or not.
2888 2021-11-12 Martin Liska <mliska@suse.cz>
2890 PR tree-optimization/102497
2891 * gimple-predicate-analysis.cc (add_pred): Remove unused
2894 2021-11-12 Richard Biener <rguenther@suse.de>
2896 PR tree-optimization/103204
2897 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
2898 top operand after folding in an address.
2900 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2902 * gimple-range-path.cc (path_range_query::path_range_query): New
2903 ctor without a ranger.
2904 (path_range_query::~path_range_query): Free ranger if necessary.
2905 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
2906 (path_range_query::ssa_range_in_phi): Same.
2907 (path_range_query::compute_ranges_in_block): Same.
2908 (path_range_query::compute_imports): Same.
2909 (path_range_query::compute_ranges): Same.
2910 (path_range_query::range_of_stmt): Same.
2911 (path_range_query::compute_outgoing_relations): Same.
2912 * gimple-range-path.h (class path_range_query): New ctor.
2913 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
2914 as path_range_query allocates one.
2915 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
2916 (back_threader::~back_threader): Same.
2918 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2920 * tree-ssa-threadbackward.c
2921 (back_threader_profitability::profitable_path_p): Remove loop
2922 crossing restriction.
2924 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2926 * ipa-modref.c (analyze_function): Do pure/const discovery, return
2928 (pass_modref::execute): If pure/const is discovered fixup cfg.
2929 (ignore_edge): Do not ignore pure/const edges.
2930 (modref_propagate_in_scc): Do pure/const discovery, return true if
2931 cdtor was promoted pure/const.
2932 (pass_ipa_modref::execute): If needed remove unreachable functions.
2933 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
2934 (warn_function_cold): Likewise.
2935 (skip_function_for_local_pure_const): Move earlier.
2936 (ipa_make_function_const): Break out from ...
2937 (ipa_make_function_pure): Break out from ...
2938 (propagate_pure_const): ... here.
2939 (pass_local_pure_const::execute): Use it.
2940 * ipa-utils.h (ipa_make_function_const): Declare.
2941 (ipa_make_function_pure): Declare.
2942 * passes.def: Move early modref after pure-const.
2944 2021-11-11 David Malcolm <dmalcolm@redhat.com>
2947 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
2949 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2951 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
2952 (TYPES_COMBINEP): Delete.
2953 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2954 qualified builtins for vcombine_* intrinsics.
2955 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
2957 (vcombine_s16): Likewise.
2958 (vcombine_s32): Likewise.
2959 (vcombine_f32): Likewise.
2960 (vcombine_u8): Use type-qualified builtin and remove casts.
2961 (vcombine_u16): Likewise.
2962 (vcombine_u32): Likewise.
2963 (vcombine_u64): Likewise.
2964 (vcombine_p8): Likewise.
2965 (vcombine_p16): Likewise.
2966 (vcombine_p64): Likewise.
2967 (vcombine_bf16): Remove unnecessary cast.
2968 * config/aarch64/iterators.md (VD_I): New mode iterator.
2969 (VDC_P): New mode iterator.
2971 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2973 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
2974 (TYPES_LOAD1_P): Define.
2975 (TYPES_STORE1_U): Define.
2976 (TYPES_STORE1P): Rename to...
2977 (TYPES_STORE1_P): This.
2978 (get_mem_type_for_load_store): Add unsigned and poly types.
2979 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
2980 type-qualified builtin declarations.
2981 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2982 qualified builtins for LD1/ST1.
2983 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
2984 builtin and remove cast.
2985 (vld1_p16): Likewise.
2986 (vld1_u8): Likewise.
2987 (vld1_u16): Likewise.
2988 (vld1_u32): Likewise.
2989 (vld1q_p8): Likewise.
2990 (vld1q_p16): Likewise.
2991 (vld1q_p64): Likewise.
2992 (vld1q_u8): Likewise.
2993 (vld1q_u16): Likewise.
2994 (vld1q_u32): Likewise.
2995 (vld1q_u64): Likewise.
2996 (vst1_p8): Likewise.
2997 (vst1_p16): Likewise.
2998 (vst1_u8): Likewise.
2999 (vst1_u16): Likewise.
3000 (vst1_u32): Likewise.
3001 (vst1q_p8): Likewise.
3002 (vst1q_p16): Likewise.
3003 (vst1q_p64): Likewise.
3004 (vst1q_u8): Likewise.
3005 (vst1q_u16): Likewise.
3006 (vst1q_u32): Likewise.
3007 (vst1q_u64): Likewise.
3008 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
3010 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3012 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3013 builtins for vector reduction.
3014 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
3015 builtin and remove casts.
3016 (vaddv_u16): Likewise.
3017 (vaddv_u32): Likewise.
3018 (vaddvq_u8): Likewise.
3019 (vaddvq_u16): Likewise.
3020 (vaddvq_u32): Likewise.
3021 (vaddvq_u64): Likewise.
3023 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3025 * config/aarch64/aarch64-simd-builtins.def:
3026 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
3027 builtin and remove casts.
3028 (vpaddq_u16): Likewise.
3029 (vpaddq_u32): Likewise.
3030 (vpaddq_u64): Likewise.
3031 (vpadd_u8): Likewise.
3032 (vpadd_u16): Likewise.
3033 (vpadd_u32): Likewise.
3034 (vpaddd_u64): Likewise.
3036 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3038 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3039 builtins for [r]subhn[2].
3040 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
3042 (vsubhn_s32): Likewise.
3043 (vsubhn_s64): Likewise.
3044 (vsubhn_u16): Use type-qualified builtin and remove casts.
3045 (vsubhn_u32): Likewise.
3046 (vsubhn_u64): Likewise.
3047 (vrsubhn_s16): Remove unnecessary cast.
3048 (vrsubhn_s32): Likewise.
3049 (vrsubhn_s64): Likewise.
3050 (vrsubhn_u16): Use type-qualified builtin and remove casts.
3051 (vrsubhn_u32): Likewise.
3052 (vrsubhn_u64): Likewise.
3053 (vrsubhn_high_s16): Remove unnecessary cast.
3054 (vrsubhn_high_s32): Likewise.
3055 (vrsubhn_high_s64): Likewise.
3056 (vrsubhn_high_u16): Use type-qualified builtin and remove
3058 (vrsubhn_high_u32): Likewise.
3059 (vrsubhn_high_u64): Likewise.
3060 (vsubhn_high_s16): Remove unnecessary cast.
3061 (vsubhn_high_s32): Likewise.
3062 (vsubhn_high_s64): Likewise.
3063 (vsubhn_high_u16): Use type-qualified builtin and remove
3065 (vsubhn_high_u32): Likewise.
3066 (vsubhn_high_u64): Likewise.
3068 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3070 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3071 builtins for [r]addhn[2].
3072 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
3074 (vaddhn_s32): Likewise.
3075 (vaddhn_s64): Likewise.
3076 (vaddhn_u16): Use type-qualified builtin and remove casts.
3077 (vaddhn_u32): Likewise.
3078 (vaddhn_u64): Likewise.
3079 (vraddhn_s16): Remove unnecessary cast.
3080 (vraddhn_s32): Likewise.
3081 (vraddhn_s64): Likewise.
3082 (vraddhn_u16): Use type-qualified builtin and remove casts.
3083 (vraddhn_u32): Likewise.
3084 (vraddhn_u64): Likewise.
3085 (vaddhn_high_s16): Remove unnecessary cast.
3086 (vaddhn_high_s32): Likewise.
3087 (vaddhn_high_s64): Likewise.
3088 (vaddhn_high_u16): Use type-qualified builtin and remove
3090 (vaddhn_high_u32): Likewise.
3091 (vaddhn_high_u64): Likewise.
3092 (vraddhn_high_s16): Remove unnecessary cast.
3093 (vraddhn_high_s32): Likewise.
3094 (vraddhn_high_s64): Likewise.
3095 (vraddhn_high_u16): Use type-qualified builtin and remove
3097 (vraddhn_high_u32): Likewise.
3098 (vraddhn_high_u64): Likewise.
3100 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3102 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3103 qualifiers in generator macros for uhsub builtins.
3104 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
3106 (vhsub_s16): Likewise.
3107 (vhsub_s32): Likewise.
3108 (vhsub_u8): Use type-qualified builtin and remove casts.
3109 (vhsub_u16): Likewise.
3110 (vhsub_u32): Likewise.
3111 (vhsubq_s8): Remove unnecessary cast.
3112 (vhsubq_s16): Likewise.
3113 (vhsubq_s32): Likewise.
3114 (vhsubq_u8): Use type-qualified builtin and remove casts.
3115 (vhsubq_u16): Likewise.
3116 (vhsubq_u32): Likewise.
3118 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3120 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3121 qualifiers in generator macros for u[r]hadd builtins.
3122 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
3124 (vhadd_s16): Likewise.
3125 (vhadd_s32): Likewise.
3126 (vhadd_u8): Use type-qualified builtin and remove casts.
3127 (vhadd_u16): Likewise.
3128 (vhadd_u32): Likewise.
3129 (vhaddq_s8): Remove unnecessary cast.
3130 (vhaddq_s16): Likewise.
3131 (vhaddq_s32): Likewise.
3132 (vhaddq_u8): Use type-qualified builtin and remove casts.
3133 (vhaddq_u16): Likewise.
3134 (vhaddq_u32): Likewise.
3135 (vrhadd_s8): Remove unnecessary cast.
3136 (vrhadd_s16): Likewise.
3137 (vrhadd_s32): Likewise.
3138 (vrhadd_u8): Use type-qualified builtin and remove casts.
3139 (vrhadd_u16): Likewise.
3140 (vrhadd_u32): Likewise.
3141 (vrhaddq_s8): Remove unnecessary cast.
3142 (vrhaddq_s16): Likewise.
3143 (vrhaddq_s32): Likewise.
3144 (vrhaddq_u8): Use type-wualified builtin and remove casts.
3145 (vrhaddq_u16): Likewise.
3146 (vrhaddq_u32): Likewise.
3148 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3150 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3151 qualifiers in generator macros for usub[lw][2] builtins.
3152 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
3154 (vsubl_s16): Likewise.
3155 (vsubl_s32): Likewise.
3156 (vsubl_u8): Use type-qualified builtin and remove casts.
3157 (vsubl_u16): Likewise.
3158 (vsubl_u32): Likewise.
3159 (vsubl_high_s8): Remove unnecessary cast.
3160 (vsubl_high_s16): Likewise.
3161 (vsubl_high_s32): Likewise.
3162 (vsubl_high_u8): Use type-qualified builtin and remove casts.
3163 (vsubl_high_u16): Likewise.
3164 (vsubl_high_u32): Likewise.
3165 (vsubw_s8): Remove unnecessary casts.
3166 (vsubw_s16): Likewise.
3167 (vsubw_s32): Likewise.
3168 (vsubw_u8): Use type-qualified builtin and remove casts.
3169 (vsubw_u16): Likewise.
3170 (vsubw_u32): Likewise.
3171 (vsubw_high_s8): Remove unnecessary cast.
3172 (vsubw_high_s16): Likewise.
3173 (vsubw_high_s32): Likewise.
3174 (vsubw_high_u8): Use type-qualified builtin and remove casts.
3175 (vsubw_high_u16): Likewise.
3176 (vsubw_high_u32): Likewise.
3178 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3180 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3181 qualifiers in generator macros for uadd[lw][2] builtins.
3182 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
3184 (vaddl_s16): Likewise.
3185 (vaddl_s32): Likewise.
3186 (vaddl_u8): Use type-qualified builtin and remove casts.
3187 (vaddl_u16): Likewise.
3188 (vaddl_u32): Likewise.
3189 (vaddl_high_s8): Remove unnecessary cast.
3190 (vaddl_high_s16): Likewise.
3191 (vaddl_high_s32): Likewise.
3192 (vaddl_high_u8): Use type-qualified builtin and remove casts.
3193 (vaddl_high_u16): Likewise.
3194 (vaddl_high_u32): Likewise.
3195 (vaddw_s8): Remove unnecessary cast.
3196 (vaddw_s16): Likewise.
3197 (vaddw_s32): Likewise.
3198 (vaddw_u8): Use type-qualified builtin and remove casts.
3199 (vaddw_u16): Likewise.
3200 (vaddw_u32): Likewise.
3201 (vaddw_high_s8): Remove unnecessary cast.
3202 (vaddw_high_s16): Likewise.
3203 (vaddw_high_s32): Likewise.
3204 (vaddw_high_u8): Use type-qualified builtin and remove casts.
3205 (vaddw_high_u16): Likewise.
3206 (vaddw_high_u32): Likewise.
3208 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3210 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3211 qualified builtins for [R]SHRN[2].
3212 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
3213 builtin and remove casts.
3214 (vshrn_n_u32): Likewise.
3215 (vshrn_n_u64): Likewise.
3216 (vrshrn_high_n_u16): Likewise.
3217 (vrshrn_high_n_u32): Likewise.
3218 (vrshrn_high_n_u64): Likewise.
3219 (vrshrn_n_u16): Likewise.
3220 (vrshrn_n_u32): Likewise.
3221 (vrshrn_n_u64): Likewise.
3222 (vshrn_high_n_u16): Likewise.
3223 (vshrn_high_n_u32): Likewise.
3224 (vshrn_high_n_u64): Likewise.
3226 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3228 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3229 type-qualified builtins for XTN[2].
3230 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
3231 qualified builtin and remove casts.
3232 (vmovn_high_u32): Likewise.
3233 (vmovn_high_u64): Likewise.
3234 (vmovn_u16): Likewise.
3235 (vmovn_u32): Likewise.
3236 (vmovn_u64): Likewise.
3238 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3240 * config/aarch64/aarch64-simd-builtins.def: Use poly type
3241 qualifier in builtin generator macros.
3242 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
3243 builtin and remove casts.
3244 (vmulq_p8): Likewise.
3245 (vmull_high_p8): Likewise.
3246 (vmull_p8): Likewise.
3248 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3250 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3251 qualified builtin generators for unsigned MLA/MLS intrinsics.
3252 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
3254 (vmla_n_u32): Likewise.
3255 (vmla_u8): Likewise.
3256 (vmla_u16): Likewise.
3257 (vmla_u32): Likewise.
3258 (vmlaq_n_u16): Likewise.
3259 (vmlaq_n_u32): Likewise.
3260 (vmlaq_u8): Likewise.
3261 (vmlaq_u16): Likewise.
3262 (vmlaq_u32): Likewise.
3263 (vmls_n_u16): Likewise.
3264 (vmls_n_u32): Likewise.
3265 (vmls_u8): Likewise.
3266 (vmls_u16): Likewise.
3267 (vmls_u32): Likewise.
3268 (vmlsq_n_u16): Likewise.
3269 (vmlsq_n_u32): Likewise.
3270 (vmlsq_u8): Likewise.
3271 (vmlsq_u16): Likewise.
3272 (vmlsq_u32): Likewise.
3274 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3276 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
3277 with looping const/pure.
3278 (modref_summary_lto::useful_p): Likewise.
3279 (merge_call_side_effects): Merge side effects before early exit
3281 (process_fnspec): Also handle pure functions.
3282 (analyze_call): Do not early exit on looping pure const.
3283 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
3284 (modref_propagate_in_scc): Update.
3286 2021-11-11 Richard Biener <rguenther@suse.de>
3288 PR tree-optimization/103190
3289 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
3291 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3293 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
3295 (path_range_query::compute_imports): ...this. Adapt it so it can
3296 be passed the imports bitmap instead of working on m_imports.
3297 (path_range_query::compute_ranges): Call compute_imports in all
3298 cases unless an imports bitmap is passed.
3299 * gimple-range-path.h (path_range_query::compute_imports): New.
3300 (path_range_query::add_copies_to_imports): Remove.
3301 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
3302 (back_threader::find_paths_to_names): Inline resolve_def.
3303 (back_threader::find_paths): Call compute_imports.
3304 (back_threader::resolve_phi): Adjust comment.
3306 2021-11-11 Richard Biener <rguenther@suse.de>
3308 PR tree-optimization/103188
3309 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
3310 Remove query parameter, split out check for size
3312 (ch_base::m_ranger, cb_base::m_query): Remove.
3313 (ch_base::copy_headers): Split processing loop into
3314 analysis around which we allocate and use ranger and
3315 transform where we do not.
3316 (pass_ch::execute): Do not allocate/free ranger here.
3317 (pass_ch_vect::execute): Likewise.
3319 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3321 * ipa-pure-const.c (propagate_pure_const): Self recursion is
3324 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3326 * cgraph.c (set_noreturn_flag_1): New function.
3327 (cgraph_node::set_noreturn_flag): New member function
3328 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
3329 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
3331 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3332 Richard Biener <rguenther@suse.de>
3334 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
3335 statically to the edge remaining in the loop.
3337 2021-11-11 Richard Biener <rguenther@suse.de>
3339 PR middle-end/103181
3340 * tree-eh.c (operation_could_trap_helper_p): Properly
3341 check vector constants for a zero element for integer
3342 division. Separate floating point and integer division code.
3344 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3347 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
3348 handling only for DECL_BIT_FIELD_TYPE decls.
3350 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
3353 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
3354 token is arch extension without leading '+' and emit appropriate
3355 diagnostic for the same.
3357 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3359 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
3360 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
3361 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
3362 * tree.c (omp_clause_num_ops): Increase num ops for
3363 OMP_CLAUSE_NUM_TEAMS to 2.
3364 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
3365 for OMP_CLAUSE_NUM_TEAMS.
3366 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
3367 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
3368 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
3369 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
3370 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
3371 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
3372 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
3374 2021-11-11 Richard Biener <rguenther@suse.de>
3376 * cfganal.c (find_pdom): Remove.
3377 (control_dependences::find_control_dependence): Remove
3378 special-casing of entry block, call get_immediate_dominator
3380 * gimple-predicate-analysis.cc (find_pdom): Remove.
3381 (find_dom): Likewise.
3382 (find_control_equiv_block): Call get_immediate_dominator
3384 (compute_control_dep_chain): Likewise.
3385 (predicate::init_from_phi_def): Likewise.
3387 2021-11-11 Richard Biener <rguenther@suse.de>
3389 * cfganal.h (control_dependences::control_dependence_map):
3391 (control_dependences::m_bitmaps): New.
3392 * cfganal.c (control_dependences::set_control_dependence_map_bit):
3394 (control_dependences::clear_control_dependence_bitmap):
3396 (control_dependences::find_control_dependence): Do not
3397 find_edge for the abnormal edge test.
3398 (control_dependences::control_dependences): Instead do not
3399 add abnormal edges to the edge list. Adjust.
3400 (control_dependences::~control_dependences): Likewise.
3401 (control_dependences::get_edges_dependent_on): Likewise.
3402 * function-tests.c: Include bitmap.h.
3404 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
3406 * doc/invoke.texi: Change references to "future cpu" to "power10",
3407 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
3409 2021-11-11 Cui,Lili <lili.cui@intel.com>
3411 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
3413 (processor_cost_table): Use alderlake_cost for Alderlake.
3414 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
3415 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
3417 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
3419 (ix86_adjust_cost): Handle Alderlake.
3420 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
3421 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
3422 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
3423 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
3424 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
3425 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
3426 (X86_TUNE_USE_LEAVE): Likewise.
3427 (X86_TUNE_PUSH_MEMORY): Likewise.
3428 (X86_TUNE_USE_INCDEC): Likewise.
3429 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
3430 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
3431 (X86_TUNE_USE_SAHF): Likewise.
3432 (X86_TUNE_USE_BT): Likewise.
3433 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
3434 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
3435 (X86_TUNE_AVOID_MFENCE): Likewise.
3436 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
3437 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
3438 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
3439 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
3440 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
3441 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
3442 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
3443 (X86_TUNE_USE_GATHER): Disable for Alderlake.
3444 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
3445 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
3447 2021-11-11 liuhongt <hongtao.liu@intel.com>
3450 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
3451 (avxsizesuffix): Ditto.
3453 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
3455 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
3456 wrong marco checking.
3458 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3460 PR tree-optimization/102906
3461 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
3462 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
3463 (class ch_base): Add m_ranger and m_query.
3464 (ch_base::copy_headers): Pass m_query to
3465 entry_loop_condition_is_static.
3466 (pass_ch::execute): Allocate and deallocate m_ranger and
3468 (pass_ch_vect::execute): Same.
3470 2021-11-10 Andrew Pinski <apinski@marvell.com>
3473 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
3474 Use vwcore iterator for the r constraint output string.
3476 2021-11-10 qing zhao <qing.zhao@oracle.com>
3478 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
3479 only when have_insn_for return true for the mode. Fix a memory leak.
3481 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
3483 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
3484 cortexa5_extra_costs, cortexa7_extra_costs,
3485 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
3486 Initialize movi, dup and extract costing fields.
3488 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3490 * gimple-range-path.cc (path_range_query::path_range_query): Do
3492 (path_range_query::dump): Change m_path uses to non-pointer.
3493 (path_range_query::defined_outside_path): Same.
3494 (path_range_query::set_path): Same.
3495 (path_range_query::add_copies_to_imports): Same.
3496 (path_range_query::range_of_stmt): Same.
3497 (path_range_query::compute_outgoing_relations): Same.
3498 (path_range_query::compute_ranges): Imports are now optional.
3499 Implement overload that takes an edge.
3500 * gimple-range-path.h (class path_range_query): Make imports
3501 optional for compute_ranges. Add compute_ranges(edge) overload.
3502 Make m_path an auto_vec instead of a pointer and adjust
3505 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3507 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
3508 (default_hash_traits<scalar_conf_masked_key>): Likewise.
3509 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
3511 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
3512 Register mask inverses.
3514 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3516 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
3517 successful vectorization.
3519 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
3521 * gimple-range-cache.cc (sbr_vector::grow): New.
3522 (sbr_vector::set_bb_range): Call grow.
3523 (sbr_vector::get_bb_range): Same.
3524 (sbr_vector::bb_range_p): Remove assert.
3526 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3528 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
3529 ,*aarch64_topbits_shuffle<mode>_be): Remove.
3531 2021-11-10 Jan Hubicka <jh@suse.cz>
3533 * ipa-modref.c: Include tree-eh.h
3534 (modref_summary::modref_summary): Initialize side_effects.
3535 (struct modref_summary_lto): New bool field side_effects.
3536 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
3537 (modref_summary::dump): Dump side_effects.
3538 (modref_summary_lto::dump): Dump side_effects.
3539 (merge_call_side_effects): Merge side effects.
3540 (process_fnspec): Calls to non-const/pure or looping
3541 function is a side effect.
3542 (analyze_call): Self-recursion is a side-effect; handle
3544 (analyze_load): Watch for volatile and throwing memory.
3545 (analyze_store): Likewise.
3546 (analyze_stmt): Watch for volatitle asm.
3547 (analyze_function): Handle side_effects.
3548 (modref_summaries::duplicate): Duplicate side_effects.
3549 (modref_summaries_lto::duplicate): Likewise.
3550 (modref_write): Stream side_effects.
3551 (read_section): Likewise.
3552 (update_signature): Update.
3553 (propagate_unknown_call): Handle side_effects.
3554 (modref_propagate_in_scc): Likewise.
3555 * ipa-modref.h (struct modref_summary): Add side_effects.
3556 * ipa-pure-const.c (special_builtin_state): Rename to ...
3557 (builtin_safe_for_const_function_p): ... this one.
3558 (check_call): Update.
3559 (finite_function_p): Break out from ...
3560 (propagate_pure_const): ... here
3561 * ipa-utils.h (finite_function): Declare.
3563 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
3565 * config.gcc (powerpc*-*-*): Remove -rpath from
3566 --with-advance-toolchain.
3568 2021-11-10 Marek Polacek <polacek@redhat.com>
3571 * attribs.c (struct scoped_attributes): Add a bool member.
3572 (lookup_scoped_attribute_spec): Forward declare.
3573 (register_scoped_attributes): New bool parameter, defaulted to
3575 (handle_ignored_attributes_option): New function.
3576 (free_attr_data): New function.
3577 (init_attributes): Call handle_ignored_attributes_option.
3578 (attr_namespace_ignored_p): New function.
3579 (decl_attributes): Check attr_namespace_ignored_p before
3581 * attribs.h (free_attr_data): Declare.
3582 (register_scoped_attributes): Adjust declaration.
3583 (handle_ignored_attributes_option): Declare.
3584 (canonicalize_attr_name): New function template.
3585 (canonicalize_attr_name): Use it.
3586 * common.opt (Wattributes=): New option with a variable.
3587 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
3588 * doc/invoke.texi: Document -Wno-attributes=.
3589 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
3590 * plugin.h (register_scoped_attributes): Adjust declaration.
3591 * toplev.c (compile_file): Call free_attr_data.
3593 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3595 * config/arm/arm-cpus.in (cortex-a710): New CPU.
3596 * config/arm/arm-tables.opt: Regenerate.
3597 * config/arm/arm-tune.md: Regenerate.
3598 * doc/invoke.texi: Update docs.
3600 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3602 * config/aarch64/aarch64-builtins.c
3603 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
3605 2021-11-10 Martin Liska <mliska@suse.cz>
3607 * lto-wrapper.c (merge_and_complain): Make the first argument
3610 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3612 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
3613 smax_nan and fmin_nan instead of smin_nan.
3614 (maxmin_uns): Rename to...
3615 (fmaxmin): ...this and make the same changes. Remove entries
3616 unrelated to fmax* and fmin*.
3617 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
3618 (<fmaxmin><mode>3): ...this.
3619 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
3621 (aarch64_<optab>p<mode>): ...this.
3622 (<maxmin_uns><mode>3): Rename to...
3623 (<fmaxmin><mode>3): ...this.
3624 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
3625 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
3626 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
3627 (aarch64_reduc_<optab>_internal<mode>): ...this.
3628 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
3629 (aarch64_reduc_<optab>_internalv2si): ...this.
3630 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
3631 (<fmaxmin><mode>3): ...this.
3632 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
3634 (fmax_nan, fmin_nan): ...this.
3635 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
3636 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
3637 (vmin_f16, vminq_f16): Update accordingly.
3639 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3641 * tree-vectorizer.h (vector_costs::finish_cost): Take the
3642 corresponding scalar costs as a parameter.
3643 (finish_cost): Likewise.
3644 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
3645 (vect_estimate_min_profitable_iters): Update accordingly.
3646 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
3647 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
3648 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
3650 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
3652 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3654 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
3656 (_loop_vec_info::single_scalar_iteration_cost): Delete.
3657 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
3658 (vector_costs::total_cost): New function.
3659 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3660 after above changes.
3661 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
3662 (vect_compute_single_scalar_iteration_cost): Store the costs
3663 in loop_vinfo->scalar_costs.
3664 (vect_estimate_min_profitable_iters): Get the scalar cost from
3665 loop_vinfo->scalar_costs.
3667 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3669 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
3670 (vector_costs::better_epilogue_loop_than_p)
3671 (vector_costs::compare_inside_loop_cost)
3672 (vector_costs::compare_outside_loop_cost): Likewise.
3673 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
3674 (vector_costs::better_epilogue_loop_than_p)
3675 (vector_costs::compare_inside_loop_cost)
3676 (vector_costs::compare_outside_loop_cost): New functions,
3677 containing code moved from...
3678 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
3680 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3682 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
3683 and vec_inside_cost.
3684 (vector_costs::outside_cost): New function.
3685 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3687 (vect_estimate_min_profitable_iters): Likewise.
3688 (vect_better_loop_vinfo_p): Get the inside and outside costs
3689 from the loop_vec_infos' vector_costs.
3691 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3693 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
3694 (_loop_vec_info::vector_costs): ...this.
3695 (LOOP_VINFO_TARGET_COST_DATA): Delete.
3696 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
3698 (vec_info::~vec_info): Remove corresponding delete.
3699 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
3700 vector_costs to null.
3701 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
3702 (vect_analyze_loop_operations): Update after above changes.
3703 (vect_analyze_loop_2): Likewise.
3704 (vect_estimate_min_profitable_iters): Likewise.
3705 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
3707 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
3709 * tree-core.h (EAF_DIRECT): Remove.
3710 (EAF_NOCLOBBER): Remove.
3711 (EAF_UNUSED): Remove.
3712 (EAF_NOESCAPE): Remove.
3713 (EAF_NO_DIRECT_CLOBBER): New.
3714 (EAF_NO_INDIRECT_CLOBBER): New.
3715 (EAF_NODIRECTESCAPE): Remove.
3716 (EAF_NO_DIRECT_ESCAPE): New.
3717 (EAF_NO_INDIRECT_ESCAPE): New.
3718 (EAF_NOT_RETURNED): Remove.
3719 (EAF_NOT_RETURNED_INDIRECTLY): New.
3720 (EAF_NOREAD): Remove.
3721 (EAF_NO_DIRECT_READ): New.
3722 (EAF_NO_INDIRECT_READ): New.
3723 * gimple.c (gimple_call_arg_flags): Update for new flags.
3724 (gimple_call_retslot_flags): Update for new flags.
3725 * ipa-modref.c (dump_eaf_flags): Likewise.
3726 (remove_useless_eaf_flags): Likewise.
3727 (deref_flags): Likewise.
3728 (modref_lattice::init): Likewise.
3729 (modref_lattice::merge): Likewise.
3730 (modref_lattice::merge_direct_load): Likewise.
3731 (modref_lattice::merge_direct_store): Likewise.
3732 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
3733 (callee_to_caller_flags): Likewise.
3734 (modref_eaf_analysis::analyze_ssa_name): Likewise.
3735 (modref_eaf_analysis::propagate): Likewise.
3736 (modref_merge_call_site_flags): Likewise.
3737 * ipa-modref.h (interposable_eaf_flags): Likewise.
3738 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
3739 * tree-ssa-structalias.c (handle_call_arg): Likewise.
3740 (handle_rhs_call): Likewise.
3741 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
3743 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3745 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
3747 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3749 * config/aarch64/aarch64-builtins.c
3750 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
3753 2021-11-10 Jan Hubicka <jh@suse.cz>
3755 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
3757 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
3758 Hongtao Liu <hongtao.liu@intel.com>
3760 PR middle-end/102566
3761 * match.pd (nop_atomic_bit_test_and_p): New match.
3762 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
3763 (gimple_nop_atomic_bit_test_and_p): New prototype.
3764 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
3765 different cases to their canonical forms.
3767 2021-11-10 liuhongt <hongtao.liu@intel.com>
3769 PR tree-optimization/103126
3770 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
3771 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
3772 * tree-if-conv.c : Include tree-vectorizer.h.
3773 (is_cond_scalar_reduction): Handle
3774 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
3775 (convert_scalar_cond_reduction): Ditto.
3777 2021-11-10 konglin1 <lingling.kong@intel.com>
3779 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
3780 (cmla<conj_op><mode>4): Likewise
3782 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3784 * tree-ssa-threadedge.c: Do not include
3785 gimple-ssa-evrp-analyze.h.
3786 * value-pointer-equiv.cc: Same.
3788 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3790 * tree-ssa-threadbackward.c
3791 (back_threader::maybe_register_path_dump): Abstract path dumping...
3792 (dump_path): ...here.
3793 (back_threader::resolve_phi): Call dump_path.
3796 2021-11-10 konglin1 <lingling.kong@intel.com>
3798 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
3799 Add new define_insn.
3800 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
3801 (fma_<mode>_fcmaddc_bcst): Likewise
3803 2021-11-10 liuhongt <hongtao.liu@intel.com>
3806 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
3809 2021-11-10 Andrew Pinski <apinski@marvell.com>
3812 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
3813 as an input, use original one.
3815 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
3817 * config/nios2/nios2.c (nios2_can_inline_p): New.
3818 (TARGET_CAN_INLINE_P): Define.
3820 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3822 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
3823 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
3824 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
3825 (lxvrse_expand_builtin): Modify the expansion for sign extension.
3826 All extensions are done within VSX registers.
3828 2021-11-09 Andrew Pinski <apinski@marvell.com>
3830 PR tree-optimization/10352
3831 * match.pd: Remove check of TYPE_PRECISION for
3834 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
3836 * function.c (allocate_struct_function): Don't set x_range_query.
3837 * function.h (get_range_query): Move to value-query.h.
3838 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
3839 (disable_ranger): Clear function current query field.
3840 * value-query.cc (get_global_range_query): Relocate to:
3841 * value-query.h (get_global_range_query): Here and inline.
3842 (get_range_query): Relocate here from function.h.
3844 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3846 * tree-ssa-threadbackward.c
3847 (back_threader::maybe_register_path_dump): New.
3848 (back_threader::maybe_register_path): Call maybe_register_path_dump.
3850 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3852 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
3853 Return NULL when unprofitable.
3855 2021-11-09 Martin Jambor <mjambor@suse.cz>
3857 * tree.h (build_debug_expr_decl): Declare.
3858 * tree.c (build_debug_expr_decl): New function.
3859 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
3860 instead of building a DEBUG_EXPR_DECL.
3861 * ipa-param-manipulation.c
3862 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3863 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
3864 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
3865 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
3866 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
3868 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3870 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
3871 (def_builtin): Change debug formatting for easier parsing and
3872 include more information.
3873 (rs6000_init_builtins): Add dump of autogenerated builtins.
3874 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
3877 2021-11-09 Marek Polacek <polacek@redhat.com>
3879 * ipa-modref.c (analyze_parms): Fix a typo.
3881 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
3884 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
3885 assertion for Z*-ext.
3887 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
3889 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
3890 and past_static_chain; merge past summary with current one.
3891 (analyze_function): Update.
3893 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3895 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
3897 2021-11-09 Martin Liska <mliska@suse.cz>
3899 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
3900 conditional based on
3901 * print-rtl.h (class rtx_writer): Likewise.
3903 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3905 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
3906 for 'Deleted'. Turn into a '#define'.
3908 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3910 * gimple-range-path.cc (path_range_query::dump): Clean up.
3911 (path_range_query::compute_ranges): Same.
3912 * value-relation.cc (path_oracle::dump): Same.
3914 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3916 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
3917 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
3919 * flag-types.h (enum threader_debug): New.
3920 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
3921 * params.opt: Add entry for --param=threader-debug=.
3923 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3925 PR middle-end/101204
3927 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
3928 (struct xint_hash_t): ... this.
3929 * doc/gty.texi: Update.
3931 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3933 * config/arm/arm-cpus.in (armv9): New define.
3934 (ARMv9a): New group.
3935 (armv9-a): New arch definition.
3936 * config/arm/arm-tables.opt: Regenerate.
3937 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
3938 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
3939 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
3940 to MULTILIB_MATCHES.
3941 * config/arm/t-multilib: Added v9_a_nosimd_variants and
3942 v9_a_simd_variants to MULTILIB_MATCHES.
3943 * doc/invoke.texi: Update docs.
3945 2021-11-09 Martin Jambor <mjambor@suse.cz>
3948 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
3949 return with error_mark_mode when part of expression is mapped to
3951 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
3952 mapped value to NULL if walk_tree returns error_mark_mode.
3954 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3956 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
3958 (get_addr_stridx): Same.
3960 (get_range_strlen_dynamic): Same.
3961 (handle_builtin_strlen): Same.
3962 (handle_builtin_strchr): Same.
3963 (handle_builtin_strcpy): Same.
3964 (maybe_diag_stxncpy_trunc): Same.
3965 (handle_builtin_stxncpy_strncat): Same.
3966 (handle_builtin_memcpy): Same.
3967 (handle_builtin_strcat): Same.
3968 (handle_alloc_call): Same.
3969 (handle_builtin_memset): Same.
3970 (handle_builtin_string_cmp): Same.
3971 (handle_pointer_plus): Same.
3972 (count_nonzero_bytes_addr): Same.
3973 (count_nonzero_bytes): Same.
3974 (handle_store): Same.
3975 (fold_strstr_to_strncmp): Same.
3976 (handle_integral_assign): Same.
3977 (check_and_optimize_stmt): Same.
3978 (class strlen_dom_walker): Replace evrp with ranger.
3979 (strlen_dom_walker::before_dom_children): Remove evrp.
3980 (strlen_dom_walker::after_dom_children): Remove evrp.
3981 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
3982 Restrict sprintf output.
3984 2021-11-09 Martin Liska <mliska@suse.cz>
3986 * genconditions.c (write_one_condition): Add const qualifier
3989 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
3991 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
3994 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3996 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
3997 escape-newline support.
3998 (VEC_INIT_V4SI): Likewise.
3999 (VEC_INIT_V8HI): Likewise.
4000 (PACK_V1TI): Likewise.
4002 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
4003 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
4004 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
4008 (FMAF128_ODD): Likewise.
4009 (VSCEQPUO): Likewise.
4011 (VSIEQPF): Likewise.
4012 (VSTDCQP): Likewise.
4013 (PACK_TD): Likewise.
4014 (TABORTDC): Likewise.
4015 (TABORTDCI): Likewise.
4016 (SE_LXVRBX): Likewise.
4017 (SE_LXVRHX): Likewise.
4018 (SE_LXVRWX): Likewise.
4019 (SE_LXVRDX): Likewise.
4020 (VREPLACE_UN_UV2DI): Likewise.
4021 (VREPLACE_UN_UV4SI): Likewise.
4022 (VREPLACE_UN_V2DI): Likewise.
4023 (VREPLACE_ELT_UV2DI): Likewise.
4024 (VREPLACE_ELT_V2DI): Likewise.
4025 (ZE_LXVRBX): Likewise.
4026 (ZE_LXVRHX): Likewise.
4027 (ZE_LXVRWX): Likewise.
4028 (ZE_LXVRDX): Likewise.
4030 (CNTLZDM): Likewise.
4031 (CNTTZDM): Likewise.
4034 (PMXVBF16GER2): Likewise.
4035 (PMXVBF16GER2_INTERNAL): Likewise.
4036 (PMXVBF16GER2NN): Likewise.
4037 (PMXVBF16GER2NN_INTERNAL): Likewise.
4038 (PMXVBF16GER2NP): Likewise.
4039 (PMXVBF16GER2NP_INTERNAL): Likewise.
4040 (PMXVBF16GER2PN): Likewise.
4041 (PMXVBF16GER2PN_INTERNAL): Likewise.
4042 (PMXVBF16GER2PP): Likewise.
4043 (PMXVBF16GER2PP_INTERNAL): Likewise.
4044 (PMXVF16GER2): Likewise.
4045 (PMXVF16GER2_INTERNAL): Likewise.
4046 (PMXVF16GER2NN): Likewise.
4047 (PMXVF16GER2NN_INTERNAL): Likewise.
4048 (PMXVF16GER2NP): Likewise.
4049 (PMXVF16GER2NP_INTERNAL): Likewise.
4050 (PMXVF16GER2PN): Likewise.
4051 (PMXVF16GER2PN_INTERNAL): Likewise.
4052 (PMXVF16GER2PP): Likewise.
4053 (PMXVF16GER2PP_INTERNAL): Likewise.
4054 (PMXVF32GER_INTERNAL): Likewise.
4055 (PMXVF32GERNN): Likewise.
4056 (PMXVF32GERNN_INTERNAL): Likewise.
4057 (PMXVF32GERNP): Likewise.
4058 (PMXVF32GERNP_INTERNAL): Likewise.
4059 (PMXVF32GERPN): Likewise.
4060 (PMXVF32GERPN_INTERNAL): Likewise.
4061 (PMXVF32GERPP): Likewise.
4062 (PMXVF32GERPP_INTERNAL): Likewise.
4063 (PMXVF64GER): Likewise.
4064 (PMXVF64GER_INTERNAL): Likewise.
4065 (PMXVF64GERNN): Likewise.
4066 (PMXVF64GERNN_INTERNAL): Likewise.
4067 (PMXVF64GERNP): Likewise.
4068 (PMXVF64GERNP_INTERNAL): Likewise.
4069 (PMXVF64GERPN): Likewise.
4070 (PMXVF64GERPN_INTERNAL): Likewise.
4071 (PMXVF64GERPP): Likewise.
4072 (PMXVF64GERPP_INTERNAL): Likewise.
4073 (PMXVI16GER2): Likewise.
4074 (PMXVI16GER2_INTERNAL): Likewise.
4075 (PMXVI16GER2PP): Likewise.
4076 (PMXVI16GER2PP_INTERNAL): Likewise.
4077 (PMXVI16GER2S): Likewise.
4078 (PMXVI16GER2S_INTERNAL): Likewise.
4079 (PMXVI16GER2SPP): Likewise.
4080 (PMXVI16GER2SPP_INTERNAL): Likewise.
4081 (PMXVI4GER8): Likewise.
4082 (PMXVI4GER8_INTERNAL): Likewise.
4083 (PMXVI4GER8PP): Likewise.
4084 (PMXVI4GER8PP_INTERNAL): Likewise.
4085 (PMXVI8GER4): Likewise.
4086 (PMXVI8GER4_INTERNAL): Likewise.
4087 (PMXVI8GER4PP): Likewise.
4088 (PMXVI8GER4PP_INTERNAL): Likewise.
4089 (PMXVI8GER4SPP): Likewise.
4090 (PMXVI8GER4SPP_INTERNAL): Likewise.
4091 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
4092 (linebuf): Increase size.
4093 (lines): New variable.
4094 (lastline): Likewise.
4095 (real_line_pos): New function.
4096 (diag): Change signature.
4097 (bif_diag): Change signature; support escape-newline handling.
4098 (ovld_diag): Likewise.
4099 (fatal): Move earlier.
4100 (consume_whitespace): Adjust diag call.
4101 (advance_line): Add escape-newline handling; call fatal.
4102 (safe_inc_pos): Adjust diag call.
4103 (match_identifier): Likewise.
4104 (match_integer): Likewise.
4105 (match_to_right_bracket): Call fatal instead of diag; adjust diag
4107 (match_basetype): Adjust diag calls.
4108 (match_bracketed_pair): Likewise.
4109 (match_const_restriction): Likewise.
4110 (match_type): Likewise.
4111 (parse_args): Likewise.
4112 (parse_bif_attrs): Likewise.
4113 (complete_vector_type): Likewise.
4114 (complete_base_type): Likewise.
4115 (parse_prototype): Likewise.
4116 (parse_bif_entry): Likewise.
4117 (parse_bif_stanza): Likewise.
4118 (parse_ovld_entry): Likewise.
4119 (parse_ovld_stanza): Likewise.
4120 (main): Allocate buffers for lines[].
4122 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
4124 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
4125 use the REG argument on the CM_LARGE_PIC code path as well.
4126 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
4127 specially with the large code models.
4129 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4131 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
4132 allowing interposition since issues with binds_to_local_def were
4135 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
4137 PR tree-optimization/103122
4138 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
4139 entry with "calulate new values" set to false.
4141 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4143 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
4144 and builtin_return_address.
4145 (builtin_fnspec): Annotate builtin_return,
4146 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
4147 builtin_cxa_end_cleanup, builtin_eh_copy_values,
4148 builtin_frame_address, builtin_apply_args,
4149 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
4150 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
4152 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
4153 and builtin_return_address.
4155 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4157 PR tree-optimization/103177
4158 * passes.def: Move uncprop after pure/const and modref.
4160 2021-11-08 Martin Jambor <mjambor@suse.cz>
4164 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
4165 location before invoking remap_with_debug_expressions on it.
4166 * ipa-param-manipulation.c
4167 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4169 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
4171 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
4174 2021-11-08 Richard Biener <rguenther@suse.de>
4176 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
4177 instead of an sbitmap to cache irreducible state.
4179 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
4181 PR tree-optimization/103120
4182 * gimple-range-path.cc (path_range_query::range_defined_in_block):
4183 Bail if there's a cache entry.
4185 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4187 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
4188 rs6000_builtin_decls_x when appropriate.
4189 (add_condition_to_bb): Likewise.
4190 (rs6000_atomic_assign_expand_fenv): Likewise.
4192 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4194 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
4195 (rs6000_builtin_decl): Call it.
4197 2021-11-08 Martin Liska <mliska@suse.cz>
4199 * cgraph.c (cgraph_node::dump): Dump it from decl.
4201 2021-11-08 Martin Liska <mliska@suse.cz>
4203 PR gcov-profile/100520
4204 * coverage.c (coverage_compute_profile_id): Strip .gk when
4205 compare debug is used.
4206 * system.h (endswith): New function.
4208 2021-11-08 Martin Liska <mliska@suse.cz>
4210 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
4212 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
4214 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
4215 * config/rs6000/rbtree.h: Likewise.
4216 * config/rs6000/rs6000-builtin-new.def: Likewise.
4217 * config/rs6000/rs6000-gen-builtins.c: Likewise.
4218 * config/rs6000/rs6000-overload.def: Likewise.
4220 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4222 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
4223 (aarch64_address_cost): Use it instead of testing for CImode and
4226 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4228 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
4229 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
4230 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
4231 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
4232 NUM_MODE_DECIMAL_FLOAT.
4233 (REAL_MODE_FORMAT): Likewise.
4235 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
4237 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
4240 2021-11-08 Richard Biener <rguenther@suse.de>
4242 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
4244 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
4245 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
4246 LOOP_VINFO_NITERS_ASSUMPTIONS.
4247 (vect_analyze_loop_1): Adjust.
4248 (vect_analyze_loop): Move loop constraint setting and
4249 SCEV/niter reset here from vect_create_loop_vinfo to perform
4251 (vect_analyze_loop_form): Move dumping of symbolic niters
4252 here from vect_create_loop_vinfo.
4254 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4256 * ipa-modref-tree.h (enum modref_special_parms): New enum.
4257 (struct modref_access_node): update for special parms.
4258 (struct modref_ref_node): Likewise.
4259 (struct modref_parm_map): Likewise.
4260 (struct modref_tree): Likewise.
4261 * ipa-modref.c (dump_access): Likewise.
4262 (get_access): Detect static chain.
4263 (parm_map_for_arg): Take tree as arg instead of
4265 (merge_call_side_effects): Compute map for static chain.
4266 (process_fnspec): Update.
4267 (struct escape_point): Remove retslot_arg and static_chain_arg.
4268 (analyze_parms): Update.
4269 (compute_parm_map): Update.
4270 (propagate_unknown_call): Update.
4271 (modref_propagate_in_scc): Update.
4272 (modref_merge_call_site_flags): Update.
4273 (ipa_merge_modref_summary_after_inlining): Update.
4274 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
4275 * ipa-modref-tree.c (test_merge): Update.
4277 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
4279 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
4280 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
4281 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
4284 2021-11-08 liuhongt <hongtao.liu@intel.com>
4286 PR tree-optimization/103077
4287 * doc/invoke.texi (Options That Control Optimization):
4288 Update documentation for -ftree-loop-vectorize and
4289 -ftree-slp-vectorize which are enabled by default at -02.
4291 2021-11-08 liuhongt <hongtao.liu@intel.com>
4294 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
4295 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
4297 2021-11-08 liuhongt <hongtao.liu@intel.com>
4300 * match.pd: Simplify
4301 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
4302 c) under flag_unsafe_math_optimizations.
4304 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4307 * params.opt (modref-max-depth): Add range.
4308 (modref-max-adjustments): Fix range.
4310 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
4312 * tree-pass.h (make_pass_vrp_threader): Remove.
4313 * tree-ssa-threadbackward.c
4314 (back_threader_profitability::profitable_path_p): Remove
4315 ASSERT_EXPR references.
4316 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
4317 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
4318 (execute_vrp): Same.
4319 (class hybrid_threader): Remove.
4320 (hybrid_threader::hybrid_threader): Remove.
4321 (hybrid_threader::~hybrid_threader): Remove.
4322 (hybrid_threader::before_dom_children): Remove.
4323 (hybrid_threader::after_dom_children): Remove.
4324 (execute_vrp_threader): Remove.
4325 (class pass_vrp_threader): Remove.
4326 (make_pass_vrp_threader): Remove.
4328 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4330 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
4331 (gimple_call_retslot_flags): Likewise.
4332 (gimple_call_static_chain_flags): Likewise.
4333 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
4335 (modref_summary::useful_p): Likewise.
4336 (modref_summary_lto::useful_p): Likewise.
4337 (analyze_parms): Do not give up on NOVOPS.
4338 (analyze_function): When dumping report chnages in EAF flags
4339 between IPA and local pass.
4340 (modref_merge_call_site_flags): Compute implicit eaf flags
4341 based on callee ecf_flags and fnspec; if the function does not
4342 bind to current defs use interposable_eaf_flags.
4343 (modref_propagate_flags_in_scc): Update.
4344 * ipa-modref.h (interposable_eaf_flags): New function.
4346 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
4348 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
4350 (rs6000_invalid_new_builtin): New function.
4351 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
4352 (rs6000_expand_ldst_mask): New function.
4353 (new_cpu_expand_builtin): Likewise.
4354 (elemrev_icode): Likewise.
4355 (ldv_expand_builtin): Likewise.
4356 (lxvrse_expand_builtin): Likewise.
4357 (lxvrze_expand_builtin): Likewise.
4358 (stv_expand_builtin): Likewise.
4359 (new_mma_expand_builtin): Likewise.
4360 (new_htm_spr_num): Likewise.
4361 (new_htm_expand_builtin): Likewise.
4362 (rs6000_expand_new_builtin): Likewise.
4363 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
4365 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4367 * ipa-modref.c (modref_lattice): Add do_dataflow,
4368 changed and propagate_to fields.
4369 (modref_lattice::release): Free propagate_to
4370 (modref_lattice::merge): Do not give up early on unknown
4372 (modref_lattice::merge_deref): Likewise.
4373 (modref_eaf_analysis): Update toplevel comment.
4374 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
4375 do optimistic dataflow initialization.
4376 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
4377 (modref_eaf_analysis::propagate): New member function.
4378 (analyze_parms): Update to new API of modref_eaf_analysis.
4380 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
4382 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
4383 return true on functions from other partition.
4385 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
4387 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
4388 (vsx_splat_v4si_di): Change constraint to "wa".
4390 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4392 PR tree-optimization/103061
4393 * value-relation.cc (path_oracle::path_oracle): Initialize
4395 (path_oracle::killing_def): Set m_killed_defs.
4396 (path_oracle::query_relation): Do not look at the root oracle for
4398 * value-relation.h (class path_oracle): Add m_killed_defs.
4400 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4402 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4403 Remove gotos and other cleanups.
4405 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4408 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
4409 paradoxical and zero sized accesses.
4411 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4414 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
4417 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4419 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
4420 to the list of symbol prefixes that must be made linker-
4423 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4425 * config.host: Add support for aarch64-*-darwin.
4426 * config/aarch64/host-aarch64-darwin.c: New file.
4427 * config/aarch64/x-darwin: New file.
4429 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4431 PR tree-optimization/103093
4432 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
4434 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4436 PR tree-optimization/102943
4437 * gimple-range-cache.cc (class update_list): New.
4438 (update_list::add): Replace add_to_update.
4439 (update_list::pop): New.
4440 (ranger_cache::ranger_cache): Adjust.
4441 (ranger_cache::~ranger_cache): Adjust.
4442 (ranger_cache::add_to_update): Delete.
4443 (ranger_cache::propagate_cache): Adjust to new class.
4444 (ranger_cache::propagate_updated_value): Ditto.
4445 (ranger_cache::fill_block_cache): Ditto.
4446 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
4448 2021-11-05 Richard Biener <rguenther@suse.de>
4450 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
4451 comment and expand on another one. Combine nested if.
4453 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4456 * config/pa/pa-modes.def: Add OImode integer type.
4457 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
4459 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
4460 to UNITS_PER_WORD if IN_LIBGCC2.
4461 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
4462 negvti2, ashlti3, shrpd_internal): New patterns.
4463 Change some multi instruction types to multi.
4465 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4468 * config/i386/i386.h
4469 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
4472 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
4475 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
4477 * config/aarch64/constraints.md: Support const in Usw.
4479 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4481 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
4482 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
4484 2021-11-05 Richard Biener <rguenther@suse.de>
4486 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
4487 (LOOP_VINFO_N_STMTS): Likewise.
4488 (vec_info_for_bb): Remove unused function.
4489 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
4490 Initialize n_stmts member.
4491 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
4492 (vect_create_loop_vinfo): Do not set loop->aux.
4493 (vect_analyze_loop_2): Do not get n_stmts as argument,
4494 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
4496 (vect_analyze_loop_1): Remove callback, get the mode iterator
4497 and autodetected_vector_mode as argument, advancing the
4498 iterator and initializing autodetected_vector_mode here.
4499 (vect_analyze_loop): Split analysis loop into two, first
4500 processing main loops only and then epilogues.
4502 2021-11-05 Martin Jambor <mjambor@suse.cz>
4504 * ipa-prop.c (compute_complex_assign_jump_func): Remove
4505 unnecessary check for RECORD_TYPE.
4507 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
4509 * doc/install.texi2html: Do not generate old.html any longer.
4511 2021-11-05 Martin Liska <mliska@suse.cz>
4514 * opts.c (finish_options): Reset flag_gtoggle when it is used.
4516 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4519 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
4520 in CONST_WIDE_INT handling. Adjust recursive calls.
4521 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
4522 new add_const_value_attribute argument.
4523 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
4524 add_const_value_attribute argument.
4526 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
4528 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
4531 2021-11-05 Richard Biener <rguenther@suse.de>
4533 * tree-vectorizer.h (struct vect_loop_form_info): New.
4534 (vect_analyze_loop_form): Adjust.
4535 (vect_create_loop_vinfo): New.
4536 * tree-parloops.c (gather_scalar_reductions): Adjust for
4537 vect_analyze_loop_form API change.
4538 * tree-vect-loop.c: Include <functional>.
4539 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
4540 take struct vect_loop_form_info as output parameter and adjust.
4541 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
4542 split out call to the original vect_analyze_loop_form_1.
4543 (vect_reanalyze_as_main_loop): Rename to...
4544 (vect_analyze_loop_1): ... this, factor out the call to
4545 vect_analyze_loop_form and generalize to be able to use it twice ...
4546 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
4549 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
4552 * config/rs6000/fusion.md: Regenerate.
4553 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
4555 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
4557 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
4560 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4562 * config/aarch64/aarch64-builtins.c
4563 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
4564 ashr_simd, lshr, lshr_simd.
4565 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
4566 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
4567 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
4569 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4571 * match.pd: New negate+shift pattern.
4573 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
4575 PR tree-optimization/103079
4576 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
4578 (gimple_range_calc_op2): Ditto.
4580 2021-11-04 Martin Jambor <mjambor@suse.cz>
4583 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
4584 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
4585 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
4586 parameter to mark_dead_statements.
4587 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
4588 (ipa_param_body_adjustments::mark_dead_statements): New parameter
4589 debugstack, push into it all SSA names used in debug statements,
4590 produce m_dead_ssa_debug_equiv mapping for the removed param.
4591 (replace_with_mapped_expr): New function.
4592 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
4593 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4594 (ipa_param_body_adjustments::common_initialization): Gather and
4595 procecc SSA which will be removed but are in debug statements. Simplify.
4596 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
4598 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
4599 when avoiding a copy of an unnecessary statement. Remap removed SSA
4600 names in existing debug statements.
4601 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
4602 parameters if we have already done so.
4604 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
4607 * gimple.c (gimple_call_static_chain_flags): Handle case when
4608 nested function does not bind locally.
4610 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4612 * config/aarch64/aarch64.c (aarch64_function_value): Generate
4613 a register rtx for Neon vector-tuple modes.
4614 (aarch64_layout_arg): Likewise.
4616 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4618 * lower-subreg.c (simple_move): Prevent decomposition if
4619 modes are not tieable.
4621 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4622 Richard Sandiford <richard.sandiford@arm.com>
4624 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
4625 (v2x4hi_UP): Likewise.
4626 (v2x4hf_UP): Likewise.
4627 (v2x4bf_UP): Likewise.
4628 (v2x2si_UP): Likewise.
4629 (v2x2sf_UP): Likewise.
4630 (v2x1di_UP): Likewise.
4631 (v2x1df_UP): Likewise.
4632 (v2x16qi_UP): Likewise.
4633 (v2x8hi_UP): Likewise.
4634 (v2x8hf_UP): Likewise.
4635 (v2x8bf_UP): Likewise.
4636 (v2x4si_UP): Likewise.
4637 (v2x4sf_UP): Likewise.
4638 (v2x2di_UP): Likewise.
4639 (v2x2df_UP): Likewise.
4640 (v3x8qi_UP): Likewise.
4641 (v3x4hi_UP): Likewise.
4642 (v3x4hf_UP): Likewise.
4643 (v3x4bf_UP): Likewise.
4644 (v3x2si_UP): Likewise.
4645 (v3x2sf_UP): Likewise.
4646 (v3x1di_UP): Likewise.
4647 (v3x1df_UP): Likewise.
4648 (v3x16qi_UP): Likewise.
4649 (v3x8hi_UP): Likewise.
4650 (v3x8hf_UP): Likewise.
4651 (v3x8bf_UP): Likewise.
4652 (v3x4si_UP): Likewise.
4653 (v3x4sf_UP): Likewise.
4654 (v3x2di_UP): Likewise.
4655 (v3x2df_UP): Likewise.
4656 (v4x8qi_UP): Likewise.
4657 (v4x4hi_UP): Likewise.
4658 (v4x4hf_UP): Likewise.
4659 (v4x4bf_UP): Likewise.
4660 (v4x2si_UP): Likewise.
4661 (v4x2sf_UP): Likewise.
4662 (v4x1di_UP): Likewise.
4663 (v4x1df_UP): Likewise.
4664 (v4x16qi_UP): Likewise.
4665 (v4x8hi_UP): Likewise.
4666 (v4x8hf_UP): Likewise.
4667 (v4x8bf_UP): Likewise.
4668 (v4x4si_UP): Likewise.
4669 (v4x4sf_UP): Likewise.
4670 (v4x2di_UP): Likewise.
4671 (v4x2df_UP): Likewise.
4672 (TYPES_GETREGP): Delete.
4673 (TYPES_SETREGP): Likewise.
4674 (TYPES_LOADSTRUCT_U): Define.
4675 (TYPES_LOADSTRUCT_P): Likewise.
4676 (TYPES_LOADSTRUCT_LANE_U): Likewise.
4677 (TYPES_LOADSTRUCT_LANE_P): Likewise.
4678 (TYPES_STORE1P): Move for consistency.
4679 (TYPES_STORESTRUCT_U): Define.
4680 (TYPES_STORESTRUCT_P): Likewise.
4681 (TYPES_STORESTRUCT_LANE_U): Likewise.
4682 (TYPES_STORESTRUCT_LANE_P): Likewise.
4683 (aarch64_simd_tuple_types): Define.
4684 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
4685 (aarch64_init_simd_builtin_functions): Update frontend lookup
4686 for builtin functions after handling arm_neon.h pragma.
4687 (register_tuple_type): Manually set modes of single-integer
4688 tuple types. Record tuple types.
4689 * config/aarch64/aarch64-modes.def
4690 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
4691 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
4692 (SVE_MODES): Give single-vector modes priority over vector-
4694 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
4695 be after all single-vector modes.
4696 * config/aarch64/aarch64-simd-builtins.def: Update builtin
4697 generator macros to reflect modifications to the backend
4699 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
4700 Use vector-tuple mode iterator and rename to...
4701 (aarch64_simd_ld2<vstruct_elt>): This.
4702 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
4704 (aarch64_simd_ld2r<vstruct_elt>): This.
4705 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
4706 iterator and rename to...
4707 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
4708 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
4710 (vec_load_lanes<mode><vstruct_elt>): This.
4711 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
4713 (aarch64_simd_st2<vstruct_elt>): This.
4714 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
4715 iterator and rename to...
4716 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
4717 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
4719 (vec_store_lanes<mode><vstruct_elt>): This.
4720 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
4722 (aarch64_simd_ld3<vstruct_elt>): This.
4723 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
4725 (aarch64_simd_ld3r<vstruct_elt>): This.
4726 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
4727 iterator and rename to...
4728 (vec_load_lanesci<mode>): This.
4729 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
4731 (aarch64_simd_st3<vstruct_elt>): This.
4732 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
4733 iterator and rename to...
4734 (vec_store_lanesci<mode>): This.
4735 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
4737 (aarch64_simd_ld4<vstruct_elt>): This.
4738 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
4740 (aarch64_simd_ld4r<vstruct_elt>): This.
4741 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
4742 iterator and rename to...
4743 (vec_load_lanesxi<mode>): This.
4744 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
4746 (aarch64_simd_st4<vstruct_elt>): This.
4747 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
4748 iterator and rename to...
4749 (vec_store_lanesxi<mode>): This.
4750 (mov<mode>): Define for Neon vector-tuple modes.
4751 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4753 (aarch64_ld1x3<vstruct_elt>): This.
4754 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
4756 (aarch64_ld1_x3_<vstruct_elt>): This.
4757 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4759 (aarch64_ld1x4<vstruct_elt>): This.
4760 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
4762 (aarch64_ld1_x4_<vstruct_elt>): This.
4763 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
4765 (aarch64_st1x2<vstruct_elt>): This.
4766 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
4768 (aarch64_st1_x2_<vstruct_elt>): This.
4769 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4771 (aarch64_st1x3<vstruct_elt>): This.
4772 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
4774 (aarch64_st1_x3_<vstruct_elt>): This.
4775 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4777 (aarch64_st1x4<vstruct_elt>): This.
4778 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
4780 (aarch64_st1_x4_<vstruct_elt>): This.
4781 (*aarch64_mov<mode>): Define for vector-tuple modes.
4782 (*aarch64_be_mov<mode>): Likewise.
4783 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
4784 mode iterator and rename to...
4785 (aarch64_ld<nregs>r<vstruct_elt>): This.
4786 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
4788 (aarch64_ld2<vstruct_elt>_dreg): This.
4789 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
4791 (aarch64_ld3<vstruct_elt>_dreg): This.
4792 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
4794 (aarch64_ld4<vstruct_elt>_dreg): This.
4795 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4796 iterator and rename to...
4797 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
4798 iterator and rename to...
4799 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4800 (aarch64_ld1x2<VQ:mode>): Delete.
4801 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
4803 (aarch64_ld1x2<vstruct_elt>): This.
4804 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4805 tuple mode iterator and rename to...
4806 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
4807 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
4808 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
4809 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
4811 (aarch64_st2<vstruct_elt>_dreg): This.
4812 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
4814 (aarch64_st3<vstruct_elt>_dreg): This.
4815 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
4817 (aarch64_st4<vstruct_elt>_dreg): This.
4818 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4819 iterator and rename to...
4820 (aarch64_st<nregs><vstruct_elt>): This.
4821 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4822 iterator and rename to aarch64_st<nregs><vstruct_elt>.
4823 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4824 tuple mode iterator and rename to...
4825 (aarch64_st<nregs>_lane<vstruct_elt>): This.
4826 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
4827 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
4829 (aarch64_simd_ld1<vstruct_elt>_x2): This.
4830 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
4831 Refactor to include new vector-tuple modes.
4832 (aarch64_classify_vector_mode): Add cases for new vector-
4834 (aarch64_advsimd_partial_struct_mode_p): Define.
4835 (aarch64_advsimd_full_struct_mode_p): Likewise.
4836 (aarch64_advsimd_vector_array_mode): Likewise.
4837 (aarch64_sve_data_mode): Change location in file.
4838 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
4839 (aarch64_hard_regno_nregs): Handle case of partial Neon
4841 (aarch64_classify_address): Refactor to include handling of
4842 Neon vector-tuple modes.
4843 (aarch64_print_operand): Print "d" for "%R" for a partial
4844 Neon vector structure.
4845 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
4846 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
4847 modes with scalar machines modes larger than 8 bytes.
4848 (aarch64_can_change_mode_class): Don't allow changes between
4849 partial and full Neon vector-structure modes.
4850 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
4851 builtin and remove boiler-plate code for opaque mode.
4852 (vst2_lane_f32): Likewise.
4853 (vst2_lane_f64): Likewise.
4854 (vst2_lane_p8): Likewise.
4855 (vst2_lane_p16): Likewise.
4856 (vst2_lane_p64): Likewise.
4857 (vst2_lane_s8): Likewise.
4858 (vst2_lane_s16): Likewise.
4859 (vst2_lane_s32): Likewise.
4860 (vst2_lane_s64): Likewise.
4861 (vst2_lane_u8): Likewise.
4862 (vst2_lane_u16): Likewise.
4863 (vst2_lane_u32): Likewise.
4864 (vst2_lane_u64): Likewise.
4865 (vst2q_lane_f16): Likewise.
4866 (vst2q_lane_f32): Likewise.
4867 (vst2q_lane_f64): Likewise.
4868 (vst2q_lane_p8): Likewise.
4869 (vst2q_lane_p16): Likewise.
4870 (vst2q_lane_p64): Likewise.
4871 (vst2q_lane_s8): Likewise.
4872 (vst2q_lane_s16): Likewise.
4873 (vst2q_lane_s32): Likewise.
4874 (vst2q_lane_s64): Likewise.
4875 (vst2q_lane_u8): Likewise.
4876 (vst2q_lane_u16): Likewise.
4877 (vst2q_lane_u32): Likewise.
4878 (vst2q_lane_u64): Likewise.
4879 (vst3_lane_f16): Likewise.
4880 (vst3_lane_f32): Likewise.
4881 (vst3_lane_f64): Likewise.
4882 (vst3_lane_p8): Likewise.
4883 (vst3_lane_p16): Likewise.
4884 (vst3_lane_p64): Likewise.
4885 (vst3_lane_s8): Likewise.
4886 (vst3_lane_s16): Likewise.
4887 (vst3_lane_s32): Likewise.
4888 (vst3_lane_s64): Likewise.
4889 (vst3_lane_u8): Likewise.
4890 (vst3_lane_u16): Likewise.
4891 (vst3_lane_u32): Likewise.
4892 (vst3_lane_u64): Likewise.
4893 (vst3q_lane_f16): Likewise.
4894 (vst3q_lane_f32): Likewise.
4895 (vst3q_lane_f64): Likewise.
4896 (vst3q_lane_p8): Likewise.
4897 (vst3q_lane_p16): Likewise.
4898 (vst3q_lane_p64): Likewise.
4899 (vst3q_lane_s8): Likewise.
4900 (vst3q_lane_s16): Likewise.
4901 (vst3q_lane_s32): Likewise.
4902 (vst3q_lane_s64): Likewise.
4903 (vst3q_lane_u8): Likewise.
4904 (vst3q_lane_u16): Likewise.
4905 (vst3q_lane_u32): Likewise.
4906 (vst3q_lane_u64): Likewise.
4907 (vst4_lane_f16): Likewise.
4908 (vst4_lane_f32): Likewise.
4909 (vst4_lane_f64): Likewise.
4910 (vst4_lane_p8): Likewise.
4911 (vst4_lane_p16): Likewise.
4912 (vst4_lane_p64): Likewise.
4913 (vst4_lane_s8): Likewise.
4914 (vst4_lane_s16): Likewise.
4915 (vst4_lane_s32): Likewise.
4916 (vst4_lane_s64): Likewise.
4917 (vst4_lane_u8): Likewise.
4918 (vst4_lane_u16): Likewise.
4919 (vst4_lane_u32): Likewise.
4920 (vst4_lane_u64): Likewise.
4921 (vst4q_lane_f16): Likewise.
4922 (vst4q_lane_f32): Likewise.
4923 (vst4q_lane_f64): Likewise.
4924 (vst4q_lane_p8): Likewise.
4925 (vst4q_lane_p16): Likewise.
4926 (vst4q_lane_p64): Likewise.
4927 (vst4q_lane_s8): Likewise.
4928 (vst4q_lane_s16): Likewise.
4929 (vst4q_lane_s32): Likewise.
4930 (vst4q_lane_s64): Likewise.
4931 (vst4q_lane_u8): Likewise.
4932 (vst4q_lane_u16): Likewise.
4933 (vst4q_lane_u32): Likewise.
4934 (vst4q_lane_u64): Likewise.
4935 (vtbl3_s8): Likewise.
4936 (vtbl3_u8): Likewise.
4937 (vtbl3_p8): Likewise.
4938 (vtbl4_s8): Likewise.
4939 (vtbl4_u8): Likewise.
4940 (vtbl4_p8): Likewise.
4941 (vld1_u8_x3): Likewise.
4942 (vld1_s8_x3): Likewise.
4943 (vld1_u16_x3): Likewise.
4944 (vld1_s16_x3): Likewise.
4945 (vld1_u32_x3): Likewise.
4946 (vld1_s32_x3): Likewise.
4947 (vld1_u64_x3): Likewise.
4948 (vld1_s64_x3): Likewise.
4949 (vld1_f16_x3): Likewise.
4950 (vld1_f32_x3): Likewise.
4951 (vld1_f64_x3): Likewise.
4952 (vld1_p8_x3): Likewise.
4953 (vld1_p16_x3): Likewise.
4954 (vld1_p64_x3): Likewise.
4955 (vld1q_u8_x3): Likewise.
4956 (vld1q_s8_x3): Likewise.
4957 (vld1q_u16_x3): Likewise.
4958 (vld1q_s16_x3): Likewise.
4959 (vld1q_u32_x3): Likewise.
4960 (vld1q_s32_x3): Likewise.
4961 (vld1q_u64_x3): Likewise.
4962 (vld1q_s64_x3): Likewise.
4963 (vld1q_f16_x3): Likewise.
4964 (vld1q_f32_x3): Likewise.
4965 (vld1q_f64_x3): Likewise.
4966 (vld1q_p8_x3): Likewise.
4967 (vld1q_p16_x3): Likewise.
4968 (vld1q_p64_x3): Likewise.
4969 (vld1_u8_x2): Likewise.
4970 (vld1_s8_x2): Likewise.
4971 (vld1_u16_x2): Likewise.
4972 (vld1_s16_x2): Likewise.
4973 (vld1_u32_x2): Likewise.
4974 (vld1_s32_x2): Likewise.
4975 (vld1_u64_x2): Likewise.
4976 (vld1_s64_x2): Likewise.
4977 (vld1_f16_x2): Likewise.
4978 (vld1_f32_x2): Likewise.
4979 (vld1_f64_x2): Likewise.
4980 (vld1_p8_x2): Likewise.
4981 (vld1_p16_x2): Likewise.
4982 (vld1_p64_x2): Likewise.
4983 (vld1q_u8_x2): Likewise.
4984 (vld1q_s8_x2): Likewise.
4985 (vld1q_u16_x2): Likewise.
4986 (vld1q_s16_x2): Likewise.
4987 (vld1q_u32_x2): Likewise.
4988 (vld1q_s32_x2): Likewise.
4989 (vld1q_u64_x2): Likewise.
4990 (vld1q_s64_x2): Likewise.
4991 (vld1q_f16_x2): Likewise.
4992 (vld1q_f32_x2): Likewise.
4993 (vld1q_f64_x2): Likewise.
4994 (vld1q_p8_x2): Likewise.
4995 (vld1q_p16_x2): Likewise.
4996 (vld1q_p64_x2): Likewise.
4997 (vld1_s8_x4): Likewise.
4998 (vld1q_s8_x4): Likewise.
4999 (vld1_s16_x4): Likewise.
5000 (vld1q_s16_x4): Likewise.
5001 (vld1_s32_x4): Likewise.
5002 (vld1q_s32_x4): Likewise.
5003 (vld1_u8_x4): Likewise.
5004 (vld1q_u8_x4): Likewise.
5005 (vld1_u16_x4): Likewise.
5006 (vld1q_u16_x4): Likewise.
5007 (vld1_u32_x4): Likewise.
5008 (vld1q_u32_x4): Likewise.
5009 (vld1_f16_x4): Likewise.
5010 (vld1q_f16_x4): Likewise.
5011 (vld1_f32_x4): Likewise.
5012 (vld1q_f32_x4): Likewise.
5013 (vld1_p8_x4): Likewise.
5014 (vld1q_p8_x4): Likewise.
5015 (vld1_p16_x4): Likewise.
5016 (vld1q_p16_x4): Likewise.
5017 (vld1_s64_x4): Likewise.
5018 (vld1_u64_x4): Likewise.
5019 (vld1_p64_x4): Likewise.
5020 (vld1q_s64_x4): Likewise.
5021 (vld1q_u64_x4): Likewise.
5022 (vld1q_p64_x4): Likewise.
5023 (vld1_f64_x4): Likewise.
5024 (vld1q_f64_x4): Likewise.
5025 (vld2_s64): Likewise.
5026 (vld2_u64): Likewise.
5027 (vld2_f64): Likewise.
5028 (vld2_s8): Likewise.
5029 (vld2_p8): Likewise.
5030 (vld2_p64): Likewise.
5031 (vld2_s16): Likewise.
5032 (vld2_p16): Likewise.
5033 (vld2_s32): Likewise.
5034 (vld2_u8): Likewise.
5035 (vld2_u16): Likewise.
5036 (vld2_u32): Likewise.
5037 (vld2_f16): Likewise.
5038 (vld2_f32): Likewise.
5039 (vld2q_s8): Likewise.
5040 (vld2q_p8): Likewise.
5041 (vld2q_s16): Likewise.
5042 (vld2q_p16): Likewise.
5043 (vld2q_p64): Likewise.
5044 (vld2q_s32): Likewise.
5045 (vld2q_s64): Likewise.
5046 (vld2q_u8): Likewise.
5047 (vld2q_u16): Likewise.
5048 (vld2q_u32): Likewise.
5049 (vld2q_u64): Likewise.
5050 (vld2q_f16): Likewise.
5051 (vld2q_f32): Likewise.
5052 (vld2q_f64): Likewise.
5053 (vld3_s64): Likewise.
5054 (vld3_u64): Likewise.
5055 (vld3_f64): Likewise.
5056 (vld3_s8): Likewise.
5057 (vld3_p8): Likewise.
5058 (vld3_s16): Likewise.
5059 (vld3_p16): Likewise.
5060 (vld3_s32): Likewise.
5061 (vld3_u8): Likewise.
5062 (vld3_u16): Likewise.
5063 (vld3_u32): Likewise.
5064 (vld3_f16): Likewise.
5065 (vld3_f32): Likewise.
5066 (vld3_p64): Likewise.
5067 (vld3q_s8): Likewise.
5068 (vld3q_p8): Likewise.
5069 (vld3q_s16): Likewise.
5070 (vld3q_p16): Likewise.
5071 (vld3q_s32): Likewise.
5072 (vld3q_s64): Likewise.
5073 (vld3q_u8): Likewise.
5074 (vld3q_u16): Likewise.
5075 (vld3q_u32): Likewise.
5076 (vld3q_u64): Likewise.
5077 (vld3q_f16): Likewise.
5078 (vld3q_f32): Likewise.
5079 (vld3q_f64): Likewise.
5080 (vld3q_p64): Likewise.
5081 (vld4_s64): Likewise.
5082 (vld4_u64): Likewise.
5083 (vld4_f64): Likewise.
5084 (vld4_s8): Likewise.
5085 (vld4_p8): Likewise.
5086 (vld4_s16): Likewise.
5087 (vld4_p16): Likewise.
5088 (vld4_s32): Likewise.
5089 (vld4_u8): Likewise.
5090 (vld4_u16): Likewise.
5091 (vld4_u32): Likewise.
5092 (vld4_f16): Likewise.
5093 (vld4_f32): Likewise.
5094 (vld4_p64): Likewise.
5095 (vld4q_s8): Likewise.
5096 (vld4q_p8): Likewise.
5097 (vld4q_s16): Likewise.
5098 (vld4q_p16): Likewise.
5099 (vld4q_s32): Likewise.
5100 (vld4q_s64): Likewise.
5101 (vld4q_u8): Likewise.
5102 (vld4q_u16): Likewise.
5103 (vld4q_u32): Likewise.
5104 (vld4q_u64): Likewise.
5105 (vld4q_f16): Likewise.
5106 (vld4q_f32): Likewise.
5107 (vld4q_f64): Likewise.
5108 (vld4q_p64): Likewise.
5109 (vld2_dup_s8): Likewise.
5110 (vld2_dup_s16): Likewise.
5111 (vld2_dup_s32): Likewise.
5112 (vld2_dup_f16): Likewise.
5113 (vld2_dup_f32): Likewise.
5114 (vld2_dup_f64): Likewise.
5115 (vld2_dup_u8): Likewise.
5116 (vld2_dup_u16): Likewise.
5117 (vld2_dup_u32): Likewise.
5118 (vld2_dup_p8): Likewise.
5119 (vld2_dup_p16): Likewise.
5120 (vld2_dup_p64): Likewise.
5121 (vld2_dup_s64): Likewise.
5122 (vld2_dup_u64): Likewise.
5123 (vld2q_dup_s8): Likewise.
5124 (vld2q_dup_p8): Likewise.
5125 (vld2q_dup_s16): Likewise.
5126 (vld2q_dup_p16): Likewise.
5127 (vld2q_dup_s32): Likewise.
5128 (vld2q_dup_s64): Likewise.
5129 (vld2q_dup_u8): Likewise.
5130 (vld2q_dup_u16): Likewise.
5131 (vld2q_dup_u32): Likewise.
5132 (vld2q_dup_u64): Likewise.
5133 (vld2q_dup_f16): Likewise.
5134 (vld2q_dup_f32): Likewise.
5135 (vld2q_dup_f64): Likewise.
5136 (vld2q_dup_p64): Likewise.
5137 (vld3_dup_s64): Likewise.
5138 (vld3_dup_u64): Likewise.
5139 (vld3_dup_f64): Likewise.
5140 (vld3_dup_s8): Likewise.
5141 (vld3_dup_p8): Likewise.
5142 (vld3_dup_s16): Likewise.
5143 (vld3_dup_p16): Likewise.
5144 (vld3_dup_s32): Likewise.
5145 (vld3_dup_u8): Likewise.
5146 (vld3_dup_u16): Likewise.
5147 (vld3_dup_u32): Likewise.
5148 (vld3_dup_f16): Likewise.
5149 (vld3_dup_f32): Likewise.
5150 (vld3_dup_p64): Likewise.
5151 (vld3q_dup_s8): Likewise.
5152 (vld3q_dup_p8): Likewise.
5153 (vld3q_dup_s16): Likewise.
5154 (vld3q_dup_p16): Likewise.
5155 (vld3q_dup_s32): Likewise.
5156 (vld3q_dup_s64): Likewise.
5157 (vld3q_dup_u8): Likewise.
5158 (vld3q_dup_u16): Likewise.
5159 (vld3q_dup_u32): Likewise.
5160 (vld3q_dup_u64): Likewise.
5161 (vld3q_dup_f16): Likewise.
5162 (vld3q_dup_f32): Likewise.
5163 (vld3q_dup_f64): Likewise.
5164 (vld3q_dup_p64): Likewise.
5165 (vld4_dup_s64): Likewise.
5166 (vld4_dup_u64): Likewise.
5167 (vld4_dup_f64): Likewise.
5168 (vld4_dup_s8): Likewise.
5169 (vld4_dup_p8): Likewise.
5170 (vld4_dup_s16): Likewise.
5171 (vld4_dup_p16): Likewise.
5172 (vld4_dup_s32): Likewise.
5173 (vld4_dup_u8): Likewise.
5174 (vld4_dup_u16): Likewise.
5175 (vld4_dup_u32): Likewise.
5176 (vld4_dup_f16): Likewise.
5177 (vld4_dup_f32): Likewise.
5178 (vld4_dup_p64): Likewise.
5179 (vld4q_dup_s8): Likewise.
5180 (vld4q_dup_p8): Likewise.
5181 (vld4q_dup_s16): Likewise.
5182 (vld4q_dup_p16): Likewise.
5183 (vld4q_dup_s32): Likewise.
5184 (vld4q_dup_s64): Likewise.
5185 (vld4q_dup_u8): Likewise.
5186 (vld4q_dup_u16): Likewise.
5187 (vld4q_dup_u32): Likewise.
5188 (vld4q_dup_u64): Likewise.
5189 (vld4q_dup_f16): Likewise.
5190 (vld4q_dup_f32): Likewise.
5191 (vld4q_dup_f64): Likewise.
5192 (vld4q_dup_p64): Likewise.
5193 (vld2_lane_u8): Likewise.
5194 (vld2_lane_u16): Likewise.
5195 (vld2_lane_u32): Likewise.
5196 (vld2_lane_u64): Likewise.
5197 (vld2_lane_s8): Likewise.
5198 (vld2_lane_s16): Likewise.
5199 (vld2_lane_s32): Likewise.
5200 (vld2_lane_s64): Likewise.
5201 (vld2_lane_f16): Likewise.
5202 (vld2_lane_f32): Likewise.
5203 (vld2_lane_f64): Likewise.
5204 (vld2_lane_p8): Likewise.
5205 (vld2_lane_p16): Likewise.
5206 (vld2_lane_p64): Likewise.
5207 (vld2q_lane_u8): Likewise.
5208 (vld2q_lane_u16): Likewise.
5209 (vld2q_lane_u32): Likewise.
5210 (vld2q_lane_u64): Likewise.
5211 (vld2q_lane_s8): Likewise.
5212 (vld2q_lane_s16): Likewise.
5213 (vld2q_lane_s32): Likewise.
5214 (vld2q_lane_s64): Likewise.
5215 (vld2q_lane_f16): Likewise.
5216 (vld2q_lane_f32): Likewise.
5217 (vld2q_lane_f64): Likewise.
5218 (vld2q_lane_p8): Likewise.
5219 (vld2q_lane_p16): Likewise.
5220 (vld2q_lane_p64): Likewise.
5221 (vld3_lane_u8): Likewise.
5222 (vld3_lane_u16): Likewise.
5223 (vld3_lane_u32): Likewise.
5224 (vld3_lane_u64): Likewise.
5225 (vld3_lane_s8): Likewise.
5226 (vld3_lane_s16): Likewise.
5227 (vld3_lane_s32): Likewise.
5228 (vld3_lane_s64): Likewise.
5229 (vld3_lane_f16): Likewise.
5230 (vld3_lane_f32): Likewise.
5231 (vld3_lane_f64): Likewise.
5232 (vld3_lane_p8): Likewise.
5233 (vld3_lane_p16): Likewise.
5234 (vld3_lane_p64): Likewise.
5235 (vld3q_lane_u8): Likewise.
5236 (vld3q_lane_u16): Likewise.
5237 (vld3q_lane_u32): Likewise.
5238 (vld3q_lane_u64): Likewise.
5239 (vld3q_lane_s8): Likewise.
5240 (vld3q_lane_s16): Likewise.
5241 (vld3q_lane_s32): Likewise.
5242 (vld3q_lane_s64): Likewise.
5243 (vld3q_lane_f16): Likewise.
5244 (vld3q_lane_f32): Likewise.
5245 (vld3q_lane_f64): Likewise.
5246 (vld3q_lane_p8): Likewise.
5247 (vld3q_lane_p16): Likewise.
5248 (vld3q_lane_p64): Likewise.
5249 (vld4_lane_u8): Likewise.
5250 (vld4_lane_u16): Likewise.
5251 (vld4_lane_u32): Likewise.
5252 (vld4_lane_u64): Likewise.
5253 (vld4_lane_s8): Likewise.
5254 (vld4_lane_s16): Likewise.
5255 (vld4_lane_s32): Likewise.
5256 (vld4_lane_s64): Likewise.
5257 (vld4_lane_f16): Likewise.
5258 (vld4_lane_f32): Likewise.
5259 (vld4_lane_f64): Likewise.
5260 (vld4_lane_p8): Likewise.
5261 (vld4_lane_p16): Likewise.
5262 (vld4_lane_p64): Likewise.
5263 (vld4q_lane_u8): Likewise.
5264 (vld4q_lane_u16): Likewise.
5265 (vld4q_lane_u32): Likewise.
5266 (vld4q_lane_u64): Likewise.
5267 (vld4q_lane_s8): Likewise.
5268 (vld4q_lane_s16): Likewise.
5269 (vld4q_lane_s32): Likewise.
5270 (vld4q_lane_s64): Likewise.
5271 (vld4q_lane_f16): Likewise.
5272 (vld4q_lane_f32): Likewise.
5273 (vld4q_lane_f64): Likewise.
5274 (vld4q_lane_p8): Likewise.
5275 (vld4q_lane_p16): Likewise.
5276 (vld4q_lane_p64): Likewise.
5277 (vqtbl2_s8): Likewise.
5278 (vqtbl2_u8): Likewise.
5279 (vqtbl2_p8): Likewise.
5280 (vqtbl2q_s8): Likewise.
5281 (vqtbl2q_u8): Likewise.
5282 (vqtbl2q_p8): Likewise.
5283 (vqtbl3_s8): Likewise.
5284 (vqtbl3_u8): Likewise.
5285 (vqtbl3_p8): Likewise.
5286 (vqtbl3q_s8): Likewise.
5287 (vqtbl3q_u8): Likewise.
5288 (vqtbl3q_p8): Likewise.
5289 (vqtbl4_s8): Likewise.
5290 (vqtbl4_u8): Likewise.
5291 (vqtbl4_p8): Likewise.
5292 (vqtbl4q_s8): Likewise.
5293 (vqtbl4q_u8): Likewise.
5294 (vqtbl4q_p8): Likewise.
5295 (vqtbx2_s8): Likewise.
5296 (vqtbx2_u8): Likewise.
5297 (vqtbx2_p8): Likewise.
5298 (vqtbx2q_s8): Likewise.
5299 (vqtbx2q_u8): Likewise.
5300 (vqtbx2q_p8): Likewise.
5301 (vqtbx3_s8): Likewise.
5302 (vqtbx3_u8): Likewise.
5303 (vqtbx3_p8): Likewise.
5304 (vqtbx3q_s8): Likewise.
5305 (vqtbx3q_u8): Likewise.
5306 (vqtbx3q_p8): Likewise.
5307 (vqtbx4_s8): Likewise.
5308 (vqtbx4_u8): Likewise.
5309 (vqtbx4_p8): Likewise.
5310 (vqtbx4q_s8): Likewise.
5311 (vqtbx4q_u8): Likewise.
5312 (vqtbx4q_p8): Likewise.
5313 (vst1_s64_x2): Likewise.
5314 (vst1_u64_x2): Likewise.
5315 (vst1_f64_x2): Likewise.
5316 (vst1_s8_x2): Likewise.
5317 (vst1_p8_x2): Likewise.
5318 (vst1_s16_x2): Likewise.
5319 (vst1_p16_x2): Likewise.
5320 (vst1_s32_x2): Likewise.
5321 (vst1_u8_x2): Likewise.
5322 (vst1_u16_x2): Likewise.
5323 (vst1_u32_x2): Likewise.
5324 (vst1_f16_x2): Likewise.
5325 (vst1_f32_x2): Likewise.
5326 (vst1_p64_x2): Likewise.
5327 (vst1q_s8_x2): Likewise.
5328 (vst1q_p8_x2): Likewise.
5329 (vst1q_s16_x2): Likewise.
5330 (vst1q_p16_x2): Likewise.
5331 (vst1q_s32_x2): Likewise.
5332 (vst1q_s64_x2): Likewise.
5333 (vst1q_u8_x2): Likewise.
5334 (vst1q_u16_x2): Likewise.
5335 (vst1q_u32_x2): Likewise.
5336 (vst1q_u64_x2): Likewise.
5337 (vst1q_f16_x2): Likewise.
5338 (vst1q_f32_x2): Likewise.
5339 (vst1q_f64_x2): Likewise.
5340 (vst1q_p64_x2): Likewise.
5341 (vst1_s64_x3): Likewise.
5342 (vst1_u64_x3): Likewise.
5343 (vst1_f64_x3): Likewise.
5344 (vst1_s8_x3): Likewise.
5345 (vst1_p8_x3): Likewise.
5346 (vst1_s16_x3): Likewise.
5347 (vst1_p16_x3): Likewise.
5348 (vst1_s32_x3): Likewise.
5349 (vst1_u8_x3): Likewise.
5350 (vst1_u16_x3): Likewise.
5351 (vst1_u32_x3): Likewise.
5352 (vst1_f16_x3): Likewise.
5353 (vst1_f32_x3): Likewise.
5354 (vst1_p64_x3): Likewise.
5355 (vst1q_s8_x3): Likewise.
5356 (vst1q_p8_x3): Likewise.
5357 (vst1q_s16_x3): Likewise.
5358 (vst1q_p16_x3): Likewise.
5359 (vst1q_s32_x3): Likewise.
5360 (vst1q_s64_x3): Likewise.
5361 (vst1q_u8_x3): Likewise.
5362 (vst1q_u16_x3): Likewise.
5363 (vst1q_u32_x3): Likewise.
5364 (vst1q_u64_x3): Likewise.
5365 (vst1q_f16_x3): Likewise.
5366 (vst1q_f32_x3): Likewise.
5367 (vst1q_f64_x3): Likewise.
5368 (vst1q_p64_x3): Likewise.
5369 (vst1_s8_x4): Likewise.
5370 (vst1q_s8_x4): Likewise.
5371 (vst1_s16_x4): Likewise.
5372 (vst1q_s16_x4): Likewise.
5373 (vst1_s32_x4): Likewise.
5374 (vst1q_s32_x4): Likewise.
5375 (vst1_u8_x4): Likewise.
5376 (vst1q_u8_x4): Likewise.
5377 (vst1_u16_x4): Likewise.
5378 (vst1q_u16_x4): Likewise.
5379 (vst1_u32_x4): Likewise.
5380 (vst1q_u32_x4): Likewise.
5381 (vst1_f16_x4): Likewise.
5382 (vst1q_f16_x4): Likewise.
5383 (vst1_f32_x4): Likewise.
5384 (vst1q_f32_x4): Likewise.
5385 (vst1_p8_x4): Likewise.
5386 (vst1q_p8_x4): Likewise.
5387 (vst1_p16_x4): Likewise.
5388 (vst1q_p16_x4): Likewise.
5389 (vst1_s64_x4): Likewise.
5390 (vst1_u64_x4): Likewise.
5391 (vst1_p64_x4): Likewise.
5392 (vst1q_s64_x4): Likewise.
5393 (vst1q_u64_x4): Likewise.
5394 (vst1q_p64_x4): Likewise.
5395 (vst1_f64_x4): Likewise.
5396 (vst1q_f64_x4): Likewise.
5397 (vst2_s64): Likewise.
5398 (vst2_u64): Likewise.
5399 (vst2_f64): Likewise.
5400 (vst2_s8): Likewise.
5401 (vst2_p8): Likewise.
5402 (vst2_s16): Likewise.
5403 (vst2_p16): Likewise.
5404 (vst2_s32): Likewise.
5405 (vst2_u8): Likewise.
5406 (vst2_u16): Likewise.
5407 (vst2_u32): Likewise.
5408 (vst2_f16): Likewise.
5409 (vst2_f32): Likewise.
5410 (vst2_p64): Likewise.
5411 (vst2q_s8): Likewise.
5412 (vst2q_p8): Likewise.
5413 (vst2q_s16): Likewise.
5414 (vst2q_p16): Likewise.
5415 (vst2q_s32): Likewise.
5416 (vst2q_s64): Likewise.
5417 (vst2q_u8): Likewise.
5418 (vst2q_u16): Likewise.
5419 (vst2q_u32): Likewise.
5420 (vst2q_u64): Likewise.
5421 (vst2q_f16): Likewise.
5422 (vst2q_f32): Likewise.
5423 (vst2q_f64): Likewise.
5424 (vst2q_p64): Likewise.
5425 (vst3_s64): Likewise.
5426 (vst3_u64): Likewise.
5427 (vst3_f64): Likewise.
5428 (vst3_s8): Likewise.
5429 (vst3_p8): Likewise.
5430 (vst3_s16): Likewise.
5431 (vst3_p16): Likewise.
5432 (vst3_s32): Likewise.
5433 (vst3_u8): Likewise.
5434 (vst3_u16): Likewise.
5435 (vst3_u32): Likewise.
5436 (vst3_f16): Likewise.
5437 (vst3_f32): Likewise.
5438 (vst3_p64): Likewise.
5439 (vst3q_s8): Likewise.
5440 (vst3q_p8): Likewise.
5441 (vst3q_s16): Likewise.
5442 (vst3q_p16): Likewise.
5443 (vst3q_s32): Likewise.
5444 (vst3q_s64): Likewise.
5445 (vst3q_u8): Likewise.
5446 (vst3q_u16): Likewise.
5447 (vst3q_u32): Likewise.
5448 (vst3q_u64): Likewise.
5449 (vst3q_f16): Likewise.
5450 (vst3q_f32): Likewise.
5451 (vst3q_f64): Likewise.
5452 (vst3q_p64): Likewise.
5453 (vst4_s64): Likewise.
5454 (vst4_u64): Likewise.
5455 (vst4_f64): Likewise.
5456 (vst4_s8): Likewise.
5457 (vst4_p8): Likewise.
5458 (vst4_s16): Likewise.
5459 (vst4_p16): Likewise.
5460 (vst4_s32): Likewise.
5461 (vst4_u8): Likewise.
5462 (vst4_u16): Likewise.
5463 (vst4_u32): Likewise.
5464 (vst4_f16): Likewise.
5465 (vst4_f32): Likewise.
5466 (vst4_p64): Likewise.
5467 (vst4q_s8): Likewise.
5468 (vst4q_p8): Likewise.
5469 (vst4q_s16): Likewise.
5470 (vst4q_p16): Likewise.
5471 (vst4q_s32): Likewise.
5472 (vst4q_s64): Likewise.
5473 (vst4q_u8): Likewise.
5474 (vst4q_u16): Likewise.
5475 (vst4q_u32): Likewise.
5476 (vst4q_u64): Likewise.
5477 (vst4q_f16): Likewise.
5478 (vst4q_f32): Likewise.
5479 (vst4q_f64): Likewise.
5480 (vst4q_p64): Likewise.
5481 (vtbx4_s8): Likewise.
5482 (vtbx4_u8): Likewise.
5483 (vtbx4_p8): Likewise.
5484 (vld1_bf16_x2): Likewise.
5485 (vld1q_bf16_x2): Likewise.
5486 (vld1_bf16_x3): Likewise.
5487 (vld1q_bf16_x3): Likewise.
5488 (vld1_bf16_x4): Likewise.
5489 (vld1q_bf16_x4): Likewise.
5490 (vld2_bf16): Likewise.
5491 (vld2q_bf16): Likewise.
5492 (vld2_dup_bf16): Likewise.
5493 (vld2q_dup_bf16): Likewise.
5494 (vld3_bf16): Likewise.
5495 (vld3q_bf16): Likewise.
5496 (vld3_dup_bf16): Likewise.
5497 (vld3q_dup_bf16): Likewise.
5498 (vld4_bf16): Likewise.
5499 (vld4q_bf16): Likewise.
5500 (vld4_dup_bf16): Likewise.
5501 (vld4q_dup_bf16): Likewise.
5502 (vst1_bf16_x2): Likewise.
5503 (vst1q_bf16_x2): Likewise.
5504 (vst1_bf16_x3): Likewise.
5505 (vst1q_bf16_x3): Likewise.
5506 (vst1_bf16_x4): Likewise.
5507 (vst1q_bf16_x4): Likewise.
5508 (vst2_bf16): Likewise.
5509 (vst2q_bf16): Likewise.
5510 (vst3_bf16): Likewise.
5511 (vst3q_bf16): Likewise.
5512 (vst4_bf16): Likewise.
5513 (vst4q_bf16): Likewise.
5514 (vld2_lane_bf16): Likewise.
5515 (vld2q_lane_bf16): Likewise.
5516 (vld3_lane_bf16): Likewise.
5517 (vld3q_lane_bf16): Likewise.
5518 (vld4_lane_bf16): Likewise.
5519 (vld4q_lane_bf16): Likewise.
5520 (vst2_lane_bf16): Likewise.
5521 (vst2q_lane_bf16): Likewise.
5522 (vst3_lane_bf16): Likewise.
5523 (vst3q_lane_bf16): Likewise.
5524 (vst4_lane_bf16): Likewise.
5525 (vst4q_lane_bf16): Likewise.
5526 * config/aarch64/geniterators.sh: Modify iterator regex to
5527 match new vector-tuple modes.
5528 * config/aarch64/iterators.md (insn_count): Extend mode
5529 attribute with vector-tuple type information.
5531 (Vendreg): Likewise.
5534 (VSTRUCT_2D): New mode iterator.
5535 (VSTRUCT_2DNX): Likewise.
5536 (VSTRUCT_2DX): Likewise.
5537 (VSTRUCT_2Q): Likewise.
5538 (VSTRUCT_2QD): Likewise.
5539 (VSTRUCT_3D): Likewise.
5540 (VSTRUCT_3DNX): Likewise.
5541 (VSTRUCT_3DX): Likewise.
5542 (VSTRUCT_3Q): Likewise.
5543 (VSTRUCT_3QD): Likewise.
5544 (VSTRUCT_4D): Likewise.
5545 (VSTRUCT_4DNX): Likewise.
5546 (VSTRUCT_4DX): Likewise.
5547 (VSTRUCT_4Q): Likewise.
5548 (VSTRUCT_4QD): Likewise.
5549 (VSTRUCT_D): Likewise.
5550 (VSTRUCT_Q): Likewise.
5551 (VSTRUCT_QD): Likewise.
5552 (VSTRUCT_ELT): New mode attribute.
5553 (vstruct_elt): Likewise.
5554 * genmodes.c (VECTOR_MODE): Add default prefix and order
5556 (VECTOR_MODE_WITH_PREFIX): Define.
5557 (make_vector_mode): Add mode prefix and order parameters.
5559 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5561 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
5563 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5565 * expr.c (emit_group_load_1): Remove historic workaround.
5567 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5569 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
5570 Factor out main loop to...
5571 (aarch64_init_simd_builtin_functions): This new function.
5572 (register_tuple_type): Define.
5573 (aarch64_scalar_builtin_type_p): Define.
5574 (handle_arm_neon_h): Define.
5575 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
5576 pragma for arm_neon.h.
5577 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
5579 (handle_arm_neon_h): Likewise.
5580 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
5581 Remove static modifier.
5582 * config/aarch64/arm_neon.h (target): Remove Neon vector
5583 structure type definitions.
5585 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5587 PR tree-optimization/102943
5588 * gimple-range-path.cc (path_range_query::range_on_path_entry):
5589 Prefer range_of_expr unless there are no statements in the BB.
5591 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5593 PR tree-optimization/102943
5594 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
5595 Avoid duplicate calculation of paths.
5597 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5599 PR tree-optimization/102943
5600 * gimple-range-path.cc (path_range_query::compute_phi_relations):
5601 Only compute relations for SSA names in the import list.
5602 (path_range_query::compute_outgoing_relations): Same.
5603 * gimple-range-path.h (path_range_query::import_p): New.
5605 2021-11-04 Richard Biener <rguenther@suse.de>
5607 PR rtl-optimization/103075
5608 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
5609 false for a VOIDmode operand.
5611 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5613 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
5614 variables private and add "m_" to their names. Remove is_loop.
5615 (aarch64_record_potential_advsimd_unrolling): Replace with...
5616 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
5617 (aarch64_analyze_loop_vinfo): Replace with...
5618 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
5619 Move initialization of (m_)vec_flags to add_stmt_cost.
5620 (aarch64_analyze_bb_vinfo): Delete.
5621 (aarch64_count_ops): Replace with...
5622 (aarch64_vector_costs::count_ops): ...this.
5623 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
5624 using m_costing_for_scalar to test whether we're costing
5625 scalar or vector code.
5626 (aarch64_adjust_body_cost_sve): Replace with...
5627 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
5628 (aarch64_adjust_body_cost): Replace with...
5629 (aarch64_vector_costs::adjust_body_cost): ...this.
5630 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
5632 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5634 * target.def (targetm.vectorize.init_cost): Replace with...
5635 (targetm.vectorize.create_costs): ...this.
5636 (targetm.vectorize.add_stmt_cost): Delete.
5637 (targetm.vectorize.finish_cost): Likewise.
5638 (targetm.vectorize.destroy_cost_data): Likewise.
5639 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
5640 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5641 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5642 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5643 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5644 * doc/tm.texi: Regenerate.
5645 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
5647 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
5648 (vector_costs): New class.
5649 (init_cost): Take a vec_info and return a vector_costs.
5650 (dump_stmt_cost): Remove data parameter.
5651 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
5652 (add_stmt_costs): Likewise.
5653 (finish_cost): Replace data parameter with a vector_costs.
5654 (destroy_cost_data): Delete.
5655 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
5657 (vec_info::vec_info): Remove the target_cost_data parameter and
5658 initialize the member variable to null instead.
5659 (vec_info::~vec_info): Delete target_cost_data instead of calling
5661 (vector_costs::add_stmt_cost): New function.
5662 (vector_costs::finish_cost): Likewise.
5663 (vector_costs::record_stmt_cost): Likewise.
5664 (vector_costs::adjust_cost_for_freq): Likewise.
5665 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
5666 call to vec_info::vec_info.
5667 (vect_compute_single_scalar_iteration_cost): Update after above
5668 changes to costing interface.
5669 (vect_analyze_loop_operations): Likewise.
5670 (vect_estimate_min_profitable_iters): Likewise.
5671 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
5672 at the start_over point, where it needs to be recreated after
5673 trying without slp. Update retry code accordingly.
5674 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
5675 to vec_info::vec_info.
5676 (vect_slp_analyze_operation): Update after above changes to costing
5678 (vect_bb_vectorization_profitable_p): Likewise.
5679 * targhooks.h (default_init_cost): Replace with...
5680 (default_vectorize_create_costs): ...this.
5681 (default_add_stmt_cost): Delete.
5682 (default_finish_cost, default_destroy_cost_data): Likewise.
5683 * targhooks.c (default_init_cost): Replace with...
5684 (default_vectorize_create_costs): ...this.
5685 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
5686 (default_finish_cost, default_destroy_cost_data): Delete.
5687 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
5688 vector_costs. Add a constructor.
5689 (aarch64_init_cost): Replace with...
5690 (aarch64_vectorize_create_costs): ...this.
5691 (aarch64_add_stmt_cost): Replace with...
5692 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
5693 to adjust the cost for inner loops.
5694 (aarch64_finish_cost): Replace with...
5695 (aarch64_vector_costs::finish_cost): ...this.
5696 (aarch64_destroy_cost_data): Delete.
5697 (TARGET_VECTORIZE_INIT_COST): Replace with...
5698 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5699 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5700 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5701 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5702 * config/i386/i386.c (ix86_vector_costs): New structure.
5703 (ix86_init_cost): Replace with...
5704 (ix86_vectorize_create_costs): ...this.
5705 (ix86_add_stmt_cost): Replace with...
5706 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
5707 to adjust the cost for inner loops.
5708 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
5709 (TARGET_VECTORIZE_INIT_COST): Replace with...
5710 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5711 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5712 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5713 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5714 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
5715 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5716 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5717 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5718 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5719 (rs6000_cost_data): Inherit from vector_costs.
5720 Add a constructor. Drop loop_info, cost and costing_for_scalar
5721 in favor of the corresponding vector_costs member variables.
5722 Add "m_" to the names of the remaining member variables and
5724 (rs6000_density_test): Replace with...
5725 (rs6000_cost_data::density_test): ...this.
5726 (rs6000_init_cost): Replace with...
5727 (rs6000_vectorize_create_costs): ...this.
5728 (rs6000_update_target_cost_per_stmt): Replace with...
5729 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
5730 (rs6000_add_stmt_cost): Replace with...
5731 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
5732 to adjust the cost for inner loops.
5733 (rs6000_adjust_vect_cost_per_loop): Replace with...
5734 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
5735 (rs6000_finish_cost): Replace with...
5736 (rs6000_cost_data::finish_cost): ...this. Group loop code
5737 into a single if statement and pass the loop_vinfo down to
5739 (rs6000_destroy_cost_data): Delete.
5741 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5743 PR tree-optimization/103062
5744 PR tree-optimization/103062
5745 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
5746 Increase size of allocation by 1.
5747 (ssa_equiv_stack::push_replacement): Grow as needed.
5748 (ssa_equiv_stack::get_replacement): Same.
5749 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
5750 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
5751 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
5752 (pointer_equiv_analyzer::get_equiv): Same.
5753 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
5754 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
5755 const markers. Use auto_vec instead of tree *.
5757 2021-11-04 Richard Biener <rguenther@suse.de>
5759 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
5760 true parameter and inline valueization.
5761 (vn_nary_op_lookup_1): Inline valueization from ...
5762 (vn_nary_op_compute_hash): ... here and remove it here.
5763 * tree-ssa-pre.c (phi_translate_1): Do not valueize
5764 before vn_nary_lookup_pieces.
5765 (get_representative_for): Mark created SSA representatives
5768 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5770 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
5771 that the operand has a vector mode. Use subreg_lowpart_offset
5772 to test whether an index corresponds to the low part.
5774 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5776 * read-rtl.c: Remove dead !GENERATOR_FILE block.
5777 * read-rtl-function.c (function_reader::consolidate_singletons):
5778 Generate canonical CONST_VECTORs.
5780 2021-11-04 liuhongt <hongtao.liu@intel.com>
5783 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
5784 (regmem_or_bitnot_regmem_operand): .. and extend to handle
5786 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
5787 the operands which are required to be register_operand.
5788 (*<avx512>_vpternlog<mode>_2): Ditto.
5789 (*<avx512>_vpternlog<mode>_3): Ditto.
5790 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
5791 vector HFmodes since it's not a real AVX512FP16 instruction.
5793 2021-11-04 liuhongt <hongtao.liu@intel.com>
5796 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
5797 .COPYSIGN (a,b) when a and b are same type as the truncation
5798 type and has less precision than extend type.
5800 2021-11-04 Richard Biener <rguenther@suse.de>
5802 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
5805 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5807 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
5808 (VI1_AVX512VNNI): Likewise.
5809 (SDOT_VPDP_SUF): New mode_attr.
5812 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
5813 vpdpwssd when VNNI targets available.
5814 (usdot_prod<mode>): New expander for vector QImode.
5816 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5818 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
5819 parentheses to base and stride.
5820 (_tile_stream_loadd_internal): Likewise.
5821 (_tile_stored_internal): Likewise.
5823 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
5825 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
5826 arguments to `reg_class_subset_p'.
5828 2021-11-03 Joseph Myers <joseph@codesourcery.com>
5831 * fold-const.c (fold_init): New function.
5832 * fold-const.h (fold_init): New prototype.
5834 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5836 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
5838 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5840 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
5841 a range from edge if arg == phidef.
5843 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5845 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
5846 for constant before any other processing.
5848 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5850 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
5852 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5854 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
5855 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
5856 no explcit context and current_bb is set.
5857 (gimple_ranger::fold_stmt): New.
5858 * gimple-range.h (current_bb, fold_stmt): New.
5859 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
5861 2021-11-03 Richard Biener <rguenther@suse.de>
5863 PR tree-optimization/102970
5864 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
5865 when translating a MEM_REF over a backedge.
5867 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
5869 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
5870 * config/aarch64/aarch64-tune.md: Regenerate.
5871 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
5872 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
5873 * doc/invoke.texi: Add documentation for Ampere-1 core.
5875 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
5877 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
5879 (ldr_got_small_<mode>): Remove pattern.
5880 (ldr_got_small_sidi): Likewise.
5881 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
5882 GOT accesses as moves.
5883 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
5884 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
5885 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
5887 2021-11-03 Martin Liska <mliska@suse.cz>
5889 * gcov.c (read_line): Remove dead variable.
5891 2021-11-03 Martin Liska <mliska@suse.cz>
5894 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
5895 (ipa_fn_summary::account_size_time): Likewise.
5896 (edge_set_predicate): Likewise.
5897 (set_hint_predicate): Likewise.
5898 (add_freqcounting_predicate): Likewise.
5899 (evaluate_conditions_for_known_args): Likewise.
5900 (evaluate_properties_for_edge): Likewise.
5901 (remap_freqcounting_preds_after_dup): Likewise.
5902 (ipa_fn_summary_t::duplicate): Likewise.
5903 (set_cond_stmt_execution_predicate): Likewise.
5904 (set_switch_stmt_execution_predicate): Likewise.
5905 (compute_bb_predicates): Likewise.
5906 (will_be_nonconstant_expr_predicate): Likewise.
5907 (will_be_nonconstant_predicate): Likewise.
5908 (phi_result_unknown_predicate): Likewise.
5909 (predicate_for_phi_result): Likewise.
5910 (analyze_function_body): Likewise.
5911 (compute_fn_summary): Likewise.
5912 (summarize_calls_size_and_time): Likewise.
5913 (estimate_calls_size_and_time): Likewise.
5914 (ipa_call_context::estimate_size_and_time): Likewise.
5915 (remap_edge_summaries): Likewise.
5916 (remap_freqcounting_predicate): Likewise.
5917 (ipa_merge_fn_summary_after_inlining): Likewise.
5918 (ipa_update_overall_fn_summary): Likewise.
5919 (read_ipa_call_summary): Likewise.
5920 (inline_read_section): Likewise.
5921 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
5922 * ipa-predicate.c (predicate::add_clause): Likewise.
5923 (ipa_predicate::add_clause): Likewise.
5924 (predicate::or_with): Likewise.
5925 (ipa_predicate::or_with): Likewise.
5926 (predicate::evaluate): Likewise.
5927 (ipa_predicate::evaluate): Likewise.
5928 (predicate::probability): Likewise.
5929 (ipa_predicate::probability): Likewise.
5930 (dump_condition): Likewise.
5931 (dump_clause): Likewise.
5932 (predicate::dump): Likewise.
5933 (ipa_predicate::dump): Likewise.
5934 (predicate::debug): Likewise.
5935 (ipa_predicate::debug): Likewise.
5936 (predicate::remap_after_duplication): Likewise.
5937 (ipa_predicate::remap_after_duplication): Likewise.
5938 (predicate::remap_after_inlining): Likewise.
5939 (ipa_predicate::remap_after_inlining): Likewise.
5940 (predicate::stream_in): Likewise.
5941 (ipa_predicate::stream_in): Likewise.
5942 (predicate::stream_out): Likewise.
5943 (ipa_predicate::stream_out): Likewise.
5944 (add_condition): Likewise.
5945 * ipa-predicate.h (class predicate): Likewise.
5946 (class ipa_predicate): Likewise.
5947 (add_condition): Likewise.
5949 2021-11-03 Richard Biener <rguenther@suse.de>
5951 * bitmap.h (bitmap_bit_p): Change the return type to bool.
5952 * bitmap.c (bitmap_bit_p): Likewise.
5953 * sbitmap.h (bitmap_bit_p): Likewise.
5954 (bitmap_set_bit): Return whether the bit changed.
5955 (bitmap_clear_bit): Likewise.
5956 * tree-ssa.c (verify_vssa): Make use of the changed state
5957 from bitmap_set_bit.
5959 2021-11-03 Richard Biener <rguenther@suse.de>
5961 PR middle-end/103033
5962 * internal-fn.c (expand_DEFERRED_INIT): Elide the
5963 native_interpret_expr path in favor of folding the
5964 VIEW_CONVERT_EXPR generated when punning the RHS.
5966 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5968 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
5971 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
5974 * ipa-modref.c (callee_to_caller_flags): New function.
5975 (modref_eaf_analysis::analyze_ssa_name): Use it.
5976 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
5978 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5980 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
5983 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5984 Uroš Bizjak <ubizjak@gmail.com>
5986 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
5987 rotations by non-constant amounts.
5989 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5991 * ipa-modref.c: Fix anonymous namespace placement.
5992 (class modref_eaf_analysis): New class.
5993 (analyze_ssa_name_flags): Turn to ...
5994 (modref_eaf_analysis::analyze_ssa_name): ... this one.
5995 (merge_call_lhs_flags): Turn to ...
5996 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
5997 (modref_eaf_analysis::merge_with_ssa_name): New member function.
5998 (record_escape_points): Turn to ...
5999 (modref_eaf_analysis::record_escape_points): ... this one.
6000 (analyze_parms): Updat
6001 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
6003 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6005 * gimple.c (gimple_call_static_chain_flags): New function.
6006 * gimple.h (gimple_call_static_chain_flags): Declare
6007 * ipa-modref.c (modref_summary::modref_summary): Initialize
6009 (modref_summary_lto::modref_summary_lto): Likewise.
6010 (modref_summary::useful_p): Test static_chain_flags.
6011 (modref_summary_lto::useful_p): Likewise.
6012 (struct modref_summary_lto): Add static_chain_flags.
6013 (modref_summary::dump): Dump static_chain_flags.
6014 (modref_summary_lto::dump): Likewise.
6015 (struct escape_point): Add static_cahin_arg.
6016 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
6017 (analyze_parms): Handle static chains.
6018 (modref_summaries::duplicate): Duplicate static_chain_flags.
6019 (modref_summaries_lto::duplicate): Likewise.
6020 (modref_write): Stream static_chain_flags.
6021 (read_section): Likewise.
6022 (modref_merge_call_site_flags): Handle static_chain_flags.
6023 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
6024 * tree-ssa-structalias.c (handle_rhs_call): Use
6025 gimple_static_chain_flags.
6027 2021-11-02 Richard Biener <rguenther@suse.de>
6029 PR tree-optimization/103029
6030 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
6031 the PHI nodes in the loop maintain their original operand
6034 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
6036 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
6037 (EAF_NOREAD): Renumber.
6038 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
6039 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
6040 (deref_flags): Likewise.
6041 (modref_lattice::init): Likewise.
6042 (modref_lattice::merge): Likewise.
6043 (merge_call_lhs_flags): Likewise.
6044 (analyze_ssa_name_flags): Likewise.
6045 (modref_merge_call_site_flags): Likewise.
6046 * tree-ssa-structalias.c (handle_call_arg): Likewise.
6048 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
6050 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
6051 check and remove an unused local variable with shNadd/shNadd.uw
6054 2021-11-02 Tamar Christina <tamar.christina@arm.com>
6056 PR tree-optimization/103007
6057 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
6060 2021-11-02 Richard Biener <rguenther@suse.de>
6062 PR middle-end/103038
6063 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
6064 (can_native_interpret_type_p): Likewise.
6065 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
6066 RHS if the LHS is an SSA name.
6068 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
6070 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
6071 * langhooks-def.h (lhd_simulate_record_decl): Declare.
6072 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
6073 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
6074 * langhooks.c (lhd_simulate_record_decl): New function.
6076 2021-11-02 Jakub Jelinek <jakub@redhat.com>
6079 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
6080 modes with TImode inner mode if 32-bit.
6082 2021-11-02 liuhongt <hongtao.liu@intel.com>
6084 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
6085 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
6086 vect_slp_v4si_store_unalign): Document efficient target.
6087 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
6088 vect_slp_v16qi_store_unalign_1): Ditto.
6089 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
6090 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
6091 (struct_4char_block_move, struct_8char_block_move,
6092 struct_16char_block_move): Ditto.
6094 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
6095 Jakub Jelinek <jakub@redhat.com>
6098 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
6099 ix86_expand_ti_to_v1ti): New helper functions.
6100 (ix86_expand_v1ti_shift): Check if the amount operand is an
6101 integer constant, and expand as a TImode shift if it isn't.
6102 (ix86_expand_v1ti_rotate): Check if the amount operand is an
6103 integer constant, and expand as a TImode rotate if it isn't.
6104 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
6105 right shifts of V1TImode quantities.
6106 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
6107 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
6108 to QImode general_operand, and let the helper functions lower
6109 shifts by non-constant operands, as TImode shifts. Make
6110 conditional on TARGET_64BIT.
6111 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
6112 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
6113 Make conditional on TARGET_64BIT.
6115 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6117 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
6118 "*tabort_1_plus"): Remove operands non-null check.
6120 2021-11-01 Martin Liska <mliska@suse.cz>
6122 * opt-functions.awk: Add new sanity checking.
6123 * optc-gen.awk: Add new argument to integer_range_info.
6124 * params.opt: Update 2 params which have negative IntegerRange.
6126 2021-11-01 qing zhao <qing.zhao@oracle.com>
6128 * gimplify.c (gimplify_decl_expr): Do not add call to
6129 __builtin_clear_padding when a variable is a gimple register
6130 or it might not have padding.
6131 (gimplify_init_constructor): Likewise.
6133 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6135 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
6136 movi, dup and extract costing fields.
6137 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
6138 thunderx_extra_costs, thunderx2t99_extra_costs,
6139 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
6141 * config/arm/aarch-cost-tables.h (generic_extra_costs,
6142 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
6143 exynosm1_extra_costs, xgene1_extra_costs): Likewise
6144 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
6145 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
6147 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6149 * cse.c (add_to_set): New.
6150 (find_sets_in_insn): Register constants in sets.
6151 (canonicalize_insn): Use auto_vec instead.
6152 (cse_insn): Try materializing using vec_dup.
6153 * rtl.h (simplify_context::simplify_gen_vec_select,
6154 simplify_gen_vec_select): New.
6155 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
6157 2021-11-01 David Malcolm <dmalcolm@redhat.com>
6159 * common.opt (fdiagnostics-escape-format=): New.
6160 (diagnostics_escape_format): New enum.
6161 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
6162 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
6163 * diagnostic-format-json.cc (json_end_diagnostic): Add
6164 "escape-source" attribute.
6165 * diagnostic-show-locus.c
6166 (exploc_with_display_col::exploc_with_display_col): Replace
6167 "tabstop" param with a cpp_char_column_policy and add an "aspect"
6168 param. Use these to compute m_display_col accordingly.
6169 (struct char_display_policy): New struct.
6170 (layout::m_policy): New field.
6171 (layout::m_escape_on_output): New field.
6172 (def_policy): New function.
6173 (make_range): Update for changes to exploc_with_display_col ctor.
6174 (default_print_decoded_ch): New.
6175 (width_per_escaped_byte): New.
6176 (escape_as_bytes_width): New.
6177 (escape_as_bytes_print): New.
6178 (escape_as_unicode_width): New.
6179 (escape_as_unicode_print): New.
6181 (layout::layout): Initialize new fields. Update m_exploc ctor
6182 call for above change to ctor.
6183 (layout::maybe_add_location_range): Update for changes to
6184 exploc_with_display_col ctor.
6185 (layout::calculate_x_offset_display): Update for change to
6187 (layout::print_source_line): Pass policy
6188 to cpp_display_width_computation. Capture cpp_decoded_char when
6189 calling process_next_codepoint. Move printing of source code to
6190 m_policy.m_print_cb.
6191 (line_label::line_label): Pass in policy rather than context.
6192 (layout::print_any_labels): Update for change to line_label ctor.
6193 (get_affected_range): Pass in policy rather than context, updating
6194 calls to location_compute_display_column accordingly.
6195 (get_printed_columns): Likewise, also for cpp_display_width.
6196 (correction::correction): Pass in policy rather than tabstop.
6197 (correction::compute_display_cols): Pass m_policy rather than
6198 m_tabstop to cpp_display_width.
6199 (correction::m_tabstop): Replace with...
6200 (correction::m_policy): ...this.
6201 (line_corrections::line_corrections): Pass in policy rather than
6203 (line_corrections::m_context): Replace with...
6204 (line_corrections::m_policy): ...this.
6205 (line_corrections::add_hint): Update to use m_policy rather than
6207 (line_corrections::add_hint): Likewise.
6208 (layout::print_trailing_fixits): Likewise.
6209 (selftest::test_display_widths): New.
6210 (selftest::test_layout_x_offset_display_utf8): Update to use
6211 policy rather than tabstop.
6212 (selftest::test_one_liner_labels_utf8): Add test of escaping
6214 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
6215 use policy rather than tabstop.
6216 (selftest::test_overlapped_fixit_printing): Likewise.
6217 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
6218 (selftest::test_overlapped_fixit_printing_2): Likewise.
6219 (selftest::test_tab_expansion): Likewise.
6220 (selftest::test_escaping_bytes_1): New.
6221 (selftest::test_escaping_bytes_2): New.
6222 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
6223 * diagnostic.c (diagnostic_initialize): Initialize
6224 context->escape_format.
6225 (convert_column_unit): Update to use default character width policy.
6226 (selftest::test_diagnostic_get_location_text): Likewise.
6227 * diagnostic.h (enum diagnostics_escape_format): New enum.
6228 (diagnostic_context::escape_format): New field.
6229 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
6230 (-fdiagnostics-format=): Add "escape-source" attribute to examples
6231 of JSON output, and document it.
6232 * input.c (location_compute_display_column): Pass in "policy"
6233 rather than "tabstop", passing to
6234 cpp_byte_column_to_display_column.
6235 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
6236 * input.h (class cpp_char_column_policy): New forward decl.
6237 (location_compute_display_column): Pass in "policy" rather than
6239 * opts.c (common_handle_option): Handle
6240 OPT_fdiagnostics_escape_format_.
6241 * selftest.c (temp_source_file::temp_source_file): New ctor
6242 overload taking a size_t.
6243 * selftest.h (temp_source_file::temp_source_file): Likewise.
6245 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6247 * dbgcnt.def: Add debug counter for back_thread[12] and
6248 back_threadfull[12].
6249 * passes.def: Pass "first" argument to each back threading pass.
6250 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
6252 (back_threader::debug_counter): New.
6253 (back_threader::maybe_register_path): Call debug_counter.
6255 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6257 * tree-ssa-threadbackward.c (BT_NONE): New.
6260 (back_threader::back_threader): Add flags.
6261 Move loop initialization here.
6262 (back_threader::~back_threader): New.
6263 (back_threader::find_taken_edge_switch): Change solver and ranger
6265 (back_threader::find_taken_edge_cond): Same.
6266 (back_threader::find_paths_to_names): Same.
6267 (back_threader::find_paths): Same.
6268 (back_threader::dump): Same.
6269 (try_thread_blocks): Merge into thread_blocks.
6270 (back_threader::thread_blocks): New.
6271 (do_early_thread_jumps): Merge into thread_blocks.
6272 (do_thread_jumps): Merge into thread_blocks.
6273 (back_threader::thread_through_all_blocks): Remove.
6275 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
6277 PR tree-optimization/103003
6278 * value-relation.cc (dom_oracle::register_relation): If the 2
6279 ssa names are the same, don't register any relation.
6281 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
6283 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
6284 redundant check for calls_eh_return.
6285 * config/aarch64/aarch64.md (*do_return): Likewise.
6287 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6289 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
6290 duplicate_loop_to_header_edge to
6291 duplicate_loop_body_to_header_edge.
6292 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6293 * cfghooks.h (struct cfg_hooks): Likewise.
6294 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6295 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
6296 (clone_loop_to_header_edge): Likewise.
6297 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
6298 * cfgrtl.c (struct cfg_hooks): Likewise.
6299 * doc/loop.texi: Likewise.
6300 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
6301 (unroll_loop_runtime_iterations): Likewise.
6302 (unroll_loop_stupid): Likewise.
6303 (apply_opt_in_copies): Likewise.
6304 * tree-cfg.c (struct cfg_hooks): Likewise.
6305 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
6306 (try_peel_loop): Likewise.
6307 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
6308 (gimple_duplicate_loop_body_to_header_edge): Likewise.
6309 (tree_transform_and_unroll_loop): Likewise.
6310 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
6313 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6315 * cfgloopmanip.c (loop_version): Refactor loopify to
6316 loop_version. Move condition generation after loopify.
6318 * cfgloopmanip.h (loopify): Delete.
6320 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
6322 * ipa-fnsummary.c: Include tree-dfa.h.
6323 (points_to_local_or_readonly_memory_p): Return true on return
6325 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
6328 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
6330 PR middle-end/102972
6331 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
6332 internal Fortran name; new permit_num_teams arg to permit
6333 omp_get_num_teams and omp_get_team_num.
6334 (scan_omp_1_stmt): Update call to it, add missing call for
6335 reverse offload, and check for strictly nested API calls in teams.
6337 2021-10-30 Jakub Jelinek <jakub@redhat.com>
6339 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
6341 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6343 * selftest.c (assert_streq): Add newlines when emitting non-equal
6346 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6348 * Makefile.in: Fix syntax for reference to LIBDEPS in
6351 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6353 * doc/install.texi: Bump required minimum DejaGnu version.
6355 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6356 Andrew MacLeod <amacleod@redhat.com>
6358 * value-relation.cc (path_oracle::killing_def): Add a
6359 self-equivalence so we don't look to the root oracle.
6361 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6363 * passes.def: Replace the pass_thread_jumps before VRP* with
6364 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
6365 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
6366 Remove hyphen from "thread-full" name.
6368 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
6370 PR tree-optimization/102983
6371 * gimple-range-cache.h (propagate_updated_value): Make public.
6372 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
6373 when processing gcond stmts.
6375 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
6377 * gimple.c (gimple_call_retslot_flags): New function.
6378 * gimple.h (gimple_call_retslot_flags): Declare.
6379 * ipa-modref.c: Include tree-cfg.h.
6380 (struct escape_entry): Turn parm_index to signed.
6381 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
6382 (modref_summary::modref_summary): Initialize retslot_flags.
6383 (struct modref_summary_lto): Likewise.
6384 (modref_summary::useful_p): Check retslot_flags.
6385 (modref_summary_lto::useful_p): Likewise.
6386 (modref_summary::dump): Dump retslot_flags.
6387 (modref_summary_lto::dump): Likewise.
6388 (struct escape_point): Add hidden_args enum.
6389 (analyze_ssa_name_flags): Ignore return slot return;
6390 use gimple_call_retslot_flags.
6391 (record_escape_points): Break out from ...
6392 (analyze_parms): ... here; handle retslot_flags.
6393 (modref_summaries::duplicate): Duplicate retslot_flags.
6394 (modref_summaries_lto::duplicate): Likewise.
6395 (modref_write_escape_summary): Stream parm_index as signed.
6396 (modref_read_escape_summary): Likewise.
6397 (modref_write): Stream retslot_flags.
6398 (read_section): Likewise.
6399 (struct escape_map): Fix typo in comment.
6400 (update_escape_summary_1): Fix whitespace.
6401 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
6402 (modref_merge_call_site_flags): Merge retslot_flags.
6403 * ipa-modref.h (struct modref_summary): Add retslot_flags.
6404 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
6406 2021-10-29 Tamar Christina <tamar.christina@arm.com>
6408 PR tree-optimization/102977
6409 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
6410 (vect_detect_pair_op): Add crosslane check.
6411 (vect_match_call_complex_mla): Remove.
6412 (class complex_mul_pattern): Update comment.
6413 (complex_mul_pattern::matches): Update detection.
6414 (class complex_fma_pattern): Remove.
6415 (complex_fma_pattern::matches): Remove.
6416 (complex_fma_pattern::recognize): Remove.
6417 (complex_fma_pattern::build): Remove.
6418 (class complex_fms_pattern): Update comment.
6419 (complex_fms_pattern::matches): Remove.
6420 (complex_operations_pattern::recognize): Remove complex_fma_pattern
6422 2021-10-29 Jakub Jelinek <jakub@redhat.com>
6424 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
6427 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
6430 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
6431 patterns match and emit for VSX xxpermdi.
6433 2021-10-29 liuhongt <hongtao.liu@intel.com>
6436 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
6438 (V16HF_FTYPE_V16HF): Ditto.
6439 (V32HF_FTYPE_V32HF): Ditto.
6440 (V8HF_FTYPE_V8HF_ROUND): Ditto.
6441 (V16HF_FTYPE_V16HF_ROUND): Ditto.
6442 (V32HF_FTYPE_V32HF_ROUND): Ditto.
6443 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
6444 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
6445 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
6446 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
6447 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
6448 * config/i386/i386-builtins.c
6449 (ix86_builtin_vectorized_function): Enable vectorization for
6450 HFmode FLOOR/CEIL/TRUNC operation.
6451 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
6453 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
6456 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6457 Andrew MacLeod <amacleod@redhat.com>
6459 * value-relation.cc (path_oracle::killing_def): Walk the
6460 equivalency list and remove SSA from any equivalencies.
6462 2021-10-28 Stafford Horne <shorne@gmail.com>
6464 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
6467 2021-10-28 Jakub Jelinek <jakub@redhat.com>
6469 PR tree-optimization/102951
6470 * fold-const.h (address_compare): Declare.
6471 * fold-const.c (address_compare): New function.
6472 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
6473 address_compare helper.
6474 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
6477 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6479 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
6481 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6483 PR tree-optimization/102940
6484 * tree-vrp.c (execute_ranger_vrp): Reset scev.
6486 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6488 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
6490 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6492 * configure: Regenerate.
6493 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
6495 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
6497 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
6499 2021-10-28 Richard Biener <rguenther@suse.de>
6502 * fold-const.c (fold_convert_const): Avoid int to float
6503 constant folding with -frounding-math and inexact result.
6504 * simplify-rtx.c (simplify_const_unary_operation): Likewise
6505 for both float and unsigned_float.
6507 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6509 * tree-ssa-threadbackward.c
6510 (back_threader::find_taken_edge_switch): Use find_case_label_range
6511 instead of find_taken_edge.
6513 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6515 * tree-ssa-threadbackward.c (class back_threader_registry):
6516 Inherit from back_jt_path_registry.
6517 (back_threader_registry::thread_through_all_blocks): Remove.
6518 (back_threader_registry::register_path): Remove
6519 m_lowlevel_registry prefix.
6521 2021-10-28 Richard Biener <rguenther@suse.de>
6524 * fold-const.c (fold_convert_const_real_from_real): Honor
6525 -frounding-math if the conversion is not exact.
6526 * simplify-rtx.c (simplify_const_unary_operation): Do not
6527 simplify FLOAT_TRUNCATE with sign dependent rounding.
6529 2021-10-28 Richard Biener <rguenther@suse.de>
6531 PR tree-optimization/102949
6532 * tree-vect-stmts.c (ensure_base_align): Look at the
6533 dr_info of a group leader and assert we are looking at
6534 one with analyzed alignment.
6536 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
6539 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
6540 V1T1 mode for unaligned load and store.
6542 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6544 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
6547 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6549 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
6552 2021-10-28 Alexandre Oliva <oliva@adacore.com>
6554 * common.opt (fharden-compares): New.
6555 (fharden-conditional-branches): New.
6556 * doc/invoke.texi: Document new options.
6557 * gimple-harden-conditionals.cc: New.
6558 * Makefile.in (OBJS): Build it.
6559 * passes.def: Add new passes.
6560 * tree-pass.h (make_pass_harden_compares): Declare.
6561 (make_pass_harden_conditional_branches): Declare.
6563 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6566 * config/rs6000/altivec.md: Add vsx register constraints.
6567 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
6568 (vsx_xxsel<mode>2): Likewise.
6569 (vsx_xxsel<mode>3): Likewise.
6570 (vsx_xxsel<mode>4): Likewise.
6572 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6575 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
6576 (altivec_vsel<mode>): ... this and update define.
6577 (*altivec_vsel<mode>_uns): Delete.
6578 (altivec_vsel<mode>2): New define_insn.
6579 (altivec_vsel<mode>3): Likewise.
6580 (altivec_vsel<mode>4): Likewise.
6581 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
6582 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
6584 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
6585 selection instead of per element.
6586 * config/rs6000/vector.md:
6587 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
6588 (vsx_xxsel<mode>): ... this and update define.
6589 (*vsx_xxsel<mode>_uns): Delete.
6590 (vsx_xxsel<mode>2): New define_insn.
6591 (vsx_xxsel<mode>3): Likewise.
6592 (vsx_xxsel<mode>4): Likewise.
6594 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
6596 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
6597 HFmode rsqrt without TARGET_SSE_MATH.
6598 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
6599 btrunc condition to be restricted by -ftrapping-math, adjust
6600 use_rsqrt_p function call.
6601 * config/i386/i386.md (rcphf2): New define_insn.
6602 (rsqrthf2): Likewise.
6603 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
6604 (div<mode>3): New expander for HF mode.
6605 (rsqrt<mode>2): Likewise.
6606 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
6607 (*avx512fp16_vmrsqrtv8hf2): Likewise.
6609 2021-10-27 Saagar Jha <saagar@saagarjha.com>
6611 * config.gcc: Adjust for Darwin21.
6612 * config/darwin-c.c (macosx_version_as_macro): Likewise.
6613 * config/darwin-driver.c (validate_macosx_version_min):
6615 (darwin_find_version_from_kernel): Likewise.
6617 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6619 * gimple-range-path.cc
6620 (path_range_query::range_defined_in_block): Call killing_def.
6622 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6624 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
6625 operands as well as relation.
6626 * gimple-range-path.cc
6627 (path_range_query::compute_ranges_in_block): Compute PHI relations
6628 first. Compute outgoing relations at the end.
6629 (path_range_query::compute_ranges): Remove call to compute_relations.
6630 (path_range_query::compute_relations): Remove.
6631 (path_range_query::maybe_register_phi_relation): New.
6632 (path_range_query::compute_phi_relations): Abstract out
6633 registering one PHI relation to...
6634 (path_range_query::compute_outgoing_relations): ...here.
6635 * gimple-range-path.h (class path_range_query): Remove
6637 Add maybe_register_phi_relation.
6639 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6640 Andrew MacLeod <amacleod@redhat.com>
6642 * value-relation.cc (path_oracle::killing_def): Kill second
6645 2021-10-27 Martin Jambor <mjambor@suse.cz>
6647 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
6648 profile feedback depending on their local availability.
6650 2021-10-27 Martin Jambor <mjambor@suse.cz>
6652 * params.opt (param_ipa_cp_profile_count_base): New parameter.
6653 * doc/invoke.texi (Optimize Options): Add entry for
6654 ipa-cp-profile-count-base.
6655 * ipa-cp.c (max_count): Replace with base_count, replace all
6656 occurrences too, unless otherwise stated.
6657 (ipcp_cloning_candidate_p): identify mostly-directly called
6658 functions based on their counts, not max_count.
6659 (compare_edge_profile_counts): New function.
6660 (ipcp_propagate_stage): Instead of setting max_count, find the
6661 appropriate edge count in a sorted vector of counts of eligible
6662 edges and make it the base_count.
6664 2021-10-27 Martin Jambor <mjambor@suse.cz>
6666 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
6667 n_nonrec_calls and itself, document all fields.
6668 (init_caller_stats): Initialize the above new fields.
6669 (gather_caller_stats): Gather self-recursive counts and calls number.
6670 (get_info_about_necessary_edges): Gather counts of self-recursive and
6671 other edges bringing in the requested value separately.
6672 (dump_profile_updates): Rework to dump info about a single node only.
6673 (lenient_count_portion_handling): New function.
6674 (struct gather_other_count_struct): New type.
6675 (gather_count_of_non_rec_edges): New function.
6676 (struct desc_incoming_count_struct): New type.
6677 (analyze_clone_icoming_counts): New function.
6678 (adjust_clone_incoming_counts): Likewise.
6679 (update_counts_for_self_gen_clones): Likewise.
6680 (update_profiling_info): Rewritten.
6681 (update_specialized_profile): Adjust call to dump_profile_updates.
6682 (create_specialized_node): Do not update profiling info.
6683 (decide_about_value): New parameter self_gen_clones, either push new
6684 clones into it or updat their profile counts. For self-recursively
6685 generated values, use a portion of the node count instead of count
6686 from self-recursive edges to estimate goodness.
6687 (decide_whether_version_node): Gather clones for self-generated values
6688 in a new vector, update their profiles at once at the end.
6690 2021-10-27 Richard Biener <rguenther@suse.de>
6692 * tree-vectorizer.c (vect_transform_loops): New function,
6694 (try_vectorize_loop_1): ... here. Simplify as epilogues
6695 are now fully handled in the split part.
6697 2021-10-27 Jakub Jelinek <jakub@redhat.com>
6699 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
6700 iterators with pointer types.
6701 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
6703 2021-10-26 Martin Sebor <msebor@redhat.com>
6705 PR tree-optimization/102238
6706 PR tree-optimization/102919
6707 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
6708 (array_elt_at_offset): Move to pointer-query.
6709 (set_aggregate_size_and_offset): New function.
6710 (field_at_offset): Move to pointer-query.
6711 (get_origin_and_offset): Rename...
6712 (get_origin_and_offset_r): this. Add an argument. Make aggregate
6713 handling more robust.
6714 (get_origin_and_offset): New.
6715 (alias_offset): Add an argument.
6716 (format_string): Use subobject size determined by get_origin_and_offset.
6717 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
6718 Improve/correct handling of aggregates.
6719 (array_elt_at_offset): Same.
6720 * pointer-query.h (field_at_offset): Declare.
6721 (array_elt_at_offset): Declare.
6723 2021-10-26 Martin Sebor <msebor@redhat.com>
6725 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
6727 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
6728 (expand_builtin_fork_or_exec): Same.
6729 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
6730 compute_objsize additional arguments.
6731 (inbounds_memaccess_p): Same.
6732 (array_bounds_checker::check_array_bounds): Add an assert. Stash
6733 statement in a member.
6734 (check_array_bounds_dom_walker::before_dom_children): Same.
6735 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
6736 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
6737 (handle_printf_call): Pass a new argument.
6738 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
6739 (check_access): Add an argument and pass it along to callees.
6740 (check_read_access): Make a member function.
6741 (pass_waccess::check_strcat): Pass access_data ctor additional
6743 (pass_waccess::check_strncat): Same.
6744 (pass_waccess::check_stxcpy): Same.
6745 (pass_waccess::check_stxncpy): Same.
6746 (pass_waccess::check_strncmp): Same.
6747 (pass_waccess::check_read_access): Same.
6748 (pass_waccess::check_builtin): Same.
6749 (pass_waccess::maybe_check_access_sizes): Same.
6750 (pass_waccess::maybe_check_dealloc_call): Same.
6751 * gimple-ssa-warn-access.h (check_read_access): Declare a new
6753 * pointer-query.cc (compute_objsize_r): Add an argument.
6754 (gimple_call_return_array): Same.
6755 (gimple_call_alloc_size): Same.
6756 (access_ref::access_ref): Same.
6757 (access_ref::get_ref): Same.
6758 (pointer_query::get_ref): Same.
6759 (handle_min_max_size): Pass an arguments to callees.
6760 (handle_array_ref): Add an argument.
6761 (handle_mem_ref): Same.
6762 (compute_objsize): Same.
6763 * pointer-query.h (struct access_ref): Adjust signatures.
6764 (struct access_data): Same.
6765 (gimple_call_alloc_size): Add an argument.
6766 (gimple_parm_array_size): Same.
6767 (compute_objsize): Same.
6768 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
6769 argument to compute_objsize.
6770 (strlen_pass::maybe_warn_overflow): Same.
6771 (maybe_diag_stxncpy_trunc): Same.
6773 2021-10-26 Martin Sebor <msebor@redhat.com>
6775 PR middle-end/102453
6776 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
6777 (pass_waccess::check_atomic_builtin): Call it.
6779 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
6781 PR rtl-optimization/102842
6782 * lra-constraints.c (match_reload): Ignore out in checking values
6784 (curr_insn_transform): Collect outputs before doing reloads of operands.
6786 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
6789 * config/rs6000/x86intrin.h: Move some included headers to new
6790 headers. Include new immintrin.h instead of those headers.
6791 * config/rs6000/immintrin.h: New.
6792 * config/rs6000/x86gprintrin.h: New.
6793 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
6795 2021-10-26 Richard Biener <rguenther@suse.de>
6797 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
6798 Remove byte_offset parameter.
6799 (vect_create_data_ref_ptr): Likewise.
6800 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6802 (vect_create_data_ref_ptr): Likewise.
6803 * tree-vect-stmts.c (vectorizable_store): Adjust.
6804 (vectorizable_load): Likewise.
6806 2021-10-26 Richard Biener <rguenther@suse.de>
6808 PR tree-optimization/96109
6809 * tree-vectorizer.h (dr_misalignment): Add optional offset
6811 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
6812 offset applied for negative stride accesses.
6813 (vect_enhance_data_refs_alignment): Compute negative stride
6814 access offset and pass it to dr_misalignment.
6815 * tree-vect-stmts.c (get_negative_load_store_type): Pass
6816 negative offset to dr_misalignment.
6817 (get_group_load_store_type): Likewise.
6818 (get_load_store_type): Likewise.
6819 (vectorizable_store): Remove asserts about alignment.
6820 (vectorizable_load): Likewise.
6822 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6824 PR tree-optimization/102897
6825 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
6827 2021-10-26 Richard Biener <rguenther@suse.de>
6829 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6830 Take offset in bytes.
6831 (vect_create_data_ref_ptr): Likewise.
6832 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
6833 offset by element size.
6834 (vect_create_cond_for_align_checks): Likewise.
6835 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
6836 (vectorizable_load): Remove duplicate leftover from merge
6839 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
6841 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
6842 function to expand V1TI mode logical shifts by integer constants.
6843 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
6844 mode rotations by integer constants.
6845 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
6846 ix86_expand_v1ti_rotate): Prototype new functions here.
6847 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
6848 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
6850 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6852 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6853 Avoid threading circular paths.
6855 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6857 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
6858 Attempt to resolve all incoming paths to a PHI.
6859 (back_threader::resolve_def): Always return true for PHIs.
6861 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6863 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
6864 Always try to resolve path without looking back.
6865 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
6866 edge is a back edge.
6868 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6870 PR tree-optimization/102789
6871 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
6872 update inits of simd_lane_access.
6874 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6876 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
6877 move to vr-values.c.
6878 (simplify_casted_conds): Move to vrp_folder class.
6879 (execute_vrp): Call via vrp_folder now.
6880 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
6881 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
6882 * vr-values.h (simplify_casted_cond): Add prototype.
6884 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6886 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
6889 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
6891 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
6893 (_mm_testc_si128): Likewise.
6894 (_mm_testnzc_si128): Likewise.
6895 (_mm_blend_ps): Likewise.
6896 (_mm_blendv_ps): Likewise.
6897 (_mm_blend_pd): Likewise.
6898 (_mm_blendv_pd): Likewise.
6899 (_mm_ceil_pd): Likewise.
6900 (_mm_ceil_sd): Likewise.
6901 (_mm_ceil_ps): Likewise.
6902 (_mm_ceil_ss): Likewise.
6903 (_mm_floor_pd): Likewise.
6904 (_mm_floor_sd): Likewise.
6905 (_mm_floor_ps): Likewise.
6906 (_mm_floor_ss): Likewise.
6907 (_mm_minpos_epu16): Likewise.
6908 (_mm_mul_epi32): Likewise.
6909 (_mm_cvtepi8_epi16): Likewise.
6910 (_mm_packus_epi32): Likewise.
6911 (_mm_cmpgt_epi64): Likewise.
6913 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
6915 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
6916 shifts of the mode's smin_value and smax_value when the bit count
6917 operand doesn't have side-effects.
6918 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
6919 when the bit count operand doesn't have side-effects.
6920 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
6921 compile-time evaluation of saturating left shifts with constant
6924 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6926 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
6929 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6931 * flag-types.h (enum ranger_debug): Adjust values.
6932 * params.opt (ranger_debug): Ditto.
6934 2021-10-25 Martin Jambor <mjambor@suse.cz>
6936 PR tree-optimization/102886
6937 * tree-sra.c (totally_scalarize_subtree): Fix the out of
6940 2021-10-25 Andrew Pinski <apinski@marvell.com>
6942 * tree-ssa-dce.c (simple_dce_from_worklist):
6943 Check stmt_unremovable_because_of_non_call_eh_p also
6944 before removing the statement.
6946 2021-10-25 Richard Biener <rguenther@suse.de>
6948 PR tree-optimization/102905
6949 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
6950 Use vect_supportable_dr_alignment again to determine whether
6951 an access is supported when not aligned.
6953 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6955 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
6958 2021-10-25 Jim Wilson <jimw@sifive.com>
6959 Kito Cheng <kito.cheng@sifive.com>
6960 Jia-Wei Chen <jiawei@iscas.ac.cn>
6961 Shi-Hua Liao <shihua@iscas.ac.cn>
6963 * config/riscv/bitmanip.md (shiftm1): New.
6964 (*bset<mode>): Ditto.
6965 (*bset<mode>_mask): Ditto.
6966 (*bset<mode>_1): Ditto.
6967 (*bset<mode>_1_mask): Ditto.
6968 (*bseti<mode>): Ditto.
6969 (*bclr<mode>): Ditto.
6970 (*bclri<mode>): Ditto.
6971 (*binv<mode>): Ditto.
6972 (*binvi<mode>): Ditto.
6973 (*bext<mode>): Ditto.
6975 * config/riscv/predicates.md (splittable_const_int_operand):
6977 (single_bit_mask_operand): New.
6978 (not_single_bit_mask_operand): Ditto.
6979 (const31_operand): Ditto.
6980 (const63_operand): Ditto.
6981 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
6982 (riscv_output_move): Ditto.
6983 (riscv_print_operand): Handle new operand type: T and S.
6984 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
6986 2021-10-25 Jim Wilson <jimw@sifive.com>
6988 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
6991 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6993 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6995 (riscv_rtx_costs): Ditto.
6997 2021-10-25 Jim Wilson <jimw@sifive.com>
6998 Kito Cheng <kito.cheng@sifive.com>
6999 Jia-Wei Chen <jiawei@iscas.ac.cn>
7001 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
7002 (bitmanip_minmax): New.
7003 (clz_ctz_pcnt): New.
7004 (bitmanip_optab): New.
7005 (bitmanip_insn): New.
7006 (*<optab>_not<mode>): New.
7007 (*xor_not<mode>): New.
7008 (<bitmanip_optab>si2): New.
7009 (*<bitmanip_optab>disi2): New.
7010 (<bitmanip_optab>di2): New.
7011 (*zero_extendhi<GPR:mode>2_bitmanip): New.
7012 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
7013 (*zero_extendhi<GPR:mode>2_zbb): New.
7016 (rotrsi3_sext): New.
7019 (rotlsi3_sext): New.
7020 (bswap<mode>2): New.
7021 (<bitmanip_optab><mode>3): New.
7022 * config/riscv/riscv.md (type): Add rotate.
7023 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
7024 (*zero_extendhi<GPR:mode>2): New.
7025 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
7026 (*extend<SHORT:mode><SUPERQI:mode>2): New.
7028 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7030 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
7032 (riscv_rtx_costs): Ditto.
7034 2021-10-25 Jim Wilson <jimw@sifive.com>
7035 Kito Cheng <kito.cheng@sifive.com>
7036 Jia-Wei Chen <jiawei@iscas.ac.cn>
7038 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
7043 (riscv_rtx_costs): Ditto.
7044 * config/riscv/riscv.md: Include bitmanip.md
7045 (type): Add bitmanip bype.
7046 (zero_extendsidi2): Change to define_expand pattern.
7047 (*zero_extendsidi2_internal): New.
7048 (zero_extendsidi2_shifted): Disable for ZBA.
7050 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
7052 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
7053 Add zba, zbb, zbc and zbs.
7054 (riscv_ext_flag_table): Ditto.
7055 * config/riscv/riscv-opts.h (MASK_ZBA): New.
7059 (TARGET_ZBA): Ditto.
7060 (TARGET_ZBB): Ditto.
7061 (TARGET_ZBC): Ditto.
7062 (TARGET_ZBS): Ditto.
7063 * config/riscv/riscv.opt (riscv_zb_subext): New.
7065 2021-10-25 liuhongt <hongtao.liu@intel.com>
7068 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
7069 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
7071 2021-10-25 Richard Biener <rguenther@suse.de>
7073 PR tree-optimization/102920
7074 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
7075 controlling VN_TOP matching behavior.
7076 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
7077 (vn_phi_eq): Do not optimistically match VN_TOP.
7079 2021-10-25 konglin1 <lingling.kong@intel.com>
7081 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
7082 define_insn_and_split.
7083 (fma_<mode>_fadd_fcmul):Likewise
7084 (fma_<complexopname>_<mode>_fma_zero):Likewise
7086 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7088 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
7089 TARGET_DISABLE_FPREGS.
7090 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
7091 MASK_DISABLE_FPREGS.
7092 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
7093 cost of hardware integer multiplication.
7094 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
7095 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
7096 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
7097 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
7098 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
7099 SOFT_FLOAT. Revise comment.
7100 (msoft-float): New option.
7102 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7104 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
7106 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
7108 * doc/install.texi (Specific): Remove obsolete details
7109 around GNU/Linux on Itanium.
7110 (Specific): Remove reference to Windows for Itanium.
7112 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
7114 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
7117 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
7119 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
7121 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
7123 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
7124 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
7125 (one_cmplv1ti2): New define expand.
7127 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
7130 * Makefile.in: Handle dvidir and install-dvi target.
7131 * configure: Regenerate.
7132 * configure.ac: Add install-dvi to target_list.
7134 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
7136 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
7139 2021-10-22 Richard Biener <rguenther@suse.de>
7141 PR tree-optimization/102893
7142 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
7143 test for an exit edge.
7145 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
7146 Andrew MacLeod <amacleod@redhat.com>
7148 * gimple-range-path.cc (path_range_query::compute_phi_relations):
7149 Kill any global relations we may know before registering a new
7151 * value-relation.cc (path_oracle::killing_def): New.
7152 * value-relation.h (path_oracle::killing_def): New.
7154 2021-10-22 Richard Biener <rguenther@suse.de>
7157 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
7159 (vn_phi_lookup): Likewise.
7161 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
7164 * doc/invoke.texi: Document -fcf-protection requires i686 or
7167 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
7169 PR middle-end/102764
7170 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
7172 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
7174 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
7175 macro and all invocations.
7177 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7179 * doc/invoke.texi (ranger-debug): Document.
7180 * flag-types.h (enum ranger_debug): New.
7181 (enum evrp_mode): Remove debug values.
7182 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
7183 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
7184 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
7185 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
7186 (execute_early_vrp): Use evrp-mode directly.
7187 * params.opt (enum evrp_mode): Remove debug values.
7188 (ranger-debug): New.
7189 (ranger-logical-depth): Relocate to be in alphabetical order.
7191 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7193 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
7194 * flag-types.h: (enum vrp_mode): New.
7195 * params.opt: (vrp1-mode, vrp2-mode): New.
7196 * tree-vrp.c (vrp_pass_num): New.
7197 (pass_vrp::pass_vrp): Set pass number.
7198 (pass_vrp::execute): Choose which VRP mode to execute.
7200 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7202 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
7203 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
7204 * tree-vrp.c (class rvrp_folder): Relocate here.
7205 (execute_ranger_vrp): New.
7206 * tree-vrp.h (execute_ranger_vrp): Export.
7208 2021-10-21 Martin Liska <mliska@suse.cz>
7212 * opts.c (finish_options): Process flag_var_tracking* options
7213 here as they can be adjusted by optimize attribute.
7214 Process also flag_syntax_only and flag_gtoggle.
7215 * toplev.c (process_options): Remove it here.
7216 * common.opt: Make debug_nonbind_markers_p as PerFunction
7217 attribute as it depends on optimization level.
7219 2021-10-21 Martin Jambor <mjambor@suse.cz>
7221 PR tree-optimization/102505
7222 * tree-sra.c (totally_scalarize_subtree): Check that the
7223 encountered field fits within the acces we would like to put it
7226 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7228 * tree-ssa-threadbackward.c
7229 (back_threader::maybe_register_path): Remove circular paths check.
7231 2021-10-21 Richard Biener <rguenther@suse.de>
7233 * toplev.c (process_options): Move the initial debug_hooks
7235 (toplev::main): ... before the call of the post_options
7238 2021-10-21 Richard Biener <rguenther@suse.de>
7240 PR tree-optimization/102847
7241 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
7242 load cost in the prologue for VMAT_INVARIANT.
7244 2021-10-21 Richard Biener <rguenther@suse.de>
7246 PR tree-optimization/102847
7247 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
7248 handle VMAT_INVARIANT as a splat in the prologue.
7250 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
7253 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
7254 move to use the same logic as HImode.
7256 2021-10-21 Richard Biener <rguenther@suse.de>
7258 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
7259 superfluous gimple_call_nothrow_p check.
7261 2021-10-21 Andrew Pinski <apinski@marvell.com>
7263 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
7264 Mark the ssa-name of the rhs as one to be removed.
7265 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
7266 Call simple_dce_from_worklist at the end to a simple dce.
7268 2021-10-21 Andrew Pinski <apinski@marvell.com>
7270 * tree-cfg.c (maybe_remove_writeonly_store): New function
7271 factored out from ...
7272 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
7274 2021-10-21 Andrew Pinski <apinski@marvell.com>
7276 * tree-cfg.c (execute_fixup_cfg): Remove comment
7277 about standalone pass.
7279 2021-10-21 Andrew Pinski <apinski@marvell.com>
7281 * tree-cfg.c (execute_fixup_cfg): Output when the statement
7282 is removed when it is a write only var.
7284 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7286 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7287 Avoid threading circular paths.
7289 2021-10-20 Alex Coplan <alex.coplan@arm.com>
7291 * calls.c (initialize_argument_information): Remove some dead
7292 code, remove handling for function_arg returning const_int.
7293 * doc/tm.texi: Delete documentation for unused target hooks.
7294 * doc/tm.texi.in: Likewise.
7295 * target.def (load_bounds_for_arg): Delete.
7296 (store_bounds_for_arg): Delete.
7297 (load_returned_bounds): Delete.
7298 (store_returned_bounds): Delete.
7299 * targhooks.c (default_load_bounds_for_arg): Delete.
7300 (default_store_bounds_for_arg): Delete.
7301 (default_load_returned_bounds): Delete.
7302 (default_store_returned_bounds): Delete.
7303 * targhooks.h (default_load_bounds_for_arg): Delete.
7304 (default_store_bounds_for_arg): Delete.
7305 (default_load_returned_bounds): Delete.
7306 (default_store_returned_bounds): Delete.
7308 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7310 * doc/extend.texi (Basic Asm): Clarify that asm is not an
7312 * doc/invoke.texi (-fno-asm): Fix description for C++.
7314 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7316 * doc/install.texi: Remove link to old.html
7318 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7320 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
7322 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7324 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
7326 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7328 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
7330 * config/aarch64/constraints.md (D1): New.
7332 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7334 * config/aarch64/aarch64-simd.md
7335 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
7336 (*aarch64_topbits_shuffle<mode>_le): New.
7337 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
7338 (*aarch64_topbits_shuffle<mode>_be): New.
7339 * config/aarch64/predicates.md
7340 (aarch64_simd_shift_imm_vec_exact_top): New.
7342 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7344 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
7345 *aarch64_<srn_op>shrn<mode>2_vect_le,
7346 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
7347 * config/aarch64/iterators.md (srn_op): New.
7349 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
7351 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
7353 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
7354 ctx->outer instead of ctx.
7356 2021-10-20 Martin Liska <mliska@suse.cz>
7359 2021-10-19 Martin Liska <mliska@suse.cz>
7362 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7363 * system.h (strip_whilespaces): New function.
7365 2021-10-20 Martin Liska <mliska@suse.cz>
7368 2021-10-19 Martin Liska <mliska@suse.cz>
7371 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7374 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7376 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
7377 Change to 8 with -Os, 11 otherwise.
7379 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7381 * config/aarch64/aarch64.c (neoversev1_tunings):
7382 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
7383 (neoversen2_tunings): Likewise.
7385 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7388 * config/aarch64/aarch64.md (frint_pattern): Update comment.
7389 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
7390 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
7391 * config/aarch64/arm_neon.h: Likewise.
7392 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
7394 2021-10-20 Martin Liska <mliska@suse.cz>
7396 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
7397 already declared global variable.
7398 (arm_unwind_emit_set): Use out_file as function argument.
7399 (arm_unwind_emit): Likewise.
7400 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
7401 (machopic_output_stub_indirection): Likewise.
7402 (machopic_output_indirection): Likewise.
7403 (machopic_finish): Likewise.
7404 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
7405 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
7406 * config/ia64/ia64.c (process_epilogue): Likewise.
7407 (process_cfa_adjust_cfa): Likewise.
7408 (process_cfa_register): Likewise.
7409 (process_cfa_offset): Likewise.
7410 (ia64_asm_unwind_emit): Likewise.
7411 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
7413 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7415 * config/aarch64/aarch64-builtins.c
7416 (aarch64_general_gimple_fold_builtin):
7417 lower vld1 and vst1 variants of the neon builtins
7418 * config/aarch64/aarch64-protos.h:
7419 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
7420 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
7423 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7425 * match.pd: Generate IFN_TRUNC.
7427 2021-10-20 Richard Biener <rguenther@suse.de>
7429 PR tree-optimization/102853
7430 * tree-data-ref.c (split_constant_offset_1): Bail out
7431 immediately if the expression traps on overflow.
7433 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7435 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
7437 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7439 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
7440 Remove superflous debugging message.
7441 (back_jt_path_registry::duplicate_thread_path): Same.
7443 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7445 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
7447 (back_threader_registry::register_path): Remove m_threaded_paths.
7449 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7451 PR tree-optimization/102814
7452 * doc/invoke.texi: Document --param=max-fsm-thread-length.
7453 * params.opt: Add --param=max-fsm-thread-length.
7454 * tree-ssa-threadbackward.c
7455 (back_threader_profitability::profitable_path_p): Fail on paths
7456 longer than max-fsm-thread-length.
7458 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
7460 PR middle-end/102764
7461 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
7462 statement to reset the current location for the outgoing edges.
7464 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7465 Richard Biener <rguenther@suse.de>
7467 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
7468 on the same line as the threading cancellation.
7469 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
7470 Avoid threading through loop headers where the path remains in the
7473 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
7475 * tree-object-size.c (unknown): Make into a function. Adjust
7477 (unknown_object_size): Simplify implementation.
7479 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
7480 Kewen Lin <linkw@linux.ibm.com>
7482 * doc/sourcebuild.texi (Effective-Target Keywords): Document
7483 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
7484 vect_slp_v16qi_store, vect_slp_v2hi_store,
7485 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
7487 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
7489 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
7491 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7495 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
7496 * config/rs6000/pmmintrin.h: Same.
7497 * config/rs6000/smmintrin.h: Same.
7498 * config/rs6000/tmmintrin.h: Same.
7500 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7502 * config.gcc (extra_headers): Add nmmintrin.h.
7504 2021-10-19 Richard Biener <rguenther@suse.de>
7506 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
7507 misalignment parameter.
7508 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
7509 Do not change DR_MISALIGNMENT in place, instead pass the
7510 adjusted misalignment to vect_supportable_dr_alignment.
7511 (vect_peeling_supportable): Likewise.
7512 (vect_peeling_hash_get_lowest_cost): Adjust.
7513 (vect_enhance_data_refs_alignment): Likewise.
7514 (vect_vfa_access_size): Likewise.
7515 (vect_supportable_dr_alignment): Add misalignment
7516 parameter and simplify.
7517 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
7518 (get_group_load_store_type): Likewise.
7519 (get_load_store_type): Likewise.
7521 2021-10-19 Clément Chigot <clement.chigot@atos.net>
7523 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
7524 __tls_get_addr reference to .text csect.
7526 2021-10-19 Martin Liska <mliska@suse.cz>
7529 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7532 2021-10-19 Richard Biener <rguenther@suse.de>
7534 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
7535 (vect_get_load_cost): Likewise.
7536 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
7537 alignment support scheme and misalignment as arguments
7539 (vect_get_peeling_costs_all_drs): Compute that info here
7540 and note that we shouldn't need to.
7541 * tree-vect-stmts.c (vect_model_store_cost): Get
7542 alignment support scheme and misalignment as arguments.
7543 (vect_get_store_cost): Likewise.
7544 (vect_model_load_cost): Likewise.
7545 (vect_get_load_cost): Likewise.
7546 (vectorizable_store): Pass down alignment support scheme
7547 and misalignment to costing.
7548 (vectorizable_load): Likewise.
7550 2021-10-19 Richard Biener <rguenther@suse.de>
7552 * tree-vect-stmts.c (get_negative_load_store_type): Add
7553 offset output parameter and initialize it.
7554 (get_group_load_store_type): Likewise.
7555 (get_load_store_type): Likewise.
7556 (vectorizable_store): Use offset as computed by
7557 get_load_store_type.
7558 (vectorizable_load): Likewise.
7560 2021-10-19 Richard Biener <rguenther@suse.de>
7562 PR tree-optimization/102827
7563 * tree-if-conv.c (predicate_statements): Add pe parameter
7564 and use that edge to insert invariant stmts on.
7565 (combine_blocks): Pass through pe.
7566 (tree_if_conversion): Compute the edge to insert invariant
7567 stmts on and pass it along.
7569 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
7572 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
7573 sssubaddv2hi3): Swap the order of operators in vec_concat.
7575 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
7577 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
7578 (altivec_vmrghb_direct): New.
7579 (*altivec_vmrghh_internal): Delete.
7580 (altivec_vmrghh_direct): New.
7581 (*altivec_vmrghw_internal): Delete.
7582 (altivec_vmrghw_direct_<mode>): New.
7583 (altivec_vmrghw_direct): Delete.
7584 (*altivec_vmrglb_internal): Delete.
7585 (altivec_vmrglb_direct): New.
7586 (*altivec_vmrglh_internal): Delete.
7587 (altivec_vmrglh_direct): New.
7588 (*altivec_vmrglw_internal): Delete.
7589 (altivec_vmrglw_direct_<mode>): New.
7590 (altivec_vmrglw_direct): Delete.
7591 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
7592 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
7594 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
7595 (vsx_xxmrglw_<mode>): Adjust.
7597 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
7599 * passes.def: Change threading comment before pass_ccp pass.
7601 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
7603 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
7604 Modify the expansion for sign extension. All extensions are done
7605 within VSX registers.
7607 2021-10-19 Richard Biener <rguenther@suse.de>
7609 * tree-vect-stmts.c (get_group_load_store_type): Add
7610 misalignment output parameter and initialize it.
7611 (get_group_load_store_type): Likewise.
7612 (vectorizable_store): Remove now redundant queries.
7613 (vectorizable_load): Likewise.
7615 2021-10-19 Richard Biener <rguenther@suse.de>
7617 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
7618 check_aligned argument.
7619 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
7621 (vect_peeling_hash_insert): Add supportable_if_not_aligned
7622 argument and do not call vect_supportable_dr_alignment here.
7623 (vect_peeling_supportable): Adjust.
7624 (vect_enhance_data_refs_alignment): Compute whether the
7625 access is supported with different alignment here and
7626 pass that down to vect_peeling_hash_insert.
7627 (vect_vfa_access_size): Adjust.
7628 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
7629 (vect_get_load_cost): Likewise.
7630 (get_negative_load_store_type): Likewise.
7631 (get_group_load_store_type): Likewise.
7632 (get_load_store_type): Likewise.
7634 2021-10-19 Martin Liska <mliska@suse.cz>
7637 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7638 * system.h (strip_whilespaces): New function.
7640 2021-10-19 dianhong xu <dianhong.xu@intel.com>
7642 * config/i386/avx512fp16intrin.h:
7643 (_mm512_set1_pch): New intrinsic.
7644 * config/i386/avx512fp16vlintrin.h:
7645 (_mm256_set1_pch): New intrinsic.
7646 (_mm_set1_pch): Ditto.
7648 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
7650 PR tree-optimization/102796
7651 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
7652 normally. Return get_tree_range for non gimple_range_ssa_p names.
7653 (gimple_ranger::range_of_stmt): Use get_tree_range for non
7654 gimple_range_ssa_p names.
7656 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
7659 * config/i386/i386.c (ix86_print_operand_address):
7660 Error out for non-address_operand asm operands.
7662 2021-10-18 Richard Biener <rguenther@suse.de>
7664 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
7665 not auto-convert dr_alignment_support to bool.
7666 (vect_peeling_supportable): Likewise.
7667 (vect_enhance_data_refs_alignment): Likewise.
7668 (vect_supportable_dr_alignment): Commonize read/write case.
7669 * tree-vect-stmts.c (vect_get_store_cost): Use
7670 dr_alignment_support, not int, for the vect_supportable_dr_alignment
7672 (vect_get_load_cost): Likewise.
7674 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
7676 * tree-object-size.c (object_sizes_execute): Consolidate LHS
7677 null check and do it early.
7679 2021-10-18 Richard Biener <rguenther@suse.de>
7681 * tree-vect-stmts.c (vectorizable_store): Use the
7682 computed alignment scheme instead of querying
7685 2021-10-18 Richard Biener <rguenther@suse.de>
7687 * tree-vect-stmts.c (vectorizable_store): Do not recompute
7688 alignment scheme already determined by get_load_store_type.
7690 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7692 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
7693 Clone corresponding pass.
7695 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7697 * combine.c (recog_for_combine): For an unrecognized move/set of
7698 a constant, try force_const_mem to place it in the constant pool.
7699 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
7700 Delete global variables (that are no longer assigned a cost value).
7701 (cse_insn): Simplify logic for deciding whether to place a folded
7702 constant in the constant pool using force_const_mem.
7703 (cse_main): Remove zero initialization of constant_pool_entries_cost
7704 and constant_pool_entries_regcost.
7705 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
7706 fractionally more expensive, when optimizing for speed.
7708 2021-10-18 Martin Liska <mliska@suse.cz>
7710 PR gcov-profile/102746
7711 PR gcov-profile/102747
7712 * gcov.c (main): Return return_code.
7713 (output_gcov_file): Mark return_code when error happens.
7714 (generate_results): Likewise.
7715 (read_graph_file): Likewise.
7716 (read_count_file): Likewise.
7718 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7720 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
7721 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
7722 a popcount, allowing compile-time evaluation/simplification.
7723 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
7725 2021-10-18 Richard Biener <rguenther@suse.de>
7727 PR tree-optimization/102788
7728 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
7730 (vect_pattern_recog_1): Likewise.
7731 (vect_recog_bool_pattern): Continue matching the pattern
7732 even if we do not have a vector type for a conversion
7735 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7737 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
7738 Evalute SS_NEG and SS_ABS of a constant argument.
7740 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
7743 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
7744 * internal-fn.def: Add entry for COND_NEG internal function.
7745 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
7746 NEGATE_EXPR, COND_NEG mapping.
7747 * optabs.def: Add entry for cond_neg_optab.
7748 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
7749 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
7750 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
7752 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7754 * tree-ssa-strlen.c (class strlen_pass): Rename from
7756 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
7757 (handle_assign): Same.
7758 (adjust_last_stmt): Same.
7759 (maybe_warn_overflow): Same.
7760 (handle_builtin_strlen): Same.
7761 (handle_builtin_strchr): Same.
7762 (handle_builtin_strcpy): Same.
7763 (handle_builtin_strncat): Same.
7764 (handle_builtin_stxncpy_strncat): Same.
7765 (handle_builtin_memcpy): Same.
7766 (handle_builtin_strcat): Same.
7767 (handle_alloc_call): Same.
7768 (handle_builtin_memset): Same.
7769 (handle_builtin_memcmp): Same.
7770 (get_len_or_size): Same.
7771 (strxcmp_eqz_result): Same.
7772 (handle_builtin_string_cmp): Same.
7773 (handle_pointer_plus): Same.
7774 (count_nonzero_bytes_addr): Same.
7775 (count_nonzero_bytes): Same.
7776 (handle_store): Same.
7777 (strlen_check_and_optimize_call): Same.
7778 (handle_integral_assign): Same.
7779 (check_and_optimize_stmt): Same.
7780 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
7782 2021-10-18 Richard Biener <rguenther@suse.de>
7784 PR tree-optimization/102798
7785 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7786 Only copy points-to info to newly generated SSA names.
7788 2021-10-18 Martin Liska <mliska@suse.cz>
7790 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
7791 * gcov.c (get_cycles_count): Likewise.
7792 * lto-compress.c (lto_compression_zlib): Likewise.
7793 (lto_uncompression_zlib): Likewise.
7794 * targhooks.c (default_pch_valid_p): Likewise.
7796 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
7798 * tree-pass.h (make_pass_thread_jumps_full): New.
7799 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
7800 (try_thread_blocks): Add resolve and speed arguments.
7801 (pass_thread_jumps::execute): Inline.
7802 (do_early_thread_jumps): New.
7803 (do_thread_jumps): New.
7804 (make_pass_thread_jumps): Move.
7805 (pass_early_thread_jumps::gate): Inline.
7806 (pass_early_thread_jumps::execute): Inline.
7807 (class pass_thread_jumps_full): New.
7809 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
7811 * configure.ac: Treat powerpc64*-*-freebsd* the same as
7812 powerpc64-*-freebsd*.
7813 * configure: Regenerate.
7815 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
7817 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
7818 set_ptr_nonull to set_ptr_nonnull in comments.
7820 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
7822 PR tree-optimization/102720
7823 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
7824 of call used and clobbered sets.
7826 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7828 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
7829 skip gomp and itm when r or nodefaultlibs is given.
7830 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
7831 Update ordering of exclusions, remove duplicate 'v' addition
7832 (collect2 will add this from the main command line).
7834 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7836 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
7837 filelist and framework options in specs instead of code.
7838 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
7839 specs that are really driver ones.
7840 (DARWIN_CC1_SPEC): Likewise.
7841 (CPP_SPEC): Likewise.
7842 (SYSROOT_SPEC): Append space.
7843 (LINK_SYSROOT_SPEC): Remove most driver link specs.
7844 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
7845 (STARTFILE_SPEC): Likewise.
7846 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
7847 (ASM_SPEC): Update driver-related specs.
7848 (ASM_FINAL_SPEC): Likewise.
7849 * config/darwin.opt: Remove now unused option aliases.
7850 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
7851 (ASM_SPEC): Update driver-related specs.
7853 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
7855 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
7856 allow standard_sse_constant_p immediate constants.
7858 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7860 * config.gcc: Add tm-dwarf2.h to tm_d-file.
7862 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
7864 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
7865 that occur in abnormal phis.
7866 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
7867 abnormal and EH edges.
7868 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
7869 (rvrp_folder::value_on_edge): Ditto.
7870 (rvrp_folder::value_of_stmt): Ditto.
7871 (hybrid_folder::value_of_expr): Ditto for ranger queries.
7872 (hybrid_folder::value_on_edge): Ditto.
7873 (hybrid_folder::value_of_stmt): Ditto.
7874 * value-query.cc (gimple_range_global): Always return a range if
7875 the type is supported.
7877 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
7879 * config/pa/pa.md: Consistently use "rG" constraint for copy
7880 instruction in move patterns.
7882 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
7884 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
7887 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
7888 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
7889 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
7890 (ELFABIVERSION_AMDGPU_HSA_V4): New.
7891 (SET_SRAM_ECC_UNSUPPORTED): New.
7892 (copy_early_debug_info): Create elf flags to match the other objects.
7893 (main): Just let the attribute flags pass through.
7895 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7897 * tree-loop-distribution.c (reduction_var_overflows_first):
7898 Pass the type of reduction_var as first argument as it is also
7899 done for the load type.
7900 (loop_distribution::transform_reduction_loop): Add missing
7901 TREE_TYPE while determining precission of reduction_var.
7903 2021-10-15 Richard Biener <rguenther@suse.de>
7905 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
7907 * toplev.c (process_options): Warn when STABS debugging is
7908 enabled but not the preferred format.
7909 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
7911 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
7913 2021-10-15 Richard Biener <rguenther@suse.de>
7916 * tree-inline.c (copy_bb): Avoid underflowing nargs.
7918 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7920 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
7921 HFmode input operand to HImode.
7922 (ix86_vectorize_vec_perm_const): Likewise.
7923 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
7925 (*avx512f_permvar_truncv8siv8hi_1_hf):
7928 2021-10-15 Richard Biener <rguenther@suse.de>
7930 PR middle-end/102682
7931 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
7932 not create a paradoxical subreg.
7934 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7936 * config/i386/i386-expand.c (ix86_expand_vector_init):
7937 For half_vector concat for HFmode, handle them like HImode.
7939 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7941 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
7942 (back_threader::back_threader): Same.
7943 (back_threader::resolve_phi): Try to solve without looking back if
7945 (back_threader::find_paths_to_names): Same.
7946 (try_thread_blocks): Pass resolve argument to back threader.
7947 (pass_early_thread_jumps::execute): Same.
7949 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7951 * doc/invoke.texi: Remove max-fsm-thread-length,
7952 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
7954 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
7956 (back_threader_registry::back_threader_registry): Same.
7957 (back_threader_profitability::profitable_path_p): Remove
7958 param_max_fsm_thread-length.
7959 (back_threader_registry::register_path): Remove
7960 m_max_allowable_paths.
7962 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7964 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
7966 (back_threader::~back_threader): Do not release m_path.
7968 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
7970 PR tree-optimization/102738
7971 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
7973 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
7975 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
7976 (omp_mark_declare_variant): Move from c-omp.c.
7977 (omp_context_name_list_prop): Update for Fortran strings.
7978 * omp-general.h (omp_check_context_selector): New prototype.
7979 (omp_mark_declare_variant): New prototype.
7981 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7983 * config/rs6000/rs6000.c (rs6000_density_test): Move early
7984 exit test further up the function.
7986 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7988 * config/arm/arm.c (arm_add_stmt_cost): Delete.
7989 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
7991 2021-10-14 Martin Jambor <mjambor@suse.cz>
7993 * doc/invoke.texi (Optimize Options): Add entry for
7994 ipa-cp-recursive-freq-factor.
7996 2021-10-14 Tamar Christina <tamar.christina@arm.com>
7998 * match.pd: New rule.
8000 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
8003 * ipa-modref-tree.h (modref_access_node::update2):
8004 Also check that parm_offset is unchanged.
8005 (modref_ref_node::insert_access): Fix updating of
8008 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8010 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
8013 2021-10-14 Richard Biener <rguenther@suse.de>
8015 PR tree-optimization/102659
8016 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
8017 rewrite pointer typed undefined overflow operations.
8018 (predicate_statements): Likewise. Make sure to emit invariant
8019 conversions in the preheader.
8020 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
8021 non-empty preheaders.
8022 * tree-data-ref.c (dr_analyze_indices): Strip useless
8023 conversions to the MEM_REF base type.
8025 2021-10-14 Martin Liska <mliska@suse.cz>
8027 * common.opt: Stop using AUTODETECT_VALUE
8028 and use EnabledBy where possible.
8029 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
8030 * toplev.c (AUTODETECT_VALUE): Remove macro.
8031 (process_options): Simplify by using EnabledBy and
8032 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
8035 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
8037 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
8039 (vld1_s32_x3): Likewise.
8041 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8043 PR tree-optimization/102736
8044 PR tree-optimization/102736
8045 * gimple-range-path.cc (path_range_query::range_on_path_entry):
8046 Assert that the requested range is defined outside the path.
8047 (path_range_query::ssa_range_in_phi): Do not call
8048 range_on_path_entry for SSA names that are defined within the
8051 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
8053 * config/darwin-driver.c (darwin_find_version_from_kernel):
8054 Quote internal identifiers and avoid contractions in
8056 (darwin_default_min_version): Likewise.
8057 (darwin_driver_init): Likewise.
8059 2021-10-14 Martin Jambor <mjambor@suse.cz>
8061 * params.opt (ipa-cp-recursive-freq-factor): New.
8062 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
8063 scc_no, self_recursion_generated_level, same_scc and
8064 self_recursion_generated_p.
8065 (ipcp_lattice::add_value): Replaced parameter unlimited with
8066 same_lat_gen_level, usit it determine limit of values and store it to
8068 (ipcp_lattice<valtype>::print): Dump the new fileds.
8069 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
8070 parameter and store it to the new value.
8071 (self_recursively_generated_p): Removed.
8072 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
8073 instead of self_recursively_generated_p, store self generation level
8075 (value_topo_info<valtype>::add_val): Set scc_no.
8076 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
8077 recursively feeding values and self generated values by appropriate
8080 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
8082 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
8083 redundant test for flag_vect_cost_model.
8085 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
8087 * bitmap.c (debug): New overloaded function for auto_bitmaps.
8088 * bitmap.h (debug): Same.
8090 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
8092 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
8093 Adjust builtin call.
8094 (_mm512_mask3_fcmadd_pch): Likewise.
8095 (_mm512_mask_fmadd_pch): Likewise
8096 (_mm512_mask3_fmadd_pch): Likewise
8097 (_mm512_mask_fcmadd_round_pch): Likewise
8098 (_mm512_mask3_fcmadd_round_pch): Likewise
8099 (_mm512_mask_fmadd_round_pch): Likewise
8100 (_mm512_mask3_fmadd_round_pch): Likewise
8101 (_mm_mask_fcmadd_sch): Likewise
8102 (_mm_mask3_fcmadd_sch): Likewise
8103 (_mm_mask_fmadd_sch): Likewise
8104 (_mm_mask3_fmadd_sch): Likewise
8105 (_mm_mask_fcmadd_round_sch): Likewise
8106 (_mm_mask3_fcmadd_round_sch): Likewise
8107 (_mm_mask_fmadd_round_sch): Likewise
8108 (_mm_mask3_fmadd_round_sch): Likewise
8109 (_mm_fcmadd_round_sch): Likewise
8110 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
8111 Adjust builtin call.
8112 (_mm_mask3_fmadd_pch): Likewise
8113 (_mm256_mask_fmadd_pch): Likewise
8114 (_mm256_mask3_fmadd_pch): Likewise
8115 (_mm_mask_fcmadd_pch): Likewise
8116 (_mm_mask3_fcmadd_pch): Likewise
8117 (_mm256_mask_fcmadd_pch): Likewise
8118 (_mm256_mask3_fcmadd_pch): Likewise
8119 * config/i386/i386-builtin.def: Add mask3 builtin for complex
8120 fma, and adjust mask_builtin to corresponding expander.
8121 * config/i386/i386-expand.c (ix86_expand_round_builtin):
8122 Skip eraseing embedded rounding for expanders that emits
8124 * config/i386/sse.md (complexmove): New mode_attr.
8125 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
8126 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
8127 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8128 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8129 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8130 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8131 * config/i386/subst.md (round_embedded_complex): New subst.
8133 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
8135 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
8136 (cbranchdf4): Likewise.
8137 Add missing move patterns for TARGET_SOFT_FLOAT.
8139 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
8141 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
8142 pseudo intermediate when moving a SUBREG into a hard register,
8143 by checking ix86_hardreg_mov_ok.
8144 (ix86_expand_vector_extract): Store zero-extended SImode
8145 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
8147 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
8148 complex (SUBREG) sets of (vector) hard registers before reload, by
8149 checking ix86_hardreg_mov_ok.
8151 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8153 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
8155 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8157 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
8159 2021-10-13 Martin Sebor <msebor@redhat.com>
8161 PR middle-end/102630
8162 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
8164 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
8166 * collect2.c (is_lto_object_file): Release simple-object
8167 resources, close files.
8169 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
8171 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
8172 when checking mve addressing modes.
8173 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
8175 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
8176 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
8177 there is a single Ux constraint.
8179 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
8181 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
8182 multiplication on 32-bit PA 1.1 and 2.0 targets.
8184 2021-10-13 Martin Liska <mliska@suse.cz>
8186 PR gcov-profile/90364
8187 * coverage.c (build_info): Emit checksum to the global variable.
8188 (build_info_type): Add new field for checksum.
8189 (coverage_obj_finish): Pass object_checksum.
8190 (coverage_init): Use 0 as checksum for .gcno files.
8191 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
8192 * gcov.c (read_graph_file): Read also checksum.
8193 * doc/invoke.texi: Document the behaviour change.
8195 2021-10-13 Richard Biener <rguenther@suse.de>
8197 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
8198 start at integer value 2.
8199 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
8200 the iterator for GSI_LAST_NEW_STMT.
8201 (gsi_insert_seq_nodes_after): Likewise.
8202 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
8203 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
8204 arguments to gsi_replace.
8206 2021-10-13 Martin Liska <mliska@suse.cz>
8209 * common.opt: Use EnabledBy instead of detection in
8210 finish_options and process_options.
8211 * opts.c (finish_options): Remove handling of
8212 x_flag_unroll_all_loops.
8213 * toplev.c (process_options): Likewise for flag_web and
8214 flag_rename_registers.
8216 2021-10-13 Richard Biener <rguenther@suse.de>
8218 PR tree-optimization/102659
8219 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
8220 (if_convertible_gimple_assign_stmt_p): Mark the loop for
8221 rewrite when stmts with undefined behavior on integer
8223 (combine_blocks): Predicate also when we need to rewrite stmts.
8224 (predicate_statements): Rewrite affected stmts to something
8225 with well-defined behavior on overflow.
8226 (tree_if_conversion): Initialize need_to_rewrite_undefined.
8228 2021-10-13 Richard Biener <rguenther@suse.de>
8231 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
8234 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
8236 * dwarf2ctf.c: Fix typo in comment.
8238 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
8241 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
8242 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
8243 avoid generating an automatic #define of vec_cpsgn. Use the
8244 correct built-in for V4SFmode that doesn't depend on VSX.
8246 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
8250 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
8251 define_insn_and_split pattern. Add alternative 1 and split it
8252 post reload to insert operand 1 into the low part of operand 0.
8253 (*sub<mode>_1_slp): Ditto.
8254 (*and<mode>_1_slp): Ditto.
8255 (*<any_or:code><mode>_1_slp): Ditto.
8256 (*ashl<mode>3_1_slp): Ditto.
8257 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
8258 (*<any_rotate:insn><mode>3_1_slp): Ditto.
8259 (*neg<mode>_1_slp): New insn_and_split pattern.
8260 (*one_cmpl<mode>_1_slp): Ditto.
8262 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
8264 * doc/install.texi: Update MinGW and mingw-64 Binaries
8267 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
8269 * doc/extend.texi (Common Variable Attributes): Fix typos in
8270 alloc_size documentation.
8272 2021-10-12 Richard Biener <rguenther@suse.de>
8274 PR tree-optimization/102696
8275 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
8276 the tree fatally failed when we reject a BIT_FIELD_REF.
8278 2021-10-12 Richard Biener <rguenther@suse.de>
8280 PR tree-optimization/102572
8281 * tree-vect-stmts.c (vect_build_gather_load_calls): When
8282 gathering the vectorized defs for the mask pass in the
8283 desired mask vector type so invariants will be handled
8286 2021-10-12 Tamar Christina <tamar.christina@arm.com>
8288 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
8289 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
8290 *fcmuo<mode>_nor_combine): New.
8292 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
8295 * config/sparc/sparc-modes.def (OI): New integer mode.
8297 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8299 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
8300 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
8303 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8305 * tree-vectorizer.h (loop_cost_model): New function.
8306 (unlimited_cost_model): Use it.
8307 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
8308 call instead of flag_vect_cost_model.
8309 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
8310 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
8311 of flag_simd_cost_model.
8313 2021-10-12 liuhongt <hongtao.liu@intel.com>
8316 * config/i386/i386-expand.c (emit_reduc_half): Handle
8318 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
8319 (reduc_plus_scal_v4qi): Ditto.
8321 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8323 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
8324 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
8325 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
8327 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8329 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
8330 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
8331 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
8332 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
8333 _mm_cvtepu32_epi64): New.
8335 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8337 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
8338 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
8340 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8342 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
8343 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
8344 _mm_max_epi32, _mm_max_epu32): New.
8346 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8348 * ipa-modref-tree.h (struct modref_access_node): Revert
8350 (struct modref_ref_node): Likewise.
8352 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8354 * ipa-modref-tree.h (modref_tree::global_access_p): New member
8357 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8358 ignore_stores_eaf_flags): Move to ipa-modref.h
8359 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
8360 (modref_summary::global_memory_read_p): New member function.
8361 (modref_summary::global_memory_written_p): New member function.
8362 * ipa-modref.h (modref_summary::global_memory_read_p,
8363 modref_summary::global_memory_written_p): Declare.
8364 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8365 ignore_stores_eaf_flags): move here.
8366 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
8368 (handle_rhs_call): Rewrite.
8369 (handle_call_arg): New function.
8370 (determine_global_memory_access): New function.
8371 (handle_const_call): Remove
8372 (handle_pure_call): Remove
8373 (find_func_aliases_for_call): Update use of handle_rhs_call.
8374 (compute_points_to_sets): Handle global memory acccesses
8377 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
8379 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
8380 documentation, mention UBSAN_OPTIONS, similar to what is done
8381 for AddressSanitizer.
8383 2021-10-11 Richard Biener <rguenther@suse.de>
8385 PR middle-end/102683
8386 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
8387 availability before building an integer type for storage
8390 2021-10-11 Richard Biener <rguenther@suse.de>
8392 PR middle-end/101480
8393 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
8396 2021-10-11 Martin Liska <mliska@suse.cz>
8398 * common.opt: Remove Init(2) for some options.
8399 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
8400 use rather OPTION_SET_P.
8402 2021-10-11 Martin Liska <mliska@suse.cz>
8404 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
8405 * flag-types.h (enum ira_region): Likewise.
8406 * toplev.c (process_options): Use OPTION_SET_P instead of
8407 IRA_REGION_AUTODETECT.
8409 2021-10-11 Jakub Jelinek <jakub@redhat.com>
8411 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
8412 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
8414 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8416 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
8417 * config/s390/s390.c (s390_rawmemchr): New function.
8418 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
8419 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
8420 the pattern vfees<mode> from vx-builtins.md.
8421 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
8423 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8425 * builtins.c (get_memory_rtx): Change to external linkage.
8426 * builtins.h (get_memory_rtx): Add function prototype.
8427 * doc/md.texi (rawmemchr<mode>): Document.
8428 * internal-fn.c (expand_RAWMEMCHR): Define.
8429 * internal-fn.def (RAWMEMCHR): Add.
8430 * optabs.def (rawmemchr_optab): Add.
8431 * tree-loop-distribution.c (find_single_drs): Change return code
8432 behaviour by also returning true if no single store was found
8434 (loop_distribution::classify_partition): Respect the new return
8435 code behaviour of function find_single_drs.
8436 (loop_distribution::execute): Call new function
8437 transform_reduction_loop in order to replace rawmemchr or strlen
8438 like loops by calls into builtins.
8439 (generate_reduction_builtin_1): New function.
8440 (generate_rawmemchr_builtin): New function.
8441 (generate_strlen_builtin_1): New function.
8442 (generate_strlen_builtin): New function.
8443 (generate_strlen_builtin_using_rawmemchr): New function.
8444 (reduction_var_overflows_first): New function.
8445 (determine_reduction_stmt_1): New function.
8446 (determine_reduction_stmt): New function.
8447 (loop_distribution::transform_reduction_loop): New function.
8449 2021-10-11 Martin Liska <mliska@suse.cz>
8451 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
8452 and remove legacy hashing code.
8454 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
8457 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
8458 CONST_INT for BEGIN and END, and use gcc_assert rather than
8461 2021-10-10 Jakub Jelinek <jakub@redhat.com>
8464 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
8465 use MO_VAL_SET if loc is not sp.
8467 2021-10-10 Andrew Pinski <apinski@marvell.com>
8469 PR tree-optimization/102622
8470 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
8471 Swap the order of a?0:pow2cst and a?0:-1 transformations.
8473 2021-10-09 liuhongt <hongtao.liu@intel.com>
8476 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
8478 (ix86_use_mask_cmp_p): Ditto.
8479 (ix86_expand_sse_movcc): Ditto.
8480 * config/i386/i386.md (setcc_hf_mask): New define_insn.
8481 (movhf_mask): Ditto.
8482 (UNSPEC_MOVCC_MASK): New unspec.
8483 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
8485 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
8487 PR rtl-optimization/102627
8488 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
8490 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8492 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
8495 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8497 * value-range.cc (irange::debug): New.
8498 * value-range.h (irange::debug): New.
8500 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
8502 PR tree-optimization/102385
8503 * predict.h (change_edge_frequency): Declare.
8504 * predict.c (change_edge_frequency): New function.
8505 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
8507 (tree_unroll_loop): Likewise.
8508 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
8509 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
8511 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
8512 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
8513 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
8514 to retrieve the exit edges. Make all the old profile update code
8515 conditional on !single_loop_p -- the case it was written for --
8516 and use a different approach for the single-loop case.
8518 2021-10-08 Martin Liska <mliska@suse.cz>
8520 * config/alpha/alpha.c (alpha_option_override): Use new macro
8522 * config/arc/arc.c (arc_override_options): Likewise.
8523 * config/arm/arm.c (arm_option_override): Likewise.
8524 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
8525 * config/c6x/c6x.c (c6x_option_override): Likewise.
8526 * config/csky/csky.c: Likewise.
8527 * config/darwin.c (darwin_override_options): Likewise.
8528 * config/frv/frv.c (frv_option_override): Likewise.
8529 * config/i386/djgpp.h: Likewise.
8530 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
8531 (ix86_max_noce_ifcvt_seq_cost): Likewise.
8532 * config/ia64/ia64.c (ia64_option_override): Likewise.
8533 (ia64_override_options_after_change): Likewise.
8534 * config/m32c/m32c.c (m32c_option_override): Likewise.
8535 * config/m32r/m32r.c (m32r_init): Likewise.
8536 * config/m68k/m68k.c (m68k_option_override): Likewise.
8537 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
8538 * config/mips/mips.c (mips_option_override): Likewise.
8539 * config/nios2/nios2.c (nios2_option_override): Likewise.
8540 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
8541 * config/pa/pa.c (pa_option_override): Likewise.
8542 * config/riscv/riscv.c (riscv_option_override): Likewise.
8543 * config/rs6000/aix71.h: Likewise.
8544 * config/rs6000/aix72.h: Likewise.
8545 * config/rs6000/aix73.h: Likewise.
8546 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
8547 (rs6000_override_options_after_change): Likewise.
8548 (rs6000_linux64_override_options): Likewise.
8549 (glibc_supports_ieee_128bit): Likewise.
8550 (rs6000_option_override_internal): Likewise.
8551 (rs6000_file_start): Likewise.
8552 (rs6000_darwin_file_start): Likewise.
8553 * config/rs6000/rtems.h: Likewise.
8554 * config/rs6000/sysv4.h: Likewise.
8555 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
8556 * config/s390/s390.c (s390_option_override): Likewise.
8557 * config/sh/linux.h: Likewise.
8558 * config/sh/netbsd-elf.h (while): Likewise.
8559 * config/sh/sh.c (sh_option_override): Likewise.
8560 * config/sol2.c (solaris_override_options): Likewise.
8561 * config/sparc/sparc.c (sparc_option_override): Likewise.
8562 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
8563 * config/visium/visium.c (visium_option_override): Likewise.
8564 * config/vxworks.c (vxworks_override_options): Likewise.
8565 * lto-opts.c (lto_write_options): Likewise.
8566 * omp-expand.c (expand_omp_simd): Likewise.
8567 * omp-general.c (omp_max_vf): Likewise.
8568 * omp-offload.c (oacc_xform_loop): Likewise.
8569 * opts.h (OPTION_SET_P): Likewise.
8570 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
8571 * toplev.c (process_options): Likewise.
8572 * tree-predcom.c: Likewise.
8573 * tree-sra.c (analyze_all_variable_accesses): Likewise.
8575 2021-10-08 liuhongt <hongtao.liu@intel.com>
8578 * config/i386/i386.c (ix86_optab_supported_p):
8579 Return true for HFmode.
8580 * match.pd: Simplify (_Float16) ceil ((double) x) to
8581 __builtin_ceilf16 (a) when a is _Float16 type and
8582 direct_internal_fn_supported_p.
8584 2021-10-08 liuhongt <hongtao.liu@intel.com>
8587 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
8588 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
8589 (reduc_<code>_scal_v4hi): New.
8591 2021-10-08 liuhongt <hongtao.liu@intel.com>
8593 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
8594 * doc/invoke.texi (Options That Control Optimization): Update
8596 * opts.c (default_options_table): Enable auto-vectorization at
8597 O2 with very-cheap cost model.
8598 (finish_options): Use cheap cost model for
8599 explicit -ftree{,-loop}-vectorize.
8601 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8603 * ctfc.c (ctfc_delete_container): Free hash table contents.
8605 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8607 * toplev.c (process_options): Do not warn for GNU GIMPLE.
8609 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
8611 * tree-object-size.c (addr_object_size,
8612 compute_builtin_object_size): Drop PDECL and POFF arguments.
8613 (addr_object_size): Adjust calls.
8614 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
8617 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
8619 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
8620 representing signed and unsigned high-part multiplication resp.
8621 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
8622 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
8623 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
8624 SS_DIV, US_DIV]: Similar simplifications for saturating
8626 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
8627 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
8628 Implement compile-time evaluation for constant operands.
8629 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
8631 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
8632 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
8633 Mention the new smul_highpart and umul_highpart RTX codes.
8634 * doc/invoke.texi: Silence @xref "compilation" warnings.
8636 2021-10-07 Martin Jambor <mjambor@suse.cz>
8639 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
8640 case when the source reference description corresponds to a
8641 referance taken in a function src->caller is inlined to.
8643 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
8646 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
8648 (modref_access_node::try_merge_with): Add sanity check that there
8649 are no redundant entries in the list.
8651 2021-10-07 Richard Biener <rguenther@suse.de>
8653 PR tree-optimization/102608
8654 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
8657 2021-10-07 Martin Liska <mliska@suse.cz>
8659 * toplev.c (toplev::main): Make
8660 save_opt_decoded_options a pointer type
8661 * toplev.h: Likewise.
8663 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8665 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
8666 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
8667 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
8669 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8671 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
8672 Adjust for new -msram-ecc=any behaviour.
8673 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
8674 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
8675 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
8676 (EF_AMDGPU_XNACK_V3): ... this.
8677 (EF_AMDGPU_SRAM_ECC): Rename to ...
8678 (EF_AMDGPU_SRAM_ECC_V3): ... this.
8679 (EF_AMDGPU_FEATURE_XNACK_V4): New.
8680 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
8681 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
8682 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
8683 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
8684 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
8685 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
8686 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
8687 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
8688 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
8689 (SET_XNACK_ON): New.
8690 (SET_XNACK_OFF): New.
8692 (SET_SRAM_ECC_ON): New.
8693 (SET_SRAM_ECC_ANY): New.
8694 (SET_SRAM_ECC_OFF): New.
8695 (TEST_SRAM_ECC_ANY): New.
8696 (TEST_SRAM_ECC_ON): New.
8697 (main): Implement HSACOv4 and -msram-ecc=any.
8699 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8701 * config.in: Regenerate.
8702 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
8706 (A_FIJI): Rename to ...
8708 (A_900): Rename to ...
8710 (A_906): Rename to ...
8712 (A_908): Rename to ...
8714 (SRAMOPT): New macro.
8715 (ASM_SPEC): Adjust xnack option usage.
8716 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
8717 * configure: Regenerate.
8718 * configure.ac: Detect LLVM assembler dialect.
8720 2021-10-07 Richard Biener <rguenther@suse.de>
8722 * tree-pretty-print.c (dump_generic_node): Do not elide
8723 printing '&' when dumping with -gimple.
8725 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8727 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
8729 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
8730 (adjust_imagpart_expr): Ditto.
8731 * value-range.cc (irange::irange_intersect): Call new routine if
8732 RHS is a single pair.
8733 (irange::intersect): New wide_int version.
8734 * value-range.h (class irange): New prototype.
8736 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8738 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
8739 Add parameter to limit size when recognizing switches.
8740 (gimple_outgoing_range::edge_range_p): Check size limit.
8741 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
8742 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
8743 that exceed the size limit.
8744 (gori_compute::gori_compute): Add initializer.
8745 * params.opt (evrp-switch-limit): New.
8746 * doc/invoke.texi: Update docs.
8748 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8750 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
8751 TYPE_MAX_VALUE instead of creating new trees when possible.
8753 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8755 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
8756 zero and non-zero more efficently.
8758 2021-10-06 Richard Biener <rguenther@suse.de>
8761 * dumpfile.h (TDF_GIMPLE_VAL): New.
8762 (dump_flag): Re-order and adjust TDF_* flags. Make
8763 the enum uint32_t. Use std::underlying_type in the
8765 (optgroup_flag): Likewise for the operator overloads.
8766 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
8767 in _Literal if TDF_GIMPLE_VAL.
8768 * gimple-pretty-print.c (dump_gimple_assign): Add
8769 TDF_GIMPLE_VAL to flags when dumping operands where only
8770 is_gimple_val are allowed.
8771 (dump_gimple_cond): Likewise.
8773 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
8775 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
8778 2021-10-05 qing zhao <qing.zhao@oracle.com>
8780 PR middle-end/102359
8781 * gimplify.c (gimplify_decl_expr): Not add initialization for an
8782 auto variable when it has been initialized by frontend.
8784 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
8786 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
8789 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8791 * common/config/avr/avr-common.c (avr_handle_option): Mark
8792 argument as ATTRIBUTE_UNUSED.
8794 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8796 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
8797 Undefine before redefinition.
8799 2021-10-05 Richard Biener <rguenther@suse.de>
8801 * toplev.c (no_backend): Remove global var.
8802 (process_options): Pass in no_backend, move post_options
8803 langhook call to toplev::main.
8804 (do_compile): Pass in no_backend, move process_options call
8806 (toplev::run_self_tests): Check no_backend at the caller.
8807 (toplev::main): Call post_options and process_options
8808 split out from do_compile, do self-tests only if
8809 no_backend is initialized.
8811 2021-10-05 Richard Biener <rguenther@suse.de>
8813 * tree-cfg.c (dump_function_to_file): Dump the UID of the
8814 function as part of the name when requested.
8815 * tree-pretty-print.c (dump_function_name): Dump the UID when
8816 requested and the langhook produced the actual name.
8818 2021-10-05 Richard Biener <rguenther@suse.de>
8820 PR middle-end/102587
8821 PR middle-end/102285
8822 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
8823 zero-initialization as last resort, use the constant
8824 size as given by the DEFERRED_INIT argument to build
8827 2021-10-04 Marek Polacek <polacek@redhat.com>
8830 * doc/invoke.texi: Document -Warray-compare.
8832 2021-10-04 Richard Biener <rguenther@suse.de>
8834 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
8835 variables are not to be initialized.
8837 2021-10-04 Richard Biener <rguenther@suse.de>
8839 * expr.h (non_mem_decl_p): Declare.
8840 (mem_ref_refers_to_non_mem_p): Likewise.
8841 * expr.c (non_mem_decl_p): Export.
8842 (mem_ref_refers_to_non_mem_p): Likewise.
8843 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
8844 but check the base with mem_ref_refers_to_non_mem_p
8847 2021-10-04 Richard Biener <rguenther@suse.de>
8849 PR tree-optimization/102570
8850 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
8851 we are using clique for the internal function code.
8852 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
8853 internal function code.
8854 (print_vn_reference_ops): Print the internal function code.
8855 (vn_reference_op_compute_hash): Hash it.
8856 (copy_reference_ops_from_call): Record it.
8857 (visit_stmt): Remove the restriction around internal function
8859 (fully_constant_vn_reference_p): Use fold_const_call and handle
8861 (vn_reference_eq): Compare call return types.
8862 * tree-ssa-pre.c (create_expression_by_pieces): Handle
8863 generating calls to internal functions.
8864 (compute_avail): Remove the restriction around internal function
8867 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
8869 PR tree-optimization/102560
8870 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
8871 marker for invalid_range.
8873 2021-10-04 Richard Biener <rguenther@suse.de>
8875 PR middle-end/102587
8876 * internal-fn.c (expand_DEFERRED_INIT): Guard register
8877 initialization path an avoid initializing VLA registers
8880 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
8882 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
8884 2021-10-03 Martin Liska <mliska@suse.cz>
8886 * toplev.c (toplev::main): Check opt_index if it is a part
8889 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
8891 PR tree-optimization/102563
8892 * range-op.cc (operator_lshift::op1_range): Do not clobber
8895 2021-10-02 Martin Liska <mliska@suse.cz>
8897 * toplev.c (toplev::main): save_decoded_options[0] is program
8898 name and so it should be skipped.
8900 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8902 PR tree-optimization/102546
8903 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
8904 X << Y is non-zero implies X is also non-zero.
8906 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8908 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8910 * config/aarch64/aarch64-tune.md: Regenerate.
8911 * doc/invoke.texi: Update docs.
8913 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8915 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8917 * config/aarch64/aarch64-tune.md: Regenerate.
8918 * doc/invoke.texi: Update docs.
8920 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8922 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8924 * config/aarch64/aarch64-tune.md: Regenerate.
8925 * doc/invoke.texi: Update docs.
8927 2021-10-01 Martin Sebor <msebor@redhat.com>
8930 * doc/invoke.texi (-Waddress): Update.
8931 * gengtype.c (write_types): Avoid -Waddress.
8932 * poly-int.h (POLY_SET_COEFF): Avoid using null.
8934 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
8937 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
8940 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8942 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
8944 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8946 * gimple-range-path.cc (path_range_query::compute_ranges): Use
8948 * gimple-range-path.h (class path_range_query): Remove shadowed
8950 (path_range_query::get_path_oracle): New.
8952 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8953 Richard Biener <rguenther@suse.de>
8956 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
8957 INT_MIN / -1 division detection from here ...
8958 (-fsanitize=signed-integer-overflow): ... and add it here.
8960 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8962 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
8964 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
8965 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
8966 (AARCH64_ISA_V9): New ISA flag.
8967 * doc/invoke.texi: Update docs.
8969 2021-10-01 Martin Liska <mliska@suse.cz>
8971 * toplev.c (toplev::main): Save decoded optimization options.
8972 * toplev.h (save_opt_decoded_options): New.
8973 * doc/extend.texi: Be more clear about optimize and target
8976 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8978 * explow.c: Include langhooks.h.
8979 (set_stack_check_libfunc): Build a proper function type.
8981 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8984 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
8985 not return true for external weak function symbols in medium model.
8987 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8989 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
8990 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
8991 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
8992 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
8993 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
8994 OMP_CLAUSE_SCHEDULE_STATIC.
8996 2021-10-01 Richard Biener <rguenther@suse.de>
8998 PR middle-end/102518
8999 * tree-inline.c (setup_one_parameter): Avoid substituting
9000 an invariant into contexts where a GIMPLE register is not valid.
9002 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
9004 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
9005 * config/arm/arm-tables.opt: Regenerate.
9006 * config/arm/arm-tune.md: Regenerate.
9007 * doc/invoke.texi: Update docs.
9009 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
9012 * config/i386/i386.md
9013 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
9016 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
9018 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
9019 omp_{c,re}alloc, fix omp_alloc/omp_free.
9021 2021-09-30 Martin Liska <mliska@suse.cz>
9023 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
9024 asm_out_file and stream directly to MYFILE.
9026 2021-09-30 Richard Biener <rguenther@suse.de>
9028 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
9029 Restore and fix condition under which we apply npeel to
9030 the DRs misalignment value.
9032 2021-09-30 Richard Biener <rguenther@suse.de>
9034 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
9035 Fix npeel check for variable amount of peeling.
9037 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9039 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
9041 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9043 * gimple-range.cc (gimple_ranger::debug): New.
9044 * gimple-range.h (class gimple_ranger): Add debug.
9046 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
9048 PR middle-end/102519
9049 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
9051 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
9054 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
9055 (btf_finalize): Empty the hash_map btf_var_ids.
9057 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
9059 * tree-vrp.c (thread_through_all_blocks): Return bool.
9060 (execute_vrp_threader): Return TODO_* flags.
9061 (pass_data_vrp_threader): Set todo_flags_finish to 0.
9063 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
9065 * timevar.def (TV_TREE_VRP_THREADER): New.
9066 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
9068 2021-09-29 David Faust <david.faust@oracle.com>
9070 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
9072 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
9074 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
9075 new type qualifier enum.
9076 (TYPES_TERNOP_SSSU): Likewise.
9077 (TYPES_TERNOP_PPPU): Likewise.
9078 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
9079 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
9081 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
9082 builtin and remove casts.
9083 (vqtbl1_s8): Likewise.
9084 (vqtbl1q_p8): Likewise.
9085 (vqtbl1q_s8): Likewise.
9086 (vqtbx1_s8): Likewise.
9087 (vqtbx1_p8): Likewise.
9088 (vqtbx1q_s8): Likewise.
9089 (vqtbx1q_p8): Likewise.
9090 (vtbl1_p8): Likewise.
9091 (vtbl2_p8): Likewise.
9092 (vtbx2_p8): Likewise.
9094 2021-09-29 Richard Biener <rguenther@suse.de>
9096 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
9098 (vect_update_misalignment_for_peel): Use it to update
9099 misaligned to the value necessary for an aligned access.
9100 (vect_get_peeling_costs_all_drs): Likewise.
9101 (vect_enhance_data_refs_alignment): Likewise.
9103 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9105 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
9106 emitted operations and adjust heuristic for code size.
9108 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9110 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
9111 emitted operations and adjust heuristic for code size.
9113 2021-09-29 Jakub Jelinek <jakub@redhat.com>
9115 PR middle-end/102504
9116 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
9117 in OMP_SCOPE clauses, not just on worksharing construct clauses.
9119 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
9121 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
9122 (<u>mulditi3): Call <su>muldi3_highpart.
9123 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
9124 (<u>mulsidi3): Call <su>mulsi3_highpart.
9125 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
9127 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9129 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
9131 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9133 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
9134 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
9137 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9139 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
9140 (AARCH64_FL_V8_7): Likewise.
9141 (AARCH64_FL_FOR_ARCH8_7): Likewise.
9142 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
9143 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
9144 * doc/invoke.texi: Document the above.
9146 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9148 * dbgcnt.c (dbg_cnt_counter): New.
9149 * dbgcnt.h (dbg_cnt_counter): New.
9150 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
9151 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
9152 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
9153 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
9156 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9158 * cfgcleanup.c (pass_jump::execute): Check
9159 flag_expensive_optimizations.
9160 (pass_jump_after_combine::gate): Same.
9161 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
9162 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
9163 * tree-ssa-threadupdate.c
9164 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
9167 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9169 * tree-ssa-reassoc.c (biased_names): New global.
9170 (propagate_bias_p): New function.
9171 (loop_carried_phi): Remove.
9172 (propagate_rank): Propagate bias along single uses.
9173 (get_rank): Update biased_names when needed.
9175 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9177 * passes.def (pass_reassoc): Rename parameter to early_p.
9178 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
9180 (phi_rank): Don't bias loop-carried phi ranks
9181 before vectorization pass.
9182 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
9183 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
9185 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
9187 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
9189 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
9191 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9194 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
9195 special 80387 instruction XFmode constants if flag_rounding_math.
9197 2021-09-28 Richard Biener <rguenther@suse.de>
9199 PR tree-optimization/100112
9200 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
9201 referece into the hashtable twice in case last_vuse is
9202 different from the original vuse on the stmt.
9204 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9206 PR middle-end/102492
9207 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
9208 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
9209 clauses on SIMD constructs.
9211 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9213 PR tree-optimization/102511
9214 * gimple-range-path.cc (path_range_query::range_on_path_entry):
9215 Return VARYING when nothing found.
9217 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
9220 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
9222 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
9223 (VALID_MMX_REG_MODE): Add V4HFmode.
9224 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
9225 vector mode condition.
9226 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
9228 (function_arg_32): Add V4HFmode.
9229 (function_arg_advance_32): Likewise.
9230 * config/i386/i386.md (mode): Add V4HF/V2HF.
9231 (MODE_SIZE): Likewise.
9232 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
9233 (V_32): Add V2HF mode.
9234 (VHF_32_64): New mode iterator.
9235 (*mov<mode>_internal): Adjust sse alternatives to support
9237 (*mov<mode>_internal): Adjust sse alternatives to support
9239 (<insn><mode>3): New define_insn for add/sub/mul/div.
9241 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9243 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
9245 (pass_early_thread_jumps::gate): Same.
9246 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
9247 Return if !flag_thread_jumps.
9248 * tree-ssa-threadupdate.c
9249 (jt_path_registry::register_jump_thread): Assert that
9250 flag_thread_jumps is true.
9252 2021-09-28 liuhongt <hongtao.liu@intel.com>
9255 (simplify_context::simplify_binary_operation_1): Relax
9256 condition of simplifying (vec_concat:M (vec_select op0
9257 index0)(vec_select op1 index1)) to allow different modes
9258 between op0 and M, but have same inner mode.
9260 2021-09-28 liuhongt <hongtao.liu@intel.com>
9262 * config/i386/i386-expand.c (emit_reduc_half): Handle
9263 V8HF/V16HF/V32HFmode.
9264 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
9265 (REDUC_SSE_SMINMAX_MODE): Ditto.
9266 (REDUC_PLUS_MODE): Add V16HF and V32HF.
9267 (REDUC_SMINMAX_MODE): Ditto.
9269 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9271 * gimple-range-path.cc
9272 (path_range_query::precompute_ranges_in_block): Rename to...
9273 (path_range_query::compute_ranges_in_block): ...this.
9274 (path_range_query::precompute_ranges): Rename to...
9275 (path_range_query::compute_ranges): ...this.
9276 (path_range_query::precompute_relations): Rename to...
9277 (path_range_query::compute_relations): ...this.
9278 (path_range_query::precompute_phi_relations): Rename to...
9279 (path_range_query::compute_phi_relations): ...this.
9280 * gimple-range-path.h: Rename precompute* to compute*.
9281 * tree-ssa-threadbackward.c
9282 (back_threader::find_taken_edge_switch): Same.
9283 (back_threader::find_taken_edge_cond): Same.
9284 * tree-ssa-threadedge.c
9285 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
9286 (hybrid_jt_state::register_equivs_stmt): Inline...
9287 * tree-ssa-threadedge.h: ...here.
9289 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9291 * tree-vrp.c (lhs_of_dominating_assert): Remove.
9292 (class vrp_jt_state): Remove.
9293 (class vrp_jt_simplifier): Remove.
9294 (vrp_jt_simplifier::simplify): Remove.
9295 (class vrp_jump_threader): Remove.
9296 (vrp_jump_threader::vrp_jump_threader): Remove.
9297 (vrp_jump_threader::~vrp_jump_threader): Remove.
9298 (vrp_jump_threader::before_dom_children): Remove.
9299 (vrp_jump_threader::after_dom_children): Remove.
9301 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9303 * passes.def (pass_vrp_threader): New.
9304 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
9305 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
9306 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
9307 (hybrid_jt_simplifier::simplify): New.
9308 (hybrid_jt_simplifier::compute_ranges_from_state): New.
9309 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
9310 (class hybrid_jt_simplifier): New.
9311 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
9313 (class hybrid_threader): New.
9314 (hybrid_threader::hybrid_threader): New.
9315 (hybrid_threader::~hybrid_threader): New.
9316 (hybrid_threader::before_dom_children): New.
9317 (hybrid_threader::after_dom_children): New.
9318 (execute_vrp_threader): New.
9319 (class pass_vrp_threader): New.
9320 (make_pass_vrp_threader): New.
9322 2021-09-27 Martin Liska <mliska@suse.cz>
9324 * output.h (enum section_flag): New.
9325 (SECTION_FORGET): Remove.
9326 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
9327 (SECTION_STYLE_MASK): Define it based on other enum
9329 * varasm.c (switch_to_section): Remove unused handling of
9332 2021-09-27 Martin Liska <mliska@suse.cz>
9334 * common.opt: Add new variable flag_default_complex_method.
9335 * opts.c (finish_options): Handle flags related to
9336 x_flag_complex_method.
9337 * toplev.c (process_options): Remove option handling related
9338 to flag_complex_method.
9340 2021-09-27 Richard Biener <rguenther@suse.de>
9342 PR middle-end/102450
9343 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
9344 type_for_size, instead use int_mode_for_size.
9346 2021-09-27 Andrew Pinski <apinski@marvell.com>
9349 * gimplify.c (gimplify_save_expr): Return early
9350 if the type of val is error_mark_node.
9352 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9354 * tree-ssanames.c (ssa_name_has_boolean_range): Use
9357 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9359 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
9360 vrp_visit_cond_stmt.
9361 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
9362 (cprop_into_stmt): Same.
9363 (dom_opt_dom_walker::optimize_stmt): Same.
9365 2021-09-27 Richard Biener <rguenther@suse.de>
9367 PR tree-optimization/97351
9368 PR tree-optimization/97352
9369 PR tree-optimization/82426
9370 * tree-vectorizer.h (dr_misalignment): Add vector type
9372 (aligned_access_p): Likewise.
9373 (known_alignment_for_access_p): Likewise.
9374 (vect_supportable_dr_alignment): Likewise.
9375 (vect_known_alignment_in_bytes): Likewise. Refactor.
9376 (DR_MISALIGNMENT): Remove.
9377 (vect_update_shared_vectype): Likewise.
9378 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
9379 a vector type with larger alignment requirement and apply
9380 the negative step adjustment here.
9381 (vect_calculate_target_alignment): Remove.
9382 (vect_compute_data_ref_alignment): Get explicit vector type
9383 argument, do not apply a negative step alignment adjustment
9385 (vect_slp_analyze_node_alignment): Re-analyze alignment
9386 when we re-visit the DR with a bigger desired alignment but
9387 keep more precise results from smaller alignments.
9388 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
9389 (vect_slp_analyze_node_operations_1): Do not update the
9390 shared vector type on stmts.
9391 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
9392 vector type of an SLP node to the representative stmt-info.
9393 (vect_transform_stmt): Likewise.
9395 2021-09-27 liuhongt <hongtao.liu@intel.com>
9398 2021-09-09 liuhongt <hongtao.liu@intel.com>
9401 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
9402 (reduc_plus_scal_v4sf): .. this, New define_expand.
9403 (reduc_plus_scal_v2df): .. and this, New define_expand.
9405 2021-09-26 liuhongt <hongtao.liu@intel.com>
9407 * doc/extend.texi (Half-Precision): Remove storage only
9408 description for _Float16 w/o avx512fp16.
9410 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
9412 * config/pru/constraints.md (Rrio): New constraint.
9413 * config/pru/predicates.md (regio_operand): New predicate.
9414 * config/pru/pru-pragma.c (pru_register_pragmas): Register
9415 the __regio_symbol address space.
9416 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
9417 * config/pru/pru.c (pru_symref2ioregno): New helper function.
9418 (pru_legitimate_address_p): Remove.
9419 (pru_addr_space_legitimate_address_p): Use the address space
9421 (pru_nongeneric_pointer_addrspace): New helper function.
9422 (pru_insert_attributes): New function to validate __regio_symbol
9424 (TARGET_INSERT_ATTRIBUTES): New macro.
9425 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
9426 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
9427 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
9428 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
9430 (*regio_nozext_writesi): New pattern to write to I/O registers.
9431 (*regio_zext_write_r30<EQS0:mode>): Ditto.
9432 * doc/extend.texi: Document the new PRU Named Address Space.
9434 2021-09-24 Patrick Palka <ppalka@redhat.com>
9438 * real.c (encode_ieee_double): Avoid unwanted sign extension.
9439 (encode_ieee_quad): Likewise.
9441 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
9443 PR rtl-optimization/102147
9444 * ira-build.c (ira_conflict_vector_profitable_p): Make
9445 profitability calculation independent of host compiler pointer and
9448 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9450 * gimple-range-path.cc (path_range_query::path_range_query):
9451 Move debugging header...
9452 (path_range_query::precompute_ranges): ...here.
9453 (path_range_query::internal_range_of_expr): Do not call
9454 range_on_path_entry if NAME is defined in the current block.
9456 2021-09-24 Richard Biener <rguenther@suse.de>
9458 * cfghooks.c (verify_flow_info): Verify unallocated BB and
9459 edge flags are not set.
9461 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9463 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9465 (jt_path_registry::register_jump_thread): Call
9466 cancel_invalid_paths.
9467 * tree-ssa-threadupdate.h (class jt_path_registry): Add
9468 cancel_invalid_paths.
9470 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9472 PR tree-optimization/102400
9473 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
9474 result_vdef to zero value.
9476 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9478 PR tree-optimization/102451
9479 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
9482 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
9484 * config/i386/sse.md (cond_<insn><mode>): Extend to support
9486 (cond_mul<mode>): Likewise.
9487 (cond_div<mode>): Likewise.
9488 (cond_<code><mode>): Likewise.
9489 (cond_fma<mode>): Likewise.
9490 (cond_fms<mode>): Likewise.
9491 (cond_fnma<mode>): Likewise.
9492 (cond_fnms<mode>): Likewise.
9494 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9496 PR tree-optimization/102463
9497 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
9498 there is no range-ops handler, don't look for a relation.
9500 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9502 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
9503 non-executable_edge flag as parameter.
9504 * gimple-range-cache.h (ranger_cache): Adjust prototype.
9505 * gimple-range-gori.cc (gori_compute::gori_compute): Take
9506 non-executable_edge flag as parameter.
9507 (gori_compute::outgoing_edge_range_p): Check new flag.
9508 * gimple-range-gori.h (gori_compute): Adjust prototype.
9509 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
9510 (gimple_ranger::range_on_edge): Check new flag.
9511 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
9512 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
9513 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
9514 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
9515 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
9516 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
9517 (simplify_using_ranges::simplify_switch_using_ranges): Clear
9518 EDGE_EXECUTABLE like it originally did.
9519 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
9520 NON_EXECUTABLE flags.
9521 (simplify_using_ranges::simplify_using_ranges): Adjust.
9522 * vr-values.h (class simplify_using_ranges): Adjust.
9523 (simplify_using_ranges::set_range_query): Add non-executable flag param.
9525 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
9528 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
9529 zero-width bit fields and return indicator.
9530 (rs6000_discover_homogeneous_aggregate): Diagnose when the
9531 presence of a zero-width bit field changes parameter passing in
9534 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9536 * gimple-range-fold.cc (fold_using_range::range_of_phi):
9537 Remove dominator check.
9539 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9541 * gimple-range-path.cc (path_range_query::precompute_relations):
9542 Hoist edge calculations before using EDGE_SUCC.
9544 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
9546 * configure.ac: Fix --with-multilib-list description.
9547 * configure: Regenerate.
9549 2021-09-23 Richard Biener <rguenther@suse.de>
9551 PR tree-optimization/102448
9552 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9553 Clear alignment info copied from DR_PTR_INFO.
9555 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9557 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
9559 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
9560 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
9561 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
9562 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
9563 (vcond<mode><mode>): New expander.
9564 (vcond<mode><sseintvecmodelower>): Likewise.
9565 (vcond<sseintvecmodelower><mode>): Likewise.
9566 (vcondu<mode><sseintvecmodelower>): Likewise.
9568 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9570 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
9572 (extendv4hf<mode>2): Likewise.
9573 (extendv2hfv2df2): Likewise.
9574 (trunc<mode><ssePHmodelower>2): Likewise.
9575 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
9576 (trunc<mode>v4hf2): ... this, and drop constraints.
9577 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
9578 (truncv2dfv2hf2): ... this, and likewise.
9580 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9582 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
9584 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
9586 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
9587 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
9588 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
9590 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9592 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
9593 (fixuns_trunchfhi2): Likewise.
9594 (*fixuns_trunchfsi2zext): New define_insn.
9595 * config/i386/sse.md (ssePHmodelower): New mode_attr.
9596 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9597 New expander for same element vector fix_truncate.
9598 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9599 Likewise for V4HF to V4SI/V4DI fix_truncate.
9600 (fix<fixunssuffix>_truncv2hfv2di2):
9601 Likeise for V2HF to V2DI fix_truncate.
9603 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9605 * config/i386/i386.md (<code>hf3): New expander.
9607 2021-09-23 liuhongt <hongtao.liu@intel.com>
9609 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
9610 (VFH_SF_AVX512VL): Extend to handle HFmode.
9611 (VF_SF_AVX512VL): Deleted.
9613 2021-09-23 liuhongt <hongtao.liu@intel.com>
9615 * config/i386/i386.md (rinthf2): New expander.
9616 (nearbyinthf2): New expander.
9618 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9620 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
9621 (class dom_jt_state): ...this and provide virtual overrides.
9622 (dom_jt_state::register_equiv): New.
9623 (class dom_jt_simplifier): Rename from
9624 dom_jump_threader_simplifier.
9625 (dom_jump_threader_simplifier::simplify): Rename...
9626 (dom_jt_simplifier::simplify): ...to this.
9627 (pass_dominator::execute): Use dom_jt_simplifier and
9629 * tree-ssa-threadedge.c (jump_threader::jump_threader):
9631 (jt_state::register_equivs_stmt): Abstract out...
9632 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
9634 (jump_threader::thread_around_empty_blocks): Update state.
9635 (jump_threader::thread_through_normal_block): Same.
9636 (jt_state::jt_state): Remove.
9637 (jt_state::push): Remove pass specific bits. Keep block vector
9639 (jt_state::append_path): New.
9640 (jt_state::pop): Remove pass specific bits.
9641 (jt_state::register_equiv): Same.
9642 (jt_state::record_ranges_from_stmt): Same.
9643 (jt_state::register_equivs_on_edge): Same. Rename...
9644 (jt_state::register_equivs_edge): ...to this.
9645 (jt_state::dump): New.
9646 (jt_state::debug): New.
9647 (jump_threader_simplifier::simplify): Remove.
9648 (jt_state::get_path): New.
9649 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
9650 class. Expose common functionality as virtual methods.
9651 (class jump_threader_simplifier): Same. Rename...
9652 (class jt_simplifier): ...to this.
9653 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
9654 (class vrp_jt_simplifier): ...to this. Provide pass specific
9656 (class vrp_jt_state): New.
9657 (vrp_jump_threader_simplifier::simplify): Rename...
9658 (vrp_jt_simplifier::simplify): ...to this. Inline code from
9659 what used to be the base class.
9660 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
9663 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
9666 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
9669 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
9670 Richard Biener <rguenther@suse.de>
9672 * match.pd (negation simplifications): Implement some negation
9673 folding transformations from fold-const.c's fold_negate_expr.
9674 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
9675 argument, to control whether the op should be simplified prior
9676 to looking up/assigning a value number.
9677 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
9678 (vn_nary_simplify): Likewise.
9679 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
9680 now call vn_nary_build_or_lookup_1 disabling simplification.
9682 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
9684 PR tree-optimization/102087
9685 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
9686 Update bound/cmp/control for niter.
9688 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9690 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9691 Move check for non-empty BB here.
9692 (fur_source::register_outgoing_edges): ...from here.
9694 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9696 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
9697 Remove call to improve_range_with_equivs.
9698 (path_range_query::improve_range_with_equivs): Remove
9699 * gimple-range-path.h: Remove improve_range_with_equivs.
9701 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9703 * config/i386/avx512fp16intrin.h:
9704 (_mm512_mask_blend_ph): New intrinsic.
9705 (_mm512_permutex2var_ph): Ditto.
9706 (_mm512_permutexvar_ph): Ditto.
9707 * config/i386/avx512fp16vlintrin.h:
9708 (_mm256_mask_blend_ph): New intrinsic.
9709 (_mm256_permutex2var_ph): Ditto.
9710 (_mm256_permutexvar_ph): Ditto.
9711 (_mm_mask_blend_ph): Ditto.
9712 (_mm_permutex2var_ph): Ditto.
9713 (_mm_permutexvar_ph): Ditto.
9715 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9717 * config/i386/avx512fp16intrin.h: Add new intrinsics.
9718 (_mm512_conj_pch): New intrinsic.
9719 (_mm512_mask_conj_pch): Ditto.
9720 (_mm512_maskz_conj_pch): Ditto.
9721 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
9722 (_mm256_conj_pch): New intrinsic.
9723 (_mm256_mask_conj_pch): Ditto.
9724 (_mm256_maskz_conj_pch): Ditto.
9725 (_mm_conj_pch): Ditto.
9726 (_mm_mask_conj_pch): Ditto.
9727 (_mm_maskz_conj_pch): Ditto.
9729 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9731 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
9732 (_mm512_reduce_add_ph): New intrinsic.
9733 (_mm512_reduce_mul_ph): Ditto.
9734 (_mm512_reduce_min_ph): Ditto.
9735 (_mm512_reduce_max_ph): Ditto.
9736 * config/i386/avx512fp16vlintrin.h
9737 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
9738 (_mm256_reduce_add_ph): New intrinsic.
9739 (_mm256_reduce_mul_ph): Ditto.
9740 (_mm256_reduce_min_ph): Ditto.
9741 (_mm256_reduce_max_ph): Ditto.
9742 (_mm_reduce_add_ph): Ditto.
9743 (_mm_reduce_mul_ph): Ditto.
9744 (_mm_reduce_min_ph): Ditto.
9745 (_mm_reduce_max_ph): Ditto.
9747 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9749 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
9750 __m128h_u): New typedef.
9751 (_mm512_load_ph): New intrinsic.
9752 (_mm256_load_ph): Ditto.
9753 (_mm_load_ph): Ditto.
9754 (_mm512_loadu_ph): Ditto.
9755 (_mm256_loadu_ph): Ditto.
9756 (_mm_loadu_ph): Ditto.
9757 (_mm512_store_ph): Ditto.
9758 (_mm256_store_ph): Ditto.
9759 (_mm_store_ph): Ditto.
9760 (_mm512_storeu_ph): Ditto.
9761 (_mm256_storeu_ph): Ditto.
9762 (_mm_storeu_ph): Ditto.
9763 (_mm512_abs_ph): Ditto.
9764 * config/i386/avx512fp16vlintrin.h
9765 (_mm_abs_ph): Ditto.
9766 (_mm256_abs_ph): Ditto.
9768 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9770 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
9772 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9775 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
9776 is actually a full copy of the source operand into the target.
9777 Don't emit a strict low part move if source and target mode match.
9779 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9781 PR middle-end/102415
9782 * omp-expand.c (expand_omp_single): If region->exit is NULL,
9783 assert region->entry is GIMPLE_OMP_SCOPE region and return.
9785 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9787 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
9788 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
9789 arguments from 2 to 3.
9790 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
9791 allocate clause allocator and print align if present.
9792 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
9793 for omp_default_mem_alloc if align modifier is present. If align
9794 modifier is present, use TREE_LIST to encode both allocator and
9796 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
9797 Handle align modifier on allocator clause if present.
9799 2021-09-22 liuhongt <hongtao.liu@intel.com>
9801 * config/i386/i386.md (define_attr "isa"): Add
9803 (define_attr "enabled"): Correspond fma_or_avx512vl to
9804 TARGET_FMA || TARGET_AVX512VL.
9805 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
9810 2021-09-22 liuhongt <hongtao.liu@intel.com>
9812 * config/i386/i386.md (cstorehf3): New define_expand.
9814 2021-09-22 liuhongt <hongtao.liu@intel.com>
9816 * config/i386/i386.md (<rounding_insn>hf2): New expander.
9817 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
9818 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
9819 Extend from VF_128 to VFH_128.
9821 2021-09-22 liuhongt <hongtao.liu@intel.com>
9823 * config/i386/i386-features.c (i386-features.c): Handle
9825 * config/i386/i386.md (sqrthf2): New expander.
9826 (*sqrthf2): New define_insn.
9827 * config/i386/sse.md
9828 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
9831 2021-09-22 liuhongt <hongtao.liu@intel.com>
9833 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
9835 (_mm_mask3_fcmadd_sch): Likewise.
9836 (_mm_maskz_fcmadd_sch): Likewise.
9837 (_mm_fcmadd_sch): Likewise.
9838 (_mm_mask_fmadd_sch): Likewise.
9839 (_mm_mask3_fmadd_sch): Likewise.
9840 (_mm_maskz_fmadd_sch): Likewise.
9841 (_mm_fmadd_sch): Likewise.
9842 (_mm_mask_fcmadd_round_sch): Likewise.
9843 (_mm_mask3_fcmadd_round_sch): Likewise.
9844 (_mm_maskz_fcmadd_round_sch): Likewise.
9845 (_mm_fcmadd_round_sch): Likewise.
9846 (_mm_mask_fmadd_round_sch): Likewise.
9847 (_mm_mask3_fmadd_round_sch): Likewise.
9848 (_mm_maskz_fmadd_round_sch): Likewise.
9849 (_mm_fmadd_round_sch): Likewise.
9850 (_mm_fcmul_sch): Likewise.
9851 (_mm_mask_fcmul_sch): Likewise.
9852 (_mm_maskz_fcmul_sch): Likewise.
9853 (_mm_fmul_sch): Likewise.
9854 (_mm_mask_fmul_sch): Likewise.
9855 (_mm_maskz_fmul_sch): Likewise.
9856 (_mm_fcmul_round_sch): Likewise.
9857 (_mm_mask_fcmul_round_sch): Likewise.
9858 (_mm_maskz_fcmul_round_sch): Likewise.
9859 (_mm_fmul_round_sch): Likewise.
9860 (_mm_mask_fmul_round_sch): Likewise.
9861 (_mm_maskz_fmul_round_sch): Likewise.
9862 * config/i386/i386-builtin.def: Add corresponding new builtins.
9863 * config/i386/sse.md
9864 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
9865 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
9866 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
9868 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
9869 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
9871 * config/i386/subst.md (mask_scalarcz_name): New.
9872 (mask_scalarc_name): Ditto.
9873 (mask_scalarc_operand3): Ditto.
9874 (mask_scalarcz_operand4): Ditto.
9875 (round_scalarcz_name): Ditto.
9876 (round_scalarc_mask_operand3): Ditto.
9877 (round_scalarcz_mask_operand4): Ditto.
9878 (round_scalarc_mask_op3): Ditto.
9879 (round_scalarcz_mask_op4): Ditto.
9880 (round_scalarcz_constraint): Ditto.
9881 (round_scalarcz_nimm_predicate): Ditto.
9882 (mask_scalarcz): Ditto.
9883 (mask_scalarc): Ditto.
9884 (round_scalarcz): Ditto.
9886 2021-09-22 liuhongt <hongtao.liu@intel.com>
9888 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
9890 (_mm512_mask_fcmadd_pch): Likewise.
9891 (_mm512_mask3_fcmadd_pch): Likewise.
9892 (_mm512_maskz_fcmadd_pch): Likewise.
9893 (_mm512_fmadd_pch): Likewise.
9894 (_mm512_mask_fmadd_pch): Likewise.
9895 (_mm512_mask3_fmadd_pch): Likewise.
9896 (_mm512_maskz_fmadd_pch): Likewise.
9897 (_mm512_fcmadd_round_pch): Likewise.
9898 (_mm512_mask_fcmadd_round_pch): Likewise.
9899 (_mm512_mask3_fcmadd_round_pch): Likewise.
9900 (_mm512_maskz_fcmadd_round_pch): Likewise.
9901 (_mm512_fmadd_round_pch): Likewise.
9902 (_mm512_mask_fmadd_round_pch): Likewise.
9903 (_mm512_mask3_fmadd_round_pch): Likewise.
9904 (_mm512_maskz_fmadd_round_pch): Likewise.
9905 (_mm512_fcmul_pch): Likewise.
9906 (_mm512_mask_fcmul_pch): Likewise.
9907 (_mm512_maskz_fcmul_pch): Likewise.
9908 (_mm512_fmul_pch): Likewise.
9909 (_mm512_mask_fmul_pch): Likewise.
9910 (_mm512_maskz_fmul_pch): Likewise.
9911 (_mm512_fcmul_round_pch): Likewise.
9912 (_mm512_mask_fcmul_round_pch): Likewise.
9913 (_mm512_maskz_fcmul_round_pch): Likewise.
9914 (_mm512_fmul_round_pch): Likewise.
9915 (_mm512_mask_fmul_round_pch): Likewise.
9916 (_mm512_maskz_fmul_round_pch): Likewise.
9917 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
9919 (_mm_mask_fmadd_pch): Likewise.
9920 (_mm_mask3_fmadd_pch): Likewise.
9921 (_mm_maskz_fmadd_pch): Likewise.
9922 (_mm256_fmadd_pch): Likewise.
9923 (_mm256_mask_fmadd_pch): Likewise.
9924 (_mm256_mask3_fmadd_pch): Likewise.
9925 (_mm256_maskz_fmadd_pch): Likewise.
9926 (_mm_fcmadd_pch): Likewise.
9927 (_mm_mask_fcmadd_pch): Likewise.
9928 (_mm_mask3_fcmadd_pch): Likewise.
9929 (_mm_maskz_fcmadd_pch): Likewise.
9930 (_mm256_fcmadd_pch): Likewise.
9931 (_mm256_mask_fcmadd_pch): Likewise.
9932 (_mm256_mask3_fcmadd_pch): Likewise.
9933 (_mm256_maskz_fcmadd_pch): Likewise.
9934 (_mm_fmul_pch): Likewise.
9935 (_mm_mask_fmul_pch): Likewise.
9936 (_mm_maskz_fmul_pch): Likewise.
9937 (_mm256_fmul_pch): Likewise.
9938 (_mm256_mask_fmul_pch): Likewise.
9939 (_mm256_maskz_fmul_pch): Likewise.
9940 (_mm_fcmul_pch): Likewise.
9941 (_mm_mask_fcmul_pch): Likewise.
9942 (_mm_maskz_fcmul_pch): Likewise.
9943 (_mm256_fcmul_pch): Likewise.
9944 (_mm256_mask_fcmul_pch): Likewise.
9945 (_mm256_maskz_fcmul_pch): Likewise.
9946 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
9947 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
9948 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
9949 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
9950 * config/i386/i386-builtin.def: Add new builtins.
9951 * config/i386/i386-expand.c: Handle new builtin types.
9952 * config/i386/subst.md (SUBST_CV): New.
9953 (maskc_name): Ditto.
9954 (maskc_operand3): Ditto.
9956 (sdc_maskz_name): Ditto.
9957 (sdc_mask_op4): Ditto.
9958 (sdc_mask_op5): Ditto.
9959 (sdc_mask_mode512bit_condition): Ditto.
9961 (round_maskc_operand3): Ditto.
9962 (round_sdc_mask_operand4): Ditto.
9963 (round_maskc_op3): Ditto.
9964 (round_sdc_mask_op4): Ditto.
9965 (round_saeonly_sdc_mask_operand5): Ditto.
9966 * config/i386/sse.md (unspec): Add complex fma unspecs.
9967 (avx512fmaskcmode): New.
9968 (UNSPEC_COMPLEX_F_C_MA): Ditto.
9969 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
9970 (complexopname): Ditto.
9971 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
9972 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
9973 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
9975 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
9976 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
9978 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
9980 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
9981 rs6000-density-size-threshold, rs6000-density-penalty,
9982 rs6000-density-load-pct-threshold,
9983 rs6000-density-load-num-threshold): New parameter.
9984 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
9985 corresponding parameters.
9987 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9989 * gimple-range-path.cc (path_range_query::defined_outside_path):
9991 (path_range_query::range_on_path_entry): New.
9992 (path_range_query::internal_range_of_expr): Resolve unknowns
9994 (path_range_query::improve_range_with_equivs): New.
9995 (path_range_query::ssa_range_in_phi): Resolve unknowns with
9997 * gimple-range-path.h (class path_range_query): Add
9998 defined_outside_path, range_on_path_entry, and
9999 improve_range_with_equivs.
10001 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10003 * gimple-range-path.cc (path_range_query::add_to_imports): New.
10004 (path_range_query::add_copies_to_imports): New.
10005 (path_range_query::precompute_ranges): Call
10006 add_copies_to_imports.
10007 * gimple-range-path.h (class path_range_query): Add prototypes
10008 for add_copies_to_imports and add_to_imports.
10010 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10012 * gimple-range-path.cc (path_range_query::range_defined_in_block):
10013 Remove useless code.
10015 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10017 * gimple-range-fold.h (class fur_source): Make oracle protected.
10018 * gimple-range-path.cc (path_range_query::path_range_query): Add
10019 resolve argument. Initialize oracle.
10020 (path_range_query::~path_range_query): Delete oracle.
10021 (path_range_query::range_of_stmt): Adapt to use relations.
10022 (path_range_query::precompute_ranges): Pre-compute relations.
10023 (class jt_fur_source): New
10024 (jt_fur_source::jt_fur_source): New.
10025 (jt_fur_source::register_relation): New.
10026 (jt_fur_source::query_relation): New.
10027 (path_range_query::precompute_relations): New.
10028 (path_range_query::precompute_phi_relations): New.
10029 * gimple-range-path.h (path_range_query): Add resolve argument.
10030 Add oracle, precompute_relations, precompute_phi_relations.
10031 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
10032 resolve argument to solver.
10034 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10036 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
10037 Rename postfold_gcond_edges to register_outgoing_edges and
10039 (fold_using_range::postfold_gcond_edges): Rename...
10040 (fur_source::register_outgoing_edges): ...to this.
10041 * gimple-range-fold.h (postfold_gcond_edges): Rename to
10042 register_outgoing_edges and move to fur_source.
10044 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10046 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
10047 dom_info_available_p.
10049 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
10051 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
10052 and quick_grow_cleared instead of safe_grow_cleared.
10054 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
10057 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
10058 'random ()' to '0'.
10060 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
10062 * configure.ac: Detect when the assembler supports new-style
10063 architecture extensions.
10064 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
10065 the full CPU string if the assembler can grok it.
10066 (arm_rewrite_march): Likewise but for the architecture.
10067 * config.in: Regenerate.
10068 * configure: Regenerate.
10070 2021-09-21 Richard Biener <rguenther@suse.de>
10072 PR tree-optimization/102421
10073 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
10074 adjust alignment info.
10076 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
10078 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
10079 bitfield stream out.
10081 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10083 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
10084 undefined edges, apply an equivalence if appropriate.
10085 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
10086 UNDEFINED if EDGE_EXECUTABLE is not set.
10087 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
10088 as EXECUTABLE upon startup.
10089 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
10090 EDGE_EXECUTABLE set.
10091 * vr-values.c (set_and_propagate_unexecutable): New.
10092 (simplify_using_ranges::fold_cond): Call set_and_propagate.
10093 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
10094 * vr-values.h: Add prototype.
10096 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10098 * value-relation.cc (equiv_oracle::register_initial_def): New.
10099 (equiv_oracle::register_relation): Call register_initial_def.
10100 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
10101 (relation_oracle::register_stmt): Check def block of PHI arguments.
10102 * value-relation.h (equiv_oracle): Add new prototypes.
10104 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
10106 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
10107 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
10108 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
10109 __ROUNDING_MATH__ according to their corresponding flags.
10110 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
10111 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
10112 and __ROUNDING_MATH__.
10114 2021-09-20 Richard Biener <rguenther@suse.de>
10116 * tree-vect-stmts.c (vectorizable_load): Use the vectype
10119 2021-09-20 Richard Biener <rguenther@suse.de>
10121 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
10122 Do not compute alignment of the vectorized access here.
10124 2021-09-20 Richard Biener <rguenther@suse.de>
10126 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
10127 Store -1 for runtime alias peeling iterations.
10129 2021-09-20 Richard Biener <rguenther@suse.de>
10131 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
10133 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
10135 * input.c (string_concat_db::record_string_concatenation)
10136 (string_concat_db::get_string_concatenation): Skip for
10137 'RESERVED_LOCATION_P'.
10139 2021-09-20 Richard Biener <rguenther@suse.de>
10141 PR tree-optimization/65206
10142 * tree-data-ref.h (struct data_reference): Add alt_indices,
10144 * tree-data-ref.c (free_data_ref): Release alt_indices.
10145 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
10146 (create_data_ref): Adjust.
10147 (initialize_data_dependence_relation): Split into head
10148 and tail. When the base objects fail to match up try
10149 again with pointer-based analysis of indices.
10150 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
10151 not compare the lazily computed alternate set of indices.
10153 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
10155 * gcc.c: Test for execute OK when we find the
10156 programs for assembler linker and dsymutil and those
10157 were specified at configure-time.
10159 2021-09-19 Martin Sebor <msebor@redhat.com>
10161 PR middle-end/102403
10162 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
10163 Correct a function pre/postcondition.
10165 2021-09-19 Martin Sebor <msebor@redhat.com>
10167 PR middle-end/102243
10168 * tree-ssa-strlen.c (get_range): Handle null cfun.
10170 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
10172 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
10173 unwinder shim as a convenience library.
10175 2021-09-19 Andrew Pinski <apinski@marvell.com>
10177 * doc/install.texi: Add note about
10178 binutils 2.35 is required for LTO usage.
10180 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10182 * tree-ssa-threadbackward.c
10183 (back_threader_registry::register_path): Use push_edge.
10184 * tree-ssa-threadedge.c
10185 (jump_threader::thread_around_empty_blocks): Same.
10186 (jump_threader::thread_through_normal_block): Same.
10187 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
10189 * tree-ssa-threadupdate.c
10190 (jt_path_registry::allocate_thread_edge): Remove.
10191 (jt_path_registry::push_edge): New.
10192 (dump_jump_thread_path): Make static.
10193 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
10196 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10198 * gimple-range-path.cc (path_range_query::path_range_query): Add
10200 (path_range_query::dump): Remove extern declaration of dump_ranger.
10201 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
10202 * gimple-range-trace.h (dump_ranger): Add prototype.
10204 2021-09-19 John Ericson <git@JohnEricson.me>
10206 * gcc.c (find_a_program): New function, factored out of...
10207 (find_a_file): Here.
10208 (execute): Use find_a_program when looking for programs rather
10211 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
10213 * config/avr/avr-mcus.def: Add atmega324pb.
10214 * doc/avr-mmcu.texi: Corresponding changes.
10216 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
10218 PR middle-end/88173
10219 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
10220 the same transformations as to @1. For comparisons against NaN,
10221 don't check HONOR_SNANS but confirm that neither operand is a
10224 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
10226 * attribs.c (make_unique_name): Delete.
10227 * attribs.h (make_unique_name): Delete.
10229 2021-09-19 Andrew Pinski <apinski@marvell.com>
10231 * lra-constraints.c (check_and_process_move): Assert
10232 that dclass and sclass are greater than or equal to NO_REGS.
10234 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10236 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
10237 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
10238 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
10240 2021-09-18 liuhongt <hongtao.liu@intel.com>
10242 * config/i386/i386-features.c (remove_partial_avx_dependency):
10243 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
10244 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
10246 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10248 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
10249 if file/namespace scope variable doesn't have predetermined sharing,
10250 treat it as if there was default(none).
10252 2021-09-18 liuhongt <hongtao.liu@intel.com>
10254 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
10256 (_mm_mask_fmadd_sh): Likewise.
10257 (_mm_mask3_fmadd_sh): Likewise.
10258 (_mm_maskz_fmadd_sh): Likewise.
10259 (_mm_fmadd_round_sh): Likewise.
10260 (_mm_mask_fmadd_round_sh): Likewise.
10261 (_mm_mask3_fmadd_round_sh): Likewise.
10262 (_mm_maskz_fmadd_round_sh): Likewise.
10263 (_mm_fnmadd_sh): Likewise.
10264 (_mm_mask_fnmadd_sh): Likewise.
10265 (_mm_mask3_fnmadd_sh): Likewise.
10266 (_mm_maskz_fnmadd_sh): Likewise.
10267 (_mm_fnmadd_round_sh): Likewise.
10268 (_mm_mask_fnmadd_round_sh): Likewise.
10269 (_mm_mask3_fnmadd_round_sh): Likewise.
10270 (_mm_maskz_fnmadd_round_sh): Likewise.
10271 (_mm_fmsub_sh): Likewise.
10272 (_mm_mask_fmsub_sh): Likewise.
10273 (_mm_mask3_fmsub_sh): Likewise.
10274 (_mm_maskz_fmsub_sh): Likewise.
10275 (_mm_fmsub_round_sh): Likewise.
10276 (_mm_mask_fmsub_round_sh): Likewise.
10277 (_mm_mask3_fmsub_round_sh): Likewise.
10278 (_mm_maskz_fmsub_round_sh): Likewise.
10279 (_mm_fnmsub_sh): Likewise.
10280 (_mm_mask_fnmsub_sh): Likewise.
10281 (_mm_mask3_fnmsub_sh): Likewise.
10282 (_mm_maskz_fnmsub_sh): Likewise.
10283 (_mm_fnmsub_round_sh): Likewise.
10284 (_mm_mask_fnmsub_round_sh): Likewise.
10285 (_mm_mask3_fnmsub_round_sh): Likewise.
10286 (_mm_maskz_fnmsub_round_sh): Likewise.
10287 * config/i386/i386-builtin-types.def
10288 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
10289 * config/i386/i386-builtin.def: Add new builtins.
10290 * config/i386/i386-expand.c: Handle new builtin type.
10291 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
10292 Ajdust to support FP16.
10293 (fmai_vmfmsub_<mode><round_name>): Ditto.
10294 (fmai_vmfnmadd_<mode><round_name>): Ditto.
10295 (fmai_vmfnmsub_<mode><round_name>): Ditto.
10296 (*fmai_fmadd_<mode>): Ditto.
10297 (*fmai_fmsub_<mode>): Ditto.
10298 (*fmai_fnmadd_<mode><round_name>): Ditto.
10299 (*fmai_fnmsub_<mode><round_name>): Ditto.
10300 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
10301 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
10302 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
10303 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
10304 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
10305 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
10306 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
10307 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10308 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
10309 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10310 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
10311 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
10312 adjust to support FP16.
10313 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
10314 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
10315 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
10318 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10320 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
10322 (maskload<mode><avx512fmaskmodelower>): Ditto.
10323 (maskstore<mode><avx512fmaskmodelower>): Ditto.
10325 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10327 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
10329 (ix86_expand_copysign): Ditto.
10330 (ix86_expand_xorsign): Ditto.
10331 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
10333 (ix86_build_signbit_mask): Ditto.
10334 (ix86_can_change_mode_class): Ditto.
10335 * config/i386/i386.md
10336 (SSEMODEF): Add HFmode.
10337 (ssevecmodef): Ditto.
10338 (<code>hf2): New define_expand.
10339 (*<code>hf2_1): New define_insn_and_split.
10340 (copysign<mode>): Extend to support HFmode under AVX512FP16.
10341 (xorsign<mode>): Ditto.
10342 * config/i386/sse.md (VFB): New mode iterator.
10343 (VFB_128_256): Ditto.
10345 (sseintvecmode2): Support HF vector mode.
10346 (<code><mode>2): Use new mode iterator.
10347 (*<code><mode>2): Ditto.
10348 (copysign<mode>3): Ditto.
10349 (xorsign<mode>3): Ditto.
10350 (<code><mode>3<mask_name>): Ditto.
10351 (<code><mode>3<mask_name>): Ditto.
10352 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
10353 (<sse>_andnot<mode>3<mask_name>): Ditto.
10354 (*<code><mode>3<mask_name>): Ditto.
10355 (*<code><mode>3<mask_name>): Ditto.
10357 2021-09-18 liuhongt <hongtao.liu@intel.com>
10359 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
10361 (_mm512_mask3_fmadd_ph): Likewise.
10362 (_mm512_maskz_fmadd_ph): Likewise.
10363 (_mm512_fmadd_round_ph): Likewise.
10364 (_mm512_mask_fmadd_round_ph): Likewise.
10365 (_mm512_mask3_fmadd_round_ph): Likewise.
10366 (_mm512_maskz_fmadd_round_ph): Likewise.
10367 (_mm512_fnmadd_ph): Likewise.
10368 (_mm512_mask_fnmadd_ph): Likewise.
10369 (_mm512_mask3_fnmadd_ph): Likewise.
10370 (_mm512_maskz_fnmadd_ph): Likewise.
10371 (_mm512_fnmadd_round_ph): Likewise.
10372 (_mm512_mask_fnmadd_round_ph): Likewise.
10373 (_mm512_mask3_fnmadd_round_ph): Likewise.
10374 (_mm512_maskz_fnmadd_round_ph): Likewise.
10375 (_mm512_fmsub_ph): Likewise.
10376 (_mm512_mask_fmsub_ph): Likewise.
10377 (_mm512_mask3_fmsub_ph): Likewise.
10378 (_mm512_maskz_fmsub_ph): Likewise.
10379 (_mm512_fmsub_round_ph): Likewise.
10380 (_mm512_mask_fmsub_round_ph): Likewise.
10381 (_mm512_mask3_fmsub_round_ph): Likewise.
10382 (_mm512_maskz_fmsub_round_ph): Likewise.
10383 (_mm512_fnmsub_ph): Likewise.
10384 (_mm512_mask_fnmsub_ph): Likewise.
10385 (_mm512_mask3_fnmsub_ph): Likewise.
10386 (_mm512_maskz_fnmsub_ph): Likewise.
10387 (_mm512_fnmsub_round_ph): Likewise.
10388 (_mm512_mask_fnmsub_round_ph): Likewise.
10389 (_mm512_mask3_fnmsub_round_ph): Likewise.
10390 (_mm512_maskz_fnmsub_round_ph): Likewise.
10391 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
10393 (_mm256_mask_fmadd_ph): Likewise.
10394 (_mm256_mask3_fmadd_ph): Likewise.
10395 (_mm256_maskz_fmadd_ph): Likewise.
10396 (_mm_fmadd_ph): Likewise.
10397 (_mm_mask_fmadd_ph): Likewise.
10398 (_mm_mask3_fmadd_ph): Likewise.
10399 (_mm_maskz_fmadd_ph): Likewise.
10400 (_mm256_fnmadd_ph): Likewise.
10401 (_mm256_mask_fnmadd_ph): Likewise.
10402 (_mm256_mask3_fnmadd_ph): Likewise.
10403 (_mm256_maskz_fnmadd_ph): Likewise.
10404 (_mm_fnmadd_ph): Likewise.
10405 (_mm_mask_fnmadd_ph): Likewise.
10406 (_mm_mask3_fnmadd_ph): Likewise.
10407 (_mm_maskz_fnmadd_ph): Likewise.
10408 (_mm256_fmsub_ph): Likewise.
10409 (_mm256_mask_fmsub_ph): Likewise.
10410 (_mm256_mask3_fmsub_ph): Likewise.
10411 (_mm256_maskz_fmsub_ph): Likewise.
10412 (_mm_fmsub_ph): Likewise.
10413 (_mm_mask_fmsub_ph): Likewise.
10414 (_mm_mask3_fmsub_ph): Likewise.
10415 (_mm_maskz_fmsub_ph): Likewise.
10416 (_mm256_fnmsub_ph): Likewise.
10417 (_mm256_mask_fnmsub_ph): Likewise.
10418 (_mm256_mask3_fnmsub_ph): Likewise.
10419 (_mm256_maskz_fnmsub_ph): Likewise.
10420 (_mm_fnmsub_ph): Likewise.
10421 (_mm_mask_fnmsub_ph): Likewise.
10422 (_mm_mask3_fnmsub_ph): Likewise.
10423 (_mm_maskz_fnmsub_ph): Likewise.
10424 * config/i386/i386-builtin.def: Add corresponding new builtins.
10425 * config/i386/sse.md
10426 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
10427 support HF vector modes.
10428 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
10430 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10431 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10432 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10433 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
10434 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
10435 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
10436 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
10438 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10439 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10440 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10441 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
10442 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
10443 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
10445 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10446 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10447 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10448 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
10449 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
10450 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
10451 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
10453 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10454 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10455 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10456 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
10457 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
10459 2021-09-18 liuhongt <hongtao.liu@intel.com>
10461 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
10463 (_mm512_mask_fmaddsub_ph): Likewise.
10464 (_mm512_mask3_fmaddsub_ph): Likewise.
10465 (_mm512_maskz_fmaddsub_ph): Likewise.
10466 (_mm512_fmaddsub_round_ph): Likewise.
10467 (_mm512_mask_fmaddsub_round_ph): Likewise.
10468 (_mm512_mask3_fmaddsub_round_ph): Likewise.
10469 (_mm512_maskz_fmaddsub_round_ph): Likewise.
10470 (_mm512_mask_fmsubadd_ph): Likewise.
10471 (_mm512_mask3_fmsubadd_ph): Likewise.
10472 (_mm512_maskz_fmsubadd_ph): Likewise.
10473 (_mm512_fmsubadd_round_ph): Likewise.
10474 (_mm512_mask_fmsubadd_round_ph): Likewise.
10475 (_mm512_mask3_fmsubadd_round_ph): Likewise.
10476 (_mm512_maskz_fmsubadd_round_ph): Likewise.
10477 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
10479 (_mm256_mask_fmaddsub_ph): Likewise.
10480 (_mm256_mask3_fmaddsub_ph): Likewise.
10481 (_mm256_maskz_fmaddsub_ph): Likewise.
10482 (_mm_fmaddsub_ph): Likewise.
10483 (_mm_mask_fmaddsub_ph): Likewise.
10484 (_mm_mask3_fmaddsub_ph): Likewise.
10485 (_mm_maskz_fmaddsub_ph): Likewise.
10486 (_mm256_fmsubadd_ph): Likewise.
10487 (_mm256_mask_fmsubadd_ph): Likewise.
10488 (_mm256_mask3_fmsubadd_ph): Likewise.
10489 (_mm256_maskz_fmsubadd_ph): Likewise.
10490 (_mm_fmsubadd_ph): Likewise.
10491 (_mm_mask_fmsubadd_ph): Likewise.
10492 (_mm_mask3_fmsubadd_ph): Likewise.
10493 (_mm_maskz_fmsubadd_ph): Likewise.
10494 * config/i386/i386-builtin.def: Add corresponding new builtins.
10495 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
10496 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
10497 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
10499 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
10501 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
10502 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
10503 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
10505 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
10506 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
10508 2021-09-18 liuhongt <hongtao.liu@intel.com>
10511 * config/i386/i386.c (ix86_print_operand): Handle
10512 V8HF/V16HF/V32HFmode.
10513 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
10514 * config/i386/sse.md (avx512bcst): Remove.
10516 2021-09-17 Martin Sebor <msebor@redhat.com>
10518 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
10519 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
10520 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
10521 (check_defs): Add comment.
10522 (can_skip_redundant_opnd): Update comment.
10523 (compute_uninit_opnds_pos): Adjust to namespace change.
10524 (find_pdom): Move to gimple-predicate-analysis.cc.
10526 (struct uninit_undef_val_t): New.
10527 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
10528 (find_control_equiv_block): Same.
10529 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
10530 (MAX_SWITCH_CASES): Same.
10531 (compute_control_dep_chain): Same.
10532 (find_uninit_use): Use predicate analyzer.
10533 (struct pred_info): Move to gimple-predicate-analysis.
10534 (convert_control_dep_chain_into_preds): Same.
10535 (find_predicates): Same.
10536 (collect_phi_def_edges): Same.
10537 (warn_uninitialized_phi): Use predicate analyzer.
10538 (find_def_preds): Move to gimple-predicate-analysis.
10539 (dump_pred_info): Same.
10540 (dump_pred_chain): Same.
10541 (dump_predicates): Same.
10542 (destroy_predicate_vecs): Remove.
10543 (execute_late_warn_uninitialized): New.
10544 (get_cmp_code): Move to gimple-predicate-analysis.
10545 (is_value_included_in): Same.
10546 (value_sat_pred_p): Same.
10547 (find_matching_predicate_in_rest_chains): Same.
10548 (is_use_properly_guarded): Same.
10549 (prune_uninit_phi_opnds): Same.
10550 (find_var_cmp_const): Same.
10551 (use_pred_not_overlap_with_undef_path_pred): Same.
10552 (pred_equal_p): Same.
10553 (is_neq_relop_p): Same.
10554 (is_neq_zero_form_p): Same.
10555 (pred_expr_equal_p): Same.
10556 (is_pred_expr_subset_of): Same.
10557 (is_pred_chain_subset_of): Same.
10558 (is_included_in): Same.
10559 (is_superset_of): Same.
10560 (pred_neg_p): Same.
10561 (simplify_pred): Same.
10562 (simplify_preds_2): Same.
10563 (simplify_preds_3): Same.
10564 (simplify_preds_4): Same.
10565 (simplify_preds): Same.
10567 (push_to_worklist): Same.
10568 (get_pred_info_from_cmp): Same.
10569 (is_degenerated_phi): Same.
10570 (normalize_one_pred_1): Same.
10571 (normalize_one_pred): Same.
10572 (normalize_one_pred_chain): Same.
10573 (normalize_preds): Same.
10574 (can_one_predicate_be_invalidated_p): Same.
10575 (can_chain_union_be_invalidated_p): Same.
10576 (uninit_uses_cannot_happen): Same.
10577 (pass_late_warn_uninitialized::execute): Define.
10578 * gimple-predicate-analysis.cc: New file.
10579 * gimple-predicate-analysis.h: New file.
10581 2021-09-17 Julian Brown <julian@codesourcery.com>
10583 * config/gcn/gcn.c (gimple.h): Include.
10584 (gcn_fork_join): Emit barrier for worker-level joins.
10585 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
10586 writes_gang_private bitmap parameter. Set bit for blocks
10587 containing gang-private variable writes.
10588 (worker_single_simple): Don't emit barrier after predicated block.
10589 (worker_single_copy): Don't emit barrier if we're not broadcasting
10590 anything and the block contains no gang-private writes.
10591 (neuter_worker_single): Don't predicate blocks that only contain
10592 NOPs or internal marker functions. Pass has_gang_private_write
10593 argument to worker_single_copy.
10594 (oacc_do_neutering): Add writes_gang_private bitmap handling.
10596 2021-09-17 Julian Brown <julian@codesourcery.com>
10598 * config/gcn/gcn-protos.h
10599 (gcn_goacc_create_worker_broadcast_record): Update prototype.
10600 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
10601 preallocated block of LDS memory. Do not cache/share decls for
10602 reduction temporaries between invocations.
10603 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
10604 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
10605 and return temporary LDS space at that offset. Return pointer in
10607 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
10609 (ACC_LDS_SIZE): Define as acc_lds_size.
10610 (gcn_init_machine_status): Don't initialise lds_allocated,
10611 lds_allocs, reduc_decls fields of machine function struct.
10612 (gcn_option_override): Handle default size for gang-private
10613 variables and -mgang-private-size option.
10614 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
10615 initialising M0_REG.
10616 (gcn_shared_mem_layout): New function.
10617 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
10618 gang_private_hwm variable.
10619 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
10620 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
10621 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
10622 * config/gcn/gcn.opt (gang_private_size_opt): New global.
10623 (mgang-private-size=): New option.
10624 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
10625 documentation hook.
10626 * doc/tm.texi: Regenerate.
10627 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
10629 (build_sender_ref): Handle sender_decl being pointer.
10630 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
10631 parameters. Pass placement argument to
10632 create_worker_broadcast_record hook invocations. Handle
10633 sender_decl being pointer and isolate_broadcasts inserting extra
10635 (blk_offset_map_t): Add typedef.
10636 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
10637 preallocated range to worker_single_copy call.
10638 (dfs_broadcast_reachable_1): New function.
10639 (idx_decl_pair_t, used_range_vec_t): New typedefs.
10640 (sort_size_descending): New function.
10641 (addr_range): New class.
10642 (splay_tree_compare_addr_range, splay_tree_free_key)
10643 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
10644 (execute_omp_oacc_neuter_broadcast): Rename to...
10645 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
10646 parameters. Arrange layout of shared memory for broadcast
10648 (execute_omp_oacc_neuter_broadcast): New function.
10649 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
10650 handling from here. Enable pass for all OpenACC routines in order
10651 to call shared memory-layout hook.
10652 * target.def (create_worker_broadcast_record): Add OFFSET
10654 (shared_mem_layout): New hook.
10656 2021-09-17 Julian Brown <julian@codesourcery.com>
10657 Thomas Schwinge <thomas@codesourcery.com>
10659 * omp-oacc-neuter-broadcast.cc
10660 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
10662 (execute_omp_oacc_neuter_broadcast): Adjust.
10664 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10666 * value-relation.cc (class equiv_chain): Move to header file.
10667 (path_oracle::path_oracle): New.
10668 (path_oracle::~path_oracle): New.
10669 (path_oracle::register_relation): New.
10670 (path_oracle::query_relation): New.
10671 (path_oracle::reset_path): New.
10672 (path_oracle::dump): New.
10673 * value-relation.h (class equiv_chain): Move to here.
10674 (class path_oracle): New.
10676 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10678 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
10680 * gimple-range-fold.cc (fur_depend::register_relation): Use
10681 register_stmt/edge routines.
10682 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
10683 (equiv_oracle::equiv_oracle): Create self equivalence cache.
10684 (equiv_oracle::~equiv_oracle): Release same.
10685 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
10686 are no equivalences.
10687 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
10688 (equiv_oracle::register_relation): Rename from register_equiv.
10689 (relation_chain_head::find_relation): Relocate from dom_oracle.
10690 (relation_oracle::register_stmt): New.
10691 (relation_oracle::register_edge): New.
10692 (dom_oracle::*): Rename from relation_oracle.
10693 (dom_oracle::register_relation): Adjust to call equiv_oracle.
10694 (dom_oracle::set_one_relation): Split from register_relation.
10695 (dom_oracle::register_transitives): Consolidate 2 methods.
10696 (dom_oracle::find_relation_block): Move core to relation_chain.
10697 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
10698 * value-relation.h (class relation_oracle): New pure virtual base.
10699 (class equiv_oracle): Inherit from relation_oracle and adjust.
10700 (class dom_oracle): Rename from old relation_oracle and adjust.
10702 2021-09-17 Martin Sebor <msebor@redhat.com>
10704 PR middle-end/102200
10705 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
10706 (handle_min_max_size): Change argument. Store original SSA_NAME for
10707 operands to potentially distinct (sub)objects.
10708 (compute_objsize_r): Adjust call to the above.
10710 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10712 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
10713 (rs6000_new_builtin_vectorized_function): New function.
10714 (rs6000_new_builtin_md_vectorized_function): Likewise.
10715 (rs6000_builtin_vectorized_function): Call
10716 rs6000_new_builtin_vectorized_function.
10717 (rs6000_builtin_md_vectorized_function): Call
10718 rs6000_new_builtin_md_vectorized_function.
10720 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10722 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
10723 (ASSEMBLE_PAIR): Likewise.
10724 (BUILD_ACC): Likewise.
10725 (DISASSEMBLE_ACC): Likewise.
10726 (DISASSEMBLE_PAIR): Likewise.
10727 (PMXVBF16GER2): Likewise.
10728 (PMXVBF16GER2NN): Likewise.
10729 (PMXVBF16GER2NP): Likewise.
10730 (PMXVBF16GER2PN): Likewise.
10731 (PMXVBF16GER2PP): Likewise.
10732 (PMXVF16GER2): Likewise.
10733 (PMXVF16GER2NN): Likewise.
10734 (PMXVF16GER2NP): Likewise.
10735 (PMXVF16GER2PN): Likewise.
10736 (PMXVF16GER2PP): Likewise.
10737 (PMXVF32GER): Likewise.
10738 (PMXVF32GERNN): Likewise.
10739 (PMXVF32GERNP): Likewise.
10740 (PMXVF32GERPN): Likewise.
10741 (PMXVF32GERPP): Likewise.
10742 (PMXVF64GER): Likewise.
10743 (PMXVF64GERNN): Likewise.
10744 (PMXVF64GERNP): Likewise.
10745 (PMXVF64GERPN): Likewise.
10746 (PMXVF64GERPP): Likewise.
10747 (PMXVI16GER2): Likewise.
10748 (PMXVI16GER2PP): Likewise.
10749 (PMXVI16GER2S): Likewise.
10750 (PMXVI16GER2SPP): Likewise.
10751 (PMXVI4GER8): Likewise.
10752 (PMXVI4GER8PP): Likewise.
10753 (PMXVI8GER4): Likewise.
10754 (PMXVI8GER4PP): Likewise.
10755 (PMXVI8GER4SPP): Likewise.
10756 (XVBF16GER2): Likewise.
10757 (XVBF16GER2NN): Likewise.
10758 (XVBF16GER2NP): Likewise.
10759 (XVBF16GER2PN): Likewise.
10760 (XVBF16GER2PP): Likewise.
10761 (XVF16GER2): Likewise.
10762 (XVF16GER2NN): Likewise.
10763 (XVF16GER2NP): Likewise.
10764 (XVF16GER2PN): Likewise.
10765 (XVF16GER2PP): Likewise.
10766 (XVF32GER): Likewise.
10767 (XVF32GERNN): Likewise.
10768 (XVF32GERNP): Likewise.
10769 (XVF32GERPN): Likewise.
10770 (XVF32GERPP): Likewise.
10771 (XVF64GER): Likewise.
10772 (XVF64GERNN): Likewise.
10773 (XVF64GERNP): Likewise.
10774 (XVF64GERPN): Likewise.
10775 (XVF64GERPP): Likewise.
10776 (XVI16GER2): Likewise.
10777 (XVI16GER2PP): Likewise.
10778 (XVI16GER2S): Likewise.
10779 (XVI16GER2SPP): Likewise.
10780 (XVI4GER8): Likewise.
10781 (XVI4GER8PP): Likewise.
10782 (XVI8GER4): Likewise.
10783 (XVI8GER4PP): Likewise.
10784 (XVI8GER4SPP): Likewise.
10785 (XXMFACC): Likewise.
10786 (XXMTACC): Likewise.
10787 (XXSETACCZ): Likewise.
10788 (ASSEMBLE_PAIR_V): Likewise.
10789 (BUILD_PAIR): Likewise.
10790 (DISASSEMBLE_PAIR_V): Likewise.
10793 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
10794 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
10795 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
10796 (parse_bif_attrs): Handle ismmaint.
10797 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
10798 (write_bif_static_init): Handle ismmaint.
10800 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10802 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
10804 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
10805 (rs6000_new_builtin_valid_without_lhs): New function.
10806 (rs6000_gimple_fold_new_mma_builtin): Likewise.
10807 (rs6000_gimple_fold_new_builtin): Likewise.
10809 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
10811 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
10812 Destruct stale Value objects.
10813 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
10816 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
10819 * match.pd (shift optimizations): Disable recent sign-changing
10820 optimization for shifts by zero, these will be folded later.
10822 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10824 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
10825 [power9] to [always].
10827 2021-09-17 Richard Biener <rguenther@suse.de>
10829 * tree-vect-stmts.c (vectorizable_load): Do not frob
10832 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10834 * config/i386/i386-features.c (remove_partial_avx_dependency):
10835 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
10836 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
10838 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
10840 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10841 * config/i386/i386.md (SSE FP to FP splitters): Replace
10842 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
10843 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
10844 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
10845 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
10846 * config/i386/x86-tune.def
10847 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
10848 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10850 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10853 * config/i386/i386-features.c (remove_partial_avx_dependency):
10854 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
10855 before generating vxorps.
10857 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10859 * config/i386/i386-options.c (processor_cost_table): Use
10860 tremont_cost for Tremont.
10861 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
10862 (tremont_memset): Likewise.
10863 (tremont_cost): Likewise.
10864 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
10865 Enable for Tremont.
10867 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10869 * common/config/i386/i386-common.c: Use Haswell scheduling model
10871 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
10873 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
10875 (ix86_adjust_cost): Handle Tremont.
10876 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
10877 Enable for Tremont.
10878 (X86_TUNE_USE_LEAVE): Likewise.
10879 (X86_TUNE_PUSH_MEMORY): Likewise.
10880 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
10881 (X86_TUNE_USE_CLTD): Likewise.
10882 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
10883 (X86_TUNE_AVOID_MFENCE): Likewise.
10884 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
10885 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
10886 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
10887 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
10888 (X86_TUNE_OPT_AGU): Likewise.
10889 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
10890 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
10891 (X86_TUNE_EXPAND_ABS): Likewise.
10892 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
10893 (X86_TUNE_SLOW_PSHUFB): Likewise.
10895 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
10897 PR rtl-optimization/102306
10898 * combine.c (try_combine): Abort the combination if we are about to
10899 duplicate volatile references.
10901 2021-09-17 liuhongt <hongtao.liu@intel.com>
10903 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
10905 (_mm256_undefined_ph): Likewise.
10906 (_mm512_undefined_ph): Likewise.
10907 (_mm_cvtsh_h): Likewise.
10908 (_mm256_cvtsh_h): Likewise.
10909 (_mm512_cvtsh_h): Likewise.
10910 (_mm512_castph_ps): Likewise.
10911 (_mm512_castph_pd): Likewise.
10912 (_mm512_castph_si512): Likewise.
10913 (_mm512_castph512_ph128): Likewise.
10914 (_mm512_castph512_ph256): Likewise.
10915 (_mm512_castph128_ph512): Likewise.
10916 (_mm512_castph256_ph512): Likewise.
10917 (_mm512_zextph128_ph512): Likewise.
10918 (_mm512_zextph256_ph512): Likewise.
10919 (_mm512_castps_ph): Likewise.
10920 (_mm512_castpd_ph): Likewise.
10921 (_mm512_castsi512_ph): Likewise.
10922 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
10924 (_mm256_castph_ps): Likewise.
10925 (_mm_castph_pd): Likewise.
10926 (_mm256_castph_pd): Likewise.
10927 (_mm_castph_si128): Likewise.
10928 (_mm256_castph_si256): Likewise.
10929 (_mm_castps_ph): Likewise.
10930 (_mm256_castps_ph): Likewise.
10931 (_mm_castpd_ph): Likewise.
10932 (_mm256_castpd_ph): Likewise.
10933 (_mm_castsi128_ph): Likewise.
10934 (_mm256_castsi256_ph): Likewise.
10935 (_mm256_castph256_ph128): Likewise.
10936 (_mm256_castph128_ph256): Likewise.
10937 (_mm256_zextph128_ph256): Likewise.
10939 2021-09-17 liuhongt <hongtao.liu@intel.com>
10941 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
10943 (_mm_mask_cvtsh_ss): Likewise.
10944 (_mm_maskz_cvtsh_ss): Likewise.
10945 (_mm_cvtsh_sd): Likewise.
10946 (_mm_mask_cvtsh_sd): Likewise.
10947 (_mm_maskz_cvtsh_sd): Likewise.
10948 (_mm_cvt_roundsh_ss): Likewise.
10949 (_mm_mask_cvt_roundsh_ss): Likewise.
10950 (_mm_maskz_cvt_roundsh_ss): Likewise.
10951 (_mm_cvt_roundsh_sd): Likewise.
10952 (_mm_mask_cvt_roundsh_sd): Likewise.
10953 (_mm_maskz_cvt_roundsh_sd): Likewise.
10954 (_mm_cvtss_sh): Likewise.
10955 (_mm_mask_cvtss_sh): Likewise.
10956 (_mm_maskz_cvtss_sh): Likewise.
10957 (_mm_cvtsd_sh): Likewise.
10958 (_mm_mask_cvtsd_sh): Likewise.
10959 (_mm_maskz_cvtsd_sh): Likewise.
10960 (_mm_cvt_roundss_sh): Likewise.
10961 (_mm_mask_cvt_roundss_sh): Likewise.
10962 (_mm_maskz_cvt_roundss_sh): Likewise.
10963 (_mm_cvt_roundsd_sh): Likewise.
10964 (_mm_mask_cvt_roundsd_sh): Likewise.
10965 (_mm_maskz_cvt_roundsd_sh): Likewise.
10966 * config/i386/i386-builtin-types.def
10967 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
10968 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
10969 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
10970 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
10971 * config/i386/i386-builtin.def: Add corrresponding new builtins.
10972 * config/i386/i386-expand.c: Handle new builtin types.
10973 * config/i386/sse.md (VF48_128): New mode iterator.
10974 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
10976 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
10979 2021-09-17 liuhongt <hongtao.liu@intel.com>
10981 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
10983 (_mm512_mask_cvtph_pd): Likewise.
10984 (_mm512_maskz_cvtph_pd): Likewise.
10985 (_mm512_cvt_roundph_pd): Likewise.
10986 (_mm512_mask_cvt_roundph_pd): Likewise.
10987 (_mm512_maskz_cvt_roundph_pd): Likewise.
10988 (_mm512_cvtxph_ps): Likewise.
10989 (_mm512_mask_cvtxph_ps): Likewise.
10990 (_mm512_maskz_cvtxph_ps): Likewise.
10991 (_mm512_cvtx_roundph_ps): Likewise.
10992 (_mm512_mask_cvtx_roundph_ps): Likewise.
10993 (_mm512_maskz_cvtx_roundph_ps): Likewise.
10994 (_mm512_cvtxps_ph): Likewise.
10995 (_mm512_mask_cvtxps_ph): Likewise.
10996 (_mm512_maskz_cvtxps_ph): Likewise.
10997 (_mm512_cvtx_roundps_ph): Likewise.
10998 (_mm512_mask_cvtx_roundps_ph): Likewise.
10999 (_mm512_maskz_cvtx_roundps_ph): Likewise.
11000 (_mm512_cvtpd_ph): Likewise.
11001 (_mm512_mask_cvtpd_ph): Likewise.
11002 (_mm512_maskz_cvtpd_ph): Likewise.
11003 (_mm512_cvt_roundpd_ph): Likewise.
11004 (_mm512_mask_cvt_roundpd_ph): Likewise.
11005 (_mm512_maskz_cvt_roundpd_ph): Likewise.
11006 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
11008 (_mm_mask_cvtph_pd): Likewise.
11009 (_mm_maskz_cvtph_pd): Likewise.
11010 (_mm256_cvtph_pd): Likewise.
11011 (_mm256_mask_cvtph_pd): Likewise.
11012 (_mm256_maskz_cvtph_pd): Likewise.
11013 (_mm_cvtxph_ps): Likewise.
11014 (_mm_mask_cvtxph_ps): Likewise.
11015 (_mm_maskz_cvtxph_ps): Likewise.
11016 (_mm256_cvtxph_ps): Likewise.
11017 (_mm256_mask_cvtxph_ps): Likewise.
11018 (_mm256_maskz_cvtxph_ps): Likewise.
11019 (_mm_cvtxps_ph): Likewise.
11020 (_mm_mask_cvtxps_ph): Likewise.
11021 (_mm_maskz_cvtxps_ph): Likewise.
11022 (_mm256_cvtxps_ph): Likewise.
11023 (_mm256_mask_cvtxps_ph): Likewise.
11024 (_mm256_maskz_cvtxps_ph): Likewise.
11025 (_mm_cvtpd_ph): Likewise.
11026 (_mm_mask_cvtpd_ph): Likewise.
11027 (_mm_maskz_cvtpd_ph): Likewise.
11028 (_mm256_cvtpd_ph): Likewise.
11029 (_mm256_mask_cvtpd_ph): Likewise.
11030 (_mm256_maskz_cvtpd_ph): Likewise.
11031 * config/i386/i386-builtin.def: Add corresponding new builtins.
11032 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11033 * config/i386/i386-expand.c: Handle new builtin types.
11034 * config/i386/sse.md
11035 (VF4_128_8_256): New.
11036 (VF48H_AVX512VL): Ditto.
11037 (ssePHmode): Add HF vector modes.
11038 (castmode): Add new convertable modes.
11039 (qq2phsuff): Ditto.
11040 (ph2pssuffix): New.
11041 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
11042 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
11043 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
11044 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
11045 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
11046 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
11047 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
11049 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
11050 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
11051 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
11052 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
11054 2021-09-17 liuhongt <hongtao.liu@intel.com>
11056 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
11058 (_mm_cvttsh_u32): Likewise.
11059 (_mm_cvtt_roundsh_i32): Likewise.
11060 (_mm_cvtt_roundsh_u32): Likewise.
11061 (_mm_cvttsh_i64): Likewise.
11062 (_mm_cvttsh_u64): Likewise.
11063 (_mm_cvtt_roundsh_i64): Likewise.
11064 (_mm_cvtt_roundsh_u64): Likewise.
11065 * config/i386/i386-builtin.def: Add corresponding new builtins.
11066 * config/i386/sse.md
11067 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
11070 2021-09-17 liuhongt <hongtao.liu@intel.com>
11072 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
11074 (_mm512_mask_cvttph_epi32): Likewise.
11075 (_mm512_maskz_cvttph_epi32): Likewise.
11076 (_mm512_cvtt_roundph_epi32): Likewise.
11077 (_mm512_mask_cvtt_roundph_epi32): Likewise.
11078 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
11079 (_mm512_cvttph_epu32): Likewise.
11080 (_mm512_mask_cvttph_epu32): Likewise.
11081 (_mm512_maskz_cvttph_epu32): Likewise.
11082 (_mm512_cvtt_roundph_epu32): Likewise.
11083 (_mm512_mask_cvtt_roundph_epu32): Likewise.
11084 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
11085 (_mm512_cvttph_epi64): Likewise.
11086 (_mm512_mask_cvttph_epi64): Likewise.
11087 (_mm512_maskz_cvttph_epi64): Likewise.
11088 (_mm512_cvtt_roundph_epi64): Likewise.
11089 (_mm512_mask_cvtt_roundph_epi64): Likewise.
11090 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
11091 (_mm512_cvttph_epu64): Likewise.
11092 (_mm512_mask_cvttph_epu64): Likewise.
11093 (_mm512_maskz_cvttph_epu64): Likewise.
11094 (_mm512_cvtt_roundph_epu64): Likewise.
11095 (_mm512_mask_cvtt_roundph_epu64): Likewise.
11096 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
11097 (_mm512_cvttph_epi16): Likewise.
11098 (_mm512_mask_cvttph_epi16): Likewise.
11099 (_mm512_maskz_cvttph_epi16): Likewise.
11100 (_mm512_cvtt_roundph_epi16): Likewise.
11101 (_mm512_mask_cvtt_roundph_epi16): Likewise.
11102 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
11103 (_mm512_cvttph_epu16): Likewise.
11104 (_mm512_mask_cvttph_epu16): Likewise.
11105 (_mm512_maskz_cvttph_epu16): Likewise.
11106 (_mm512_cvtt_roundph_epu16): Likewise.
11107 (_mm512_mask_cvtt_roundph_epu16): Likewise.
11108 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
11109 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
11111 (_mm_mask_cvttph_epi32): Likewise.
11112 (_mm_maskz_cvttph_epi32): Likewise.
11113 (_mm256_cvttph_epi32): Likewise.
11114 (_mm256_mask_cvttph_epi32): Likewise.
11115 (_mm256_maskz_cvttph_epi32): Likewise.
11116 (_mm_cvttph_epu32): Likewise.
11117 (_mm_mask_cvttph_epu32): Likewise.
11118 (_mm_maskz_cvttph_epu32): Likewise.
11119 (_mm256_cvttph_epu32): Likewise.
11120 (_mm256_mask_cvttph_epu32): Likewise.
11121 (_mm256_maskz_cvttph_epu32): Likewise.
11122 (_mm_cvttph_epi64): Likewise.
11123 (_mm_mask_cvttph_epi64): Likewise.
11124 (_mm_maskz_cvttph_epi64): Likewise.
11125 (_mm256_cvttph_epi64): Likewise.
11126 (_mm256_mask_cvttph_epi64): Likewise.
11127 (_mm256_maskz_cvttph_epi64): Likewise.
11128 (_mm_cvttph_epu64): Likewise.
11129 (_mm_mask_cvttph_epu64): Likewise.
11130 (_mm_maskz_cvttph_epu64): Likewise.
11131 (_mm256_cvttph_epu64): Likewise.
11132 (_mm256_mask_cvttph_epu64): Likewise.
11133 (_mm256_maskz_cvttph_epu64): Likewise.
11134 (_mm_cvttph_epi16): Likewise.
11135 (_mm_mask_cvttph_epi16): Likewise.
11136 (_mm_maskz_cvttph_epi16): Likewise.
11137 (_mm256_cvttph_epi16): Likewise.
11138 (_mm256_mask_cvttph_epi16): Likewise.
11139 (_mm256_maskz_cvttph_epi16): Likewise.
11140 (_mm_cvttph_epu16): Likewise.
11141 (_mm_mask_cvttph_epu16): Likewise.
11142 (_mm_maskz_cvttph_epu16): Likewise.
11143 (_mm256_cvttph_epu16): Likewise.
11144 (_mm256_mask_cvttph_epu16): Likewise.
11145 (_mm256_maskz_cvttph_epu16): Likewise.
11146 * config/i386/i386-builtin.def: Add new builtins.
11147 * config/i386/sse.md
11148 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
11150 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
11151 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
11152 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
11153 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
11155 2021-09-17 liuhongt <hongtao.liu@intel.com>
11157 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
11158 (_mm_cvtsh_u32): Likewise.
11159 (_mm_cvt_roundsh_i32): Likewise.
11160 (_mm_cvt_roundsh_u32): Likewise.
11161 (_mm_cvtsh_i64): Likewise.
11162 (_mm_cvtsh_u64): Likewise.
11163 (_mm_cvt_roundsh_i64): Likewise.
11164 (_mm_cvt_roundsh_u64): Likewise.
11165 (_mm_cvti32_sh): Likewise.
11166 (_mm_cvtu32_sh): Likewise.
11167 (_mm_cvt_roundi32_sh): Likewise.
11168 (_mm_cvt_roundu32_sh): Likewise.
11169 (_mm_cvti64_sh): Likewise.
11170 (_mm_cvtu64_sh): Likewise.
11171 (_mm_cvt_roundi64_sh): Likewise.
11172 (_mm_cvt_roundu64_sh): Likewise.
11173 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11174 * config/i386/i386-builtin.def: Add corresponding new builtins.
11175 * config/i386/i386-expand.c (ix86_expand_round_builtin):
11176 Handle new builtin types.
11177 * config/i386/sse.md
11178 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
11180 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
11181 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
11183 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
11185 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
11186 (altivec_resolve_new_overloaded_builtin): New forward decl.
11187 (rs6000_new_builtin_type_compatible): New function.
11188 (altivec_resolve_overloaded_builtin): Call
11189 altivec_resolve_new_overloaded_builtin.
11190 (altivec_build_new_resolved_builtin): New function.
11191 (altivec_resolve_new_overloaded_builtin): Likewise.
11192 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
11194 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
11195 name of rs6000_new_builtin_is_supported.
11197 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
11199 * config/i386/i386-protos.h (ix86_decompose_address):
11200 Change return type to bool.
11201 * config/i386/i386.c (ix86_decompose_address): Ditto.
11203 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
11206 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
11207 Added 'build/' to target, use build/%.o rule.
11208 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
11209 '$(build_exeext)' to target and 'build/' for the *.o files.
11210 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
11213 2021-09-16 Martin Jambor <mjambor@suse.cz>
11215 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
11216 the body has been removed. Remove trailing whitespace.
11218 2021-09-16 Richard Biener <rguenther@suse.de>
11220 PR middle-end/102360
11221 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
11222 of non-memory more robust.
11224 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11226 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
11227 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
11228 (leon5_adjust_cost): Increase cost of store with data dependency
11229 on ALU instruction and FPU anti-dependencies.
11230 (sparc_option_override): Add LEON5 costs
11231 (sparc_adjust_cost): Add LEON5 cost adjustments
11232 * config/sparc/sparc.h: Add LEON5
11233 * config/sparc/sparc.md: Include LEON5 scheduling information
11234 * config/sparc/sparc.opt: Add LEON5
11235 * doc/invoke.texi: Add LEON5
11236 * config/sparc/leon5.md: New file.
11238 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11240 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
11241 sensitive sequence for B2BST errata workaround.
11243 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11245 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
11246 functions with atomic instruction in the UT700 errata workaround.
11248 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11250 * config/sparc/sparc.c (next_active_non_empty_insn): New function
11251 that returns next active non empty assembly instruction.
11252 (sparc_do_work_around_errata): Use new function.
11254 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11256 * config/sparc/sparc.c (store_insn_p): Add predicate for store
11258 (load_insn_p): Add predicate for load attributes.
11259 (sparc_do_work_around_errata): Use new predicates.
11261 2021-09-16 Andreas Larsson <andreas@gaisler.com>
11263 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
11266 2021-09-16 Martin Liska <mliska@suse.cz>
11268 * config/mips/netbsd.h: Fix typo in name of a macro.
11270 2021-09-16 liuhongt <hongtao.liu@intel.com>
11272 PR middle-end/102080
11273 * match.pd: Check mask type when doing cond_op related gimple
11275 * tree.c (is_truth_type_for): New function.
11276 * tree.h (is_truth_type_for): New declaration.
11278 2021-09-16 liuhongt <hongtao.liu@intel.com>
11280 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
11282 (_mm512_mask_cvtepi32_ph): Likewise.
11283 (_mm512_maskz_cvtepi32_ph): Likewise.
11284 (_mm512_cvt_roundepi32_ph): Likewise.
11285 (_mm512_mask_cvt_roundepi32_ph): Likewise.
11286 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
11287 (_mm512_cvtepu32_ph): Likewise.
11288 (_mm512_mask_cvtepu32_ph): Likewise.
11289 (_mm512_maskz_cvtepu32_ph): Likewise.
11290 (_mm512_cvt_roundepu32_ph): Likewise.
11291 (_mm512_mask_cvt_roundepu32_ph): Likewise.
11292 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
11293 (_mm512_cvtepi64_ph): Likewise.
11294 (_mm512_mask_cvtepi64_ph): Likewise.
11295 (_mm512_maskz_cvtepi64_ph): Likewise.
11296 (_mm512_cvt_roundepi64_ph): Likewise.
11297 (_mm512_mask_cvt_roundepi64_ph): Likewise.
11298 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
11299 (_mm512_cvtepu64_ph): Likewise.
11300 (_mm512_mask_cvtepu64_ph): Likewise.
11301 (_mm512_maskz_cvtepu64_ph): Likewise.
11302 (_mm512_cvt_roundepu64_ph): Likewise.
11303 (_mm512_mask_cvt_roundepu64_ph): Likewise.
11304 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
11305 (_mm512_cvtepi16_ph): Likewise.
11306 (_mm512_mask_cvtepi16_ph): Likewise.
11307 (_mm512_maskz_cvtepi16_ph): Likewise.
11308 (_mm512_cvt_roundepi16_ph): Likewise.
11309 (_mm512_mask_cvt_roundepi16_ph): Likewise.
11310 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
11311 (_mm512_cvtepu16_ph): Likewise.
11312 (_mm512_mask_cvtepu16_ph): Likewise.
11313 (_mm512_maskz_cvtepu16_ph): Likewise.
11314 (_mm512_cvt_roundepu16_ph): Likewise.
11315 (_mm512_mask_cvt_roundepu16_ph): Likewise.
11316 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
11317 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
11319 (_mm_mask_cvtepi32_ph): Likewise.
11320 (_mm_maskz_cvtepi32_ph): Likewise.
11321 (_mm256_cvtepi32_ph): Likewise.
11322 (_mm256_mask_cvtepi32_ph): Likewise.
11323 (_mm256_maskz_cvtepi32_ph): Likewise.
11324 (_mm_cvtepu32_ph): Likewise.
11325 (_mm_mask_cvtepu32_ph): Likewise.
11326 (_mm_maskz_cvtepu32_ph): Likewise.
11327 (_mm256_cvtepu32_ph): Likewise.
11328 (_mm256_mask_cvtepu32_ph): Likewise.
11329 (_mm256_maskz_cvtepu32_ph): Likewise.
11330 (_mm_cvtepi64_ph): Likewise.
11331 (_mm_mask_cvtepi64_ph): Likewise.
11332 (_mm_maskz_cvtepi64_ph): Likewise.
11333 (_mm256_cvtepi64_ph): Likewise.
11334 (_mm256_mask_cvtepi64_ph): Likewise.
11335 (_mm256_maskz_cvtepi64_ph): Likewise.
11336 (_mm_cvtepu64_ph): Likewise.
11337 (_mm_mask_cvtepu64_ph): Likewise.
11338 (_mm_maskz_cvtepu64_ph): Likewise.
11339 (_mm256_cvtepu64_ph): Likewise.
11340 (_mm256_mask_cvtepu64_ph): Likewise.
11341 (_mm256_maskz_cvtepu64_ph): Likewise.
11342 (_mm_cvtepi16_ph): Likewise.
11343 (_mm_mask_cvtepi16_ph): Likewise.
11344 (_mm_maskz_cvtepi16_ph): Likewise.
11345 (_mm256_cvtepi16_ph): Likewise.
11346 (_mm256_mask_cvtepi16_ph): Likewise.
11347 (_mm256_maskz_cvtepi16_ph): Likewise.
11348 (_mm_cvtepu16_ph): Likewise.
11349 (_mm_mask_cvtepu16_ph): Likewise.
11350 (_mm_maskz_cvtepu16_ph): Likewise.
11351 (_mm256_cvtepu16_ph): Likewise.
11352 (_mm256_mask_cvtepu16_ph): Likewise.
11353 (_mm256_maskz_cvtepu16_ph): Likewise.
11354 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11355 * config/i386/i386-builtin.def: Add corresponding new builtins.
11356 * config/i386/i386-expand.c
11357 (ix86_expand_args_builtin): Handle new builtin types.
11358 (ix86_expand_round_builtin): Ditto.
11359 * config/i386/i386-modes.def: Declare V2HF and V6HF.
11360 * config/i386/sse.md (VI2H_AVX512VL): New.
11361 (qq2phsuff): Ditto.
11362 (sseintvecmode): Add HF vector modes.
11363 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
11365 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11366 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11367 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11368 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11369 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
11370 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11371 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11372 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11373 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11374 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
11375 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
11377 2021-09-16 liuhongt <hongtao.liu@intel.com>
11379 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
11381 (_mm512_mask_cvtph_epi32): Likewise.
11382 (_mm512_maskz_cvtph_epi32): Likewise.
11383 (_mm512_cvt_roundph_epi32): Likewise.
11384 (_mm512_mask_cvt_roundph_epi32): Likewise.
11385 (_mm512_maskz_cvt_roundph_epi32): Likewise.
11386 (_mm512_cvtph_epu32): Likewise.
11387 (_mm512_mask_cvtph_epu32): Likewise.
11388 (_mm512_maskz_cvtph_epu32): Likewise.
11389 (_mm512_cvt_roundph_epu32): Likewise.
11390 (_mm512_mask_cvt_roundph_epu32): Likewise.
11391 (_mm512_maskz_cvt_roundph_epu32): Likewise.
11392 (_mm512_cvtph_epi64): Likewise.
11393 (_mm512_mask_cvtph_epi64): Likewise.
11394 (_mm512_maskz_cvtph_epi64): Likewise.
11395 (_mm512_cvt_roundph_epi64): Likewise.
11396 (_mm512_mask_cvt_roundph_epi64): Likewise.
11397 (_mm512_maskz_cvt_roundph_epi64): Likewise.
11398 (_mm512_cvtph_epu64): Likewise.
11399 (_mm512_mask_cvtph_epu64): Likewise.
11400 (_mm512_maskz_cvtph_epu64): Likewise.
11401 (_mm512_cvt_roundph_epu64): Likewise.
11402 (_mm512_mask_cvt_roundph_epu64): Likewise.
11403 (_mm512_maskz_cvt_roundph_epu64): Likewise.
11404 (_mm512_cvtph_epi16): Likewise.
11405 (_mm512_mask_cvtph_epi16): Likewise.
11406 (_mm512_maskz_cvtph_epi16): Likewise.
11407 (_mm512_cvt_roundph_epi16): Likewise.
11408 (_mm512_mask_cvt_roundph_epi16): Likewise.
11409 (_mm512_maskz_cvt_roundph_epi16): Likewise.
11410 (_mm512_cvtph_epu16): Likewise.
11411 (_mm512_mask_cvtph_epu16): Likewise.
11412 (_mm512_maskz_cvtph_epu16): Likewise.
11413 (_mm512_cvt_roundph_epu16): Likewise.
11414 (_mm512_mask_cvt_roundph_epu16): Likewise.
11415 (_mm512_maskz_cvt_roundph_epu16): Likewise.
11416 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
11418 (_mm_mask_cvtph_epi32): Likewise.
11419 (_mm_maskz_cvtph_epi32): Likewise.
11420 (_mm256_cvtph_epi32): Likewise.
11421 (_mm256_mask_cvtph_epi32): Likewise.
11422 (_mm256_maskz_cvtph_epi32): Likewise.
11423 (_mm_cvtph_epu32): Likewise.
11424 (_mm_mask_cvtph_epu32): Likewise.
11425 (_mm_maskz_cvtph_epu32): Likewise.
11426 (_mm256_cvtph_epu32): Likewise.
11427 (_mm256_mask_cvtph_epu32): Likewise.
11428 (_mm256_maskz_cvtph_epu32): Likewise.
11429 (_mm_cvtph_epi64): Likewise.
11430 (_mm_mask_cvtph_epi64): Likewise.
11431 (_mm_maskz_cvtph_epi64): Likewise.
11432 (_mm256_cvtph_epi64): Likewise.
11433 (_mm256_mask_cvtph_epi64): Likewise.
11434 (_mm256_maskz_cvtph_epi64): Likewise.
11435 (_mm_cvtph_epu64): Likewise.
11436 (_mm_mask_cvtph_epu64): Likewise.
11437 (_mm_maskz_cvtph_epu64): Likewise.
11438 (_mm256_cvtph_epu64): Likewise.
11439 (_mm256_mask_cvtph_epu64): Likewise.
11440 (_mm256_maskz_cvtph_epu64): Likewise.
11441 (_mm_cvtph_epi16): Likewise.
11442 (_mm_mask_cvtph_epi16): Likewise.
11443 (_mm_maskz_cvtph_epi16): Likewise.
11444 (_mm256_cvtph_epi16): Likewise.
11445 (_mm256_mask_cvtph_epi16): Likewise.
11446 (_mm256_maskz_cvtph_epi16): Likewise.
11447 (_mm_cvtph_epu16): Likewise.
11448 (_mm_mask_cvtph_epu16): Likewise.
11449 (_mm_maskz_cvtph_epu16): Likewise.
11450 (_mm256_cvtph_epu16): Likewise.
11451 (_mm256_mask_cvtph_epu16): Likewise.
11452 (_mm256_maskz_cvtph_epu16): Likewise.
11453 * config/i386/i386-builtin-types.def: Add new builtin types.
11454 * config/i386/i386-builtin.def: Add new builtins.
11455 * config/i386/i386-expand.c
11456 (ix86_expand_args_builtin): Handle new builtin types.
11457 (ix86_expand_round_builtin): Ditto.
11458 * config/i386/sse.md (sseintconvert): New.
11459 (ssePHmode): Ditto.
11460 (UNSPEC_US_FIX_NOTRUNC): Ditto.
11461 (sseintconvertsignprefix): Ditto.
11462 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
11465 2021-09-16 liuhongt <hongtao.liu@intel.com>
11467 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
11469 (_mm_cvtsi128_si16): Likewise.
11470 (_mm_mask_load_sh): Likewise.
11471 (_mm_maskz_load_sh): Likewise.
11472 (_mm_mask_store_sh): Likewise.
11473 (_mm_move_sh): Likewise.
11474 (_mm_mask_move_sh): Likewise.
11475 (_mm_maskz_move_sh): Likewise.
11476 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11477 * config/i386/i386-builtin.def: Add corresponding new builtins.
11478 * config/i386/i386-expand.c
11479 (ix86_expand_special_args_builtin): Handle new builtin types.
11480 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
11481 * config/i386/sse.md (VI2F): New mode iterator.
11482 (vec_set<mode>_0): Use new mode iterator.
11483 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
11484 (avx512f_store<mode>_mask): Ditto.
11486 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
11488 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
11490 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
11492 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11493 Proceed if no symbol summary or the symbol alias flag is false.
11495 2021-09-15 Jakub Jelinek <jakub@redhat.com>
11499 * varasm.c (output_constructor_regular_field): Instead of assertion
11500 that array_size_for_constructor result is equal to size of
11501 TREE_TYPE (local->val) in bytes, assert that the type size is greater
11502 or equal to array_size_for_constructor result and use type size as
11505 2021-09-15 Martin Liska <mliska@suse.cz>
11508 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
11510 2021-09-15 Martin Liska <mliska@suse.cz>
11513 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11514 Check that we have a symbol summary for a symbol.
11516 2021-09-15 Richard Biener <rguenther@suse.de>
11519 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11520 to inherit from elfos.h
11522 2021-09-15 liuhongt <hongtao.liu@intel.com>
11525 * config/i386/i386-expand.c
11526 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
11528 (ix86_expand_vector_set): Use blendw instead of pinsrw.
11529 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
11530 AVX512FP16 which supports 16bit vector load.
11531 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
11533 (avx512bw_interleave_high<mode><mask_name>): .. this, and
11534 extend to V32HFmode.
11535 (avx2_interleave_highv16hi<mask_name>): Rename to ..
11536 (avx2_interleave_high<mode><mask_name>): .. this, and extend
11538 (vec_interleave_highv8hi<mask_name>): Rename to ..
11539 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
11540 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
11542 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
11543 this, and extend to V32HFmode.
11544 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
11545 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
11546 (vec_interleave_lowv8hi<mask_name>): Rename to ..
11547 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
11548 (sse4_1_pblendw): Rename to ..
11549 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
11550 (avx2_pblendph): New define_expand.
11551 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
11552 sseintmodesuffix instead of ssemodesuffix.
11553 (blendsuf): New mode attr.
11555 2021-09-15 Richard Biener <rguenther@suse.de>
11557 * tree-vectorizer.h (dr_misalignment): Move out of line.
11558 (dr_target_alignment): New.
11559 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
11560 (set_dr_target_alignment): New.
11561 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
11562 * tree-vect-data-refs.c (dr_misalignment): Compute and
11563 return the group members misalignment.
11564 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
11565 (vect_analyze_data_refs_alignment): Compute alignment only
11566 for the first element of a DR group.
11567 (vect_slp_analyze_node_alignment): Likewise.
11569 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
11571 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
11572 * config/i386/avx512fp16vlintrin.h: Likewise.
11573 * config/i386/i386-builtin.def: Adjust builtin name and
11574 enumeration to match AVX512F style.
11576 2021-09-15 Richard Biener <rguenther@suse.de>
11578 PR tree-optimization/102318
11579 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
11580 previous change and do the mode conversion separately from
11581 the sign conversion.
11583 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
11584 Peter Cordes <peter@cordes.ca>
11587 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
11588 (*vec_extract<mode><ssescalarmodelower>_valign): Output
11589 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
11592 2021-09-15 Richard Biener <rguenther@suse.de>
11594 * config.gcc: Remove vax-*-openbsd* configuration.
11596 2021-09-15 Richard Biener <rguenther@suse.de>
11598 * config.gcc: Remove m68k-openbsd.
11600 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
11603 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
11605 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
11607 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
11608 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
11609 (*mma_xxsetaccz): Delete.
11610 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
11611 Use UNSPECV_MMA_XXSETACCZ. Update comment.
11612 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
11614 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
11616 * Makefile.in: Remove variables related to applying no-PIE
11617 to the exes on $build.
11618 * configure: Regenerate.
11619 * configure.ac: Remove configuration related to applying
11620 no-PIE to the exes on $build.
11622 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
11624 * config/arc/arc.md (doloop_end): Add missing mode.
11625 (loop_end): Likewise.
11627 2021-09-14 Jakub Jelinek <jakub@redhat.com>
11629 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
11630 it to recursive calls, for depth above 7 just gimplify or return.
11631 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
11632 __builtin_clear_padding and TARGET_EXPR.
11633 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
11635 2021-09-14 liuhongt <hongtao.liu@intel.com>
11637 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
11639 (_mm_mask_fpclass_sh_mask): Likewise.
11640 (_mm512_mask_fpclass_ph_mask): Likewise.
11641 (_mm512_fpclass_ph_mask): Likewise.
11642 (_mm_getexp_sh): Likewise.
11643 (_mm_mask_getexp_sh): Likewise.
11644 (_mm_maskz_getexp_sh): Likewise.
11645 (_mm512_getexp_ph): Likewise.
11646 (_mm512_mask_getexp_ph): Likewise.
11647 (_mm512_maskz_getexp_ph): Likewise.
11648 (_mm_getexp_round_sh): Likewise.
11649 (_mm_mask_getexp_round_sh): Likewise.
11650 (_mm_maskz_getexp_round_sh): Likewise.
11651 (_mm512_getexp_round_ph): Likewise.
11652 (_mm512_mask_getexp_round_ph): Likewise.
11653 (_mm512_maskz_getexp_round_ph): Likewise.
11654 (_mm_getmant_sh): Likewise.
11655 (_mm_mask_getmant_sh): Likewise.
11656 (_mm_maskz_getmant_sh): Likewise.
11657 (_mm512_getmant_ph): Likewise.
11658 (_mm512_mask_getmant_ph): Likewise.
11659 (_mm512_maskz_getmant_ph): Likewise.
11660 (_mm_getmant_round_sh): Likewise.
11661 (_mm_mask_getmant_round_sh): Likewise.
11662 (_mm_maskz_getmant_round_sh): Likewise.
11663 (_mm512_getmant_round_ph): Likewise.
11664 (_mm512_mask_getmant_round_ph): Likewise.
11665 (_mm512_maskz_getmant_round_ph): Likewise.
11666 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
11668 (_mm_fpclass_ph_mask): Likewise.
11669 (_mm256_mask_fpclass_ph_mask): Likewise.
11670 (_mm256_fpclass_ph_mask): Likewise.
11671 (_mm256_getexp_ph): Likewise.
11672 (_mm256_mask_getexp_ph): Likewise.
11673 (_mm256_maskz_getexp_ph): Likewise.
11674 (_mm_getexp_ph): Likewise.
11675 (_mm_mask_getexp_ph): Likewise.
11676 (_mm_maskz_getexp_ph): Likewise.
11677 (_mm256_getmant_ph): Likewise.
11678 (_mm256_mask_getmant_ph): Likewise.
11679 (_mm256_maskz_getmant_ph): Likewise.
11680 (_mm_getmant_ph): Likewise.
11681 (_mm_mask_getmant_ph): Likewise.
11682 (_mm_maskz_getmant_ph): Likewise.
11683 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11684 * config/i386/i386-builtin.def: Add corresponding new builtins.
11685 * config/i386/i386-expand.c
11686 (ix86_expand_args_builtin): Handle new builtin types.
11687 (ix86_expand_round_builtin): Ditto.
11688 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
11689 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
11690 to support HF vector modes.
11691 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
11693 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
11694 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
11695 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
11696 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
11699 2021-09-14 liuhongt <hongtao.liu@intel.com>
11701 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
11703 (_mm512_mask_reduce_ph): Likewise.
11704 (_mm512_maskz_reduce_ph): Likewise.
11705 (_mm512_reduce_round_ph): Likewise.
11706 (_mm512_mask_reduce_round_ph): Likewise.
11707 (_mm512_maskz_reduce_round_ph): Likewise.
11708 (_mm_reduce_sh): Likewise.
11709 (_mm_mask_reduce_sh): Likewise.
11710 (_mm_maskz_reduce_sh): Likewise.
11711 (_mm_reduce_round_sh): Likewise.
11712 (_mm_mask_reduce_round_sh): Likewise.
11713 (_mm_maskz_reduce_round_sh): Likewise.
11714 (_mm512_roundscale_ph): Likewise.
11715 (_mm512_mask_roundscale_ph): Likewise.
11716 (_mm512_maskz_roundscale_ph): Likewise.
11717 (_mm512_roundscale_round_ph): Likewise.
11718 (_mm512_mask_roundscale_round_ph): Likewise.
11719 (_mm512_maskz_roundscale_round_ph): Likewise.
11720 (_mm_roundscale_sh): Likewise.
11721 (_mm_mask_roundscale_sh): Likewise.
11722 (_mm_maskz_roundscale_sh): Likewise.
11723 (_mm_roundscale_round_sh): Likewise.
11724 (_mm_mask_roundscale_round_sh): Likewise.
11725 (_mm_maskz_roundscale_round_sh): Likewise.
11726 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
11728 (_mm_mask_reduce_ph): Likewise.
11729 (_mm_maskz_reduce_ph): Likewise.
11730 (_mm256_reduce_ph): Likewise.
11731 (_mm256_mask_reduce_ph): Likewise.
11732 (_mm256_maskz_reduce_ph): Likewise.
11733 (_mm_roundscale_ph): Likewise.
11734 (_mm_mask_roundscale_ph): Likewise.
11735 (_mm_maskz_roundscale_ph): Likewise.
11736 (_mm256_roundscale_ph): Likewise.
11737 (_mm256_mask_roundscale_ph): Likewise.
11738 (_mm256_maskz_roundscale_ph): Likewise.
11739 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11740 * config/i386/i386-builtin.def: Add corresponding new builtins.
11741 * config/i386/i386-expand.c
11742 (ix86_expand_args_builtin): Handle new builtin types.
11743 (ix86_expand_round_builtin): Ditto.
11744 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
11746 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
11747 ... this, and adjust for round operands.
11748 (reduces<mode><mask_scalar_name>): Likewise, with ...
11749 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
11751 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
11752 Adjust for HF vector modes.
11753 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
11755 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
11757 2021-09-14 liuhongt <hongtao.liu@intel.com>
11759 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
11761 (_mm512_mask_rcp_ph): Likewise.
11762 (_mm512_maskz_rcp_ph): Likewise.
11763 (_mm_rcp_sh): Likewise.
11764 (_mm_mask_rcp_sh): Likewise.
11765 (_mm_maskz_rcp_sh): Likewise.
11766 (_mm512_scalef_ph): Likewise.
11767 (_mm512_mask_scalef_ph): Likewise.
11768 (_mm512_maskz_scalef_ph): Likewise.
11769 (_mm512_scalef_round_ph): Likewise.
11770 (_mm512_mask_scalef_round_ph): Likewise.
11771 (_mm512_maskz_scalef_round_ph): Likewise.
11772 (_mm_scalef_sh): Likewise.
11773 (_mm_mask_scalef_sh): Likewise.
11774 (_mm_maskz_scalef_sh): Likewise.
11775 (_mm_scalef_round_sh): Likewise.
11776 (_mm_mask_scalef_round_sh): Likewise.
11777 (_mm_maskz_scalef_round_sh): Likewise.
11778 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
11780 (_mm256_rcp_ph): Likewise.
11781 (_mm_mask_rcp_ph): Likewise.
11782 (_mm256_mask_rcp_ph): Likewise.
11783 (_mm_maskz_rcp_ph): Likewise.
11784 (_mm256_maskz_rcp_ph): Likewise.
11785 (_mm_scalef_ph): Likewise.
11786 (_mm256_scalef_ph): Likewise.
11787 (_mm_mask_scalef_ph): Likewise.
11788 (_mm256_mask_scalef_ph): Likewise.
11789 (_mm_maskz_scalef_ph): Likewise.
11790 (_mm256_maskz_scalef_ph): Likewise.
11791 * config/i386/i386-builtin.def: Add new builtins.
11792 * config/i386/sse.md (VFH_AVX512VL): New.
11793 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
11794 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
11795 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
11796 Adjust to support HF vector modes.
11797 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
11799 2021-09-14 liuhongt <hongtao.liu@intel.com>
11801 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
11803 (_mm512_mask_sqrt_ph): Likewise.
11804 (_mm512_maskz_sqrt_ph): Likewise.
11805 (_mm512_sqrt_round_ph): Likewise.
11806 (_mm512_mask_sqrt_round_ph): Likewise.
11807 (_mm512_maskz_sqrt_round_ph): Likewise.
11808 (_mm512_rsqrt_ph): Likewise.
11809 (_mm512_mask_rsqrt_ph): Likewise.
11810 (_mm512_maskz_rsqrt_ph): Likewise.
11811 (_mm_rsqrt_sh): Likewise.
11812 (_mm_mask_rsqrt_sh): Likewise.
11813 (_mm_maskz_rsqrt_sh): Likewise.
11814 (_mm_sqrt_sh): Likewise.
11815 (_mm_mask_sqrt_sh): Likewise.
11816 (_mm_maskz_sqrt_sh): Likewise.
11817 (_mm_sqrt_round_sh): Likewise.
11818 (_mm_mask_sqrt_round_sh): Likewise.
11819 (_mm_maskz_sqrt_round_sh): Likewise.
11820 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
11821 (_mm256_sqrt_ph): Likewise.
11822 (_mm_mask_sqrt_ph): Likewise.
11823 (_mm256_mask_sqrt_ph): Likewise.
11824 (_mm_maskz_sqrt_ph): Likewise.
11825 (_mm256_maskz_sqrt_ph): Likewise.
11826 (_mm_rsqrt_ph): Likewise.
11827 (_mm256_rsqrt_ph): Likewise.
11828 (_mm_mask_rsqrt_ph): Likewise.
11829 (_mm256_mask_rsqrt_ph): Likewise.
11830 (_mm_maskz_rsqrt_ph): Likewise.
11831 (_mm256_maskz_rsqrt_ph): Likewise.
11832 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11833 * config/i386/i386-builtin.def: Add corresponding new builtins.
11834 * config/i386/i386-expand.c
11835 (ix86_expand_args_builtin): Handle new builtins.
11836 (ix86_expand_round_builtin): Ditto.
11837 * config/i386/sse.md (VF_AVX512FP16VL): New.
11838 (sqrt<mode>2): Adjust for HF vector modes.
11839 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
11840 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
11842 (<sse>_rsqrt<mode>2<mask_name>): New.
11843 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
11845 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11847 PR bootstrap/101574
11848 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
11849 'RESERVED_LOCATION_P' locations.
11850 * warning-control.cc (get_nowarn_spec, suppress_warning)
11851 (copy_warning): Likewise.
11853 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11855 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
11856 (typedef key_type_t): ... this. Remove.
11857 (nowarn_map): Document.
11858 * diagnostic-spec.c (nowarn_map): Likewise.
11859 * warning-control.cc (convert_to_key): Evolve functions into...
11860 (get_location): ... these. Adjust all users.
11862 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11864 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
11866 2021-09-13 Jason Merrill <jason@redhat.com>
11868 * params.opt: Add destructive-interference-size and
11869 constructive-interference-size.
11870 * doc/invoke.texi: Document them.
11871 * config/aarch64/aarch64.c (aarch64_override_options_internal):
11873 * config/arm/arm.c (arm_option_override): Set them.
11874 * config/i386/i386-options.c (ix86_option_override_internal):
11877 2021-09-13 Martin Liska <mliska@suse.cz>
11878 H.J. Lu <hjl.tools@gmail.com>
11881 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
11882 for x86-64 micro levels for __builtin_cpu_supports.
11883 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
11884 Add priorities for the micro-arch levels.
11885 (enum processor_features): Add new features.
11886 * common/config/i386/i386-isas.h: Add micro-arch features.
11887 * config/i386/i386-builtins.c (get_builtin_code_for_version):
11888 Support the micro-arch levels by callsing
11889 __builtin_cpu_supports.
11890 * doc/extend.texi: Document that the levels are support by
11891 __builtin_cpu_supports.
11893 2021-09-13 Andrew Pinski <apinski@marvell.com>
11896 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
11898 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
11899 (aarch64_general_gimple_fold_builtin): Likewise.
11901 2021-09-13 Andrew Pinski <apinski@marvell.com>
11903 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
11904 to the Unsupported targets list.
11905 Remove support for m32r-*-linux* and m32rle-*-linux*.
11906 * config/m32r/linux.h: Removed.
11907 * config/m32r/t-linux: Removed.
11909 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
11912 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
11913 register index for SVE predicate modes.
11915 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11917 * tree-ssa-threadbackward.c
11918 (back_threader_profitability::profitable_path_p): Remove FSM
11920 (back_threader_registry::register_path): Same.
11921 * tree-ssa-threadedge.c
11922 (jump_threader::simplify_control_stmt_condition): Same.
11923 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
11924 Add backedge_threads argument.
11925 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
11926 backedge_threads argument.
11927 (back_jt_path_registry::back_jt_path_registry): Same.
11928 (dump_jump_thread_path): Adjust for FSM removal.
11929 (back_jt_path_registry::rewire_first_differing_edge): Same.
11930 (back_jt_path_registry::adjust_paths_after_duplication): Same.
11931 (back_jt_path_registry::update_cfg): Same.
11932 (jt_path_registry::register_jump_thread): Same.
11933 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
11935 (class back_jt_path_registry): Add backedge_threads to
11938 2021-09-13 Martin Liska <mliska@suse.cz>
11941 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
11943 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
11946 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
11947 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
11948 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
11949 TARGET_AVX256_STORE_BY_PIECES instead of
11950 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
11951 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11952 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
11953 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11954 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
11955 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
11957 2021-09-13 liuhongt <hongtao.liu@intel.com>
11959 PR bootstrap/102302
11960 * expmed.c (extract_bit_field_using_extv): Use
11961 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
11963 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11965 * Makefile.in (OBJS): Add value-pointer-equiv.o.
11966 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
11967 value-pointer-equiv.*.
11968 (ssa_equiv_stack::ssa_equiv_stack): Same.
11969 (ssa_equiv_stack::enter): Same.
11970 (ssa_equiv_stack::leave): Same.
11971 (ssa_equiv_stack::push_replacement): Same.
11972 (ssa_equiv_stack::get_replacement): Same.
11973 (is_pointer_ssa): Same.
11974 (class pointer_equiv_analyzer): Same.
11975 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
11976 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
11977 (pointer_equiv_analyzer::set_global_equiv): Same.
11978 (pointer_equiv_analyzer::set_cond_equiv): Same.
11979 (pointer_equiv_analyzer::get_equiv): Same.
11980 (pointer_equiv_analyzer::enter): Same.
11981 (pointer_equiv_analyzer::leave): Same.
11982 (pointer_equiv_analyzer::get_equiv_expr): Same.
11983 (pta_valueize): Same.
11984 (pointer_equiv_analyzer::visit_stmt): Same.
11985 (pointer_equiv_analyzer::visit_edge): Same.
11986 (hybrid_folder::value_of_expr): Same.
11987 (hybrid_folder::value_on_edge): Same.
11988 * value-pointer-equiv.cc: New file.
11989 * value-pointer-equiv.h: New file.
11991 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11994 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
11995 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
11997 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12000 * config/arm/arm.md (movmisaligndi): New define_expand.
12001 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
12003 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
12006 * emit-rtl.c (gen_highpart): Use adjust_address to handle
12007 MEM rather than calling simplify_gen_subreg.
12009 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
12011 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
12012 define into a block.
12014 2021-09-13 Richard Biener <rguenther@suse.de>
12016 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
12017 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
12018 change based on DARWIN_PREFER_DWARF not being defined.
12020 2021-09-13 Richard Biener <rguenther@suse.de>
12022 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
12023 to inherit from elfos.h
12025 2021-09-13 Richard Biener <rguenther@suse.de>
12027 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
12029 2021-09-13 Richard Biener <rguenther@suse.de>
12031 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
12032 override, pick up DWARF2_DEBUG define from elfos.h
12034 2021-09-13 Richard Biener <rguenther@suse.de>
12036 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
12039 2021-09-13 Richard Biener <rguenther@suse.de>
12041 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
12044 2021-09-13 Richard Biener <rguenther@suse.de>
12046 * config/i386/cygming.h: Always default to DWARF2 debugging.
12047 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
12049 * doc/install.texi: Document binutils 2.16 as minimum
12050 requirement for mingw.
12052 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
12054 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
12055 nstmts, nloads and extra_ctor_cost.
12056 (rs6000_density_test): Add load density related heuristics. Do
12057 extra costing on vector construction statements if need.
12058 (rs6000_init_cost): Init new members.
12059 (rs6000_update_target_cost_per_stmt): New function.
12060 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
12061 rs6000_update_target_cost_per_stmt and call it.
12063 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
12065 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
12066 (rs6000_init_cost): Adjust.
12068 2021-09-13 liuhongt <hongtao.liu@intel.com>
12070 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
12071 (UNSPEC_XORSIGN): Ditto.
12073 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
12075 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
12076 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12079 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
12081 * tree-ssa-threadbackward.c (class back_threader_registry): Use
12082 back_jt_path_registry.
12083 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
12084 fwd_jt_path_registry.
12085 * tree-ssa-threadedge.h (class jump_threader): Same..
12086 * tree-ssa-threadupdate.c
12087 (jump_thread_path_registry::jump_thread_path_registry): Rename...
12088 (jt_path_registry::jt_path_registry): ...to this.
12089 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
12090 (jt_path_registry::~jt_path_registry): ...this.
12091 (fwd_jt_path_registry::fwd_jt_path_registry): New.
12092 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
12093 (jump_thread_path_registry::allocate_thread_edge): Rename...
12094 (jt_path_registry::allocate_thread_edge): ...to this.
12095 (jump_thread_path_registry::allocate_thread_path): Rename...
12096 (jt_path_registry::allocate_thread_path): ...to this.
12097 (jump_thread_path_registry::lookup_redirection_data): Rename...
12098 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
12099 (jump_thread_path_registry::thread_block_1): Rename...
12100 (fwd_jt_path_registry::thread_block_1): ...to this.
12101 (jump_thread_path_registry::thread_block): Rename...
12102 (fwd_jt_path_registry::thread_block): ...to this.
12103 (jt_path_registry::thread_through_loop_header): Rename...
12104 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
12105 (jump_thread_path_registry::mark_threaded_blocks): Rename...
12106 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
12107 (jump_thread_path_registry::debug_path): Rename...
12108 (jt_path_registry::debug_path): ...to this.
12109 (jump_thread_path_registry::dump): Rename...
12110 (jt_path_registry::debug): ...to this.
12111 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
12112 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
12113 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
12114 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
12115 (jump_thread_path_registry::duplicate_thread_path): Rename...
12116 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
12117 drop ill-formed candidates.
12118 (jump_thread_path_registry::remove_jump_threads_including): Rename...
12119 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
12120 (jt_path_registry::thread_through_all_blocks): New.
12121 (back_jt_path_registry::update_cfg): New.
12122 (fwd_jt_path_registry::update_cfg): New.
12123 (jump_thread_path_registry::register_jump_thread): Rename...
12124 (jt_path_registry::register_jump_thread): ...to this.
12125 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
12127 (class jt_path_registry): ...here.
12128 (class fwd_jt_path_registry): New.
12129 (class back_jt_path_registry): New.
12131 2021-09-10 liuhongt <hongtao.liu@intel.com>
12134 2021-09-01 liuhongt <hongtao.liu@intel.com>
12136 * emit-rtl.c (validate_subreg): Get rid of all float-int
12139 2021-09-10 Jakub Jelinek <jakub@redhat.com>
12141 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
12142 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
12143 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
12144 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
12145 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
12146 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
12147 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
12148 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
12150 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
12151 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
12152 GF_OMP_ATOMIC_WEAK.
12153 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
12155 * tree.h (OMP_ATOMIC_WEAK): Define.
12156 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
12157 fail memory order being encoded in the same enum and also print
12158 fail clause if present.
12159 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
12160 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
12161 handle pre_p == NULL case as a test mode that only returns value
12162 but doesn't change gimplify nor change anything otherwise, adjust
12163 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
12164 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
12165 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
12167 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
12168 COND_EXPR rhs. Set weak flag on gimple load/store for
12170 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
12171 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
12173 (expand_omp_atomic_cas): New function.
12174 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
12176 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
12177 using expand_omp_atomic_cas.
12179 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
12180 Michael Matz <matz@suse.de>
12182 * tree-pass.h (PROP_loop_opts_done): New.
12183 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
12184 Intersect with global range.
12185 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
12186 * tree-ssa-threadbackward.c
12187 (back_threader_profitability::profitable_path_p): Disable
12188 threading through latches until after loop optimizations have run.
12190 2021-09-10 David Faust <david.faust@oracle.com>
12192 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
12195 2021-09-10 David Faust <david.faust@oracle.com>
12197 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
12198 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
12199 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
12200 (bpf_expand_cbranch): New function.
12201 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
12203 (zero_extendsidi2): Only use mov32 instruction if it is available.
12204 (SIM mode iterator): Conditionalize support for SI mode.
12205 (JM mode iterator): New.
12206 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
12207 (*branch_on_di): Update name, use new JM iterator.
12208 * config/bpf/bpf.opt: (mjmpext): New option.
12209 (malu32): Likewise.
12210 (mjmp32): Likewise.
12212 (bpf_isa): New enum.
12214 2021-09-10 David Faust <david.faust@oracle.com>
12216 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
12217 for register-to-register extensions.
12218 (zero_extendqidi2): Likewise.
12220 2021-09-10 Richard Biener <rguenther@suse.de>
12222 PR middle-end/102273
12223 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
12225 2021-09-10 Richard Biener <rguenther@suse.de>
12227 PR middle-end/102269
12228 * gimplify.c (is_var_need_auto_init): Empty types do not need
12231 2021-09-10 Richard Biener <rguenther@suse.de>
12233 * configure.ac (--with-stabs): Remove.
12234 * configure: Regenerate.
12235 * doc/install.texi: Remove --with-stabs documentation.
12237 2021-09-10 liuhongt <hongtao.liu@intel.com>
12239 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
12241 (_mm512_mask_cmp_ph_mask): Likewise.
12242 (_mm512_cmp_round_ph_mask): Likewise.
12243 (_mm512_mask_cmp_round_ph_mask): Likewise.
12244 (_mm_cmp_sh_mask): Likewise.
12245 (_mm_mask_cmp_sh_mask): Likewise.
12246 (_mm_cmp_round_sh_mask): Likewise.
12247 (_mm_mask_cmp_round_sh_mask): Likewise.
12248 (_mm_comieq_sh): Likewise.
12249 (_mm_comilt_sh): Likewise.
12250 (_mm_comile_sh): Likewise.
12251 (_mm_comigt_sh): Likewise.
12252 (_mm_comige_sh): Likewise.
12253 (_mm_comineq_sh): Likewise.
12254 (_mm_ucomieq_sh): Likewise.
12255 (_mm_ucomilt_sh): Likewise.
12256 (_mm_ucomile_sh): Likewise.
12257 (_mm_ucomigt_sh): Likewise.
12258 (_mm_ucomige_sh): Likewise.
12259 (_mm_ucomineq_sh): Likewise.
12260 (_mm_comi_round_sh): Likewise.
12261 (_mm_comi_sh): Likewise.
12262 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
12263 (_mm_mask_cmp_ph_mask): Likewise.
12264 (_mm256_cmp_ph_mask): Likewise.
12265 (_mm256_mask_cmp_ph_mask): Likewise.
12266 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12267 * config/i386/i386-builtin.def: Add corresponding new builtins.
12268 * config/i386/i386-expand.c
12269 (ix86_expand_args_builtin): Handle new builtin types.
12270 (ix86_expand_round_builtin): Ditto.
12271 * config/i386/i386.md (ssevecmode): Add HF mode.
12272 (MODEFH): New mode iterator.
12273 * config/i386/sse.md
12274 (V48H_AVX512VL): New mode iterator to support HF vector modes.
12275 Ajdust corresponding description.
12276 (ssecmpintprefix): New.
12277 (VI12_AVX512VL): Adjust to support HF vector modes.
12278 (cmp_imm_predicate): Likewise.
12279 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
12281 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
12282 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
12283 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
12285 2021-09-10 liuhongt <hongtao.liu@intel.com>
12287 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
12288 (_mm512_mask_max_ph): Likewise.
12289 (_mm512_maskz_max_ph): Likewise.
12290 (_mm512_min_ph): Likewise.
12291 (_mm512_mask_min_ph): Likewise.
12292 (_mm512_maskz_min_ph): Likewise.
12293 (_mm512_max_round_ph): Likewise.
12294 (_mm512_mask_max_round_ph): Likewise.
12295 (_mm512_maskz_max_round_ph): Likewise.
12296 (_mm512_min_round_ph): Likewise.
12297 (_mm512_mask_min_round_ph): Likewise.
12298 (_mm512_maskz_min_round_ph): Likewise.
12299 (_mm_max_sh): Likewise.
12300 (_mm_mask_max_sh): Likewise.
12301 (_mm_maskz_max_sh): Likewise.
12302 (_mm_min_sh): Likewise.
12303 (_mm_mask_min_sh): Likewise.
12304 (_mm_maskz_min_sh): Likewise.
12305 (_mm_max_round_sh): Likewise.
12306 (_mm_mask_max_round_sh): Likewise.
12307 (_mm_maskz_max_round_sh): Likewise.
12308 (_mm_min_round_sh): Likewise.
12309 (_mm_mask_min_round_sh): Likewise.
12310 (_mm_maskz_min_round_sh): Likewise.
12311 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
12312 (_mm256_max_ph): Likewise.
12313 (_mm_mask_max_ph): Likewise.
12314 (_mm256_mask_max_ph): Likewise.
12315 (_mm_maskz_max_ph): Likewise.
12316 (_mm256_maskz_max_ph): Likewise.
12317 (_mm_min_ph): Likewise.
12318 (_mm256_min_ph): Likewise.
12319 (_mm_mask_min_ph): Likewise.
12320 (_mm256_mask_min_ph): Likewise.
12321 (_mm_maskz_min_ph): Likewise.
12322 (_mm256_maskz_min_ph): Likewise.
12323 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12324 * config/i386/i386-builtin.def: Add corresponding new builtins.
12325 * config/i386/i386-expand.c
12326 (ix86_expand_args_builtin): Handle new builtin types.
12327 * config/i386/sse.md
12328 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
12329 support HF vector modes.
12330 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
12331 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
12333 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
12335 * config/i386/subst.md (round_saeonly_mode512bit_condition):
12336 Adjust for HF vector modes.
12338 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
12340 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
12341 (_mm_mask_add_sh): Likewise.
12342 (_mm_maskz_add_sh): Likewise.
12343 (_mm_sub_sh): Likewise.
12344 (_mm_mask_sub_sh): Likewise.
12345 (_mm_maskz_sub_sh): Likewise.
12346 (_mm_mul_sh): Likewise.
12347 (_mm_mask_mul_sh): Likewise.
12348 (_mm_maskz_mul_sh): Likewise.
12349 (_mm_div_sh): Likewise.
12350 (_mm_mask_div_sh): Likewise.
12351 (_mm_maskz_div_sh): Likewise.
12352 (_mm_add_round_sh): Likewise.
12353 (_mm_mask_add_round_sh): Likewise.
12354 (_mm_maskz_add_round_sh): Likewise.
12355 (_mm_sub_round_sh): Likewise.
12356 (_mm_mask_sub_round_sh): Likewise.
12357 (_mm_maskz_sub_round_sh): Likewise.
12358 (_mm_mul_round_sh): Likewise.
12359 (_mm_mask_mul_round_sh): Likewise.
12360 (_mm_maskz_mul_round_sh): Likewise.
12361 (_mm_div_round_sh): Likewise.
12362 (_mm_mask_div_round_sh): Likewise.
12363 (_mm_maskz_div_round_sh): Likewise.
12364 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12365 * config/i386/i386-builtin.def: Add corresponding new builtins.
12366 * config/i386/i386-expand.c
12367 (ix86_expand_round_builtin): Handle new builtins.
12368 * config/i386/sse.md (VF_128): Change description.
12369 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
12370 Adjust to support HF vector modes.
12371 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
12374 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
12376 * config/i386/i386-expand.c
12377 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
12378 * config/i386/i386.c
12379 (ix86_preferred_simd_mode): Handle HF mode.
12380 * config/i386/sse.md (V_256H): New mode iterator.
12381 (avx_vextractf128<mode>): Use it.
12382 (VEC_INIT_MODE): Align vector HFmode condition to vector
12383 HImodes since there're no real HF instruction used.
12384 (VEC_INIT_HALF_MODE): Ditto.
12386 (VIHF_AVX512BW): Ditto.
12387 (*vec_extracthf): Ditto.
12388 (VEC_EXTRACT_MODE): Ditto.
12390 2021-09-10 Richard Biener <rguenther@suse.de>
12393 * config/dbx.h: Remove.
12394 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
12395 * config/lynx.h: Likewise.
12397 2021-09-10 liuhongt <hongtao.liu@intel.com>
12399 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
12400 right into ANDNOT + AND + IOR, using paradoxical subregs.
12401 (ix86_split_copysign_const): Remove.
12402 (ix86_split_copysign_var): Ditto.
12403 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
12404 (ix86_split_copysign_var): Ditto.
12405 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
12406 (@copysign<mode>3_var): Ditto.
12408 2021-09-09 qing zhao <qing.zhao@oracle.com>
12410 * builtins.c (expand_builtin_memset): Make external visible.
12411 * builtins.h (expand_builtin_memset): Declare extern.
12412 * common.opt (ftrivial-auto-var-init=): New option.
12413 * doc/extend.texi: Document the uninitialized attribute.
12414 * doc/invoke.texi: Document -ftrivial-auto-var-init.
12415 * flag-types.h (enum auto_init_type): New enumerated type
12417 * gimple-fold.c (clear_padding_type): Add one new parameter.
12418 (clear_padding_union): Likewise.
12419 (clear_padding_emit_loop): Likewise.
12420 (clear_type_padding_in_mask): Likewise.
12421 (gimple_fold_builtin_clear_padding): Handle this new parameter.
12422 * gimplify.c (gimple_add_init_for_auto_var): New function.
12423 (gimple_add_padding_init_for_auto_var): New function.
12424 (is_var_need_auto_init): New function.
12425 (gimplify_decl_expr): Add initialization to automatic variables per
12427 (gimplify_call_expr): Add one new parameter for call to
12428 __builtin_clear_padding.
12429 (gimplify_init_constructor): Add padding initialization in the end.
12430 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
12431 (expand_DEFERRED_INIT): New function.
12432 * internal-fn.def (DEFERRED_INIT): New internal function.
12433 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
12434 * tree-sra.c (generate_subtree_deferred_init): New function.
12435 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
12436 calls to .DEFERRED_INIT.
12437 (sra_modify_deferred_init): New function.
12438 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
12439 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
12440 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
12442 (check_defs): Likewise.
12443 (warn_uninitialized_vars): Likewise.
12444 * tree-ssa.c (ssa_undefined_value_p): Likewise.
12445 * tree.c (build_common_builtin_nodes): Build tree node for
12446 BUILT_IN_CLEAR_PADDING when needed.
12448 2021-09-09 Richard Biener <rguenther@suse.de>
12450 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
12453 2021-09-09 Richard Biener <rguenther@suse.de>
12455 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
12456 DOM walk from get_loop_body_in_dom_order using a worklist
12459 2021-09-09 liuhongt <hongtao.liu@intel.com>
12461 * config.gcc: Add avx512fp16vlintrin.h.
12462 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
12463 (_mm512_mask_add_ph): Likewise.
12464 (_mm512_maskz_add_ph): Likewise.
12465 (_mm512_sub_ph): Likewise.
12466 (_mm512_mask_sub_ph): Likewise.
12467 (_mm512_maskz_sub_ph): Likewise.
12468 (_mm512_mul_ph): Likewise.
12469 (_mm512_mask_mul_ph): Likewise.
12470 (_mm512_maskz_mul_ph): Likewise.
12471 (_mm512_div_ph): Likewise.
12472 (_mm512_mask_div_ph): Likewise.
12473 (_mm512_maskz_div_ph): Likewise.
12474 (_mm512_add_round_ph): Likewise.
12475 (_mm512_mask_add_round_ph): Likewise.
12476 (_mm512_maskz_add_round_ph): Likewise.
12477 (_mm512_sub_round_ph): Likewise.
12478 (_mm512_mask_sub_round_ph): Likewise.
12479 (_mm512_maskz_sub_round_ph): Likewise.
12480 (_mm512_mul_round_ph): Likewise.
12481 (_mm512_mask_mul_round_ph): Likewise.
12482 (_mm512_maskz_mul_round_ph): Likewise.
12483 (_mm512_div_round_ph): Likewise.
12484 (_mm512_mask_div_round_ph): Likewise.
12485 (_mm512_maskz_div_round_ph): Likewise.
12486 * config/i386/avx512fp16vlintrin.h: New header.
12487 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
12488 Add new builtin types.
12489 * config/i386/i386-builtin.def: Add corresponding builtins.
12490 * config/i386/i386-expand.c
12491 (ix86_expand_args_builtin): Handle new builtin types.
12492 (ix86_expand_round_builtin): Likewise.
12493 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
12494 * config/i386/sse.md (VFH): New mode_iterator.
12496 (avx512fmaskmode): Add HF vector modes.
12497 (avx512fmaskhalfmode): Likewise.
12498 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
12500 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
12501 (mul<mode>3<mask_name><round_name>): Likewise.
12502 (*mul<mode>3<mask_name><round_name>): Likewise.
12503 (div<mode>3): Likewise.
12504 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
12505 * config/i386/subst.md (SUBST_V): Add HF vector modes.
12506 (SUBST_A): Likewise.
12507 (round_mode512bit_condition): Adjust for V32HFmode.
12509 2021-09-09 liuhongt <hongtao.liu@intel.com>
12512 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
12513 (reduc_plus_scal_v4sf): .. this, New define_expand.
12514 (reduc_plus_scal_v2df): .. and this, New define_expand.
12516 2021-09-09 liuhongt <hongtao.liu@intel.com>
12519 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
12522 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
12525 * doc/trouble.texi (Copy Assignment): Fix description of
12526 behaviour and fix code in example.
12528 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
12531 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
12532 r11 instead of r12 for restoring CR.
12534 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12535 liuhongt <hongtao.liu@intel.com>
12538 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
12539 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
12540 into AND with mask and XOR, using paradoxical subregs.
12541 (ix86_split_xorsign): Remove.
12542 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
12544 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
12546 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
12548 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12551 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
12552 operands[2], emit abs<mode>2 instead.
12553 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
12554 first alternative even for avx, add another alternative with
12555 =&Yv <- 0, Yv, Yvm constraints.
12556 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
12557 to op1, emit vpandn instead.
12559 2021-09-08 liuhongt <hongtao.liu@intel.com>
12561 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
12562 (_mm256_set_ph): Likewise.
12563 (_mm512_set_ph): Likewise.
12564 (_mm_setr_ph): Likewise.
12565 (_mm256_setr_ph): Likewise.
12566 (_mm512_setr_ph): Likewise.
12567 (_mm_set1_ph): Likewise.
12568 (_mm256_set1_ph): Likewise.
12569 (_mm512_set1_ph): Likewise.
12570 (_mm_setzero_ph): Likewise.
12571 (_mm256_setzero_ph): Likewise.
12572 (_mm512_setzero_ph): Likewise.
12573 (_mm_set_sh): Likewise.
12574 (_mm_load_sh): Likewise.
12575 (_mm_store_sh): Likewise.
12576 * config/i386/i386-builtin-types.def (V8HF): New type.
12577 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
12578 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
12579 Support vector HFmodes.
12580 (ix86_expand_vector_init_one_nonzero): Likewise.
12581 (ix86_expand_vector_init_one_var): Likewise.
12582 (ix86_expand_vector_init_interleave): Likewise.
12583 (ix86_expand_vector_init_general): Likewise.
12584 (ix86_expand_vector_set): Likewise.
12585 (ix86_expand_vector_extract): Likewise.
12586 (ix86_expand_vector_init_concat): Likewise.
12587 (ix86_expand_sse_movcc): Handle vector HFmodes.
12588 (ix86_expand_vector_set_var): Ditto.
12589 * config/i386/i386-modes.def: Add HF vector modes in comment.
12590 * config/i386/i386.c (classify_argument): Add HF vector modes.
12591 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
12592 (ix86_vector_mode_supported_p): Likewise.
12593 (ix86_set_reg_reg_cost): Handle vector HFmode.
12594 (ix86_get_ssemov): Handle vector HFmode.
12595 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
12597 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
12599 (function_arg_advance_32): Ditto.
12600 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
12601 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
12602 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
12603 (VALID_SSE2_REG_VHF_MODE): New.
12604 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
12605 (SSE_REG_MODE_P): Add vector HFmode.
12606 * config/i386/i386.md (mode): Add HF vector modes.
12607 (MODE_SIZE): Likewise.
12608 (ssemodesuffix): Add ph suffix for HF vector modes.
12609 * config/i386/sse.md (VFH_128): New mode iterator.
12610 (VMOVE): Adjust for HF vector modes.
12612 (V_256_512): Likewise.
12613 (avx512): Likewise.
12614 (avx512fmaskmode): Likewise.
12615 (shuffletype): Likewise.
12616 (sseinsnmode): Likewise.
12617 (ssedoublevecmode): Likewise.
12618 (ssehalfvecmode): Likewise.
12619 (ssehalfvecmodelower): Likewise.
12620 (ssePScmode): Likewise.
12621 (ssescalarmode): Likewise.
12622 (ssescalarmodelower): Likewise.
12623 (sseintprefix): Likewise.
12625 (bcstscalarsuff): Likewise.
12626 (xtg_mode): Likewise.
12627 (VI12HF_AVX512VL): New mode_iterator.
12628 (VF_AVX512FP16): Likewise.
12630 (VIHF_256): Likewise.
12631 (VIHF_AVX512BW): Likewise.
12632 (V16_256): Likewise.
12633 (V32_512): Likewise.
12634 (sseintmodesuffix): New mode_attr.
12635 (sse): Add scalar and vector HFmodes.
12636 (ssescalarmode): Add vector HFmode mapping.
12637 (ssescalarmodesuffix): Add sh suffix for HFmode.
12638 (*<sse>_vm<insn><mode>3): Use VFH_128.
12639 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
12640 (*ieee_<ieee_maxmin><mode>3): Likewise.
12641 (<avx512>_blendm<mode>): New define_insn.
12642 (vec_setv8hf): New define_expand.
12643 (vec_set<mode>_0): New define_insn for HF vector set.
12644 (*avx512fp16_movsh): Likewise.
12645 (avx512fp16_movsh): Likewise.
12646 (vec_extract_lo_v32hi): Rename to ...
12647 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
12649 (vec_extract_hi_v32hi): Likewise.
12650 (vec_extract_hi_<mode>): Likewise.
12651 (vec_extract_lo_v16hi): Likewise.
12652 (vec_extract_lo_<mode>): Likewise.
12653 (vec_extract_hi_v16hi): Likewise.
12654 (vec_extract_hi_<mode>): Likewise.
12655 (vec_set_hi_v16hi): Likewise.
12656 (vec_set_hi_<mode>): Likewise.
12657 (vec_set_lo_v16hi): Likewise.
12658 (vec_set_lo_<mode>): Likewise.
12659 (*vec_extract<mode>_0): New define_insn_and_split for HF
12661 (*vec_extracthf): New define_insn.
12662 (VEC_EXTRACT_MODE): Add HF vector modes.
12663 (PINSR_MODE): Add V8HF.
12664 (sse2p4_1): Likewise.
12665 (pinsr_evex_isa): Likewise.
12666 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
12667 insert for V8HFmode.
12668 (pbroadcast_evex_isa): Add HF vector modes.
12669 (AVX2_VEC_DUP_MODE): Likewise.
12670 (VEC_INIT_MODE): Likewise.
12671 (VEC_INIT_HALF_MODE): Likewise.
12672 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
12674 (avx2_pbroadcast<mode>_1): Likewise.
12675 (<avx512>_vec_dup<mode>_1): Likewise.
12676 (<avx512>_vec_dup<mode><mask_name>): Likewise.
12677 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
12680 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
12681 H.J. Lu <hongjiu.lu@intel.com>
12682 Liu Hongtao <hongtao.liu@intel.com>
12683 Wang Hongyu <hongyu.wang@intel.com>
12684 Xu Dianhong <dianhong.xu@intel.com>
12686 * common/config/i386/cpuinfo.h (get_available_features):
12687 Detect FEATURE_AVX512FP16.
12688 * common/config/i386/i386-common.c
12689 (OPTION_MASK_ISA_AVX512FP16_SET,
12690 OPTION_MASK_ISA_AVX512FP16_UNSET,
12691 OPTION_MASK_ISA2_AVX512FP16_SET,
12692 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
12693 (OPTION_MASK_ISA2_AVX512BW_UNSET,
12694 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
12695 (ix86_handle_option): Handle -mavx512fp16.
12696 * common/config/i386/i386-cpuinfo.h (enum processor_features):
12697 Add FEATURE_AVX512FP16.
12698 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
12699 * config.gcc: Add avx512fp16intrin.h.
12700 * config/i386/avx512fp16intrin.h: New intrinsic header.
12701 * config/i386/cpuid.h: Add bit_AVX512FP16.
12702 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
12703 * config/i386/i386-builtins.c: Support _Float16 type for i386
12705 (ix86_register_float16_builtin_type): New function.
12706 (ix86_float16_type_node): New.
12707 * config/i386/i386-c.c (ix86_target_macros_internal): Define
12709 * config/i386/i386-expand.c (ix86_expand_branch): Support
12711 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
12712 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12713 (ix86_expand_fp_movcc): Ditto.
12714 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
12715 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
12716 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
12717 * config/i386/i386.c (ix86_get_ssemov): Use
12718 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
12719 (ix86_get_excess_precision): Use
12720 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
12722 (sse_store_index): Use SFmode cost for HFmode cost.
12723 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
12724 GPR cost for HFmode.
12725 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
12726 (ix86_mangle_type): Add manlging for _Float16 type.
12727 (inline_secondary_memory_needed): No memory is needed for
12728 16bit movement between gpr and sse reg under
12730 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
12731 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12732 (ix86_division_cost): Ditto.
12733 (ix86_rtx_costs): Ditto.
12734 (ix86_add_stmt_cost): Ditto.
12735 (ix86_optab_supported_p): Ditto.
12736 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
12737 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
12738 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
12739 * config/i386/i386.md (mode): Add HFmode.
12740 (MODE_SIZE): Add HFmode.
12741 (isa): Add avx512fp16.
12742 (enabled): Handle avx512fp16.
12743 (ssemodesuffix): Add sh suffix for HFmode.
12744 (comm): Add mult, div.
12745 (plusminusmultdiv): New code iterator.
12746 (insn): Add mult, div.
12747 (*movhf_internal): Adjust for avx512fp16 instruction.
12748 (*movhi_internal): Ditto.
12749 (*cmpi<unord>hf): New define_insn for HFmode.
12750 (*ieee_s<ieee_maxmin>hf3): Likewise.
12751 (extendhf<mode>2): Likewise.
12752 (trunc<mode>hf2): Likewise.
12753 (float<floatunssuffix><mode>hf2): Likewise.
12754 (*<insn>hf): Likewise.
12755 (cbranchhf4): New expander.
12756 (movhfcc): Likewise.
12757 (<insn>hf3): Likewise.
12758 (mulhf3): Likewise.
12759 (divhf3): Likewise.
12760 * config/i386/i386.opt: Add mavx512fp16.
12761 * config/i386/immintrin.h: Include avx512fp16intrin.h.
12762 * doc/invoke.texi: Add mavx512fp16.
12763 * doc/extend.texi: Add avx512fp16 Usage Notes.
12765 2021-09-08 liuhongt <hongtao.liu@intel.com>
12767 * common.opt: Support -fexcess-precision=16.
12768 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
12769 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
12770 EXCESS_PRECISION_TYPE_FLOAT16.
12771 * config/arm/arm.c (arm_excess_precision): Ditto.
12772 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
12773 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
12774 when EXCESS_PRECISION_TYPE_FLOAT16.
12775 * config/s390/s390.c (s390_excess_precision): Ditto.
12776 * coretypes.h (enum excess_precision_type): Add
12777 EXCESS_PRECISION_TYPE_FLOAT16.
12778 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
12779 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
12780 * doc/extend.texi (Half-Precision): Document
12781 -fexcess-precision=16.
12782 * flag-types.h (enum excess_precision): Add
12783 EXCESS_PRECISION_FLOAT16.
12784 * target.def (excess_precision): Update document.
12785 * tree.c (excess_precision_type): Set excess_precision_type to
12786 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
12788 2021-09-08 liuhongt <hongtao.liu@intel.com>
12790 * doc/extend.texi: (@node Floating Types): Adjust the wording.
12791 (@node Half-Precision): Ditto.
12793 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
12796 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
12797 'CONST_INT_P (src)' to the condition of the block that tries to
12798 eliminate literal when loading integer contant.
12800 2021-09-07 David Faust <david.faust@oracle.com>
12802 * doc/extend.texi (BPF Type Attributes) New node.
12803 Document new preserve_access_index attribute.
12804 Document new preserve_access_index builtin.
12805 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
12807 2021-09-07 David Faust <david.faust@oracle.com>
12809 * config/bpf/bpf.c: Adjust includes.
12810 (bpf_handle_preserve_access_index_attribute): New function.
12811 (bpf_attribute_table): Use it here.
12812 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
12813 (bpf_option_override): Handle "-mco-re" option.
12814 (bpf_asm_init_sections): New.
12815 (TARGET_ASM_INIT_SECTIONS): Redefine.
12816 (bpf_file_end): New.
12817 (TARGET_ASM_FILE_END): Redefine.
12818 (bpf_init_builtins): Add "__builtin_preserve_access_index".
12819 (bpf_core_compute, bpf_core_get_index): New.
12820 (is_attr_preserve_access): New.
12821 (bpf_expand_builtin): Handle new builtins.
12822 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
12823 (bpf_core_walk): New.
12824 (bpf_resolve_overloaded_builtin): New.
12825 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
12826 (handle_attr): New.
12827 (pass_bpf_core_attr): New RTL pass.
12828 * config/bpf/bpf-passes.def: New file.
12829 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
12830 * config/bpf/coreout.c: New file.
12831 * config/bpf/coreout.h: Likewise.
12832 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
12833 (coreout.o): New rule.
12834 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
12835 * config.gcc (bpf): Add coreout.h to extra_headers.
12836 Add coreout.o to extra_objs.
12837 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
12839 2021-09-07 David Faust <david.faust@oracle.com>
12841 * btfout.c (get_btf_id): Function is no longer static.
12842 * ctfc.h: Expose it here.
12844 2021-09-07 David Faust <david.faust@oracle.com>
12846 * ctfc.c (ctf_lookup_tree_type): New function.
12847 * ctfc.h: Likewise.
12849 2021-09-07 David Faust <david.faust@oracle.com>
12851 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
12852 * ctfc.h: Analogous change.
12854 2021-09-07 David Faust <david.faust@oracle.com>
12856 * dwarf2out.c (lookup_type_die): Function is no longer static.
12857 * dwarf2out.h: Expose it here.
12859 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12861 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
12862 (ctf_debug_early_finish): New definition.
12863 (ctf_debug_finish): Likewise.
12864 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
12865 (ctf_debug_early_finish): New declaration.
12866 (ctf_debug_finish): Likewise.
12867 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
12868 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
12870 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12872 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
12873 support when compiling for CO-RE.
12874 * config/bpf/bpf.opt: Add new command line option -mco-re.
12876 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12878 * flag-types.h (enum debug_info_type): Add new enum
12879 DINFO_TYPE_BTF_WITH_CORE.
12880 (BTF_WITH_CORE_DEBUG): New bitmask.
12881 * flags.h (btf_with_core_debuginfo_p): New declaration.
12882 * opts.c (btf_with_core_debuginfo_p): New definition.
12884 2021-09-07 Jason Merrill <jason@redhat.com>
12886 * tree.h (error_operand_p): Change to inline function.
12888 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
12890 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
12891 (empty_block_with_phis_p): ...this.
12892 (potentially_threadable_block): Same.
12893 (jump_threader::thread_through_normal_block): Same.
12895 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
12898 * dwarf2out.c (mark_base_types): New overloaded function.
12899 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
12900 as the compilation unit, and call move_marked_base_types afterward.
12902 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
12905 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
12907 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
12909 2021-09-07 Richard Biener <rguenther@suse.de>
12911 PR tree-optimization/102226
12912 * tree-vect-loop.c (vect_transform_cycle_phi): Record
12913 the converted value for the epilogue PHI use.
12915 2021-09-07 Martin Liska <mliska@suse.cz>
12917 PR gcov-profile/80223
12918 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
12919 options, do not inline when no_profile_instrument_function
12920 attributes are different in early inliner. It's fine to inline
12921 it after PGO instrumentation.
12923 2021-09-07 Richard Biener <rguenther@suse.de>
12925 PR tree-optimization/101555
12926 * tree-ssa-pre.c (translate_vuse_through_block): Do not
12927 perform an alias walk to determine the validity of the
12928 mem at the start of the block which is already guaranteed
12929 by means of prune_clobbered_mems.
12930 (phi_translate_1): Pass edge to translate_vuse_through_block.
12932 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
12935 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
12936 (remainder<mode>3): Likewise.
12938 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
12940 * config/mips/mips.c (mips_file_start): add .module for
12943 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
12945 * wide-int.cc (wi::clz): Reorder tests to ensure the result
12946 is zero for all negative values.
12948 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
12950 * doc/invoke.texi (-foffload-options): Fix @opindex.
12952 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
12955 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
12956 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
12957 source alternative for AVX.
12959 2021-09-06 liuhongt <hongtao.liu@intel.com>
12961 PR middle-end/102182
12962 * optabs.c (expand_fix): Add from1 to avoid from being
12965 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
12967 * dwarf2out.c (modified_type_die): Deal with all array types earlier
12968 and use local variable consistently throughout the function.
12970 2021-09-06 Jakub Jelinek <jakub@redhat.com>
12972 PR tree-optimization/102207
12973 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12974 were promoted from signed to wider unsigned type.
12976 2021-09-06 Andrew Pinski <apinski@marvell.com>
12978 PR tree-optimization/63184
12979 * match.pd: Add simplification of pointer_diff of two pointer_plus
12980 with addr_expr in the first operand of each pointer_plus.
12981 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
12982 in the first operand of each pointer_plus.
12984 2021-09-06 Richard Biener <rguenther@suse.de>
12986 PR tree-optimization/102176
12987 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
12989 (vect_bb_slp_scalar_cost): Use the computed set of
12990 vectorized scalar stmts instead of relying on the out-of-date
12991 and not accurate PURE_SLP_STMT.
12992 (vect_bb_vectorization_profitable_p): Compute the set
12993 of vectorized scalar stmts.
12995 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12997 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
12998 GIMPLE_COND special casing.
12999 (path_range_query::range_defined_in_block): Use range_of_stmt
13000 instead of calling fold_range directly.
13002 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13004 * gimple-range-path.cc (path_range_query::range_of_expr): Set
13005 m_undefined_path when appropriate.
13006 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
13007 (path_range_query::unreachable_path_p): New.
13008 (path_range_query::precompute_ranges): Set m_undefined_path.
13009 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
13010 (path_range_query::internal_range_of_expr): New.
13011 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
13012 Use unreachable_path_p.
13014 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
13016 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
13017 Remove argument and call find_taken_edge.
13018 (back_threader::resolve_phi): Do not calculate taken edge before
13019 calling maybe_register_path.
13020 (back_threader::find_paths_to_names): Same.
13022 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
13024 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
13025 * config/h8300/testcompare.md (store_c): Update name, use new
13027 (store_neg_c, store_shifted_c): New patterns.
13029 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
13032 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
13033 instead of r12 for CR save, in all cases.
13035 2021-09-03 Andrew Pinski <apinski@marvell.com>
13037 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
13038 Handle error_mark_node as the type of the type_decl.
13040 2021-09-03 Andrew Pinski <apinski@marvell.com>
13042 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
13044 (aarch64_simd_types): Likewise.
13045 (aarch64_simd_intOI_type_node): Likewise.
13046 (aarch64_simd_intCI_type_node): Likewise.
13047 (aarch64_simd_intXI_type_node): Likewise.
13048 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
13049 (aarch64_fp16_ptr_type_node): Likewise.
13050 (aarch64_bf16_type_node): Likewise.
13051 (aarch64_bf16_ptr_type_node): Likewise.
13053 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13055 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
13057 (minus_op1_op2_relation_effect): ...here.
13058 (class operator_pointer_diff): New.
13059 (operator_pointer_diff::op1_op2_relation_effect): Call
13060 minus_op1_op2_relation_effect.
13061 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
13063 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13065 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
13066 Add may_peel_loop_headers.
13067 (back_threader_registry::thread_through_all_blocks): Same.
13068 (try_thread_blocks): Pass may_peel_loop_headers argument.
13069 (pass_early_thread_jumps::execute): Same.
13071 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13073 * tree-ssa-threadedge.c (has_phis_p): New.
13074 (forwarder_block_p): New.
13075 (potentially_threadable_block): Call forwarder_block_p.
13076 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
13077 (jump_threader::thread_through_normal_block): Call
13080 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13082 * tree-ssa-threadbackward.c (back_threader::dump): New.
13083 (back_threader::debug): New.
13084 (back_threader_profitability::profitable_path_p): Dump blocks
13085 even if we are bailing early.
13087 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13089 * tree-ssa-threadupdate.c (cancel_thread): New.
13090 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
13091 (jump_thread_path_registry::mark_threaded_blocks): Same.
13092 (jump_thread_path_registry::register_jump_thread): Same.
13094 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13096 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
13097 which objects are available.
13098 (jt_state::pop): Same.
13099 (jt_state::register_equiv): Same.
13100 (jt_state::register_equivs_on_edge): Same.
13102 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13104 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
13105 Move pop until after a thread is registered.
13107 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13109 * tree-ssa-threadupdate.c (debug): New.
13111 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13113 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
13114 (push_dump_file::~push_dump_file): New.
13115 (dump_ranger): Change dump_file temporarily while dumping
13117 * gimple-range-trace.h (class push_dump_file): New.
13119 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13121 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
13122 (dump_ranger): Dump function name.
13124 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13126 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13127 Adjust for non-null.
13128 (path_range_query::adjust_for_non_null_uses): New.
13129 (path_range_query::precompute_ranges): Call
13130 adjust_for_non_null_uses.
13131 * gimple-range-path.h: Add m_non_null and
13132 adjust_for_non_null_uses.
13134 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13136 * gimple-range-path.cc (path_range_query::dump): Dump path
13138 (path_range_query::precompute_ranges): Dump entire path.
13140 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13142 * value-relation.cc (relation_oracle::debug): New.
13143 * value-relation.h (relation_oracle::debug): New.
13145 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13147 * tree-ssa-loop-ch.c: Remove unnecessary include file.
13149 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13151 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
13152 Skip statements with no defining BB.
13153 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13154 Do not get confused by statements with no defining BB.
13156 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13158 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
13159 gimple_range_adjustment. Add support for constants.
13160 (adjust_realpart_expr): New.
13161 (gimple_range_adjustment): Move IMAGPART_EXPR code to
13162 adjust_imagpart_expr.
13163 * range-op.cc (integral_table::integral_table): Add entry for
13166 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13168 * omp-expand.c (expand_omp_atomic_pipeline): Use
13169 IFN_ATOMIC_COMPARE_EXCHANGE instead of
13170 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
13173 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13176 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
13177 that is false if DECL_BIT_FIELD.
13178 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
13179 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
13180 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
13181 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
13182 of DECL_FIELD_ABI_IGNORED.
13183 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
13184 SET_DECL_FIELD_ABI_IGNORED instead of writing to
13185 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
13186 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
13187 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
13188 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
13190 2021-09-03 liuhongt <hongtao.liu@intel.com>
13193 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
13194 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
13195 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
13197 2021-09-02 Martin Sebor <msebor@redhat.com>
13199 PR tree-optimization/17506
13201 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
13203 2021-09-02 Richard Biener <rguenther@suse.de>
13205 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
13206 fix for PR78185 and continue processing when leaving
13207 finite inner loops.
13209 2021-09-02 Jakub Jelinek <jakub@redhat.com>
13211 PR tree-optimization/99591
13212 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
13215 2021-09-02 Richard Biener <rguenther@suse.de>
13218 2021-09-02 Richard Biener <rguenther@suse.de>
13220 PR tree-optimization/102155
13221 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13222 over a part of the RPO array and do not recurse here.
13223 Dump blocks marked as always executed.
13224 (fill_always_executed_in): Walk over the RPO array and
13225 process loops whose header we run into.
13226 (loop_invariant_motion_in_fun): Compute the first RPO
13227 using rev_post_order_and_mark_dfs_back_seme in iteration
13228 order and pass that to fill_always_executed_in.
13230 2021-09-02 liuhongt <hongtao.liu@intel.com>
13232 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
13233 * config/i386/i386.c (enum x86_64_reg_class): Add
13234 X86_64_SSEHF_CLASS.
13235 (merge_classes): Handle X86_64_SSEHF_CLASS.
13236 (examine_argument): Ditto.
13237 (construct_container): Ditto.
13238 (classify_argument): Ditto, and set HFmode/HCmode to
13239 X86_64_SSEHF_CLASS.
13240 (function_value_32): Return _FLoat16/Complex Float16 by
13242 (function_value_64): Return _Float16/Complex Float16 by SSE
13244 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
13245 (ix86_secondary_reload): Require gpr as intermediate register
13246 to store _Float16 from sse register when sse4 is not
13248 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
13250 (ix86_scalar_mode_supported_p): Ditto.
13251 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
13252 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
13253 (VALID_INT_MODE_P): Add HFmode and HCmode.
13254 * config/i386/i386.md (*pushhf_rex64): New define_insn.
13256 (*movhf_internal): Ditto.
13257 * doc/extend.texi (Half-Precision Floating Point): Documemt
13260 2021-09-02 Richard Biener <rguenther@suse.de>
13262 PR tree-optimization/102155
13263 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13264 over a part of the RPO array and do not recurse here.
13265 Dump blocks marked as always executed.
13266 (fill_always_executed_in): Walk over the RPO array and
13267 process loops whose header we run into.
13268 (loop_invariant_motion_in_fun): Compute the first RPO
13269 using rev_post_order_and_mark_dfs_back_seme in iteration
13270 order and pass that to fill_always_executed_in.
13272 2021-09-02 YunQiang Su <syq@debian.org>
13275 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13277 * config/mips/mips.c (mips_module_isa_name): New.
13278 mips_file_start: add .module mipsREV to all asm output
13280 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
13282 PR tree-optimization/102152
13283 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
13284 comparison to a scalar comparison before calling
13285 update_stmt_if_modified.
13287 2021-09-01 Andrew Pinski <apinski@marvell.com>
13290 * config/aarch64/aarch64.c (aarch64_expand_setmem):
13291 Check STRICT_ALIGNMENT before creating an overlapping
13294 2021-09-01 Martin Sebor <msebor@redhat.com>
13296 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
13297 (check_access): Pass additional argument.
13298 (check_memop_access): Remove template and make a member function.
13299 (maybe_check_dealloc_call): Make a pass_waccess member function.
13300 (class pass_waccess): Add, rename, and remove members.
13301 (pass_waccess::pass_waccess): Adjust to name change.
13302 (pass_waccess::~pass_waccess): Same.
13303 (check_alloca): Make a member function.
13304 (check_alloc_size_call): Same.
13305 (check_strcat): Same.
13306 (check_strncat): Same.
13307 (check_stxcpy): Same.
13308 (check_stxncpy): Same.
13309 (check_strncmp): Same.
13310 (maybe_warn_rdwr_sizes): Rename...
13311 (pass_waccess::maybe_check_access_sizes): ...to this.
13312 (pass_waccess::check_call): Adjust to name changes.
13313 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
13315 (pass_waccess::execute): Adjust to name changes.
13316 * gimple-ssa-warn-access.h (check_memop_access): Remove.
13317 * pointer-query.cc (access_ref::phi): Handle null pointer.
13318 (access_ref::inform_access): Same.
13319 (pointer_query::put_ref): Modify a cached value, not a copy of it.
13320 (pointer_query::dump): New function.
13321 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
13323 * pointer-query.h (pointer_query::dump): Declare.
13324 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
13325 (dump_strlen_info): Use function query.
13326 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
13328 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
13330 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
13332 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
13334 * doc/extend.texi: Document unavailable attribute.
13335 * print-tree.c (print_node): Handle unavailable attribute.
13336 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
13337 * tree.c (error_unavailable_use): New.
13338 * tree.h (TREE_UNAVAILABLE): New.
13339 (error_unavailable_use): New.
13341 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13343 PR tree-optimization/102124
13344 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
13345 MINUS_EXPR, if itype is unsigned with smaller precision than type,
13346 add an extra cast to signed variant of itype to ensure sign-extension.
13348 2021-09-01 Martin Liska <mliska@suse.cz>
13350 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
13351 edges and rather use colors for TRUE and FALSE edges.
13353 2021-09-01 Richard Biener <rguenther@suse.de>
13355 PR tree-optimization/93491
13356 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
13357 after processing the stmt itself. Do not consider
13358 pure functions possibly not returning. Properly avoid
13359 adding possibly trapping calls to EXP_GEN when there's
13360 a preceeding possibly not returning call.
13361 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
13364 2021-09-01 Richard Biener <rguenther@suse.de>
13366 PR tree-optimization/102139
13367 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
13368 type to record a std::pair of the stmt-info and the innermost
13370 (dr_vec_info::group): New member.
13371 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
13372 (vect_compute_data_ref_alignment): Verify the recorded
13373 base alignment can be used.
13374 (data_ref_pair): Remove.
13375 (dr_group_sort_cmp): Adjust.
13376 (vect_analyze_data_ref_accesses): Store the group-ID in the
13377 dr_vec_info and operate on a vector of dr_vec_infos.
13379 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
13381 * read-md.c (md_reader::handle_enum): support value assignation.
13382 * doc/md.texi: record define_c_enum value assignation support.
13384 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13386 PR tree-optimization/102141
13387 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
13388 argument. If false, emit stmts after gsi instead of before, and
13390 (bswap_replace): Adjust callers. When converting output of bswap,
13391 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
13394 2021-09-01 liuhongt <hongtao.liu@intel.com>
13396 * emit-rtl.c (validate_subreg): Get rid of all float-int
13399 2021-09-01 liuhongt <hongtao.liu@intel.com>
13402 2021-08-30 liuhongt <hongtao.liu@intel.com>
13404 * expmed.c (extract_bit_field_1): Make sure we're playing with
13405 integral modes before call extract_integral_bit_field.
13406 (extract_integral_bit_field): Add a parameter of type
13407 scalar_int_mode which corresponds to of tmode.
13408 And call extract_and_convert_fixed_bit_field instead of
13409 extract_fixed_bit_field and convert_extracted_bit_field.
13410 (extract_and_convert_fixed_bit_field): New function, it's a
13411 combination of extract_fixed_bit_field and
13412 convert_extracted_bit_field.
13414 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13416 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
13418 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13420 * omp-general.h (omp_is_reference): Rename to...
13421 (omp_privatize_by_reference): ... this. Adjust all users...
13422 * omp-general.c: ... here, ...
13423 * gimplify.c: ... here, ...
13424 * omp-expand.c: ... here, ...
13425 * omp-low.c: ... here.
13427 2021-08-31 Martin Sebor <msebor@redhat.com>
13429 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
13430 pointer element for equality to zero, not that of the cotaining
13433 2021-08-31 Martin Sebor <msebor@redhat.com>
13435 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
13437 2021-08-31 Martin Sebor <msebor@redhat.com>
13439 * function.h (function): Add comments.
13440 (get_range_query): Same. Add attribute returns nonnull.
13442 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13444 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
13445 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
13446 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
13448 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
13450 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
13452 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
13453 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
13454 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
13455 (m32c-*-elf*, m32c-*-rtems*): Likewise.
13456 * config/cris/cris.h (DBX_NO_XREFS): Remove.
13457 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
13458 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
13459 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
13460 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
13461 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
13463 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
13465 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
13466 allowed on target constructs and only with particular other clauses.
13467 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
13468 'ancestor' is used.
13469 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
13470 structs when 'ancestor' is used.
13471 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
13472 'ancestor' is used.
13473 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
13474 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
13476 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13478 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
13479 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13481 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
13482 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
13483 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
13484 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
13485 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
13486 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13487 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
13488 would be paradoxical.
13490 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13492 * combine.c (combine_simplify_rtx): Avoid converting an explicit
13493 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
13494 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
13496 2021-08-31 Richard Biener <rguenther@suse.de>
13498 PR tree-optimization/102142
13499 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
13500 condition under which to unset the visited flag.
13502 2021-08-31 Richard Biener <rguenther@suse.de>
13504 PR middle-end/102129
13505 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
13506 possibly trapping expressions across calls.
13508 2021-08-31 Jakub Jelinek <jakub@redhat.com>
13510 PR tree-optimization/102134
13511 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
13512 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
13515 2021-08-31 Andrew Pinski <apinski@marvell.com>
13518 * collect-utils.c (setup_signals): New declaration.
13519 * collect-utils.h (setup_signals): New function.
13520 * collect2.c (handler): Delete.
13521 (main): Instead of manually setting up the signals,
13522 just call setup_signals.
13523 * lto-wrapper.c (main): Likewise.
13525 2021-08-31 Andrew Pinski <apinski@marvell.com>
13528 * config/i386/i386-protos.h (x86_output_aligned_bss):
13529 Change align argument to unsigned type.
13530 (x86_elf_aligned_decl_common): Likewise.
13531 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
13532 (x86_output_aligned_bss): Likewise.
13534 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13536 * config/mips/mips.c (mips_module_isa_name): New.
13537 mips_file_start: add .module mipsREV to all asm output
13539 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13541 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
13542 use enum instead of int for 'isa' member.
13543 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
13544 config/mips/netbsd.h: replace hardcoded numbers with enum.
13546 2021-08-31 liuhongt <hongtao.liu@intel.com>
13548 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
13549 define_split to define_insn_and_split.
13550 (*avx2_eq<mode>3): Removed.
13551 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
13552 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
13553 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
13555 (*avx2_gt<mode>3): Removed.
13556 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
13557 define_insn to define_expand, and adjust pattern.
13558 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
13560 2021-08-30 David Malcolm <dmalcolm@redhat.com>
13563 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
13565 2021-08-30 Jason Merrill <jason@redhat.com>
13567 * doc/invoke.texi: Document -Wmissing-requires.
13569 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13571 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
13572 TARGET_EXTRA_BUILTINS guard.
13574 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13576 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
13577 initialization of V2DI_type_node and unsigned_V2DI_type_node.
13579 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13581 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
13582 decl when new_builtins_are_live.
13583 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
13586 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
13588 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
13589 OPTION_MASK_P10_FUSION_2STORE.
13590 (POWERPC_MASKS): Likewise.
13591 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
13592 store fusion for Power10.
13593 (is_fusable_store): New.
13594 (power10_sched_reorder): Likewise.
13595 (rs6000_sched_reorder): Do Power10 specific reordering.
13596 (rs6000_sched_reorder2): Likewise.
13597 * config/rs6000/rs6000.opt: Add new option.
13599 2021-08-30 Richard Biener <rguenther@suse.de>
13601 PR tree-optimization/102128
13602 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13603 Move scanning for if-converted scalar code to the caller
13604 and instead delay clearing the visited flag for profitable
13606 (vect_slp_region): Cost all subgraphs before scheduling.
13607 For if-converted BB vectorization scan for scalar COND_EXPRs
13608 and do not vectorize if any found and the cost model is
13611 2021-08-30 Richard Biener <rguenther@suse.de>
13613 * common.opt (fexceptions): Mark
13614 EnabledBy(fnon-call-exceptions).
13615 * doc/invoke.texi (fnon-call-exceptions): Document this
13616 enables -fexceptions.
13618 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
13620 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
13621 is not already defined.
13623 2021-08-30 liuhongt <hongtao.liu@intel.com>
13625 * expmed.c (extract_bit_field_1): Make sure we're playing with
13626 integral modes before call extract_integral_bit_field.
13627 (extract_integral_bit_field): Add a parameter of type
13628 scalar_int_mode which corresponds to of tmode.
13629 And call extract_and_convert_fixed_bit_field instead of
13630 extract_fixed_bit_field and convert_extracted_bit_field.
13631 (extract_and_convert_fixed_bit_field): New function, it's a
13632 combination of extract_fixed_bit_field and
13633 convert_extracted_bit_field.
13635 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13637 * config/darwin.c (darwin_libc_has_function): Do not run
13638 the checks for x86 or modern Darwin. Make sure that there
13639 is a value set for darwin_macosx_version_min before testing.
13641 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13643 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
13645 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
13647 * ipa-modref-tree.h (modref_access_node::merge): Break out
13648 logic combining offsets and logic merging ranges to ...
13649 (modref_access_node::combined_offsets): ... here
13650 (modref_access_node::update2): ... here
13651 (modref_access_node::closer_pair_p): New member function.
13652 (modref_access_node::forced_merge): New member function.
13653 (modre_ref_node::insert): Do merging when table is full.
13655 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
13658 * config.gcc: MIPS: use N64 ABI by default if the triple end
13659 with -gnuabi64, which is used by Debian since 2013.
13661 2021-08-28 Alexandre Oliva <oliva@adacore.com>
13663 * ipa-modref.c (analyze_function): Skip debug stmts.
13664 * tree-inline.c (estimate_num_insn): Consider builtins even
13665 without a cgraph_node.
13667 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13669 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
13670 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
13671 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
13672 * config/h8300/h8300.md (eqne): New code iterator.
13673 (geultu, geultu_to_c): Similarly.
13674 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
13675 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
13676 (cmp<mode>_c): New pattern
13678 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13680 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
13682 (dom_opt_dom_walker::optimize_stmt): Use it.
13684 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13686 * config/darwin.c (finalize_ctors): Add a section-start linker-
13688 (finalize_dtors): Likewise.
13689 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
13691 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
13693 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
13694 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
13695 old initialization logic when new builtins are enabled.
13696 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
13697 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
13698 (write_init_file): Likewise.
13700 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13702 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
13703 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
13704 (amdgcn-* | gcn-*) Likewise.
13706 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
13708 * symbol-summary.h: Added missing template keyword.
13710 2021-08-27 Richard Biener <rguenther@suse.de>
13712 PR tree-optimization/45178
13713 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
13714 infinite loops without exit do not mark control dependent
13715 edges of the latch necessary.
13717 2021-08-27 konglin1 <lingling.kong@intel.com>
13720 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
13722 (<avx512>scattersi<mode>): Likewise.
13723 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
13724 (*avx512f_scatterdi<VI48F:mode>): Likewise
13726 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13728 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
13729 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
13730 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
13731 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
13732 P10_BUILTIN_PEXTD on Power10.
13734 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13736 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
13737 signedness for some Power10 bifs.
13739 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
13742 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
13743 computed alignment if the entire struct has attribute packed.
13745 2021-08-27 liuhongt <hongtao.liu@intel.com>
13749 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
13750 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
13751 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
13752 IX86_BUILTIN_SHUFPS256.
13753 (ix86_masked_all_ones): New function.
13755 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13757 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
13758 (conditional moves with memory inputs splitters): Ditto.
13759 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
13761 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13763 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
13764 search after merging.
13766 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13768 * config/rs6000/rs6000-overload.def: Add remaining overloads.
13770 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13772 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
13774 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13776 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
13777 crypto, and htm stanzas.
13779 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13781 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
13783 2021-08-26 Martin Sebor <msebor@redhat.com>
13785 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
13786 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
13787 (warn_uninitialized_vars): Same. Reduce visibility of locals.
13788 (warn_uninitialized_phi): Same.
13790 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13792 * tree-ssa-ccp.c (get_individual_bits): Helper function to
13793 extract the individual bits from a widest_int constant (mask).
13794 (gray_code_bit_flips): New read-only table for effiently
13795 enumerating permutations/combinations of bits.
13796 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
13797 by unknown counts that are guaranteed less than the target
13798 precision and four or fewer unknown bits by enumeration.
13799 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
13800 enumeration under the same conditions. Handle remaining
13801 shifts as a mask based upon the minimum possible shift value.
13803 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13804 Richard Biener <rguenther@suse.de>
13806 * match.pd (shift transformations): Remove a redundant
13807 !POINTER_TYPE_P check.
13809 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13812 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
13813 Set all_regs to true in the call to replace_rtx.
13815 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13817 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
13818 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
13819 hiting --param modref-max-refs limit.
13820 (modref_tree:insert_base): Be smarter when hitting
13821 --param modref-max-bases limit. Add new parameter REF.
13822 (modref_tree:insert): Update.
13823 (modref_tree:merge): Update.
13824 * ipa-modref.c (read_modref_records): Update.
13826 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13828 * params.opt: (modref-max-adjustments): Add full stop.
13830 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13832 * ipa-modref-tree.h (modref_ref_node::verify): New member
13834 (modref_ref_node::insert): Use it.
13835 (modref_ref_node::try_mere_with): Fix off by one error.
13837 2021-08-26 Martin Liska <mliska@suse.cz>
13838 Stefan Kneifel <stefan.kneifel@bluewin.ch>
13840 * cgraph.h (create_version_clone_with_body): Add new parameter.
13841 * cgraphclones.c: Likewise.
13842 * multiple_target.c (create_dispatcher_calls): Do not use
13844 (create_target_clone): Likewise here.
13846 2021-08-26 Jonathan Yong <10walls@gmail.com>
13848 * doc/extend.texi: Add note about reserved priorities
13849 to the constructor attribute.
13851 2021-08-25 Martin Sebor <msebor@redhat.com>
13853 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
13854 range table header alone.
13855 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
13857 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
13859 * doc/invoke.texi: Document --param modref-max-adjustments.
13860 * ipa-modref-tree.c (test_insert_search_collapse): Update.
13861 (test_merge): Update.
13862 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
13863 (modref_access_node::operator==): Fix handling of access ranges.
13864 (modref_access_node::contains): Constify parameter; handle also
13865 mismatched parm offsets.
13866 (modref_access_node::update): New function.
13867 (modref_access_node::merge): New function.
13868 (unspecified_modref_access_node): Update constructor.
13869 (modref_ref_node::insert_access): Add record_adjustments parameter;
13871 (modref_ref_node::try_merge_with): New private function.
13872 (modref_tree::insert): New record_adjustments parameter.
13873 (modref_tree::merge): New record_adjustments parameter.
13874 (modref_tree::copy_from): Update.
13875 * ipa-modref.c (dump_access): Dump adjustments field.
13876 (get_access): Update constructor.
13877 (record_access): Update call of insert.
13878 (record_access_lto): Update call of insert.
13879 (merge_call_side_effects): Add record_adjustments parameter.
13880 (get_access_for_fnspec): Update.
13881 (process_fnspec): Update.
13882 (analyze_call): Update.
13883 (analyze_function): Update.
13884 (read_modref_records): Update.
13885 (ipa_merge_modref_summary_after_inlining): Update.
13886 (propagate_unknown_call): Update.
13887 (modref_propagate_in_scc): Update.
13888 * params.opt (param-max-modref-adjustments=): New.
13890 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
13892 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
13894 (xxspltiw_v4si): Use vecperm type attribute.
13895 (xxspltiw_v4si_inst): Use vecperm type attribute.
13896 (xxspltiw_v4sf_inst): Likewise.
13897 (xxspltidp_v2df): Use vecperm type attribute. Use
13898 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
13899 (xxspltidp_v2df_inst): Likewise.
13900 (xxsplti32dx_v4si): Use vecperm type attribute.
13901 (xxsplti32dx_v4si_inst): Likewise.
13902 (xxsplti32dx_v4sf_inst): Likewise.
13903 (xxblend_<mode>): Likewise.
13904 (xxpermx): Likewise.
13905 (xxpermx_inst): Likewise.
13906 (xxeval): Likewise.
13908 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
13911 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
13912 * diagnostic.c (diagnostic_initialize_input_context): New function.
13913 * diagnostic.h (diagnostic_initialize_input_context): Declare.
13914 * input.c (default_charset_callback): New function.
13915 (file_cache::initialize_input_context): New function.
13916 (file_cache_slot::create): Added ability to convert the input
13917 according to the input context.
13918 (file_cache::file_cache): Initialize the new input context.
13919 (class file_cache_slot): Added new m_alloc_offset member.
13920 (file_cache_slot::file_cache_slot): Initialize the new member.
13921 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
13922 (file_cache_slot::maybe_grow): Likewise.
13923 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
13924 (file_cache_slot::get_next_line): Likewise.
13925 * input.h (class file_cache): Added input context member.
13927 2021-08-25 Richard Biener <rguenther@suse.de>
13929 PR tree-optimization/102046
13930 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
13931 update ->any_pattern when swapping operands.
13933 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
13936 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
13937 (ix86_decompose_address): Remove retval check for ASHIFT,
13938 allow non-canonical zero extend if AND mask covers ASHIFT
13940 (ix86_legitimate_address_p): Adjust condition for decompose.
13941 (ix86_rtx_costs): Adjust cost for lea with non-canonical
13943 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
13945 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
13947 PR tree-optimization/101145
13948 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
13950 (number_of_iterations_lt): Invoke above function.
13951 (adjust_cond_for_loop_until_wrap):
13952 Merge to number_of_iterations_until_wrap.
13953 (number_of_iterations_cond): Update invokes for
13954 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
13956 2021-08-25 konglin1 <lingling.kong@intel.com>
13959 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
13960 macro define in O0.
13961 (_mm512_mask_fpclass_ps_mask): Ditto.
13963 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
13965 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
13966 (vec_unpacku_hi_v8hi): Likewise.
13967 (vec_unpacku_lo_v16qi): Likewise.
13968 (vec_unpacku_lo_v8hi): Likewise.
13969 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
13970 (vec_unpacku_lo_<VP_small_lc>): Likewise.
13972 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
13974 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
13975 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13976 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13977 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
13979 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13981 PR middle-end/102031
13982 * simplify-rtx.c (simplify_truncation): When comparing precisions
13983 use "subreg_prec" variable, not "subreg_mode".
13985 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13987 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
13990 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13992 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
13993 various pointer type nodes.
13994 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
13995 values for various pointer types.
13996 (ptr_V16QI_type_node): New macro.
13997 (ptr_V1TI_type_node): New macro.
13998 (ptr_V2DI_type_node): New macro.
13999 (ptr_V2DF_type_node): New macro.
14000 (ptr_V4SI_type_node): New macro.
14001 (ptr_V4SF_type_node): New macro.
14002 (ptr_V8HI_type_node): New macro.
14003 (ptr_unsigned_V16QI_type_node): New macro.
14004 (ptr_unsigned_V1TI_type_node): New macro.
14005 (ptr_unsigned_V8HI_type_node): New macro.
14006 (ptr_unsigned_V4SI_type_node): New macro.
14007 (ptr_unsigned_V2DI_type_node): New macro.
14008 (ptr_bool_V16QI_type_node): New macro.
14009 (ptr_bool_V8HI_type_node): New macro.
14010 (ptr_bool_V4SI_type_node): New macro.
14011 (ptr_bool_V2DI_type_node): New macro.
14012 (ptr_bool_V1TI_type_node): New macro.
14013 (ptr_pixel_type_node): New macro.
14014 (ptr_intQI_type_node): New macro.
14015 (ptr_uintQI_type_node): New macro.
14016 (ptr_intHI_type_node): New macro.
14017 (ptr_uintHI_type_node): New macro.
14018 (ptr_intSI_type_node): New macro.
14019 (ptr_uintSI_type_node): New macro.
14020 (ptr_intDI_type_node): New macro.
14021 (ptr_uintDI_type_node): New macro.
14022 (ptr_intTI_type_node): New macro.
14023 (ptr_uintTI_type_node): New macro.
14024 (ptr_long_integer_type_node): New macro.
14025 (ptr_long_unsigned_type_node): New macro.
14026 (ptr_float_type_node): New macro.
14027 (ptr_double_type_node): New macro.
14028 (ptr_long_double_type_node): New macro.
14029 (ptr_dfloat64_type_node): New macro.
14030 (ptr_dfloat128_type_node): New macro.
14031 (ptr_ieee128_type_node): New macro.
14032 (ptr_ibm128_type_node): New macro.
14033 (ptr_vector_pair_type_node): New macro.
14034 (ptr_vector_quad_type_node): New macro.
14035 (ptr_long_long_integer_type_node): New macro.
14036 (ptr_long_long_unsigned_type_node): New macro.
14038 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14040 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
14041 and power9-64 stanzas.
14043 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14044 Tom de Vries <tdevries@suse.de>
14046 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
14047 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
14048 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
14049 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
14050 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
14051 * config/nvptx/nvptx-c.c: New source file.
14052 (nvptx_cpu_cpp_builtins): Move implementation here.
14054 2021-08-24 Martin Sebor <msebor@redhat.com>
14056 PR middle-end/101600
14057 PR middle-end/101977
14058 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
14059 the phrasing of a warning.
14060 (check_access): Use the remaining size after subtracting any offset
14061 rather than the whole object size.
14062 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
14063 clear for any nonnull PHI argument.
14064 (compute_objsize): Clear argument.
14066 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14068 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
14070 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
14072 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
14075 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
14077 * value-relation.cc (rr_transitive_table): New.
14078 (relation_transitive): New.
14079 (value_relation::swap): Remove.
14080 (value_relation::apply_transitive): New.
14081 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
14082 (relation_oracle::register_relation): Call register_transitives.
14083 (relation_oracle::register_transitives): New.
14084 * value-relation.h (relation_oracle): Add new temporary bitmap and
14087 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
14090 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
14091 from integer to a pseudo vector register.
14093 2021-08-24 Richard Biener <rguenther@suse.de>
14095 PR tree-optimization/100089
14096 * tree-vectorizer.h (vect_slp_bb): Rename to ...
14097 (vect_slp_if_converted_bb): ... this and get the original
14098 loop as new argument.
14099 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
14100 pass original loop to vect_slp_if_converted_bb.
14101 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
14102 If orig_loop was passed scan the not vectorized stmts
14103 for COND_EXPRs and force not profitable if found.
14104 (vect_slp_region): Pass down all SLP instances to costing
14105 if orig_loop was specified.
14106 (vect_slp_bbs): Pass through orig_loop.
14107 (vect_slp_bb): Rename to ...
14108 (vect_slp_if_converted_bb): ... this and get the original
14109 loop as new argument.
14110 (vect_slp_function): Adjust.
14112 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14115 * config/arm/arm.md (attribute arch): Add fix_vlldm.
14116 (arch_enabled): Use it.
14117 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
14118 use when erratum mitigation is needed.
14120 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14123 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
14124 * doc/invoke.texi (Arm Options): Document it.
14125 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
14126 (ALL_QUIRKS): Add quirk_vlldm.
14127 (cortex-m33): Add quirk_vlldm.
14128 (cortex-m35p, cortex-m55): Likewise.
14129 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
14130 targetting an affected CPU and not explicitly controlled on
14133 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14135 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
14136 (lazy_load_multiple_insn): Likewise.
14138 2021-08-24 liuhongt <hongtao.liu@intel.com>
14141 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
14142 Enable avx512 embedded broadcast.
14143 (*<avx512>_vternlog<mode>_all): Ditto.
14144 (<avx512>_vternlog<mode>_mask): Ditto.
14146 2021-08-24 liuhongt <hongtao.liu@intel.com>
14149 * config/i386/i386.c (ix86_rtx_costs): Define cost for
14151 * config/i386/i386.h (STRIP_UNARY): New macro.
14152 * config/i386/predicates.md (reg_or_notreg_operand): New
14154 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
14155 (*<avx512>_vternlog<mode>_1): New pre_reload
14156 define_insn_and_split.
14157 (*<avx512>_vternlog<mode>_2): Ditto.
14158 (*<avx512>_vternlog<mode>_3): Ditto.
14159 (any_logic1,any_logic2): New code iterator.
14160 (logic_op): New code attribute.
14161 (ternlogsuffix): Extend to VNxDF and VNxSF.
14163 2021-08-24 Richard Biener <rguenther@suse.de>
14165 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
14166 * params.opt (--param vect-inner-loop-cost-factor): Adjust
14168 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
14169 inner_loop_cost_factor to the minimum of the estimated number
14170 of iterations of the inner loop and vect-inner-loop-cost-factor.
14172 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14173 Richard Biener <rguenther@suse.de>
14175 * config/i386/i386-features.c (compute_convert_gain): Provide
14176 more accurate values for CONST_INT, when optimizing for size.
14177 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
14178 * config/i386/i386.h (COSTS_N_BYTES): to here.
14180 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14181 Jakub Jelinek <jakub@redhat.com>
14183 PR middle-end/102029
14184 * match.pd (shift transformations): Add an additional check for
14185 !POINTER_TYPE_P in the recently added left shift transformation.
14187 2021-08-24 liuhongt <hongtao.liu@intel.com>
14189 PR tree-optimization/100089
14190 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
14191 loop vectorizer when cost model is very-cheap.
14193 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14195 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
14196 asprintf, which is not available on AIX.
14198 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14200 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
14201 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
14203 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14205 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
14206 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
14208 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
14209 (rbtree.o): Likewise.
14210 (rs6000-gen-builtins): Likewise.
14211 (rs6000-builtins.c): Likewise.
14212 (rs6000-builtins.h): Likewise.
14213 (rs6000.o): Add dependency.
14214 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
14215 (rs6000-vecdefines.h): New target.
14216 (rs6000-builtins.o): Likewise.
14217 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
14218 (rs6000-c.o): Likewise.
14220 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14223 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
14224 Diagnose buffer overrun.
14225 (safe_inc_pos): Fix overrun detection.
14226 (match_identifier): Diagnose buffer overrun.
14227 (match_integer): Likewise.
14228 (match_to_right_bracket): Likewise.
14230 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14232 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
14233 Improve range compare.
14234 (modref_access_node::contains): New member function.
14235 (modref_access_node::search): Remove.
14236 (modref_access_node::insert): Be smarter about subaccesses.
14238 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
14240 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
14241 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
14242 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
14245 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
14247 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
14249 * config/h8300/jumpcall.md (call, call_value): Restrict to
14250 !SIBLING_CALL_P cases.
14251 (subcall, sibcall_value): New patterns & expanders.
14252 * config/h8300/proepi.md (epilogue): Pass new argument to
14253 h8300_expand_epilogue.
14254 (sibcall_epilogue): New expander.
14255 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
14257 (h8300_ok_for_sibcall_p): New function.
14258 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
14260 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14262 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
14263 Handle case where the operand is already the desired mode.
14265 2021-08-23 Richard Biener <rguenther@suse.de>
14268 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
14269 in addition to has_gimple_body.
14271 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14273 PR middle-end/101949
14274 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
14277 2021-08-23 Martin Liska <mliska@suse.cz>
14279 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
14282 2021-08-23 Richard Biener <rguenther@suse.de>
14284 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
14285 Properly scale the inner loop cost only once.
14287 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14289 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
14290 Provide bounds for unsigned (and signed with non-negative operands)
14291 division and modulus.
14293 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14295 * simplify-rtx.c (simplify_truncation): Generalize simplification
14296 of (truncate:A (subreg:B X)).
14297 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
14298 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
14299 already has the desired machine mode.
14300 (test_scalar_int_ops): Add tests that useless extensions and
14301 truncations are optimized away.
14302 (test_scalar_int_ext_ops): New self-test function to confirm
14303 that truncations of extensions are correctly simplified.
14304 (test_scalar_int_ext_ops2): New self-test function to check
14305 truncations of truncations, extensions of extensions, and
14306 truncations of extensions.
14307 (test_scalar_ops): Call the above two functions with a
14308 representative sampling of integer machine modes.
14310 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14312 * match.pd (shift transformations): Change the sign of an
14313 LSHIFT_EXPR if it reduces the number of explicit conversions.
14315 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14317 PR tree-optimization/86723
14318 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
14319 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
14320 non-memory permutation of 64-bit src which only has bytes of
14321 0 or [5..8] and n->range is 4.
14322 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
14323 find_bswap_or_nop_finalize caller, support bswap with some bytes
14324 zeroed, as long as at least two bytes are not zeroed.
14325 (bswap_replace): Add mask argument and handle masking of bswap
14327 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
14328 caller, punt if cast64_to_32 or mask is not all ones.
14329 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
14330 caller, for now punt if cast64_to_32.
14332 2021-08-23 Richard Biener <rguenther@suse.de>
14334 PR tree-optimization/79334
14335 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
14336 a type also for COMPONENT_REFs.
14337 (vn_reference_may_trap): Check ARRAY_REF with constant index
14338 against the array domain.
14340 2021-08-23 liuhongt <hongtao.liu@intel.com>
14343 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
14344 TARGET_AVX512BW to condition.
14346 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14349 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
14350 register variables already during early_dwarf if possible.
14352 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14354 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
14356 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14359 * config/arm/arm.opt: Fix typo.
14360 * config/arm/t-rmprofile: Fix typo.
14362 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14364 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
14365 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
14366 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
14367 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
14368 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
14369 if either grainsize or num_tasks clause has the strict modifier.
14371 2021-08-23 Martin Liska <mliska@suse.cz>
14373 * dbgcnt.def (DEBUG_COUNTER): New counter.
14374 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
14376 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14378 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
14380 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
14383 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
14385 * config/mips/mips.c (mips_msa_output_shift_immediate): New
14387 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
14388 vlshr<mode>3): Call it.
14390 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
14391 Martin Liska <mliska@suse.cz>
14393 PR middle-end/101949
14394 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
14397 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
14399 * config/mips/mips.c (mips_function_rodata_section,
14400 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
14402 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
14404 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
14406 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14408 * configure.ac (thread-local storage support): Remove tls_first_major
14409 and tls_first_minor. Use "$conftest_s" to check support.
14410 * configure: Regenerate.
14412 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14414 * configure.ac: Fixup formatting.
14416 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14418 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
14419 * configure.ac: ... update all callers.
14421 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14424 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
14425 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
14426 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
14427 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
14428 Remove remaining checks for in-tree assembler.
14429 * configure: Regenerate.
14431 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
14433 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
14434 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
14436 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
14437 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
14438 Improve logical shifts by 27 bits for H8/S.
14439 (get_shift_alg): Corresponding changes.
14440 (h8300_option_override): Revert to loops for -Os when profitable.
14442 2021-08-20 Richard Biener <rguenther@suse.de>
14444 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
14446 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
14447 index as group_id when dataref_groups were not computed.
14448 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
14449 we advace to the next BB.
14451 2021-08-20 Jakub Jelinek <jakub@redhat.com>
14453 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
14456 2021-08-20 Martin Liska <mliska@suse.cz>
14458 PR gcov-profile/89961
14459 * gcov.c (make_gcov_file_name): Rewrite using std::string.
14460 (mangle_name): Simplify, do not used the second argument.
14461 (strip_extention): New function.
14462 (get_md5sum): Likewise.
14463 (get_gcov_intermediate_filename): Handle properly -p and -x
14465 (output_gcov_file): Use string type.
14466 (generate_results): Likewise.
14467 (md5sum_to_hex): Remove.
14469 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
14471 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
14472 (UNSPEC_XXSPLTIW): Move to vsx.md.
14473 (UNSPEC_XXSPLTID): Move to vsx.md.
14474 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
14475 (UNSPEC_XXBLEND): Move to vsx.md.
14476 (UNSPEC_XXPERMX): Move to vsx.md.
14477 (VM3): Move to vsx.md.
14478 (VM3_char): Move to vsx.md.
14479 (xxspltiw_v4si): Move to vsx.md.
14480 (xxspltiw_v4sf): Move to vsx.md.
14481 (xxspltiw_v4sf_inst): Move to vsx.md.
14482 (xxspltidp_v2df): Move to vsx.md.
14483 (xxspltidp_v2df_inst): Move to vsx.md.
14484 (xxsplti32dx_v4si_inst): Move to vsx.md.
14485 (xxsplti32dx_v4sf): Move to vsx.md.
14486 (xxsplti32dx_v4sf_inst): Move to vsx.md.
14487 (xxblend_<mode>): Move to vsx.md.
14488 (xxpermx): Move to vsx.md.
14489 (xxpermx_inst): Move to vsx.md.
14490 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
14491 (UNSPEC_XXSPLTIW): Move from altivec.md.
14492 (UNSPEC_XXSPLTID): Move from altivec.md.
14493 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
14494 (UNSPEC_XXBLEND): Move from altivec.md.
14495 (UNSPEC_XXPERMX): Move from altivec.md.
14496 (VM3): Move from altivec.md.
14497 (VM3_char): Move from altivec.md.
14498 (xxspltiw_v4si): Move from altivec.md.
14499 (xxspltiw_v4sf): Move from altivec.md.
14500 (xxspltiw_v4sf_inst): Move from altivec.md.
14501 (xxspltidp_v2df): Move from altivec.md.
14502 (xxspltidp_v2df_inst): Move from altivec.md.
14503 (xxsplti32dx_v4si_inst): Move from altivec.md.
14504 (xxsplti32dx_v4sf): Move from altivec.md.
14505 (xxsplti32dx_v4sf_inst): Move from altivec.md.
14506 (xxblend_<mode>): Move from altivec.md.
14507 (xxpermx): Move from altivec.md.
14508 (xxpermx_inst): Move from altivec.md.
14510 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
14512 * tree-vect-generic.c (expand_vector_operations_1): Use either
14513 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
14514 when constructing scalar splat expressions.
14516 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
14519 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
14520 pointer to __vector_pair *.
14522 2021-08-19 Martin Sebor <msebor@redhat.com>
14524 * gimple-range.cc: Add comments.
14525 * gimple-range.h: Same.
14527 2021-08-19 Martin Sebor <msebor@redhat.com>
14529 PR middle-end/101984
14530 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
14533 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
14535 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
14536 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
14537 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
14539 2021-08-19 Jakub Jelinek <jakub@redhat.com>
14541 PR middle-end/101950
14542 * optabs.c (expand_clrsb_using_clz): New function.
14543 (expand_unop): Use it as another clrsb expansion fallback.
14545 2021-08-19 liuhongt <hongtao.liu@intel.com>
14548 2021-07-28 liuhongt <hongtao.liu@intel.com>
14551 * config/i386/i386.h (processor_costs): Add new member
14553 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
14554 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
14555 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
14556 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
14557 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
14558 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
14559 generic_cost, core_cost): Initialize integer_to_sse same value
14561 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
14562 * config/i386/i386.c (ix86_builtin_vectorization_cost):
14563 Use integer_to_sse instead of sse_op to calculate the cost of
14566 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
14568 * config.gcc: Include rpath.opt for Darwin.
14569 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
14571 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14573 PR bootstrap/101959
14574 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
14577 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
14579 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
14580 pattern and type cast.
14581 (vld4_lane_f32): Use float RTL pattern.
14582 (vld4q_lane_f64): Use float type cast.
14584 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
14586 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
14589 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14591 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
14592 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
14593 (hash_map_tests_c_tests): Call it.
14595 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14597 * ggc.h (enum ggc_collect): New.
14598 (ggc_collect): Use it.
14599 * ggc-page.c: Adjust.
14600 * ggc-common.c: Likewise.
14601 * ggc-tests.c: Likewise.
14602 * read-rtl-function.c: Likewise.
14603 * selftest-run-tests.c: Likewise.
14604 * doc/gty.texi (Invoking the garbage collector): Likewise.
14606 2021-08-18 liuhongt <hongtao.liu@intel.com>
14609 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
14611 * config/i386/sse.md (*sse3_haddv2df3_low): Add
14612 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
14613 (*sse3_hsubv2df3_low): Ditto.
14614 * config/i386/x86-tune.def
14615 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
14617 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14619 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
14620 (gori_compute::compute_operand_range): Add tracing.
14621 (gori_compute::logical_combine): Ditto.
14622 (gori_compute::compute_logical_operands): Ditto.
14623 (gori_compute::compute_operand1_range): Ditto.
14624 (gori_compute::compute_operand2_range): Ditto.
14625 (gori_compute::outgoing_edge_range_p): Ditto.
14626 * gimple-range-gori.h (class gori_compute): Add range_tracer.
14628 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14630 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
14631 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
14632 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
14633 * params.opt (--param=evrp-mode): Adjust options.
14635 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14637 * Makefile.in (OBJS): Add gimple-range-trace.o.
14638 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
14639 * gimple-range-fold.cc: Adjust headers.
14640 * gimple-range-trace.cc: New.
14641 * gimple-range-trace.h: New.
14642 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
14643 (gimple_ranger::range_of_expr): Add tracing.
14644 (gimple_ranger::range_on_entry): Ditto.
14645 (gimple_ranger::range_on_exit): Ditto.
14646 (gimple_ranger::range_on_edge): Ditto.
14647 (gimple_ranger::fold_range_internal): Ditto.
14648 (gimple_ranger::dump_bb): Do not calculate edge range twice.
14649 (trace_ranger::*): Remove.
14650 (enable_ranger): Never create a trace_ranger.
14651 (debug_seed_ranger): Move to gimple-range-trace.cc.
14652 (dump_ranger): Ditto.
14653 (debug_ranger): Ditto.
14654 * gimple-range.h: Include gimple-range-trace.h.
14655 (range_on_entry, range_on_exit): No longer virtual.
14656 (class trace_ranger): Remove.
14657 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
14659 2021-08-17 Martin Sebor <msebor@redhat.com>
14661 PR middle-end/101854
14662 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
14663 in gimple-ssa-warn-access.cc.
14664 * calls.c (alloc_max_size): Move code to check_alloca.
14665 (get_size_range): Move to pointer-query.cc.
14666 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
14667 (get_attr_nonstring_decl): Move to tree.c.
14668 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
14669 (append_attrname): Same.
14670 (maybe_warn_rdwr_sizes): Same.
14671 (initialize_argument_information): Move code to
14672 gimple-ssa-warn-access.cc.
14673 * calls.h (maybe_warn_alloc_args_overflow): Move to
14674 gimple-ssa-warn-access.h.
14675 (get_attr_nonstring_decl): Move to tree.h.
14676 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
14677 (enum size_range_flags): Move to pointer-query.h.
14678 (get_size_range): Same.
14679 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
14680 to avoid Clang -Wunused-function.
14681 (get_size_range): Declare static.
14682 (maybe_emit_free_warning): Rename...
14683 (maybe_check_dealloc_call): ...to this for consistency.
14684 (class pass_waccess): Add members.
14685 (pass_waccess::~pass_waccess): Defined.
14686 (alloc_max_size): Move here from calls.c.
14687 (maybe_warn_alloc_args_overflow): Same.
14688 (check_alloca): New function.
14689 (check_alloc_size_call): New function.
14690 (check_strncat): Handle another warning flag.
14691 (pass_waccess::check_builtin): Handle alloca.
14692 (fntype_argno_type): Move here from calls.c.
14693 (append_attrname): Same.
14694 (maybe_warn_rdwr_sizes): Same.
14695 (pass_waccess::check_call): Define.
14696 (check_nonstring_args): New function.
14697 (pass_waccess::check): Call new member functions.
14698 (pass_waccess::execute): Enable ranger.
14699 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
14700 (maybe_warn_nonstring_arg): Same.
14701 * gimple-ssa-warn-restrict.c: Remove #include.
14702 * pointer-query.cc (get_size_range): Move here from calls.c.
14703 * pointer-query.h (enum size_range_flags): Same.
14704 (get_size_range): Same.
14705 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
14706 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
14708 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
14710 * ggc.h (ggc_collect): Add 'force_collect' parameter.
14711 * ggc-page.c (ggc_collect): Use that one instead of global
14712 'ggc_force_collect'. Adjust all users.
14713 * doc/gty.texi (Invoking the garbage collector): Update.
14714 * ggc-internal.h (ggc_force_collect): Remove.
14715 * ggc-common.c (ggc_force_collect): Likewise.
14716 * selftest.h (forcibly_ggc_collect): Remove.
14717 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
14718 * read-rtl-function.c (test_loading_labels): Adjust.
14719 * selftest-run-tests.c (run_tests): Likewise.
14721 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14723 * config/darwin.c (darwin_file_end): Reset and reclaim the
14724 section names table at the end of compile.
14726 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14729 * config.in: Regenerate.
14730 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
14731 (ASM_SPEC): Pass options to disable branch shortening where
14733 * configure: Regenerate.
14734 * configure.ac: Detect versions of 'as' that support the
14735 optimisation which has the bug.
14737 2021-08-17 Richard Biener <rguenther@suse.de>
14739 * optabs-query.c (supports_vec_gather_load_p): Also check
14741 (supports_vec_scatter_store_p): Likewise.
14742 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
14743 back to masked variants if non-masked are not supported.
14744 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
14745 When we need to use masked gather/scatter but do not have
14746 a mask set up a constant true one.
14747 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
14748 non-SSA_NAME masks.
14750 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14752 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
14753 algorithm as PLUS_EXPR to improve subtraction bit bounds.
14754 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
14756 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14758 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
14759 calculate the mask-value pair result of a multiplication by an
14761 (bit_value_binop) [MULT_EXPR]: Call it from here for
14762 multiplications by (sparse) non-negative constants.
14764 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
14767 * config.gcc (gcc_cv_initfini_array): Leave undefined for
14768 uclinuxfdpiceabi targets.
14770 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14772 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
14773 reverse debug stmts.
14775 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14777 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
14779 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14781 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
14782 (__LD4Q_LANE_FUNC): Likewise.
14783 (vld4_lane_u8): Define without macro.
14784 (vld4_lane_u16): Likewise.
14785 (vld4_lane_u32): Likewise.
14786 (vld4_lane_u64): Likewise.
14787 (vld4_lane_s8): Likewise.
14788 (vld4_lane_s16): Likewise.
14789 (vld4_lane_s32): Likewise.
14790 (vld4_lane_s64): Likewise.
14791 (vld4_lane_f16): Likewise.
14792 (vld4_lane_f32): Likewise.
14793 (vld4_lane_f64): Likewise.
14794 (vld4_lane_p8): Likewise.
14795 (vld4_lane_p16): Likewise.
14796 (vld4_lane_p64): Likewise.
14797 (vld4q_lane_u8): Likewise.
14798 (vld4q_lane_u16): Likewise.
14799 (vld4q_lane_u32): Likewise.
14800 (vld4q_lane_u64): Likewise.
14801 (vld4q_lane_s8): Likewise.
14802 (vld4q_lane_s16): Likewise.
14803 (vld4q_lane_s32): Likewise.
14804 (vld4q_lane_s64): Likewise.
14805 (vld4q_lane_f16): Likewise.
14806 (vld4q_lane_f32): Likewise.
14807 (vld4q_lane_f64): Likewise.
14808 (vld4q_lane_p8): Likewise.
14809 (vld4q_lane_p16): Likewise.
14810 (vld4q_lane_p64): Likewise.
14811 (vld4_lane_bf16): Likewise.
14812 (vld4q_lane_bf16): Likewise.
14814 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14816 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
14817 (__LD3Q_LANE_FUNC): Delete.
14818 (vld3_lane_u8): Define without macro.
14819 (vld3_lane_u16): Likewise.
14820 (vld3_lane_u32): Likewise.
14821 (vld3_lane_u64): Likewise.
14822 (vld3_lane_s8): Likewise.
14823 (vld3_lane_s16): Likewise.
14824 (vld3_lane_s32): Likewise.
14825 (vld3_lane_s64): Likewise.
14826 (vld3_lane_f16): Likewise.
14827 (vld3_lane_f32): Likewise.
14828 (vld3_lane_f64): Likewise.
14829 (vld3_lane_p8): Likewise.
14830 (vld3_lane_p16): Likewise.
14831 (vld3_lane_p64): Likewise.
14832 (vld3q_lane_u8): Likewise.
14833 (vld3q_lane_u16): Likewise.
14834 (vld3q_lane_u32): Likewise.
14835 (vld3q_lane_u64): Likewise.
14836 (vld3q_lane_s8): Likewise.
14837 (vld3q_lane_s16): Likewise.
14838 (vld3q_lane_s32): Likewise.
14839 (vld3q_lane_s64): Likewise.
14840 (vld3q_lane_f16): Likewise.
14841 (vld3q_lane_f32): Likewise.
14842 (vld3q_lane_f64): Likewise.
14843 (vld3q_lane_p8): Likewise.
14844 (vld3q_lane_p16): Likewise.
14845 (vld3q_lane_p64): Likewise.
14846 (vld3_lane_bf16): Likewise.
14847 (vld3q_lane_bf16): Likewise.
14849 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14851 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
14852 (__LD2Q_LANE_FUNC): Likewise.
14853 (vld2_lane_u8): Define without macro.
14854 (vld2_lane_u16): Likewise.
14855 (vld2_lane_u32): Likewise.
14856 (vld2_lane_u64): Likewise.
14857 (vld2_lane_s8): Likewise.
14858 (vld2_lane_s16): Likewise.
14859 (vld2_lane_s32): Likewise.
14860 (vld2_lane_s64): Likewise.
14861 (vld2_lane_f16): Likewise.
14862 (vld2_lane_f32): Likewise.
14863 (vld2_lane_f64): Likewise.
14864 (vld2_lane_p8): Likewise.
14865 (vld2_lane_p16): Likewise.
14866 (vld2_lane_p64): Likewise.
14867 (vld2q_lane_u8): Likewise.
14868 (vld2q_lane_u16): Likewise.
14869 (vld2q_lane_u32): Likewise.
14870 (vld2q_lane_u64): Likewise.
14871 (vld2q_lane_s8): Likewise.
14872 (vld2q_lane_s16): Likewise.
14873 (vld2q_lane_s32): Likewise.
14874 (vld2q_lane_s64): Likewise.
14875 (vld2q_lane_f16): Likewise.
14876 (vld2q_lane_f32): Likewise.
14877 (vld2q_lane_f64): Likewise.
14878 (vld2q_lane_p8): Likewise.
14879 (vld2q_lane_p16): Likewise.
14880 (vld2q_lane_p64): Likewise.
14881 (vld2_lane_bf16): Likewise.
14882 (vld2q_lane_bf16): Likewise.
14884 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14886 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
14889 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14891 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
14892 (rank_for_schedule): Use it.
14894 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14896 PR rtl-optimization/91598
14897 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
14898 insns after memory reads and before memory writes.
14900 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
14902 * rtl.h (CONST_VECTOR_P): New macro.
14903 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
14904 code testing macros.
14905 (aarch64_ptrue_all_mode): Likewise.
14906 (aarch64_expand_mov_immediate): Likewise.
14907 (aarch64_const_vec_all_in_range_p): Likewise.
14908 (aarch64_rtx_costs): Likewise.
14909 (aarch64_legitimate_constant_p): Likewise.
14910 (aarch64_simd_valid_immediate): Likewise.
14911 (aarch64_simd_make_constant): Likewise.
14912 (aarch64_convert_mult_to_shift): Likewise.
14913 (aarch64_expand_sve_vec_perm): Likewise.
14914 (aarch64_vec_fpconst_pow_of_2): Likewise.
14916 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14918 PR tree-optimization/101938
14919 * range-op.cc (operator_abs::op1_range): Special case
14920 -TYPE_MIN_VALUE for flag_wrapv.
14922 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
14924 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
14927 2021-08-17 Jakub Jelinek <jakub@redhat.com>
14929 * tree.def (OMP_SCOPE): New tree code.
14930 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
14931 * tree-nested.c (convert_nonlocal_reference_stmt,
14932 convert_local_reference_stmt, convert_gimple_call): Handle
14934 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
14935 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
14936 * gimple.c (gimple_build_omp_scope): New function.
14937 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
14938 * gimple.h (gimple_build_omp_scope): Declare.
14939 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
14940 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
14941 gimple_omp_scope_set_clauses): New inline functions.
14942 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
14943 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
14944 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
14945 * gimple-walk.c (walk_gimple_stmt): Likewise.
14946 * gimple-low.c (lower_stmt): Likewise.
14947 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14948 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
14949 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
14950 allowing task reductions. Reject inscan reductions on scope.
14951 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
14952 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
14953 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
14954 (estimate_num_insns): Likewise.
14955 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
14956 contexts if var isn't privatized there.
14957 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
14958 (scan_omp_1_stmt): Likewise.
14959 (maybe_add_implicit_barrier_cancel): Look through outer
14961 (lower_omp_scope): New function.
14962 (lower_omp_task_reductions): Handle OMP_SCOPE.
14963 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
14964 (diagnose_sb_1, diagnose_sb_2): Likewise.
14965 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
14966 (expand_omp): Handle GIMPLE_OMP_SCOPE.
14967 (omp_make_gimple_edges): Likewise.
14968 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
14970 2021-08-17 Richard Biener <rguenther@suse.de>
14972 PR tree-optimization/101925
14973 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
14974 reverse on COMPONENT_REF and ARRAY_REF according to
14975 what reverse_storage_order_for_component_p does.
14976 (vn_reference_eq): Compare reversed on reference ops.
14977 (reverse_storage_order_for_component_p): New overload.
14978 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
14979 on the reference looked up.
14981 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
14983 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
14984 shifts on the H8/S.
14985 (h8300_option_override): Use loops on H8/S more often when optimizing
14987 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
14988 accordingly. Handle various arithmetic right shifts with special
14989 sequences that we couldn't handle before.
14991 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
14993 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
14995 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
14997 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
14998 * config/sparc/sparc.c (sparc_gcov_type_size): New.
14999 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
15000 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
15001 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
15002 * doc/tm.texi.in: Regenerate.
15003 * target.def (gcov_type_size): New target hook.
15004 * targhooks.c (default_gcov_type_size): New.
15005 * targhooks.h (default_gcov_type_size): Declare.
15006 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
15008 (gimple_gen_time_profiler): Likewise.
15010 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
15012 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
15014 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
15016 PR middle-end/101931
15017 * omp-low.c (omp_runtime_api_call): Update for routines
15018 added in the meanwhile.
15020 2021-08-16 Martin Liska <mliska@suse.cz>
15022 PR tree-optimization/100393
15023 * tree-switch-conversion.c (group_cluster::dump): Use
15024 get_comparison_count.
15025 (jump_table_cluster::find_jump_tables): Pre-compute number of
15026 comparisons and then decrement it. Cache also max_ratio.
15027 (jump_table_cluster::can_be_handled): Change signature.
15028 * tree-switch-conversion.h (get_comparison_count): New.
15030 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
15032 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
15033 encodings only when -fgnat-encodings=all is specified.
15034 (add_bound_info): Likewise.
15035 (add_byte_size_attribute): Likewise.
15036 (gen_member_die): Likewise.
15038 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15040 * omp-oacc-neuter-broadcast.cc
15041 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
15043 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15045 * omp-oacc-neuter-broadcast.cc
15046 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
15049 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
15051 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
15052 (execute_omp_oacc_neuter_broadcast): ... here.
15053 (install_var_field, build_receiver_ref, build_sender_ref): Take
15054 'field_map_t *' parameter. Adjust all users.
15055 (worker_single_copy, neuter_worker_single): Take a
15056 'record_field_map_t *' parameter. Adjust all users.
15058 2021-08-16 liuhongt <hongtao.liu@intel.com>
15061 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
15064 2021-08-16 Martin Liska <mliska@suse.cz>
15067 * multiple_target.c (create_dispatcher_calls): Make default
15068 function local only if it is a definition.
15070 2021-08-16 Martin Liska <mliska@suse.cz>
15073 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
15074 consider equal SSA_NAMEs when one is a param.
15076 2021-08-16 liuhongt <hongtao.liu@intel.com>
15079 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
15080 Support vpermi2b for V32QI/V16QImode.
15081 (ix86_extract_perm_from_pool_constant): New function.
15082 (ix86_expand_vec_one_operand_perm_avx512): Support
15083 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
15084 (expand_vec_perm_1): Adjust comments for upper.
15085 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
15087 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
15088 (pshufb_truncv4siv4hi_operand): Ditto.
15089 (pshufb_truncv8hiv8qi_operand): Ditto.
15090 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
15091 New pre_reload define_insn_and_split.
15092 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
15093 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
15094 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
15095 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
15096 (*avx512f_permvar_truncv4div4si_1): Ditto.
15097 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
15098 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
15099 (*avx512f_pshufd_truncv2div2si_1): Ditto.
15101 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
15103 * config/riscv/multilib-generator: Support code model option for
15105 * doc/install.texi: Add document of new option for
15106 --with-multilib-generator.
15108 2021-08-15 Clément Chigot <clement.chigot@atos.net>
15110 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
15111 (rs6000_legitimize_tls_address_aix): Use it.
15112 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
15113 xcoff_tls_exec_model_detected is true.
15115 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
15117 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
15118 to allow a bit more code growth, saving many dozens of cycles.
15119 (h8300_option_override): Adjus shift_alg_si if optimizing for
15121 (get_shift_alg): Use special + inline shifts for residuals
15124 2021-08-14 Stafford Horne <shorne@gmail.com>
15127 * config/or1k/or1k-opts.h: New file.
15128 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
15129 Support generating gotha relocations if -mcmodel=large is
15131 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
15133 * config/or1k/or1k.opt (mcmodel=): New option.
15134 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
15136 2021-08-14 Martin Sebor <msebor@redhat.com>
15138 PR middle-end/101791
15139 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
15140 to valid_new_delete_pair_p.
15141 * tree.c (valid_new_delete_pair_p): Add argument.
15142 * tree.h (valid_new_delete_pair_p): Same.
15144 2021-08-14 Jakub Jelinek <jakub@redhat.com>
15147 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
15148 <case E_V64QImode>: For this mode assert
15149 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
15151 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
15154 * config/rs6000/altivec.md (xxeval): Use register_predicate
15155 instead of altivec_register_predicate.
15157 2021-08-13 Martin Sebor <msebor@redhat.com>
15159 PR middle-end/101734
15160 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
15161 (maybe_warn_operand): Call it.
15163 2021-08-13 Martin Liska <mliska@suse.cz>
15166 * attribs.c (decl_attributes): Make naked functions "noipa"
15169 2021-08-13 Martin Liska <mliska@suse.cz>
15172 * symtab.c (symtab_node::noninterposable_alias): Do not create
15173 local aliases for target_clone functions as the clonning pass
15176 2021-08-13 Martin Liska <mliska@suse.cz>
15178 * opts.c (LIVE_PATCHING_OPTION): Define.
15179 (control_options_for_live_patching): Use it in error messages.
15181 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
15183 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
15184 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
15185 ignore_stores_eaf_flags): New constants.
15186 (remove_useless_eaf_flags): New function.
15187 (eaf_flags_useful_p): Use it.
15188 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
15190 (modref_lattice::init): Add EAF_NOREAD.
15191 (modref_lattice::add_escape_point): Do not reacord escape point if
15193 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
15194 use remove_useless_eaf_flags.
15195 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
15196 (modref_lattice::merge_direct_load): Add EAF_NOREAD
15197 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
15198 (analyze_parms): Use remove_useless_eaf_flags.
15199 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
15200 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
15201 use remove_useless_eaf_flags.
15202 (modref_propagate_flags_in_scc): Update.
15203 * ipa-modref.h: Turn eaf_flags_t back to char.
15204 * tree-core.h (EAF_NOT_RETURNED): Fix.
15205 (EAF_NOREAD): New constant
15206 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
15208 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
15209 (handle_pure_call): Likewise.
15211 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15213 * tree.def (OMP_MASKED): New tree code.
15214 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
15215 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
15216 OMP_CLAUSE_FILTER_EXPR): Define.
15217 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
15218 (omp_clause_code_name): Likewise.
15219 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
15220 * tree-nested.c (convert_nonlocal_omp_clauses,
15221 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
15222 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
15223 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
15224 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
15225 (dump_generic_node): Handle OMP_MASTER.
15226 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
15227 * gimple.c (gimple_build_omp_masked): New function.
15228 (gimple_copy): Handle GIMPLE_OMP_MASKED.
15229 * gimple.h (gimple_build_omp_masked): Declare.
15230 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
15231 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
15232 gimple_omp_masked_set_clauses): New inline functions.
15233 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
15234 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
15235 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
15236 * gimple-walk.c (walk_gimple_stmt): Likewise.
15237 * gimple-low.c (lower_stmt): Likewise.
15238 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
15239 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
15240 that take one expression rather than decl or constant, force
15241 gimplification of that into a SSA_NAME or temporary unless min
15243 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
15244 (gimplify_expr): Handle OMP_MASKED.
15245 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
15246 (estimate_num_insns): Likewise.
15247 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
15248 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
15249 diagnostics for existence of masked construct.
15250 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
15251 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
15252 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
15255 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
15258 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
15259 (ldexp<mode>3): Use avx512f_scalef<mode>2.
15260 (UNSPEC_SCALEF): Move from ...
15261 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
15263 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
15265 * ipa-split.c (consider_split): Fix condition testing void functions.
15267 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
15269 * doc/invoke.texi: Remove docs for threader-mode param.
15270 * flag-types.h (enum threader_mode): Remove.
15271 * params.opt: Remove threader-mode param.
15272 * tree-ssa-threadbackward.c (class back_threader): Remove
15273 path_is_unreachable_p.
15274 Make find_paths private.
15275 Add maybe_thread and thread_through_all_blocks.
15276 Remove reference marker for m_registry.
15277 Remove reference marker for m_profit.
15278 (back_threader::back_threader): Adjust for registry and profit not
15280 (dump_path): Move down.
15281 (debug): Move down.
15282 (class thread_jumps): Remove.
15283 (class back_threader_registry): Remove m_all_paths.
15285 (thread_jumps::thread_through_all_blocks): Move to back_threader
15287 (fsm_find_thread_path): Remove
15288 (back_threader::maybe_thread): New.
15289 (back_threader::thread_through_all_blocks): Move from
15291 (back_threader_registry::back_threader_registry): Remove
15293 (back_threader_registry::~back_threader_registry): Remove.
15294 (thread_jumps::find_taken_edge): Remove.
15295 (thread_jumps::check_subpath_and_update_thread_path): Remove.
15296 (thread_jumps::maybe_register_path): Remove.
15297 (thread_jumps::handle_phi): Remove.
15298 (handle_assignment_p): Remove.
15299 (thread_jumps::handle_assignment): Remove.
15300 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
15301 (thread_jumps::find_jump_threads_backwards): Remove.
15302 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
15303 (try_thread_blocks): Rename find_jump_threads_backwards to
15305 (pass_early_thread_jumps::execute): Same.
15307 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
15309 * tree-core.h (omp_clause_proc_bind_kind): Add
15310 OMP_CLAUSE_PROC_BIND_PRIMARY.
15311 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
15312 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
15314 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
15316 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
15317 fno-common reference.
15318 * config/arc/arc.c (arc_override_options): Remove overriding of
15321 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15324 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
15325 If d->testing_p, return true after performing checks instead of
15326 actually expanding the insn.
15327 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
15328 !TARGET_AVX512BW and return false.
15330 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
15332 * configure.ac (PE linker --disable-dynamicbase support): New check.
15333 * configure: Regenerate.
15334 * config.in: Likewise.
15335 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
15336 (LINK_SPEC): Use it.
15337 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
15338 (LINK_SPEC): Likewise.
15340 2021-08-12 liuhongt <hongtao.liu@intel.com>
15343 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
15344 post_reload define_insn_and_split.
15345 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
15346 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
15347 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
15348 (*avx2_zero_extendv8hiv8si2_2): Ditto.
15349 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
15350 (*avx512f_zero_extendv8siv8di2_2): Ditto.
15351 (*avx2_zero_extendv4siv4di2_2): Ditto.
15352 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
15353 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
15356 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15358 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
15361 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15363 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
15365 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15367 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
15368 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
15369 initialization of pcvoid_type_node here...
15370 (altivec_init_builtins): ...from here.
15371 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
15372 RS6000_BTI_const_ptr_void.
15373 (pcvoid_type_node): New macro.
15375 2021-08-11 Richard Biener <rguenther@suse.de>
15378 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
15379 hard-register accesses.
15381 2021-08-11 Richard Biener <rguenther@suse.de>
15383 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
15384 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
15385 to determine has_volatile_ops.
15387 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
15389 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
15391 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
15392 Alexandre Oliva <oliva@adacore.com>
15394 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
15396 (modref_lattice::merge_deref): Fix handling of indirect scape points.
15397 (update_escape_summary_1): Likewise.
15398 (update_escape_summary): Likewise.
15399 (ipa_merge_modref_summary_after_inlining): Likewise.
15401 2021-08-11 Richard Biener <rguenther@suse.de>
15403 PR middle-end/101858
15404 * fold-const.c (fold_binary_loc): Guard simplification
15405 of X < (cast) (1 << Y) to integer types.
15407 2021-08-11 Richard Biener <rguenther@suse.de>
15409 PR tree-optimization/101861
15410 * tree-vect-stmts.c (vectorizable_load): Fix error in
15411 previous change with regard to gather vectorization.
15413 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15416 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
15418 (vdup_n_s16): Likewise.
15419 (vdup_n_s32): Likewise.
15420 (vdup_n_s64): Likewise.
15421 (vdup_n_u8): Likewise.
15422 (vdup_n_u16): Likewise.
15423 (vdup_n_u32): Likewise.
15424 (vdup_n_u64): Likewise.
15425 (vdup_n_p8): Likewise.
15426 (vdup_n_p16): Likewise.
15427 (vdup_n_p64): Likewise.
15428 (vdup_n_f16): Likewise.
15429 (vdup_n_f32): Likewise.
15430 (vdupq_n_s8): Likewise.
15431 (vdupq_n_s16): Likewise.
15432 (vdupq_n_s32): Likewise.
15433 (vdupq_n_s64): Likewise.
15434 (vdupq_n_u8): Likewise.
15435 (vdupq_n_u16): Likewise.
15436 (vdupq_n_u32): Likewise.
15437 (vdupq_n_u64): Likewise.
15438 (vdupq_n_p8): Likewise.
15439 (vdupq_n_p16): Likewise.
15440 (vdupq_n_p64): Likewise.
15441 (vdupq_n_f16): Likewise.
15442 (vdupq_n_f32): Likewise.
15443 (vmov_n_s8): Replace call to builtin with call to corresponding
15445 (vmov_n_s16): Likewise.
15446 (vmov_n_s32): Likewise.
15447 (vmov_n_s64): Likewise.
15448 (vmov_n_u8): Likewise.
15449 (vmov_n_u16): Likewise.
15450 (vmov_n_u32): Likewise.
15451 (vmov_n_u64): Likewise.
15452 (vmov_n_p8): Likewise.
15453 (vmov_n_p16): Likewise.
15454 (vmov_n_f16): Likewise.
15455 (vmov_n_f32): Likewise.
15456 (vmovq_n_s8): Likewise.
15457 (vmovq_n_s16): Likewise.
15458 (vmovq_n_s32): Likewise.
15459 (vmovq_n_s64): Likewise.
15460 (vmovq_n_u8): Likewise.
15461 (vmovq_n_u16): Likewise.
15462 (vmovq_n_u32): Likewise.
15463 (vmovq_n_u64): Likewise.
15464 (vmovq_n_p8): Likewise.
15465 (vmovq_n_p16): Likewise.
15466 (vmovq_n_f16): Likewise.
15467 (vmovq_n_f32): Likewise.
15468 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
15470 2021-08-11 liuhongt <hongtao.liu@intel.com>
15473 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
15474 when TARGET_AVX512F and TARGET_SSE_MATH.
15476 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15479 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
15480 for V32HImode if !TARGET_AVX512BW.
15481 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
15482 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
15483 early, but actually check the permutation.
15485 2021-08-10 Richard Biener <rguenther@suse.de>
15487 PR tree-optimization/101809
15488 * tree-vect-stmts.c (get_load_store_type): Allow emulated
15489 gathers with offset vector nunits being a constant multiple
15490 of the data vector nunits.
15491 (vect_get_gather_scatter_ops): Use the appropriate nunits
15492 for the offset vector defs.
15493 (vectorizable_store): Adjust call to
15494 vect_get_gather_scatter_ops.
15495 (vectorizable_load): Likewise. Handle the case of less
15496 offset vectors than data vectors.
15498 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15501 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
15502 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
15505 2021-08-10 Richard Biener <rguenther@suse.de>
15507 PR tree-optimization/101801
15508 PR tree-optimization/101819
15509 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
15510 * tree-vect-loop.c (vect_emulated_vector_p): New function.
15511 (vectorizable_reduction): Re-instantiate a check for emulated
15513 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15514 (vectorizable_operation): Likewise. Cost emulated vector
15515 operations according to the scalar sequence synthesized by
15518 2021-08-10 Richard Biener <rguenther@suse.de>
15520 PR middle-end/101824
15521 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
15522 volatile in case the variable was.
15524 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
15527 * config/i386/constraints.md (BC): Document for integer SSE
15528 constant all bits set operand.
15529 (BF): New constraint for const floating-point all bits set
15531 * config/i386/i386.c (standard_sse_constant_p): Likewise.
15532 (standard_sse_constant_opcode): Likewise.
15533 * config/i386/sse.md (sseconstm1): New mode attribute.
15534 (mov<mode>_internal): Replace BC with <sseconstm1>.
15536 2021-08-10 liuhongt <hongtao.liu@intel.com>
15538 * config/i386/sse.md (cond_<insn><mode>): New expander.
15539 (VI248_AVX512VLBW): New mode iterator.
15540 * config/i386/predicates.md
15541 (nonimmediate_or_const_vec_dup_operand): New predicate.
15543 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
15545 PR tree-optimization/101741
15546 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
15547 type of parameter for toupper/tolower.
15549 2021-08-09 Martin Jambor <mjambor@suse.cz>
15551 PR testsuite/101654
15552 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
15554 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
15556 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
15558 (is_store_insn1): Verify source is a register.
15560 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
15563 * config/i386/mmx.md (<any_logic:code>v2sf3):
15564 Rename from *mmx_<any_logic:code>v2sf3
15566 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15568 * config/nvptx/nvptx.c: Cross-reference parts adapted in
15569 'gcc/omp-oacc-neuter-broadcast.cc'.
15570 * omp-low.c: Likewise.
15571 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
15574 2021-08-09 Julian Brown <julian@codesourcery.com>
15575 Kwok Cheung Yeung <kcy@codesourcery.com>
15576 Thomas Schwinge <thomas@codesourcery.com>
15578 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
15579 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
15580 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
15581 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
15582 (gcn_fork_join): Update comment.
15583 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
15584 (macc_experimental_workers): Remove unused option.
15586 2021-08-09 Julian Brown <julian@codesourcery.com>
15587 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
15588 Kwok Cheung Yeung <kcy@codesourcery.com>
15589 Thomas Schwinge <thomas@codesourcery.com>
15591 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
15592 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
15593 Add documentation hook.
15594 * doc/tm.texi: Regenerate.
15595 * omp-oacc-neuter-broadcast.cc: New file.
15596 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
15597 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
15598 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
15599 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
15600 * target.def (goacc.create_worker_broadcast_record): Add target
15602 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
15604 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
15605 Rename prototype to...
15606 (gcn_goacc_create_worker_broadcast_record): ... this.
15607 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
15609 (gcn_goacc_create_worker_broadcast_record): ... this.
15610 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
15612 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
15614 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
15617 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
15618 the right iterator.
15620 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15622 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
15624 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15626 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
15627 consider '-foffload-abi'.
15628 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
15629 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
15632 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15634 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
15637 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15639 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
15641 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15643 * doc/gty.texi (Files): Update.
15645 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15647 * doc/gty.texi (Files): Fix GTY header file example.
15649 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
15651 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
15652 determine the upper and lower bounds from a mask-value pair.
15653 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
15654 absolute value and unsigned absolute value expressions.
15655 (bit_value_binop): Initialize *VAL's precision.
15656 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
15657 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
15658 support when the operands are unknown but potentially equal.
15659 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
15661 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
15663 * config/aarch64/aarch64.md
15664 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
15666 2021-08-08 Sergei Trofimovich <siarheit@google.com>
15668 * lra-constraints.c: Fix s/otput/output/ typo.
15670 2021-08-06 Martin Sebor <msebor@redhat.com>
15672 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
15673 (expand_builtin_strcat): Same.
15674 (expand_builtin_stpncpy): Same.
15675 (expand_builtin_strncat): Same.
15676 (check_read_access): Same.
15677 (check_memop_access): Same.
15678 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
15679 (expand_builtin_strnlen): Same.
15680 (expand_builtin_memcpy): Same.
15681 (expand_builtin_memmove): Same.
15682 (expand_builtin_mempcpy): Same.
15683 (expand_builtin_strcpy): Same.
15684 (expand_builtin_strcpy_args): Same.
15685 (expand_builtin_stpcpy_1): Same.
15686 (expand_builtin_strncpy): Same.
15687 (expand_builtin_memset): Same.
15688 (expand_builtin_bzero): Same.
15689 (expand_builtin_strcmp): Same.
15690 (expand_builtin_strncmp): Same.
15691 (expand_builtin): Remove handlers.
15692 (fold_builtin_strlen): Add a comment.
15693 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
15694 * calls.c (maybe_warn_nonstring_arg): Same.
15695 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
15696 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
15697 (gimple_fold_builtin_stpcpy): Same.
15698 * gimple-ssa-warn-access.cc (has_location): New function.
15699 (get_location): Same.
15700 (get_callee_fndecl): Same.
15701 (call_nargs): Same.
15703 (warn_string_no_nul): Define.
15704 (unterminated_array): Same.
15705 (check_nul_terminated_array): Same.
15706 (maybe_warn_nonstring_arg): Same.
15707 (maybe_warn_for_bound): Same.
15708 (warn_for_access): Same.
15709 (check_access): Same.
15710 (check_memop_access): Same.
15711 (check_read_access): Same.
15712 (warn_dealloc_offset): Use helper functions.
15713 (maybe_emit_free_warning): Same.
15714 (class pass_waccess): Add members.
15715 (check_strcat): New function.
15716 (check_strncat): New function.
15717 (check_stxcpy): New function.
15718 (check_stxncpy): New function.
15719 (check_strncmp): New function.
15720 (pass_waccess::check_builtin): New function.
15721 (pass_waccess::check): Call it.
15722 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
15724 (maybe_warn_for_bound): Same.
15725 (check_access): Same.
15726 (check_memop_access): Same.
15727 (check_read_access): Same.
15728 * pointer-query.h (struct access_data): Define a ctor overload.
15730 2021-08-06 Richard Biener <rguenther@suse.de>
15732 PR tree-optimization/101801
15733 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
15734 (vect_can_vectorize_without_simd_p): ... to this.
15735 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
15736 (vect_can_vectorize_without_simd_p): ... to this and fold
15737 in vect_min_worthwhile_factor.
15738 (vect_min_worthwhile_factor): Remove.
15739 (vectorizable_reduction): Adjust and remove the cost part.
15740 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15741 (vectorizable_operation): Likewise.
15743 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
15746 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
15747 Add general_gr_operand predicate to operand 3.
15749 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
15751 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
15752 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
15754 2021-08-06 Tamar Christina <tamar.christina@arm.com>
15756 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
15757 register_svprfop): Pass vec<> by pointer.
15758 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
15759 * langhooks.c (lhd_simulate_enum_decl): Likewise.
15760 * langhooks.h (struct lang_hooks_for_types): Likewise.
15762 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15764 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
15765 __builtin_memcpy instead of constructing an additional
15766 __builtin_aarch64_simd_oi one vector at a time.
15767 (vst1q_bf16_x2): Likewise.
15768 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
15769 an additional __builtin_aarch64_simd_ci one vector at a time.
15770 (vst1q_bf16_x3): Likewise.
15771 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
15772 (vst1q_bf16_x4): Likewise.
15773 (vst2_bf16): Use __builtin_memcpy instead of constructing an
15774 additional __builtin_aarch64_simd_oi one vector at a time.
15775 (vst2q_bf16): Likewise.
15776 (vst3_bf16): Use __builtin_memcpy instead of constructing an
15777 additional __builtin_aarch64_simd_ci mode one vector at a
15779 (vst3q_bf16): Likewise.
15780 (vst4_bf16): Use __builtin_memcpy instead of constructing an
15781 additional __builtin_aarch64_simd_xi one vector at a time.
15782 (vst4q_bf16): Likewise.
15784 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15786 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
15787 (__ST2Q_LANE_FUNC): Delete.
15788 (vst2_lane_f16): Use __builtin_memcpy to copy vector
15789 structure instead of constructing __builtin_aarch64_simd_oi
15790 one vector at a time.
15791 (vst2_lane_f32): Likewise.
15792 (vst2_lane_f64): Likewise.
15793 (vst2_lane_p8): Likewise.
15794 (vst2_lane_p16): Likewise.
15795 (vst2_lane_p64): Likewise.
15796 (vst2_lane_s8): Likewise.
15797 (vst2_lane_s16): Likewise.
15798 (vst2_lane_s32): Likewise.
15799 (vst2_lane_s64): Likewise.
15800 (vst2_lane_u8): Likewise.
15801 (vst2_lane_u16): Likewise.
15802 (vst2_lane_u32): Likewise.
15803 (vst2_lane_u64): Likewise.
15804 (vst2_lane_bf16): Likewise.
15805 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
15806 structure instead of using a union.
15807 (vst2q_lane_f32): Likewise.
15808 (vst2q_lane_f64): Likewise.
15809 (vst2q_lane_p8): Likewise.
15810 (vst2q_lane_p16): Likewise.
15811 (vst2q_lane_p64): Likewise.
15812 (vst2q_lane_s8): Likewise.
15813 (vst2q_lane_s16): Likewise.
15814 (vst2q_lane_s32): Likewise.
15815 (vst2q_lane_s64): Likewise.
15816 (vst2q_lane_u8): Likewise.
15817 (vst2q_lane_u16): Likewise.
15818 (vst2q_lane_u32): Likewise.
15819 (vst2q_lane_u64): Likewise.
15820 (vst2q_lane_bf16): Likewise.
15822 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15824 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
15825 (__ST3Q_LANE_FUNC): Delete.
15826 (vst3_lane_f16): Use __builtin_memcpy to copy vector
15827 structure instead of constructing __builtin_aarch64_simd_ci
15828 one vector at a time.
15829 (vst3_lane_f32): Likewise.
15830 (vst3_lane_f64): Likewise.
15831 (vst3_lane_p8): Likewise.
15832 (vst3_lane_p16): Likewise.
15833 (vst3_lane_p64): Likewise.
15834 (vst3_lane_s8): Likewise.
15835 (vst3_lane_s16): Likewise.
15836 (vst3_lane_s32): Likewise.
15837 (vst3_lane_s64): Likewise.
15838 (vst3_lane_u8): Likewise.
15839 (vst3_lane_u16): Likewise.
15840 (vst3_lane_u32): Likewise.
15841 (vst3_lane_u64): Likewise.
15842 (vst3_lane_bf16): Likewise.
15843 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
15844 structure instead of using a union.
15845 (vst3q_lane_f32): Likewise.
15846 (vst3q_lane_f64): Likewise.
15847 (vst3q_lane_p8): Likewise.
15848 (vst3q_lane_p16): Likewise.
15849 (vst3q_lane_p64): Likewise.
15850 (vst3q_lane_s8): Likewise.
15851 (vst3q_lane_s16): Likewise.
15852 (vst3q_lane_s32): Likewise.
15853 (vst3q_lane_s64): Likewise.
15854 (vst3q_lane_u8): Likewise.
15855 (vst3q_lane_u16): Likewise.
15856 (vst3q_lane_u32): Likewise.
15857 (vst3q_lane_u64): Likewise.
15858 (vst3q_lane_bf16): Likewise.
15860 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15862 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
15863 (__ST4Q_LANE_FUNC): Delete.
15864 (vst4_lane_f16): Use __builtin_memcpy to copy vector
15865 structure instead of constructing __builtin_aarch64_simd_xi
15866 one vector at a time.
15867 (vst4_lane_f32): Likewise.
15868 (vst4_lane_f64): Likewise.
15869 (vst4_lane_p8): Likewise.
15870 (vst4_lane_p16): Likewise.
15871 (vst4_lane_p64): Likewise.
15872 (vst4_lane_s8): Likewise.
15873 (vst4_lane_s16): Likewise.
15874 (vst4_lane_s32): Likewise.
15875 (vst4_lane_s64): Likewise.
15876 (vst4_lane_u8): Likewise.
15877 (vst4_lane_u16): Likewise.
15878 (vst4_lane_u32): Likewise.
15879 (vst4_lane_u64): Likewise.
15880 (vst4_lane_bf16): Likewise.
15881 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
15882 structure instead of using a union.
15883 (vst4q_lane_f32): Likewise.
15884 (vst4q_lane_f64): Likewise.
15885 (vst4q_lane_p8): Likewise.
15886 (vst4q_lane_p16): Likewise.
15887 (vst4q_lane_p64): Likewise.
15888 (vst4q_lane_s8): Likewise.
15889 (vst4q_lane_s16): Likewise.
15890 (vst4q_lane_s32): Likewise.
15891 (vst4q_lane_s64): Likewise.
15892 (vst4q_lane_u8): Likewise.
15893 (vst4q_lane_u16): Likewise.
15894 (vst4q_lane_u32): Likewise.
15895 (vst4q_lane_u64): Likewise.
15896 (vst4q_lane_bf16): Likewise.
15898 2021-08-06 Martin Liska <mliska@suse.cz>
15900 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
15901 a target option is restored, it can have
15902 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
15903 and error should not be emitted.
15905 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
15907 * gcov-io.h (gcov_write): Declare.
15908 * gcov-io.c (gcov_write): New.
15909 (gcov_write_counter): Remove.
15910 (gcov_write_tag_length): Likewise.
15911 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
15912 gcov_write_unsigned().
15913 * doc/invoke.texi (fprofile-info-section): Mention
15914 __gcov_info_to_gdca().
15916 2021-08-06 Martin Sebor <msebor@redhat.com>
15918 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
15919 arguments to by-reference.
15920 (iterate_fix_dominators): Same.
15921 * dominance.h (iterate_fix_dominators): Same.
15922 * ipa-prop.h: Call auto_vec::to_vec_legacy.
15923 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
15924 arguments to by-reference.
15925 (debug_data_dependence_relation): Same.
15926 (dump_data_dependence_relations): Same.
15927 * tree-data-ref.h (debug_data_dependence_relation): Same.
15928 (dump_data_dependence_relations): Same.
15929 * tree-predcom.c (dump_chains): Same.
15930 (initialize_root_vars_lm): Same.
15931 (determine_unroll_factor): Same.
15932 (replace_phis_by_defined_names): Same.
15933 (insert_init_seqs): Same.
15934 (pcom_worker::tree_predictive_commoning_loop): Call
15935 auto_vec::to_vec_legacy.
15936 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
15937 arguments to by-reference.
15938 * tree-ssa-threadbackward.c (populate_worklist): Same.
15939 (back_threader::resolve_def): Same.
15940 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
15941 (vect_enhance_data_refs_alignment): Same.
15942 (vect_check_lower_bound): Same.
15943 (vect_prune_runtime_alias_test_list): Same.
15944 (vect_permute_store_chain): Same.
15945 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
15946 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
15947 * tree-vectorizer.h (vect_permute_store_chain): Same.
15948 * vec.c (test_init): New function.
15949 (vec_c_tests): Call new function.
15950 * vec.h (vec): Declare ctors, dtor, and assignment.
15951 (auto_vec::vec_to_legacy): New function.
15952 (vec::copy): Adjust initialization.
15954 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
15957 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
15958 callee only uses GPRs.
15959 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
15960 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
15961 * config/i386/x86gprintrin.h: Add
15962 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
15963 to disable non-GPR ISAs.
15965 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15967 PR middle-end/101787
15968 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
15970 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15972 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
15973 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
15974 (vect_is_extending_load, vect_is_integer_truncation): New functions,
15975 moved from aarch64.c but given different names.
15976 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
15977 (aarch64_is_reduction, aarch64_reduc_type)
15978 (aarch64_embedded_comparison_type, aarch64_comparison_type)
15979 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
15980 in favor of the above. Update callers accordingly.
15982 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15985 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
15986 writing .cpu directive in asm output.
15987 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
15988 (arm_last_printed_arch_string): Delete.
15989 (arm_last-printed_fpu_string): Delete.
15990 (arm_configure_build_target): If use of floating-point/SIMD is
15991 disabled, remove all fp/simd related features from the target ISA.
15992 (last_arm_targ_options): New variable.
15993 (arm_print_asm_arch_directives): Add new parameters. Change order
15994 of emitted directives and handle all cases here.
15995 (arm_file_start): Always call arm_print_asm_arch_directives, move
15996 all generation of .arch/.arch_extension here.
15997 (arm_file_end): Call arm_print_asm_arch.
15998 (arm_declare_function_name): Call arm_print_asm_arch_directives
15999 instead of printing .arch/.fpu directives directly.
16001 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16003 * config/arm/arm.c (arm_configure_build_target): Don't call
16004 arm_option_reconfigure_globals.
16005 (arm_option_restore): Call arm_option_reconfigure_globals after
16006 reconfiguring the target.
16007 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
16009 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
16011 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
16012 arch_name is always set.
16014 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
16016 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
16017 of vec_select high-half from being added into Neon subtract
16020 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
16022 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
16023 of vec_select high-half from being added into Neon add cost.
16025 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
16027 * cfgloop.h (loops_list::loops_list): Add one optional argument
16028 root and adjust accordingly, update loop tree walking and factor
16030 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
16032 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
16034 PR tree-optimization/101626
16035 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
16036 reverse scalar storage order on a pointer or vector component.
16038 2021-08-05 liuhongt <hongtao.liu@intel.com>
16040 * config/i386/sse.md (cond_<code><mode>): New expander.
16042 2021-08-05 liuhongt <hongtao.liu@intel.com>
16044 * config/i386/sse.md (cond_<code><mode>): New expander.
16046 2021-08-05 liuhongt <hongtao.liu@intel.com>
16048 * config/i386/sse.md (cond_<code><mode>): New expander.
16050 2021-08-04 David Malcolm <dmalcolm@redhat.com>
16053 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
16055 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16058 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
16059 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
16061 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16064 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16065 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16066 data with SSE register from one memory location to another.
16068 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16070 * config/s390/s390.c (expand_perm_with_vpdi): New function.
16071 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
16072 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
16073 parameterized expander.
16074 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
16076 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16078 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
16079 (struct expand_vec_perm_d): Define struct.
16080 (expand_perm_with_merge): New function.
16081 (vectorize_vec_perm_const_1): New function.
16082 (s390_vectorize_vec_perm_const): New function.
16083 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
16085 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16087 * config/s390/vector.md (V_HW_64): Remove mode iterator.
16088 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
16089 * config/s390/vx-builtins.md
16090 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
16093 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16095 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
16097 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
16099 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
16100 instead of an unspec.
16102 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16104 * config/s390/s390-modes.def: Add more vector modes to support
16105 concatenation of two vectors.
16106 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
16108 (s390_expand_merge): Likewise.
16109 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
16110 (s390_expand_merge): New function.
16111 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
16112 Remove constant definitions.
16113 * config/s390/vector.md (V_HW_2): Add mode iterators.
16114 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
16115 (vec_2x_nelts, vec_2x_wide): New mode attributes.
16116 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
16117 New pattern definitions.
16118 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
16119 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
16120 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
16121 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
16123 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
16125 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
16126 emit vec merge pattern.
16128 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16130 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
16132 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
16133 vec_select high-half from being added into Neon multiply
16135 * rtlanal.c (vec_series_highpart_p): Define.
16136 * rtlanal.h (vec_series_highpart_p): Declare.
16138 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16140 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
16142 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
16143 vec_select cost from being added into Neon multiply cost.
16145 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16147 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
16148 which old_loop_vinfo is an epilogue loop that handles a constant
16149 number of iterations.
16151 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16153 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
16154 when a reanalyzed loop fails to be cheaper than the current
16157 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16159 * config/aarch64/aarch64.c: Fix a typo.
16161 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
16163 PR gcov-profile/101773
16164 * gcov-io.c (gcov_close): Check return code of a fclose.
16166 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
16169 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
16170 .file statement when needed.
16172 2021-08-04 Richard Biener <rguenther@suse.de>
16174 * tree-vect-data-refs.c (vect_check_gather_scatter):
16175 Include widening conversions only when the result is
16176 still handed by native gather or the current offset
16177 size not already matches the data size.
16178 Also succeed analysis in case there's no native support,
16179 noted by a IFN_LAST ifn and a NULL decl.
16180 (vect_analyze_data_refs): Always consider gathers.
16181 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
16182 Test for no IFN gather rather than decl gather.
16183 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
16184 gather-scatter info and cost emulated gathers accordingly.
16185 (vect_truncate_gather_scatter_offset): Properly test for
16187 (vect_use_strided_gather_scatters_p): Likewise.
16188 (get_load_store_type): Handle emulated gathers and its
16190 (vectorizable_load): Likewise. Emulate them by extracting
16191 scalar offsets, doing scalar loads and a vector construct.
16193 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16196 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
16197 argument to set m_max_size.
16198 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
16199 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
16200 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
16202 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
16203 Marc Glisse <marc.glisse@inria.fr>
16205 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
16206 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
16207 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
16209 2021-08-04 Richard Biener <rguenther@suse.de>
16211 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
16212 out code to decompose vector loads ...
16213 (optimize_vector_load): ... here. Generalize it to
16214 handle intermediate widening and TARGET_MEM_REF loads
16215 and apply it to loads with a supported vector mode as well.
16217 2021-08-04 Richard Biener <rguenther@suse.de>
16219 PR tree-optimization/101756
16220 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
16221 the result of the reduction epilogue is compatible to the original
16224 2021-08-04 liuhongt <hongtao.liu@intel.com>
16227 * config/i386/i386.md (peephole2): Refine predicate from
16228 register_operand to general_reg_operand.
16230 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
16232 * gimple-range-path.h (path_range_query::dump): Mark override.
16234 2021-08-04 Richard Biener <rguenther@suse.de>
16236 PR tree-optimization/101769
16237 * tree-tailcall.c (eliminate_tail_call): Add the created loop
16238 for the first recursion and return it via the new output parameter.
16239 (optimize_tail_call): Pass through new output param.
16240 (tree_optimize_tail_calls_1): After creating all latches,
16241 add the created loop to the loop tree. Do not mark loops for fixup.
16243 2021-08-04 Martin Liska <mliska@suse.cz>
16245 * doc/invoke.texi: Document threader-mode param.
16247 2021-08-04 liuhongt <hongtao.liu@intel.com>
16249 * config/i386/sse.md (cond_fma<mode>): New expander.
16250 (cond_fms<mode>): Ditto.
16251 (cond_fnma<mode>): Ditto.
16252 (cond_fnms<mode>): Ditto.
16254 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16256 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
16258 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16260 * config/rs6000/constraints.md: Remove "e" from the list of available
16261 constraint characters.
16263 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
16265 PR gcov-profile/71672
16266 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
16268 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
16270 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
16272 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
16274 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
16275 try XMM31 to avoid vzeroupper.
16277 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16279 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
16280 -mcpu=neoverse-512tvb.
16281 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
16282 * config/aarch64/aarch64-tune.md: Regenerate.
16283 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
16284 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
16285 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
16286 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
16287 (aarch64_adjust_body_cost): Likewise.
16289 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16291 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
16292 record issue information for operations that occur in the
16295 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16297 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
16298 parameter. Detect cases in which an Advanced SIMD MLA would almost
16299 certainly require a MOV.
16300 (aarch64_count_ops): Update accordingly.
16302 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16304 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
16305 function, split out from...
16306 (aarch64_detect_vector_stmt_subtype): ...here.
16307 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
16309 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16311 * config/aarch64/aarch64-protos.h (sve_vec_cost):
16312 Add gather_load_x32_cost and gather_load_x64_cost.
16313 * config/aarch64/aarch64.c (generic_sve_vector_cost)
16314 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
16315 accordingly, using the values given by the scalar_load * number
16316 of elements calculation that we used previously.
16317 (aarch64_detect_vector_stmt_subtype): Use the new fields.
16319 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16321 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
16322 function, split out from...
16323 (aarch64_adjust_body_cost): ...here.
16325 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16327 * config/aarch64/fractional-cost.h: New file.
16328 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
16329 and cost_fraction.h.
16330 (vec_cost_fraction): New typedef.
16331 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
16332 (aarch64_detect_vector_stmt_subtype): Likewise.
16333 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
16334 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
16336 (aarch64_adjust_body_cost): Likewise.
16337 (aarch64_test_cost_fraction): New function.
16338 (aarch64_run_selftests): Call it.
16340 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16342 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
16344 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
16346 (aarch64_estimated_poly_value): Likewise. Use the least significant
16347 set bit for the minimum and likely values. Use the most significant
16348 set bit for the maximum value.
16350 2021-08-03 liuhongt <hongtao.liu@intel.com>
16352 * config/i386/sse.md (cond_<insn><mode>): New expander.
16353 (cond_mul<mode>): Ditto.
16355 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
16357 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
16359 2021-08-03 liuhongt <hongtao.liu@intel.com>
16361 * config/i386/sse.md (cond_<insn><mode>):New expander.
16362 (cond_mul<mode>): Ditto.
16363 (cond_div<mode>): Ditto.
16365 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16367 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
16368 check stack_realign_needed for stack realignment.
16369 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
16370 than the largest integer supported by vector register.
16371 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
16372 (MOVE_MAX): Set to bytes of the largest integer supported by
16374 (STORE_MAX_PIECES): New.
16376 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16378 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16379 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16380 data from one memory location to another.
16382 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16384 PR middle-end/90773
16385 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
16387 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
16389 PR tree-optimization/101724
16390 * params.opt: Remove --param=threader-iterative.
16391 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
16394 2021-08-02 Tom de Vries <tdevries@suse.de>
16396 PR middle-end/101665
16397 * doc/extend.texi (nonnull attribute): Improve documentation.
16399 2021-08-02 Andrew Pinski <apinski@marvell.com>
16401 PR rtl-optimization/101683
16402 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
16404 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
16406 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
16407 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
16408 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
16409 (tree_ssa_phiop_worker): Update call to function above.
16411 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
16414 * config/i386/i386.md (bsr_rex64_1_zext): New.
16415 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
16416 with gen_bsr_rex64_1_zext.
16418 2021-07-31 Jakub Jelinek <jakub@redhat.com>
16421 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
16422 define_insn patterns.
16423 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
16424 Add combine splitters for constant - clz.
16425 (clz<mode>2): Use a temporary pseudo for bsr result.
16427 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16429 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
16430 _mm_floor_sd, _mm_floor_ss): New.
16432 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16434 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
16435 _mm_ceil_sd, _mm_ceil_ss): New.
16437 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16439 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
16440 _mm_blend_ps, _mm_blendv_ps): New.
16442 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
16443 Uroš Bizjak <ubizjak@gmail.com>
16445 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
16446 to generate a conditional move using the carry flag after sub $1.
16447 (peephole2): Eliminate a register-to-register move by inverting
16448 the condition of a conditional move.
16450 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16452 * config/mmix/mmix.md ("call", "call_value", "*call_real")
16453 ("*call_value_real"): Don't generate rtx mentioning the generic
16454 operands 1 and 2 to "call", and similarly for "call_value".
16455 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
16456 (mmix_print_operand): Use '!' instead of 'p'.
16458 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16460 * doc/md.texi (call): Correct information about operand 2.
16461 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
16463 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16465 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
16467 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16469 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
16471 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16473 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
16474 const_basic_block..
16475 (*::get_bb_range): Ditto.
16476 (*::bb_range_p): Ditto.
16477 * gimple-range-cache.h: Change prototypes.
16479 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16481 PR middle-end/90773
16482 * builtins.c (builtin_memcpy_read_str): Change the mode argument
16483 from scalar_int_mode to fixed_size_mode.
16484 (builtin_strncpy_read_str): Likewise.
16485 (gen_memset_value_from_prev): New function.
16486 (builtin_memset_read_str): Change the mode argument from
16487 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
16488 and support CONST_VECTOR.
16489 (builtin_memset_gen_str): Likewise.
16490 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
16492 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
16493 with fixed_size_mode.
16494 (builtin_memset_read_str): Likewise.
16495 * expr.c (widest_int_mode_for_size): Renamed to ...
16496 (widest_fixed_size_mode_for_size): Add a bool argument to
16497 indicate if QI vector mode can be used.
16498 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
16499 instead of widest_int_mode_for_size.
16500 (pieces_addr::adjust): Change the mode argument from
16501 scalar_int_mode to fixed_size_mode.
16502 (op_by_pieces_d): Make m_len read-only. Add a bool member,
16503 m_qi_vector_mode, to indicate that QI vector mode can be used.
16504 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
16505 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
16506 instead of widest_int_mode_for_size.
16507 (op_by_pieces_d::get_usable_mode): Change the mode argument from
16508 scalar_int_mode to fixed_size_mode. Call
16509 widest_fixed_size_mode_for_size instead of
16510 widest_int_mode_for_size.
16511 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
16512 function to return the smallest integer or QI vector mode.
16513 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
16514 instead of widest_int_mode_for_size. Call
16515 smallest_fixed_size_mode_for_size instead of
16516 smallest_int_mode_for_size.
16517 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
16518 indicate that QI vector mode can be used and pass it to
16519 op_by_pieces_d::op_by_pieces_d.
16520 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
16521 instead of widest_int_mode_for_size. Pass memsetp to
16522 widest_fixed_size_mode_for_size to support QI vector mode.
16523 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
16524 (store_by_pieces): Pass memsetp to
16525 store_by_pieces_d::store_by_pieces_d.
16526 (clear_by_pieces_1): Removed.
16527 (clear_by_pieces): Replace clear_by_pieces_1 with
16528 builtin_memset_read_str and pass true to store_by_pieces_d to
16529 support vector mode broadcast.
16530 (string_cst_read_str): Change the mode argument from
16531 scalar_int_mode to fixed_size_mode.
16532 * expr.h (by_pieces_constfn): Change scalar_int_mode to
16534 (by_pieces_prev): Likewise.
16535 * rtl.h (lowpart_subreg_regno): New.
16536 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
16537 simplify_subreg_regno.
16538 * target.def (gen_memset_scratch_rtx): New hook.
16539 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
16540 * doc/tm.texi: Regenerated.
16542 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16545 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
16546 TARGET_EXPR instead of MODIFY_EXPR.
16548 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16551 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
16552 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
16553 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
16555 (vec_cmpu<IMSA:mode><mode_i>): New expander.
16557 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16560 * config/i386/i386-options.c (ix86_option_override_internal):
16561 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
16563 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
16566 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
16567 explicitly dereferencing __a.
16568 (vld1_s64): Likewise.
16569 (vld1_u64): Likewise.
16570 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
16571 and change to VAR13.
16573 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
16575 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
16576 use of m_range_analyzer.
16577 (loop_versioning::lv_dom_walker::before_dom_children): Same.
16578 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
16579 (loop_versioning::prune_loop_conditions): Replace vr_values use
16580 with range_query interface.
16581 (pass_loop_versioning::execute): Use ranger.
16583 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16586 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
16587 enum values, and emit a warning if they mismatch.
16589 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
16591 * cfgloop.h (as_const): New function.
16592 (class loop_iterator): Rename to ...
16593 (class loops_list): ... this.
16594 (loop_iterator::next): Rename to ...
16595 (loops_list::Iter::fill_curr_loop): ... this and adjust.
16596 (loop_iterator::loop_iterator): Rename to ...
16597 (loops_list::loops_list): ... this and adjust.
16598 (loops_list::Iter): New class.
16599 (loops_list::iterator): New type.
16600 (loops_list::const_iterator): New type.
16601 (loops_list::begin): New function.
16602 (loops_list::end): Likewise.
16603 (loops_list::begin const): Likewise.
16604 (loops_list::end const): Likewise.
16605 (FOR_EACH_LOOP): Remove.
16606 (FOR_EACH_LOOP_FN): Remove.
16607 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
16608 for loop with loops_list instance.
16609 (sort_sibling_loops): Likewise.
16610 (disambiguate_loops_with_multiple_latches): Likewise.
16611 (verify_loop_structure): Likewise.
16612 * cfgloopmanip.c (create_preheaders): Likewise.
16613 (force_single_succ_latches): Likewise.
16614 * config/aarch64/falkor-tag-collision-avoidance.c
16615 (execute_tag_collision_avoidance): Likewise.
16616 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
16617 * config/s390/s390.c (s390_adjust_loops): Likewise.
16618 * doc/loop.texi: Likewise.
16619 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
16620 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
16621 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
16622 (loop_versioning::make_versioning_decisions): Likewise.
16623 * gimple-ssa-split-paths.c (split_paths): Likewise.
16624 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
16625 * graphite.c (canonicalize_loop_form): Likewise.
16626 (graphite_transform_loops): Likewise.
16627 * ipa-fnsummary.c (analyze_function_body): Likewise.
16628 * ipa-pure-const.c (analyze_function): Likewise.
16629 * loop-doloop.c (doloop_optimize_loops): Likewise.
16630 * loop-init.c (loop_optimizer_finalize): Likewise.
16631 (fix_loop_structure): Likewise.
16632 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
16633 (move_loop_invariants): Likewise.
16634 * loop-unroll.c (decide_unrolling): Likewise.
16635 (unroll_loops): Likewise.
16636 * modulo-sched.c (sms_schedule): Likewise.
16637 * predict.c (predict_loops): Likewise.
16638 (pass_profile::execute): Likewise.
16639 * profile.c (branch_prob): Likewise.
16640 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
16641 (sel_find_rgns): Likewise.
16642 * tree-cfg.c (replace_loop_annotate): Likewise.
16643 (replace_uses_by): Likewise.
16644 (move_sese_region_to_fn): Likewise.
16645 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
16646 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
16647 * tree-parloops.c (parallelize_loops): Likewise.
16648 * tree-predcom.c (tree_predictive_commoning): Likewise.
16649 * tree-scalar-evolution.c (scev_initialize): Likewise.
16650 (scev_reset): Likewise.
16651 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
16652 * tree-ssa-live.c (remove_unused_locals): Likewise.
16653 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
16654 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
16655 (tree_ssa_lim_initialize): Likewise.
16656 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
16657 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
16658 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
16659 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
16660 (free_numbers_of_iterations_estimates): Likewise.
16661 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
16662 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
16663 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
16664 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
16665 (pass_scev_cprop::execute): Likewise.
16666 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
16667 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
16668 * tree-ssa-threadupdate.c
16669 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
16670 * tree-vectorizer.c (vectorize_loops): Likewise.
16671 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
16673 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
16675 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
16676 generating a VOIDmode register for e.g the
16677 function_arg_info::end_marker.
16679 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
16681 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
16682 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
16683 (cc mode_attr): Similarly.
16684 (ccz subst_attr): Similarly.
16685 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
16686 * config/h8300/testcompare.md: Remove various cc0 based patterns
16687 that had been commented out. Add pattern to set CCZ from a bit
16690 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
16691 Julian Brown <julian@codesourcery.com>
16692 Kwok Cheung Yeung <kcy@codesourcery.com>
16694 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
16695 'update_stmt' after modification.
16696 (pass_oacc_loop_designation): New function, extracted out of...
16697 (pass_oacc_device_lower): ... this.
16698 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
16699 (make_pass_oacc_loop_designation): New
16700 * passes.def: Add it.
16701 * tree-parloops.c (create_parallel_loop): Adjust.
16702 * tree-pass.h (make_pass_oacc_loop_designation): New.
16704 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
16706 * flag-types.h (enum threader_mode): New.
16707 * params.opt: Add entry for --param=threader-mode.
16708 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
16709 (class back_threader): New.
16710 (back_threader::back_threader): New.
16711 (back_threader::~back_threader): New.
16712 (back_threader::maybe_register_path): New.
16713 (back_threader::find_taken_edge): New.
16714 (back_threader::find_taken_edge_switch): New.
16715 (back_threader::find_taken_edge_cond): New.
16716 (back_threader::resolve_def): New.
16717 (back_threader::resolve_phi): New.
16718 (back_threader::find_paths_to_names): New.
16719 (back_threader::find_paths): New.
16722 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
16723 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
16724 (pass_thread_jumps::execute): Abstract out code...
16725 (try_thread_blocks): ...here.
16726 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
16727 Abstract out threading candidate code to...
16728 (single_succ_to_potentially_threadable_block): ...here.
16729 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
16731 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
16732 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
16733 Return bool from register_jump_thread.
16735 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
16737 * target.def: in0 and in1 do not need to be registers.
16738 * doc/tm.texi: Regenerate.
16740 2021-07-29 liuhongt <hongtao.liu@intel.com>
16743 * config/i386/i386.c (ix86_widen_mult_cost): New function.
16744 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
16747 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
16750 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
16751 (rs6000_preferred_doloop_mode): New hook.
16752 * doc/tm.texi: Regenerate.
16753 * doc/tm.texi.in: Add hook preferred_doloop_mode.
16754 * target.def (preferred_doloop_mode): New hook.
16755 * targhooks.c (default_preferred_doloop_mode): New hook.
16756 * targhooks.h (default_preferred_doloop_mode): New hook.
16757 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
16758 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
16759 and compute_doloop_base_on_mode.
16761 2021-07-28 Martin Sebor <msebor@redhat.com>
16763 PR middle-end/101494
16764 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
16765 and size computation.
16767 2021-07-28 Martin Sebor <msebor@redhat.com>
16769 PR middle-end/101601
16770 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
16772 Handle pointers to functions.
16774 2021-07-28 Martin Sebor <msebor@redhat.com>
16776 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
16777 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
16778 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
16779 (access_ref::access_ref): Same.
16780 (access_ref::phi): Same.
16781 (access_ref::get_ref): Same.
16782 (access_ref::size_remaining): Same.
16783 (access_ref::offset_in_range): Same.
16784 (access_ref::add_offset): Same.
16785 (access_ref::inform_access): Same.
16786 (ssa_name_limit_t::visit_phi): Same.
16787 (ssa_name_limit_t::leave_phi): Same.
16788 (ssa_name_limit_t::next): Same.
16789 (ssa_name_limit_t::next_phi): Same.
16790 (ssa_name_limit_t::~ssa_name_limit_t): Same.
16791 (pointer_query::pointer_query): Same.
16792 (pointer_query::get_ref): Same.
16793 (pointer_query::put_ref): Same.
16794 (pointer_query::flush_cache): Same.
16795 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
16796 (check_nul_terminated_array): Same.
16797 (unterminated_array): Same.
16798 (maybe_warn_for_bound): Same.
16799 (check_read_access): Same.
16800 (warn_for_access): Same.
16801 (get_size_range): Same.
16802 (check_access): Same.
16803 (gimple_call_alloc_size): Move to tree.c.
16804 (gimple_parm_array_size): Move to pointer-query.cc.
16805 (get_offset_range): Same.
16806 (gimple_call_return_array): Same.
16807 (handle_min_max_size): Same.
16808 (handle_array_ref): Same.
16809 (handle_mem_ref): Same.
16810 (compute_objsize): Same.
16811 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
16812 (call_dealloc_argno): Same.
16813 (fndecl_dealloc_argno): Same.
16814 (new_delete_mismatch_p): Same.
16815 (matching_alloc_calls_p): Same.
16816 (warn_dealloc_offset): Same.
16817 (maybe_emit_free_warning): Same.
16818 * builtins.h (check_nul_terminated_array): Move to
16819 gimple-ssa-warn-access.h.
16820 (check_nul_terminated_array): Same.
16821 (warn_string_no_nul): Same.
16822 (unterminated_array): Same.
16823 (class ssa_name_limit_t): Same.
16824 (class pointer_query): Same.
16825 (struct access_ref): Same.
16826 (class range_query): Same.
16827 (struct access_data): Same.
16828 (gimple_call_alloc_size): Same.
16829 (gimple_parm_array_size): Same.
16830 (compute_objsize): Same.
16831 (class access_data): Same.
16832 (maybe_emit_free_warning): Same.
16833 * calls.c (initialize_argument_information): Remove call to
16834 maybe_emit_free_warning.
16835 * gimple-array-bounds.cc: Include new header..
16836 * gimple-fold.c: Same.
16837 * gimple-ssa-sprintf.c: Same.
16838 * gimple-ssa-warn-restrict.c: Same.
16839 * passes.def: Add pass_warn_access.
16840 * tree-pass.h (make_pass_warn_access): Declare.
16841 * tree-ssa-strlen.c: Include new headers.
16842 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
16843 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
16844 * gimple-ssa-warn-access.cc: New file.
16845 * gimple-ssa-warn-access.h: New file.
16846 * pointer-query.cc: New file.
16847 * pointer-query.h: New file.
16849 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16851 PR middle-end/101624
16852 * ubsan.c (maybe_instrument_pointer_overflow,
16853 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
16854 PARM_DECLs or RESULT_DECLs.
16855 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
16857 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16859 PR middle-end/101642
16860 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
16861 to type of bswap16 for comparison.
16862 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
16864 2021-07-28 Richard Biener <rguenther@suse.de>
16866 PR tree-optimization/101615
16867 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
16868 at CTOR SLP graph entries.
16870 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
16872 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
16873 Add "r,w" alternative.
16875 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
16878 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
16879 AVX_U128_DIRTY when all bits are zero.
16881 2021-07-28 Richard Biener <rguenther@suse.de>
16883 PR tree-optimization/101615
16884 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
16885 external nodes cannot be permuted so make them perm_out 0.
16887 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
16890 * config.in: Regenerate.
16891 * config/gcn/gcn-hsa.h (A_FIJI): New define.
16892 (A_900): New define.
16893 (A_906): New define.
16894 (A_908): New define.
16895 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
16896 * config/gcn/gcn.c (output_file_start): Adjust attributes according
16897 to the assembler capabilities.
16898 * config/gcn/mkoffload.c (main): Likewise.
16899 * configure: Regenerate.
16900 * configure.ac: Add tests for LLVM assembler attribute features.
16902 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
16904 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
16905 cond_false and cond_true on branches.
16907 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16909 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
16912 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16914 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
16916 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16919 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
16920 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
16921 and has special !TARGET_AVX512VL expansion.
16922 (vashrv2di3<mask_name>): Rename to ...
16923 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
16924 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
16926 2021-07-28 Martin Uecker <muecker@gwdg.de>
16928 * calls.c (maybe_warn_rdwr_sizes): Correct argument
16929 numbers in warning that were switched.
16931 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
16933 PR tree-optimization/101596
16934 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
16935 by using new_type's precision instead.
16937 2021-07-28 liuhongt <hongtao.liu@intel.com>
16940 * config/i386/i386.h (processor_costs): Add new member
16942 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
16943 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
16944 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
16945 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
16946 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
16947 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
16948 generic_cost, core_cost): Initialize integer_to_sse same value
16950 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
16951 * config/i386/i386.c (ix86_builtin_vectorization_cost):
16952 Use integer_to_sse instead of sse_op to calculate the cost of
16955 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16957 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
16959 (write_init_file): Call write_ovld_static_init.
16961 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16963 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
16965 (write_init_file): Call write_bif_static_init.
16967 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16969 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
16970 (TYPE_MAP_SIZE): New macro.
16971 (type_map): New initialized variable.
16972 (typemap_cmp): New function.
16973 (write_type_node): Likewise.
16974 (write_fntype_init): Implement.
16976 2021-07-27 Martin Sebor <msebor@redhat.com>
16978 PR tree-optimization/101584
16979 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
16980 (check_defs): Call it.
16982 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16984 * tree-ssa-dom.c (dom_jump_threader_simplifier):
16985 Put avail_exprs_stack in the class, instead of passing it to
16986 jump_threader_simplifier.
16987 (dom_jump_threader_simplifier::simplify): Add state argument.
16988 (dom_opt_dom_walker): Add state.
16989 (pass_dominator::execute): Pass state to threader.
16990 (dom_opt_dom_walker::before_dom_children): Use state.
16991 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
16992 arguments by state.
16993 (jump_threader::record_temporary_equivalences_from_phis):
16994 Register equivalences through the state variable.
16995 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
16996 Record ranges in a statement through the state variable.
16997 (jump_threader::simplify_control_stmt_condition): Pass state to
16999 (jump_threader::simplify_control_stmt_condition_1): Same.
17000 (jump_threader::thread_around_empty_blocks): Remove obsolete
17002 (jump_threader::thread_through_normal_block): Record equivalences
17003 on edge through the state variable.
17004 (jump_threader::thread_across_edge): Abstract state pushing.
17005 (jt_state::jt_state): New.
17006 (jt_state::push): New.
17007 (jt_state::pop): New.
17008 (jt_state::register_equiv): New.
17009 (jt_state::record_ranges_from_stmt): New.
17010 (jt_state::register_equivs_on_edge): New.
17011 (jump_threader_simplifier::jump_threader_simplifier): Move from
17013 (jump_threader_simplifier::simplify): Add state argument.
17014 * tree-ssa-threadedge.h (class jt_state): New.
17015 (class jump_threader): Add state to constructor.
17016 (class jump_threader_simplifier): Add state to simplify. Remove
17017 avail_exprs_stack from class.
17018 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
17020 (vrp_jump_threader::vrp_jump_threader): Add state.
17021 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
17023 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
17025 * Makefile.in (OBJS): Add gimple-range-path.o.
17026 * gimple-range-path.cc: New file.
17027 * gimple-range-path.h: New file.
17029 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
17031 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
17032 inside vec_duplicate for all patterns.
17033 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
17034 Push sign/zero-extension inside vec_duplicate.
17036 2021-07-27 Richard Biener <rguenther@suse.de>
17038 PR tree-optimization/101573
17039 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
17040 looking at uninitialized PHI arg defs in some constrained cases.
17041 (warn_uninitialized_vars): Call it.
17042 (execute_early_warn_uninitialized): Calculate dominators.
17044 2021-07-27 Richard Biener <rguenther@suse.de>
17046 PR tree-optimization/39821
17047 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
17048 vector_stmt for widening arithmetic.
17049 (vectorizable_conversion): Adjust.
17051 2021-07-27 Martin Jambor <mjambor@suse.cz>
17053 * cgraph.h (ipa_replace_map): New field force_load_ref.
17054 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
17055 aded new flag load_dereferenced, adjusted comments.
17056 (ipa_get_param_dereferenced): New function.
17057 (ipa_set_param_dereferenced): Likewise.
17058 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
17059 * ipa-cp.c: Include gimple.h.
17060 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
17061 (get_replacement_map): New parameter force_load_ref, set the
17062 appropriate flag in ipa_replace_map if set.
17063 (struct symbol_and_index_together): New type.
17064 (adjust_refs_in_act_callers): New function.
17065 (adjust_references_in_caller): Likewise.
17066 (create_specialized_node): When appropriate, call
17067 adjust_references_in_caller and force only load references.
17068 * ipa-prop.c (load_from_dereferenced_name): New function.
17069 (ipa_analyze_controlled_uses): Also detect loads from a
17070 dereference, harden testing of call statements.
17071 (ipa_write_node_info): Stream the dereferenced flag.
17072 (ipa_read_node_info): Likewise.
17073 (ipa_set_jf_constant): Also create refdesc when jump function
17074 references a variable.
17075 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
17076 also on references of variables and return a symtab_node. Adjust
17078 (propagate_controlled_uses): Also remove references to VAR_DECLs.
17080 2021-07-27 Jakub Jelinek <jakub@redhat.com>
17082 PR middle-end/101586
17083 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
17084 positions above or equal to sz except for diagnostics of flexible
17087 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17089 PR tree-optimization/78888
17090 * gimple-range-fold.cc (get_letter_range): New.
17091 (fold_using_range::range_of_builtin_call): Call get_letter_range.
17093 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17095 PR tree-optimization/78888
17096 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
17097 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
17099 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
17100 Marc Glisse <marc.glisse@inria.fr>
17102 * match.pd (rotate): Simplify equality/inequality of rotations.
17103 (bswap): Simplify equality/inequality tests of byte swapping.
17105 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17107 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
17110 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17112 * range-op.cc (operator_lshift::fold_range): Pass rel to
17113 base class fold_range.
17114 (operator_rshift::fold_range): Same.
17116 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
17119 * toplev.h (min_align_loops_log): Remove declaration.
17120 (min_align_jumps_log, min_align_labels_log): Likewise.
17121 (min_align_functions_log): Likewise.
17123 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17125 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
17127 (execute_vrp): Abstract out simplification of conditionals...
17128 (simplify_casted_conds): ...here.
17130 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17132 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
17133 Add gimple argument.
17134 (array_bounds_checker::check_array_ref): Same.
17135 (array_bounds_checker::check_addr_expr): Same.
17136 (array_bounds_checker::check_array_bounds): Pass statement to
17137 check_array_bounds and check_addr_expr.
17138 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
17139 (check_addr_expr): Same.
17140 (get_value_range): Same.
17142 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17144 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
17145 (sdot_prod, udot_prod): ... This.
17146 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
17148 (<sur>dot_prod<vsi2qi>): ... this.
17149 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
17150 Change operands order.
17151 (<sur>sadv16qi): Use new operands order.
17152 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
17153 vdotq_s32): Use new RTL ordering.
17155 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17157 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
17158 aarch64_types_ternop_suss_qualifiers): New.
17159 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
17160 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
17161 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
17163 2021-07-23 Jakub Jelinek <jakub@redhat.com>
17165 PR rtl-optimization/101562
17166 * expmed.c (store_integral_bit_field): Only use movstrict_optab
17167 if the operand isn't paradoxical.
17169 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
17171 * gimple-array-bounds.h (class array_bounds_checker): Change
17172 ranges type to range_query.
17174 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17176 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
17177 __builtin_memcpy instead of constructing
17178 __builtin_aarch64_simd_oi one vector at a time.
17179 (vst1_u64_x2): Likewise.
17180 (vst1_f64_x2): Likewise.
17181 (vst1_s8_x2): Likewise.
17182 (vst1_p8_x2): Likewise.
17183 (vst1_s16_x2): Likewise.
17184 (vst1_p16_x2): Likewise.
17185 (vst1_s32_x2): Likewise.
17186 (vst1_u8_x2): Likewise.
17187 (vst1_u16_x2): Likewise.
17188 (vst1_u32_x2): Likewise.
17189 (vst1_f16_x2): Likewise.
17190 (vst1_f32_x2): Likewise.
17191 (vst1_p64_x2): Likewise.
17192 (vst1q_s8_x2): Likewise.
17193 (vst1q_p8_x2): Likewise.
17194 (vst1q_s16_x2): Likewise.
17195 (vst1q_p16_x2): Likewise.
17196 (vst1q_s32_x2): Likewise.
17197 (vst1q_s64_x2): Likewise.
17198 (vst1q_u8_x2): Likewise.
17199 (vst1q_u16_x2): Likewise.
17200 (vst1q_u32_x2): Likewise.
17201 (vst1q_u64_x2): Likewise.
17202 (vst1q_f16_x2): Likewise.
17203 (vst1q_f32_x2): Likewise.
17204 (vst1q_f64_x2): Likewise.
17205 (vst1q_p64_x2): Likewise.
17207 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17209 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
17210 __builtin_memcpy instead of constructing
17211 __builtin_aarch64_simd_ci one vector at a time.
17212 (vst1_u64_x3): Likewise.
17213 (vst1_f64_x3): Likewise.
17214 (vst1_s8_x3): Likewise.
17215 (vst1_p8_x3): Likewise.
17216 (vst1_s16_x3): Likewise.
17217 (vst1_p16_x3): Likewise.
17218 (vst1_s32_x3): Likewise.
17219 (vst1_u8_x3): Likewise.
17220 (vst1_u16_x3): Likewise.
17221 (vst1_u32_x3): Likewise.
17222 (vst1_f16_x3): Likewise.
17223 (vst1_f32_x3): Likewise.
17224 (vst1_p64_x3): Likewise.
17225 (vst1q_s8_x3): Likewise.
17226 (vst1q_p8_x3): Likewise.
17227 (vst1q_s16_x3): Likewise.
17228 (vst1q_p16_x3): Likewise.
17229 (vst1q_s32_x3): Likewise.
17230 (vst1q_s64_x3): Likewise.
17231 (vst1q_u8_x3): Likewise.
17232 (vst1q_u16_x3): Likewise.
17233 (vst1q_u32_x3): Likewise.
17234 (vst1q_u64_x3): Likewise.
17235 (vst1q_f16_x3): Likewise.
17236 (vst1q_f32_x3): Likewise.
17237 (vst1q_f64_x3): Likewise.
17238 (vst1q_p64_x3): Likewise.
17240 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
17243 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
17244 hard register when LRA is in progress.
17246 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17248 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
17249 __builtin_memcpy instead of using a union.
17250 (vst1q_s8_x4): Likewise.
17251 (vst1_s16_x4): Likewise.
17252 (vst1q_s16_x4): Likewise.
17253 (vst1_s32_x4): Likewise.
17254 (vst1q_s32_x4): Likewise.
17255 (vst1_u8_x4): Likewise.
17256 (vst1q_u8_x4): Likewise.
17257 (vst1_u16_x4): Likewise.
17258 (vst1q_u16_x4): Likewise.
17259 (vst1_u32_x4): Likewise.
17260 (vst1q_u32_x4): Likewise.
17261 (vst1_f16_x4): Likewise.
17262 (vst1q_f16_x4): Likewise.
17263 (vst1_f32_x4): Likewise.
17264 (vst1q_f32_x4): Likewise.
17265 (vst1_p8_x4): Likewise.
17266 (vst1q_p8_x4): Likewise.
17267 (vst1_p16_x4): Likewise.
17268 (vst1q_p16_x4): Likewise.
17269 (vst1_s64_x4): Likewise.
17270 (vst1_u64_x4): Likewise.
17271 (vst1_p64_x4): Likewise.
17272 (vst1q_s64_x4): Likewise.
17273 (vst1q_u64_x4): Likewise.
17274 (vst1q_p64_x4): Likewise.
17275 (vst1_f64_x4): Likewise.
17276 (vst1q_f64_x4): Likewise.
17278 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
17280 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
17281 instead of constructing __builtin_aarch64_simd_oi one vector
17283 (vst2_u64): Likewise.
17284 (vst2_f64): Likewise.
17285 (vst2_s8): Likewise.
17286 (vst2_p8): Likewise.
17287 (vst2_s16): Likewise.
17288 (vst2_p16): Likewise.
17289 (vst2_s32): Likewise.
17290 (vst2_u8): Likewise.
17291 (vst2_u16): Likewise.
17292 (vst2_u32): Likewise.
17293 (vst2_f16): Likewise.
17294 (vst2_f32): Likewise.
17295 (vst2_p64): Likewise.
17296 (vst2q_s8): Likewise.
17297 (vst2q_p8): Likewise.
17298 (vst2q_s16): Likewise.
17299 (vst2q_p16): Likewise.
17300 (vst2q_s32): Likewise.
17301 (vst2q_s64): Likewise.
17302 (vst2q_u8): Likewise.
17303 (vst2q_u16): Likewise.
17304 (vst2q_u32): Likewise.
17305 (vst2q_u64): Likewise.
17306 (vst2q_f16): Likewise.
17307 (vst2q_f32): Likewise.
17308 (vst2q_f64): Likewise.
17309 (vst2q_p64): Likewise.
17311 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17313 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
17314 instead of constructing __builtin_aarch64_simd_ci one vector
17316 (vst3_u64): Likewise.
17317 (vst3_f64): Likewise.
17318 (vst3_s8): Likewise.
17319 (vst3_p8): Likewise.
17320 (vst3_s16): Likewise.
17321 (vst3_p16): Likewise.
17322 (vst3_s32): Likewise.
17323 (vst3_u8): Likewise.
17324 (vst3_u16): Likewise.
17325 (vst3_u32): Likewise.
17326 (vst3_f16): Likewise.
17327 (vst3_f32): Likewise.
17328 (vst3_p64): Likewise.
17329 (vst3q_s8): Likewise.
17330 (vst3q_p8): Likewise.
17331 (vst3q_s16): Likewise.
17332 (vst3q_p16): Likewise.
17333 (vst3q_s32): Likewise.
17334 (vst3q_s64): Likewise.
17335 (vst3q_u8): Likewise.
17336 (vst3q_u16): Likewise.
17337 (vst3q_u32): Likewise.
17338 (vst3q_u64): Likewise.
17339 (vst3q_f16): Likewise.
17340 (vst3q_f32): Likewise.
17341 (vst3q_f64): Likewise.
17342 (vst3q_p64): Likewise.
17344 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17346 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
17347 instead of constructing __builtin_aarch64_simd_xi one vector
17349 (vst4_u64): Likewise.
17350 (vst4_f64): Likewise.
17351 (vst4_s8): Likewise.
17352 (vst4_p8): Likewise.
17353 (vst4_s16): Likewise.
17354 (vst4_p16): Likewise.
17355 (vst4_s32): Likewise.
17356 (vst4_u8): Likewise.
17357 (vst4_u16): Likewise.
17358 (vst4_u32): Likewise.
17359 (vst4_f16): Likewise.
17360 (vst4_f32): Likewise.
17361 (vst4_p64): Likewise.
17362 (vst4q_s8): Likewise.
17363 (vst4q_p8): Likewise.
17364 (vst4q_s16): Likewise.
17365 (vst4q_p16): Likewise.
17366 (vst4q_s32): Likewise.
17367 (vst4q_s64): Likewise.
17368 (vst4q_u8): Likewise.
17369 (vst4q_u16): Likewise.
17370 (vst4q_u32): Likewise.
17371 (vst4q_u64): Likewise.
17372 (vst4q_f16): Likewise.
17373 (vst4q_f32): Likewise.
17374 (vst4q_f64): Likewise.
17375 (vst4q_p64): Likewise.
17377 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17379 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
17380 instead of constructing __builtin_aarch64_simd_oi one vector
17382 (vtbx4_u8): Likewise.
17383 (vtbx4_p8): Likewise.
17385 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17387 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
17388 instead of constructing __builtin_aarch64_simd_oi one vector
17390 (vtbl3_u8): Likewise.
17391 (vtbl3_p8): Likewise.
17392 (vtbl4_s8): Likewise.
17393 (vtbl4_u8): Likewise.
17394 (vtbl4_p8): Likewise.
17396 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17398 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
17399 instead of constructing __builtin_aarch64_simd_oi one vector
17401 (vqtbx2_u8): Likewise.
17402 (vqtbx2_p8): Likewise.
17403 (vqtbx2q_s8): Likewise.
17404 (vqtbx2q_u8): Likewise.
17405 (vqtbx2q_p8): Likewise.
17406 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
17407 __builtin_aarch64_simd_ci one vector at a time.
17408 (vqtbx3_u8): Likewise.
17409 (vqtbx3_p8): Likewise.
17410 (vqtbx3q_s8): Likewise.
17411 (vqtbx3q_u8): Likewise.
17412 (vqtbx3q_p8): Likewise.
17413 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
17414 __builtin_aarch64_simd_xi one vector at a time.
17415 (vqtbx4_u8): Likewise.
17416 (vqtbx4_p8): Likewise.
17417 (vqtbx4q_s8): Likewise.
17418 (vqtbx4q_u8): Likewise.
17419 (vqtbx4q_p8): Likewise.
17421 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17423 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
17424 instead of constructing __builtin_aarch64_simd_oi one vector
17426 (vqtbl2_u8): Likewise.
17427 (vqtbl2_p8): Likewise.
17428 (vqtbl2q_s8): Likewise.
17429 (vqtbl2q_u8): Likewise.
17430 (vqtbl2q_p8): Likewise.
17431 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
17432 __builtin_aarch64_simd_ci one vector at a time.
17433 (vqtbl3_u8): Likewise.
17434 (vqtbl3_p8): Likewise.
17435 (vqtbl3q_s8): Likewise.
17436 (vqtbl3q_u8): Likewise.
17437 (vqtbl3q_p8): Likewise.
17438 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
17439 __builtin_aarch64_simd_xi one vector at a time.
17440 (vqtbl4_u8): Likewise.
17441 (vqtbl4_p8): Likewise.
17442 (vqtbl4q_s8): Likewise.
17443 (vqtbl4q_u8): Likewise.
17444 (vqtbl4q_p8): Likewise.
17446 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
17449 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
17451 2021-07-22 Andrew Pinski <apinski@marvell.com>
17453 PR tree-optimization/10153
17454 * tree-tailcall.c (create_tailcall_accumulator):
17455 Don't call fold_convert as the type should be correct already.
17456 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
17457 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
17459 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
17461 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
17462 varying_p check for null/non-null check.
17464 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17466 PR tree-optimization/101511
17467 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
17468 is in ssa2's equiv set, and don't trap if so.
17470 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17472 PR tree-optimization/101497
17473 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
17476 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17478 PR tree-optimization/101496
17479 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
17480 first, then vrp_visit_cond_Stmt.
17482 2021-07-22 liuhongt <hongtao.liu@intel.com>
17484 * config/i386/i386-expand.c
17485 (ix86_broadcast_from_integer_constant): Rename to ..
17486 (ix86_broadcast_from_constant): .. this, and extend it to
17488 (ix86_expand_vector_move): Extend to float mode.
17489 * config/i386/i386-features.c
17490 (replace_constant_pool_with_broadcast): Remove.
17491 (remove_partial_avx_dependency_gate): Ditto.
17492 (constant_pool_broadcast): Ditto.
17493 (class pass_constant_pool_broadcast): Ditto.
17494 (make_pass_constant_pool_broadcast): Ditto.
17495 (remove_partial_avx_dependency): Adjust gate.
17496 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
17497 * config/i386/i386-protos.h
17498 (make_pass_constant_pool_broadcast): Remove.
17500 2021-07-22 liuhongt <hongtao.liu@intel.com>
17502 * config/i386/constraints.md (Wb): New constraint.
17504 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
17506 (*ashlqi3_1): Ditto.
17507 (*<insn><mode>3_1): Split to ..
17508 (*ashr<mode>3_1): this, ...
17509 (*lshr<mode>3_1): and this, also extend this pattern to avx512
17511 (*<insn><mode>3_1): Split to ..
17512 (*ashr<mode>3_1): this, ...
17513 (*lshrqi3_1): and this, also extend this pattern to avx512
17515 (*lshrhi3_1): And this, also extend this pattern to avx512
17517 * config/i386/sse.md (k<code><mode>): New define_split after
17518 it to convert generic shift pattern to mask shift ones.
17520 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
17521 Joseph Myers <joseph@codesourcery.com>
17522 Cesar Philippidis <cesar@codesourcery.com>
17524 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
17525 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
17527 * tree-pretty-print.c (dump_omp_clause): Likewise.
17528 * omp-general.c (oacc_verify_routine_clauses): Likewise.
17529 * gimplify.c (gimplify_scan_omp_clauses)
17530 (gimplify_adjust_omp_clauses): Likewise.
17531 * tree-nested.c (convert_nonlocal_omp_clauses)
17532 (convert_local_omp_clauses): Likewise.
17533 * omp-low.c (scan_sharing_clauses): Likewise.
17534 * omp-offload.c (execute_oacc_device_lower): Update.
17536 2021-07-21 Martin Sebor <msebor@redhat.com>
17538 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
17540 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17542 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
17545 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17547 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
17549 (write_fntype_init): New stub function.
17550 (write_init_bif_table): Likewise.
17551 (write_init_ovld_table): New function.
17552 (write_init_file): Implement.
17554 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17556 * config/rs6000/rs6000-gen-builtins.c
17557 (write_autogenerated_header): New function.
17558 (write_decls): Likewise.
17559 (write_extern_fntype): New callback function.
17560 (write_header_file): Implement.
17562 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17564 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
17567 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17569 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
17571 (complete_base_type): Likewise.
17572 (construct_fntype_id): Likewise.
17573 (parse_bif_entry): Call contruct_fntype_id.
17574 (parse_ovld_entry): Likewise.
17576 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17578 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
17579 (MAXOVLDSTANZAS): New macro.
17580 (ovld_stanzas): New variable.
17581 (curr_ovld_stanza): Likewise.
17582 (MAXOVLDS): New macro.
17583 (ovlddata): New struct.
17584 (ovlds): New variable.
17585 (curr_ovld): Likewise.
17586 (max_ovld_args): Likewise.
17587 (parse_ovld_entry): New function.
17588 (parse_ovld_stanza): Likewise.
17589 (parse_ovld): Implement.
17591 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17593 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
17596 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17598 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
17599 (parse_prototype): Implement.
17601 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17603 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
17604 (curr_bif_stanza): New variable.
17605 (stanza_entry): New struct.
17606 (stanza_map): New initialized variable.
17607 (enable_string): Likewise.
17608 (fnkinds): New enum.
17609 (typelist): New struct.
17610 (attrinfo): Likewise.
17611 (MAXRESTROPNDS): New macro.
17612 (prototype): New struct.
17613 (MAXBIFS): New macro.
17614 (bifdata): New struct.
17615 (bifs): New variable.
17616 (curr_bif): Likewise.
17617 (bif_order): Likewise.
17618 (bif_index): Likewise.
17619 (fatal): New function.
17620 (stanza_name_to_stanza): Likewise.
17621 (parse_bif_attrs): New stub function.
17622 (parse_prototype): Likewise.
17623 (parse_bif_entry): New function.
17624 (parse_bif_stanza): Likewise.
17625 (parse_bif): Implement.
17626 (set_bif_order): New function.
17627 (create_bif_order): Implement.
17629 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17631 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
17632 (num_bifs): New variable.
17633 (num_ovld_stanzas): Likewise.
17634 (num_ovlds): Likewise.
17635 (parse_codes): New enum.
17636 (bif_rbt): New variable.
17637 (ovld_rbt): Likewise.
17638 (fntype_rbt): Likewise.
17639 (bifo_rbt): Likewise.
17640 (parse_bif): New stub function.
17641 (create_bif_order): Likewise.
17642 (parse_ovld): Likewise.
17643 (write_header_file): Likewise.
17644 (write_init_file): Likewise.
17645 (write_defines_file): Likewise.
17646 (delete_output_files): New function.
17649 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
17652 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
17653 from CRC32 _builtin functions.
17655 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
17657 * coverage.c (build_gcov_info_var_registration): Mark the object placed
17658 in the linker set as referenced so that it does not get optimized away.
17660 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
17663 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17665 * config.gcc (riscv*-*-*): Detect which python is available.
17667 2021-07-21 Jakub Jelinek <jakub@redhat.com>
17669 PR middle-end/101535
17670 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
17671 contexts in which decl isn't privatized and for ORT_TARGET return
17672 false if decl is mapped.
17674 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17676 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
17677 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
17678 has been successfully unrolled.
17680 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17682 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
17683 exit condition if no tail loop is needed, and if the original exit
17684 condition should therefore be kept as-is.
17685 (tree_transform_and_unroll_loop): Handle that case here too.
17687 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
17689 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
17691 (free_data_refs): Likewise.
17692 * tree-data-ref.h (free_dependence_relations): Likewise.
17693 (free_data_refs): Likewise.
17694 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
17696 (struct component): Likewise.
17697 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
17698 (pcom_worker::~pcom_worker): Likewise.
17699 (pcom_worker::release_chain): Adjust as auto_vec changes.
17700 (pcom_worker::loop): Rename to ...
17701 (pcom_worker::m_loop): ... this.
17702 (pcom_worker::datarefs): Rename to ...
17703 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
17704 (pcom_worker::dependences): Rename to ...
17705 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
17706 (pcom_worker::chains): Rename to ...
17707 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
17708 (pcom_worker::looparound_phis): Rename to ...
17709 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
17711 (pcom_worker::cache): Rename to ...
17712 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
17713 (pcom_worker::release_chain): Adjust for auto_vec changes.
17714 (pcom_worker::release_chains): Adjust for auto_vec and renaming
17716 (release_component): Remove.
17717 (release_components): Adjust for release_component removal.
17718 (component_of): Adjust to use vec.
17719 (merge_comps): Likewise.
17720 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
17721 (pcom_worker::determine_offset): Likewise.
17722 (class comp_ptrs): Remove.
17723 (pcom_worker::split_data_refs_to_components): Adjust for renaming
17724 changes, for comp_ptrs removal with auto_vec.
17725 (pcom_worker::suitable_component_p): Adjust for renaming changes.
17726 (pcom_worker::filter_suitable_components): Adjust for release_component
17728 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
17729 (pcom_worker::find_looparound_phi): Likewise.
17730 (pcom_worker::add_looparound_copies): Likewise.
17731 (pcom_worker::determine_roots_comp): Likewise.
17732 (pcom_worker::single_nonlooparound_use): Likewise.
17733 (pcom_worker::execute_pred_commoning_chain): Likewise.
17734 (pcom_worker::execute_pred_commoning): Likewise.
17735 (pcom_worker::try_combine_chains): Likewise.
17736 (pcom_worker::prepare_initializers_chain): Likewise.
17737 (pcom_worker::prepare_initializers): Likewise.
17738 (pcom_worker::prepare_finalizers_chain): Likewise.
17739 (pcom_worker::prepare_finalizers): Likewise.
17740 (pcom_worker::tree_predictive_commoning_loop): Likewise.
17742 2021-07-20 Martin Sebor <msebor@redhat.com>
17744 PR middle-end/101397
17745 * builtins.c (gimple_call_return_array): Add argument. Correct
17746 offsets for memchr, mempcpy, stpcpy, and stpncpy.
17747 (compute_objsize_r): Adjust offset computation for argument returning
17750 2021-07-20 Martin Sebor <msebor@redhat.com>
17752 PR middle-end/101300
17753 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
17755 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
17757 * function.c (assign_parm_setup_block): Use adjust_address instead
17758 of change_address to preserve MEM_EXPR and friends.
17760 2021-07-20 Martin Sebor <msebor@redhat.com>
17762 * cfgloop.h (single_likely_exit): Adjust by-value argument to
17763 by-const-reference.
17764 * cfgloopanal.c (single_likely_exit): Same.
17765 * cgraph.h (struct cgraph_node): Same.
17766 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
17767 * genautomata.c (merge_states): Same.
17768 * genextract.c (VEC_char_to_string): Same.
17769 * genmatch.c (dt_node::gen_kids_1): Same.
17770 (walk_captures): Adjust by-value argument to by-reference.
17771 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
17772 to by-const-reference.
17773 * gimple.c (gimple_build_call_vec): Same.
17774 (gimple_build_call_internal_vec): Same.
17775 (gimple_build_switch): Same.
17776 (sort_case_labels): Same.
17777 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17779 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
17780 by-const-reference.
17781 (gimple_build_call_internal_vec): Same.
17782 (gimple_build_switch): Same.
17783 (sort_case_labels): Same.
17784 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17786 * haifa-sched.c (calc_priorities): Adjust by-value argument to
17787 by-const-reference.
17788 (sched_init_luids): Same.
17789 (haifa_init_h_i_d): Same.
17790 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
17791 (adjust_callers_for_value_intersection): Adjust by-value argument to
17793 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
17794 by-const-reference.
17795 (find_more_contexts_for_caller_subset): Same.
17796 (find_aggregate_values_for_callers_subset): Same.
17797 (copy_useful_known_contexts): Same.
17798 * ipa-fnsummary.c (remap_edge_summaries): Same.
17799 (remap_freqcounting_predicate): Same.
17800 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
17802 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
17803 to by-const-reference.
17804 * ipa-predicate.h (predicate::remap_after_inlining): Same.
17805 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
17806 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
17807 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
17808 * read-rtl.c (add_overload_instance): Same.
17809 * rtl.h (native_decode_rtx): Same.
17810 (native_decode_vector_rtx): Same.
17811 * sched-int.h (sched_init_luids): Same.
17812 (haifa_init_h_i_d): Same.
17813 * simplify-rtx.c (native_decode_vector_rtx): Same.
17814 (native_decode_rtx): Same.
17815 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
17816 (shrink_wrap_one_built_in_call_with_conds): Same.
17817 (shrink_wrap_conditional_dead_built_in_calls): Same.
17818 * tree-data-ref.c (create_runtime_alias_checks): Same.
17819 (compute_all_dependences): Same.
17820 * tree-data-ref.h (compute_all_dependences): Same.
17821 (create_runtime_alias_checks): Same.
17822 (index_in_loop_nest): Same.
17823 * tree-if-conv.c (mask_exists): Same.
17824 * tree-loop-distribution.c (class loop_distribution): Same.
17825 (loop_distribution::create_rdg_vertices): Same.
17826 (dump_rdg_partitions): Same.
17827 (debug_rdg_partitions): Same.
17828 (partition_contains_all_rw): Same.
17829 (loop_distribution::distribute_loop): Same.
17830 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
17831 (oacc_entry_exit_single_gang): Same.
17832 * tree-ssa-loop-im.c (hoist_memory_references): Same.
17833 (loop_suitable_for_sm): Same.
17834 * tree-ssa-loop-niter.c (bound_index): Same.
17835 * tree-ssa-reassoc.c (update_ops): Same.
17836 (swap_ops_for_binary_stmt): Same.
17837 (rewrite_expr_tree): Same.
17838 (rewrite_expr_tree_parallel): Same.
17839 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
17840 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
17841 * tree-ssa-structalias.c (process_all_all_constraints): Same.
17842 (make_constraints_to): Same.
17843 (handle_lhs_call): Same.
17844 (find_func_aliases_for_builtin_call): Same.
17845 (sort_fieldstack): Same.
17846 (check_for_overlaps): Same.
17847 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
17848 (vect_create_cond_for_unequal_addrs): Same.
17849 (vect_create_cond_for_lower_bounds): Same.
17850 (vect_create_cond_for_alias_checks): Same.
17851 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
17852 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
17853 (vect_make_slp_decision): Same.
17854 (vect_slp_bbs): Same.
17855 (duplicate_and_interleave): Same.
17856 (vect_transform_slp_perm_load): Same.
17857 (vect_schedule_slp): Same.
17858 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
17859 (vect_schedule_slp): Same.
17860 (duplicate_and_interleave): Same.
17861 * tree.c (build_vector_from_ctor): Same.
17862 (build_vector): Same.
17863 (check_vector_cst): Same.
17864 (check_vector_cst_duplicate): Same.
17865 (check_vector_cst_fill): Same.
17866 (check_vector_cst_stepped): Same.
17867 * tree.h (build_vector_from_ctor): Same.
17869 2021-07-20 Jakub Jelinek <jakub@redhat.com>
17872 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
17873 type from bool to int.
17874 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
17875 EASY_VECTOR_MSB case if either step or copies is not 1.
17876 (vspltis_shifted): Fix comment typo.
17877 (easy_altivec_constant): Change return type from bool to int, instead
17878 of returning true return byte size of the element mode that should be
17879 used to synthetize the constant.
17880 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
17881 that vspltis_shifted is 0, handle the case where easy_altivec_constant
17882 assumes using different vector mode from CONST_VECTOR's mode.
17883 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
17884 easy_altivec_constant to determine mode in which -1 >> -1 should be
17885 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
17887 2021-07-20 Richard Biener <rguenther@suse.de>
17890 * dwarf2out.h (dwarf_file_data): Add key member.
17891 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
17892 (dwarf_file_hasher::hash): Hash key.
17893 (lookup_filename): Remap the filename and store it in the
17894 filename member of dwarf_file_data when creating a new
17896 (file_name_acquire): Do not remap the filename again.
17897 (maybe_emit_file): Likewise.
17899 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
17901 * config/aarch64/aarch64-simd-builtins.def: Use two variant
17902 generators for all TBL/TBX intrinsics and rename to
17903 consistent forms: qtbl[1234] or qtbx[1234].
17904 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
17906 (aarch64_qtbl1<mode>): This.
17907 (aarch64_tbx1<mode>): Rename to...
17908 (aarch64_qtbx1<mode>): This.
17909 (aarch64_tbl2v16qi): Delete.
17910 (aarch64_tbl3<mode>): Rename to...
17911 (aarch64_qtbl2<mode>): This.
17912 (aarch64_tbx4<mode>): Rename to...
17913 (aarch64_qtbx2<mode>): This.
17914 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
17915 renamed qtbl1 and qtbl2 RTL patterns.
17916 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
17918 (vqtbl1_s8): Likewise.
17919 (vqtbl1_u8): Likewise.
17920 (vqtbl1q_p8): Likewise.
17921 (vqtbl1q_s8): Likewise.
17922 (vqtbl1q_u8): Likewise.
17923 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
17924 (vqtbx1_u8): Likewise.
17925 (vqtbx1_p8): Likewise.
17926 (vqtbx1q_s8): Likewise.
17927 (vqtbx1q_u8): Likewise.
17928 (vqtbx1q_p8): Likewise.
17929 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
17930 (vtbl1_u8): Likewise.
17931 (vtbl1_p8): Likewise.
17932 (vtbl2_s8): Likewise
17933 (vtbl2_u8): Likewise.
17934 (vtbl2_p8): Likewise.
17935 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
17936 (vtbl3_u8): Likewise.
17937 (vtbl3_p8): Likewise.
17938 (vtbl4_s8): Likewise.
17939 (vtbl4_u8): Likewise.
17940 (vtbl4_p8): Likewise.
17941 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
17942 (vtbx2_u8): Likewise.
17943 (vtbx2_p8): Likewise.
17944 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
17945 (vqtbl2_u8): Likewise.
17946 (vqtbl2_p8): Likewise.
17947 (vqtbl2q_s8): Likewise.
17948 (vqtbl2q_u8): Likewise.
17949 (vqtbl2q_p8): Likewise.
17950 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
17951 (vqtbx2_u8): Likewise.
17952 (vqtbx2_p8): Likewise.
17953 (vqtbx2q_s8): Likewise.
17954 (vqtbx2q_u8): Likewise.
17955 (vqtbx2q_p8): Likewise.
17956 (vtbx4_s8): Likewise.
17957 (vtbx4_u8): Likewise.
17958 (vtbx4_p8): Likewise.
17960 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
17963 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
17965 (define_peephole2 atomic_loaddi_fpu): Ditto.
17967 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17969 * config.gcc (riscv*-*-*): Detect which python is available.
17971 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17973 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
17974 (smul<mode>3_highpart): ... this.
17975 (mulhu_<mode>): Rename to...
17976 (umul<mode>3_highpart): ... this.
17977 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
17978 MULHU_V2DI, MULHU_V4SI): Adjust.
17980 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17982 PR tree-optimization/100696
17983 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
17984 * internal-fn.def (IFN_MULH): New internal function.
17985 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
17986 recog normal multiply highpart as IFN_MULH.
17987 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
17990 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17992 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
17993 (BTF_DEBUGGING_INFO): Likewise.
17994 * doc/tm.texi.in: Document the new macros.
17995 * doc/tm.texi: Regenerated.
17996 * toplev.c: Guard initialization of debug hooks.
17998 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
18000 * flags.h (ctf_debuginfo_p): New function declaration.
18001 * opts.c (ctf_debuginfo_p): New function definition.
18003 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
18006 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
18007 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
18008 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
18009 * config/gcn/gcn-valu.md: Add a warning comment.
18010 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
18011 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
18012 * config/gcn/gcn.md: Add a warning comment.
18013 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
18014 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
18016 (EF_AMDGPU_XNACK): New.
18017 (EF_AMDGPU_SRAM_ECC): New.
18019 (copy_early_debug_info): Use elf_flags.
18020 (main): Handle -mxnack and -msram-ecc options.
18021 * doc/invoke.texi: Document -mxnack and -msram-ecc.
18023 2021-07-19 Andrew Pinski <apinski@marvell.com>
18026 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
18027 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
18029 2021-07-19 Richard Biener <rguenther@suse.de>
18031 PR tree-optimization/101505
18032 * tree-vect-patterns.c (vect_determine_precisions): Walk
18033 PHIs also for loop vectorization.
18035 2021-07-19 Richard Biener <rguenther@suse.de>
18037 * gimple.h (gimple_expr_type): Remove.
18038 * doc/gimple.texi: Remove gimple_expr_type documentation.
18040 2021-07-19 Richard Biener <rguenther@suse.de>
18042 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
18043 (ao_ref_init_from_vn_reference): Likewise.
18044 (fully_constant_reference): Likewise.
18045 (vn_reference_lookup_call): Do not set vr->type to random
18047 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
18049 * tree-vect-generic.c (expand_vector_piecewise): Pass in
18050 whether we expanded parallel.
18051 (expand_vector_parallel): Adjust.
18052 (expand_vector_addition): Likewise.
18053 (expand_vector_comparison): Likewise.
18054 (expand_vector_operation): Likewise.
18055 (expand_vector_scalar_condition): Likewise.
18056 (expand_vector_conversion): Likewise.
18058 2021-07-19 Richard Biener <rguenther@suse.de>
18060 * tree-vrp.c (register_edge_assert_for_2): Use the
18062 (vrp_folder::fold_predicate_in): Likewise.
18063 * vr-values.c (gimple_assign_nonzero_p): Likewise.
18064 (vr_values::extract_range_from_comparison): Likewise.
18065 (vr_values::extract_range_from_ubsan_builtin): Use the
18066 type of the first operand.
18067 (vr_values::extract_range_basic): Push down type
18068 computation, use the appropriate LHS.
18069 (vr_values::extract_range_from_assignment): Use the
18072 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
18075 * common/config/i386/i386-common.c (ix86_handle_option): For
18076 -mgeneral-regs-only, enable the GPR only instructions which are
18077 enabled implicitly by SSE ISAs unless they have been disabled
18080 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
18083 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
18084 ix86_avx_u128_mode_needed.
18085 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
18086 returns AVX register.
18088 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
18090 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
18091 (handle_const_call): Liekise
18092 (handle_pure_call): Liekise
18094 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
18096 PR tree-optimization/96542
18097 * range-op.cc (range_operator::wi_fold_in_parts): New.
18098 (range_operator::fold_range): Call wi_fold_in_parts.
18099 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
18100 * range-op.h (wi_fold_in_parts): Add prototype.
18102 2021-07-16 David Malcolm <dmalcolm@redhat.com>
18104 * doc/analyzer.texi: Add __analyzer_dump_state.
18106 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18108 * config/rs6000/rbtree.c: New file.
18109 * config/rs6000/rbtree.h: New file.
18111 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18113 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
18114 (typeinfo): Add restr field.
18115 (match_bracketed_pair): New function.
18116 (match_const_restriction): Implement.
18118 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18120 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
18122 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18124 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
18125 (basetype): Likewise.
18126 (typeinfo): Likewise.
18127 (handle_pointer): New function.
18128 (match_basetype): New stub function.
18129 (match_const_restriction): Likewise.
18130 (match_type): New function.
18132 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18134 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
18136 (advance_line): Likewise.
18137 (safe_inc_pos): Likewise.
18138 (match_identifier): Likewise.
18139 (match_integer): Likewise.
18140 (match_to_right_bracket): Likewise.
18142 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18144 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
18145 (ovld_file): Likewise.
18146 (header_file): Likewise.
18147 (init_file): Likewise.
18148 (defines_file): Likewise.
18149 (pgm_path): Likewise.
18150 (bif_path): Likewise.
18151 (ovld_path): Likewise.
18152 (header_path): Likewise.
18153 (init_path): Likewise.
18154 (defines_path): Likewise.
18155 (LINELEN): New macro.
18156 (linebuf): New variable.
18160 (bif_diag): New function.
18161 (ovld_diag): Likewise.
18163 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18165 * config/rs6000/rs6000-builtin-new.def: New.
18166 * config/rs6000/rs6000-overload.def: New.
18168 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18170 * config/rs6000/rs6000-gen-builtins.c: New.
18172 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18174 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
18175 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
18176 * gengtype-state.c (state_writer::write_state_file_list): Add a
18177 parameter to the fileslist expression for the number of build
18179 (read_state_files_list): Detect build headers and strip the
18180 initial "./" or ".\" from their names.
18181 * gengtype.c (build_headers): New global variable.
18182 (num_build_headers): Likewise.
18183 (open_base_files): Emit #include for each build header.
18184 (main): Detect and count build headers.
18185 * gengtype.h (build_headers): New extern variable.
18186 (num_build_headers): Likewise.
18188 2021-07-16 Richard Biener <rguenther@suse.de>
18190 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
18191 the type of the LHS.
18192 (find_bswap_or_nop_1): Likewise.
18193 (find_bswap_or_nop): Likewise.
18194 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
18196 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
18197 Remove unused parameters, pass in the scalar type. Fix
18198 internal store function handling.
18199 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
18200 (vect_get_vector_types_for_stmt): Move down check for
18201 existing vector stmt after we've determined a scalar type.
18202 Pass down the used scalar type to vect_get_smallest_scalar_type.
18203 * tree-vect-generic.c (expand_vector_condition): Use
18204 the type of the LHS.
18205 (expand_vector_scalar_condition): Likewise.
18206 (expand_vector_operations_1): Likewise.
18207 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
18208 (vect_recog_dot_prod_pattern): Likewise.
18209 (vect_recog_sad_pattern): Likewise.
18210 (vect_recog_widen_op_pattern): Likewise.
18211 (vect_recog_widen_sum_pattern): Likewise.
18212 (vect_recog_mixed_size_cond_pattern): Likewise.
18214 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
18216 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
18217 (dump_eaf_flags): Dump EAF_NOT_RETURNED
18218 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
18219 and EAF_NOT_RETURNED.
18220 (modref_summary::useful_p): Likewise.
18221 (modref_summary_lto::useful_p): Likewise.
18222 (struct) modref_summary_lto: Use eaf_fleags_t.
18223 (deref_flags): Handle EAF_NOT_RETURNED.
18224 (struct escape_point): Use min_flags.
18225 (modref_lattice::init): Add EAF_NOT_RETURNED.
18226 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
18227 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
18229 (analyze_parms): Also analyze const functions; update conition on
18231 (modref_write): Update streaming.
18232 (read_section): Update streaming.
18233 (remap_arg_flags): Use eaf_flags_t.
18234 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
18235 * ipa-modref.h: (eaf_flags_t): New typedef.
18236 (struct modref_summary): Use eaf_flags_t.
18237 * tree-core.h (EAF_NOT_RETURNED): New constant.
18239 2021-07-16 Richard Biener <rguenther@suse.de>
18241 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
18242 the type of the LHS.
18243 (gimple_assign_nonnegative_warnv_p): Likewise.
18244 (gimple_call_nonnegative_warnv_p): Likewise. Return false
18245 if the call has no LHS.
18246 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
18247 * tree-eh.c (stmt_could_throw_1_p): Likewise.
18248 * tree-inline.c (insert_init_stmt): Likewise.
18249 * tree-ssa-loop-niter.c (get_val_for): Likewise.
18250 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
18252 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
18253 gassign *. Use the type of the lhs.
18254 (vn_nary_op_lookup_stmt): Adjust.
18255 (vn_nary_op_insert_stmt): Likewise.
18257 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
18259 * config/s390/predicates.md (bras_sym_operand): Accept all
18260 functions in 64-bit mode, use UNSPEC_PLT31.
18261 (larl_operand): Use UNSPEC_PLT31.
18262 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
18263 (legitimize_pic_address): Likewise.
18264 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
18266 (s390_delegitimize_address): Use UNSPEC_PLT31.
18267 (s390_output_addr_const_extra): Likewise.
18268 (print_operand): Add @PLT to TLS calls, handle %K.
18269 (s390_function_profiler): Mark __fentry__/_mcount as function,
18270 use %K, use UNSPEC_PLT31.
18271 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
18272 (s390_emit_call): Use UNSPEC_PLT31.
18273 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
18274 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
18275 (*movdi_64): Use %K.
18276 (reload_base_64): Likewise.
18277 (*sibcall_brc): Likewise.
18278 (*sibcall_brcl): Likewise.
18279 (*sibcall_value_brc): Likewise.
18280 (*sibcall_value_brcl): Likewise.
18282 (*brasl): Likewise.
18283 (*bras_r): Likewise.
18284 (*brasl_r): Likewise.
18285 (*bras_tls): Likewise.
18286 (*brasl_tls): Likewise.
18287 (main_base_64): Likewise.
18288 (reload_base_64): Likewise.
18289 (@split_stack_call<mode>): Likewise.
18291 2021-07-16 Richard Biener <rguenther@suse.de>
18293 PR tree-optimization/101467
18294 * tree-vect-stmts.c (vect_gen_while): Properly guard
18295 make_temp_ssa_name usage.
18297 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
18299 * config.gcc: Don't use forked print-sysroot-suffix.sh and
18300 t-sysroot-suffix for C-SKY.
18301 * config/csky/print-sysroot-suffix.sh: Delete.
18302 * config/csky/t-csky-linux: Delete.
18303 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
18304 instead of CSKY_MULTILIB_DIRNAMES.
18306 2021-07-16 Richard Biener <rguenther@suse.de>
18308 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
18309 conversion issues with the partial reduction of the reused
18310 vector accumulator.
18312 2021-07-16 Richard Biener <rguenther@suse.de>
18314 * config/i386/i386-options.c (ix86_option_override_internal): Set
18315 param_vect_partial_vector_usage to zero if not set.
18317 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
18320 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
18321 (VALID_INT_MODE_P): Add SDmode and DDmode.
18322 Add TDmode for TARGET_64BIT.
18323 (VALID_DFP_MODE_P): Remove.
18324 * config/i386/i386.c (ix86_hard_regno_mode_ok):
18325 Do not use VALID_DFP_MODE_P.
18327 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
18329 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
18331 (fold_using_range::fold_stmt): Ditto.
18332 (fold_using_range::range_of_range_op): Ditto.
18333 (fold_using_range::range_of_phi): Ditto.
18334 (fold_using_range::range_of_call): Ditto.
18335 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
18336 (fold_using_range::range_of_builtin_call): Ditto.
18337 (fold_using_range::range_of_cond_expr): Ditto.
18338 * gimple-range-fold.h (gimple_range_type): New.
18340 2021-07-15 Martin Sebor <msebor@redhat.com>
18342 PR middle-end/97027
18343 * tree-ssa-strlen.c (handle_assign): New function.
18344 (maybe_warn_overflow): Add argument.
18345 (nonzero_bytes_for_type): New function.
18346 (count_nonzero_bytes): Handle more tree types. Call
18347 nonzero_bytes_for_tye.
18348 (count_nonzero_bytes): Handle types.
18349 (handle_store): Handle stores from function calls.
18350 (strlen_check_and_optimize_call): Move code to handle_assign. Call
18351 it for assignments from function calls.
18353 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18358 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
18360 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18362 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
18364 2021-07-15 Martin Sebor <msebor@redhat.com>
18368 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
18369 (operand_compare::verify_hash_value): Same.
18370 * tree-core.h (OEP_DECL_NAME): New.
18372 2021-07-15 Martin Jambor <mjambor@suse.cz>
18374 * profile-count.h (profile_count::value): Change the return type to
18376 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
18378 * tree-cfg.c (dump_function_to_file): Likewise.
18380 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
18383 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
18384 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
18387 2021-07-15 Richard Biener <rguenther@suse.de>
18389 * tree-vectorizer.h (vect_gen_while): Match up with
18390 vect_gen_while_not.
18391 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
18392 of vect_gen_while_not.
18393 (vect_gen_while_not): Adjust.
18394 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
18396 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
18398 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
18399 (ranger_cache::range_of_def): Call adjust_range.
18400 (ranger_cache::entry_range): Same.
18401 * gimple-range-cache.h (non_null_ref::adjust_range): New.
18402 * gimple-range.cc (gimple_ranger::range_of_expr): Call
18404 (gimple_ranger::range_on_entry): Same.
18406 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18409 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18411 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18413 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18416 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18418 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18419 (sdot_prod, udot_prod): ...These.
18420 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18421 (aarch64_<sur>dot<vsi2qi>): Rename to...
18422 (<sur>dot_prod<vsi2qi>): ...This.
18423 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18426 2021-07-15 Jakub Jelinek <jakub@redhat.com>
18428 PR middle-end/101437
18429 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
18430 types even if they have non-BLKmode TYPE_MODE.
18432 2021-07-15 Richard Biener <rguenther@suse.de>
18435 * gcc.c (process_command): Process -gtoggle like process_options
18436 would after parsing options.
18438 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18440 * cfgexpand.c (expand_asm_loc): Adjust.
18441 (expand_asm_stmt): Likewise.
18442 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
18443 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18444 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
18445 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
18446 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
18447 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
18448 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
18449 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
18450 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
18451 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
18452 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
18453 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
18454 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
18455 * doc/tm.texi: Regenerate.
18456 * target.def: Add location argument to md_asm_adjust.
18458 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18460 * tree-diagnostic.c (diagnostic_report_current_function): Use the
18461 diagnostic's location, not input_location.
18463 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18465 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
18467 (expand_asm_stmt): Likewise.
18469 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18471 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
18472 addressed memory rtx, if any.
18473 (rs6000_split_multireg_move): Fix code formatting.
18474 Handle MMA build built-ins with operands in adjacent memory locations.
18476 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18478 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
18481 2021-07-14 Jason Merrill <jason@redhat.com>
18483 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
18485 2021-07-14 Jason Merrill <jason@redhat.com>
18487 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
18490 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18492 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
18493 Remove erroneous line.
18495 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
18497 * params.opt (param_evrp_mode): Change default.
18499 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18501 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18502 (sdot_prod, udot_prod): ...These.
18503 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18504 (aarch64_<sur>dot<vsi2qi>): Rename to...
18505 (<sur>dot_prod<vsi2qi>): ...This.
18506 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18509 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18511 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18513 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18515 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
18517 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18519 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
18521 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18523 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
18524 (usdot_prod<vsi2qi>): ... This.
18525 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
18526 (usdot_prod): ...This.
18527 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
18528 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
18530 (@<sur>dot_prod<vsi2qi>): ...This.
18531 * config/aarch64/aarch64-sve-builtins-base.cc
18532 (svusdot_impl::expand): Use it.
18534 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18536 * optabs.def (usdot_prod_optab): New.
18537 * doc/md.texi: Document it and clarify other dot prod optabs.
18538 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
18539 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
18540 * optabs.c (expand_widen_pattern_expr): Likewise.
18541 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
18542 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
18543 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
18545 (vect_widened_op_tree): Optionally ignore
18547 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
18549 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
18552 * config/i386/driver-i386.c (host_detect_local_cpu): Check
18553 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
18554 Enable UINTR only for 64-bit codegen.
18555 * config/i386/i386-options.c
18556 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
18558 * config/i386/i386.h (ARCH_ARG): New.
18559 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
18560 "[arch|tune] 64" for 64-bit codegen.
18562 2021-07-14 Richard Biener <rguenther@suse.de>
18564 PR tree-optimization/101445
18565 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
18566 of the IV in the correct direction for negative stride
18569 2021-07-14 Jakub Jelinek <jakub@redhat.com>
18572 * godump.c (godump_str_hash): New type.
18573 (godump_container::pot_dummy_types): Use string_hash instead of
18574 ptr_hash in the hash_set.
18576 2021-07-14 Richard Biener <rguenther@suse.de>
18578 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
18579 vector types where the old vector type has a multiple of
18580 the new vector type elements.
18581 (vect_create_partial_epilog): New function, split out from...
18582 (vect_create_epilog_for_reduction): ... here.
18583 (vect_transform_cycle_phi): Reduce the re-used accumulator
18584 to the new vector type.
18586 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18588 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
18589 non-'t'-sized arg check.
18591 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18593 * tree-cfg.c (cleanup_dead_labels_eh): Update
18594 post_landing_pad label upon change of landing pad block's
18596 (cleanup_dead_labels): Check that a removed label is not that
18599 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
18601 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
18603 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
18604 Add Neon to general purpose register case for zero-extend
18606 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
18607 case to prevent some cases opting to go through memory.
18608 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
18609 * rtl.c (rtvec_series_p): Define predicate to determine
18610 whether a vector contains a linear series of integers.
18611 * rtl.h (rtvec_series_p): Define.
18612 * rtlanal.c (vec_series_lowpart_p): Define predicate to
18613 determine if a vector selection is equivalent to the low part
18615 * rtlanal.h (vec_series_lowpart_p): Define.
18616 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
18617 Add vec_select -> subreg simplification.
18619 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
18621 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
18622 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
18623 _mm_test_mix_ones_zeros): New.
18625 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
18626 Richard Biener <rguenther@suse.de>
18628 * gimple.c (gimple_could_trap_p_1): Make S argument a
18629 "const gimple*". Preserve constness in call to
18630 gimple_asm_volatile_p.
18631 (gimple_could_trap_p): Make S argument a "const gimple*".
18632 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
18633 Update function prototypes.
18635 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18637 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
18638 (_loop_vec_info::main_loop_edge): New field.
18639 (_loop_vec_info::skip_main_loop_edge): Likewise.
18640 (_loop_vec_info::skip_this_loop_edge): Likewise.
18641 (_loop_vec_info::reusable_accumulators): Likewise.
18642 (_stmt_vec_info::reduc_scalar_results): Likewise.
18643 (_stmt_vec_info::reused_accumulator): Likewise.
18644 (vect_get_main_loop_result): Declare.
18645 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
18646 reduc_scalar_inputs.
18647 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
18648 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
18649 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
18650 skip_main_loop_edge and skip_this_loop_edge fields.
18651 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
18652 (vect_emit_reduction_init_stmts): New function.
18653 (get_initial_def_for_reduction): Use it.
18654 (get_initial_defs_for_reduction): Likewise. Change the vinfo
18655 parameter to a loop_vec_info.
18656 (vect_create_epilog_for_reduction): Store the scalar results
18657 in the reduc_info. If an epilogue loop is reusing an accumulator
18658 from the main loop, and if the epilogue loop can also be skipped,
18659 try to place the reduction code in the join block. Record
18660 accumulators that could potentially be reused by epilogue loops.
18661 (vect_transform_cycle_phi): When vectorizing epilogue loops,
18662 try to reuse accumulators from the main loop. Record the initial
18663 value in reduc_info for non-SLP reductions too.
18665 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18667 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
18668 adjustment handling. Take the neutral value as an argument,
18669 in place of the code argument.
18670 (vect_transform_cycle_phi): Update accordingly. Handle the
18671 initial values of cond reductions separately from code reductions.
18672 Choose the adjustment here rather than in
18673 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
18675 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18677 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
18678 (neutral_op_for_reduction): ...this, providing a more general
18680 (vect_create_epilog_for_reduction): Update accordingly.
18681 (vectorizable_reduction): Likewise.
18682 (vect_transform_cycle_phi): Likewise.
18684 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18686 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
18687 reduc_info instead of the original stmt_vec_info.
18688 (vect_transform_cycle_phi): Update accordingly.
18690 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18692 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
18693 reduc_info as an additional parameter.
18694 (vect_transform_cycle_phi): Update accordingly.
18696 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18698 * tree-vectorizer.h: Include tree-ssa-operands.h.
18699 (vect_phi_initial_value): New function.
18700 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
18701 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
18702 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
18703 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
18705 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18707 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
18708 the phi results to vectype after creating them. Remove later
18709 conversion code that thus becomes redundant.
18711 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18713 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
18714 the new_phis vector with a reduc_inputs vector. Combine handling
18715 of reduction chains and ncopies > 1.
18717 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18719 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
18720 scalar_results to group_size elements after reducing down from
18721 N*group_size elements. Construct an array_slice of the live-out
18722 stmts and assert that there is one stmt per scalar result.
18724 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18726 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
18727 nested_in_vect_loop and use double_reduc everywhere. Remove dead
18728 assignment to "loop".
18730 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18732 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
18733 vector types first. For scalar types, consider both the preferred
18734 vector mode and the alternative vector modes.
18735 * optabs-query.c (can_vec_mask_load_store_p): Use the same
18736 structure as above, in particular using related_vector_mode
18737 for modes provided by autovectorize_vector_modes.
18739 2021-07-13 Jakub Jelinek <jakub@redhat.com>
18740 Richard Biener <rguenther@suse.de>
18742 PR tree-optimization/101419
18743 * tree-pass.h (PROP_objsz): Define.
18744 (make_pass_early_object_sizes): Declare.
18745 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
18746 there to pass_early_object_sizes, drop parameter.
18747 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
18748 drop parameter, move pass_post_ipa_warn right after that.
18749 * tree-object-size.c (pass_object_sizes::execute): Rename to...
18750 (object_sizes_execute): ... this. Add insert_min_max_p argument.
18751 (pass_data_object_sizes): Move after object_sizes_execute.
18752 (pass_object_sizes): Likewise. In execute method call
18753 object_sizes_execute, drop set_pass_param method and insert_min_max_p
18754 non-static data member and its initializer in the ctor.
18755 (pass_data_early_object_sizes, pass_early_object_sizes,
18756 make_pass_early_object_sizes): New.
18757 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
18758 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
18760 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
18763 * config/riscv/constraints.md ("S"): Update description and remove
18765 * doc/md.texi (Machine Constraints): Document the 'S' constraints
18768 2021-07-13 Richard Biener <rguenther@suse.de>
18771 2021-07-12 Richard Biener <rguenther@suse.de>
18773 * tree-vect-slp.c (vect_slp_region): Show the number of
18774 SLP graph entries in the optimization message.
18776 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
18778 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
18780 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
18781 return type to long.
18782 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
18785 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18787 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
18788 Query relation between the 2 operands and use it.
18790 2021-07-12 Sergei Trofimovich <siarheit@google.com>
18792 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
18794 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18797 * config/i386/predicates.md (vec_setm_sse41_operand):
18798 Rename from vec_setm_operand.
18799 (vec_setm_avx2_operand): New predicate.
18800 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
18801 Use vec_setm_sse41_operand as operand 2 predicate.
18802 (vec_set<V_256_512:mode): New expander.
18803 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
18804 as operand 2 predicate.
18806 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18808 PR tree-optimization/101335
18809 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
18811 2021-07-12 Andrew Pinski <apinski@marvell.com>
18813 * tree-ssa-phiopt.c (match_simplify_replacement): Move
18814 insert of the sequence before the movement of the
18815 statement. Check if to see if the statement is used
18816 outside of the original phi to see if we should move it.
18818 2021-07-12 Richard Biener <rguenther@suse.de>
18820 * dump-context.h (debug_dump_context::debug_dump_context):
18821 Add FILE * parameter defaulted to stderr.
18822 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
18823 * tree-vect-slp.c (dot_slp_tree): New functions.
18825 2021-07-12 Richard Biener <rguenther@suse.de>
18827 PR tree-optimization/101373
18828 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
18829 references when the BB may not return.
18830 (compute_avail): Pass in the function we're working on and
18831 replace cfun references with it. Externally throwing
18832 const calls also possibly terminate the function.
18833 (pass_pre::execute): Pass down the function we're working on.
18834 * gcse.c (compute_hash_table_work): Externally throwing
18835 const/pure calls also need record_last_mem_set_info.
18836 * postreload-gcse.c (record_opr_changes): Looping or externally
18837 throwing const/pure calls also need record_last_mem_set_info.
18839 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18841 * recog.c (memory_address_addr_space_p): Change the type to bool.
18842 Return true/false instead of 1/0.
18843 (offsettable_memref_p): Ditto.
18844 (offsettable_nonstrict_memref_p): Ditto.
18845 (offsettable_address_addr_space_p): Ditto.
18846 Change the type of addressp indirect function to bool.
18847 * recog.h (memory_address_addr_space_p): Change the type to bool.
18848 (strict_memory_address_addr_space_p): Ditto.
18849 (offsettable_memref_p): Ditto.
18850 (offsettable_nonstrict_memref_p): Ditto.
18851 (offsettable_address_addr_space_p): Ditto.
18852 * reload.c (maybe_memory_address_addr_space_p): Ditto.
18853 (strict_memory_address_addr_space_p): Change the type to bool.
18854 Return true/false instead of 1/0.
18855 (maybe_memory_address_addr_space_p): Change the type to bool.
18857 2021-07-12 Richard Biener <rguenther@suse.de>
18859 * tree-vect-slp.c (vect_slp_region): Show the number of
18860 SLP graph entries in the optimization message.
18862 2021-07-12 Richard Biener <rguenther@suse.de>
18864 PR tree-optimization/101394
18865 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
18866 copies from abnormals for a full redundancy.
18868 2021-07-12 Richard Biener <rguenther@suse.de>
18870 PR middle-end/101423
18871 * gimple.c (gimple_could_trap_p_1): Internal function calls
18873 * tree-eh.c (tree_could_trap_p): Likewise.
18875 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18878 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
18880 (vmulq_n_u32): Likewise.
18881 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
18882 (vmulq_n_f32): Likewise.
18883 (vmul_n_f16): Likewise.
18884 (vmulq_n_f16): Likewise.
18886 2021-07-12 Martin Liska <mliska@suse.cz>
18888 PR sanitizer/101425
18889 * gcc.c (check_offload_target_name): Call
18890 candidates_list_and_hint only if we have a candidate.
18892 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18895 * config/arm/neon.md (vec_init): Move to ...
18896 * config/arm/vec-common.md (vec_init): ... here.
18897 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
18899 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
18901 PR tree-optimization/101403
18902 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
18903 signedness of the shift is not the same as the signedness of
18904 the type extension.
18906 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
18907 Uroš Bizjak <ubizjak@gmail.com>
18909 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
18910 divmod of a constant numerator with new define_insn_and_split.
18912 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
18915 * config/i386/i386-expand.c (ix86_expand_call): If a call is
18916 to a non-local-binding, or local but to a public symbol, then
18917 assume that it might be indirected via the lazy symbol binder.
18918 Mark R10 and R10 as clobbered in that case.
18920 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
18923 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
18924 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
18925 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
18927 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
18929 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
18930 log2 range of operands[3] to [1,31].
18931 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
18933 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18935 * doc/md.texi: Don't split @smallexample in multiple @groups.
18937 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18939 * doc/md.texi: Add missing 'see' word.
18941 2021-07-09 Andrew Pinski <apinski@marvell.com>
18943 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
18944 to take sequence and gimple_match_op. Accept the case where
18945 op is a SSA_NAME and one statement in the sequence.
18946 Also allow constants.
18947 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
18948 Update call to phiopt_early_allow. Discard the sequence if not
18951 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
18956 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
18957 instead of ISA_HAS_MSA.
18958 (mips_expand_vec_unpack): Likewise.
18959 (mips_expand_vector_init): Likewise.
18961 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
18963 * config/rs6000/vsx.md (mods_<mode>): Rename to...
18964 (mod<mode>3): ... this.
18965 (modu_<mode>): Rename to...
18966 (umod<mode>3): ... this.
18967 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
18968 MODU_V4SI): Adjust.
18970 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
18972 * config/h8300/shiftrotate.md (variable shifts): Expose condition
18973 code handling for the test before the loop.
18975 2021-07-08 Martin Jambor <mjambor@suse.cz>
18978 * ipa-sra.c (class isra_call_summary): New member
18979 m_before_any_store, initialize it in the constructor.
18980 (isra_call_summary::dump): Dump the new field.
18981 (ipa_sra_call_summaries::duplicate): Copy it.
18982 (process_scan_results): Set it.
18983 (isra_write_edge_summary): Stream it.
18984 (isra_read_edge_summary): Likewise.
18985 (param_splitting_across_edge): Only override
18986 safe_to_import_accesses if m_before_any_store is set.
18988 2021-07-08 Martin Sebor <msebor@redhat.com>
18990 PR bootstrap/101374
18991 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
18992 Use Object Size Type 0 instead of 1.
18994 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18996 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
18999 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
19001 * match.pd: Simplify an extend-operate-truncate sequence involving
19004 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
19005 Richard Biener <rguenther@suse.de>
19007 PR tree-optimization/40210
19008 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
19009 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
19010 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
19012 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
19015 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
19017 * config/i386/mmx.md (V_32): New mode iterator.
19018 (mov<V_32:mode>): Use V_32 mode iterator.
19019 (*mov<V_32:mode>_internal): Ditto.
19020 (*push<V_32:mode>2_rex64): Ditto.
19021 (*push<V_32:mode>2): Ditto.
19022 (movmisalign<V_32:mode>): Ditto.
19023 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
19024 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
19025 (vec_unpacks_lo_v4qi): New expander.
19026 (vec_unpacks_hi_v4qi): Ditto.
19027 (vec_unpacku_lo_v4qi): Ditto.
19028 (vec_unpacku_hi_v4qi): Ditto.
19029 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
19030 (VALID_INT_MODE_P): Ditto.
19032 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
19035 * config/rs6000/rs6000.md (udivti3): New insn.
19036 (divti3): New insn.
19037 (umodti3): New insn.
19038 (modti3): New insn.
19040 2021-07-07 Martin Sebor <msebor@redhat.com>
19042 PR tree-optimization/100137
19043 PR tree-optimization/99121
19044 PR tree-optimization/97027
19045 * builtins.c (access_ref::access_ref): Also set offmax.
19046 (access_ref::offset_in_range): Define new function.
19047 (access_ref::add_offset): Set offmax.
19048 (access_ref::inform_access): Handle access_none.
19049 (handle_mem_ref): Clear ostype.
19050 (compute_objsize_r): Handle ASSERT_EXPR.
19051 * builtins.h (struct access_ref): Add offmax member.
19052 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
19053 compute_objsize() and simplify.
19055 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
19057 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
19058 and VSX_BUILTIN_STXVP.
19060 2021-07-07 Martin Sebor <msebor@redhat.com>
19063 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
19064 a stray %K from error_at() missed in r12-2088.
19066 2021-07-07 Richard Biener <rguenther@suse.de>
19068 PR tree-optimization/99728
19069 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
19071 (mem_refs_may_alias_p): Add assert we handled aggregate
19073 (sm_seq_valid_bb): Give up when running into aggregate copies.
19074 (ref_indep_loop_p): Handle aggregate copies as never
19075 being invariant themselves but allow other refs to be
19076 disambiguated against them.
19077 (can_sm_ref_p): Do not try to apply store-motion to aggregate
19080 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
19083 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
19086 2021-07-06 Martin Sebor <msebor@redhat.com>
19088 * gimple-pretty-print.c (percent_G_format): Remove.
19089 * tree-diagnostic.c (default_tree_printer): Remove calls.
19090 * tree-pretty-print.c (percent_K_format): Remove.
19091 * tree-pretty-print.h (percent_K_format): Remove.
19093 2021-07-06 Martin Sebor <msebor@redhat.com>
19095 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
19096 Remove %K and use error_at.
19097 (aarch64_expand_fcmla_builtin): Same.
19098 (aarch64_expand_builtin_tme): Same.
19099 (aarch64_expand_builtin_memtag): Same.
19100 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
19101 (arm_expand_builtin): Same.
19102 * config/arm/arm.c (bounds_check): Same.
19104 2021-07-06 Martin Sebor <msebor@redhat.com>
19106 * builtins.c (warn_string_no_nul): Remove %G.
19107 (maybe_warn_for_bound): Same.
19108 (warn_for_access): Same.
19109 (check_access): Same.
19110 (check_strncat_sizes): Same.
19111 (expand_builtin_strncat): Same.
19112 (expand_builtin_strncmp): Same.
19113 (expand_builtin): Same.
19114 (expand_builtin_object_size): Same.
19115 (warn_dealloc_offset): Same.
19116 (maybe_emit_free_warning): Same.
19117 * calls.c (maybe_warn_alloc_args_overflow): Same.
19118 (maybe_warn_nonstring_arg): Same.
19119 (maybe_warn_rdwr_sizes): Same.
19120 * expr.c (expand_expr_real_1): Remove %K.
19121 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
19122 (gimple_fold_builtin_strncat): Same.
19123 * gimple-ssa-sprintf.c (format_directive): Same.
19124 (handle_printf_call): Same.
19125 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
19126 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19127 (maybe_diag_access_bounds): Same. Call gimple_location.
19128 (check_bounds_or_overlap): Same.
19129 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
19130 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
19131 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
19132 (maybe_diag_stxncpy_trunc): Same.
19133 (handle_builtin_stxncpy_strncat): Same.
19134 (maybe_warn_pointless_strcmp): Same.
19135 * tree-ssa-uninit.c (maybe_warn_operand): Same.
19137 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
19140 * config/i386/predicates.md (vec_setm_operand): Enable
19141 register_operand for TARGET_SSE4_1.
19142 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
19143 as operand 2 predicate. Call ix86_expand_vector_set_var
19144 for non-constant index operand.
19145 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
19146 Call ix86_expand_vector_set_var for non-constant index operand.
19148 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
19150 * config/h8300/jumpcall.md (*branch): When possible, generate
19151 the comparison in CCZN mode.
19152 * config/h8300/predicates.md (simple_memory_operand): Reject all
19153 auto-increment addressing modes.
19155 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19157 PR bootstrap/100246
19158 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
19161 2021-07-06 Richard Biener <rguenther@suse.de>
19163 * doc/md.texi (vec_fmaddsub<mode>4): Document.
19164 (vec_fmsubadd<mode>4): Likewise.
19165 * optabs.def (vec_fmaddsub$a4): Add.
19166 (vec_fmsubadd$a4): Likewise.
19167 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
19168 (IFN_VEC_FMSUBADD): Likewise.
19169 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
19170 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
19171 (addsub_pattern::build): Likewise.
19172 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
19173 and CFN_VEC_FMSUBADD are not transparent for permutes.
19174 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
19175 (vec_fmsubadd<mode>4): Likewise.
19177 2021-07-06 Richard Biener <rguenther@suse.de>
19179 * doc/invoke.texi (fmove-loop-stores): Document.
19180 * common.opt (fmove-loop-stores): New option.
19181 * opts.c (default_options_table): Enable -fmove-loop-stores
19182 at -O1 but not -Og.
19183 * tree-ssa-loop-im.c (pass_lim::execute): Pass
19184 flag_move_loop_stores instead of true to
19185 loop_invariant_motion_in_fun.
19187 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19189 * doc/install.texi: Document --with-dsymutil.
19191 2021-07-06 Andrew Pinski <apinski@marvell.com>
19193 PR tree-optimization/101256
19194 * dbgcnt.def (phiopt_edge_range): New counter.
19195 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
19196 Check to make sure the new name is defined in the same
19197 bb as the conditional before duplicating range info.
19198 Also add debug counter.
19200 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19202 PR rtl-optimization/100328
19203 * config/i386/i386-options.c (ix86_option_override_internal):
19204 Set param_ira_consider_dup_in_all_alts to 0.
19206 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19208 PR rtl-optimization/100328
19209 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
19211 * ira.c (ira_get_dup_out_num): Adjust as parameter
19212 param_ira_consider_dup_in_all_alts.
19213 * params.opt (ira-consider-dup-in-all-alts): New.
19214 * ira-conflicts.c (process_regs_for_copy): Add one parameter
19215 single_input_op_has_cstr_p.
19216 (get_freq_for_shuffle_copy): New function.
19217 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
19218 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
19220 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
19222 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
19223 generate condition code aware RTL directly.
19225 2021-07-05 Andrew Pinski <apinski@marvell.com>
19227 PR tree-optimization/101039
19228 * match.pd (A CMP 0 ? A : -A): New patterns.
19229 * tree-ssa-phiopt.c (abs_replacement): Delete function.
19230 (tree_ssa_phiopt_worker): Don't call abs_replacement.
19231 Update comment about abs_replacement.
19233 2021-07-05 Andrew Pinski <apinski@marvell.com>
19235 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
19236 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
19238 2021-07-05 Andrew Pinski <apinski@marvell.com>
19240 * tree-ssa-phiopt.c (match_simplify_replacement):
19241 Add early_p argument. Call gimple_simplify_phiopt
19242 instead of gimple_simplify.
19243 (tree_ssa_phiopt_worker): Update call to
19244 match_simplify_replacement and allow unconditionally.
19245 (phiopt_early_allow): New function.
19246 (gimple_simplify_phiopt): New function.
19248 2021-07-05 Andrew Pinski <apinski@marvell.com>
19250 PR middle-end/101237
19251 * fold-const.c (negate_expr_p): Remove call to element_mode
19252 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
19253 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
19254 (fold_negate_expr_1): Likewise.
19255 (const_unop): Likewise.
19256 (fold_cond_expr_with_comparison): Likewise.
19257 (fold_binary_loc): Likewise.
19258 (fold_ternary_loc): Likewise.
19259 (tree_call_nonnegative_warnv_p): Likewise.
19260 * match.pd (-(A + B) -> (-B) - A): Likewise.
19262 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
19264 * configure.ac: Handle --with-dsymutil in the same way as we
19265 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
19266 Extract the type and version for the dsymutil configured or
19267 found by the default searches.
19268 * config.in: Regenerated.
19269 * configure: Regenerated.
19270 * collect2.c (do_dsymutil): Handle locating dsymutil in the
19271 same way as for the assembler and linker.
19272 * config/darwin.h (DSYMUTIL): Delete.
19273 * gcc.c: Report a configured dsymutil correctly.
19274 * exec-tool.in: Allow for dsymutil.
19276 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
19278 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
19279 Handle V4QI and V2HI modes.
19280 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
19281 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
19282 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
19283 Handle 4-byte dual operands with TARGET_XOP and single operands
19284 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
19285 mmx_pshufbv4qi3 for TARGET_SSSE3.
19286 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
19287 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
19288 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
19289 (expand_vec_perm_even_odd_1): Handle V4QI mode.
19290 (expand_vec_perm_broadcast_1): Handle V4QI mode.
19291 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
19292 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
19293 (mmx_pshufbv4qi3): Ditto.
19294 (*mmx_pblendw32): Ditto.
19295 (*mmx_pblendw64): Rename from *mmx_pblendw.
19296 (mmx_punpckhbw_low): New insn_and_split pattern.
19297 (mmx_punpcklbw_low): Ditto.
19299 2021-07-05 Richard Biener <rguenther@suse.de>
19301 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
19302 set LOOP_C_INFINITE on the vectorized loop.
19304 2021-07-05 Richard Biener <rguenther@suse.de>
19306 PR middle-end/101291
19307 * cfgloopmanip.c (loop_version): Set the loop copy of the
19308 versioned loop to the new loop.
19310 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
19313 * config.gcc: Ensure that Darwin biarch definitions are
19314 added before i386.h.
19315 * config/i386/darwin.h (TARGET_64BIT): Remove.
19316 (PR80556_WORKAROUND): New.
19317 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
19318 (DARWIN_SUBARCH_SPEC): New.
19319 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
19320 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19321 (REAL_LIBGCC_SPEC): Remove.
19322 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
19323 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19324 (REAL_LIBGCC_SPEC): Remove.
19326 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
19328 PR middle-end/101294
19329 * expr.c (store_constructor): Don't use vec_duplicate on vector.
19331 2021-07-02 Martin Sebor <msebor@redhat.com>
19333 PR middle-end/98871
19334 PR middle-end/98512
19335 * diagnostic.c (get_any_inlining_info): New.
19336 (update_effective_level_from_pragmas): Handle inlining context.
19337 (diagnostic_enabled): Same.
19338 (diagnostic_report_diagnostic): Same.
19339 * diagnostic.h (struct diagnostic_info): Add ctor.
19340 (struct diagnostic_context): Add new member.
19341 * tree-diagnostic.c (set_inlining_locations): New.
19342 (tree_diagnostics_defaults): Set new callback pointer.
19344 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
19346 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
19348 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
19349 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
19350 lxvp and stxvp built-ins.
19351 (mma_init_builtins): Handle lxvp and stxvp built-ins.
19352 (builtin_function_type): Likewise.
19353 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
19355 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19357 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
19358 additional argument for the code.
19359 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
19360 argument for the code. Just return if the ZN bits are useful or
19361 not rather than the old style CC_* enums.
19362 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
19363 more generic shiftqi patterns.
19364 (shifthi_noscratch, shiftsi_noscratch): Similarly.
19365 (shiftqi_noscratch_set_flags): New pattern.
19366 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
19368 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
19370 PR tree-optimization/101223
19371 * range-op.cc (build_lt): Add -1 for signed values.
19372 (built_gt): Subtract -1 for signed values.
19374 2021-07-02 David Faust <david.faust@oracle.com>
19376 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
19377 (btf_asm_type): Likewise.
19379 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19381 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
19382 an array of rtx rather than a pointer to rtx. Add code argument.
19383 (compute_a_shift_length): Similarly.
19384 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
19385 shift itself isn't an operand. Create dummy operand[0] to carry
19386 a mode and pass a suitable rtx code to compute_a_shift_length.
19387 (get_shift_alg): Adjust operand number of clobber in output templates.
19388 (output_a_shift): Make first argument an array of rtx rather than
19389 a pointer to rtx. Add code argument for the type of shift.
19390 Adjust now that the shift itself is no longer an operand.
19391 (compute_a_shift_length): Similarly.
19392 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
19393 iterator rather than nshift_operator.
19394 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
19395 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
19396 and compute_a_shift_length.
19397 (shiftqi_noscratch_clobber_flags): Likewise.
19398 (shifthi_noscratch_clobber_flags): Likewise.
19399 (shiftsi_noscratch_clobber_flags): Likewise.
19401 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19404 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
19405 dsymutil for BTF/CTF.
19407 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19410 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
19411 segment to include BTF.
19412 (BTF_INFO_SECTION_NAME): New.
19414 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19416 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
19417 (small_data_operand, memreg_operand, small_insn_p): Likewise.
19418 * config/m32r/m32r.c (call_operand): Adjust return type.
19419 (small_data_operand, memreg_operand): Likewise.
19421 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19423 * config/frv/frv-protos.h (integer_register_operand): Adjust return
19425 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
19426 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
19427 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
19428 (move_source_operand, move_destination_operand): Likewise.
19429 (condexec_source_operand, condexec_dest_operand): Likewise.
19430 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
19431 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
19432 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
19433 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
19434 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
19435 (even_fpr_operand, odd_fpr_operand): Likewise.
19436 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
19437 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
19438 (uint4_operand, uint1_operand, int_2word_operand): Likewise
19439 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
19440 (relational_operator, float_relational_operator): Likewise.
19441 (ccr_eqne_operator, minmax_operator): Likewise.
19442 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
19443 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
19444 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
19445 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
19446 (quad_acc_operand, accg_operand): Likewise.
19448 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19450 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
19451 return type to a bool.
19452 (nonimmediate_nonstack_operand): Likewise.
19453 (xstormy16_splittable_below100_operand): Likewise.
19454 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
19456 (xstormy16_splittable_below100_operand): Likewise.
19458 2021-07-02 Richard Biener <rguenther@suse.de>
19460 PR tree-optimization/101293
19461 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
19462 with combined offsets.
19463 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
19464 combined with the rest of the offset.
19466 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
19468 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
19469 PIC encodings for PE-COFF targets.
19471 2021-07-02 Jakub Jelinek <jakub@redhat.com>
19474 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
19475 Return nullptr for TImode inner mode.
19477 2021-07-02 Richard Biener <rguenther@suse.de>
19479 PR tree-optimization/101280
19480 PR tree-optimization/101173
19481 * gimple-loop-interchange.cc
19482 (tree_loop_interchange::valid_data_dependences): Properly
19483 guard all dependence checks with DDR_REVERSED_P or its
19486 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
19488 * config/i386/i386-expand.c (ix86_expand_builtin):
19489 Add branch to clear odata when ZF is set for asedecenc_expand
19490 and wideaesdecenc_expand.
19492 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
19494 * config/i386/gcc-auto-profile: regenerate
19496 2021-07-02 liuhongt <hongtao.liu@intel.com>
19498 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
19499 (trunc<mode><pmov_dst_4_lower>2): this.
19501 2021-07-01 David Malcolm <dmalcolm@redhat.com>
19503 * diagnostic.h (diagnostic_context::m_file_cache): New field.
19504 * input.c (class fcache): Rename to...
19505 (class file_cache_slot): ...this, making most members private and
19506 prefixing fields with "m_".
19507 (file_cache_slot::get_file_path): New accessor.
19508 (file_cache_slot::get_use_count): New accessor.
19509 (file_cache_slot::missing_trailing_newline_p): New accessor.
19510 (file_cache_slot::inc_use_count): New.
19511 (fcache_buffer_size): Move to...
19512 (file_cache_slot::buffer_size): ...here.
19513 (fcache_line_record_size): Move to...
19514 (file_cache_slot::line_record_size): ...here.
19515 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
19516 (fcache_tab_size): Move to file_cache::num_file_slots.
19517 (diagnostic_file_cache_init): Update for move of fcache_tab
19518 to global_dc->m_file_cache.
19519 (diagnostic_file_cache_fini): Likewise.
19520 (lookup_file_in_cache_tab): Convert to...
19521 (file_cache::lookup_file): ...this.
19522 (diagnostics_file_cache_forcibly_evict_file): Update for move of
19523 fcache_tab to global_dc->m_file_cache, moving most of
19524 implementation to...
19525 (file_cache::forcibly_evict_file): ...this new function and...
19526 (file_cache_slot::evict): ...this new function.
19527 (evicted_cache_tab_entry): Convert to...
19528 (file_cache::evicted_cache_tab_entry): ...this.
19529 (add_file_to_cache_tab): Convert to...
19530 (file_cache::add_file): ...this, moving bulk of implementation
19532 (file_cache_slot::create): ..this new function.
19533 (file_cache::file_cache): New.
19534 (file_cache::~file_cache): New.
19535 (lookup_or_add_file_to_cache_tab): Convert to...
19536 (file_cache::lookup_or_add_file): ..this new function.
19537 (fcache::fcache): Rename to...
19538 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
19540 (fcache::~fcache): Rename to...
19541 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
19543 (needs_read): Convert to...
19544 (file_cache_slot::needs_read_p): ...this.
19545 (needs_grow): Convert to...
19546 (file_cache_slot::needs_grow_p): ...this.
19547 (maybe_grow): Convert to...
19548 (file_cache_slot::maybe_grow): ...this.
19549 (read_data): Convert to...
19550 (file_cache_slot::read_data): ...this.
19551 (maybe_read_data): Convert to...
19552 (file_cache_slot::maybe_read_data): ...this.
19553 (get_next_line): Convert to...
19554 (file_cache_slot::get_next_line): ...this.
19555 (goto_next_line): Convert to...
19556 (file_cache_slot::goto_next_line): ...this.
19557 (read_line_num): Convert to...
19558 (file_cache_slot::read_line_num): ...this.
19559 (location_get_source_line): Update for moving of globals to
19560 global_dc->m_file_cache.
19561 (location_missing_trailing_newline): Likewise.
19562 * input.h (class file_cache_slot): New forward decl.
19563 (class file_cache): New.
19565 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
19567 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
19568 128-bit floating point conditional move support.
19569 (have_compare_and_set_mask): Add IEEE 128-bit floating point
19571 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
19572 (mov<mode>cc_p10, IEEE128 iterator): New insn.
19573 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
19574 (fpmask<mode>, IEEE128 iterator): New insn.
19575 (xxsel<mode>, IEEE128 iterator): New insn.
19577 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
19580 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
19582 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19584 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19586 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
19588 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
19589 (vec_duplicate<mode>): New expander.
19591 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19594 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19596 (ix86_byte_broadcast): New function.
19597 (ix86_convert_const_wide_int_to_broadcast): Likewise.
19598 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
19599 size is 16 bytes or bigger.
19600 (ix86_broadcast_from_integer_constant): New function.
19601 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
19602 to broadcast if mode size is 16 bytes or bigger.
19603 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
19605 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
19607 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19609 * config/i386/predicates.md (ix86_endbr_immediate_operand):
19610 Return true/false instead of 1/0.
19611 (movq_parallel): Ditto.
19613 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19615 * recog.c (general_operand): Return true/false instead of 1/0.
19616 (register_operand): Ditto.
19617 (immediate_operand): Ditto.
19618 (const_int_operand): Ditto.
19619 (const_scalar_int_operand): Ditto.
19620 (const_double_operand): Ditto.
19621 (push_operand): Ditto.
19622 (pop_operand): Ditto.
19623 (memory_operand): Ditto.
19624 (indirect_operand): Ditto.
19626 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19628 * genpreds.c (write_predicate_subfunction):
19629 Change the type of written subfunction to bool.
19630 (write_one_predicate_function):
19631 Change the type of written function to bool.
19632 (write_tm_preds_h): Ditto.
19633 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
19634 * recog.c (general_operand): Change the type to bool.
19635 (address_operand): Ditto.
19636 (register_operand): Ditto.
19637 (pmode_register_operand): Ditto.
19638 (scratch_operand): Ditto.
19639 (immediate_operand): Ditto.
19640 (const_int_operand): Ditto.
19641 (const_scalar_int_operand): Ditto.
19642 (const_double_operand): Ditto.
19643 (nonimmediate_operand): Ditto.
19644 (nonmemory_operand): Ditto.
19645 (push_operand): Ditto.
19646 (pop_operand): Ditto.
19647 (memory_operand): Ditto.
19648 (indirect_operand): Ditto.
19649 (ordered_comparison_operator): Ditto.
19650 (comparison_operator): Ditto.
19651 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
19652 Change the type of indirect predicate function to bool.
19653 * config/rs6000/rs6000.c (easy_vector_constant):
19654 Change the type to bool.
19655 * config/mips/mips-protos.h (m16_based_address_p):
19656 Change the type of operand 3 to bool.
19658 2021-07-01 Richard Biener <rguenther@suse.de>
19660 PR tree-optimization/101280
19661 PR tree-optimization/101173
19662 * gimple-loop-interchange.cc
19663 (tree_loop_interchange::valid_data_dependences): Revert
19664 previous change and instead correctly handle DDR_REVERSED_P
19667 2021-07-01 Richard Biener <rguenther@suse.de>
19669 PR tree-optimization/101278
19670 * tree-ssa-dse.c (dse_classify_store): First check for
19671 uses, then ignore stmt for chaining purposes.
19673 2021-07-01 Richard Biener <rguenther@suse.de>
19675 PR tree-optimization/100778
19676 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
19677 vectorized ops ahead of their scalar BB.
19679 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19682 * config/i386/i386.md (*nabs<dwi>2_doubleword):
19683 New insn_and_split pattern.
19684 (*nabs<dwi>2_1): Ditto.
19685 * config/i386/i386-features.c
19686 (general_scalar_chain::compute_convert_gain):
19687 Handle (NEG (ABS (...))) RTX. Rewrite src code
19688 scanner as switch statement.
19689 (general_scalar_chain::convert_insn):
19690 Handle (NEG (ABS (...))) RTX.
19691 (general_scalar_to_vector_candidate_p):
19692 Detect (NEG (ABS (...))) RTX. Reorder case statements
19693 for (AND (NOT (...) ...)) fallthrough.
19695 2021-07-01 Richard Biener <rguenther@suse.de>
19697 PR tree-optimization/101178
19698 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
19699 (slpg::perm_in): Add.
19700 (slpg::get_perm_in): Remove.
19701 (slpg::get_perm_materialized): Add.
19702 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
19703 during permute propagation and materialization.
19705 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19708 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
19710 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19712 PR middle-end/94366
19713 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
19714 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
19715 use boolean_type_node instead of integer_type_node as NE_EXPR type.
19716 (lower_reduction_clauses): Likewise.
19718 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19720 * config/gcn/gcn.c: Include dwarf2.h.
19721 (gcn_addr_space_debug): New function.
19722 (TARGET_ADDR_SPACE_DEBUG): New hook.
19724 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19726 * common/config/gcn/gcn-common.c
19727 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
19728 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
19730 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
19731 (gcn_frame_pointer_rqd): New function.
19732 (TARGET_FRAME_POINTER_REQUIRED): New hook.
19734 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19736 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
19737 prologue register saves.
19738 (gcn_debug_unwind_info): Use UI_DWARF2.
19739 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
19740 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
19741 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
19742 (DWARF_LINK_REGISTER): New define.
19743 (FIRST_PSEUDO_REGISTER): Increment.
19744 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
19745 (CALL_USED_REGISTERS): Likewise.
19746 (REGISTER_NAMES): Likewise.
19748 2021-06-30 Richard Biener <rguenther@suse.de>
19750 PR tree-optimization/101267
19751 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
19752 API and use SLP compatible interface of vect_is_simple_use.
19753 Reject not vectorized SLP defs for callers that do not support
19755 (vect_check_store_rhs): Handle masked stores and pass down
19756 the appropriate operator index.
19757 (vectorizable_call): Adjust.
19758 (vectorizable_store): Likewise.
19759 (vectorizable_load): Likewise. Handle SLP pecularity of
19761 (vect_is_simple_use): Remove special-casing of masked stores.
19763 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19765 * common.opt (foffload): Remove help as Driver only.
19766 * gcc.c (display_help): Add -foffload.
19768 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19770 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
19771 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
19772 a candidate list; better inform no offload target is configured
19773 and fix hint extraction when passed target is not '\0' at [len].
19774 * common.opt (foffload): Add tailing '.'.
19775 (foffload-options): Likewise; fix flag name in the help string.
19777 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19780 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
19781 (vcage_f32): Gate comparison on __FAST_MATH__.
19782 (vcageq_f32): Likewise.
19783 (vcale_f32): Likewise.
19784 (vcaleq_f32): Likewise.
19785 (vcagt_f32): Likewise.
19786 (vcagtq_f32): Likewise.
19787 (vcalt_f32): Likewise.
19788 (vcaltq_f32): Likewise.
19789 (vcage_f16): Likewise.
19790 (vcageq_f16): Likewise.
19791 (vcale_f16): Likewise.
19792 (vcaleq_f16): Likewise.
19793 (vcagt_f16): Likewise.
19794 (vcagtq_f16): Likewise.
19795 (vcalt_f16): Likewise.
19796 (vcaltq_f16): Likewise.
19798 2021-06-30 Richard Biener <rguenther@suse.de>
19800 PR tree-optimization/101264
19801 * tree-vect-slp.c (vect_optimize_slp): Propagate the
19802 computed perm_in to all "any" permute successors
19803 we cannot de-duplicate immediately.
19805 2021-06-30 liuhongt <hongtao.liu@intel.com>
19808 * config/i386/sse.md
19809 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
19811 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
19813 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
19814 * config/i386/subst.md (maskz_scalar): New define_subst.
19815 (maskz_scalar_name): New subst_attr.
19816 (maskz_scalar_op5): Ditto.
19817 (round_saeonly_maskz_scalar_op5): Ditto.
19818 (round_saeonly_maskz_scalar_operand5): Ditto.
19820 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
19822 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
19823 Increase code CSECT alignment to at least 32 bytes.
19824 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
19825 alignment designation.
19827 2021-06-29 Sergei Trofimovich <siarheit@google.com>
19829 * doc/generic.texi: Fix s/net yet/not yet/ typo.
19831 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19833 PR tree-optimization/101254
19834 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
19835 wrapping/non-wrapping when setting the result range.
19837 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19839 * value-query.cc (gimple_range_global): Allow phis.
19841 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19843 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
19844 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
19845 statement for location context.
19846 (check_for_binary_op_overflow): Ditto.
19847 (simplify_using_ranges::get_vr_for_comparison): Ditto.
19848 (simplify_using_ranges::compare_name_with_value): Ditto.
19849 (simplify_using_ranges::compare_names): Ditto.
19850 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
19851 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
19852 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
19853 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
19854 (simplify_using_ranges::two_valued_val_range_p): Ditto.
19855 (simplify_using_ranges::simplify): Ditto.
19856 * vr-values.h: Adjust prototypes.
19858 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
19861 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
19863 2021-06-29 Julian Brown <julian@codesourcery.com>
19865 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
19866 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
19867 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
19869 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
19870 (MAX_FIXED_MODE_SIZE): Change to 128.
19872 2021-06-29 Julian Brown <julian@codesourcery.com>
19874 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
19875 (s_mnemonic): Add clrsb.
19876 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
19877 (clrsb<mode>2): Add expander for SImode/DImode.
19879 2021-06-29 Julian Brown <julian@codesourcery.com>
19881 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
19882 muldi3): Add patterns.
19884 2021-06-29 Julian Brown <julian@codesourcery.com>
19886 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
19887 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
19889 2021-06-29 Julian Brown <julian@codesourcery.com>
19891 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
19893 2021-06-29 Joseph Myers <joseph@codesourcery.com>
19895 * btfout.c, ctfout.c: Include "memmodel.h".
19897 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19899 * gcc.c (check_offload_target_name): Cast len argument to
19900 %q.*s to 'int'; avoid -Wstringop-truncation warning.
19902 2021-06-29 Richard Biener <rguenther@suse.de>
19904 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
19905 to "any" permute nodes and relax "any" permute proapgation
19906 during iterative backward propagation.
19908 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19911 * common.opt (-foffload=): Update description.
19912 (-foffload-options=): New.
19913 * doc/invoke.texi (C Language Options): Document
19914 -foffload and -foffload-options.
19915 * gcc.c (check_offload_target_name): New, split off from
19916 handle_foffload_option.
19917 (check_foffload_target_names): New.
19918 (handle_foffload_option): Handle -foffload=default.
19919 (driver_handle_option): Update for -foffload-options.
19920 * lto-opts.c (lto_write_options): Use -foffload-options
19921 instead of -foffload.
19922 * lto-wrapper.c (merge_and_complain, append_offload_options):
19924 * opts.c (common_handle_option): Likewise.
19926 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19928 * doc/invoke.texi (C Language Options): Sort options
19929 alphabetically in optlist and also the description itself.
19930 Remove leftover -fallow-single-precision from and add missing
19931 -fgnu-tm to the optlist.
19933 2021-06-29 Richard Biener <rguenther@suse.de>
19935 * tree-vect-slp.c (slpg_vertex::visited): Remove.
19936 (vect_slp_perms_eq): Handle -1 permutes.
19937 (vect_optimize_slp): Rewrite permute propagation.
19939 2021-06-29 Jakub Jelinek <jakub@redhat.com>
19942 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
19943 perform the optimization in GENERIC when sanitizing and x has a
19946 2021-06-29 Richard Biener <rguenther@suse.de>
19948 PR tree-optimization/101242
19949 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
19950 PHIs with not represented initial values as leafs.
19952 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
19954 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
19955 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
19956 "register" keyword.
19957 (pdp11_initial_elimination_offset) Remove unused variable.
19958 (pdp11_cmp_length) Ditto.
19959 (pdp11_insn_cost): Ditto, and fix signedness warning.
19961 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
19963 * btfout.c: Include tm_p.h.
19966 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19968 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
19970 (bpf_expand_epilogue): Likewise.
19971 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
19972 Do not define DBX_DEBUGGING_INFO.
19974 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19976 * doc/invoke.texi: Document the CTF and BTF debug info options.
19978 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19979 David Faust <david.faust@oracle.com>
19980 Jose E. Marchesi <jose.marchesi@oracle.com>
19981 Weimin Pan <weimin.pan@oracle.com>
19983 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
19984 GTFILES. Add new object files.
19985 * common.opt: Add CTF and BTF debug info options.
19986 * btfout.c: New file.
19987 * ctfc.c: Likewise.
19988 * ctfc.h: Likewise.
19989 * ctfout.c: Likewise.
19990 * dwarf2ctf.c: Likewise.
19991 * dwarf2ctf.h: Likewise.
19992 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
19994 * dwarf2out.c (dwarf2out_source_line): Likewise.
19995 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
19997 (debug_format_do_cu): New function.
19998 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
20000 Include dwarf2ctf.c.
20001 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
20003 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
20004 (CTF_DEBUG): New bitmask.
20005 (BTF_DEBUG): Likewise.
20006 (enum ctf_debug_info_levels): New enum.
20007 * gengtype.c (open_base_files): Handle ctfc.h.
20008 (main): Handle uint32_t type.
20009 * flags.h (btf_debuginfo_p): New definition.
20010 (dwarf_based_debuginfo_p): Likewise.
20011 * opts.c (debug_type_names): Add entries for CTF and BTF.
20012 (btf_debuginfo_p): New function.
20013 (dwarf_based_debuginfo_p): Likewise.
20014 (common_handle_option): Handle -gctfN and -gbtf options.
20015 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
20016 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
20019 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
20021 * dwarf2out.c (AT_class): Function is no longer static.
20022 (AT_int): Likewise.
20023 (AT_unsigned): Likewise.
20024 (AT_loc): Likewise.
20025 (get_AT): Likewise.
20026 (get_AT_string): Likewise.
20027 (get_AT_flag): Likewise.
20028 (get_AT_unsigned): Likewise.
20029 (get_AT_ref): Likewise.
20030 (new_die_raw): Likewise.
20031 (lookup_decl_die): Likewise.
20032 (base_type_die): Likewise.
20033 (add_name_attribute): Likewise.
20034 (add_AT_int): Likewise.
20035 (add_AT_unsigned): Likewise.
20036 (add_AT_loc): Likewise.
20037 (dw_get_die_tag): New function.
20038 (dw_get_die_child): Likewise.
20039 (dw_get_die_sib): Likewise.
20040 (struct dwarf_file_data): Move from here to dwarf2out.h
20041 (struct dw_attr_struct): Likewise.
20042 * dwarf2out.h: Analogous changes.
20044 2021-06-28 Martin Jambor <mjambor@suse.cz>
20047 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
20048 members m_dead_stmts and m_dead_ssas.
20049 * ipa-param-manipulation.c
20050 (ipa_param_body_adjustments::mark_dead_statements): New function.
20051 (ipa_param_body_adjustments::common_initialization): Call it on
20052 all removed but not split parameters.
20053 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
20055 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
20057 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
20058 dead debug statements.
20059 (copy_phis_for_bb): Do not copy dead PHI nodes.
20061 2021-06-28 Martin Jambor <mjambor@suse.cz>
20064 * symtab-clones.h (clone_info): Removed member param_adjustments.
20065 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
20066 deal with pass-through splits now.
20067 (ipa_param_performed_split): Removed.
20068 (ipa_param_adjustments::modify_call): Adjusted parameters.
20069 (class ipa_param_body_adjustments): Adjusted parameters of
20070 register_replacement, modify_gimple_stmt and modify_call_stmt.
20071 (ipa_verify_edge_has_no_modifications): Declare.
20072 (ipa_edge_modifications_finalize): Declare.
20073 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
20074 performed_splits processing, pas only edge to padjs->modify_call,
20075 check that call arguments were not modified if they should not have
20077 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
20079 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
20080 (ipa_edge_modification_info): Likewise.
20081 (ipa_edge_modification_sum): Likewise.
20082 (ipa_edge_modifications): New edge summary.
20083 (ipa_verify_edge_has_no_modifications): New function.
20084 (transitive_split_p): Removed.
20085 (transitive_split_map): Likewise.
20086 (init_transitive_splits): Likewise.
20087 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
20088 summary instead of performed_splits.
20089 (ipa_param_body_adjustments::register_replacement): Drop dummy
20090 parameter, set base_index of the created ipa_param_body_replacement.
20091 (phi_arg_will_live_p): New function.
20092 (ipa_param_body_adjustments::common_initialization): Do not create
20093 IPA_SRA dummy decls.
20094 (simple_tree_swap_info): Removed.
20095 (remap_split_decl_to_dummy): Likewise.
20096 (record_argument_state_1): New function.
20097 (record_argument_state): Likewise.
20098 (ipa_param_body_adjustments::modify_call_stmt): New parameter
20099 orig_stmt. Do not work with dummy decls, save necessary info about
20100 changes to ipa_edge_modifications.
20101 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
20102 orig_stmt, pass it to modify_call_stmt.
20103 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
20104 modify_gimple_stmt.
20105 (ipa_edge_modifications_finalize): New function.
20106 * tree-inline.c (remap_gimple_stmt): Pass original statement to
20107 modify_gimple_stmt.
20108 (copy_phis_for_bb): Do not copy dead PHI nodes.
20109 (expand_call_inline): Do not remap performed_splits.
20110 (update_clone_info): Likewise.
20111 * toplev.c: Include ipa-param-manipulation.h.
20112 (toplev::finalize): Call ipa_edge_modifications_finalize.
20114 2021-06-28 Andrew Pinski <apinski@marvell.com>
20116 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
20117 info if we're the only things setting the target PHI.
20118 (value_replacement): Don't duplicate range here.
20119 (minmax_replacement): Likewise.
20121 2021-06-28 Richard Biener <rguenther@suse.de>
20123 PR tree-optimization/101229
20124 * gimple-walk.c (gimple_walk_op): Handle PHIs.
20126 2021-06-28 Martin Liska <mliska@suse.cz>
20128 * config/v850/v850.c (construct_dispose_instruction): Allocate
20130 (construct_prepare_instruction): Likewise.
20132 2021-06-28 Martin Liska <mliska@suse.cz>
20134 * config/v850/v850.c (v850_option_override): Build default
20136 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
20137 ignored for inlining.
20138 (TARGET_CAN_INLINE_P): New.
20140 2021-06-28 Richard Biener <rguenther@suse.de>
20142 PR tree-optimization/101207
20143 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
20144 permute eliding for load permutations properly.
20146 2021-06-28 Richard Biener <rguenther@suse.de>
20148 PR tree-optimization/101173
20149 * gimple-loop-interchange.cc
20150 (tree_loop_interchange::valid_data_dependences): Disallow outer
20151 loop dependence distance of zero.
20153 2021-06-28 liuhongt <hongtao.liu@intel.com>
20156 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
20157 define_insn_and_split.
20158 (*avx_cmp<mode>3_ltint): Ditto.
20159 (*avx2_pcmp<mode>3_3): Ditto.
20160 (*avx2_pcmp<mode>3_4): Ditto.
20161 (*avx2_pcmp<mode>3_5): Ditto.
20163 2021-06-28 liuhongt <hongtao.liu@intel.com>
20165 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
20166 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
20167 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
20168 IX86_BUILTIN_PBLENDVB128): Replace icode with
20170 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
20172 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
20173 New pre_reload splitter.
20175 2021-06-27 Andrew Pinski <apinski@marvell.com>
20177 PR middle-end/101230
20178 * fold-const.c (fold_ternary_loc): Check
20179 the return value of invert_tree_comparison.
20181 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
20183 * config.gcc: Add SPDX License Identifier.
20184 (powerpc-ibm-aix789): Default to aix73.h.
20185 (powerpc-ibm-aix7.2.*.*): New stanza.
20186 * config/rs6000/aix72.h: Add SPDX License Identifier.
20187 * config/rs6000/aix73.h: New file.
20189 2021-06-26 Jason Merrill <jason@redhat.com>
20191 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
20193 2021-06-26 Andrew Pinski <apinski@marvell.com>
20195 * genmatch.c (lower_cond): Copy for_subst_vec
20196 for the simplify also.
20197 (lower): Swap the order for lower_for and lower_cond.
20199 2021-06-26 Andrew Pinski <apinski@marvell.com>
20201 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
20202 flow senatitive info on the moved ssa set.
20204 2021-06-26 Andrew Pinski <apinski@marvell.com>
20206 * fold-const.c (fold_cond_expr_with_comparison):
20207 Exand arg0 into comp_code, arg00, and arg01.
20208 (fold_ternary_loc): Use invert_tree_comparison
20209 instead of fold_invert_truthvalue for the case
20210 where we have A CMP B ? C : A.
20212 2021-06-25 Martin Sebor <msebor@redhat.com>
20214 PR middle-end/101216
20215 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
20217 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
20219 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
20221 2021-06-25 Richard Biener <rguenther@suse.de>
20223 PR tree-optimization/101202
20224 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
20227 2021-06-25 Richard Biener <rguenther@suse.de>
20229 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
20230 STMT_VINFO_REDUC_DEF from the original representative.
20232 2021-06-25 Martin Sebor <msebor@redhat.com>
20234 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
20235 gimple_no_warning_p and gimple_set_no_warning with
20236 warning_suppressed_p, and suppress_warning.
20238 (maybe_warn_for_bound): Same.
20239 (warn_for_access): Same.
20240 (check_access): Same.
20241 (expand_builtin_strncmp): Same.
20242 (fold_builtin_varargs): Same.
20243 * calls.c (maybe_warn_nonstring_arg): Same.
20244 (maybe_warn_rdwr_sizes): Same.
20245 * cfgexpand.c (expand_call_stmt): Same.
20246 * cgraphunit.c (check_global_declaration): Same.
20247 * fold-const.c (fold_undefer_overflow_warnings): Same.
20248 (fold_truth_not_expr): Same.
20249 (fold_unary_loc): Same.
20250 (fold_checksum_tree): Same.
20251 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
20252 (array_bounds_checker::check_mem_ref): Same.
20253 (array_bounds_checker::check_addr_expr): Same.
20254 (array_bounds_checker::check_array_bounds): Same.
20255 * gimple-expr.c (copy_var_decl): Same.
20256 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
20257 (gimple_fold_builtin_strncat): Same.
20258 (gimple_fold_builtin_stxcpy_chk): Same.
20259 (gimple_fold_builtin_stpcpy): Same.
20260 (gimple_fold_builtin_sprintf): Same.
20261 (fold_stmt_1): Same.
20262 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
20263 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
20264 * gimple-ssa-sprintf.c (handle_printf_call): Same.
20265 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
20266 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
20267 * gimple-ssa-warn-restrict.h: Adjust declarations.
20268 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
20269 gimple_no_warning_p and gimple_set_no_warning with
20270 warning_suppressed_p, and suppress_warning.
20271 (check_call): Same.
20272 (check_bounds_or_overlap): Same.
20273 * gimple.c (gimple_build_call_from_tree): Same.
20274 * gimplify.c (gimplify_return_expr): Same.
20275 (gimplify_cond_expr): Same.
20276 (gimplify_modify_expr_complex_part): Same.
20277 (gimplify_modify_expr): Same.
20278 (gimple_push_cleanup): Same.
20279 (gimplify_expr): Same.
20280 * omp-expand.c (expand_omp_for_generic): Same.
20281 (expand_omp_taskloop_for_outer): Same.
20282 * omp-low.c (lower_rec_input_clauses): Same.
20283 (lower_lastprivate_clauses): Same.
20284 (lower_send_clauses): Same.
20285 (lower_omp_target): Same.
20286 * tree-cfg.c (pass_warn_function_return::execute): Same.
20287 * tree-complex.c (create_one_component_var): Same.
20288 * tree-inline.c (remap_gimple_op_r): Same.
20289 (copy_tree_body_r): Same.
20290 (declare_return_variable): Same.
20291 (expand_call_inline): Same.
20292 * tree-nested.c (lookup_field_for_decl): Same.
20293 * tree-sra.c (create_access_replacement): Same.
20294 (generate_subtree_copies): Same.
20295 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
20296 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
20297 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
20298 * tree-ssa-loop-im.c (execute_sm): Same.
20299 * tree-ssa-phiopt.c (cond_store_replacement): Same.
20300 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
20301 (handle_builtin_strcpy): Same.
20302 (maybe_diag_stxncpy_trunc): Same.
20303 (handle_builtin_stxncpy_strncat): Same.
20304 (handle_builtin_strcat): Same.
20305 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
20306 (set_no_uninit_warning): Same.
20307 (uninit_undefined_value_p): Same.
20308 (warn_uninit): Same.
20309 (maybe_warn_operand): Same.
20310 * tree-vrp.c (compare_values_warnv): Same.
20311 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
20312 (test_for_singularity): Same.
20313 * gimple.h (warning_suppressed_p): New function.
20314 (suppress_warning): Same.
20315 (copy_no_warning): Same.
20316 (gimple_set_block): Call gimple_set_location.
20317 (gimple_set_location): Call copy_warning.
20319 2021-06-25 Martin Sebor <msebor@redhat.com>
20321 * tree.h (warning_suppressed_at, copy_warning,
20322 warning_suppressed_p, suppress_warning): New functions.
20324 2021-06-25 Martin Sebor <msebor@redhat.com>
20326 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
20327 * gengtype.c (open_base_files): Add diagnostic-spec.h.
20328 * diagnostic-spec.c: New file.
20329 * diagnostic-spec.h: New file.
20330 * tree.h (no_warning, all_warnings, suppress_warning_at): New
20332 * warning-control.cc: New file.
20334 2021-06-25 liuhongt <hongtao.liu@intel.com>
20337 * config/i386/i386.c (x86_order_regs_for_local_alloc):
20340 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20342 PR tree-optimization/101189
20343 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
20344 LHS range of condition to postfold routine.
20345 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
20346 FALSE edge if the LHS range supports it being taken.
20347 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
20349 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20351 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
20352 (relation_oracle::find_relation_block): Check correct bitmap.
20353 (relation_oracle::dump): Do not dump NULL blocks.
20355 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20357 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
20358 range_on_edge instead of manually calculating.
20360 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20362 * range-op.cc: Fix comment.
20364 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
20367 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
20368 Handle V8QI and V4HI modes.
20369 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
20371 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
20372 (mmxpackmode): New mode attribute.
20373 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
20374 (mmxunpackmode): New mode attribute.
20375 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
20376 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
20377 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
20378 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
20379 * config/i386/i386.md (extsuffix): Move from ...
20380 * config/i386/sse.md: ... here.
20382 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20384 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
20385 (dwarf2out_finish): ...instead of here.
20387 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20389 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
20390 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
20391 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
20392 * configure: Regenerate.
20394 2021-06-24 Richard Biener <rguenther@suse.de>
20396 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
20397 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
20398 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
20400 2021-06-24 Richard Biener <rguenther@suse.de>
20402 * config/i386/sse.md (avx_addsubv4df3): Rename to
20404 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
20405 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
20406 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
20407 * config/i386/i386-builtin.def: Adjust.
20408 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
20409 * optabs.def (vec_addsub_optab): New optab.
20410 * tree-vect-slp-patterns.c (class addsub_pattern): New.
20411 (slp_patterns): Add addsub_pattern.
20412 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
20413 across CFN_VEC_ADDSUB.
20414 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
20416 * doc/md.texi (vec_addsub<mode>3): Document.
20418 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20420 PR middle-end/101170
20421 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
20422 where regno + subreg_regno_offset wraps around use 0 as starting
20425 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20427 PR middle-end/101172
20428 * stor-layout.c (finish_bitfield_representative): If nextf has
20429 error_mark_node type, set repr type to error_mark_node too.
20431 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
20433 * config/s390/s390.c (s390_function_profiler): Ignore labelno
20435 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
20437 2021-06-24 Richard Biener <rguenther@suse.de>
20439 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
20440 across operations that have different semantics on different
20443 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20445 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
20446 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
20447 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
20448 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
20449 OMP_TARGET user outer_ctx instead of ctx for placeholders and
20450 initializer/combiner gimplification.
20451 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
20452 on target constructs.
20453 (lower_rec_input_clauses): Likewise.
20454 (lower_omp_target): Likewise.
20455 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
20456 on target if in_reduction is present.
20458 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
20460 * tree-predcom.c (class pcom_worker): New class.
20461 (release_chain): Renamed to...
20462 (pcom_worker::release_chain): ...this.
20463 (release_chains): Renamed to...
20464 (pcom_worker::release_chains): ...this.
20465 (aff_combination_dr_offset): Renamed to...
20466 (pcom_worker::aff_combination_dr_offset): ...this.
20467 (determine_offset): Renamed to...
20468 (pcom_worker::determine_offset): ...this.
20469 (class comp_ptrs): New class.
20470 (split_data_refs_to_components): Renamed to...
20471 (pcom_worker::split_data_refs_to_components): ...this,
20472 and update with class comp_ptrs.
20473 (suitable_component_p): Renamed to...
20474 (pcom_worker::suitable_component_p): ...this.
20475 (filter_suitable_components): Renamed to...
20476 (pcom_worker::filter_suitable_components): ...this.
20477 (valid_initializer_p): Renamed to...
20478 (pcom_worker::valid_initializer_p): ...this.
20479 (find_looparound_phi): Renamed to...
20480 (pcom_worker::find_looparound_phi): ...this.
20481 (add_looparound_copies): Renamed to...
20482 (pcom_worker::add_looparound_copies): ...this.
20483 (determine_roots_comp): Renamed to...
20484 (pcom_worker::determine_roots_comp): ...this.
20485 (determine_roots): Renamed to...
20486 (pcom_worker::determine_roots): ...this.
20487 (single_nonlooparound_use): Renamed to...
20488 (pcom_worker::single_nonlooparound_use): ...this.
20489 (remove_stmt): Renamed to...
20490 (pcom_worker::remove_stmt): ...this.
20491 (execute_pred_commoning_chain): Renamed to...
20492 (pcom_worker::execute_pred_commoning_chain): ...this.
20493 (execute_pred_commoning): Renamed to...
20494 (pcom_worker::execute_pred_commoning): ...this.
20495 (struct epcc_data): New member worker.
20496 (execute_pred_commoning_cbck): Call execute_pred_commoning
20497 with pcom_worker pointer.
20498 (find_use_stmt): Renamed to...
20499 (pcom_worker::find_use_stmt): ...this.
20500 (find_associative_operation_root): Renamed to...
20501 (pcom_worker::find_associative_operation_root): ...this.
20502 (find_common_use_stmt): Renamed to...
20503 (pcom_worker::find_common_use_stmt): ...this.
20504 (combinable_refs_p): Renamed to...
20505 (pcom_worker::combinable_refs_p): ...this.
20506 (reassociate_to_the_same_stmt): Renamed to...
20507 (pcom_worker::reassociate_to_the_same_stmt): ...this.
20508 (stmt_combining_refs): Renamed to...
20509 (pcom_worker::stmt_combining_refs): ...this.
20510 (combine_chains): Renamed to...
20511 (pcom_worker::combine_chains): ...this.
20512 (try_combine_chains): Renamed to...
20513 (pcom_worker::try_combine_chains): ...this.
20514 (prepare_initializers_chain): Renamed to...
20515 (pcom_worker::prepare_initializers_chain): ...this.
20516 (prepare_initializers): Renamed to...
20517 (pcom_worker::prepare_initializers): ...this.
20518 (prepare_finalizers_chain): Renamed to...
20519 (pcom_worker::prepare_finalizers_chain): ...this.
20520 (prepare_finalizers): Renamed to...
20521 (pcom_worker::prepare_finalizers): ...this.
20522 (tree_predictive_commoning_loop): Renamed to...
20523 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
20524 some calls and remove some cleanup code.
20525 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
20526 (static variable looparound_phis): Remove.
20527 (static variable name_expansions): Remove.
20529 2021-06-24 Richard Biener <rguenther@suse.de>
20531 * tree-vect-slp.c (slpg_vertex): New struct.
20532 (vect_slp_build_vertices): Adjust.
20533 (vect_optimize_slp): Likewise. Maintain an outgoing permute
20534 and a materialized one.
20536 2021-06-24 Richard Biener <rguenther@suse.de>
20538 PR tree-optimization/101105
20539 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
20540 Only ignore steps when they are equal or scalar order is preserved.
20542 2021-06-24 liuhongt <hongtao.liu@intel.com>
20545 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
20546 Adjust comments for ix86_expand_vecop_qihi2.
20547 (ix86_expand_vecmul_qihi): Renamed to ..
20548 (ix86_expand_vecop_qihi2): Adjust function prototype to
20549 support shift operation, add static to definition.
20550 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
20551 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
20552 ix86_expand_vec_shift_qihi_constant.
20553 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
20554 (ix86_expand_vec_shift_qihi_constant): Deleted.
20555 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
20557 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
20558 condition TARGET_64BIT.
20559 (mul<mode>3): Ditto.
20560 (<insn><mode>3): Ditto.
20561 (vlshr<mode>3): Extend to support avx512 vlshr.
20562 (v<insn><mode>3): New expander for
20564 (v<insn>v8qi3): Ditto.
20565 (vashrv8hi3<mask_name>): Renamed to ..
20566 (vashr<mode>3): And extend to support V16QImode for avx512.
20567 (vashrv16qi3): Deleted.
20568 (vashrv2di3<mask_name>): Extend expander to support avx512
20571 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
20573 * doc/lto.texi (Design Overview): Update that slim objects are
20576 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
20578 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
20579 of OTHER_POWER10_MASKS so it will not be enabled by default.
20581 2021-06-23 Richard Biener <rguenther@suse.de>
20582 Martin Jambor <mjambor@suse.cz>
20584 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
20585 param replacement unconditionally. Adjust comment.
20587 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20589 * Makefile.in (OBJS): Add gimple-range-fold.o
20590 * gimple-range-fold.cc: New.
20591 * gimple-range-fold.h: New.
20592 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
20593 (gimple_range_calc_op2): Ditto.
20594 * gimple-range-gori.h: Move prototypes to here.
20595 * gimple-range.cc: Adjust include files.
20596 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
20597 (fur_source::get_operand): Ditto.
20598 (fur_source::get_phi_operand): Ditto.
20599 (fur_source::query_relation): Ditto.
20600 (fur_source::register_relation): Ditto.
20601 (class fur_edge): Ditto.
20602 (fur_edge::fur_edge): Ditto.
20603 (fur_edge::get_operand): Ditto.
20604 (fur_edge::get_phi_operand): Ditto.
20605 (fur_stmt::fur_stmt): Ditto.
20606 (fur_stmt::get_operand): Ditto.
20607 (fur_stmt::get_phi_operand): Ditto.
20608 (fur_stmt::query_relation): Ditto.
20609 (class fur_depend): Relocate to gimple-range-fold.h.
20610 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
20611 (fur_depend::register_relation): Ditto.
20612 (fur_depend::register_relation): Ditto.
20613 (class fur_list): Ditto.
20614 (fur_list::fur_list): Ditto.
20615 (fur_list::get_operand): Ditto.
20616 (fur_list::get_phi_operand): Ditto.
20617 (fold_range): Ditto.
20618 (adjust_pointer_diff_expr): Ditto.
20619 (gimple_range_adjustment): Ditto.
20620 (gimple_range_base_of_assignment): Ditto.
20621 (gimple_range_operand1): Ditto.
20622 (gimple_range_operand2): Ditto.
20623 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
20624 (gimple_range_calc_op2): Ditto.
20625 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
20626 (fold_using_range::range_of_range_op): Ditto.
20627 (fold_using_range::range_of_address): Ditto.
20628 (fold_using_range::range_of_phi): Ditto.
20629 (fold_using_range::range_of_call): Ditto.
20630 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
20631 (fold_using_range::range_of_builtin_call): Ditto.
20632 (fold_using_range::range_of_cond_expr): Ditto.
20633 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
20634 (fold_using_range::relation_fold_and_or): Ditto.
20635 (fold_using_range::postfold_gcond_edges): Ditto.
20636 * gimple-range.h: Add gimple-range-fold.h to include files. Change
20637 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
20638 (gimple_range_handler): Relocate to gimple-range-fold.h.
20639 (gimple_range_ssa_p): Ditto.
20640 (range_compatible_p): Ditto.
20641 (class fur_source): Ditto.
20642 (class fur_stmt): Ditto.
20643 (class fold_using_range): Ditto.
20644 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
20645 (gimple_range_calc_op2): Ditto.
20647 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20649 PR tree-optimization/101148
20650 PR tree-optimization/101014
20651 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
20652 (ranger_cache::~ranger_cache): Adjust.
20653 (ranger_cache::block_range): Check if propagation disallowed.
20654 (ranger_cache::propagate_cache): Disallow propagation if new value
20655 can't be stored properly.
20656 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
20658 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20660 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
20661 (sbr_vector::set_bb_range): Return true.
20662 (class sbr_sparse_bitmap): Adjust.
20663 (sbr_sparse_bitmap::set_bb_range): Return value.
20664 (block_range_cache::set_bb_range): Return value.
20665 (ranger_cache::propagate_cache): Use return value to print msg.
20666 * gimple-range-cache.h (class block_range_cache): Adjust.
20668 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20670 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
20672 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
20674 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
20675 so this pattern can be used for test/compare removal. Pass
20676 current insn to compute_logical_op_length and output_logical_op.
20677 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
20678 (h8300_and_costs): Add argument to compute_logical_op_length.
20679 (output_logical_op): Add new argument. Use it to determine if the
20680 condition codes are used and adjust the output accordingly.
20681 (compute_logical_op_length): Add new argument and update length
20682 computations when condition codes are used.
20683 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
20685 (output_logical_op): Likewise.
20687 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20690 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
20691 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
20692 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
20693 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
20694 * config/i386/sse.md (unspec): ... here.
20696 2021-06-23 Martin Liska <mliska@suse.cz>
20699 * optc-save-gen.awk: Put back arm_fp16_format to
20702 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20705 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
20708 (clz<mode>2): Update RTX pattern for additions.
20710 2021-06-23 Jakub Jelinek <jakub@redhat.com>
20712 PR middle-end/101167
20713 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
20714 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
20716 2021-06-22 Sergei Trofimovich <siarheit@google.com>
20718 * doc/rtl.texi: drop unbalanced parenthesis.
20720 2021-06-22 Richard Biener <rguenther@suse.de>
20722 PR middle-end/101156
20723 * gimplify.c (gimplify_expr): Remove premature incorrect
20726 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20728 PR tree-optimization/101159
20729 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
20732 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20734 PR middle-end/101160
20735 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
20736 clear crtl->return_rtx instead of keeping it referencing a pseudo.
20738 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20739 Andrew Pinski <apinski@marvell.com>
20741 PR tree-optimization/101162
20742 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
20745 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20747 * range-op.cc (range_relational_tests): New.
20748 (range_op_tests): Call range_relational_tests.
20750 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20752 * range-op.cc (operator_cast::lhs_op1_relation): New.
20753 (operator_identity::lhs_op1_relation): Mew.
20755 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20757 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
20759 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20761 * range-op.cc (operator_plus::lhs_op1_relation): New.
20762 (operator_plus::lhs_op2_relation): New.
20764 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20766 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
20767 relation_oracle if dominators exist.
20768 (ranger_cache::~ranger_cache): Dispose of oracle.
20769 (ranger_cache::dump_bb): Dump oracle.
20770 * gimple-range.cc (fur_source::fur_source): New.
20771 (fur_source::get_operand): Use mmeber query.
20772 (fur_source::get_phi_operand): Use member_query.
20773 (fur_source::query_relation): New.
20774 (fur_source::register_dependency): Delete.
20775 (fur_source::register_relation): New.
20776 (fur_edge::fur_edge): Adjust.
20777 (fur_edge::get_phi_operand): Fix comment.
20778 (fur_edge::query): Delete.
20779 (fur_stmt::fur_stmt): Adjust.
20780 (fur_stmt::query): Delete.
20781 (fur_depend::fur_depend): Adjust.
20782 (fur_depend::register_relation): New.
20783 (fur_depend::register_relation): New.
20784 (fur_list::fur_list): Adjust.
20785 (fur_list::get_operand): Use member query.
20786 (fold_using_range::range_of_range_op): Process and query relations.
20787 (fold_using_range::range_of_address): Adjust dependency call.
20788 (fold_using_range::range_of_phi): Ditto.
20789 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
20790 (fold_using_range::relation_fold_and_or): New.
20791 (fold_using_range::postfold_gcond_edges): New.
20792 * gimple-range.h (class gimple_ranger): Adjust.
20793 (class fur_source): Adjust members.
20794 (class fur_stmt): Ditto.
20795 (class fold_using_range): Ditto.
20797 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20799 * range-op.cc (range_operator::wi_fold): Apply relation effect.
20800 (range_operator::fold_range): Adjust and apply relation effect.
20801 (*::fold_range): Add relation parameters.
20802 (*::op1_range): Ditto.
20803 (*::op2_range): Ditto.
20804 (range_operator::lhs_op1_relation): New.
20805 (range_operator::lhs_op2_relation): New.
20806 (range_operator::op1_op2_relation): New.
20807 (range_operator::op1_op2_relation_effect): New.
20808 (relop_early_resolve): New.
20809 (operator_equal::op1_op2_relation): New.
20810 (operator_equal::fold_range): Call relop_early_resolve.
20811 (operator_not_equal::op1_op2_relation): New.
20812 (operator_not_equal::fold_range): Call relop_early_resolve.
20813 (operator_lt::op1_op2_relation): New.
20814 (operator_lt::fold_range): Call relop_early_resolve.
20815 (operator_le::op1_op2_relation): New.
20816 (operator_le::fold_range): Call relop_early_resolve.
20817 (operator_gt::op1_op2_relation): New.
20818 (operator_gt::fold_range): Call relop_early_resolve.
20819 (operator_ge::op1_op2_relation): New.
20820 (operator_ge::fold_range): Call relop_early_resolve.
20821 * range-op.h (class range_operator): Adjust parameters and methods.
20823 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20825 * Makefile.in (OBJS): Add value-relation.o.
20826 * gimple-range.h: Adjust include files.
20827 * tree-data-ref.c: Adjust include file order.
20828 * value-query.cc (range_query::get_value_range): Default to no oracle.
20829 (range_query::query_relation): New.
20830 (range_query::query_relation): New.
20831 * value-query.h (class range_query): Adjust.
20832 * value-relation.cc: New.
20833 * value-relation.h: New.
20835 2021-06-22 Richard Biener <rguenther@suse.de>
20837 PR tree-optimization/101151
20838 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
20841 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
20843 * config/riscv/riscv.c (thead_c906_tune_info): New.
20844 (riscv_tune_info_table): Use new tune.
20846 2021-06-22 Richard Biener <rguenther@suse.de>
20848 PR tree-optimization/101158
20849 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
20850 checking after checking for matching operation.
20852 2021-06-22 Richard Biener <rguenther@suse.de>
20854 PR tree-optimization/101159
20855 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
20856 missing NULL vectype check.
20858 2021-06-22 Richard Biener <rguenther@suse.de>
20860 PR tree-optimization/101154
20861 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
20863 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20866 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
20867 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
20868 * config/i386/i386.c (ix86_expand_prologue): Clear it.
20869 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
20870 Remove "" from match_operand. Emit new insns using emit_move_insn and
20871 set ix86_last_zero_store_uid to INSN_UID of the last store.
20872 Add peephole2s for 1/2/4 stores of const0_rtx following previous
20875 2021-06-22 Martin Liska <mliska@suse.cz>
20877 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
20880 2021-06-22 Martin Liska <mliska@suse.cz>
20882 * gcov-io.h: Remove padding entries.
20884 2021-06-22 liuhongt <hongtao.liu@intel.com>
20886 PR tree-optimization/97770
20887 * tree-vect-patterns.c (vect_recog_popcount_pattern):
20889 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
20891 2021-06-22 liuhongt <hongtao.liu@intel.com>
20894 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
20895 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
20896 (expand<mode>_mask): this ..
20897 (*expand<mode>_mask): New pre_reload splitter to transform
20898 v{,p}expand* to vmov* when mask is zero, all ones, or has all
20899 ones in it's lower part, otherwise still generate
20902 2021-06-22 liuhongt <hongtao.liu@intel.com>
20905 * config/i386/i386-expand.c
20906 (ix86_expand_special_args_builtin): Keep constm1_operand only
20907 if it satisfies insn's operand predicate.
20909 2021-06-21 Jason Merrill <jason@redhat.com>
20912 * df-scan.c (df_ref_record): Check that regno < endregno.
20913 * function.c (assign_parms, expand_function_end): Do nothing with a
20914 TYPE_EMPTY_P result.
20916 2021-06-21 Richard Biener <rguenther@suse.de>
20918 PR tree-optimization/101120
20919 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
20921 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
20922 DR chain DCE capability.
20923 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
20924 * tree-vect-stmts.c (vectorizable_load): Remove unused
20925 loads in the DR chain for SLP.
20927 2021-06-21 Jakub Jelinek <jakub@redhat.com>
20929 PR inline-asm/100785
20930 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
20931 output or input operands were already error_mark_node.
20932 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
20933 remove all inputs, outputs and clobbers from the asm and
20934 set template to "".
20936 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20938 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
20939 (vceq_s16): Likewise.
20940 (vceq_s32): Likewise.
20941 (vceq_u8): Likewise.
20942 (vceq_u16): Likewise.
20943 (vceq_u32): Likewise.
20944 (vceq_p8): Likewise.
20945 (vceqq_s8): Likewise.
20946 (vceqq_s16): Likewise.
20947 (vceqq_s32): Likewise.
20948 (vceqq_u8): Likewise.
20949 (vceqq_u16): Likewise.
20950 (vceqq_u32): Likewise.
20951 (vceqq_p8): Likewise.
20952 (vceq_f32): Gate __a == __b on __FAST_MATH__.
20953 (vceqq_f32): Likewise.
20954 (vceq_f16): Likewise.
20955 (vceqq_f16): Likewise.
20957 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20960 * config/arm/iterators.md (NEON_VACMP): Remove.
20961 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
20963 (neon_vca<cmp_op><mode>_insn): Likewise.
20964 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
20967 2021-06-21 Richard Biener <rguenther@suse.de>
20969 PR tree-optimization/101121
20970 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
20971 when we just lack a stmt with the desired op when doing permutation.
20972 (vect_build_slp_tree): When caching a failed SLP build attempt
20973 assert that at least one lane is marked as not matching.
20975 2021-06-21 liuhongt <hongtao.liu@intel.com>
20978 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
20979 register alternative.
20980 (*and<mode>_1): Ditto.
20982 (*andn<mode>_1): Ditto.
20983 (*<code><mode>_1): Ditto.
20984 (*<code>qi_1): Ditto.
20985 (*one_cmpl<mode>2_1): Ditto.
20986 (*one_cmplsi2_1_zext): Ditto.
20987 (*one_cmplqi2_1): Ditto.
20988 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
20989 the order of mask registers to be before general registers.
20991 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
20994 * config/i386/i386.md: New define_peephole2s to shrink writing
20995 1, 2 or 4 consecutive zeros to memory when optimizing for size.
20997 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
20999 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
21000 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
21001 more efficient code when the source can be trivially simplified.
21003 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
21005 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
21006 a range if global is not available.
21007 (ranger_cache::entry_range): Fallback to range_of_def.
21008 * gimple-range-cache.h (range_of_def): Adjust prototype.
21010 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
21012 PR tree-optimization/101014
21013 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
21015 (ranger_cache::~ranger_cache): Ditto.
21016 (ranger_cache::enable_new_values): Delete.
21017 (ranger_cache::push_poor_value): Delete.
21018 (ranger_cache::range_of_def): Remove poor value processing.
21019 (ranger_cache::entry_range): Ditto.
21020 (ranger_cache::fill_block_cache): Ditto.
21021 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
21022 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
21023 * gimple-range.h (class gimple_ranger): Adjust.
21025 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21028 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
21029 derived from arm_canon_arch.
21030 (arm_canon_arch_option): Call it.
21031 (arm_canon_arch_multilib_option): New function.
21032 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
21033 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
21034 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
21035 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
21036 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
21037 * config/arm/arm.opt (mlibarch): New option.
21038 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
21039 of march on RHS with mlibarch.
21041 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
21043 * config.in: Regenerate.
21044 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
21046 * configure: Regenerate.
21047 * configure.ac: Fix for global_load assembler functions.
21049 2021-06-18 Richard Biener <rguenther@suse.de>
21051 PR tree-optimization/101112
21052 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
21053 to lookup a pattern stmt def.
21055 2021-06-18 Jakub Jelinek <jakub@redhat.com>
21057 PR middle-end/101062
21058 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
21059 representatives in QUAL_UNION_TYPE.
21061 2021-06-18 Andrew Pinski <apinski@marvell.com>
21063 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
21064 Add counting of how many times it is done.
21065 (factor_out_conditional_conversion): Likewise.
21066 (match_simplify_replacement): Likewise.
21067 (value_replacement): Likewise.
21068 (spaceship_replacement): Likewise.
21069 (cond_store_replacement): Likewise.
21070 (cond_if_else_store_replacement_1): Likewise.
21071 (hoist_adjacent_loads): Likewise.
21073 2021-06-18 Andrew Pinski <apinski@marvell.com>
21075 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
21076 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
21077 (verify_gimple_assign_binary): Reject point and offset types on
21078 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
21079 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
21080 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
21082 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
21084 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
21085 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
21086 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
21089 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
21091 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
21092 earlyclobber to alts 0/1.
21093 (gen_addadd): Add earlyclobber to alts 0/1.
21094 * config/rs6000/fusion.md: Regenerate file.
21096 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21098 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
21100 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21102 * gimple-range-cache.cc: Comment cleanups.
21103 * gimple-range-gori.cc: Comment cleanups.
21104 * gimple-range.cc: Comment/spacing cleanups
21105 * value-range.h: Comment cleanups.
21107 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
21110 * calls.c (expand_call): Replace PUSH_ARGS with
21111 targetm.calls.push_argument (0).
21112 (emit_library_call_value_1): Likewise.
21113 * defaults.h (PUSH_ARGS): Removed.
21114 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
21115 targetm.calls.push_argument (0).
21116 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
21117 (emit_push_insn): Pass the number bytes to push to
21118 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
21119 * hooks.c (hook_bool_uint_true): New.
21120 * hooks.h (hook_bool_uint_true): Likewise.
21121 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
21122 targetm.calls.push_argument (0).
21123 * target.def (push_argument): Add a targetm.calls hook.
21124 * targhooks.c (default_push_argument): New.
21125 * targhooks.h (default_push_argument): Likewise.
21126 * config/bpf/bpf.h (PUSH_ARGS): Removed.
21127 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
21128 * config/cr16/cr16.h (PUSH_ARGS): Removed.
21129 * config/i386/i386.c (ix86_push_argument): New.
21130 (TARGET_PUSH_ARGUMENT): Likewise.
21131 * config/i386/i386.h (PUSH_ARGS): Removed.
21132 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
21133 * config/m32c/m32c.h (PUSH_ARGS): Removed.
21134 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
21135 * config/pru/pru.h (PUSH_ARGS): Likewise.
21136 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
21137 TARGET_PUSH_ARGUMENT hook.
21138 * doc/tm.texi: Regenerated.
21140 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
21143 * config/i386/i386-expand.c (expand_vector_set_var):
21144 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
21145 ix86_expand_vector_init_duplicate.
21146 (ix86_expand_vector_init_duplicate): Emit insv_1 for
21147 QImode for !TARGET_PARTIAL_REG_STALL.
21148 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
21149 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
21150 as operand 2 predicate. Call ix86_expand_vector_set_var
21151 for non-constant index operand.
21152 (vec_setv2si): Ditto.
21153 (vec_setv4hi): Ditto.
21154 (vec_setv8qi): ditto.
21156 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21158 PR tree-optimization/100790
21159 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
21162 2021-06-17 Martin Liska <mliska@suse.cz>
21164 * doc/invoke.texi: Use consistently -O1 instead of -O.
21166 2021-06-17 Martin Liska <mliska@suse.cz>
21168 * gcov-io.h: Update documentation entry about string format.
21170 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
21173 * config/s390/vecintrin.h (vec_doublee): Fix to use
21174 __builtin_s390_vflls.
21175 (vec_floate): Fix to use __builtin_s390_vflrd.
21177 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21179 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
21180 * dominance.h (get_dominated_to_depth): Likewise.
21181 (get_all_dominated_blocks): Likewise.
21182 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
21183 * gcse.c (hoist_code): Likewise.
21184 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
21185 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
21186 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
21187 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
21189 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21191 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
21192 * dominance.h (get_dominated_by_region): Likewise.
21193 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
21194 (gimple_duplicate_sese_tail): Likewise.
21195 (move_sese_region_to_fn): Likewise.
21197 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21199 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
21200 * dominance.h (get_dominated_by): Likewise.
21201 * auto-profile.c (afdo_find_equiv_class): Adjust.
21202 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
21203 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
21204 * tree-cfg.c (test_linear_chain): Likewise.
21205 (test_diamond): Likewise.
21207 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21209 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
21210 * cfgloopanal.c (get_loop_hot_path): Likewise.
21211 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
21213 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21215 * cgraph.c (cgraph_node::collect_callers): Return
21216 auto_vec<cgraph_edge *>.
21217 * cgraph.h (cgraph_node::collect_callers): Likewise.
21218 * ipa-cp.c (create_specialized_node): Adjust.
21219 (decide_about_value): Likewise.
21220 (decide_whether_version_node): Likewise.
21221 * ipa-sra.c (process_isra_node_results): Likewise.
21223 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21225 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
21226 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
21228 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
21231 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21233 * gimple-range.cc (debug_seed_ranger): New.
21234 (dump_ranger): New.
21235 (debug_ranger): New.
21237 2021-06-17 Richard Biener <rguenther@suse.de>
21239 PR tree-optimization/54400
21240 * tree-vectorizer.h (enum slp_instance_kind): Add
21241 slp_inst_kind_bb_reduc.
21242 (reduction_fn_for_scalar_code): Declare.
21243 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
21244 Check SLP_INSTANCE_KIND instead of looking at the
21246 (vect_slp_analyze_instance_alignment): Likewise.
21247 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
21248 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
21249 chain linearization from vect_build_slp_tree_2 and generalize
21250 for the use of BB reduction vectorization.
21251 (vect_build_slp_tree_2): Adjust accordingly.
21252 (vect_optimize_slp): Elide permutes at the root of BB reduction
21254 (vectorizable_bb_reduc_epilogue): New function.
21255 (vect_slp_prune_covered_roots): Likewise.
21256 (vect_slp_analyze_operations): Use them.
21257 (vect_slp_check_for_constructors): Recognize associatable
21258 chains for BB reduction vectorization.
21259 (vectorize_slp_instance_root_stmt): Generate code for the
21260 BB reduction epilogue.
21262 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21264 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
21266 (gori_compute::may_recompute_p): New.
21267 (gori_compute::outgoing_edge_range_p): Perform recomputations.
21268 * gimple-range-gori.h (class gori_compute): Add prototype.
21270 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21272 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
21273 true when a range can be calculated.
21274 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
21276 2021-06-16 Martin Sebor <msebor@redhat.com>
21278 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
21279 Correct documented defaults.
21281 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
21283 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
21284 m_new_value_p directly.
21286 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
21289 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
21290 Handle 64bit modes for TARGET_SSE4_1.
21291 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
21292 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
21293 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
21294 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
21295 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
21296 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
21298 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21300 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
21301 Change to an expander that emits the correct instruction
21302 depending on endianness.
21303 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
21304 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
21306 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21308 * config/aarch64/aarch64-simd-builtins.def: Split generator
21309 for aarch64_<su>qmovn builtins into scalar and vector
21311 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
21313 (aarch64_<su>qmovn<mode>_insn_be): Define.
21314 (aarch64_<su>qmovn<mode>): Split into scalar and vector
21315 variants. Change vector variant to an expander that emits the
21316 correct instruction depending on endianness.
21318 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21320 * config/aarch64/aarch64-simd-builtins.def: Split generator
21321 for aarch64_sqmovun builtins into scalar and vector variants.
21322 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
21323 Split into scalar and vector variants. Change vector variant
21324 to an expander that emits the correct instruction depending
21326 (aarch64_sqmovun<mode>_insn_le): Define.
21327 (aarch64_sqmovun<mode>_insn_be): Define.
21329 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21331 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
21332 Define - modeling zero-high-half semantics.
21333 (aarch64_xtn<mode>): Change to an expander that emits the
21334 appropriate instruction depending on endianness.
21335 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
21337 (aarch64_xtn2<mode>_le): Rename to...
21338 (aarch64_xtn2<mode>_insn_le): This.
21339 (aarch64_xtn2<mode>_be): Rename to...
21340 (aarch64_xtn2<mode>_insn_be): This.
21341 (vec_pack_trunc_<mode>): Emit truncation instruction instead
21343 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
21344 attribute iterator.
21346 2021-06-16 Martin Jambor <mjambor@suse.cz>
21348 PR tree-optimization/100453
21349 * tree-sra.c (create_access): Disqualify any const candidates
21350 which are written to.
21351 (sra_modify_expr): Do not store sub-replacements back to a const base.
21352 (handle_unscalarized_data_in_subtree): Likewise.
21353 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
21354 instead of constant_decl_p.
21356 2021-06-16 Jakub Jelinek <jakub@redhat.com>
21358 PR middle-end/101062
21359 * stor-layout.c (finish_bitfield_representative): For fields in unions
21360 assume nextf is always NULL.
21361 (finish_bitfield_layout): Compute bit field representatives also in
21362 unions, but handle it as if each bitfield was the only field in the
21365 2021-06-16 Richard Biener <rguenther@suse.de>
21367 PR tree-optimization/101088
21368 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
21369 supported refs on edges. Do not assert same ref but
21370 different kind stores are unsuported but mark them so.
21371 (hoist_memory_references): Only look for supported refs
21374 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
21376 PR rtl-optimization/46235
21377 * config/i386/i386.md: New define_split for bt followed by cmov.
21378 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
21379 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
21380 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
21381 by setnc with zero extension.
21383 2021-06-16 Richard Biener <rguenther@suse.de>
21385 PR tree-optimization/101083
21386 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
21387 vectype as argument.
21388 (vect_build_slp_tree_2): Adjust.
21390 2021-06-15 Martin Sebor <msebor@redhat.com>
21392 PR middle-end/100876
21393 * builtins.c: (gimple_call_return_array): Account for size_t
21394 mangling as either unsigned int or unsigned long
21396 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
21398 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
21399 up before eliminating comparisons.
21401 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
21403 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
21404 nonzero|X is nonzero.
21405 (range_op_bitwise_and_tests): Add tests for above.
21407 2021-06-15 Carl Love <cel@us.ibm.com>
21410 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
21412 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
21415 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
21418 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
21419 (struct gimplify_omp_ctx): Extend defaultmap array by one.
21420 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
21421 (omp_notice_variable): Update type classification for Fortran.
21422 (gimplify_scan_omp_clauses): Update calls for new argument; handle
21423 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
21424 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
21425 * langhooks.c (lhd_omp_scalar_p): Likewise.
21426 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
21427 (LANG_HOOKS_DECLS): Add them.
21428 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
21429 omp_scalar_p pointer type to include the new bool argument.
21431 2021-06-15 David Malcolm <dmalcolm@redhat.com>
21433 * doc/analyzer.texi
21434 (Special Functions for Debugging the Analyzer): Add
21435 __analyzer_dump_capacity.
21437 2021-06-15 Jakub Jelinek <jakub@redhat.com>
21440 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
21441 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
21443 2021-06-15 Richard Biener <rguenther@suse.de>
21445 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
21446 check to identify loop latches.
21447 * cfgloop.c (verify_loop_structure): Likewise.
21448 * loop-init.c (apply_loop_flags): Allow marked irreducible
21449 regions even with multiple latches.
21450 * predict.c (rebuild_frequencies): Simplify.
21452 2021-06-15 Richard Biener <rguenther@suse.de>
21454 * tree-ssa-threadupdate.c
21455 (jump_thread_path_registry::mark_threaded_blocks): Assert we
21456 have marked irreducible regions.
21458 2021-06-14 Martin Sebor <msebor@redhat.com>
21461 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
21462 Handle calls to placement new.
21463 (ndecl_dealloc_argno): Avoid placement delete.
21465 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
21468 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
21469 create_tmp_reg_or_ssa_name().
21471 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
21473 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21474 (ranger_cache::enable_new_values): Set to specified value and
21475 return the old value.
21476 (ranger_cache::disable_new_values): Delete.
21477 (ranger_cache::fill_block_cache): Disable non 1st order derived
21479 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
21480 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
21482 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
21485 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
21486 Return true early when testing with V2HImode.
21487 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
21489 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
21491 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
21492 (mve_vec_unpack<US>_hi_<mode>): New pattern.
21493 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
21494 (mve_vmovntq_<supf><mode>): Prefix with '@'.
21495 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
21497 (vec_unpack<US>_lo_<mode>): Likewise.
21498 (vec_pack_trunc_<mode>): Rename to
21499 neon_quad_vec_pack_trunc_<mode>.
21500 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
21502 (vec_unpack<US>_lo_<mode>): New.
21503 (vec_pack_trunc_<mode>): New.
21505 2021-06-14 Richard Biener <rguenther@suse.de>
21507 PR tree-optimization/100934
21508 * tree-ssa-dom.c (pass_dominator::execute): Properly
21509 mark irreducible regions.
21511 2021-06-14 Martin Liska <mliska@suse.cz>
21513 * doc/invoke.texi: Put r{...} on the same line as @item.
21515 2021-06-14 Martin Liska <mliska@suse.cz>
21517 * doc/invoke.texi: Add missing newline.
21519 2021-06-14 Martin Liska <mliska@suse.cz>
21521 * doc/invoke.texi: Remove '+' charasters.
21523 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
21525 * config.gcc (arc): Add support for with_cpu option.
21526 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
21528 2021-06-14 Richard Biener <rguenther@suse.de>
21530 PR tree-optimization/101031
21531 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
21532 instead of size when accounting for a possibly string
21535 2021-06-14 Martin Liska <mliska@suse.cz>
21537 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
21539 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
21541 * value-query.cc (gimple_range_global): Call get_range_global
21542 if called after inlining.
21544 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
21547 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21548 Emit constant permutation insn directly from here.
21550 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
21552 * attribs.c (find_attribute_namespace): Iterate over vec<> with
21554 * auto-profile.c (afdo_find_equiv_class): Likewise.
21555 * gcc.c (do_specs_vec): Likewise.
21556 (do_spec_1): Likewise.
21557 (driver::set_up_specs): Likewise.
21558 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
21559 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
21560 (imm_store_chain_info::try_coalesce_bswap): Likewise.
21561 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
21562 (get_location_for_stmts): Likewise.
21563 * graphite-poly.c (print_iteration_domains): Likewise.
21564 (free_poly_bb): Likewise.
21565 (remove_gbbs_in_scop): Likewise.
21566 (free_scop): Likewise.
21567 (dump_gbb_cases): Likewise.
21568 (dump_gbb_conditions): Likewise.
21569 (print_pdrs): Likewise.
21570 (print_scop): Likewise.
21571 * ifcvt.c (cond_move_process_if_block): Likewise.
21572 * lower-subreg.c (decompose_multiword_subregs): Likewise.
21573 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
21574 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
21575 * sel-sched-dump.c (dump_insn_vector): Likewise.
21576 * store-motion.c (store_ops_ok): Likewise.
21577 (store_killed_in_insn): Likewise.
21578 * timevar.c (timer::named_items::print): Likewise.
21579 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
21580 (cleanup_tree_cfg_noloop): Likewise.
21581 * tree-data-ref.c (dump_data_references): Likewise.
21582 (print_dir_vectors): Likewise.
21583 (print_dist_vectors): Likewise.
21584 (dump_data_dependence_relations): Likewise.
21585 (dump_dist_dir_vectors): Likewise.
21586 (dump_ddrs): Likewise.
21587 (create_runtime_alias_checks): Likewise.
21588 (free_subscripts): Likewise.
21589 (save_dist_v): Likewise.
21590 (save_dir_v): Likewise.
21591 (invariant_access_functions): Likewise.
21592 (same_access_functions): Likewise.
21593 (access_functions_are_affine_or_constant_p): Likewise.
21594 (find_data_references_in_stmt): Likewise.
21595 (graphite_find_data_references_in_stmt): Likewise.
21596 (free_dependence_relations): Likewise.
21597 (free_data_refs): Likewise.
21598 * tree-inline.c (copy_debug_stmts): Likewise.
21599 * tree-into-ssa.c (dump_currdefs): Likewise.
21600 (rewrite_update_phi_arguments): Likewise.
21601 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
21602 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
21604 (vect_slp_analyze_node_dependences): Likewise.
21605 (vect_slp_analyze_instance_dependence): Likewise.
21606 (vect_record_base_alignments): Likewise.
21607 (vect_get_peeling_costs_all_drs): Likewise.
21608 (vect_peeling_supportable): Likewise.
21609 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
21610 (vec_info::free_stmt_vec_infos): Likewise.
21612 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
21614 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
21615 (andqi3_1<cczn>): Removed.
21616 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
21617 (H8/SX bit logicals): Split out from other patterns.
21618 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
21619 mulqihi3_const_clobber_flags.
21620 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
21622 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
21625 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
21626 to true if red zone is used.
21627 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
21628 ix86_red_zone_used.
21629 * config/i386/i386.h (machine_function): Add red_zone_used.
21630 (ix86_red_zone_size): Removed.
21631 (ix86_red_zone_used): New.
21632 * config/i386/i386.md (peephole2 patterns): Replace
21633 ix86_red_zone_size with ix86_red_zone_used.
21635 2021-06-12 Jason Merrill <jason@redhat.com>
21637 * doc/extend.texi (unused variable attribute): Applies to
21638 structure fields as well.
21640 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
21642 * auto-profile.c (read_profile): fix a typo in an error string
21644 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
21646 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
21648 * tree-pretty-print.c (dump_omp_clauses): Update.
21649 (dump_generic_node) <OMP_CLAUSE>: Use it.
21651 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21654 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
21655 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
21656 the polymorphic variants matching code.
21657 (__arm_vld1q_z): Likewise.
21658 (__arm_vld2q): Likewise.
21659 (__arm_vld4q): Likewise.
21660 (__arm_vldrbq_gather_offset): Likewise.
21661 (__arm_vldrbq_gather_offset_z): Likewise.
21663 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21665 PR tree-optimization/96392
21666 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
21668 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21670 PR tree-optimization/96392
21671 * fold-const.c (fold_real_zero_addition_p): Take both arguments
21672 of the addition or subtraction, not just the zero. Use this
21673 other argument in tests for signaling NaNs and signed zeros.
21674 (tree_expr_maybe_real_minus_zero_p): New predicate.
21675 * fold-const.h (fold_real_zero_addition_p): Update prototype.
21676 (tree_expr_maybe_real_minus_zero_p): New function prototype.
21677 * match.pd: Update calls to fold_real_zero_addition_p.
21678 Replace HONOR_NANS with tree_expr_maybe_nan_p.
21679 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
21680 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
21681 * tree-ssa-reassoc.c (eliminate_using_constants): Update
21682 call to fold_real_zero_addition_p.
21684 2021-06-11 Richard Biener <rguenther@suse.de>
21686 PR tree-optimization/101025
21687 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
21688 all refs that require dependence checking.
21690 2021-06-11 Richard Biener <rguenther@suse.de>
21692 PR tree-optimization/101028
21693 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
21694 reassoc discovery fails fatally, mark appropriate lanes
21697 2021-06-11 Richard Biener <rguenther@suse.de>
21699 PR tree-optimization/101026
21700 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
21701 have a representative for the associated chain nodes.
21703 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21705 PR rtl-optimization/101008
21706 * simplify-rtx.c (relational_result): New function.
21707 (simplify_logical_relational_operation,
21708 simplify_relational_operation): Use it.
21710 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21713 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
21715 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
21718 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
21719 false if the permutation can be implemented with constant
21720 permutation instruction in wider mode.
21721 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
21722 Handle V8QImode and V4HImode.
21724 2021-06-11 Martin Liska <mliska@suse.cz>
21726 PR gcov-profile/100788
21727 * common.opt: Add new option.
21728 * coverage.c (coverage_begin_function): Emit warning instead on
21729 the internal compiler error.
21730 * doc/invoke.texi: Document the option.
21731 * toplev.c (process_options): Enable it by default.
21733 2021-06-11 Richard Biener <rguenther@suse.de>
21735 PR middle-end/101009
21736 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
21737 to set *init_b to true when we encounter a constant equal
21739 (compute_affine_dependence): Also dump the actual DR_REF.
21741 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21743 PR tree-optimization/100984
21744 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
21745 replacements table.
21746 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
21748 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
21750 * config/rs6000/rs6000.md
21751 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
21752 define_insn_and_split.
21754 2021-06-11 Richard Biener <rguenther@suse.de>
21756 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
21757 to sort operands of the associative chain.
21759 2021-06-11 Richard Biener <rguenther@suse.de>
21761 * system.h (gcc_stablesort_r): Declare.
21762 * sort.cc (gcc_sort_r): Support stable sort.
21763 (gcc_stablesort_r): Define.
21764 * vec.h (vec<>::stablesort): Add.
21766 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
21769 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
21770 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
21771 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
21772 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
21773 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
21774 (expand_vec_perm_interleave2): Handle 64bit modes.
21775 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
21776 (expand_vec_perm_even_odd_1): Ditto.
21777 (ix86_vectorize_vec_perm_const): Ditto.
21778 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
21779 * config/i386/sse.md: ... here.
21780 * config/i386/mmx.md (*vec_interleave_lowv2sf):
21781 New insn_and_split pattern.
21782 (*vec_interleave_highv2sf): Ditto.
21783 (mmx_pshufbv8qi3): New insn pattern.
21784 (*mmx_pblendw): Ditto.
21786 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
21788 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
21789 (build_acc): Likewise.
21790 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
21791 source operands in little-endian mode.
21792 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
21793 (mma_init_builtins): Likewise.
21794 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
21795 ordering for the MMA assemble and build source operands.
21796 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
21798 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
21801 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
21803 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
21805 * config/h8300/extensions.md: Replace _clobber_flags patterns
21808 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
21810 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
21811 (vcond_mask_<mode><tointvec>): this.
21813 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
21814 Thomas Schwinge <thomas@codesourcery.com>
21816 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
21817 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
21818 * gimple.h (enum gf_mask): Split
21819 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
21820 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
21821 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
21822 (is_gimple_omp_oacc): Update.
21823 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
21824 * gimplify.c (gimplify_omp_target_update): Likewise.
21825 * omp-expand.c (expand_omp_target, build_omp_regions_1)
21826 (omp_make_gimple_edges): Likewise.
21827 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
21830 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21832 * value-query.cc (value_query::value_on_edge): Rename name to
21834 (range_query::range_on_edge): Same.
21835 (range_query::value_of_expr): Same.
21836 (range_query::value_on_edge): Same.
21837 * value-query.h (class value_query): Same.
21838 (class range_query): Same.
21840 2021-06-10 Richard Biener <rguenther@suse.de>
21842 PR tree-optimization/101003
21843 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
21844 use the pattern stmt defs when linearizing a chain.
21846 2021-06-10 Jakub Jelinek <jakub@redhat.com>
21849 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
21850 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
21852 2021-06-10 Clement Chigot <clement.chigot@atos.net>
21854 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
21855 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
21857 2021-06-09 Andrew Pinski <apinski@marvell.com>
21859 PR tree-optimization/100925
21860 * match.pd (a ? CST1 : CST2): Limit transformations
21861 that would produce a negative to integeral types only.
21862 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
21864 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21867 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21869 * doc/tm.texi: Correctly update.
21871 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21873 * doc/tm.texi: Correctly update.
21875 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21878 * doc/tm.texi.in (Trampolines): Add a missing blank line.
21880 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
21883 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
21884 and -ftrampolines work only with Ada.
21885 * doc/tm.texi.in (Trampolines): Likewise.
21886 * doc/tm.texi: Regenerated.
21888 2021-06-09 Carl Love <cel@us.ibm.com>
21890 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
21891 Add define for new builtins.
21892 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
21893 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
21894 overloaded builtin definitions.
21895 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
21896 VSIGNEXTSD2Q): Add builtin expansions.
21897 (SIGNEXT): Add P10 overload definition.
21898 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
21899 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
21900 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
21901 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
21902 vsignextend_si_v2di)[VIlong]: Add define_expand.
21903 Make define_insn vsx_sign_extend_si_v2di visible.
21904 * doc/extend.texi: Add documentation for the vec_signexti,
21905 vec_signextll builtins and vec_signextq.
21907 2021-06-09 Carl Love <cel@us.ibm.com>
21909 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
21910 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
21911 __floattikf_sw, __floatuntikf_sw respectively.
21912 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
21913 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
21914 define_insn for mode IEEE 128.
21916 2021-06-09 Carl Love <cel@us.ibm.com>
21918 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
21919 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
21920 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
21921 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
21922 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
21923 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
21924 uses of VSX_TI to VEC_TI.
21926 2021-06-09 Carl Love <cel@us.ibm.com>
21928 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
21930 2021-06-09 Carl Love <cel@us.ibm.com>
21932 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
21934 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
21935 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
21936 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
21937 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
21938 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
21939 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
21940 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
21942 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
21943 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
21944 altivec_vrlqnm): New define_expands.
21945 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
21946 VCMPGTUT_P): Add macro expansions.
21947 (BU_P10V_AV_P): Add builtin predicate definition.
21948 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
21949 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
21950 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
21951 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
21952 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
21953 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
21954 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
21955 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
21956 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
21957 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
21958 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
21959 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
21960 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
21961 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
21962 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
21963 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
21964 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
21965 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
21966 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
21967 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
21968 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
21969 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
21970 P10V_BUILTIN_MODU_V1TI):
21971 New overloaded definitions.
21972 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
21973 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
21974 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
21975 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
21976 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
21977 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
21979 (altivec_init_builtins): New E_V1TImode case statement.
21980 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
21981 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
21982 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
21983 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
21984 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
21985 E_V1TImode]: New case statements.
21986 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
21987 value RS6000_BTI_bool_V1TI.
21988 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
21989 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
21990 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
21991 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
21992 vlshrv1ti3, vashrv1ti3): New define_expands.
21993 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
21994 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
21995 UNSPEC_VSX_MODUQ): New unspecs.
21996 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
21997 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
21999 (vcmpnet): New define_expand.
22000 * doc/extend.texi: Add documentation for the new builtins vec_rl,
22001 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
22002 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
22003 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
22004 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
22005 vec_any_ge, vec_any_le.
22007 2021-06-09 Carl Love <cel@us.ibm.com>
22009 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
22010 bug in argument generation.
22012 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
22014 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
22016 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
22017 remove <supf> iterator.
22018 (mve_vclzq_u<mode>): New.
22019 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
22020 (neon_vclz<mode): Move to ...
22021 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
22022 * config/arm/vec-common.md: ... here. Add support for MVE.
22024 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
22026 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
22027 (@mve_vrhaddq_<supf><mode): Likewise.
22028 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
22029 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
22030 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
22032 2021-06-09 imba-tjd <109224573@qq.com>
22034 * doc/invoke.texi: Fix typo.
22036 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
22038 PR middle-end/53267
22039 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
22040 Support evaluation of fmod/fmodf/fmodl at compile-time.
22042 2021-06-09 Richard Biener <rguenther@suse.de>
22044 PR tree-optimization/100981
22045 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
22046 gimple_get_lhs to also handle calls.
22047 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
22050 2021-06-09 Richard Biener <rguenther@suse.de>
22052 PR tree-optimization/97832
22053 * tree-vectorizer.h (_slp_tree::failed): New.
22054 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
22056 (_slp_tree::~_slp_tree): Free failed.
22057 (vect_build_slp_tree): Retain failed nodes and record
22058 matches in them, copying that back out when running
22059 into a cached fail. Dump start and end of discovery.
22060 (dt_sort_cmp): New.
22061 (vect_build_slp_tree_2): Handle associatable chains
22062 together doing more aggressive operand swapping.
22064 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
22067 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
22069 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
22070 Linux and GNU targets.
22072 2021-06-09 Richard Biener <rguenther@suse.de>
22074 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
22077 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22079 * config/arc/arc.md (loop_end): Change it to
22080 define_insn_and_split.
22082 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22084 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
22085 (machi): New pattern.
22086 (umaddhisi4): Use VMAC2HU instruction.
22087 (umachi): New pattern.
22089 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
22091 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
22092 * config/arc/arc.c (arc_split_move_p): New function.
22093 (arc_split_move): Clean up.
22094 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
22095 (movdf_insn): Likewise.
22096 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
22098 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
22101 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
22102 argument to "raw". Do not emit segment overrides when "raw" is true.
22104 2021-06-09 Martin Liska <mliska@suse.cz>
22106 * doc/gcov.texi: Create a proper JSON files.
22107 * doc/invoke.texi: Remove dots in order to make it a valid
22110 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
22112 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
22113 (insn_is_load_p): Use pattern_is_rotate64.
22114 (insn_is_swap_p): Likewise.
22115 (quad_aligned_load_p): Likewise.
22116 (const_load_sequence_p): Likewise.
22117 (replace_swapped_aligned_load): Likewise.
22118 (recombine_lvx_pattern): Likewise.
22119 (recombine_stvx_pattern): Likewise.
22121 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
22123 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
22124 fur_stmt source record.
22125 * gimple-range.cc (fur_source::get_operand): Generic range query.
22126 (fur_source::get_phi_operand): New.
22127 (fur_source::register_dependency): New.
22128 (fur_source::query): New.
22129 (class fur_edge): New. Edge source for operands.
22130 (fur_edge::fur_edge): New.
22131 (fur_edge::get_operand): New.
22132 (fur_edge::get_phi_operand): New.
22133 (fur_edge::query): New.
22134 (fur_stmt::fur_stmt): New.
22135 (fur_stmt::get_operand): New.
22136 (fur_stmt::get_phi_operand): New.
22137 (fur_stmt::query): New.
22138 (class fur_depend): New. Statement source and process dependencies.
22139 (fur_depend::fur_depend): New.
22140 (fur_depend::register_dependency): New.
22141 (class fur_list): New. List source for operands.
22142 (fur_list::fur_list): New.
22143 (fur_list::get_operand): New.
22144 (fur_list::get_phi_operand): New.
22145 (fold_range): New. Instantiate appropriate fur_source class and fold.
22146 (fold_using_range::range_of_range_op): Use new API.
22147 (fold_using_range::range_of_address): Ditto.
22148 (fold_using_range::range_of_phi): Ditto.
22149 (imple_ranger::fold_range_internal): Use fur_depend class.
22150 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
22151 * gimple-range.h (class fur_source): Now a base class.
22152 (class fur_stmt): New.
22153 (fold_range): New prototypes.
22154 (fur_source::fur_source): Delete.
22156 2021-06-08 Andrew Pinski <apinski@marvell.com>
22158 PR tree-optimization/25290
22159 * tree-ssa-phiopt.c (xor_replacement): Delete.
22160 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
22161 (match_simplify_replacement): Allow one cheap preparation
22162 statement that can be moved to before the if.
22164 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
22166 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
22167 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
22169 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
22171 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
22172 Create length attribute on define_insn_and_split. Only split for cases which we
22174 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
22175 fix length computation.
22176 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
22178 2021-06-08 Richard Biener <rguenther@suse.de>
22180 PR tree-optimization/100923
22181 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
22182 the operand vector to be valueized.
22183 (valueize_refs): Likewise.
22184 (valueize_shared_reference_ops_from_ref): Adjust.
22185 (valueize_shared_reference_ops_from_call): Likewise.
22186 (vn_reference_lookup_3): Likewise.
22187 (vn_reference_lookup_pieces): Likewise. Re-valueize
22188 with honoring availability when we are about to create
22189 the ao_ref and valueized before.
22190 (vn_reference_lookup): Likewise.
22191 (vn_reference_insert_pieces): Adjust.
22193 2021-06-08 Richard Biener <rguenther@suse.de>
22195 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
22196 (_slp_instance::root_stmts): ... a vector.
22197 (SLP_INSTANCE_ROOT_STMT): Rename to ...
22198 (SLP_INSTANCE_ROOT_STMTS): ... this.
22199 (slp_root::root): Change to...
22200 (slp_root::roots): ... a vector.
22201 (slp_root::slp_root): Adjust.
22202 * tree-vect-slp.c (_slp_instance::location): Adjust.
22203 (vect_free_slp_instance): Release the root stmt vector.
22204 (vect_build_slp_instance): Adjust.
22205 (vect_analyze_slp): Likewise.
22206 (_bb_vec_info::~_bb_vec_info): Likewise.
22207 (vect_slp_analyze_operations): Likewise.
22208 (vect_bb_vectorization_profitable_p): Likewise. Adjust
22209 costs for the root stmt.
22210 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
22212 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
22214 (vectorize_slp_instance_root_stmt): Adjust.
22215 (vect_schedule_slp): Likewise.
22217 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
22219 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
22220 (ssa_equiv_stack::ssa_equiv_stack): New.
22221 (ssa_equiv_stack::~ssa_equiv_stack): New.
22222 (ssa_equiv_stack::enter): New.
22223 (ssa_equiv_stack::leave): New.
22224 (ssa_equiv_stack::push_replacement): New.
22225 (ssa_equiv_stack::get_replacement): New.
22226 (is_pointer_ssa): New.
22227 (class pointer_equiv_analyzer): New.
22228 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
22229 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
22230 (pointer_equiv_analyzer::set_global_equiv): New.
22231 (pointer_equiv_analyzer::set_cond_equiv): New.
22232 (pointer_equiv_analyzer::get_equiv): New.
22233 (pointer_equiv_analyzer::enter): New.
22234 (pointer_equiv_analyzer::leave): New.
22235 (pointer_equiv_analyzer::get_equiv_expr): New.
22236 (pta_valueize): New.
22237 (pointer_equiv_analyzer::visit_stmt): New.
22238 (pointer_equiv_analyzer::visit_edge): New.
22239 (hybrid_folder::value_of_expr): Call PTA.
22240 (hybrid_folder::value_on_edge): Same.
22241 (hybrid_folder::pre_fold_bb): New.
22242 (hybrid_folder::post_fold_bb): New.
22243 (hybrid_folder::pre_fold_stmt): New.
22244 (rvrp_folder::pre_fold_bb): New.
22245 (rvrp_folder::post_fold_bb): New.
22246 (rvrp_folder::pre_fold_stmt): New.
22247 (rvrp_folder::value_of_expr): Call PTA.
22248 (rvrp_folder::value_on_edge): Same.
22250 2021-06-08 Jakub Jelinek <jakub@redhat.com>
22253 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
22254 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
22256 2021-06-08 Richard Biener <rguenther@suse.de>
22258 PR middle-end/100951
22259 * tree-vect-generic.c (expand_vector_piecewise): Build a
22260 VECTOR_CST if all elements are constant.
22261 (expand_vector_condition): Likewise.
22262 (lower_vec_perm): Likewise.
22263 (expand_vector_conversion): Likewise.
22265 2021-06-08 Martin Liska <mliska@suse.cz>
22267 * doc/invoke.texi: Document new param evrp-sparse-threshold.
22269 2021-06-08 Martin Liska <mliska@suse.cz>
22271 * genautomata.c (create_automata): Fix typo.
22273 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22275 PR tree-optimization/100794
22276 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
22277 allow_unroll_p and only allow unrolling when it's true.
22278 (tree_predictive_commoning): Add parameter allow_unroll_p and
22280 (run_tree_predictive_commoning): Likewise.
22281 (pass_predcom::gate): Check flag_tree_loop_vectorize and
22282 global_options_set.x_flag_predictive_commoning.
22283 (pass_predcom::execute): Adjust for allow_unroll_p.
22285 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22287 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
22288 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
22289 lambda function cleanup, remove scev_reset call, and adjust return
22291 (tree_predictive_commoning): Adjust for different changed values,
22292 only set flag TODO_update_ssa_only_virtuals if changed.
22293 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
22294 from todo_flags_finish.
22296 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22298 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
22299 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
22300 (sbr_sparse_bitmap::bitmap_set_quad): New.
22301 (sbr_sparse_bitmap::bitmap_get_quad): New.
22302 (sbr_sparse_bitmap::set_bb_range): New.
22303 (sbr_sparse_bitmap::get_bb_range): New.
22304 (sbr_sparse_bitmap::bb_range_p): New.
22305 (block_range_cache::block_range_cache): initialize bitmap obstack.
22306 (block_range_cache::~block_range_cache): Destruct obstack.
22307 (block_range_cache::set_bb_range): Decide when to utilze the
22308 sparse on entry cache.
22309 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
22310 * params.opt (-param=evrp-sparse-threshold): New.
22312 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22314 * bitmap.c (bitmap_set_aligned_chunk): New.
22315 (bitmap_get_aligned_chunk): New.
22316 (test_aligned_chunk): New.
22317 (bitmap_c_tests): Call test_aligned_chunk.
22318 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
22320 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
22323 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22325 (ix86_expand_vector_init_one_nonzero): Ditto.
22326 (ix86_expand_vector_init_one_var): Ditto.
22327 (ix86_expand_vector_init_general): Ditto.
22328 * config/i386/mmx.md (vec_initv4qiqi): New expander.
22330 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
22332 * config/h8300/movepush.md: Change most _clobber_flags
22333 patterns to instead use <cczn> subst.
22334 (movsi_cczn): New pattern with usable CC cases split out.
22335 (movsi_h8sx_cczn): Likewise.
22337 2021-06-07 Martin Liska <mliska@suse.cz>
22339 * common/common-target.def: Split long lines and replace them
22341 * target.def: Likewise.
22342 * doc/tm.texi: Re-generated.
22344 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22347 * fold-const.c (fold_read_from_vector): Return NULL if trying to
22348 read from a CONSTRUCTOR with vector type elements.
22350 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22352 PR middle-end/100898
22353 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
22354 should copy any arguments. Don't call gimple_call_num_args
22355 on id->call_stmt or call_stmt more than once.
22357 2021-06-07 liuhongt <hongtao.liu@intel.com>
22360 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
22362 (<insn>v4siv4di2): Delete constraints for define_expand.
22364 2021-06-07 liuhongt <hongtao.liu@intel.com>
22367 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
22368 assignment of cfun->machine->has_explicit_vzeroupper.
22369 * config/i386/i386-features.c
22370 (ix86_add_reg_usage_to_vzerouppers): Delete.
22371 (ix86_add_reg_usage_to_vzeroupper): Ditto.
22372 (rest_of_handle_insert_vzeroupper): Remove
22373 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
22375 (gate): Remove cfun->machine->has_explicit_vzeroupper.
22376 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
22378 * config/i386/i386.c (ix86_insn_callee_abi): New function.
22379 (ix86_initialize_callee_abi): Ditto.
22380 (ix86_expand_avx_vzeroupper): Ditto.
22381 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
22383 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
22384 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
22386 * config/i386/i386.h (struct GTY(()) machine_function): Delete
22387 has_explicit_vzeroupper.
22388 * config/i386/i386.md (enum unspec): New member
22390 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
22391 define_constants for insn callee abi index.
22392 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
22393 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
22394 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
22395 (*avx_vzeroupper): Rename to ..
22396 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
22397 call_insn which has a special vzeroupper ABI.
22398 (*avx_vzeroupper_1): Deleted.
22400 2021-06-07 liuhongt <hongtao.liu@intel.com>
22403 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
22404 it won't use stack pointer reg.
22405 * final.c (leaf_function_p): When call_insn is a fake call, it
22406 won't affect caller as a leaf function.
22407 * reg-stack.c (callee_clobbers_any_stack_reg): New.
22408 (subst_stack_regs): When call_insn doesn't clobber any stack
22409 reg, don't clear the arguments.
22410 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
22412 * shrink-wrap.c (requires_stack_frame_p): No need for stack
22413 frame for a fake call.
22414 * rtl.h (FAKE_CALL_P): New macro.
22416 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
22418 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
22420 (sparc_order_regs_for_local_alloc): ...this.
22421 (sparc_leaf_reg_remap): Declare.
22422 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
22423 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
22424 * config/sparc/sparc.c (leaf_reg_remap): Delete.
22425 (order_regs_for_local_alloc): Rename to...
22426 (sparc_order_regs_for_local_alloc): ...this.
22427 (sparc_leaf_reg_remap): New function.
22428 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
22430 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
22432 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
22433 Use assemble_name to output BSS section name.
22435 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
22437 * config/i386/constraints.md (Bs):
22438 Remove boolean operators from match_test RTX.
22441 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
22444 2021-06-06 Martin Liska <mliska@suse.cz>
22446 * doc/extend.texi: Add missing @headitem.
22447 * doc/invoke.texi: Likewise.
22448 * doc/objc.texi: Likewise.
22450 2021-06-06 Martin Liska <mliska@suse.cz>
22452 * genhooks.c (emit_findices): Remove unused function.
22453 (emit_documentation): Do not call emit_findices
22454 and do not search for @Fcode directives.
22456 2021-06-06 Martin Liska <mliska@suse.cz>
22458 * doc/invoke.texi: Remove extra character.
22460 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22462 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
22464 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22466 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
22467 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
22468 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
22469 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
22471 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22473 * config/or1k/or1k.md (*movdi): Fix empty split condition.
22475 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22477 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
22480 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22482 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
22483 *zero_extendsidi2): Fix empty split condition.
22485 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
22487 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
22489 * config/h8300/bitfield.md: Likewise.
22490 * config/h8300/combiner.md: Likewise.
22491 * config/h8300/divmod.md: Likewise.
22492 * config/h8300/extensions.md: Likewise.
22493 * config/h8300/jumpcall.md: Likewise.
22494 * config/h8300/movepush.md: Likewise.
22495 * config/h8300/multiply.md: Likewise.
22496 * config/h8300/other.md: Likewise.
22497 * config/h8300/shiftrotate.md: Likewise.
22498 * config/h8300/logical.md: Likewise. Fix split pattern to use
22499 code iterator that somehow slipped through.
22501 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
22503 PR middle-end/100905
22504 * tree-nested.c (convert_nonlocal_omp_clauses,
22505 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
22507 2021-06-04 Martin Sebor <msebor@redhat.com>
22509 PR middle-end/100732
22510 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
22511 with either source or destination argument of invalid type.
22512 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
22513 calls with arguments of invalid type.
22515 2021-06-04 Martin Sebor <msebor@redhat.com>
22517 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
22519 (attr_access::vla_bounds): Also handle VLA bounds.
22521 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22523 * config/i386/predicates.md (GOT_memory_operand):
22524 Implement using match_code RTXes.
22525 (GOT32_symbol_operand): Ditto.
22527 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22530 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22532 (ix86_expand_vector_init_general): Ditto.
22533 Use SImode instead of word_mode for logic operations
22534 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
22535 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
22536 implemented by expand_vec_perm_1.
22537 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
22538 should be implemented using standard shuffle patterns.
22539 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
22540 V2HI modes to modes, implementable with shuffle for one operand.
22541 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
22542 (*pshufw_1): New insn pattern.
22543 (*vec_dupv2hi): Ditto.
22544 (vec_initv2hihi): New expander.
22546 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22548 * config/arm/vfp.md (no_literal_pool_df_immediate,
22549 no_literal_pool_sf_immediate): Fix empty split condition.
22551 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22553 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
22554 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
22555 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
22556 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
22557 *<sse4_1_avx2>_pblendvb_lt): Likewise.
22559 2021-06-04 Jakub Jelinek <jakub@redhat.com>
22562 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
22563 concatenation from half-sized modes with TImode elements.
22565 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
22567 * config/arc/arc.c (arc_override_options): Disable millicode
22568 thunks when RF16 is on.
22570 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22572 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
22574 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22576 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
22577 Replace PROMOTE_MODE marco with its content.
22579 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22581 * config/cris/cris.md (*addi_reload): Fix empty split condition.
22583 2021-06-03 Jim Wilson <jimw@sifive.com>
22585 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
22586 turn it on for all riscv targets.
22588 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
22591 * config/i386/i386-expand.c (ix86_expand_vector_set):
22592 Handle V2HI and V4QI modes.
22593 (ix86_expand_vector_extract): Ditto.
22594 * config/i386/mmx.md (*pinsrw): New insn pattern.
22597 (*pextrw_zext): Ditto.
22599 (*pextrb_zext): Ditto.
22600 (vec_setv2hi): New expander.
22601 (vec_extractv2hihi): Ditto.
22602 (vec_setv4qi): Ditto.
22603 (vec_extractv4qiqi): Ditto.
22604 (vec_setv8qi): Enable only for TARGET_SSE4_1.
22605 (vec_extractv8qiqi): Ditto.
22607 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
22609 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
22610 order to subf instruction.
22611 * config/rs6000/fusion.md: Regenerate.
22613 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22615 * calls.c (get_size_range): Use range_of_expr instead of
22616 determine_value_range.
22617 * tree-affine.c (expr_to_aff_combination): Same.
22618 * tree-data-ref.c (split_constant_offset): Same.
22619 * tree-vrp.c (determine_value_range_1): Remove.
22620 (determine_value_range): Remove.
22621 * tree-vrp.h (determine_value_range): Remove.
22623 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22625 * function-tests.c (test_ranges): Call gimple_range_tests.
22626 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
22628 * gimple-range.cc (fur_source::get_operand): Do not call
22629 get_tree_range or gimple_range_global.
22631 (get_tree_range): Move to value-query.cc.
22632 Call get_arith_expr_range.
22633 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
22634 Include gimple-range-tests.cc.
22635 * gimple-range.h (fold_range): Add argument.
22636 (get_tree_range): Remove.
22637 * selftest.h (gimple_range_tests): New.
22638 * value-query.cc (global_range_query::range_of_expr): Add
22640 (range_query::get_tree_range): Move from gimple-range.cc.
22641 * value-query.h (class range_query): Add get_tree_range and
22642 get_arith_expr_range. Make fur_source a friend.
22643 * vr-values.c (vr_values::range_of_expr): Pass stmt to
22645 * gimple-range-tests.cc: New file.
22647 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22649 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
22650 update_global_range.
22651 * value-query.cc (update_global_range): New.
22652 * value-query.h (update_global_range): New.
22654 2021-06-03 David Malcolm <dmalcolm@redhat.com>
22656 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
22657 printing the same location twice if there are fix-it hints,
22658 multiple locations, or a label.
22660 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
22662 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
22663 thresholds to narrow the upper bound on epilogue iterations.
22665 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
22667 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
22668 (mve_vabsq_s<mode>): Likewise.
22669 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
22670 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
22671 * config/arm/vec-common.md (neg<mode>2): Rename to
22672 <absneg_str><mode>2.
22674 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
22676 * common/config/arc/arc-common.c (arc_option_optimization_table):
22677 Remove malign-call.
22678 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
22679 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
22680 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
22681 * config/arc/arc.md (abssi2_mixed): Remove pattern.
22682 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
22683 (malign-call): Likewise.
22684 (mmixed-code): Likewise.
22685 * doc/invoke.texi (ARC): Update doc.
22687 2021-06-03 Martin Liska <mliska@suse.cz>
22689 * common.opt: Use proper Enum values.
22690 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
22691 (parse_sanitizer_options): Handle only sanitizer_opts.
22692 (common_handle_option): Just assign value.
22694 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22697 * tree-inline.c (inline_forbidden_p): Remove test on return type.
22699 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22701 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
22702 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
22703 (type_byte_size): Inline into...
22704 (add_byte_size_attribute): ...this and call add_scalar_info.
22706 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22708 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
22709 (typed_binop_from_tree): New function.
22710 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
22711 turn a divide by a power of 2 into a shift.
22712 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
22713 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
22714 typed divide by calling typed_binop_from_tree.
22716 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22718 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
22719 (is_handled_procedure_type): Likewise.
22720 (struct loc_descr_context): Add strict_signedness field.
22721 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
22722 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
22723 (resolve_args_picking): Minor tweak.
22724 (function_to_dwarf_procedure): Initialize strict_signedness field.
22725 (type_byte_size): Likewise.
22726 (field_byte_offset): Likewise.
22727 (gen_descr_array_type_die): Likewise.
22728 (gen_variant_part): Likewise.
22729 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
22730 to true when a context is present before evaluating the arguments.
22731 <COND_EXPR>: Do not generate a useless comparison with zero.
22732 When dereferencing an address, if strict_signedness is true and the
22733 type is small and signed, use DW_OP_deref_type to do the dereference
22734 and then DW_OP_convert to convert back to the generic type.
22736 2021-06-03 Jakub Jelinek <jakub@redhat.com>
22739 * tree-inline.c (copy_tree_body_r): Handle iterators on
22740 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
22742 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22744 * config/arc/arc.md (*bbit_di): Remove.
22746 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
22748 PR rtl-optimization/100264
22749 * ree.c (get_sub_rtx): Ignore SET expressions without register
22750 destinations and remove assertion, as it is not valid anymore
22751 with this new behaviour.
22752 (merge_def_and_ext): Eliminate destination check for register
22753 as such SET expressions can't occur anymore.
22754 (combine_reaching_defs): Likewise.
22756 2021-06-02 Jakub Jelinek <jakub@redhat.com>
22759 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
22760 -Wtype-limits warnings.
22761 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
22762 in operands to avoid -Wsign-compare warnings.
22764 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
22766 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
22769 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
22771 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
22773 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
22775 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
22777 * config/s390/subst.md(cconly_subst): Use a single constraint
22778 in (match_scratch).
22780 2021-06-02 Martin Liska <mliska@suse.cz>
22782 * ipa-icf.h: Use auto_vec for memory_access_types.
22784 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
22786 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
22787 argument from prototype.
22788 (output_logical_op): Add rtx_code argument.
22789 (compute_logical_op_length): Likewise.
22790 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
22791 to compute_a_shift_length.
22792 (output_logical_op); New argument with the rtx code rather than
22793 extracting it from an operand. Handle QImode too.
22794 (compute_logical_op_length): Similary.
22795 (compute_a_shift_length): Drop unused argument.
22796 * config/h8300/h8300.md (logicals): New code iterator.
22797 * config/h8300/logical.md (<code><mode>3 expander): Combine
22798 the "and" expander with the "ior"/"xor" expander.
22799 (bclr<mode>msx): Combine the QI/HI mode patterns.
22800 (<logical><mode>3 insns): Use code iterator rather than match_operator.
22801 Handle QImode as well. Update call to output_logical_op and
22802 compute_logical_op_length to pass in rtx_code
22803 Fix split condition on all define_insn_and_split patterns.
22804 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
22805 the flags and setting ZN via existing define_subst.
22806 * config/h8300/shiftrotate.md: Drop unused argument from
22807 calls to compute_a_shift_length.
22808 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
22810 2021-06-01 Andrew Pinski <apinski@marvell.com>
22812 PR tree-optimization/25290
22813 * tree-ssa-phiopt.c (match_simplify_replacement):
22815 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
22816 (two_value_replacement): Change the comment about
22817 conditional_replacement.
22818 (conditional_replacement): Delete.
22820 2021-06-01 Andrew Pinski <apinski@marvell.com>
22822 PR tree-optimization/95481
22823 * tree-tailcall.c (find_tail_calls): Handle empty typed
22826 2021-06-01 Andrew Pinski <apinski@marvell.com>
22828 * gimplify.c (zero_sized_field_decl): Delete
22829 (zero_sized_type): Delete
22830 (gimplify_init_ctor_eval): Use is_empty_type instead
22831 of zero_sized_field_decl.
22832 (gimplify_modify_expr): Use is_empty_type instead of
22835 2021-06-01 Jason Merrill <jason@redhat.com>
22838 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
22840 2021-06-01 Jason Merrill <jason@redhat.com>
22843 * diagnostic.h (warning_enabled_at): Declare.
22844 * diagnostic.c (diagnostic_enabled): Factor out from...
22845 (diagnostic_report_diagnostic): ...here.
22846 (warning_enabled_at): New.
22848 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
22850 * gimple-ssa-evrp.c: Enable exporting of global ranges.
22852 2021-06-01 Martin Liska <mliska@suse.cz>
22855 * doc/invoke.texi: Mention that -fgcse-after-reload
22856 is enabled with -O3.
22858 2021-06-01 liuhongt <hongtao.liu@intel.com>
22860 PR tree-optimization/98365
22861 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
22862 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
22863 (convert_scalar_cond_reduction): Ditto.
22864 (predicate_scalar_phi): Ditto.
22866 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22868 PR tree-optimization/100781
22869 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
22870 value calculation by default.
22871 (ranger_cache::enable_new_values): New.
22872 (ranger_cache::disable_new_values): New.
22873 (ranger_cache::push_poor_value): Check if new values are allowed.
22874 * gimple-range-cache.h (class ranger_cache): New member/methods.
22875 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
22876 statement, and disable/renable new value calculation.
22878 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22880 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
22881 (ranger_cache::range_of_def): New.
22882 (ranger_cache::entry_range): New.
22883 (ranger_cache::exit_range): New.
22884 (ranger_cache::range_of_expr): Adjust.
22885 (ranger_cache::range_on_edge): Adjust.
22886 (ranger_cache::propagate_cache): Call exit_range directly.
22887 * gimple-range-cache.h (class ranger_cache): Adjust.
22889 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22891 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
22892 gori_compute being a member rather than base class.
22893 dervied call to member call.
22894 (ranger_cache::dump): No longer dump gori_map.
22895 (ranger_cache::dump_bb): New.
22896 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
22897 being a member rather than base class.
22898 (ranger_cache::set_global_range): Ditto.
22899 (ranger_cache::ssa_range_in_bb): Ditto.
22900 (ranger_cache::range_of_expr): New.
22901 (ranger_cache::range_on_edge): New.
22902 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
22903 (ranger_cache::propagate_cache): Adjust debugging output.
22904 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
22906 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
22907 member, and inherit from range_query instead.
22908 (ranger_cache::dump_bb): New. split from dump.
22909 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
22910 (gori_compute::expr_range_at_stmt): Delete.
22911 (gori_compute::compute_name_range_op): Delete.
22912 (gori_compute::compute_operand_range_switch): Add fur_source.
22913 (gori_compute::compute_operand_range): Add fur_source param, inline
22914 old compute_name_range_op and optimize_logical_operands.
22915 (struct tf_range): Delete.
22916 (gori_compute::logical_combine): Adjust
22917 (gori_compute::optimize_logical_operands): Delete.
22918 (gori_compute::compute_logical_operands_in_chain): Delete.
22919 (gori_compute::compute_logical_operands): Adjust.
22920 (gori_compute::compute_operand1_range): Adjust to fur_source.
22921 (gori_compute::compute_operand2_range): Ditto.
22922 (gori_compute::compute_operand1_and_operand2_range): Ditto.
22923 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
22924 and adjust to fur_source.
22925 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
22926 range_query and fur_source.
22927 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
22928 from the ranger_cache..
22929 (gimple_ranger::fold_range_internal): Adjust to base class change of
22931 (gimple_ranger::dump_bb): Adjust dump.
22932 * gimple-range.h (gimple_ranger):export gori computes object.
22934 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22936 PR tree-optimization/100774
22937 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
22938 Constant values are also not stale.
22939 (ranger_cache::set_global_range): Range invariant values should also
22940 have the correct timestamp.
22942 2021-05-31 Martin Liska <mliska@suse.cz>
22944 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
22945 Unpack FUNCTION_DECL_DECL_TYPE.
22946 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
22947 Stream FUNCTION_DECL_DECL_TYPE instead of
22948 DECL_IS_OPERATOR_NEW_P.
22949 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
22951 (DECL_IS_OPERATOR_NEW_P): Likewise.
22952 (DECL_IS_OPERATOR_DELETE_P): Likewise.
22953 (DECL_LAMBDA_FUNCTION_P): Likewise.
22955 2021-05-31 Richard Biener <rguenther@suse.de>
22958 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
22959 * internal-fn.def (SHUFFLEVECTOR): New.
22960 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
22961 * doc/extend.texi: Document __builtin_shufflevector.
22963 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
22966 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
22967 indexed form addresses.
22969 2021-05-29 Jeff Law <jlaw@tachyum.com>
22971 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
22972 parameter. Call callers fixed.
22974 (output_plussi): Add FALLTHRU markers.
22975 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
22977 2021-05-29 Jakub Jelinek <jakub@redhat.com>
22979 PR middle-end/99928
22980 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
22981 combined with parallel, make sure to add shared clause to
22982 parallel for explicit linear clause.
22984 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
22986 PR tree-optimization/100787
22987 * gimple-ssa-evrp.c: Disable exporting of global ranges.
22989 2021-05-28 Jason Merrill <jason@redhat.com>
22991 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
22992 operator--, operator*, operator==, and operator!=.
22993 (class tsi_range): New.
22995 2021-05-28 Richard Biener <rguenther@suse.de>
22997 PR tree-optimization/100778
22998 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
22999 trapping ops in different BBs.
23001 2021-05-28 Richard Biener <rguenther@suse.de>
23004 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
23005 copy fntype from original call.
23007 2021-05-28 Martin Liska <mliska@suse.cz>
23009 PR gcov-profile/100751
23010 * doc/gcov.texi: Revert partially a hunk that was wrong.
23012 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
23014 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
23016 (HAVE_sync_compare_and_swaphi): Likewise.
23017 (HAVE_sync_compare_and_swapsi): Likewise.
23019 2021-05-28 Jakub Jelinek <jakub@redhat.com>
23021 PR middle-end/99928
23022 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
23024 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
23026 * gimplify.c (gimplify_omp_affinity): New.
23027 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
23028 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
23029 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
23030 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
23031 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
23033 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
23034 Richard Biener <rguenther@suse.de>
23036 * match.pd <popcount & / + pattern matching>:
23037 When generating popcount directly fails, try doing it in two halves.
23039 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
23041 * Makefile.in (generated_files): Add gimple-match.c and
23044 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
23046 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
23048 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
23050 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
23052 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
23054 PR tree-optimization/99398
23055 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
23056 where the fed operands are CTOR/CST and propagated through
23057 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
23058 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
23060 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
23063 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23065 * config/i386/mmx.md (addv2sf3): Do not call
23066 ix86_fixup_binary_operands_no_copy.
23069 (<smaxmin:code>v2sf3): Ditto.
23070 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
23071 (<plusminus:insn><VI_32:mode>3): Remove expander.
23072 (<plusminus:insn><VI_32:mode>3): Rename from
23073 "*<plusminus:insn><VI_32:mode>3".
23074 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
23075 (mulv2hi3): Remove expander.
23076 (mulv2hi3): Rename from *mulv2hi3.
23077 (<s>mulv2hi3_highpart): Remove expander.
23078 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
23079 (<smaxmin:code><MMXMODE14:mode>3): Rename from
23080 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
23081 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
23082 (SMAXMIN_MMXMODEI): Remove mode iterator.
23083 (<smaxmin:code>v4hi3): New expander.
23084 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
23085 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
23086 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23087 (SMAXMIN_VI_32): Remove mode iterator.
23088 (<umaxmin:code><MMXMODE24:mode>3): Rename from
23089 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
23090 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
23091 (UMAXMIN_MMXMODEI): Remove mode iterator.
23092 (<umaxmin:code>v8qi3): New expander.
23093 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
23094 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
23095 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23096 (UMAXMIN_VI_32): Remove mode iterator.
23097 (<any_shift:insn>v2hi3): Remove expander.
23098 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
23099 (<any_logic:code><MMXMODEI:mode>3): Do not call
23100 ix86_fixup_binary_operands_no_copy.
23101 (<any_logic:code><VI_32:mode>3): Remove expander.
23102 (<any_logic:code><VI_32:mode>3): Rename from
23103 "*<any_logic:code><VI_32:mode>3".
23104 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
23105 * config/i386/sse.md (div<VF2:mode>3): Do not call
23106 ix86_fixup_binary_operands_no_copy.
23107 (div<VF1:mode>3): Ditto.
23108 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
23109 (smulhrsv4hi3): Ditto.
23110 (smulhrsv2hi3): Ditto.
23112 2021-05-27 Martin Sebor <msebor@redhat.com>
23114 * ggc.h (gt_ggc_mx): Add overloads for all integers.
23116 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
23118 (hash_map::operator==): New function.
23120 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23123 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
23124 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
23125 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
23126 (*xop_maskcmp<VI_32:mode>3): Ditto.
23127 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
23128 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
23130 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
23133 * config/arm/arm.c (arm_configure_build_target): Remove parameter
23134 opts_set, directly check opts parameters for being non-null.
23135 (arm_option_restore): Update call to arm_configure_build_target.
23136 (arm_option_override): Likewise.
23137 (arm_can_inline_p): Likewise.
23138 (arm_valid_target_attribute_tree): Likewise.
23139 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
23140 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
23143 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23145 * vr-values.c (simplify_conversion_using_ranges): Use
23146 get_range_query instead of get_global_range_query.
23148 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23150 * gimple-range.cc (get_range_global): Move to value-query.cc.
23151 (gimple_range_global): Same.
23152 (get_global_range_query): Same.
23153 (global_range_query::range_of_expr): Same.
23154 * gimple-range.h (class global_range_query): Move to
23156 (gimple_range_global): Same.
23157 * tree-ssanames.c (get_range_info): Move to value-query.cc.
23158 (get_ptr_nonnull): Same.
23159 * tree-ssanames.h (get_range_info): Remove.
23160 (get_ptr_nonnull): Remove.
23161 * value-query.cc (get_ssa_name_range_info): Move from
23163 (get_ssa_name_ptr_info_nonnull): Same.
23164 (get_range_global): Move from gimple-range.cc.
23165 (gimple_range_global): Same.
23166 (get_global_range_query): Same.
23167 (global_range_query::range_of_expr): Same.
23168 * value-query.h (class global_range_query): Move from
23170 (gimple_range_global): Same.
23172 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23175 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
23176 (uavgv2hi3_ceil): Ditto.
23178 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23181 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
23183 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23185 * tree-ssanames.c (get_range_info): Merge both copies of
23186 get_range_info into one that works with irange.
23187 * tree-ssanames.h (get_range_info): Remove version that works on
23190 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23192 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
23193 (expand_builtin_strnlen): Same.
23194 (determine_block_size): Same.
23195 * fold-const.c (expr_not_equal_to): Same.
23196 * gimple-fold.c (size_must_be_zero_p): Same.
23197 * gimple-match-head.c: Include gimple-range.h.
23198 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
23199 * gimple-ssa-warn-restrict.c
23200 (builtin_memref::extend_offset_range): Same.
23201 * graphite-sese-to-poly.c (add_param_constraints): Same.
23202 * internal-fn.c (get_min_precision): Same.
23203 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
23204 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
23206 * tree-data-ref.c (split_constant_offset): Same.
23207 (dr_step_indicator): Same.
23208 * tree-dfa.c (get_ref_base_and_extent): Same.
23209 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
23210 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
23211 (determine_value_range): Same.
23212 (record_nonwrapping_iv): Same.
23213 (infer_loop_bounds_from_signedness): Same.
23214 (scev_var_range_cant_overflow): Same.
23215 * tree-ssa-phiopt.c (two_value_replacement): Same.
23216 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
23217 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
23218 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
23220 (dump_strlen_info): Same.
23221 (set_strlen_range): Same.
23222 (maybe_diag_stxncpy_trunc): Same.
23223 (get_len_or_size): Same.
23224 (handle_integral_assign): Same.
23225 * tree-ssa-structalias.c (find_what_p_points_to): Same.
23226 * tree-ssa-uninit.c (find_var_cmp_const): Same.
23227 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
23228 * tree-vect-patterns.c (vect_get_range_info): Same.
23229 (vect_recog_divmod_pattern): Same.
23230 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
23231 (register_edge_assert_for_2): Same.
23232 (determine_value_range_1): Same.
23233 * tree.c (get_range_pos_neg): Same.
23234 * vr-values.c (vr_values::get_lattice_entry): Same.
23235 (vr_values::update_value_range): Same.
23236 (simplify_conversion_using_ranges): Same.
23238 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23240 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
23241 get_range_query instead of query argument.
23242 (pass_walloca::execute): Enable and disable global ranger.
23244 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23246 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
23248 (rvrp_folder::~rvrp_folder): Call disable_ranger.
23249 (hybrid_folder::hybrid_folder): Call enable_ranger.
23250 (hybrid_folder::~hybrid_folder): Call disable_ranger.
23252 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23254 * function.c (allocate_struct_function): Set cfun->x_range_query.
23255 * function.h (struct function): Declare x_range_query.
23256 (get_range_query): New.
23257 (get_global_range_query): New.
23258 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
23259 Remove call to safe_grow_cleared.
23260 * gimple-range.cc (get_range_global): New.
23261 (gimple_range_global): Move from gimple-range.h.
23262 (get_global_range_query): New.
23263 (global_range_query::range_of_expr): New.
23264 (enable_ranger): New.
23265 (disable_ranger): New.
23266 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
23267 (class global_range_query): New.
23268 (enable_ranger): New.
23269 (disable_ranger): New.
23270 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
23271 dump_all_value_ranges to dump.
23272 * tree-vrp.c (vrp_prop::finalize): Same.
23273 * value-query.cc (range_query::dump): New.
23274 * value-query.h (range_query::dump): New.
23275 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
23276 (vr_values::dump): ...this.
23277 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
23278 dump and make virtual.
23280 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
23282 * config/i386/i386.c (ix86_autovectorize_vector_modes):
23283 Add V4QImode and V16QImode for TARGET_SSE2.
23284 * doc/sourcebuild.texi (Vector-specific attributes):
23285 Add vect64 and vect32 description.
23287 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
23289 * gimple-range-gori.cc (range_def_chain::register_dependency):
23290 Resize m_def_chain when needed.
23292 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
23294 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
23295 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
23296 * config/arm/vec-common.md: .. here. Add support for MVE.
23298 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23300 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
23302 * config/microblaze/microblaze.c (microblaze_legitimize_address,
23304 microblaze_option_override, print_operand): Likewise.
23305 * config/microblaze/microblaze.md (call_internal_plt,
23306 call_value_intern_plt, call_value_intern): Likewise.
23307 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
23308 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
23309 call_value_multiple_internal1): Likewise.
23310 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
23312 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
23314 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
23315 arc_ccfsm_advance, symbolic_reference_mentioned_p,
23316 arc_raw_symbolic_reference_mentioned_p): Remove register
23319 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23322 * omp-low.c: Include omp-offload.h.
23323 (create_omp_child_function): If current_function_decl has
23324 "omp declare target" attribute and is_gimple_omp_offloaded,
23325 remove that attribute from the copy of attribute list and
23326 add "omp target entrypoint" attribute instead.
23327 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
23328 variables for offloading if in omp_maybe_offloaded_ctx.
23329 * omp-offload.c (pass_omp_target_link::execute): Nullify second
23330 argument to GOMP_target_data_ext in offloaded code.
23332 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23334 * config/csky/csky.c (csky_can_change_mode_class): Delete.
23335 For csky, HF/SF mode use the low bits of VREGS.
23337 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23339 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
23340 DECL which is a reference for OMP.
23342 2021-05-26 Martin Liska <mliska@suse.cz>
23344 PR gcov-profile/100751
23345 * doc/gcov.texi: Document that __gcov_dump can be called just
23346 once and that __gcov_reset resets run-time counters.
23348 2021-05-26 Martin Liska <mliska@suse.cz>
23350 * doc/install.texi: Port relevant part from install-old.texi
23351 and re-generate list of CPUs and systems.
23353 2021-05-26 Martin Liska <mliska@suse.cz>
23355 * Makefile.in: Remove it.
23356 * doc/include/fdl.texi: Update next/previous chapters.
23357 * doc/install.texi: Likewise.
23358 * doc/install-old.texi: Removed.
23360 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23362 * config/csky/csky.c (ck810_legitimate_index_p): Support
23363 "base + index" with DF mode.
23364 * config/csky/constraints.md ("Y"): New constraint for memory operands
23365 without index register.
23366 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
23367 when mov between memory and general registers, and lower their priority.
23368 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
23370 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23372 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
23374 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23376 * config/csky/csky.md (untyped_call): Emit clobber for return
23377 registers to mark them used.
23379 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23381 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
23383 2021-05-26 Andrew Pinski <apinski@marvell.com>
23385 * match.pd (x < 0 ? ~y : y): New patterns.
23387 2021-05-26 Andrew Pinski <apinski@marvell.com>
23389 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
23390 A?POW2:0 and A?0:POW2.
23392 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23394 * gimple-range-gori.cc (class logical_stmt_cache): Delete
23395 (logical_stmt_cache::logical_stmt_cache ): Delete.
23396 (logical_stmt_cache::~logical_stmt_cache): Delete.
23397 (logical_stmt_cache::cache_entry::dump): Delete.
23398 (logical_stmt_cache::get_range): Delete.
23399 (logical_stmt_cache::cached_name ): Delete.
23400 (logical_stmt_cache::same_cached_name): Delete.
23401 (logical_stmt_cache::cacheable_p): Delete.
23402 (logical_stmt_cache::slot_diagnostics ): Delete.
23403 (logical_stmt_cache::dump): Delete.
23404 (gori_compute_cache::gori_compute_cache): Delete.
23405 (gori_compute_cache::~gori_compute_cache): Delete.
23406 (gori_compute_cache::compute_operand_range): Delete.
23407 (gori_compute_cache::cache_stmt): Delete.
23408 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
23410 (class gori_compute_cache): Delete.
23412 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23414 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
23416 (fold_using_range::range_of_address): Adjust.
23417 (fold_using_range::range_of_phi): Adjust.
23418 * gimple-range.h (class fur_source): Adjust.
23419 (fur_source::fur_source): Adjust.
23421 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23423 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
23424 from expr_range_in_bb and adjust.
23425 (gori_compute::compute_name_range_op): Adjust.
23426 (gori_compute::optimize_logical_operands): Adjust.
23427 (gori_compute::compute_logical_operands_in_chain): Adjust.
23428 (gori_compute::compute_operand1_range): Adjust.
23429 (gori_compute::compute_operand2_range): Adjust.
23430 (ori_compute_cache::cache_stmt): Adjust.
23431 * gimple-range-gori.h (gori_compute): Rename prototype.
23433 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23435 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
23436 checked only after range_of_stmt, not range_on_entry.
23437 (gimple_ranger::range_on_entry): Check for non-null in any
23438 predecessor block, if it is not already non-null.
23439 (gimple_ranger::range_on_exit): DOnt check for non-null after
23440 range on entry call.
23441 (gimple_ranger::dump_bb): New. Split from dump.
23442 (gimple_ranger::dump): Adjust.
23443 * gimple-range.h (class gimple_ranger): Adjust.
23445 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23447 * gimple-range-cache.cc (struct range_timestamp): Delete.
23448 (class temporal_cache): Adjust.
23449 (temporal_cache::get_timestamp): Delete.
23450 (temporal_cache::set_dependency): Delete.
23451 (temporal_cache::temporal_value): Adjust.
23452 (temporal_cache::current_p): Take dependencies as params.
23453 (temporal_cache::set_timestamp): Adjust.
23454 (temporal_cache::set_always_current): Adjust.
23455 (ranger_cache::get_non_stale_global_range): Adjust.
23456 (ranger_cache::register_dependency): Delete.
23457 * gimple-range-cache.h (class range_cache): Adjust.
23459 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23461 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
23463 (range_def_chain::~range_def_chain): Dispose of obstack rather than
23464 each individual bitmap.
23465 (range_def_chain::set_import): New.
23466 (range_def_chain::get_imports): New.
23467 (range_def_chain::chain_import_p): New.
23468 (range_def_chain::register_dependency): Rename from build_def_chain
23470 (range_def_chain::def_chain_in_bitmap_p): New.
23471 (range_def_chain::add_def_chain_to_bitmap): New.
23472 (range_def_chain::has_def_chain): Just check first depenedence.
23473 (range_def_chain::get_def_chain): Process imports, use generic
23474 register_dependency routine.
23475 (range_def_chain::dump): New.
23476 (gori_map::gori_map): Allocate import list.
23477 (gori_map::~gori_map): Release imports.
23478 (gori_map::exports): Check for past allocated block size.
23479 (gori_map::imports): New.
23480 (gori_map::def_chain_in_export_p): Delete.
23481 (gori_map::is_import_p): New.
23482 (gori_map::maybe_add_gori): Handle imports.
23483 (gori_map::dump): Adjust output, add imports.
23484 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
23485 (gori_export_iterator::gori_export_iterator): New.
23486 (gori_export_iterator::next): New.
23487 (gori_export_iterator::get_name): New.
23488 * gimple-range-gori.h (range_def_chain): Add imports and direct
23489 dependecies via struct rdc.
23490 (range_def_chain::depend1): New.
23491 (range_def_chain::depend2): New.
23492 (class gori_map): Adjust.
23493 (FOR_EACH_GORI_IMPORT_NAME): New.
23494 (FOR_EACH_GORI_EXPORT_NAME): New.
23495 (class gori_export_iterator): New.
23497 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23499 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
23500 export cache filling to here.
23501 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
23503 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23505 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
23506 (gori_map): Move to gimple-range-gori.h.
23507 (gori_compute::gori_compute): Adjust.
23508 (gori_compute::~gori_compute): Delete.
23509 (gori_compute::compute_operand_range_switch): Adjust.
23510 (gori_compute::compute_operand_range): Adjust.
23511 (gori_compute::compute_logical_operands): Adjust.
23512 (gori_compute::has_edge_range_p ): Adjust.
23513 (gori_compute::set_range_invariant): Delete.
23514 (gori_compute::dump): Adjust.
23515 (gori_compute::outgoing_edge_range_p): Adjust.
23516 * gimple-range-gori.h (class range_def_chain): Relocate here.
23517 (class gori_map): Relocate here.
23518 (class gori_compute): Inherit from gori_map, and adjust.
23520 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
23522 * value-range.cc (range_tests_legacy): Use
23523 build_nonstandard_integer_type instead of int and short.
23525 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23527 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
23528 when really creating an initialization statement for it.
23530 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23532 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
23534 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
23536 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
23538 2021-05-25 Martin Liska <mliska@suse.cz>
23540 PR tree-optimization/92860
23542 * optc-save-gen.awk: Remove exceptions.
23544 2021-05-25 Martin Liska <mliska@suse.cz>
23546 * asan.h (sanitize_coverage_p): New function.
23547 * doc/extend.texi: Document it.
23548 * fold-const.c (fold_range_test): Use sanitize_flags_p
23549 instead of flag_sanitize_coverage.
23550 (fold_truth_andor): Likewise.
23551 * sancov.c: Likewise.
23552 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
23553 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
23554 -fsanitize-coverage when inlining.
23556 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23558 * config/csky/csky-modes.def : Fix copyright.
23560 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23562 * config/csky/csky-modes.def : Amend copyright.
23563 * config/csky/csky_insn_fpuv2.md : Likewise.
23564 * config/csky/csky_insn_fpuv3.md : Likewise.
23566 2021-05-25 Richard Biener <rguenther@suse.de>
23568 PR middle-end/100727
23569 * calls.c (initialize_argument_information): Explicitely test
23570 for WITH_SIZE_EXPR.
23571 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
23573 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23575 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
23576 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
23577 the signle definition. The signle definition may not work well
23578 at simplify_subreg_regno().
23579 (HARD_FRAME_POINTER_REGNUM): New.
23580 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
23581 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
23582 csky_initial_elimination_offset, csky_expand_prologue,
23583 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
23585 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23587 * config/csky/csky.c (csky_option_override):
23588 Init csky_arch_isa_features[] in advance, so TARGET_DSP
23589 and TARGET_DIV can be set well.
23591 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23593 * config/csky/constraints.md ("l", "h"): Delete.
23594 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
23595 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
23596 * config/csky/csky.c (regno_reg_classm,
23597 csky_secondary_reload, csky_register_move_cost):
23598 Use HILO_REGS instead of LO_REGS and HI_REGS.
23600 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23602 * config/csky/constraints.md ("W"): New constriant for mem operand
23603 with base reg, index register.
23604 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
23605 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
23607 ("Dv"): New constraint for const double value that can be used at
23609 * config/csky/csky-modes.def (HFmode): New mode.
23610 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
23611 to "csky_valid_mem_constraint_operand" and support new constraint
23613 (csky_get_movedouble_length): New.
23614 (fpuv3_output_move): New.
23615 (fpuv3_const_double): New.
23616 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
23617 (decompose_csky_address): Refine.
23618 (csky_print_operand): New "CONST_DOUBLE" operand.
23619 (csky_output_move): Support fpv3 instructions.
23620 (csky_get_movedouble_length): New.
23621 (fpuv3_output_move): New.
23622 (fpuv3_const_double): New.
23623 (csky_emit_compare): Cover float comparsion.
23624 (csky_emit_compare_float): Refine.
23625 (csky_vaild_fpuv2_mem_operand): Rename to
23626 "csky_valid_mem_constraint_operand" and support new constraint "W".
23627 (ck860_rtx_costs): New.
23628 (csky_rtx_costs): Add the cost calculation of CK860.
23629 (regno_reg_class): New vregs for fpuv3.
23630 (csky_dbx_regno): Likewise.
23631 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
23632 (csky_conditional_register_usage): Suporrot fpuv3.
23633 (csky_dwarf_register_span): Suporrot fpuv3.
23634 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
23635 (ck810_legitimate_index_p): Support fp16.
23636 * config/csky/csky.h (TARGET_TLS): ADD CK860.
23637 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
23638 (TARGET_SINGLE_FPU): Support fpuv3.
23639 (TARGET_SUPPORT_FPV3): New.
23640 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
23641 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
23642 REG_CLASS_CONTENTS): Support fpuv3.
23643 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
23644 (csky_movsf_fpv2): Likewise.
23645 (ck801_movsf): Likewise.
23646 (csky_movsf): Likewise.
23648 (csky_movdf_fpv2): Likewise.
23649 (ck801_movdf): Likewise.
23650 (csky_movdf): Likewise.
23651 (movsicc): Refine. Use "comparison_operatior" instead of
23652 "ordered_comparison_operatior".
23653 (addsicc): Likewise.
23654 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
23655 (call_value_internal_vh): New.
23656 * config/csky/csky_cores.def (CK860): New arch and cpu.
23661 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
23662 into emit-patterns and match-patterns, remain the emit-patterns here,
23663 and move the match-patterns to csky_insn_fpuv2.md or
23664 csky_insn_fpuv3.md.
23665 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
23666 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
23668 * config/csky/csky_isa.def (fcr): New.
23673 (CK860): New definition for ck860.
23674 * config/csky/csky_tables.opt (ck860): New processors ck860,
23675 ck860f. And new arch ck860.
23680 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
23681 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
23682 * config/csky/t-csky-elf: Support 860.
23683 * config/csky/t-csky-linux: Likewise.
23684 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
23686 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
23688 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
23689 add generation of logical-add and add-logical fusion pairs.
23690 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
23692 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
23693 logical-add and add-logical fusion by default.
23694 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
23695 -mpower10-fusion-add-logical options.
23696 * config/rs6000/fusion.md: Regenerate file.
23698 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
23700 * value-range.cc (irange::legacy_equal_p): Check type when
23701 comparing VR_VARYING types.
23702 (range_tests_legacy): Test comparing VARYING ranges of different
23705 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23707 * config/aarch64/aarch64.c (neoversen1_tunings):
23708 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
23710 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23712 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
23713 extern weak symbols. Limit symbol offsets for non-GOT symbols with
23716 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23718 * config/arm/neon.md (vec_load_lanesxi<mode>)
23719 (vec_store_lanexoi<mode>): Move ...
23720 * config/arm/vec-common.md: here.
23722 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23724 * config/arm/neon.md (vec_load_lanesoi<mode>)
23725 (vec_store_lanesoi<mode>): Move ...
23726 * config/arm/vec-common.md: here.
23728 2021-05-24 liuhongt <hongtao.liu@intel.com>
23731 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
23732 stmt with GIMPLE_NOP when lhs doesn't exist.
23734 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
23737 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
23738 New instruction pattern.
23739 (*push<VI_32:mode>2): Ditto.
23740 (push splitter for SSE registers): New splitter.
23742 2021-05-23 Andrew Pinski <apinski@marvell.com>
23744 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
23746 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
23748 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
23749 * config/rs6000/fusion.md: Regenerate file.
23751 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23753 * config/rs6000/genfusion.pl (gen_addadd): New function.
23754 * config/rs6000/fusion.md: Regenerate file.
23755 * config/rs6000/rs6000-cpus.def: Add
23756 OPTION_MASK_P10_FUSION_2ADD to masks.
23757 * config/rs6000/rs6000.c (rs6000_option_override_internal):
23758 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
23759 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
23761 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23763 PR middle-end/99928
23764 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
23765 * gimplify.c (enum gimplify_omp_var_data): Fix up
23766 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
23767 (omp_lastprivate_for_combined_outer_constructs): If combined target
23768 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
23769 GOVD_MAP | GOVD_SEEN.
23770 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
23771 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
23772 (gimplify_adjust_omp_clauses): For firstprivate clauses with
23773 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
23774 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
23775 let it be replaced by implicit map clause.
23777 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23779 PR middle-end/99928
23780 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
23782 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
23783 (gimplify_omp_for): Likewise.
23785 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23787 PR middle-end/90115
23788 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
23789 'external' in blocks.
23791 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23793 PR middle-end/90115
23794 * flag-types.h (enum openacc_privatization): New.
23795 * params.opt (-param=openacc-privatization): New.
23796 * doc/invoke.texi (openacc-privatization): Document it.
23797 * omp-general.h (get_openacc_privatization_dump_flags): New
23799 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
23800 * omp-offload.c (execute_oacc_device_lower)
23801 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
23802 * target.def (goacc.adjust_private_decl): Add 'location_t'
23804 * doc/tm.texi: Regenerate.
23805 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
23806 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
23807 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
23808 Likewise. Preserve it for...
23809 (nvptx_goacc_expand_var_decl): ... use here.
23811 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23813 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
23816 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23818 PR middle-end/90115
23819 * omp-low.c (oacc_privatization_candidate_p): New function.
23820 (oacc_privatization_scan_clause_chain)
23821 (oacc_privatization_scan_decl_chain): Use it. Also
23822 'gcc_checking_assert' that we're not seeing duplicates.
23824 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23826 PR middle-end/90115
23827 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
23830 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23832 PR middle-end/90115
23833 * omp-offload.c (execute_oacc_device_lower): Explain.
23835 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23837 PR middle-end/90115
23838 * omp-offload.c (execute_oacc_device_lower)
23839 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
23841 * internal-fn.c (expand_UNIQUE): Don't expect
23842 'IFN_UNIQUE_OACC_PRIVATE'.
23844 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23846 PR middle-end/90115
23847 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
23849 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23851 PR middle-end/90115
23852 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
23853 (nvptx_goacc_expand_var_decl): Tighten.
23855 2021-05-21 Julian Brown <julian@codesourcery.com>
23856 Chung-Lin Tang <cltang@codesourcery.com>
23857 Thomas Schwinge <thomas@codesourcery.com>
23859 PR middle-end/90115
23860 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
23861 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
23862 * doc/tm.texi: Regenerate.
23863 * expr.c (expand_expr_real_1): Expand decls using the
23864 expand_var_decl OpenACC hook if defined.
23865 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
23866 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
23867 * omp-low.c (omp_context): Add oacc_privatization_candidates
23869 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
23871 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
23872 private marker's gimple call arguments, and pass it to
23873 lower_oacc_reductions.
23874 (oacc_privatization_scan_clause_chain)
23875 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
23877 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
23878 * omp-offload.c (convert.h): Include.
23879 (oacc_loop_xform_head_tail): Treat private-variable markers like
23880 fork/join when transforming head/tail sequences.
23881 (struct var_decl_rewrite_info): Add struct.
23882 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
23883 (execute_oacc_device_lower): Support rewriting gang-private
23884 variables using target hook, and fix up addr_expr and var_decl
23886 * target.def (adjust_private_decl, expand_var_decl): New hooks.
23887 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
23889 (gcn_goacc_adjust_private_decl): ...this.
23890 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
23892 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
23893 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
23894 definition using gcn_goacc_adjust_gangprivate_decl...
23895 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
23896 gcn_goacc_adjust_private_decl.
23897 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
23898 (gang_private_shared_size): New global variable.
23899 (gang_private_shared_align): Likewise.
23900 (gang_private_shared_sym): Likewise.
23901 (gang_private_shared_hmap): Likewise.
23902 (nvptx_option_override): Initialize these.
23903 (nvptx_file_end): Output gang_private_shared_sym.
23904 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
23906 (nvptx_set_current_function): Clear gang_private_shared_hmap.
23907 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
23908 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
23910 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
23912 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
23914 2021-05-21 Richard Biener <rguenther@suse.de>
23915 H.J. Lu <hjl.tools@gmail.com>
23917 PR middle-end/90773
23918 * expr.c (expand_constructor): Elide expand_constructor if
23919 move by pieces is preferred.
23921 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23923 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
23924 Take a flag and mode value as arguments.
23925 (aarch64_modifies_global_state_p): Likewise.
23926 (aarch64_reads_global_state_p): Likewise.
23927 (aarch64_could_trap_p): Likewise.
23928 (aarch64_get_attributes): Likewise.
23929 (aarch64_init_simd_builtins): Adjust callsite of above.
23930 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
23931 function attributes to apply to builtins.
23932 (aarch64_init_crc32_builtins): Likewise.
23933 (aarch64_init_builtin_rsqrt): Likewise.
23935 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23937 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
23938 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
23939 (gen_2logical): Use new fusion types.
23940 * config/rs6000/fusion.md: Regenerate.
23942 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23945 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
23946 Handle V4QI and V2HI modes.
23947 (ix86_expand_sse_movcc): Ditto.
23948 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
23949 New instruction pattern.
23950 (*eq<VI_32:mode>3): Ditto.
23951 (*gt<VI_32:mode>3): Ditto.
23952 (*xop_pcmov_<VI_32:mode>): Ditto.
23953 (mmx_pblendvb32): Ditto.
23954 (mmx_pblendvb64): Rename from mmx_pblendvb.
23955 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
23956 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
23957 (vcond<VI_32:mode><VI_32:mode>): Ditto.
23958 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
23959 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
23961 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23963 PR tree-optimization/94589
23964 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
23965 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
23966 not the same as cmp3) treat <= the same as < and >= the same as >.
23967 Don't require that cond2_phi_edge is true edge, instead take
23968 false/true edges into account based on cmp1/cmp2 comparison kinds.
23970 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23973 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
23974 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
23975 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
23976 using SMAXMIN_MMXMODEI mode iterator.
23977 (*<smaxmin:code>v4qi3): New insn pattern.
23978 (*<smaxmin:code>v2hi3): Ditto.
23979 (SMAXMIN_VI_32): New mode iterator.
23980 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
23981 (UMAXMIN_MMXMODEI): New mode iterator.
23982 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
23983 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
23984 using UMAXMIN_MMXMODEI mode iterator.
23985 (*<umaxmin:code>v4qi3): New insn pattern.
23986 (*<umaxmin:code>v2hi3): Ditto.
23987 (UMAXMIN_VI_32): New mode iterator.
23988 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
23989 (abs<VI_32:mode>2): New insn pattern.
23990 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
23991 * config/i386/sse.md: ... here.
23993 2021-05-20 Clement Chigot <clement.chigot@atos.net>
23994 David Edelsohn <dje.gcc@gmail.com>
23996 * collect2.c (scan_prog_file): Issue non-fatal warning for
23999 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
24001 * doc/invoke.texi (-Wno-c++11-extensions)
24002 (-Wno-c++14-extensions, -Wno-c++17-extensions)
24003 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
24006 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
24008 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
24009 * config/darwin.c (darwin_override_options): Likewise.
24010 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
24011 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
24012 (DWARF2_FRAME_REG_OUT): Likewise.
24013 * config/mips/mips.c (mips_output_filename): Likewise.
24014 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
24016 (rs6000_dbx_register_number): Likewise.
24017 * dbxout.c: Include flags.h.
24018 * dwarf2cfi.c (cfi_label_required_p): Likewise.
24019 (dwarf2out_do_frame): Likewise.
24020 * except.c: Include flags.h.
24021 * final.c (dwarf2_debug_info_emitted_p): Likewise.
24022 (final_scan_insn_1): Likewise.
24023 * flags.h (dwarf_debuginfo_p): New function declaration.
24024 * opts.c (dwarf_debuginfo_p): New function definition.
24025 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
24026 * toplev.c (process_options): Likewise.
24028 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
24030 * common.opt: Change type to support bitmasks.
24031 * flag-types.h (enum debug_info_type): Rename enumerator constants.
24032 (NO_DEBUG): New bitmask.
24033 (DBX_DEBUG): Likewise.
24034 (DWARF2_DEBUG): Likewise.
24035 (XCOFF_DEBUG): Likewise.
24036 (VMS_DEBUG): Likewise.
24037 (VMS_AND_DWARF2_DEBUG): Likewise.
24038 * flags.h (debug_set_to_format): New function declaration.
24039 (debug_set_count): Likewise.
24040 (debug_set_names): Likewise.
24041 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
24042 (debug_set_to_format): New function definition.
24043 (debug_set_count): Likewise.
24044 (debug_set_names): Likewise.
24045 (set_debug_level): Update access to debug_type_names.
24046 * toplev.c: Likewise.
24048 2021-05-20 Martin Sebor <msebor@redhat.com>
24050 PR middle-end/100684
24051 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
24053 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
24056 * config/i386/i386.md (isa): Remove x64_bmi.
24057 (enabled): Remove x64_bmi.
24058 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24059 Remove general register alternative.
24060 (*andnot<VI_32:mode>3): Ditto.
24061 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24062 (*<any_logic:code><VI_32:mode>3): Ditto.
24064 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
24066 * config/arm/arm.c: Include head files tree-vectorizer.h and
24069 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
24072 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
24073 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
24074 (<s>mulv4hi3_highpart): New expander.
24075 (*<s>mulv2hi3_highpart): New insn pattern.
24076 (<s>mulv2hi3_higpart): New expander.
24077 (*<any_shift:insn>v2hi3): New insn pattern.
24078 (<any_shift:insn>v2hi3): New expander.
24079 * config/i386/sse.md (smulhrsv2hi3): New expander.
24080 (*smulhrsv2hi3): New insn pattern.
24082 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
24084 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
24086 * params.opt (vect-inner-loop-cost-factor): New.
24087 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
24088 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
24089 tree-vectorizer.h and its required ones.
24090 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
24091 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
24092 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
24093 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
24094 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
24095 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
24097 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
24098 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
24099 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
24101 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
24102 Torbjörn Svensson <torbjorn.svensson@st.com>
24105 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
24107 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24109 PR middle-end/99928
24110 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
24111 explicit linear clause when combined with target, make it map(tofrom:)
24112 instead of no clause or firstprivate.
24114 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24116 PR tree-optimization/94589
24117 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
24118 of integral conversions.
24120 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
24122 * gimple-range.cc (fur_source::get_operand): New.
24123 (gimple_range_fold): Delete.
24124 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
24125 (fold_using_range::range_of_range_op): Move from gimple_ranger.
24126 (fold_using_range::range_of_address): Ditto.
24127 (fold_using_range::range_of_phi): Ditto.
24128 (fold_using_range::range_of_call): Ditto.
24129 (fold_using_range::range_of_builtin_ubsan_call): Move from
24130 range_of_builtin_ubsan_call.
24131 (fold_using_range::range_of_builtin_call): Move from
24132 range_of_builtin_call.
24133 (gimple_ranger::range_of_builtin_call): Delete.
24134 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
24135 (gimple_ranger::fold_range_internal): New.
24136 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
24137 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
24138 gimple_ranger. Improve ranges of SSA_NAMES when possible.
24139 * gimple-range.h (gimple_ranger): Remove various range_of routines.
24140 (class fur_source): New.
24141 (class fold_using_range): New.
24142 (fur_source::fur_source): New.
24144 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
24145 instead of range_of_builtin_call.
24147 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
24149 * doc/cpp.texi (Common Predefined Macros): Update documentation
24150 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
24152 2021-05-19 Alex Coplan <alex.coplan@arm.com>
24155 * config/arm/arm.md (nonsecure_call_internal): Always ensure
24156 callee's address is in a register.
24158 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
24160 * common/config/riscv/riscv-common.c
24161 (riscv_subset_list::parsing_subset_version): Properly parse the letter
24163 (riscv_subset_list::parse_std_ext,
24164 riscv_subset_list::parse_multiletter_ext): To handle errors generated
24165 in riscv_subset_list::parsing_subset_version.
24167 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24169 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
24170 type attribute in patterns generating XTN(2).
24172 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24174 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
24175 Remove as duplicate of...
24176 (aarch64_xtn<mode>): This.
24177 (aarch64_xtn2<mode>_le): Move position in file.
24178 (aarch64_xtn2<mode>_be): Move position in file.
24179 (aarch64_xtn2<mode>): Move position in file.
24180 (vec_pack_trunc_<mode>): Define as an expander.
24182 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24184 * config/aarch64/aarch64-simd-builtins.def: Split builtin
24185 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
24186 separate scalar and vector generators.
24187 * config/aarch64/aarch64-simd.md
24188 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
24190 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
24191 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
24192 * config/aarch64/iterators.md: Define SD_HSDI iterator.
24194 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
24196 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
24198 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
24200 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24202 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
24203 Implement as an expand emitting a big/little endian
24204 instruction pattern.
24205 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
24206 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
24208 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24210 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
24211 Implement as an expand emitting a big/little endian
24212 instruction pattern.
24213 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
24214 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
24215 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
24216 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
24218 2021-05-19 Richard Biener <rguenther@suse.de>
24220 PR middle-end/100672
24221 * fold-const.c (fold_negate_expr_1): Use element_precision.
24222 (negate_expr_p): Likewise.
24224 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
24226 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
24227 (pred_load): New int attribute.
24228 * config/aarch64/aarch64-sve.md
24229 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
24230 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
24231 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
24232 code_for_aarch64_load.
24234 2021-05-19 Richard Biener <rguenther@suse.de>
24236 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
24237 sure TARGET_MEM_REF bases are expanded as memory.
24238 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
24239 Do not mark TARGET_MEM_REF bases addressable.
24240 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
24241 TARGET_MEM_REF bases as never rewritable.
24242 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
24243 walk TARGET_MEM_REF bases as address-takens.
24244 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
24246 2021-05-19 Richard Biener <rguenther@suse.de>
24248 * builtins.c (get_object_alignment_1): Strip outer
24250 * tree-dfa.c (get_ref_base_and_extent): Handle outer
24251 WITH_SIZE_EXPR for size processing and process the
24253 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
24254 outer WITH_SIZE_EXPR.
24255 (ao_ref_base_alias_ptr_type): Likewise.
24256 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
24257 and handle that accordingly, stripping it for the
24258 core alias workers.
24259 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
24260 looking through it instead of returning NULL.
24262 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24264 PR middle-end/100576
24265 * builtins.c (check_read_access): Convert bound to size_type_node if
24268 2021-05-19 Richard Biener <rguenther@suse.de>
24270 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
24271 (verify_types_in_gimple_reference): ... here. Sanitize.
24272 (verify_gimple_call): Verify references in LHS and arguments.
24273 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
24275 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
24277 * config/i386/i386.h (VALID_INT_MODE_P):
24278 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
24279 * config/i386/i386.md (isa): Add x64_bmi.
24280 (enabled): Handle x64_bmi.
24281 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24282 Add alternative using 64bit general registers.
24283 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24285 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24287 PR middle-end/99928
24288 * tree.h (OMP_MASTER_COMBINED): Define.
24289 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
24290 handling for outer combined/composite constructs to a loop.
24291 Handle lastprivate on combined target.
24292 (gimplify_expr): Formatting fix.
24294 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
24296 * passes.def: Add sink_code pass before store_merging.
24297 * tree-ssa-sink.c (pass_sink_code:clone): New.
24299 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
24301 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
24302 rs6000_special_adjust_field_align_p.
24303 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
24304 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
24305 Remove ABI warning.
24306 (rs6000_function_arg): Likewise.
24307 * config/rs6000/rs6000-protos.h
24308 (rs6000_special_adjust_field_align_p): Remove prototype.
24309 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
24311 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
24312 rs6000_special_adjust_field_align_p.
24314 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24317 * config/i386/i386.h (VALID_SSE2_REG_MODE):
24318 Add V4QI and V2HI modes.
24319 (VALID_INT_MODE_P): Ditto.
24320 * config/i386/mmx.md (VI_32): New mode iterator.
24321 (mmxvecsize): Handle V4QI and V2HI.
24323 (mov<VI_32:mode>): New expander.
24324 (*mov<mode>_internal): New insn pattern.
24325 (movmisalign<VI_32:mode>): New expander.
24326 (neg<VI_32:mode>): New expander.
24327 (<plusminus:insn><VI_32:mode>3): New expander.
24328 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
24329 (mulv2hi3): New expander.
24330 (*mulv2hi3): New insn pattern.
24331 (one_cmpl<VI_32:mode>2): New expander.
24332 (*andnot<VI_32:mode>3): New insn pattern.
24333 (<any_logic:code><VI_32:mode>3): New expander.
24334 (*<any_logic:code><VI_32:mode>3): New insn pattern.
24336 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24338 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
24339 Fix a mode mismatch with operand 1.
24341 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24344 * config/i386/i386-expand.c (split_double_mode): Return
24345 temporary register when simplify_gen_subreg fails with
24346 the high half od the paradoxical subreg.
24348 2021-05-18 Richard Biener <rguenther@suse.de>
24350 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
24351 and honor it when expanding.
24352 (expand_used_vars_for_block): Pass through forced_stack_var.
24353 (expand_used_vars): Likewise.
24354 (discover_nonconstant_array_refs_r): Set bits in
24355 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
24356 (avoid_type_punning_on_regs): Likewise.
24357 (discover_nonconstant_array_refs): Likewise.
24358 (pass_expand::execute): Create and pass down forced_stack_var
24359 bitmap. For parameters and returns temporarily set
24360 TREE_ADDRESSABLE when expand_function_start.
24362 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24364 * doc/sourcebuild.texi: Document 'dg-note'.
24366 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
24369 * configure: Regenerate.
24370 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
24372 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24374 * gimple.h (is_gimple_omp_oacc): Tighten.
24375 * omp-low.c (check_omp_nesting_restrictions): Adjust.
24377 2021-05-18 Richard Biener <rguenther@suse.de>
24379 * tree-ssa-operands.c (mark_address_taken): Simplify.
24381 2021-05-18 Martin Liska <mliska@suse.cz>
24383 * config/gcn/mkoffload.c (STR): Redefine.
24384 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24385 * config/nvptx/mkoffload.c (STR): Likewise.
24387 2021-05-18 Martin Liska <mliska@suse.cz>
24389 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
24390 Use startswith function instead of strncmp.
24391 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
24392 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
24393 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
24394 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
24395 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
24396 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
24397 * config/arm/arm.c (arm_file_start): Likewise.
24398 (arm_valid_target_attribute_rec): Likewise.
24399 (thumb1_md_asm_adjust): Likewise.
24400 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
24401 * config/avr/avr.c (STR_PREFIX_P): Likewise.
24402 (avr_set_current_function): Likewise.
24403 (avr_handle_addr_attribute): Likewise.
24404 (avr_asm_output_aligned_decl_common): Likewise.
24405 (avr_asm_named_section): Likewise.
24406 (avr_section_type_flags): Likewise.
24407 (avr_asm_select_section): Likewise.
24408 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
24409 (c6x_section_type_flags): Likewise.
24410 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
24411 (darwin_objc_declare_unresolved_class_reference): Likewise.
24412 (darwin_objc_declare_class_definition): Likewise.
24413 * config/darwin.c (indirect_data): Likewise.
24414 (darwin_encode_section_info): Likewise.
24415 (darwin_objc2_section): Likewise.
24416 (darwin_objc1_section): Likewise.
24417 (machopic_select_section): Likewise.
24418 (darwin_globalize_label): Likewise.
24419 (darwin_label_is_anonymous_local_objc_name): Likewise.
24420 (darwin_asm_named_section): Likewise.
24421 (darwin_asm_output_dwarf_offset): Likewise.
24422 * config/frv/frv.c (frv_string_begins_with): Likewise.
24423 (frv_in_small_data_p): Likewise.
24424 * config/gcn/mkoffload.c (STR): Likewise.
24426 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
24427 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
24428 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
24429 (ix86_md_asm_adjust): Likewise.
24430 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24431 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
24432 (i386_pe_file_end): Likewise.
24433 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
24434 (ia64_section_type_flags): Likewise.
24435 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
24436 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
24437 (mips16_stub_function_p): Likewise.
24438 (mips_function_rodata_section): Likewise.
24439 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
24440 (msp430_function_section): Likewise.
24441 (msp430_section_type_flags): Likewise.
24442 (msp430_expand_helper): Likewise.
24443 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
24444 (nios2_valid_target_attribute_rec): Likewise.
24445 * config/nvptx/mkoffload.c (process): Likewise.
24447 * config/pa/som.h: Likewise.
24448 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
24449 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
24450 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
24451 (rs6000_inner_target_options): Likewise.
24452 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
24453 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
24454 * config/vax/vax.c (vax_output_int_move): Likewise.
24455 * config/vms/vms-ld.c (startswith): Likewise.
24456 (process_args): Likewise.
24458 * config/vms/vms.c: Likewise.
24460 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24462 PR rtl-optimization/100590
24463 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
24464 they are NONJUMP_INSN_P.
24466 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24469 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
24470 DECL_ASSEMBLER_NAME on the fn_decl.
24472 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24474 PR tree-optimization/94589
24475 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
24476 phi result used in (res & ~1) == 0 comparison as res >= 0 as
24477 res == 2 would be UB with -ffinite-math-only.
24479 2021-05-18 Martin Liska <mliska@suse.cz>
24481 * Makefile.in: genversion.o should depend on DATESTAMP.
24483 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
24485 * config/arc/simdext.md (negv2si2): Remove round bracket.
24487 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24489 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
24490 _Bool as macro expanding to _Bool.
24492 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24495 * tree.c (build_reference_type_for_mode)
24496 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
24498 (build_reference_type, build_pointer_type): Invoke
24499 build_*_type_for_mode with VOIDmode.
24501 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
24503 PR tree-optimization/100512
24504 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
24505 and non-zero pointer ranges as invariant.
24506 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
24507 processing from here.
24509 2021-05-17 Tom de Vries <tdevries@suse.de>
24512 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
24513 * config/nvptx/nvptx.c (nvptx_output_barrier)
24514 (nvptx_output_atomic_insn): New function.
24515 (nvptx_print_operand): Add support for 'B'.
24516 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
24519 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
24521 PR tree-optimization/100349
24522 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
24525 2021-05-17 Tamar Christina <tamar.christina@arm.com>
24527 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
24528 (host_detect_local_cpu): Use it.
24530 2021-05-17 Martin Liska <mliska@suse.cz>
24532 * doc/invoke.texi: Add 2 missing dots.
24534 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
24536 PR bootstrap/100552
24537 * configure.ac: Replace pattern substitution with call to sed.
24538 * configure: Regenerate.
24540 2021-05-17 Richard Biener <rguenther@suse.de>
24542 PR middle-end/100582
24543 * tree.c (array_at_struct_end_p): Get to the base of the
24544 reference before looking for the underlying decl.
24546 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
24548 * genoutput.c (validate_insn_alternatives) Make "wrong number of
24549 alternatives" message more specific, and remove assumption on where
24552 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24554 * config/arm/iterators.md (V16): New iterator.
24555 (VH_cvtto): New iterator.
24556 (v_cmp_result): Added V4HF and V8HF support.
24557 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
24558 (vcond<mode><mode>): Likewise.
24559 (vcond_mask_<mode><v_cmp_result>): Likewise.
24560 (vcond<VH_cvtto><mode>): New expander.
24562 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24564 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
24566 * config/arm/arm.c (arm_expand_vector_compare): Add support for
24568 (arm_expand_vcond): Likewise.
24569 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
24570 VCMPEQQ_N_S, VCMPNEQ_N_S.
24571 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
24572 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
24573 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
24574 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
24575 (@mve_vpselq_<supf><mode>): Likewise.
24576 (@mve_vpselq_f<mode>"): Likewise.
24577 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
24578 and move to vec-common.md.
24579 (vec_cmpu<mode><mode>): Likewise.
24580 (vcond<mode><mode>): Likewise.
24581 (vcond<V_cvtto><mode>): Likewise.
24582 (vcondu<mode><v_cmp_result>): Likewise.
24583 (vcond_mask_<mode><v_cmp_result>): Likewise.
24584 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
24585 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
24586 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
24587 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
24588 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
24589 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
24591 (vec_cmpu<mode><mode>): Likewise.
24592 (vcond<mode><mode>): Likewise.
24593 (vcond<V_cvtto><mode>): Likewise.
24594 (vcondu<mode><v_cmp_result>): Likewise.
24595 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
24598 2021-05-17 liuhongt <hongtao.liu@intel.com>
24601 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
24602 gsi_insert_seq_before instead.
24604 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24606 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
24607 (arm_sat_ok): ...this.
24609 2021-05-17 Martin Liska <mliska@suse.cz>
24611 * lto-wrapper.c (merge_flto_options): Factor out a new function.
24612 (merge_and_complain): Use it.
24613 (run_gcc): Merge also linker command line -flto=foo argument
24616 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
24618 * config/arm/arm.h (CPP_SPEC): Remove error message about
24619 -mlittle-endian/-mbig-endian conflict.
24621 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24623 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
24624 __ROP_PROTECT__ if -mrop-protect is selected.
24626 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24628 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
24629 rop_hash_save_offset and rop_hash_size.
24630 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
24631 rop_hash_size and rop_hash_save_offset.
24632 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
24633 (rs6000_emit_prologue): Emit hashst[p] in prologue.
24634 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
24635 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
24637 (hashst): New define_insn.
24638 (hashchk): Likewise.
24640 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24642 * config/rs6000/rs6000.c (rs6000_option_override_internal):
24643 Disable shrink wrap when inserting ROP-protect instructions.
24644 * config/rs6000/rs6000.opt (mrop-protect): New option.
24645 (mprivileged): Likewise.
24646 * doc/invoke.texi: Document mrop-protect and mprivileged.
24648 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
24650 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
24653 2021-05-15 Martin Jambor <mjambor@suse.cz>
24656 2021-05-13 Martin Jambor <mjambor@suse.cz>
24658 PR tree-optimization/100453
24659 * tree-sra.c (sra_modify_assign): All const base accesses do not
24660 need refreshing, not just those from decl_pool.
24661 (sra_modify_assign): Do not refresh into a const base decl.
24663 2021-05-15 Jakub Jelinek <jakub@redhat.com>
24665 PR rtl-optimization/100342
24666 * regcprop.c (copy_value): When copying a source reg in a wider
24667 mode than it has recorded for the value, adjust recorded destination
24668 mode too or punt if !REG_CAN_CHANGE_MODE_P.
24670 2021-05-14 Jason Merrill <jason@redhat.com>
24672 * intl.h: Add comments.
24674 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24676 * config/aarch64/aarch64-simd.md
24677 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
24678 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
24679 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
24680 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
24681 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
24682 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
24683 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
24684 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
24685 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
24687 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24690 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
24691 boolean logic equivalent.
24692 (vtst_s16): Likewise.
24693 (vtst_s32): Likewise.
24694 (vtst_u8): Likewise.
24695 (vtst_u16): Likewise.
24696 (vtst_u32): Likewise.
24697 (vtst_p8): Likewise.
24698 (vtst_p16): Likewise.
24699 (vtstq_s8): Likewise.
24700 (vtstq_s16): Likewise.
24701 (vtstq_s32): Likewise.
24702 (vtstq_u8): Likewise.
24703 (vtstq_u16): Likewise.
24704 (vtstq_u32): Likewise.
24705 (vtstq_p8): Likewise.
24706 (vtstq_p16): Likewise.
24707 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
24708 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
24710 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24712 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
24713 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
24714 (aarch64_sqdmlsl2<mode>): Delete.
24715 (aarch64_sqdmlal2_lane<mode>): Merge this...
24716 (aarch64_sqdmlsl2_lane<mode>): ... And this...
24717 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
24718 (aarch64_sqdmlal2_laneq<mode>): Merge this...
24719 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
24720 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
24721 (aarch64_sqdmlal2_n<mode>): Merge this...
24722 (aarch64_sqdmlsl2_n<mode>): ... And this...
24723 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
24725 2021-05-13 Martin Sebor <msebor@redhat.com>
24727 PR middle-end/100574
24728 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
24729 all null arguments.
24731 2021-05-13 Martin Sebor <msebor@redhat.com>
24733 PR tree-optimization/93100
24734 PR middle-end/98583
24735 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
24736 don't modify referenced objects.
24738 2021-05-13 Martin Jambor <mjambor@suse.cz>
24740 PR tree-optimization/100453
24741 * tree-sra.c (sra_modify_assign): All const base accesses do not
24742 need refreshing, not just those from decl_pool.
24743 (sra_modify_assign): Do not refresh into a const base decl.
24745 2021-05-13 Martin Liska <mliska@suse.cz>
24747 * tree-ssa-dom.c: Remove m_simplifier.
24749 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
24752 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
24753 canonicalize DImode inequality comparisons against the
24754 maximum integral value.
24756 2021-05-13 Jakub Jelinek <jakub@redhat.com>
24758 PR tree-optimization/98856
24759 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
24760 Expect V2DI and V4DI arithmetic right shifts to be emulated.
24761 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
24763 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
24764 expand_vec_perm_2perm_pblendv): New functions.
24765 (ix86_expand_vec_perm_const_1): Use them.
24766 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
24767 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
24768 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
24769 (ashrv4di3): New define_expand.
24770 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
24771 and !TARGET_AVX512VL expansion.
24773 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
24776 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
24777 sizes < 16 to a register when constructing vpcmov pattern.
24778 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
24780 2021-05-13 Martin Liska <mliska@suse.cz>
24782 * gcov-io.c (gcov_write_block): Remove.
24783 (gcov_write_words): Likewise.
24784 (gcov_read_words): Re-implement using gcov_read_bytes.
24785 (gcov_allocate): Remove.
24786 (GCOV_BLOCK_SIZE): Likewise.
24787 (struct gcov_var): Remove most of the fields.
24788 (gcov_position): Implement with ftell.
24789 (gcov_rewrite): Remove setting of start and offset fields.
24790 (from_file): Re-format.
24791 (gcov_open): Remove setbuf call. It should not be needed.
24792 (gcov_close): Remove internal buffer handling.
24793 (gcov_magic): Use __builtin_bswap32.
24794 (gcov_write_counter): Use directly gcov_write_unsigned.
24795 (gcov_write_string): Use direct fwrite and do not round
24797 (gcov_seek): Use directly fseek.
24798 (gcov_write_tag): Use gcov_write_unsigned directly.
24799 (gcov_write_length): Likewise.
24800 (gcov_write_tag_length): Likewise.
24801 (gcov_read_bytes): Use directly fread.
24802 (gcov_read_unsigned): Use gcov_read_words.
24803 (gcov_read_counter): Likewise.
24804 (gcov_read_string): Use gcov_read_bytes.
24805 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
24806 that size is not in bytes, but words (4B).
24807 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
24808 (GCOV_TAG_ARCS_LENGTH): Likewise.
24809 (GCOV_TAG_ARCS_NUM): Likewise.
24810 (GCOV_TAG_COUNTER_LENGTH): Likewise.
24811 (GCOV_TAG_COUNTER_NUM): Likewise.
24812 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
24814 2021-05-13 liuhongt <hongtao.liu@intel.com>
24817 * config/i386/sse.md (ssedoublevecmode): Add attribute for
24818 V64QI/V32HI/V16SI/V4DI.
24819 (ssehalfvecmode): Add attribute for V2DI/V2DF.
24820 (*vec_concatv4si_0): Extend to VI124_128.
24821 (*vec_concat<mode>_0): New pre-reload splitter.
24822 * config/i386/predicates.md (movq_parallel): New predicate.
24824 2021-05-13 Alexandre Oliva <oliva@adacore.com>
24826 * targhooks.c (default_zero_call_used_regs): Retry using
24827 successfully-zeroed registers as sources.
24829 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
24831 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
24833 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
24836 * gimple-range.cc (range_of_builtin_call): Skip out on
24837 processing __builtin_clz when varying.
24839 2021-05-12 Tom de Vries <tdevries@suse.de>
24842 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
24843 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
24844 to ptx_version_option.
24845 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
24846 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
24847 (define_insn "nvptx_vote_ballot"): Use sync variant for
24849 * config/nvptx/nvptx.opt (ptx_version): Add enum.
24850 (mptx): Add option.
24851 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
24853 2021-05-12 Richard Biener <rguenther@suse.de>
24855 PR tree-optimization/100566
24856 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
24857 allow_back for all edge queries.
24859 2021-05-12 liuhongt <hongtao.liu@intel.com>
24862 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
24863 splitters for pblendvb of NOT mask register.
24865 2021-05-12 Richard Biener <rguenther@suse.de>
24867 PR tree-optimization/100519
24868 * tree-ssa-reassoc.c (can_associate_p): Split into...
24869 (can_associate_op_p): ... this
24870 (can_associate_type_p): ... and this.
24871 (is_reassociable_op): Call can_associate_op_p.
24872 (break_up_subtract_bb): Call the appropriate predicates.
24873 (reassociate_bb): Likewise.
24875 2021-05-12 Martin Liska <mliska@suse.cz>
24877 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
24878 (run_gcc): Use -flto argument detection for merged
24881 2021-05-12 Martin Liska <mliska@suse.cz>
24883 * lto-wrapper.c (print_lto_docs_link): New function.
24884 (run_gcc): Print warning about missing job server detection
24885 after we know NR of partitions. Do the same for -flto{,=1}.
24886 * opts.c (get_option_html_page): Support -flto option.
24888 2021-05-12 Martin Liska <mliska@suse.cz>
24890 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
24892 (append_option): Remove.
24893 (find_option): Rework to use the vector type.
24894 (remove_option): Remove.
24895 (merge_and_complain): Use vectors for cl_decoded_option data
24897 (append_compiler_options): Likewise.
24898 (append_diag_options): Likewise.
24899 (append_linker_options): Likewise.
24900 (append_offload_options): Likewise.
24901 (compile_offload_image): Likewise.
24902 (compile_images_for_offload_targets): Likewise.
24903 (find_and_merge_options): Likewise.
24904 (run_gcc): Likewise.
24906 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
24909 * dwarf2out.c (dwarf2out_finish): Set
24910 have_multiple_function_sections with multi-range text_section.
24912 2021-05-12 Martin Liska <mliska@suse.cz>
24914 PR bootstrap/100560
24915 * Makefile.in: Remove version.h from linker command line.
24917 2021-05-12 Richard Biener <rguenther@suse.de>
24919 PR middle-end/100547
24920 * rtl.h (rtvec_alloc): Make argument size_t.
24921 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
24923 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24925 PR middle-end/100508
24926 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
24927 type, don't reuse DECL_RTL if it has different mode, instead force
24928 creation of a new DEBUG_EXPR.
24930 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24931 Marc Glisse <marc.glisse@inria.fr>
24933 PR tree-optimization/94589
24934 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
24935 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
24937 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
24940 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
24941 * config/i386/mmx.md (MMXMODE124): New mode iterator.
24943 (mmxintvecmode): New mode attribute.
24944 (mmxintvecmodelower): Ditto.
24945 (*mmx_maskcmpv2sf3_comm): New insn pattern.
24946 (*mmx_maskcmpv2sf3): Ditto.
24947 (vec_cmpv2sfv2si): New expander.
24948 (vcond<V2FI:mode>v2si): Ditto.
24949 (mmx_vlendvps): New insn pattern.
24950 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
24951 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
24952 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
24954 2021-05-11 Martin Sebor <msebor@redhat.com>
24956 PR middle-end/21433
24957 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
24959 2021-05-11 Richard Biener <rguenther@suse.de>
24961 * gimple-fold.c (gimple_fold_call): Do not call
24962 maybe_fold_reference on call arguments or the static chain.
24963 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
24966 2021-05-11 Martin Liska <mliska@suse.cz>
24968 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
24969 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
24970 (DEF_HSAIL_SAT_BUILTIN): Likewise.
24971 (DEF_HSAIL_INTR_BUILTIN): Likewise.
24972 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
24973 * doc/frontends.texi: Remove BRIG.
24974 * doc/install.texi: Likewise.
24975 * doc/invoke.texi: Likewise.
24976 * doc/standards.texi: Likewise.
24977 * brig-builtins.def: Removed.
24978 * brig/ChangeLog: Removed.
24979 * brig/Make-lang.in: Removed.
24980 * brig/brig-builtins.h: Removed.
24981 * brig/brig-c.h: Removed.
24982 * brig/brig-lang.c: Removed.
24983 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
24984 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
24985 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
24986 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
24987 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
24988 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
24989 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
24990 * brig/brigfrontend/brig-comment-handler.cc: Removed.
24991 * brig/brigfrontend/brig-control-handler.cc: Removed.
24992 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
24993 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
24994 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
24995 * brig/brigfrontend/brig-function-handler.cc: Removed.
24996 * brig/brigfrontend/brig-function.cc: Removed.
24997 * brig/brigfrontend/brig-function.h: Removed.
24998 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
24999 * brig/brigfrontend/brig-label-handler.cc: Removed.
25000 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
25001 * brig/brigfrontend/brig-machine.c: Removed.
25002 * brig/brigfrontend/brig-machine.h: Removed.
25003 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
25004 * brig/brigfrontend/brig-module-handler.cc: Removed.
25005 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
25006 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
25007 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
25008 * brig/brigfrontend/brig-to-generic.cc: Removed.
25009 * brig/brigfrontend/brig-to-generic.h: Removed.
25010 * brig/brigfrontend/brig-util.cc: Removed.
25011 * brig/brigfrontend/brig-util.h: Removed.
25012 * brig/brigfrontend/brig-variable-handler.cc: Removed.
25013 * brig/brigfrontend/hsa-brig-format.h: Removed.
25014 * brig/brigfrontend/phsa.h: Removed.
25015 * brig/brigspec.c: Removed.
25016 * brig/config-lang.in: Removed.
25017 * brig/gccbrig.texi: Removed.
25018 * brig/lang-specs.h: Removed.
25019 * brig/lang.opt: Removed.
25021 2021-05-11 Richard Biener <rguenther@suse.de>
25024 * ipa-param-manipulation.c
25025 (ipa_param_body_adjustments::modify_call_stmt): Avoid
25026 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
25027 via gimple_call_lhs_ptr.
25029 2021-05-11 Alex Coplan <alex.coplan@arm.com>
25032 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
25033 Avoid emitting CFA adjusts on the sp if we have the fp.
25035 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
25037 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
25038 (VMULD): New iterator.
25039 (VCOND): Handle V4HF and V8HF.
25040 (VCONQ): Fix entry for V2SF.
25041 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
25042 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
25043 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
25044 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
25045 Use a 128-bit vector mode for the indexed operand. Use stype for
25046 the scheduling type.
25048 2021-05-11 Richard Biener <rguenther@suse.de>
25050 * gimple-fold.c (maybe_fold_reference): Only return
25051 is_gimple_min_invariant values.
25053 2021-05-11 Richard Biener <rguenther@suse.de>
25055 PR middle-end/100509
25056 * gimple-fold.c (fold_gimple_assign): Only call
25057 get_symbol_constant_value on register type symbols.
25059 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
25060 Joe Ramsay <joe.ramsay@arm.com>
25063 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
25064 (__arm_vcmpneq): Remove duplicate definition.
25065 (__arm_vstrwq_scatter_offset_p): Likewise.
25066 (__arm_vmaxq_x): Likewise.
25067 (__arm_vmlsdavaq): Likewise.
25068 (__arm_vmlsdavaxq): Likewise.
25069 (__arm_vmlsdavq_p): Likewise.
25070 (__arm_vmlsdavxq_p): Likewise.
25071 (__arm_vrmlaldavhaq): Likewise.
25072 (__arm_vstrbq_p): Likewise.
25073 (__arm_vstrbq_scatter_offset): Likewise.
25074 (__arm_vstrbq_scatter_offset_p): Likewise.
25075 (__arm_vstrdq_scatter_offset): Likewise.
25076 (__arm_vstrdq_scatter_offset_p): Likewise.
25077 (__arm_vstrdq_scatter_shifted_offset): Likewise.
25078 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
25080 2021-05-11 Jakub Jelinek <jakub@redhat.com>
25082 PR middle-end/100471
25083 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
25084 is 0, bypass the reduction loop including
25085 GOMP_taskgroup_reduction_unregister call.
25087 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25089 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
25090 costing_for_scalar.
25091 (rs6000_density_test): Early return if costing_for_scalar is true.
25092 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
25094 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25096 * doc/tm.texi: Regenerated.
25097 * target.def (init_cost): Add new parameter costing_for_scalar.
25098 * targhooks.c (default_init_cost): Adjust for new parameter.
25099 * targhooks.h (default_init_cost): Likewise.
25100 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
25101 (vect_compute_single_scalar_iteration_cost): Likewise.
25102 (vect_analyze_loop_2): Likewise.
25103 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
25104 (vect_bb_vectorization_profitable_p): Likewise.
25105 * tree-vectorizer.h (init_cost): Likewise.
25106 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
25107 * config/i386/i386.c (ix86_init_cost): Likewise.
25108 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
25110 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25112 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
25113 vect_nonmem and moved into...
25114 (struct rs6000_cost_data): ...here.
25115 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
25116 (rs6000_add_stmt_cost): Likewise.
25117 (rs6000_finish_cost): Likewise.
25119 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
25121 * range-op.cc (get_bool_state): Adjust head comment.
25122 (operator_not_equal::op1_range): Fix comment.
25123 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
25125 2021-05-10 Martin Sebor <msebor@redhat.com>
25127 PR middle-end/100425
25128 PR middle-end/100510
25129 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
25130 (pass_walloca::xlimit_certain_p): ...to this.
25131 (pass_walloca::gate): Execute for any kind of handled warning.
25132 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
25133 warnings when xlimit_certain_p is set.
25135 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
25137 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
25138 Return ALTIVEC_REGS if that is best_class.
25139 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
25141 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25143 * config/arm/arm.h (CPP_SPEC): Remove error message about
25146 2021-05-10 Martin Jambor <mjambor@suse.cz>
25148 * ipa-prop.h (IPA_NODE_REF): Removed.
25149 (IPA_NODE_REF_GET_CREATE): Likewise.
25150 (IPA_EDGE_REF): Likewise.
25151 (IPA_EDGE_REF_GET_CREATE): Likewise.
25152 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
25153 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
25154 use of ipa_node_params_sum.
25155 (ipcp_versionable_function_p): Likewise.
25156 (push_node_to_stack): Likewise.
25157 (pop_node_from_stack): Likewise.
25158 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
25159 direct use of ipa_node_params_sum.
25160 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
25161 ipa_node_params_sum.
25162 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
25164 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
25165 use of ipa_node_params_sum.
25166 (self_recursively_generated_p): Likewise.
25167 (propagate_scalar_across_jump_function): Likewise.
25168 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
25169 direct use of ipa_edge_args_sum, moved the lookup after the early
25170 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
25171 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
25172 direct uses of ipa_node_params_sum.
25173 (propagate_vr_across_jump_function): Likewise.
25174 (propagate_aggregate_lattice): Likewise.
25175 (propagate_aggs_across_jump_function): Likewise.
25176 (propagate_constants_across_call): Likewise, also replaced
25177 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
25178 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
25179 of ipa_node_params_sum.
25180 (estimate_local_effects): Likewise.
25181 (add_all_node_vals_to_toposort): Likewise.
25182 (propagate_constants_topo): Likewise.
25183 (ipcp_propagate_stage): Likewise.
25184 (ipcp_discover_new_direct_edges): Likewise.
25185 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
25186 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
25187 (get_info_about_necessary_edges): Likewise.
25188 (want_remove_some_param_p): Likewise.
25189 (create_specialized_node): Likewise.
25190 (self_recursive_pass_through_p): Likewise.
25191 (self_recursive_agg_pass_through_p): Likewise.
25192 (find_more_scalar_values_for_callers_subset): Likewise and also
25193 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
25194 case replacing two of those with a single query.
25195 (find_more_contexts_for_caller_subset): Likewise for the
25196 ipa_polymorphic_call_context overload.
25197 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
25198 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
25199 ipa_node_params_sum.
25200 (find_aggregate_values_for_callers_subset): Likewise, also reusing
25201 results of ipa_edge_args_sum->get.
25202 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
25203 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
25204 direct use of ipa_edge_args_sum.
25205 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
25206 summary query after the early exit and reused the result later.
25207 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
25208 ipa_node_params_sum.
25209 (decide_whether_version_node): Likewise. Removed re-querying for
25210 summaries after cloning.
25211 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
25212 ipa_node_params_sum.
25213 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
25215 (identify_dead_nodes): Likewise.
25216 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
25217 ipa_node_params_sum.
25218 (ipcp_store_vr_results): Likewise.
25219 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
25220 (ipa_fn_summary_t::duplicate): Likewise.
25221 (analyze_function_body): Likewise.
25222 (estimate_calls_size_and_time): Likewise.
25223 (ipa_cached_call_context::duplicate_from): Likewise.
25224 (ipa_call_context::equal_to): Likewise.
25225 (remap_edge_params): Likewise.
25226 (ipa_merge_fn_summary_after_inlining): Likewise.
25227 (inline_read_section): Likewise.
25228 * ipa-icf.c (sem_function::param_used_p): Likewise.
25229 * ipa-modref.c (compute_parm_map): Likewise.
25230 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
25232 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
25233 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25235 * ipa-profile.c (check_argument_count): Likewise.
25236 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
25237 with a direct use of ipa_node_params_sum.
25238 (ipa_initialize_node_params): Likewise.
25239 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
25240 direct use of ipa_edge_args_sum and reused the query result.
25241 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
25242 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25243 direct use of ipa_edge_args_sum.
25244 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
25245 ipa_node_params_sum and reused the result of the query.
25246 (ipa_analyze_node): Likewise.
25247 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
25248 of ipa_node_params_sum.
25249 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
25250 direct uses of ipa_edge_args_sum.
25251 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
25252 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25253 direct use of ipa_edge_args_sum. Removed superficial re-querying the
25255 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
25256 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25258 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
25259 direct use of ipa_edge_args_sum.
25260 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
25261 use of ipa_node_params_sum.
25262 (ipa_print_node_params): Likewise.
25263 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
25264 direct uses of ipa_edge_args_sum.
25265 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
25267 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
25268 ipa_node_params_sum.
25269 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
25270 scopes where it is used.
25272 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
25274 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
25275 <case E_V2SImode>: Force op_true to register.
25277 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25279 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
25280 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
25281 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
25283 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
25284 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
25285 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
25286 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
25287 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
25288 (mve_vcmpneq_n_f<mode>): Remove.
25289 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
25290 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
25291 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
25293 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25295 * config/arm/iterators.md (MVE_COMPARISONS): New.
25297 (mve_cmp_type): New.
25298 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
25300 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
25301 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
25302 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
25303 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
25304 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25305 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25306 (mve_vcmpneq_n_<mode>): Remove.
25308 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25310 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
25311 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
25312 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
25315 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25317 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
25318 (vcmpneq_n_u): Likewise.
25319 (vcmpeqq_u,): Likewise.
25320 (vcmpeqq_n_u): Likewise.
25321 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
25322 VCMPEQQ_N_U and VCMPNEQ_N_U.
25323 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
25324 (mve_vcmpeqq_n): Likewise.
25325 (mve_vcmpeqq): Likewise.
25326 (mve_vcmpneq_n): Likewise.
25328 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25330 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
25331 the 's' version of the builtin.
25333 2021-05-10 Richard Biener <rguenther@suse.de>
25335 PR tree-optimization/100492
25336 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
25337 Find nothing when the loop contains an irreducible region.
25339 2021-05-10 Richard Biener <rguenther@suse.de>
25341 PR middle-end/100464
25343 * gimple-fold.c (canonicalize_constructor_val): Do not set
25346 2021-05-10 Richard Biener <rguenther@suse.de>
25348 PR tree-optimization/100434
25349 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
25351 (dse_optimize_stmt): Handle call LHS by dropping the
25352 LHS or the whole call if it doesn't have other
25354 (pass_dse::execute): Adjust.
25356 2021-05-10 Martin Liska <mliska@suse.cz>
25358 * Makefile.in: Add missing genversion rule.
25360 2021-05-10 Alex Coplan <alex.coplan@arm.com>
25363 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
25364 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
25366 2021-05-10 Martin Liska <mliska@suse.cz>
25368 * builtins.c (is_builtin_name): Use startswith
25369 function instead of strncmp.
25370 * collect2.c (main): Likewise.
25371 (has_lto_section): Likewise.
25372 (scan_libraries): Likewise.
25373 * coverage.c (coverage_checksum_string): Likewise.
25374 (coverage_init): Likewise.
25375 * dwarf2out.c (is_cxx): Likewise.
25376 (gen_compile_unit_die): Likewise.
25377 * gcc-ar.c (main): Likewise.
25378 * gcc.c (init_spec): Likewise.
25379 (read_specs): Likewise.
25380 (execute): Likewise.
25381 (check_live_switch): Likewise.
25382 * genattrtab.c (write_attr_case): Likewise.
25383 (IS_ATTR_GROUP): Likewise.
25384 * gencfn-macros.c (main): Likewise.
25385 * gengtype.c (type_for_name): Likewise.
25386 (gen_rtx_next): Likewise.
25387 (get_file_langdir): Likewise.
25388 (write_local): Likewise.
25389 * genmatch.c (get_operator): Likewise.
25390 (get_operand_type): Likewise.
25391 (expr::gen_transform): Likewise.
25392 * genoutput.c (validate_optab_operands): Likewise.
25393 * incpath.c (add_sysroot_to_chain): Likewise.
25394 * langhooks.c (lang_GNU_C): Likewise.
25395 (lang_GNU_CXX): Likewise.
25396 (lang_GNU_Fortran): Likewise.
25397 (lang_GNU_OBJC): Likewise.
25398 * lto-wrapper.c (run_gcc): Likewise.
25399 * omp-general.c (omp_max_simt_vf): Likewise.
25400 * omp-low.c (omp_runtime_api_call): Likewise.
25401 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
25402 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
25403 * real.c (real_from_string): Likewise.
25404 * selftest.c (assert_str_startswith): Likewise.
25405 * timevar.c (timer::validate_phases): Likewise.
25406 * tree.c (get_file_function_name): Likewise.
25407 * ubsan.c (ubsan_use_new_style_p): Likewise.
25408 * varasm.c (default_function_rodata_section): Likewise.
25409 (incorporeal_function_p): Likewise.
25410 (default_section_type_flags): Likewise.
25411 * system.h (startswith): Define startswith.
25413 2021-05-10 Martin Liska <mliska@suse.cz>
25415 * bitmap.h (class auto_bitmap): Remove
25416 __cplusplus >= 201103.
25417 * config/aarch64/aarch64.c: Likewise.
25418 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
25420 * sbitmap.h: Likewise.
25422 2021-05-10 Martin Liska <mliska@suse.cz>
25424 * Makefile.in: Rename gcov-iov to genversion and depend
25425 on version.h (instead of gcov-iov.h).
25426 * gcov-io.h: Include version.h instread of gcov-iov.h.
25427 * gengtype-state.c (read_state_version): Likewise.
25428 * gcov-iov.c: Moved to...
25429 * genversion.c: ...here.
25430 * lto-streamer.h (LTO_major_version): Define it with
25432 * version.c: Removed.
25433 * version.h: Removed.
25435 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25437 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
25438 * config/arc/simdext.md (VCT): Add predicates for iterator
25441 (voptab): Likewise.
25442 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
25443 (<voptab>v2si3): New patterns.
25445 (reduc_plus_scal_v4hi): Likewise.
25446 (reduc_plus_scal_v2si): Likewise.
25447 (vec_duplicatev2si): Likewise.
25448 (vec_duplicatev4hi): Likewise.
25450 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25452 * config/arc/simdext.md: Format and cleanup file.
25454 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25456 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
25457 only when munaligned-access option is on.
25458 (movmisalign<mode>): Likewise.
25460 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25462 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
25464 * config/arc/arc.c (arc_reorg): Remove underscore from string.
25466 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25468 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
25469 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
25470 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
25471 (norm_f): Likewise.
25474 (clzsi2): Use fls instruction when available.
25475 (arc_clzsi2): Likewise.
25477 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25479 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
25481 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25483 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
25485 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
25487 PR middle-end/100467
25488 * toplev.c (compile_file): Call insn_locations_init before
25489 targetm.asm_out.code_end.
25491 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25494 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25496 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25498 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25499 Andrew Stubbs <amd@codesourcery.com>
25502 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
25503 emit_move_insn operands.
25505 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25507 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
25508 location for the outgoing edges of an empty block.
25509 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
25510 type by means of the get_array_descr_info langhook, if it is set and
25511 returns true. Remove obsolete code dealing with unnamed subtypes.
25513 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25515 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
25516 (sbr_vector): Renamed from ssa_block_cache.
25517 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
25518 (ssa_block_ranges::~ssa_block_ranges): Remove.
25519 (sbr_vector::set_bb_range): Use varying and undefined cached values.
25520 (ssa_block_ranges::set_bb_varying): Remove.
25521 (sbr_vector::get_bb_range): Adjust assert.
25522 (sbr_vector::bb_range_p): Adjust assert.
25523 (~block_range_cache): No freeing loop required.
25524 (block_range_cache::get_block_ranges): Remove.
25525 (block_range_cache::set_bb_range): Inline get_block_ranges.
25526 (block_range_cache::set_bb_varying): Remove.
25527 * gimple-range-cache.h (set_bb_varying): Remove prototype.
25528 * value-range.h (irange_allocator::get_memory): New.
25530 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25532 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
25533 dominator tree is available and requested.
25534 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
25535 (ranger_cache::fill_block_cache): Don't search dom tree here either.
25536 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
25538 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25540 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
25541 only PHI nodes better.
25543 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25545 * gimple-range-edge.h (gimple_outgoing_range): Rename from
25547 (gcond_edge_range): Export prototype.
25548 * gimple-range-edge.cc (gcond_edge_range): New.
25549 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
25550 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
25552 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25554 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
25555 default range into a temp and allocate only what is needed.
25557 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25559 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
25561 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25563 * gimple-range.h (gimple_range_global): Pick up parameter initial
25564 values, and use-before defined locals are UNDEFINED.
25566 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25568 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
25570 * tree.h (reverse_storage_order_for_component_p): Return false if
25571 the type is a pointer.
25573 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25575 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25577 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25580 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
25581 Handle V8QI, V4HI and V2SI modes.
25582 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
25583 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
25584 * config/i386/i386.md (unspec): ... here.
25586 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
25587 Tom de Vries <tdevries@suse.de>
25589 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
25590 a truth_value_p reduction variable is nonintegral.
25592 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25595 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
25596 Return false for mode sizes < 16.
25598 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25601 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
25603 2021-05-06 Martin Jambor <mjambor@suse.cz>
25605 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
25606 when there is no function summary.
25607 (ipa_sra_summarize_function): produce edge summaries even when
25610 2021-05-06 Tom Tromey <tom@tromey.com>
25612 * godump.c (string_hash_eq): Remove.
25613 (go_finish): Use htab_eq_string.
25615 2021-05-06 Tom Tromey <tom@tromey.com>
25617 * gengtype-state.c (read_state): Use htab_eq_string.
25618 (string_eq): Remove.
25620 2021-05-06 Tom Tromey <tom@tromey.com>
25622 * gensupport.c (htab_eq_string): Remove.
25624 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
25627 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
25628 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
25629 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
25630 set_ignored_loc callbacks.
25631 * debug.c (do_nothing_debug_hooks): Likewise.
25632 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
25633 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
25634 (in_text_section_p, last_text_label, last_cold_label,
25635 switch_text_ranges, switch_cold_ranges): New data items.
25636 (dwarf2out_note_section_used): Remove.
25637 (dwarf2out_begin_prologue): Set fde->ignored_debug and
25639 (mark_ignored_debug_section): New helper function.
25640 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
25641 mark_ignored_debug_section.
25642 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
25643 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
25644 (size_of_aranges): Adjust formula for multi-part text ranges size.
25645 (output_aranges): Output multi-part text ranges.
25646 (dwarf2out_set_ignored_loc): New callback function.
25647 (dwarf2out_finish): Output multi-part text ranges.
25648 (dwarf2out_c_finalize): Clear new data items.
25649 * final.c (final_start_function_1): Call set_ignored_loc callback.
25650 (final_scan_insn_1): Likewise.
25651 * ggc-page.c (gt_ggc_mx): New helper function.
25652 * stringpool.c (gt_pch_nx): Likewise.
25654 2021-05-06 Richard Biener <rguenther@suse.de>
25656 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
25657 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
25658 (TV_TREE_INTO_SSA): New.
25659 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
25660 (rewrite_blocks): Likewise.
25661 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
25663 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25665 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
25666 abs_replacement, xor_replacement,
25667 cond_removal_in_popcount_clz_ctz_pattern,
25668 replace_phi_edge_with_variable): Change type of phi argument from
25669 gimple * to gphi *.
25671 2021-05-06 Richard Biener <rguenther@suse.de>
25673 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
25674 Output an opt-info message.
25675 (do_split_loop_on_cond): Likewise.
25676 (tree_ssa_split_loops): Update SSA form here.
25678 2021-05-06 Richard Biener <rguenther@suse.de>
25680 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
25681 return variable removal.
25683 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
25685 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
25686 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
25687 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
25689 * config/s390/s390.c (s390_const_operand_ok): Remove unused
25692 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25694 PR tree-optimization/94589
25695 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
25696 spaceship_replacement.
25697 (cond_only_block_p, spaceship_replacement): New functions.
25699 2021-05-06 Richard Biener <rguenther@suse.de>
25702 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
25703 dealing with a debug use and only query existing addresses
25705 (lower_emutls_1): Avoid splitting out addresses for debug
25706 stmts, reset the debug stmt when we fail to find existing
25708 (lower_emutls_phi_arg): Set wi.stmt.
25710 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
25713 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
25714 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
25715 (stack_protect_test): Use cbranch helper.
25717 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25720 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
25721 always return the establisher frame for __builtin_frame_address (0).
25723 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
25726 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
25727 (ix86_cpu_model_var): Likewise.
25728 (ix86_cpu_features2_type_node): Likewise.
25729 (ix86_cpu_features2_var): Likewise.
25730 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
25733 2021-05-05 Martin Sebor <msebor@redhat.com>
25735 * passes.def (pass_warn_printf): Run after SSA.
25737 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25739 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
25740 * config/arm/predicates.md (minus_one_operand): New predicate.
25742 2021-05-05 Jeff Law <jlaw@tachyum.com>
25744 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
25746 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
25748 PR rtl-optimization/100263
25749 * postreload.c (move2add_valid_value_p): Ensure register can
25752 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25754 PR rtl-optimization/100411
25755 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
25756 and beginning of function markers.
25758 2021-05-05 Jeff Law <jlaw@tachyum.com>
25760 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
25761 * config/cr16/cr16.c (notice_update_cc): Remove.
25762 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
25764 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
25767 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
25768 Handle V8QI, V4HI and V2SI modes.
25769 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
25770 (ix86_build_signbit_mask): Ditto.
25771 * config/i386/mmx.md (MMXMODE14): New mode iterator.
25772 (<smaxmin:code><MMXMODE14:mode>3): New expander.
25773 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
25774 (<umaxmin:code><MMXMODE24:mode>3): New expander.
25775 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
25776 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
25777 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25778 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25779 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25780 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25782 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25784 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
25785 not expand the VALUE_EXPR of variables put in the non-local frame.
25786 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
25787 to be ignored for debug info, ensure its variable offsets are not.
25789 2021-05-05 Richard Biener <rguenther@suse.de>
25791 PR tree-optimization/79333
25792 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
25793 Fold stmt following SSA edges.
25795 2021-05-05 Richard Biener <rguenther@suse.de>
25797 PR middle-end/100394
25798 * calls.c (expand_call): Preserve possibly throwing calls.
25799 * cfgexpand.c (expand_call_stmt): When a call can throw signal
25800 RTL expansion there are side-effects.
25801 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
25802 mark all possibly throwing stmts necessary unless we can elide
25804 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
25805 -fdelete-dead-exceptions.
25806 * tree.h (DECL_PURE_P): Add note about exceptions.
25808 2021-05-05 Alexandre Oliva <oliva@adacore.com>
25810 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
25813 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
25815 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
25816 get_fnname_from_decl for name of thunk.
25817 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
25818 and ASM_OUTPUT_LABEL.
25819 (rs6000_xcoff_declare_function_name): Use assemble_name and
25821 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
25822 (rs6000_xcoff_encode_section_info): Don't add mapping class
25823 for aliases. Always add [DS] mapping class to primary
25825 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
25827 2021-05-04 Martin Sebor <msebor@redhat.com>
25829 PR middle-end/100307
25830 * builtins.c (compute_objsize_r): Clear base0 for pointers.
25832 2021-05-04 Jeff Law <jlaw@tachyum.com>
25834 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
25836 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
25838 * caller-save.c: Remove CC0.
25839 * cfgcleanup.c: Remove CC0.
25840 * cfgrtl.c: Remove CC0.
25841 * combine.c: Remove CC0.
25842 * compare-elim.c: Remove CC0.
25843 * conditions.h: Remove CC0.
25844 * config/h8300/h8300.h: Remove CC0.
25845 * config/h8300/h8300-protos.h: Remove CC0.
25846 * config/h8300/peepholes.md: Remove CC0.
25847 * config/i386/x86-tune-sched.c: Remove CC0.
25848 * config/m68k/m68k.c: Remove CC0.
25849 * config/rl78/rl78.c: Remove CC0.
25850 * config/sparc/sparc.c: Remove CC0.
25851 * config/xtensa/xtensa.c: Remove CC0.
25852 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
25853 RTL where that is used as a placeholder only.
25854 * cprop.c: Remove CC0.
25855 * cse.c: Remove CC0.
25856 * cselib.c: Remove CC0.
25857 * df-problems.c: Remove CC0.
25858 * df-scan.c: Remove CC0.
25859 * doc/md.texi: Remove CC0. Adjust an example.
25860 * doc/rtl.texi: Remove CC0. Adjust an example.
25861 * doc/tm.texi: Regenerate.
25862 * doc/tm.texi.in: Remove CC0.
25863 * emit-rtl.c: Remove CC0.
25864 * final.c: Remove CC0.
25865 * fwprop.c: Remove CC0.
25866 * gcse-common.c: Remove CC0.
25867 * gcse.c: Remove CC0.
25868 * genattrtab.c: Remove CC0.
25869 * genconfig.c: Remove CC0.
25870 * genemit.c: Remove CC0.
25871 * genextract.c: Remove CC0.
25872 * gengenrtl.c: Remove CC0.
25873 * genrecog.c: Remove CC0.
25874 * haifa-sched.c: Remove CC0.
25875 * ifcvt.c: Remove CC0.
25876 * ira-costs.c: Remove CC0.
25877 * ira.c: Remove CC0.
25878 * jump.c: Remove CC0.
25879 * loop-invariant.c: Remove CC0.
25880 * lra-constraints.c: Remove CC0.
25881 * lra-eliminations.c: Remove CC0.
25882 * optabs.c: Remove CC0.
25883 * postreload-gcse.c: Remove CC0.
25884 * postreload.c: Remove CC0.
25885 * print-rtl.c: Remove CC0.
25886 * read-rtl-function.c: Remove CC0.
25887 * reg-notes.def: Remove CC0.
25888 * reg-stack.c: Remove CC0.
25889 * reginfo.c: Remove CC0.
25890 * regrename.c: Remove CC0.
25891 * reload.c: Remove CC0.
25892 * reload1.c: Remove CC0.
25893 * reorg.c: Remove CC0.
25894 * resource.c: Remove CC0.
25895 * rtl.c: Remove CC0.
25896 * rtl.def: Remove CC0.
25897 * rtl.h: Remove CC0.
25898 * rtlanal.c: Remove CC0.
25899 * sched-deps.c: Remove CC0.
25900 * sched-rgn.c: Remove CC0.
25901 * shrink-wrap.c: Remove CC0.
25902 * simplify-rtx.c: Remove CC0.
25903 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
25904 CC_STATUS_MDEP, and CC_STATUS.
25905 * target.def: Remove CC0.
25906 * valtrack.c: Remove CC0.
25907 * var-tracking.c: Remove CC0.
25909 2021-05-04 Richard Biener <rguenther@suse.de>
25911 PR tree-optimization/100414
25912 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
25914 (tree_ssa_phiopt_worker): But unconditionally here.
25916 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
25918 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
25919 && and || with floating-point and complex arguments.
25921 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25923 * tree-inline.c (insert_debug_decl_map): Delete.
25924 (copy_debug_stmt): Minor tweak.
25925 (setup_one_parameter): Do not use a variable if the value is either
25926 a read-only DECL or a non-addressable local variable in the caller.
25927 In this case, insert the debug-only variable in the map manually.
25928 (expand_call_inline): Do not generate a CLOBBER for these values.
25929 * tree-inline.h (debug_map): Minor tweak.
25931 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25933 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
25934 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
25936 2021-05-04 Richard Biener <rguenther@suse.de>
25938 PR tree-optimization/100329
25939 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
25941 (insert_stmt_after): Assert we're not running into asm goto.
25943 2021-05-04 Richard Biener <rguenther@suse.de>
25945 PR tree-optimization/100398
25946 * tree-ssa-dse.c (pass_dse::execute): Preserve control
25949 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25951 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
25953 2021-05-04 Alexandre Oliva <oliva@adacore.com>
25955 * builtins.c (try_store_by_multiple_pieces): New.
25956 (expand_builtin_memset_args): Use it. If target_char_cast
25957 fails, proceed as for non-constant val. Pass len's ctz to...
25958 * expr.c (clear_storage_hints): ... this. Try store by
25959 multiple pieces after setmem.
25960 (clear_storage): Adjust.
25961 * expr.h (clear_storage_hints): Likewise.
25962 (try_store_by_multiple_pieces): Declare.
25963 * passes.def: Replace the last copy_prop with ccp.
25965 2021-05-03 Tom de Vries <tdevries@suse.de>
25968 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
25971 2021-05-03 Richard Biener <rguenther@suse.de>
25973 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
25975 2021-05-03 Richard Biener <rguenther@suse.de>
25977 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
25978 (dse_dom_walker): Remove.
25979 (dse_dom_walker::dse_optimize_stmt): Rename...
25980 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
25981 (dse_dom_walker::before_dom_children): Inline ...
25982 (pass_dse::execute): ... here. Perform a reverse program
25985 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
25988 * configure: Regenerated.
25990 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
25993 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
25994 (s390_md_asm_adjust): Handle hard registers.
25996 2021-05-03 Jakub Jelinek <jakub@redhat.com>
25998 PR tree-optimization/100382
25999 * tree-ssa-dse.c: Include tree-eh.h.
26000 (dse_dom_walker::before_dom_children): Don't remove stmts if
26001 stmt_unremovable_because_of_non_call_eh_p is true.
26003 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
26005 * varasm.c (compute_reloc_for_var): Split out from...
26006 (get_variable_section): Use it.
26007 * output.h (compute_reloc_for_var): Declare.
26008 * config/rs6000/rs6000-protos.h
26009 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
26011 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
26012 Don't append storage mapping class to symbol.
26013 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
26014 Don't convert TLS BSS to common.
26015 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
26016 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
26018 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
26020 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
26021 If align is 0 from TLS class, use the same rules as varasm.c
26022 If not common, switch to BSS section manually.
26023 If common, emit appropriate comm or lcomm directive.
26024 (rs6000_xcoff_encode_section_info): Add logic to append all
26025 storage mapping classes.
26026 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
26027 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
26028 rs6000_xcoff_asm_output_aligned_decl_common.
26029 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
26030 rs6000_xcoff_asm_output_aligned_decl_common.
26031 (ASM_OUTPUT_TLS_COMMON): Use
26032 rs6000_xcoff_asm_output_aligned_decl_common.
26034 2021-05-02 Jakub Jelinek <jakub@redhat.com>
26037 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
26038 as first argument of pseudo_node_t constructors.
26040 2021-05-02 Jakub Jelinek <jakub@redhat.com>
26043 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
26045 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
26047 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
26048 (gt_pch_nx (int_range<1> *&)): New.
26049 (gt_ggc_mx (int_range<1> *&)): New.
26050 * value-range.h (class irange): Add GTY support for
26053 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
26055 * doc/options.texi (Negative): Change either or to both and.
26057 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26059 * config/aarch64/aarch64-simd-builtins.def: Add
26060 float_ml[as][q]_laneq builtin generator macros.
26061 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
26062 (aarch64_float_mla_laneq<mode>): Define.
26063 (aarch64_float_mls_laneq<mode>): Define.
26064 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
26065 instead of GCC vector extensions.
26066 (vmlaq_laneq_f32): Likewise.
26067 (vmls_laneq_f32): Likewise.
26068 (vmlsq_laneq_f32): Likewise.
26070 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26072 * config/aarch64/aarch64-simd-builtins.def: Add
26073 float_ml[as]_lane builtin generator macros.
26074 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
26076 (mul_lane<mode>3): This, and re-order arguments.
26077 (aarch64_float_mla_lane<mode>): Define.
26078 (aarch64_float_mls_lane<mode>): Define.
26079 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
26080 instead of GCC vector extensions.
26081 (vmlaq_lane_f32): Likewise.
26082 (vmls_lane_f32): Likewise.
26083 (vmlsq_lane_f32): Likewise.
26085 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26087 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
26088 builtin generator macros.
26089 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
26091 (aarch64_float_mls<mode>): Define.
26092 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
26093 instead of relying on GCC vector extensions.
26094 (vmla_f64): Likewise.
26095 (vmlaq_f32): Likewise.
26096 (vmlaq_f64): Likewise.
26097 (vmls_f32): Likewise.
26098 (vmls_f64): Likewise.
26099 (vmlsq_f32): Likewise.
26100 (vmlsq_f64): Likewise.
26101 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
26103 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26105 * config/aarch64/aarch64-simd-builtins.def: Add
26106 float_ml[as]_n_builtin generator macros.
26107 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
26109 (mul_n<mode>3): This, and re-order arguments.
26110 (aarch64_float_mla_n<mode>): Define.
26111 (aarch64_float_mls_n<mode>): Define.
26112 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
26113 instead of inline asm.
26114 (vmlaq_n_f32): Likewise.
26115 (vmls_n_f32): Likewise.
26116 (vmlsq_n_f32): Likewise.
26118 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
26120 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
26121 builtin generator macros.
26122 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
26123 (aarch64_pmull_hiv16qi_insn): Define.
26124 (aarch64_pmull_hiv16qi): Define.
26125 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
26126 instead of inline asm.
26127 (vmull_p8): Likewise.
26129 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26131 * config/avr/avr.md: Adjust peepholes to match and
26132 generate parallels with clobber of REG_CC.
26133 (mov<mode>_insn): Rename to mov<mode>_insn_split.
26134 (*mov<mode>_insn): Rename to mov<mode>_insn.
26136 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
26138 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
26139 for VAR_DECLs if -fdata-sections enabled.
26141 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
26143 PR bootstrap/100327
26144 * config/rs6000/rs6000.c
26145 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
26146 (rs6000_libgcc_floating_mode_supported_p): New target hook.
26148 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26150 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
26152 (class back_threader_registry): ...to here...
26153 (class back_threader_profitability): ...and here...
26154 (thread_jumps::thread_through_all_blocks): Remove argument.
26155 (back_threader_registry::back_threader_registry): New.
26156 (back_threader_registry::~back_threader_registry): New.
26157 (back_threader_registry::thread_through_all_blocks): New.
26158 (thread_jumps::profitable_jump_thread_path): Move from here...
26159 (back_threader_profitability::profitable_path_p): ...to here.
26160 (thread_jumps::find_taken_edge): New.
26161 (thread_jumps::convert_and_register_current_path): Move...
26162 (back_threader_registry::register_path): ...to here.
26163 (thread_jumps::register_jump_thread_path_if_profitable): Move...
26164 (thread_jumps::maybe_register_path): ...to here.
26165 (thread_jumps::handle_phi): Call find_taken_edge and
26166 maybe_register_path.
26167 (thread_jumps::handle_assignment): Same.
26168 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
26169 tree argument to handle_phi and handle_assignment.
26170 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
26171 set of m_speed_p and m_max_threaded_paths.
26172 (pass_thread_jumps::execute): Remove second argument from
26173 find_jump_threads_backwards.
26174 (pass_early_thread_jumps::execute): Same.
26176 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26178 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
26179 (class dom_opt_dom_walker): Initialize some class variables.
26180 (pass_dominator::execute): Pass evrp_range_analyzer and
26181 dom_jump_threader_simplifier to dom_opt_dom_walker.
26182 Adjust for some functions moving into classes.
26183 (simplify_stmt_for_jump_threading): Adjust and move to...
26184 (jump_threader_simplifier::simplify): ...here.
26185 (dom_opt_dom_walker::before_dom_children): Adjust for
26186 m_evrp_range_analyzer.
26187 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
26188 (test_for_singularity): Place in dom_opt_dom_walker class.
26189 (dom_opt_dom_walker::optimize_stmt): The argument
26190 evrp_range_analyzer is now a class field.
26191 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
26192 (thread_jumps::thread_through_all_blocks): New.
26193 (thread_jumps::convert_and_register_current_path): Use m_registry.
26194 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
26195 being in the threader class.
26196 (pass_early_thread_jumps::execute): Same.
26197 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
26198 (jump_threader::jump_threader): ...here.
26199 (threadedge_finalize_values): Move...
26200 (jump_threader::~jump_threader): ...here.
26201 (jump_threader::remove_jump_threads_including): New.
26202 (jump_threader::thread_through_all_blocks): New.
26203 (record_temporary_equivalences_from_phis): Move...
26204 (jump_threader::record_temporary_equivalences_from_phis): ...here.
26205 (record_temporary_equivalences_from_stmts_at_dest): Move...
26206 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
26208 (simplify_control_stmt_condition_1): Move to jump_threader class.
26209 (simplify_control_stmt_condition): Move...
26210 (jump_threader::simplify_control_stmt_condition): ...here.
26211 (thread_around_empty_blocks): Move...
26212 (jump_threader::thread_around_empty_blocks): ...here.
26213 (thread_through_normal_block): Move...
26214 (jump_threader::thread_through_normal_block): ...here.
26215 (thread_across_edge): Move...
26216 (jump_threader::thread_across_edge): ...here.
26217 (thread_outgoing_edges): Move...
26218 (jump_threader::thread_outgoing_edges): ...here.
26219 * tree-ssa-threadedge.h: Move externally facing functings...
26220 (class jump_threader): ...here...
26221 (class jump_threader_simplifier): ...and here.
26222 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
26223 (jump_thread_path_allocator::jump_thread_path_allocator): New.
26224 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
26225 (jump_thread_path_allocator::allocate_thread_edge): New.
26226 (jump_thread_path_allocator::allocate_thread_path): New.
26227 (jump_thread_path_registry::jump_thread_path_registry): New.
26228 (jump_thread_path_registry::~jump_thread_path_registry): New.
26229 (jump_thread_path_registry::allocate_thread_edge): New.
26230 (jump_thread_path_registry::allocate_thread_path): New.
26231 (dump_jump_thread_path): Make extern.
26232 (debug (const vec<jump_thread_edge *> &path)): New.
26233 (struct removed_edges): Move to tree-ssa-threadupdate.h.
26234 (struct thread_stats_d): Remove.
26235 (remove_ctrl_stmt_and_useless_edges): Make static.
26236 (lookup_redirection_data): Move...
26237 (jump_thread_path_registry::lookup_redirection_data): ...here.
26238 (ssa_redirect_edges): Make static.
26239 (thread_block_1): Move...
26240 (jump_thread_path_registry::thread_block_1): ...here.
26241 (thread_block): Move...
26242 (jump_thread_path_registry::thread_block): ...here.
26243 (thread_through_loop_header): Move...
26244 (jump_thread_path_registry::thread_through_loop_header): ...here.
26245 (mark_threaded_blocks): Move...
26246 (jump_thread_path_registry::mark_threaded_blocks): ...here.
26247 (debug_path): Move...
26248 (jump_thread_path_registry::debug_path): ...here.
26249 (debug_all_paths): Move...
26250 (jump_thread_path_registry::dump): ..here.
26251 (rewire_first_differing_edge): Move...
26252 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
26253 (adjust_paths_after_duplication): Move...
26254 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
26255 (duplicate_thread_path): Move...
26256 (jump_thread_path_registry::duplicate_thread_path): ..here.
26257 (remove_jump_threads_including): Move...
26258 (jump_thread_path_registry::remove_jump_threads_including): ...here.
26259 (thread_through_all_blocks): Move to...
26260 (jump_thread_path_registry::thread_through_all_blocks): ...here.
26261 (delete_jump_thread_path): Remove.
26262 (register_jump_thread): Move...
26263 (jump_thread_path_registry::register_jump_thread): ...here.
26264 * tree-ssa-threadupdate.h: Move externally facing functions...
26265 (class jump_thread_path_allocator): ...here...
26266 (class jump_thread_path_registry): ...and here.
26267 (thread_through_all_blocks): Remove.
26268 (struct removed_edges): New.
26269 (register_jump_thread): Remove.
26270 (remove_jump_threads_including): Remove.
26271 (delete_jump_thread_path): Remove.
26272 (remove_ctrl_stmt_and_useless_edges): Remove.
26273 (free_dom_edge_info): New prototype.
26274 * tree-vrp.c: Remove x_vr_values hack.
26275 (class vrp_jump_threader_simplifier): New.
26276 (vrp_jump_threader_simplifier::simplify): New.
26277 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
26278 Remove m_dummy_cond.
26279 Instantiate m_simplifier and m_threader.
26280 (vrp_jump_threader::thread_through_all_blocks): New.
26281 (vrp_jump_threader::simplify_stmt): Remove.
26282 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
26283 Remove x_vr_values hack.
26284 (execute_vrp): Adjust for thread_through_all_blocks being in a
26287 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
26289 * genflags.c (gen_insn): Print failed expansion string.
26291 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26293 * expr.c (alignment_for_piecewise_move): Call mode_for_size
26294 without limit to MAX_FIXED_MODE_SIZE.
26296 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26298 PR middle-end/90773
26299 * builtins.c (builtin_memset_gen_str): Don't use return from
26300 simplify_gen_subreg.
26302 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
26305 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
26306 (*addsi3_carry_zext_0r): Ditto.
26307 (*sub<mode>3_carry_0): Ditto.
26308 (*subsi3_carry_zext_0r): Ditto.
26309 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
26311 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
26312 Also consider ix86_carry_flag_unset_operator to calculate
26313 the cost of adc/sbb insn.
26315 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
26317 PR rtl-optimization/100225
26318 PR rtl-optimization/84878
26319 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
26320 where we have an instruction which touches (writes) any hard
26321 register from df->regular_block_artificial_uses set.
26322 Allow not-single-set instruction only right before basic block
26325 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
26327 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
26329 2021-04-30 LevyHsu <admin@levyhsu.com>
26331 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
26332 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
26333 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
26334 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
26336 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26338 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
26340 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26342 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
26343 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
26344 * config/i386/i386.c: Adjust.
26345 * config/i386/i386.md: Adjust.
26346 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
26347 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26348 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26349 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26350 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26351 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26352 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26353 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26354 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26355 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26356 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
26358 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26360 * config/i386/i386-expand.c (ix86_expand_int_compare):
26361 Swap operands of GTU and LEU comparison to emit carry flag comparison.
26362 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
26363 predicate to allow more combine opportunities with memory operands.
26364 (*sub<mode>3_carry_0): Ditto.
26366 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
26368 PR rtl-optimization/100303
26369 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
26370 boolean that indicates whether the use will only be used in
26371 debug instructions. Treat it in the same way that existing
26372 cross-EBB debug references would be handled if so.
26373 (function_info::make_uses_available): Likewise.
26374 * rtl-ssa/functions.h (function_info::make_uses_available): Update
26375 prototype accordingly.
26376 (function_info::make_uses_available): Likewise.
26377 * fwprop.c (try_fwprop_subst): Update call accordingly.
26379 2021-04-29 Jeff Law <jlaw@tachyum.com>
26381 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
26384 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26385 Richard Biener <rguenther@suse.de>
26388 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
26389 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
26390 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
26391 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
26392 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
26393 to PURE_ARGS category.
26394 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26395 Handle PURE_ARGS category.
26396 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
26398 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26400 * configure.ac: Check for the presence of sys/locking.h header and
26401 for whether _LK_LOCK is supported by _locking.
26402 * configure: Regenerate.
26403 * config.in: Likewise.
26404 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
26405 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
26406 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
26408 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26410 * config/i386/predicates.md (fcmov_comparison_operator):
26411 Do not check for trivial FP comparison operator.
26412 <case GEU, case LTU>: Allow CCGZmode.
26413 <case GTU, case LEU>: Do not allow CCCmode.
26414 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
26415 (ix86_carry_flag_operator): Match only LTU and UNLT code.
26416 Do not check for trivial FP comparison operator. Allow CCGZmode.
26418 2021-04-29 Tom de Vries <tdevries@suse.de>
26420 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
26421 fd->loop.step by either step or orig_step.
26423 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26425 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
26426 (load_got_register): Do the PIC dance here.
26427 (sparc_legitimize_tls_address): Simplify.
26428 (sparc_emit_probe_stack_range): Likewise.
26429 (sparc32_initialize_trampoline): Likewise.
26430 (sparc64_initialize_trampoline): Likewise.
26431 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
26432 (probe_stack_range<P:mode>): Likewise.
26433 (flush<P:mode>): Likewise.
26434 (tgd_hi22<P:mode>): Likewise.
26435 (tgd_lo10<P:mode>): Likewise.
26436 (tgd_add<P:mode>): Likewise.
26437 (tgd_call<P:mode>): Likewise.
26438 (tldm_hi22<P:mode>): Likewise.
26439 (tldm_lo10<P:mode>): Likewise.
26440 (tldm_add<P:mode>): Likewise.
26441 (tldm_call<P:mode>): Likewise.
26442 (tldo_hix22<P:mode>): Likewise.
26443 (tldo_lox10<P:mode>): Likewise.
26444 (tldo_add<P:mode>): Likewise.
26445 (tie_hi22<P:mode>): Likewise.
26446 (tie_lo10<P:mode>): Likewise.
26447 (tie_add<P:mode>): Likewise.
26448 (tle_hix22<P:mode>): Likewise.
26449 (tle_lox10<P:mode>): Likewise.
26450 (stack_protect_setsi): Rename to...
26451 (stack_protect_set32): ...this.
26452 (stack_protect_setdi): Rename to...
26453 (stack_protect_set64): ...this.
26454 (stack_protect_set): Adjust calls to above.
26455 (stack_protect_testsi): Rename to...
26456 (stack_protect_test32): ...this.
26457 (stack_protect_testdi): Rename to...
26458 (stack_protect_test64): ...this.
26459 (stack_protect_test): Adjust calls to above.
26461 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
26463 PR middle-end/90773
26464 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
26465 (builtin_strncpy_read_str): Likewise.
26466 (builtin_memset_read_str): Add an argument for the previous RTL
26467 information and generate the new RTL from the previous RTL info.
26468 (builtin_memset_gen_str): Likewise.
26469 * builtins.h (builtin_strncpy_read_str): Update the prototype.
26470 (builtin_memset_read_str): Likewise.
26471 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
26472 returns true, round up size and alignment to the widest integer
26473 mode for maximum size.
26474 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
26475 and pass it to m_constfn.
26476 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
26477 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
26478 initialize m_push. Initialize m_overlap_op_by_pieces with
26479 targetm.overlap_op_by_pieces_p ().
26480 (op_by_pieces_d::run): Pass the previous RTL information to
26481 pieces_addr::adjust and generate overlapping operations if
26482 m_overlap_op_by_pieces is true.
26484 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
26486 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
26488 (can_store_by_pieces): Use by_pieces_constfn on constfun.
26489 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
26490 for op_by_pieces_d change.
26491 (clear_by_pieces_1): Add a dummy argument.
26492 (clear_by_pieces): Updated for op_by_pieces_d change.
26493 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
26494 (string_cst_read_str): Add a dummy argument.
26495 * expr.h (by_pieces_constfn): Add a dummy argument.
26496 (by_pieces_prev): New.
26497 * target.def (overlap_op_by_pieces_p): New target hook.
26498 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
26499 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
26500 * doc/tm.texi: Regenerated.
26502 2021-04-29 Richard Biener <rguenther@suse.de>
26504 PR tree-optimization/100253
26505 * tree-vect-stmts.c (vectorizable_load): Do not assume
26506 element alignment when DR_MISALIGNMENT is -1.
26507 (vectorizable_store): Likewise.
26509 2021-04-29 Jakub Jelinek <jakub@redhat.com>
26512 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
26513 absu_hwi instead of abs_hwi.
26515 2021-04-29 Richard Biener <rguenther@suse.de>
26517 PR middle-end/38474
26518 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
26519 forwarding when indirect forwarding through ESCAPED
26522 2021-04-29 Tom de Vries <tdevries@suse.de>
26525 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
26526 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
26527 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
26528 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
26530 2021-04-29 Richard Biener <rguenther@suse.de>
26532 PR tree-optimization/99912
26533 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
26534 (dse_dom_walker::todo): Likewise.
26535 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
26537 (dse_dom_walker::before_dom_children): Remove trivially
26538 dead SSA defs and schedule CFG cleanup if we removed all
26540 (pass_dse::execute): Get TODO as computed by the DOM walker
26541 and return it. Wipe dominator info earlier.
26543 2021-04-29 Richard Biener <rguenther@suse.de>
26546 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
26547 Track blocks to cleanup EH in new m_need_eh_cleanup.
26548 (ipcp_modif_dom_walker::cleanup_eh): New.
26549 (ipcp_transform_function): Release dominator info before
26552 2021-04-29 Martin Sebor <msebor@redhat.com>
26554 PR middle-end/100250
26555 * attribs.c (attr_access::array_as_string): Avoid dereferencing
26556 a pointer when it's null.
26558 2021-04-29 Martin Sebor <msebor@redhat.com>
26560 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
26561 * ipa-free-lang-data.cc: New file.
26562 * tree.c: Move pass free_lang_data to file above.
26563 (build_array_type_1): Declare extern.
26564 * tree.h (build_array_type_1): Declare.
26566 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26568 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
26569 make consistent with updated RTL pattern.
26570 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
26571 Implement using ss_truncate and us_truncate rather than
26573 * config/aarch64/iterators.md: Remove redundant unspecs and
26574 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
26576 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26578 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
26579 attributes consistent with those defined in arm_neon.h.
26581 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26583 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
26584 attributes consistent with those defined in arm_neon.h.
26586 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26588 * config/aarch64/aarch64-simd-builtins.def: Add
26589 float_trunc_rodd builtin generator macros.
26590 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
26592 (aarch64_float_trunc_rodd_lo_v2sf): Define.
26593 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
26594 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
26595 (aarch64_float_trunc_rodd_hi_v4sf): Define.
26596 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
26597 instead of inline asm.
26598 (vcvtx_high_f32_f64): Likewise.
26599 (vcvtxd_f32_f64): Likewise.
26600 * config/aarch64/iterators.md: Add FCVTXN unspec.
26602 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26604 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
26606 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
26608 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
26609 instead of inline asm.
26610 (vqtbx1_u8): Likewise.
26611 (vqtbx1_p8): Likewise.
26612 (vqtbx1q_s8): Likewise.
26613 (vqtbx1q_u8): Likewise.
26614 (vqtbx1q_p8): Likewise.
26615 (vtbx2_s8): Likewise.
26616 (vtbx2_u8): Likewise.
26617 (vtbx2_p8): Likewise.
26619 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26621 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
26623 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
26624 instead of inline asm.
26625 (vqtbl1_s8): Likewise.
26626 (vqtbl1_u8): Likewise.
26627 (vqtbl1q_p8): Likewise.
26628 (vqtbl1q_s8): Likewise.
26629 (vqtbl1q_u8): Likewise.
26630 (vtbl1_s8): Likewise.
26631 (vtbl1_u8): Likewise.
26632 (vtbl1_p8): Likewise.
26633 (vtbl2_s8): Likewise.
26634 (vtbl2_u8): Likewise.
26635 (vtbl2_p8): Likewise.
26637 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26639 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
26640 ssri_n buitin generator macro.
26641 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
26642 instead of inline asm.
26643 (vsri_n_p16): Likewise.
26644 (vsri_n_p64): Likewise.
26645 (vsriq_n_p8): Likewise.
26646 (vsriq_n_p16): Likewise.
26647 (vsriq_n_p64): Likewise.
26649 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26651 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
26652 iterator for polynomial ssli_n builtin generator macro.
26653 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
26654 instead of inline asm.
26655 (vsli_n_p16): Likewise.
26656 (vsliq_n_p8): Likewise.
26657 (vsliq_n_p16): Likewise.
26658 * config/aarch64/iterators.md: Define VALLP mode iterator.
26660 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26662 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
26663 iterator to generate [su]adalp RTL builtins.
26664 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
26665 [su]adalp RTL pattern.
26666 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
26667 instead of inline asm.
26668 (vpadal_u32): Likewise.
26670 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26672 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
26673 builtin generator macros.
26674 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
26676 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
26677 instead of inline asm.
26678 (vpaddl_s16): Likewise.
26679 (vpaddl_s32): Likewise.
26680 (vpaddl_u8): Likewise.
26681 (vpaddl_u16): Likewise.
26682 (vpaddl_u32): Likewise.
26683 (vpaddlq_s8): Likewise.
26684 (vpaddlq_s16): Likewise.
26685 (vpaddlq_s32): Likewise.
26686 (vpaddlq_u8): Likewise.
26687 (vpaddlq_u16): Likewise.
26688 (vpaddlq_u32): Liwewise.
26689 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
26690 appropriate attributes.
26692 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26694 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
26695 for aarch64_addp<mode> builtin macro generator.
26696 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
26697 aarch64_addp<mode> RTL pattern.
26698 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
26699 instead of inline asm.
26700 (vpaddq_s16): Likewise.
26701 (vpaddq_s32): Likewise.
26702 (vpaddq_s64): Likewise.
26703 (vpaddq_u8): Likewise.
26704 (vpaddq_u16): Likewise.
26705 (vpaddq_u32): Likewise.
26706 (vpaddq_u64): Likewise.
26708 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26710 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
26711 builtin generator macros.
26712 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
26714 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
26715 instead of inline asm.
26716 (vqdmulh_n_s32): Likewise.
26717 (vqdmulhq_n_s16): Likewise.
26718 (vqdmulhq_n_s32): Likewise.
26719 (vqrdmulh_n_s16): Likewise.
26720 (vqrdmulh_n_s32): Likewise.
26721 (vqrdmulhq_n_s16): Likewise.
26722 (vqrdmulhq_n_s32): Likewise.
26724 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
26726 * doc/install.texi (--enable-offload-defaulted): Document.
26728 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26730 * config/avr/avr-dimode.md: Turn existing patterns into
26731 define_insn_and_split style patterns where the splitter
26732 adds a clobber of the condition code register. Drop "cc"
26733 attribute. Add new patterns to match output of
26735 * config/avr/avr-fixed.md: Likewise.
26736 * config/avr/avr.c (cc_reg_rtx): New.
26737 (avr_parallel_insn_from_insns): Adjust insn count
26738 for removal of set of cc0.
26739 (avr_is_casesi_sequence): Likewise.
26740 (avr_casei_sequence_check_operands): Likewise.
26741 (avr_optimize_casesi): Likewise. Also insert
26742 new insns after jump_insn.
26743 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
26744 for removal of set of cc0.
26745 (avr_init_expanders): Initialize cc_reg_rtx.
26746 (avr_regno_reg_class): Handle REG_CC.
26747 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
26748 (avr_notice_update_cc): Remove function.
26749 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
26750 (compare_condition): Adjust for PARALLEL with
26752 (out_shift_with_cnt): Likewise.
26753 (ashlhi3_out): Likewise.
26754 (ashrhi3_out): Likewise.
26755 (lshrhi3_out): Likewise.
26756 (avr_class_max_nregs): Return single reg for REG_CC.
26757 (avr_compare_pattern): Check for REG_CC instead
26759 (avr_reorg_remove_redundant_compare): Likewise.
26760 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
26761 (avr_hard_regno_nregs): Return single reg for REG_CC.
26762 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
26763 (avr_md_asm_adjust): Clobber REG_CC.
26764 (TARGET_HARD_REGNO_NREGS): Define.
26765 (TARGET_CLASS_MAX_NREGS): Define.
26766 (TARGET_MD_ASM_ADJUST): Define.
26767 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
26769 (enum reg_class): Add CC_REG class.
26770 (NOTICE_UPDATE_CC): Remove.
26771 (CC_OVERFLOW_UNUSABLE): Remove.
26772 (CC_NO_CARRY): Remove.
26773 * config/avr/avr.md: Turn existing patterns into
26774 define_insn_and_split style patterns where the splitter
26775 adds a clobber of the condition code register. Drop "cc"
26776 attribute. Add new patterns to match output of
26778 (sez): Remove unused pattern.
26780 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
26783 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
26786 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
26789 * config/aarch64/constraints.md (Utq): Require the address to
26790 be valid for both the element mode and for V2DImode.
26792 2021-04-28 Jakub Jelinek <jakub@redhat.com>
26793 Tobias Burnus <tobias@codesourcery.com>
26795 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
26796 * gcc.c (process_command): New variable.
26797 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
26798 set it if -foffload is defaulted.
26799 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
26800 (compile_offload_image): If OFFLOAD_DEFAULTED and
26801 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
26802 if corresponding mkoffload can't be found.
26803 (compile_images_for_offload_targets): Likewise. Free and clear
26804 offload_names if no valid offload is found.
26805 * config.in: Regenerate.
26806 * configure: Regenerate.
26808 2021-04-28 Richard Biener <rguenther@suse.de>
26810 PR tree-optimization/100292
26811 * tree-vect-generic.c (expand_vector_condition): Do not fold
26814 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26816 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
26817 * config/rs6000/aix64.opt (m64): New.
26820 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26822 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
26823 (index_term_p): Handle ASHIFT too.
26825 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26827 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
26828 (sync_lock_test_and_set<mode>): Adjust accordingly.
26829 (sync_lock_release<mode>): Likewise.
26831 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26833 * config/vax/vax-protos.h (adjacent_operands_p): Remove
26835 * config/vax/vax.c (adjacent_operands_p): Remove.
26837 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
26839 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
26840 through to the non-conditional execution case if getting the
26841 condition for conditional execution has failed.
26843 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26845 PR middle-end/100284
26846 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
26847 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
26848 than asserting on it.
26850 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26852 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
26853 with TARGET_AIX_OS.
26855 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26858 * calls.c (precompute_register_parameters): Additionally test
26859 targetm.precompute_tls_p to pre-compute argument.
26860 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
26861 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
26862 * target.def (precompute_tls_p): New.
26863 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
26864 * doc/tm.texi: Regenerated.
26866 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26869 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
26870 back to HOST_WIDE_INT.
26872 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
26875 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
26876 memory alignment for the outer mode.
26878 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
26880 PR middle-end/90773
26881 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
26882 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
26884 2021-04-27 Alex Coplan <alex.coplan@arm.com>
26887 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
26888 with negative immediates: ensure we expand cbranchsi4_scratch
26889 correctly and ensure we satisfy its constraints.
26890 * config/arm/sync.md
26891 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
26892 attempt to tie two output operands together with constraints;
26893 collapse two alternatives.
26894 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
26895 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
26897 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26900 * config/aarch64/predicates.md (aarch64_sub_immediate,
26901 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
26902 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
26903 * config/aarch64/aarch64.c (aarch64_print_operand,
26904 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
26906 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26908 PR tree-optimization/100239
26909 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
26910 permutations with all indices from the first zero element as vec_shl.
26912 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26914 PR rtl-optimization/100254
26915 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
26916 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
26918 2021-04-27 Richard Biener <rguenther@suse.de>
26920 PR tree-optimization/99912
26921 * passes.def: Add comment about new TODO_remove_unused_locals.
26922 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
26925 2021-04-27 Richard Biener <rguenther@suse.de>
26927 PR tree-optimization/99912
26928 * passes.def (pass_all_optimizations): Add pass_dse before
26929 the first pass_dce, move the first pass_dse before the
26930 pass_dce following pass_pre.
26932 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26934 PR tree-optimization/95527
26935 * generic-match-head.c: Include tm.h.
26936 * gimple-match-head.c: Include tm.h.
26937 * match.pd (CLZ == INTEGER_CST): Don't use
26938 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
26939 if clz == CFN_CLZ. Add missing val declaration.
26940 (CTZ cmp CST): New simplifications.
26942 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26944 PR tree-optimization/96696
26945 * expr.c (expand_expr_divmod): New function.
26946 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
26947 divisions. Formatting fixes.
26948 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
26951 2021-04-27 Martin Jambor <mjambor@suse.cz>
26954 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
26955 If removing a call statement LHS SSA name, release it.
26957 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
26960 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
26961 is valid before including it in the mask.
26963 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26966 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
26969 2021-04-27 Richard Biener <rguenther@suse.de>
26971 PR tree-optimization/100051
26972 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
26973 disambiguator based on access size vs. decl size.
26975 2021-04-27 Richard Biener <rguenther@suse.de>
26977 PR tree-optimization/100278
26978 * tree-ssa-pre.c (compute_avail): Give up when we cannot
26979 adjust TBAA beacuse of mismatching bases.
26981 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26984 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
26985 For any_rotate define_insn_split and following splitters, use
26986 SWI iterator instead of SWI48.
26988 2021-04-27 Richard Biener <rguenther@suse.de>
26990 PR tree-optimization/99776
26991 * match.pd (bit_field_ref (ctor)): Relax element extract
26992 type compatibility checks.
26994 2021-04-27 Cui,Lili <lili.cui@intel.com>
26996 * common/config/i386/i386-common.c (processor_names):
26997 Sync processor_names with processor_type.
26998 * config/i386/i386-options.c (processor_cost_table):
26999 Sync processor_cost_table with processor_type.
27001 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27003 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
27004 (irange::set): Call irange_set_1bit_anti_range for handling all
27005 1-bit ranges. Fall through on ~[MIN,MAX].
27007 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27009 * value-range.cc (irange::legacy_num_pairs): Remove.
27010 (irange::invert): Change gcc_assert to gcc_checking_assert.
27011 * value-range.h (irange::num_pairs): Adjust for a cached
27012 num_pairs(). Also, rename all gcc_assert's to
27013 gcc_checking_assert's.
27015 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27017 * value-range.cc (irange::operator=): Set m_kind.
27018 (irange::copy_to_legacy): Handle varying and undefined sources
27019 as a legacy copy since they can be easily copied.
27020 (irange::irange_set): Set m_kind.
27021 (irange::irange_set_anti_range): Same.
27022 (irange::set): Rename normalize_min_max to normalize_kind.
27023 (irange::verify_range): Adjust for multi-ranges having the
27025 (irange::irange_union): Set m_kind.
27026 (irange::irange_intersect): Same.
27027 (irange::invert): Same.
27028 * value-range.h (irange::kind): Always return m_kind.
27029 (irange::varying_p): Rename to...
27030 (irange::varying_comptaible_p): ...this.
27031 (irange::undefined_p): Only look at m_kind.
27032 (irange::irange): Always set VR_UNDEFINED if applicable.
27033 (irange::set_undefined): Always set VR_UNDEFINED.
27034 (irange::set_varying): Always set m_kind to VR_VARYING.
27035 (irange::normalize_min_max): Rename to...
27036 (irange::normalize_kind): ...this.
27038 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27040 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
27041 Adjust for constant_p including varying_p.
27042 * tree-vrp.c (vrp_prop::finalize): Same.
27043 (determine_value_range): Same.
27044 * vr-values.c (vr_values::range_of_expr): Same.
27045 * value-range.cc (irange::symbolic_p): Do not check varying_p.
27046 (irange::constant_p): Same.
27048 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
27050 * value-range.cc (irange::legacy_lower_bound): Replace
27051 !undefined_p check with num_ranges > 0.
27052 (irange::legacy_upper_bound): Same.
27053 * value-range.h (irange::type): Same.
27054 (irange::lower_bound): Same.
27055 (irange::upper_bound): Same.
27057 2021-04-26 Richard Biener <rguenther@suse.de>
27059 PR tree-optimization/99956
27060 * gimple-loop-interchange.cc (compute_access_stride):
27061 Try instantiating the access in a shallower loop nest
27062 if instantiating failed.
27063 (compute_access_strides): Pass adjustable loop_nest
27064 to compute_access_stride.
27066 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
27068 * doc/sourcebuild.texi (arm_cmse_hw): Document.
27070 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27072 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
27074 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
27075 Nathan Sidwell <nathan@codesourcery.com>
27076 Tom de Vries <vries@codesourcery.com>
27077 Julian Brown <julian@codesourcery.com>
27078 Kwok Cheung Yeung <kcy@codesourcery.com>
27080 * omp-offload.c (oacc_validate_dims): Implement
27081 '-Wopenacc-parallelism'.
27082 * doc/invoke.texi (-Wopenacc-parallelism): Document.
27084 2021-04-26 Richard Biener <rguenther@suse.de>
27086 * tree-cfg.h (gimplify_build1): Remove.
27087 (gimplify_build2): Likewise.
27088 (gimplify_build3): Likewise.
27089 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
27090 (gimplify_build2): Likewise.
27091 (gimplify_build3): Likewise.
27092 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
27094 (gimplify_build2): Likewise.
27095 (gimplify_build3): Likewise.
27096 (tree_vec_extract): Use resimplify with following SSA edges.
27097 (expand_vector_parallel): Avoid passing NULL size/bitpos
27098 to tree_vec_extract.
27099 * expr.c (store_constructor): Deal with zero-element CTORs.
27100 * match.pd (bit_field_ref <vector CTOR>): Make sure to
27101 produce vector constants when possible.
27103 2021-04-26 Richard Biener <rguenther@suse.de>
27105 * tree-complex.c: Include gimple-fold.h.
27106 (expand_complex_addition): Use gimple_build.
27107 (expand_complex_multiplication_components): Likewise.
27108 (expand_complex_multiplication): Likewise.
27109 (expand_complex_div_straight): Likewise.
27110 (expand_complex_div_wide): Likewise.
27111 (expand_complex_division): Likewise.
27112 (expand_complex_conjugate): Likewise.
27113 (expand_complex_comparison): Likewise.
27115 2021-04-26 Richard Biener <rguenther@suse.de>
27117 * tree-ssa-phiopt.c (two_value_replacement): Remove use
27118 of legacy gimplify_buildN API.
27120 2021-04-26 Richard Biener <rguenther@suse.de>
27122 PR tree-optimization/99473
27123 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
27126 2021-04-26 Richard Biener <rguenther@suse.de>
27128 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
27129 Use replace_call_with_value.
27131 2021-04-26 Richard Biener <rguenther@suse.de>
27133 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
27134 (update_gimple_call): Likewise.
27135 (update_call_from_tree): Likewise.
27136 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
27137 (valid_gimple_call_p): Likewise.
27138 (move_ssa_defining_stmt_for_defs): Likewise.
27139 (finish_update_gimple_call): Likewise.
27140 (update_gimple_call): Likewise.
27141 (update_call_from_tree): Likewise.
27142 (propagate_tree_value_into_stmt): Use replace_call_with_value.
27143 * gimple-fold.h (update_gimple_call): Declare.
27144 * gimple-fold.c (valid_gimple_rhs_p): Move here from
27145 tree-ssa-propagate.c.
27146 (update_gimple_call): Likewise.
27147 (valid_gimple_call_p): Likewise.
27148 (finish_update_gimple_call): Likewise, and simplify.
27149 (gimplify_and_update_call_from_tree): Implement
27150 update_call_from_tree functionality, avoid excessive
27151 push/pop_gimplify_context.
27152 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
27153 (gimple_fold_call): Likewise.
27154 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
27155 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
27156 (pass_fold_builtins::execute): Likewise.
27157 (optimize_stack_restore): Use replace_call_with_value.
27158 * tree-cfg.c (fold_loop_internal_call): Likewise.
27159 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
27160 only gimplify_and_update_call_from_tree.
27161 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
27162 (handle_builtin_strchr): Likewise.
27163 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
27165 2021-04-26 Jakub Jelinek <jakub@redhat.com>
27168 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
27169 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
27172 2021-04-25 liuhongt <hongtao.liu@intel.com>
27175 * config/i386/i386-builtin.def (BDESC): Change the icode of
27176 the following builtins to CODE_FOR_nothing.
27177 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
27178 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
27179 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
27180 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
27181 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
27182 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
27183 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
27184 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
27185 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
27186 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
27187 (sse2_eq<mode>3): Ditto.
27188 (sse4_1_eqv2di3): Ditto.
27189 (sse2_gt<mode>3): Rename to ..
27190 (*sse2_gt<mode>3): .. this.
27192 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27195 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27198 * config/darwin.c (darwin_binds_local_p): Assume that any
27199 public symbol might be interposed for PIC code. Update function
27200 header comment to reflect current Darwin capability.
27202 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27205 * config/darwin.c (darwin_binds_local_p): Assume that any
27206 public symbol might be interposed for PIC code. Update function
27207 header comment to reflect current Darwin capability.
27209 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
27211 * doc/sourcebuild.texi: Document no-opts and any-opts target
27214 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27216 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
27218 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27220 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
27222 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27225 * config/i386/i386-options.c (ix86_option_override_internal):
27226 Error out when -m96bit-long-double is used with 64bit targets.
27227 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
27229 2021-04-23 Martin Liska <mliska@suse.cz>
27231 * lto-wrapper.c: Remove FIXME about usage of
27232 hardware_concurrency. The function is not on par with
27235 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27238 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
27239 Copy operand 3 to operand 4. Use sse_reg_operand
27240 as operand 3 predicate.
27241 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
27242 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
27243 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
27244 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
27245 Copy operand 1 to operand 0.
27246 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
27247 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
27248 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
27250 2021-04-23 Alex Coplan <alex.coplan@arm.com>
27252 PR rtl-optimization/100230
27253 * early-remat.c (early_remat::sort_candidates): Use delete[]
27254 instead of delete for array allocated with new[].
27256 2021-04-23 Richard Biener <rguenther@suse.de>
27258 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
27259 (capture_info::capture_info): Likewise.
27260 (capture_info::walk_match): Likewise.
27261 (expr::gen_transform): Likewise.
27262 (dt_simplify::gen_1): Likewise.
27263 * gimple-match-head.c (maybe_resimplify_conditional_op):
27264 Remove VEC_COND_EXPR special-casing.
27265 (gimple_simplify): Likewise.
27266 * gimple.c (gimple_could_trap_p_1): Adjust.
27267 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
27268 to participate in PRE.
27270 2021-04-23 Richard Biener <rguenther@suse.de>
27272 * cfganal.c (connect_infinite_loops_to_exit): First call
27273 add_noreturn_fake_exit_edges.
27274 * ipa-sra.c (process_scan_results): Do not call the now redundant
27275 add_noreturn_fake_exit_edges.
27276 * predict.c (tree_estimate_probability): Likewise.
27277 (rebuild_frequencies): Likewise.
27278 * store-motion.c (one_store_motion_pass): Likewise.
27280 2021-04-23 Richard Biener <rguenther@suse.de>
27282 PR tree-optimization/100222
27283 * predict.c (pass_profile::execute): Remove redundant call to
27284 mark_irreducible_loops.
27285 (report_predictor_hitrates): Likewise.
27287 2021-04-23 Richard Biener <rguenther@suse.de>
27289 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
27290 valid_gimple_rhs_p by instead gimplifying to one.
27292 2021-04-23 Richard Biener <rguenther@suse.de>
27294 PR tree-optimization/99971
27295 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
27296 Always use TBAA for loads.
27298 2021-04-23 liuhongt <hongtao.liu@intel.com>
27301 * config/i386/i386-options.c (ix86_option_override_internal):
27302 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
27303 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
27304 by target attribute.
27306 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
27308 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
27310 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
27312 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
27314 * config.gcc (powerpc-ibm-aix6.*): Remove.
27315 * config/rs6000/aix61.h: Delete.
27317 2021-04-22 Martin Liska <mliska@suse.cz>
27319 PR testsuite/100159
27320 PR testsuite/100192
27321 * builtins.c (expand_builtin): Fix typos and missing comments.
27322 * dwarf2out.c (gen_subprogram_die): Likewise.
27323 (gen_struct_or_union_type_die): Likewise.
27325 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
27328 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
27329 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
27331 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
27333 * config/i386/darwin.h (TARGET_64BIT): Remove definition
27334 based on TARGET_ISA_64BIT.
27335 (TARGET_64BIT_P): Remove definition based on
27336 TARGET_ISA_64BIT_P().
27338 2021-04-21 Martin Liska <mliska@suse.cz>
27341 2021-04-21 Martin Liska <mliska@suse.cz>
27343 * lto-wrapper.c (cpuset_popcount): Remove.
27344 (init_num_threads): Remove and use hardware_concurrency.
27346 2021-04-21 Martin Liska <mliska@suse.cz>
27349 * main.c (main): Call toplev::finalize in CHECKING_P mode.
27350 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
27351 when incremental LTO linking happens.
27353 2021-04-21 Martin Liska <mliska@suse.cz>
27355 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
27356 makeserver cannot be detected, then use -flto=N fallback.
27358 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27360 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
27361 default to yes for aarch64-linux-gnu.
27362 * configure: Regenerate.
27364 2021-04-21 Martin Liska <mliska@suse.cz>
27366 * lto-wrapper.c (cpuset_popcount): Remove.
27367 (init_num_threads): Remove and use hardware_concurrency.
27369 2021-04-21 Martin Liska <mliska@suse.cz>
27371 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
27372 which remains to be '(... || 0)' and clang complains about it.
27373 * dwarf2out.c (AT_vms_delta): Declare conditionally.
27374 (add_AT_vms_delta): Likewise.
27375 * tree.c (fld_simplified_type): Use rather more common pattern
27376 for disabling of something (#if 0).
27377 (get_tree_code_name): Likewise.
27378 (verify_type_variant): Likewise.
27380 2021-04-21 Martin Liska <mliska@suse.cz>
27382 * config/i386/i386-expand.c (decide_alignment): Use newly named
27383 macro TARGET_CPU_P.
27384 * config/i386/i386.c (ix86_decompose_address): Likewise.
27385 (ix86_address_cost): Likewise.
27386 (ix86_lea_outperforms): Likewise.
27387 (ix86_avoid_lea_for_addr): Likewise.
27388 (ix86_add_stmt_cost): Likewise.
27389 * config/i386/i386.h (TARGET_*): Remove.
27390 (TARGET_CPU_P): New macro.
27391 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
27392 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
27393 (swap_top_of_ready_list): Likewise.
27394 (ix86_atom_sched_reorder): Likewise.
27395 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
27396 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
27398 2021-04-21 Martin Liska <mliska@suse.cz>
27400 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
27402 (SET_TARGET_NO_SAHF): Likewise.
27403 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
27404 (SET_TARGET_PREFETCH_SSE): Likewise.
27405 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
27406 (SET_TARGET_NO_TUNE): Likewise.
27407 (TARGET_EXPLICIT_NO_80387_P): Likewise.
27408 (SET_TARGET_NO_80387): Likewise.
27410 * config/i386/i386.h (TARGET_*): Remove.
27411 * opth-gen.awk: Generate new used macros.
27413 2021-04-21 Martin Liska <mliska@suse.cz>
27415 * config/i386/i386.h (PTA_*): Remove.
27416 (enum pta_flag): New.
27417 (DEF_PTA): Generate PTA_* values from i386-isa.def.
27418 * config/i386/i386-isa.def: New file.
27420 2021-04-21 Alex Coplan <alex.coplan@arm.com>
27423 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
27424 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
27425 jump table targets.
27427 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
27429 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
27430 x86_64-*-* targets.
27431 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
27433 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
27434 (ix86_handle_option): Handle -mmwait.
27435 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
27436 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
27437 __builtin_ia32_monitor and __builtin_ia32_mwait.
27438 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
27439 (ix86_valid_target_attribute_inner_p): Likewise.
27440 (ix86_option_override_internal): Enable mwait/monitor
27441 instructions for -msse3.
27442 * config/i386/i386.h (TARGET_MWAIT): New.
27443 (TARGET_MWAIT_P): Likewise.
27444 * config/i386/i386.opt: Add -mmwait.
27445 * config/i386/mwaitintrin.h: New file.
27446 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
27447 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
27449 (@sse3_monitor_<mode>): Likewise.
27450 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
27451 * doc/extend.texi: Document mwait target attribute.
27452 * doc/invoke.texi: Document -mmwait.
27454 2021-04-21 Martin Liska <mliska@suse.cz>
27456 * config/i386/i386-options.c (DEF_ENUM): Remove it.
27457 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
27458 * config/i386/stringop.def (DEF_ENUM): Likewise.
27460 2021-04-21 Martin Liska <mliska@suse.cz>
27462 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
27463 of print_generic_expr.
27465 2021-04-21 Jakub Jelinek <jakub@redhat.com>
27467 PR rtl-optimization/100148
27468 * cprop.c (constprop_register): Use next_nondebug_insn instead of
27471 2021-04-21 Martin Liska <mliska@suse.cz>
27474 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
27475 free_dominance_info calls.
27477 2021-04-21 Richard Biener <rguenther@suse.de>
27479 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
27480 parameter (and assume it to be false).
27481 (fold_gimple_assign): Adjust, remove all callers of
27482 maybe_fold_reference calling it with is_lhs true.
27483 (gimple_fold_call): Likewise.
27484 (fold_stmt_1): Likewise.
27486 2021-04-21 Richard Biener <rguenther@suse.de>
27488 * fold-const.c (pedantic_non_lvalue_loc): Remove.
27489 (fold_binary_loc): Adjust.
27490 (fold_ternary_loc): Likewise.
27492 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27494 PR middle-end/100130
27495 * varasm.c (get_block_for_decl): Make sure that any use of the
27496 retain attribute matches the section's retain flag.
27497 (switch_to_section): Check for retain mismatches even when
27498 changing sections, but do not warn if the given decl is the
27499 section's named.decl.
27500 (output_object_block): Pass the first decl in the block (if any)
27501 to switch_to_section.
27503 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
27505 * config/i386/i386-c.c (ix86_target_macros_internal): Define
27506 __CRC32__ for -mcrc32.
27507 * config/i386/i386-options.c (ix86_option_override_internal):
27508 Enable crc32 instruction for -msse4.2.
27509 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
27511 (sse4_2_crc32di): Likewise.
27512 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
27515 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
27518 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
27521 2021-04-20 Martin Liska <mliska@suse.cz>
27523 * doc/invoke.texi: Fix typo.
27524 * params.opt: Likewise.
27526 2021-04-20 Martin Liska <mliska@suse.cz>
27528 * doc/invoke.texi: Document new param.
27530 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
27532 PR tree-optimization/100081
27533 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
27534 rather than gori_compute_cache.
27535 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
27536 (range_def_chain::m_logical_depth): New member.
27537 (range_def_chain::range_def_chain): Initialize m_logical_depth.
27538 (range_def_chain::get_def_chain): Don't build defchains through more
27539 than LOGICAL_LIMIT logical expressions.
27540 * params.opt (param_ranger_logical_depth): New.
27542 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
27545 * config/arm/arm.c (arm_configure_build_target): Do not strip
27546 extended FPU/SIMD feature bits from the target ISA when -mfpu
27547 is specified (partial revert of r11-8168).
27549 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
27551 * params.opt (-param=openacc-kernels=): Add.
27552 * omp-oacc-kernels-decompose.cc
27553 (pass_omp_oacc_kernels_decompose::gate): Use it.
27554 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
27555 (--param): ... here, 'openacc-kernels'.
27557 2021-04-19 Martin Liska <mliska@suse.cz>
27560 * gengtype.c (finish_root_table): Align function arguments
27561 in between declaration and definition.
27563 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
27565 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
27566 frames larger than the SEH maximum frame size.
27568 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
27570 PR rtl-optimization/99927
27571 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
27572 is dead, just drop it.
27574 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27577 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
27578 * doc/tm.texi: Regenerate.
27579 * doc/tm.texi.in (D language and ABI): Add @hook for
27580 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
27582 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27584 * config/darwin-d.c (darwin_d_handle_target_object_format): New
27586 (darwin_d_register_target_info): New function.
27587 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27588 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
27590 (dragonfly_d_register_target_info): New function.
27591 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27592 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
27594 (freebsd_d_register_target_info): New function.
27595 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27596 * config/glibc-d.c (glibc_d_handle_target_object_format): New
27598 (glibc_d_register_target_info): New function.
27599 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27600 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
27602 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
27603 as handler for objectFormat key.
27604 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
27606 (winnt_d_register_target_info): New function.
27607 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27608 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
27610 (netbsd_d_register_target_info): New function.
27611 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27612 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
27614 (openbsd_d_register_target_info): New function.
27615 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27616 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
27617 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
27618 handler for objectFormat key.
27619 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
27621 (rs6000_d_register_target_info): Add
27622 rs6000_d_handle_target_object_format as handler for objectFormat key.
27623 * config/sol2-d.c (solaris_d_handle_target_object_format): New
27625 (solaris_d_register_target_info): New function.
27626 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27628 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27631 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
27632 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
27634 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
27635 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
27636 the old and new alignment after applying MIN/MAX to it is different.
27638 2021-04-16 Tamar Christina <tamar.christina@arm.com>
27641 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
27642 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
27644 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
27646 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
27648 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
27649 this section and its subsections.
27651 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27654 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
27655 define_insn patterns.
27657 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27659 PR rtl-optimization/98689
27660 * reg-notes.def (UNTYPED_CALL): New note.
27661 * combine.c (distribute_notes): Handle it.
27662 * emit-rtl.c (try_split): Likewise.
27663 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
27664 that calls with the note implicitly set all return value registers.
27665 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
27668 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27670 PR rtl-optimization/99596
27671 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
27672 register accesses for const calls. Assume that pure functions
27673 can only read from global registers. Ignore cases in which
27674 the stack pointer has been marked global.
27676 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27679 * tree-vect-loop.c (vect_transform_loop): Don't remove just
27680 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
27681 them by their last argument.
27683 2021-04-15 Martin Liska <mliska@suse.cz>
27685 * doc/invoke.texi: Other params don't use it, remove it.
27687 2021-04-15 Richard Biener <rguenther@suse.de>
27689 * gimple-builder.h: Add deprecation note.
27691 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27694 * attribs.h (restrict_type_identity_attributes_to): Declare.
27695 * attribs.c (restrict_type_identity_attributes_to): New function.
27697 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27700 * attribs.h (affects_type_identity_attributes): Declare.
27701 * attribs.c (remove_attributes_matching): New function.
27702 (affects_type_identity_attributes): Likewise.
27704 2021-04-15 Jakub Jelinek <jakub@redhat.com>
27707 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
27708 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
27709 ZERO_EXTEND, SIGN_EXTEND or AND.
27711 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27713 PR rtl-optimization/99929
27714 * rtl.h (same_vector_encodings_p): New function.
27715 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
27716 * cselib.c (rtx_equal_for_cselib_1): Likewise.
27717 * jump.c (rtx_renumbered_equal_p): Likewise.
27718 * lra-constraints.c (operands_match_p): Likewise.
27719 * reload.c (operands_match_p): Likewise.
27720 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
27722 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27724 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
27725 more information about variable-length CONST_VECTORs.
27727 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
27729 PR rtl-optimization/100066
27730 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
27731 ordered modes when choosing splitting mode for hard reg.
27733 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27736 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
27738 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
27740 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
27742 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
27743 for mask operand types.
27744 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
27745 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
27747 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
27749 * config/s390/s390.c (s390_const_operand_ok): Check the new
27750 operand types and generate a list of valid values.
27752 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27754 * doc/tm.texi: Regenerate.
27755 * doc/tm.texi.in (D language and ABI): Add @hook for
27756 TARGET_D_REGISTER_OS_TARGET_INFO.
27758 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27760 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
27762 (aarch64_d_register_target_info): New function.
27763 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
27765 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
27767 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
27768 (arm_d_register_target_info): New function.
27769 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
27770 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27771 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
27772 (ix86_d_register_target_info): New function.
27773 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
27774 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27775 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
27776 (mips_d_register_target_info): New function.
27777 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
27778 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27779 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
27780 (pa_d_register_target_info): New function.
27781 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
27782 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27783 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
27785 (riscv_d_register_target_info): New function.
27786 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
27787 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27788 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
27790 (rs6000_d_register_target_info): New function.
27791 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
27793 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27794 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
27795 (s390_d_register_target_info): New function.
27796 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
27797 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27798 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
27800 (sparc_d_register_target_info): New function.
27801 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
27802 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27803 * doc/tm.texi: Regenerate.
27804 * doc/tm.texi.in (D language and ABI): Add @hook for
27805 TARGET_D_REGISTER_CPU_TARGET_INFO.
27807 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27809 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
27810 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
27811 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
27812 * doc/tm.texi: Regenerate.
27813 * doc/tm.texi.in (D language and ABI): Add @hook for
27814 TARGET_D_HAS_STDCALL_CONVENTION.
27816 2021-04-14 Richard Biener <rguenther@suse.de>
27818 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
27819 VEC_COND_EXPRs have a gimple_val condition.
27820 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
27821 can no longer have a GENERIC condition.
27823 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
27826 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
27827 from the isa_delta when -mfpu has been used.
27828 (arm_options_perform_arch_sanity_checks): It's the architecture that
27829 lacks an FPU not the processor.
27831 2021-04-13 Richard Biener <rguenther@suse.de>
27833 PR tree-optimization/100053
27834 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
27835 not use optimistic dominance queries for backedges to validate
27837 (dominated_by_p_w_unex): Add parameter to ignore executable
27838 state on backedges.
27839 (rpo_elim::eliminate_avail): Adjust.
27841 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27844 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
27845 *aarch64_bfxilsi_extrdi): New define_insn patterns.
27847 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27850 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
27851 outermode, return NULL if the result doesn't encode back to the
27852 original byte sequence.
27853 (simplify_gen_subreg): Don't create SUBREGs from constants to
27854 MODE_COMPOSITE_P outermode.
27856 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27858 PR rtl-optimization/99905
27859 * combine.c (expand_compound_operation): If pos + len > modewidth,
27860 perform the right shift by pos in inner_mode and then convert to mode,
27861 instead of trying to simplify a shift of rtx with inner_mode by pos
27862 as if it was a shift in mode.
27864 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27867 * combine.c (simplify_and_const_int_1): Don't optimize varop
27868 away if it has side-effects.
27870 2021-04-12 Martin Liska <mliska@suse.cz>
27872 * doc/extend.texi: Escape @smallexample content.
27874 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
27876 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
27877 alternative in order to load a DFP zero.
27879 2021-04-12 Martin Liska <mliska@suse.cz>
27881 * doc/extend.texi: Be more precise in documentation
27882 of symver attribute.
27884 2021-04-12 Martin Liska <mliska@suse.cz>
27887 * gimplify.c (gimplify_expr): Right now, we unpoison all
27888 variables before a goto <dest>. We should not do it if we are
27891 2021-04-12 Cui,Lili <lili.cui@intel.com>
27893 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
27895 * common/config/i386/i386-common.c (processor_names): Add
27897 (processor_alias_table): Add rocketlake.
27898 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
27899 INTEL_COREI7_ROCKETLAKE.
27900 * config.gcc: Add -march=rocketlake.
27901 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
27903 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
27904 (processor_cost_table): Add rocketlake cost.
27905 * config/i386/i386.h (ix86_size_cost) : Define
27907 (processor_type) : Add PROCESSOR_ROCKETLAKE.
27908 (PTA_ROCKETLAKE): Ditto.
27909 * doc/extend.texi: Add rocketlake.
27910 * doc/invoke.texi: Add rocketlake.
27912 2021-04-12 Cui,Lili <lili.cui@intel.com>
27914 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
27915 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
27916 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
27917 * doc/invoke.texi: Change alderlake ISA list.
27919 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
27921 PR middle-end/98088
27922 * omp-expand.c (expand_oacc_collapse_init): Update condition in
27925 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
27928 * config/i386/serializeintrin.h (_serialize): Defined as macro.
27930 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27933 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
27934 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
27936 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27938 PR middle-end/99989
27939 * gimple-ssa-warn-alloca.c
27940 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
27941 0 with integer precision unconditionally.
27943 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27945 PR rtl-optimization/98601
27946 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
27947 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
27948 unaligned_mems handle VOIDmode like BLKmode.
27950 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
27953 * tree.c (free_lang_data_in_decl): Do not release body of
27954 declare_variant_alt.
27956 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27958 * config/aarch64/aarch64.c (aarch64_option_restore): If the
27959 architecture was specified explicitly and the tuning wasn't,
27960 tune for the architecture rather than the configured default CPU.
27962 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27964 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
27965 as the temporary register.
27967 2021-04-09 Martin Liska <mliska@suse.cz>
27969 * doc/extend.texi: Move non-target attributes on the top level.
27971 2021-04-09 Martin Liska <mliska@suse.cz>
27973 * doc/invoke.texi: Document minimum and maximum value of the
27974 argument for both supported compression algorithms.
27976 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
27978 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
27979 TLS BSS before TLS data.
27980 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
27982 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27984 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
27986 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27988 * match.pd: Extend vec_cond folds to handle shifts.
27990 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
27992 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
27995 2021-04-08 Alex Coplan <alex.coplan@arm.com>
27998 * config/arm/iterators.md (MVE_vecs): New.
27999 (V_elem): Also handle V2DF.
28000 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
28001 (*mve_vdup<mode>): ... this. Remove second alternative since
28002 vec_duplicate of const_int is not canonical RTL, and we don't
28003 want to match symbol_refs.
28004 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
28006 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
28008 * fold-const.c (fold_single_bit_test): Fix typo.
28009 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
28012 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
28014 PR tree-optimization/97513
28015 * tree-vect-slp.c (vect_add_slp_permutation): New function,
28017 (vectorizable_slp_permutation): ...here. Detect cases in which
28018 all VEC_PERM_EXPRs are guaranteed to have the same stepped
28019 permute vector and only generate one permute vector for that case.
28020 Extend that case to handle variable-length vectors.
28022 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
28024 PR tree-optimization/99873
28025 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
28026 (vect_build_slp_instance): Don't split store groups that could
28027 use IFN_STORE_LANES.
28029 2021-04-07 Jakub Jelinek <jakub@redhat.com>
28032 * varasm.c (output_constant_pool_contents): Don't strip name encoding
28033 from XSTR (desc->sym, 0) or from label before passing those to
28036 2021-04-07 Richard Biener <rguenther@suse.de>
28038 PR tree-optimization/99954
28039 * tree-loop-distribution.c: Include tree-affine.h.
28040 (generate_memcpy_builtin): Try using tree-affine to prove
28042 (loop_distribution::classify_builtin_ldst): Always classify
28045 2021-04-07 Richard Biener <rguenther@suse.de>
28047 PR tree-optimization/99947
28048 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
28049 steps vector to avoid pushing elements from the reallocated
28052 2021-04-07 Richard Biener <rguenther@suse.de>
28054 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
28055 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
28057 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
28059 (debug_vn_reference_ops): New.
28061 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
28063 PR tree-optimization/98736
28064 * tree-loop-distribution.c
28065 * (loop_distribution::bb_top_order_init):
28066 Compute RPO with programing order preserved by calling function
28067 rev_post_order_and_mark_dfs_back_seme.
28069 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
28072 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
28073 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
28075 (process_bb_lives): Don't update biggest mode of hard reg for
28076 implicit in multi-register group. Use the new functions for
28077 updating dead_set and unused_set by register notes.
28079 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
28081 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
28084 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
28086 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
28087 (skylake_memset): Likewise.
28088 (skylake_cost): Change CLEAR_RATIO to 17.
28089 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28090 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
28091 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
28093 2021-04-06 Richard Biener <rguenther@suse.de>
28095 PR tree-optimization/99880
28096 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
28097 set vectorized defs of relevant PHIs.
28099 2021-04-06 Richard Biener <rguenther@suse.de>
28101 PR tree-optimization/99924
28102 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
28103 nodes w/o scalar stmts as visited.
28105 2021-04-06 Alex Coplan <alex.coplan@arm.com>
28108 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
28109 PCS for [su]fix_optab.
28111 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28113 * config/darwin.c (machopic_legitimize_pic_address): Check
28114 that the current pic register is one of the hard reg set
28115 before setting liveness.
28117 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28119 * config/darwin.c (machopic_legitimize_pic_address): Fix
28120 whitespace, remove unused code.
28122 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28124 PR tree-optimization/99882
28125 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
28128 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28130 PR rtl-optimization/99863
28131 * dse.c (replace_read): Drop regs_live argument. Instead of
28132 regs_live, use store_insn->fixed_regs_live if non-NULL,
28133 otherwise punt if insns sequence clobbers or sets any hard
28136 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28139 * targhooks.h (default_print_patchable_function_entry_1): Declare.
28140 * targhooks.c (default_print_patchable_function_entry_1): New function,
28141 copied from default_print_patchable_function_entry with an added flags
28143 (default_print_patchable_function_entry): Rewritten into a small
28144 wrapper around default_print_patchable_function_entry_1.
28145 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
28147 (rs6000_print_patchable_function_entry): New function.
28149 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
28151 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
28153 2021-04-01 Jason Merrill <jason@redhat.com>
28156 * common.opt: Document v15 and v16.
28158 2021-04-01 Richard Biener <rguenther@suse.de>
28160 PR tree-optimization/99863
28161 * gimplify.c (gimplify_init_constructor): Recompute vector
28164 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28166 * doc/extend.texi (symver attribute): Fix up syntax errors
28169 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28171 PR tree-optimization/96573
28172 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
28173 also pointer types.
28175 2021-04-01 Richard Biener <rguenther@suse.de>
28177 PR tree-optimization/99856
28178 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
28179 precision to vector element precision.
28181 2021-04-01 Martin Jambor <mjambor@suse.cz>
28183 PR tree-optimization/97009
28184 * tree-sra.c (access_or_its_child_written): New function.
28185 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
28188 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
28191 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
28193 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
28196 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
28197 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
28198 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
28199 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
28200 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
28201 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
28203 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
28204 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
28205 (define_attr "prefixed"): Update initializer.
28207 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28210 * dwarf2out.c (debug_ranges_dwo_section): New variable.
28211 (DW_RANGES_IDX_SKELETON): Define.
28212 (struct dw_ranges): Add begin_entry and end_entry members.
28213 (DEBUG_DWO_RNGLISTS_SECTION): Define.
28214 (add_ranges_num): Adjust r initializer for addition of *_entry
28216 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
28217 set idx to DW_RANGES_IDX_SKELETON.
28218 (use_distinct_base_address_for_range): New function.
28219 (index_rnglists): Don't set r->idx if it is equal to
28220 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
28221 r->end_entry for -gsplit-dwarf if those will be needed by
28223 (output_rnglists): Add DWO argument. If true, switch to
28224 debug_ranges_dwo_section rather than debug_ranges_section.
28225 Adjust l1/l2 label indexes. Only output the offset table when
28226 dwo is true and don't include in there the skeleton range
28227 entry if present. For -gsplit-dwarf, skip ranges that belong
28228 to the other rnglists section. Change return type from void
28229 to bool and return true if there are any range entries for
28230 the other section. For dwarf_split_debug_info use
28231 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
28232 entries instead of DW_RLE_start_end, DW_RLE_start_length and
28233 DW_RLE_base_address. Use use_distinct_base_address_for_range.
28234 (init_sections_and_labels): Initialize debug_ranges_dwo_section
28235 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
28236 and range_base_label indexes.
28237 (dwarf2out_finish): Call index_rnglists earlier before finalizing
28238 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
28239 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
28240 with different dwo arguments.
28241 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
28243 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28245 PR tree-optimization/98268
28246 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
28247 recompute_tree_invariant_for_addr_expr after successfully
28248 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
28250 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28252 PR tree-optimization/99726
28253 * tree-data-ref.c (create_intersect_range_checks_index): Bail
28254 out if there is more than one access function SCEV for the loop
28257 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28259 PR rtl-optimization/97141
28260 PR rtl-optimization/98726
28261 * emit-rtl.c (valid_for_const_vector_p): Return true for
28263 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
28264 poly_wide_int instead of a wide_int.
28265 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
28267 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
28268 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
28269 false for CONST_VECTORs that cannot be forced to memory.
28270 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
28271 is too complex to force to memory, build it up from individual
28274 2021-03-31 Jan Hubicka <jh@suse.cz>
28277 * cgraph.c (cgraph_node::release_body): Fix overactive check.
28279 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
28282 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
28285 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
28287 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
28288 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
28290 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
28291 "rep movsb/stosb" only for known sizes.
28292 * config/i386/i386-options.c (processor_cost_table): Use Ice
28293 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
28294 Rapids and Alder Lake.
28295 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
28296 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
28297 (icelake_memset): Likewise.
28298 (icelake_cost): Likewise.
28299 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28302 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28305 * config/aarch64/aarch64.c
28306 (aarch64_vectorize_preferred_vector_alignment): Query the size
28307 of the provided SVE vector; do not assume that all SVE vectors
28308 have the same size.
28310 2021-03-31 Jan Hubicka <jh@suse.cz>
28313 * cgraph.c (cgraph_node::release_body): Remove all callers and
28315 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
28316 * cgraphunit.c (cgraph_node::expand): And here.
28318 2021-03-31 Martin Liska <mliska@suse.cz>
28320 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
28321 and one negated condition.
28323 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28324 Richard Sandiford <richard.sandiford@arm.com>
28327 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
28328 constraints on operands[2] and similarly 0 and rk constraints
28329 on operands[1] corresponding to that.
28331 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28334 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
28335 linker doesn't support DWARF sections new in DWARF5.
28336 * config/i386/i386-options.c (ix86_option_override_internal): Default
28337 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
28339 * config.in: Regenerated.
28340 * configure: Regenerated.
28342 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28345 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
28346 available issue_info before using it.
28348 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28351 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
28354 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
28357 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
28358 (altivec_lvsl_reg_<mode>): ... this.
28359 (altivec_lvsr_reg): Change to ...
28360 (altivec_lvsr_reg_<mode>): ... this.
28361 * config/rs6000/predicates.md (vec_set_index_operand): New.
28362 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
28363 Enable 32bit variable vec_insert for all TARGET_VSX.
28364 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28365 Enable 32bit variable vec_insert for p9 and above.
28366 (rs6000_expand_vector_set_var_p8): Rename to ...
28367 (rs6000_expand_vector_set_var_p7): ... this.
28368 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
28370 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
28371 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
28372 gen_altivec_lvsr_reg_di.
28374 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
28377 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
28378 (__rdtscp): Likewise.
28380 2021-03-30 Tamar Christina <tamar.christina@arm.com>
28382 PR tree-optimization/99825
28383 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
28384 Reject non-mult 2 lanes.
28386 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
28389 * config/arm/arm.c (arm_file_start): Fix emission of
28390 Tag_ABI_VFP_args attribute.
28392 2021-03-30 Richard Biener <rguenther@suse.de>
28394 PR tree-optimization/99824
28395 * stor-layout.c (set_min_and_max_values_for_integral_type):
28396 Assert the precision is within the bounds of
28397 WIDE_INT_MAX_PRECISION.
28398 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
28399 the outermost component ref only to lower the access size
28400 and initialize that from the access type.
28402 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
28405 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
28406 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
28408 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
28410 * config/aarch64/aarch64.md
28411 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
28412 attribute to disambiguate between SIMD and FP variants of the
28415 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
28417 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
28418 (analyze_ssa_name_flags): Fix typo in comment.
28420 2021-03-29 Alex Coplan <alex.coplan@arm.com>
28423 * config/aarch64/aarch64-sve-builtins.cc
28424 (function_builder::add_function): Add placeholder_p argument, use
28425 placeholder decls if this is set.
28426 (function_builder::add_unique_function): Instead of conditionally adding
28427 direct overloads, unconditionally add either a direct overload or a
28429 (function_builder::add_overloaded_function): Set placeholder_p if we're
28430 using C++ overloads. Use the obstack for string storage instead
28431 of relying on the tree nodes.
28432 (function_builder::add_overloaded_functions): Don't return early for
28433 m_direct_overloads: we need to add placeholders.
28434 * config/aarch64/aarch64-sve-builtins.h
28435 (function_builder::add_function): Add placeholder_p argument.
28437 2021-03-29 Richard Biener <rguenther@suse.de>
28439 PR tree-optimization/99807
28440 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
28441 assert below VEC_PERM handling.
28443 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28446 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
28447 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
28448 matching const_int 0.
28449 (move_lo_quad_internal_be_<mode>): Likewise.
28450 (move_lo_quad_<mode>): Update for the above.
28451 * config/aarch64/iterators.md (VQ_2E): Delete.
28453 2021-03-29 Jakub Jelinek <jakub@redhat.com>
28455 PR tree-optimization/99777
28456 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
28457 types other than scalar integral types.
28459 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
28461 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
28462 XCOFF TLS reloc decorations.
28464 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
28466 * doc/analyzer.texi (Analyzer Internals): Update link to
28467 "A Memory Model for Static Analysis of C Programs".
28469 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
28471 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
28472 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
28474 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
28475 (rs6000_special_round_type_align): Recursively check innermost first
28478 2021-03-26 Jakub Jelinek <jakub@redhat.com>
28481 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
28482 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
28483 assignment with drap_reg active, queue reg save for hfp with offset 0
28484 and flush queued reg saves. When handling a push with rule18,
28485 defer queueing reg save for hfp and just assert the offset is 0.
28486 (scan_trace): Assert that fde->rule18 is false.
28488 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
28491 * ira-costs.c (record_reg_classes): Put case with
28492 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
28493 * ira.c (ira_setup_alts): Ditto.
28494 * lra-constraints.c (process_alt_operands): Ditto.
28495 * recog.c (asm_operand_ok): Ditto.
28496 * reload.c (find_reloads): Ditto.
28498 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28500 * config/aarch64/aarch64-protos.h
28501 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
28502 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
28503 * config/aarch64/aarch64.c (generic_addrcost_table): Update
28504 accordingly, using the same costs as for post_modify.
28505 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
28506 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
28507 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
28508 (a64fx_addrcost_table): Likewise.
28509 (neoversev1_addrcost_table): New.
28510 (neoversev1_tunings): Use neoversev1_addrcost_table.
28511 (aarch64_address_cost): Use the new post_modify costs for CImode
28514 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28516 * config/aarch64/aarch64.opt
28517 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
28518 * doc/invoke.texi: Document it.
28519 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
28520 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
28521 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
28522 (aarch64_vec_issue_info): New structures.
28523 (cpu_vector_cost): Write comments above the variables rather
28525 (cpu_vector_cost::issue_info): New member variable.
28526 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
28527 and tree-ssa-loop-niter.h.
28528 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
28529 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
28530 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
28531 (thunderx3t110_vector_cost): Initialize issue_info to null.
28532 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
28533 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
28534 (neoversev1_vector_cost): Use them.
28535 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
28536 (aarch64_vector_costs::saw_sve_only_op): New member variable.
28537 (aarch64_vector_costs::num_vector_iterations): Likewise.
28538 (aarch64_vector_costs::scalar_ops): Likewise.
28539 (aarch64_vector_costs::advsimd_ops): Likewise.
28540 (aarch64_vector_costs::sve_ops): Likewise.
28541 (aarch64_vector_costs::seen_loads): Likewise.
28542 (aarch64_simd_vec_costs_for_flags): New function.
28543 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
28544 Count the number of predicate operations required by SVE WHILE
28546 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
28547 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
28548 (aarch64_count_ops): Likewise.
28549 (aarch64_add_stmt_cost): Record whether see an SVE operation
28550 that cannot currently be implementing using Advanced SIMD.
28551 Record issue information about the scalar, Advanced SIMD
28552 and (where relevant) SVE versions of a loop.
28553 (aarch64_vec_op_count::dump): New function.
28554 (aarch64_sve_op_count::dump): Likewise.
28555 (aarch64_estimate_min_cycles_per_iter): Likewise.
28556 (aarch64_adjust_body_cost): If issue information is available,
28557 try to compare the issue rates of the various loop implementations
28558 and increase or decrease the vector body cost accordingly.
28560 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28562 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
28563 Assume a zero cost for induction phis.
28565 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28567 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
28569 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
28570 vector comparisons.
28572 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28574 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
28576 (aarch64_add_stmt_cost): Call it.
28578 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28580 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
28581 New tuning parameter.
28582 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
28583 (aarch64_estimated_sve_vq): New function.
28584 (aarch64_vector_costs::analyzed_vinfo): New member variable.
28585 (aarch64_vector_costs::is_loop): Likewise.
28586 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
28587 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
28588 (aarch64_record_potential_advsimd_unrolling): New function.
28589 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
28590 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
28591 aarch64_analyze_bb_vinfo on the first use of a costs structure.
28592 Detect whether we're vectorizing a loop for SVE that might be
28593 completely unrolled if it used Advanced SIMD instead.
28594 (aarch64_adjust_body_cost_for_latency): New function.
28595 (aarch64_finish_cost): Call it.
28597 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28599 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
28600 (aarch64_init_cost): New function.
28601 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
28602 the default unsigned[3].
28603 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
28604 (TARGET_VECTORIZE_INIT_COST): Override.
28605 (TARGET_VECTORIZE_FINISH_COST): Likewise.
28606 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
28608 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28610 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
28611 (neoversev1_sve_vector_cost): New cost structures.
28612 (neoversev1_vector_cost): Likewise.
28613 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
28615 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28617 * config/aarch64/aarch64-protos.h
28618 (sve_vec_cost::scatter_store_elt_cost): New member variable.
28619 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28620 accordingly, taking the cost from the cost of a scalar_store.
28621 (a64fx_sve_vector_cost): Likewise.
28622 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
28624 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28626 * config/aarch64/aarch64-protos.h
28627 (simd_vec_cost::store_elt_extra_cost): New member variable.
28628 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28629 accordingly, using the vec_to_scalar cost for the new field.
28630 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28631 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28632 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28633 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28634 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28635 (thunderx3t110_advsimd_vector_cost): Likewise.
28636 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
28638 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28640 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
28641 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
28642 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
28643 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28644 accordingly, using zero for the new costs.
28645 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28646 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28647 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28648 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28649 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28650 (thunderx3t110_advsimd_vector_cost): Likewise.
28651 (aarch64_ld234_st234_vectors): New function.
28652 (aarch64_adjust_stmt_cost): Likewise.
28653 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
28654 the new vector costs.
28656 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28658 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
28659 derived class of simd_vec_cost. Add information about CLAST[AB]
28660 and FADDA instructions.
28661 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28662 accordingly, using the vec_to_scalar costs for the new fields.
28663 (a64fx_sve_vector_cost): Likewise.
28664 (aarch64_reduc_type): New function.
28665 (aarch64_sve_in_loop_reduction_latency): Likewise.
28666 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
28667 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
28668 that occur in the loop body.
28669 (aarch64_add_stmt_cost): Update call accordingly.
28671 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28673 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
28675 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
28676 above the fields rather than to the right.
28677 (simd_vec_cost::reduc_i8_cost): New member variable.
28678 (simd_vec_cost::reduc_i16_cost): Likewise.
28679 (simd_vec_cost::reduc_i32_cost): Likewise.
28680 (simd_vec_cost::reduc_i64_cost): Likewise.
28681 (simd_vec_cost::reduc_f16_cost): Likewise.
28682 (simd_vec_cost::reduc_f32_cost): Likewise.
28683 (simd_vec_cost::reduc_f64_cost): Likewise.
28684 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28685 accordingly, using the vec_to_scalar_cost for the new fields.
28686 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28687 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28688 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28689 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28690 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28691 (thunderx3t110_advsimd_vector_cost): Likewise.
28692 (aarch64_use_new_vector_costs_p): New function.
28693 (aarch64_simd_vec_costs): New function, split out from...
28694 (aarch64_builtin_vectorization_cost): ...here.
28695 (aarch64_is_reduction): New function.
28696 (aarch64_detect_vector_stmt_subtype): Likewise.
28697 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
28698 using the new vector costs.
28700 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28703 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
28704 TLS declarations as public.
28706 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28708 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
28709 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
28710 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
28711 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
28712 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
28713 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
28714 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
28715 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
28716 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
28718 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28721 * config.gcc (*-*-cygwin*): Add winnt-d.o
28722 (*-*-mingw*): Likewise.
28723 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
28724 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
28725 * config/i386/t-cygming: Add winnt-d.o.
28726 * config/i386/winnt-d.c: New file.
28728 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28730 * config/freebsd-d.c: Include memmodel.h.
28732 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28735 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
28736 * config/t-openbsd: Add openbsd-d.o.
28737 * config/openbsd-d.c: New file.
28739 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
28741 PR tree-optimization/96974
28742 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
28743 with graceful exit.
28745 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28748 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28752 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28753 always_inline in system headers.
28755 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
28757 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
28759 2021-03-25 Jakub Jelinek <jakub@redhat.com>
28762 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
28763 * fold-const.c (operand_compare::operand_equal_p): Don't compare
28764 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
28766 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28770 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28771 always_inline in system headers.
28773 2021-03-25 Richard Biener <rguenther@suse.de>
28775 PR tree-optimization/99746
28776 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
28777 the scalar stmt as patterned. Instead set up required things
28780 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
28782 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
28785 2021-03-24 Martin Liska <mliska@suse.cz>
28788 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
28790 * config/i386/i386-options.c (ix86_option_override_internal):
28791 Add run-time assert.
28793 2021-03-24 Martin Jambor <mjambor@suse.cz>
28796 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
28797 parameters with unknown type.
28798 (ipacp_value_safe_for_type): New function.
28799 (propagate_vals_across_arith_jfunc): Verify that the constant type
28800 can be used for a type of the formal parameter.
28801 (propagate_vals_across_ancestor): Likewise.
28802 (propagate_scalar_across_jump_function): Likewise. Pass the type
28803 also to propagate_vals_across_ancestor.
28805 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
28808 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
28810 (movmisalign<mode>_mve_load): Likewise.
28812 2021-03-24 Jakub Jelinek <jakub@redhat.com>
28815 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
28816 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
28818 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28820 * doc/sourcebuild.texi (sysconf): New effective target.
28822 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28824 * config/i386/predicates.md (reg_or_const_vec_operand): New.
28825 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
28826 the now *-prefixed insn_and_split, turn the splitter const vec
28827 into an input for the insn, making it an ignored immediate for
28828 non-split cases, and loaded into the scratch register
28831 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
28834 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
28835 Use define_relaxed_memory_constraint for them.
28837 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
28840 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
28841 colon to the diagnostic message.
28843 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
28845 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
28847 (try_fwprop_subst_note): Use set_info instead of insn_info.
28848 (try_fwprop_subst_pattern): Likewise.
28849 (try_fwprop_subst_notes): Likewise.
28850 (try_fwprop_subst): Likewise.
28851 (forward_propagate_subreg): Likewise.
28852 (forward_propagate_and_simplify): Likewise.
28853 (forward_propagate_into): Likewise.
28854 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
28856 (set_info::single_nondebug_insn_use): Likewise.
28857 (set_info::single_phi_use): Likewise.
28858 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
28860 (set_info::single_nondebug_insn_use): Likewise.
28861 (set_info::single_phi_use): Likewise.
28863 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
28865 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
28867 2021-03-23 Jakub Jelinek <jakub@redhat.com>
28870 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
28871 expand_mult to perform an unsigned rather than a signed
28874 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
28877 * config/i386/cpuid.h (__cpuid): Add __volatile__.
28878 (__cpuid_count): Likewise.
28880 2021-03-23 Richard Biener <rguenther@suse.de>
28882 PR tree-optimization/99721
28883 * tree-vect-slp.c (vect_slp_analyze_node_operations):
28884 Make sure we can schedule the node.
28886 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28888 * config/riscv/riscv.c (riscv_subword): Take endianness into
28889 account when calculating the byte offset.
28891 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28893 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
28894 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
28895 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
28896 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
28897 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
28898 new predicate "subreg_lowpart_operator"
28900 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28902 * config/riscv/riscv.c (riscv_swap_instruction): New function
28903 to byteswap an SImode rtx containing an instruction.
28904 (riscv_trampoline_init): Byteswap the generated instructions
28907 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28909 * common/config/riscv/riscv-common.c
28910 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
28911 * config.gcc (riscv32be-*, riscv64be-*): Set
28912 TARGET_BIG_ENDIAN_DEFAULT to 1.
28913 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
28914 depending on default endianness.
28915 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28916 * config/riscv/linux.h (LINK_SPEC): Likewise.
28917 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
28918 default endianness.
28919 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
28921 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28923 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
28924 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28925 * config/riscv/linux.h (LINK_SPEC): Likewise.
28926 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
28928 (BYTES_BIG_ENDIAN): Handle big endian.
28929 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
28930 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
28932 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
28934 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28936 * regcprop.c (find_oldest_value_reg): Ask target whether
28937 different mode is fine for replacement register.
28939 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
28941 PR tree-optimization/99296
28942 * value-range.cc (irange::irange_set_1bit_anti_range): New.
28943 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
28944 * value-range.h (irange::irange_set_1bit_anti_range): New.
28946 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
28949 * config/aarch64/constraints.md (UtQ): Use
28950 define_relaxed_memory_constraint for it.
28951 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
28952 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28953 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
28954 (have_relaxed_memory_constraints): New static var.
28955 (relaxed_memory_start, relaxed_memory_end): Ditto.
28956 (add_constraint): Add arg is_relaxed_memory. Check name for
28957 relaxed memory. Set up is_relaxed_memory in constraint_data and
28958 have_relaxed_memory_constraints. Adjust calls.
28959 (choose_enum_order): Process relaxed memory.
28960 (write_tm_preds_h): Ditto.
28961 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28962 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28963 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
28964 * ira-lives.c (single_reg_class): Use
28965 insn_extra_relaxed_memory_constraint.
28966 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
28967 * lra-constraints.c (valid_address_p): Use
28968 insn_extra_relaxed_memory_constraint instead of other memory
28970 (process_alt_operands): Process CT_RELAXED_MEMORY.
28971 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
28972 * recog.c (asm_operand_ok, preprocess_constraints): Process
28974 * reload.c (find_reloads): Ditto.
28975 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
28976 * stmt.c (parse_input_constraint): Use
28977 insn_extra_relaxed_memory_constraint.
28979 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
28982 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
28985 2021-03-22 Alex Coplan <alex.coplan@arm.com>
28988 * config/arm/arm-protos.h (neon_make_constant): Add generate
28989 argument to guard emitting insns, default to true.
28990 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
28991 CONST_VECTORs which neon_make_constant can't handle.
28992 (neon_vdup_constant): Add generate argument, avoid emitting
28993 insns if it's not set.
28994 (neon_make_constant): Plumb new generate argument through.
28995 * config/arm/constraints.md (Ui): New. Use it...
28996 * config/arm/mve.md (*mve_mov<mode>): ... here.
28997 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
28998 synthesize constants.
29000 2021-03-22 Richard Biener <rguenther@suse.de>
29002 * debug.h: Add deprecation warning.
29004 2021-03-22 Richard Biener <rguenther@suse.de>
29006 PR tree-optimization/99694
29007 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
29010 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
29013 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
29014 after type checking.
29016 2021-03-22 Jakub Jelinek <jakub@redhat.com>
29020 * dwarf2out.c (get_full_len): Use get_precision rather than
29022 (add_const_value_attribute): Make sure add_AT_wide argument has
29023 precision prec rather than some very wide one.
29025 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
29027 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
29028 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
29029 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
29030 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
29031 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
29032 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
29033 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
29034 *fix_trunc<mode>si2_internal): Fix empty split condition.
29035 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
29036 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
29037 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
29038 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
29040 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
29043 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
29044 Convert idx to DImode.
29045 (rs6000_expand_vector_set_var_p8): Likewise.
29047 2021-03-21 Jakub Jelinek <jakub@redhat.com>
29050 * dwarf2out.c (insert_float): Change return type from void to
29051 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
29052 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
29055 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
29058 * config/i386/i386.c (construct_container): Check cfun != NULL
29059 before accessing silent_p.
29061 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
29063 * asan.c: Fix typos in comments.
29065 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
29067 PR rtl-optimization/99680
29068 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
29069 (process_address_1): Check empty constraint before using
29072 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
29074 * config/rs6000/rs6000.c (power10_cost): New.
29075 (rs6000_option_override_internal): Set Power10 costs.
29076 (rs6000_issue_rate): Set Power10 issue rate.
29077 * config/rs6000/power10.md: Rewrite for Power10.
29079 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
29082 * lra-constraints.c (process_address_1): Don't use unknown
29083 constraint for address constraint.
29085 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
29088 * config.gcc (powerpc-*-darwin8): Delete the reference to
29089 the now removed darwin8.h.
29091 2021-03-19 Olivier Hainque <hainque@adacore.com>
29094 * config/vxworksae.h (VX_CPU_PREFIX): Define.
29096 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
29098 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
29100 2021-03-19 Tamar Christina <tamar.christina@arm.com>
29102 PR tree-optimization/99656
29103 * tree-vect-slp-patterns.c (linear_loads_p,
29104 complex_add_pattern::matches, is_eq_or_top,
29105 vect_validate_multiplication, complex_mul_pattern::matches,
29106 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
29107 * tree-vectorizer.h: (complex_load_perm_t): Removed.
29108 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
29109 complex_load_perm_t.
29111 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
29114 * config/i386/i386-options.c (ix86_init_machine_status): Set
29116 * config/i386/i386.c (init_cumulative_args): Set silent_p to
29118 (construct_container): Return early for return and argument
29119 errors if silent_p is true.
29120 * config/i386/i386.h (machine_function): Add silent_p.
29122 2021-03-19 Jakub Jelinek <jakub@redhat.com>
29125 * config/arm/constraints.md (Ds): New constraint.
29126 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
29127 constraint instead of w,Dm.
29129 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
29131 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
29134 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
29136 PR middle-end/99641
29137 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
29138 array type, do the computation of the current position in sizetype.
29140 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
29143 * lra-constraints.c (process_address_1): Use lookup_constraint
29144 only for a single constraint.
29146 2021-03-18 Martin Sebor <msebor@redhat.com>
29148 PR middle-end/99502
29149 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
29150 (inbounds_memaccess_p): ...to this. Check the ending offset of
29151 the accessed member.
29153 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
29155 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
29156 %> quote markers to error messages.
29157 (gcn_goacc_validate_dims): Likewise.
29158 (gcn_conditional_register_usage): Remove exclaimation mark from error
29160 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
29162 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
29164 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
29167 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
29168 Kito Cheng <kito.cheng@sifive.com>
29170 * config/riscv/riscv.c (riscv_block_move_straight): Change type
29171 to unsigned HOST_WIDE_INT for parameter and local variable with
29172 HOST_WIDE_INT type.
29173 (riscv_adjust_block_mem): Ditto.
29174 (riscv_block_move_loop): Ditto.
29175 (riscv_expand_block_move): Ditto.
29177 2021-03-18 Nick Clifton <nickc@redhat.com>
29179 * config/v850/v850.c (construct_restore_jr): Increase static
29181 (construct_save_jarl): Likewise.
29182 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
29184 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29186 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
29187 (aarch64_override_options_internal): Use it.
29188 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
29191 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
29193 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
29194 error message format issues.
29195 (nios2_option_override): Likewise.
29196 (nios2_expand_fpu_builtin): Likewise.
29197 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
29198 truncation warning.
29199 (nios2_expand_custom_builtin): More error message format fixes.
29200 (nios2_expand_rdwrctl_builtin): Likewise.
29201 (nios2_expand_rdprs_builtin): Likewise.
29202 (nios2_expand_eni_builtin): Likewise.
29203 (nios2_expand_builtin): Likewise.
29204 (nios2_register_custom_code): Likewise.
29205 (nios2_valid_target_attribute_rec): Likewise.
29206 (nios2_add_insn_asm): Fix uninitialized variable warning.
29208 2021-03-17 Jan Hubicka <jh@suse.cz>
29210 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
29211 of gather to match reality.
29212 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
29214 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29216 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
29217 to compare against CC_REG rather than NE.
29219 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
29222 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
29223 inline assembly statements.
29224 (ix86_print_operand): Update 'P' handling for -fno-plt.
29226 2021-03-17 Tamar Christina <tamar.christina@arm.com>
29229 * config/aarch64/aarch64.c
29230 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
29232 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
29235 * config/rs6000/predicates.md (branch_comparison_operator): Allow
29236 ordered and unordered for CCFPmode, if flag_finite_math_only.
29238 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29241 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
29242 rather than ASHIFT.
29243 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
29245 2021-03-16 Martin Liska <mliska@suse.cz>
29248 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
29249 cl_optimization_compare function.
29251 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
29253 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
29256 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29259 * config/i386/i386.h (struct machine_function): Add
29260 has_explicit_vzeroupper bitfield.
29261 * config/i386/i386-expand.c (ix86_expand_builtin): Set
29262 cfun->machine->has_explicit_vzeroupper when expanding
29263 IX86_BUILTIN_VZEROUPPER.
29264 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
29265 Do the mode switching only when TARGET_VZEROUPPER, expensive
29266 optimizations turned on and not optimizing for size.
29267 (pass_insert_vzeroupper::gate): Enable even when
29268 cfun->machine->has_explicit_vzeroupper is set.
29270 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29273 * config/aarch64/aarch64.c
29274 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
29275 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
29276 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
29278 2021-03-15 Richard Biener <rguenther@suse.de>
29280 PR tree-optimization/98834
29281 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
29282 subsetting by truncating the access size.
29284 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
29286 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
29287 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
29290 2021-03-15 Martin Liska <mliska@suse.cz>
29292 * spellcheck.c: Add missing comma in initialization.
29294 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
29296 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
29297 alternative 2 and alternative 1 with alternative 3 using
29298 YW register constraint.
29299 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
29300 using YW register constraint.
29301 (*vec_extractv16qi_zext): Ditto.
29302 (*vec_extractv4si): Merge alternatives 4 and 5
29303 using Yw register constraint.
29304 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
29306 2021-03-13 Martin Sebor <msebor@redhat.com>
29308 PR tree-optimization/99489
29309 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
29310 is not a call statement.
29312 2021-03-13 Jakub Jelinek <jakub@redhat.com>
29314 PR tree-optimization/99544
29315 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
29316 if for vector types multiplication can't be done in type's mode.
29318 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
29321 * config/sparc/constraints.md (w): Rename to...
29322 (W): ... this and ditch previous implementation.
29323 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
29324 (*movdf_insn_sp64): Likewise.
29325 (*mov<VM64:mode>_insn_sp64): Likewise.
29326 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
29328 (atomic_compare_and_swap_leon3_1): Likewise.
29329 (*atomic_compare_and_swapdi_v8plus): Likewise.
29330 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
29331 architecture and add missing address validity check during LRA.
29333 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
29336 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
29337 occuring for assumed-size arrays in use_device_{ptr,addr}.
29339 2021-03-12 Jakub Jelinek <jakub@redhat.com>
29342 * config/i386/constraints.md (YW): New internal constraint.
29343 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
29344 (*<sse2_avx2>_<insn><mode>3<mask_name>,
29345 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
29346 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
29348 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
29349 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
29350 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
29351 into one, use Yw instead of former x,v.
29352 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
29353 the last alternative.
29354 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
29355 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
29356 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
29357 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
29358 into one, use <v_Yw> instead of former x,v.
29359 (avx2_interleave_highv32qi<mask_name>,
29360 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
29361 constraints. Add && <mask_avx512bw_condition> to condition.
29362 (avx2_interleave_lowv32qi<mask_name>,
29363 vec_interleave_lowv16qi<mask_name>,
29364 avx2_interleave_highv16hi<mask_name>,
29365 vec_interleave_highv8hi<mask_name>,
29366 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
29367 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
29368 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
29369 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
29370 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
29371 Yw instead of v in constraints.
29372 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
29373 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
29374 instead of Yv in constraints.
29375 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
29376 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
29377 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
29378 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
29379 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
29381 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
29382 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
29383 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
29384 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
29385 two, one with just x, another isa avx512vl with v.
29387 2021-03-12 Martin Liska <mliska@suse.cz>
29389 * doc/invoke.texi: Add missing param documentation.
29391 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29394 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
29395 analyzer/trimmed-graph.o.
29396 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
29397 feasibility checking to reflect new implementation.
29398 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
29400 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
29402 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29404 * digraph.cc (selftest::test_shortest_paths): Update
29405 shortest_paths init for new param. Add test of
29406 SPS_TO_GIVEN_TARGET.
29407 * shortest-paths.h (enum shortest_path_sense): New.
29408 (shortest_paths::shortest_paths): Add "sense" param.
29409 Update for renamings. Generalize to use "sense" param.
29410 (shortest_paths::get_shortest_path): Rename param.
29411 (shortest_paths::m_sense): New field.
29412 (shortest_paths::m_prev): Rename...
29413 (shortest_paths::m_best_edge): ...to this.
29414 (shortest_paths::get_shortest_path): Update for renamings.
29415 Conditionalize flipping of path on sense of traversal.
29417 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29419 * digraph.cc (selftest::test_shortest_paths): Add test coverage
29420 for paths from B and C.
29421 * shortest-paths.h (shortest_paths::shortest_paths): Handle
29422 unreachable nodes, rather than asserting.
29424 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
29427 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
29428 xcoff_tbss_section_name.
29429 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
29430 * xcoffout.c (xcoff_tbss_section_name): Delete.
29431 * xcoffout.h (xcoff_tbss_section_name): Delete.
29433 2021-03-11 Richard Biener <rguenther@suse.de>
29435 PR tree-optimization/99523
29436 * tree-cfg.c (dump_function_to_file): Dump SSA names
29437 w/o identifier to the decls section as well, not only those
29438 without a VAR_DECL.
29440 2021-03-11 Jakub Jelinek <jakub@redhat.com>
29443 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
29444 function calls with lhs fail if the lhs don't have compatible types.
29446 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
29448 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
29449 Change FRAME_POINTER_REGNUM to correspond to a new faked
29450 register faked_fp, part of GENNONACR_REGS like faked_ap.
29451 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
29452 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
29453 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
29454 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
29455 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
29457 (CRIS_REAL_FP_REGNUM): New constant.
29458 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
29459 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
29460 (cris_initial_elimination_offset): Handle elimination changes
29461 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
29462 and add one from FRAME_POINTER_REGNUM to
29463 HARD_FRAME_POINTER_REGNUM.
29464 (cris_expand_prologue, cris_expand_epilogue): Emit code for
29465 hard_frame_pointer_rtx instead of frame_pointer_rtx.
29467 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
29470 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
29471 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
29473 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
29476 * lra-constraints.c (process_address_1): Don't check unknown
29477 constraint, use X for empty constraint.
29479 2021-03-10 Alex Coplan <alex.coplan@arm.com>
29481 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
29482 Fix typo in comment describing "is_ha" argument.
29484 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
29486 * doc/sourcebuild.texi: Document LRA target selector.
29488 2021-03-10 David Malcolm <dmalcolm@redhat.com>
29490 * doc/ux.texi: Add subsection contrasting interactive versus
29491 batch usage of GCC.
29493 2021-03-10 Joel Hutton <joel.hutton@arm.com>
29496 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
29498 (vectorizable_load): Fix gather load mask check condition.
29500 2021-03-10 Richard Biener <rguenther@suse.de>
29502 PR tree-optimization/99510
29503 * tree.c (check_aligned_type): Check that the candidate
29504 has TYPE_USER_ALIGN set instead of matching with the
29507 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
29509 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
29510 float and vector integer modes only if the mode is not larger.
29512 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
29514 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
29516 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29518 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
29520 * ira-lives.c (single_reg_class): Ditto.
29522 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
29524 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
29525 the architecture-specific rtems.h.
29526 (aarch64-*-rtems*): Likewise.
29527 (arm*-*-rtems*): Likewise.
29528 (epiphany-*-rtems*): Likewise.
29529 (riscv*-*-rtems*): Likewise.
29531 2021-03-09 Jakub Jelinek <jakub@redhat.com>
29533 PR tree-optimization/99305
29534 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
29535 before integer_all_onesp instead of vice versa.
29537 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
29539 * common/config/arm/arm-common.c (arm_config_default): Change type
29540 of 'i' to unsigned.
29542 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29545 * lra-constraints.c (process_address_1): Process constraint 'g'
29546 separately and digital constraints containing more one digit.
29548 2021-03-09 Nick Clifton <nickc@redhat.com>
29550 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
29551 (DWARF"_DEBUGGING_INFO): Define.
29553 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
29556 * calls.c (initialize_argument_information): When the argument
29557 is passed by reference, do not make a copy in a thunk only if
29558 the argument is already in memory. Remove redundant test for
29559 the case of callee copy.
29561 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29564 * lra-constraints.c (process_address_1): Process 0..9 constraints
29565 in process_address_1.
29567 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
29569 * config/s390/s390.c (struct s390_processor processor_table):
29570 Binutils name string must not be empty.
29572 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
29574 * config/arc/arc.c (arc_attr_type): Remove function.
29576 2021-03-09 Martin Liska <mliska@suse.cz>
29579 * config/i386/i386-options.c (ix86_option_override_internal):
29580 Set isa_flags for OPTS argument and not for the global
29583 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29585 * config/rs6000/predicates.md (ds_form_mem_operand): Check
29588 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29591 * config/rs6000/predicates.md (ds_form_mem_operand) New
29593 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
29594 ds_form_mem_operand in ld/lwa patterns.
29595 * config/rs6000/fusion.md: Regenerate file.
29597 2021-03-08 Martin Sebor <msebor@redhat.com>
29599 PR middle-end/98266
29600 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
29601 (array_bounds_checker::check_array_bounds): Call it.
29603 2021-03-08 Martin Sebor <msebor@redhat.com>
29605 PR middle-end/97631
29606 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
29607 (handle_builtin_stxncpy_strncat): Rename locals. Determine
29608 destination size from allocation calls. Issue a more appropriate
29610 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
29611 (handle_builtin_memset): Same.
29613 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
29616 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
29617 to ensure we do not have an Altivec style address.
29618 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
29619 an Altivec style address.
29620 (*vsx_le_perm_store_<mode>): Likewise.
29621 (splitters after *vsx_le_perm_store_<mode>): Likewise.
29622 (vsx_load_<mode>): Disable special expander if passed an Altivec
29624 (vsx_store_<mode>): Likewise.
29626 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29629 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
29630 (aarch64_simd_shift_imm_vec_hi): Likewise.
29631 (aarch64_simd_shift_imm_vec_si): Likewise.
29632 (aarch64_simd_shift_imm_vec_di): Likewise.
29633 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
29634 predicate from above.
29635 (aarch64_shrn<mode>_insn_be): Likewise.
29636 (aarch64_rshrn<mode>_insn_le): Likewise.
29637 (aarch64_rshrn<mode>_insn_be): Likewise.
29638 (aarch64_shrn2<mode>_insn_le): Likewise.
29639 (aarch64_shrn2<mode>_insn_be): Likewise.
29640 (aarch64_rshrn2<mode>_insn_le): Likewise.
29641 (aarch64_rshrn2<mode>_insn_be): Likewise.
29643 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
29646 * lra-constraints.c (skip_contraint_modifiers): New function.
29647 (process_address_1): Use it before lookup_constraint call.
29649 2021-03-08 Martin Liska <mliska@suse.cz>
29652 * config/i386/i386-options.c (ix86_option_override_internal):
29653 Enable UINTR and HRESET for -march that supports it.
29655 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
29657 * config/s390/s390.c (f_constraint_p): New function.
29658 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
29659 (TARGET_MD_ASM_ADJUST): Likewise.
29661 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
29664 * tree-nested.c (convert_local_reference_stmt): Avoid calling
29665 lookup_field_for_decl for Fortran module (= namespace context).
29667 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
29669 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
29670 comparison with arithmetic right shift.
29671 (s390_expand_vcond): No need for a force_reg anymore.
29672 s390_vec_compare will do it.
29673 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
29674 immediate operands.
29676 2021-03-07 Jakub Jelinek <jakub@redhat.com>
29679 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
29680 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
29682 * config/i386/sse.md (v_Yw): New define_mode_attr.
29683 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
29684 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
29686 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
29687 xYw in constraints.
29689 2021-03-06 Julian Brown <julian@codesourcery.com>
29691 * tree-pretty-print.c (dump_generic_node): Emit non-generic
29692 address space info for aggregates.
29694 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
29696 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
29698 2021-03-05 Jakub Jelinek <jakub@redhat.com>
29700 PR middle-end/99322
29701 * tree-cfg.c (bb_to_omp_idx): New variable.
29702 (execute_build_cfg): Release the bb_to_omp_idx vector after
29703 cleanup_tree_cfg returns.
29704 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
29705 for bb_to_omp_idx being a vec<int> instead of pointer to array
29707 (make_edges): Remove bb_to_omp_idx local variable, don't pass
29708 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
29709 vec<int> instead of pointer to array of ints and don't free/release
29711 (remove_bb): When removing a bb and placing forced label somewhere
29712 else, ensure it is put into the same OpenMP region during cfg
29713 pass if possible or to entry successor as fallback. Unregister
29714 bb from bb_to_omp_idx.
29716 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
29719 * lra-constraints.c (process_address_1): Skip decomposing address
29720 for asm insn operand with unknown constraint.
29722 2021-03-05 Martin Jambor <mjambor@suse.cz>
29725 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
29726 corresponding speculative edges if we are about to resolve
29727 sepculation. Make edge direct (and so resolve speculations) before
29728 removing it from call_site_hash.
29729 (cgraph_edge::make_direct): Relax the initial assert to allow calling
29730 the function on speculative direct edges.
29732 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
29734 PR rtl-optimization/99376
29735 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
29736 of low-order zero bits is too large, set the result to 0 directly.
29738 2021-03-04 Jakub Jelinek <jakub@redhat.com>
29740 PR middle-end/93235
29741 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
29742 SUBREG and a SUBREG to op_mode can't be created.
29744 2021-03-04 Alex Coplan <alex.coplan@arm.com>
29747 * config/aarch64/aarch64-sve-builtins.cc
29748 (function_resolver::require_vector_type): Handle error_mark_node.
29750 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
29752 * cfgexpand.c (expand_asm_loc): Pass new parameter.
29753 (expand_asm_stmt): Likewise.
29754 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
29756 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
29757 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
29758 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
29759 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
29760 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
29761 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
29762 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
29763 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
29764 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
29765 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
29766 * doc/tm.texi (md_asm_adjust): Likewise.
29767 * target.def (md_asm_adjust): Likewise.
29769 2021-03-04 Richard Biener <rguenther@suse.de>
29771 PR middle-end/97855
29772 * tree-pretty-print.c: Poison pp_printf.
29773 (dump_decl_name): Avoid use of pp_printf.
29774 (dump_block_node): Likewise.
29775 (dump_generic_node): Likewise.
29777 2021-03-04 Martin Sebor <msebor@redhat.com>
29779 PR middle-end/96963
29780 PR middle-end/94655
29781 * builtins.c (handle_array_ref): New helper.
29782 (handle_mem_ref): New helper.
29783 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
29784 into new helper functions. Correct a workaround for vectorized
29787 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
29789 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
29790 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
29791 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
29792 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
29793 attribute for Power10.
29794 * config/rs6000/mma.md (*movoo): Likewise.
29795 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
29796 (define_mode_attr bits): Add DD/TD modes.
29797 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
29798 store_conditionalpti): Update size attribute for Power10.
29800 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
29803 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
29804 -Wuninitialized, -Wmaybe-uninitialized.
29805 (wide-int.o-warn): Likewise.
29807 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
29809 * common/config/arm/arm-common.c: Include configargs.h.
29810 (arm_config_default): New function.
29811 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
29812 processors that do not support Thumb. Take into account the
29813 --with-mode configuration setting for selecting the default.
29814 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
29815 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
29817 2021-03-03 Martin Liska <mliska@suse.cz>
29819 PR gcov-profile/97461
29820 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
29822 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
29825 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29826 point back the hard frame pointer to its default location when the
29827 frame is larger than SEH_MAX_FRAME_SIZE.
29829 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29832 * config/i386/predicates.md (logic_operator): New define_predicate.
29833 * config/i386/i386.md (mov + mem using comm arith peephole2):
29834 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
29835 and the inner mode is [QH]Imode.
29837 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29840 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
29841 (new_loc_list): Clear end_entry.
29842 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
29843 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
29845 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
29846 initialize also end_entry.
29848 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29851 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
29852 partitions, if in non-layout mode after reorder_blocks also move
29853 affected blocks to ensure a single partition transition.
29855 2021-03-03 Jason Merrill <jason@redhat.com>
29858 * cgraphunit.c (process_function_and_variable_attributes): Don't
29859 warn about flatten on an alias if the target also has it.
29860 * cgraph.h (symtab_node::get_alias_target_tree): New.
29862 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
29864 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
29865 period to symbol name.
29866 (tls_get_addr_internal<mode>): Same.
29868 2021-03-02 David Malcolm <dmalcolm@redhat.com>
29871 * diagnostic-show-locus.c
29872 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
29875 2021-03-02 Martin Sebor <msebor@redhat.com>
29877 PR middle-end/99276
29878 * builtins.c (warn_for_access): Remove stray warning text.
29880 2021-03-02 Martin Sebor <msebor@redhat.com>
29882 PR middle-end/99295
29883 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
29886 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29889 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
29890 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
29891 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
29892 DW_MACRO_define_strx and DW_MACRO_undef_strx.
29893 (save_macinfo_strings): Use DW_MACRO_*_str* even with
29894 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
29895 DW_MACRO_undef_strx.
29897 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29899 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
29901 (BT_FN_V8HI_V8HI_UINT): Likewise.
29902 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
29903 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
29904 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
29905 New builtin definitions.
29906 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
29907 vector extension version.
29908 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
29909 available with current -march level.
29910 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
29911 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
29912 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
29913 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
29914 (vec_extend_to_fp32_lo): Likewise.
29915 (vec_round_from_fp32): Likewise.
29916 (vec_convert_to_fp16): Likewise.
29917 (vec_convert_from_fp16): Likewise.
29918 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
29919 (vclfnls_v8hi): Likewise.
29920 (vcrnfs_v8hi): Likewise.
29921 (vcfn_v8hi): Likewise.
29922 (vcnf_v8hi): Likewise.
29924 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29926 * common/config/s390/s390-common.c (processor_flags_table): New entry.
29927 * config.gcc: Enable arch14 for --with-arch and --with-tune.
29928 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
29929 arch14 for unknown CPU models.
29930 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
29931 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
29932 (s390_get_sched_attrmask): Likewise.
29933 (s390_get_unit_mask): Likewise.
29934 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
29935 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
29936 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
29937 (TARGET_NNPA_P): New macro definitions.
29938 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
29939 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
29941 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29943 PR middle-end/95757
29944 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
29945 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
29946 != 1 comparisons if name is lhs of a comparison.
29948 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
29952 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
29953 * config/darwin.c (darwin_should_restore_cfa_state): New.
29954 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
29955 * doc/tm.texi: Regenerated.
29956 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
29957 * dwarf2cfi.c (connect_traces): If the target requests, restore
29958 the CFA expression after a DW_CFA_restore.
29959 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
29961 2021-03-01 Martin Liska <mliska@suse.cz>
29964 * optc-save-gen.awk: Add 4 more exceptions.
29966 2021-03-01 Nathan Sidwell <nathan@acm.org>
29969 * tree.h (TYPE_ALIGN_RAW): New accessor.
29970 (TYPE_ALIGN): Use it.
29972 2021-03-01 Jan Hubicka <jh@suse.cz>
29975 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
29977 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
29980 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29981 point the hard frame pointer to the SSE register save area instead
29982 of the general register save area. Perform only minimal adjustment
29983 for small frames if it is initially not correctly aligned.
29984 (ix86_expand_prologue): Remove early saves for a SEH target.
29985 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
29987 2021-02-28 Jakub Jelinek <jakub@redhat.com>
29990 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
29991 typo - referneced -> referenced.
29992 * tree.c (component_ref_size): Fix comment typo -
29993 refernce -> reference.
29994 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
29995 traling -> trailing.
29996 (aliasing_component_refs_p): Fix comment typos -
29997 refernce -> reference and refernece -> reference and
29998 traling -> trailing.
29999 (nonoverlapping_refs_since_match_p): Fix comment typo -
30000 referneces -> references.
30001 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
30002 referneces -> references.
30004 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
30006 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
30007 diagnostic message to avoid use of a contraction and format
30010 2021-02-27 Jakub Jelinek <jakub@redhat.com>
30013 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
30014 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
30015 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
30016 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
30019 2021-02-26 Richard Biener <rguenther@suse.de>
30021 PR middle-end/99281
30022 * expr.c (store_field): For calls with return-slot optimization
30023 and addressable return type expand the store directly.
30025 2021-02-26 Richard Biener <rguenther@suse.de>
30028 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
30030 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
30033 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
30036 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
30038 * config.gcc: Add rs6000-pcrel-opt.o.
30039 * config/rs6000/rs6000-pcrel-opt.c: New file.
30040 * config/rs6000/pcrel-opt.md: New file.
30041 * config/rs6000/predicates.md: Add d_form_memory predicate.
30042 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
30043 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
30044 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
30045 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
30046 and make_pass_pcrel_opt().
30047 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
30048 (rs6000_option_override_internal): Add pcrel-opt.
30049 (rs6000_delegitimize_address): Support pcrel-opt.
30050 (rs6000_opt_masks): Add pcrel-opt.
30051 (pcrel_opt_valid_mem_p): New function.
30052 (reg_to_non_prefixed): Make global.
30053 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
30054 (output_pcrel_opt_reloc): New function.
30055 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
30056 (pcrel_extern_addr): Set loads_extern_addr.
30057 Add include for pcrel-opt.md.
30058 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
30059 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
30062 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
30065 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
30066 If TARGET_64BIT and dest is SUBREG, we check the width, if it
30067 equal to SImode, we use SImode operation, just like what we are
30070 2021-02-26 Marek Polacek <polacek@redhat.com>
30072 * builtins.c (warn_for_access): Fix typos.
30074 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
30076 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
30077 mark in front of the immediate quantity.
30078 (<optab>_rolsi3_uxtw): Likewise.
30080 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
30083 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
30084 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
30085 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
30086 address and disable when the FPCXT is not available.
30087 (nonsecure_call_value_reg_thumb2): Likewise.
30089 2021-02-25 Nathan Sidwell <nathan@acm.org>
30092 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
30094 2021-02-25 Tamar Christina <tamar.christina@arm.com>
30096 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
30098 2021-02-25 Richard Biener <rguenther@suse.de>
30100 PR tree-optimization/99253
30101 * tree-vect-loop.c (check_reduction_path): First compute
30102 code, then verify out-of-loop uses.
30104 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30107 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
30109 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30111 PR tree-optimization/80635
30112 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
30113 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
30114 has mode precision.
30116 2021-02-25 Richard Biener <rguenther@suse.de>
30118 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
30119 load_map population.
30120 (vect_match_slp_patterns_2): Revert part of last change.
30121 (vect_analyze_slp): Do not interleave optimize_load_redistribution
30122 with pattern detection but do it afterwards. Dump the
30123 whole SLP graph after pattern recognition and load
30124 redistribution optimization finished.
30126 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30129 * omp-low.c (struct omp_context): Add teams_nested_p and
30130 nonteams_nested_p members.
30131 (scan_omp_target): Diagnose teams nested inside of target with other
30132 directives strictly nested inside of the same target.
30133 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
30134 ctx->nonteams_nested_p as needed.
30136 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
30138 PR inline-asm/99123
30139 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
30141 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
30143 * config/cris/cris.c (cris_expand_prologue): Set
30144 current_function_static_stack_size, if flag_stack_usage_info.
30146 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
30148 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
30149 (rs6000_final_prescan_insn): Adjust.
30150 (rs6000_asm_output_opcode): Likewise.
30152 2021-02-24 Martin Sebor <msebor@redhat.com>
30154 PR middle-end/97172
30155 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
30156 from function arguments.
30158 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30160 PR tree-optimization/99220
30161 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
30162 node from cache when it's about to be deleted.
30164 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30166 PR tree-optimization/99225
30167 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
30168 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
30169 build_int_cst (..., 1). Formatting fixes.
30171 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30173 PR tree-optimization/99149
30174 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
30176 (vect_slp_reset_pattern): Remove.
30177 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
30178 (complex_mul_pattern::build, complex_fma_pattern::build,
30179 complex_fms_pattern::build): Fix ref counts.
30180 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
30181 when node is being deleted.
30182 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
30183 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
30185 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
30187 2021-02-24 Matthias Klose <doko@ubuntu.com>
30190 2020-12-07 Matthias Klose <doko@ubuntu.com>
30192 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
30193 and ENABLE_RTL_FLAG_CHECKING.
30195 2021-02-24 Richard Biener <rguenther@suse.de>
30198 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
30200 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
30202 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
30203 (vsx_assemble_pair): ...to this.
30204 (*mma_assemble_pair): Rename from this...
30205 (*vsx_assemble_pair): ...to this.
30206 (mma_disassemble_pair): Rename from this...
30207 (vsx_disassemble_pair): ...to this.
30208 (*mma_disassemble_pair): Rename from this...
30209 (*vsx_disassemble_pair): ...to this.
30210 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
30211 BU_COMPAT): New macros.
30212 (mma_assemble_pair): Rename from this...
30213 (vsx_assemble_pair): ...to this.
30214 (mma_disassemble_pair): Rename from this...
30215 (vsx_disassemble_pair): ...to this.
30216 (mma_assemble_pair): New compatibility built-in.
30217 (mma_disassemble_pair): Likewise.
30218 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
30219 (RS6000_BUILTIN_COMPAT): Define.
30220 (bdesc_compat): New.
30221 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30222 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
30223 and VSX_BUILTIN_ASSEMBLE_PAIR.
30224 (rs6000_init_builtins): Register compatibility built-ins.
30225 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
30226 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
30227 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30228 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
30229 (__builtin_vsx_assemble_pair): ...to this.
30230 (__builtin_mma_disassemble_pair): Rename from this...
30231 (__builtin_vsx_disassemble_pair): ...to this.
30233 2021-02-23 Martin Liska <mliska@suse.cz>
30236 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
30237 with different alignment. That leads to an invalid red zone
30238 size allocated in runtime.
30240 2021-02-23 Jakub Jelinek <jakub@redhat.com>
30242 PR tree-optimization/99204
30243 * fold-const.c (fold_read_from_constant_string): Check that
30244 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
30246 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
30247 Kewen Lin <linkw@gcc.gnu.org>
30249 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
30250 (rotl<mode>3_insert_3): ...this.
30251 (plus_ior_xor): New code_iterator.
30252 (define_split for GPR rl*imi): New splitter.
30253 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
30254 for integer merging.
30256 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30258 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
30260 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
30261 into a register when the above is enabled.
30262 * config/aarch64/aarch64.c (neoversev1_tunings):
30263 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30264 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30266 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
30268 * config/cris/cris.c (cris_print_operand) <'T'>: Change
30269 valid operand from is now an addi mult-value to shift-value.
30270 * config/cris/cris.md (*addi): Change expression of scaled
30271 operand from mult to ashift.
30272 * config/cris/cris.md (*addi_reload): New insn_and_split.
30274 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
30277 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
30278 hook_bool_const_tree_hwi_hwi_const_tree_true.
30279 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
30281 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
30283 PR rtl-optimization/98791
30284 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
30285 for unordered modes.
30287 2021-02-22 Martin Liska <mliska@suse.cz>
30289 * tree-inline.c (inline_forbidden_p): Set
30290 inline_forbidden_reason.
30292 2021-02-22 Richard Biener <rguenther@suse.de>
30294 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
30297 2021-02-22 Richard Biener <rguenther@suse.de>
30299 PR tree-optimization/99165
30300 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
30301 Accumulate changed to ret.
30303 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
30306 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
30308 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
30310 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30313 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
30315 (trunctf<DFP_ALL:mode>2): Likewise.
30316 (trunctdtf2_vr): Likewise.
30317 (trunctdtf2): Likewise.
30318 (extend<DFP_ALL:mode>tf2_vr): Likewise.
30319 (extend<DFP_ALL:mode>tf2): Likewise.
30320 (extendtftd2_vr): Likewise.
30321 (extendtftd2): Likewise.
30323 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30325 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
30326 add memory alternative.
30327 (tf_to_fprx2): New pattern.
30329 2021-02-19 Martin Sebor <msebor@redhat.com>
30332 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
30333 (attr_access::free_lang_data): Remove a spurious test.
30335 2021-02-19 Nathan Sidwell <nathan@acm.org>
30337 * doc/invoke.texi (flang-info-module-read): Document.
30339 2021-02-19 Martin Liska <mliska@suse.cz>
30341 PR translation/99167
30342 * params.opt: Fix typo.
30344 2021-02-19 Richard Biener <rguenther@suse.de>
30346 PR middle-end/99122
30347 * tree-inline.c (inline_forbidden_p): Do not inline functions
30348 with VLA arguments or return value.
30350 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30353 * config/arm/arm.md (*stack_protect_combined_set_insn,
30354 *stack_protect_combined_test_insn): If force_const_mem result
30355 is not valid general operand, force its address into the destination
30358 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30361 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
30362 pad or non-local label, put FORCED_LABELs from bb b after that label
30363 rather than before it.
30365 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
30368 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
30369 expand_vector_broadcast' to emit the vec_duplicate operand.
30371 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
30373 PR rtl-optimization/96264
30374 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
30377 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
30380 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
30381 looking up the retain attribute.
30382 (resolve_unique_section): Likewise.
30383 (get_variable_section): Likewise.
30384 (switch_to_section): Likewise. Warn when a symbol without the
30385 retain attribute and a symbol with the retain attribute are
30386 placed in the section with the same name, instead of the used
30388 * doc/extend.texi: Document the "retain" attribute.
30390 2021-02-18 Nathan Sidwell <nathan@acm.org>
30393 * doc/invoke.texi (flang-info-include-translate): Document header
30396 2021-02-18 Richard Biener <rguenther@suse.de>
30398 PR middle-end/99122
30399 * ipa-fnsummary.c (analyze_function_body): Set
30400 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
30401 * tree-inline.c (insert_init_debug_bind): Pass NULL for
30402 error_mark_node values.
30403 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
30405 (setup_one_parameter): Delay force_value_to_type until when
30408 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
30410 PR tree-optimization/99142
30411 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
30413 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30415 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
30416 wide_int_bitmask::wide_int_bitmask (uint64_t),
30417 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
30418 wide_int_bitmask::operator ~ () const,
30419 wide_int_bitmask::operator | (wide_int_bitmask) const,
30420 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
30422 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
30423 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
30424 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
30425 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
30426 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
30427 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
30428 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
30429 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
30430 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
30431 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
30432 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
30433 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
30434 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
30435 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
30436 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
30437 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
30438 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
30439 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
30440 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
30441 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
30442 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
30443 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
30444 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
30445 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
30447 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30449 PR middle-end/99109
30450 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
30451 (build_printable_array_type): ... this. Add nelts argument. For
30452 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
30453 nelts, call build_array_type_nelts.
30454 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
30455 instead of build_zero_elt_array_type and build_array_type_nelts.
30457 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30460 * config/i386/i386.c (distance_non_agu_define): Don't call
30461 extract_insn_cached here.
30462 (ix86_lea_outperforms): Save and restore recog_data around call
30463 to distance_non_agu_define and distance_agu_use.
30464 (ix86_ok_to_clobber_flags): Remove.
30465 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
30466 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
30467 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
30468 into define_insn. Move the splitting to define_peephole2 and
30469 check there using peep2_regno_dead_p if FLAGS_REG is dead.
30471 2021-02-17 Julian Brown <julian@codesourcery.com>
30473 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
30476 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
30479 * config/mips/mips.c (mips_symbol_insns): Do not use
30480 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
30482 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
30484 PR inline-asm/98096
30485 * stmt.c (resolve_operand_name_1): Take inout operands into account
30486 for access to labels by names.
30487 * doc/extend.texi: Describe counting operands for accessing labels.
30489 2021-02-16 Richard Biener <rguenther@suse.de>
30491 PR tree-optimization/38474
30492 * tree-ssa-structalias.c (variable_info::address_taken): New.
30493 (new_var_info): Initialize address_taken.
30494 (process_constraint): Set address_taken.
30495 (solve_constraints): Use the new address_taken flag rather
30496 than is_reg_var for sorting variables.
30497 (dump_constraint): Dump the variable number if the name
30500 2021-02-16 Jakub Jelinek <jakub@redhat.com>
30503 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
30504 multiply by 4096 and for inbranch by 8192.
30505 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
30506 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
30508 2021-02-15 Maya Rashish <coypu@sdf.org>
30510 * config/aarch64/aarch64.c (aarch64_init_builtins):
30511 Call SUBTARGET_INIT_BUILTINS.
30513 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
30515 PR rtl-optimization/98872
30516 * init-regs.c (initialize_uninitialized_regs): Skip initialization
30517 if CONST0_RTX is NULL.
30519 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
30521 PR rtl-optimization/98863
30522 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
30523 (function_info::build_info): Turn into a declaration, moving the
30524 definition to internals.h.
30525 (function_info::bb_walker): Declare.
30526 (function_info::create_reg_use): Likewise.
30527 (function_info::calculate_potential_phi_regs): Take a build_info
30529 (function_info::place_phis, function_info::create_ebbs): Declare.
30530 (function_info::calculate_ebb_live_in_for_debug): Likewise.
30531 (function_info::populate_backedge_phis): Delete.
30532 (function_info::start_block, function_info::end_block): Declare.
30533 (function_info::populate_phi_inputs): Delete.
30534 (function_info::m_potential_phi_regs): Move information to build_info.
30535 * rtl-ssa/internals.h: New file.
30536 (function_info::bb_phi_info): New class.
30537 (function_info::build_info): Moved from functions.h.
30538 Add a constructor and destructor.
30539 (function_info::build_info::ebb_use): Delete.
30540 (function_info::build_info::ebb_def): Likewise.
30541 (function_info::build_info::bb_live_out): Likewise.
30542 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
30543 (function_info::build_info::potential_phi_regs): Likewise.
30544 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
30545 (function_info::build_info::ebb_def_regs): Likewise.
30546 (function_info::build_info::bb_phis): Likewise.
30547 (function_info::build_info::bb_mem_live_out): Likewise.
30548 (function_info::build_info::bb_to_rpo): Likewise.
30549 (function_info::build_info::def_stack): Likewise.
30550 (function_info::build_info::old_def_stack_limit): Likewise.
30551 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
30552 Remove the regno argument. Push the previous definition onto the
30553 definition stack where necessary.
30554 * rtl-ssa/accesses.cc: Include internals.h.
30555 * rtl-ssa/changes.cc: Likewise.
30556 * rtl-ssa/blocks.cc: Likewise.
30557 (function_info::build_info::build_info): Define.
30558 (function_info::build_info::~build_info): Likewise.
30559 (function_info::bb_walker): New class.
30560 (function_info::bb_walker::bb_walker): Define.
30561 (function_info::add_live_out_use): Convert a logarithmic-complexity
30562 test into a linear one. Allow the same definition to be passed
30564 (function_info::calculate_potential_phi_regs): Moved from
30565 functions.cc. Take a build_info parameter and store the
30566 information there instead.
30567 (function_info::place_phis): New function.
30568 (function_info::add_entry_block_defs): Update call to record_reg_def.
30569 (function_info::calculate_ebb_live_in_for_debug): New function.
30570 (function_info::add_phi_nodes): Use bb_phis to decide which
30571 registers need phi nodes and initialize ebb_def_regs accordingly.
30572 Do not add degenerate phis here.
30573 (function_info::add_artificial_accesses): Use create_reg_use.
30574 Assert that all definitions are listed in the DF LR sets.
30575 Update call to record_reg_def.
30576 (function_info::record_block_live_out): Record live-out register
30577 values in the phis of successor blocks. Use the live-out set
30578 when processing the last block in an EBB, instead of always
30579 using the live-in sets of successor blocks. AND the live sets
30580 with the set of registers that have been defined in the EBB,
30581 rather than with all potential phi registers. Cope correctly
30582 with branches back to the start of the current EBB.
30583 (function_info::start_block): New function.
30584 (function_info::end_block): Likewise.
30585 (function_info::populate_phi_inputs): Likewise.
30586 (function_info::create_ebbs): Likewise.
30587 (function_info::process_all_blocks): Rewrite into a multi-phase
30589 * rtl-ssa/functions.cc: Include internals.h.
30590 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
30591 (function_info::init_function_data): Remove caller.
30592 * rtl-ssa/insns.cc: Include internals.h
30593 (function_info::create_reg_use): New function. Lazily any
30594 degenerate phis needed by the linear RPO view.
30595 (function_info::record_use): Use create_reg_use. When processing
30596 debug uses, use potential_phi_regs and test it before checking
30597 whether the register is live on entry to the current EBB. Lazily
30598 calculate ebb_live_in_for_debug.
30599 (function_info::record_call_clobbers): Update call to record_reg_def.
30600 (function_info::record_def): Likewise.
30602 2021-02-15 Martin Liska <mliska@suse.cz>
30604 * toplev.c (init_asm_output): Free output of
30605 gen_command_line_string function.
30606 (process_options): Likewise.
30608 2021-02-15 Martin Liska <mliska@suse.cz>
30610 * params.opt: Add 2 missing Param keywords.
30612 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
30614 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
30616 2021-02-15 Jakub Jelinek <jakub@redhat.com>
30618 PR tree-optimization/99079
30619 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
30620 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
30621 require both type and TREE_TYPE (@1) to be integral types and either
30622 type having smaller or equal precision, or TREE_TYPE (@1) being
30623 unsigned type, or type being signed type. If TREE_TYPE (@1)
30624 doesn't have wrapping overflow, perform the subtraction of one in
30627 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
30628 Richard Biener <rguether@suse.de>
30631 * ipa-reference.c (ipa_init): Only conditinally initialize
30632 reference_vars_to_consider.
30633 (propagate): Conditionally deninitialize reference_vars_to_consider.
30634 (ipa_reference_write_optimization_summary): Sanity check that
30635 reference_vars_to_consider is not allocated.
30637 2021-02-13 Levy Hsu <admin@levyhsu.com>
30640 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
30641 extend parameter to get_si_mem_base_reg declaration.
30642 (get_si_mem_base_reg): Add extend parameter. Set it.
30643 (analyze): Pass extend arg to get_si_mem_base_reg.
30644 (transform): Likewise. Use it when rewriting mems.
30645 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
30646 loads and emit sign/zero extending load followed by subreg move.
30648 2021-02-13 Jim Wilson <jimw@sifive.com>
30651 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
30652 exit when !reload_completed. Only perform check for compressed reg
30653 if reload_completed.
30654 (riscv_rtx_costs): In MEM case, when optimizing for size and
30655 shorten memrefs, if not compressible, then increase cost.
30657 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30659 PR rtl-optimization/98439
30660 * recog.c (pass_split_before_regstack::gate): Enable even when
30661 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
30664 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30667 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
30668 swap of V2SImode elements in memory into DImode memory rotate by 32.
30670 2021-02-12 Martin Sebor <msebor@redhat.com>
30672 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
30674 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30676 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
30677 m_temp_obstack rather than m_obstack to allocate the temporary use.
30679 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30681 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
30682 as read-modify operations.
30684 2021-02-12 Richard Biener <rguenther@suse.de>
30686 PR middle-end/38474
30687 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
30688 fbi->aa_walk_budget is bigger than zero. Update
30689 fbi->aa_walk_budget.
30690 (param_change_prob): Likewise.
30691 * ipa-prop.c (detect_type_change_from_memory_writes):
30692 Properly account walk_aliased_vdefs.
30693 (parm_preserved_before_stmt_p): Canonicalize updates.
30694 (parm_ref_data_preserved_p): Likewise.
30695 (parm_ref_data_pass_through_p): Likewise.
30696 (determine_known_aggregate_parts): Account own alias queries.
30698 2021-02-12 Martin Liska <mliska@suse.cz>
30700 * opts-common.c (decode_cmdline_option): Release werror_arg.
30701 * opts.c (gen_producer_string): Release output of
30702 gen_command_line_string.
30704 2021-02-12 Richard Biener <rguenther@suse.de>
30706 PR tree-optimization/38474
30707 * params.opt (-param=max-store-chains-to-track=): New param.
30708 (-param=max-stores-to-track=): Likewise.
30709 * doc/invoke.texi (max-store-chains-to-track): Document.
30710 (max-stores-to-track): Likewise.
30711 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
30713 (pass_store_merging::m_n_stores): Likewise.
30714 (pass_store_merging::terminate_and_process_chain): Update
30715 m_n_stores and m_n_chains.
30716 (pass_store_merging::process_store): Likewise. Terminate
30717 oldest chains if the number of stores or chains get too large.
30718 (imm_store_chain_info::terminate_and_process_chain): Dump
30721 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
30723 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
30724 the cold section, emit a nop before the directive if the previous
30725 active instruction can throw.
30727 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
30730 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
30731 memory addresses that are legal for quad word accesses.
30733 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
30736 * config/arm/thumb2.md (*doloop_end_internal): Generate
30737 alternative sequence to handle long range branches.
30739 2021-02-11 Joel Hutton <joel.hutton@arm.com>
30741 PR tree-optimization/98772
30742 * optabs-tree.c (supportable_half_widening_operation): New function
30743 to check for supportable V8QI->V8HI widening patterns.
30744 * optabs-tree.h (supportable_half_widening_operation): New function.
30745 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
30746 to create promotion stmts for V8QI->V8HI widening patterns.
30747 (vectorizable_conversion): Add case for V8QI->V8HI.
30749 2021-02-11 Richard Biener <rguenther@suse.de>
30751 * sparseset.h (SPARSESET_ELT_BITS): Remove.
30752 (SPARSESET_ELT_TYPE): Use unsigned int.
30753 * fwprop.c: Do not include sparseset.h.
30755 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30758 * varasm.c (declare_weak): For -fsyntax-only, allow even
30759 TREE_ASM_WRITTEN function decls.
30761 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30764 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
30765 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
30766 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
30767 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
30768 calling simplify_gen_subreg on it.
30770 2021-02-10 Martin Liska <mliska@suse.cz>
30772 * config/nvptx/nvptx.c (nvptx_option_override): Use
30773 flag_patchable_function_entry instead of the removed
30774 function_entry_patch_area_size.
30776 2021-02-10 Martin Liska <mliska@suse.cz>
30778 PR tree-optimization/99002
30779 PR tree-optimization/99026
30780 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
30781 leak when adjacent cases are merged.
30782 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
30784 (make_pass_lower_switch): Remove trailing whitespace.
30785 * tree-switch-conversion.h (release_clusters): New.
30787 2021-02-10 Richard Biener <rguenther@suse.de>
30789 PR rtl-optimization/99054
30790 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
30791 (fixup_partitions): Adjust.
30792 (rtl_verify_edges): Likewise.
30794 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30796 PR middle-end/99007
30797 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
30798 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
30801 2021-02-10 Richard Biener <rguenther@suse.de>
30804 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
30807 2021-02-10 Richard Biener <rguenther@suse.de>
30809 PR tree-optimization/99024
30810 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
30811 clear loop->aux if it is associated with the destroyed loop_vinfo.
30813 2021-02-10 Martin Liska <mliska@suse.cz>
30815 PR tree-optimization/99002
30816 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
30819 2021-02-10 Martin Liska <mliska@suse.cz>
30822 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
30823 a reference exists.
30825 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30828 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
30829 at class scope for DWARF5+.
30831 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
30833 PR rtl-optimization/96015
30834 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
30835 (relax_delay_slots): When deleting a jump to the next active
30836 instruction over a barrier, first delete the barrier if the
30837 jump is the only way to reach the target label.
30839 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
30841 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
30842 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
30843 vector multiplies and vect.alu for SSRA.
30844 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
30845 vect.mul cost field.
30846 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
30847 * config/arm/arm.c: Likewise.
30849 2021-02-09 Richard Biener <rguenther@suse.de>
30851 PR tree-optimization/98863
30852 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
30853 * tree-ssa-sccvn.c (last_pushed_avail): New global.
30854 (rpo_elim::eliminate_push_avail): Chain pushed avails.
30855 (unwind_state::avail_top): Add.
30856 (do_unwind): Rewrite unwinding of avail entries.
30857 (do_rpo_vn): Initialize last_pushed_avail and
30858 avail_top of the undo state.
30860 2021-02-09 Jakub Jelinek <jakub@redhat.com>
30862 PR middle-end/99004
30863 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
30864 const char * to char * and free those pointers after use.
30866 2021-02-09 Richard Biener <rguenther@suse.de>
30868 PR tree-optimization/99017
30869 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
30870 zero vector cost entries.
30872 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
30874 PR middle-end/98974
30875 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
30876 parameter in vectorizable_condition.
30878 2021-02-08 Richard Biener <rguenther@suse.de>
30881 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
30883 2021-02-08 Martin Liska <mliska@suse.cz>
30886 * cfgexpand.c (pass_expand::execute): Parse per-function option
30887 flag_patchable_function_entry and use it.
30888 * common.opt: Remove function_entry_patch_area_size and
30889 function_entry_patch_area_start global variables.
30890 * opts.c (parse_and_check_patch_area): New function.
30891 (common_handle_option): Use it.
30892 * opts.h (parse_and_check_patch_area): New function.
30893 * toplev.c (process_options): Parse and use
30894 function_entry_patch_area_size.
30896 2021-02-08 Martin Sebor <msebor@redhat.com>
30898 * doc/extend.texi (attribute malloc): Correct typos.
30900 2021-02-05 Nathan Sidwell <nathan@acm.org>
30903 * gcc.c (driver::maybe_run_linker): Check for input file
30904 accessibility if not linking.
30906 2021-02-05 Richard Biener <rguenther@suse.de>
30908 PR tree-optimization/98855
30909 * tree-vectorizer.h (add_stmt_cost): New overload.
30910 * tree-vect-slp.c (li_cost_vec_cmp): New.
30911 (vect_bb_slp_scalar_cost): Cost individual loop regions
30912 separately. Account for the scalar instance root stmt.
30914 2021-02-05 Tom de Vries <tdevries@suse.de>
30917 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
30919 (bit_test_cluster::emit): Reuse location_t for newly created
30921 (switch_decision_tree::try_switch_expansion): Preserve
30923 * tree-switch-conversion.h: Change function signatures.
30925 2021-02-05 Jakub Jelinek <jakub@redhat.com>
30928 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
30929 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
30930 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
30931 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
30933 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30935 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
30936 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
30937 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
30938 (vget_high_f16): Reimplement using new builtin.
30939 (vget_high_f32): Likewise.
30940 (vget_high_f64): Likewise.
30941 (vget_high_p8): Likewise.
30942 (vget_high_p16): Likewise.
30943 (vget_high_p64): Likewise.
30944 (vget_high_s8): Likewise.
30945 (vget_high_s16): Likewise.
30946 (vget_high_s32): Likewise.
30947 (vget_high_s64): Likewise.
30948 (vget_high_u8): Likewise.
30949 (vget_high_u16): Likewise.
30950 (vget_high_u32): Likewise.
30951 (vget_high_u64): Likewise.
30953 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30955 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
30956 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
30957 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
30958 (vget_low_f16): Reimplement using new builtin.
30959 (vget_low_f32): Likewise.
30960 (vget_low_f64): Likewise.
30961 (vget_low_p8): Likewise.
30962 (vget_low_p16): Likewise.
30963 (vget_low_p64): Likewise.
30964 (vget_low_s8): Likewise.
30965 (vget_low_s16): Likewise.
30966 (vget_low_s32): Likewise.
30967 (vget_low_s64): Likewise.
30968 (vget_low_u8): Likewise.
30969 (vget_low_u16): Likewise.
30970 (vget_low_u32): Likewise.
30971 (vget_low_u64): Likewise.
30973 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
30975 * gcc.c (print_multilib_info): Check all required argument is provided
30978 2021-02-05 liuhongt <hongtao.liu@intel.com>
30981 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
30982 generate integer mask comparison for 128/256-bits vector when
30983 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
30984 delete redundant !maskcmp condition.
30985 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
30987 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
30988 condition directly to if (maskcmp), add extra check for
30989 cmpmode, it should be MODE_INT.
30990 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
30991 parameters op_true/op_false.
30992 (ix86_use_mask_cmp_p): New.
30994 2021-02-05 liuhongt <hongtao.liu@intel.com>
30997 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
30998 Remove m_GENERIC from ~list.
30999 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
31001 2021-02-04 David Malcolm <dmalcolm@redhat.com>
31004 * diagnostic-show-locus.c (compatible_locations_p): Require
31005 locations in the same macro map to be either both from the
31006 macro definition, or both from the macro arguments.
31008 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
31010 * config/aarch64/aarch64-simd-builtins.def: Add
31011 [su]mull_hi_lane[q] builtin generator macros.
31012 * config/aarch64/aarch64-simd.md
31013 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
31014 (aarch64_<su>mull_hi_lane<mode>): Define.
31015 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
31016 (aarch64_<su>mull_hi_laneq<mode>): Define.
31017 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
31018 builtin instead of inline asm.
31019 (vmull_high_lane_s32): Likewise.
31020 (vmull_high_lane_u16): Likewise.
31021 (vmull_high_lane_u32): Likewise.
31022 (vmull_high_laneq_s16): Likewise.
31023 (vmull_high_laneq_s32): Likewise.
31024 (vmull_high_laneq_u16): Likewise.
31025 (vmull_high_laneq_u32): Liekwise.
31027 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
31029 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
31030 builtin generator macros.
31031 * config/aarch64/aarch64-simd.md
31032 (aarch64_<su>mull_hi_n<mode>_insn): Define.
31033 (aarch64_<su>mull_hi_n<mode>): Define.
31034 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
31035 instead of inline asm.
31036 (vmull_high_n_s32): Likewise.
31037 (vmull_high_n_u16): Likewise.
31038 (vmull_high_n_u32): Likewise.
31040 2021-02-04 Richard Biener <rguenther@suse.de>
31042 PR tree-optimization/98855
31043 * tree-vect-loop.c (vectorizable_phi): Do not cost
31044 single-argument PHIs.
31045 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
31046 * tree-vect-stmts.c (vectorizable_bswap): Also perform
31047 costing for SLP operation.
31049 2021-02-04 Martin Liska <mliska@suse.cz>
31051 * doc/extend.texi: Mention -mprefer-vector-width in target
31054 2021-02-03 Martin Sebor <msebor@redhat.com>
31056 PR tree-optimization/98937
31057 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
31058 Flush pointer_query cache.
31060 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
31062 * config/rs6000/genfusion.pl (gen_2logical): Add missing
31063 fixes based on patch review.
31064 * config/rs6000/fusion.md: Regenerate file.
31066 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
31068 * config/rs6000/t-rs6000: Comment out auto generation of
31071 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
31073 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
31074 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
31075 (output_file_start): Add gfx908.
31076 * config/gcn/gcn.opt (gpu_type): Add gfx908.
31077 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
31078 (MULTILIB_DIRNAMES): Add gfx908.
31079 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
31080 (main): Recognize gfx908.
31081 * config/gcn/t-omp-device: Add gfx908.
31083 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31085 * config/aarch64/aarch64-simd-builtins.def: Add
31086 [su]mlsl_hi_lane[q] builtin macro generators.
31087 * config/aarch64/aarch64-simd.md
31088 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
31089 (aarch64_<su>mlsl_hi_lane<mode>): Define.
31090 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
31091 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
31092 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
31093 builtin instead of inline asm.
31094 (vmlsl_high_lane_s32): Likewise.
31095 (vmlsl_high_lane_u16): Likewise.
31096 (vmlsl_high_lane_u32): Likewise.
31097 (vmlsl_high_laneq_s16): Likewise.
31098 (vmlsl_high_laneq_s32): Likewise.
31099 (vmlsl_high_laneq_u16): Likewise.
31100 (vmlsl_high_laneq_u32): Likewise.
31101 (vmlal_high_laneq_u32): Likewise.
31103 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31105 * config/aarch64/aarch64-simd-builtins.def: Add
31106 [su]mlal_hi_lane[q] builtin generator macros.
31107 * config/aarch64/aarch64-simd.md
31108 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
31109 (aarch64_<su>mlal_hi_lane<mode>): Define.
31110 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
31111 (aarch64_<su>mlal_hi_laneq<mode>): Define.
31112 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
31113 builtin instead of inline asm.
31114 (vmlal_high_lane_s32): Likewise.
31115 (vmlal_high_lane_u16): Likewise.
31116 (vmlal_high_lane_u32): Likewise.
31117 (vmlal_high_laneq_s16): Likewise.
31118 (vmlal_high_laneq_s32): Likewise.
31119 (vmlal_high_laneq_u16): Likewise.
31120 (vmlal_high_laneq_u32): Likewise.
31122 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31124 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
31125 builtin generator macros.
31126 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
31128 (aarch64_<su>mlsl_hi_n<mode>): Define.
31129 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
31130 instead of inline asm.
31131 (vmlsl_high_n_s32): Likewise.
31132 (vmlsl_high_n_u16): Likewise.
31133 (vmlsl_high_n_u32): Likewise.
31135 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31137 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
31138 builtin generator macros.
31139 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
31141 (aarch64_<su>mlal_hi_n<mode>): Define.
31142 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
31143 instead of inline asm.
31144 (vmlal_high_n_s32): Likewise.
31145 (vmlal_high_n_u16): Likewise.
31146 (vmlal_high_n_u32): Likewise.
31148 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31150 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
31152 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
31154 (aarch64_<su>mlal_hi<mode>_insn): This.
31155 (aarch64_<su>mlal_hi<mode>): Define.
31156 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
31157 instead of inline asm.
31158 (vmlal_high_s16): Likewise.
31159 (vmlal_high_s32): Likewise.
31160 (vmlal_high_u8): Likewise.
31161 (vmlal_high_u16): Likewise.
31162 (vmlal_high_u32): Likewise.
31164 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
31166 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
31167 after calling alter_subreg() on a (mem).
31169 2021-02-03 Martin Liska <mliska@suse.cz>
31172 * lto-streamer-out.c (produce_lto_section): Fill up missing
31174 * lto-streamer.h (struct lto_section): Add _padding field.
31176 2021-02-03 Richard Biener <rguenther@suse.de>
31178 * lto-streamer.c (lto_get_section_name): Free temporary
31180 * tree-loop-distribution.c
31181 (loop_distribution::merge_dep_scc_partitions): Free edge data.
31183 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31185 PR middle-end/97487
31186 * ifcvt.c (noce_can_force_operand): New function.
31187 (noce_emit_move_insn): Use it.
31188 (noce_try_sign_mask): Likewise. Formatting fix.
31190 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31192 PR middle-end/97971
31193 * lra-constraints.c (process_alt_operands): For inline asm, don't call
31194 fatal_insn, but instead return false.
31196 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31198 PR tree-optimization/98287
31199 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
31202 2021-02-03 Tamar Christina <tamar.christina@arm.com>
31204 PR tree-optimization/98928
31205 * tree-vect-loop.c (vect_analyze_loop_2): Change
31206 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
31207 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
31208 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
31209 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
31211 2021-02-02 Richard Biener <rguenther@suse.de>
31213 * gimple-loop-interchange.cc (prepare_data_references):
31215 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
31216 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
31217 * tree-vect-stmts.c (vectorizable_condition): Do not
31219 (vectorizable_comparison): Likewise.
31221 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31223 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
31224 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
31225 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
31226 (vrsqrteq_u32): Likewise.
31228 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31230 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
31231 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
31232 (aarch64_sqxtun2<mode>_be): Likewise.
31233 (aarch64_sqxtun2<mode>): Likewise.
31234 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
31235 (vqmovun_high_s32): Likewise.
31236 (vqmovun_high_s64): Likewise.
31237 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
31239 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31241 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
31243 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
31245 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31247 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
31248 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
31249 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
31250 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
31251 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
31252 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
31253 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
31254 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
31256 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31258 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
31259 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
31260 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
31262 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31264 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
31265 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
31267 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31269 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
31270 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
31271 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
31272 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
31273 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
31274 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
31275 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
31276 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
31277 NONE builtin flags.
31279 2021-02-02 Jakub Jelinek <jakub@redhat.com>
31281 PR tree-optimization/98848
31282 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
31283 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
31285 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
31288 * expr.c: Check mode before calling store_expr.
31290 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
31292 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
31294 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
31295 instruction using expression ior.
31296 (mve_vornq_u<mode>): New expander.
31297 (mve_vornq_f<mode>): Use ior code instead of unspec.
31298 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
31300 2021-02-02 Alexandre Oliva <oliva@adacore.com>
31302 * tree-nested.c (convert_nonlocal_reference_op): Move
31303 current_function_decl restore after re-gimplification.
31304 (convert_local_reference_op): Likewise.
31306 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31308 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
31310 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
31312 (aarch64_rshrn<mode>_insn_be): Likewise.
31313 (aarch64_rshrn<mode>): Likewise.
31314 (aarch64_rshrn2<mode>_insn_le): Likewise.
31315 (aarch64_rshrn2<mode>_insn_be): Likewise.
31316 (aarch64_rshrn2<mode>): Likewise.
31317 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
31318 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
31320 (vrshrn_high_n_s32): Likewise.
31321 (vrshrn_high_n_s64): Likewise.
31322 (vrshrn_high_n_u16): Likewise.
31323 (vrshrn_high_n_u32): Likewise.
31324 (vrshrn_high_n_u64): Likewise.
31325 (vrshrn_n_s16): Likewise.
31326 (vrshrn_n_s32): Likewise.
31327 (vrshrn_n_s64): Likewise.
31328 (vrshrn_n_u16): Likewise.
31329 (vrshrn_n_u32): Likewise.
31330 (vrshrn_n_u64): Likewise.
31332 2021-02-01 Sergei Trofimovich <siarheit@google.com>
31334 PR tree-optimization/98499
31335 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
31336 conservatively and assume all possible side-effects.
31338 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31340 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
31341 vec_unpacku_hi_): Define builtins.
31342 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
31344 (vmovl_high_s16): Likewise.
31345 (vmovl_high_s32): Likewise.
31346 (vmovl_high_u8): Likewise.
31347 (vmovl_high_u16): Likewise.
31348 (vmovl_high_u32): Likewise.
31350 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31352 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
31354 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
31356 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
31358 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
31360 (vabdl_s16): Likewise.
31361 (vabdl_s32): Likewise.
31362 (vabdl_u8): Likewise.
31363 (vabdl_u16): Likewise.
31364 (vabdl_u32): Likewise.
31365 * config/aarch64/iterators.md (ABDL): New int iterator.
31366 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
31368 2021-02-01 Martin Sebor <msebor@redhat.com>
31370 * tree.h (BLOCK_VARS): Add comment.
31371 (BLOCK_SUBBLOCKS): Same.
31372 (BLOCK_SUPERCONTEXT): Same.
31373 (BLOCK_ABSTRACT_ORIGIN): Same.
31374 (inlined_function_outer_scope_p): Same.
31376 2021-02-01 Martin Sebor <msebor@redhat.com>
31378 PR middle-end/97172
31379 * attribs.c (attr_access::free_lang_data): Define new function.
31380 * attribs.h (attr_access::free_lang_data): Declare new function.
31382 2021-02-01 Richard Biener <rguenther@suse.de>
31384 * vec.h (auto_vec::auto_vec): Add memory stat parameters
31386 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
31388 2021-02-01 Tamar Christina <tamar.christina@arm.com>
31390 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
31391 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
31393 2021-02-01 Richard Biener <rguenther@suse.de>
31395 PR rtl-optimization/98863
31396 * config/i386/i386-features.c (convert_scalars_to_vector):
31397 Set DF_RD_PRUNE_DEAD_DEFS.
31399 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
31401 * system.h (SIZE_MAX): Define if not already defined.
31403 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
31405 * config/rs6000/genfusion.pl (gen_2logical): New function to
31406 generate patterns for logical-logical fusion.
31407 * config/rs6000/fusion.md: Regenerated patterns.
31408 * config/rs6000/rs6000-cpus.def: Add
31409 OPTION_MASK_P10_FUSION_2LOGICAL.
31410 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31411 Enable logical-logical fusion for p10.
31412 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
31414 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31416 * config/rs6000/rs6000.opt: Add periods to new AIX options.
31418 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31420 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
31421 (mabi=vec-default): New.
31422 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
31423 __EXTABI__ for AIX Vector extended ABI.
31424 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
31426 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
31428 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
31430 2021-01-30 Jakub Jelinek <jakub@redhat.com>
31432 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
31433 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
31435 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
31438 * lra-constraints.c (in_class_p): Don't narrow class only for REG
31441 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
31443 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
31444 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
31445 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
31447 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
31449 PR tree-optimization/98866
31450 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
31451 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
31452 (gori_map::m_maybe_invariant): Rename from all_outgoing.
31453 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
31454 (gori_map::is_export_p): Ditto.
31455 (gori_map::calculate_gori): Ditto.
31456 (gori_compute::set_range_invariant): New.
31457 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
31458 invariant for pointers evaluating to [1, +INF].
31460 2021-01-29 Richard Biener <rguenther@suse.de>
31462 PR rtl-optimization/98863
31463 * config/i386/i386-features.c (remove_partial_avx_dependency):
31464 Do not perform DF analysis.
31465 (pass_data_remove_partial_avx_dependency): Remove
31468 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31470 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
31471 builtin generator macros.
31472 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
31474 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
31475 instead of inline asm.
31476 (vmull_n_s32): Likewise.
31477 (vmull_n_u16): Likewise.
31478 (vmull_n_u32): Likewise.
31480 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31482 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
31484 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
31486 (aarch64_<sur>abdl2<mode>): ... This.
31487 (<sur>sadv16qi): Adjust use of above.
31488 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
31490 (vabdl_high_s16): Likewise.
31491 (vabdl_high_s32): Likewise.
31492 (vabdl_high_u8): Likewise.
31493 (vabdl_high_u16): Likewise.
31494 (vabdl_high_u32): Likewise.
31496 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31498 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
31500 (uabal2): Likewise.
31501 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
31503 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
31505 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
31507 (vabal_high_s16): Likewise.
31508 (vabal_high_s32): Likewise.
31509 (vabal_high_u8): Likewise.
31510 (vabal_high_u16): Likewise.
31511 (vabal_high_u32): Likewise.
31512 * config/aarch64/iterators.md (ABAL2): New mode iterator.
31513 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
31515 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31517 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
31520 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
31522 (aarch64_<sur>abal<mode>): ... This
31523 (<sur>sadv16qi): Adust use of the above.
31524 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
31526 (vabal_s16): Likewise.
31527 (vabal_s32): Likewise.
31528 (vabal_u8): Likewise.
31529 (vabal_u16): Likewise.
31530 (vabal_u32): Likewise.
31532 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31534 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
31536 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
31538 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
31540 (vaddlv_s16): Likewise.
31541 (vaddlv_u8): Likewise.
31542 (vaddlv_u16): Likewise.
31543 (vaddlvq_s8): Likewise.
31544 (vaddlvq_s16): Likewise.
31545 (vaddlvq_s32): Likewise.
31546 (vaddlvq_u8): Likewise.
31547 (vaddlvq_u16): Likewise.
31548 (vaddlvq_u32): Likewise.
31549 (vaddlv_s32): Likewise.
31550 (vaddlv_u32): Likewise.
31551 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
31552 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
31553 (Vwstype): New mode attribute.
31555 (VWIDE_S): Likewise.
31556 (USADDLV): New int iterator.
31557 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
31559 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31561 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
31562 builtin generator macros.
31563 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
31565 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
31566 instead of inline asm.
31567 (vmlsl_lane_s32): Likewise.
31568 (vmlsl_lane_u16): Likewise.
31569 (vmlsl_lane_u32): Likewise.
31570 (vmlsl_laneq_s16): Likewise.
31571 (vmlsl_laneq_s32): Likewise.
31572 (vmlsl_laneq_u16): Likewise.
31573 (vmlsl_laneq_u32): Likewise.
31575 2021-01-29 Richard Biener <rguenther@suse.de>
31577 * doc/invoke.texi (--param max-gcse-memory): Document unit
31579 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
31580 * params.opt (--param max-gcse-memory): Adjust default and
31581 document unit of size.
31583 2021-01-29 Richard Biener <rguenther@suse.de>
31585 PR rtl-optimization/98863
31586 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
31587 HOST_WIDE_INT for the memory estimate.
31589 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
31590 Richard Biener <rguenther@suse.de>
31592 PR tree-optimization/97627
31593 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
31594 Do not analyze fake edges.
31596 2021-01-29 Richard Biener <rguenther@suse.de>
31598 PR rtl-optimization/98144
31599 * df.h (df_mir_bb_info): Add con_visited member.
31600 * df-problems.c (df_mir_alloc): Initialize con_visited,
31601 do not fully populate IN and OUT.
31602 (df_mir_reset): Likewise.
31603 (df_mir_confluence_0): Set con_visited.
31604 (df_mir_confluence_n): Properly handle implicitely
31605 fully populated IN and OUT as designated by con_visited
31606 and update con_visited accordingly.
31608 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31611 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
31612 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
31613 && !TARGET_REALLY_IWMMXT to conditions.
31615 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31618 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
31621 2021-01-28 Marek Polacek <polacek@redhat.com>
31624 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
31625 the main variant, maybe reset it in its variants too.
31626 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
31627 (check_aligned_type): Check if TYPE_USER_ALIGN match.
31629 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
31632 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
31633 of constant zero for comparisons.
31635 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
31637 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
31638 support for mapping built-in function names for long double
31639 built-in functions if long double is IEEE 128-bit.
31641 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31643 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
31644 builtin generator macros.
31645 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
31647 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
31648 instead of inline asm.
31649 (vmlsl_n_s32): Likewise.
31650 (vmlsl_n_u16): Likewise.
31651 (vmlsl_n_u32): Likewise.
31653 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31655 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
31656 builtin generator macros.
31657 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
31659 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
31660 instead of inline asm.
31661 (vmlal_n_s32): Likewise.
31662 (vmlal_n_u16): Likewise.
31663 (vmlal_n_u32): Likewise.
31665 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31667 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
31669 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
31671 (aarch64_shrn2<mode>_insn_be): Likewise.
31672 (aarch64_shrn2<mode>): Likewise.
31673 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
31675 (vshrn_high_n_s32): Likewise.
31676 (vshrn_high_n_s64): Likewise.
31677 (vshrn_high_n_u16): Likewise.
31678 (vshrn_high_n_u32): Likewise.
31679 (vshrn_high_n_u64): Likewise.
31681 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31683 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
31685 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
31687 (aarch64_shrn<mode>_insn_be): Likewise.
31688 (aarch64_shrn<mode>): Likewise.
31689 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
31691 (vshrn_n_s32): Likewise.
31692 (vshrn_n_s64): Likewise.
31693 (vshrn_n_u16): Likewise.
31694 (vshrn_n_u32): Likewise.
31695 (vshrn_n_u64): Likewise.
31696 * config/aarch64/iterators.md (vn_mode): New mode attribute.
31698 2021-01-28 Richard Biener <rguenther@suse.de>
31700 PR rtl-optimization/80960
31701 * dse.c (check_mem_read_rtx): Call get_addr on the
31704 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
31705 David Edelsohn <dje.gcc@gmail.com>
31708 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31709 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
31711 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31713 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
31714 wrapper call rs6000_expand_vector_set_var for cleanup. Call
31715 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
31717 (rs6000_expand_vector_set_var): Delete.
31718 (rs6000_expand_vector_set_var_p9): Make static.
31719 (rs6000_expand_vector_set_var_p8): Make static.
31721 2021-01-28 Xing GUO <higuoxing@gmail.com>
31723 * common/config/riscv/riscv-common.c
31724 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
31725 when `p` extension exists.
31727 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
31729 PR rtl-optimization/97684
31730 * ira.c (ira): Call ira_set_pseudo_classes before
31731 update_equiv_regs when it is necessary.
31733 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31736 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
31737 %w0, %w1 and %2 instead of %0, %1 and %2.
31739 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
31741 * config/rs6000/genfusion.pl: New script to generate
31742 define_insn_and_split patterns so combine can arrange fused
31743 instructions next to each other.
31744 * config/rs6000/fusion.md: New file, generated fused instruction
31745 patterns for combine.
31746 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
31747 (non_update_memory_operand): New predicate.
31748 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
31749 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
31751 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
31753 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31754 Automatically set OPTION_MASK_P10_FUSION and
31755 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
31756 (rs600_opt_masks): Allow -mpower10-fusion
31757 in function attributes.
31758 (address_is_non_pfx_d_or_x): New function.
31759 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
31760 * config/rs6000/rs6000.md: Include fusion.md.
31761 * config/rs6000/rs6000.opt: Add -mpower10-fusion
31762 and -mpower10-fusion-ld-cmpi.
31763 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
31765 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31767 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
31768 builtin generator macros.
31769 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
31771 (aarch64_<su>mlal<mode>): This.
31772 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
31773 instead of inline asm.
31774 (vmlal_s16): Likewise.
31775 (vmlal_s32): Likewise.
31776 (vmlal_u8): Likewise.
31777 (vmlal_u16): Likewise.
31778 (vmlal_u32): Likewise.
31780 2021-01-27 Richard Biener <rguenther@suse.de>
31782 PR tree-optimization/98854
31783 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
31784 PHIs from scalars when the number of CTORs matches the
31785 number of children.
31787 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31789 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
31791 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
31793 (aarch64_mls_n<mode>): This.
31794 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
31796 (vmls_n_s32): Likewise.
31797 (vmls_n_u16): Likewise.
31798 (vmls_n_u32): Likewise.
31799 (vmlsq_n_s16): Likewise.
31800 (vmlsq_n_s32): Likewise.
31801 (vmlsq_n_u16): Likewise.
31802 (vmlsq_n_u32): Likewise.
31804 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31806 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
31808 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
31810 (vmls_s16): Likewise.
31811 (vmls_s32): Likewise.
31812 (vmls_u8): Likewise.
31813 (vmls_u16): Likewise.
31814 (vmls_u32): Likewise.
31815 (vmlsq_s8): Likewise.
31816 (vmlsq_s16): Likewise.
31817 (vmlsq_s32): Likewise.
31818 (vmlsq_u8): Likewise.
31819 (vmlsq_u16): Likewise.
31820 (vmlsq_u32): Likewise.
31822 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31824 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
31826 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
31828 (aarch64_mla_n<mode>): This.
31829 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
31831 (vmla_n_s32): Likewise.
31832 (vmla_n_u16): Likewise.
31833 (vmla_n_u32): Likewise.
31834 (vmlaq_n_s16): Likewise.
31835 (vmlaq_n_s32): Likewise.
31836 (vmlaq_n_u16): Likewise.
31837 (vmlaq_n_u32): Likewise.
31839 2021-01-27 liuhongt <hongtao.liu@intel.com>
31842 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
31843 (*sse2_eq<mode>3): Ditto.
31845 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31847 * tree-pass.h (PROP_trees): Rename to ...
31848 (PROP_gimple): ... this.
31849 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
31850 * passes.c (execute_function_dump, execute_function_todo,
31851 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
31852 * varpool.c (ctor_for_folding): Likewise.
31854 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31856 PR tree-optimization/97260
31857 * varpool.c: Include tree-pass.h.
31858 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
31859 non-TREE_SIDE_EFFECTS automatic variables.
31861 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
31863 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
31865 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
31866 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
31867 (gen_compile_unit_die): Recognise C++23.
31869 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31872 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
31875 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31878 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
31879 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
31880 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
31882 2021-01-26 Richard Biener <rguenther@suse.de>
31884 * gimple-pretty-print.c (dump_binary_rhs): Handle
31885 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
31887 2021-01-26 Richard Biener <rguenther@suse.de>
31889 PR middle-end/98726
31890 * tree.h (vector_cst_int_elt): Remove.
31891 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
31894 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
31896 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
31897 for V64DFmode min/max reductions.
31899 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31901 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
31902 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
31903 two halves, one with x and the other with const0_rtx, ordered
31904 depending on endianity.
31906 2021-01-26 Alexandre Oliva <oliva@adacore.com>
31908 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
31909 temporaries not seen in binding block, and not about to be
31910 added as gimple variables.
31912 2021-01-25 Martin Sebor <msebor@redhat.com>
31915 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
31917 2021-01-25 Martin Liska <mliska@suse.cz>
31919 * value-prof.c (get_nth_most_common_value): Use %s instead
31922 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31925 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
31926 readelf -wi is able to read the emitted .debug_info back.
31927 * configure: Regenerated.
31929 2021-01-25 Martin Liska <mliska@suse.cz>
31931 PR gcov-profile/98739
31932 * common.opt: Add missing sign symbol.
31933 * value-prof.c (get_nth_most_common_value): Restore handling
31934 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
31935 PROFILE_REPRODUCIBILITY_MULTITHREADED.
31937 2021-01-25 Richard Biener <rguenther@suse.de>
31939 PR middle-end/98807
31940 * tree.c (vector_element_bits): Always use precision of
31941 the element type for boolean vectors.
31943 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31945 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
31946 (ENDFILE_SPEC): Evaluate qnolinkcmds.
31948 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31950 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
31951 nostartfiles handling since this is already done by
31952 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
31953 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
31954 is already done by LINK_COMMAND_SPEC.
31955 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
31956 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
31959 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31962 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
31963 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
31964 value rather than host size_t.
31965 (fold_const_call): Change type of s2 from size_t to
31966 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
31967 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
31968 instead of s2 as last argument.
31970 2021-01-25 Tamar Christina <tamar.christina@arm.com>
31972 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
31973 VCMLA_OP, VCMUL_OP): New.
31974 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
31975 * config/arm/neon.md (cmul<conj_op><mode>3): New.
31976 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
31977 UNSPEC_VCMUL_CONJ): New.
31978 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
31979 cml<fcmac1><conj_op><mode>4): New.
31981 2021-01-23 Jakub Jelinek <jakub@redhat.com>
31984 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
31986 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
31988 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
31990 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
31992 (vmla_s16): Likewise.
31993 (vmla_s32): Likewise.
31994 (vmla_u8): Likewise.
31995 (vmla_u16): Likewise.
31996 (vmla_u32): Likewise.
31997 (vmlaq_s8): Likewise.
31998 (vmlaq_s16): Likewise.
31999 (vmlaq_s32): Likewise.
32000 (vmlaq_u8): Likewise.
32001 (vmlaq_u16): Likewise.
32002 (vmlaq_u32): Likewise.
32004 2021-01-22 David Malcolm <dmalcolm@redhat.com>
32006 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
32009 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32012 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
32013 filenames to emit, still emit the required 0 index directory and
32014 filename entries that match DW_AT_comp_dir and DW_AT_name of the
32017 2021-01-22 Marek Polacek <polacek@redhat.com>
32020 * doc/invoke.texi: Update C++ ABI Version 15 description.
32022 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32024 PR tree-optimization/98766
32025 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
32026 comparing against type size with param_avoid_fma_max_bits.
32028 2021-01-22 Richard Biener <rguenther@suse.de>
32030 PR middle-end/98793
32031 * tree.c (vector_element_bits): Key single-bit bool vector on
32032 integer mode rather than not vector mode.
32034 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
32037 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32038 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
32040 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
32041 to call different path for P8 and P9.
32042 (rs6000_expand_vector_set_var_p9): New function.
32043 (rs6000_expand_vector_set_var_p8): New function.
32045 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
32049 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
32050 Ajdust variable index vec_insert from address dereference to
32051 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
32052 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
32054 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
32056 2021-01-22 Martin Liska <mliska@suse.cz>
32058 PR gcov-profile/98739
32059 * profile.c (compute_value_histograms): Drop time profile for
32060 -fprofile-reproducible=multithreaded.
32062 2021-01-22 Nathan Sidwell <nathan@acm.org>
32064 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
32067 2021-01-22 Richard Biener <rguenther@suse.de>
32069 PR middle-end/98773
32070 * tree-data-ref.c (initalize_matrix_A): Revert previous
32071 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
32073 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32075 PR tree-optimization/90248
32076 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
32077 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
32079 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
32080 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
32082 2021-01-22 Jakub Jelinek <jakub@redhat.com>
32084 PR tree-optimization/98255
32085 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
32086 extend index - low_bound from sizetype's precision rather than index
32088 (get_addr_base_and_unit_offset_1): Likewise.
32089 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
32090 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
32092 2021-01-22 Richard Biener <rguenther@suse.de>
32094 PR tree-optimization/98786
32095 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
32096 adding new uses of abnormals. Verify we deal with a conditional
32099 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32102 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
32104 2021-01-22 liuhongt <hongtao.liu@intel.com>
32108 * config/i386/sse.md (VI_128_256): New mode iterator.
32109 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
32110 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
32111 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
32112 define_insn_and_split to lower avx512 vector comparison to avx
32113 version when dest is vector.
32114 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
32115 define_insn_and_split for negating the comparison result.
32116 * config/i386/predicates.md (float_vector_all_ones_operand):
32118 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
32119 general NOT operator without UNSPEC_MASKOP.
32121 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
32123 PR rtl-optimization/98777
32124 * lra-int.h (lra_pmode_pseudo): New extern.
32125 * lra.c (lra_pmode_pseudo): New global.
32127 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
32129 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
32131 * fwprop.c (fwprop_propagation::classify_result): Allow
32132 (subreg (mem)) simplifications.
32134 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32136 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
32138 (aarch64_sqdmlal<mode>): ... This...
32139 (aarch64_sqdmlsl<mode>): ... And this.
32140 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
32141 (aarch64_sqdmlal_lane<mode>): ... This...
32142 (aarch64_sqdmlsl_lane<mode>): ... And this.
32143 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
32144 (aarch64_sqdmlsl_laneq<mode>): ... This...
32145 (aarch64_sqdmlal_laneq<mode>): ... And this.
32146 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
32147 (aarch64_sqdmlsl_n<mode>): ... This...
32148 (aarch64_sqdmlal_n<mode>): ... And this.
32149 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
32150 (aarch64_sqdmlal2<mode>_internal): ... This...
32151 (aarch64_sqdmlsl2<mode>_internal): ... And this.
32153 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
32155 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
32157 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
32160 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
32162 2021-01-21 liuhongt <hongtao.liu@intel.com>
32164 PR rtl-optimization/98694
32165 * regcprop.c (copy_value): If SRC had been assigned a mode
32166 narrower than the copy, we can't link DEST into the chain even
32167 they have same hard_regno_nregs(i.e. HImode/SImode in i386
32170 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32172 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
32173 Convert to define_insn_and_split. Split into simple move when moving
32176 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
32178 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
32179 Adjust comment. Simplify code.
32181 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32184 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
32185 with DW_FORM_line_strp form.
32186 (prune_unused_types_update_strings): Don't add into debug_str_hash
32187 indirect strings with DW_FORM_line_strp form.
32188 (adjust_name_comp_dir): New function.
32189 (dwarf2out_finish): Call it on CU DIEs after resetting
32190 debug_line_str_hash.
32192 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
32194 PR rtl-optimization/98722
32195 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
32196 has no 3-op add insn to transform insns containing two pluses.
32198 2021-01-20 Richard Biener <rguenther@suse.de>
32200 * hwint.h (add_hwi): New function.
32201 (mul_hwi): Likewise.
32202 * tree-data-ref.c (initialize_matrix_A): Properly translate
32203 tree constants and avoid HOST_WIDE_INT_MIN.
32204 (lambda_matrix_row_add): Avoid undefined integer overflow
32205 and return true on such overflow.
32206 (lambda_matrix_right_hermite): Handle overflow from
32207 lambda_matrix_row_add gracefully. Simplify previous fix.
32208 (analyze_subscript_affine_affine): Likewise.
32210 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
32212 PR tree-optimization/96674
32213 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
32214 x >= y && y != XXX_MIN --> x > y - 1
32216 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
32218 PR tree-optimization/98535
32219 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
32220 If the high and low permutes are the same, remove the high permutes
32221 from the working set and only continue with the low ones.
32223 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32225 PR tree-optimization/98721
32226 * builtins.c (access_ref::inform_access): Don't assume
32227 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
32228 object whenever allocfn is NULL, rather than only when DECL_P
32229 is true. Use %qE instead of %qD for that. Formatting fixes.
32231 2021-01-20 Richard Biener <rguenther@suse.de>
32233 PR tree-optimization/98758
32234 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
32235 (lambda_matrix_right_hermite): Avoid undefinedness with
32236 signed integer abs and multiplication.
32237 (analyze_subscript_affine_affine): Use lambda_int.
32239 2021-01-20 David Malcolm <dmalcolm@redhat.com>
32242 * dwarf2out.c (output_line_info): Rename static variable
32243 "generation", moving it out of the function to...
32244 (output_line_info_generation): New.
32245 (init_sections_and_labels): Likewise, renaming the variable to...
32246 (init_sections_and_labels_generation): New.
32247 (dwarf2out_c_finalize): Reset the new variables.
32249 2021-01-19 Martin Sebor <msebor@redhat.com>
32251 PR middle-end/98664
32252 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
32253 all functions, even if they're not declared artificial or inline.
32254 * tree.c (tree_inlined_location): Use macro expansion location
32255 only if scope traversal fails to expose one.
32257 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
32259 PR rtl-optimization/92294
32260 * alias.c (compare_base_symbol_refs): Take an extra parameter
32261 and add the distance between two symbols to it. Enshrine in
32262 comments that -1 means "either 0 or 1, but we can't tell
32263 which at compile time".
32264 (memrefs_conflict_p): Update call accordingly.
32265 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
32268 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32270 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
32271 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
32272 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
32273 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
32274 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
32275 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
32276 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
32277 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
32278 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
32279 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
32280 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
32281 sqmovun): Use NONE flags.
32283 2021-01-19 Richard Biener <rguenther@suse.de>
32286 * ipa-modref.c (analyze_stmt): Only record a summary for a
32289 2021-01-19 Richard Biener <rguenther@suse.de>
32291 PR middle-end/98638
32292 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
32294 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
32296 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
32297 built-in define __FIX_LEON3FT_TN0018.
32299 2021-01-19 Richard Biener <rguenther@suse.de>
32302 * tree-inline.c (tree_function_versioning): Set input_location
32303 to UNKNOWN_LOCATION throughout the function.
32305 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
32308 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
32310 2021-01-19 Martin Jambor <mjambor@suse.cz>
32313 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
32314 whether non-call exceptions allow removal of a statement.
32315 (isra_analyze_call): Pass the appropriate function to
32316 ssa_name_only_returned_p.
32318 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
32320 * config/riscv/arch-canonicalize (longext_sort): New function for
32321 sorting 'multi-letter'.
32322 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
32323 'alts'. The 'arch' may not be the first of 'alts'.
32324 (_expand_combination): Add underline for the 'ext' without '*'.
32325 This is because, a single-letter extension can always be treated well
32326 with a '_' prefix, but it cannot be separated out if it is appended
32329 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
32332 * ira.c (ira): Skip abnormal critical edge splitting.
32334 2021-01-18 Jakub Jelinek <jakub@redhat.com>
32336 PR tree-optimization/98727
32337 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
32338 second .MUL_OVERFLOW operand for signed multiplication with overflow
32339 checking if the second operand of multiplication is not constant.
32341 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
32343 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
32344 defaults to version 4.
32346 2021-01-18 David Malcolm <dmalcolm@redhat.com>
32348 * attribs.h (fndecl_dealloc_argno): New decl.
32349 * builtins.c (call_dealloc_argno): Split out second half of
32351 (fndecl_dealloc_argno): New.
32352 * doc/extend.texi (Common Function Attributes): Document the
32353 interaction between the analyzer and the malloc attribute.
32354 * doc/invoke.texi (Static Analyzer Options): Likewise.
32356 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
32358 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
32359 dwarf_version to 4.
32360 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
32362 2021-01-17 Martin Jambor <mjambor@suse.cz>
32365 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
32368 2021-01-17 Mark Wielaard <mark@klomp.org>
32370 * common.opt (gdwarf-): Init(5).
32371 * doc/invoke.texi (-gdwarf): Document default to 5.
32373 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
32375 * builtin-types.def
32376 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
32378 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
32379 ...this. Add extra argument.
32380 * gimplify.c (omp_default_clause): Ensure that event handle is
32381 firstprivate in a task region.
32382 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
32383 (gimplify_adjust_omp_clauses): Likewise.
32384 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
32385 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
32386 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
32387 if detach clause specified. Add detach argument when generating
32389 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
32391 (finish_taskreg_scan): Move field for variable containing the event
32392 handle to the front of the struct.
32393 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
32395 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
32396 OMP_CLAUSE_DETACH clause.
32397 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
32398 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
32399 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
32401 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
32403 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
32405 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32407 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
32408 multilib directory names. Use MULTILIB_REQUIRED instead of
32409 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
32410 -mcustom-fpu-cfg=fph2 multilib.
32412 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32414 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
32415 (nios2_init_fpu_configs): Provide register values for new
32416 -mcustom-fpu-cfg=fph2 option variant.
32417 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
32420 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32422 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
32423 custom instruction warnings.
32425 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32427 PR tree-optimization/96669
32428 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
32430 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32432 PR tree-optimization/96271
32433 * passes.def: Pass false argument to first two pass_cd_dce
32434 instances and true to last instance. Add comment that
32435 last instance rewrites no longer addressed locals.
32436 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
32438 (pass_cd_dce::set_pass_param): New method.
32439 (pass_cd_dce::execute): Return TODO_update_address_taken from
32440 last cd_dce instance.
32442 2021-01-15 Carl Love <cel@us.ibm.com>
32444 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
32446 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
32447 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
32448 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
32449 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
32450 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
32451 Add builtin define.
32452 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
32453 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
32454 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
32455 New overloaded definitions.
32456 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
32457 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
32458 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
32459 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
32460 P10V_BUILTIN_MULHU_V4SI]: Add case
32461 statement for builtins.
32462 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
32463 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
32464 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
32465 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
32466 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
32467 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
32468 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
32469 Add define_insn, mode is VIlong.
32470 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
32471 Add builtin descriptions.
32473 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
32475 * final.c (final_start_function_1): Reset force_source_line.
32477 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32479 PR tree-optimization/96669
32480 * match.pd (((1 << A) & 1) != 0 -> A == 0,
32481 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
32482 possibly different power of two constants and to right shift too.
32484 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32486 PR tree-optimization/96681
32487 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
32488 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
32489 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
32490 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
32492 2021-01-15 Alexandre Oliva <oliva@adacore.com>
32494 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
32496 2021-01-15 Tamar Christina <tamar.christina@arm.com>
32498 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
32499 cmul<conj_op><mode>3): New.
32500 * config/aarch64/iterators.md (UNSPEC_FCMUL,
32501 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
32502 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
32503 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
32504 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
32505 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
32506 (rot_op): Renamed to conj_op.
32507 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
32508 cmul<conj_op><mode>3): New.
32509 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
32510 cmul<conj_op><mode>3): New.
32512 2021-01-15 David Malcolm <dmalcolm@redhat.com>
32516 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32517 Escape the tempfile name when constructing the expected output.
32519 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32521 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
32523 (aarch64_<su>mlsl_hi<mode>): ... This.
32524 (aarch64_<su>mlsl_hi<mode>): Define.
32525 (*aarch64_<su>mlsl<mode): Rename to...
32526 (aarch64_<su>mlsl<mode): ... This.
32527 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
32528 smlsl_hi, umlsl_hi): Define builtins.
32529 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
32530 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
32531 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
32532 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
32534 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
32536 * config/i386/i386-c.c (ix86_target_macros):
32537 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
32539 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32542 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
32543 * Makefile.in (RTL_SSA_H): New variable.
32544 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
32545 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
32546 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
32548 * config/aarch64/aarch64-cc-fusion.cc: New file.
32550 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32552 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
32553 calling cancel_changes for changes that no longer exist.
32555 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32557 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
32558 (function_info::reg_defs): ...this.
32559 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
32560 (function_info::reg_defs): ...this.
32562 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32565 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32567 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32570 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32573 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32575 2021-01-15 Richard Biener <rguenther@suse.de>
32577 PR tree-optimization/96376
32578 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
32579 for VMAT_INVARIANT.
32581 2021-01-15 Martin Liska <mliska@suse.cz>
32583 * doc/install.texi: Document that some tests need pytest module.
32584 * doc/sourcebuild.texi: Likewise.
32586 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32589 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32591 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32593 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
32594 (mve_vshrq_n_u<mode>_imm): Likewise.
32595 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
32596 * config/arm/vec-common.md: ... here.
32598 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32600 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
32602 * config/arm/neon.md (vashl<mode>3): Delete.
32603 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
32604 (vasl<mode>3): New expander.
32606 2021-01-15 Richard Biener <rguenther@suse.de>
32608 PR tree-optimization/98685
32609 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
32610 of vector extern defs.
32612 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32615 * diagnostic.c (diagnostic_kind_text): Break out this array
32617 (diagnostic_build_prefix): ...here.
32618 (fancy_abort): Detect when diagnostic_initialize has not yet been
32619 called and fall back to a minimal implementation of printing the
32620 ICE, rather than segfaulting in internal_error.
32622 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32624 * diagnostic.c (diagnostic_initialize): Eliminate
32625 parseable_fixits_p in favor of initializing extra_output_kind from
32626 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32627 (convert_column_unit): New function, split out from...
32628 (diagnostic_converted_column): ...this.
32629 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
32630 Use them to call convert_column_unit on the column values.
32631 (diagnostic_report_diagnostic): Eliminate conditional on
32632 parseable_fixits_p in favor of a switch statement on
32633 extra_output_kind, passing the appropriate values to the new
32634 params of print_parseable_fixits.
32635 (selftest::test_print_parseable_fixits_none): Update for new
32636 params of print_parseable_fixits.
32637 (selftest::test_print_parseable_fixits_insert): Likewise.
32638 (selftest::test_print_parseable_fixits_remove): Likewise.
32639 (selftest::test_print_parseable_fixits_replace): Likewise.
32640 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32642 (selftest::diagnostic_c_tests): Call it.
32643 * diagnostic.h (enum diagnostics_extra_output_kind): New.
32644 (diagnostic_context::parseable_fixits_p): Delete field in favor
32646 (diagnostic_context::extra_output_kind): ...this new field.
32647 * doc/invoke.texi (Environment Variables): Add
32648 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32649 * opts.c (common_handle_option): Update handling of
32650 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
32653 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32655 * tree-vect-slp-patterns.c (class complex_operations_pattern,
32656 complex_operations_pattern::matches,
32657 complex_operations_pattern::recognize,
32658 complex_operations_pattern::build): New.
32659 (slp_patterns): Use it.
32661 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32663 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
32664 * optabs.def (cmls_optab, cmls_conj_optab): New.
32665 * doc/md.texi: Document them.
32666 * tree-vect-slp-patterns.c (class complex_fms_pattern,
32667 complex_fms_pattern::matches, complex_fms_pattern::recognize,
32668 complex_fms_pattern::build): New.
32670 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32672 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
32673 * optabs.def (cmla_optab, cmla_conj_optab): New.
32674 * doc/md.texi: Document them.
32675 * tree-vect-slp-patterns.c (vect_match_call_p,
32676 class complex_fma_pattern, vect_slp_reset_pattern,
32677 complex_fma_pattern::matches, complex_fma_pattern::recognize,
32678 complex_fma_pattern::build): New.
32680 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32682 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
32683 * optabs.def (cmul_optab, cmul_conj_optab): New.
32684 * doc/md.texi: Document them.
32685 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
32686 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
32687 vect_build_combine_node, class complex_mul_pattern,
32688 complex_mul_pattern::matches, complex_mul_pattern::recognize,
32689 complex_mul_pattern::build): New.
32691 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32693 * tree-vect-slp.c (optimize_load_redistribution_1): New.
32694 (optimize_load_redistribution, vect_is_slp_load_node): New.
32695 (vect_match_slp_patterns): Use it.
32697 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32699 * tree-vect-slp-patterns.c (complex_add_pattern::build):
32702 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
32704 * config/gcn/mkoffload.c (main): Create an offload image only in
32705 64-bit configurations.
32707 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
32710 * config/i386/i386-options.c (ix86_option_override_internal):
32711 Issue an error for -fcf-protection with CF_BRANCH when compiling
32712 for 32-bit non-TARGET_CMOV targets.
32714 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32717 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
32718 Remove declaration and initialization of shadow variable "ret".
32719 (ix86_option_override_internal): Remove delcaration of
32720 shadow variable "i". Redeclare shadowed variable to unsigned.
32721 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
32722 * config/i386/i386-builtins.c (get_builtin_code_for_version):
32723 Update for redeclaration.
32724 * config/i386/i386.h (pta_size): Ditto.
32726 2021-01-14 Richard Biener <rguenther@suse.de>
32728 PR tree-optimization/98674
32729 * tree-data-ref.c (base_supports_access_fn_components_p): New.
32730 (initialize_data_dependence_relation): For two bases without
32731 possible access fns resort to type size equality when determining
32732 shape compatibility.
32734 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32737 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
32738 <=, >= operators in vcle and vcge intrinsics respectively.
32739 * config/arm/arm_neon_builtins.def: Remove entry for
32742 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32745 * config/i386/i386-options.c (ix86_function_specific_save):
32746 Remove redundant assignment to opts->x_ix86_branch_cost.
32747 * config/i386/i386.c (ix86_prefetch_sse):
32748 Rename from x86_prefetch_sse. Update all uses.
32749 * config/i386/i386.h: Update for rename.
32750 * config/i386/i386-options.h: Ditto.
32752 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32755 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
32756 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
32757 Use Bm instead of m for non-avx. Add isa attribute.
32759 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32761 PR tree-optimization/96688
32762 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
32763 ~X can be simplified.
32765 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
32767 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
32768 IFN_LOAD_LANES results.
32770 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32772 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
32774 (aarch64_xtn<mode>): Likewise.
32775 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
32778 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32780 (vmovl_s16): Likewise.
32781 (vmovl_s32): Likewise.
32782 (vmovl_u8): Likewise.
32783 (vmovl_u16): Likewise.
32784 (vmovl_u32): Likewise.
32785 (vmovn_s16): Likewise.
32786 (vmovn_s32): Likewise.
32787 (vmovn_s64): Likewise.
32788 (vmovn_u16): Likewise.
32789 (vmovn_u32): Likewise.
32790 (vmovn_u64): Likewise.
32792 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32794 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
32796 (aarch64_<su>qxtn2<mode>_be): Likewise.
32797 (aarch64_<su>qxtn2<mode>): Likewise.
32798 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
32800 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
32801 (su): Handle ss_truncate and us_truncate.
32802 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
32804 (vqmovn_high_s32): Likewise.
32805 (vqmovn_high_s64): Likewise.
32806 (vqmovn_high_u16): Likewise.
32807 (vqmovn_high_u32): Likewise.
32808 (vqmovn_high_u64): Likewise.
32810 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32812 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
32814 (aarch64_xtn2<mode>_be): Likewise.
32815 (aarch64_xtn2<mode>): Likewise.
32816 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
32818 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
32820 (vmovn_high_s32): Likewise.
32821 (vmovn_high_s64): Likewise.
32822 (vmovn_high_u16): Likewise.
32823 (vmovn_high_u32): Likewise.
32824 (vmovn_high_u64): Likewise.
32826 2021-01-13 Stafford Horne <shorne@gmail.com>
32828 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
32830 2021-01-13 Stafford Horne <shorne@gmail.com>
32832 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
32834 2021-01-13 Stafford Horne <shorne@gmail.com>
32836 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
32837 define for __or1k_hard_float__.
32839 2021-01-13 Stafford Horne <shorne@gmail.com>
32841 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
32842 (PROFILE_HOOK): Define to call _mcount.
32843 (FUNCTION_PROFILER): Change from abort to no-op.
32845 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32847 PR tree-optimization/96691
32848 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
32849 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
32850 (~D ^ C) or (D ^ C) can be simplified.
32852 2021-01-13 Richard Biener <rguenther@suse.de>
32854 PR tree-optimization/92645
32855 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
32856 until after vector lowering.
32858 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32860 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
32862 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
32863 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
32865 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32867 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
32869 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
32870 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
32872 2021-01-13 Richard Biener <rguenther@suse.de>
32874 PR tree-optimization/92645
32875 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
32876 BIT_FIELD_REF argument.
32877 (vect_build_slp_tree_2): Record the desired vector type
32878 on the external vector def.
32879 (vectorizable_slp_permutation): Handle required punning
32880 of existing vector defs.
32882 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32884 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
32886 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32888 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
32890 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
32892 * config.gcc [$target == *-*-gnu*]: Enable
32893 'default_gnu_indirect_function'.
32895 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32898 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
32899 registers before calling targetm.vectorize.vec_perm_const, only after
32901 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
32902 two argument permutation when one operand is zero vector and only
32903 after that force operands into registers.
32904 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
32905 define_insn_and_split pattern.
32906 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
32907 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
32908 (*avx2_zero_extendv8hiv8si2_1): Likewise.
32909 (*avx512f_zero_extendv8siv8di2_1): Likewise.
32910 (*avx2_zero_extendv4siv4di2_1): Likewise.
32911 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
32913 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
32914 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
32915 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
32916 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
32917 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
32918 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
32920 2021-01-13 Martin Liska <mliska@suse.cz>
32922 PR tree-optimization/98455
32923 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
32924 Record also virtual PHIs.
32925 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
32928 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
32930 * doc/invoke.texi (C++ Modules): Fix typos.
32932 2021-01-13 Richard Biener <rguenther@suse.de>
32934 PR tree-optimization/98640
32935 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
32936 handle plus or minus from a truncated operand to be
32939 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32942 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
32943 define_insn_and_split patterns.
32944 (splitter after *btr<mode>_2): New splitter.
32946 2021-01-13 Martin Liska <mliska@suse.cz>
32949 * cgraphunit.c (analyze_functions): Remove dead code.
32951 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
32953 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
32954 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
32955 (a64fx_regmove_cost, a64fx_vector_cost): New.
32956 (a64fx_tunings): Use the new added cost tables.
32958 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32961 * config/i386/predicates.md (pmovzx_parallel): New predicate.
32962 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
32963 define_insn_and_split pattern.
32964 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
32965 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
32967 2021-01-13 Julian Brown <julian@codesourcery.com>
32969 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
32970 to fix v0 register.
32972 2021-01-13 Julian Brown <julian@codesourcery.com>
32974 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
32977 2021-01-13 Julian Brown <julian@codesourcery.com>
32979 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
32980 for reciprocal-approximation instructions.
32981 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
32982 refinement and division result.
32983 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
32985 2021-01-13 Julian Brown <julian@codesourcery.com>
32987 * config/gcn/gcn-valu.md (subdf): Rename to...
32990 2021-01-12 Martin Liska <mliska@suse.cz>
32992 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
32994 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
32996 * function-abi.h: Fix typo.
32998 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
33002 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
33003 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
33004 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
33005 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
33006 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
33007 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
33008 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
33009 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
33010 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
33011 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
33012 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
33013 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
33014 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
33015 (ARM_HAVE_V2DI_LDST): Likewise.
33016 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
33017 (*movmisalign<mode>_mve_load): New pattern.
33018 * config/arm/neon.md (movmisalign<mode>): Move to ...
33019 * config/arm/vec-common.md: ... here.
33021 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
33024 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
33025 of pattern 'plus (plus (hard reg, const), pseudo)'.
33027 2021-01-12 Richard Biener <rguenther@suse.de>
33029 PR tree-optimization/98550
33030 * tree-vect-slp.c (vect_record_max_nunits): Check whether
33031 the group size is a multiple of the vector element count.
33032 (vect_build_slp_tree_1): When we need to fail because
33033 the vector type choosen causes unrolling do so lazily
33034 without affecting matches only at the end to guide group splitting.
33036 2021-01-12 Martin Liska <mliska@suse.cz>
33039 * optc-save-gen.awk: Compare also n_target_save vars with
33042 2021-01-12 Martin Liska <mliska@suse.cz>
33044 * gcov.c (source_info::debug): New.
33045 (print_usage): Add --debug (-D) option.
33046 (process_args): Likewise.
33047 (generate_results): Call src->debug after
33048 accumulate_line_counts.
33049 (read_graph_file): Properly assign id for EXIT_BLOCK.
33050 * profile.c (branch_prob): Dump function body before it is
33053 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33055 PR tree-optimization/98629
33056 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
33057 unless returning non-zero.
33059 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33061 PR tree-optimization/95731
33062 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
33063 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
33064 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
33065 only after optimize_range_tests_var_bound.
33067 2021-01-12 Jakub Jelinek <jakub@redhat.com>
33069 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
33070 * configure: Regenerated.
33072 2021-01-12 liuhongt <hongtao.liu@intel.com>
33075 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
33077 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
33080 2021-01-12 Alexandre Oliva <oliva@adacore.com>
33082 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
33084 (auto_end_imm_use_stmt_traverse): New struct.
33085 (FOR_EACH_IMM_USE_STMT): Use it.
33086 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
33088 * gimple-ssa-strength-reduction.c: ... here, ...
33089 * graphite-scop-detection.c: ... here, ...
33090 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
33091 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
33092 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
33093 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
33094 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
33095 * tree-vect-slp.c: ... and here, ...
33096 * doc/tree-ssa.texi: ... and the example here.
33098 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33100 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
33101 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
33102 (*sdiv_pow2<mode>3): New pattern.
33103 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
33104 Wrap the ASRD in an UNSPEC_PRED_X.
33105 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
33106 predicate with a constant PTRUE, if it isn't already.
33107 (*cond_<sve_int_op><mode>_z): Replace with...
33108 (*cond_<sve_int_op><mode>_any): ...this new pattern.
33110 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33112 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
33113 SVE_FULL_I to SVE_I.
33114 (*cond_bic<mode>_any): Likewise.
33116 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33118 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
33119 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
33122 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33124 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
33125 SVE_FULL_I to SVE_I.
33126 (*aarch64_cond_<su>abd<mode>_2): Likewise.
33127 (*aarch64_cond_<su>abd<mode>_any): Likewise.
33128 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
33129 for the max and min but not for the minus.
33130 (*aarch64_cond_<su>abd<mode>_3): New pattern.
33132 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33134 * config/aarch64/iterators.md (SVE_24I): New iterator.
33135 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
33136 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
33138 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33140 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
33141 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
33143 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
33144 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
33145 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
33146 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
33148 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33150 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
33151 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
33152 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
33155 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33157 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
33158 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
33159 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
33161 2021-01-11 Martin Liska <mliska@suse.cz>
33164 * symtab-clones.h (clone_info::release): Release
33165 symtab::m_clones with ggc_delete as it's a GGC memory.
33167 2021-01-11 Matthias Klose <doko@ubuntu.com>
33169 * Makefile.in (LINK_PROGRESS): Show the link target.
33171 2021-01-11 Richard Biener <rguenther@suse.de>
33173 PR tree-optimization/91403
33174 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
33175 single-element interleaving group size at 4096 elements.
33177 2021-01-11 Richard Biener <rguenther@suse.de>
33179 PR tree-optimization/98526
33180 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
33181 of the actual reduction op for the regular case.
33182 (vectorizable_reduction): Cost the stmts
33183 vect_transform_reduction produces here.
33185 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
33187 * tree-ssa-forwprop.c (simplify_vector_constructor): For
33188 big-endian, use UNPACK[_FLOAT]_HI.
33190 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33192 * tree-vect-slp-patterns.c (class complex_pattern,
33193 class complex_add_pattern): Add parameters to matches.
33194 (complex_add_pattern::build): Free memory.
33195 (complex_add_pattern::matches): Move validation end of match.
33196 (complex_add_pattern::recognize): Likewise.
33198 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33200 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
33202 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33204 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
33206 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33208 PR tree-optimization/95867
33209 * tree-ssa-math-opts.h: New header.
33210 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
33211 (powi_as_mults): No longer static. Use build_one_cst instead of
33212 build_real. Formatting fix.
33213 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
33214 (attempt_builtin_powi): Handle multiplication reassociation without
33215 powi_fndecl using powi_as_mults.
33216 (reassociate_bb): For integral types don't require
33217 -funsafe-math-optimizations to call attempt_builtin_powi.
33219 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33221 PR tree-optimization/95852
33222 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
33223 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
33224 allow in the bb any of the stmts in that vector, div_stmt and
33225 up to 3 cast stmts.
33226 (arith_cast_equal_p): New function.
33227 (arith_overflow_check_p): Add cast_stmt argument, handle signed
33228 multiply overflow checks.
33229 (match_arith_overflow): Adjust caller. Handle signed multiply
33232 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33234 PR tree-optimization/95852
33235 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
33236 (uaddsub_overflow_check_p): Renamed to ...
33237 (arith_overflow_check_p): ... this. Handle also multiplication
33238 with overflow check.
33239 (match_uaddsub_overflow): Renamed to ...
33240 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
33241 also multiplication with overflow check. Adjust function comment.
33242 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
33243 match_arith_overflow also for MULT_EXPR.
33245 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33247 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
33248 __builtin_convertvector.
33249 (vmovl_s16): Likewise.
33250 (vmovl_s32): Likewise.
33251 (vmovl_u8): Likewise.
33252 (vmovl_u16): Likewise.
33253 (vmovl_u32): Likewise.
33254 (vmovn_s16): Likewise.
33255 (vmovn_s32): Likewise.
33256 (vmovn_s64): Likewise.
33257 (vmovn_u16): Likewise.
33258 (vmovn_u32): Likewise.
33259 (vmovn_u64): Likewise.
33261 2021-01-11 Martin Liska <mliska@suse.cz>
33263 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
33264 (if_chain::is_beneficial): Delete clusters
33265 (find_conditions): Make second argument of conditions_in_bbs a
33266 pointer so that we control over it's lifetime.
33267 (pass_if_to_switch::execute): Delete them.
33269 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
33271 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
33274 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33276 * config/vax/vax.md (cc): Remove mode attribute.
33277 (subst_<cc>, subst_f<cc>): Rename to...
33278 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
33279 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
33280 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
33281 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
33283 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33285 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
33286 `const_double_zero'.
33288 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33290 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
33291 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
33294 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33296 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
33298 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
33299 with `const_double_zero'.
33300 * doc/rtl.texi (Constant Expression Types): Document it.
33302 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33305 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
33306 POINTER_DIFF_EXPR to be any integral type.
33308 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33310 PR rtl-optimization/98603
33311 * function.c (instantiate_virtual_regs_in_insn): For asm goto
33312 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
33313 if any, set ASM_OPERANDS mode to VOIDmode and change
33314 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
33316 2021-01-09 Alexandre Oliva <oliva@gnu.org>
33319 * final.c (notice_source_line): Narrow down the condition to
33320 skip a line-0 marker.
33322 2021-01-08 Sergei Trofimovich <siarheit@google.com>
33324 * ipa-modref.c (merge_call_side_effects): Fix
33325 linebreak split by reordering two print calls.
33327 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33329 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
33330 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
33331 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
33332 consistency, fix constraint.
33334 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33336 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
33337 callables instead of mask values.
33338 (struct target_flag_set_p): New predicate.
33339 (s390_cpu_cpp_builtins_internal): Define or undefine
33340 __LONG_DOUBLE_VX__ macro.
33342 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33345 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
33346 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
33349 2021-01-08 Richard Biener <rguenther@suse.de>
33351 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
33353 2021-01-08 Richard Biener <rguenther@suse.de>
33355 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
33356 (vect_build_slp_tree): On cache hit release the matched
33357 scalar stmts vector.
33358 * tree-vect-stmts.c (vectorizable_store): Properly free
33359 vec_oprnds before possibly gathering them again.
33361 2021-01-08 Richard Biener <rguenther@suse.de>
33363 PR tree-optimization/98544
33364 * tree-vect-slp.c (vect_optimize_slp): Always materialize
33365 permutes at a permute node.
33367 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33370 * config/i386/i386.c (x86_function_profiler): Use R10 to call
33371 mcount in large model. Sorry for large model with PIC.
33373 2021-01-08 Jakub Jelinek <jakub@redhat.com>
33376 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
33377 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
33378 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
33379 TargetSave and initialize for variables with enum types.
33380 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
33381 mstack-protector-guard-symbol=): Add Save.
33382 * config/i386/i386-options.c (ix86_function_specific_save,
33383 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
33384 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
33385 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
33386 x_ix86_veclibabi_type.
33388 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33390 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
33391 SVE_FULL_I to SVE_I.
33392 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
33394 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33396 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
33397 SVE_FULL_I to SVE_I.
33398 (*cond_uxt<mode>_any): Likewise.
33400 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33402 * config/aarch64/iterators.md (Vwhalf): New iterator.
33403 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
33405 (aarch64_<sur>adalp<mode>): ... This. Make more
33407 (<sur>sadv16qi): Adjust callsite of the above.
33408 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
33410 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
33412 (vpadal_s16): Likewise.
33413 (vpadal_u8): Likewise.
33414 (vpadal_u16): Likewise.
33415 (vpadalq_s8): Likewise.
33416 (vpadalq_s16): Likewise.
33417 (vpadalq_s32): Likewise.
33418 (vpadalq_u8): Likewise.
33419 (vpadalq_u16): Likewise.
33420 (vpadalq_u32): Likewise.
33422 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33424 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
33426 (aarch64_<su>abd<mode>): ... This.
33427 (<sur>sadv16qi): Adjust callsite of the above.
33428 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
33430 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
33432 (vabd_s16): Likewise.
33433 (vabd_s32): Likewise.
33434 (vabd_u8): Likewise.
33435 (vabd_u16): Likewise.
33436 (vabd_u32): Likewise.
33437 (vabdq_s8): Likewise.
33438 (vabdq_s16): Likewise.
33439 (vabdq_s32): Likewise.
33440 (vabdq_u8): Likewise.
33441 (vabdq_u16): Likewise.
33442 (vabdq_u32): Likewise.
33444 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33446 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
33448 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
33449 (vaba_s16): Likewise.
33450 (vaba_s32): Likewise.
33451 (vaba_u8): Likewise.
33452 (vaba_u16): Likewise.
33453 (vaba_u32): Likewise.
33454 (vabaq_s8): Likewise.
33455 (vabaq_s16): Likewise.
33456 (vabaq_s32): Likewise.
33457 (vabaq_u8): Likewise.
33458 (vabaq_u16): Likewise.
33459 (vabaq_u32): Likewise.
33461 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33463 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
33464 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
33465 Change RTL pattern to match.
33467 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33469 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
33470 * config/riscv/riscv-c.c (riscv-subset.h): New.
33471 (INCLUDE_STRING): Define.
33472 (riscv_cpu_cpp_builtins): Add new style architecture extension
33474 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
33475 (riscv_subset_list::end): New.
33476 (riscv_current_subset_list): New.
33478 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33480 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
33481 Move to riscv-subset.h.
33482 (struct riscv_subset_t): Ditto.
33483 (class riscv_subset_list): Ditto.
33484 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
33485 from riscv-common.c.
33486 (struct riscv_subset_t): Ditto.
33487 (class riscv_subset_list): Ditto.
33488 * config/riscv/t-riscv ($(common_out_file)): Add file
33491 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33494 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
33495 New define_insn patterns.
33497 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33499 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
33500 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
33501 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
33503 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33505 PR tree-optimization/98560
33506 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
33507 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
33509 (vec_cond_direct): Likewise argument 2.
33510 (vec_condu_direct, vec_condeq_direct): Delete.
33511 (expand_vect_cond_optab_fn): Rename to...
33512 (expand_vec_cond_optab_fn): ...this, replacing old macro.
33513 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
33514 (expand_vect_cond_mask_optab_fn): Rename to...
33515 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
33516 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
33518 (direct_vec_cond_optab_supported_p): Likewise.
33519 (direct_vec_condu_optab_supported_p): Delete.
33520 (direct_vec_condeq_optab_supported_p): Delete.
33521 * gimple-isel.cc: Include internal-fn.h.
33522 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
33525 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33527 PR tree-optimization/98560
33528 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
33529 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
33531 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
33533 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
33534 rotate_insn and optab code attributes.
33535 Update all uses to merged code attribute.
33536 * config/i386/sse.md: Update all uses to merged code attribute.
33537 * config/i386/mmx.md: Update all uses to merged code attribute.
33539 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33541 PR tree-optimization/98568
33542 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
33543 (bswap_replace): Use it.
33545 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
33547 PR rtl-optimization/97978
33548 * lra-int.h (lra_hard_reg_split_p): New external.
33549 * lra.c (lra_hard_reg_split_p): New global.
33550 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
33551 * lra-assigns.c (lra_assign): Don't check allocation correctness
33552 after hard reg splitting.
33554 2021-01-06 Martin Sebor <msebor@redhat.com>
33557 * builtins.c (new_delete_mismatch_p): New overload.
33558 (new_delete_mismatch_p (tree, tree)): Call it.
33560 2021-01-06 Alexandre Oliva <oliva@adacore.com>
33562 * Makefile.in (T_GLIMITS_H): New.
33563 (stmp-int-hdrs): Depend on it, use it.
33564 * config/t-vxworks (T_GLIMITS_H): Override it.
33565 (vxw-glimits.h): New.
33567 2021-01-06 Richard Biener <rguenther@suse.de>
33569 PR tree-optimization/98513
33570 * value-range.cc (intersect_ranges): Compare the upper bounds
33571 for the expected relation.
33573 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
33576 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
33578 * doc/standards.texi (HSAIL): Remove section.
33580 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
33582 * configure: Re-generate.
33584 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33586 * doc/invoke.texi (-std=c++20): Adjust for the publication of
33587 ISO 14882:2020 standard.
33588 * doc/standards.texi: Likewise.
33590 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33592 PR tree-optimization/94802
33593 * expr.h (maybe_optimize_sub_cmp_0): Declare.
33594 * expr.c: Include tree-pretty-print.h and flags.h.
33595 (maybe_optimize_sub_cmp_0): New function.
33596 (do_store_flag): Use it.
33597 * cfgexpand.c (expand_gimple_cond): Likewise.
33599 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33601 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
33602 * rtlanal.c (simple_regno_set): Tweak description to clarify the
33605 2021-01-05 Richard Biener <rguenther@suse.de>
33607 PR tree-optimization/98516
33608 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
33609 lanes when materializing on a VEC_PERM node.
33610 (vectorizable_slp_permutation): Dump the permute properly.
33612 2021-01-05 Richard Biener <rguenther@suse.de>
33614 * tree-vect-slp.c (vect_slp_region): Move debug counter
33615 to cover individual subgraphs.
33617 2021-01-05 Richard Biener <rguenther@suse.de>
33619 PR tree-optimization/98428
33620 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
33621 vector lane extracts for loop vectorization.
33623 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33625 PR tree-optimization/98514
33626 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
33628 (operand_rank): Change type from hash_map<tree, long> to
33629 hash_map<tree, int64_t>.
33630 (phi_rank): Change return type from long to int64_t.
33631 (loop_carried_phi): Change block_rank variable type from long to
33633 (propagate_rank): Change return type, rank parameter type and
33634 op_rank variable type from long to int64_t.
33635 (find_operand_rank): Change return type from long to int64_t
33636 and change slot variable type from long * to int64_t *.
33637 (insert_operand_rank): Change rank parameter type from long to
33639 (get_rank): Change return type and rank variable type from long to
33640 int64_t. Use PRId64 instead of ld to print the rank.
33641 (init_reassoc): Change rank variable type from long to int64_t
33642 and adjust correspondingly bb_rank and operand_rank initialization.
33644 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33646 PR tree-optimization/96928
33647 * tree-ssa-phiopt.c (xor_replacement): New function.
33648 (tree_ssa_phiopt_worker): Call it.
33650 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33652 PR tree-optimization/96930
33653 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
33654 from narrower value which has the same type as 1 << B, perform
33655 the right shift on the narrower value followed by extension.
33657 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33659 PR tree-optimization/96239
33660 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
33662 (get_status_for_store_merging): Don't return BB_INVALID for blocks
33663 with potential bswap optimizable CONSTRUCTORs.
33664 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
33667 2021-01-05 Richard Biener <rguenther@suse.de>
33669 PR tree-optimization/98381
33670 * tree.c (vector_element_bits): Properly compute bool vector
33672 * tree-vect-loop.c (vectorizable_live_operation): Properly
33673 compute the last lane bit offset.
33675 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33678 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
33679 Clear the top 64 bytes of the input XMM register.
33680 (sse_cvttps2pi): Ditto.
33682 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33685 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
33687 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
33690 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
33693 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
33695 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
33696 (umaddsidi4_split): Likewise.
33698 2021-01-05 liuhongt <hongtao.liu@intel.com>
33701 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
33702 define_insn_and_split for zero_extend of subreg HI of pmovskb
33704 (*sse2_pmovskb_zexthisi): Add new combine splitters for
33705 zero_extend of not of subreg HI of pmovskb result.
33707 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33710 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
33712 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
33713 convert_memory_address to convert symbolic immediates to ptr_mode
33714 before forcing them to memory.
33716 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33718 PR rtl-optimization/97144
33719 * recog.c (constrain_operands): Initialize matching_operand
33720 for each alternative, rather than only doing it once.
33722 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33724 PR rtl-optimization/98403
33725 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
33726 why we don't remove call clobbers.
33727 (function_info::apply_changes_to_insn): Don't attempt to add
33728 call clobbers here.
33730 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33732 PR tree-optimization/98371
33733 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
33734 (vect_analyze_loop): If an epilogue loop appears to be cheaper
33735 than the main loop, re-analyze it as a main loop before adopting
33738 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
33741 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
33742 * aclocal.m4, configure: Regenerate.
33743 * Makefile.in (NETLIBS): Define.
33744 (BACKEND): Remove $(CODYLIB).
33746 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33748 PR rtl-optimization/98334
33749 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
33750 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
33752 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
33754 * tree-inline.c (expand_call_inline): Restore input_location.
33755 Return result from recursive call.
33757 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33759 PR tree-optimization/95401
33760 * config/aarch64/aarch64-sve-builtins.cc
33761 (gimple_folder::load_store_cookie): Use bits rather than bytes
33762 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
33763 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
33764 * tree-vect-stmts.c (vectorizable_store): Likewise.
33765 (vectorizable_load): Likewise.
33767 2021-01-04 Richard Biener <rguenther@suse.de>
33769 PR tree-optimization/98308
33770 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
33773 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33775 PR tree-optimization/95771
33776 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
33777 with precision smaller than int's precision and types with precision
33778 twice as large as long long. Formatting fixes.
33780 2021-01-04 Richard Biener <rguenther@suse.de>
33782 PR tree-optimization/98464
33783 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
33784 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
33785 (process_bb): Adjust.
33787 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
33790 * doc/invoke.texi (-fsanitize=address): Fix wording describing
33791 clash with -fsanitize=hwaddress.
33793 2021-01-04 Richard Biener <rguenther@suse.de>
33795 PR tree-optimization/98282
33796 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
33797 invariants as VN_NARY.
33799 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33802 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
33803 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
33804 to handle zero operands.
33806 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33808 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
33809 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
33810 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
33813 2021-01-04 Richard Biener <rguenther@suse.de>
33815 PR tree-optimization/98393
33816 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
33817 when hitting the limit.
33819 2021-01-04 Richard Biener <rguenther@suse.de>
33821 PR tree-optimization/98291
33822 * tree-vect-loop.c (vectorizable_reduction): Bypass
33823 associativity check for SLP reductions with VF 1.
33825 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33827 PR tree-optimization/96782
33828 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
33830 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
33832 * collect-utils.c (collect_execute): Check dumppfx.
33833 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
33834 to collect_execute.
33835 (do_link): Add new parameter atsuffix.
33836 (main): Handle -dumpdir option. Skip one argument for
33837 -o, -isystem and -B options.
33838 * gcc.c (make_at_file): New helper function.
33839 (close_at_file): Use it.
33841 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33843 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
33844 Amend handling for LD64_VERSION fallback defaults.
33846 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33848 * config.gcc: Compute default version information
33849 from the configured target. Likewise defaults for
33851 * config/darwin10.h: Removed.
33852 * config/darwin12.h: Removed.
33853 * config/darwin9.h: Removed.
33854 * config/rs6000/darwin8.h: Removed.
33856 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33858 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
33860 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33862 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
33863 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
33865 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33867 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
33869 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
33871 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33873 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
33874 for the Darwin10 unwinder stub from here ...
33875 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
33877 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33879 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
33880 (ASM_DEBUG_SPEC):Only define if the assembler supports
33882 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
33883 (DARWIN_PREFER_DWARF): Define.
33884 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
33885 (DARWIN_PREFER_DWARF): Likewise
33886 (DSYMUTIL_SPEC): Likewise.
33887 (COLLECT_RUN_DSYMUTIL): Likewise.
33888 (ASM_DEBUG_SPEC): Likewise.
33889 (ASM_DEBUG_OPTION_SPEC): Likewise.
33891 2021-01-02 Jan Hubicka <jh@suse.cz>
33893 * cfg.c (free_block): ggc_free bb.
33895 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33897 * gcc.c (process_command): Update copyright notice dates.
33898 * gcov-dump.c (print_version): Ditto.
33899 * gcov.c (print_version): Ditto.
33900 * gcov-tool.c (print_version): Ditto.
33901 * gengtype.c (create_file): Ditto.
33902 * doc/cpp.texi: Bump @copying's copyright year.
33903 * doc/cppinternals.texi: Ditto.
33904 * doc/gcc.texi: Ditto.
33905 * doc/gccint.texi: Ditto.
33906 * doc/gcov.texi: Ditto.
33907 * doc/install.texi: Ditto.
33908 * doc/invoke.texi: Ditto.
33910 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33912 * ChangeLog-2020: Rotate ChangeLog. New file.
33915 Copyright (C) 2021 Free Software Foundation, Inc.
33917 Copying and distribution of this file, with or without modification,
33918 are permitted in any medium without royalty provided the copyright
33919 notice and this notice are preserved.