1 2021-12-03 Martin Liska <mliska@suse.cz>
2 Peter Bergner <bergner@linux.ibm.com>
5 * config/rs6000/rs6000.c (rs6000_option_override_internal): Move the
6 disabling of shrink-wrapping when using -mrop-protect from here...
7 (rs6000_override_options_after_change): ...to here.
9 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
12 * config/i386/i386-expand.c (ix86_expand_builtin): Pass PVW_NONE
13 and PVW_NONE to ix86_target_string.
14 * config/i386/i386-options.c (ix86_target_string): Add arguments
15 for move_max and store_max.
16 (ix86_target_string::add_vector_width): New lambda.
17 (ix86_debug_options): Pass ix86_move_max and ix86_store_max to
19 (ix86_function_specific_print): Pass ptr->x_ix86_move_max and
20 ptr->x_ix86_store_max to ix86_target_string.
21 (ix86_valid_target_attribute_tree): Handle x_ix86_move_max and
23 (ix86_option_override_internal): Set the default x_ix86_move_max
25 * config/i386/i386-options.h (ix86_target_string): Add
26 prefer_vector_width and prefer_vector_width.
27 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): Removed.
28 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
29 (MOVE_MAX): Use 64 if ix86_move_max or ix86_store_max ==
30 PVW_AVX512. Use 32 if ix86_move_max or ix86_store_max >=
32 (STORE_MAX_PIECES): Use 64 if ix86_store_max == PVW_AVX512.
33 Use 32 if ix86_store_max >= PVW_AVX256.
34 * config/i386/i386.opt: Add -mmove-max=bits and -mstore-max=bits.
35 * config/i386/x86-tune.def (X86_TUNE_AVX512_MOVE_BY_PIECES): New.
36 (X86_TUNE_AVX512_STORE_BY_PIECES): Likewise.
37 * doc/invoke.texi: Document -mmove-max=bits and -mstore-max=bits.
39 2021-12-03 Bill Schmidt <wschmidt@linux.ibm.com>
41 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Fix builtin
44 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
46 * common/config/riscv/riscv-common.c (riscv_implied_info):
47 Add K-ext related entry.
48 (riscv_supported_std_ext): Add 'k'.
49 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add 'k'.
50 (IMPLIED_EXT): Add K-ext related entry.
52 2021-12-03 SiYu Wu <siyu@isrc.iscas.ac.cn>
54 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
56 * config/riscv/riscv-opts.h (MASK_ZBKB): New.
73 (TARGET_ZKSED): Ditto.
76 * config/riscv/riscv.opt (riscv_zk_subext): New.
78 2021-12-03 Tamar Christina <tamar.christina@arm.com>
80 * tree-vect-stmts.c (prepare_load_store_mask): Rename to...
81 (prepare_vec_mask): ...This and record operations that have already been
83 (vectorizable_call): Use it.
84 (vectorizable_operation): Likewise.
85 (vectorizable_store): Likewise.
86 (vectorizable_load): Likewise.
87 * tree-vectorizer.h (class _loop_vec_info): Add vec_cond_masked_set.
88 (vec_cond_masked_set_type, tree_cond_mask_hash): New.
90 2021-12-03 H.J. Lu <hjl.tools@gmail.com>
94 * target.def (ifunc_ref_local_ok): Add a target hook.
95 * varasm.c (default_binds_local_p_3): Force indirect function
96 resolver non-local only if targetm.ifunc_ref_local_ok returns
98 * config/i386/i386-expand.c (ix86_expand_call): Call
99 ix86_call_use_plt_p to check if PLT should be used.
100 * config/i386/i386-protos.h (ix86_call_use_plt_p): New.
101 * config/i386/i386.c (output_pic_addr_const): Call
102 ix86_call_use_plt_p to check if "@PLT" is needed.
103 (ix86_call_use_plt_p): New.
104 (TARGET_IFUNC_REF_LOCAL_OK): New.
105 * doc/tm.texi.in: Add TARGET_IFUNC_REF_LOCAL_OK.
106 * doc/tm.texi: Regenerated.
108 2021-12-03 Jakub Jelinek <jakub@redhat.com>
110 * attribs.h (simple_cst_list_equal): Declare.
111 * attribs.c (simple_cst_list_equal): No longer static.
112 * config/i386/i386-options.c (target_attribute_cache): New variable.
113 (ix86_valid_target_attribute_p): Cache DECL_FUNCTION_SPECIFIC_TARGET
114 and DECL_FUNCTION_SPECIFIC_OPTIMIZATION based on args.
116 2021-12-03 Jakub Jelinek <jakub@redhat.com>
119 * ggc.h (gt_pch_note_callback): Declare.
120 * gengtype.h (enum typekind): Add TYPE_CALLBACK.
121 (callback_type): Declare.
122 * gengtype.c (dbgprint_count_type_at): Handle TYPE_CALLBACK.
123 (callback_type): New variable.
124 (process_gc_options): Add CALLBACK argument, handle callback
126 (set_gc_used_type): Adjust process_gc_options caller, if callback,
127 set type to &callback_type.
128 (output_mangled_typename): Handle TYPE_CALLBACK.
129 (walk_type): Likewise. Handle callback option.
130 (write_types_process_field): Handle TYPE_CALLBACK.
131 (write_types_local_user_process_field): Likewise.
132 (write_types_local_process_field): Likewise.
133 (write_root): Likewise.
134 (dump_typekind): Likewise.
135 (dump_type): Likewise.
136 * gengtype-state.c (type_lineloc): Handle TYPE_CALLBACK.
137 (state_writer::write_state_callback_type): New method.
138 (state_writer::write_state_type): Handle TYPE_CALLBACK.
139 (read_state_callback_type): New function.
140 (read_state_type): Handle TYPE_CALLBACK.
141 * ggc-common.c (callback_vec): New variable.
142 (gt_pch_note_callback): New function.
143 (gt_pch_save): Stream out gt_pch_save function address and relocation
145 (gt_pch_restore): Stream in saved gt_pch_save function address and
146 relocation table and apply relocations if needed.
147 * doc/gty.texi (callback): Document new GTY option.
148 * varasm.c (get_unnamed_section): Change callback argument's type and
149 last argument's type from const void * to const char *.
150 (output_section_asm_op): Change argument's type from const void *
151 to const char *, remove unnecessary cast.
152 * tree-core.h (struct tree_translation_unit_decl): Drop GTY((skip))
153 from language member.
154 * output.h (unnamed_section_callback): Change argument type from
155 const void * to const char *.
156 (struct unnamed_section): Use GTY((callback)) instead of GTY((skip))
157 for callback member. Change data member type from const void *
159 (struct noswitch_section): Use GTY((callback)) instead of GTY((skip))
161 (get_unnamed_section): Change callback argument's type and
162 last argument's type from const void * to const char *.
163 (output_section_asm_op): Change argument's type from const void *
165 * config/avr/avr.c (avr_output_progmem_section_asm_op): Likewise.
166 Remove unneeded cast.
167 * config/darwin.c (output_objc_section_asm_op): Change argument's type
168 from const void * to const char *.
169 * config/pa/pa.c (som_output_text_section_asm_op): Likewise.
170 (som_output_comdat_data_section_asm_op): Likewise.
171 * config/rs6000/rs6000.c (rs6000_elf_output_toc_section_asm_op):
173 (rs6000_xcoff_output_readonly_section_asm_op): Likewise. Instead
174 of dereferencing directive hardcode variable names and decide based on
175 whether directive is NULL or not.
176 (rs6000_xcoff_output_readwrite_section_asm_op): Change argument's type
177 from const void * to const char *.
178 (rs6000_xcoff_output_tls_section_asm_op): Likewise. Instead
179 of dereferencing directive hardcode variable names and decide based on
180 whether directive is NULL or not.
181 (rs6000_xcoff_output_toc_section_asm_op): Change argument's type
182 from const void * to const char *.
183 (rs6000_xcoff_asm_init_sections): Adjust get_unnamed_section callers.
185 2021-12-03 Richard Purdie <richard.purdie@linuxfoundation.org>
187 * Makefile.in: Fix "argument list too long" from install-plugins.
189 2021-12-03 Matt Jacobson <mhjacobson@me.com>
191 * config.gcc: For the AVR target, populate TM_MULTILIB_CONFIG.
192 * config/avr/genmultilib.awk: Add ability to filter generated multilib
194 * config/avr/t-avr: Pass TM_MULTILIB_CONFIG to genmultilib.awk.
195 * configure.ac: Update help string for --with-multilib-list.
196 * configure: Regenerate.
198 2021-12-02 Ilya Lipnitskiy <ilya.lipnitskiy@gmail.com>
200 * configure: Regenerate.
201 * configure.ac: Define TARGET_LIBC_GNUSTACK on musl.
203 2021-12-02 Iain Sandoe <iain@sandoe.co.uk>
206 * config/host-darwin.c (TRY_EMPTY_VM_SPACE,
207 SAFE_ALLOC_SIZE): New.
208 (darwin_gt_pch_get_address): Rewrite to use nominated
209 memory segments rather than part of the compiler __DATA
211 (darwin_gt_pch_use_address): Likewise.
213 2021-12-02 Gerald Pfeifer <gerald@pfeifer.com>
215 * doc/install.texi (*-*-freebsd*): Remove references to
216 FreeBSD 1 and FreeBSD 2.
218 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
220 PR rtl-optimization/103437
221 * ira-color.c (setup_allocno_priorities): Switch off backup code
222 for overflow if compiler has __builtin_smul_overflow. Use <
223 for comparison with -INT_MAX.
225 2021-12-02 Bill Schmidt <wschmidt@linux.ibm.com>
227 * config/rs6000/altivec.h: Delete a number of #defines that are now
228 superfluous. Alphabetize. Include rs6000-vecdefines.h. Include some
230 * config/rs6000/rs6000-builtin-new.def (CMPB): Flag as no32bit.
231 (BPERMD): Flag as 32bit (needing special handling for 32-bit).
232 (UNPACK_TD): Return unsigned long long instead of unsigned long.
233 (GET_TEXASR): Return unsigned long instead of unsigned long long.
234 (GET_TEXASRU): Likewise.
235 (GET_TFHAR): Likewise.
236 (GET_TFIAR): Likewise.
237 (SET_TEXASR): Pass unsigned long instead of unsigned long long.
238 (SET_TEXASRU): Likewise.
239 (SET_TFHAR): Likewise.
240 (SET_TFIAR): Likewise.
241 (TABORTDC): Likewise.
242 (TABORTDCI): Likewise.
243 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): Fix error
244 handling for no32bit. Add 32bit handling for RS6000_BIF_BPERMD.
245 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Initialize
246 new_builtins_are_live to 1.
248 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
250 PR rtl-optimization/103437
251 * ira-color.c (setup_allocno_priorities): Use long long
252 multiplication as backup for overflow processing.
254 2021-12-02 Tamar Christina <tamar.christina@arm.com>
256 * config/aarch64/aarch64-protos.h (aarch64_gen_shareable_zero): New.
257 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>,
258 aarch64_rshrn2<mode>): Generate rounding half-ing add when appropriate.
259 * config/aarch64/aarch64.c (aarch64_gen_shareable_zero): New.
261 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
263 PR rtl-optimization/103437
264 * ira-color.c (setup_allocno_priorities): Process multiplication
267 2021-12-02 Martin Liska <mliska@suse.cz>
269 PR gcov-profile/96092
270 * common.opt: New option.
271 * coverage.c (coverage_begin_function): Emit filename with
272 remap_profile_filename.
273 * doc/invoke.texi: Document the new option.
274 * file-prefix-map.c (add_profile_prefix_map): New.
275 (remap_profile_filename): Likewise.
276 * file-prefix-map.h (add_profile_prefix_map): Likewise.
277 (remap_profile_filename): Likewise.
278 * lto-opts.c (lto_write_options): Handle
279 OPT_fprofile_prefix_map_.
280 * opts-global.c (handle_common_deferred_options): Likewise.
281 * opts.c (common_handle_option): Likewise.
282 (gen_command_line_string): Likewise.
283 * profile.c (output_location): Emit filename with
284 remap_profile_filename.
286 2021-12-02 Richard Biener <rguenther@suse.de>
289 * internal-fn.c (expand_DEFERRED_INIT): When the base
290 of the LHS is a decl with matching constant size use
291 that as the initialization target instead of an
292 eventual VLA typed one.
294 2021-12-02 Richard Biener <rguenther@suse.de>
296 PR tree-optimization/103527
297 * tree-vect-stmts.c (vect_build_gather_load_calls): Always
298 use a truth type for building the vector mask.
300 2021-12-02 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
302 * config/vxworks.h (LTO_PLUGIN_SPEC): Adapt to corrected
303 spelling of -linker-output-auto-nolto-rel.
305 2021-12-01 Uroš Bizjak <ubizjak@gmail.com>
308 * config/i386/sse.md (VI2F): Remove mode iterator.
309 (VI2F_256_512): New mode iterator.
310 (vec_set<V8_128:mode>_0): New insn pattern.
311 (vec_set<VI2F_256_512:mode>_0>): Rename from vec_set<VI2F:mode>mode.
312 Use VI2F_256_512 mode iterator instead of VI2F.
313 (*axv512fp16_movsh): Remove.
314 (<sse2p4_1>_pinsr<ssemodesuffix>): Add (x,x,x) AVX2 alternative.
315 Do not disable V8HF mode insn on AVX2 targets.
316 (pinsrw -> pbroadcast + pblendw peephole2): New peephole.
317 (pinsrw -> pbroadcast + pblendw splitter): New post-reload splitter.
318 * config/i386/i386.md (extendhfsf): Call gen_vec_setv8hf_0.
319 * config/i386/i386-expand.c (ix86_expand_vector_set)
320 <case E_V8HFmode>: Use vec_merge path for TARGET_AVX2.
322 2021-12-01 Roger Sayle <roger@nextmovesoftware.com>
323 Richard Biener <rguenther@suse.de>
325 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
326 Check if simplify_using_initial_conditions allows us to
327 simplify the expression for may_be_zero.
328 * match.pd (X != C ? -X : -C -> -X): New transform.
329 (X != C ? ~X : ~C -> ~X): Likewise.
330 ((X+1) > Y ? -X : 1 -> X >= Y ? -X : 1): Likewise.
332 2021-12-01 Peter Bergner <bergner@linux.ibm.com>
335 * gimplify.c (is_var_need_auto_init): Handle opaque types.
337 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
339 * common.opt (static-libphobos): Add option.
340 * config/darwin.h (LINK_SPEC): Substitute -lgphobos with libgphobos.a
341 when linking statically.
342 * gcc.c (driver_handle_option): Set -static-libphobos as always valid.
344 2021-12-01 Jason Merrill <jason@redhat.com>
347 * fold-const.c (maybe_nonzero_address): Use get_create or get
348 depending on folding_initializer.
349 (fold_binary_initializer_loc): New.
350 * fold-const.h (fold_binary_initializer_loc): Declare.
351 * varasm.c (mark_weak): Don't use the decl location.
352 * doc/invoke.texi: Document -fconstexpr-fp-except.
354 2021-12-01 Bill Schmidt <wschmidt@linux.ibm.com>
357 * config/rs6000/rs6000-c.c (rs6000-builtins.h): Stop including.
358 (rs6000-internal.h): Include.
359 (altivec_resolve_new_overloaded_builtin): Move call to
360 rs6000_invalid_new_builtin here from rs6000_new_builtin_decl.
361 * config/rs6000/rs6000-call.c (rs6000-builtins.h): Stop including.
362 (rs6000_invalid_new_builtin): Remove static qualifier.
363 (rs6000_new_builtin_decl): Remove test for supported builtin.
364 * config/rs6000/rs6000-internal.h (rs6000-builtins.h): Include.
365 (rs6000_invalid_new_builtin): Declare.
366 * config/rs6000/rs6000.c (rs6000-builtins.h): Don't include.
368 2021-12-01 Aldy Hernandez <aldyh@redhat.com>
370 PR tree-optimization/103409
371 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
372 Do all the work with just one ssa_global_cache.
373 * gimple-range-path.h: Remove m_tmp_phi_cache.
375 2021-12-01 Richard Sandiford <richard.sandiford@arm.com>
377 PR tree-optimization/103517
378 * tree-vect-slp.c (vect_build_slp_tree_1): When allowing two
379 different component references, check the codes of both them,
380 rather than just the first.
382 2021-12-01 Andrew MacLeod <amacleod@redhat.com>
384 PR tree-optimization/103464
385 * gimple-range.cc (gimple_ranger::prefill_name): Process phis also.
386 (gimple_ranger::prefill_stmt_dependencies): Ditto.
388 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
390 * doc/install.texi (Prerequisites): Add note that D front end now
391 requires GDC installed in order to bootstrap.
392 (Building): Add D compiler section, referencing prerequisites.
394 2021-12-01 Jakub Jelinek <jakub@redhat.com>
396 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE,
397 X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY,
398 X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Formatting fixes.
399 (X86_TUNE_USE_GATHER): Put m_GENERIC last for consistency.
401 2021-12-01 Jakub Jelinek <jakub@redhat.com>
403 PR rtl-optimization/102356
404 * rtl.h (simplify_context): Add assoc_count member and
405 max_assoc_count static member.
406 * simplify-rtx.c (simplify_associative_operation): Don't reassociate
407 more than max_assoc_count times within one outermost simplify_* call.
408 * dwarf2out.c (mem_loc_descriptor): Optimize binary operation
409 with both operands the same using DW_OP_dup.
411 2021-12-01 Tamar Christina <tamar.christina@arm.com>
413 * match.pd: Move below pattern that rewrites to EQ, NE.
414 * tree.c (bitmask_inv_cst_vector_p): Correct do .. while indentation.
416 2021-12-01 Siddhesh Poyarekar <siddhesh@gotplt.org>
417 Martin Liška <mliska@suse.cz>
419 PR tree-optimization/103456
420 * tree-object-size.c (merge_object_sizes): Update osi->changed
421 only if object_sizes_set succeeded.
423 2021-11-30 liuhongt <hongtao.liu@intel.com>
427 * config/i386/i386.md (*x86_64_shld_1): Set_attr
428 length_immediate to 1.
429 (*x86_shld_1): Ditto.
430 (*x86_64_shrd_1): Ditto.
431 (*x86_shrd_1): Ditto.
433 2021-11-30 Bill Schmidt <wschmidt@linux.ibm.com>
435 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
437 (altivec_resolve_new_overloaded_builtin): Likewise.
439 2021-11-30 Martin Jambor <mjambor@suse.cz>
442 * ipa-sra.c (scan_function): Also check ECF_LOOPING_CONST_OR_PURE flag.
444 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
446 PR tree-optimization/103494
447 * tree-vect-stmts.c (vect_get_gather_scatter_ops): Remove ncopies
448 argument and calculate ncopies from gs_info->offset_vectype
450 (vectorizable_store, vectorizable_load): Update accordingly.
452 2021-11-30 Iain Buclaw <ibuclaw@gdcproject.org>
454 * Makefile.in (GDC): New variable.
455 (GDCFLAGS): New variable.
456 * configure: Regenerate.
457 * configure.ac: Add call to ACX_PROG_GDC. Substitute GDCFLAGS.
459 2021-11-30 Martin Liska <mliska@suse.cz>
460 Martin Jambor <mjambor@suse.cz>
463 * ipa-param-manipulation.c
464 (ipa_param_body_adjustments::prepare_debug_expressions): Be
465 careful about hash_map reallocating itself. Simpify a return
466 which always returns true.
468 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
470 PR tree-optimization/103440
471 * gimple-range-fold.cc (fold_using_range::range_of_phi): Continue
472 normal param processing for equiv params.
474 2021-11-30 Richard Biener <rguenther@suse.de>
476 * config/i386/i386.c (ix86_shift_rotate_cost): Remove
478 * tree-chrec.c (evolution_function_is_invariant_rec_p):
480 * tree-if-conv.c (if_convertible_stmt_p): Likewise.
481 * tree-ssa-pre.c (fully_constant_expression): Likewise.
482 * tree-vrp.c (operand_less_p): Likewise.
483 * reload.c (reg_overlap_mentioned_for_reload_p): Remove
484 unreachable gcc_unreachable ().
485 * sel-sched-ir.h (bb_next_bb): Likewise.
486 * varasm.c (compare_constant): Likewise.
488 2021-11-30 Richard Biener <rguenther@suse.de>
490 PR tree-optimization/103489
491 * tree-vect-loop.c (vectorizable_phi): Verify argument
492 vector type compatibility to mitigate bool pattern recog
495 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
497 * omp-offload.c (oacc_loop_auto_partitions): Remove erroneous
498 "Orphan reductions cannot have gang partitioning" handling.
500 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
502 * omp-offload.c (oacc_loop_process): Implement "gang reduction on
503 an orphan loop" checking.
505 2021-11-30 Cesar Philippidis <cesar@codesourcery.com>
506 Thomas Schwinge <thomas@codesourcery.com>
508 * omp-general.h (enum oacc_loop_flags): Add OLF_REDUCTION enum.
509 * omp-low.c (lower_oacc_head_mark): Use it to mark OpenACC
511 * omp-offload.c (oacc_loop_auto_partitions): Don't assign gang
512 level parallelism to orphan reductions.
514 2021-11-30 Aldy Hernandez <aldyh@redhat.com>
516 PR tree-optimization/103451
517 * range-op.cc (operator_div::wi_fold): Remove
518 can_throw_non_call_exceptions special case.
519 * tree-ssa-sink.c (sink_code_in_bb): Same.
521 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
523 * tree-vect-slp.c (arg1_arg4_map): New variable.
524 (vect_get_operand_map): Handle IFN_MASK_GATHER_LOAD.
525 (vect_build_slp_tree_1): Likewise.
526 (vect_build_slp_tree_2): Likewise.
527 * tree-vect-stmts.c (vectorizable_load): Expect the mask to be
528 the last SLP child node rather than the first.
530 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
532 * tree-if-conv.c: Include tree-eh.h.
533 (predicate_statements): Remove pe argument. Don't hoist
535 (combine_blocks): Remove pe argument.
536 (ifcvt_available_on_edge_p, ifcvt_can_hoist): New functions.
537 (ifcvt_hoist_invariants): Likewise.
538 (tree_if_conversion): Update call to combine_blocks. Call
539 ifcvt_hoist_invariants after VN.
541 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
543 PR tree-optimization/102467
544 * doc/sourcebuild.texi (vect_gather_load_ifn): Document.
545 * tree-vect-data-refs.c (vect_analyze_data_ref_dependence):
546 Commonize safelen handling. Punt for anything involving
547 gathers and scatters unless safelen says otherwise.
548 * tree-vect-slp.c (arg1_map): New variable.
549 (vect_get_operand_map): Handle IFN_GATHER_LOAD.
550 (vect_build_slp_tree_1): Likewise.
551 (vect_build_slp_tree_2): Likewise.
552 (compatible_calls_p): If vect_get_operand_map returns nonnull,
553 check that any skipped arguments are equal.
554 (vect_slp_analyze_node_operations_1): Tighten reduction check.
555 * tree-vect-stmts.c (check_load_store_for_partial_vectors): Take
557 (vect_get_gather_scatter_ops): Take slp_node and ncopies arguments.
559 (vectorizable_store, vectorizable_load): Adjust accordingly.
561 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
563 * gimple.h (gimple_num_args, gimple_arg, gimple_arg_ptr): New
565 * tree-vect-slp.c (cond_expr_maps, arg2_map): New variables.
566 (vect_get_operand_map): New function.
567 (vect_get_and_check_slp_defs): Fix outdated comment.
568 Use vect_get_operand_map and new gimple argument accessors.
569 (vect_build_slp_tree_2): Likewise.
571 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
573 * tree-vect-slp.c (vect_build_slp_tree_1): Use code_helper
574 to record the operations performed by statements, only using
575 CALL_EXPR for things that don't map to built-in or internal
576 functions. For shifts, require all shift amounts to be equal
577 if optab_vector is not supported but optab_scalar is.
579 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
581 * tree-vect-data-refs.c (vect_check_gather_scatter): Continue
582 processing conversions if the current offset is a pointer.
584 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
586 * tree-vectorizer.h (vect_is_reduction): Use STMT_VINFO_REDUC_IDX.
588 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
590 * genopinit.c (main): Turn supports_vec_gather_load and
591 supports_vec_scatter_store into signed char arrays and remove
592 supports_vec_gather_load_cached and supports_vec_scatter_store_cached.
593 * optabs-query.c (supports_vec_convert_optab_p): Add a mode parameter.
594 If the mode is not VOIDmode, test only for that mode.
595 (supports_vec_gather_load_p): Likewise.
596 (supports_vec_scatter_store_p): Likewise.
597 * optabs-query.h (supports_vec_gather_load_p): Likewise.
598 (supports_vec_scatter_store_p): Likewise.
599 * tree-vect-data-refs.c (vect_check_gather_scatter): Pass the
600 vector mode to supports_vec_gather_load_p and
601 supports_vec_scatter_store_p.
603 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
605 * internal-fn.c (commutative_binary_fn_p): Handle IFN_ADD_OVERFLOW
606 and IFN_MUL_OVERFLOW.
608 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
610 * internal-fn.c (commutative_binary_fn_p): Handle IFN_UBSAN_CHECK_ADD
611 and IFN_UBSAN_CHECK_MUL.
613 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
615 * internal-fn.c (commutative_binary_fn_p): Handle IFN_COMPLEX_MUL.
617 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
619 * gimple-fold.c: Include internal-fn.h.
620 (fold_stmt_1): If a function maps to an internal one, use
621 first_commutative_argument to canonicalize the order of
622 commutative arguments.
623 * gimple-match-head.c (gimple_resimplify2, gimple_resimplify3)
624 (gimple_resimplify4, gimple_resimplify5): Extend commutativity
627 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
629 * doc/md.texi (reduc_fmin_scal_@var{m}): Document.
630 (reduc_fmax_scal_@var{m}): Likewise.
631 * optabs.def (reduc_fmax_scal_optab): New optab.
632 (reduc_fmin_scal_optab): Likewise
633 * internal-fn.def (REDUC_FMAX, REDUC_FMIN): New functions.
634 * tree-vect-loop.c (reduction_fn_for_scalar_code): Handle
635 CASE_CFN_FMAX and CASE_CFN_FMIN.
636 (neutral_op_for_reduction): Likewise.
637 (needs_fold_left_reduction_p): Likewise.
638 * config/aarch64/iterators.md (FMAXMINV): New iterator.
639 (fmaxmin): Handle UNSPEC_FMAXNMV and UNSPEC_FMINNMV.
640 * config/aarch64/aarch64-simd.md (reduc_<optab>_scal_<mode>): Fix
642 (reduc_<fmaxmin>_scal_<mode>): New pattern.
643 * config/aarch64/aarch64-sve.md (reduc_<fmaxmin>_scal_<mode>):
646 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
648 * builtins.h (associated_internal_fn): Declare overload that
649 takes a (combined_cfn, return type) pair.
650 * builtins.c (associated_internal_fn): Split new overload out
651 of original fndecl version. Also provide an overload that takes
652 a (combined_cfn, return type) pair.
653 * internal-fn.h (commutative_binary_fn_p): Declare.
654 (commutative_ternary_fn_p): Likewise.
655 (associative_binary_fn_p): Likewise.
656 * internal-fn.c (commutative_binary_fn_p, commutative_ternary_fn_p):
657 New functions, split out from...
658 (first_commutative_argument): ...here.
659 (associative_binary_fn_p): New function.
660 * gimple-match.h (code_helper): Add a constructor that takes
662 (commutative_binary_op_p): Declare.
663 (commutative_ternary_op_p): Likewise.
664 (first_commutative_argument): Likewise.
665 (associative_binary_op_p): Likewise.
666 (canonicalize_code): Likewise.
667 (directly_supported_p): Likewise.
668 (get_conditional_internal_fn): Likewise.
669 (gimple_build): New overloads that takes a code_helper.
670 * gimple-fold.c (gimple_build): Likewise.
671 * gimple-match-head.c (commutative_binary_op_p): New function.
672 (commutative_ternary_op_p): Likewise.
673 (first_commutative_argument): Likewise.
674 (associative_binary_op_p): Likewise.
675 (canonicalize_code): Likewise.
676 (directly_supported_p): Likewise.
677 (get_conditional_internal_fn): Likewise.
678 * tree-vectorizer.h: Include gimple-match.h.
679 (neutral_op_for_reduction): Take a code_helper instead of a tree_code.
680 (needs_fold_left_reduction_p): Likewise.
681 (reduction_fn_for_scalar_code): Likewise.
682 (vect_can_vectorize_without_simd_p): Declare a nNew overload that takes
684 * tree-vect-loop.c: Include case-cfn-macros.h.
685 (fold_left_reduction_fn): Take a code_helper instead of a tree_code.
686 (reduction_fn_for_scalar_code): Likewise.
687 (neutral_op_for_reduction): Likewise.
688 (needs_fold_left_reduction_p): Likewise.
689 (use_mask_by_cond_expr_p): Likewise.
690 (build_vect_cond_expr): Likewise.
691 (vect_create_partial_epilog): Likewise. Use gimple_build rather
692 than gimple_build_assign.
693 (check_reduction_path): Handle calls and operate on code_helpers
694 rather than tree_codes.
695 (vect_is_simple_reduction): Likewise.
696 (vect_model_reduction_cost): Likewise.
697 (vect_find_reusable_accumulator): Likewise.
698 (vect_create_epilog_for_reduction): Likewise.
699 (vect_transform_cycle_phi): Likewise.
700 (vectorizable_reduction): Likewise. Make more use of
702 (vect_transform_reduction): Use gimple_extract_op but expect
704 (vect_can_vectorize_without_simd_p): New overload that takes
706 * tree-vect-stmts.c (vectorizable_call): Handle reductions in
708 * tree-vect-patterns.c (vect_mark_pattern_stmts): Use
709 gimple_extract_op when updating STMT_VINFO_REDUC_IDX.
711 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
713 * gimple-match.h (code_helper): Provide == and != overloads.
714 (code_helper::operator tree_code): Make explicit.
715 (code_helper::operator combined_fn): Likewise.
716 * gimple-match-head.c (convert_conditional_op): Use explicit
717 conversions where necessary.
718 (gimple_resimplify1, gimple_resimplify2, gimple_resimplify3): Likewise.
719 (maybe_push_res_to_seq, gimple_simplify): Likewise.
720 * gimple-fold.c (replace_stmt_with_simplification): Likewise.
722 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
724 * gimple-match.h (code_helper): Add functions for querying whether
725 the code represents an internal_fn or a built_in_function.
726 Provide explicit conversion operators for both cases.
727 (gimple_extract_op): Declare.
728 * gimple-match-head.c (gimple_extract): New function, extracted from...
729 (gimple_simplify): ...here.
730 (gimple_extract_op): New function.
732 2021-11-30 Eric Botcazou <ebotcazou@adacore.com>
735 * config/i386/i386.c (ix86_output_call_insn): Beef up comment about
736 nops emitted with SEH.
737 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
738 the cold section, emit a nop before the directive if the previous
739 active instruction is a call.
741 2021-11-30 Roger Sayle <roger@nextmovesoftware.com>
742 Uroš Bizjak <ubizjak@gmail.com>
745 * config/i386/sse.md (define_split): New splitters to simplify
746 not;vec_duplicate;and as vec_duplicate;andn.
748 2021-11-30 Richard Biener <rguenther@suse.de>
751 * match.pd (-((int)x >> (prec - 1)) to (unsigned)x >> (prec - 1)):
752 Use VIEW_CONVERT_EXPR for vectors.
754 2021-11-30 Richard Biener <rguenther@suse.de>
756 * cfgrtl.c (skip_insns_after_block): Refactor code to
757 be more easily readable.
758 * expr.c (op_by_pieces_d::run): Remove unreachable
760 * sched-deps.c (sched_analyze): Remove unreachable
762 * sel-sched-ir.c (in_same_ebb_p): Likewise.
763 * tree-ssa-alias.c (nonoverlapping_refs_since_match_p):
764 Remove unreachable code.
765 * tree-vect-slp.c (vectorize_slp_instance_root_stmt):
766 Refactor to avoid unreachable loop iteration.
767 * tree.c (walk_tree_1): Remove unreachable break.
768 * vec-perm-indices.c (vec_perm_indices::series_p): Remove
771 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
774 * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin mask
777 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
779 * config/rs6000/rs6000.c
780 (rs6000_cost_data::update_target_cost_per_stmt): Adjust the way to
781 compute extra penalized cost. Remove useless parameter.
782 (rs6000_cost_data::rs6000_add_stmt_cost): Adjust the call to function
783 update_target_cost_per_stmt.
785 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
788 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
790 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
791 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
792 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
793 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
794 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
795 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
796 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
797 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
799 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
801 PR tree-optimization/103467
802 * gimple-range-gori.cc (range_def_chain::register_dependency): Don't
803 use an object reference after a potential resize.
805 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
808 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
809 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
810 optimizing for size. Fix vpinsrw insn template.
811 (*movhf_internal): Fix pinsrw and pextrw insn templates for
812 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
813 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
814 and TARGET_SSE_SPLIT_REGS targets.
816 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
818 * configure: Re-regenerate.
820 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
823 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
824 variables. Use ETAGS variable in TAGS target.
825 * configure: Regenerate.
826 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
827 variables to be overridden.
829 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
831 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
833 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
834 (_mm_movemask_epi8): Likewise.
836 2021-11-29 Richard Biener <rguenther@suse.de>
838 * read-rtl-function.c (function_reader::read_rtx_operand):
839 Return only after resetting m_in_call_function_usage.
841 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
843 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
844 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
845 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
846 on QImode shift amounts from const_int_operand to general_operand.
848 2021-11-29 Richard Biener <rguenther@suse.de>
850 * tree.h (reverse_storage_order_for_component_p): Remove
851 spurious gcc_unreachable.
852 * cfganal.c (dfs_find_deadend): Likewise.
853 * fold-const-call.c (fold_const_logb): Likewise.
854 (fold_const_significand): Likewise.
855 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
858 2021-11-29 Richard Biener <rguenther@suse.de>
860 * vec.c (qsort_chk): Do not return the void return value
861 from the noreturn qsort_chk_error.
862 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
863 * df-scan.c (df_ref_equal_p): Likewise.
864 * dwarf2out.c (is_base_type): Likewise.
865 (add_const_value_attribute): Likewise.
866 * fixed-value.c (fixed_arithmetic): Likewise.
867 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
868 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
869 * graphite-isl-ast-to-gimple.c
870 (gcc_expression_from_isl_expr_op): Likewise.
871 (gcc_expression_from_isl_expression): Likewise.
872 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
874 * lto-streamer-in.c (lto_input_mode_table): Likewise.
876 2021-11-29 liuhongt <hongtao.liu@intel.com>
879 * config/i386/i386.c (inline_secondary_memory_needed): HImode
880 move between GPR and SSE registers is supported under
881 TARGET_SSE2 and above.
882 * config/i386/i386.md (extendhfsf2): Optimize expander.
884 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
887 2021-11-29 liuhongt <hongtao.liu@intel.com>
891 * config/i386/i386.c (ix86_secondary_reload): Without
892 TARGET_SSE4_1, General register is needed to move HImode from
893 sse register to memory.
894 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
895 pextrw in output templates.
896 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
897 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
898 alternatives related to sse register.
900 2021-11-29 Richard Biener <rguenther@suse.de>
902 PR tree-optimization/103458
903 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
904 create forwarders for blocks with abnormal predecessors.
906 2021-11-29 Richard Biener <rguenther@suse.de>
908 * gimple-predicate-analysis.cc (can_be_invalidated_p):
909 Restore semantics to the one before the split from
912 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
914 * profile.c: Include sreal.h
915 (struct bb_stats): New.
916 (cmp_stats): New function.
917 (compute_branch_probabilities): Output bb stats.
919 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
921 * cfghooks.c: Include sreal.h, profile.h.
922 (profile_record_check_consistency): Fix checking of count counsistency;
923 record also dynamic mismatches.
924 * cfgrtl.c (rtl_account_profile_record): Similarly.
925 * tree-cfg.c (gimple_account_profile_record): Likewise.
926 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
927 num_mismatched_freq_out, turn time to double, add
928 dyn_mismatched_prob_out, dyn_mismatched_count_in,
929 num_mismatched_prob_out; remove num_mismatched_count_out.
930 * passes.c (account_profile_1): New function.
931 (account_profile_in_list): New function.
932 (pass_manager::dump_profile_report): Rewrite.
933 (execute_one_ipa_transform_pass): Check profile consistency after
935 (execute_all_ipa_transforms): Remove cfun test; record all transform
937 (execute_one_pass): Fix collecting of profile stats.
939 2021-11-27 Jakub Jelinek <jakub@redhat.com>
942 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
943 Change insn pattern to accurately describe the instructions.
944 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
945 define_insn patterns.
946 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
947 define_insn_and_split patterns.
948 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
949 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
950 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
951 splitters for x86_{,64_}sh{l,r}d pattern changes.
953 2021-11-27 Jakub Jelinek <jakub@redhat.com>
955 PR tree-optimization/103435
956 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
957 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
960 2021-11-27 Martin Jambor <mjambor@suse.cz>
963 * ipa-prop.c (ipcp_transform_function): Call
964 delete_unreachable_blocks_update_callgraph instead of returning
967 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
969 * tree-object-size.c (object_sizes_grow, object_sizes_release,
970 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
971 object_sizes_set): New functions.
972 (addr_object_size, compute_builtin_object_size,
973 expr_object_size, call_object_size, unknown_object_size,
974 merge_object_sizes, plus_stmt_object_size,
975 cond_expr_object_size, collect_object_sizes_for,
976 check_for_plus_in_loops_1, init_object_sizes,
977 fini_object_sizes): Adjust.
979 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
981 * tree-object-size.c: New enum.
982 (object_sizes, computed, addr_object_size,
983 compute_builtin_object_size, expr_object_size, call_object_size,
984 merge_object_sizes, plus_stmt_object_size,
985 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
986 object_sizes_execute): Replace magic numbers with enums.
988 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
990 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
991 tree_niter_desc* argument instead of just a tree for NITER.
992 If we require the iv candidate value at the end of the final
993 loop iteration, try using the original loop bound as the
994 NITER for sufficiently simple loops.
995 (may_eliminate_iv): Update (only) call to cand_value_at.
997 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
999 * ipa-modref.c (analyze_function): Drop parameter F and dump
1000 cgraph node name rather than cfun name.
1001 (modref_generate): Update.
1002 (modref_summaries::insert):Update.
1003 (modref_summaries_lto::insert):Update.
1004 (pass_modref::execute):Update.
1005 (ipa_merge_modref_summary_after_inlining): Improve combining of
1008 2021-11-26 Jan Hubicka <jh@suse.cz>
1011 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
1013 2021-11-26 konglin1 <lingling.kong@intel.com>
1016 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
1018 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
1019 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
1020 (*extendhf<mode>2): Rename from extendhf<mode>2.
1021 (truncsfhf2): Likewise.
1022 (truncdfhf2): Likewise.
1023 (*trunc<mode>2): Likewise.
1025 2021-11-26 liuhongt <hongtao.liu@intel.com>
1027 PR middle-end/103419
1028 * match.pd: Fix typo, use the type of second parameter, not
1031 2021-11-25 Jan Hubicka <jh@suse.cz>
1033 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
1035 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
1037 PR middle-end/103406
1038 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
1040 2021-11-25 Martin Jambor <mjambor@suse.cz>
1043 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
1045 * ipa-param-manipulation.h (ipa_param_adjustments): New member
1046 function get_updated_index_or_split.
1047 * ipa-param-manipulation.c
1048 (ipa_param_adjustments::get_updated_index_or_split): New function.
1049 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
1050 capability to identify scalarized parameters and perform substitution
1052 (ipcp_transform_function): Create descriptors earlier, handle new
1053 return values of adjust_agg_replacement_values.
1055 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1057 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
1058 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
1059 * gimple-range-path.h: Remove compute_ranges_defined.
1061 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
1064 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
1065 number of spilled registers is counted correctly.
1067 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1069 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
1072 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
1074 PR tree-optimization/103359
1075 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
1076 equivalent to def, don't initially include it's range.
1078 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
1080 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
1081 gimple_static_chain.
1083 2021-11-25 Richard Biener <rguenther@suse.de>
1085 * cgraphunit.c (symbol_table::output_weakrefs): Remove
1087 (get_alias_symbol): Remove now unused function.
1089 2021-11-25 Richard Biener <rguenther@suse.de>
1091 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
1093 (skip_insns_after_block): Remove unreachable break and continue.
1095 2021-11-25 Richard Biener <rguenther@suse.de>
1097 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
1099 2021-11-25 Richard Biener <rguenther@suse.de>
1101 * regset.h (REG_SET_EMPTY_P): New macro.
1102 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
1104 2021-11-25 Martin Liska <mliska@suse.cz>
1106 * doc/invoke.texi: Use @option for -Wuninitialized.
1108 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1110 PR tree-optimization/103254
1111 * gimple-range-path.cc (path_range_query::compute_ranges): Move
1112 exported boolean code...
1113 (path_range_query::compute_imports): ...here.
1115 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
1117 PR tree-optimization/103254
1118 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
1119 (path_range_query::compute_ranges_in_block): Move to
1120 compute_ranges_defined.
1121 * gimple-range-path.h (compute_ranges_defined): New.
1123 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1125 PR tree-optimization/103417
1126 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
1129 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1131 PR tree-optimization/103376
1132 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
1133 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
1134 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
1135 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
1136 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
1139 2021-11-25 Jakub Jelinek <jakub@redhat.com>
1142 * doc/invoke.texi (-Wcomma-subscript): Document that for
1143 -std=c++20 the option isn't enabled by default with -Wno-deprecated
1144 but for -std=c++23 it is.
1146 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
1148 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
1149 (modref_merge_call_site_flags): ... here.
1150 (ipa_merge_modref_summary_after_inlining): Use it.
1152 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1154 * cfgloop.c (verify_loop_structure): Reduce scope of
1155 'class loop *loop' variable.
1156 * ipa-fnsummary.c (analyze_function_body): Likewise.
1157 * loop-init.c (fix_loop_structure): Likewise.
1158 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
1159 * predict.c (predict_loops): Likewise.
1160 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
1161 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
1163 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1165 PR tree-optimization/103231
1166 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
1167 (gimple_ranger::gimple_ranger): Delete stmt stack.
1168 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
1170 (gimple_ranger::prefill_name): New.
1171 (gimple_ranger::prefill_stmt_dependencies): New.
1172 * gimple-range.h (class gimple_ranger): Add prototypes.
1174 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1176 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
1177 return a range, return if it came from the cache or not.
1178 (get_non_stale_global_range): Rename to get_global_range, and return
1179 the temporal state in a flag.
1180 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
1181 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
1183 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
1186 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
1188 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
1189 a constant is encountered to terminate the trace.
1191 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
1193 PR middle-end/103059
1194 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
1195 of indexed addressing.
1196 (find_reloads): Adjust accordingly.
1198 2021-11-24 Richard Biener <rguenther@suse.de>
1199 Jan Hubicka <jh@suse.cz>
1201 PR tree-optimization/103168
1202 * ipa-modref.h (struct modref_summary): Add load_accesses.
1203 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
1204 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
1205 info to walk the virtual use->def chain to CSE const/pure
1206 function calls possibly reading from memory.
1208 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1210 PR middle-end/103244
1211 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
1214 2021-11-24 Richard Biener <rguenther@suse.de>
1216 PR middle-end/103193
1217 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
1218 with NaNs and -ftrapping-math.
1220 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1222 PR middle-end/103384
1223 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
1224 return 0 for kind(host) and continue for kind(nohost).
1226 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1228 PR middle-end/103365
1229 * attribs.h (lookup_attribute): Allow attr_name to start with
1230 underscore, as long as canonicalize_attr_name returns false.
1231 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
1232 * attribs.c (extract_attribute_substring): Reimplement using
1233 canonicalize_attr_name.
1234 (register_scoped_attribute): Change gcc_assert into
1235 gcc_checking_assert, verify !canonicalize_attr_name rather than
1236 that str.str doesn't start with '_'.
1238 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1240 PR tree-optimization/103376
1241 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
1242 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
1244 (find_bswap_or_nop_1, find_bswap_or_nop,
1245 imm_store_chain_info::try_coalesce_bswap): Adjust
1246 perform_symbolic_merge callers.
1248 2021-11-24 Richard Biener <rguenther@suse.de>
1250 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
1251 argument instead of re-computing it.
1252 (find_interesting_uses): Likewise.
1253 (find_induction_variables): Pass through loop body.
1254 (tree_ssa_iv_optimize_loop): Pass down loop body.
1256 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1258 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
1261 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1263 * tree.c (bitmask_inv_cst_vector_p): New.
1264 * tree.h (bitmask_inv_cst_vector_p): New.
1265 * match.pd: Use it in new bitmask compare pattern.
1267 2021-11-24 Jason Merrill <jason@redhat.com>
1269 * timevar.h (class auto_cond_timevar): New.
1271 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
1272 H.J. Lu <hongjiu.lu@intel.com>
1274 PR tree-optimization/103194
1275 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
1277 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
1278 (optimize_atomic_bit_test_and): Enhance
1279 optimize_atomic_bit_test_and to handle truncation.
1281 2021-11-23 Martin Sebor <msebor@redhat.com>
1284 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
1285 * common.opt: Add -Winfinite-recursion.
1286 * doc/invoke.texi (-Winfinite-recursion): Document.
1287 * passes.def (pass_warn_recursion): Schedule a new pass.
1288 * tree-pass.h (make_pass_warn_recursion): Declare.
1289 * gimple-warn-recursion.c: New file.
1291 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1293 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
1294 error messages for ENB_P8V and ENB_P9V.
1296 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1298 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
1299 CMPB to power6-64 stanza.
1300 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
1302 (rs6000_new_builtin_is_supported): Likewise.
1303 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
1304 (rs6000_init_builtins): Handle ENB_P6_64 case.
1305 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
1306 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
1307 (enable_string): Add "ENB_P6_64".
1308 (write_decls): Add ENB_P6_64 to bif_enable enum.
1310 2021-11-23 Jan Hubicka <jh@suse.cz>
1312 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
1314 (modref_tree::modref_tree): Remove parametr.
1315 (modref_tree::insert_base): Add max_bases parameter.
1316 (modref_tree::insert): Add max_bases, max_refs, max_accesses
1318 (modref_tree::insert): New member function.
1319 (modref_tree::merge): Add max_bases, max_refs, max_accesses
1321 (modref_tree::insert): New member function.
1322 * ipa-modref-tree.c (test_insert_search_collapse): Update.
1323 (test_merge): Update.
1324 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
1325 (dump_lto_records): Likewise.
1326 (modref_summary::finalize): Fix whitespace.
1327 (get_modref_function_summary): Likewise.
1328 (modref_access_analysis::record_access): Update.
1329 (modref_access_analysis::record_access_lto): Update.
1330 (modref_access_analysis::process_fnspec): Update.
1331 (analyze_function): Update.
1332 (modref_summaries::duplicate): Update.
1333 (modref_summaries_lto::duplicate): Update.
1334 (write_modref_records): Update.
1335 (read_modref_records): Update.
1336 (read_section): Update.
1337 (propagate_unknown_call): Update.
1338 (modref_propagate_in_scc): Update.
1339 (ipa_merge_modref_summary_after_inlining): Update.
1341 2021-11-23 Martin Liska <mliska@suse.cz>
1343 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
1345 2021-11-23 Richard Biener <rguenther@suse.de>
1347 PR tree-optimization/103361
1348 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
1349 for the dependence distance.
1350 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
1352 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1354 * tree-inline.h (struct copy_body_data): Remove
1355 transform_lang_insert_block member.
1356 * tree-inline.c (remap_block): Don't call
1357 id->transform_lang_insert_block.
1358 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
1359 tree_function_versioning, maybe_inline_call_in_expr,
1360 copy_fn): Don't initialize id.transform_lang_insert_block.
1361 * gimplify.c (gimplify_omp_loop): Likewise.
1363 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
1365 PR tree-optimization/103335
1366 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
1367 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
1368 are divisible by BITS_PER_UNIT.
1369 (get_byte_aligned_range_containing_ref): New function.
1370 (get_byte_aligned_range_contained_in_ref): New function.
1371 (normalize_ref): Rename to ...
1372 (get_byte_range): ... this one; handle accesses not aligned to byte
1373 boundary; return range in bytes rater than updating ao_ref.
1374 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
1376 (setup_live_bytes_from_ref): Likewise.
1377 (clear_bytes_written_by): Update.
1378 (live_bytes_read): Update.
1379 (dse_classify_store): Simplify tech before live_bytes_read checks.
1381 2021-11-23 Andrew Pinski <apinski@marvell.com>
1383 PR tree-optimization/102216
1384 * gimple-fold.c (fold_stmt_1): Add canonicalization
1385 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
1386 can only be done if !in_place.
1388 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1390 PR middle-end/102431
1391 * gimplify.c (replace_reduction_placeholders): Remove.
1392 (note_no_context_vars): New function.
1393 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
1394 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
1395 and replace_reduction_placeholders for duplication of
1396 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
1397 automatic vars have DECL_CONTEXT set to non-NULL before doing so
1398 and reset it afterwards for those vars and their corresponding
1401 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
1404 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
1405 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
1406 or v4sf when p9_vector is set.
1407 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
1409 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1411 PR tree-optimization/102232
1412 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
1414 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1416 PR tree-optimization/96779
1417 * match.pd (-x == x) -> (x == 0): New optimization.
1419 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
1421 PR tree-optimization/98953
1422 PR tree-optimization/103345
1423 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
1424 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
1425 (pass_optimize_bswap::execute): Likewise.
1427 2021-11-22 Martin Liska <mliska@suse.cz>
1429 * doc/invoke.texi: Remove duplicate documentation for 3 params.
1431 2021-11-22 Richard Biener <rguenther@suse.de>
1433 PR tree-optimization/103351
1434 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
1437 2021-11-22 liuhongt <hongtao.liu@intel.com>
1440 * config/i386/constraints.md (Bk): New
1441 define_memory_constraint.
1442 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
1444 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
1446 * config/i386/i386.md (*movsi_internal): Don't allow
1447 mask/sse/mmx move in TLS code sequences.
1448 (*movdi_internal): Ditto.
1450 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
1452 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
1455 2021-11-21 Jakub Jelinek <jakub@redhat.com>
1458 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
1459 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
1460 instead of target_option_default_node.
1462 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1465 * ipa-modref.c (parm_map_for_arg): Rename to ...
1466 (parm_map_for_ptr): .. this one; handle static chain and calls to
1468 (modref_access_analysis::get_access): Use parm_map_for_ptr.
1469 (modref_access_analysis::process_fnspec): Update.
1470 (modref_access_analysis::analyze_load): Update.
1471 (modref_access_analysis::analyze_store): Update.
1473 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1475 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
1477 (ignore_retval_p): Likewise.
1478 (ignore_stores_p): Likewise.
1479 (parm_map_for_arg): Likewise.
1480 (class modref_access_analysis): New class.
1481 (modref_access_analysis::set_side_effects): New member function.
1482 (modref_access_analysis::set_nondeterministic): New member function.
1483 (get_access): Turn to ...
1484 (modref_access_analysis::get_access): ... this one.
1485 (record_access): Turn to ...
1486 (modref_access_analysis::record_access): ... this one.
1487 (record_access_lto): Turn to ...
1488 (modref_access_analysis::record_access_lto): ... This one.
1489 (record_access_p): Turn to ...
1490 (modref_access_analysis::record_access_p): ... This one
1491 (modref_access_analysis::record_unknown_load): New member function.
1492 (modref_access_analysis::record_unknown_store): New member function.
1493 (get_access_for_fnspec): Turn to ...
1494 (modref_access_analysis::get_access_for_fnspec): ... this one.
1495 (merge_call_side_effects): Turn to ...
1496 (moderf_access_analysis::merge_call_side_effects): Turn to ...
1497 (collapse_loads): Move later in source code.
1498 (collapse_stores): Move later in source code.
1499 (process_fnspec): Turn to ...
1500 (modref_access_analysis::process_fnspec): ... this one.
1501 (analyze_call): Turn to ...
1502 (modref_access_analysis::analyze_call): ... this one.
1503 (struct summary_ptrs): Remove.
1504 (analyze_load): Turn to ...
1505 (modref_access_analysis::analyze_load): ... this one.
1506 (analyze_store): Turn to ...
1507 (modref_access_analysis::analyze_store): ... this one.
1508 (analyze_stmt): Turn to ...
1509 (modref_access_analysis::analyze_stmt): ... This one.
1510 (remove_summary): Remove.
1511 (modref_access_analysis::propagate): Break out from ...
1512 (modref_access_analysis::analyze): Break out from ...
1513 (analyze_function): ... here.
1515 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
1516 Robin Dapp <rdapp@linux.ibm.com>
1519 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
1520 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
1522 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1525 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
1526 (merge_call_side_effects): Improve debug output.
1528 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1531 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
1533 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
1535 PR tree-optimization/103226
1536 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
1539 2021-11-20 Andrew Pinski <apinski@marvell.com>
1541 PR tree-optimization/103220
1542 * match.pd ((type) X bitop CST): Don't check if CST
1543 fits into the type if only the sign changes.
1545 2021-11-20 Alexandre Oliva <oliva@adacore.com>
1547 PR tree-optimization/102988
1548 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
1549 without decl sharing.
1551 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1553 * doc/invoke.texi: Remove whitespace after an @option.
1555 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
1557 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
1558 _ARCH_PWR9, optimize vec_sum2s when LE.
1560 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1563 * config/darwin-driver.c (darwin_driver_init): Handle exported
1564 symbols and symbol lists (suppress automatic export of the TLS
1566 * config/darwin.c (darwin_rename_builtins): Remove workaround.
1567 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
1568 (REAL_LIBGCC_SPEC): Handle revised library uses.
1569 * config/darwin.opt (nodefaultexport): New.
1570 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
1571 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
1572 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
1574 2021-11-19 Martin Jambor <mjambor@suse.cz>
1576 * opts.c (default_options_table): Switch off
1577 flag_semantic_interposition at Ofast.
1578 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
1579 -fsemantic-interposition.
1581 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
1583 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
1584 by !gimple_call_chain.
1586 2021-11-19 Martin Sebor <msebor@redhat.com>
1590 * doc/invoke.texi (-Waddress): Update.
1592 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
1594 PR tree-optimization/103254
1595 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
1596 depth for all statements with multple ssa names.
1598 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
1600 * config/s390/s390.md (define_peephole2): Variable insn points
1601 to the first matched insn. Use peep2_next_insn(1) to refer to
1602 the second matched insn.
1604 2021-11-19 Tamar Christina <tamar.christina@arm.com>
1606 PR tree-optimization/103311
1608 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
1609 test to new codegen.
1610 (complex_mul_pattern::matches): Move check downwards.
1612 2021-11-19 Martin Liska <mliska@suse.cz>
1615 2021-11-19 Martin Liska <mliska@suse.cz>
1617 * cfgexpand.c (pass_expand::execute): Use option directly.
1618 * function.c (allocate_struct_function): Likewise.
1619 * gimple-low.c (lower_function_body): Likewise.
1620 (lower_stmt): Likewise.
1621 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1622 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1623 * ipa-split.c (split_function): Likewise.
1624 * lto-streamer-in.c (input_function): Likewise.
1625 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1626 * ssa-iterators.h (num_imm_uses): Likewise.
1627 * tree-cfg.c (make_blocks): Likewise.
1628 (gimple_merge_blocks): Likewise.
1629 * tree-inline.c (tree_function_versioning): Likewise.
1630 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1631 * tree-sra.c (analyze_access_subtree): Likewise.
1632 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1633 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1634 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1635 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1636 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1637 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1638 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1639 (target_for_debug_bind): Likewise.
1640 (insert_debug_temp_for_var_def): Likewise.
1641 (insert_debug_temps_for_defs): Likewise.
1642 (reset_debug_uses): Likewise.
1643 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1644 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1645 (adjust_debug_stmts): Likewise.
1646 (adjust_phi_and_debug_stmts): Likewise.
1647 (vect_do_peeling): Likewise.
1648 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1649 (vect_transform_loop): Likewise.
1650 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1651 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1652 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1654 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
1656 * gcc.c (process_command): Skip dumpdir override if file is a
1658 * doc/invoke.texi: Update -dumpdir documentation.
1660 2021-11-19 Andrew Pinski <apinski@marvell.com>
1662 PR tree-optimization/103314
1663 * match.pd ((type) X op CST): Restrict the equal
1664 TYPE_PRECISION case to GIMPLE only.
1666 2021-11-19 Martin Liska <mliska@suse.cz>
1669 * ipa-modref-tree.h (struct modref_parm_map): Add default
1671 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
1673 2021-11-19 Richard Biener <rguenther@suse.de>
1675 PR middle-end/103248
1676 * tree-eh.c (operation_could_trap_helper_p): Properly handle
1677 fixed-point RDIV_EXPR.
1679 2021-11-19 Richard Biener <rguenther@suse.de>
1681 PR tree-optimization/102436
1682 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
1683 to just create the if structure and return the then block.
1684 (execute_sm): Add flag to indicate the var will re-use
1686 (hoist_memory_references): Support a single conditional
1687 block with all stores as special case.
1689 2021-11-19 Andrew Pinski <apinski@marvell.com>
1691 PR tree-optimization/103317
1692 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
1693 middle bb case, check to make sure it has a single predecessor.
1695 2021-11-19 Andrew Pinski <apinski@marvell.com>
1697 PR tree-optimization/103257
1699 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
1700 Disable until !canonicalize_math_p ().
1702 2021-11-19 Marek Polacek <polacek@redhat.com>
1706 * doc/invoke.texi: Update documentation for -Wuninitialized.
1707 * tree.c (stabilize_reference): Set location.
1709 2021-11-19 liuhongt <hongtao.liu@intel.com>
1712 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
1713 storing 256/512-bit SSE register to be equal to cost of
1714 unaligned store to avoid odd alignment peeling.
1715 (icelake_cost): Ditto.
1717 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1719 * config/rs6000/predicates.md (current_file_function_operand):
1720 Add flag_semantic_interposition to call of decl_replaceable_p.
1722 2021-11-18 Martin Liska <mliska@suse.cz>
1724 * ipa-modref.c (analyze_function): Do not execute the code
1725 only if dump_file != NULL.
1727 2021-11-18 Martin Liska <mliska@suse.cz>
1729 * ipa-modref.c (analyze_function): Use fnode instead of repeated
1730 cgraph_node::get (current_function_decl).
1732 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1734 * cgraph.c (cgraph_node::get_availability): Update call of
1736 (cgraph_node::verify_node): Verify that semantic_interposition flag
1738 * cgraph.h: (symtab_node): Add semantic_interposition flag.
1739 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
1740 semantic_interposition flag.
1741 * cgraphunit.c (cgraph_node::finalize_function): Set
1742 semantic_interposition flag.
1743 (cgraph_node::add_new_function): Likewise.
1744 (varpool_node::finalize_decl): Likewise.
1745 (cgraph_node::create_wrapper): Likewise.
1746 * common.opt (fsemantic-interposition): Turn to optimization node.
1747 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
1748 (lto_output_varpool_node): Likewise.
1749 (input_overwrite_node): Likewise.
1750 (input_varpool_node): Likewise.
1751 * symtab.c (symtab_node::dump_base): Dump new flag.
1752 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
1754 * varasm.h (decl_replaceable_p): Update declaration.
1755 * varpool.c (varpool_node::ctor_useable_for_folding_p):
1756 Use semantic_interposition flag.
1757 (varpool_node::get_availability): Likewise.
1758 (varpool_node::create_alias): Copy semantic_interposition flag.
1760 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1763 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
1764 parameter may still be returned.
1765 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
1766 even for unused function args.
1768 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1770 PR tree-optimization/103311
1771 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
1772 complex_fms_pattern::matches): Check for multiplications.
1774 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
1777 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
1778 CS prefix for -mindirect-branch-cs-prefix.
1779 (ix86_output_indirect_branch_via_reg): Likewise.
1780 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
1781 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
1783 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1785 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
1788 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1790 PR tree-optimization/103253
1791 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
1793 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
1795 * doc/extend.texi: Document __builtin_assoc_barrier.
1797 2021-11-18 Martin Liska <mliska@suse.cz>
1799 * cfgexpand.c (pass_expand::execute): Use option directly.
1800 * function.c (allocate_struct_function): Likewise.
1801 * gimple-low.c (lower_function_body): Likewise.
1802 (lower_stmt): Likewise.
1803 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1804 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1805 * ipa-split.c (split_function): Likewise.
1806 * lto-streamer-in.c (input_function): Likewise.
1807 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1808 * ssa-iterators.h (num_imm_uses): Likewise.
1809 * tree-cfg.c (make_blocks): Likewise.
1810 (gimple_merge_blocks): Likewise.
1811 * tree-inline.c (tree_function_versioning): Likewise.
1812 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1813 * tree-sra.c (analyze_access_subtree): Likewise.
1814 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1815 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1816 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1817 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1818 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1819 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1820 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1821 (target_for_debug_bind): Likewise.
1822 (insert_debug_temp_for_var_def): Likewise.
1823 (insert_debug_temps_for_defs): Likewise.
1824 (reset_debug_uses): Likewise.
1825 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1826 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1827 (adjust_debug_stmts): Likewise.
1828 (adjust_phi_and_debug_stmts): Likewise.
1829 (vect_do_peeling): Likewise.
1830 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1831 (vect_transform_loop): Likewise.
1832 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1833 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1834 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1836 2021-11-18 Richard Biener <rguenther@suse.de>
1838 PR tree-optimization/103277
1839 * tree-ssa-dse.c (need_ab_cleanup): New.
1840 (dse_optimize_redundant_stores): Adjust.
1841 (delete_dead_or_redundant_assignment): Get extra
1842 need_ab_cleanup argument and set when abnormal cleanup is
1844 (dse_optimize_call): Adjust.
1845 (dse_optimize_stmt): Likewise.
1846 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
1847 Perform abnormal cleanup.
1848 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
1850 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
1852 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1853 Adjust generated cfg to avoid infinite loop.
1855 2021-11-18 konglin1 <lingling.kong@intel.com>
1857 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
1858 (_mm512_mask_mul_pch): Likewise.
1859 (_mm512_maskz_mul_pch): Likewise.
1860 (_mm512_mul_round_pch): Likewise.
1861 (_mm512_mask_mul_round_pch): Likewise.
1862 (_mm512_maskz_mul_round_pch): Likewise.
1863 (_mm512_cmul_pch): Likewise.
1864 (_mm512_mask_cmul_pch): Likewise.
1865 (_mm512_maskz_cmul_pch): Likewise.
1866 (_mm512_cmul_round_pch): Likewise.
1867 (_mm512_mask_cmul_round_pch): Likewise.
1868 (_mm512_maskz_cmul_round_pch): Likewise.
1869 (_mm_mul_sch): Likewise.
1870 (_mm_mask_mul_sch): Likewise.
1871 (_mm_maskz_mul_sch): Likewise.
1872 (_mm_mul_round_sch): Likewise.
1873 (_mm_mask_mul_round_sch): Likewise.
1874 (_mm_maskz_mul_round_sch): Likewise.
1875 (_mm_cmul_sch): Likewise.
1876 (_mm_mask_cmul_sch): Likewise.
1877 (_mm_maskz_cmul_sch): Likewise.
1878 (_mm_cmul_round_sch): Likewise.
1879 (_mm_mask_cmul_round_sch): Likewise.
1880 (_mm_maskz_cmul_round_sch): Likewise.
1881 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
1882 (_mm_mask_mul_pch): Likewise.
1883 (_mm_maskz_mul_pch): Likewise.
1884 (_mm256_mul_pch): Likewise.
1885 (_mm256_mask_mul_pch): Likewise.
1886 (_mm256_maskz_mul_pch): Likewise.
1887 (_mm_cmul_pch): Likewise.
1888 (_mm_mask_cmul_pch): Likewise.
1889 (_mm_maskz_cmul_pch): Likewise.
1890 (_mm256_cmul_pch): Likewise.
1891 (_mm256_mask_cmul_pch): Likewise.
1892 (_mm256_maskz_cmul_pch): Likewise.
1894 2021-11-17 Andrew Pinski <apinski@marvell.com>
1896 PR tree-optimization/103228
1897 PR tree-optimization/55177
1898 * match.pd ((type) X bitop CST): Also do this
1899 transformation for nop conversions.
1901 2021-11-17 Martin Sebor <msebor@redhat.com>
1903 PR tree-optimization/102759
1904 * gimple-array-bounds.cc (build_printable_array_type): Move...
1905 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
1906 pathological function redeclarations that remove a previously
1908 Improve formatting of function arguments in informational notes.
1909 * pointer-query.cc (build_printable_array_type): ...to here.
1910 * pointer-query.h (build_printable_array_type): Declared.
1912 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1915 * config/i386/i386-opts.h (harden_sls): New enum.
1916 * config/i386/i386.c (output_indirect_thunk): Mitigate against
1917 SLS for function return.
1918 (ix86_output_function_return): Likewise.
1919 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
1921 (ix86_output_indirect_jmp): Likewise.
1922 (ix86_output_call_insn): Likewise.
1923 * config/i386/i386.opt: Add -mharden-sls=.
1924 * doc/invoke.texi: Document -mharden-sls=.
1926 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1929 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
1930 (ix86_output_function_return): Likewise.
1931 * config/i386/i386.md (simple_return_pop_internal): Likewise.
1933 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1936 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
1939 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1941 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
1942 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
1943 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
1944 on indirect_thunks_used.
1945 (ix86_output_indirect_function_return): Ditto.
1947 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1949 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
1950 (modref_access_node::stream_out): New member function.
1951 (modref_access_node::stream_in): New member function.
1952 * ipa-modref-tree.h (modref_access_node::stream_out,
1953 modref_access_node::stream_in): Declare.
1954 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
1955 (modref_summary_lto::dump): Dump kills.
1956 (analyze_store): Record kills for LTO
1957 (analyze_stmt): Likewise.
1958 (modref_summaries_lto::duplicate): Duplicate kills.
1959 (write_modref_records): Use new stream_out member function.
1960 (read_modref_records): Likewise.
1961 (modref_write): Stream out kills.
1962 (read_section): Stream in kills
1963 (remap_kills): New function.
1964 (update_signature): Use it.
1966 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1968 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
1969 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
1970 * config/i386/i386.c (zero_all_vector_registers):
1971 Use LEGACY_SSE_REGNO_P predicate.
1972 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
1973 and EXT_REG_SSE_REGNO_P predicates.
1974 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
1975 and LEGACY_SSE_REGNO_P predicates.
1977 2021-11-17 Jason Merrill <jason@redhat.com>
1979 * doc/invoke.texi (C++ Dialect Options): Document
1980 -fimplicit-constexpr.
1982 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
1984 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
1985 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
1987 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
1990 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
1994 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
1995 (gimplify_compound_lval): Gimplify base expression first.
1996 (gimplify_target_expr): Add comment.
1998 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2000 PR tree-optimization/103192
2001 * tree-ssa-loop-im.c (move_computations_worker): Use
2002 reset_flow_sensitive_info instead of manually clearing
2003 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
2004 with integral types.
2006 2021-11-17 Jakub Jelinek <jakub@redhat.com>
2008 PR tree-optimization/103255
2009 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
2010 range_nonzero rather than unadjusted base's range. Formatting fixes.
2012 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
2014 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
2015 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
2016 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
2017 * internal-fn.c (first_commutative_argument): Handle them.
2018 (FOR_EACH_COND_FN_PAIR): Likewise.
2019 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
2020 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
2023 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2025 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
2026 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
2027 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
2028 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
2029 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
2032 2021-11-17 Andrew Pinski <apinski@marvell.com>
2034 PR tree-optimization/103288
2035 * tree-ssa-phiopt.c (value_replacement): Return early if middle
2036 block has more than one pred.
2038 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
2040 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
2041 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
2042 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
2043 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
2044 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
2045 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
2046 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
2047 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
2049 2021-11-17 Marek Polacek <polacek@redhat.com>
2051 PR preprocessor/103026
2052 * doc/invoke.texi: Document -Wbidi-chars.
2054 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
2057 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
2060 2021-11-16 Jan Hubicka <jh@suse.cz>
2062 * ipa-modref.c (get_modref_function_summary): Declare.
2063 * ipa-modref.h (get_modref_function_summary): New function.
2064 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
2065 (clear_bytes_written_by): ... here; also clear memory killed by
2068 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
2070 * ggc-common.c (gt_pch_save): If we cannot find a suitable
2071 memory segment for save, then error-out, do not try to
2073 (gt_pch_restore): Save the existing line table, and when
2074 the replacement is being read, use that when constructing
2077 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
2080 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
2082 (*mma_assemble_acc): Likewise.
2084 2021-11-16 Martin Sebor <msebor@redhat.com>
2086 PR tree-optimization/102960
2087 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
2088 than a pointer to it.
2089 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
2091 (get_maxval_strlen): Use an auto_bitmap.
2092 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
2094 (get_range_strlen_phi): ...into this function.
2095 Avoid assuming maximum string length is constant
2096 (printf_strlen_execute): Dump pointer query cache contents when
2097 details are requisted.
2099 2021-11-16 Jason Merrill <jason@redhat.com>
2101 * langhooks.h (struct lang_hooks): Adjust comment.
2102 * print-tree.c (print_node): Also call print_xnode hook for
2105 2021-11-16 Andrew Pinski <apinski@marvell.com>
2107 PR tree-optimization/103218
2108 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
2110 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
2112 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
2113 (umaddhisi4): Likewise.
2114 (machi): Update pattern.
2117 2021-11-16 Richard Biener <rguenther@suse.de>
2119 PR tree-optimization/102880
2120 * tree-ssa-dce.c (sort_phi_args): New function.
2121 (make_forwarders_with_degenerate_phis): Likewise.
2122 (perform_tree_ssa_dce): Call
2123 make_forwarders_with_degenerate_phis.
2125 2021-11-16 Richard Biener <rguenther@suse.de>
2127 PR tree-optimization/102880
2128 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
2129 single_pred (bb1) condition to places that really need it.
2130 (match_simplify_replacement): Likewise.
2131 (value_replacement): Likewise.
2132 (replace_phi_edge_with_variable): Deal with extra edges
2135 2021-11-16 Martin Jambor <mjambor@suse.cz>
2137 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
2138 add a fixme note about the mode assignment perhaps being unnecessary.
2139 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
2141 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
2142 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
2143 * tree-inline.c (remap_ssa_name): Likewise.
2144 (tree_function_versioning): Likewise.
2145 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
2146 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
2147 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
2149 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2151 PR tree-optimization/103208
2152 * omp-expand.c (expand_omp_build_cond): New function.
2153 (expand_omp_for_init_counts, expand_omp_for_init_vars,
2154 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
2156 2021-11-16 Jakub Jelinek <jakub@redhat.com>
2158 PR tree-optimization/102009
2159 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
2160 Punt if any of alloc_size arguments is out of bounds vs. number of
2163 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
2165 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
2166 on !optimize_function_for_size_p.
2167 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
2168 (define_splits): Conditionalize on !optimize_function_for_size_p.
2169 (*bmi2_rorxsi3_1_zext): Likewise.
2170 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
2171 (define_splits): Conditionalize on !optimize_function_for_size_p.
2173 2021-11-16 Jan Hubicka <jh@suse.cz>
2176 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
2179 2021-11-16 Andrew Pinski <apinski@marvell.com>
2181 PR tree-optimization/103245
2182 * match.pd: Combine the abs pattern matching using multiplication.
2183 Adding optional nop_convert too.
2185 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
2187 PR middle-end/103268
2188 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
2191 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2193 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
2194 determine if it is safe to transform to strcat.
2195 (gimple_fold_builtin_snprintf): Likewise.
2197 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2199 * gimple-fold.c (known_lower): New function.
2200 (gimple_fold_builtin_strncat_chk,
2201 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
2202 gimple_fold_builtin_stxncpy_chk,
2203 gimple_fold_builtin_snprintf_chk,
2204 gimple_fold_builtin_sprintf_chk): Use it.
2206 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2208 * gimple-fold.c (dump_transformation): New function.
2209 (gimple_fold_builtin_stxcpy_chk,
2210 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
2211 BUILT_IN_STRNCPY if return value is not used.
2213 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
2215 PR middle-end/103184
2216 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
2217 before transforming equivalent, but slighly different cases to
2218 their canonical forms.
2220 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2223 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
2224 * doc/tm.texi: Regenerated.
2225 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
2226 * ipa.c (cgraph_build_static_cdtor_1): Return the built
2228 (build_cxa_atexit_decl): New.
2229 (build_dso_handle_decl): New.
2230 (build_cxa_dtor_registrations): New.
2231 (compare_cdtor_tu_order): New.
2232 (build_cxa_atexit_fns): New.
2233 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
2234 process the DTORs/CTORs accordingly.
2235 (pass_ipa_cdtor_merge::gate): Also run if
2236 dtors_from_cxa_atexit is set.
2237 * target.def (dtors_from_cxa_atexit): New hook.
2239 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2241 * config.in: Regenerate.
2242 * configure: Regenerate.
2243 * configure.ac: Test ld64 for -platform-version support.
2245 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2247 * diagnostic-spec.h (typedef xint_hash_t)
2248 (typedef xint_hash_map_t): Replace with...
2249 (typedef nowarn_map_t): ... this.
2250 (nowarn_map): Adjust.
2251 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
2253 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2255 * profile.c (branch_prob): Use 'location_hash' for
2258 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2260 PR tree-optimization/103207
2261 * value-range.cc (irange::set): Drop overflow.
2263 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2265 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
2266 to OMP_TARGET_CLAUSES if it isn't there already.
2268 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2270 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2271 Remove useless code.
2272 (path_range_query::ssa_defined_in_bb): New.
2273 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
2274 could trigger additional lookups.
2275 Do not use the cache for ARGs defined in this block.
2276 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
2277 (path_range_query::maybe_register_phi_relation): Same.
2278 (path_range_query::range_of_stmt): Adjust comment.
2279 * gimple-range-path.h (ssa_defined_in_bb): New.
2281 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2283 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2284 Default to global range if nothing found.
2286 2021-11-15 Richard Biener <rguenther@suse.de>
2288 PR tree-optimization/103237
2289 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
2290 double reductions with multiple inner loop LC PHI nodes.
2292 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
2295 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2296 New expand function.
2297 * config/i386/i386-options.c (ix86_target_string): Add
2298 -mrelax-cmpxchg-loop flag.
2299 (ix86_valid_target_attribute_inner_p): Likewise.
2300 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
2301 New expand function prototype.
2302 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
2303 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
2305 (atomic_<logic>_fetch<mode>): Likewise.
2306 (atomic_fetch_nand<mode>): Likewise.
2307 (atomic_nand_fetch<mode>): Likewise.
2308 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
2309 (atomic_<logic>_fetch<mode>): Likewise.
2310 (atomic_fetch_nand<mode>): Likewise.
2311 (atomic_nand_fetch<mode>): Likewise.
2312 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
2314 2021-11-15 Richard Biener <rguenther@suse.de>
2316 PR tree-optimization/103219
2317 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
2318 to determine the exit for the VN region.
2320 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2322 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
2323 (pass_data_vectorize): Likewise.
2324 (make_pass_vectorize): Likewise.
2325 * tree-vectorizer.c (vectorize_loops): Merge with
2326 pass_vectorize::execute and replace cfun occurences with fun param.
2327 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
2328 fun, and adjust callers approrpiately.
2329 (note_simd_array_uses): Likewise.
2330 (vect_loop_dist_alias_call): Likewise.
2331 (set_uid_loop_bbs): Likewise.
2332 (vect_transform_loops): Likewise.
2333 (try_vectorize_loop_1): Likewise.
2334 (try_vectorize_loop): Likewise.
2336 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2339 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
2340 atomic_bit_test_and_complement<mode>,
2341 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
2344 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
2346 * config/vax/vax.c (vax_lra_p): New prototype and function.
2347 (TARGET_LRA_P): Wire it.
2348 * config/vax/vax.opt (mlra): New option.
2349 * doc/invoke.texi (Option Summary, VAX Options): Document the
2352 2021-11-14 Jan Hubicka <jh@suse.cz>
2354 * ipa-modref.h (struct modref_summary): Add nondeterministic
2355 and calls_interposable flags.
2356 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
2357 (modref_summary::useful_p): Check new flags.
2358 (struct modref_summary_lto): Add nondeterministic and
2359 calls_interposable flags.
2360 (modref_summary_lto::modref_summary_lto): Initialize new flags.
2361 (modref_summary_lto::useful_p): Check new flags.
2362 (modref_summary::dump): Dump new flags.
2363 (modref_summary_lto::dump): Dump new flags.
2364 (ignore_nondeterminism_p): New function.
2365 (merge_call_side_effects): Merge new flags.
2366 (process_fnspec): Likewise.
2367 (analyze_load): Volatile access is nondeterministic.
2368 (analyze_store): Liekwise.
2369 (analyze_stmt): Volatile ASM is nondeterministic.
2370 (analyze_function): Clear new flags.
2371 (modref_summaries::duplicate): Duplicate new flags.
2372 (modref_summaries_lto::duplicate): Duplicate new flags.
2373 (modref_write): Stream new flags.
2374 (read_section): Stream new flags.
2375 (propagate_unknown_call): Update new flags.
2376 (modref_propagate_in_scc): Propagate new flags.
2377 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
2379 * tree-ssa-structalias.c (determine_global_memory_access):
2382 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
2384 * config/vax/vax.h (SET_RATIO): New macro.
2385 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
2386 (setmemhi): New expander.
2387 (setmemhi1): New insn and splitter.
2388 (*setmemhi1): New insn.
2390 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2392 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
2394 (modref_access_node::merge_for_kills): Likewise.
2395 (modref_access_node::insert_kill): Likewise.
2396 * ipa-modref-tree.h (modref_access_node::update_for_kills,
2397 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
2399 (modref_access_node::useful_for_kill): New member function.
2400 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
2401 (lto_modref_summary): Add kills.
2402 (modref_summary::dump): Dump kills.
2403 (record_access): Add mdoref_access_node parameter.
2404 (record_access_lto): Likewise.
2405 (merge_call_side_effects): Merge kills.
2406 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
2407 (struct summary_ptrs): Add always_executed filed.
2408 (analyze_load): Update.
2409 (analyze_store): Update; record kills.
2410 (analyze_stmt): Add always_executed; record kills in clobbers.
2411 (analyze_function): Track always_executed.
2412 (modref_summaries::duplicate): Duplicate kills.
2413 (update_signature): Release kills.
2414 * ipa-modref.h (struct modref_summary): Add kills.
2415 * tree-ssa-alias.c (alias_stats): Add kill stats.
2416 (dump_alias_stats): Dump kill stats.
2417 (store_kills_ref_p): Break out from ...
2418 (stmt_kills_ref_p): Use it; handle modref info based kills.
2420 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
2422 PR tree-optimization/103229
2423 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
2424 null value to memset.
2426 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2428 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
2430 (modref_access_node::get_ao_ref): Likewise.
2431 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
2432 (modref_access_node::get_ao_ref): Declare.
2433 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
2434 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
2436 2021-11-13 Jan Hubicka <jh@suse.cz>
2439 * dbgcnt.def (ipa_attr): New counters.
2440 * ipa-pure-const.c: Include dbgcnt.c
2441 (ipa_make_function_const): Use debug counter.
2442 (ipa_make_function_pure): Likewise.
2443 (propagate_pure_const): Fix bug in my previous change.
2445 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2447 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
2448 Offline from ipa-modref-tree.h.
2449 (modref_access_node::dump): Move from ipa-modref.c; make member
2451 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
2452 modref_access_node::dump): Declare.
2453 * ipa-modref.c (dump_access): Remove.
2454 (dump_records): Update.
2455 (dump_lto_records): Update.
2456 (record_access): Update.
2457 (record_access_lto): Update.
2459 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2461 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
2462 (modref_summary::dump): Dump try_dse.
2463 (modref_summary::finalize): Add FUN attribute; compute try-dse.
2464 (analyze_function): Update.
2465 (read_section): Update.
2466 (update_signature): Update.
2467 (pass_ipa_modref::execute): Update.
2468 * ipa-modref.h (struct modref_summary):
2469 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
2470 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
2471 * tree-ssa-dse.c (dse_optimize_call): New function.
2472 (dse_optimize_stmt): Use it.
2474 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2476 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
2478 2021-11-13 Jan Hubicka <jh@suse.cz>
2481 (struct modref_access_node): Move longer member functions to
2483 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
2485 * ipa-modref-tree.c (modref_access_node::contains): Move here
2486 from ipa-modref-tree.h.
2487 (modref_access_node::update): Likewise.
2488 (modref_access_node::merge): Likewise.
2489 (modref_access_node::closer_pair_p): Likewise.
2490 (modref_access_node::forced_merge): Likewise.
2491 (modref_access_node::update2): Likewise.
2492 (modref_access_node::combined_offsets): Likewise.
2493 (modref_access_node::try_merge_with): Likewise.
2494 (modref_access_node::insert): Likewise.
2496 2021-11-13 Jan Hubicka <jh@suse.cz>
2498 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
2499 (modref_summary::global_memory_written_p): Remove.
2500 (modref_summary::dump): Dump new flags.
2501 (modref_summary::finalize): New member function.
2502 (analyze_function): Call it.
2503 (read_section): Call it.
2504 (update_signature): Call it.
2505 (pass_ipa_modref::execute): Call it.
2506 * ipa-modref.h (struct modref_summary): Remove
2507 global_memory_read_p and global_memory_written_p.
2508 Add global_memory_read, global_memory_written.
2509 * tree-ssa-structalias.c (determine_global_memory_access):
2512 2021-11-13 Jan Hubicka <jh@suse.cz>
2514 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
2515 * ipa-param-manipulation.c
2516 (ipa_param_adjustments::type_attribute_allowed_p):
2517 New member function.
2518 (drop_type_attribute_if_params_changed_p): New function.
2519 (build_adjusted_function_type): Use it.
2520 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
2522 2021-11-13 David Malcolm <dmalcolm@redhat.com>
2524 * doc/invoke.texi (Static Analyzer Options): Add
2525 -Wno-analyzer-tainted-allocation-size,
2526 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
2527 -Wno-analyzer-tainted-size to list. Add
2528 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
2529 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
2530 of options effectively enabled by -fanalyzer.
2531 (-Wanalyzer-tainted-allocation-size): New.
2532 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
2533 (-Wanalyzer-tainted-divisor): New.
2534 (-Wanalyzer-tainted-offset): New.
2535 (-Wanalyzer-tainted-size): New.
2537 2021-11-13 Jan Hubicka <jh@suse.cz>
2539 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
2540 * gimple.c (gimple_call_arg_flags): ... here.
2541 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
2542 (modref_merge_call_site_flags): Use arg_eaf_flags.
2544 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2546 PR tree-optimization/103222
2547 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
2549 (path_range_query::compute_ranges_in_block): Call
2550 compute_ranges_in_phis.
2551 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
2554 2021-11-13 Jan Hubicka <jh@suse.cz>
2556 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
2557 changes on "fn spec" attribute; give up on varadic types.
2558 * ipa-param-manipulation.c: Include attribs.h.
2559 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
2560 true remove "fn spec" attribute.
2561 (ipa_param_adjustments::build_new_function_type): Update.
2562 (ipa_param_body_adjustments::modify_formal_parameters): update.
2563 * ipa-sra.c: Include attribs.h.
2564 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
2566 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2568 * gimple-range-path.cc (path_range_query::path_range_query): Merge
2570 (path_range_query::import_p): Move from header file.
2571 (path_range_query::~path_range_query): Adjust for combined ctors.
2572 * gimple-range-path.h: Merge ctors.
2573 (path_range_query::import_p): Move to .cc file.
2575 2021-11-13 Jan Hubicka <jh@suse.cz>
2577 PR tree-optimization/103182
2578 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
2579 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
2581 2021-11-12 Stafford Horne <shorne@gmail.com>
2583 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
2584 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
2585 new function or1k_profile_hook.
2586 * config/or1k/or1k.c (struct machine_function): Add new field
2587 set_mcount_arg_insn.
2588 (or1k_profile_hook): New function.
2589 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
2591 (or1k_frame_pointer_required): Frame pointer no longer needed
2594 2021-11-12 Jan Hubicka <jh@suse.cz>
2596 PR tree-optimization/103209
2597 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
2598 use of handle_rhs_call
2600 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
2602 PR tree-optimization/103202
2603 * gimple-range-path.cc
2604 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
2606 2021-11-12 Jan Hubicka <jh@suse.cz>
2608 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
2609 fix call of ipa_make_function_const and ipa_make_function_pure.
2611 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2613 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
2614 (aarch64_vector_op::m_seen_loads): Likewise.
2615 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
2617 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
2618 Remove code that tries to predict different vec_flags from the
2620 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
2621 Remove m_advsimd_ops handling.
2623 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2625 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
2626 (aarch64_sve_only_stmt_p): Delete.
2627 (aarch64_vector_costs::prefer_unrolled_loop): New function,
2628 extracted from adjust_body_cost.
2629 (aarch64_vector_costs::better_main_loop_than_p): New function,
2630 using heuristics extracted from adjust_body_cost and
2631 adjust_body_cost_sve.
2632 (aarch64_vector_costs::adjust_body_cost_sve): Remove
2633 advsimd_cycles_per_iter and could_use_advsimd parameters.
2634 Update after changes above.
2635 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
2637 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2639 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
2640 New member variable.
2641 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
2642 (aarch64_vec_op_count::vf_factor): New function.
2643 (aarch64_vector_costs::aarch64_vector_costs): When costing for
2644 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
2646 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
2647 instead of hard-coding 2.
2649 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2651 * config/aarch64/aarch64.c
2652 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
2653 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
2654 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
2655 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
2656 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
2657 above properties too.
2658 (aarch64_estimate_min_cycles_per_iter): Delete.
2659 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
2660 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
2661 routine to print CPI estimates.
2662 (adjust_body_cost_sve): Likewise. Use the other functions above
2663 instead of doing the work inline.
2665 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2667 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
2669 (aarch64_vec_op_count::base_issue_info): Remove handling of null
2671 (aarch64_vec_op_count::simd_issue_info): Likewise.
2672 (aarch64_vec_op_count::sve_issue_info): Likewise.
2673 (aarch64_vector_costs::m_ops): Turn into a vector.
2674 (aarch64_vector_costs::m_advsimd_ops): Likewise.
2675 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
2676 the vectors based on aarch64_tune_params.
2677 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
2678 of all entries in m_ops.
2679 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
2681 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
2682 parameter and get the information from the ops instead.
2683 (aarch64_vector_costs::adjust_body_cost_sve): Take a
2684 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
2685 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
2686 Exit earlier if m_ops is empty for either cost structure.
2688 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2690 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
2691 (aarch64_vector_costs::m_sve_ops): Replace with...
2692 (aarch64_vector_costs::m_ops): ...this.
2693 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
2694 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
2695 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
2696 Initialize m_vec_flags here rather than in add_stmt_cost.
2697 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
2698 Allow vectype to be null.
2699 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
2700 code too. Don't require vectype to be nonnull.
2701 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
2702 and scalar costs as parameters. Use the scalar costs to determine
2703 the cycles per iteration of the scalar loop, then multiply it
2704 by the estimated VF.
2705 (aarch64_vector_costs::finish_cost): Update call accordingly.
2707 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2709 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
2710 (aarch64_vector_costs::count_ops): Use it rather than the
2711 vectype to determine floatness.
2713 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2715 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
2716 Remove vectype parameter and get floatness from the type of the
2718 (arch64_in_loop_reduction_latency): Likewise.
2719 (aarch64_detect_vector_stmt_subtype): Update caller.
2720 (aarch64_vector_costs::count_ops): Likewise.
2722 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2724 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
2725 (aarch64_vec_op_count): ...this. Add a constructor.
2726 (aarch64_vec_op_count::vec_flags): New function.
2727 (aarch64_vec_op_count::base_issue_info): Likewise.
2728 (aarch64_vec_op_count::simd_issue_info): Likewise.
2729 (aarch64_vec_op_count::sve_issue_info): Likewise.
2730 (aarch64_vec_op_count::m_issue_info): New member variable.
2731 (aarch64_vec_op_count::m_vec_flags): Likewise.
2732 (aarch64_vector_costs): Add a constructor.
2733 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
2734 (aarch64_vector_costs::aarch64_vector_costs): New function.
2735 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
2736 (aarch64_vector_costs::count_ops): Remove vec_flags and
2737 issue_info parameters, using the new aarch64_vec_op_count
2739 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
2740 (aarch64_sve_op_count::dump): Fold into...
2741 (aarch64_vec_op_count::dump): ..here.
2743 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2745 * config/aarch64/aarch64.c: Include tree-dfa.h.
2746 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
2747 and MEM_OFFSET into account.
2748 (aarch64_swap_ldrstr_operands): Use it.
2749 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
2750 address of the second memory doesn't depend on the result of
2753 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
2755 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
2757 2021-11-12 Jan Hubicka <jh@suse.cz>
2759 PR tree-optimization/103175
2760 * ipa-modref.c (modref_lattice::merge): Add sanity check.
2761 (callee_to_caller_flags): Make flags adjustment sane.
2762 (modref_eaf_analysis::analyze_ssa_name): Likewise.
2764 2021-11-12 Jan Hubicka <jh@suse.cz>
2767 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
2768 not mark pure/const function if there are side-effects.
2770 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
2772 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
2773 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
2774 * tree-pretty-print.c (dump_omp_clause): Add support for printing
2775 implicit attribute in tree dumping.
2776 * gimplify.c (gimplify_adjust_omp_clauses_1):
2777 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
2779 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
2780 clauses, from simple append, to starting of list, after non-map clauses.
2781 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
2782 values passed to libgomp for implicit maps.
2784 2021-11-12 Jakub Jelinek <jakub@redhat.com>
2786 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
2787 (BUILT_IN_GOMP_TEAMS4): New.
2788 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
2789 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
2790 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
2791 GOMP_teams, pass to it also num_teams lower-bound expression
2792 or a dup of upper-bound if it is missing and a flag whether
2793 it is the first call or not.
2795 2021-11-12 Martin Liska <mliska@suse.cz>
2797 PR tree-optimization/102497
2798 * gimple-predicate-analysis.cc (add_pred): Remove unused
2801 2021-11-12 Richard Biener <rguenther@suse.de>
2803 PR tree-optimization/103204
2804 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
2805 top operand after folding in an address.
2807 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2809 * gimple-range-path.cc (path_range_query::path_range_query): New
2810 ctor without a ranger.
2811 (path_range_query::~path_range_query): Free ranger if necessary.
2812 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
2813 (path_range_query::ssa_range_in_phi): Same.
2814 (path_range_query::compute_ranges_in_block): Same.
2815 (path_range_query::compute_imports): Same.
2816 (path_range_query::compute_ranges): Same.
2817 (path_range_query::range_of_stmt): Same.
2818 (path_range_query::compute_outgoing_relations): Same.
2819 * gimple-range-path.h (class path_range_query): New ctor.
2820 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
2821 as path_range_query allocates one.
2822 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
2823 (back_threader::~back_threader): Same.
2825 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2827 * tree-ssa-threadbackward.c
2828 (back_threader_profitability::profitable_path_p): Remove loop
2829 crossing restriction.
2831 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2833 * ipa-modref.c (analyze_function): Do pure/const discovery, return
2835 (pass_modref::execute): If pure/const is discovered fixup cfg.
2836 (ignore_edge): Do not ignore pure/const edges.
2837 (modref_propagate_in_scc): Do pure/const discovery, return true if
2838 cdtor was promoted pure/const.
2839 (pass_ipa_modref::execute): If needed remove unreachable functions.
2840 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
2841 (warn_function_cold): Likewise.
2842 (skip_function_for_local_pure_const): Move earlier.
2843 (ipa_make_function_const): Break out from ...
2844 (ipa_make_function_pure): Break out from ...
2845 (propagate_pure_const): ... here.
2846 (pass_local_pure_const::execute): Use it.
2847 * ipa-utils.h (ipa_make_function_const): Declare.
2848 (ipa_make_function_pure): Declare.
2849 * passes.def: Move early modref after pure-const.
2851 2021-11-11 David Malcolm <dmalcolm@redhat.com>
2854 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
2856 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2858 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
2859 (TYPES_COMBINEP): Delete.
2860 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2861 qualified builtins for vcombine_* intrinsics.
2862 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
2864 (vcombine_s16): Likewise.
2865 (vcombine_s32): Likewise.
2866 (vcombine_f32): Likewise.
2867 (vcombine_u8): Use type-qualified builtin and remove casts.
2868 (vcombine_u16): Likewise.
2869 (vcombine_u32): Likewise.
2870 (vcombine_u64): Likewise.
2871 (vcombine_p8): Likewise.
2872 (vcombine_p16): Likewise.
2873 (vcombine_p64): Likewise.
2874 (vcombine_bf16): Remove unnecessary cast.
2875 * config/aarch64/iterators.md (VD_I): New mode iterator.
2876 (VDC_P): New mode iterator.
2878 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2880 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
2881 (TYPES_LOAD1_P): Define.
2882 (TYPES_STORE1_U): Define.
2883 (TYPES_STORE1P): Rename to...
2884 (TYPES_STORE1_P): This.
2885 (get_mem_type_for_load_store): Add unsigned and poly types.
2886 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
2887 type-qualified builtin declarations.
2888 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2889 qualified builtins for LD1/ST1.
2890 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
2891 builtin and remove cast.
2892 (vld1_p16): Likewise.
2893 (vld1_u8): Likewise.
2894 (vld1_u16): Likewise.
2895 (vld1_u32): Likewise.
2896 (vld1q_p8): Likewise.
2897 (vld1q_p16): Likewise.
2898 (vld1q_p64): Likewise.
2899 (vld1q_u8): Likewise.
2900 (vld1q_u16): Likewise.
2901 (vld1q_u32): Likewise.
2902 (vld1q_u64): Likewise.
2903 (vst1_p8): Likewise.
2904 (vst1_p16): Likewise.
2905 (vst1_u8): Likewise.
2906 (vst1_u16): Likewise.
2907 (vst1_u32): Likewise.
2908 (vst1q_p8): Likewise.
2909 (vst1q_p16): Likewise.
2910 (vst1q_p64): Likewise.
2911 (vst1q_u8): Likewise.
2912 (vst1q_u16): Likewise.
2913 (vst1q_u32): Likewise.
2914 (vst1q_u64): Likewise.
2915 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
2917 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2919 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2920 builtins for vector reduction.
2921 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
2922 builtin and remove casts.
2923 (vaddv_u16): Likewise.
2924 (vaddv_u32): Likewise.
2925 (vaddvq_u8): Likewise.
2926 (vaddvq_u16): Likewise.
2927 (vaddvq_u32): Likewise.
2928 (vaddvq_u64): Likewise.
2930 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2932 * config/aarch64/aarch64-simd-builtins.def:
2933 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
2934 builtin and remove casts.
2935 (vpaddq_u16): Likewise.
2936 (vpaddq_u32): Likewise.
2937 (vpaddq_u64): Likewise.
2938 (vpadd_u8): Likewise.
2939 (vpadd_u16): Likewise.
2940 (vpadd_u32): Likewise.
2941 (vpaddd_u64): Likewise.
2943 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2945 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2946 builtins for [r]subhn[2].
2947 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
2949 (vsubhn_s32): Likewise.
2950 (vsubhn_s64): Likewise.
2951 (vsubhn_u16): Use type-qualified builtin and remove casts.
2952 (vsubhn_u32): Likewise.
2953 (vsubhn_u64): Likewise.
2954 (vrsubhn_s16): Remove unnecessary cast.
2955 (vrsubhn_s32): Likewise.
2956 (vrsubhn_s64): Likewise.
2957 (vrsubhn_u16): Use type-qualified builtin and remove casts.
2958 (vrsubhn_u32): Likewise.
2959 (vrsubhn_u64): Likewise.
2960 (vrsubhn_high_s16): Remove unnecessary cast.
2961 (vrsubhn_high_s32): Likewise.
2962 (vrsubhn_high_s64): Likewise.
2963 (vrsubhn_high_u16): Use type-qualified builtin and remove
2965 (vrsubhn_high_u32): Likewise.
2966 (vrsubhn_high_u64): Likewise.
2967 (vsubhn_high_s16): Remove unnecessary cast.
2968 (vsubhn_high_s32): Likewise.
2969 (vsubhn_high_s64): Likewise.
2970 (vsubhn_high_u16): Use type-qualified builtin and remove
2972 (vsubhn_high_u32): Likewise.
2973 (vsubhn_high_u64): Likewise.
2975 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2977 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2978 builtins for [r]addhn[2].
2979 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
2981 (vaddhn_s32): Likewise.
2982 (vaddhn_s64): Likewise.
2983 (vaddhn_u16): Use type-qualified builtin and remove casts.
2984 (vaddhn_u32): Likewise.
2985 (vaddhn_u64): Likewise.
2986 (vraddhn_s16): Remove unnecessary cast.
2987 (vraddhn_s32): Likewise.
2988 (vraddhn_s64): Likewise.
2989 (vraddhn_u16): Use type-qualified builtin and remove casts.
2990 (vraddhn_u32): Likewise.
2991 (vraddhn_u64): Likewise.
2992 (vaddhn_high_s16): Remove unnecessary cast.
2993 (vaddhn_high_s32): Likewise.
2994 (vaddhn_high_s64): Likewise.
2995 (vaddhn_high_u16): Use type-qualified builtin and remove
2997 (vaddhn_high_u32): Likewise.
2998 (vaddhn_high_u64): Likewise.
2999 (vraddhn_high_s16): Remove unnecessary cast.
3000 (vraddhn_high_s32): Likewise.
3001 (vraddhn_high_s64): Likewise.
3002 (vraddhn_high_u16): Use type-qualified builtin and remove
3004 (vraddhn_high_u32): Likewise.
3005 (vraddhn_high_u64): Likewise.
3007 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3009 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3010 qualifiers in generator macros for uhsub builtins.
3011 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
3013 (vhsub_s16): Likewise.
3014 (vhsub_s32): Likewise.
3015 (vhsub_u8): Use type-qualified builtin and remove casts.
3016 (vhsub_u16): Likewise.
3017 (vhsub_u32): Likewise.
3018 (vhsubq_s8): Remove unnecessary cast.
3019 (vhsubq_s16): Likewise.
3020 (vhsubq_s32): Likewise.
3021 (vhsubq_u8): Use type-qualified builtin and remove casts.
3022 (vhsubq_u16): Likewise.
3023 (vhsubq_u32): Likewise.
3025 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3027 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3028 qualifiers in generator macros for u[r]hadd builtins.
3029 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
3031 (vhadd_s16): Likewise.
3032 (vhadd_s32): Likewise.
3033 (vhadd_u8): Use type-qualified builtin and remove casts.
3034 (vhadd_u16): Likewise.
3035 (vhadd_u32): Likewise.
3036 (vhaddq_s8): Remove unnecessary cast.
3037 (vhaddq_s16): Likewise.
3038 (vhaddq_s32): Likewise.
3039 (vhaddq_u8): Use type-qualified builtin and remove casts.
3040 (vhaddq_u16): Likewise.
3041 (vhaddq_u32): Likewise.
3042 (vrhadd_s8): Remove unnecessary cast.
3043 (vrhadd_s16): Likewise.
3044 (vrhadd_s32): Likewise.
3045 (vrhadd_u8): Use type-qualified builtin and remove casts.
3046 (vrhadd_u16): Likewise.
3047 (vrhadd_u32): Likewise.
3048 (vrhaddq_s8): Remove unnecessary cast.
3049 (vrhaddq_s16): Likewise.
3050 (vrhaddq_s32): Likewise.
3051 (vrhaddq_u8): Use type-wualified builtin and remove casts.
3052 (vrhaddq_u16): Likewise.
3053 (vrhaddq_u32): Likewise.
3055 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3057 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3058 qualifiers in generator macros for usub[lw][2] builtins.
3059 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
3061 (vsubl_s16): Likewise.
3062 (vsubl_s32): Likewise.
3063 (vsubl_u8): Use type-qualified builtin and remove casts.
3064 (vsubl_u16): Likewise.
3065 (vsubl_u32): Likewise.
3066 (vsubl_high_s8): Remove unnecessary cast.
3067 (vsubl_high_s16): Likewise.
3068 (vsubl_high_s32): Likewise.
3069 (vsubl_high_u8): Use type-qualified builtin and remove casts.
3070 (vsubl_high_u16): Likewise.
3071 (vsubl_high_u32): Likewise.
3072 (vsubw_s8): Remove unnecessary casts.
3073 (vsubw_s16): Likewise.
3074 (vsubw_s32): Likewise.
3075 (vsubw_u8): Use type-qualified builtin and remove casts.
3076 (vsubw_u16): Likewise.
3077 (vsubw_u32): Likewise.
3078 (vsubw_high_s8): Remove unnecessary cast.
3079 (vsubw_high_s16): Likewise.
3080 (vsubw_high_s32): Likewise.
3081 (vsubw_high_u8): Use type-qualified builtin and remove casts.
3082 (vsubw_high_u16): Likewise.
3083 (vsubw_high_u32): Likewise.
3085 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3087 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
3088 qualifiers in generator macros for uadd[lw][2] builtins.
3089 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
3091 (vaddl_s16): Likewise.
3092 (vaddl_s32): Likewise.
3093 (vaddl_u8): Use type-qualified builtin and remove casts.
3094 (vaddl_u16): Likewise.
3095 (vaddl_u32): Likewise.
3096 (vaddl_high_s8): Remove unnecessary cast.
3097 (vaddl_high_s16): Likewise.
3098 (vaddl_high_s32): Likewise.
3099 (vaddl_high_u8): Use type-qualified builtin and remove casts.
3100 (vaddl_high_u16): Likewise.
3101 (vaddl_high_u32): Likewise.
3102 (vaddw_s8): Remove unnecessary cast.
3103 (vaddw_s16): Likewise.
3104 (vaddw_s32): Likewise.
3105 (vaddw_u8): Use type-qualified builtin and remove casts.
3106 (vaddw_u16): Likewise.
3107 (vaddw_u32): Likewise.
3108 (vaddw_high_s8): Remove unnecessary cast.
3109 (vaddw_high_s16): Likewise.
3110 (vaddw_high_s32): Likewise.
3111 (vaddw_high_u8): Use type-qualified builtin and remove casts.
3112 (vaddw_high_u16): Likewise.
3113 (vaddw_high_u32): Likewise.
3115 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3117 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3118 qualified builtins for [R]SHRN[2].
3119 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
3120 builtin and remove casts.
3121 (vshrn_n_u32): Likewise.
3122 (vshrn_n_u64): Likewise.
3123 (vrshrn_high_n_u16): Likewise.
3124 (vrshrn_high_n_u32): Likewise.
3125 (vrshrn_high_n_u64): Likewise.
3126 (vrshrn_n_u16): Likewise.
3127 (vrshrn_n_u32): Likewise.
3128 (vrshrn_n_u64): Likewise.
3129 (vshrn_high_n_u16): Likewise.
3130 (vshrn_high_n_u32): Likewise.
3131 (vshrn_high_n_u64): Likewise.
3133 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3135 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
3136 type-qualified builtins for XTN[2].
3137 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
3138 qualified builtin and remove casts.
3139 (vmovn_high_u32): Likewise.
3140 (vmovn_high_u64): Likewise.
3141 (vmovn_u16): Likewise.
3142 (vmovn_u32): Likewise.
3143 (vmovn_u64): Likewise.
3145 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3147 * config/aarch64/aarch64-simd-builtins.def: Use poly type
3148 qualifier in builtin generator macros.
3149 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
3150 builtin and remove casts.
3151 (vmulq_p8): Likewise.
3152 (vmull_high_p8): Likewise.
3153 (vmull_p8): Likewise.
3155 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
3157 * config/aarch64/aarch64-simd-builtins.def: Declare type-
3158 qualified builtin generators for unsigned MLA/MLS intrinsics.
3159 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
3161 (vmla_n_u32): Likewise.
3162 (vmla_u8): Likewise.
3163 (vmla_u16): Likewise.
3164 (vmla_u32): Likewise.
3165 (vmlaq_n_u16): Likewise.
3166 (vmlaq_n_u32): Likewise.
3167 (vmlaq_u8): Likewise.
3168 (vmlaq_u16): Likewise.
3169 (vmlaq_u32): Likewise.
3170 (vmls_n_u16): Likewise.
3171 (vmls_n_u32): Likewise.
3172 (vmls_u8): Likewise.
3173 (vmls_u16): Likewise.
3174 (vmls_u32): Likewise.
3175 (vmlsq_n_u16): Likewise.
3176 (vmlsq_n_u32): Likewise.
3177 (vmlsq_u8): Likewise.
3178 (vmlsq_u16): Likewise.
3179 (vmlsq_u32): Likewise.
3181 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3183 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
3184 with looping const/pure.
3185 (modref_summary_lto::useful_p): Likewise.
3186 (merge_call_side_effects): Merge side effects before early exit
3188 (process_fnspec): Also handle pure functions.
3189 (analyze_call): Do not early exit on looping pure const.
3190 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
3191 (modref_propagate_in_scc): Update.
3193 2021-11-11 Richard Biener <rguenther@suse.de>
3195 PR tree-optimization/103190
3196 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
3198 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3200 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
3202 (path_range_query::compute_imports): ...this. Adapt it so it can
3203 be passed the imports bitmap instead of working on m_imports.
3204 (path_range_query::compute_ranges): Call compute_imports in all
3205 cases unless an imports bitmap is passed.
3206 * gimple-range-path.h (path_range_query::compute_imports): New.
3207 (path_range_query::add_copies_to_imports): Remove.
3208 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
3209 (back_threader::find_paths_to_names): Inline resolve_def.
3210 (back_threader::find_paths): Call compute_imports.
3211 (back_threader::resolve_phi): Adjust comment.
3213 2021-11-11 Richard Biener <rguenther@suse.de>
3215 PR tree-optimization/103188
3216 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
3217 Remove query parameter, split out check for size
3219 (ch_base::m_ranger, cb_base::m_query): Remove.
3220 (ch_base::copy_headers): Split processing loop into
3221 analysis around which we allocate and use ranger and
3222 transform where we do not.
3223 (pass_ch::execute): Do not allocate/free ranger here.
3224 (pass_ch_vect::execute): Likewise.
3226 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3228 * ipa-pure-const.c (propagate_pure_const): Self recursion is
3231 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3233 * cgraph.c (set_noreturn_flag_1): New function.
3234 (cgraph_node::set_noreturn_flag): New member function
3235 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
3236 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
3238 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3239 Richard Biener <rguenther@suse.de>
3241 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
3242 statically to the edge remaining in the loop.
3244 2021-11-11 Richard Biener <rguenther@suse.de>
3246 PR middle-end/103181
3247 * tree-eh.c (operation_could_trap_helper_p): Properly
3248 check vector constants for a zero element for integer
3249 division. Separate floating point and integer division code.
3251 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3254 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
3255 handling only for DECL_BIT_FIELD_TYPE decls.
3257 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
3260 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
3261 token is arch extension without leading '+' and emit appropriate
3262 diagnostic for the same.
3264 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3266 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
3267 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
3268 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
3269 * tree.c (omp_clause_num_ops): Increase num ops for
3270 OMP_CLAUSE_NUM_TEAMS to 2.
3271 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
3272 for OMP_CLAUSE_NUM_TEAMS.
3273 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
3274 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
3275 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
3276 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
3277 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
3278 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
3279 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
3281 2021-11-11 Richard Biener <rguenther@suse.de>
3283 * cfganal.c (find_pdom): Remove.
3284 (control_dependences::find_control_dependence): Remove
3285 special-casing of entry block, call get_immediate_dominator
3287 * gimple-predicate-analysis.cc (find_pdom): Remove.
3288 (find_dom): Likewise.
3289 (find_control_equiv_block): Call get_immediate_dominator
3291 (compute_control_dep_chain): Likewise.
3292 (predicate::init_from_phi_def): Likewise.
3294 2021-11-11 Richard Biener <rguenther@suse.de>
3296 * cfganal.h (control_dependences::control_dependence_map):
3298 (control_dependences::m_bitmaps): New.
3299 * cfganal.c (control_dependences::set_control_dependence_map_bit):
3301 (control_dependences::clear_control_dependence_bitmap):
3303 (control_dependences::find_control_dependence): Do not
3304 find_edge for the abnormal edge test.
3305 (control_dependences::control_dependences): Instead do not
3306 add abnormal edges to the edge list. Adjust.
3307 (control_dependences::~control_dependences): Likewise.
3308 (control_dependences::get_edges_dependent_on): Likewise.
3309 * function-tests.c: Include bitmap.h.
3311 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
3313 * doc/invoke.texi: Change references to "future cpu" to "power10",
3314 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
3316 2021-11-11 Cui,Lili <lili.cui@intel.com>
3318 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
3320 (processor_cost_table): Use alderlake_cost for Alderlake.
3321 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
3322 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
3324 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
3326 (ix86_adjust_cost): Handle Alderlake.
3327 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
3328 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
3329 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
3330 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
3331 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
3332 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
3333 (X86_TUNE_USE_LEAVE): Likewise.
3334 (X86_TUNE_PUSH_MEMORY): Likewise.
3335 (X86_TUNE_USE_INCDEC): Likewise.
3336 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
3337 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
3338 (X86_TUNE_USE_SAHF): Likewise.
3339 (X86_TUNE_USE_BT): Likewise.
3340 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
3341 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
3342 (X86_TUNE_AVOID_MFENCE): Likewise.
3343 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
3344 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
3345 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
3346 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
3347 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
3348 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
3349 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
3350 (X86_TUNE_USE_GATHER): Disable for Alderlake.
3351 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
3352 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
3354 2021-11-11 liuhongt <hongtao.liu@intel.com>
3357 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
3358 (avxsizesuffix): Ditto.
3360 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
3362 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
3363 wrong marco checking.
3365 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3367 PR tree-optimization/102906
3368 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
3369 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
3370 (class ch_base): Add m_ranger and m_query.
3371 (ch_base::copy_headers): Pass m_query to
3372 entry_loop_condition_is_static.
3373 (pass_ch::execute): Allocate and deallocate m_ranger and
3375 (pass_ch_vect::execute): Same.
3377 2021-11-10 Andrew Pinski <apinski@marvell.com>
3380 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
3381 Use vwcore iterator for the r constraint output string.
3383 2021-11-10 qing zhao <qing.zhao@oracle.com>
3385 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
3386 only when have_insn_for return true for the mode. Fix a memory leak.
3388 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
3390 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
3391 cortexa5_extra_costs, cortexa7_extra_costs,
3392 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
3393 Initialize movi, dup and extract costing fields.
3395 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3397 * gimple-range-path.cc (path_range_query::path_range_query): Do
3399 (path_range_query::dump): Change m_path uses to non-pointer.
3400 (path_range_query::defined_outside_path): Same.
3401 (path_range_query::set_path): Same.
3402 (path_range_query::add_copies_to_imports): Same.
3403 (path_range_query::range_of_stmt): Same.
3404 (path_range_query::compute_outgoing_relations): Same.
3405 (path_range_query::compute_ranges): Imports are now optional.
3406 Implement overload that takes an edge.
3407 * gimple-range-path.h (class path_range_query): Make imports
3408 optional for compute_ranges. Add compute_ranges(edge) overload.
3409 Make m_path an auto_vec instead of a pointer and adjust
3412 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3414 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
3415 (default_hash_traits<scalar_conf_masked_key>): Likewise.
3416 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
3418 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
3419 Register mask inverses.
3421 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3423 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
3424 successful vectorization.
3426 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
3428 * gimple-range-cache.cc (sbr_vector::grow): New.
3429 (sbr_vector::set_bb_range): Call grow.
3430 (sbr_vector::get_bb_range): Same.
3431 (sbr_vector::bb_range_p): Remove assert.
3433 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3435 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
3436 ,*aarch64_topbits_shuffle<mode>_be): Remove.
3438 2021-11-10 Jan Hubicka <jh@suse.cz>
3440 * ipa-modref.c: Include tree-eh.h
3441 (modref_summary::modref_summary): Initialize side_effects.
3442 (struct modref_summary_lto): New bool field side_effects.
3443 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
3444 (modref_summary::dump): Dump side_effects.
3445 (modref_summary_lto::dump): Dump side_effects.
3446 (merge_call_side_effects): Merge side effects.
3447 (process_fnspec): Calls to non-const/pure or looping
3448 function is a side effect.
3449 (analyze_call): Self-recursion is a side-effect; handle
3451 (analyze_load): Watch for volatile and throwing memory.
3452 (analyze_store): Likewise.
3453 (analyze_stmt): Watch for volatitle asm.
3454 (analyze_function): Handle side_effects.
3455 (modref_summaries::duplicate): Duplicate side_effects.
3456 (modref_summaries_lto::duplicate): Likewise.
3457 (modref_write): Stream side_effects.
3458 (read_section): Likewise.
3459 (update_signature): Update.
3460 (propagate_unknown_call): Handle side_effects.
3461 (modref_propagate_in_scc): Likewise.
3462 * ipa-modref.h (struct modref_summary): Add side_effects.
3463 * ipa-pure-const.c (special_builtin_state): Rename to ...
3464 (builtin_safe_for_const_function_p): ... this one.
3465 (check_call): Update.
3466 (finite_function_p): Break out from ...
3467 (propagate_pure_const): ... here
3468 * ipa-utils.h (finite_function): Declare.
3470 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
3472 * config.gcc (powerpc*-*-*): Remove -rpath from
3473 --with-advance-toolchain.
3475 2021-11-10 Marek Polacek <polacek@redhat.com>
3478 * attribs.c (struct scoped_attributes): Add a bool member.
3479 (lookup_scoped_attribute_spec): Forward declare.
3480 (register_scoped_attributes): New bool parameter, defaulted to
3482 (handle_ignored_attributes_option): New function.
3483 (free_attr_data): New function.
3484 (init_attributes): Call handle_ignored_attributes_option.
3485 (attr_namespace_ignored_p): New function.
3486 (decl_attributes): Check attr_namespace_ignored_p before
3488 * attribs.h (free_attr_data): Declare.
3489 (register_scoped_attributes): Adjust declaration.
3490 (handle_ignored_attributes_option): Declare.
3491 (canonicalize_attr_name): New function template.
3492 (canonicalize_attr_name): Use it.
3493 * common.opt (Wattributes=): New option with a variable.
3494 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
3495 * doc/invoke.texi: Document -Wno-attributes=.
3496 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
3497 * plugin.h (register_scoped_attributes): Adjust declaration.
3498 * toplev.c (compile_file): Call free_attr_data.
3500 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3502 * config/arm/arm-cpus.in (cortex-a710): New CPU.
3503 * config/arm/arm-tables.opt: Regenerate.
3504 * config/arm/arm-tune.md: Regenerate.
3505 * doc/invoke.texi: Update docs.
3507 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3509 * config/aarch64/aarch64-builtins.c
3510 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
3512 2021-11-10 Martin Liska <mliska@suse.cz>
3514 * lto-wrapper.c (merge_and_complain): Make the first argument
3517 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3519 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
3520 smax_nan and fmin_nan instead of smin_nan.
3521 (maxmin_uns): Rename to...
3522 (fmaxmin): ...this and make the same changes. Remove entries
3523 unrelated to fmax* and fmin*.
3524 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
3525 (<fmaxmin><mode>3): ...this.
3526 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
3528 (aarch64_<optab>p<mode>): ...this.
3529 (<maxmin_uns><mode>3): Rename to...
3530 (<fmaxmin><mode>3): ...this.
3531 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
3532 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
3533 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
3534 (aarch64_reduc_<optab>_internal<mode>): ...this.
3535 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
3536 (aarch64_reduc_<optab>_internalv2si): ...this.
3537 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
3538 (<fmaxmin><mode>3): ...this.
3539 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
3541 (fmax_nan, fmin_nan): ...this.
3542 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
3543 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
3544 (vmin_f16, vminq_f16): Update accordingly.
3546 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3548 * tree-vectorizer.h (vector_costs::finish_cost): Take the
3549 corresponding scalar costs as a parameter.
3550 (finish_cost): Likewise.
3551 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
3552 (vect_estimate_min_profitable_iters): Update accordingly.
3553 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
3554 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
3555 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
3557 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
3559 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3561 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
3563 (_loop_vec_info::single_scalar_iteration_cost): Delete.
3564 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
3565 (vector_costs::total_cost): New function.
3566 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3567 after above changes.
3568 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
3569 (vect_compute_single_scalar_iteration_cost): Store the costs
3570 in loop_vinfo->scalar_costs.
3571 (vect_estimate_min_profitable_iters): Get the scalar cost from
3572 loop_vinfo->scalar_costs.
3574 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3576 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
3577 (vector_costs::better_epilogue_loop_than_p)
3578 (vector_costs::compare_inside_loop_cost)
3579 (vector_costs::compare_outside_loop_cost): Likewise.
3580 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
3581 (vector_costs::better_epilogue_loop_than_p)
3582 (vector_costs::compare_inside_loop_cost)
3583 (vector_costs::compare_outside_loop_cost): New functions,
3584 containing code moved from...
3585 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
3587 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3589 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
3590 and vec_inside_cost.
3591 (vector_costs::outside_cost): New function.
3592 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3594 (vect_estimate_min_profitable_iters): Likewise.
3595 (vect_better_loop_vinfo_p): Get the inside and outside costs
3596 from the loop_vec_infos' vector_costs.
3598 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3600 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
3601 (_loop_vec_info::vector_costs): ...this.
3602 (LOOP_VINFO_TARGET_COST_DATA): Delete.
3603 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
3605 (vec_info::~vec_info): Remove corresponding delete.
3606 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
3607 vector_costs to null.
3608 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
3609 (vect_analyze_loop_operations): Update after above changes.
3610 (vect_analyze_loop_2): Likewise.
3611 (vect_estimate_min_profitable_iters): Likewise.
3612 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
3614 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
3616 * tree-core.h (EAF_DIRECT): Remove.
3617 (EAF_NOCLOBBER): Remove.
3618 (EAF_UNUSED): Remove.
3619 (EAF_NOESCAPE): Remove.
3620 (EAF_NO_DIRECT_CLOBBER): New.
3621 (EAF_NO_INDIRECT_CLOBBER): New.
3622 (EAF_NODIRECTESCAPE): Remove.
3623 (EAF_NO_DIRECT_ESCAPE): New.
3624 (EAF_NO_INDIRECT_ESCAPE): New.
3625 (EAF_NOT_RETURNED): Remove.
3626 (EAF_NOT_RETURNED_INDIRECTLY): New.
3627 (EAF_NOREAD): Remove.
3628 (EAF_NO_DIRECT_READ): New.
3629 (EAF_NO_INDIRECT_READ): New.
3630 * gimple.c (gimple_call_arg_flags): Update for new flags.
3631 (gimple_call_retslot_flags): Update for new flags.
3632 * ipa-modref.c (dump_eaf_flags): Likewise.
3633 (remove_useless_eaf_flags): Likewise.
3634 (deref_flags): Likewise.
3635 (modref_lattice::init): Likewise.
3636 (modref_lattice::merge): Likewise.
3637 (modref_lattice::merge_direct_load): Likewise.
3638 (modref_lattice::merge_direct_store): Likewise.
3639 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
3640 (callee_to_caller_flags): Likewise.
3641 (modref_eaf_analysis::analyze_ssa_name): Likewise.
3642 (modref_eaf_analysis::propagate): Likewise.
3643 (modref_merge_call_site_flags): Likewise.
3644 * ipa-modref.h (interposable_eaf_flags): Likewise.
3645 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
3646 * tree-ssa-structalias.c (handle_call_arg): Likewise.
3647 (handle_rhs_call): Likewise.
3648 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
3650 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3652 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
3654 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3656 * config/aarch64/aarch64-builtins.c
3657 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
3660 2021-11-10 Jan Hubicka <jh@suse.cz>
3662 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
3664 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
3665 Hongtao Liu <hongtao.liu@intel.com>
3667 PR middle-end/102566
3668 * match.pd (nop_atomic_bit_test_and_p): New match.
3669 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
3670 (gimple_nop_atomic_bit_test_and_p): New prototype.
3671 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
3672 different cases to their canonical forms.
3674 2021-11-10 liuhongt <hongtao.liu@intel.com>
3676 PR tree-optimization/103126
3677 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
3678 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
3679 * tree-if-conv.c : Include tree-vectorizer.h.
3680 (is_cond_scalar_reduction): Handle
3681 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
3682 (convert_scalar_cond_reduction): Ditto.
3684 2021-11-10 konglin1 <lingling.kong@intel.com>
3686 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
3687 (cmla<conj_op><mode>4): Likewise
3689 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3691 * tree-ssa-threadedge.c: Do not include
3692 gimple-ssa-evrp-analyze.h.
3693 * value-pointer-equiv.cc: Same.
3695 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3697 * tree-ssa-threadbackward.c
3698 (back_threader::maybe_register_path_dump): Abstract path dumping...
3699 (dump_path): ...here.
3700 (back_threader::resolve_phi): Call dump_path.
3703 2021-11-10 konglin1 <lingling.kong@intel.com>
3705 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
3706 Add new define_insn.
3707 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
3708 (fma_<mode>_fcmaddc_bcst): Likewise
3710 2021-11-10 liuhongt <hongtao.liu@intel.com>
3713 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
3716 2021-11-10 Andrew Pinski <apinski@marvell.com>
3719 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
3720 as an input, use original one.
3722 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
3724 * config/nios2/nios2.c (nios2_can_inline_p): New.
3725 (TARGET_CAN_INLINE_P): Define.
3727 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3729 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
3730 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
3731 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
3732 (lxvrse_expand_builtin): Modify the expansion for sign extension.
3733 All extensions are done within VSX registers.
3735 2021-11-09 Andrew Pinski <apinski@marvell.com>
3737 PR tree-optimization/10352
3738 * match.pd: Remove check of TYPE_PRECISION for
3741 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
3743 * function.c (allocate_struct_function): Don't set x_range_query.
3744 * function.h (get_range_query): Move to value-query.h.
3745 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
3746 (disable_ranger): Clear function current query field.
3747 * value-query.cc (get_global_range_query): Relocate to:
3748 * value-query.h (get_global_range_query): Here and inline.
3749 (get_range_query): Relocate here from function.h.
3751 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3753 * tree-ssa-threadbackward.c
3754 (back_threader::maybe_register_path_dump): New.
3755 (back_threader::maybe_register_path): Call maybe_register_path_dump.
3757 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3759 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
3760 Return NULL when unprofitable.
3762 2021-11-09 Martin Jambor <mjambor@suse.cz>
3764 * tree.h (build_debug_expr_decl): Declare.
3765 * tree.c (build_debug_expr_decl): New function.
3766 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
3767 instead of building a DEBUG_EXPR_DECL.
3768 * ipa-param-manipulation.c
3769 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3770 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
3771 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
3772 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
3773 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
3775 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3777 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
3778 (def_builtin): Change debug formatting for easier parsing and
3779 include more information.
3780 (rs6000_init_builtins): Add dump of autogenerated builtins.
3781 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
3784 2021-11-09 Marek Polacek <polacek@redhat.com>
3786 * ipa-modref.c (analyze_parms): Fix a typo.
3788 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
3791 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
3792 assertion for Z*-ext.
3794 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
3796 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
3797 and past_static_chain; merge past summary with current one.
3798 (analyze_function): Update.
3800 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3802 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
3804 2021-11-09 Martin Liska <mliska@suse.cz>
3806 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
3807 conditional based on
3808 * print-rtl.h (class rtx_writer): Likewise.
3810 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3812 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
3813 for 'Deleted'. Turn into a '#define'.
3815 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3817 * gimple-range-path.cc (path_range_query::dump): Clean up.
3818 (path_range_query::compute_ranges): Same.
3819 * value-relation.cc (path_oracle::dump): Same.
3821 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3823 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
3824 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
3826 * flag-types.h (enum threader_debug): New.
3827 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
3828 * params.opt: Add entry for --param=threader-debug=.
3830 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3832 PR middle-end/101204
3834 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
3835 (struct xint_hash_t): ... this.
3836 * doc/gty.texi: Update.
3838 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3840 * config/arm/arm-cpus.in (armv9): New define.
3841 (ARMv9a): New group.
3842 (armv9-a): New arch definition.
3843 * config/arm/arm-tables.opt: Regenerate.
3844 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
3845 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
3846 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
3847 to MULTILIB_MATCHES.
3848 * config/arm/t-multilib: Added v9_a_nosimd_variants and
3849 v9_a_simd_variants to MULTILIB_MATCHES.
3850 * doc/invoke.texi: Update docs.
3852 2021-11-09 Martin Jambor <mjambor@suse.cz>
3855 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
3856 return with error_mark_mode when part of expression is mapped to
3858 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
3859 mapped value to NULL if walk_tree returns error_mark_mode.
3861 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3863 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
3865 (get_addr_stridx): Same.
3867 (get_range_strlen_dynamic): Same.
3868 (handle_builtin_strlen): Same.
3869 (handle_builtin_strchr): Same.
3870 (handle_builtin_strcpy): Same.
3871 (maybe_diag_stxncpy_trunc): Same.
3872 (handle_builtin_stxncpy_strncat): Same.
3873 (handle_builtin_memcpy): Same.
3874 (handle_builtin_strcat): Same.
3875 (handle_alloc_call): Same.
3876 (handle_builtin_memset): Same.
3877 (handle_builtin_string_cmp): Same.
3878 (handle_pointer_plus): Same.
3879 (count_nonzero_bytes_addr): Same.
3880 (count_nonzero_bytes): Same.
3881 (handle_store): Same.
3882 (fold_strstr_to_strncmp): Same.
3883 (handle_integral_assign): Same.
3884 (check_and_optimize_stmt): Same.
3885 (class strlen_dom_walker): Replace evrp with ranger.
3886 (strlen_dom_walker::before_dom_children): Remove evrp.
3887 (strlen_dom_walker::after_dom_children): Remove evrp.
3888 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
3889 Restrict sprintf output.
3891 2021-11-09 Martin Liska <mliska@suse.cz>
3893 * genconditions.c (write_one_condition): Add const qualifier
3896 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
3898 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
3901 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3903 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
3904 escape-newline support.
3905 (VEC_INIT_V4SI): Likewise.
3906 (VEC_INIT_V8HI): Likewise.
3907 (PACK_V1TI): Likewise.
3909 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
3910 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
3911 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
3915 (FMAF128_ODD): Likewise.
3916 (VSCEQPUO): Likewise.
3918 (VSIEQPF): Likewise.
3919 (VSTDCQP): Likewise.
3920 (PACK_TD): Likewise.
3921 (TABORTDC): Likewise.
3922 (TABORTDCI): Likewise.
3923 (SE_LXVRBX): Likewise.
3924 (SE_LXVRHX): Likewise.
3925 (SE_LXVRWX): Likewise.
3926 (SE_LXVRDX): Likewise.
3927 (VREPLACE_UN_UV2DI): Likewise.
3928 (VREPLACE_UN_UV4SI): Likewise.
3929 (VREPLACE_UN_V2DI): Likewise.
3930 (VREPLACE_ELT_UV2DI): Likewise.
3931 (VREPLACE_ELT_V2DI): Likewise.
3932 (ZE_LXVRBX): Likewise.
3933 (ZE_LXVRHX): Likewise.
3934 (ZE_LXVRWX): Likewise.
3935 (ZE_LXVRDX): Likewise.
3937 (CNTLZDM): Likewise.
3938 (CNTTZDM): Likewise.
3941 (PMXVBF16GER2): Likewise.
3942 (PMXVBF16GER2_INTERNAL): Likewise.
3943 (PMXVBF16GER2NN): Likewise.
3944 (PMXVBF16GER2NN_INTERNAL): Likewise.
3945 (PMXVBF16GER2NP): Likewise.
3946 (PMXVBF16GER2NP_INTERNAL): Likewise.
3947 (PMXVBF16GER2PN): Likewise.
3948 (PMXVBF16GER2PN_INTERNAL): Likewise.
3949 (PMXVBF16GER2PP): Likewise.
3950 (PMXVBF16GER2PP_INTERNAL): Likewise.
3951 (PMXVF16GER2): Likewise.
3952 (PMXVF16GER2_INTERNAL): Likewise.
3953 (PMXVF16GER2NN): Likewise.
3954 (PMXVF16GER2NN_INTERNAL): Likewise.
3955 (PMXVF16GER2NP): Likewise.
3956 (PMXVF16GER2NP_INTERNAL): Likewise.
3957 (PMXVF16GER2PN): Likewise.
3958 (PMXVF16GER2PN_INTERNAL): Likewise.
3959 (PMXVF16GER2PP): Likewise.
3960 (PMXVF16GER2PP_INTERNAL): Likewise.
3961 (PMXVF32GER_INTERNAL): Likewise.
3962 (PMXVF32GERNN): Likewise.
3963 (PMXVF32GERNN_INTERNAL): Likewise.
3964 (PMXVF32GERNP): Likewise.
3965 (PMXVF32GERNP_INTERNAL): Likewise.
3966 (PMXVF32GERPN): Likewise.
3967 (PMXVF32GERPN_INTERNAL): Likewise.
3968 (PMXVF32GERPP): Likewise.
3969 (PMXVF32GERPP_INTERNAL): Likewise.
3970 (PMXVF64GER): Likewise.
3971 (PMXVF64GER_INTERNAL): Likewise.
3972 (PMXVF64GERNN): Likewise.
3973 (PMXVF64GERNN_INTERNAL): Likewise.
3974 (PMXVF64GERNP): Likewise.
3975 (PMXVF64GERNP_INTERNAL): Likewise.
3976 (PMXVF64GERPN): Likewise.
3977 (PMXVF64GERPN_INTERNAL): Likewise.
3978 (PMXVF64GERPP): Likewise.
3979 (PMXVF64GERPP_INTERNAL): Likewise.
3980 (PMXVI16GER2): Likewise.
3981 (PMXVI16GER2_INTERNAL): Likewise.
3982 (PMXVI16GER2PP): Likewise.
3983 (PMXVI16GER2PP_INTERNAL): Likewise.
3984 (PMXVI16GER2S): Likewise.
3985 (PMXVI16GER2S_INTERNAL): Likewise.
3986 (PMXVI16GER2SPP): Likewise.
3987 (PMXVI16GER2SPP_INTERNAL): Likewise.
3988 (PMXVI4GER8): Likewise.
3989 (PMXVI4GER8_INTERNAL): Likewise.
3990 (PMXVI4GER8PP): Likewise.
3991 (PMXVI4GER8PP_INTERNAL): Likewise.
3992 (PMXVI8GER4): Likewise.
3993 (PMXVI8GER4_INTERNAL): Likewise.
3994 (PMXVI8GER4PP): Likewise.
3995 (PMXVI8GER4PP_INTERNAL): Likewise.
3996 (PMXVI8GER4SPP): Likewise.
3997 (PMXVI8GER4SPP_INTERNAL): Likewise.
3998 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
3999 (linebuf): Increase size.
4000 (lines): New variable.
4001 (lastline): Likewise.
4002 (real_line_pos): New function.
4003 (diag): Change signature.
4004 (bif_diag): Change signature; support escape-newline handling.
4005 (ovld_diag): Likewise.
4006 (fatal): Move earlier.
4007 (consume_whitespace): Adjust diag call.
4008 (advance_line): Add escape-newline handling; call fatal.
4009 (safe_inc_pos): Adjust diag call.
4010 (match_identifier): Likewise.
4011 (match_integer): Likewise.
4012 (match_to_right_bracket): Call fatal instead of diag; adjust diag
4014 (match_basetype): Adjust diag calls.
4015 (match_bracketed_pair): Likewise.
4016 (match_const_restriction): Likewise.
4017 (match_type): Likewise.
4018 (parse_args): Likewise.
4019 (parse_bif_attrs): Likewise.
4020 (complete_vector_type): Likewise.
4021 (complete_base_type): Likewise.
4022 (parse_prototype): Likewise.
4023 (parse_bif_entry): Likewise.
4024 (parse_bif_stanza): Likewise.
4025 (parse_ovld_entry): Likewise.
4026 (parse_ovld_stanza): Likewise.
4027 (main): Allocate buffers for lines[].
4029 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
4031 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
4032 use the REG argument on the CM_LARGE_PIC code path as well.
4033 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
4034 specially with the large code models.
4036 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4038 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
4039 allowing interposition since issues with binds_to_local_def were
4042 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
4044 PR tree-optimization/103122
4045 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
4046 entry with "calulate new values" set to false.
4048 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4050 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
4051 and builtin_return_address.
4052 (builtin_fnspec): Annotate builtin_return,
4053 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
4054 builtin_cxa_end_cleanup, builtin_eh_copy_values,
4055 builtin_frame_address, builtin_apply_args,
4056 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
4057 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
4059 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
4060 and builtin_return_address.
4062 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4064 PR tree-optimization/103177
4065 * passes.def: Move uncprop after pure/const and modref.
4067 2021-11-08 Martin Jambor <mjambor@suse.cz>
4071 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
4072 location before invoking remap_with_debug_expressions on it.
4073 * ipa-param-manipulation.c
4074 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4076 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
4078 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
4081 2021-11-08 Richard Biener <rguenther@suse.de>
4083 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
4084 instead of an sbitmap to cache irreducible state.
4086 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
4088 PR tree-optimization/103120
4089 * gimple-range-path.cc (path_range_query::range_defined_in_block):
4090 Bail if there's a cache entry.
4092 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4094 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
4095 rs6000_builtin_decls_x when appropriate.
4096 (add_condition_to_bb): Likewise.
4097 (rs6000_atomic_assign_expand_fenv): Likewise.
4099 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
4101 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
4102 (rs6000_builtin_decl): Call it.
4104 2021-11-08 Martin Liska <mliska@suse.cz>
4106 * cgraph.c (cgraph_node::dump): Dump it from decl.
4108 2021-11-08 Martin Liska <mliska@suse.cz>
4110 PR gcov-profile/100520
4111 * coverage.c (coverage_compute_profile_id): Strip .gk when
4112 compare debug is used.
4113 * system.h (endswith): New function.
4115 2021-11-08 Martin Liska <mliska@suse.cz>
4117 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
4119 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
4121 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
4122 * config/rs6000/rbtree.h: Likewise.
4123 * config/rs6000/rs6000-builtin-new.def: Likewise.
4124 * config/rs6000/rs6000-gen-builtins.c: Likewise.
4125 * config/rs6000/rs6000-overload.def: Likewise.
4127 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4129 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
4130 (aarch64_address_cost): Use it instead of testing for CImode and
4133 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
4135 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
4136 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
4137 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
4138 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
4139 NUM_MODE_DECIMAL_FLOAT.
4140 (REAL_MODE_FORMAT): Likewise.
4142 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
4144 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
4147 2021-11-08 Richard Biener <rguenther@suse.de>
4149 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
4151 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
4152 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
4153 LOOP_VINFO_NITERS_ASSUMPTIONS.
4154 (vect_analyze_loop_1): Adjust.
4155 (vect_analyze_loop): Move loop constraint setting and
4156 SCEV/niter reset here from vect_create_loop_vinfo to perform
4158 (vect_analyze_loop_form): Move dumping of symbolic niters
4159 here from vect_create_loop_vinfo.
4161 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
4163 * ipa-modref-tree.h (enum modref_special_parms): New enum.
4164 (struct modref_access_node): update for special parms.
4165 (struct modref_ref_node): Likewise.
4166 (struct modref_parm_map): Likewise.
4167 (struct modref_tree): Likewise.
4168 * ipa-modref.c (dump_access): Likewise.
4169 (get_access): Detect static chain.
4170 (parm_map_for_arg): Take tree as arg instead of
4172 (merge_call_side_effects): Compute map for static chain.
4173 (process_fnspec): Update.
4174 (struct escape_point): Remove retslot_arg and static_chain_arg.
4175 (analyze_parms): Update.
4176 (compute_parm_map): Update.
4177 (propagate_unknown_call): Update.
4178 (modref_propagate_in_scc): Update.
4179 (modref_merge_call_site_flags): Update.
4180 (ipa_merge_modref_summary_after_inlining): Update.
4181 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
4182 * ipa-modref-tree.c (test_merge): Update.
4184 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
4186 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
4187 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
4188 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
4191 2021-11-08 liuhongt <hongtao.liu@intel.com>
4193 PR tree-optimization/103077
4194 * doc/invoke.texi (Options That Control Optimization):
4195 Update documentation for -ftree-loop-vectorize and
4196 -ftree-slp-vectorize which are enabled by default at -02.
4198 2021-11-08 liuhongt <hongtao.liu@intel.com>
4201 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
4202 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
4204 2021-11-08 liuhongt <hongtao.liu@intel.com>
4207 * match.pd: Simplify
4208 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
4209 c) under flag_unsafe_math_optimizations.
4211 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4214 * params.opt (modref-max-depth): Add range.
4215 (modref-max-adjustments): Fix range.
4217 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
4219 * tree-pass.h (make_pass_vrp_threader): Remove.
4220 * tree-ssa-threadbackward.c
4221 (back_threader_profitability::profitable_path_p): Remove
4222 ASSERT_EXPR references.
4223 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
4224 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
4225 (execute_vrp): Same.
4226 (class hybrid_threader): Remove.
4227 (hybrid_threader::hybrid_threader): Remove.
4228 (hybrid_threader::~hybrid_threader): Remove.
4229 (hybrid_threader::before_dom_children): Remove.
4230 (hybrid_threader::after_dom_children): Remove.
4231 (execute_vrp_threader): Remove.
4232 (class pass_vrp_threader): Remove.
4233 (make_pass_vrp_threader): Remove.
4235 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4237 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
4238 (gimple_call_retslot_flags): Likewise.
4239 (gimple_call_static_chain_flags): Likewise.
4240 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
4242 (modref_summary::useful_p): Likewise.
4243 (modref_summary_lto::useful_p): Likewise.
4244 (analyze_parms): Do not give up on NOVOPS.
4245 (analyze_function): When dumping report chnages in EAF flags
4246 between IPA and local pass.
4247 (modref_merge_call_site_flags): Compute implicit eaf flags
4248 based on callee ecf_flags and fnspec; if the function does not
4249 bind to current defs use interposable_eaf_flags.
4250 (modref_propagate_flags_in_scc): Update.
4251 * ipa-modref.h (interposable_eaf_flags): New function.
4253 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
4255 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
4257 (rs6000_invalid_new_builtin): New function.
4258 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
4259 (rs6000_expand_ldst_mask): New function.
4260 (new_cpu_expand_builtin): Likewise.
4261 (elemrev_icode): Likewise.
4262 (ldv_expand_builtin): Likewise.
4263 (lxvrse_expand_builtin): Likewise.
4264 (lxvrze_expand_builtin): Likewise.
4265 (stv_expand_builtin): Likewise.
4266 (new_mma_expand_builtin): Likewise.
4267 (new_htm_spr_num): Likewise.
4268 (new_htm_expand_builtin): Likewise.
4269 (rs6000_expand_new_builtin): Likewise.
4270 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
4272 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4274 * ipa-modref.c (modref_lattice): Add do_dataflow,
4275 changed and propagate_to fields.
4276 (modref_lattice::release): Free propagate_to
4277 (modref_lattice::merge): Do not give up early on unknown
4279 (modref_lattice::merge_deref): Likewise.
4280 (modref_eaf_analysis): Update toplevel comment.
4281 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
4282 do optimistic dataflow initialization.
4283 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
4284 (modref_eaf_analysis::propagate): New member function.
4285 (analyze_parms): Update to new API of modref_eaf_analysis.
4287 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
4289 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
4290 return true on functions from other partition.
4292 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
4294 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
4295 (vsx_splat_v4si_di): Change constraint to "wa".
4297 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4299 PR tree-optimization/103061
4300 * value-relation.cc (path_oracle::path_oracle): Initialize
4302 (path_oracle::killing_def): Set m_killed_defs.
4303 (path_oracle::query_relation): Do not look at the root oracle for
4305 * value-relation.h (class path_oracle): Add m_killed_defs.
4307 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4309 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4310 Remove gotos and other cleanups.
4312 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4315 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
4316 paradoxical and zero sized accesses.
4318 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4321 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
4324 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4326 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
4327 to the list of symbol prefixes that must be made linker-
4330 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4332 * config.host: Add support for aarch64-*-darwin.
4333 * config/aarch64/host-aarch64-darwin.c: New file.
4334 * config/aarch64/x-darwin: New file.
4336 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4338 PR tree-optimization/103093
4339 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
4341 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4343 PR tree-optimization/102943
4344 * gimple-range-cache.cc (class update_list): New.
4345 (update_list::add): Replace add_to_update.
4346 (update_list::pop): New.
4347 (ranger_cache::ranger_cache): Adjust.
4348 (ranger_cache::~ranger_cache): Adjust.
4349 (ranger_cache::add_to_update): Delete.
4350 (ranger_cache::propagate_cache): Adjust to new class.
4351 (ranger_cache::propagate_updated_value): Ditto.
4352 (ranger_cache::fill_block_cache): Ditto.
4353 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
4355 2021-11-05 Richard Biener <rguenther@suse.de>
4357 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
4358 comment and expand on another one. Combine nested if.
4360 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4363 * config/pa/pa-modes.def: Add OImode integer type.
4364 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
4366 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
4367 to UNITS_PER_WORD if IN_LIBGCC2.
4368 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
4369 negvti2, ashlti3, shrpd_internal): New patterns.
4370 Change some multi instruction types to multi.
4372 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4375 * config/i386/i386.h
4376 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
4379 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
4382 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
4384 * config/aarch64/constraints.md: Support const in Usw.
4386 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4388 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
4389 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
4391 2021-11-05 Richard Biener <rguenther@suse.de>
4393 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
4394 (LOOP_VINFO_N_STMTS): Likewise.
4395 (vec_info_for_bb): Remove unused function.
4396 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
4397 Initialize n_stmts member.
4398 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
4399 (vect_create_loop_vinfo): Do not set loop->aux.
4400 (vect_analyze_loop_2): Do not get n_stmts as argument,
4401 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
4403 (vect_analyze_loop_1): Remove callback, get the mode iterator
4404 and autodetected_vector_mode as argument, advancing the
4405 iterator and initializing autodetected_vector_mode here.
4406 (vect_analyze_loop): Split analysis loop into two, first
4407 processing main loops only and then epilogues.
4409 2021-11-05 Martin Jambor <mjambor@suse.cz>
4411 * ipa-prop.c (compute_complex_assign_jump_func): Remove
4412 unnecessary check for RECORD_TYPE.
4414 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
4416 * doc/install.texi2html: Do not generate old.html any longer.
4418 2021-11-05 Martin Liska <mliska@suse.cz>
4421 * opts.c (finish_options): Reset flag_gtoggle when it is used.
4423 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4426 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
4427 in CONST_WIDE_INT handling. Adjust recursive calls.
4428 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
4429 new add_const_value_attribute argument.
4430 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
4431 add_const_value_attribute argument.
4433 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
4435 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
4438 2021-11-05 Richard Biener <rguenther@suse.de>
4440 * tree-vectorizer.h (struct vect_loop_form_info): New.
4441 (vect_analyze_loop_form): Adjust.
4442 (vect_create_loop_vinfo): New.
4443 * tree-parloops.c (gather_scalar_reductions): Adjust for
4444 vect_analyze_loop_form API change.
4445 * tree-vect-loop.c: Include <functional>.
4446 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
4447 take struct vect_loop_form_info as output parameter and adjust.
4448 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
4449 split out call to the original vect_analyze_loop_form_1.
4450 (vect_reanalyze_as_main_loop): Rename to...
4451 (vect_analyze_loop_1): ... this, factor out the call to
4452 vect_analyze_loop_form and generalize to be able to use it twice ...
4453 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
4456 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
4459 * config/rs6000/fusion.md: Regenerate.
4460 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
4462 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
4464 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
4467 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4469 * config/aarch64/aarch64-builtins.c
4470 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
4471 ashr_simd, lshr, lshr_simd.
4472 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
4473 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
4474 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
4476 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4478 * match.pd: New negate+shift pattern.
4480 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
4482 PR tree-optimization/103079
4483 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
4485 (gimple_range_calc_op2): Ditto.
4487 2021-11-04 Martin Jambor <mjambor@suse.cz>
4490 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
4491 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
4492 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
4493 parameter to mark_dead_statements.
4494 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
4495 (ipa_param_body_adjustments::mark_dead_statements): New parameter
4496 debugstack, push into it all SSA names used in debug statements,
4497 produce m_dead_ssa_debug_equiv mapping for the removed param.
4498 (replace_with_mapped_expr): New function.
4499 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
4500 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4501 (ipa_param_body_adjustments::common_initialization): Gather and
4502 procecc SSA which will be removed but are in debug statements. Simplify.
4503 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
4505 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
4506 when avoiding a copy of an unnecessary statement. Remap removed SSA
4507 names in existing debug statements.
4508 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
4509 parameters if we have already done so.
4511 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
4514 * gimple.c (gimple_call_static_chain_flags): Handle case when
4515 nested function does not bind locally.
4517 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4519 * config/aarch64/aarch64.c (aarch64_function_value): Generate
4520 a register rtx for Neon vector-tuple modes.
4521 (aarch64_layout_arg): Likewise.
4523 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4525 * lower-subreg.c (simple_move): Prevent decomposition if
4526 modes are not tieable.
4528 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4529 Richard Sandiford <richard.sandiford@arm.com>
4531 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
4532 (v2x4hi_UP): Likewise.
4533 (v2x4hf_UP): Likewise.
4534 (v2x4bf_UP): Likewise.
4535 (v2x2si_UP): Likewise.
4536 (v2x2sf_UP): Likewise.
4537 (v2x1di_UP): Likewise.
4538 (v2x1df_UP): Likewise.
4539 (v2x16qi_UP): Likewise.
4540 (v2x8hi_UP): Likewise.
4541 (v2x8hf_UP): Likewise.
4542 (v2x8bf_UP): Likewise.
4543 (v2x4si_UP): Likewise.
4544 (v2x4sf_UP): Likewise.
4545 (v2x2di_UP): Likewise.
4546 (v2x2df_UP): Likewise.
4547 (v3x8qi_UP): Likewise.
4548 (v3x4hi_UP): Likewise.
4549 (v3x4hf_UP): Likewise.
4550 (v3x4bf_UP): Likewise.
4551 (v3x2si_UP): Likewise.
4552 (v3x2sf_UP): Likewise.
4553 (v3x1di_UP): Likewise.
4554 (v3x1df_UP): Likewise.
4555 (v3x16qi_UP): Likewise.
4556 (v3x8hi_UP): Likewise.
4557 (v3x8hf_UP): Likewise.
4558 (v3x8bf_UP): Likewise.
4559 (v3x4si_UP): Likewise.
4560 (v3x4sf_UP): Likewise.
4561 (v3x2di_UP): Likewise.
4562 (v3x2df_UP): Likewise.
4563 (v4x8qi_UP): Likewise.
4564 (v4x4hi_UP): Likewise.
4565 (v4x4hf_UP): Likewise.
4566 (v4x4bf_UP): Likewise.
4567 (v4x2si_UP): Likewise.
4568 (v4x2sf_UP): Likewise.
4569 (v4x1di_UP): Likewise.
4570 (v4x1df_UP): Likewise.
4571 (v4x16qi_UP): Likewise.
4572 (v4x8hi_UP): Likewise.
4573 (v4x8hf_UP): Likewise.
4574 (v4x8bf_UP): Likewise.
4575 (v4x4si_UP): Likewise.
4576 (v4x4sf_UP): Likewise.
4577 (v4x2di_UP): Likewise.
4578 (v4x2df_UP): Likewise.
4579 (TYPES_GETREGP): Delete.
4580 (TYPES_SETREGP): Likewise.
4581 (TYPES_LOADSTRUCT_U): Define.
4582 (TYPES_LOADSTRUCT_P): Likewise.
4583 (TYPES_LOADSTRUCT_LANE_U): Likewise.
4584 (TYPES_LOADSTRUCT_LANE_P): Likewise.
4585 (TYPES_STORE1P): Move for consistency.
4586 (TYPES_STORESTRUCT_U): Define.
4587 (TYPES_STORESTRUCT_P): Likewise.
4588 (TYPES_STORESTRUCT_LANE_U): Likewise.
4589 (TYPES_STORESTRUCT_LANE_P): Likewise.
4590 (aarch64_simd_tuple_types): Define.
4591 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
4592 (aarch64_init_simd_builtin_functions): Update frontend lookup
4593 for builtin functions after handling arm_neon.h pragma.
4594 (register_tuple_type): Manually set modes of single-integer
4595 tuple types. Record tuple types.
4596 * config/aarch64/aarch64-modes.def
4597 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
4598 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
4599 (SVE_MODES): Give single-vector modes priority over vector-
4601 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
4602 be after all single-vector modes.
4603 * config/aarch64/aarch64-simd-builtins.def: Update builtin
4604 generator macros to reflect modifications to the backend
4606 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
4607 Use vector-tuple mode iterator and rename to...
4608 (aarch64_simd_ld2<vstruct_elt>): This.
4609 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
4611 (aarch64_simd_ld2r<vstruct_elt>): This.
4612 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
4613 iterator and rename to...
4614 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
4615 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
4617 (vec_load_lanes<mode><vstruct_elt>): This.
4618 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
4620 (aarch64_simd_st2<vstruct_elt>): This.
4621 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
4622 iterator and rename to...
4623 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
4624 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
4626 (vec_store_lanes<mode><vstruct_elt>): This.
4627 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
4629 (aarch64_simd_ld3<vstruct_elt>): This.
4630 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
4632 (aarch64_simd_ld3r<vstruct_elt>): This.
4633 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
4634 iterator and rename to...
4635 (vec_load_lanesci<mode>): This.
4636 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
4638 (aarch64_simd_st3<vstruct_elt>): This.
4639 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
4640 iterator and rename to...
4641 (vec_store_lanesci<mode>): This.
4642 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
4644 (aarch64_simd_ld4<vstruct_elt>): This.
4645 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
4647 (aarch64_simd_ld4r<vstruct_elt>): This.
4648 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
4649 iterator and rename to...
4650 (vec_load_lanesxi<mode>): This.
4651 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
4653 (aarch64_simd_st4<vstruct_elt>): This.
4654 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
4655 iterator and rename to...
4656 (vec_store_lanesxi<mode>): This.
4657 (mov<mode>): Define for Neon vector-tuple modes.
4658 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4660 (aarch64_ld1x3<vstruct_elt>): This.
4661 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
4663 (aarch64_ld1_x3_<vstruct_elt>): This.
4664 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4666 (aarch64_ld1x4<vstruct_elt>): This.
4667 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
4669 (aarch64_ld1_x4_<vstruct_elt>): This.
4670 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
4672 (aarch64_st1x2<vstruct_elt>): This.
4673 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
4675 (aarch64_st1_x2_<vstruct_elt>): This.
4676 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4678 (aarch64_st1x3<vstruct_elt>): This.
4679 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
4681 (aarch64_st1_x3_<vstruct_elt>): This.
4682 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4684 (aarch64_st1x4<vstruct_elt>): This.
4685 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
4687 (aarch64_st1_x4_<vstruct_elt>): This.
4688 (*aarch64_mov<mode>): Define for vector-tuple modes.
4689 (*aarch64_be_mov<mode>): Likewise.
4690 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
4691 mode iterator and rename to...
4692 (aarch64_ld<nregs>r<vstruct_elt>): This.
4693 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
4695 (aarch64_ld2<vstruct_elt>_dreg): This.
4696 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
4698 (aarch64_ld3<vstruct_elt>_dreg): This.
4699 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
4701 (aarch64_ld4<vstruct_elt>_dreg): This.
4702 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4703 iterator and rename to...
4704 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
4705 iterator and rename to...
4706 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4707 (aarch64_ld1x2<VQ:mode>): Delete.
4708 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
4710 (aarch64_ld1x2<vstruct_elt>): This.
4711 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4712 tuple mode iterator and rename to...
4713 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
4714 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
4715 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
4716 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
4718 (aarch64_st2<vstruct_elt>_dreg): This.
4719 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
4721 (aarch64_st3<vstruct_elt>_dreg): This.
4722 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
4724 (aarch64_st4<vstruct_elt>_dreg): This.
4725 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4726 iterator and rename to...
4727 (aarch64_st<nregs><vstruct_elt>): This.
4728 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4729 iterator and rename to aarch64_st<nregs><vstruct_elt>.
4730 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4731 tuple mode iterator and rename to...
4732 (aarch64_st<nregs>_lane<vstruct_elt>): This.
4733 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
4734 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
4736 (aarch64_simd_ld1<vstruct_elt>_x2): This.
4737 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
4738 Refactor to include new vector-tuple modes.
4739 (aarch64_classify_vector_mode): Add cases for new vector-
4741 (aarch64_advsimd_partial_struct_mode_p): Define.
4742 (aarch64_advsimd_full_struct_mode_p): Likewise.
4743 (aarch64_advsimd_vector_array_mode): Likewise.
4744 (aarch64_sve_data_mode): Change location in file.
4745 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
4746 (aarch64_hard_regno_nregs): Handle case of partial Neon
4748 (aarch64_classify_address): Refactor to include handling of
4749 Neon vector-tuple modes.
4750 (aarch64_print_operand): Print "d" for "%R" for a partial
4751 Neon vector structure.
4752 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
4753 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
4754 modes with scalar machines modes larger than 8 bytes.
4755 (aarch64_can_change_mode_class): Don't allow changes between
4756 partial and full Neon vector-structure modes.
4757 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
4758 builtin and remove boiler-plate code for opaque mode.
4759 (vst2_lane_f32): Likewise.
4760 (vst2_lane_f64): Likewise.
4761 (vst2_lane_p8): Likewise.
4762 (vst2_lane_p16): Likewise.
4763 (vst2_lane_p64): Likewise.
4764 (vst2_lane_s8): Likewise.
4765 (vst2_lane_s16): Likewise.
4766 (vst2_lane_s32): Likewise.
4767 (vst2_lane_s64): Likewise.
4768 (vst2_lane_u8): Likewise.
4769 (vst2_lane_u16): Likewise.
4770 (vst2_lane_u32): Likewise.
4771 (vst2_lane_u64): Likewise.
4772 (vst2q_lane_f16): Likewise.
4773 (vst2q_lane_f32): Likewise.
4774 (vst2q_lane_f64): Likewise.
4775 (vst2q_lane_p8): Likewise.
4776 (vst2q_lane_p16): Likewise.
4777 (vst2q_lane_p64): Likewise.
4778 (vst2q_lane_s8): Likewise.
4779 (vst2q_lane_s16): Likewise.
4780 (vst2q_lane_s32): Likewise.
4781 (vst2q_lane_s64): Likewise.
4782 (vst2q_lane_u8): Likewise.
4783 (vst2q_lane_u16): Likewise.
4784 (vst2q_lane_u32): Likewise.
4785 (vst2q_lane_u64): Likewise.
4786 (vst3_lane_f16): Likewise.
4787 (vst3_lane_f32): Likewise.
4788 (vst3_lane_f64): Likewise.
4789 (vst3_lane_p8): Likewise.
4790 (vst3_lane_p16): Likewise.
4791 (vst3_lane_p64): Likewise.
4792 (vst3_lane_s8): Likewise.
4793 (vst3_lane_s16): Likewise.
4794 (vst3_lane_s32): Likewise.
4795 (vst3_lane_s64): Likewise.
4796 (vst3_lane_u8): Likewise.
4797 (vst3_lane_u16): Likewise.
4798 (vst3_lane_u32): Likewise.
4799 (vst3_lane_u64): Likewise.
4800 (vst3q_lane_f16): Likewise.
4801 (vst3q_lane_f32): Likewise.
4802 (vst3q_lane_f64): Likewise.
4803 (vst3q_lane_p8): Likewise.
4804 (vst3q_lane_p16): Likewise.
4805 (vst3q_lane_p64): Likewise.
4806 (vst3q_lane_s8): Likewise.
4807 (vst3q_lane_s16): Likewise.
4808 (vst3q_lane_s32): Likewise.
4809 (vst3q_lane_s64): Likewise.
4810 (vst3q_lane_u8): Likewise.
4811 (vst3q_lane_u16): Likewise.
4812 (vst3q_lane_u32): Likewise.
4813 (vst3q_lane_u64): Likewise.
4814 (vst4_lane_f16): Likewise.
4815 (vst4_lane_f32): Likewise.
4816 (vst4_lane_f64): Likewise.
4817 (vst4_lane_p8): Likewise.
4818 (vst4_lane_p16): Likewise.
4819 (vst4_lane_p64): Likewise.
4820 (vst4_lane_s8): Likewise.
4821 (vst4_lane_s16): Likewise.
4822 (vst4_lane_s32): Likewise.
4823 (vst4_lane_s64): Likewise.
4824 (vst4_lane_u8): Likewise.
4825 (vst4_lane_u16): Likewise.
4826 (vst4_lane_u32): Likewise.
4827 (vst4_lane_u64): Likewise.
4828 (vst4q_lane_f16): Likewise.
4829 (vst4q_lane_f32): Likewise.
4830 (vst4q_lane_f64): Likewise.
4831 (vst4q_lane_p8): Likewise.
4832 (vst4q_lane_p16): Likewise.
4833 (vst4q_lane_p64): Likewise.
4834 (vst4q_lane_s8): Likewise.
4835 (vst4q_lane_s16): Likewise.
4836 (vst4q_lane_s32): Likewise.
4837 (vst4q_lane_s64): Likewise.
4838 (vst4q_lane_u8): Likewise.
4839 (vst4q_lane_u16): Likewise.
4840 (vst4q_lane_u32): Likewise.
4841 (vst4q_lane_u64): Likewise.
4842 (vtbl3_s8): Likewise.
4843 (vtbl3_u8): Likewise.
4844 (vtbl3_p8): Likewise.
4845 (vtbl4_s8): Likewise.
4846 (vtbl4_u8): Likewise.
4847 (vtbl4_p8): Likewise.
4848 (vld1_u8_x3): Likewise.
4849 (vld1_s8_x3): Likewise.
4850 (vld1_u16_x3): Likewise.
4851 (vld1_s16_x3): Likewise.
4852 (vld1_u32_x3): Likewise.
4853 (vld1_s32_x3): Likewise.
4854 (vld1_u64_x3): Likewise.
4855 (vld1_s64_x3): Likewise.
4856 (vld1_f16_x3): Likewise.
4857 (vld1_f32_x3): Likewise.
4858 (vld1_f64_x3): Likewise.
4859 (vld1_p8_x3): Likewise.
4860 (vld1_p16_x3): Likewise.
4861 (vld1_p64_x3): Likewise.
4862 (vld1q_u8_x3): Likewise.
4863 (vld1q_s8_x3): Likewise.
4864 (vld1q_u16_x3): Likewise.
4865 (vld1q_s16_x3): Likewise.
4866 (vld1q_u32_x3): Likewise.
4867 (vld1q_s32_x3): Likewise.
4868 (vld1q_u64_x3): Likewise.
4869 (vld1q_s64_x3): Likewise.
4870 (vld1q_f16_x3): Likewise.
4871 (vld1q_f32_x3): Likewise.
4872 (vld1q_f64_x3): Likewise.
4873 (vld1q_p8_x3): Likewise.
4874 (vld1q_p16_x3): Likewise.
4875 (vld1q_p64_x3): Likewise.
4876 (vld1_u8_x2): Likewise.
4877 (vld1_s8_x2): Likewise.
4878 (vld1_u16_x2): Likewise.
4879 (vld1_s16_x2): Likewise.
4880 (vld1_u32_x2): Likewise.
4881 (vld1_s32_x2): Likewise.
4882 (vld1_u64_x2): Likewise.
4883 (vld1_s64_x2): Likewise.
4884 (vld1_f16_x2): Likewise.
4885 (vld1_f32_x2): Likewise.
4886 (vld1_f64_x2): Likewise.
4887 (vld1_p8_x2): Likewise.
4888 (vld1_p16_x2): Likewise.
4889 (vld1_p64_x2): Likewise.
4890 (vld1q_u8_x2): Likewise.
4891 (vld1q_s8_x2): Likewise.
4892 (vld1q_u16_x2): Likewise.
4893 (vld1q_s16_x2): Likewise.
4894 (vld1q_u32_x2): Likewise.
4895 (vld1q_s32_x2): Likewise.
4896 (vld1q_u64_x2): Likewise.
4897 (vld1q_s64_x2): Likewise.
4898 (vld1q_f16_x2): Likewise.
4899 (vld1q_f32_x2): Likewise.
4900 (vld1q_f64_x2): Likewise.
4901 (vld1q_p8_x2): Likewise.
4902 (vld1q_p16_x2): Likewise.
4903 (vld1q_p64_x2): Likewise.
4904 (vld1_s8_x4): Likewise.
4905 (vld1q_s8_x4): Likewise.
4906 (vld1_s16_x4): Likewise.
4907 (vld1q_s16_x4): Likewise.
4908 (vld1_s32_x4): Likewise.
4909 (vld1q_s32_x4): Likewise.
4910 (vld1_u8_x4): Likewise.
4911 (vld1q_u8_x4): Likewise.
4912 (vld1_u16_x4): Likewise.
4913 (vld1q_u16_x4): Likewise.
4914 (vld1_u32_x4): Likewise.
4915 (vld1q_u32_x4): Likewise.
4916 (vld1_f16_x4): Likewise.
4917 (vld1q_f16_x4): Likewise.
4918 (vld1_f32_x4): Likewise.
4919 (vld1q_f32_x4): Likewise.
4920 (vld1_p8_x4): Likewise.
4921 (vld1q_p8_x4): Likewise.
4922 (vld1_p16_x4): Likewise.
4923 (vld1q_p16_x4): Likewise.
4924 (vld1_s64_x4): Likewise.
4925 (vld1_u64_x4): Likewise.
4926 (vld1_p64_x4): Likewise.
4927 (vld1q_s64_x4): Likewise.
4928 (vld1q_u64_x4): Likewise.
4929 (vld1q_p64_x4): Likewise.
4930 (vld1_f64_x4): Likewise.
4931 (vld1q_f64_x4): Likewise.
4932 (vld2_s64): Likewise.
4933 (vld2_u64): Likewise.
4934 (vld2_f64): Likewise.
4935 (vld2_s8): Likewise.
4936 (vld2_p8): Likewise.
4937 (vld2_p64): Likewise.
4938 (vld2_s16): Likewise.
4939 (vld2_p16): Likewise.
4940 (vld2_s32): Likewise.
4941 (vld2_u8): Likewise.
4942 (vld2_u16): Likewise.
4943 (vld2_u32): Likewise.
4944 (vld2_f16): Likewise.
4945 (vld2_f32): Likewise.
4946 (vld2q_s8): Likewise.
4947 (vld2q_p8): Likewise.
4948 (vld2q_s16): Likewise.
4949 (vld2q_p16): Likewise.
4950 (vld2q_p64): Likewise.
4951 (vld2q_s32): Likewise.
4952 (vld2q_s64): Likewise.
4953 (vld2q_u8): Likewise.
4954 (vld2q_u16): Likewise.
4955 (vld2q_u32): Likewise.
4956 (vld2q_u64): Likewise.
4957 (vld2q_f16): Likewise.
4958 (vld2q_f32): Likewise.
4959 (vld2q_f64): Likewise.
4960 (vld3_s64): Likewise.
4961 (vld3_u64): Likewise.
4962 (vld3_f64): Likewise.
4963 (vld3_s8): Likewise.
4964 (vld3_p8): Likewise.
4965 (vld3_s16): Likewise.
4966 (vld3_p16): Likewise.
4967 (vld3_s32): Likewise.
4968 (vld3_u8): Likewise.
4969 (vld3_u16): Likewise.
4970 (vld3_u32): Likewise.
4971 (vld3_f16): Likewise.
4972 (vld3_f32): Likewise.
4973 (vld3_p64): Likewise.
4974 (vld3q_s8): Likewise.
4975 (vld3q_p8): Likewise.
4976 (vld3q_s16): Likewise.
4977 (vld3q_p16): Likewise.
4978 (vld3q_s32): Likewise.
4979 (vld3q_s64): Likewise.
4980 (vld3q_u8): Likewise.
4981 (vld3q_u16): Likewise.
4982 (vld3q_u32): Likewise.
4983 (vld3q_u64): Likewise.
4984 (vld3q_f16): Likewise.
4985 (vld3q_f32): Likewise.
4986 (vld3q_f64): Likewise.
4987 (vld3q_p64): Likewise.
4988 (vld4_s64): Likewise.
4989 (vld4_u64): Likewise.
4990 (vld4_f64): Likewise.
4991 (vld4_s8): Likewise.
4992 (vld4_p8): Likewise.
4993 (vld4_s16): Likewise.
4994 (vld4_p16): Likewise.
4995 (vld4_s32): Likewise.
4996 (vld4_u8): Likewise.
4997 (vld4_u16): Likewise.
4998 (vld4_u32): Likewise.
4999 (vld4_f16): Likewise.
5000 (vld4_f32): Likewise.
5001 (vld4_p64): Likewise.
5002 (vld4q_s8): Likewise.
5003 (vld4q_p8): Likewise.
5004 (vld4q_s16): Likewise.
5005 (vld4q_p16): Likewise.
5006 (vld4q_s32): Likewise.
5007 (vld4q_s64): Likewise.
5008 (vld4q_u8): Likewise.
5009 (vld4q_u16): Likewise.
5010 (vld4q_u32): Likewise.
5011 (vld4q_u64): Likewise.
5012 (vld4q_f16): Likewise.
5013 (vld4q_f32): Likewise.
5014 (vld4q_f64): Likewise.
5015 (vld4q_p64): Likewise.
5016 (vld2_dup_s8): Likewise.
5017 (vld2_dup_s16): Likewise.
5018 (vld2_dup_s32): Likewise.
5019 (vld2_dup_f16): Likewise.
5020 (vld2_dup_f32): Likewise.
5021 (vld2_dup_f64): Likewise.
5022 (vld2_dup_u8): Likewise.
5023 (vld2_dup_u16): Likewise.
5024 (vld2_dup_u32): Likewise.
5025 (vld2_dup_p8): Likewise.
5026 (vld2_dup_p16): Likewise.
5027 (vld2_dup_p64): Likewise.
5028 (vld2_dup_s64): Likewise.
5029 (vld2_dup_u64): Likewise.
5030 (vld2q_dup_s8): Likewise.
5031 (vld2q_dup_p8): Likewise.
5032 (vld2q_dup_s16): Likewise.
5033 (vld2q_dup_p16): Likewise.
5034 (vld2q_dup_s32): Likewise.
5035 (vld2q_dup_s64): Likewise.
5036 (vld2q_dup_u8): Likewise.
5037 (vld2q_dup_u16): Likewise.
5038 (vld2q_dup_u32): Likewise.
5039 (vld2q_dup_u64): Likewise.
5040 (vld2q_dup_f16): Likewise.
5041 (vld2q_dup_f32): Likewise.
5042 (vld2q_dup_f64): Likewise.
5043 (vld2q_dup_p64): Likewise.
5044 (vld3_dup_s64): Likewise.
5045 (vld3_dup_u64): Likewise.
5046 (vld3_dup_f64): Likewise.
5047 (vld3_dup_s8): Likewise.
5048 (vld3_dup_p8): Likewise.
5049 (vld3_dup_s16): Likewise.
5050 (vld3_dup_p16): Likewise.
5051 (vld3_dup_s32): Likewise.
5052 (vld3_dup_u8): Likewise.
5053 (vld3_dup_u16): Likewise.
5054 (vld3_dup_u32): Likewise.
5055 (vld3_dup_f16): Likewise.
5056 (vld3_dup_f32): Likewise.
5057 (vld3_dup_p64): Likewise.
5058 (vld3q_dup_s8): Likewise.
5059 (vld3q_dup_p8): Likewise.
5060 (vld3q_dup_s16): Likewise.
5061 (vld3q_dup_p16): Likewise.
5062 (vld3q_dup_s32): Likewise.
5063 (vld3q_dup_s64): Likewise.
5064 (vld3q_dup_u8): Likewise.
5065 (vld3q_dup_u16): Likewise.
5066 (vld3q_dup_u32): Likewise.
5067 (vld3q_dup_u64): Likewise.
5068 (vld3q_dup_f16): Likewise.
5069 (vld3q_dup_f32): Likewise.
5070 (vld3q_dup_f64): Likewise.
5071 (vld3q_dup_p64): Likewise.
5072 (vld4_dup_s64): Likewise.
5073 (vld4_dup_u64): Likewise.
5074 (vld4_dup_f64): Likewise.
5075 (vld4_dup_s8): Likewise.
5076 (vld4_dup_p8): Likewise.
5077 (vld4_dup_s16): Likewise.
5078 (vld4_dup_p16): Likewise.
5079 (vld4_dup_s32): Likewise.
5080 (vld4_dup_u8): Likewise.
5081 (vld4_dup_u16): Likewise.
5082 (vld4_dup_u32): Likewise.
5083 (vld4_dup_f16): Likewise.
5084 (vld4_dup_f32): Likewise.
5085 (vld4_dup_p64): Likewise.
5086 (vld4q_dup_s8): Likewise.
5087 (vld4q_dup_p8): Likewise.
5088 (vld4q_dup_s16): Likewise.
5089 (vld4q_dup_p16): Likewise.
5090 (vld4q_dup_s32): Likewise.
5091 (vld4q_dup_s64): Likewise.
5092 (vld4q_dup_u8): Likewise.
5093 (vld4q_dup_u16): Likewise.
5094 (vld4q_dup_u32): Likewise.
5095 (vld4q_dup_u64): Likewise.
5096 (vld4q_dup_f16): Likewise.
5097 (vld4q_dup_f32): Likewise.
5098 (vld4q_dup_f64): Likewise.
5099 (vld4q_dup_p64): Likewise.
5100 (vld2_lane_u8): Likewise.
5101 (vld2_lane_u16): Likewise.
5102 (vld2_lane_u32): Likewise.
5103 (vld2_lane_u64): Likewise.
5104 (vld2_lane_s8): Likewise.
5105 (vld2_lane_s16): Likewise.
5106 (vld2_lane_s32): Likewise.
5107 (vld2_lane_s64): Likewise.
5108 (vld2_lane_f16): Likewise.
5109 (vld2_lane_f32): Likewise.
5110 (vld2_lane_f64): Likewise.
5111 (vld2_lane_p8): Likewise.
5112 (vld2_lane_p16): Likewise.
5113 (vld2_lane_p64): Likewise.
5114 (vld2q_lane_u8): Likewise.
5115 (vld2q_lane_u16): Likewise.
5116 (vld2q_lane_u32): Likewise.
5117 (vld2q_lane_u64): Likewise.
5118 (vld2q_lane_s8): Likewise.
5119 (vld2q_lane_s16): Likewise.
5120 (vld2q_lane_s32): Likewise.
5121 (vld2q_lane_s64): Likewise.
5122 (vld2q_lane_f16): Likewise.
5123 (vld2q_lane_f32): Likewise.
5124 (vld2q_lane_f64): Likewise.
5125 (vld2q_lane_p8): Likewise.
5126 (vld2q_lane_p16): Likewise.
5127 (vld2q_lane_p64): Likewise.
5128 (vld3_lane_u8): Likewise.
5129 (vld3_lane_u16): Likewise.
5130 (vld3_lane_u32): Likewise.
5131 (vld3_lane_u64): Likewise.
5132 (vld3_lane_s8): Likewise.
5133 (vld3_lane_s16): Likewise.
5134 (vld3_lane_s32): Likewise.
5135 (vld3_lane_s64): Likewise.
5136 (vld3_lane_f16): Likewise.
5137 (vld3_lane_f32): Likewise.
5138 (vld3_lane_f64): Likewise.
5139 (vld3_lane_p8): Likewise.
5140 (vld3_lane_p16): Likewise.
5141 (vld3_lane_p64): Likewise.
5142 (vld3q_lane_u8): Likewise.
5143 (vld3q_lane_u16): Likewise.
5144 (vld3q_lane_u32): Likewise.
5145 (vld3q_lane_u64): Likewise.
5146 (vld3q_lane_s8): Likewise.
5147 (vld3q_lane_s16): Likewise.
5148 (vld3q_lane_s32): Likewise.
5149 (vld3q_lane_s64): Likewise.
5150 (vld3q_lane_f16): Likewise.
5151 (vld3q_lane_f32): Likewise.
5152 (vld3q_lane_f64): Likewise.
5153 (vld3q_lane_p8): Likewise.
5154 (vld3q_lane_p16): Likewise.
5155 (vld3q_lane_p64): Likewise.
5156 (vld4_lane_u8): Likewise.
5157 (vld4_lane_u16): Likewise.
5158 (vld4_lane_u32): Likewise.
5159 (vld4_lane_u64): Likewise.
5160 (vld4_lane_s8): Likewise.
5161 (vld4_lane_s16): Likewise.
5162 (vld4_lane_s32): Likewise.
5163 (vld4_lane_s64): Likewise.
5164 (vld4_lane_f16): Likewise.
5165 (vld4_lane_f32): Likewise.
5166 (vld4_lane_f64): Likewise.
5167 (vld4_lane_p8): Likewise.
5168 (vld4_lane_p16): Likewise.
5169 (vld4_lane_p64): Likewise.
5170 (vld4q_lane_u8): Likewise.
5171 (vld4q_lane_u16): Likewise.
5172 (vld4q_lane_u32): Likewise.
5173 (vld4q_lane_u64): Likewise.
5174 (vld4q_lane_s8): Likewise.
5175 (vld4q_lane_s16): Likewise.
5176 (vld4q_lane_s32): Likewise.
5177 (vld4q_lane_s64): Likewise.
5178 (vld4q_lane_f16): Likewise.
5179 (vld4q_lane_f32): Likewise.
5180 (vld4q_lane_f64): Likewise.
5181 (vld4q_lane_p8): Likewise.
5182 (vld4q_lane_p16): Likewise.
5183 (vld4q_lane_p64): Likewise.
5184 (vqtbl2_s8): Likewise.
5185 (vqtbl2_u8): Likewise.
5186 (vqtbl2_p8): Likewise.
5187 (vqtbl2q_s8): Likewise.
5188 (vqtbl2q_u8): Likewise.
5189 (vqtbl2q_p8): Likewise.
5190 (vqtbl3_s8): Likewise.
5191 (vqtbl3_u8): Likewise.
5192 (vqtbl3_p8): Likewise.
5193 (vqtbl3q_s8): Likewise.
5194 (vqtbl3q_u8): Likewise.
5195 (vqtbl3q_p8): Likewise.
5196 (vqtbl4_s8): Likewise.
5197 (vqtbl4_u8): Likewise.
5198 (vqtbl4_p8): Likewise.
5199 (vqtbl4q_s8): Likewise.
5200 (vqtbl4q_u8): Likewise.
5201 (vqtbl4q_p8): Likewise.
5202 (vqtbx2_s8): Likewise.
5203 (vqtbx2_u8): Likewise.
5204 (vqtbx2_p8): Likewise.
5205 (vqtbx2q_s8): Likewise.
5206 (vqtbx2q_u8): Likewise.
5207 (vqtbx2q_p8): Likewise.
5208 (vqtbx3_s8): Likewise.
5209 (vqtbx3_u8): Likewise.
5210 (vqtbx3_p8): Likewise.
5211 (vqtbx3q_s8): Likewise.
5212 (vqtbx3q_u8): Likewise.
5213 (vqtbx3q_p8): Likewise.
5214 (vqtbx4_s8): Likewise.
5215 (vqtbx4_u8): Likewise.
5216 (vqtbx4_p8): Likewise.
5217 (vqtbx4q_s8): Likewise.
5218 (vqtbx4q_u8): Likewise.
5219 (vqtbx4q_p8): Likewise.
5220 (vst1_s64_x2): Likewise.
5221 (vst1_u64_x2): Likewise.
5222 (vst1_f64_x2): Likewise.
5223 (vst1_s8_x2): Likewise.
5224 (vst1_p8_x2): Likewise.
5225 (vst1_s16_x2): Likewise.
5226 (vst1_p16_x2): Likewise.
5227 (vst1_s32_x2): Likewise.
5228 (vst1_u8_x2): Likewise.
5229 (vst1_u16_x2): Likewise.
5230 (vst1_u32_x2): Likewise.
5231 (vst1_f16_x2): Likewise.
5232 (vst1_f32_x2): Likewise.
5233 (vst1_p64_x2): Likewise.
5234 (vst1q_s8_x2): Likewise.
5235 (vst1q_p8_x2): Likewise.
5236 (vst1q_s16_x2): Likewise.
5237 (vst1q_p16_x2): Likewise.
5238 (vst1q_s32_x2): Likewise.
5239 (vst1q_s64_x2): Likewise.
5240 (vst1q_u8_x2): Likewise.
5241 (vst1q_u16_x2): Likewise.
5242 (vst1q_u32_x2): Likewise.
5243 (vst1q_u64_x2): Likewise.
5244 (vst1q_f16_x2): Likewise.
5245 (vst1q_f32_x2): Likewise.
5246 (vst1q_f64_x2): Likewise.
5247 (vst1q_p64_x2): Likewise.
5248 (vst1_s64_x3): Likewise.
5249 (vst1_u64_x3): Likewise.
5250 (vst1_f64_x3): Likewise.
5251 (vst1_s8_x3): Likewise.
5252 (vst1_p8_x3): Likewise.
5253 (vst1_s16_x3): Likewise.
5254 (vst1_p16_x3): Likewise.
5255 (vst1_s32_x3): Likewise.
5256 (vst1_u8_x3): Likewise.
5257 (vst1_u16_x3): Likewise.
5258 (vst1_u32_x3): Likewise.
5259 (vst1_f16_x3): Likewise.
5260 (vst1_f32_x3): Likewise.
5261 (vst1_p64_x3): Likewise.
5262 (vst1q_s8_x3): Likewise.
5263 (vst1q_p8_x3): Likewise.
5264 (vst1q_s16_x3): Likewise.
5265 (vst1q_p16_x3): Likewise.
5266 (vst1q_s32_x3): Likewise.
5267 (vst1q_s64_x3): Likewise.
5268 (vst1q_u8_x3): Likewise.
5269 (vst1q_u16_x3): Likewise.
5270 (vst1q_u32_x3): Likewise.
5271 (vst1q_u64_x3): Likewise.
5272 (vst1q_f16_x3): Likewise.
5273 (vst1q_f32_x3): Likewise.
5274 (vst1q_f64_x3): Likewise.
5275 (vst1q_p64_x3): Likewise.
5276 (vst1_s8_x4): Likewise.
5277 (vst1q_s8_x4): Likewise.
5278 (vst1_s16_x4): Likewise.
5279 (vst1q_s16_x4): Likewise.
5280 (vst1_s32_x4): Likewise.
5281 (vst1q_s32_x4): Likewise.
5282 (vst1_u8_x4): Likewise.
5283 (vst1q_u8_x4): Likewise.
5284 (vst1_u16_x4): Likewise.
5285 (vst1q_u16_x4): Likewise.
5286 (vst1_u32_x4): Likewise.
5287 (vst1q_u32_x4): Likewise.
5288 (vst1_f16_x4): Likewise.
5289 (vst1q_f16_x4): Likewise.
5290 (vst1_f32_x4): Likewise.
5291 (vst1q_f32_x4): Likewise.
5292 (vst1_p8_x4): Likewise.
5293 (vst1q_p8_x4): Likewise.
5294 (vst1_p16_x4): Likewise.
5295 (vst1q_p16_x4): Likewise.
5296 (vst1_s64_x4): Likewise.
5297 (vst1_u64_x4): Likewise.
5298 (vst1_p64_x4): Likewise.
5299 (vst1q_s64_x4): Likewise.
5300 (vst1q_u64_x4): Likewise.
5301 (vst1q_p64_x4): Likewise.
5302 (vst1_f64_x4): Likewise.
5303 (vst1q_f64_x4): Likewise.
5304 (vst2_s64): Likewise.
5305 (vst2_u64): Likewise.
5306 (vst2_f64): Likewise.
5307 (vst2_s8): Likewise.
5308 (vst2_p8): Likewise.
5309 (vst2_s16): Likewise.
5310 (vst2_p16): Likewise.
5311 (vst2_s32): Likewise.
5312 (vst2_u8): Likewise.
5313 (vst2_u16): Likewise.
5314 (vst2_u32): Likewise.
5315 (vst2_f16): Likewise.
5316 (vst2_f32): Likewise.
5317 (vst2_p64): Likewise.
5318 (vst2q_s8): Likewise.
5319 (vst2q_p8): Likewise.
5320 (vst2q_s16): Likewise.
5321 (vst2q_p16): Likewise.
5322 (vst2q_s32): Likewise.
5323 (vst2q_s64): Likewise.
5324 (vst2q_u8): Likewise.
5325 (vst2q_u16): Likewise.
5326 (vst2q_u32): Likewise.
5327 (vst2q_u64): Likewise.
5328 (vst2q_f16): Likewise.
5329 (vst2q_f32): Likewise.
5330 (vst2q_f64): Likewise.
5331 (vst2q_p64): Likewise.
5332 (vst3_s64): Likewise.
5333 (vst3_u64): Likewise.
5334 (vst3_f64): Likewise.
5335 (vst3_s8): Likewise.
5336 (vst3_p8): Likewise.
5337 (vst3_s16): Likewise.
5338 (vst3_p16): Likewise.
5339 (vst3_s32): Likewise.
5340 (vst3_u8): Likewise.
5341 (vst3_u16): Likewise.
5342 (vst3_u32): Likewise.
5343 (vst3_f16): Likewise.
5344 (vst3_f32): Likewise.
5345 (vst3_p64): Likewise.
5346 (vst3q_s8): Likewise.
5347 (vst3q_p8): Likewise.
5348 (vst3q_s16): Likewise.
5349 (vst3q_p16): Likewise.
5350 (vst3q_s32): Likewise.
5351 (vst3q_s64): Likewise.
5352 (vst3q_u8): Likewise.
5353 (vst3q_u16): Likewise.
5354 (vst3q_u32): Likewise.
5355 (vst3q_u64): Likewise.
5356 (vst3q_f16): Likewise.
5357 (vst3q_f32): Likewise.
5358 (vst3q_f64): Likewise.
5359 (vst3q_p64): Likewise.
5360 (vst4_s64): Likewise.
5361 (vst4_u64): Likewise.
5362 (vst4_f64): Likewise.
5363 (vst4_s8): Likewise.
5364 (vst4_p8): Likewise.
5365 (vst4_s16): Likewise.
5366 (vst4_p16): Likewise.
5367 (vst4_s32): Likewise.
5368 (vst4_u8): Likewise.
5369 (vst4_u16): Likewise.
5370 (vst4_u32): Likewise.
5371 (vst4_f16): Likewise.
5372 (vst4_f32): Likewise.
5373 (vst4_p64): Likewise.
5374 (vst4q_s8): Likewise.
5375 (vst4q_p8): Likewise.
5376 (vst4q_s16): Likewise.
5377 (vst4q_p16): Likewise.
5378 (vst4q_s32): Likewise.
5379 (vst4q_s64): Likewise.
5380 (vst4q_u8): Likewise.
5381 (vst4q_u16): Likewise.
5382 (vst4q_u32): Likewise.
5383 (vst4q_u64): Likewise.
5384 (vst4q_f16): Likewise.
5385 (vst4q_f32): Likewise.
5386 (vst4q_f64): Likewise.
5387 (vst4q_p64): Likewise.
5388 (vtbx4_s8): Likewise.
5389 (vtbx4_u8): Likewise.
5390 (vtbx4_p8): Likewise.
5391 (vld1_bf16_x2): Likewise.
5392 (vld1q_bf16_x2): Likewise.
5393 (vld1_bf16_x3): Likewise.
5394 (vld1q_bf16_x3): Likewise.
5395 (vld1_bf16_x4): Likewise.
5396 (vld1q_bf16_x4): Likewise.
5397 (vld2_bf16): Likewise.
5398 (vld2q_bf16): Likewise.
5399 (vld2_dup_bf16): Likewise.
5400 (vld2q_dup_bf16): Likewise.
5401 (vld3_bf16): Likewise.
5402 (vld3q_bf16): Likewise.
5403 (vld3_dup_bf16): Likewise.
5404 (vld3q_dup_bf16): Likewise.
5405 (vld4_bf16): Likewise.
5406 (vld4q_bf16): Likewise.
5407 (vld4_dup_bf16): Likewise.
5408 (vld4q_dup_bf16): Likewise.
5409 (vst1_bf16_x2): Likewise.
5410 (vst1q_bf16_x2): Likewise.
5411 (vst1_bf16_x3): Likewise.
5412 (vst1q_bf16_x3): Likewise.
5413 (vst1_bf16_x4): Likewise.
5414 (vst1q_bf16_x4): Likewise.
5415 (vst2_bf16): Likewise.
5416 (vst2q_bf16): Likewise.
5417 (vst3_bf16): Likewise.
5418 (vst3q_bf16): Likewise.
5419 (vst4_bf16): Likewise.
5420 (vst4q_bf16): Likewise.
5421 (vld2_lane_bf16): Likewise.
5422 (vld2q_lane_bf16): Likewise.
5423 (vld3_lane_bf16): Likewise.
5424 (vld3q_lane_bf16): Likewise.
5425 (vld4_lane_bf16): Likewise.
5426 (vld4q_lane_bf16): Likewise.
5427 (vst2_lane_bf16): Likewise.
5428 (vst2q_lane_bf16): Likewise.
5429 (vst3_lane_bf16): Likewise.
5430 (vst3q_lane_bf16): Likewise.
5431 (vst4_lane_bf16): Likewise.
5432 (vst4q_lane_bf16): Likewise.
5433 * config/aarch64/geniterators.sh: Modify iterator regex to
5434 match new vector-tuple modes.
5435 * config/aarch64/iterators.md (insn_count): Extend mode
5436 attribute with vector-tuple type information.
5438 (Vendreg): Likewise.
5441 (VSTRUCT_2D): New mode iterator.
5442 (VSTRUCT_2DNX): Likewise.
5443 (VSTRUCT_2DX): Likewise.
5444 (VSTRUCT_2Q): Likewise.
5445 (VSTRUCT_2QD): Likewise.
5446 (VSTRUCT_3D): Likewise.
5447 (VSTRUCT_3DNX): Likewise.
5448 (VSTRUCT_3DX): Likewise.
5449 (VSTRUCT_3Q): Likewise.
5450 (VSTRUCT_3QD): Likewise.
5451 (VSTRUCT_4D): Likewise.
5452 (VSTRUCT_4DNX): Likewise.
5453 (VSTRUCT_4DX): Likewise.
5454 (VSTRUCT_4Q): Likewise.
5455 (VSTRUCT_4QD): Likewise.
5456 (VSTRUCT_D): Likewise.
5457 (VSTRUCT_Q): Likewise.
5458 (VSTRUCT_QD): Likewise.
5459 (VSTRUCT_ELT): New mode attribute.
5460 (vstruct_elt): Likewise.
5461 * genmodes.c (VECTOR_MODE): Add default prefix and order
5463 (VECTOR_MODE_WITH_PREFIX): Define.
5464 (make_vector_mode): Add mode prefix and order parameters.
5466 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5468 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
5470 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5472 * expr.c (emit_group_load_1): Remove historic workaround.
5474 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5476 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
5477 Factor out main loop to...
5478 (aarch64_init_simd_builtin_functions): This new function.
5479 (register_tuple_type): Define.
5480 (aarch64_scalar_builtin_type_p): Define.
5481 (handle_arm_neon_h): Define.
5482 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
5483 pragma for arm_neon.h.
5484 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
5486 (handle_arm_neon_h): Likewise.
5487 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
5488 Remove static modifier.
5489 * config/aarch64/arm_neon.h (target): Remove Neon vector
5490 structure type definitions.
5492 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5494 PR tree-optimization/102943
5495 * gimple-range-path.cc (path_range_query::range_on_path_entry):
5496 Prefer range_of_expr unless there are no statements in the BB.
5498 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5500 PR tree-optimization/102943
5501 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
5502 Avoid duplicate calculation of paths.
5504 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5506 PR tree-optimization/102943
5507 * gimple-range-path.cc (path_range_query::compute_phi_relations):
5508 Only compute relations for SSA names in the import list.
5509 (path_range_query::compute_outgoing_relations): Same.
5510 * gimple-range-path.h (path_range_query::import_p): New.
5512 2021-11-04 Richard Biener <rguenther@suse.de>
5514 PR rtl-optimization/103075
5515 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
5516 false for a VOIDmode operand.
5518 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5520 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
5521 variables private and add "m_" to their names. Remove is_loop.
5522 (aarch64_record_potential_advsimd_unrolling): Replace with...
5523 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
5524 (aarch64_analyze_loop_vinfo): Replace with...
5525 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
5526 Move initialization of (m_)vec_flags to add_stmt_cost.
5527 (aarch64_analyze_bb_vinfo): Delete.
5528 (aarch64_count_ops): Replace with...
5529 (aarch64_vector_costs::count_ops): ...this.
5530 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
5531 using m_costing_for_scalar to test whether we're costing
5532 scalar or vector code.
5533 (aarch64_adjust_body_cost_sve): Replace with...
5534 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
5535 (aarch64_adjust_body_cost): Replace with...
5536 (aarch64_vector_costs::adjust_body_cost): ...this.
5537 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
5539 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5541 * target.def (targetm.vectorize.init_cost): Replace with...
5542 (targetm.vectorize.create_costs): ...this.
5543 (targetm.vectorize.add_stmt_cost): Delete.
5544 (targetm.vectorize.finish_cost): Likewise.
5545 (targetm.vectorize.destroy_cost_data): Likewise.
5546 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
5547 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5548 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5549 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5550 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5551 * doc/tm.texi: Regenerate.
5552 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
5554 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
5555 (vector_costs): New class.
5556 (init_cost): Take a vec_info and return a vector_costs.
5557 (dump_stmt_cost): Remove data parameter.
5558 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
5559 (add_stmt_costs): Likewise.
5560 (finish_cost): Replace data parameter with a vector_costs.
5561 (destroy_cost_data): Delete.
5562 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
5564 (vec_info::vec_info): Remove the target_cost_data parameter and
5565 initialize the member variable to null instead.
5566 (vec_info::~vec_info): Delete target_cost_data instead of calling
5568 (vector_costs::add_stmt_cost): New function.
5569 (vector_costs::finish_cost): Likewise.
5570 (vector_costs::record_stmt_cost): Likewise.
5571 (vector_costs::adjust_cost_for_freq): Likewise.
5572 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
5573 call to vec_info::vec_info.
5574 (vect_compute_single_scalar_iteration_cost): Update after above
5575 changes to costing interface.
5576 (vect_analyze_loop_operations): Likewise.
5577 (vect_estimate_min_profitable_iters): Likewise.
5578 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
5579 at the start_over point, where it needs to be recreated after
5580 trying without slp. Update retry code accordingly.
5581 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
5582 to vec_info::vec_info.
5583 (vect_slp_analyze_operation): Update after above changes to costing
5585 (vect_bb_vectorization_profitable_p): Likewise.
5586 * targhooks.h (default_init_cost): Replace with...
5587 (default_vectorize_create_costs): ...this.
5588 (default_add_stmt_cost): Delete.
5589 (default_finish_cost, default_destroy_cost_data): Likewise.
5590 * targhooks.c (default_init_cost): Replace with...
5591 (default_vectorize_create_costs): ...this.
5592 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
5593 (default_finish_cost, default_destroy_cost_data): Delete.
5594 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
5595 vector_costs. Add a constructor.
5596 (aarch64_init_cost): Replace with...
5597 (aarch64_vectorize_create_costs): ...this.
5598 (aarch64_add_stmt_cost): Replace with...
5599 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
5600 to adjust the cost for inner loops.
5601 (aarch64_finish_cost): Replace with...
5602 (aarch64_vector_costs::finish_cost): ...this.
5603 (aarch64_destroy_cost_data): Delete.
5604 (TARGET_VECTORIZE_INIT_COST): Replace with...
5605 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5606 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5607 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5608 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5609 * config/i386/i386.c (ix86_vector_costs): New structure.
5610 (ix86_init_cost): Replace with...
5611 (ix86_vectorize_create_costs): ...this.
5612 (ix86_add_stmt_cost): Replace with...
5613 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
5614 to adjust the cost for inner loops.
5615 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
5616 (TARGET_VECTORIZE_INIT_COST): Replace with...
5617 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5618 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5619 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5620 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5621 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
5622 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5623 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5624 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5625 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5626 (rs6000_cost_data): Inherit from vector_costs.
5627 Add a constructor. Drop loop_info, cost and costing_for_scalar
5628 in favor of the corresponding vector_costs member variables.
5629 Add "m_" to the names of the remaining member variables and
5631 (rs6000_density_test): Replace with...
5632 (rs6000_cost_data::density_test): ...this.
5633 (rs6000_init_cost): Replace with...
5634 (rs6000_vectorize_create_costs): ...this.
5635 (rs6000_update_target_cost_per_stmt): Replace with...
5636 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
5637 (rs6000_add_stmt_cost): Replace with...
5638 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
5639 to adjust the cost for inner loops.
5640 (rs6000_adjust_vect_cost_per_loop): Replace with...
5641 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
5642 (rs6000_finish_cost): Replace with...
5643 (rs6000_cost_data::finish_cost): ...this. Group loop code
5644 into a single if statement and pass the loop_vinfo down to
5646 (rs6000_destroy_cost_data): Delete.
5648 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5650 PR tree-optimization/103062
5651 PR tree-optimization/103062
5652 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
5653 Increase size of allocation by 1.
5654 (ssa_equiv_stack::push_replacement): Grow as needed.
5655 (ssa_equiv_stack::get_replacement): Same.
5656 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
5657 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
5658 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
5659 (pointer_equiv_analyzer::get_equiv): Same.
5660 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
5661 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
5662 const markers. Use auto_vec instead of tree *.
5664 2021-11-04 Richard Biener <rguenther@suse.de>
5666 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
5667 true parameter and inline valueization.
5668 (vn_nary_op_lookup_1): Inline valueization from ...
5669 (vn_nary_op_compute_hash): ... here and remove it here.
5670 * tree-ssa-pre.c (phi_translate_1): Do not valueize
5671 before vn_nary_lookup_pieces.
5672 (get_representative_for): Mark created SSA representatives
5675 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5677 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
5678 that the operand has a vector mode. Use subreg_lowpart_offset
5679 to test whether an index corresponds to the low part.
5681 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5683 * read-rtl.c: Remove dead !GENERATOR_FILE block.
5684 * read-rtl-function.c (function_reader::consolidate_singletons):
5685 Generate canonical CONST_VECTORs.
5687 2021-11-04 liuhongt <hongtao.liu@intel.com>
5690 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
5691 (regmem_or_bitnot_regmem_operand): .. and extend to handle
5693 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
5694 the operands which are required to be register_operand.
5695 (*<avx512>_vpternlog<mode>_2): Ditto.
5696 (*<avx512>_vpternlog<mode>_3): Ditto.
5697 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
5698 vector HFmodes since it's not a real AVX512FP16 instruction.
5700 2021-11-04 liuhongt <hongtao.liu@intel.com>
5703 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
5704 .COPYSIGN (a,b) when a and b are same type as the truncation
5705 type and has less precision than extend type.
5707 2021-11-04 Richard Biener <rguenther@suse.de>
5709 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
5712 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5714 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
5715 (VI1_AVX512VNNI): Likewise.
5716 (SDOT_VPDP_SUF): New mode_attr.
5719 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
5720 vpdpwssd when VNNI targets available.
5721 (usdot_prod<mode>): New expander for vector QImode.
5723 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5725 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
5726 parentheses to base and stride.
5727 (_tile_stream_loadd_internal): Likewise.
5728 (_tile_stored_internal): Likewise.
5730 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
5732 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
5733 arguments to `reg_class_subset_p'.
5735 2021-11-03 Joseph Myers <joseph@codesourcery.com>
5738 * fold-const.c (fold_init): New function.
5739 * fold-const.h (fold_init): New prototype.
5741 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5743 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
5745 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5747 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
5748 a range from edge if arg == phidef.
5750 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5752 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
5753 for constant before any other processing.
5755 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5757 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
5759 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5761 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
5762 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
5763 no explcit context and current_bb is set.
5764 (gimple_ranger::fold_stmt): New.
5765 * gimple-range.h (current_bb, fold_stmt): New.
5766 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
5768 2021-11-03 Richard Biener <rguenther@suse.de>
5770 PR tree-optimization/102970
5771 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
5772 when translating a MEM_REF over a backedge.
5774 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
5776 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
5777 * config/aarch64/aarch64-tune.md: Regenerate.
5778 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
5779 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
5780 * doc/invoke.texi: Add documentation for Ampere-1 core.
5782 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
5784 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
5786 (ldr_got_small_<mode>): Remove pattern.
5787 (ldr_got_small_sidi): Likewise.
5788 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
5789 GOT accesses as moves.
5790 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
5791 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
5792 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
5794 2021-11-03 Martin Liska <mliska@suse.cz>
5796 * gcov.c (read_line): Remove dead variable.
5798 2021-11-03 Martin Liska <mliska@suse.cz>
5801 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
5802 (ipa_fn_summary::account_size_time): Likewise.
5803 (edge_set_predicate): Likewise.
5804 (set_hint_predicate): Likewise.
5805 (add_freqcounting_predicate): Likewise.
5806 (evaluate_conditions_for_known_args): Likewise.
5807 (evaluate_properties_for_edge): Likewise.
5808 (remap_freqcounting_preds_after_dup): Likewise.
5809 (ipa_fn_summary_t::duplicate): Likewise.
5810 (set_cond_stmt_execution_predicate): Likewise.
5811 (set_switch_stmt_execution_predicate): Likewise.
5812 (compute_bb_predicates): Likewise.
5813 (will_be_nonconstant_expr_predicate): Likewise.
5814 (will_be_nonconstant_predicate): Likewise.
5815 (phi_result_unknown_predicate): Likewise.
5816 (predicate_for_phi_result): Likewise.
5817 (analyze_function_body): Likewise.
5818 (compute_fn_summary): Likewise.
5819 (summarize_calls_size_and_time): Likewise.
5820 (estimate_calls_size_and_time): Likewise.
5821 (ipa_call_context::estimate_size_and_time): Likewise.
5822 (remap_edge_summaries): Likewise.
5823 (remap_freqcounting_predicate): Likewise.
5824 (ipa_merge_fn_summary_after_inlining): Likewise.
5825 (ipa_update_overall_fn_summary): Likewise.
5826 (read_ipa_call_summary): Likewise.
5827 (inline_read_section): Likewise.
5828 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
5829 * ipa-predicate.c (predicate::add_clause): Likewise.
5830 (ipa_predicate::add_clause): Likewise.
5831 (predicate::or_with): Likewise.
5832 (ipa_predicate::or_with): Likewise.
5833 (predicate::evaluate): Likewise.
5834 (ipa_predicate::evaluate): Likewise.
5835 (predicate::probability): Likewise.
5836 (ipa_predicate::probability): Likewise.
5837 (dump_condition): Likewise.
5838 (dump_clause): Likewise.
5839 (predicate::dump): Likewise.
5840 (ipa_predicate::dump): Likewise.
5841 (predicate::debug): Likewise.
5842 (ipa_predicate::debug): Likewise.
5843 (predicate::remap_after_duplication): Likewise.
5844 (ipa_predicate::remap_after_duplication): Likewise.
5845 (predicate::remap_after_inlining): Likewise.
5846 (ipa_predicate::remap_after_inlining): Likewise.
5847 (predicate::stream_in): Likewise.
5848 (ipa_predicate::stream_in): Likewise.
5849 (predicate::stream_out): Likewise.
5850 (ipa_predicate::stream_out): Likewise.
5851 (add_condition): Likewise.
5852 * ipa-predicate.h (class predicate): Likewise.
5853 (class ipa_predicate): Likewise.
5854 (add_condition): Likewise.
5856 2021-11-03 Richard Biener <rguenther@suse.de>
5858 * bitmap.h (bitmap_bit_p): Change the return type to bool.
5859 * bitmap.c (bitmap_bit_p): Likewise.
5860 * sbitmap.h (bitmap_bit_p): Likewise.
5861 (bitmap_set_bit): Return whether the bit changed.
5862 (bitmap_clear_bit): Likewise.
5863 * tree-ssa.c (verify_vssa): Make use of the changed state
5864 from bitmap_set_bit.
5866 2021-11-03 Richard Biener <rguenther@suse.de>
5868 PR middle-end/103033
5869 * internal-fn.c (expand_DEFERRED_INIT): Elide the
5870 native_interpret_expr path in favor of folding the
5871 VIEW_CONVERT_EXPR generated when punning the RHS.
5873 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5875 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
5878 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
5881 * ipa-modref.c (callee_to_caller_flags): New function.
5882 (modref_eaf_analysis::analyze_ssa_name): Use it.
5883 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
5885 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5887 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
5890 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5891 Uroš Bizjak <ubizjak@gmail.com>
5893 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
5894 rotations by non-constant amounts.
5896 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5898 * ipa-modref.c: Fix anonymous namespace placement.
5899 (class modref_eaf_analysis): New class.
5900 (analyze_ssa_name_flags): Turn to ...
5901 (modref_eaf_analysis::analyze_ssa_name): ... this one.
5902 (merge_call_lhs_flags): Turn to ...
5903 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
5904 (modref_eaf_analysis::merge_with_ssa_name): New member function.
5905 (record_escape_points): Turn to ...
5906 (modref_eaf_analysis::record_escape_points): ... this one.
5907 (analyze_parms): Updat
5908 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
5910 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5912 * gimple.c (gimple_call_static_chain_flags): New function.
5913 * gimple.h (gimple_call_static_chain_flags): Declare
5914 * ipa-modref.c (modref_summary::modref_summary): Initialize
5916 (modref_summary_lto::modref_summary_lto): Likewise.
5917 (modref_summary::useful_p): Test static_chain_flags.
5918 (modref_summary_lto::useful_p): Likewise.
5919 (struct modref_summary_lto): Add static_chain_flags.
5920 (modref_summary::dump): Dump static_chain_flags.
5921 (modref_summary_lto::dump): Likewise.
5922 (struct escape_point): Add static_cahin_arg.
5923 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
5924 (analyze_parms): Handle static chains.
5925 (modref_summaries::duplicate): Duplicate static_chain_flags.
5926 (modref_summaries_lto::duplicate): Likewise.
5927 (modref_write): Stream static_chain_flags.
5928 (read_section): Likewise.
5929 (modref_merge_call_site_flags): Handle static_chain_flags.
5930 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
5931 * tree-ssa-structalias.c (handle_rhs_call): Use
5932 gimple_static_chain_flags.
5934 2021-11-02 Richard Biener <rguenther@suse.de>
5936 PR tree-optimization/103029
5937 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
5938 the PHI nodes in the loop maintain their original operand
5941 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5943 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
5944 (EAF_NOREAD): Renumber.
5945 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
5946 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
5947 (deref_flags): Likewise.
5948 (modref_lattice::init): Likewise.
5949 (modref_lattice::merge): Likewise.
5950 (merge_call_lhs_flags): Likewise.
5951 (analyze_ssa_name_flags): Likewise.
5952 (modref_merge_call_site_flags): Likewise.
5953 * tree-ssa-structalias.c (handle_call_arg): Likewise.
5955 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
5957 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
5958 check and remove an unused local variable with shNadd/shNadd.uw
5961 2021-11-02 Tamar Christina <tamar.christina@arm.com>
5963 PR tree-optimization/103007
5964 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
5967 2021-11-02 Richard Biener <rguenther@suse.de>
5969 PR middle-end/103038
5970 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
5971 (can_native_interpret_type_p): Likewise.
5972 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
5973 RHS if the LHS is an SSA name.
5975 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
5977 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
5978 * langhooks-def.h (lhd_simulate_record_decl): Declare.
5979 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
5980 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
5981 * langhooks.c (lhd_simulate_record_decl): New function.
5983 2021-11-02 Jakub Jelinek <jakub@redhat.com>
5986 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
5987 modes with TImode inner mode if 32-bit.
5989 2021-11-02 liuhongt <hongtao.liu@intel.com>
5991 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
5992 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
5993 vect_slp_v4si_store_unalign): Document efficient target.
5994 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
5995 vect_slp_v16qi_store_unalign_1): Ditto.
5996 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
5997 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
5998 (struct_4char_block_move, struct_8char_block_move,
5999 struct_16char_block_move): Ditto.
6001 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
6002 Jakub Jelinek <jakub@redhat.com>
6005 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
6006 ix86_expand_ti_to_v1ti): New helper functions.
6007 (ix86_expand_v1ti_shift): Check if the amount operand is an
6008 integer constant, and expand as a TImode shift if it isn't.
6009 (ix86_expand_v1ti_rotate): Check if the amount operand is an
6010 integer constant, and expand as a TImode rotate if it isn't.
6011 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
6012 right shifts of V1TImode quantities.
6013 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
6014 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
6015 to QImode general_operand, and let the helper functions lower
6016 shifts by non-constant operands, as TImode shifts. Make
6017 conditional on TARGET_64BIT.
6018 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
6019 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
6020 Make conditional on TARGET_64BIT.
6022 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
6024 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
6025 "*tabort_1_plus"): Remove operands non-null check.
6027 2021-11-01 Martin Liska <mliska@suse.cz>
6029 * opt-functions.awk: Add new sanity checking.
6030 * optc-gen.awk: Add new argument to integer_range_info.
6031 * params.opt: Update 2 params which have negative IntegerRange.
6033 2021-11-01 qing zhao <qing.zhao@oracle.com>
6035 * gimplify.c (gimplify_decl_expr): Do not add call to
6036 __builtin_clear_padding when a variable is a gimple register
6037 or it might not have padding.
6038 (gimplify_init_constructor): Likewise.
6040 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6042 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
6043 movi, dup and extract costing fields.
6044 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
6045 thunderx_extra_costs, thunderx2t99_extra_costs,
6046 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
6048 * config/arm/aarch-cost-tables.h (generic_extra_costs,
6049 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
6050 exynosm1_extra_costs, xgene1_extra_costs): Likewise
6051 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
6052 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
6054 2021-11-01 Tamar Christina <tamar.christina@arm.com>
6056 * cse.c (add_to_set): New.
6057 (find_sets_in_insn): Register constants in sets.
6058 (canonicalize_insn): Use auto_vec instead.
6059 (cse_insn): Try materializing using vec_dup.
6060 * rtl.h (simplify_context::simplify_gen_vec_select,
6061 simplify_gen_vec_select): New.
6062 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
6064 2021-11-01 David Malcolm <dmalcolm@redhat.com>
6066 * common.opt (fdiagnostics-escape-format=): New.
6067 (diagnostics_escape_format): New enum.
6068 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
6069 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
6070 * diagnostic-format-json.cc (json_end_diagnostic): Add
6071 "escape-source" attribute.
6072 * diagnostic-show-locus.c
6073 (exploc_with_display_col::exploc_with_display_col): Replace
6074 "tabstop" param with a cpp_char_column_policy and add an "aspect"
6075 param. Use these to compute m_display_col accordingly.
6076 (struct char_display_policy): New struct.
6077 (layout::m_policy): New field.
6078 (layout::m_escape_on_output): New field.
6079 (def_policy): New function.
6080 (make_range): Update for changes to exploc_with_display_col ctor.
6081 (default_print_decoded_ch): New.
6082 (width_per_escaped_byte): New.
6083 (escape_as_bytes_width): New.
6084 (escape_as_bytes_print): New.
6085 (escape_as_unicode_width): New.
6086 (escape_as_unicode_print): New.
6088 (layout::layout): Initialize new fields. Update m_exploc ctor
6089 call for above change to ctor.
6090 (layout::maybe_add_location_range): Update for changes to
6091 exploc_with_display_col ctor.
6092 (layout::calculate_x_offset_display): Update for change to
6094 (layout::print_source_line): Pass policy
6095 to cpp_display_width_computation. Capture cpp_decoded_char when
6096 calling process_next_codepoint. Move printing of source code to
6097 m_policy.m_print_cb.
6098 (line_label::line_label): Pass in policy rather than context.
6099 (layout::print_any_labels): Update for change to line_label ctor.
6100 (get_affected_range): Pass in policy rather than context, updating
6101 calls to location_compute_display_column accordingly.
6102 (get_printed_columns): Likewise, also for cpp_display_width.
6103 (correction::correction): Pass in policy rather than tabstop.
6104 (correction::compute_display_cols): Pass m_policy rather than
6105 m_tabstop to cpp_display_width.
6106 (correction::m_tabstop): Replace with...
6107 (correction::m_policy): ...this.
6108 (line_corrections::line_corrections): Pass in policy rather than
6110 (line_corrections::m_context): Replace with...
6111 (line_corrections::m_policy): ...this.
6112 (line_corrections::add_hint): Update to use m_policy rather than
6114 (line_corrections::add_hint): Likewise.
6115 (layout::print_trailing_fixits): Likewise.
6116 (selftest::test_display_widths): New.
6117 (selftest::test_layout_x_offset_display_utf8): Update to use
6118 policy rather than tabstop.
6119 (selftest::test_one_liner_labels_utf8): Add test of escaping
6121 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
6122 use policy rather than tabstop.
6123 (selftest::test_overlapped_fixit_printing): Likewise.
6124 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
6125 (selftest::test_overlapped_fixit_printing_2): Likewise.
6126 (selftest::test_tab_expansion): Likewise.
6127 (selftest::test_escaping_bytes_1): New.
6128 (selftest::test_escaping_bytes_2): New.
6129 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
6130 * diagnostic.c (diagnostic_initialize): Initialize
6131 context->escape_format.
6132 (convert_column_unit): Update to use default character width policy.
6133 (selftest::test_diagnostic_get_location_text): Likewise.
6134 * diagnostic.h (enum diagnostics_escape_format): New enum.
6135 (diagnostic_context::escape_format): New field.
6136 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
6137 (-fdiagnostics-format=): Add "escape-source" attribute to examples
6138 of JSON output, and document it.
6139 * input.c (location_compute_display_column): Pass in "policy"
6140 rather than "tabstop", passing to
6141 cpp_byte_column_to_display_column.
6142 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
6143 * input.h (class cpp_char_column_policy): New forward decl.
6144 (location_compute_display_column): Pass in "policy" rather than
6146 * opts.c (common_handle_option): Handle
6147 OPT_fdiagnostics_escape_format_.
6148 * selftest.c (temp_source_file::temp_source_file): New ctor
6149 overload taking a size_t.
6150 * selftest.h (temp_source_file::temp_source_file): Likewise.
6152 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6154 * dbgcnt.def: Add debug counter for back_thread[12] and
6155 back_threadfull[12].
6156 * passes.def: Pass "first" argument to each back threading pass.
6157 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
6159 (back_threader::debug_counter): New.
6160 (back_threader::maybe_register_path): Call debug_counter.
6162 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
6164 * tree-ssa-threadbackward.c (BT_NONE): New.
6167 (back_threader::back_threader): Add flags.
6168 Move loop initialization here.
6169 (back_threader::~back_threader): New.
6170 (back_threader::find_taken_edge_switch): Change solver and ranger
6172 (back_threader::find_taken_edge_cond): Same.
6173 (back_threader::find_paths_to_names): Same.
6174 (back_threader::find_paths): Same.
6175 (back_threader::dump): Same.
6176 (try_thread_blocks): Merge into thread_blocks.
6177 (back_threader::thread_blocks): New.
6178 (do_early_thread_jumps): Merge into thread_blocks.
6179 (do_thread_jumps): Merge into thread_blocks.
6180 (back_threader::thread_through_all_blocks): Remove.
6182 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
6184 PR tree-optimization/103003
6185 * value-relation.cc (dom_oracle::register_relation): If the 2
6186 ssa names are the same, don't register any relation.
6188 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
6190 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
6191 redundant check for calls_eh_return.
6192 * config/aarch64/aarch64.md (*do_return): Likewise.
6194 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6196 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
6197 duplicate_loop_to_header_edge to
6198 duplicate_loop_body_to_header_edge.
6199 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6200 * cfghooks.h (struct cfg_hooks): Likewise.
6201 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6202 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
6203 (clone_loop_to_header_edge): Likewise.
6204 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
6205 * cfgrtl.c (struct cfg_hooks): Likewise.
6206 * doc/loop.texi: Likewise.
6207 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
6208 (unroll_loop_runtime_iterations): Likewise.
6209 (unroll_loop_stupid): Likewise.
6210 (apply_opt_in_copies): Likewise.
6211 * tree-cfg.c (struct cfg_hooks): Likewise.
6212 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
6213 (try_peel_loop): Likewise.
6214 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
6215 (gimple_duplicate_loop_body_to_header_edge): Likewise.
6216 (tree_transform_and_unroll_loop): Likewise.
6217 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
6220 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6222 * cfgloopmanip.c (loop_version): Refactor loopify to
6223 loop_version. Move condition generation after loopify.
6225 * cfgloopmanip.h (loopify): Delete.
6227 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
6229 * ipa-fnsummary.c: Include tree-dfa.h.
6230 (points_to_local_or_readonly_memory_p): Return true on return
6232 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
6235 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
6237 PR middle-end/102972
6238 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
6239 internal Fortran name; new permit_num_teams arg to permit
6240 omp_get_num_teams and omp_get_team_num.
6241 (scan_omp_1_stmt): Update call to it, add missing call for
6242 reverse offload, and check for strictly nested API calls in teams.
6244 2021-10-30 Jakub Jelinek <jakub@redhat.com>
6246 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
6248 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6250 * selftest.c (assert_streq): Add newlines when emitting non-equal
6253 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6255 * Makefile.in: Fix syntax for reference to LIBDEPS in
6258 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6260 * doc/install.texi: Bump required minimum DejaGnu version.
6262 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6263 Andrew MacLeod <amacleod@redhat.com>
6265 * value-relation.cc (path_oracle::killing_def): Add a
6266 self-equivalence so we don't look to the root oracle.
6268 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6270 * passes.def: Replace the pass_thread_jumps before VRP* with
6271 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
6272 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
6273 Remove hyphen from "thread-full" name.
6275 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
6277 PR tree-optimization/102983
6278 * gimple-range-cache.h (propagate_updated_value): Make public.
6279 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
6280 when processing gcond stmts.
6282 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
6284 * gimple.c (gimple_call_retslot_flags): New function.
6285 * gimple.h (gimple_call_retslot_flags): Declare.
6286 * ipa-modref.c: Include tree-cfg.h.
6287 (struct escape_entry): Turn parm_index to signed.
6288 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
6289 (modref_summary::modref_summary): Initialize retslot_flags.
6290 (struct modref_summary_lto): Likewise.
6291 (modref_summary::useful_p): Check retslot_flags.
6292 (modref_summary_lto::useful_p): Likewise.
6293 (modref_summary::dump): Dump retslot_flags.
6294 (modref_summary_lto::dump): Likewise.
6295 (struct escape_point): Add hidden_args enum.
6296 (analyze_ssa_name_flags): Ignore return slot return;
6297 use gimple_call_retslot_flags.
6298 (record_escape_points): Break out from ...
6299 (analyze_parms): ... here; handle retslot_flags.
6300 (modref_summaries::duplicate): Duplicate retslot_flags.
6301 (modref_summaries_lto::duplicate): Likewise.
6302 (modref_write_escape_summary): Stream parm_index as signed.
6303 (modref_read_escape_summary): Likewise.
6304 (modref_write): Stream retslot_flags.
6305 (read_section): Likewise.
6306 (struct escape_map): Fix typo in comment.
6307 (update_escape_summary_1): Fix whitespace.
6308 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
6309 (modref_merge_call_site_flags): Merge retslot_flags.
6310 * ipa-modref.h (struct modref_summary): Add retslot_flags.
6311 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
6313 2021-10-29 Tamar Christina <tamar.christina@arm.com>
6315 PR tree-optimization/102977
6316 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
6317 (vect_detect_pair_op): Add crosslane check.
6318 (vect_match_call_complex_mla): Remove.
6319 (class complex_mul_pattern): Update comment.
6320 (complex_mul_pattern::matches): Update detection.
6321 (class complex_fma_pattern): Remove.
6322 (complex_fma_pattern::matches): Remove.
6323 (complex_fma_pattern::recognize): Remove.
6324 (complex_fma_pattern::build): Remove.
6325 (class complex_fms_pattern): Update comment.
6326 (complex_fms_pattern::matches): Remove.
6327 (complex_operations_pattern::recognize): Remove complex_fma_pattern
6329 2021-10-29 Jakub Jelinek <jakub@redhat.com>
6331 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
6334 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
6337 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
6338 patterns match and emit for VSX xxpermdi.
6340 2021-10-29 liuhongt <hongtao.liu@intel.com>
6343 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
6345 (V16HF_FTYPE_V16HF): Ditto.
6346 (V32HF_FTYPE_V32HF): Ditto.
6347 (V8HF_FTYPE_V8HF_ROUND): Ditto.
6348 (V16HF_FTYPE_V16HF_ROUND): Ditto.
6349 (V32HF_FTYPE_V32HF_ROUND): Ditto.
6350 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
6351 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
6352 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
6353 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
6354 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
6355 * config/i386/i386-builtins.c
6356 (ix86_builtin_vectorized_function): Enable vectorization for
6357 HFmode FLOOR/CEIL/TRUNC operation.
6358 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
6360 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
6363 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6364 Andrew MacLeod <amacleod@redhat.com>
6366 * value-relation.cc (path_oracle::killing_def): Walk the
6367 equivalency list and remove SSA from any equivalencies.
6369 2021-10-28 Stafford Horne <shorne@gmail.com>
6371 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
6374 2021-10-28 Jakub Jelinek <jakub@redhat.com>
6376 PR tree-optimization/102951
6377 * fold-const.h (address_compare): Declare.
6378 * fold-const.c (address_compare): New function.
6379 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
6380 address_compare helper.
6381 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
6384 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6386 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
6388 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6390 PR tree-optimization/102940
6391 * tree-vrp.c (execute_ranger_vrp): Reset scev.
6393 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6395 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
6397 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6399 * configure: Regenerate.
6400 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
6402 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
6404 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
6406 2021-10-28 Richard Biener <rguenther@suse.de>
6409 * fold-const.c (fold_convert_const): Avoid int to float
6410 constant folding with -frounding-math and inexact result.
6411 * simplify-rtx.c (simplify_const_unary_operation): Likewise
6412 for both float and unsigned_float.
6414 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6416 * tree-ssa-threadbackward.c
6417 (back_threader::find_taken_edge_switch): Use find_case_label_range
6418 instead of find_taken_edge.
6420 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6422 * tree-ssa-threadbackward.c (class back_threader_registry):
6423 Inherit from back_jt_path_registry.
6424 (back_threader_registry::thread_through_all_blocks): Remove.
6425 (back_threader_registry::register_path): Remove
6426 m_lowlevel_registry prefix.
6428 2021-10-28 Richard Biener <rguenther@suse.de>
6431 * fold-const.c (fold_convert_const_real_from_real): Honor
6432 -frounding-math if the conversion is not exact.
6433 * simplify-rtx.c (simplify_const_unary_operation): Do not
6434 simplify FLOAT_TRUNCATE with sign dependent rounding.
6436 2021-10-28 Richard Biener <rguenther@suse.de>
6438 PR tree-optimization/102949
6439 * tree-vect-stmts.c (ensure_base_align): Look at the
6440 dr_info of a group leader and assert we are looking at
6441 one with analyzed alignment.
6443 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
6446 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
6447 V1T1 mode for unaligned load and store.
6449 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6451 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
6454 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6456 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
6459 2021-10-28 Alexandre Oliva <oliva@adacore.com>
6461 * common.opt (fharden-compares): New.
6462 (fharden-conditional-branches): New.
6463 * doc/invoke.texi: Document new options.
6464 * gimple-harden-conditionals.cc: New.
6465 * Makefile.in (OBJS): Build it.
6466 * passes.def: Add new passes.
6467 * tree-pass.h (make_pass_harden_compares): Declare.
6468 (make_pass_harden_conditional_branches): Declare.
6470 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6473 * config/rs6000/altivec.md: Add vsx register constraints.
6474 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
6475 (vsx_xxsel<mode>2): Likewise.
6476 (vsx_xxsel<mode>3): Likewise.
6477 (vsx_xxsel<mode>4): Likewise.
6479 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6482 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
6483 (altivec_vsel<mode>): ... this and update define.
6484 (*altivec_vsel<mode>_uns): Delete.
6485 (altivec_vsel<mode>2): New define_insn.
6486 (altivec_vsel<mode>3): Likewise.
6487 (altivec_vsel<mode>4): Likewise.
6488 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
6489 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
6491 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
6492 selection instead of per element.
6493 * config/rs6000/vector.md:
6494 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
6495 (vsx_xxsel<mode>): ... this and update define.
6496 (*vsx_xxsel<mode>_uns): Delete.
6497 (vsx_xxsel<mode>2): New define_insn.
6498 (vsx_xxsel<mode>3): Likewise.
6499 (vsx_xxsel<mode>4): Likewise.
6501 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
6503 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
6504 HFmode rsqrt without TARGET_SSE_MATH.
6505 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
6506 btrunc condition to be restricted by -ftrapping-math, adjust
6507 use_rsqrt_p function call.
6508 * config/i386/i386.md (rcphf2): New define_insn.
6509 (rsqrthf2): Likewise.
6510 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
6511 (div<mode>3): New expander for HF mode.
6512 (rsqrt<mode>2): Likewise.
6513 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
6514 (*avx512fp16_vmrsqrtv8hf2): Likewise.
6516 2021-10-27 Saagar Jha <saagar@saagarjha.com>
6518 * config.gcc: Adjust for Darwin21.
6519 * config/darwin-c.c (macosx_version_as_macro): Likewise.
6520 * config/darwin-driver.c (validate_macosx_version_min):
6522 (darwin_find_version_from_kernel): Likewise.
6524 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6526 * gimple-range-path.cc
6527 (path_range_query::range_defined_in_block): Call killing_def.
6529 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6531 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
6532 operands as well as relation.
6533 * gimple-range-path.cc
6534 (path_range_query::compute_ranges_in_block): Compute PHI relations
6535 first. Compute outgoing relations at the end.
6536 (path_range_query::compute_ranges): Remove call to compute_relations.
6537 (path_range_query::compute_relations): Remove.
6538 (path_range_query::maybe_register_phi_relation): New.
6539 (path_range_query::compute_phi_relations): Abstract out
6540 registering one PHI relation to...
6541 (path_range_query::compute_outgoing_relations): ...here.
6542 * gimple-range-path.h (class path_range_query): Remove
6544 Add maybe_register_phi_relation.
6546 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6547 Andrew MacLeod <amacleod@redhat.com>
6549 * value-relation.cc (path_oracle::killing_def): Kill second
6552 2021-10-27 Martin Jambor <mjambor@suse.cz>
6554 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
6555 profile feedback depending on their local availability.
6557 2021-10-27 Martin Jambor <mjambor@suse.cz>
6559 * params.opt (param_ipa_cp_profile_count_base): New parameter.
6560 * doc/invoke.texi (Optimize Options): Add entry for
6561 ipa-cp-profile-count-base.
6562 * ipa-cp.c (max_count): Replace with base_count, replace all
6563 occurrences too, unless otherwise stated.
6564 (ipcp_cloning_candidate_p): identify mostly-directly called
6565 functions based on their counts, not max_count.
6566 (compare_edge_profile_counts): New function.
6567 (ipcp_propagate_stage): Instead of setting max_count, find the
6568 appropriate edge count in a sorted vector of counts of eligible
6569 edges and make it the base_count.
6571 2021-10-27 Martin Jambor <mjambor@suse.cz>
6573 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
6574 n_nonrec_calls and itself, document all fields.
6575 (init_caller_stats): Initialize the above new fields.
6576 (gather_caller_stats): Gather self-recursive counts and calls number.
6577 (get_info_about_necessary_edges): Gather counts of self-recursive and
6578 other edges bringing in the requested value separately.
6579 (dump_profile_updates): Rework to dump info about a single node only.
6580 (lenient_count_portion_handling): New function.
6581 (struct gather_other_count_struct): New type.
6582 (gather_count_of_non_rec_edges): New function.
6583 (struct desc_incoming_count_struct): New type.
6584 (analyze_clone_icoming_counts): New function.
6585 (adjust_clone_incoming_counts): Likewise.
6586 (update_counts_for_self_gen_clones): Likewise.
6587 (update_profiling_info): Rewritten.
6588 (update_specialized_profile): Adjust call to dump_profile_updates.
6589 (create_specialized_node): Do not update profiling info.
6590 (decide_about_value): New parameter self_gen_clones, either push new
6591 clones into it or updat their profile counts. For self-recursively
6592 generated values, use a portion of the node count instead of count
6593 from self-recursive edges to estimate goodness.
6594 (decide_whether_version_node): Gather clones for self-generated values
6595 in a new vector, update their profiles at once at the end.
6597 2021-10-27 Richard Biener <rguenther@suse.de>
6599 * tree-vectorizer.c (vect_transform_loops): New function,
6601 (try_vectorize_loop_1): ... here. Simplify as epilogues
6602 are now fully handled in the split part.
6604 2021-10-27 Jakub Jelinek <jakub@redhat.com>
6606 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
6607 iterators with pointer types.
6608 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
6610 2021-10-26 Martin Sebor <msebor@redhat.com>
6612 PR tree-optimization/102238
6613 PR tree-optimization/102919
6614 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
6615 (array_elt_at_offset): Move to pointer-query.
6616 (set_aggregate_size_and_offset): New function.
6617 (field_at_offset): Move to pointer-query.
6618 (get_origin_and_offset): Rename...
6619 (get_origin_and_offset_r): this. Add an argument. Make aggregate
6620 handling more robust.
6621 (get_origin_and_offset): New.
6622 (alias_offset): Add an argument.
6623 (format_string): Use subobject size determined by get_origin_and_offset.
6624 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
6625 Improve/correct handling of aggregates.
6626 (array_elt_at_offset): Same.
6627 * pointer-query.h (field_at_offset): Declare.
6628 (array_elt_at_offset): Declare.
6630 2021-10-26 Martin Sebor <msebor@redhat.com>
6632 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
6634 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
6635 (expand_builtin_fork_or_exec): Same.
6636 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
6637 compute_objsize additional arguments.
6638 (inbounds_memaccess_p): Same.
6639 (array_bounds_checker::check_array_bounds): Add an assert. Stash
6640 statement in a member.
6641 (check_array_bounds_dom_walker::before_dom_children): Same.
6642 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
6643 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
6644 (handle_printf_call): Pass a new argument.
6645 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
6646 (check_access): Add an argument and pass it along to callees.
6647 (check_read_access): Make a member function.
6648 (pass_waccess::check_strcat): Pass access_data ctor additional
6650 (pass_waccess::check_strncat): Same.
6651 (pass_waccess::check_stxcpy): Same.
6652 (pass_waccess::check_stxncpy): Same.
6653 (pass_waccess::check_strncmp): Same.
6654 (pass_waccess::check_read_access): Same.
6655 (pass_waccess::check_builtin): Same.
6656 (pass_waccess::maybe_check_access_sizes): Same.
6657 (pass_waccess::maybe_check_dealloc_call): Same.
6658 * gimple-ssa-warn-access.h (check_read_access): Declare a new
6660 * pointer-query.cc (compute_objsize_r): Add an argument.
6661 (gimple_call_return_array): Same.
6662 (gimple_call_alloc_size): Same.
6663 (access_ref::access_ref): Same.
6664 (access_ref::get_ref): Same.
6665 (pointer_query::get_ref): Same.
6666 (handle_min_max_size): Pass an arguments to callees.
6667 (handle_array_ref): Add an argument.
6668 (handle_mem_ref): Same.
6669 (compute_objsize): Same.
6670 * pointer-query.h (struct access_ref): Adjust signatures.
6671 (struct access_data): Same.
6672 (gimple_call_alloc_size): Add an argument.
6673 (gimple_parm_array_size): Same.
6674 (compute_objsize): Same.
6675 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
6676 argument to compute_objsize.
6677 (strlen_pass::maybe_warn_overflow): Same.
6678 (maybe_diag_stxncpy_trunc): Same.
6680 2021-10-26 Martin Sebor <msebor@redhat.com>
6682 PR middle-end/102453
6683 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
6684 (pass_waccess::check_atomic_builtin): Call it.
6686 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
6688 PR rtl-optimization/102842
6689 * lra-constraints.c (match_reload): Ignore out in checking values
6691 (curr_insn_transform): Collect outputs before doing reloads of operands.
6693 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
6696 * config/rs6000/x86intrin.h: Move some included headers to new
6697 headers. Include new immintrin.h instead of those headers.
6698 * config/rs6000/immintrin.h: New.
6699 * config/rs6000/x86gprintrin.h: New.
6700 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
6702 2021-10-26 Richard Biener <rguenther@suse.de>
6704 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
6705 Remove byte_offset parameter.
6706 (vect_create_data_ref_ptr): Likewise.
6707 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6709 (vect_create_data_ref_ptr): Likewise.
6710 * tree-vect-stmts.c (vectorizable_store): Adjust.
6711 (vectorizable_load): Likewise.
6713 2021-10-26 Richard Biener <rguenther@suse.de>
6715 PR tree-optimization/96109
6716 * tree-vectorizer.h (dr_misalignment): Add optional offset
6718 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
6719 offset applied for negative stride accesses.
6720 (vect_enhance_data_refs_alignment): Compute negative stride
6721 access offset and pass it to dr_misalignment.
6722 * tree-vect-stmts.c (get_negative_load_store_type): Pass
6723 negative offset to dr_misalignment.
6724 (get_group_load_store_type): Likewise.
6725 (get_load_store_type): Likewise.
6726 (vectorizable_store): Remove asserts about alignment.
6727 (vectorizable_load): Likewise.
6729 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6731 PR tree-optimization/102897
6732 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
6734 2021-10-26 Richard Biener <rguenther@suse.de>
6736 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6737 Take offset in bytes.
6738 (vect_create_data_ref_ptr): Likewise.
6739 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
6740 offset by element size.
6741 (vect_create_cond_for_align_checks): Likewise.
6742 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
6743 (vectorizable_load): Remove duplicate leftover from merge
6746 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
6748 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
6749 function to expand V1TI mode logical shifts by integer constants.
6750 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
6751 mode rotations by integer constants.
6752 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
6753 ix86_expand_v1ti_rotate): Prototype new functions here.
6754 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
6755 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
6757 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6759 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6760 Avoid threading circular paths.
6762 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6764 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
6765 Attempt to resolve all incoming paths to a PHI.
6766 (back_threader::resolve_def): Always return true for PHIs.
6768 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6770 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
6771 Always try to resolve path without looking back.
6772 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
6773 edge is a back edge.
6775 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6777 PR tree-optimization/102789
6778 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
6779 update inits of simd_lane_access.
6781 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6783 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
6784 move to vr-values.c.
6785 (simplify_casted_conds): Move to vrp_folder class.
6786 (execute_vrp): Call via vrp_folder now.
6787 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
6788 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
6789 * vr-values.h (simplify_casted_cond): Add prototype.
6791 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6793 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
6796 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
6798 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
6800 (_mm_testc_si128): Likewise.
6801 (_mm_testnzc_si128): Likewise.
6802 (_mm_blend_ps): Likewise.
6803 (_mm_blendv_ps): Likewise.
6804 (_mm_blend_pd): Likewise.
6805 (_mm_blendv_pd): Likewise.
6806 (_mm_ceil_pd): Likewise.
6807 (_mm_ceil_sd): Likewise.
6808 (_mm_ceil_ps): Likewise.
6809 (_mm_ceil_ss): Likewise.
6810 (_mm_floor_pd): Likewise.
6811 (_mm_floor_sd): Likewise.
6812 (_mm_floor_ps): Likewise.
6813 (_mm_floor_ss): Likewise.
6814 (_mm_minpos_epu16): Likewise.
6815 (_mm_mul_epi32): Likewise.
6816 (_mm_cvtepi8_epi16): Likewise.
6817 (_mm_packus_epi32): Likewise.
6818 (_mm_cmpgt_epi64): Likewise.
6820 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
6822 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
6823 shifts of the mode's smin_value and smax_value when the bit count
6824 operand doesn't have side-effects.
6825 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
6826 when the bit count operand doesn't have side-effects.
6827 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
6828 compile-time evaluation of saturating left shifts with constant
6831 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6833 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
6836 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6838 * flag-types.h (enum ranger_debug): Adjust values.
6839 * params.opt (ranger_debug): Ditto.
6841 2021-10-25 Martin Jambor <mjambor@suse.cz>
6843 PR tree-optimization/102886
6844 * tree-sra.c (totally_scalarize_subtree): Fix the out of
6847 2021-10-25 Andrew Pinski <apinski@marvell.com>
6849 * tree-ssa-dce.c (simple_dce_from_worklist):
6850 Check stmt_unremovable_because_of_non_call_eh_p also
6851 before removing the statement.
6853 2021-10-25 Richard Biener <rguenther@suse.de>
6855 PR tree-optimization/102905
6856 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
6857 Use vect_supportable_dr_alignment again to determine whether
6858 an access is supported when not aligned.
6860 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6862 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
6865 2021-10-25 Jim Wilson <jimw@sifive.com>
6866 Kito Cheng <kito.cheng@sifive.com>
6867 Jia-Wei Chen <jiawei@iscas.ac.cn>
6868 Shi-Hua Liao <shihua@iscas.ac.cn>
6870 * config/riscv/bitmanip.md (shiftm1): New.
6871 (*bset<mode>): Ditto.
6872 (*bset<mode>_mask): Ditto.
6873 (*bset<mode>_1): Ditto.
6874 (*bset<mode>_1_mask): Ditto.
6875 (*bseti<mode>): Ditto.
6876 (*bclr<mode>): Ditto.
6877 (*bclri<mode>): Ditto.
6878 (*binv<mode>): Ditto.
6879 (*binvi<mode>): Ditto.
6880 (*bext<mode>): Ditto.
6882 * config/riscv/predicates.md (splittable_const_int_operand):
6884 (single_bit_mask_operand): New.
6885 (not_single_bit_mask_operand): Ditto.
6886 (const31_operand): Ditto.
6887 (const63_operand): Ditto.
6888 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
6889 (riscv_output_move): Ditto.
6890 (riscv_print_operand): Handle new operand type: T and S.
6891 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
6893 2021-10-25 Jim Wilson <jimw@sifive.com>
6895 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
6898 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6900 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6902 (riscv_rtx_costs): Ditto.
6904 2021-10-25 Jim Wilson <jimw@sifive.com>
6905 Kito Cheng <kito.cheng@sifive.com>
6906 Jia-Wei Chen <jiawei@iscas.ac.cn>
6908 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
6909 (bitmanip_minmax): New.
6910 (clz_ctz_pcnt): New.
6911 (bitmanip_optab): New.
6912 (bitmanip_insn): New.
6913 (*<optab>_not<mode>): New.
6914 (*xor_not<mode>): New.
6915 (<bitmanip_optab>si2): New.
6916 (*<bitmanip_optab>disi2): New.
6917 (<bitmanip_optab>di2): New.
6918 (*zero_extendhi<GPR:mode>2_bitmanip): New.
6919 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
6920 (*zero_extendhi<GPR:mode>2_zbb): New.
6923 (rotrsi3_sext): New.
6926 (rotlsi3_sext): New.
6927 (bswap<mode>2): New.
6928 (<bitmanip_optab><mode>3): New.
6929 * config/riscv/riscv.md (type): Add rotate.
6930 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
6931 (*zero_extendhi<GPR:mode>2): New.
6932 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
6933 (*extend<SHORT:mode><SUPERQI:mode>2): New.
6935 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6937 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6939 (riscv_rtx_costs): Ditto.
6941 2021-10-25 Jim Wilson <jimw@sifive.com>
6942 Kito Cheng <kito.cheng@sifive.com>
6943 Jia-Wei Chen <jiawei@iscas.ac.cn>
6945 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
6950 (riscv_rtx_costs): Ditto.
6951 * config/riscv/riscv.md: Include bitmanip.md
6952 (type): Add bitmanip bype.
6953 (zero_extendsidi2): Change to define_expand pattern.
6954 (*zero_extendsidi2_internal): New.
6955 (zero_extendsidi2_shifted): Disable for ZBA.
6957 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6959 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
6960 Add zba, zbb, zbc and zbs.
6961 (riscv_ext_flag_table): Ditto.
6962 * config/riscv/riscv-opts.h (MASK_ZBA): New.
6966 (TARGET_ZBA): Ditto.
6967 (TARGET_ZBB): Ditto.
6968 (TARGET_ZBC): Ditto.
6969 (TARGET_ZBS): Ditto.
6970 * config/riscv/riscv.opt (riscv_zb_subext): New.
6972 2021-10-25 liuhongt <hongtao.liu@intel.com>
6975 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
6976 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
6978 2021-10-25 Richard Biener <rguenther@suse.de>
6980 PR tree-optimization/102920
6981 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
6982 controlling VN_TOP matching behavior.
6983 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
6984 (vn_phi_eq): Do not optimistically match VN_TOP.
6986 2021-10-25 konglin1 <lingling.kong@intel.com>
6988 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
6989 define_insn_and_split.
6990 (fma_<mode>_fadd_fcmul):Likewise
6991 (fma_<complexopname>_<mode>_fma_zero):Likewise
6993 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6995 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
6996 TARGET_DISABLE_FPREGS.
6997 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
6998 MASK_DISABLE_FPREGS.
6999 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
7000 cost of hardware integer multiplication.
7001 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
7002 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
7003 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
7004 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
7005 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
7006 SOFT_FLOAT. Revise comment.
7007 (msoft-float): New option.
7009 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
7011 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
7013 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
7015 * doc/install.texi (Specific): Remove obsolete details
7016 around GNU/Linux on Itanium.
7017 (Specific): Remove reference to Windows for Itanium.
7019 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
7021 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
7024 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
7026 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
7028 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
7030 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
7031 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
7032 (one_cmplv1ti2): New define expand.
7034 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
7037 * Makefile.in: Handle dvidir and install-dvi target.
7038 * configure: Regenerate.
7039 * configure.ac: Add install-dvi to target_list.
7041 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
7043 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
7046 2021-10-22 Richard Biener <rguenther@suse.de>
7048 PR tree-optimization/102893
7049 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
7050 test for an exit edge.
7052 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
7053 Andrew MacLeod <amacleod@redhat.com>
7055 * gimple-range-path.cc (path_range_query::compute_phi_relations):
7056 Kill any global relations we may know before registering a new
7058 * value-relation.cc (path_oracle::killing_def): New.
7059 * value-relation.h (path_oracle::killing_def): New.
7061 2021-10-22 Richard Biener <rguenther@suse.de>
7064 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
7066 (vn_phi_lookup): Likewise.
7068 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
7071 * doc/invoke.texi: Document -fcf-protection requires i686 or
7074 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
7076 PR middle-end/102764
7077 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
7079 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
7081 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
7082 macro and all invocations.
7084 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7086 * doc/invoke.texi (ranger-debug): Document.
7087 * flag-types.h (enum ranger_debug): New.
7088 (enum evrp_mode): Remove debug values.
7089 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
7090 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
7091 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
7092 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
7093 (execute_early_vrp): Use evrp-mode directly.
7094 * params.opt (enum evrp_mode): Remove debug values.
7095 (ranger-debug): New.
7096 (ranger-logical-depth): Relocate to be in alphabetical order.
7098 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7100 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
7101 * flag-types.h: (enum vrp_mode): New.
7102 * params.opt: (vrp1-mode, vrp2-mode): New.
7103 * tree-vrp.c (vrp_pass_num): New.
7104 (pass_vrp::pass_vrp): Set pass number.
7105 (pass_vrp::execute): Choose which VRP mode to execute.
7107 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
7109 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
7110 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
7111 * tree-vrp.c (class rvrp_folder): Relocate here.
7112 (execute_ranger_vrp): New.
7113 * tree-vrp.h (execute_ranger_vrp): Export.
7115 2021-10-21 Martin Liska <mliska@suse.cz>
7119 * opts.c (finish_options): Process flag_var_tracking* options
7120 here as they can be adjusted by optimize attribute.
7121 Process also flag_syntax_only and flag_gtoggle.
7122 * toplev.c (process_options): Remove it here.
7123 * common.opt: Make debug_nonbind_markers_p as PerFunction
7124 attribute as it depends on optimization level.
7126 2021-10-21 Martin Jambor <mjambor@suse.cz>
7128 PR tree-optimization/102505
7129 * tree-sra.c (totally_scalarize_subtree): Check that the
7130 encountered field fits within the acces we would like to put it
7133 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7135 * tree-ssa-threadbackward.c
7136 (back_threader::maybe_register_path): Remove circular paths check.
7138 2021-10-21 Richard Biener <rguenther@suse.de>
7140 * toplev.c (process_options): Move the initial debug_hooks
7142 (toplev::main): ... before the call of the post_options
7145 2021-10-21 Richard Biener <rguenther@suse.de>
7147 PR tree-optimization/102847
7148 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
7149 load cost in the prologue for VMAT_INVARIANT.
7151 2021-10-21 Richard Biener <rguenther@suse.de>
7153 PR tree-optimization/102847
7154 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
7155 handle VMAT_INVARIANT as a splat in the prologue.
7157 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
7160 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
7161 move to use the same logic as HImode.
7163 2021-10-21 Richard Biener <rguenther@suse.de>
7165 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
7166 superfluous gimple_call_nothrow_p check.
7168 2021-10-21 Andrew Pinski <apinski@marvell.com>
7170 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
7171 Mark the ssa-name of the rhs as one to be removed.
7172 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
7173 Call simple_dce_from_worklist at the end to a simple dce.
7175 2021-10-21 Andrew Pinski <apinski@marvell.com>
7177 * tree-cfg.c (maybe_remove_writeonly_store): New function
7178 factored out from ...
7179 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
7181 2021-10-21 Andrew Pinski <apinski@marvell.com>
7183 * tree-cfg.c (execute_fixup_cfg): Remove comment
7184 about standalone pass.
7186 2021-10-21 Andrew Pinski <apinski@marvell.com>
7188 * tree-cfg.c (execute_fixup_cfg): Output when the statement
7189 is removed when it is a write only var.
7191 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
7193 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7194 Avoid threading circular paths.
7196 2021-10-20 Alex Coplan <alex.coplan@arm.com>
7198 * calls.c (initialize_argument_information): Remove some dead
7199 code, remove handling for function_arg returning const_int.
7200 * doc/tm.texi: Delete documentation for unused target hooks.
7201 * doc/tm.texi.in: Likewise.
7202 * target.def (load_bounds_for_arg): Delete.
7203 (store_bounds_for_arg): Delete.
7204 (load_returned_bounds): Delete.
7205 (store_returned_bounds): Delete.
7206 * targhooks.c (default_load_bounds_for_arg): Delete.
7207 (default_store_bounds_for_arg): Delete.
7208 (default_load_returned_bounds): Delete.
7209 (default_store_returned_bounds): Delete.
7210 * targhooks.h (default_load_bounds_for_arg): Delete.
7211 (default_store_bounds_for_arg): Delete.
7212 (default_load_returned_bounds): Delete.
7213 (default_store_returned_bounds): Delete.
7215 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7217 * doc/extend.texi (Basic Asm): Clarify that asm is not an
7219 * doc/invoke.texi (-fno-asm): Fix description for C++.
7221 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7223 * doc/install.texi: Remove link to old.html
7225 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7227 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
7229 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7231 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
7233 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7235 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
7237 * config/aarch64/constraints.md (D1): New.
7239 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7241 * config/aarch64/aarch64-simd.md
7242 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
7243 (*aarch64_topbits_shuffle<mode>_le): New.
7244 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
7245 (*aarch64_topbits_shuffle<mode>_be): New.
7246 * config/aarch64/predicates.md
7247 (aarch64_simd_shift_imm_vec_exact_top): New.
7249 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7251 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
7252 *aarch64_<srn_op>shrn<mode>2_vect_le,
7253 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
7254 * config/aarch64/iterators.md (srn_op): New.
7256 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
7258 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
7260 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
7261 ctx->outer instead of ctx.
7263 2021-10-20 Martin Liska <mliska@suse.cz>
7266 2021-10-19 Martin Liska <mliska@suse.cz>
7269 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7270 * system.h (strip_whilespaces): New function.
7272 2021-10-20 Martin Liska <mliska@suse.cz>
7275 2021-10-19 Martin Liska <mliska@suse.cz>
7278 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7281 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7283 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
7284 Change to 8 with -Os, 11 otherwise.
7286 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7288 * config/aarch64/aarch64.c (neoversev1_tunings):
7289 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
7290 (neoversen2_tunings): Likewise.
7292 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7295 * config/aarch64/aarch64.md (frint_pattern): Update comment.
7296 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
7297 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
7298 * config/aarch64/arm_neon.h: Likewise.
7299 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
7301 2021-10-20 Martin Liska <mliska@suse.cz>
7303 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
7304 already declared global variable.
7305 (arm_unwind_emit_set): Use out_file as function argument.
7306 (arm_unwind_emit): Likewise.
7307 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
7308 (machopic_output_stub_indirection): Likewise.
7309 (machopic_output_indirection): Likewise.
7310 (machopic_finish): Likewise.
7311 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
7312 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
7313 * config/ia64/ia64.c (process_epilogue): Likewise.
7314 (process_cfa_adjust_cfa): Likewise.
7315 (process_cfa_register): Likewise.
7316 (process_cfa_offset): Likewise.
7317 (ia64_asm_unwind_emit): Likewise.
7318 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
7320 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7322 * config/aarch64/aarch64-builtins.c
7323 (aarch64_general_gimple_fold_builtin):
7324 lower vld1 and vst1 variants of the neon builtins
7325 * config/aarch64/aarch64-protos.h:
7326 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
7327 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
7330 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7332 * match.pd: Generate IFN_TRUNC.
7334 2021-10-20 Richard Biener <rguenther@suse.de>
7336 PR tree-optimization/102853
7337 * tree-data-ref.c (split_constant_offset_1): Bail out
7338 immediately if the expression traps on overflow.
7340 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7342 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
7344 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7346 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
7347 Remove superflous debugging message.
7348 (back_jt_path_registry::duplicate_thread_path): Same.
7350 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7352 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
7354 (back_threader_registry::register_path): Remove m_threaded_paths.
7356 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7358 PR tree-optimization/102814
7359 * doc/invoke.texi: Document --param=max-fsm-thread-length.
7360 * params.opt: Add --param=max-fsm-thread-length.
7361 * tree-ssa-threadbackward.c
7362 (back_threader_profitability::profitable_path_p): Fail on paths
7363 longer than max-fsm-thread-length.
7365 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
7367 PR middle-end/102764
7368 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
7369 statement to reset the current location for the outgoing edges.
7371 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7372 Richard Biener <rguenther@suse.de>
7374 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
7375 on the same line as the threading cancellation.
7376 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
7377 Avoid threading through loop headers where the path remains in the
7380 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
7382 * tree-object-size.c (unknown): Make into a function. Adjust
7384 (unknown_object_size): Simplify implementation.
7386 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
7387 Kewen Lin <linkw@linux.ibm.com>
7389 * doc/sourcebuild.texi (Effective-Target Keywords): Document
7390 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
7391 vect_slp_v16qi_store, vect_slp_v2hi_store,
7392 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
7394 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
7396 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
7398 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7402 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
7403 * config/rs6000/pmmintrin.h: Same.
7404 * config/rs6000/smmintrin.h: Same.
7405 * config/rs6000/tmmintrin.h: Same.
7407 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7409 * config.gcc (extra_headers): Add nmmintrin.h.
7411 2021-10-19 Richard Biener <rguenther@suse.de>
7413 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
7414 misalignment parameter.
7415 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
7416 Do not change DR_MISALIGNMENT in place, instead pass the
7417 adjusted misalignment to vect_supportable_dr_alignment.
7418 (vect_peeling_supportable): Likewise.
7419 (vect_peeling_hash_get_lowest_cost): Adjust.
7420 (vect_enhance_data_refs_alignment): Likewise.
7421 (vect_vfa_access_size): Likewise.
7422 (vect_supportable_dr_alignment): Add misalignment
7423 parameter and simplify.
7424 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
7425 (get_group_load_store_type): Likewise.
7426 (get_load_store_type): Likewise.
7428 2021-10-19 Clément Chigot <clement.chigot@atos.net>
7430 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
7431 __tls_get_addr reference to .text csect.
7433 2021-10-19 Martin Liska <mliska@suse.cz>
7436 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7439 2021-10-19 Richard Biener <rguenther@suse.de>
7441 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
7442 (vect_get_load_cost): Likewise.
7443 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
7444 alignment support scheme and misalignment as arguments
7446 (vect_get_peeling_costs_all_drs): Compute that info here
7447 and note that we shouldn't need to.
7448 * tree-vect-stmts.c (vect_model_store_cost): Get
7449 alignment support scheme and misalignment as arguments.
7450 (vect_get_store_cost): Likewise.
7451 (vect_model_load_cost): Likewise.
7452 (vect_get_load_cost): Likewise.
7453 (vectorizable_store): Pass down alignment support scheme
7454 and misalignment to costing.
7455 (vectorizable_load): Likewise.
7457 2021-10-19 Richard Biener <rguenther@suse.de>
7459 * tree-vect-stmts.c (get_negative_load_store_type): Add
7460 offset output parameter and initialize it.
7461 (get_group_load_store_type): Likewise.
7462 (get_load_store_type): Likewise.
7463 (vectorizable_store): Use offset as computed by
7464 get_load_store_type.
7465 (vectorizable_load): Likewise.
7467 2021-10-19 Richard Biener <rguenther@suse.de>
7469 PR tree-optimization/102827
7470 * tree-if-conv.c (predicate_statements): Add pe parameter
7471 and use that edge to insert invariant stmts on.
7472 (combine_blocks): Pass through pe.
7473 (tree_if_conversion): Compute the edge to insert invariant
7474 stmts on and pass it along.
7476 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
7479 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
7480 sssubaddv2hi3): Swap the order of operators in vec_concat.
7482 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
7484 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
7485 (altivec_vmrghb_direct): New.
7486 (*altivec_vmrghh_internal): Delete.
7487 (altivec_vmrghh_direct): New.
7488 (*altivec_vmrghw_internal): Delete.
7489 (altivec_vmrghw_direct_<mode>): New.
7490 (altivec_vmrghw_direct): Delete.
7491 (*altivec_vmrglb_internal): Delete.
7492 (altivec_vmrglb_direct): New.
7493 (*altivec_vmrglh_internal): Delete.
7494 (altivec_vmrglh_direct): New.
7495 (*altivec_vmrglw_internal): Delete.
7496 (altivec_vmrglw_direct_<mode>): New.
7497 (altivec_vmrglw_direct): Delete.
7498 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
7499 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
7501 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
7502 (vsx_xxmrglw_<mode>): Adjust.
7504 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
7506 * passes.def: Change threading comment before pass_ccp pass.
7508 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
7510 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
7511 Modify the expansion for sign extension. All extensions are done
7512 within VSX registers.
7514 2021-10-19 Richard Biener <rguenther@suse.de>
7516 * tree-vect-stmts.c (get_group_load_store_type): Add
7517 misalignment output parameter and initialize it.
7518 (get_group_load_store_type): Likewise.
7519 (vectorizable_store): Remove now redundant queries.
7520 (vectorizable_load): Likewise.
7522 2021-10-19 Richard Biener <rguenther@suse.de>
7524 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
7525 check_aligned argument.
7526 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
7528 (vect_peeling_hash_insert): Add supportable_if_not_aligned
7529 argument and do not call vect_supportable_dr_alignment here.
7530 (vect_peeling_supportable): Adjust.
7531 (vect_enhance_data_refs_alignment): Compute whether the
7532 access is supported with different alignment here and
7533 pass that down to vect_peeling_hash_insert.
7534 (vect_vfa_access_size): Adjust.
7535 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
7536 (vect_get_load_cost): Likewise.
7537 (get_negative_load_store_type): Likewise.
7538 (get_group_load_store_type): Likewise.
7539 (get_load_store_type): Likewise.
7541 2021-10-19 Martin Liska <mliska@suse.cz>
7544 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7545 * system.h (strip_whilespaces): New function.
7547 2021-10-19 dianhong xu <dianhong.xu@intel.com>
7549 * config/i386/avx512fp16intrin.h:
7550 (_mm512_set1_pch): New intrinsic.
7551 * config/i386/avx512fp16vlintrin.h:
7552 (_mm256_set1_pch): New intrinsic.
7553 (_mm_set1_pch): Ditto.
7555 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
7557 PR tree-optimization/102796
7558 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
7559 normally. Return get_tree_range for non gimple_range_ssa_p names.
7560 (gimple_ranger::range_of_stmt): Use get_tree_range for non
7561 gimple_range_ssa_p names.
7563 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
7566 * config/i386/i386.c (ix86_print_operand_address):
7567 Error out for non-address_operand asm operands.
7569 2021-10-18 Richard Biener <rguenther@suse.de>
7571 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
7572 not auto-convert dr_alignment_support to bool.
7573 (vect_peeling_supportable): Likewise.
7574 (vect_enhance_data_refs_alignment): Likewise.
7575 (vect_supportable_dr_alignment): Commonize read/write case.
7576 * tree-vect-stmts.c (vect_get_store_cost): Use
7577 dr_alignment_support, not int, for the vect_supportable_dr_alignment
7579 (vect_get_load_cost): Likewise.
7581 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
7583 * tree-object-size.c (object_sizes_execute): Consolidate LHS
7584 null check and do it early.
7586 2021-10-18 Richard Biener <rguenther@suse.de>
7588 * tree-vect-stmts.c (vectorizable_store): Use the
7589 computed alignment scheme instead of querying
7592 2021-10-18 Richard Biener <rguenther@suse.de>
7594 * tree-vect-stmts.c (vectorizable_store): Do not recompute
7595 alignment scheme already determined by get_load_store_type.
7597 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7599 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
7600 Clone corresponding pass.
7602 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7604 * combine.c (recog_for_combine): For an unrecognized move/set of
7605 a constant, try force_const_mem to place it in the constant pool.
7606 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
7607 Delete global variables (that are no longer assigned a cost value).
7608 (cse_insn): Simplify logic for deciding whether to place a folded
7609 constant in the constant pool using force_const_mem.
7610 (cse_main): Remove zero initialization of constant_pool_entries_cost
7611 and constant_pool_entries_regcost.
7612 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
7613 fractionally more expensive, when optimizing for speed.
7615 2021-10-18 Martin Liska <mliska@suse.cz>
7617 PR gcov-profile/102746
7618 PR gcov-profile/102747
7619 * gcov.c (main): Return return_code.
7620 (output_gcov_file): Mark return_code when error happens.
7621 (generate_results): Likewise.
7622 (read_graph_file): Likewise.
7623 (read_count_file): Likewise.
7625 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7627 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
7628 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
7629 a popcount, allowing compile-time evaluation/simplification.
7630 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
7632 2021-10-18 Richard Biener <rguenther@suse.de>
7634 PR tree-optimization/102788
7635 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
7637 (vect_pattern_recog_1): Likewise.
7638 (vect_recog_bool_pattern): Continue matching the pattern
7639 even if we do not have a vector type for a conversion
7642 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7644 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
7645 Evalute SS_NEG and SS_ABS of a constant argument.
7647 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
7650 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
7651 * internal-fn.def: Add entry for COND_NEG internal function.
7652 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
7653 NEGATE_EXPR, COND_NEG mapping.
7654 * optabs.def: Add entry for cond_neg_optab.
7655 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
7656 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
7657 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
7659 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7661 * tree-ssa-strlen.c (class strlen_pass): Rename from
7663 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
7664 (handle_assign): Same.
7665 (adjust_last_stmt): Same.
7666 (maybe_warn_overflow): Same.
7667 (handle_builtin_strlen): Same.
7668 (handle_builtin_strchr): Same.
7669 (handle_builtin_strcpy): Same.
7670 (handle_builtin_strncat): Same.
7671 (handle_builtin_stxncpy_strncat): Same.
7672 (handle_builtin_memcpy): Same.
7673 (handle_builtin_strcat): Same.
7674 (handle_alloc_call): Same.
7675 (handle_builtin_memset): Same.
7676 (handle_builtin_memcmp): Same.
7677 (get_len_or_size): Same.
7678 (strxcmp_eqz_result): Same.
7679 (handle_builtin_string_cmp): Same.
7680 (handle_pointer_plus): Same.
7681 (count_nonzero_bytes_addr): Same.
7682 (count_nonzero_bytes): Same.
7683 (handle_store): Same.
7684 (strlen_check_and_optimize_call): Same.
7685 (handle_integral_assign): Same.
7686 (check_and_optimize_stmt): Same.
7687 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
7689 2021-10-18 Richard Biener <rguenther@suse.de>
7691 PR tree-optimization/102798
7692 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7693 Only copy points-to info to newly generated SSA names.
7695 2021-10-18 Martin Liska <mliska@suse.cz>
7697 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
7698 * gcov.c (get_cycles_count): Likewise.
7699 * lto-compress.c (lto_compression_zlib): Likewise.
7700 (lto_uncompression_zlib): Likewise.
7701 * targhooks.c (default_pch_valid_p): Likewise.
7703 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
7705 * tree-pass.h (make_pass_thread_jumps_full): New.
7706 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
7707 (try_thread_blocks): Add resolve and speed arguments.
7708 (pass_thread_jumps::execute): Inline.
7709 (do_early_thread_jumps): New.
7710 (do_thread_jumps): New.
7711 (make_pass_thread_jumps): Move.
7712 (pass_early_thread_jumps::gate): Inline.
7713 (pass_early_thread_jumps::execute): Inline.
7714 (class pass_thread_jumps_full): New.
7716 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
7718 * configure.ac: Treat powerpc64*-*-freebsd* the same as
7719 powerpc64-*-freebsd*.
7720 * configure: Regenerate.
7722 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
7724 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
7725 set_ptr_nonull to set_ptr_nonnull in comments.
7727 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
7729 PR tree-optimization/102720
7730 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
7731 of call used and clobbered sets.
7733 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7735 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
7736 skip gomp and itm when r or nodefaultlibs is given.
7737 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
7738 Update ordering of exclusions, remove duplicate 'v' addition
7739 (collect2 will add this from the main command line).
7741 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7743 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
7744 filelist and framework options in specs instead of code.
7745 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
7746 specs that are really driver ones.
7747 (DARWIN_CC1_SPEC): Likewise.
7748 (CPP_SPEC): Likewise.
7749 (SYSROOT_SPEC): Append space.
7750 (LINK_SYSROOT_SPEC): Remove most driver link specs.
7751 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
7752 (STARTFILE_SPEC): Likewise.
7753 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
7754 (ASM_SPEC): Update driver-related specs.
7755 (ASM_FINAL_SPEC): Likewise.
7756 * config/darwin.opt: Remove now unused option aliases.
7757 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
7758 (ASM_SPEC): Update driver-related specs.
7760 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
7762 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
7763 allow standard_sse_constant_p immediate constants.
7765 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7767 * config.gcc: Add tm-dwarf2.h to tm_d-file.
7769 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
7771 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
7772 that occur in abnormal phis.
7773 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
7774 abnormal and EH edges.
7775 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
7776 (rvrp_folder::value_on_edge): Ditto.
7777 (rvrp_folder::value_of_stmt): Ditto.
7778 (hybrid_folder::value_of_expr): Ditto for ranger queries.
7779 (hybrid_folder::value_on_edge): Ditto.
7780 (hybrid_folder::value_of_stmt): Ditto.
7781 * value-query.cc (gimple_range_global): Always return a range if
7782 the type is supported.
7784 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
7786 * config/pa/pa.md: Consistently use "rG" constraint for copy
7787 instruction in move patterns.
7789 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
7791 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
7794 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
7795 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
7796 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
7797 (ELFABIVERSION_AMDGPU_HSA_V4): New.
7798 (SET_SRAM_ECC_UNSUPPORTED): New.
7799 (copy_early_debug_info): Create elf flags to match the other objects.
7800 (main): Just let the attribute flags pass through.
7802 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7804 * tree-loop-distribution.c (reduction_var_overflows_first):
7805 Pass the type of reduction_var as first argument as it is also
7806 done for the load type.
7807 (loop_distribution::transform_reduction_loop): Add missing
7808 TREE_TYPE while determining precission of reduction_var.
7810 2021-10-15 Richard Biener <rguenther@suse.de>
7812 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
7814 * toplev.c (process_options): Warn when STABS debugging is
7815 enabled but not the preferred format.
7816 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
7818 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
7820 2021-10-15 Richard Biener <rguenther@suse.de>
7823 * tree-inline.c (copy_bb): Avoid underflowing nargs.
7825 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7827 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
7828 HFmode input operand to HImode.
7829 (ix86_vectorize_vec_perm_const): Likewise.
7830 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
7832 (*avx512f_permvar_truncv8siv8hi_1_hf):
7835 2021-10-15 Richard Biener <rguenther@suse.de>
7837 PR middle-end/102682
7838 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
7839 not create a paradoxical subreg.
7841 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7843 * config/i386/i386-expand.c (ix86_expand_vector_init):
7844 For half_vector concat for HFmode, handle them like HImode.
7846 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7848 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
7849 (back_threader::back_threader): Same.
7850 (back_threader::resolve_phi): Try to solve without looking back if
7852 (back_threader::find_paths_to_names): Same.
7853 (try_thread_blocks): Pass resolve argument to back threader.
7854 (pass_early_thread_jumps::execute): Same.
7856 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7858 * doc/invoke.texi: Remove max-fsm-thread-length,
7859 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
7861 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
7863 (back_threader_registry::back_threader_registry): Same.
7864 (back_threader_profitability::profitable_path_p): Remove
7865 param_max_fsm_thread-length.
7866 (back_threader_registry::register_path): Remove
7867 m_max_allowable_paths.
7869 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7871 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
7873 (back_threader::~back_threader): Do not release m_path.
7875 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
7877 PR tree-optimization/102738
7878 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
7880 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
7882 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
7883 (omp_mark_declare_variant): Move from c-omp.c.
7884 (omp_context_name_list_prop): Update for Fortran strings.
7885 * omp-general.h (omp_check_context_selector): New prototype.
7886 (omp_mark_declare_variant): New prototype.
7888 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7890 * config/rs6000/rs6000.c (rs6000_density_test): Move early
7891 exit test further up the function.
7893 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7895 * config/arm/arm.c (arm_add_stmt_cost): Delete.
7896 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
7898 2021-10-14 Martin Jambor <mjambor@suse.cz>
7900 * doc/invoke.texi (Optimize Options): Add entry for
7901 ipa-cp-recursive-freq-factor.
7903 2021-10-14 Tamar Christina <tamar.christina@arm.com>
7905 * match.pd: New rule.
7907 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
7910 * ipa-modref-tree.h (modref_access_node::update2):
7911 Also check that parm_offset is unchanged.
7912 (modref_ref_node::insert_access): Fix updating of
7915 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7917 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
7920 2021-10-14 Richard Biener <rguenther@suse.de>
7922 PR tree-optimization/102659
7923 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
7924 rewrite pointer typed undefined overflow operations.
7925 (predicate_statements): Likewise. Make sure to emit invariant
7926 conversions in the preheader.
7927 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
7928 non-empty preheaders.
7929 * tree-data-ref.c (dr_analyze_indices): Strip useless
7930 conversions to the MEM_REF base type.
7932 2021-10-14 Martin Liska <mliska@suse.cz>
7934 * common.opt: Stop using AUTODETECT_VALUE
7935 and use EnabledBy where possible.
7936 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
7937 * toplev.c (AUTODETECT_VALUE): Remove macro.
7938 (process_options): Simplify by using EnabledBy and
7939 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
7942 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
7944 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
7946 (vld1_s32_x3): Likewise.
7948 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7950 PR tree-optimization/102736
7951 PR tree-optimization/102736
7952 * gimple-range-path.cc (path_range_query::range_on_path_entry):
7953 Assert that the requested range is defined outside the path.
7954 (path_range_query::ssa_range_in_phi): Do not call
7955 range_on_path_entry for SSA names that are defined within the
7958 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
7960 * config/darwin-driver.c (darwin_find_version_from_kernel):
7961 Quote internal identifiers and avoid contractions in
7963 (darwin_default_min_version): Likewise.
7964 (darwin_driver_init): Likewise.
7966 2021-10-14 Martin Jambor <mjambor@suse.cz>
7968 * params.opt (ipa-cp-recursive-freq-factor): New.
7969 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
7970 scc_no, self_recursion_generated_level, same_scc and
7971 self_recursion_generated_p.
7972 (ipcp_lattice::add_value): Replaced parameter unlimited with
7973 same_lat_gen_level, usit it determine limit of values and store it to
7975 (ipcp_lattice<valtype>::print): Dump the new fileds.
7976 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
7977 parameter and store it to the new value.
7978 (self_recursively_generated_p): Removed.
7979 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
7980 instead of self_recursively_generated_p, store self generation level
7982 (value_topo_info<valtype>::add_val): Set scc_no.
7983 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
7984 recursively feeding values and self generated values by appropriate
7987 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7989 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
7990 redundant test for flag_vect_cost_model.
7992 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7994 * bitmap.c (debug): New overloaded function for auto_bitmaps.
7995 * bitmap.h (debug): Same.
7997 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
7999 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
8000 Adjust builtin call.
8001 (_mm512_mask3_fcmadd_pch): Likewise.
8002 (_mm512_mask_fmadd_pch): Likewise
8003 (_mm512_mask3_fmadd_pch): Likewise
8004 (_mm512_mask_fcmadd_round_pch): Likewise
8005 (_mm512_mask3_fcmadd_round_pch): Likewise
8006 (_mm512_mask_fmadd_round_pch): Likewise
8007 (_mm512_mask3_fmadd_round_pch): Likewise
8008 (_mm_mask_fcmadd_sch): Likewise
8009 (_mm_mask3_fcmadd_sch): Likewise
8010 (_mm_mask_fmadd_sch): Likewise
8011 (_mm_mask3_fmadd_sch): Likewise
8012 (_mm_mask_fcmadd_round_sch): Likewise
8013 (_mm_mask3_fcmadd_round_sch): Likewise
8014 (_mm_mask_fmadd_round_sch): Likewise
8015 (_mm_mask3_fmadd_round_sch): Likewise
8016 (_mm_fcmadd_round_sch): Likewise
8017 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
8018 Adjust builtin call.
8019 (_mm_mask3_fmadd_pch): Likewise
8020 (_mm256_mask_fmadd_pch): Likewise
8021 (_mm256_mask3_fmadd_pch): Likewise
8022 (_mm_mask_fcmadd_pch): Likewise
8023 (_mm_mask3_fcmadd_pch): Likewise
8024 (_mm256_mask_fcmadd_pch): Likewise
8025 (_mm256_mask3_fcmadd_pch): Likewise
8026 * config/i386/i386-builtin.def: Add mask3 builtin for complex
8027 fma, and adjust mask_builtin to corresponding expander.
8028 * config/i386/i386-expand.c (ix86_expand_round_builtin):
8029 Skip eraseing embedded rounding for expanders that emits
8031 * config/i386/sse.md (complexmove): New mode_attr.
8032 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
8033 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
8034 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8035 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
8036 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8037 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
8038 * config/i386/subst.md (round_embedded_complex): New subst.
8040 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
8042 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
8043 (cbranchdf4): Likewise.
8044 Add missing move patterns for TARGET_SOFT_FLOAT.
8046 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
8048 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
8049 pseudo intermediate when moving a SUBREG into a hard register,
8050 by checking ix86_hardreg_mov_ok.
8051 (ix86_expand_vector_extract): Store zero-extended SImode
8052 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
8054 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
8055 complex (SUBREG) sets of (vector) hard registers before reload, by
8056 checking ix86_hardreg_mov_ok.
8058 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8060 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
8062 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
8064 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
8066 2021-10-13 Martin Sebor <msebor@redhat.com>
8068 PR middle-end/102630
8069 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
8071 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
8073 * collect2.c (is_lto_object_file): Release simple-object
8074 resources, close files.
8076 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
8078 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
8079 when checking mve addressing modes.
8080 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
8082 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
8083 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
8084 there is a single Ux constraint.
8086 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
8088 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
8089 multiplication on 32-bit PA 1.1 and 2.0 targets.
8091 2021-10-13 Martin Liska <mliska@suse.cz>
8093 PR gcov-profile/90364
8094 * coverage.c (build_info): Emit checksum to the global variable.
8095 (build_info_type): Add new field for checksum.
8096 (coverage_obj_finish): Pass object_checksum.
8097 (coverage_init): Use 0 as checksum for .gcno files.
8098 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
8099 * gcov.c (read_graph_file): Read also checksum.
8100 * doc/invoke.texi: Document the behaviour change.
8102 2021-10-13 Richard Biener <rguenther@suse.de>
8104 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
8105 start at integer value 2.
8106 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
8107 the iterator for GSI_LAST_NEW_STMT.
8108 (gsi_insert_seq_nodes_after): Likewise.
8109 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
8110 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
8111 arguments to gsi_replace.
8113 2021-10-13 Martin Liska <mliska@suse.cz>
8116 * common.opt: Use EnabledBy instead of detection in
8117 finish_options and process_options.
8118 * opts.c (finish_options): Remove handling of
8119 x_flag_unroll_all_loops.
8120 * toplev.c (process_options): Likewise for flag_web and
8121 flag_rename_registers.
8123 2021-10-13 Richard Biener <rguenther@suse.de>
8125 PR tree-optimization/102659
8126 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
8127 (if_convertible_gimple_assign_stmt_p): Mark the loop for
8128 rewrite when stmts with undefined behavior on integer
8130 (combine_blocks): Predicate also when we need to rewrite stmts.
8131 (predicate_statements): Rewrite affected stmts to something
8132 with well-defined behavior on overflow.
8133 (tree_if_conversion): Initialize need_to_rewrite_undefined.
8135 2021-10-13 Richard Biener <rguenther@suse.de>
8138 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
8141 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
8143 * dwarf2ctf.c: Fix typo in comment.
8145 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
8148 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
8149 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
8150 avoid generating an automatic #define of vec_cpsgn. Use the
8151 correct built-in for V4SFmode that doesn't depend on VSX.
8153 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
8157 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
8158 define_insn_and_split pattern. Add alternative 1 and split it
8159 post reload to insert operand 1 into the low part of operand 0.
8160 (*sub<mode>_1_slp): Ditto.
8161 (*and<mode>_1_slp): Ditto.
8162 (*<any_or:code><mode>_1_slp): Ditto.
8163 (*ashl<mode>3_1_slp): Ditto.
8164 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
8165 (*<any_rotate:insn><mode>3_1_slp): Ditto.
8166 (*neg<mode>_1_slp): New insn_and_split pattern.
8167 (*one_cmpl<mode>_1_slp): Ditto.
8169 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
8171 * doc/install.texi: Update MinGW and mingw-64 Binaries
8174 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
8176 * doc/extend.texi (Common Variable Attributes): Fix typos in
8177 alloc_size documentation.
8179 2021-10-12 Richard Biener <rguenther@suse.de>
8181 PR tree-optimization/102696
8182 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
8183 the tree fatally failed when we reject a BIT_FIELD_REF.
8185 2021-10-12 Richard Biener <rguenther@suse.de>
8187 PR tree-optimization/102572
8188 * tree-vect-stmts.c (vect_build_gather_load_calls): When
8189 gathering the vectorized defs for the mask pass in the
8190 desired mask vector type so invariants will be handled
8193 2021-10-12 Tamar Christina <tamar.christina@arm.com>
8195 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
8196 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
8197 *fcmuo<mode>_nor_combine): New.
8199 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
8202 * config/sparc/sparc-modes.def (OI): New integer mode.
8204 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8206 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
8207 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
8210 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8212 * tree-vectorizer.h (loop_cost_model): New function.
8213 (unlimited_cost_model): Use it.
8214 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
8215 call instead of flag_vect_cost_model.
8216 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
8217 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
8218 of flag_simd_cost_model.
8220 2021-10-12 liuhongt <hongtao.liu@intel.com>
8223 * config/i386/i386-expand.c (emit_reduc_half): Handle
8225 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
8226 (reduc_plus_scal_v4qi): Ditto.
8228 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8230 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
8231 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
8232 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
8234 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8236 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
8237 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
8238 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
8239 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
8240 _mm_cvtepu32_epi64): New.
8242 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8244 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
8245 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
8247 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8249 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
8250 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
8251 _mm_max_epi32, _mm_max_epu32): New.
8253 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8255 * ipa-modref-tree.h (struct modref_access_node): Revert
8257 (struct modref_ref_node): Likewise.
8259 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8261 * ipa-modref-tree.h (modref_tree::global_access_p): New member
8264 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8265 ignore_stores_eaf_flags): Move to ipa-modref.h
8266 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
8267 (modref_summary::global_memory_read_p): New member function.
8268 (modref_summary::global_memory_written_p): New member function.
8269 * ipa-modref.h (modref_summary::global_memory_read_p,
8270 modref_summary::global_memory_written_p): Declare.
8271 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8272 ignore_stores_eaf_flags): move here.
8273 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
8275 (handle_rhs_call): Rewrite.
8276 (handle_call_arg): New function.
8277 (determine_global_memory_access): New function.
8278 (handle_const_call): Remove
8279 (handle_pure_call): Remove
8280 (find_func_aliases_for_call): Update use of handle_rhs_call.
8281 (compute_points_to_sets): Handle global memory acccesses
8284 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
8286 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
8287 documentation, mention UBSAN_OPTIONS, similar to what is done
8288 for AddressSanitizer.
8290 2021-10-11 Richard Biener <rguenther@suse.de>
8292 PR middle-end/102683
8293 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
8294 availability before building an integer type for storage
8297 2021-10-11 Richard Biener <rguenther@suse.de>
8299 PR middle-end/101480
8300 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
8303 2021-10-11 Martin Liska <mliska@suse.cz>
8305 * common.opt: Remove Init(2) for some options.
8306 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
8307 use rather OPTION_SET_P.
8309 2021-10-11 Martin Liska <mliska@suse.cz>
8311 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
8312 * flag-types.h (enum ira_region): Likewise.
8313 * toplev.c (process_options): Use OPTION_SET_P instead of
8314 IRA_REGION_AUTODETECT.
8316 2021-10-11 Jakub Jelinek <jakub@redhat.com>
8318 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
8319 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
8321 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8323 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
8324 * config/s390/s390.c (s390_rawmemchr): New function.
8325 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
8326 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
8327 the pattern vfees<mode> from vx-builtins.md.
8328 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
8330 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8332 * builtins.c (get_memory_rtx): Change to external linkage.
8333 * builtins.h (get_memory_rtx): Add function prototype.
8334 * doc/md.texi (rawmemchr<mode>): Document.
8335 * internal-fn.c (expand_RAWMEMCHR): Define.
8336 * internal-fn.def (RAWMEMCHR): Add.
8337 * optabs.def (rawmemchr_optab): Add.
8338 * tree-loop-distribution.c (find_single_drs): Change return code
8339 behaviour by also returning true if no single store was found
8341 (loop_distribution::classify_partition): Respect the new return
8342 code behaviour of function find_single_drs.
8343 (loop_distribution::execute): Call new function
8344 transform_reduction_loop in order to replace rawmemchr or strlen
8345 like loops by calls into builtins.
8346 (generate_reduction_builtin_1): New function.
8347 (generate_rawmemchr_builtin): New function.
8348 (generate_strlen_builtin_1): New function.
8349 (generate_strlen_builtin): New function.
8350 (generate_strlen_builtin_using_rawmemchr): New function.
8351 (reduction_var_overflows_first): New function.
8352 (determine_reduction_stmt_1): New function.
8353 (determine_reduction_stmt): New function.
8354 (loop_distribution::transform_reduction_loop): New function.
8356 2021-10-11 Martin Liska <mliska@suse.cz>
8358 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
8359 and remove legacy hashing code.
8361 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
8364 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
8365 CONST_INT for BEGIN and END, and use gcc_assert rather than
8368 2021-10-10 Jakub Jelinek <jakub@redhat.com>
8371 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
8372 use MO_VAL_SET if loc is not sp.
8374 2021-10-10 Andrew Pinski <apinski@marvell.com>
8376 PR tree-optimization/102622
8377 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
8378 Swap the order of a?0:pow2cst and a?0:-1 transformations.
8380 2021-10-09 liuhongt <hongtao.liu@intel.com>
8383 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
8385 (ix86_use_mask_cmp_p): Ditto.
8386 (ix86_expand_sse_movcc): Ditto.
8387 * config/i386/i386.md (setcc_hf_mask): New define_insn.
8388 (movhf_mask): Ditto.
8389 (UNSPEC_MOVCC_MASK): New unspec.
8390 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
8392 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
8394 PR rtl-optimization/102627
8395 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
8397 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8399 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
8402 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8404 * value-range.cc (irange::debug): New.
8405 * value-range.h (irange::debug): New.
8407 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
8409 PR tree-optimization/102385
8410 * predict.h (change_edge_frequency): Declare.
8411 * predict.c (change_edge_frequency): New function.
8412 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
8414 (tree_unroll_loop): Likewise.
8415 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
8416 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
8418 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
8419 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
8420 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
8421 to retrieve the exit edges. Make all the old profile update code
8422 conditional on !single_loop_p -- the case it was written for --
8423 and use a different approach for the single-loop case.
8425 2021-10-08 Martin Liska <mliska@suse.cz>
8427 * config/alpha/alpha.c (alpha_option_override): Use new macro
8429 * config/arc/arc.c (arc_override_options): Likewise.
8430 * config/arm/arm.c (arm_option_override): Likewise.
8431 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
8432 * config/c6x/c6x.c (c6x_option_override): Likewise.
8433 * config/csky/csky.c: Likewise.
8434 * config/darwin.c (darwin_override_options): Likewise.
8435 * config/frv/frv.c (frv_option_override): Likewise.
8436 * config/i386/djgpp.h: Likewise.
8437 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
8438 (ix86_max_noce_ifcvt_seq_cost): Likewise.
8439 * config/ia64/ia64.c (ia64_option_override): Likewise.
8440 (ia64_override_options_after_change): Likewise.
8441 * config/m32c/m32c.c (m32c_option_override): Likewise.
8442 * config/m32r/m32r.c (m32r_init): Likewise.
8443 * config/m68k/m68k.c (m68k_option_override): Likewise.
8444 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
8445 * config/mips/mips.c (mips_option_override): Likewise.
8446 * config/nios2/nios2.c (nios2_option_override): Likewise.
8447 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
8448 * config/pa/pa.c (pa_option_override): Likewise.
8449 * config/riscv/riscv.c (riscv_option_override): Likewise.
8450 * config/rs6000/aix71.h: Likewise.
8451 * config/rs6000/aix72.h: Likewise.
8452 * config/rs6000/aix73.h: Likewise.
8453 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
8454 (rs6000_override_options_after_change): Likewise.
8455 (rs6000_linux64_override_options): Likewise.
8456 (glibc_supports_ieee_128bit): Likewise.
8457 (rs6000_option_override_internal): Likewise.
8458 (rs6000_file_start): Likewise.
8459 (rs6000_darwin_file_start): Likewise.
8460 * config/rs6000/rtems.h: Likewise.
8461 * config/rs6000/sysv4.h: Likewise.
8462 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
8463 * config/s390/s390.c (s390_option_override): Likewise.
8464 * config/sh/linux.h: Likewise.
8465 * config/sh/netbsd-elf.h (while): Likewise.
8466 * config/sh/sh.c (sh_option_override): Likewise.
8467 * config/sol2.c (solaris_override_options): Likewise.
8468 * config/sparc/sparc.c (sparc_option_override): Likewise.
8469 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
8470 * config/visium/visium.c (visium_option_override): Likewise.
8471 * config/vxworks.c (vxworks_override_options): Likewise.
8472 * lto-opts.c (lto_write_options): Likewise.
8473 * omp-expand.c (expand_omp_simd): Likewise.
8474 * omp-general.c (omp_max_vf): Likewise.
8475 * omp-offload.c (oacc_xform_loop): Likewise.
8476 * opts.h (OPTION_SET_P): Likewise.
8477 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
8478 * toplev.c (process_options): Likewise.
8479 * tree-predcom.c: Likewise.
8480 * tree-sra.c (analyze_all_variable_accesses): Likewise.
8482 2021-10-08 liuhongt <hongtao.liu@intel.com>
8485 * config/i386/i386.c (ix86_optab_supported_p):
8486 Return true for HFmode.
8487 * match.pd: Simplify (_Float16) ceil ((double) x) to
8488 __builtin_ceilf16 (a) when a is _Float16 type and
8489 direct_internal_fn_supported_p.
8491 2021-10-08 liuhongt <hongtao.liu@intel.com>
8494 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
8495 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
8496 (reduc_<code>_scal_v4hi): New.
8498 2021-10-08 liuhongt <hongtao.liu@intel.com>
8500 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
8501 * doc/invoke.texi (Options That Control Optimization): Update
8503 * opts.c (default_options_table): Enable auto-vectorization at
8504 O2 with very-cheap cost model.
8505 (finish_options): Use cheap cost model for
8506 explicit -ftree{,-loop}-vectorize.
8508 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8510 * ctfc.c (ctfc_delete_container): Free hash table contents.
8512 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8514 * toplev.c (process_options): Do not warn for GNU GIMPLE.
8516 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
8518 * tree-object-size.c (addr_object_size,
8519 compute_builtin_object_size): Drop PDECL and POFF arguments.
8520 (addr_object_size): Adjust calls.
8521 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
8524 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
8526 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
8527 representing signed and unsigned high-part multiplication resp.
8528 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
8529 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
8530 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
8531 SS_DIV, US_DIV]: Similar simplifications for saturating
8533 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
8534 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
8535 Implement compile-time evaluation for constant operands.
8536 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
8538 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
8539 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
8540 Mention the new smul_highpart and umul_highpart RTX codes.
8541 * doc/invoke.texi: Silence @xref "compilation" warnings.
8543 2021-10-07 Martin Jambor <mjambor@suse.cz>
8546 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
8547 case when the source reference description corresponds to a
8548 referance taken in a function src->caller is inlined to.
8550 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
8553 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
8555 (modref_access_node::try_merge_with): Add sanity check that there
8556 are no redundant entries in the list.
8558 2021-10-07 Richard Biener <rguenther@suse.de>
8560 PR tree-optimization/102608
8561 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
8564 2021-10-07 Martin Liska <mliska@suse.cz>
8566 * toplev.c (toplev::main): Make
8567 save_opt_decoded_options a pointer type
8568 * toplev.h: Likewise.
8570 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8572 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
8573 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
8574 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
8576 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8578 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
8579 Adjust for new -msram-ecc=any behaviour.
8580 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
8581 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
8582 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
8583 (EF_AMDGPU_XNACK_V3): ... this.
8584 (EF_AMDGPU_SRAM_ECC): Rename to ...
8585 (EF_AMDGPU_SRAM_ECC_V3): ... this.
8586 (EF_AMDGPU_FEATURE_XNACK_V4): New.
8587 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
8588 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
8589 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
8590 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
8591 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
8592 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
8593 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
8594 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
8595 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
8596 (SET_XNACK_ON): New.
8597 (SET_XNACK_OFF): New.
8599 (SET_SRAM_ECC_ON): New.
8600 (SET_SRAM_ECC_ANY): New.
8601 (SET_SRAM_ECC_OFF): New.
8602 (TEST_SRAM_ECC_ANY): New.
8603 (TEST_SRAM_ECC_ON): New.
8604 (main): Implement HSACOv4 and -msram-ecc=any.
8606 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8608 * config.in: Regenerate.
8609 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
8613 (A_FIJI): Rename to ...
8615 (A_900): Rename to ...
8617 (A_906): Rename to ...
8619 (A_908): Rename to ...
8621 (SRAMOPT): New macro.
8622 (ASM_SPEC): Adjust xnack option usage.
8623 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
8624 * configure: Regenerate.
8625 * configure.ac: Detect LLVM assembler dialect.
8627 2021-10-07 Richard Biener <rguenther@suse.de>
8629 * tree-pretty-print.c (dump_generic_node): Do not elide
8630 printing '&' when dumping with -gimple.
8632 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8634 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
8636 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
8637 (adjust_imagpart_expr): Ditto.
8638 * value-range.cc (irange::irange_intersect): Call new routine if
8639 RHS is a single pair.
8640 (irange::intersect): New wide_int version.
8641 * value-range.h (class irange): New prototype.
8643 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8645 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
8646 Add parameter to limit size when recognizing switches.
8647 (gimple_outgoing_range::edge_range_p): Check size limit.
8648 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
8649 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
8650 that exceed the size limit.
8651 (gori_compute::gori_compute): Add initializer.
8652 * params.opt (evrp-switch-limit): New.
8653 * doc/invoke.texi: Update docs.
8655 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8657 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
8658 TYPE_MAX_VALUE instead of creating new trees when possible.
8660 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8662 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
8663 zero and non-zero more efficently.
8665 2021-10-06 Richard Biener <rguenther@suse.de>
8668 * dumpfile.h (TDF_GIMPLE_VAL): New.
8669 (dump_flag): Re-order and adjust TDF_* flags. Make
8670 the enum uint32_t. Use std::underlying_type in the
8672 (optgroup_flag): Likewise for the operator overloads.
8673 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
8674 in _Literal if TDF_GIMPLE_VAL.
8675 * gimple-pretty-print.c (dump_gimple_assign): Add
8676 TDF_GIMPLE_VAL to flags when dumping operands where only
8677 is_gimple_val are allowed.
8678 (dump_gimple_cond): Likewise.
8680 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
8682 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
8685 2021-10-05 qing zhao <qing.zhao@oracle.com>
8687 PR middle-end/102359
8688 * gimplify.c (gimplify_decl_expr): Not add initialization for an
8689 auto variable when it has been initialized by frontend.
8691 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
8693 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
8696 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8698 * common/config/avr/avr-common.c (avr_handle_option): Mark
8699 argument as ATTRIBUTE_UNUSED.
8701 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8703 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
8704 Undefine before redefinition.
8706 2021-10-05 Richard Biener <rguenther@suse.de>
8708 * toplev.c (no_backend): Remove global var.
8709 (process_options): Pass in no_backend, move post_options
8710 langhook call to toplev::main.
8711 (do_compile): Pass in no_backend, move process_options call
8713 (toplev::run_self_tests): Check no_backend at the caller.
8714 (toplev::main): Call post_options and process_options
8715 split out from do_compile, do self-tests only if
8716 no_backend is initialized.
8718 2021-10-05 Richard Biener <rguenther@suse.de>
8720 * tree-cfg.c (dump_function_to_file): Dump the UID of the
8721 function as part of the name when requested.
8722 * tree-pretty-print.c (dump_function_name): Dump the UID when
8723 requested and the langhook produced the actual name.
8725 2021-10-05 Richard Biener <rguenther@suse.de>
8727 PR middle-end/102587
8728 PR middle-end/102285
8729 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
8730 zero-initialization as last resort, use the constant
8731 size as given by the DEFERRED_INIT argument to build
8734 2021-10-04 Marek Polacek <polacek@redhat.com>
8737 * doc/invoke.texi: Document -Warray-compare.
8739 2021-10-04 Richard Biener <rguenther@suse.de>
8741 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
8742 variables are not to be initialized.
8744 2021-10-04 Richard Biener <rguenther@suse.de>
8746 * expr.h (non_mem_decl_p): Declare.
8747 (mem_ref_refers_to_non_mem_p): Likewise.
8748 * expr.c (non_mem_decl_p): Export.
8749 (mem_ref_refers_to_non_mem_p): Likewise.
8750 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
8751 but check the base with mem_ref_refers_to_non_mem_p
8754 2021-10-04 Richard Biener <rguenther@suse.de>
8756 PR tree-optimization/102570
8757 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
8758 we are using clique for the internal function code.
8759 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
8760 internal function code.
8761 (print_vn_reference_ops): Print the internal function code.
8762 (vn_reference_op_compute_hash): Hash it.
8763 (copy_reference_ops_from_call): Record it.
8764 (visit_stmt): Remove the restriction around internal function
8766 (fully_constant_vn_reference_p): Use fold_const_call and handle
8768 (vn_reference_eq): Compare call return types.
8769 * tree-ssa-pre.c (create_expression_by_pieces): Handle
8770 generating calls to internal functions.
8771 (compute_avail): Remove the restriction around internal function
8774 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
8776 PR tree-optimization/102560
8777 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
8778 marker for invalid_range.
8780 2021-10-04 Richard Biener <rguenther@suse.de>
8782 PR middle-end/102587
8783 * internal-fn.c (expand_DEFERRED_INIT): Guard register
8784 initialization path an avoid initializing VLA registers
8787 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
8789 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
8791 2021-10-03 Martin Liska <mliska@suse.cz>
8793 * toplev.c (toplev::main): Check opt_index if it is a part
8796 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
8798 PR tree-optimization/102563
8799 * range-op.cc (operator_lshift::op1_range): Do not clobber
8802 2021-10-02 Martin Liska <mliska@suse.cz>
8804 * toplev.c (toplev::main): save_decoded_options[0] is program
8805 name and so it should be skipped.
8807 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8809 PR tree-optimization/102546
8810 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
8811 X << Y is non-zero implies X is also non-zero.
8813 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8815 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8817 * config/aarch64/aarch64-tune.md: Regenerate.
8818 * doc/invoke.texi: Update docs.
8820 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8822 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8824 * config/aarch64/aarch64-tune.md: Regenerate.
8825 * doc/invoke.texi: Update docs.
8827 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8829 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8831 * config/aarch64/aarch64-tune.md: Regenerate.
8832 * doc/invoke.texi: Update docs.
8834 2021-10-01 Martin Sebor <msebor@redhat.com>
8837 * doc/invoke.texi (-Waddress): Update.
8838 * gengtype.c (write_types): Avoid -Waddress.
8839 * poly-int.h (POLY_SET_COEFF): Avoid using null.
8841 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
8844 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
8847 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8849 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
8851 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8853 * gimple-range-path.cc (path_range_query::compute_ranges): Use
8855 * gimple-range-path.h (class path_range_query): Remove shadowed
8857 (path_range_query::get_path_oracle): New.
8859 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8860 Richard Biener <rguenther@suse.de>
8863 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
8864 INT_MIN / -1 division detection from here ...
8865 (-fsanitize=signed-integer-overflow): ... and add it here.
8867 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8869 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
8871 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
8872 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
8873 (AARCH64_ISA_V9): New ISA flag.
8874 * doc/invoke.texi: Update docs.
8876 2021-10-01 Martin Liska <mliska@suse.cz>
8878 * toplev.c (toplev::main): Save decoded optimization options.
8879 * toplev.h (save_opt_decoded_options): New.
8880 * doc/extend.texi: Be more clear about optimize and target
8883 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8885 * explow.c: Include langhooks.h.
8886 (set_stack_check_libfunc): Build a proper function type.
8888 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8891 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
8892 not return true for external weak function symbols in medium model.
8894 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8896 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
8897 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
8898 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
8899 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
8900 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
8901 OMP_CLAUSE_SCHEDULE_STATIC.
8903 2021-10-01 Richard Biener <rguenther@suse.de>
8905 PR middle-end/102518
8906 * tree-inline.c (setup_one_parameter): Avoid substituting
8907 an invariant into contexts where a GIMPLE register is not valid.
8909 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8911 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
8912 * config/arm/arm-tables.opt: Regenerate.
8913 * config/arm/arm-tune.md: Regenerate.
8914 * doc/invoke.texi: Update docs.
8916 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
8919 * config/i386/i386.md
8920 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
8923 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
8925 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
8926 omp_{c,re}alloc, fix omp_alloc/omp_free.
8928 2021-09-30 Martin Liska <mliska@suse.cz>
8930 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
8931 asm_out_file and stream directly to MYFILE.
8933 2021-09-30 Richard Biener <rguenther@suse.de>
8935 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8936 Restore and fix condition under which we apply npeel to
8937 the DRs misalignment value.
8939 2021-09-30 Richard Biener <rguenther@suse.de>
8941 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8942 Fix npeel check for variable amount of peeling.
8944 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8946 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
8948 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8950 * gimple-range.cc (gimple_ranger::debug): New.
8951 * gimple-range.h (class gimple_ranger): Add debug.
8953 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8955 PR middle-end/102519
8956 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
8958 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
8961 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
8962 (btf_finalize): Empty the hash_map btf_var_ids.
8964 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8966 * tree-vrp.c (thread_through_all_blocks): Return bool.
8967 (execute_vrp_threader): Return TODO_* flags.
8968 (pass_data_vrp_threader): Set todo_flags_finish to 0.
8970 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8972 * timevar.def (TV_TREE_VRP_THREADER): New.
8973 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
8975 2021-09-29 David Faust <david.faust@oracle.com>
8977 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
8979 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
8981 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
8982 new type qualifier enum.
8983 (TYPES_TERNOP_SSSU): Likewise.
8984 (TYPES_TERNOP_PPPU): Likewise.
8985 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
8986 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
8988 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
8989 builtin and remove casts.
8990 (vqtbl1_s8): Likewise.
8991 (vqtbl1q_p8): Likewise.
8992 (vqtbl1q_s8): Likewise.
8993 (vqtbx1_s8): Likewise.
8994 (vqtbx1_p8): Likewise.
8995 (vqtbx1q_s8): Likewise.
8996 (vqtbx1q_p8): Likewise.
8997 (vtbl1_p8): Likewise.
8998 (vtbl2_p8): Likewise.
8999 (vtbx2_p8): Likewise.
9001 2021-09-29 Richard Biener <rguenther@suse.de>
9003 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
9005 (vect_update_misalignment_for_peel): Use it to update
9006 misaligned to the value necessary for an aligned access.
9007 (vect_get_peeling_costs_all_drs): Likewise.
9008 (vect_enhance_data_refs_alignment): Likewise.
9010 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9012 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
9013 emitted operations and adjust heuristic for code size.
9015 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9017 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
9018 emitted operations and adjust heuristic for code size.
9020 2021-09-29 Jakub Jelinek <jakub@redhat.com>
9022 PR middle-end/102504
9023 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
9024 in OMP_SCOPE clauses, not just on worksharing construct clauses.
9026 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
9028 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
9029 (<u>mulditi3): Call <su>muldi3_highpart.
9030 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
9031 (<u>mulsidi3): Call <su>mulsi3_highpart.
9032 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
9034 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9036 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
9038 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
9040 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
9041 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
9044 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
9046 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
9047 (AARCH64_FL_V8_7): Likewise.
9048 (AARCH64_FL_FOR_ARCH8_7): Likewise.
9049 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
9050 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
9051 * doc/invoke.texi: Document the above.
9053 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9055 * dbgcnt.c (dbg_cnt_counter): New.
9056 * dbgcnt.h (dbg_cnt_counter): New.
9057 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
9058 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
9059 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
9060 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
9063 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9065 * cfgcleanup.c (pass_jump::execute): Check
9066 flag_expensive_optimizations.
9067 (pass_jump_after_combine::gate): Same.
9068 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
9069 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
9070 * tree-ssa-threadupdate.c
9071 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
9074 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9076 * tree-ssa-reassoc.c (biased_names): New global.
9077 (propagate_bias_p): New function.
9078 (loop_carried_phi): Remove.
9079 (propagate_rank): Propagate bias along single uses.
9080 (get_rank): Update biased_names when needed.
9082 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
9084 * passes.def (pass_reassoc): Rename parameter to early_p.
9085 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
9087 (phi_rank): Don't bias loop-carried phi ranks
9088 before vectorization pass.
9089 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
9090 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
9092 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
9094 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
9096 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
9098 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9101 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
9102 special 80387 instruction XFmode constants if flag_rounding_math.
9104 2021-09-28 Richard Biener <rguenther@suse.de>
9106 PR tree-optimization/100112
9107 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
9108 referece into the hashtable twice in case last_vuse is
9109 different from the original vuse on the stmt.
9111 2021-09-28 Jakub Jelinek <jakub@redhat.com>
9113 PR middle-end/102492
9114 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
9115 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
9116 clauses on SIMD constructs.
9118 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9120 PR tree-optimization/102511
9121 * gimple-range-path.cc (path_range_query::range_on_path_entry):
9122 Return VARYING when nothing found.
9124 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
9127 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
9129 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
9130 (VALID_MMX_REG_MODE): Add V4HFmode.
9131 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
9132 vector mode condition.
9133 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
9135 (function_arg_32): Add V4HFmode.
9136 (function_arg_advance_32): Likewise.
9137 * config/i386/i386.md (mode): Add V4HF/V2HF.
9138 (MODE_SIZE): Likewise.
9139 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
9140 (V_32): Add V2HF mode.
9141 (VHF_32_64): New mode iterator.
9142 (*mov<mode>_internal): Adjust sse alternatives to support
9144 (*mov<mode>_internal): Adjust sse alternatives to support
9146 (<insn><mode>3): New define_insn for add/sub/mul/div.
9148 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
9150 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
9152 (pass_early_thread_jumps::gate): Same.
9153 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
9154 Return if !flag_thread_jumps.
9155 * tree-ssa-threadupdate.c
9156 (jt_path_registry::register_jump_thread): Assert that
9157 flag_thread_jumps is true.
9159 2021-09-28 liuhongt <hongtao.liu@intel.com>
9162 (simplify_context::simplify_binary_operation_1): Relax
9163 condition of simplifying (vec_concat:M (vec_select op0
9164 index0)(vec_select op1 index1)) to allow different modes
9165 between op0 and M, but have same inner mode.
9167 2021-09-28 liuhongt <hongtao.liu@intel.com>
9169 * config/i386/i386-expand.c (emit_reduc_half): Handle
9170 V8HF/V16HF/V32HFmode.
9171 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
9172 (REDUC_SSE_SMINMAX_MODE): Ditto.
9173 (REDUC_PLUS_MODE): Add V16HF and V32HF.
9174 (REDUC_SMINMAX_MODE): Ditto.
9176 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9178 * gimple-range-path.cc
9179 (path_range_query::precompute_ranges_in_block): Rename to...
9180 (path_range_query::compute_ranges_in_block): ...this.
9181 (path_range_query::precompute_ranges): Rename to...
9182 (path_range_query::compute_ranges): ...this.
9183 (path_range_query::precompute_relations): Rename to...
9184 (path_range_query::compute_relations): ...this.
9185 (path_range_query::precompute_phi_relations): Rename to...
9186 (path_range_query::compute_phi_relations): ...this.
9187 * gimple-range-path.h: Rename precompute* to compute*.
9188 * tree-ssa-threadbackward.c
9189 (back_threader::find_taken_edge_switch): Same.
9190 (back_threader::find_taken_edge_cond): Same.
9191 * tree-ssa-threadedge.c
9192 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
9193 (hybrid_jt_state::register_equivs_stmt): Inline...
9194 * tree-ssa-threadedge.h: ...here.
9196 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9198 * tree-vrp.c (lhs_of_dominating_assert): Remove.
9199 (class vrp_jt_state): Remove.
9200 (class vrp_jt_simplifier): Remove.
9201 (vrp_jt_simplifier::simplify): Remove.
9202 (class vrp_jump_threader): Remove.
9203 (vrp_jump_threader::vrp_jump_threader): Remove.
9204 (vrp_jump_threader::~vrp_jump_threader): Remove.
9205 (vrp_jump_threader::before_dom_children): Remove.
9206 (vrp_jump_threader::after_dom_children): Remove.
9208 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9210 * passes.def (pass_vrp_threader): New.
9211 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
9212 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
9213 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
9214 (hybrid_jt_simplifier::simplify): New.
9215 (hybrid_jt_simplifier::compute_ranges_from_state): New.
9216 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
9217 (class hybrid_jt_simplifier): New.
9218 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
9220 (class hybrid_threader): New.
9221 (hybrid_threader::hybrid_threader): New.
9222 (hybrid_threader::~hybrid_threader): New.
9223 (hybrid_threader::before_dom_children): New.
9224 (hybrid_threader::after_dom_children): New.
9225 (execute_vrp_threader): New.
9226 (class pass_vrp_threader): New.
9227 (make_pass_vrp_threader): New.
9229 2021-09-27 Martin Liska <mliska@suse.cz>
9231 * output.h (enum section_flag): New.
9232 (SECTION_FORGET): Remove.
9233 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
9234 (SECTION_STYLE_MASK): Define it based on other enum
9236 * varasm.c (switch_to_section): Remove unused handling of
9239 2021-09-27 Martin Liska <mliska@suse.cz>
9241 * common.opt: Add new variable flag_default_complex_method.
9242 * opts.c (finish_options): Handle flags related to
9243 x_flag_complex_method.
9244 * toplev.c (process_options): Remove option handling related
9245 to flag_complex_method.
9247 2021-09-27 Richard Biener <rguenther@suse.de>
9249 PR middle-end/102450
9250 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
9251 type_for_size, instead use int_mode_for_size.
9253 2021-09-27 Andrew Pinski <apinski@marvell.com>
9256 * gimplify.c (gimplify_save_expr): Return early
9257 if the type of val is error_mark_node.
9259 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9261 * tree-ssanames.c (ssa_name_has_boolean_range): Use
9264 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9266 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
9267 vrp_visit_cond_stmt.
9268 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
9269 (cprop_into_stmt): Same.
9270 (dom_opt_dom_walker::optimize_stmt): Same.
9272 2021-09-27 Richard Biener <rguenther@suse.de>
9274 PR tree-optimization/97351
9275 PR tree-optimization/97352
9276 PR tree-optimization/82426
9277 * tree-vectorizer.h (dr_misalignment): Add vector type
9279 (aligned_access_p): Likewise.
9280 (known_alignment_for_access_p): Likewise.
9281 (vect_supportable_dr_alignment): Likewise.
9282 (vect_known_alignment_in_bytes): Likewise. Refactor.
9283 (DR_MISALIGNMENT): Remove.
9284 (vect_update_shared_vectype): Likewise.
9285 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
9286 a vector type with larger alignment requirement and apply
9287 the negative step adjustment here.
9288 (vect_calculate_target_alignment): Remove.
9289 (vect_compute_data_ref_alignment): Get explicit vector type
9290 argument, do not apply a negative step alignment adjustment
9292 (vect_slp_analyze_node_alignment): Re-analyze alignment
9293 when we re-visit the DR with a bigger desired alignment but
9294 keep more precise results from smaller alignments.
9295 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
9296 (vect_slp_analyze_node_operations_1): Do not update the
9297 shared vector type on stmts.
9298 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
9299 vector type of an SLP node to the representative stmt-info.
9300 (vect_transform_stmt): Likewise.
9302 2021-09-27 liuhongt <hongtao.liu@intel.com>
9305 2021-09-09 liuhongt <hongtao.liu@intel.com>
9308 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
9309 (reduc_plus_scal_v4sf): .. this, New define_expand.
9310 (reduc_plus_scal_v2df): .. and this, New define_expand.
9312 2021-09-26 liuhongt <hongtao.liu@intel.com>
9314 * doc/extend.texi (Half-Precision): Remove storage only
9315 description for _Float16 w/o avx512fp16.
9317 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
9319 * config/pru/constraints.md (Rrio): New constraint.
9320 * config/pru/predicates.md (regio_operand): New predicate.
9321 * config/pru/pru-pragma.c (pru_register_pragmas): Register
9322 the __regio_symbol address space.
9323 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
9324 * config/pru/pru.c (pru_symref2ioregno): New helper function.
9325 (pru_legitimate_address_p): Remove.
9326 (pru_addr_space_legitimate_address_p): Use the address space
9328 (pru_nongeneric_pointer_addrspace): New helper function.
9329 (pru_insert_attributes): New function to validate __regio_symbol
9331 (TARGET_INSERT_ATTRIBUTES): New macro.
9332 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
9333 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
9334 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
9335 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
9337 (*regio_nozext_writesi): New pattern to write to I/O registers.
9338 (*regio_zext_write_r30<EQS0:mode>): Ditto.
9339 * doc/extend.texi: Document the new PRU Named Address Space.
9341 2021-09-24 Patrick Palka <ppalka@redhat.com>
9345 * real.c (encode_ieee_double): Avoid unwanted sign extension.
9346 (encode_ieee_quad): Likewise.
9348 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
9350 PR rtl-optimization/102147
9351 * ira-build.c (ira_conflict_vector_profitable_p): Make
9352 profitability calculation independent of host compiler pointer and
9355 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9357 * gimple-range-path.cc (path_range_query::path_range_query):
9358 Move debugging header...
9359 (path_range_query::precompute_ranges): ...here.
9360 (path_range_query::internal_range_of_expr): Do not call
9361 range_on_path_entry if NAME is defined in the current block.
9363 2021-09-24 Richard Biener <rguenther@suse.de>
9365 * cfghooks.c (verify_flow_info): Verify unallocated BB and
9366 edge flags are not set.
9368 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9370 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9372 (jt_path_registry::register_jump_thread): Call
9373 cancel_invalid_paths.
9374 * tree-ssa-threadupdate.h (class jt_path_registry): Add
9375 cancel_invalid_paths.
9377 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9379 PR tree-optimization/102400
9380 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
9381 result_vdef to zero value.
9383 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9385 PR tree-optimization/102451
9386 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
9389 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
9391 * config/i386/sse.md (cond_<insn><mode>): Extend to support
9393 (cond_mul<mode>): Likewise.
9394 (cond_div<mode>): Likewise.
9395 (cond_<code><mode>): Likewise.
9396 (cond_fma<mode>): Likewise.
9397 (cond_fms<mode>): Likewise.
9398 (cond_fnma<mode>): Likewise.
9399 (cond_fnms<mode>): Likewise.
9401 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9403 PR tree-optimization/102463
9404 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
9405 there is no range-ops handler, don't look for a relation.
9407 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9409 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
9410 non-executable_edge flag as parameter.
9411 * gimple-range-cache.h (ranger_cache): Adjust prototype.
9412 * gimple-range-gori.cc (gori_compute::gori_compute): Take
9413 non-executable_edge flag as parameter.
9414 (gori_compute::outgoing_edge_range_p): Check new flag.
9415 * gimple-range-gori.h (gori_compute): Adjust prototype.
9416 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
9417 (gimple_ranger::range_on_edge): Check new flag.
9418 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
9419 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
9420 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
9421 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
9422 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
9423 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
9424 (simplify_using_ranges::simplify_switch_using_ranges): Clear
9425 EDGE_EXECUTABLE like it originally did.
9426 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
9427 NON_EXECUTABLE flags.
9428 (simplify_using_ranges::simplify_using_ranges): Adjust.
9429 * vr-values.h (class simplify_using_ranges): Adjust.
9430 (simplify_using_ranges::set_range_query): Add non-executable flag param.
9432 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
9435 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
9436 zero-width bit fields and return indicator.
9437 (rs6000_discover_homogeneous_aggregate): Diagnose when the
9438 presence of a zero-width bit field changes parameter passing in
9441 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9443 * gimple-range-fold.cc (fold_using_range::range_of_phi):
9444 Remove dominator check.
9446 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9448 * gimple-range-path.cc (path_range_query::precompute_relations):
9449 Hoist edge calculations before using EDGE_SUCC.
9451 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
9453 * configure.ac: Fix --with-multilib-list description.
9454 * configure: Regenerate.
9456 2021-09-23 Richard Biener <rguenther@suse.de>
9458 PR tree-optimization/102448
9459 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9460 Clear alignment info copied from DR_PTR_INFO.
9462 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9464 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
9466 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
9467 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
9468 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
9469 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
9470 (vcond<mode><mode>): New expander.
9471 (vcond<mode><sseintvecmodelower>): Likewise.
9472 (vcond<sseintvecmodelower><mode>): Likewise.
9473 (vcondu<mode><sseintvecmodelower>): Likewise.
9475 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9477 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
9479 (extendv4hf<mode>2): Likewise.
9480 (extendv2hfv2df2): Likewise.
9481 (trunc<mode><ssePHmodelower>2): Likewise.
9482 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
9483 (trunc<mode>v4hf2): ... this, and drop constraints.
9484 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
9485 (truncv2dfv2hf2): ... this, and likewise.
9487 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9489 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
9491 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
9493 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
9494 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
9495 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
9497 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9499 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
9500 (fixuns_trunchfhi2): Likewise.
9501 (*fixuns_trunchfsi2zext): New define_insn.
9502 * config/i386/sse.md (ssePHmodelower): New mode_attr.
9503 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9504 New expander for same element vector fix_truncate.
9505 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9506 Likewise for V4HF to V4SI/V4DI fix_truncate.
9507 (fix<fixunssuffix>_truncv2hfv2di2):
9508 Likeise for V2HF to V2DI fix_truncate.
9510 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9512 * config/i386/i386.md (<code>hf3): New expander.
9514 2021-09-23 liuhongt <hongtao.liu@intel.com>
9516 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
9517 (VFH_SF_AVX512VL): Extend to handle HFmode.
9518 (VF_SF_AVX512VL): Deleted.
9520 2021-09-23 liuhongt <hongtao.liu@intel.com>
9522 * config/i386/i386.md (rinthf2): New expander.
9523 (nearbyinthf2): New expander.
9525 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9527 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
9528 (class dom_jt_state): ...this and provide virtual overrides.
9529 (dom_jt_state::register_equiv): New.
9530 (class dom_jt_simplifier): Rename from
9531 dom_jump_threader_simplifier.
9532 (dom_jump_threader_simplifier::simplify): Rename...
9533 (dom_jt_simplifier::simplify): ...to this.
9534 (pass_dominator::execute): Use dom_jt_simplifier and
9536 * tree-ssa-threadedge.c (jump_threader::jump_threader):
9538 (jt_state::register_equivs_stmt): Abstract out...
9539 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
9541 (jump_threader::thread_around_empty_blocks): Update state.
9542 (jump_threader::thread_through_normal_block): Same.
9543 (jt_state::jt_state): Remove.
9544 (jt_state::push): Remove pass specific bits. Keep block vector
9546 (jt_state::append_path): New.
9547 (jt_state::pop): Remove pass specific bits.
9548 (jt_state::register_equiv): Same.
9549 (jt_state::record_ranges_from_stmt): Same.
9550 (jt_state::register_equivs_on_edge): Same. Rename...
9551 (jt_state::register_equivs_edge): ...to this.
9552 (jt_state::dump): New.
9553 (jt_state::debug): New.
9554 (jump_threader_simplifier::simplify): Remove.
9555 (jt_state::get_path): New.
9556 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
9557 class. Expose common functionality as virtual methods.
9558 (class jump_threader_simplifier): Same. Rename...
9559 (class jt_simplifier): ...to this.
9560 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
9561 (class vrp_jt_simplifier): ...to this. Provide pass specific
9563 (class vrp_jt_state): New.
9564 (vrp_jump_threader_simplifier::simplify): Rename...
9565 (vrp_jt_simplifier::simplify): ...to this. Inline code from
9566 what used to be the base class.
9567 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
9570 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
9573 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
9576 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
9577 Richard Biener <rguenther@suse.de>
9579 * match.pd (negation simplifications): Implement some negation
9580 folding transformations from fold-const.c's fold_negate_expr.
9581 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
9582 argument, to control whether the op should be simplified prior
9583 to looking up/assigning a value number.
9584 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
9585 (vn_nary_simplify): Likewise.
9586 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
9587 now call vn_nary_build_or_lookup_1 disabling simplification.
9589 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
9591 PR tree-optimization/102087
9592 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
9593 Update bound/cmp/control for niter.
9595 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9597 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9598 Move check for non-empty BB here.
9599 (fur_source::register_outgoing_edges): ...from here.
9601 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9603 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
9604 Remove call to improve_range_with_equivs.
9605 (path_range_query::improve_range_with_equivs): Remove
9606 * gimple-range-path.h: Remove improve_range_with_equivs.
9608 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9610 * config/i386/avx512fp16intrin.h:
9611 (_mm512_mask_blend_ph): New intrinsic.
9612 (_mm512_permutex2var_ph): Ditto.
9613 (_mm512_permutexvar_ph): Ditto.
9614 * config/i386/avx512fp16vlintrin.h:
9615 (_mm256_mask_blend_ph): New intrinsic.
9616 (_mm256_permutex2var_ph): Ditto.
9617 (_mm256_permutexvar_ph): Ditto.
9618 (_mm_mask_blend_ph): Ditto.
9619 (_mm_permutex2var_ph): Ditto.
9620 (_mm_permutexvar_ph): Ditto.
9622 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9624 * config/i386/avx512fp16intrin.h: Add new intrinsics.
9625 (_mm512_conj_pch): New intrinsic.
9626 (_mm512_mask_conj_pch): Ditto.
9627 (_mm512_maskz_conj_pch): Ditto.
9628 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
9629 (_mm256_conj_pch): New intrinsic.
9630 (_mm256_mask_conj_pch): Ditto.
9631 (_mm256_maskz_conj_pch): Ditto.
9632 (_mm_conj_pch): Ditto.
9633 (_mm_mask_conj_pch): Ditto.
9634 (_mm_maskz_conj_pch): Ditto.
9636 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9638 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
9639 (_mm512_reduce_add_ph): New intrinsic.
9640 (_mm512_reduce_mul_ph): Ditto.
9641 (_mm512_reduce_min_ph): Ditto.
9642 (_mm512_reduce_max_ph): Ditto.
9643 * config/i386/avx512fp16vlintrin.h
9644 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
9645 (_mm256_reduce_add_ph): New intrinsic.
9646 (_mm256_reduce_mul_ph): Ditto.
9647 (_mm256_reduce_min_ph): Ditto.
9648 (_mm256_reduce_max_ph): Ditto.
9649 (_mm_reduce_add_ph): Ditto.
9650 (_mm_reduce_mul_ph): Ditto.
9651 (_mm_reduce_min_ph): Ditto.
9652 (_mm_reduce_max_ph): Ditto.
9654 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9656 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
9657 __m128h_u): New typedef.
9658 (_mm512_load_ph): New intrinsic.
9659 (_mm256_load_ph): Ditto.
9660 (_mm_load_ph): Ditto.
9661 (_mm512_loadu_ph): Ditto.
9662 (_mm256_loadu_ph): Ditto.
9663 (_mm_loadu_ph): Ditto.
9664 (_mm512_store_ph): Ditto.
9665 (_mm256_store_ph): Ditto.
9666 (_mm_store_ph): Ditto.
9667 (_mm512_storeu_ph): Ditto.
9668 (_mm256_storeu_ph): Ditto.
9669 (_mm_storeu_ph): Ditto.
9670 (_mm512_abs_ph): Ditto.
9671 * config/i386/avx512fp16vlintrin.h
9672 (_mm_abs_ph): Ditto.
9673 (_mm256_abs_ph): Ditto.
9675 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9677 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
9679 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9682 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
9683 is actually a full copy of the source operand into the target.
9684 Don't emit a strict low part move if source and target mode match.
9686 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9688 PR middle-end/102415
9689 * omp-expand.c (expand_omp_single): If region->exit is NULL,
9690 assert region->entry is GIMPLE_OMP_SCOPE region and return.
9692 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9694 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
9695 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
9696 arguments from 2 to 3.
9697 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
9698 allocate clause allocator and print align if present.
9699 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
9700 for omp_default_mem_alloc if align modifier is present. If align
9701 modifier is present, use TREE_LIST to encode both allocator and
9703 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
9704 Handle align modifier on allocator clause if present.
9706 2021-09-22 liuhongt <hongtao.liu@intel.com>
9708 * config/i386/i386.md (define_attr "isa"): Add
9710 (define_attr "enabled"): Correspond fma_or_avx512vl to
9711 TARGET_FMA || TARGET_AVX512VL.
9712 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
9717 2021-09-22 liuhongt <hongtao.liu@intel.com>
9719 * config/i386/i386.md (cstorehf3): New define_expand.
9721 2021-09-22 liuhongt <hongtao.liu@intel.com>
9723 * config/i386/i386.md (<rounding_insn>hf2): New expander.
9724 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
9725 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
9726 Extend from VF_128 to VFH_128.
9728 2021-09-22 liuhongt <hongtao.liu@intel.com>
9730 * config/i386/i386-features.c (i386-features.c): Handle
9732 * config/i386/i386.md (sqrthf2): New expander.
9733 (*sqrthf2): New define_insn.
9734 * config/i386/sse.md
9735 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
9738 2021-09-22 liuhongt <hongtao.liu@intel.com>
9740 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
9742 (_mm_mask3_fcmadd_sch): Likewise.
9743 (_mm_maskz_fcmadd_sch): Likewise.
9744 (_mm_fcmadd_sch): Likewise.
9745 (_mm_mask_fmadd_sch): Likewise.
9746 (_mm_mask3_fmadd_sch): Likewise.
9747 (_mm_maskz_fmadd_sch): Likewise.
9748 (_mm_fmadd_sch): Likewise.
9749 (_mm_mask_fcmadd_round_sch): Likewise.
9750 (_mm_mask3_fcmadd_round_sch): Likewise.
9751 (_mm_maskz_fcmadd_round_sch): Likewise.
9752 (_mm_fcmadd_round_sch): Likewise.
9753 (_mm_mask_fmadd_round_sch): Likewise.
9754 (_mm_mask3_fmadd_round_sch): Likewise.
9755 (_mm_maskz_fmadd_round_sch): Likewise.
9756 (_mm_fmadd_round_sch): Likewise.
9757 (_mm_fcmul_sch): Likewise.
9758 (_mm_mask_fcmul_sch): Likewise.
9759 (_mm_maskz_fcmul_sch): Likewise.
9760 (_mm_fmul_sch): Likewise.
9761 (_mm_mask_fmul_sch): Likewise.
9762 (_mm_maskz_fmul_sch): Likewise.
9763 (_mm_fcmul_round_sch): Likewise.
9764 (_mm_mask_fcmul_round_sch): Likewise.
9765 (_mm_maskz_fcmul_round_sch): Likewise.
9766 (_mm_fmul_round_sch): Likewise.
9767 (_mm_mask_fmul_round_sch): Likewise.
9768 (_mm_maskz_fmul_round_sch): Likewise.
9769 * config/i386/i386-builtin.def: Add corresponding new builtins.
9770 * config/i386/sse.md
9771 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
9772 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
9773 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
9775 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
9776 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
9778 * config/i386/subst.md (mask_scalarcz_name): New.
9779 (mask_scalarc_name): Ditto.
9780 (mask_scalarc_operand3): Ditto.
9781 (mask_scalarcz_operand4): Ditto.
9782 (round_scalarcz_name): Ditto.
9783 (round_scalarc_mask_operand3): Ditto.
9784 (round_scalarcz_mask_operand4): Ditto.
9785 (round_scalarc_mask_op3): Ditto.
9786 (round_scalarcz_mask_op4): Ditto.
9787 (round_scalarcz_constraint): Ditto.
9788 (round_scalarcz_nimm_predicate): Ditto.
9789 (mask_scalarcz): Ditto.
9790 (mask_scalarc): Ditto.
9791 (round_scalarcz): Ditto.
9793 2021-09-22 liuhongt <hongtao.liu@intel.com>
9795 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
9797 (_mm512_mask_fcmadd_pch): Likewise.
9798 (_mm512_mask3_fcmadd_pch): Likewise.
9799 (_mm512_maskz_fcmadd_pch): Likewise.
9800 (_mm512_fmadd_pch): Likewise.
9801 (_mm512_mask_fmadd_pch): Likewise.
9802 (_mm512_mask3_fmadd_pch): Likewise.
9803 (_mm512_maskz_fmadd_pch): Likewise.
9804 (_mm512_fcmadd_round_pch): Likewise.
9805 (_mm512_mask_fcmadd_round_pch): Likewise.
9806 (_mm512_mask3_fcmadd_round_pch): Likewise.
9807 (_mm512_maskz_fcmadd_round_pch): Likewise.
9808 (_mm512_fmadd_round_pch): Likewise.
9809 (_mm512_mask_fmadd_round_pch): Likewise.
9810 (_mm512_mask3_fmadd_round_pch): Likewise.
9811 (_mm512_maskz_fmadd_round_pch): Likewise.
9812 (_mm512_fcmul_pch): Likewise.
9813 (_mm512_mask_fcmul_pch): Likewise.
9814 (_mm512_maskz_fcmul_pch): Likewise.
9815 (_mm512_fmul_pch): Likewise.
9816 (_mm512_mask_fmul_pch): Likewise.
9817 (_mm512_maskz_fmul_pch): Likewise.
9818 (_mm512_fcmul_round_pch): Likewise.
9819 (_mm512_mask_fcmul_round_pch): Likewise.
9820 (_mm512_maskz_fcmul_round_pch): Likewise.
9821 (_mm512_fmul_round_pch): Likewise.
9822 (_mm512_mask_fmul_round_pch): Likewise.
9823 (_mm512_maskz_fmul_round_pch): Likewise.
9824 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
9826 (_mm_mask_fmadd_pch): Likewise.
9827 (_mm_mask3_fmadd_pch): Likewise.
9828 (_mm_maskz_fmadd_pch): Likewise.
9829 (_mm256_fmadd_pch): Likewise.
9830 (_mm256_mask_fmadd_pch): Likewise.
9831 (_mm256_mask3_fmadd_pch): Likewise.
9832 (_mm256_maskz_fmadd_pch): Likewise.
9833 (_mm_fcmadd_pch): Likewise.
9834 (_mm_mask_fcmadd_pch): Likewise.
9835 (_mm_mask3_fcmadd_pch): Likewise.
9836 (_mm_maskz_fcmadd_pch): Likewise.
9837 (_mm256_fcmadd_pch): Likewise.
9838 (_mm256_mask_fcmadd_pch): Likewise.
9839 (_mm256_mask3_fcmadd_pch): Likewise.
9840 (_mm256_maskz_fcmadd_pch): Likewise.
9841 (_mm_fmul_pch): Likewise.
9842 (_mm_mask_fmul_pch): Likewise.
9843 (_mm_maskz_fmul_pch): Likewise.
9844 (_mm256_fmul_pch): Likewise.
9845 (_mm256_mask_fmul_pch): Likewise.
9846 (_mm256_maskz_fmul_pch): Likewise.
9847 (_mm_fcmul_pch): Likewise.
9848 (_mm_mask_fcmul_pch): Likewise.
9849 (_mm_maskz_fcmul_pch): Likewise.
9850 (_mm256_fcmul_pch): Likewise.
9851 (_mm256_mask_fcmul_pch): Likewise.
9852 (_mm256_maskz_fcmul_pch): Likewise.
9853 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
9854 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
9855 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
9856 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
9857 * config/i386/i386-builtin.def: Add new builtins.
9858 * config/i386/i386-expand.c: Handle new builtin types.
9859 * config/i386/subst.md (SUBST_CV): New.
9860 (maskc_name): Ditto.
9861 (maskc_operand3): Ditto.
9863 (sdc_maskz_name): Ditto.
9864 (sdc_mask_op4): Ditto.
9865 (sdc_mask_op5): Ditto.
9866 (sdc_mask_mode512bit_condition): Ditto.
9868 (round_maskc_operand3): Ditto.
9869 (round_sdc_mask_operand4): Ditto.
9870 (round_maskc_op3): Ditto.
9871 (round_sdc_mask_op4): Ditto.
9872 (round_saeonly_sdc_mask_operand5): Ditto.
9873 * config/i386/sse.md (unspec): Add complex fma unspecs.
9874 (avx512fmaskcmode): New.
9875 (UNSPEC_COMPLEX_F_C_MA): Ditto.
9876 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
9877 (complexopname): Ditto.
9878 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
9879 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
9880 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
9882 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
9883 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
9885 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
9887 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
9888 rs6000-density-size-threshold, rs6000-density-penalty,
9889 rs6000-density-load-pct-threshold,
9890 rs6000-density-load-num-threshold): New parameter.
9891 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
9892 corresponding parameters.
9894 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9896 * gimple-range-path.cc (path_range_query::defined_outside_path):
9898 (path_range_query::range_on_path_entry): New.
9899 (path_range_query::internal_range_of_expr): Resolve unknowns
9901 (path_range_query::improve_range_with_equivs): New.
9902 (path_range_query::ssa_range_in_phi): Resolve unknowns with
9904 * gimple-range-path.h (class path_range_query): Add
9905 defined_outside_path, range_on_path_entry, and
9906 improve_range_with_equivs.
9908 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9910 * gimple-range-path.cc (path_range_query::add_to_imports): New.
9911 (path_range_query::add_copies_to_imports): New.
9912 (path_range_query::precompute_ranges): Call
9913 add_copies_to_imports.
9914 * gimple-range-path.h (class path_range_query): Add prototypes
9915 for add_copies_to_imports and add_to_imports.
9917 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9919 * gimple-range-path.cc (path_range_query::range_defined_in_block):
9920 Remove useless code.
9922 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9924 * gimple-range-fold.h (class fur_source): Make oracle protected.
9925 * gimple-range-path.cc (path_range_query::path_range_query): Add
9926 resolve argument. Initialize oracle.
9927 (path_range_query::~path_range_query): Delete oracle.
9928 (path_range_query::range_of_stmt): Adapt to use relations.
9929 (path_range_query::precompute_ranges): Pre-compute relations.
9930 (class jt_fur_source): New
9931 (jt_fur_source::jt_fur_source): New.
9932 (jt_fur_source::register_relation): New.
9933 (jt_fur_source::query_relation): New.
9934 (path_range_query::precompute_relations): New.
9935 (path_range_query::precompute_phi_relations): New.
9936 * gimple-range-path.h (path_range_query): Add resolve argument.
9937 Add oracle, precompute_relations, precompute_phi_relations.
9938 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
9939 resolve argument to solver.
9941 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9943 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9944 Rename postfold_gcond_edges to register_outgoing_edges and
9946 (fold_using_range::postfold_gcond_edges): Rename...
9947 (fur_source::register_outgoing_edges): ...to this.
9948 * gimple-range-fold.h (postfold_gcond_edges): Rename to
9949 register_outgoing_edges and move to fur_source.
9951 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9953 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
9954 dom_info_available_p.
9956 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9958 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
9959 and quick_grow_cleared instead of safe_grow_cleared.
9961 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
9964 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
9967 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
9969 * configure.ac: Detect when the assembler supports new-style
9970 architecture extensions.
9971 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
9972 the full CPU string if the assembler can grok it.
9973 (arm_rewrite_march): Likewise but for the architecture.
9974 * config.in: Regenerate.
9975 * configure: Regenerate.
9977 2021-09-21 Richard Biener <rguenther@suse.de>
9979 PR tree-optimization/102421
9980 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
9981 adjust alignment info.
9983 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
9985 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
9986 bitfield stream out.
9988 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9990 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
9991 undefined edges, apply an equivalence if appropriate.
9992 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
9993 UNDEFINED if EDGE_EXECUTABLE is not set.
9994 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
9995 as EXECUTABLE upon startup.
9996 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
9997 EDGE_EXECUTABLE set.
9998 * vr-values.c (set_and_propagate_unexecutable): New.
9999 (simplify_using_ranges::fold_cond): Call set_and_propagate.
10000 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
10001 * vr-values.h: Add prototype.
10003 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
10005 * value-relation.cc (equiv_oracle::register_initial_def): New.
10006 (equiv_oracle::register_relation): Call register_initial_def.
10007 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
10008 (relation_oracle::register_stmt): Check def block of PHI arguments.
10009 * value-relation.h (equiv_oracle): Add new prototypes.
10011 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
10013 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
10014 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
10015 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
10016 __ROUNDING_MATH__ according to their corresponding flags.
10017 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
10018 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
10019 and __ROUNDING_MATH__.
10021 2021-09-20 Richard Biener <rguenther@suse.de>
10023 * tree-vect-stmts.c (vectorizable_load): Use the vectype
10026 2021-09-20 Richard Biener <rguenther@suse.de>
10028 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
10029 Do not compute alignment of the vectorized access here.
10031 2021-09-20 Richard Biener <rguenther@suse.de>
10033 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
10034 Store -1 for runtime alias peeling iterations.
10036 2021-09-20 Richard Biener <rguenther@suse.de>
10038 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
10040 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
10042 * input.c (string_concat_db::record_string_concatenation)
10043 (string_concat_db::get_string_concatenation): Skip for
10044 'RESERVED_LOCATION_P'.
10046 2021-09-20 Richard Biener <rguenther@suse.de>
10048 PR tree-optimization/65206
10049 * tree-data-ref.h (struct data_reference): Add alt_indices,
10051 * tree-data-ref.c (free_data_ref): Release alt_indices.
10052 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
10053 (create_data_ref): Adjust.
10054 (initialize_data_dependence_relation): Split into head
10055 and tail. When the base objects fail to match up try
10056 again with pointer-based analysis of indices.
10057 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
10058 not compare the lazily computed alternate set of indices.
10060 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
10062 * gcc.c: Test for execute OK when we find the
10063 programs for assembler linker and dsymutil and those
10064 were specified at configure-time.
10066 2021-09-19 Martin Sebor <msebor@redhat.com>
10068 PR middle-end/102403
10069 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
10070 Correct a function pre/postcondition.
10072 2021-09-19 Martin Sebor <msebor@redhat.com>
10074 PR middle-end/102243
10075 * tree-ssa-strlen.c (get_range): Handle null cfun.
10077 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
10079 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
10080 unwinder shim as a convenience library.
10082 2021-09-19 Andrew Pinski <apinski@marvell.com>
10084 * doc/install.texi: Add note about
10085 binutils 2.35 is required for LTO usage.
10087 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10089 * tree-ssa-threadbackward.c
10090 (back_threader_registry::register_path): Use push_edge.
10091 * tree-ssa-threadedge.c
10092 (jump_threader::thread_around_empty_blocks): Same.
10093 (jump_threader::thread_through_normal_block): Same.
10094 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
10096 * tree-ssa-threadupdate.c
10097 (jt_path_registry::allocate_thread_edge): Remove.
10098 (jt_path_registry::push_edge): New.
10099 (dump_jump_thread_path): Make static.
10100 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
10103 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
10105 * gimple-range-path.cc (path_range_query::path_range_query): Add
10107 (path_range_query::dump): Remove extern declaration of dump_ranger.
10108 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
10109 * gimple-range-trace.h (dump_ranger): Add prototype.
10111 2021-09-19 John Ericson <git@JohnEricson.me>
10113 * gcc.c (find_a_program): New function, factored out of...
10114 (find_a_file): Here.
10115 (execute): Use find_a_program when looking for programs rather
10118 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
10120 * config/avr/avr-mcus.def: Add atmega324pb.
10121 * doc/avr-mmcu.texi: Corresponding changes.
10123 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
10125 PR middle-end/88173
10126 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
10127 the same transformations as to @1. For comparisons against NaN,
10128 don't check HONOR_SNANS but confirm that neither operand is a
10131 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
10133 * attribs.c (make_unique_name): Delete.
10134 * attribs.h (make_unique_name): Delete.
10136 2021-09-19 Andrew Pinski <apinski@marvell.com>
10138 * lra-constraints.c (check_and_process_move): Assert
10139 that dclass and sclass are greater than or equal to NO_REGS.
10141 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10143 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
10144 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
10145 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
10147 2021-09-18 liuhongt <hongtao.liu@intel.com>
10149 * config/i386/i386-features.c (remove_partial_avx_dependency):
10150 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
10151 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
10153 2021-09-18 Jakub Jelinek <jakub@redhat.com>
10155 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
10156 if file/namespace scope variable doesn't have predetermined sharing,
10157 treat it as if there was default(none).
10159 2021-09-18 liuhongt <hongtao.liu@intel.com>
10161 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
10163 (_mm_mask_fmadd_sh): Likewise.
10164 (_mm_mask3_fmadd_sh): Likewise.
10165 (_mm_maskz_fmadd_sh): Likewise.
10166 (_mm_fmadd_round_sh): Likewise.
10167 (_mm_mask_fmadd_round_sh): Likewise.
10168 (_mm_mask3_fmadd_round_sh): Likewise.
10169 (_mm_maskz_fmadd_round_sh): Likewise.
10170 (_mm_fnmadd_sh): Likewise.
10171 (_mm_mask_fnmadd_sh): Likewise.
10172 (_mm_mask3_fnmadd_sh): Likewise.
10173 (_mm_maskz_fnmadd_sh): Likewise.
10174 (_mm_fnmadd_round_sh): Likewise.
10175 (_mm_mask_fnmadd_round_sh): Likewise.
10176 (_mm_mask3_fnmadd_round_sh): Likewise.
10177 (_mm_maskz_fnmadd_round_sh): Likewise.
10178 (_mm_fmsub_sh): Likewise.
10179 (_mm_mask_fmsub_sh): Likewise.
10180 (_mm_mask3_fmsub_sh): Likewise.
10181 (_mm_maskz_fmsub_sh): Likewise.
10182 (_mm_fmsub_round_sh): Likewise.
10183 (_mm_mask_fmsub_round_sh): Likewise.
10184 (_mm_mask3_fmsub_round_sh): Likewise.
10185 (_mm_maskz_fmsub_round_sh): Likewise.
10186 (_mm_fnmsub_sh): Likewise.
10187 (_mm_mask_fnmsub_sh): Likewise.
10188 (_mm_mask3_fnmsub_sh): Likewise.
10189 (_mm_maskz_fnmsub_sh): Likewise.
10190 (_mm_fnmsub_round_sh): Likewise.
10191 (_mm_mask_fnmsub_round_sh): Likewise.
10192 (_mm_mask3_fnmsub_round_sh): Likewise.
10193 (_mm_maskz_fnmsub_round_sh): Likewise.
10194 * config/i386/i386-builtin-types.def
10195 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
10196 * config/i386/i386-builtin.def: Add new builtins.
10197 * config/i386/i386-expand.c: Handle new builtin type.
10198 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
10199 Ajdust to support FP16.
10200 (fmai_vmfmsub_<mode><round_name>): Ditto.
10201 (fmai_vmfnmadd_<mode><round_name>): Ditto.
10202 (fmai_vmfnmsub_<mode><round_name>): Ditto.
10203 (*fmai_fmadd_<mode>): Ditto.
10204 (*fmai_fmsub_<mode>): Ditto.
10205 (*fmai_fnmadd_<mode><round_name>): Ditto.
10206 (*fmai_fnmsub_<mode><round_name>): Ditto.
10207 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
10208 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
10209 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
10210 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
10211 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
10212 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
10213 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
10214 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10215 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
10216 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10217 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
10218 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
10219 adjust to support FP16.
10220 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
10221 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
10222 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
10225 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10227 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
10229 (maskload<mode><avx512fmaskmodelower>): Ditto.
10230 (maskstore<mode><avx512fmaskmodelower>): Ditto.
10232 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10234 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
10236 (ix86_expand_copysign): Ditto.
10237 (ix86_expand_xorsign): Ditto.
10238 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
10240 (ix86_build_signbit_mask): Ditto.
10241 (ix86_can_change_mode_class): Ditto.
10242 * config/i386/i386.md
10243 (SSEMODEF): Add HFmode.
10244 (ssevecmodef): Ditto.
10245 (<code>hf2): New define_expand.
10246 (*<code>hf2_1): New define_insn_and_split.
10247 (copysign<mode>): Extend to support HFmode under AVX512FP16.
10248 (xorsign<mode>): Ditto.
10249 * config/i386/sse.md (VFB): New mode iterator.
10250 (VFB_128_256): Ditto.
10252 (sseintvecmode2): Support HF vector mode.
10253 (<code><mode>2): Use new mode iterator.
10254 (*<code><mode>2): Ditto.
10255 (copysign<mode>3): Ditto.
10256 (xorsign<mode>3): Ditto.
10257 (<code><mode>3<mask_name>): Ditto.
10258 (<code><mode>3<mask_name>): Ditto.
10259 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
10260 (<sse>_andnot<mode>3<mask_name>): Ditto.
10261 (*<code><mode>3<mask_name>): Ditto.
10262 (*<code><mode>3<mask_name>): Ditto.
10264 2021-09-18 liuhongt <hongtao.liu@intel.com>
10266 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
10268 (_mm512_mask3_fmadd_ph): Likewise.
10269 (_mm512_maskz_fmadd_ph): Likewise.
10270 (_mm512_fmadd_round_ph): Likewise.
10271 (_mm512_mask_fmadd_round_ph): Likewise.
10272 (_mm512_mask3_fmadd_round_ph): Likewise.
10273 (_mm512_maskz_fmadd_round_ph): Likewise.
10274 (_mm512_fnmadd_ph): Likewise.
10275 (_mm512_mask_fnmadd_ph): Likewise.
10276 (_mm512_mask3_fnmadd_ph): Likewise.
10277 (_mm512_maskz_fnmadd_ph): Likewise.
10278 (_mm512_fnmadd_round_ph): Likewise.
10279 (_mm512_mask_fnmadd_round_ph): Likewise.
10280 (_mm512_mask3_fnmadd_round_ph): Likewise.
10281 (_mm512_maskz_fnmadd_round_ph): Likewise.
10282 (_mm512_fmsub_ph): Likewise.
10283 (_mm512_mask_fmsub_ph): Likewise.
10284 (_mm512_mask3_fmsub_ph): Likewise.
10285 (_mm512_maskz_fmsub_ph): Likewise.
10286 (_mm512_fmsub_round_ph): Likewise.
10287 (_mm512_mask_fmsub_round_ph): Likewise.
10288 (_mm512_mask3_fmsub_round_ph): Likewise.
10289 (_mm512_maskz_fmsub_round_ph): Likewise.
10290 (_mm512_fnmsub_ph): Likewise.
10291 (_mm512_mask_fnmsub_ph): Likewise.
10292 (_mm512_mask3_fnmsub_ph): Likewise.
10293 (_mm512_maskz_fnmsub_ph): Likewise.
10294 (_mm512_fnmsub_round_ph): Likewise.
10295 (_mm512_mask_fnmsub_round_ph): Likewise.
10296 (_mm512_mask3_fnmsub_round_ph): Likewise.
10297 (_mm512_maskz_fnmsub_round_ph): Likewise.
10298 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
10300 (_mm256_mask_fmadd_ph): Likewise.
10301 (_mm256_mask3_fmadd_ph): Likewise.
10302 (_mm256_maskz_fmadd_ph): Likewise.
10303 (_mm_fmadd_ph): Likewise.
10304 (_mm_mask_fmadd_ph): Likewise.
10305 (_mm_mask3_fmadd_ph): Likewise.
10306 (_mm_maskz_fmadd_ph): Likewise.
10307 (_mm256_fnmadd_ph): Likewise.
10308 (_mm256_mask_fnmadd_ph): Likewise.
10309 (_mm256_mask3_fnmadd_ph): Likewise.
10310 (_mm256_maskz_fnmadd_ph): Likewise.
10311 (_mm_fnmadd_ph): Likewise.
10312 (_mm_mask_fnmadd_ph): Likewise.
10313 (_mm_mask3_fnmadd_ph): Likewise.
10314 (_mm_maskz_fnmadd_ph): Likewise.
10315 (_mm256_fmsub_ph): Likewise.
10316 (_mm256_mask_fmsub_ph): Likewise.
10317 (_mm256_mask3_fmsub_ph): Likewise.
10318 (_mm256_maskz_fmsub_ph): Likewise.
10319 (_mm_fmsub_ph): Likewise.
10320 (_mm_mask_fmsub_ph): Likewise.
10321 (_mm_mask3_fmsub_ph): Likewise.
10322 (_mm_maskz_fmsub_ph): Likewise.
10323 (_mm256_fnmsub_ph): Likewise.
10324 (_mm256_mask_fnmsub_ph): Likewise.
10325 (_mm256_mask3_fnmsub_ph): Likewise.
10326 (_mm256_maskz_fnmsub_ph): Likewise.
10327 (_mm_fnmsub_ph): Likewise.
10328 (_mm_mask_fnmsub_ph): Likewise.
10329 (_mm_mask3_fnmsub_ph): Likewise.
10330 (_mm_maskz_fnmsub_ph): Likewise.
10331 * config/i386/i386-builtin.def: Add corresponding new builtins.
10332 * config/i386/sse.md
10333 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
10334 support HF vector modes.
10335 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
10337 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10338 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10339 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10340 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
10341 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
10342 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
10343 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
10345 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10346 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10347 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10348 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
10349 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
10350 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
10352 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10353 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10354 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10355 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
10356 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
10357 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
10358 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
10360 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10361 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10362 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10363 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
10364 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
10366 2021-09-18 liuhongt <hongtao.liu@intel.com>
10368 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
10370 (_mm512_mask_fmaddsub_ph): Likewise.
10371 (_mm512_mask3_fmaddsub_ph): Likewise.
10372 (_mm512_maskz_fmaddsub_ph): Likewise.
10373 (_mm512_fmaddsub_round_ph): Likewise.
10374 (_mm512_mask_fmaddsub_round_ph): Likewise.
10375 (_mm512_mask3_fmaddsub_round_ph): Likewise.
10376 (_mm512_maskz_fmaddsub_round_ph): Likewise.
10377 (_mm512_mask_fmsubadd_ph): Likewise.
10378 (_mm512_mask3_fmsubadd_ph): Likewise.
10379 (_mm512_maskz_fmsubadd_ph): Likewise.
10380 (_mm512_fmsubadd_round_ph): Likewise.
10381 (_mm512_mask_fmsubadd_round_ph): Likewise.
10382 (_mm512_mask3_fmsubadd_round_ph): Likewise.
10383 (_mm512_maskz_fmsubadd_round_ph): Likewise.
10384 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
10386 (_mm256_mask_fmaddsub_ph): Likewise.
10387 (_mm256_mask3_fmaddsub_ph): Likewise.
10388 (_mm256_maskz_fmaddsub_ph): Likewise.
10389 (_mm_fmaddsub_ph): Likewise.
10390 (_mm_mask_fmaddsub_ph): Likewise.
10391 (_mm_mask3_fmaddsub_ph): Likewise.
10392 (_mm_maskz_fmaddsub_ph): Likewise.
10393 (_mm256_fmsubadd_ph): Likewise.
10394 (_mm256_mask_fmsubadd_ph): Likewise.
10395 (_mm256_mask3_fmsubadd_ph): Likewise.
10396 (_mm256_maskz_fmsubadd_ph): Likewise.
10397 (_mm_fmsubadd_ph): Likewise.
10398 (_mm_mask_fmsubadd_ph): Likewise.
10399 (_mm_mask3_fmsubadd_ph): Likewise.
10400 (_mm_maskz_fmsubadd_ph): Likewise.
10401 * config/i386/i386-builtin.def: Add corresponding new builtins.
10402 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
10403 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
10404 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
10406 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
10408 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
10409 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
10410 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
10412 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
10413 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
10415 2021-09-18 liuhongt <hongtao.liu@intel.com>
10418 * config/i386/i386.c (ix86_print_operand): Handle
10419 V8HF/V16HF/V32HFmode.
10420 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
10421 * config/i386/sse.md (avx512bcst): Remove.
10423 2021-09-17 Martin Sebor <msebor@redhat.com>
10425 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
10426 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
10427 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
10428 (check_defs): Add comment.
10429 (can_skip_redundant_opnd): Update comment.
10430 (compute_uninit_opnds_pos): Adjust to namespace change.
10431 (find_pdom): Move to gimple-predicate-analysis.cc.
10433 (struct uninit_undef_val_t): New.
10434 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
10435 (find_control_equiv_block): Same.
10436 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
10437 (MAX_SWITCH_CASES): Same.
10438 (compute_control_dep_chain): Same.
10439 (find_uninit_use): Use predicate analyzer.
10440 (struct pred_info): Move to gimple-predicate-analysis.
10441 (convert_control_dep_chain_into_preds): Same.
10442 (find_predicates): Same.
10443 (collect_phi_def_edges): Same.
10444 (warn_uninitialized_phi): Use predicate analyzer.
10445 (find_def_preds): Move to gimple-predicate-analysis.
10446 (dump_pred_info): Same.
10447 (dump_pred_chain): Same.
10448 (dump_predicates): Same.
10449 (destroy_predicate_vecs): Remove.
10450 (execute_late_warn_uninitialized): New.
10451 (get_cmp_code): Move to gimple-predicate-analysis.
10452 (is_value_included_in): Same.
10453 (value_sat_pred_p): Same.
10454 (find_matching_predicate_in_rest_chains): Same.
10455 (is_use_properly_guarded): Same.
10456 (prune_uninit_phi_opnds): Same.
10457 (find_var_cmp_const): Same.
10458 (use_pred_not_overlap_with_undef_path_pred): Same.
10459 (pred_equal_p): Same.
10460 (is_neq_relop_p): Same.
10461 (is_neq_zero_form_p): Same.
10462 (pred_expr_equal_p): Same.
10463 (is_pred_expr_subset_of): Same.
10464 (is_pred_chain_subset_of): Same.
10465 (is_included_in): Same.
10466 (is_superset_of): Same.
10467 (pred_neg_p): Same.
10468 (simplify_pred): Same.
10469 (simplify_preds_2): Same.
10470 (simplify_preds_3): Same.
10471 (simplify_preds_4): Same.
10472 (simplify_preds): Same.
10474 (push_to_worklist): Same.
10475 (get_pred_info_from_cmp): Same.
10476 (is_degenerated_phi): Same.
10477 (normalize_one_pred_1): Same.
10478 (normalize_one_pred): Same.
10479 (normalize_one_pred_chain): Same.
10480 (normalize_preds): Same.
10481 (can_one_predicate_be_invalidated_p): Same.
10482 (can_chain_union_be_invalidated_p): Same.
10483 (uninit_uses_cannot_happen): Same.
10484 (pass_late_warn_uninitialized::execute): Define.
10485 * gimple-predicate-analysis.cc: New file.
10486 * gimple-predicate-analysis.h: New file.
10488 2021-09-17 Julian Brown <julian@codesourcery.com>
10490 * config/gcn/gcn.c (gimple.h): Include.
10491 (gcn_fork_join): Emit barrier for worker-level joins.
10492 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
10493 writes_gang_private bitmap parameter. Set bit for blocks
10494 containing gang-private variable writes.
10495 (worker_single_simple): Don't emit barrier after predicated block.
10496 (worker_single_copy): Don't emit barrier if we're not broadcasting
10497 anything and the block contains no gang-private writes.
10498 (neuter_worker_single): Don't predicate blocks that only contain
10499 NOPs or internal marker functions. Pass has_gang_private_write
10500 argument to worker_single_copy.
10501 (oacc_do_neutering): Add writes_gang_private bitmap handling.
10503 2021-09-17 Julian Brown <julian@codesourcery.com>
10505 * config/gcn/gcn-protos.h
10506 (gcn_goacc_create_worker_broadcast_record): Update prototype.
10507 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
10508 preallocated block of LDS memory. Do not cache/share decls for
10509 reduction temporaries between invocations.
10510 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
10511 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
10512 and return temporary LDS space at that offset. Return pointer in
10514 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
10516 (ACC_LDS_SIZE): Define as acc_lds_size.
10517 (gcn_init_machine_status): Don't initialise lds_allocated,
10518 lds_allocs, reduc_decls fields of machine function struct.
10519 (gcn_option_override): Handle default size for gang-private
10520 variables and -mgang-private-size option.
10521 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
10522 initialising M0_REG.
10523 (gcn_shared_mem_layout): New function.
10524 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
10525 gang_private_hwm variable.
10526 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
10527 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
10528 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
10529 * config/gcn/gcn.opt (gang_private_size_opt): New global.
10530 (mgang-private-size=): New option.
10531 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
10532 documentation hook.
10533 * doc/tm.texi: Regenerate.
10534 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
10536 (build_sender_ref): Handle sender_decl being pointer.
10537 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
10538 parameters. Pass placement argument to
10539 create_worker_broadcast_record hook invocations. Handle
10540 sender_decl being pointer and isolate_broadcasts inserting extra
10542 (blk_offset_map_t): Add typedef.
10543 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
10544 preallocated range to worker_single_copy call.
10545 (dfs_broadcast_reachable_1): New function.
10546 (idx_decl_pair_t, used_range_vec_t): New typedefs.
10547 (sort_size_descending): New function.
10548 (addr_range): New class.
10549 (splay_tree_compare_addr_range, splay_tree_free_key)
10550 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
10551 (execute_omp_oacc_neuter_broadcast): Rename to...
10552 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
10553 parameters. Arrange layout of shared memory for broadcast
10555 (execute_omp_oacc_neuter_broadcast): New function.
10556 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
10557 handling from here. Enable pass for all OpenACC routines in order
10558 to call shared memory-layout hook.
10559 * target.def (create_worker_broadcast_record): Add OFFSET
10561 (shared_mem_layout): New hook.
10563 2021-09-17 Julian Brown <julian@codesourcery.com>
10564 Thomas Schwinge <thomas@codesourcery.com>
10566 * omp-oacc-neuter-broadcast.cc
10567 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
10569 (execute_omp_oacc_neuter_broadcast): Adjust.
10571 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10573 * value-relation.cc (class equiv_chain): Move to header file.
10574 (path_oracle::path_oracle): New.
10575 (path_oracle::~path_oracle): New.
10576 (path_oracle::register_relation): New.
10577 (path_oracle::query_relation): New.
10578 (path_oracle::reset_path): New.
10579 (path_oracle::dump): New.
10580 * value-relation.h (class equiv_chain): Move to here.
10581 (class path_oracle): New.
10583 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10585 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
10587 * gimple-range-fold.cc (fur_depend::register_relation): Use
10588 register_stmt/edge routines.
10589 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
10590 (equiv_oracle::equiv_oracle): Create self equivalence cache.
10591 (equiv_oracle::~equiv_oracle): Release same.
10592 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
10593 are no equivalences.
10594 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
10595 (equiv_oracle::register_relation): Rename from register_equiv.
10596 (relation_chain_head::find_relation): Relocate from dom_oracle.
10597 (relation_oracle::register_stmt): New.
10598 (relation_oracle::register_edge): New.
10599 (dom_oracle::*): Rename from relation_oracle.
10600 (dom_oracle::register_relation): Adjust to call equiv_oracle.
10601 (dom_oracle::set_one_relation): Split from register_relation.
10602 (dom_oracle::register_transitives): Consolidate 2 methods.
10603 (dom_oracle::find_relation_block): Move core to relation_chain.
10604 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
10605 * value-relation.h (class relation_oracle): New pure virtual base.
10606 (class equiv_oracle): Inherit from relation_oracle and adjust.
10607 (class dom_oracle): Rename from old relation_oracle and adjust.
10609 2021-09-17 Martin Sebor <msebor@redhat.com>
10611 PR middle-end/102200
10612 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
10613 (handle_min_max_size): Change argument. Store original SSA_NAME for
10614 operands to potentially distinct (sub)objects.
10615 (compute_objsize_r): Adjust call to the above.
10617 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10619 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
10620 (rs6000_new_builtin_vectorized_function): New function.
10621 (rs6000_new_builtin_md_vectorized_function): Likewise.
10622 (rs6000_builtin_vectorized_function): Call
10623 rs6000_new_builtin_vectorized_function.
10624 (rs6000_builtin_md_vectorized_function): Call
10625 rs6000_new_builtin_md_vectorized_function.
10627 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10629 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
10630 (ASSEMBLE_PAIR): Likewise.
10631 (BUILD_ACC): Likewise.
10632 (DISASSEMBLE_ACC): Likewise.
10633 (DISASSEMBLE_PAIR): Likewise.
10634 (PMXVBF16GER2): Likewise.
10635 (PMXVBF16GER2NN): Likewise.
10636 (PMXVBF16GER2NP): Likewise.
10637 (PMXVBF16GER2PN): Likewise.
10638 (PMXVBF16GER2PP): Likewise.
10639 (PMXVF16GER2): Likewise.
10640 (PMXVF16GER2NN): Likewise.
10641 (PMXVF16GER2NP): Likewise.
10642 (PMXVF16GER2PN): Likewise.
10643 (PMXVF16GER2PP): Likewise.
10644 (PMXVF32GER): Likewise.
10645 (PMXVF32GERNN): Likewise.
10646 (PMXVF32GERNP): Likewise.
10647 (PMXVF32GERPN): Likewise.
10648 (PMXVF32GERPP): Likewise.
10649 (PMXVF64GER): Likewise.
10650 (PMXVF64GERNN): Likewise.
10651 (PMXVF64GERNP): Likewise.
10652 (PMXVF64GERPN): Likewise.
10653 (PMXVF64GERPP): Likewise.
10654 (PMXVI16GER2): Likewise.
10655 (PMXVI16GER2PP): Likewise.
10656 (PMXVI16GER2S): Likewise.
10657 (PMXVI16GER2SPP): Likewise.
10658 (PMXVI4GER8): Likewise.
10659 (PMXVI4GER8PP): Likewise.
10660 (PMXVI8GER4): Likewise.
10661 (PMXVI8GER4PP): Likewise.
10662 (PMXVI8GER4SPP): Likewise.
10663 (XVBF16GER2): Likewise.
10664 (XVBF16GER2NN): Likewise.
10665 (XVBF16GER2NP): Likewise.
10666 (XVBF16GER2PN): Likewise.
10667 (XVBF16GER2PP): Likewise.
10668 (XVF16GER2): Likewise.
10669 (XVF16GER2NN): Likewise.
10670 (XVF16GER2NP): Likewise.
10671 (XVF16GER2PN): Likewise.
10672 (XVF16GER2PP): Likewise.
10673 (XVF32GER): Likewise.
10674 (XVF32GERNN): Likewise.
10675 (XVF32GERNP): Likewise.
10676 (XVF32GERPN): Likewise.
10677 (XVF32GERPP): Likewise.
10678 (XVF64GER): Likewise.
10679 (XVF64GERNN): Likewise.
10680 (XVF64GERNP): Likewise.
10681 (XVF64GERPN): Likewise.
10682 (XVF64GERPP): Likewise.
10683 (XVI16GER2): Likewise.
10684 (XVI16GER2PP): Likewise.
10685 (XVI16GER2S): Likewise.
10686 (XVI16GER2SPP): Likewise.
10687 (XVI4GER8): Likewise.
10688 (XVI4GER8PP): Likewise.
10689 (XVI8GER4): Likewise.
10690 (XVI8GER4PP): Likewise.
10691 (XVI8GER4SPP): Likewise.
10692 (XXMFACC): Likewise.
10693 (XXMTACC): Likewise.
10694 (XXSETACCZ): Likewise.
10695 (ASSEMBLE_PAIR_V): Likewise.
10696 (BUILD_PAIR): Likewise.
10697 (DISASSEMBLE_PAIR_V): Likewise.
10700 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
10701 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
10702 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
10703 (parse_bif_attrs): Handle ismmaint.
10704 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
10705 (write_bif_static_init): Handle ismmaint.
10707 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10709 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
10711 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
10712 (rs6000_new_builtin_valid_without_lhs): New function.
10713 (rs6000_gimple_fold_new_mma_builtin): Likewise.
10714 (rs6000_gimple_fold_new_builtin): Likewise.
10716 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
10718 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
10719 Destruct stale Value objects.
10720 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
10723 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
10726 * match.pd (shift optimizations): Disable recent sign-changing
10727 optimization for shifts by zero, these will be folded later.
10729 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10731 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
10732 [power9] to [always].
10734 2021-09-17 Richard Biener <rguenther@suse.de>
10736 * tree-vect-stmts.c (vectorizable_load): Do not frob
10739 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10741 * config/i386/i386-features.c (remove_partial_avx_dependency):
10742 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
10743 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
10745 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
10747 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10748 * config/i386/i386.md (SSE FP to FP splitters): Replace
10749 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
10750 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
10751 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
10752 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
10753 * config/i386/x86-tune.def
10754 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
10755 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10757 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10760 * config/i386/i386-features.c (remove_partial_avx_dependency):
10761 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
10762 before generating vxorps.
10764 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10766 * config/i386/i386-options.c (processor_cost_table): Use
10767 tremont_cost for Tremont.
10768 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
10769 (tremont_memset): Likewise.
10770 (tremont_cost): Likewise.
10771 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
10772 Enable for Tremont.
10774 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10776 * common/config/i386/i386-common.c: Use Haswell scheduling model
10778 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
10780 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
10782 (ix86_adjust_cost): Handle Tremont.
10783 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
10784 Enable for Tremont.
10785 (X86_TUNE_USE_LEAVE): Likewise.
10786 (X86_TUNE_PUSH_MEMORY): Likewise.
10787 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
10788 (X86_TUNE_USE_CLTD): Likewise.
10789 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
10790 (X86_TUNE_AVOID_MFENCE): Likewise.
10791 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
10792 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
10793 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
10794 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
10795 (X86_TUNE_OPT_AGU): Likewise.
10796 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
10797 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
10798 (X86_TUNE_EXPAND_ABS): Likewise.
10799 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
10800 (X86_TUNE_SLOW_PSHUFB): Likewise.
10802 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
10804 PR rtl-optimization/102306
10805 * combine.c (try_combine): Abort the combination if we are about to
10806 duplicate volatile references.
10808 2021-09-17 liuhongt <hongtao.liu@intel.com>
10810 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
10812 (_mm256_undefined_ph): Likewise.
10813 (_mm512_undefined_ph): Likewise.
10814 (_mm_cvtsh_h): Likewise.
10815 (_mm256_cvtsh_h): Likewise.
10816 (_mm512_cvtsh_h): Likewise.
10817 (_mm512_castph_ps): Likewise.
10818 (_mm512_castph_pd): Likewise.
10819 (_mm512_castph_si512): Likewise.
10820 (_mm512_castph512_ph128): Likewise.
10821 (_mm512_castph512_ph256): Likewise.
10822 (_mm512_castph128_ph512): Likewise.
10823 (_mm512_castph256_ph512): Likewise.
10824 (_mm512_zextph128_ph512): Likewise.
10825 (_mm512_zextph256_ph512): Likewise.
10826 (_mm512_castps_ph): Likewise.
10827 (_mm512_castpd_ph): Likewise.
10828 (_mm512_castsi512_ph): Likewise.
10829 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
10831 (_mm256_castph_ps): Likewise.
10832 (_mm_castph_pd): Likewise.
10833 (_mm256_castph_pd): Likewise.
10834 (_mm_castph_si128): Likewise.
10835 (_mm256_castph_si256): Likewise.
10836 (_mm_castps_ph): Likewise.
10837 (_mm256_castps_ph): Likewise.
10838 (_mm_castpd_ph): Likewise.
10839 (_mm256_castpd_ph): Likewise.
10840 (_mm_castsi128_ph): Likewise.
10841 (_mm256_castsi256_ph): Likewise.
10842 (_mm256_castph256_ph128): Likewise.
10843 (_mm256_castph128_ph256): Likewise.
10844 (_mm256_zextph128_ph256): Likewise.
10846 2021-09-17 liuhongt <hongtao.liu@intel.com>
10848 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
10850 (_mm_mask_cvtsh_ss): Likewise.
10851 (_mm_maskz_cvtsh_ss): Likewise.
10852 (_mm_cvtsh_sd): Likewise.
10853 (_mm_mask_cvtsh_sd): Likewise.
10854 (_mm_maskz_cvtsh_sd): Likewise.
10855 (_mm_cvt_roundsh_ss): Likewise.
10856 (_mm_mask_cvt_roundsh_ss): Likewise.
10857 (_mm_maskz_cvt_roundsh_ss): Likewise.
10858 (_mm_cvt_roundsh_sd): Likewise.
10859 (_mm_mask_cvt_roundsh_sd): Likewise.
10860 (_mm_maskz_cvt_roundsh_sd): Likewise.
10861 (_mm_cvtss_sh): Likewise.
10862 (_mm_mask_cvtss_sh): Likewise.
10863 (_mm_maskz_cvtss_sh): Likewise.
10864 (_mm_cvtsd_sh): Likewise.
10865 (_mm_mask_cvtsd_sh): Likewise.
10866 (_mm_maskz_cvtsd_sh): Likewise.
10867 (_mm_cvt_roundss_sh): Likewise.
10868 (_mm_mask_cvt_roundss_sh): Likewise.
10869 (_mm_maskz_cvt_roundss_sh): Likewise.
10870 (_mm_cvt_roundsd_sh): Likewise.
10871 (_mm_mask_cvt_roundsd_sh): Likewise.
10872 (_mm_maskz_cvt_roundsd_sh): Likewise.
10873 * config/i386/i386-builtin-types.def
10874 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
10875 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
10876 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
10877 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
10878 * config/i386/i386-builtin.def: Add corrresponding new builtins.
10879 * config/i386/i386-expand.c: Handle new builtin types.
10880 * config/i386/sse.md (VF48_128): New mode iterator.
10881 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
10883 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
10886 2021-09-17 liuhongt <hongtao.liu@intel.com>
10888 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
10890 (_mm512_mask_cvtph_pd): Likewise.
10891 (_mm512_maskz_cvtph_pd): Likewise.
10892 (_mm512_cvt_roundph_pd): Likewise.
10893 (_mm512_mask_cvt_roundph_pd): Likewise.
10894 (_mm512_maskz_cvt_roundph_pd): Likewise.
10895 (_mm512_cvtxph_ps): Likewise.
10896 (_mm512_mask_cvtxph_ps): Likewise.
10897 (_mm512_maskz_cvtxph_ps): Likewise.
10898 (_mm512_cvtx_roundph_ps): Likewise.
10899 (_mm512_mask_cvtx_roundph_ps): Likewise.
10900 (_mm512_maskz_cvtx_roundph_ps): Likewise.
10901 (_mm512_cvtxps_ph): Likewise.
10902 (_mm512_mask_cvtxps_ph): Likewise.
10903 (_mm512_maskz_cvtxps_ph): Likewise.
10904 (_mm512_cvtx_roundps_ph): Likewise.
10905 (_mm512_mask_cvtx_roundps_ph): Likewise.
10906 (_mm512_maskz_cvtx_roundps_ph): Likewise.
10907 (_mm512_cvtpd_ph): Likewise.
10908 (_mm512_mask_cvtpd_ph): Likewise.
10909 (_mm512_maskz_cvtpd_ph): Likewise.
10910 (_mm512_cvt_roundpd_ph): Likewise.
10911 (_mm512_mask_cvt_roundpd_ph): Likewise.
10912 (_mm512_maskz_cvt_roundpd_ph): Likewise.
10913 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
10915 (_mm_mask_cvtph_pd): Likewise.
10916 (_mm_maskz_cvtph_pd): Likewise.
10917 (_mm256_cvtph_pd): Likewise.
10918 (_mm256_mask_cvtph_pd): Likewise.
10919 (_mm256_maskz_cvtph_pd): Likewise.
10920 (_mm_cvtxph_ps): Likewise.
10921 (_mm_mask_cvtxph_ps): Likewise.
10922 (_mm_maskz_cvtxph_ps): Likewise.
10923 (_mm256_cvtxph_ps): Likewise.
10924 (_mm256_mask_cvtxph_ps): Likewise.
10925 (_mm256_maskz_cvtxph_ps): Likewise.
10926 (_mm_cvtxps_ph): Likewise.
10927 (_mm_mask_cvtxps_ph): Likewise.
10928 (_mm_maskz_cvtxps_ph): Likewise.
10929 (_mm256_cvtxps_ph): Likewise.
10930 (_mm256_mask_cvtxps_ph): Likewise.
10931 (_mm256_maskz_cvtxps_ph): Likewise.
10932 (_mm_cvtpd_ph): Likewise.
10933 (_mm_mask_cvtpd_ph): Likewise.
10934 (_mm_maskz_cvtpd_ph): Likewise.
10935 (_mm256_cvtpd_ph): Likewise.
10936 (_mm256_mask_cvtpd_ph): Likewise.
10937 (_mm256_maskz_cvtpd_ph): Likewise.
10938 * config/i386/i386-builtin.def: Add corresponding new builtins.
10939 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10940 * config/i386/i386-expand.c: Handle new builtin types.
10941 * config/i386/sse.md
10942 (VF4_128_8_256): New.
10943 (VF48H_AVX512VL): Ditto.
10944 (ssePHmode): Add HF vector modes.
10945 (castmode): Add new convertable modes.
10946 (qq2phsuff): Ditto.
10947 (ph2pssuffix): New.
10948 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
10949 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10950 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10951 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10952 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10953 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
10954 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
10956 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
10957 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
10958 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
10959 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
10961 2021-09-17 liuhongt <hongtao.liu@intel.com>
10963 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
10965 (_mm_cvttsh_u32): Likewise.
10966 (_mm_cvtt_roundsh_i32): Likewise.
10967 (_mm_cvtt_roundsh_u32): Likewise.
10968 (_mm_cvttsh_i64): Likewise.
10969 (_mm_cvttsh_u64): Likewise.
10970 (_mm_cvtt_roundsh_i64): Likewise.
10971 (_mm_cvtt_roundsh_u64): Likewise.
10972 * config/i386/i386-builtin.def: Add corresponding new builtins.
10973 * config/i386/sse.md
10974 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
10977 2021-09-17 liuhongt <hongtao.liu@intel.com>
10979 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
10981 (_mm512_mask_cvttph_epi32): Likewise.
10982 (_mm512_maskz_cvttph_epi32): Likewise.
10983 (_mm512_cvtt_roundph_epi32): Likewise.
10984 (_mm512_mask_cvtt_roundph_epi32): Likewise.
10985 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
10986 (_mm512_cvttph_epu32): Likewise.
10987 (_mm512_mask_cvttph_epu32): Likewise.
10988 (_mm512_maskz_cvttph_epu32): Likewise.
10989 (_mm512_cvtt_roundph_epu32): Likewise.
10990 (_mm512_mask_cvtt_roundph_epu32): Likewise.
10991 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
10992 (_mm512_cvttph_epi64): Likewise.
10993 (_mm512_mask_cvttph_epi64): Likewise.
10994 (_mm512_maskz_cvttph_epi64): Likewise.
10995 (_mm512_cvtt_roundph_epi64): Likewise.
10996 (_mm512_mask_cvtt_roundph_epi64): Likewise.
10997 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
10998 (_mm512_cvttph_epu64): Likewise.
10999 (_mm512_mask_cvttph_epu64): Likewise.
11000 (_mm512_maskz_cvttph_epu64): Likewise.
11001 (_mm512_cvtt_roundph_epu64): Likewise.
11002 (_mm512_mask_cvtt_roundph_epu64): Likewise.
11003 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
11004 (_mm512_cvttph_epi16): Likewise.
11005 (_mm512_mask_cvttph_epi16): Likewise.
11006 (_mm512_maskz_cvttph_epi16): Likewise.
11007 (_mm512_cvtt_roundph_epi16): Likewise.
11008 (_mm512_mask_cvtt_roundph_epi16): Likewise.
11009 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
11010 (_mm512_cvttph_epu16): Likewise.
11011 (_mm512_mask_cvttph_epu16): Likewise.
11012 (_mm512_maskz_cvttph_epu16): Likewise.
11013 (_mm512_cvtt_roundph_epu16): Likewise.
11014 (_mm512_mask_cvtt_roundph_epu16): Likewise.
11015 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
11016 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
11018 (_mm_mask_cvttph_epi32): Likewise.
11019 (_mm_maskz_cvttph_epi32): Likewise.
11020 (_mm256_cvttph_epi32): Likewise.
11021 (_mm256_mask_cvttph_epi32): Likewise.
11022 (_mm256_maskz_cvttph_epi32): Likewise.
11023 (_mm_cvttph_epu32): Likewise.
11024 (_mm_mask_cvttph_epu32): Likewise.
11025 (_mm_maskz_cvttph_epu32): Likewise.
11026 (_mm256_cvttph_epu32): Likewise.
11027 (_mm256_mask_cvttph_epu32): Likewise.
11028 (_mm256_maskz_cvttph_epu32): Likewise.
11029 (_mm_cvttph_epi64): Likewise.
11030 (_mm_mask_cvttph_epi64): Likewise.
11031 (_mm_maskz_cvttph_epi64): Likewise.
11032 (_mm256_cvttph_epi64): Likewise.
11033 (_mm256_mask_cvttph_epi64): Likewise.
11034 (_mm256_maskz_cvttph_epi64): Likewise.
11035 (_mm_cvttph_epu64): Likewise.
11036 (_mm_mask_cvttph_epu64): Likewise.
11037 (_mm_maskz_cvttph_epu64): Likewise.
11038 (_mm256_cvttph_epu64): Likewise.
11039 (_mm256_mask_cvttph_epu64): Likewise.
11040 (_mm256_maskz_cvttph_epu64): Likewise.
11041 (_mm_cvttph_epi16): Likewise.
11042 (_mm_mask_cvttph_epi16): Likewise.
11043 (_mm_maskz_cvttph_epi16): Likewise.
11044 (_mm256_cvttph_epi16): Likewise.
11045 (_mm256_mask_cvttph_epi16): Likewise.
11046 (_mm256_maskz_cvttph_epi16): Likewise.
11047 (_mm_cvttph_epu16): Likewise.
11048 (_mm_mask_cvttph_epu16): Likewise.
11049 (_mm_maskz_cvttph_epu16): Likewise.
11050 (_mm256_cvttph_epu16): Likewise.
11051 (_mm256_mask_cvttph_epu16): Likewise.
11052 (_mm256_maskz_cvttph_epu16): Likewise.
11053 * config/i386/i386-builtin.def: Add new builtins.
11054 * config/i386/sse.md
11055 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
11057 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
11058 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
11059 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
11060 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
11062 2021-09-17 liuhongt <hongtao.liu@intel.com>
11064 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
11065 (_mm_cvtsh_u32): Likewise.
11066 (_mm_cvt_roundsh_i32): Likewise.
11067 (_mm_cvt_roundsh_u32): Likewise.
11068 (_mm_cvtsh_i64): Likewise.
11069 (_mm_cvtsh_u64): Likewise.
11070 (_mm_cvt_roundsh_i64): Likewise.
11071 (_mm_cvt_roundsh_u64): Likewise.
11072 (_mm_cvti32_sh): Likewise.
11073 (_mm_cvtu32_sh): Likewise.
11074 (_mm_cvt_roundi32_sh): Likewise.
11075 (_mm_cvt_roundu32_sh): Likewise.
11076 (_mm_cvti64_sh): Likewise.
11077 (_mm_cvtu64_sh): Likewise.
11078 (_mm_cvt_roundi64_sh): Likewise.
11079 (_mm_cvt_roundu64_sh): Likewise.
11080 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11081 * config/i386/i386-builtin.def: Add corresponding new builtins.
11082 * config/i386/i386-expand.c (ix86_expand_round_builtin):
11083 Handle new builtin types.
11084 * config/i386/sse.md
11085 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
11087 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
11088 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
11090 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
11092 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
11093 (altivec_resolve_new_overloaded_builtin): New forward decl.
11094 (rs6000_new_builtin_type_compatible): New function.
11095 (altivec_resolve_overloaded_builtin): Call
11096 altivec_resolve_new_overloaded_builtin.
11097 (altivec_build_new_resolved_builtin): New function.
11098 (altivec_resolve_new_overloaded_builtin): Likewise.
11099 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
11101 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
11102 name of rs6000_new_builtin_is_supported.
11104 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
11106 * config/i386/i386-protos.h (ix86_decompose_address):
11107 Change return type to bool.
11108 * config/i386/i386.c (ix86_decompose_address): Ditto.
11110 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
11113 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
11114 Added 'build/' to target, use build/%.o rule.
11115 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
11116 '$(build_exeext)' to target and 'build/' for the *.o files.
11117 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
11120 2021-09-16 Martin Jambor <mjambor@suse.cz>
11122 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
11123 the body has been removed. Remove trailing whitespace.
11125 2021-09-16 Richard Biener <rguenther@suse.de>
11127 PR middle-end/102360
11128 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
11129 of non-memory more robust.
11131 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11133 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
11134 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
11135 (leon5_adjust_cost): Increase cost of store with data dependency
11136 on ALU instruction and FPU anti-dependencies.
11137 (sparc_option_override): Add LEON5 costs
11138 (sparc_adjust_cost): Add LEON5 cost adjustments
11139 * config/sparc/sparc.h: Add LEON5
11140 * config/sparc/sparc.md: Include LEON5 scheduling information
11141 * config/sparc/sparc.opt: Add LEON5
11142 * doc/invoke.texi: Add LEON5
11143 * config/sparc/leon5.md: New file.
11145 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11147 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
11148 sensitive sequence for B2BST errata workaround.
11150 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11152 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
11153 functions with atomic instruction in the UT700 errata workaround.
11155 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11157 * config/sparc/sparc.c (next_active_non_empty_insn): New function
11158 that returns next active non empty assembly instruction.
11159 (sparc_do_work_around_errata): Use new function.
11161 2021-09-16 Daniel Cederman <cederman@gaisler.com>
11163 * config/sparc/sparc.c (store_insn_p): Add predicate for store
11165 (load_insn_p): Add predicate for load attributes.
11166 (sparc_do_work_around_errata): Use new predicates.
11168 2021-09-16 Andreas Larsson <andreas@gaisler.com>
11170 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
11173 2021-09-16 Martin Liska <mliska@suse.cz>
11175 * config/mips/netbsd.h: Fix typo in name of a macro.
11177 2021-09-16 liuhongt <hongtao.liu@intel.com>
11179 PR middle-end/102080
11180 * match.pd: Check mask type when doing cond_op related gimple
11182 * tree.c (is_truth_type_for): New function.
11183 * tree.h (is_truth_type_for): New declaration.
11185 2021-09-16 liuhongt <hongtao.liu@intel.com>
11187 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
11189 (_mm512_mask_cvtepi32_ph): Likewise.
11190 (_mm512_maskz_cvtepi32_ph): Likewise.
11191 (_mm512_cvt_roundepi32_ph): Likewise.
11192 (_mm512_mask_cvt_roundepi32_ph): Likewise.
11193 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
11194 (_mm512_cvtepu32_ph): Likewise.
11195 (_mm512_mask_cvtepu32_ph): Likewise.
11196 (_mm512_maskz_cvtepu32_ph): Likewise.
11197 (_mm512_cvt_roundepu32_ph): Likewise.
11198 (_mm512_mask_cvt_roundepu32_ph): Likewise.
11199 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
11200 (_mm512_cvtepi64_ph): Likewise.
11201 (_mm512_mask_cvtepi64_ph): Likewise.
11202 (_mm512_maskz_cvtepi64_ph): Likewise.
11203 (_mm512_cvt_roundepi64_ph): Likewise.
11204 (_mm512_mask_cvt_roundepi64_ph): Likewise.
11205 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
11206 (_mm512_cvtepu64_ph): Likewise.
11207 (_mm512_mask_cvtepu64_ph): Likewise.
11208 (_mm512_maskz_cvtepu64_ph): Likewise.
11209 (_mm512_cvt_roundepu64_ph): Likewise.
11210 (_mm512_mask_cvt_roundepu64_ph): Likewise.
11211 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
11212 (_mm512_cvtepi16_ph): Likewise.
11213 (_mm512_mask_cvtepi16_ph): Likewise.
11214 (_mm512_maskz_cvtepi16_ph): Likewise.
11215 (_mm512_cvt_roundepi16_ph): Likewise.
11216 (_mm512_mask_cvt_roundepi16_ph): Likewise.
11217 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
11218 (_mm512_cvtepu16_ph): Likewise.
11219 (_mm512_mask_cvtepu16_ph): Likewise.
11220 (_mm512_maskz_cvtepu16_ph): Likewise.
11221 (_mm512_cvt_roundepu16_ph): Likewise.
11222 (_mm512_mask_cvt_roundepu16_ph): Likewise.
11223 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
11224 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
11226 (_mm_mask_cvtepi32_ph): Likewise.
11227 (_mm_maskz_cvtepi32_ph): Likewise.
11228 (_mm256_cvtepi32_ph): Likewise.
11229 (_mm256_mask_cvtepi32_ph): Likewise.
11230 (_mm256_maskz_cvtepi32_ph): Likewise.
11231 (_mm_cvtepu32_ph): Likewise.
11232 (_mm_mask_cvtepu32_ph): Likewise.
11233 (_mm_maskz_cvtepu32_ph): Likewise.
11234 (_mm256_cvtepu32_ph): Likewise.
11235 (_mm256_mask_cvtepu32_ph): Likewise.
11236 (_mm256_maskz_cvtepu32_ph): Likewise.
11237 (_mm_cvtepi64_ph): Likewise.
11238 (_mm_mask_cvtepi64_ph): Likewise.
11239 (_mm_maskz_cvtepi64_ph): Likewise.
11240 (_mm256_cvtepi64_ph): Likewise.
11241 (_mm256_mask_cvtepi64_ph): Likewise.
11242 (_mm256_maskz_cvtepi64_ph): Likewise.
11243 (_mm_cvtepu64_ph): Likewise.
11244 (_mm_mask_cvtepu64_ph): Likewise.
11245 (_mm_maskz_cvtepu64_ph): Likewise.
11246 (_mm256_cvtepu64_ph): Likewise.
11247 (_mm256_mask_cvtepu64_ph): Likewise.
11248 (_mm256_maskz_cvtepu64_ph): Likewise.
11249 (_mm_cvtepi16_ph): Likewise.
11250 (_mm_mask_cvtepi16_ph): Likewise.
11251 (_mm_maskz_cvtepi16_ph): Likewise.
11252 (_mm256_cvtepi16_ph): Likewise.
11253 (_mm256_mask_cvtepi16_ph): Likewise.
11254 (_mm256_maskz_cvtepi16_ph): Likewise.
11255 (_mm_cvtepu16_ph): Likewise.
11256 (_mm_mask_cvtepu16_ph): Likewise.
11257 (_mm_maskz_cvtepu16_ph): Likewise.
11258 (_mm256_cvtepu16_ph): Likewise.
11259 (_mm256_mask_cvtepu16_ph): Likewise.
11260 (_mm256_maskz_cvtepu16_ph): Likewise.
11261 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11262 * config/i386/i386-builtin.def: Add corresponding new builtins.
11263 * config/i386/i386-expand.c
11264 (ix86_expand_args_builtin): Handle new builtin types.
11265 (ix86_expand_round_builtin): Ditto.
11266 * config/i386/i386-modes.def: Declare V2HF and V6HF.
11267 * config/i386/sse.md (VI2H_AVX512VL): New.
11268 (qq2phsuff): Ditto.
11269 (sseintvecmode): Add HF vector modes.
11270 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
11272 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11273 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11274 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11275 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11276 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
11277 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11278 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11279 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11280 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11281 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
11282 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
11284 2021-09-16 liuhongt <hongtao.liu@intel.com>
11286 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
11288 (_mm512_mask_cvtph_epi32): Likewise.
11289 (_mm512_maskz_cvtph_epi32): Likewise.
11290 (_mm512_cvt_roundph_epi32): Likewise.
11291 (_mm512_mask_cvt_roundph_epi32): Likewise.
11292 (_mm512_maskz_cvt_roundph_epi32): Likewise.
11293 (_mm512_cvtph_epu32): Likewise.
11294 (_mm512_mask_cvtph_epu32): Likewise.
11295 (_mm512_maskz_cvtph_epu32): Likewise.
11296 (_mm512_cvt_roundph_epu32): Likewise.
11297 (_mm512_mask_cvt_roundph_epu32): Likewise.
11298 (_mm512_maskz_cvt_roundph_epu32): Likewise.
11299 (_mm512_cvtph_epi64): Likewise.
11300 (_mm512_mask_cvtph_epi64): Likewise.
11301 (_mm512_maskz_cvtph_epi64): Likewise.
11302 (_mm512_cvt_roundph_epi64): Likewise.
11303 (_mm512_mask_cvt_roundph_epi64): Likewise.
11304 (_mm512_maskz_cvt_roundph_epi64): Likewise.
11305 (_mm512_cvtph_epu64): Likewise.
11306 (_mm512_mask_cvtph_epu64): Likewise.
11307 (_mm512_maskz_cvtph_epu64): Likewise.
11308 (_mm512_cvt_roundph_epu64): Likewise.
11309 (_mm512_mask_cvt_roundph_epu64): Likewise.
11310 (_mm512_maskz_cvt_roundph_epu64): Likewise.
11311 (_mm512_cvtph_epi16): Likewise.
11312 (_mm512_mask_cvtph_epi16): Likewise.
11313 (_mm512_maskz_cvtph_epi16): Likewise.
11314 (_mm512_cvt_roundph_epi16): Likewise.
11315 (_mm512_mask_cvt_roundph_epi16): Likewise.
11316 (_mm512_maskz_cvt_roundph_epi16): Likewise.
11317 (_mm512_cvtph_epu16): Likewise.
11318 (_mm512_mask_cvtph_epu16): Likewise.
11319 (_mm512_maskz_cvtph_epu16): Likewise.
11320 (_mm512_cvt_roundph_epu16): Likewise.
11321 (_mm512_mask_cvt_roundph_epu16): Likewise.
11322 (_mm512_maskz_cvt_roundph_epu16): Likewise.
11323 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
11325 (_mm_mask_cvtph_epi32): Likewise.
11326 (_mm_maskz_cvtph_epi32): Likewise.
11327 (_mm256_cvtph_epi32): Likewise.
11328 (_mm256_mask_cvtph_epi32): Likewise.
11329 (_mm256_maskz_cvtph_epi32): Likewise.
11330 (_mm_cvtph_epu32): Likewise.
11331 (_mm_mask_cvtph_epu32): Likewise.
11332 (_mm_maskz_cvtph_epu32): Likewise.
11333 (_mm256_cvtph_epu32): Likewise.
11334 (_mm256_mask_cvtph_epu32): Likewise.
11335 (_mm256_maskz_cvtph_epu32): Likewise.
11336 (_mm_cvtph_epi64): Likewise.
11337 (_mm_mask_cvtph_epi64): Likewise.
11338 (_mm_maskz_cvtph_epi64): Likewise.
11339 (_mm256_cvtph_epi64): Likewise.
11340 (_mm256_mask_cvtph_epi64): Likewise.
11341 (_mm256_maskz_cvtph_epi64): Likewise.
11342 (_mm_cvtph_epu64): Likewise.
11343 (_mm_mask_cvtph_epu64): Likewise.
11344 (_mm_maskz_cvtph_epu64): Likewise.
11345 (_mm256_cvtph_epu64): Likewise.
11346 (_mm256_mask_cvtph_epu64): Likewise.
11347 (_mm256_maskz_cvtph_epu64): Likewise.
11348 (_mm_cvtph_epi16): Likewise.
11349 (_mm_mask_cvtph_epi16): Likewise.
11350 (_mm_maskz_cvtph_epi16): Likewise.
11351 (_mm256_cvtph_epi16): Likewise.
11352 (_mm256_mask_cvtph_epi16): Likewise.
11353 (_mm256_maskz_cvtph_epi16): Likewise.
11354 (_mm_cvtph_epu16): Likewise.
11355 (_mm_mask_cvtph_epu16): Likewise.
11356 (_mm_maskz_cvtph_epu16): Likewise.
11357 (_mm256_cvtph_epu16): Likewise.
11358 (_mm256_mask_cvtph_epu16): Likewise.
11359 (_mm256_maskz_cvtph_epu16): Likewise.
11360 * config/i386/i386-builtin-types.def: Add new builtin types.
11361 * config/i386/i386-builtin.def: Add new builtins.
11362 * config/i386/i386-expand.c
11363 (ix86_expand_args_builtin): Handle new builtin types.
11364 (ix86_expand_round_builtin): Ditto.
11365 * config/i386/sse.md (sseintconvert): New.
11366 (ssePHmode): Ditto.
11367 (UNSPEC_US_FIX_NOTRUNC): Ditto.
11368 (sseintconvertsignprefix): Ditto.
11369 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
11372 2021-09-16 liuhongt <hongtao.liu@intel.com>
11374 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
11376 (_mm_cvtsi128_si16): Likewise.
11377 (_mm_mask_load_sh): Likewise.
11378 (_mm_maskz_load_sh): Likewise.
11379 (_mm_mask_store_sh): Likewise.
11380 (_mm_move_sh): Likewise.
11381 (_mm_mask_move_sh): Likewise.
11382 (_mm_maskz_move_sh): Likewise.
11383 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11384 * config/i386/i386-builtin.def: Add corresponding new builtins.
11385 * config/i386/i386-expand.c
11386 (ix86_expand_special_args_builtin): Handle new builtin types.
11387 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
11388 * config/i386/sse.md (VI2F): New mode iterator.
11389 (vec_set<mode>_0): Use new mode iterator.
11390 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
11391 (avx512f_store<mode>_mask): Ditto.
11393 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
11395 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
11397 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
11399 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11400 Proceed if no symbol summary or the symbol alias flag is false.
11402 2021-09-15 Jakub Jelinek <jakub@redhat.com>
11406 * varasm.c (output_constructor_regular_field): Instead of assertion
11407 that array_size_for_constructor result is equal to size of
11408 TREE_TYPE (local->val) in bytes, assert that the type size is greater
11409 or equal to array_size_for_constructor result and use type size as
11412 2021-09-15 Martin Liska <mliska@suse.cz>
11415 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
11417 2021-09-15 Martin Liska <mliska@suse.cz>
11420 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11421 Check that we have a symbol summary for a symbol.
11423 2021-09-15 Richard Biener <rguenther@suse.de>
11426 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11427 to inherit from elfos.h
11429 2021-09-15 liuhongt <hongtao.liu@intel.com>
11432 * config/i386/i386-expand.c
11433 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
11435 (ix86_expand_vector_set): Use blendw instead of pinsrw.
11436 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
11437 AVX512FP16 which supports 16bit vector load.
11438 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
11440 (avx512bw_interleave_high<mode><mask_name>): .. this, and
11441 extend to V32HFmode.
11442 (avx2_interleave_highv16hi<mask_name>): Rename to ..
11443 (avx2_interleave_high<mode><mask_name>): .. this, and extend
11445 (vec_interleave_highv8hi<mask_name>): Rename to ..
11446 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
11447 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
11449 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
11450 this, and extend to V32HFmode.
11451 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
11452 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
11453 (vec_interleave_lowv8hi<mask_name>): Rename to ..
11454 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
11455 (sse4_1_pblendw): Rename to ..
11456 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
11457 (avx2_pblendph): New define_expand.
11458 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
11459 sseintmodesuffix instead of ssemodesuffix.
11460 (blendsuf): New mode attr.
11462 2021-09-15 Richard Biener <rguenther@suse.de>
11464 * tree-vectorizer.h (dr_misalignment): Move out of line.
11465 (dr_target_alignment): New.
11466 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
11467 (set_dr_target_alignment): New.
11468 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
11469 * tree-vect-data-refs.c (dr_misalignment): Compute and
11470 return the group members misalignment.
11471 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
11472 (vect_analyze_data_refs_alignment): Compute alignment only
11473 for the first element of a DR group.
11474 (vect_slp_analyze_node_alignment): Likewise.
11476 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
11478 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
11479 * config/i386/avx512fp16vlintrin.h: Likewise.
11480 * config/i386/i386-builtin.def: Adjust builtin name and
11481 enumeration to match AVX512F style.
11483 2021-09-15 Richard Biener <rguenther@suse.de>
11485 PR tree-optimization/102318
11486 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
11487 previous change and do the mode conversion separately from
11488 the sign conversion.
11490 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
11491 Peter Cordes <peter@cordes.ca>
11494 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
11495 (*vec_extract<mode><ssescalarmodelower>_valign): Output
11496 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
11499 2021-09-15 Richard Biener <rguenther@suse.de>
11501 * config.gcc: Remove vax-*-openbsd* configuration.
11503 2021-09-15 Richard Biener <rguenther@suse.de>
11505 * config.gcc: Remove m68k-openbsd.
11507 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
11510 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
11512 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
11514 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
11515 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
11516 (*mma_xxsetaccz): Delete.
11517 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
11518 Use UNSPECV_MMA_XXSETACCZ. Update comment.
11519 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
11521 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
11523 * Makefile.in: Remove variables related to applying no-PIE
11524 to the exes on $build.
11525 * configure: Regenerate.
11526 * configure.ac: Remove configuration related to applying
11527 no-PIE to the exes on $build.
11529 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
11531 * config/arc/arc.md (doloop_end): Add missing mode.
11532 (loop_end): Likewise.
11534 2021-09-14 Jakub Jelinek <jakub@redhat.com>
11536 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
11537 it to recursive calls, for depth above 7 just gimplify or return.
11538 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
11539 __builtin_clear_padding and TARGET_EXPR.
11540 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
11542 2021-09-14 liuhongt <hongtao.liu@intel.com>
11544 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
11546 (_mm_mask_fpclass_sh_mask): Likewise.
11547 (_mm512_mask_fpclass_ph_mask): Likewise.
11548 (_mm512_fpclass_ph_mask): Likewise.
11549 (_mm_getexp_sh): Likewise.
11550 (_mm_mask_getexp_sh): Likewise.
11551 (_mm_maskz_getexp_sh): Likewise.
11552 (_mm512_getexp_ph): Likewise.
11553 (_mm512_mask_getexp_ph): Likewise.
11554 (_mm512_maskz_getexp_ph): Likewise.
11555 (_mm_getexp_round_sh): Likewise.
11556 (_mm_mask_getexp_round_sh): Likewise.
11557 (_mm_maskz_getexp_round_sh): Likewise.
11558 (_mm512_getexp_round_ph): Likewise.
11559 (_mm512_mask_getexp_round_ph): Likewise.
11560 (_mm512_maskz_getexp_round_ph): Likewise.
11561 (_mm_getmant_sh): Likewise.
11562 (_mm_mask_getmant_sh): Likewise.
11563 (_mm_maskz_getmant_sh): Likewise.
11564 (_mm512_getmant_ph): Likewise.
11565 (_mm512_mask_getmant_ph): Likewise.
11566 (_mm512_maskz_getmant_ph): Likewise.
11567 (_mm_getmant_round_sh): Likewise.
11568 (_mm_mask_getmant_round_sh): Likewise.
11569 (_mm_maskz_getmant_round_sh): Likewise.
11570 (_mm512_getmant_round_ph): Likewise.
11571 (_mm512_mask_getmant_round_ph): Likewise.
11572 (_mm512_maskz_getmant_round_ph): Likewise.
11573 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
11575 (_mm_fpclass_ph_mask): Likewise.
11576 (_mm256_mask_fpclass_ph_mask): Likewise.
11577 (_mm256_fpclass_ph_mask): Likewise.
11578 (_mm256_getexp_ph): Likewise.
11579 (_mm256_mask_getexp_ph): Likewise.
11580 (_mm256_maskz_getexp_ph): Likewise.
11581 (_mm_getexp_ph): Likewise.
11582 (_mm_mask_getexp_ph): Likewise.
11583 (_mm_maskz_getexp_ph): Likewise.
11584 (_mm256_getmant_ph): Likewise.
11585 (_mm256_mask_getmant_ph): Likewise.
11586 (_mm256_maskz_getmant_ph): Likewise.
11587 (_mm_getmant_ph): Likewise.
11588 (_mm_mask_getmant_ph): Likewise.
11589 (_mm_maskz_getmant_ph): Likewise.
11590 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11591 * config/i386/i386-builtin.def: Add corresponding new builtins.
11592 * config/i386/i386-expand.c
11593 (ix86_expand_args_builtin): Handle new builtin types.
11594 (ix86_expand_round_builtin): Ditto.
11595 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
11596 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
11597 to support HF vector modes.
11598 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
11600 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
11601 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
11602 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
11603 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
11606 2021-09-14 liuhongt <hongtao.liu@intel.com>
11608 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
11610 (_mm512_mask_reduce_ph): Likewise.
11611 (_mm512_maskz_reduce_ph): Likewise.
11612 (_mm512_reduce_round_ph): Likewise.
11613 (_mm512_mask_reduce_round_ph): Likewise.
11614 (_mm512_maskz_reduce_round_ph): Likewise.
11615 (_mm_reduce_sh): Likewise.
11616 (_mm_mask_reduce_sh): Likewise.
11617 (_mm_maskz_reduce_sh): Likewise.
11618 (_mm_reduce_round_sh): Likewise.
11619 (_mm_mask_reduce_round_sh): Likewise.
11620 (_mm_maskz_reduce_round_sh): Likewise.
11621 (_mm512_roundscale_ph): Likewise.
11622 (_mm512_mask_roundscale_ph): Likewise.
11623 (_mm512_maskz_roundscale_ph): Likewise.
11624 (_mm512_roundscale_round_ph): Likewise.
11625 (_mm512_mask_roundscale_round_ph): Likewise.
11626 (_mm512_maskz_roundscale_round_ph): Likewise.
11627 (_mm_roundscale_sh): Likewise.
11628 (_mm_mask_roundscale_sh): Likewise.
11629 (_mm_maskz_roundscale_sh): Likewise.
11630 (_mm_roundscale_round_sh): Likewise.
11631 (_mm_mask_roundscale_round_sh): Likewise.
11632 (_mm_maskz_roundscale_round_sh): Likewise.
11633 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
11635 (_mm_mask_reduce_ph): Likewise.
11636 (_mm_maskz_reduce_ph): Likewise.
11637 (_mm256_reduce_ph): Likewise.
11638 (_mm256_mask_reduce_ph): Likewise.
11639 (_mm256_maskz_reduce_ph): Likewise.
11640 (_mm_roundscale_ph): Likewise.
11641 (_mm_mask_roundscale_ph): Likewise.
11642 (_mm_maskz_roundscale_ph): Likewise.
11643 (_mm256_roundscale_ph): Likewise.
11644 (_mm256_mask_roundscale_ph): Likewise.
11645 (_mm256_maskz_roundscale_ph): Likewise.
11646 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11647 * config/i386/i386-builtin.def: Add corresponding new builtins.
11648 * config/i386/i386-expand.c
11649 (ix86_expand_args_builtin): Handle new builtin types.
11650 (ix86_expand_round_builtin): Ditto.
11651 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
11653 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
11654 ... this, and adjust for round operands.
11655 (reduces<mode><mask_scalar_name>): Likewise, with ...
11656 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
11658 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
11659 Adjust for HF vector modes.
11660 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
11662 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
11664 2021-09-14 liuhongt <hongtao.liu@intel.com>
11666 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
11668 (_mm512_mask_rcp_ph): Likewise.
11669 (_mm512_maskz_rcp_ph): Likewise.
11670 (_mm_rcp_sh): Likewise.
11671 (_mm_mask_rcp_sh): Likewise.
11672 (_mm_maskz_rcp_sh): Likewise.
11673 (_mm512_scalef_ph): Likewise.
11674 (_mm512_mask_scalef_ph): Likewise.
11675 (_mm512_maskz_scalef_ph): Likewise.
11676 (_mm512_scalef_round_ph): Likewise.
11677 (_mm512_mask_scalef_round_ph): Likewise.
11678 (_mm512_maskz_scalef_round_ph): Likewise.
11679 (_mm_scalef_sh): Likewise.
11680 (_mm_mask_scalef_sh): Likewise.
11681 (_mm_maskz_scalef_sh): Likewise.
11682 (_mm_scalef_round_sh): Likewise.
11683 (_mm_mask_scalef_round_sh): Likewise.
11684 (_mm_maskz_scalef_round_sh): Likewise.
11685 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
11687 (_mm256_rcp_ph): Likewise.
11688 (_mm_mask_rcp_ph): Likewise.
11689 (_mm256_mask_rcp_ph): Likewise.
11690 (_mm_maskz_rcp_ph): Likewise.
11691 (_mm256_maskz_rcp_ph): Likewise.
11692 (_mm_scalef_ph): Likewise.
11693 (_mm256_scalef_ph): Likewise.
11694 (_mm_mask_scalef_ph): Likewise.
11695 (_mm256_mask_scalef_ph): Likewise.
11696 (_mm_maskz_scalef_ph): Likewise.
11697 (_mm256_maskz_scalef_ph): Likewise.
11698 * config/i386/i386-builtin.def: Add new builtins.
11699 * config/i386/sse.md (VFH_AVX512VL): New.
11700 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
11701 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
11702 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
11703 Adjust to support HF vector modes.
11704 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
11706 2021-09-14 liuhongt <hongtao.liu@intel.com>
11708 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
11710 (_mm512_mask_sqrt_ph): Likewise.
11711 (_mm512_maskz_sqrt_ph): Likewise.
11712 (_mm512_sqrt_round_ph): Likewise.
11713 (_mm512_mask_sqrt_round_ph): Likewise.
11714 (_mm512_maskz_sqrt_round_ph): Likewise.
11715 (_mm512_rsqrt_ph): Likewise.
11716 (_mm512_mask_rsqrt_ph): Likewise.
11717 (_mm512_maskz_rsqrt_ph): Likewise.
11718 (_mm_rsqrt_sh): Likewise.
11719 (_mm_mask_rsqrt_sh): Likewise.
11720 (_mm_maskz_rsqrt_sh): Likewise.
11721 (_mm_sqrt_sh): Likewise.
11722 (_mm_mask_sqrt_sh): Likewise.
11723 (_mm_maskz_sqrt_sh): Likewise.
11724 (_mm_sqrt_round_sh): Likewise.
11725 (_mm_mask_sqrt_round_sh): Likewise.
11726 (_mm_maskz_sqrt_round_sh): Likewise.
11727 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
11728 (_mm256_sqrt_ph): Likewise.
11729 (_mm_mask_sqrt_ph): Likewise.
11730 (_mm256_mask_sqrt_ph): Likewise.
11731 (_mm_maskz_sqrt_ph): Likewise.
11732 (_mm256_maskz_sqrt_ph): Likewise.
11733 (_mm_rsqrt_ph): Likewise.
11734 (_mm256_rsqrt_ph): Likewise.
11735 (_mm_mask_rsqrt_ph): Likewise.
11736 (_mm256_mask_rsqrt_ph): Likewise.
11737 (_mm_maskz_rsqrt_ph): Likewise.
11738 (_mm256_maskz_rsqrt_ph): Likewise.
11739 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11740 * config/i386/i386-builtin.def: Add corresponding new builtins.
11741 * config/i386/i386-expand.c
11742 (ix86_expand_args_builtin): Handle new builtins.
11743 (ix86_expand_round_builtin): Ditto.
11744 * config/i386/sse.md (VF_AVX512FP16VL): New.
11745 (sqrt<mode>2): Adjust for HF vector modes.
11746 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
11747 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
11749 (<sse>_rsqrt<mode>2<mask_name>): New.
11750 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
11752 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11754 PR bootstrap/101574
11755 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
11756 'RESERVED_LOCATION_P' locations.
11757 * warning-control.cc (get_nowarn_spec, suppress_warning)
11758 (copy_warning): Likewise.
11760 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11762 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
11763 (typedef key_type_t): ... this. Remove.
11764 (nowarn_map): Document.
11765 * diagnostic-spec.c (nowarn_map): Likewise.
11766 * warning-control.cc (convert_to_key): Evolve functions into...
11767 (get_location): ... these. Adjust all users.
11769 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11771 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
11773 2021-09-13 Jason Merrill <jason@redhat.com>
11775 * params.opt: Add destructive-interference-size and
11776 constructive-interference-size.
11777 * doc/invoke.texi: Document them.
11778 * config/aarch64/aarch64.c (aarch64_override_options_internal):
11780 * config/arm/arm.c (arm_option_override): Set them.
11781 * config/i386/i386-options.c (ix86_option_override_internal):
11784 2021-09-13 Martin Liska <mliska@suse.cz>
11785 H.J. Lu <hjl.tools@gmail.com>
11788 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
11789 for x86-64 micro levels for __builtin_cpu_supports.
11790 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
11791 Add priorities for the micro-arch levels.
11792 (enum processor_features): Add new features.
11793 * common/config/i386/i386-isas.h: Add micro-arch features.
11794 * config/i386/i386-builtins.c (get_builtin_code_for_version):
11795 Support the micro-arch levels by callsing
11796 __builtin_cpu_supports.
11797 * doc/extend.texi: Document that the levels are support by
11798 __builtin_cpu_supports.
11800 2021-09-13 Andrew Pinski <apinski@marvell.com>
11803 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
11805 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
11806 (aarch64_general_gimple_fold_builtin): Likewise.
11808 2021-09-13 Andrew Pinski <apinski@marvell.com>
11810 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
11811 to the Unsupported targets list.
11812 Remove support for m32r-*-linux* and m32rle-*-linux*.
11813 * config/m32r/linux.h: Removed.
11814 * config/m32r/t-linux: Removed.
11816 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
11819 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
11820 register index for SVE predicate modes.
11822 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11824 * tree-ssa-threadbackward.c
11825 (back_threader_profitability::profitable_path_p): Remove FSM
11827 (back_threader_registry::register_path): Same.
11828 * tree-ssa-threadedge.c
11829 (jump_threader::simplify_control_stmt_condition): Same.
11830 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
11831 Add backedge_threads argument.
11832 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
11833 backedge_threads argument.
11834 (back_jt_path_registry::back_jt_path_registry): Same.
11835 (dump_jump_thread_path): Adjust for FSM removal.
11836 (back_jt_path_registry::rewire_first_differing_edge): Same.
11837 (back_jt_path_registry::adjust_paths_after_duplication): Same.
11838 (back_jt_path_registry::update_cfg): Same.
11839 (jt_path_registry::register_jump_thread): Same.
11840 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
11842 (class back_jt_path_registry): Add backedge_threads to
11845 2021-09-13 Martin Liska <mliska@suse.cz>
11848 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
11850 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
11853 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
11854 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
11855 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
11856 TARGET_AVX256_STORE_BY_PIECES instead of
11857 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
11858 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11859 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
11860 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11861 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
11862 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
11864 2021-09-13 liuhongt <hongtao.liu@intel.com>
11866 PR bootstrap/102302
11867 * expmed.c (extract_bit_field_using_extv): Use
11868 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
11870 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11872 * Makefile.in (OBJS): Add value-pointer-equiv.o.
11873 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
11874 value-pointer-equiv.*.
11875 (ssa_equiv_stack::ssa_equiv_stack): Same.
11876 (ssa_equiv_stack::enter): Same.
11877 (ssa_equiv_stack::leave): Same.
11878 (ssa_equiv_stack::push_replacement): Same.
11879 (ssa_equiv_stack::get_replacement): Same.
11880 (is_pointer_ssa): Same.
11881 (class pointer_equiv_analyzer): Same.
11882 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
11883 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
11884 (pointer_equiv_analyzer::set_global_equiv): Same.
11885 (pointer_equiv_analyzer::set_cond_equiv): Same.
11886 (pointer_equiv_analyzer::get_equiv): Same.
11887 (pointer_equiv_analyzer::enter): Same.
11888 (pointer_equiv_analyzer::leave): Same.
11889 (pointer_equiv_analyzer::get_equiv_expr): Same.
11890 (pta_valueize): Same.
11891 (pointer_equiv_analyzer::visit_stmt): Same.
11892 (pointer_equiv_analyzer::visit_edge): Same.
11893 (hybrid_folder::value_of_expr): Same.
11894 (hybrid_folder::value_on_edge): Same.
11895 * value-pointer-equiv.cc: New file.
11896 * value-pointer-equiv.h: New file.
11898 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11901 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
11902 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
11904 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11907 * config/arm/arm.md (movmisaligndi): New define_expand.
11908 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
11910 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11913 * emit-rtl.c (gen_highpart): Use adjust_address to handle
11914 MEM rather than calling simplify_gen_subreg.
11916 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
11918 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
11919 define into a block.
11921 2021-09-13 Richard Biener <rguenther@suse.de>
11923 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
11924 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
11925 change based on DARWIN_PREFER_DWARF not being defined.
11927 2021-09-13 Richard Biener <rguenther@suse.de>
11929 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11930 to inherit from elfos.h
11932 2021-09-13 Richard Biener <rguenther@suse.de>
11934 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
11936 2021-09-13 Richard Biener <rguenther@suse.de>
11938 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
11939 override, pick up DWARF2_DEBUG define from elfos.h
11941 2021-09-13 Richard Biener <rguenther@suse.de>
11943 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
11946 2021-09-13 Richard Biener <rguenther@suse.de>
11948 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
11951 2021-09-13 Richard Biener <rguenther@suse.de>
11953 * config/i386/cygming.h: Always default to DWARF2 debugging.
11954 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
11956 * doc/install.texi: Document binutils 2.16 as minimum
11957 requirement for mingw.
11959 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11961 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
11962 nstmts, nloads and extra_ctor_cost.
11963 (rs6000_density_test): Add load density related heuristics. Do
11964 extra costing on vector construction statements if need.
11965 (rs6000_init_cost): Init new members.
11966 (rs6000_update_target_cost_per_stmt): New function.
11967 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
11968 rs6000_update_target_cost_per_stmt and call it.
11970 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11972 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
11973 (rs6000_init_cost): Adjust.
11975 2021-09-13 liuhongt <hongtao.liu@intel.com>
11977 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
11978 (UNSPEC_XORSIGN): Ditto.
11980 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
11982 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
11983 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
11986 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
11988 * tree-ssa-threadbackward.c (class back_threader_registry): Use
11989 back_jt_path_registry.
11990 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
11991 fwd_jt_path_registry.
11992 * tree-ssa-threadedge.h (class jump_threader): Same..
11993 * tree-ssa-threadupdate.c
11994 (jump_thread_path_registry::jump_thread_path_registry): Rename...
11995 (jt_path_registry::jt_path_registry): ...to this.
11996 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
11997 (jt_path_registry::~jt_path_registry): ...this.
11998 (fwd_jt_path_registry::fwd_jt_path_registry): New.
11999 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
12000 (jump_thread_path_registry::allocate_thread_edge): Rename...
12001 (jt_path_registry::allocate_thread_edge): ...to this.
12002 (jump_thread_path_registry::allocate_thread_path): Rename...
12003 (jt_path_registry::allocate_thread_path): ...to this.
12004 (jump_thread_path_registry::lookup_redirection_data): Rename...
12005 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
12006 (jump_thread_path_registry::thread_block_1): Rename...
12007 (fwd_jt_path_registry::thread_block_1): ...to this.
12008 (jump_thread_path_registry::thread_block): Rename...
12009 (fwd_jt_path_registry::thread_block): ...to this.
12010 (jt_path_registry::thread_through_loop_header): Rename...
12011 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
12012 (jump_thread_path_registry::mark_threaded_blocks): Rename...
12013 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
12014 (jump_thread_path_registry::debug_path): Rename...
12015 (jt_path_registry::debug_path): ...to this.
12016 (jump_thread_path_registry::dump): Rename...
12017 (jt_path_registry::debug): ...to this.
12018 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
12019 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
12020 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
12021 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
12022 (jump_thread_path_registry::duplicate_thread_path): Rename...
12023 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
12024 drop ill-formed candidates.
12025 (jump_thread_path_registry::remove_jump_threads_including): Rename...
12026 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
12027 (jt_path_registry::thread_through_all_blocks): New.
12028 (back_jt_path_registry::update_cfg): New.
12029 (fwd_jt_path_registry::update_cfg): New.
12030 (jump_thread_path_registry::register_jump_thread): Rename...
12031 (jt_path_registry::register_jump_thread): ...to this.
12032 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
12034 (class jt_path_registry): ...here.
12035 (class fwd_jt_path_registry): New.
12036 (class back_jt_path_registry): New.
12038 2021-09-10 liuhongt <hongtao.liu@intel.com>
12041 2021-09-01 liuhongt <hongtao.liu@intel.com>
12043 * emit-rtl.c (validate_subreg): Get rid of all float-int
12046 2021-09-10 Jakub Jelinek <jakub@redhat.com>
12048 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
12049 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
12050 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
12051 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
12052 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
12053 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
12054 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
12055 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
12057 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
12058 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
12059 GF_OMP_ATOMIC_WEAK.
12060 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
12062 * tree.h (OMP_ATOMIC_WEAK): Define.
12063 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
12064 fail memory order being encoded in the same enum and also print
12065 fail clause if present.
12066 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
12067 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
12068 handle pre_p == NULL case as a test mode that only returns value
12069 but doesn't change gimplify nor change anything otherwise, adjust
12070 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
12071 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
12072 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
12074 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
12075 COND_EXPR rhs. Set weak flag on gimple load/store for
12077 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
12078 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
12080 (expand_omp_atomic_cas): New function.
12081 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
12083 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
12084 using expand_omp_atomic_cas.
12086 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
12087 Michael Matz <matz@suse.de>
12089 * tree-pass.h (PROP_loop_opts_done): New.
12090 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
12091 Intersect with global range.
12092 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
12093 * tree-ssa-threadbackward.c
12094 (back_threader_profitability::profitable_path_p): Disable
12095 threading through latches until after loop optimizations have run.
12097 2021-09-10 David Faust <david.faust@oracle.com>
12099 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
12102 2021-09-10 David Faust <david.faust@oracle.com>
12104 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
12105 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
12106 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
12107 (bpf_expand_cbranch): New function.
12108 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
12110 (zero_extendsidi2): Only use mov32 instruction if it is available.
12111 (SIM mode iterator): Conditionalize support for SI mode.
12112 (JM mode iterator): New.
12113 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
12114 (*branch_on_di): Update name, use new JM iterator.
12115 * config/bpf/bpf.opt: (mjmpext): New option.
12116 (malu32): Likewise.
12117 (mjmp32): Likewise.
12119 (bpf_isa): New enum.
12121 2021-09-10 David Faust <david.faust@oracle.com>
12123 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
12124 for register-to-register extensions.
12125 (zero_extendqidi2): Likewise.
12127 2021-09-10 Richard Biener <rguenther@suse.de>
12129 PR middle-end/102273
12130 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
12132 2021-09-10 Richard Biener <rguenther@suse.de>
12134 PR middle-end/102269
12135 * gimplify.c (is_var_need_auto_init): Empty types do not need
12138 2021-09-10 Richard Biener <rguenther@suse.de>
12140 * configure.ac (--with-stabs): Remove.
12141 * configure: Regenerate.
12142 * doc/install.texi: Remove --with-stabs documentation.
12144 2021-09-10 liuhongt <hongtao.liu@intel.com>
12146 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
12148 (_mm512_mask_cmp_ph_mask): Likewise.
12149 (_mm512_cmp_round_ph_mask): Likewise.
12150 (_mm512_mask_cmp_round_ph_mask): Likewise.
12151 (_mm_cmp_sh_mask): Likewise.
12152 (_mm_mask_cmp_sh_mask): Likewise.
12153 (_mm_cmp_round_sh_mask): Likewise.
12154 (_mm_mask_cmp_round_sh_mask): Likewise.
12155 (_mm_comieq_sh): Likewise.
12156 (_mm_comilt_sh): Likewise.
12157 (_mm_comile_sh): Likewise.
12158 (_mm_comigt_sh): Likewise.
12159 (_mm_comige_sh): Likewise.
12160 (_mm_comineq_sh): Likewise.
12161 (_mm_ucomieq_sh): Likewise.
12162 (_mm_ucomilt_sh): Likewise.
12163 (_mm_ucomile_sh): Likewise.
12164 (_mm_ucomigt_sh): Likewise.
12165 (_mm_ucomige_sh): Likewise.
12166 (_mm_ucomineq_sh): Likewise.
12167 (_mm_comi_round_sh): Likewise.
12168 (_mm_comi_sh): Likewise.
12169 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
12170 (_mm_mask_cmp_ph_mask): Likewise.
12171 (_mm256_cmp_ph_mask): Likewise.
12172 (_mm256_mask_cmp_ph_mask): Likewise.
12173 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12174 * config/i386/i386-builtin.def: Add corresponding new builtins.
12175 * config/i386/i386-expand.c
12176 (ix86_expand_args_builtin): Handle new builtin types.
12177 (ix86_expand_round_builtin): Ditto.
12178 * config/i386/i386.md (ssevecmode): Add HF mode.
12179 (MODEFH): New mode iterator.
12180 * config/i386/sse.md
12181 (V48H_AVX512VL): New mode iterator to support HF vector modes.
12182 Ajdust corresponding description.
12183 (ssecmpintprefix): New.
12184 (VI12_AVX512VL): Adjust to support HF vector modes.
12185 (cmp_imm_predicate): Likewise.
12186 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
12188 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
12189 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
12190 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
12192 2021-09-10 liuhongt <hongtao.liu@intel.com>
12194 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
12195 (_mm512_mask_max_ph): Likewise.
12196 (_mm512_maskz_max_ph): Likewise.
12197 (_mm512_min_ph): Likewise.
12198 (_mm512_mask_min_ph): Likewise.
12199 (_mm512_maskz_min_ph): Likewise.
12200 (_mm512_max_round_ph): Likewise.
12201 (_mm512_mask_max_round_ph): Likewise.
12202 (_mm512_maskz_max_round_ph): Likewise.
12203 (_mm512_min_round_ph): Likewise.
12204 (_mm512_mask_min_round_ph): Likewise.
12205 (_mm512_maskz_min_round_ph): Likewise.
12206 (_mm_max_sh): Likewise.
12207 (_mm_mask_max_sh): Likewise.
12208 (_mm_maskz_max_sh): Likewise.
12209 (_mm_min_sh): Likewise.
12210 (_mm_mask_min_sh): Likewise.
12211 (_mm_maskz_min_sh): Likewise.
12212 (_mm_max_round_sh): Likewise.
12213 (_mm_mask_max_round_sh): Likewise.
12214 (_mm_maskz_max_round_sh): Likewise.
12215 (_mm_min_round_sh): Likewise.
12216 (_mm_mask_min_round_sh): Likewise.
12217 (_mm_maskz_min_round_sh): Likewise.
12218 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
12219 (_mm256_max_ph): Likewise.
12220 (_mm_mask_max_ph): Likewise.
12221 (_mm256_mask_max_ph): Likewise.
12222 (_mm_maskz_max_ph): Likewise.
12223 (_mm256_maskz_max_ph): Likewise.
12224 (_mm_min_ph): Likewise.
12225 (_mm256_min_ph): Likewise.
12226 (_mm_mask_min_ph): Likewise.
12227 (_mm256_mask_min_ph): Likewise.
12228 (_mm_maskz_min_ph): Likewise.
12229 (_mm256_maskz_min_ph): Likewise.
12230 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12231 * config/i386/i386-builtin.def: Add corresponding new builtins.
12232 * config/i386/i386-expand.c
12233 (ix86_expand_args_builtin): Handle new builtin types.
12234 * config/i386/sse.md
12235 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
12236 support HF vector modes.
12237 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
12238 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
12240 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
12242 * config/i386/subst.md (round_saeonly_mode512bit_condition):
12243 Adjust for HF vector modes.
12245 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
12247 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
12248 (_mm_mask_add_sh): Likewise.
12249 (_mm_maskz_add_sh): Likewise.
12250 (_mm_sub_sh): Likewise.
12251 (_mm_mask_sub_sh): Likewise.
12252 (_mm_maskz_sub_sh): Likewise.
12253 (_mm_mul_sh): Likewise.
12254 (_mm_mask_mul_sh): Likewise.
12255 (_mm_maskz_mul_sh): Likewise.
12256 (_mm_div_sh): Likewise.
12257 (_mm_mask_div_sh): Likewise.
12258 (_mm_maskz_div_sh): Likewise.
12259 (_mm_add_round_sh): Likewise.
12260 (_mm_mask_add_round_sh): Likewise.
12261 (_mm_maskz_add_round_sh): Likewise.
12262 (_mm_sub_round_sh): Likewise.
12263 (_mm_mask_sub_round_sh): Likewise.
12264 (_mm_maskz_sub_round_sh): Likewise.
12265 (_mm_mul_round_sh): Likewise.
12266 (_mm_mask_mul_round_sh): Likewise.
12267 (_mm_maskz_mul_round_sh): Likewise.
12268 (_mm_div_round_sh): Likewise.
12269 (_mm_mask_div_round_sh): Likewise.
12270 (_mm_maskz_div_round_sh): Likewise.
12271 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12272 * config/i386/i386-builtin.def: Add corresponding new builtins.
12273 * config/i386/i386-expand.c
12274 (ix86_expand_round_builtin): Handle new builtins.
12275 * config/i386/sse.md (VF_128): Change description.
12276 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
12277 Adjust to support HF vector modes.
12278 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
12281 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
12283 * config/i386/i386-expand.c
12284 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
12285 * config/i386/i386.c
12286 (ix86_preferred_simd_mode): Handle HF mode.
12287 * config/i386/sse.md (V_256H): New mode iterator.
12288 (avx_vextractf128<mode>): Use it.
12289 (VEC_INIT_MODE): Align vector HFmode condition to vector
12290 HImodes since there're no real HF instruction used.
12291 (VEC_INIT_HALF_MODE): Ditto.
12293 (VIHF_AVX512BW): Ditto.
12294 (*vec_extracthf): Ditto.
12295 (VEC_EXTRACT_MODE): Ditto.
12297 2021-09-10 Richard Biener <rguenther@suse.de>
12300 * config/dbx.h: Remove.
12301 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
12302 * config/lynx.h: Likewise.
12304 2021-09-10 liuhongt <hongtao.liu@intel.com>
12306 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
12307 right into ANDNOT + AND + IOR, using paradoxical subregs.
12308 (ix86_split_copysign_const): Remove.
12309 (ix86_split_copysign_var): Ditto.
12310 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
12311 (ix86_split_copysign_var): Ditto.
12312 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
12313 (@copysign<mode>3_var): Ditto.
12315 2021-09-09 qing zhao <qing.zhao@oracle.com>
12317 * builtins.c (expand_builtin_memset): Make external visible.
12318 * builtins.h (expand_builtin_memset): Declare extern.
12319 * common.opt (ftrivial-auto-var-init=): New option.
12320 * doc/extend.texi: Document the uninitialized attribute.
12321 * doc/invoke.texi: Document -ftrivial-auto-var-init.
12322 * flag-types.h (enum auto_init_type): New enumerated type
12324 * gimple-fold.c (clear_padding_type): Add one new parameter.
12325 (clear_padding_union): Likewise.
12326 (clear_padding_emit_loop): Likewise.
12327 (clear_type_padding_in_mask): Likewise.
12328 (gimple_fold_builtin_clear_padding): Handle this new parameter.
12329 * gimplify.c (gimple_add_init_for_auto_var): New function.
12330 (gimple_add_padding_init_for_auto_var): New function.
12331 (is_var_need_auto_init): New function.
12332 (gimplify_decl_expr): Add initialization to automatic variables per
12334 (gimplify_call_expr): Add one new parameter for call to
12335 __builtin_clear_padding.
12336 (gimplify_init_constructor): Add padding initialization in the end.
12337 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
12338 (expand_DEFERRED_INIT): New function.
12339 * internal-fn.def (DEFERRED_INIT): New internal function.
12340 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
12341 * tree-sra.c (generate_subtree_deferred_init): New function.
12342 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
12343 calls to .DEFERRED_INIT.
12344 (sra_modify_deferred_init): New function.
12345 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
12346 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
12347 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
12349 (check_defs): Likewise.
12350 (warn_uninitialized_vars): Likewise.
12351 * tree-ssa.c (ssa_undefined_value_p): Likewise.
12352 * tree.c (build_common_builtin_nodes): Build tree node for
12353 BUILT_IN_CLEAR_PADDING when needed.
12355 2021-09-09 Richard Biener <rguenther@suse.de>
12357 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
12360 2021-09-09 Richard Biener <rguenther@suse.de>
12362 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
12363 DOM walk from get_loop_body_in_dom_order using a worklist
12366 2021-09-09 liuhongt <hongtao.liu@intel.com>
12368 * config.gcc: Add avx512fp16vlintrin.h.
12369 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
12370 (_mm512_mask_add_ph): Likewise.
12371 (_mm512_maskz_add_ph): Likewise.
12372 (_mm512_sub_ph): Likewise.
12373 (_mm512_mask_sub_ph): Likewise.
12374 (_mm512_maskz_sub_ph): Likewise.
12375 (_mm512_mul_ph): Likewise.
12376 (_mm512_mask_mul_ph): Likewise.
12377 (_mm512_maskz_mul_ph): Likewise.
12378 (_mm512_div_ph): Likewise.
12379 (_mm512_mask_div_ph): Likewise.
12380 (_mm512_maskz_div_ph): Likewise.
12381 (_mm512_add_round_ph): Likewise.
12382 (_mm512_mask_add_round_ph): Likewise.
12383 (_mm512_maskz_add_round_ph): Likewise.
12384 (_mm512_sub_round_ph): Likewise.
12385 (_mm512_mask_sub_round_ph): Likewise.
12386 (_mm512_maskz_sub_round_ph): Likewise.
12387 (_mm512_mul_round_ph): Likewise.
12388 (_mm512_mask_mul_round_ph): Likewise.
12389 (_mm512_maskz_mul_round_ph): Likewise.
12390 (_mm512_div_round_ph): Likewise.
12391 (_mm512_mask_div_round_ph): Likewise.
12392 (_mm512_maskz_div_round_ph): Likewise.
12393 * config/i386/avx512fp16vlintrin.h: New header.
12394 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
12395 Add new builtin types.
12396 * config/i386/i386-builtin.def: Add corresponding builtins.
12397 * config/i386/i386-expand.c
12398 (ix86_expand_args_builtin): Handle new builtin types.
12399 (ix86_expand_round_builtin): Likewise.
12400 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
12401 * config/i386/sse.md (VFH): New mode_iterator.
12403 (avx512fmaskmode): Add HF vector modes.
12404 (avx512fmaskhalfmode): Likewise.
12405 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
12407 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
12408 (mul<mode>3<mask_name><round_name>): Likewise.
12409 (*mul<mode>3<mask_name><round_name>): Likewise.
12410 (div<mode>3): Likewise.
12411 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
12412 * config/i386/subst.md (SUBST_V): Add HF vector modes.
12413 (SUBST_A): Likewise.
12414 (round_mode512bit_condition): Adjust for V32HFmode.
12416 2021-09-09 liuhongt <hongtao.liu@intel.com>
12419 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
12420 (reduc_plus_scal_v4sf): .. this, New define_expand.
12421 (reduc_plus_scal_v2df): .. and this, New define_expand.
12423 2021-09-09 liuhongt <hongtao.liu@intel.com>
12426 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
12429 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
12432 * doc/trouble.texi (Copy Assignment): Fix description of
12433 behaviour and fix code in example.
12435 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
12438 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
12439 r11 instead of r12 for restoring CR.
12441 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12442 liuhongt <hongtao.liu@intel.com>
12445 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
12446 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
12447 into AND with mask and XOR, using paradoxical subregs.
12448 (ix86_split_xorsign): Remove.
12449 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
12451 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
12453 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
12455 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12458 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
12459 operands[2], emit abs<mode>2 instead.
12460 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
12461 first alternative even for avx, add another alternative with
12462 =&Yv <- 0, Yv, Yvm constraints.
12463 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
12464 to op1, emit vpandn instead.
12466 2021-09-08 liuhongt <hongtao.liu@intel.com>
12468 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
12469 (_mm256_set_ph): Likewise.
12470 (_mm512_set_ph): Likewise.
12471 (_mm_setr_ph): Likewise.
12472 (_mm256_setr_ph): Likewise.
12473 (_mm512_setr_ph): Likewise.
12474 (_mm_set1_ph): Likewise.
12475 (_mm256_set1_ph): Likewise.
12476 (_mm512_set1_ph): Likewise.
12477 (_mm_setzero_ph): Likewise.
12478 (_mm256_setzero_ph): Likewise.
12479 (_mm512_setzero_ph): Likewise.
12480 (_mm_set_sh): Likewise.
12481 (_mm_load_sh): Likewise.
12482 (_mm_store_sh): Likewise.
12483 * config/i386/i386-builtin-types.def (V8HF): New type.
12484 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
12485 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
12486 Support vector HFmodes.
12487 (ix86_expand_vector_init_one_nonzero): Likewise.
12488 (ix86_expand_vector_init_one_var): Likewise.
12489 (ix86_expand_vector_init_interleave): Likewise.
12490 (ix86_expand_vector_init_general): Likewise.
12491 (ix86_expand_vector_set): Likewise.
12492 (ix86_expand_vector_extract): Likewise.
12493 (ix86_expand_vector_init_concat): Likewise.
12494 (ix86_expand_sse_movcc): Handle vector HFmodes.
12495 (ix86_expand_vector_set_var): Ditto.
12496 * config/i386/i386-modes.def: Add HF vector modes in comment.
12497 * config/i386/i386.c (classify_argument): Add HF vector modes.
12498 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
12499 (ix86_vector_mode_supported_p): Likewise.
12500 (ix86_set_reg_reg_cost): Handle vector HFmode.
12501 (ix86_get_ssemov): Handle vector HFmode.
12502 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
12504 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
12506 (function_arg_advance_32): Ditto.
12507 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
12508 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
12509 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
12510 (VALID_SSE2_REG_VHF_MODE): New.
12511 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
12512 (SSE_REG_MODE_P): Add vector HFmode.
12513 * config/i386/i386.md (mode): Add HF vector modes.
12514 (MODE_SIZE): Likewise.
12515 (ssemodesuffix): Add ph suffix for HF vector modes.
12516 * config/i386/sse.md (VFH_128): New mode iterator.
12517 (VMOVE): Adjust for HF vector modes.
12519 (V_256_512): Likewise.
12520 (avx512): Likewise.
12521 (avx512fmaskmode): Likewise.
12522 (shuffletype): Likewise.
12523 (sseinsnmode): Likewise.
12524 (ssedoublevecmode): Likewise.
12525 (ssehalfvecmode): Likewise.
12526 (ssehalfvecmodelower): Likewise.
12527 (ssePScmode): Likewise.
12528 (ssescalarmode): Likewise.
12529 (ssescalarmodelower): Likewise.
12530 (sseintprefix): Likewise.
12532 (bcstscalarsuff): Likewise.
12533 (xtg_mode): Likewise.
12534 (VI12HF_AVX512VL): New mode_iterator.
12535 (VF_AVX512FP16): Likewise.
12537 (VIHF_256): Likewise.
12538 (VIHF_AVX512BW): Likewise.
12539 (V16_256): Likewise.
12540 (V32_512): Likewise.
12541 (sseintmodesuffix): New mode_attr.
12542 (sse): Add scalar and vector HFmodes.
12543 (ssescalarmode): Add vector HFmode mapping.
12544 (ssescalarmodesuffix): Add sh suffix for HFmode.
12545 (*<sse>_vm<insn><mode>3): Use VFH_128.
12546 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
12547 (*ieee_<ieee_maxmin><mode>3): Likewise.
12548 (<avx512>_blendm<mode>): New define_insn.
12549 (vec_setv8hf): New define_expand.
12550 (vec_set<mode>_0): New define_insn for HF vector set.
12551 (*avx512fp16_movsh): Likewise.
12552 (avx512fp16_movsh): Likewise.
12553 (vec_extract_lo_v32hi): Rename to ...
12554 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
12556 (vec_extract_hi_v32hi): Likewise.
12557 (vec_extract_hi_<mode>): Likewise.
12558 (vec_extract_lo_v16hi): Likewise.
12559 (vec_extract_lo_<mode>): Likewise.
12560 (vec_extract_hi_v16hi): Likewise.
12561 (vec_extract_hi_<mode>): Likewise.
12562 (vec_set_hi_v16hi): Likewise.
12563 (vec_set_hi_<mode>): Likewise.
12564 (vec_set_lo_v16hi): Likewise.
12565 (vec_set_lo_<mode>): Likewise.
12566 (*vec_extract<mode>_0): New define_insn_and_split for HF
12568 (*vec_extracthf): New define_insn.
12569 (VEC_EXTRACT_MODE): Add HF vector modes.
12570 (PINSR_MODE): Add V8HF.
12571 (sse2p4_1): Likewise.
12572 (pinsr_evex_isa): Likewise.
12573 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
12574 insert for V8HFmode.
12575 (pbroadcast_evex_isa): Add HF vector modes.
12576 (AVX2_VEC_DUP_MODE): Likewise.
12577 (VEC_INIT_MODE): Likewise.
12578 (VEC_INIT_HALF_MODE): Likewise.
12579 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
12581 (avx2_pbroadcast<mode>_1): Likewise.
12582 (<avx512>_vec_dup<mode>_1): Likewise.
12583 (<avx512>_vec_dup<mode><mask_name>): Likewise.
12584 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
12587 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
12588 H.J. Lu <hongjiu.lu@intel.com>
12589 Liu Hongtao <hongtao.liu@intel.com>
12590 Wang Hongyu <hongyu.wang@intel.com>
12591 Xu Dianhong <dianhong.xu@intel.com>
12593 * common/config/i386/cpuinfo.h (get_available_features):
12594 Detect FEATURE_AVX512FP16.
12595 * common/config/i386/i386-common.c
12596 (OPTION_MASK_ISA_AVX512FP16_SET,
12597 OPTION_MASK_ISA_AVX512FP16_UNSET,
12598 OPTION_MASK_ISA2_AVX512FP16_SET,
12599 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
12600 (OPTION_MASK_ISA2_AVX512BW_UNSET,
12601 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
12602 (ix86_handle_option): Handle -mavx512fp16.
12603 * common/config/i386/i386-cpuinfo.h (enum processor_features):
12604 Add FEATURE_AVX512FP16.
12605 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
12606 * config.gcc: Add avx512fp16intrin.h.
12607 * config/i386/avx512fp16intrin.h: New intrinsic header.
12608 * config/i386/cpuid.h: Add bit_AVX512FP16.
12609 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
12610 * config/i386/i386-builtins.c: Support _Float16 type for i386
12612 (ix86_register_float16_builtin_type): New function.
12613 (ix86_float16_type_node): New.
12614 * config/i386/i386-c.c (ix86_target_macros_internal): Define
12616 * config/i386/i386-expand.c (ix86_expand_branch): Support
12618 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
12619 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12620 (ix86_expand_fp_movcc): Ditto.
12621 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
12622 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
12623 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
12624 * config/i386/i386.c (ix86_get_ssemov): Use
12625 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
12626 (ix86_get_excess_precision): Use
12627 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
12629 (sse_store_index): Use SFmode cost for HFmode cost.
12630 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
12631 GPR cost for HFmode.
12632 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
12633 (ix86_mangle_type): Add manlging for _Float16 type.
12634 (inline_secondary_memory_needed): No memory is needed for
12635 16bit movement between gpr and sse reg under
12637 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
12638 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12639 (ix86_division_cost): Ditto.
12640 (ix86_rtx_costs): Ditto.
12641 (ix86_add_stmt_cost): Ditto.
12642 (ix86_optab_supported_p): Ditto.
12643 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
12644 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
12645 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
12646 * config/i386/i386.md (mode): Add HFmode.
12647 (MODE_SIZE): Add HFmode.
12648 (isa): Add avx512fp16.
12649 (enabled): Handle avx512fp16.
12650 (ssemodesuffix): Add sh suffix for HFmode.
12651 (comm): Add mult, div.
12652 (plusminusmultdiv): New code iterator.
12653 (insn): Add mult, div.
12654 (*movhf_internal): Adjust for avx512fp16 instruction.
12655 (*movhi_internal): Ditto.
12656 (*cmpi<unord>hf): New define_insn for HFmode.
12657 (*ieee_s<ieee_maxmin>hf3): Likewise.
12658 (extendhf<mode>2): Likewise.
12659 (trunc<mode>hf2): Likewise.
12660 (float<floatunssuffix><mode>hf2): Likewise.
12661 (*<insn>hf): Likewise.
12662 (cbranchhf4): New expander.
12663 (movhfcc): Likewise.
12664 (<insn>hf3): Likewise.
12665 (mulhf3): Likewise.
12666 (divhf3): Likewise.
12667 * config/i386/i386.opt: Add mavx512fp16.
12668 * config/i386/immintrin.h: Include avx512fp16intrin.h.
12669 * doc/invoke.texi: Add mavx512fp16.
12670 * doc/extend.texi: Add avx512fp16 Usage Notes.
12672 2021-09-08 liuhongt <hongtao.liu@intel.com>
12674 * common.opt: Support -fexcess-precision=16.
12675 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
12676 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
12677 EXCESS_PRECISION_TYPE_FLOAT16.
12678 * config/arm/arm.c (arm_excess_precision): Ditto.
12679 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
12680 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
12681 when EXCESS_PRECISION_TYPE_FLOAT16.
12682 * config/s390/s390.c (s390_excess_precision): Ditto.
12683 * coretypes.h (enum excess_precision_type): Add
12684 EXCESS_PRECISION_TYPE_FLOAT16.
12685 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
12686 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
12687 * doc/extend.texi (Half-Precision): Document
12688 -fexcess-precision=16.
12689 * flag-types.h (enum excess_precision): Add
12690 EXCESS_PRECISION_FLOAT16.
12691 * target.def (excess_precision): Update document.
12692 * tree.c (excess_precision_type): Set excess_precision_type to
12693 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
12695 2021-09-08 liuhongt <hongtao.liu@intel.com>
12697 * doc/extend.texi: (@node Floating Types): Adjust the wording.
12698 (@node Half-Precision): Ditto.
12700 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
12703 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
12704 'CONST_INT_P (src)' to the condition of the block that tries to
12705 eliminate literal when loading integer contant.
12707 2021-09-07 David Faust <david.faust@oracle.com>
12709 * doc/extend.texi (BPF Type Attributes) New node.
12710 Document new preserve_access_index attribute.
12711 Document new preserve_access_index builtin.
12712 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
12714 2021-09-07 David Faust <david.faust@oracle.com>
12716 * config/bpf/bpf.c: Adjust includes.
12717 (bpf_handle_preserve_access_index_attribute): New function.
12718 (bpf_attribute_table): Use it here.
12719 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
12720 (bpf_option_override): Handle "-mco-re" option.
12721 (bpf_asm_init_sections): New.
12722 (TARGET_ASM_INIT_SECTIONS): Redefine.
12723 (bpf_file_end): New.
12724 (TARGET_ASM_FILE_END): Redefine.
12725 (bpf_init_builtins): Add "__builtin_preserve_access_index".
12726 (bpf_core_compute, bpf_core_get_index): New.
12727 (is_attr_preserve_access): New.
12728 (bpf_expand_builtin): Handle new builtins.
12729 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
12730 (bpf_core_walk): New.
12731 (bpf_resolve_overloaded_builtin): New.
12732 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
12733 (handle_attr): New.
12734 (pass_bpf_core_attr): New RTL pass.
12735 * config/bpf/bpf-passes.def: New file.
12736 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
12737 * config/bpf/coreout.c: New file.
12738 * config/bpf/coreout.h: Likewise.
12739 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
12740 (coreout.o): New rule.
12741 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
12742 * config.gcc (bpf): Add coreout.h to extra_headers.
12743 Add coreout.o to extra_objs.
12744 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
12746 2021-09-07 David Faust <david.faust@oracle.com>
12748 * btfout.c (get_btf_id): Function is no longer static.
12749 * ctfc.h: Expose it here.
12751 2021-09-07 David Faust <david.faust@oracle.com>
12753 * ctfc.c (ctf_lookup_tree_type): New function.
12754 * ctfc.h: Likewise.
12756 2021-09-07 David Faust <david.faust@oracle.com>
12758 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
12759 * ctfc.h: Analogous change.
12761 2021-09-07 David Faust <david.faust@oracle.com>
12763 * dwarf2out.c (lookup_type_die): Function is no longer static.
12764 * dwarf2out.h: Expose it here.
12766 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12768 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
12769 (ctf_debug_early_finish): New definition.
12770 (ctf_debug_finish): Likewise.
12771 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
12772 (ctf_debug_early_finish): New declaration.
12773 (ctf_debug_finish): Likewise.
12774 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
12775 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
12777 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12779 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
12780 support when compiling for CO-RE.
12781 * config/bpf/bpf.opt: Add new command line option -mco-re.
12783 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12785 * flag-types.h (enum debug_info_type): Add new enum
12786 DINFO_TYPE_BTF_WITH_CORE.
12787 (BTF_WITH_CORE_DEBUG): New bitmask.
12788 * flags.h (btf_with_core_debuginfo_p): New declaration.
12789 * opts.c (btf_with_core_debuginfo_p): New definition.
12791 2021-09-07 Jason Merrill <jason@redhat.com>
12793 * tree.h (error_operand_p): Change to inline function.
12795 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
12797 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
12798 (empty_block_with_phis_p): ...this.
12799 (potentially_threadable_block): Same.
12800 (jump_threader::thread_through_normal_block): Same.
12802 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
12805 * dwarf2out.c (mark_base_types): New overloaded function.
12806 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
12807 as the compilation unit, and call move_marked_base_types afterward.
12809 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
12812 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
12814 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
12816 2021-09-07 Richard Biener <rguenther@suse.de>
12818 PR tree-optimization/102226
12819 * tree-vect-loop.c (vect_transform_cycle_phi): Record
12820 the converted value for the epilogue PHI use.
12822 2021-09-07 Martin Liska <mliska@suse.cz>
12824 PR gcov-profile/80223
12825 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
12826 options, do not inline when no_profile_instrument_function
12827 attributes are different in early inliner. It's fine to inline
12828 it after PGO instrumentation.
12830 2021-09-07 Richard Biener <rguenther@suse.de>
12832 PR tree-optimization/101555
12833 * tree-ssa-pre.c (translate_vuse_through_block): Do not
12834 perform an alias walk to determine the validity of the
12835 mem at the start of the block which is already guaranteed
12836 by means of prune_clobbered_mems.
12837 (phi_translate_1): Pass edge to translate_vuse_through_block.
12839 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
12842 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
12843 (remainder<mode>3): Likewise.
12845 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
12847 * config/mips/mips.c (mips_file_start): add .module for
12850 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
12852 * wide-int.cc (wi::clz): Reorder tests to ensure the result
12853 is zero for all negative values.
12855 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
12857 * doc/invoke.texi (-foffload-options): Fix @opindex.
12859 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
12862 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
12863 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
12864 source alternative for AVX.
12866 2021-09-06 liuhongt <hongtao.liu@intel.com>
12868 PR middle-end/102182
12869 * optabs.c (expand_fix): Add from1 to avoid from being
12872 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
12874 * dwarf2out.c (modified_type_die): Deal with all array types earlier
12875 and use local variable consistently throughout the function.
12877 2021-09-06 Jakub Jelinek <jakub@redhat.com>
12879 PR tree-optimization/102207
12880 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12881 were promoted from signed to wider unsigned type.
12883 2021-09-06 Andrew Pinski <apinski@marvell.com>
12885 PR tree-optimization/63184
12886 * match.pd: Add simplification of pointer_diff of two pointer_plus
12887 with addr_expr in the first operand of each pointer_plus.
12888 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
12889 in the first operand of each pointer_plus.
12891 2021-09-06 Richard Biener <rguenther@suse.de>
12893 PR tree-optimization/102176
12894 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
12896 (vect_bb_slp_scalar_cost): Use the computed set of
12897 vectorized scalar stmts instead of relying on the out-of-date
12898 and not accurate PURE_SLP_STMT.
12899 (vect_bb_vectorization_profitable_p): Compute the set
12900 of vectorized scalar stmts.
12902 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12904 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
12905 GIMPLE_COND special casing.
12906 (path_range_query::range_defined_in_block): Use range_of_stmt
12907 instead of calling fold_range directly.
12909 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12911 * gimple-range-path.cc (path_range_query::range_of_expr): Set
12912 m_undefined_path when appropriate.
12913 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
12914 (path_range_query::unreachable_path_p): New.
12915 (path_range_query::precompute_ranges): Set m_undefined_path.
12916 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
12917 (path_range_query::internal_range_of_expr): New.
12918 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
12919 Use unreachable_path_p.
12921 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12923 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
12924 Remove argument and call find_taken_edge.
12925 (back_threader::resolve_phi): Do not calculate taken edge before
12926 calling maybe_register_path.
12927 (back_threader::find_paths_to_names): Same.
12929 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
12931 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
12932 * config/h8300/testcompare.md (store_c): Update name, use new
12934 (store_neg_c, store_shifted_c): New patterns.
12936 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
12939 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
12940 instead of r12 for CR save, in all cases.
12942 2021-09-03 Andrew Pinski <apinski@marvell.com>
12944 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
12945 Handle error_mark_node as the type of the type_decl.
12947 2021-09-03 Andrew Pinski <apinski@marvell.com>
12949 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
12951 (aarch64_simd_types): Likewise.
12952 (aarch64_simd_intOI_type_node): Likewise.
12953 (aarch64_simd_intCI_type_node): Likewise.
12954 (aarch64_simd_intXI_type_node): Likewise.
12955 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
12956 (aarch64_fp16_ptr_type_node): Likewise.
12957 (aarch64_bf16_type_node): Likewise.
12958 (aarch64_bf16_ptr_type_node): Likewise.
12960 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12962 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
12964 (minus_op1_op2_relation_effect): ...here.
12965 (class operator_pointer_diff): New.
12966 (operator_pointer_diff::op1_op2_relation_effect): Call
12967 minus_op1_op2_relation_effect.
12968 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
12970 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12972 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
12973 Add may_peel_loop_headers.
12974 (back_threader_registry::thread_through_all_blocks): Same.
12975 (try_thread_blocks): Pass may_peel_loop_headers argument.
12976 (pass_early_thread_jumps::execute): Same.
12978 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12980 * tree-ssa-threadedge.c (has_phis_p): New.
12981 (forwarder_block_p): New.
12982 (potentially_threadable_block): Call forwarder_block_p.
12983 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
12984 (jump_threader::thread_through_normal_block): Call
12987 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12989 * tree-ssa-threadbackward.c (back_threader::dump): New.
12990 (back_threader::debug): New.
12991 (back_threader_profitability::profitable_path_p): Dump blocks
12992 even if we are bailing early.
12994 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12996 * tree-ssa-threadupdate.c (cancel_thread): New.
12997 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
12998 (jump_thread_path_registry::mark_threaded_blocks): Same.
12999 (jump_thread_path_registry::register_jump_thread): Same.
13001 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13003 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
13004 which objects are available.
13005 (jt_state::pop): Same.
13006 (jt_state::register_equiv): Same.
13007 (jt_state::register_equivs_on_edge): Same.
13009 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13011 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
13012 Move pop until after a thread is registered.
13014 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13016 * tree-ssa-threadupdate.c (debug): New.
13018 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13020 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
13021 (push_dump_file::~push_dump_file): New.
13022 (dump_ranger): Change dump_file temporarily while dumping
13024 * gimple-range-trace.h (class push_dump_file): New.
13026 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13028 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
13029 (dump_ranger): Dump function name.
13031 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13033 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13034 Adjust for non-null.
13035 (path_range_query::adjust_for_non_null_uses): New.
13036 (path_range_query::precompute_ranges): Call
13037 adjust_for_non_null_uses.
13038 * gimple-range-path.h: Add m_non_null and
13039 adjust_for_non_null_uses.
13041 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13043 * gimple-range-path.cc (path_range_query::dump): Dump path
13045 (path_range_query::precompute_ranges): Dump entire path.
13047 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13049 * value-relation.cc (relation_oracle::debug): New.
13050 * value-relation.h (relation_oracle::debug): New.
13052 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13054 * tree-ssa-loop-ch.c: Remove unnecessary include file.
13056 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13058 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
13059 Skip statements with no defining BB.
13060 * gimple-range-path.cc (path_range_query::range_defined_in_block):
13061 Do not get confused by statements with no defining BB.
13063 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
13065 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
13066 gimple_range_adjustment. Add support for constants.
13067 (adjust_realpart_expr): New.
13068 (gimple_range_adjustment): Move IMAGPART_EXPR code to
13069 adjust_imagpart_expr.
13070 * range-op.cc (integral_table::integral_table): Add entry for
13073 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13075 * omp-expand.c (expand_omp_atomic_pipeline): Use
13076 IFN_ATOMIC_COMPARE_EXCHANGE instead of
13077 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
13080 2021-09-03 Jakub Jelinek <jakub@redhat.com>
13083 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
13084 that is false if DECL_BIT_FIELD.
13085 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
13086 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
13087 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
13088 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
13089 of DECL_FIELD_ABI_IGNORED.
13090 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
13091 SET_DECL_FIELD_ABI_IGNORED instead of writing to
13092 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
13093 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
13094 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
13095 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
13097 2021-09-03 liuhongt <hongtao.liu@intel.com>
13100 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
13101 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
13102 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
13104 2021-09-02 Martin Sebor <msebor@redhat.com>
13106 PR tree-optimization/17506
13108 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
13110 2021-09-02 Richard Biener <rguenther@suse.de>
13112 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
13113 fix for PR78185 and continue processing when leaving
13114 finite inner loops.
13116 2021-09-02 Jakub Jelinek <jakub@redhat.com>
13118 PR tree-optimization/99591
13119 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
13122 2021-09-02 Richard Biener <rguenther@suse.de>
13125 2021-09-02 Richard Biener <rguenther@suse.de>
13127 PR tree-optimization/102155
13128 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13129 over a part of the RPO array and do not recurse here.
13130 Dump blocks marked as always executed.
13131 (fill_always_executed_in): Walk over the RPO array and
13132 process loops whose header we run into.
13133 (loop_invariant_motion_in_fun): Compute the first RPO
13134 using rev_post_order_and_mark_dfs_back_seme in iteration
13135 order and pass that to fill_always_executed_in.
13137 2021-09-02 liuhongt <hongtao.liu@intel.com>
13139 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
13140 * config/i386/i386.c (enum x86_64_reg_class): Add
13141 X86_64_SSEHF_CLASS.
13142 (merge_classes): Handle X86_64_SSEHF_CLASS.
13143 (examine_argument): Ditto.
13144 (construct_container): Ditto.
13145 (classify_argument): Ditto, and set HFmode/HCmode to
13146 X86_64_SSEHF_CLASS.
13147 (function_value_32): Return _FLoat16/Complex Float16 by
13149 (function_value_64): Return _Float16/Complex Float16 by SSE
13151 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
13152 (ix86_secondary_reload): Require gpr as intermediate register
13153 to store _Float16 from sse register when sse4 is not
13155 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
13157 (ix86_scalar_mode_supported_p): Ditto.
13158 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
13159 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
13160 (VALID_INT_MODE_P): Add HFmode and HCmode.
13161 * config/i386/i386.md (*pushhf_rex64): New define_insn.
13163 (*movhf_internal): Ditto.
13164 * doc/extend.texi (Half-Precision Floating Point): Documemt
13167 2021-09-02 Richard Biener <rguenther@suse.de>
13169 PR tree-optimization/102155
13170 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
13171 over a part of the RPO array and do not recurse here.
13172 Dump blocks marked as always executed.
13173 (fill_always_executed_in): Walk over the RPO array and
13174 process loops whose header we run into.
13175 (loop_invariant_motion_in_fun): Compute the first RPO
13176 using rev_post_order_and_mark_dfs_back_seme in iteration
13177 order and pass that to fill_always_executed_in.
13179 2021-09-02 YunQiang Su <syq@debian.org>
13182 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13184 * config/mips/mips.c (mips_module_isa_name): New.
13185 mips_file_start: add .module mipsREV to all asm output
13187 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
13189 PR tree-optimization/102152
13190 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
13191 comparison to a scalar comparison before calling
13192 update_stmt_if_modified.
13194 2021-09-01 Andrew Pinski <apinski@marvell.com>
13197 * config/aarch64/aarch64.c (aarch64_expand_setmem):
13198 Check STRICT_ALIGNMENT before creating an overlapping
13201 2021-09-01 Martin Sebor <msebor@redhat.com>
13203 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
13204 (check_access): Pass additional argument.
13205 (check_memop_access): Remove template and make a member function.
13206 (maybe_check_dealloc_call): Make a pass_waccess member function.
13207 (class pass_waccess): Add, rename, and remove members.
13208 (pass_waccess::pass_waccess): Adjust to name change.
13209 (pass_waccess::~pass_waccess): Same.
13210 (check_alloca): Make a member function.
13211 (check_alloc_size_call): Same.
13212 (check_strcat): Same.
13213 (check_strncat): Same.
13214 (check_stxcpy): Same.
13215 (check_stxncpy): Same.
13216 (check_strncmp): Same.
13217 (maybe_warn_rdwr_sizes): Rename...
13218 (pass_waccess::maybe_check_access_sizes): ...to this.
13219 (pass_waccess::check_call): Adjust to name changes.
13220 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
13222 (pass_waccess::execute): Adjust to name changes.
13223 * gimple-ssa-warn-access.h (check_memop_access): Remove.
13224 * pointer-query.cc (access_ref::phi): Handle null pointer.
13225 (access_ref::inform_access): Same.
13226 (pointer_query::put_ref): Modify a cached value, not a copy of it.
13227 (pointer_query::dump): New function.
13228 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
13230 * pointer-query.h (pointer_query::dump): Declare.
13231 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
13232 (dump_strlen_info): Use function query.
13233 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
13235 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
13237 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
13239 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
13241 * doc/extend.texi: Document unavailable attribute.
13242 * print-tree.c (print_node): Handle unavailable attribute.
13243 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
13244 * tree.c (error_unavailable_use): New.
13245 * tree.h (TREE_UNAVAILABLE): New.
13246 (error_unavailable_use): New.
13248 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13250 PR tree-optimization/102124
13251 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
13252 MINUS_EXPR, if itype is unsigned with smaller precision than type,
13253 add an extra cast to signed variant of itype to ensure sign-extension.
13255 2021-09-01 Martin Liska <mliska@suse.cz>
13257 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
13258 edges and rather use colors for TRUE and FALSE edges.
13260 2021-09-01 Richard Biener <rguenther@suse.de>
13262 PR tree-optimization/93491
13263 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
13264 after processing the stmt itself. Do not consider
13265 pure functions possibly not returning. Properly avoid
13266 adding possibly trapping calls to EXP_GEN when there's
13267 a preceeding possibly not returning call.
13268 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
13271 2021-09-01 Richard Biener <rguenther@suse.de>
13273 PR tree-optimization/102139
13274 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
13275 type to record a std::pair of the stmt-info and the innermost
13277 (dr_vec_info::group): New member.
13278 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
13279 (vect_compute_data_ref_alignment): Verify the recorded
13280 base alignment can be used.
13281 (data_ref_pair): Remove.
13282 (dr_group_sort_cmp): Adjust.
13283 (vect_analyze_data_ref_accesses): Store the group-ID in the
13284 dr_vec_info and operate on a vector of dr_vec_infos.
13286 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
13288 * read-md.c (md_reader::handle_enum): support value assignation.
13289 * doc/md.texi: record define_c_enum value assignation support.
13291 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13293 PR tree-optimization/102141
13294 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
13295 argument. If false, emit stmts after gsi instead of before, and
13297 (bswap_replace): Adjust callers. When converting output of bswap,
13298 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
13301 2021-09-01 liuhongt <hongtao.liu@intel.com>
13303 * emit-rtl.c (validate_subreg): Get rid of all float-int
13306 2021-09-01 liuhongt <hongtao.liu@intel.com>
13309 2021-08-30 liuhongt <hongtao.liu@intel.com>
13311 * expmed.c (extract_bit_field_1): Make sure we're playing with
13312 integral modes before call extract_integral_bit_field.
13313 (extract_integral_bit_field): Add a parameter of type
13314 scalar_int_mode which corresponds to of tmode.
13315 And call extract_and_convert_fixed_bit_field instead of
13316 extract_fixed_bit_field and convert_extracted_bit_field.
13317 (extract_and_convert_fixed_bit_field): New function, it's a
13318 combination of extract_fixed_bit_field and
13319 convert_extracted_bit_field.
13321 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13323 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
13325 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13327 * omp-general.h (omp_is_reference): Rename to...
13328 (omp_privatize_by_reference): ... this. Adjust all users...
13329 * omp-general.c: ... here, ...
13330 * gimplify.c: ... here, ...
13331 * omp-expand.c: ... here, ...
13332 * omp-low.c: ... here.
13334 2021-08-31 Martin Sebor <msebor@redhat.com>
13336 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
13337 pointer element for equality to zero, not that of the cotaining
13340 2021-08-31 Martin Sebor <msebor@redhat.com>
13342 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
13344 2021-08-31 Martin Sebor <msebor@redhat.com>
13346 * function.h (function): Add comments.
13347 (get_range_query): Same. Add attribute returns nonnull.
13349 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13351 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
13352 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
13353 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
13355 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
13357 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
13359 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
13360 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
13361 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
13362 (m32c-*-elf*, m32c-*-rtems*): Likewise.
13363 * config/cris/cris.h (DBX_NO_XREFS): Remove.
13364 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
13365 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
13366 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
13367 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
13368 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
13370 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
13372 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
13373 allowed on target constructs and only with particular other clauses.
13374 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
13375 'ancestor' is used.
13376 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
13377 structs when 'ancestor' is used.
13378 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
13379 'ancestor' is used.
13380 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
13381 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
13383 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13385 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
13386 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13388 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
13389 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
13390 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
13391 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
13392 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
13393 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13394 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
13395 would be paradoxical.
13397 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13399 * combine.c (combine_simplify_rtx): Avoid converting an explicit
13400 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
13401 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
13403 2021-08-31 Richard Biener <rguenther@suse.de>
13405 PR tree-optimization/102142
13406 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
13407 condition under which to unset the visited flag.
13409 2021-08-31 Richard Biener <rguenther@suse.de>
13411 PR middle-end/102129
13412 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
13413 possibly trapping expressions across calls.
13415 2021-08-31 Jakub Jelinek <jakub@redhat.com>
13417 PR tree-optimization/102134
13418 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
13419 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
13422 2021-08-31 Andrew Pinski <apinski@marvell.com>
13425 * collect-utils.c (setup_signals): New declaration.
13426 * collect-utils.h (setup_signals): New function.
13427 * collect2.c (handler): Delete.
13428 (main): Instead of manually setting up the signals,
13429 just call setup_signals.
13430 * lto-wrapper.c (main): Likewise.
13432 2021-08-31 Andrew Pinski <apinski@marvell.com>
13435 * config/i386/i386-protos.h (x86_output_aligned_bss):
13436 Change align argument to unsigned type.
13437 (x86_elf_aligned_decl_common): Likewise.
13438 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
13439 (x86_output_aligned_bss): Likewise.
13441 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13443 * config/mips/mips.c (mips_module_isa_name): New.
13444 mips_file_start: add .module mipsREV to all asm output
13446 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13448 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
13449 use enum instead of int for 'isa' member.
13450 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
13451 config/mips/netbsd.h: replace hardcoded numbers with enum.
13453 2021-08-31 liuhongt <hongtao.liu@intel.com>
13455 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
13456 define_split to define_insn_and_split.
13457 (*avx2_eq<mode>3): Removed.
13458 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
13459 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
13460 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
13462 (*avx2_gt<mode>3): Removed.
13463 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
13464 define_insn to define_expand, and adjust pattern.
13465 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
13467 2021-08-30 David Malcolm <dmalcolm@redhat.com>
13470 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
13472 2021-08-30 Jason Merrill <jason@redhat.com>
13474 * doc/invoke.texi: Document -Wmissing-requires.
13476 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13478 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
13479 TARGET_EXTRA_BUILTINS guard.
13481 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13483 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
13484 initialization of V2DI_type_node and unsigned_V2DI_type_node.
13486 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13488 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
13489 decl when new_builtins_are_live.
13490 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
13493 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
13495 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
13496 OPTION_MASK_P10_FUSION_2STORE.
13497 (POWERPC_MASKS): Likewise.
13498 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
13499 store fusion for Power10.
13500 (is_fusable_store): New.
13501 (power10_sched_reorder): Likewise.
13502 (rs6000_sched_reorder): Do Power10 specific reordering.
13503 (rs6000_sched_reorder2): Likewise.
13504 * config/rs6000/rs6000.opt: Add new option.
13506 2021-08-30 Richard Biener <rguenther@suse.de>
13508 PR tree-optimization/102128
13509 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13510 Move scanning for if-converted scalar code to the caller
13511 and instead delay clearing the visited flag for profitable
13513 (vect_slp_region): Cost all subgraphs before scheduling.
13514 For if-converted BB vectorization scan for scalar COND_EXPRs
13515 and do not vectorize if any found and the cost model is
13518 2021-08-30 Richard Biener <rguenther@suse.de>
13520 * common.opt (fexceptions): Mark
13521 EnabledBy(fnon-call-exceptions).
13522 * doc/invoke.texi (fnon-call-exceptions): Document this
13523 enables -fexceptions.
13525 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
13527 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
13528 is not already defined.
13530 2021-08-30 liuhongt <hongtao.liu@intel.com>
13532 * expmed.c (extract_bit_field_1): Make sure we're playing with
13533 integral modes before call extract_integral_bit_field.
13534 (extract_integral_bit_field): Add a parameter of type
13535 scalar_int_mode which corresponds to of tmode.
13536 And call extract_and_convert_fixed_bit_field instead of
13537 extract_fixed_bit_field and convert_extracted_bit_field.
13538 (extract_and_convert_fixed_bit_field): New function, it's a
13539 combination of extract_fixed_bit_field and
13540 convert_extracted_bit_field.
13542 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13544 * config/darwin.c (darwin_libc_has_function): Do not run
13545 the checks for x86 or modern Darwin. Make sure that there
13546 is a value set for darwin_macosx_version_min before testing.
13548 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13550 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
13552 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
13554 * ipa-modref-tree.h (modref_access_node::merge): Break out
13555 logic combining offsets and logic merging ranges to ...
13556 (modref_access_node::combined_offsets): ... here
13557 (modref_access_node::update2): ... here
13558 (modref_access_node::closer_pair_p): New member function.
13559 (modref_access_node::forced_merge): New member function.
13560 (modre_ref_node::insert): Do merging when table is full.
13562 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
13565 * config.gcc: MIPS: use N64 ABI by default if the triple end
13566 with -gnuabi64, which is used by Debian since 2013.
13568 2021-08-28 Alexandre Oliva <oliva@adacore.com>
13570 * ipa-modref.c (analyze_function): Skip debug stmts.
13571 * tree-inline.c (estimate_num_insn): Consider builtins even
13572 without a cgraph_node.
13574 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13576 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
13577 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
13578 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
13579 * config/h8300/h8300.md (eqne): New code iterator.
13580 (geultu, geultu_to_c): Similarly.
13581 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
13582 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
13583 (cmp<mode>_c): New pattern
13585 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13587 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
13589 (dom_opt_dom_walker::optimize_stmt): Use it.
13591 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13593 * config/darwin.c (finalize_ctors): Add a section-start linker-
13595 (finalize_dtors): Likewise.
13596 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
13598 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
13600 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
13601 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
13602 old initialization logic when new builtins are enabled.
13603 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
13604 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
13605 (write_init_file): Likewise.
13607 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13609 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
13610 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
13611 (amdgcn-* | gcn-*) Likewise.
13613 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
13615 * symbol-summary.h: Added missing template keyword.
13617 2021-08-27 Richard Biener <rguenther@suse.de>
13619 PR tree-optimization/45178
13620 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
13621 infinite loops without exit do not mark control dependent
13622 edges of the latch necessary.
13624 2021-08-27 konglin1 <lingling.kong@intel.com>
13627 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
13629 (<avx512>scattersi<mode>): Likewise.
13630 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
13631 (*avx512f_scatterdi<VI48F:mode>): Likewise
13633 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13635 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
13636 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
13637 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
13638 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
13639 P10_BUILTIN_PEXTD on Power10.
13641 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13643 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
13644 signedness for some Power10 bifs.
13646 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
13649 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
13650 computed alignment if the entire struct has attribute packed.
13652 2021-08-27 liuhongt <hongtao.liu@intel.com>
13656 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
13657 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
13658 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
13659 IX86_BUILTIN_SHUFPS256.
13660 (ix86_masked_all_ones): New function.
13662 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13664 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
13665 (conditional moves with memory inputs splitters): Ditto.
13666 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
13668 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13670 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
13671 search after merging.
13673 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13675 * config/rs6000/rs6000-overload.def: Add remaining overloads.
13677 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13679 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
13681 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13683 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
13684 crypto, and htm stanzas.
13686 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13688 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
13690 2021-08-26 Martin Sebor <msebor@redhat.com>
13692 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
13693 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
13694 (warn_uninitialized_vars): Same. Reduce visibility of locals.
13695 (warn_uninitialized_phi): Same.
13697 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13699 * tree-ssa-ccp.c (get_individual_bits): Helper function to
13700 extract the individual bits from a widest_int constant (mask).
13701 (gray_code_bit_flips): New read-only table for effiently
13702 enumerating permutations/combinations of bits.
13703 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
13704 by unknown counts that are guaranteed less than the target
13705 precision and four or fewer unknown bits by enumeration.
13706 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
13707 enumeration under the same conditions. Handle remaining
13708 shifts as a mask based upon the minimum possible shift value.
13710 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13711 Richard Biener <rguenther@suse.de>
13713 * match.pd (shift transformations): Remove a redundant
13714 !POINTER_TYPE_P check.
13716 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13719 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
13720 Set all_regs to true in the call to replace_rtx.
13722 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13724 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
13725 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
13726 hiting --param modref-max-refs limit.
13727 (modref_tree:insert_base): Be smarter when hitting
13728 --param modref-max-bases limit. Add new parameter REF.
13729 (modref_tree:insert): Update.
13730 (modref_tree:merge): Update.
13731 * ipa-modref.c (read_modref_records): Update.
13733 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13735 * params.opt: (modref-max-adjustments): Add full stop.
13737 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13739 * ipa-modref-tree.h (modref_ref_node::verify): New member
13741 (modref_ref_node::insert): Use it.
13742 (modref_ref_node::try_mere_with): Fix off by one error.
13744 2021-08-26 Martin Liska <mliska@suse.cz>
13745 Stefan Kneifel <stefan.kneifel@bluewin.ch>
13747 * cgraph.h (create_version_clone_with_body): Add new parameter.
13748 * cgraphclones.c: Likewise.
13749 * multiple_target.c (create_dispatcher_calls): Do not use
13751 (create_target_clone): Likewise here.
13753 2021-08-26 Jonathan Yong <10walls@gmail.com>
13755 * doc/extend.texi: Add note about reserved priorities
13756 to the constructor attribute.
13758 2021-08-25 Martin Sebor <msebor@redhat.com>
13760 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
13761 range table header alone.
13762 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
13764 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
13766 * doc/invoke.texi: Document --param modref-max-adjustments.
13767 * ipa-modref-tree.c (test_insert_search_collapse): Update.
13768 (test_merge): Update.
13769 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
13770 (modref_access_node::operator==): Fix handling of access ranges.
13771 (modref_access_node::contains): Constify parameter; handle also
13772 mismatched parm offsets.
13773 (modref_access_node::update): New function.
13774 (modref_access_node::merge): New function.
13775 (unspecified_modref_access_node): Update constructor.
13776 (modref_ref_node::insert_access): Add record_adjustments parameter;
13778 (modref_ref_node::try_merge_with): New private function.
13779 (modref_tree::insert): New record_adjustments parameter.
13780 (modref_tree::merge): New record_adjustments parameter.
13781 (modref_tree::copy_from): Update.
13782 * ipa-modref.c (dump_access): Dump adjustments field.
13783 (get_access): Update constructor.
13784 (record_access): Update call of insert.
13785 (record_access_lto): Update call of insert.
13786 (merge_call_side_effects): Add record_adjustments parameter.
13787 (get_access_for_fnspec): Update.
13788 (process_fnspec): Update.
13789 (analyze_call): Update.
13790 (analyze_function): Update.
13791 (read_modref_records): Update.
13792 (ipa_merge_modref_summary_after_inlining): Update.
13793 (propagate_unknown_call): Update.
13794 (modref_propagate_in_scc): Update.
13795 * params.opt (param-max-modref-adjustments=): New.
13797 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
13799 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
13801 (xxspltiw_v4si): Use vecperm type attribute.
13802 (xxspltiw_v4si_inst): Use vecperm type attribute.
13803 (xxspltiw_v4sf_inst): Likewise.
13804 (xxspltidp_v2df): Use vecperm type attribute. Use
13805 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
13806 (xxspltidp_v2df_inst): Likewise.
13807 (xxsplti32dx_v4si): Use vecperm type attribute.
13808 (xxsplti32dx_v4si_inst): Likewise.
13809 (xxsplti32dx_v4sf_inst): Likewise.
13810 (xxblend_<mode>): Likewise.
13811 (xxpermx): Likewise.
13812 (xxpermx_inst): Likewise.
13813 (xxeval): Likewise.
13815 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
13818 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
13819 * diagnostic.c (diagnostic_initialize_input_context): New function.
13820 * diagnostic.h (diagnostic_initialize_input_context): Declare.
13821 * input.c (default_charset_callback): New function.
13822 (file_cache::initialize_input_context): New function.
13823 (file_cache_slot::create): Added ability to convert the input
13824 according to the input context.
13825 (file_cache::file_cache): Initialize the new input context.
13826 (class file_cache_slot): Added new m_alloc_offset member.
13827 (file_cache_slot::file_cache_slot): Initialize the new member.
13828 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
13829 (file_cache_slot::maybe_grow): Likewise.
13830 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
13831 (file_cache_slot::get_next_line): Likewise.
13832 * input.h (class file_cache): Added input context member.
13834 2021-08-25 Richard Biener <rguenther@suse.de>
13836 PR tree-optimization/102046
13837 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
13838 update ->any_pattern when swapping operands.
13840 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
13843 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
13844 (ix86_decompose_address): Remove retval check for ASHIFT,
13845 allow non-canonical zero extend if AND mask covers ASHIFT
13847 (ix86_legitimate_address_p): Adjust condition for decompose.
13848 (ix86_rtx_costs): Adjust cost for lea with non-canonical
13850 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
13852 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
13854 PR tree-optimization/101145
13855 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
13857 (number_of_iterations_lt): Invoke above function.
13858 (adjust_cond_for_loop_until_wrap):
13859 Merge to number_of_iterations_until_wrap.
13860 (number_of_iterations_cond): Update invokes for
13861 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
13863 2021-08-25 konglin1 <lingling.kong@intel.com>
13866 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
13867 macro define in O0.
13868 (_mm512_mask_fpclass_ps_mask): Ditto.
13870 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
13872 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
13873 (vec_unpacku_hi_v8hi): Likewise.
13874 (vec_unpacku_lo_v16qi): Likewise.
13875 (vec_unpacku_lo_v8hi): Likewise.
13876 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
13877 (vec_unpacku_lo_<VP_small_lc>): Likewise.
13879 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
13881 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
13882 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13883 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13884 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
13886 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13888 PR middle-end/102031
13889 * simplify-rtx.c (simplify_truncation): When comparing precisions
13890 use "subreg_prec" variable, not "subreg_mode".
13892 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13894 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
13897 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13899 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
13900 various pointer type nodes.
13901 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
13902 values for various pointer types.
13903 (ptr_V16QI_type_node): New macro.
13904 (ptr_V1TI_type_node): New macro.
13905 (ptr_V2DI_type_node): New macro.
13906 (ptr_V2DF_type_node): New macro.
13907 (ptr_V4SI_type_node): New macro.
13908 (ptr_V4SF_type_node): New macro.
13909 (ptr_V8HI_type_node): New macro.
13910 (ptr_unsigned_V16QI_type_node): New macro.
13911 (ptr_unsigned_V1TI_type_node): New macro.
13912 (ptr_unsigned_V8HI_type_node): New macro.
13913 (ptr_unsigned_V4SI_type_node): New macro.
13914 (ptr_unsigned_V2DI_type_node): New macro.
13915 (ptr_bool_V16QI_type_node): New macro.
13916 (ptr_bool_V8HI_type_node): New macro.
13917 (ptr_bool_V4SI_type_node): New macro.
13918 (ptr_bool_V2DI_type_node): New macro.
13919 (ptr_bool_V1TI_type_node): New macro.
13920 (ptr_pixel_type_node): New macro.
13921 (ptr_intQI_type_node): New macro.
13922 (ptr_uintQI_type_node): New macro.
13923 (ptr_intHI_type_node): New macro.
13924 (ptr_uintHI_type_node): New macro.
13925 (ptr_intSI_type_node): New macro.
13926 (ptr_uintSI_type_node): New macro.
13927 (ptr_intDI_type_node): New macro.
13928 (ptr_uintDI_type_node): New macro.
13929 (ptr_intTI_type_node): New macro.
13930 (ptr_uintTI_type_node): New macro.
13931 (ptr_long_integer_type_node): New macro.
13932 (ptr_long_unsigned_type_node): New macro.
13933 (ptr_float_type_node): New macro.
13934 (ptr_double_type_node): New macro.
13935 (ptr_long_double_type_node): New macro.
13936 (ptr_dfloat64_type_node): New macro.
13937 (ptr_dfloat128_type_node): New macro.
13938 (ptr_ieee128_type_node): New macro.
13939 (ptr_ibm128_type_node): New macro.
13940 (ptr_vector_pair_type_node): New macro.
13941 (ptr_vector_quad_type_node): New macro.
13942 (ptr_long_long_integer_type_node): New macro.
13943 (ptr_long_long_unsigned_type_node): New macro.
13945 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13947 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
13948 and power9-64 stanzas.
13950 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13951 Tom de Vries <tdevries@suse.de>
13953 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
13954 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
13955 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
13956 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
13957 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
13958 * config/nvptx/nvptx-c.c: New source file.
13959 (nvptx_cpu_cpp_builtins): Move implementation here.
13961 2021-08-24 Martin Sebor <msebor@redhat.com>
13963 PR middle-end/101600
13964 PR middle-end/101977
13965 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
13966 the phrasing of a warning.
13967 (check_access): Use the remaining size after subtracting any offset
13968 rather than the whole object size.
13969 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
13970 clear for any nonnull PHI argument.
13971 (compute_objsize): Clear argument.
13973 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13975 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
13977 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13979 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
13982 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
13984 * value-relation.cc (rr_transitive_table): New.
13985 (relation_transitive): New.
13986 (value_relation::swap): Remove.
13987 (value_relation::apply_transitive): New.
13988 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
13989 (relation_oracle::register_relation): Call register_transitives.
13990 (relation_oracle::register_transitives): New.
13991 * value-relation.h (relation_oracle): Add new temporary bitmap and
13994 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
13997 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
13998 from integer to a pseudo vector register.
14000 2021-08-24 Richard Biener <rguenther@suse.de>
14002 PR tree-optimization/100089
14003 * tree-vectorizer.h (vect_slp_bb): Rename to ...
14004 (vect_slp_if_converted_bb): ... this and get the original
14005 loop as new argument.
14006 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
14007 pass original loop to vect_slp_if_converted_bb.
14008 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
14009 If orig_loop was passed scan the not vectorized stmts
14010 for COND_EXPRs and force not profitable if found.
14011 (vect_slp_region): Pass down all SLP instances to costing
14012 if orig_loop was specified.
14013 (vect_slp_bbs): Pass through orig_loop.
14014 (vect_slp_bb): Rename to ...
14015 (vect_slp_if_converted_bb): ... this and get the original
14016 loop as new argument.
14017 (vect_slp_function): Adjust.
14019 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14022 * config/arm/arm.md (attribute arch): Add fix_vlldm.
14023 (arch_enabled): Use it.
14024 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
14025 use when erratum mitigation is needed.
14027 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14030 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
14031 * doc/invoke.texi (Arm Options): Document it.
14032 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
14033 (ALL_QUIRKS): Add quirk_vlldm.
14034 (cortex-m33): Add quirk_vlldm.
14035 (cortex-m35p, cortex-m55): Likewise.
14036 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
14037 targetting an affected CPU and not explicitly controlled on
14040 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
14042 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
14043 (lazy_load_multiple_insn): Likewise.
14045 2021-08-24 liuhongt <hongtao.liu@intel.com>
14048 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
14049 Enable avx512 embedded broadcast.
14050 (*<avx512>_vternlog<mode>_all): Ditto.
14051 (<avx512>_vternlog<mode>_mask): Ditto.
14053 2021-08-24 liuhongt <hongtao.liu@intel.com>
14056 * config/i386/i386.c (ix86_rtx_costs): Define cost for
14058 * config/i386/i386.h (STRIP_UNARY): New macro.
14059 * config/i386/predicates.md (reg_or_notreg_operand): New
14061 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
14062 (*<avx512>_vternlog<mode>_1): New pre_reload
14063 define_insn_and_split.
14064 (*<avx512>_vternlog<mode>_2): Ditto.
14065 (*<avx512>_vternlog<mode>_3): Ditto.
14066 (any_logic1,any_logic2): New code iterator.
14067 (logic_op): New code attribute.
14068 (ternlogsuffix): Extend to VNxDF and VNxSF.
14070 2021-08-24 Richard Biener <rguenther@suse.de>
14072 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
14073 * params.opt (--param vect-inner-loop-cost-factor): Adjust
14075 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
14076 inner_loop_cost_factor to the minimum of the estimated number
14077 of iterations of the inner loop and vect-inner-loop-cost-factor.
14079 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14080 Richard Biener <rguenther@suse.de>
14082 * config/i386/i386-features.c (compute_convert_gain): Provide
14083 more accurate values for CONST_INT, when optimizing for size.
14084 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
14085 * config/i386/i386.h (COSTS_N_BYTES): to here.
14087 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
14088 Jakub Jelinek <jakub@redhat.com>
14090 PR middle-end/102029
14091 * match.pd (shift transformations): Add an additional check for
14092 !POINTER_TYPE_P in the recently added left shift transformation.
14094 2021-08-24 liuhongt <hongtao.liu@intel.com>
14096 PR tree-optimization/100089
14097 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
14098 loop vectorizer when cost model is very-cheap.
14100 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14102 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
14103 asprintf, which is not available on AIX.
14105 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14107 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
14108 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
14110 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14112 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
14113 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
14115 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
14116 (rbtree.o): Likewise.
14117 (rs6000-gen-builtins): Likewise.
14118 (rs6000-builtins.c): Likewise.
14119 (rs6000-builtins.h): Likewise.
14120 (rs6000.o): Add dependency.
14121 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
14122 (rs6000-vecdefines.h): New target.
14123 (rs6000-builtins.o): Likewise.
14124 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
14125 (rs6000-c.o): Likewise.
14127 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
14130 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
14131 Diagnose buffer overrun.
14132 (safe_inc_pos): Fix overrun detection.
14133 (match_identifier): Diagnose buffer overrun.
14134 (match_integer): Likewise.
14135 (match_to_right_bracket): Likewise.
14137 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14139 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
14140 Improve range compare.
14141 (modref_access_node::contains): New member function.
14142 (modref_access_node::search): Remove.
14143 (modref_access_node::insert): Be smarter about subaccesses.
14145 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
14147 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
14148 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
14149 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
14152 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
14154 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
14156 * config/h8300/jumpcall.md (call, call_value): Restrict to
14157 !SIBLING_CALL_P cases.
14158 (subcall, sibcall_value): New patterns & expanders.
14159 * config/h8300/proepi.md (epilogue): Pass new argument to
14160 h8300_expand_epilogue.
14161 (sibcall_epilogue): New expander.
14162 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
14164 (h8300_ok_for_sibcall_p): New function.
14165 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
14167 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14169 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
14170 Handle case where the operand is already the desired mode.
14172 2021-08-23 Richard Biener <rguenther@suse.de>
14175 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
14176 in addition to has_gimple_body.
14178 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14180 PR middle-end/101949
14181 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
14184 2021-08-23 Martin Liska <mliska@suse.cz>
14186 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
14189 2021-08-23 Richard Biener <rguenther@suse.de>
14191 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
14192 Properly scale the inner loop cost only once.
14194 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14196 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
14197 Provide bounds for unsigned (and signed with non-negative operands)
14198 division and modulus.
14200 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14202 * simplify-rtx.c (simplify_truncation): Generalize simplification
14203 of (truncate:A (subreg:B X)).
14204 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
14205 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
14206 already has the desired machine mode.
14207 (test_scalar_int_ops): Add tests that useless extensions and
14208 truncations are optimized away.
14209 (test_scalar_int_ext_ops): New self-test function to confirm
14210 that truncations of extensions are correctly simplified.
14211 (test_scalar_int_ext_ops2): New self-test function to check
14212 truncations of truncations, extensions of extensions, and
14213 truncations of extensions.
14214 (test_scalar_ops): Call the above two functions with a
14215 representative sampling of integer machine modes.
14217 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14219 * match.pd (shift transformations): Change the sign of an
14220 LSHIFT_EXPR if it reduces the number of explicit conversions.
14222 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14224 PR tree-optimization/86723
14225 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
14226 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
14227 non-memory permutation of 64-bit src which only has bytes of
14228 0 or [5..8] and n->range is 4.
14229 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
14230 find_bswap_or_nop_finalize caller, support bswap with some bytes
14231 zeroed, as long as at least two bytes are not zeroed.
14232 (bswap_replace): Add mask argument and handle masking of bswap
14234 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
14235 caller, punt if cast64_to_32 or mask is not all ones.
14236 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
14237 caller, for now punt if cast64_to_32.
14239 2021-08-23 Richard Biener <rguenther@suse.de>
14241 PR tree-optimization/79334
14242 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
14243 a type also for COMPONENT_REFs.
14244 (vn_reference_may_trap): Check ARRAY_REF with constant index
14245 against the array domain.
14247 2021-08-23 liuhongt <hongtao.liu@intel.com>
14250 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
14251 TARGET_AVX512BW to condition.
14253 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14256 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
14257 register variables already during early_dwarf if possible.
14259 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14261 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
14263 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14266 * config/arm/arm.opt: Fix typo.
14267 * config/arm/t-rmprofile: Fix typo.
14269 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14271 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
14272 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
14273 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
14274 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
14275 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
14276 if either grainsize or num_tasks clause has the strict modifier.
14278 2021-08-23 Martin Liska <mliska@suse.cz>
14280 * dbgcnt.def (DEBUG_COUNTER): New counter.
14281 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
14283 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14285 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
14287 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
14290 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
14292 * config/mips/mips.c (mips_msa_output_shift_immediate): New
14294 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
14295 vlshr<mode>3): Call it.
14297 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
14298 Martin Liska <mliska@suse.cz>
14300 PR middle-end/101949
14301 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
14304 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
14306 * config/mips/mips.c (mips_function_rodata_section,
14307 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
14309 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
14311 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
14313 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14315 * configure.ac (thread-local storage support): Remove tls_first_major
14316 and tls_first_minor. Use "$conftest_s" to check support.
14317 * configure: Regenerate.
14319 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14321 * configure.ac: Fixup formatting.
14323 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14325 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
14326 * configure.ac: ... update all callers.
14328 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14331 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
14332 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
14333 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
14334 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
14335 Remove remaining checks for in-tree assembler.
14336 * configure: Regenerate.
14338 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
14340 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
14341 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
14343 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
14344 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
14345 Improve logical shifts by 27 bits for H8/S.
14346 (get_shift_alg): Corresponding changes.
14347 (h8300_option_override): Revert to loops for -Os when profitable.
14349 2021-08-20 Richard Biener <rguenther@suse.de>
14351 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
14353 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
14354 index as group_id when dataref_groups were not computed.
14355 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
14356 we advace to the next BB.
14358 2021-08-20 Jakub Jelinek <jakub@redhat.com>
14360 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
14363 2021-08-20 Martin Liska <mliska@suse.cz>
14365 PR gcov-profile/89961
14366 * gcov.c (make_gcov_file_name): Rewrite using std::string.
14367 (mangle_name): Simplify, do not used the second argument.
14368 (strip_extention): New function.
14369 (get_md5sum): Likewise.
14370 (get_gcov_intermediate_filename): Handle properly -p and -x
14372 (output_gcov_file): Use string type.
14373 (generate_results): Likewise.
14374 (md5sum_to_hex): Remove.
14376 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
14378 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
14379 (UNSPEC_XXSPLTIW): Move to vsx.md.
14380 (UNSPEC_XXSPLTID): Move to vsx.md.
14381 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
14382 (UNSPEC_XXBLEND): Move to vsx.md.
14383 (UNSPEC_XXPERMX): Move to vsx.md.
14384 (VM3): Move to vsx.md.
14385 (VM3_char): Move to vsx.md.
14386 (xxspltiw_v4si): Move to vsx.md.
14387 (xxspltiw_v4sf): Move to vsx.md.
14388 (xxspltiw_v4sf_inst): Move to vsx.md.
14389 (xxspltidp_v2df): Move to vsx.md.
14390 (xxspltidp_v2df_inst): Move to vsx.md.
14391 (xxsplti32dx_v4si_inst): Move to vsx.md.
14392 (xxsplti32dx_v4sf): Move to vsx.md.
14393 (xxsplti32dx_v4sf_inst): Move to vsx.md.
14394 (xxblend_<mode>): Move to vsx.md.
14395 (xxpermx): Move to vsx.md.
14396 (xxpermx_inst): Move to vsx.md.
14397 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
14398 (UNSPEC_XXSPLTIW): Move from altivec.md.
14399 (UNSPEC_XXSPLTID): Move from altivec.md.
14400 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
14401 (UNSPEC_XXBLEND): Move from altivec.md.
14402 (UNSPEC_XXPERMX): Move from altivec.md.
14403 (VM3): Move from altivec.md.
14404 (VM3_char): Move from altivec.md.
14405 (xxspltiw_v4si): Move from altivec.md.
14406 (xxspltiw_v4sf): Move from altivec.md.
14407 (xxspltiw_v4sf_inst): Move from altivec.md.
14408 (xxspltidp_v2df): Move from altivec.md.
14409 (xxspltidp_v2df_inst): Move from altivec.md.
14410 (xxsplti32dx_v4si_inst): Move from altivec.md.
14411 (xxsplti32dx_v4sf): Move from altivec.md.
14412 (xxsplti32dx_v4sf_inst): Move from altivec.md.
14413 (xxblend_<mode>): Move from altivec.md.
14414 (xxpermx): Move from altivec.md.
14415 (xxpermx_inst): Move from altivec.md.
14417 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
14419 * tree-vect-generic.c (expand_vector_operations_1): Use either
14420 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
14421 when constructing scalar splat expressions.
14423 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
14426 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
14427 pointer to __vector_pair *.
14429 2021-08-19 Martin Sebor <msebor@redhat.com>
14431 * gimple-range.cc: Add comments.
14432 * gimple-range.h: Same.
14434 2021-08-19 Martin Sebor <msebor@redhat.com>
14436 PR middle-end/101984
14437 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
14440 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
14442 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
14443 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
14444 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
14446 2021-08-19 Jakub Jelinek <jakub@redhat.com>
14448 PR middle-end/101950
14449 * optabs.c (expand_clrsb_using_clz): New function.
14450 (expand_unop): Use it as another clrsb expansion fallback.
14452 2021-08-19 liuhongt <hongtao.liu@intel.com>
14455 2021-07-28 liuhongt <hongtao.liu@intel.com>
14458 * config/i386/i386.h (processor_costs): Add new member
14460 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
14461 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
14462 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
14463 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
14464 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
14465 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
14466 generic_cost, core_cost): Initialize integer_to_sse same value
14468 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
14469 * config/i386/i386.c (ix86_builtin_vectorization_cost):
14470 Use integer_to_sse instead of sse_op to calculate the cost of
14473 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
14475 * config.gcc: Include rpath.opt for Darwin.
14476 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
14478 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14480 PR bootstrap/101959
14481 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
14484 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
14486 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
14487 pattern and type cast.
14488 (vld4_lane_f32): Use float RTL pattern.
14489 (vld4q_lane_f64): Use float type cast.
14491 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
14493 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
14496 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14498 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
14499 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
14500 (hash_map_tests_c_tests): Call it.
14502 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14504 * ggc.h (enum ggc_collect): New.
14505 (ggc_collect): Use it.
14506 * ggc-page.c: Adjust.
14507 * ggc-common.c: Likewise.
14508 * ggc-tests.c: Likewise.
14509 * read-rtl-function.c: Likewise.
14510 * selftest-run-tests.c: Likewise.
14511 * doc/gty.texi (Invoking the garbage collector): Likewise.
14513 2021-08-18 liuhongt <hongtao.liu@intel.com>
14516 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
14518 * config/i386/sse.md (*sse3_haddv2df3_low): Add
14519 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
14520 (*sse3_hsubv2df3_low): Ditto.
14521 * config/i386/x86-tune.def
14522 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
14524 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14526 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
14527 (gori_compute::compute_operand_range): Add tracing.
14528 (gori_compute::logical_combine): Ditto.
14529 (gori_compute::compute_logical_operands): Ditto.
14530 (gori_compute::compute_operand1_range): Ditto.
14531 (gori_compute::compute_operand2_range): Ditto.
14532 (gori_compute::outgoing_edge_range_p): Ditto.
14533 * gimple-range-gori.h (class gori_compute): Add range_tracer.
14535 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14537 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
14538 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
14539 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
14540 * params.opt (--param=evrp-mode): Adjust options.
14542 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14544 * Makefile.in (OBJS): Add gimple-range-trace.o.
14545 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
14546 * gimple-range-fold.cc: Adjust headers.
14547 * gimple-range-trace.cc: New.
14548 * gimple-range-trace.h: New.
14549 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
14550 (gimple_ranger::range_of_expr): Add tracing.
14551 (gimple_ranger::range_on_entry): Ditto.
14552 (gimple_ranger::range_on_exit): Ditto.
14553 (gimple_ranger::range_on_edge): Ditto.
14554 (gimple_ranger::fold_range_internal): Ditto.
14555 (gimple_ranger::dump_bb): Do not calculate edge range twice.
14556 (trace_ranger::*): Remove.
14557 (enable_ranger): Never create a trace_ranger.
14558 (debug_seed_ranger): Move to gimple-range-trace.cc.
14559 (dump_ranger): Ditto.
14560 (debug_ranger): Ditto.
14561 * gimple-range.h: Include gimple-range-trace.h.
14562 (range_on_entry, range_on_exit): No longer virtual.
14563 (class trace_ranger): Remove.
14564 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
14566 2021-08-17 Martin Sebor <msebor@redhat.com>
14568 PR middle-end/101854
14569 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
14570 in gimple-ssa-warn-access.cc.
14571 * calls.c (alloc_max_size): Move code to check_alloca.
14572 (get_size_range): Move to pointer-query.cc.
14573 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
14574 (get_attr_nonstring_decl): Move to tree.c.
14575 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
14576 (append_attrname): Same.
14577 (maybe_warn_rdwr_sizes): Same.
14578 (initialize_argument_information): Move code to
14579 gimple-ssa-warn-access.cc.
14580 * calls.h (maybe_warn_alloc_args_overflow): Move to
14581 gimple-ssa-warn-access.h.
14582 (get_attr_nonstring_decl): Move to tree.h.
14583 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
14584 (enum size_range_flags): Move to pointer-query.h.
14585 (get_size_range): Same.
14586 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
14587 to avoid Clang -Wunused-function.
14588 (get_size_range): Declare static.
14589 (maybe_emit_free_warning): Rename...
14590 (maybe_check_dealloc_call): ...to this for consistency.
14591 (class pass_waccess): Add members.
14592 (pass_waccess::~pass_waccess): Defined.
14593 (alloc_max_size): Move here from calls.c.
14594 (maybe_warn_alloc_args_overflow): Same.
14595 (check_alloca): New function.
14596 (check_alloc_size_call): New function.
14597 (check_strncat): Handle another warning flag.
14598 (pass_waccess::check_builtin): Handle alloca.
14599 (fntype_argno_type): Move here from calls.c.
14600 (append_attrname): Same.
14601 (maybe_warn_rdwr_sizes): Same.
14602 (pass_waccess::check_call): Define.
14603 (check_nonstring_args): New function.
14604 (pass_waccess::check): Call new member functions.
14605 (pass_waccess::execute): Enable ranger.
14606 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
14607 (maybe_warn_nonstring_arg): Same.
14608 * gimple-ssa-warn-restrict.c: Remove #include.
14609 * pointer-query.cc (get_size_range): Move here from calls.c.
14610 * pointer-query.h (enum size_range_flags): Same.
14611 (get_size_range): Same.
14612 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
14613 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
14615 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
14617 * ggc.h (ggc_collect): Add 'force_collect' parameter.
14618 * ggc-page.c (ggc_collect): Use that one instead of global
14619 'ggc_force_collect'. Adjust all users.
14620 * doc/gty.texi (Invoking the garbage collector): Update.
14621 * ggc-internal.h (ggc_force_collect): Remove.
14622 * ggc-common.c (ggc_force_collect): Likewise.
14623 * selftest.h (forcibly_ggc_collect): Remove.
14624 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
14625 * read-rtl-function.c (test_loading_labels): Adjust.
14626 * selftest-run-tests.c (run_tests): Likewise.
14628 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14630 * config/darwin.c (darwin_file_end): Reset and reclaim the
14631 section names table at the end of compile.
14633 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14636 * config.in: Regenerate.
14637 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
14638 (ASM_SPEC): Pass options to disable branch shortening where
14640 * configure: Regenerate.
14641 * configure.ac: Detect versions of 'as' that support the
14642 optimisation which has the bug.
14644 2021-08-17 Richard Biener <rguenther@suse.de>
14646 * optabs-query.c (supports_vec_gather_load_p): Also check
14648 (supports_vec_scatter_store_p): Likewise.
14649 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
14650 back to masked variants if non-masked are not supported.
14651 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
14652 When we need to use masked gather/scatter but do not have
14653 a mask set up a constant true one.
14654 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
14655 non-SSA_NAME masks.
14657 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14659 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
14660 algorithm as PLUS_EXPR to improve subtraction bit bounds.
14661 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
14663 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14665 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
14666 calculate the mask-value pair result of a multiplication by an
14668 (bit_value_binop) [MULT_EXPR]: Call it from here for
14669 multiplications by (sparse) non-negative constants.
14671 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
14674 * config.gcc (gcc_cv_initfini_array): Leave undefined for
14675 uclinuxfdpiceabi targets.
14677 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14679 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
14680 reverse debug stmts.
14682 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14684 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
14686 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14688 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
14689 (__LD4Q_LANE_FUNC): Likewise.
14690 (vld4_lane_u8): Define without macro.
14691 (vld4_lane_u16): Likewise.
14692 (vld4_lane_u32): Likewise.
14693 (vld4_lane_u64): Likewise.
14694 (vld4_lane_s8): Likewise.
14695 (vld4_lane_s16): Likewise.
14696 (vld4_lane_s32): Likewise.
14697 (vld4_lane_s64): Likewise.
14698 (vld4_lane_f16): Likewise.
14699 (vld4_lane_f32): Likewise.
14700 (vld4_lane_f64): Likewise.
14701 (vld4_lane_p8): Likewise.
14702 (vld4_lane_p16): Likewise.
14703 (vld4_lane_p64): Likewise.
14704 (vld4q_lane_u8): Likewise.
14705 (vld4q_lane_u16): Likewise.
14706 (vld4q_lane_u32): Likewise.
14707 (vld4q_lane_u64): Likewise.
14708 (vld4q_lane_s8): Likewise.
14709 (vld4q_lane_s16): Likewise.
14710 (vld4q_lane_s32): Likewise.
14711 (vld4q_lane_s64): Likewise.
14712 (vld4q_lane_f16): Likewise.
14713 (vld4q_lane_f32): Likewise.
14714 (vld4q_lane_f64): Likewise.
14715 (vld4q_lane_p8): Likewise.
14716 (vld4q_lane_p16): Likewise.
14717 (vld4q_lane_p64): Likewise.
14718 (vld4_lane_bf16): Likewise.
14719 (vld4q_lane_bf16): Likewise.
14721 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14723 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
14724 (__LD3Q_LANE_FUNC): Delete.
14725 (vld3_lane_u8): Define without macro.
14726 (vld3_lane_u16): Likewise.
14727 (vld3_lane_u32): Likewise.
14728 (vld3_lane_u64): Likewise.
14729 (vld3_lane_s8): Likewise.
14730 (vld3_lane_s16): Likewise.
14731 (vld3_lane_s32): Likewise.
14732 (vld3_lane_s64): Likewise.
14733 (vld3_lane_f16): Likewise.
14734 (vld3_lane_f32): Likewise.
14735 (vld3_lane_f64): Likewise.
14736 (vld3_lane_p8): Likewise.
14737 (vld3_lane_p16): Likewise.
14738 (vld3_lane_p64): Likewise.
14739 (vld3q_lane_u8): Likewise.
14740 (vld3q_lane_u16): Likewise.
14741 (vld3q_lane_u32): Likewise.
14742 (vld3q_lane_u64): Likewise.
14743 (vld3q_lane_s8): Likewise.
14744 (vld3q_lane_s16): Likewise.
14745 (vld3q_lane_s32): Likewise.
14746 (vld3q_lane_s64): Likewise.
14747 (vld3q_lane_f16): Likewise.
14748 (vld3q_lane_f32): Likewise.
14749 (vld3q_lane_f64): Likewise.
14750 (vld3q_lane_p8): Likewise.
14751 (vld3q_lane_p16): Likewise.
14752 (vld3q_lane_p64): Likewise.
14753 (vld3_lane_bf16): Likewise.
14754 (vld3q_lane_bf16): Likewise.
14756 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14758 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
14759 (__LD2Q_LANE_FUNC): Likewise.
14760 (vld2_lane_u8): Define without macro.
14761 (vld2_lane_u16): Likewise.
14762 (vld2_lane_u32): Likewise.
14763 (vld2_lane_u64): Likewise.
14764 (vld2_lane_s8): Likewise.
14765 (vld2_lane_s16): Likewise.
14766 (vld2_lane_s32): Likewise.
14767 (vld2_lane_s64): Likewise.
14768 (vld2_lane_f16): Likewise.
14769 (vld2_lane_f32): Likewise.
14770 (vld2_lane_f64): Likewise.
14771 (vld2_lane_p8): Likewise.
14772 (vld2_lane_p16): Likewise.
14773 (vld2_lane_p64): Likewise.
14774 (vld2q_lane_u8): Likewise.
14775 (vld2q_lane_u16): Likewise.
14776 (vld2q_lane_u32): Likewise.
14777 (vld2q_lane_u64): Likewise.
14778 (vld2q_lane_s8): Likewise.
14779 (vld2q_lane_s16): Likewise.
14780 (vld2q_lane_s32): Likewise.
14781 (vld2q_lane_s64): Likewise.
14782 (vld2q_lane_f16): Likewise.
14783 (vld2q_lane_f32): Likewise.
14784 (vld2q_lane_f64): Likewise.
14785 (vld2q_lane_p8): Likewise.
14786 (vld2q_lane_p16): Likewise.
14787 (vld2q_lane_p64): Likewise.
14788 (vld2_lane_bf16): Likewise.
14789 (vld2q_lane_bf16): Likewise.
14791 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14793 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
14796 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14798 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
14799 (rank_for_schedule): Use it.
14801 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14803 PR rtl-optimization/91598
14804 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
14805 insns after memory reads and before memory writes.
14807 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
14809 * rtl.h (CONST_VECTOR_P): New macro.
14810 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
14811 code testing macros.
14812 (aarch64_ptrue_all_mode): Likewise.
14813 (aarch64_expand_mov_immediate): Likewise.
14814 (aarch64_const_vec_all_in_range_p): Likewise.
14815 (aarch64_rtx_costs): Likewise.
14816 (aarch64_legitimate_constant_p): Likewise.
14817 (aarch64_simd_valid_immediate): Likewise.
14818 (aarch64_simd_make_constant): Likewise.
14819 (aarch64_convert_mult_to_shift): Likewise.
14820 (aarch64_expand_sve_vec_perm): Likewise.
14821 (aarch64_vec_fpconst_pow_of_2): Likewise.
14823 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14825 PR tree-optimization/101938
14826 * range-op.cc (operator_abs::op1_range): Special case
14827 -TYPE_MIN_VALUE for flag_wrapv.
14829 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
14831 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
14834 2021-08-17 Jakub Jelinek <jakub@redhat.com>
14836 * tree.def (OMP_SCOPE): New tree code.
14837 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
14838 * tree-nested.c (convert_nonlocal_reference_stmt,
14839 convert_local_reference_stmt, convert_gimple_call): Handle
14841 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
14842 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
14843 * gimple.c (gimple_build_omp_scope): New function.
14844 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
14845 * gimple.h (gimple_build_omp_scope): Declare.
14846 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
14847 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
14848 gimple_omp_scope_set_clauses): New inline functions.
14849 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
14850 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
14851 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
14852 * gimple-walk.c (walk_gimple_stmt): Likewise.
14853 * gimple-low.c (lower_stmt): Likewise.
14854 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14855 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
14856 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
14857 allowing task reductions. Reject inscan reductions on scope.
14858 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
14859 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
14860 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
14861 (estimate_num_insns): Likewise.
14862 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
14863 contexts if var isn't privatized there.
14864 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
14865 (scan_omp_1_stmt): Likewise.
14866 (maybe_add_implicit_barrier_cancel): Look through outer
14868 (lower_omp_scope): New function.
14869 (lower_omp_task_reductions): Handle OMP_SCOPE.
14870 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
14871 (diagnose_sb_1, diagnose_sb_2): Likewise.
14872 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
14873 (expand_omp): Handle GIMPLE_OMP_SCOPE.
14874 (omp_make_gimple_edges): Likewise.
14875 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
14877 2021-08-17 Richard Biener <rguenther@suse.de>
14879 PR tree-optimization/101925
14880 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
14881 reverse on COMPONENT_REF and ARRAY_REF according to
14882 what reverse_storage_order_for_component_p does.
14883 (vn_reference_eq): Compare reversed on reference ops.
14884 (reverse_storage_order_for_component_p): New overload.
14885 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
14886 on the reference looked up.
14888 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
14890 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
14891 shifts on the H8/S.
14892 (h8300_option_override): Use loops on H8/S more often when optimizing
14894 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
14895 accordingly. Handle various arithmetic right shifts with special
14896 sequences that we couldn't handle before.
14898 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
14900 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
14902 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
14904 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
14905 * config/sparc/sparc.c (sparc_gcov_type_size): New.
14906 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
14907 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
14908 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
14909 * doc/tm.texi.in: Regenerate.
14910 * target.def (gcov_type_size): New target hook.
14911 * targhooks.c (default_gcov_type_size): New.
14912 * targhooks.h (default_gcov_type_size): Declare.
14913 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
14915 (gimple_gen_time_profiler): Likewise.
14917 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14919 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
14921 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
14923 PR middle-end/101931
14924 * omp-low.c (omp_runtime_api_call): Update for routines
14925 added in the meanwhile.
14927 2021-08-16 Martin Liska <mliska@suse.cz>
14929 PR tree-optimization/100393
14930 * tree-switch-conversion.c (group_cluster::dump): Use
14931 get_comparison_count.
14932 (jump_table_cluster::find_jump_tables): Pre-compute number of
14933 comparisons and then decrement it. Cache also max_ratio.
14934 (jump_table_cluster::can_be_handled): Change signature.
14935 * tree-switch-conversion.h (get_comparison_count): New.
14937 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14939 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
14940 encodings only when -fgnat-encodings=all is specified.
14941 (add_bound_info): Likewise.
14942 (add_byte_size_attribute): Likewise.
14943 (gen_member_die): Likewise.
14945 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14947 * omp-oacc-neuter-broadcast.cc
14948 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
14950 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14952 * omp-oacc-neuter-broadcast.cc
14953 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
14956 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14958 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
14959 (execute_omp_oacc_neuter_broadcast): ... here.
14960 (install_var_field, build_receiver_ref, build_sender_ref): Take
14961 'field_map_t *' parameter. Adjust all users.
14962 (worker_single_copy, neuter_worker_single): Take a
14963 'record_field_map_t *' parameter. Adjust all users.
14965 2021-08-16 liuhongt <hongtao.liu@intel.com>
14968 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
14971 2021-08-16 Martin Liska <mliska@suse.cz>
14974 * multiple_target.c (create_dispatcher_calls): Make default
14975 function local only if it is a definition.
14977 2021-08-16 Martin Liska <mliska@suse.cz>
14980 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
14981 consider equal SSA_NAMEs when one is a param.
14983 2021-08-16 liuhongt <hongtao.liu@intel.com>
14986 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
14987 Support vpermi2b for V32QI/V16QImode.
14988 (ix86_extract_perm_from_pool_constant): New function.
14989 (ix86_expand_vec_one_operand_perm_avx512): Support
14990 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
14991 (expand_vec_perm_1): Adjust comments for upper.
14992 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
14994 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
14995 (pshufb_truncv4siv4hi_operand): Ditto.
14996 (pshufb_truncv8hiv8qi_operand): Ditto.
14997 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
14998 New pre_reload define_insn_and_split.
14999 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
15000 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
15001 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
15002 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
15003 (*avx512f_permvar_truncv4div4si_1): Ditto.
15004 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
15005 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
15006 (*avx512f_pshufd_truncv2div2si_1): Ditto.
15008 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
15010 * config/riscv/multilib-generator: Support code model option for
15012 * doc/install.texi: Add document of new option for
15013 --with-multilib-generator.
15015 2021-08-15 Clément Chigot <clement.chigot@atos.net>
15017 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
15018 (rs6000_legitimize_tls_address_aix): Use it.
15019 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
15020 xcoff_tls_exec_model_detected is true.
15022 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
15024 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
15025 to allow a bit more code growth, saving many dozens of cycles.
15026 (h8300_option_override): Adjus shift_alg_si if optimizing for
15028 (get_shift_alg): Use special + inline shifts for residuals
15031 2021-08-14 Stafford Horne <shorne@gmail.com>
15034 * config/or1k/or1k-opts.h: New file.
15035 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
15036 Support generating gotha relocations if -mcmodel=large is
15038 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
15040 * config/or1k/or1k.opt (mcmodel=): New option.
15041 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
15043 2021-08-14 Martin Sebor <msebor@redhat.com>
15045 PR middle-end/101791
15046 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
15047 to valid_new_delete_pair_p.
15048 * tree.c (valid_new_delete_pair_p): Add argument.
15049 * tree.h (valid_new_delete_pair_p): Same.
15051 2021-08-14 Jakub Jelinek <jakub@redhat.com>
15054 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
15055 <case E_V64QImode>: For this mode assert
15056 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
15058 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
15061 * config/rs6000/altivec.md (xxeval): Use register_predicate
15062 instead of altivec_register_predicate.
15064 2021-08-13 Martin Sebor <msebor@redhat.com>
15066 PR middle-end/101734
15067 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
15068 (maybe_warn_operand): Call it.
15070 2021-08-13 Martin Liska <mliska@suse.cz>
15073 * attribs.c (decl_attributes): Make naked functions "noipa"
15076 2021-08-13 Martin Liska <mliska@suse.cz>
15079 * symtab.c (symtab_node::noninterposable_alias): Do not create
15080 local aliases for target_clone functions as the clonning pass
15083 2021-08-13 Martin Liska <mliska@suse.cz>
15085 * opts.c (LIVE_PATCHING_OPTION): Define.
15086 (control_options_for_live_patching): Use it in error messages.
15088 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
15090 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
15091 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
15092 ignore_stores_eaf_flags): New constants.
15093 (remove_useless_eaf_flags): New function.
15094 (eaf_flags_useful_p): Use it.
15095 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
15097 (modref_lattice::init): Add EAF_NOREAD.
15098 (modref_lattice::add_escape_point): Do not reacord escape point if
15100 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
15101 use remove_useless_eaf_flags.
15102 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
15103 (modref_lattice::merge_direct_load): Add EAF_NOREAD
15104 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
15105 (analyze_parms): Use remove_useless_eaf_flags.
15106 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
15107 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
15108 use remove_useless_eaf_flags.
15109 (modref_propagate_flags_in_scc): Update.
15110 * ipa-modref.h: Turn eaf_flags_t back to char.
15111 * tree-core.h (EAF_NOT_RETURNED): Fix.
15112 (EAF_NOREAD): New constant
15113 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
15115 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
15116 (handle_pure_call): Likewise.
15118 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15120 * tree.def (OMP_MASKED): New tree code.
15121 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
15122 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
15123 OMP_CLAUSE_FILTER_EXPR): Define.
15124 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
15125 (omp_clause_code_name): Likewise.
15126 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
15127 * tree-nested.c (convert_nonlocal_omp_clauses,
15128 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
15129 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
15130 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
15131 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
15132 (dump_generic_node): Handle OMP_MASTER.
15133 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
15134 * gimple.c (gimple_build_omp_masked): New function.
15135 (gimple_copy): Handle GIMPLE_OMP_MASKED.
15136 * gimple.h (gimple_build_omp_masked): Declare.
15137 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
15138 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
15139 gimple_omp_masked_set_clauses): New inline functions.
15140 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
15141 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
15142 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
15143 * gimple-walk.c (walk_gimple_stmt): Likewise.
15144 * gimple-low.c (lower_stmt): Likewise.
15145 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
15146 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
15147 that take one expression rather than decl or constant, force
15148 gimplification of that into a SSA_NAME or temporary unless min
15150 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
15151 (gimplify_expr): Handle OMP_MASKED.
15152 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
15153 (estimate_num_insns): Likewise.
15154 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
15155 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
15156 diagnostics for existence of masked construct.
15157 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
15158 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
15159 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
15162 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
15165 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
15166 (ldexp<mode>3): Use avx512f_scalef<mode>2.
15167 (UNSPEC_SCALEF): Move from ...
15168 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
15170 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
15172 * ipa-split.c (consider_split): Fix condition testing void functions.
15174 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
15176 * doc/invoke.texi: Remove docs for threader-mode param.
15177 * flag-types.h (enum threader_mode): Remove.
15178 * params.opt: Remove threader-mode param.
15179 * tree-ssa-threadbackward.c (class back_threader): Remove
15180 path_is_unreachable_p.
15181 Make find_paths private.
15182 Add maybe_thread and thread_through_all_blocks.
15183 Remove reference marker for m_registry.
15184 Remove reference marker for m_profit.
15185 (back_threader::back_threader): Adjust for registry and profit not
15187 (dump_path): Move down.
15188 (debug): Move down.
15189 (class thread_jumps): Remove.
15190 (class back_threader_registry): Remove m_all_paths.
15192 (thread_jumps::thread_through_all_blocks): Move to back_threader
15194 (fsm_find_thread_path): Remove
15195 (back_threader::maybe_thread): New.
15196 (back_threader::thread_through_all_blocks): Move from
15198 (back_threader_registry::back_threader_registry): Remove
15200 (back_threader_registry::~back_threader_registry): Remove.
15201 (thread_jumps::find_taken_edge): Remove.
15202 (thread_jumps::check_subpath_and_update_thread_path): Remove.
15203 (thread_jumps::maybe_register_path): Remove.
15204 (thread_jumps::handle_phi): Remove.
15205 (handle_assignment_p): Remove.
15206 (thread_jumps::handle_assignment): Remove.
15207 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
15208 (thread_jumps::find_jump_threads_backwards): Remove.
15209 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
15210 (try_thread_blocks): Rename find_jump_threads_backwards to
15212 (pass_early_thread_jumps::execute): Same.
15214 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
15216 * tree-core.h (omp_clause_proc_bind_kind): Add
15217 OMP_CLAUSE_PROC_BIND_PRIMARY.
15218 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
15219 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
15221 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
15223 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
15224 fno-common reference.
15225 * config/arc/arc.c (arc_override_options): Remove overriding of
15228 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15231 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
15232 If d->testing_p, return true after performing checks instead of
15233 actually expanding the insn.
15234 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
15235 !TARGET_AVX512BW and return false.
15237 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
15239 * configure.ac (PE linker --disable-dynamicbase support): New check.
15240 * configure: Regenerate.
15241 * config.in: Likewise.
15242 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
15243 (LINK_SPEC): Use it.
15244 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
15245 (LINK_SPEC): Likewise.
15247 2021-08-12 liuhongt <hongtao.liu@intel.com>
15250 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
15251 post_reload define_insn_and_split.
15252 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
15253 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
15254 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
15255 (*avx2_zero_extendv8hiv8si2_2): Ditto.
15256 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
15257 (*avx512f_zero_extendv8siv8di2_2): Ditto.
15258 (*avx2_zero_extendv4siv4di2_2): Ditto.
15259 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
15260 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
15263 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15265 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
15268 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15270 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
15272 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15274 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
15275 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
15276 initialization of pcvoid_type_node here...
15277 (altivec_init_builtins): ...from here.
15278 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
15279 RS6000_BTI_const_ptr_void.
15280 (pcvoid_type_node): New macro.
15282 2021-08-11 Richard Biener <rguenther@suse.de>
15285 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
15286 hard-register accesses.
15288 2021-08-11 Richard Biener <rguenther@suse.de>
15290 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
15291 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
15292 to determine has_volatile_ops.
15294 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
15296 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
15298 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
15299 Alexandre Oliva <oliva@adacore.com>
15301 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
15303 (modref_lattice::merge_deref): Fix handling of indirect scape points.
15304 (update_escape_summary_1): Likewise.
15305 (update_escape_summary): Likewise.
15306 (ipa_merge_modref_summary_after_inlining): Likewise.
15308 2021-08-11 Richard Biener <rguenther@suse.de>
15310 PR middle-end/101858
15311 * fold-const.c (fold_binary_loc): Guard simplification
15312 of X < (cast) (1 << Y) to integer types.
15314 2021-08-11 Richard Biener <rguenther@suse.de>
15316 PR tree-optimization/101861
15317 * tree-vect-stmts.c (vectorizable_load): Fix error in
15318 previous change with regard to gather vectorization.
15320 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15323 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
15325 (vdup_n_s16): Likewise.
15326 (vdup_n_s32): Likewise.
15327 (vdup_n_s64): Likewise.
15328 (vdup_n_u8): Likewise.
15329 (vdup_n_u16): Likewise.
15330 (vdup_n_u32): Likewise.
15331 (vdup_n_u64): Likewise.
15332 (vdup_n_p8): Likewise.
15333 (vdup_n_p16): Likewise.
15334 (vdup_n_p64): Likewise.
15335 (vdup_n_f16): Likewise.
15336 (vdup_n_f32): Likewise.
15337 (vdupq_n_s8): Likewise.
15338 (vdupq_n_s16): Likewise.
15339 (vdupq_n_s32): Likewise.
15340 (vdupq_n_s64): Likewise.
15341 (vdupq_n_u8): Likewise.
15342 (vdupq_n_u16): Likewise.
15343 (vdupq_n_u32): Likewise.
15344 (vdupq_n_u64): Likewise.
15345 (vdupq_n_p8): Likewise.
15346 (vdupq_n_p16): Likewise.
15347 (vdupq_n_p64): Likewise.
15348 (vdupq_n_f16): Likewise.
15349 (vdupq_n_f32): Likewise.
15350 (vmov_n_s8): Replace call to builtin with call to corresponding
15352 (vmov_n_s16): Likewise.
15353 (vmov_n_s32): Likewise.
15354 (vmov_n_s64): Likewise.
15355 (vmov_n_u8): Likewise.
15356 (vmov_n_u16): Likewise.
15357 (vmov_n_u32): Likewise.
15358 (vmov_n_u64): Likewise.
15359 (vmov_n_p8): Likewise.
15360 (vmov_n_p16): Likewise.
15361 (vmov_n_f16): Likewise.
15362 (vmov_n_f32): Likewise.
15363 (vmovq_n_s8): Likewise.
15364 (vmovq_n_s16): Likewise.
15365 (vmovq_n_s32): Likewise.
15366 (vmovq_n_s64): Likewise.
15367 (vmovq_n_u8): Likewise.
15368 (vmovq_n_u16): Likewise.
15369 (vmovq_n_u32): Likewise.
15370 (vmovq_n_u64): Likewise.
15371 (vmovq_n_p8): Likewise.
15372 (vmovq_n_p16): Likewise.
15373 (vmovq_n_f16): Likewise.
15374 (vmovq_n_f32): Likewise.
15375 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
15377 2021-08-11 liuhongt <hongtao.liu@intel.com>
15380 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
15381 when TARGET_AVX512F and TARGET_SSE_MATH.
15383 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15386 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
15387 for V32HImode if !TARGET_AVX512BW.
15388 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
15389 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
15390 early, but actually check the permutation.
15392 2021-08-10 Richard Biener <rguenther@suse.de>
15394 PR tree-optimization/101809
15395 * tree-vect-stmts.c (get_load_store_type): Allow emulated
15396 gathers with offset vector nunits being a constant multiple
15397 of the data vector nunits.
15398 (vect_get_gather_scatter_ops): Use the appropriate nunits
15399 for the offset vector defs.
15400 (vectorizable_store): Adjust call to
15401 vect_get_gather_scatter_ops.
15402 (vectorizable_load): Likewise. Handle the case of less
15403 offset vectors than data vectors.
15405 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15408 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
15409 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
15412 2021-08-10 Richard Biener <rguenther@suse.de>
15414 PR tree-optimization/101801
15415 PR tree-optimization/101819
15416 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
15417 * tree-vect-loop.c (vect_emulated_vector_p): New function.
15418 (vectorizable_reduction): Re-instantiate a check for emulated
15420 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15421 (vectorizable_operation): Likewise. Cost emulated vector
15422 operations according to the scalar sequence synthesized by
15425 2021-08-10 Richard Biener <rguenther@suse.de>
15427 PR middle-end/101824
15428 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
15429 volatile in case the variable was.
15431 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
15434 * config/i386/constraints.md (BC): Document for integer SSE
15435 constant all bits set operand.
15436 (BF): New constraint for const floating-point all bits set
15438 * config/i386/i386.c (standard_sse_constant_p): Likewise.
15439 (standard_sse_constant_opcode): Likewise.
15440 * config/i386/sse.md (sseconstm1): New mode attribute.
15441 (mov<mode>_internal): Replace BC with <sseconstm1>.
15443 2021-08-10 liuhongt <hongtao.liu@intel.com>
15445 * config/i386/sse.md (cond_<insn><mode>): New expander.
15446 (VI248_AVX512VLBW): New mode iterator.
15447 * config/i386/predicates.md
15448 (nonimmediate_or_const_vec_dup_operand): New predicate.
15450 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
15452 PR tree-optimization/101741
15453 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
15454 type of parameter for toupper/tolower.
15456 2021-08-09 Martin Jambor <mjambor@suse.cz>
15458 PR testsuite/101654
15459 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
15461 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
15463 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
15465 (is_store_insn1): Verify source is a register.
15467 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
15470 * config/i386/mmx.md (<any_logic:code>v2sf3):
15471 Rename from *mmx_<any_logic:code>v2sf3
15473 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15475 * config/nvptx/nvptx.c: Cross-reference parts adapted in
15476 'gcc/omp-oacc-neuter-broadcast.cc'.
15477 * omp-low.c: Likewise.
15478 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
15481 2021-08-09 Julian Brown <julian@codesourcery.com>
15482 Kwok Cheung Yeung <kcy@codesourcery.com>
15483 Thomas Schwinge <thomas@codesourcery.com>
15485 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
15486 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
15487 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
15488 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
15489 (gcn_fork_join): Update comment.
15490 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
15491 (macc_experimental_workers): Remove unused option.
15493 2021-08-09 Julian Brown <julian@codesourcery.com>
15494 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
15495 Kwok Cheung Yeung <kcy@codesourcery.com>
15496 Thomas Schwinge <thomas@codesourcery.com>
15498 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
15499 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
15500 Add documentation hook.
15501 * doc/tm.texi: Regenerate.
15502 * omp-oacc-neuter-broadcast.cc: New file.
15503 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
15504 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
15505 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
15506 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
15507 * target.def (goacc.create_worker_broadcast_record): Add target
15509 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
15511 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
15512 Rename prototype to...
15513 (gcn_goacc_create_worker_broadcast_record): ... this.
15514 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
15516 (gcn_goacc_create_worker_broadcast_record): ... this.
15517 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
15519 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
15521 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
15524 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
15525 the right iterator.
15527 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15529 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
15531 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15533 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
15534 consider '-foffload-abi'.
15535 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
15536 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
15539 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15541 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
15544 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15546 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
15548 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15550 * doc/gty.texi (Files): Update.
15552 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15554 * doc/gty.texi (Files): Fix GTY header file example.
15556 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
15558 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
15559 determine the upper and lower bounds from a mask-value pair.
15560 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
15561 absolute value and unsigned absolute value expressions.
15562 (bit_value_binop): Initialize *VAL's precision.
15563 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
15564 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
15565 support when the operands are unknown but potentially equal.
15566 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
15568 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
15570 * config/aarch64/aarch64.md
15571 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
15573 2021-08-08 Sergei Trofimovich <siarheit@google.com>
15575 * lra-constraints.c: Fix s/otput/output/ typo.
15577 2021-08-06 Martin Sebor <msebor@redhat.com>
15579 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
15580 (expand_builtin_strcat): Same.
15581 (expand_builtin_stpncpy): Same.
15582 (expand_builtin_strncat): Same.
15583 (check_read_access): Same.
15584 (check_memop_access): Same.
15585 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
15586 (expand_builtin_strnlen): Same.
15587 (expand_builtin_memcpy): Same.
15588 (expand_builtin_memmove): Same.
15589 (expand_builtin_mempcpy): Same.
15590 (expand_builtin_strcpy): Same.
15591 (expand_builtin_strcpy_args): Same.
15592 (expand_builtin_stpcpy_1): Same.
15593 (expand_builtin_strncpy): Same.
15594 (expand_builtin_memset): Same.
15595 (expand_builtin_bzero): Same.
15596 (expand_builtin_strcmp): Same.
15597 (expand_builtin_strncmp): Same.
15598 (expand_builtin): Remove handlers.
15599 (fold_builtin_strlen): Add a comment.
15600 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
15601 * calls.c (maybe_warn_nonstring_arg): Same.
15602 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
15603 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
15604 (gimple_fold_builtin_stpcpy): Same.
15605 * gimple-ssa-warn-access.cc (has_location): New function.
15606 (get_location): Same.
15607 (get_callee_fndecl): Same.
15608 (call_nargs): Same.
15610 (warn_string_no_nul): Define.
15611 (unterminated_array): Same.
15612 (check_nul_terminated_array): Same.
15613 (maybe_warn_nonstring_arg): Same.
15614 (maybe_warn_for_bound): Same.
15615 (warn_for_access): Same.
15616 (check_access): Same.
15617 (check_memop_access): Same.
15618 (check_read_access): Same.
15619 (warn_dealloc_offset): Use helper functions.
15620 (maybe_emit_free_warning): Same.
15621 (class pass_waccess): Add members.
15622 (check_strcat): New function.
15623 (check_strncat): New function.
15624 (check_stxcpy): New function.
15625 (check_stxncpy): New function.
15626 (check_strncmp): New function.
15627 (pass_waccess::check_builtin): New function.
15628 (pass_waccess::check): Call it.
15629 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
15631 (maybe_warn_for_bound): Same.
15632 (check_access): Same.
15633 (check_memop_access): Same.
15634 (check_read_access): Same.
15635 * pointer-query.h (struct access_data): Define a ctor overload.
15637 2021-08-06 Richard Biener <rguenther@suse.de>
15639 PR tree-optimization/101801
15640 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
15641 (vect_can_vectorize_without_simd_p): ... to this.
15642 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
15643 (vect_can_vectorize_without_simd_p): ... to this and fold
15644 in vect_min_worthwhile_factor.
15645 (vect_min_worthwhile_factor): Remove.
15646 (vectorizable_reduction): Adjust and remove the cost part.
15647 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15648 (vectorizable_operation): Likewise.
15650 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
15653 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
15654 Add general_gr_operand predicate to operand 3.
15656 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
15658 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
15659 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
15661 2021-08-06 Tamar Christina <tamar.christina@arm.com>
15663 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
15664 register_svprfop): Pass vec<> by pointer.
15665 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
15666 * langhooks.c (lhd_simulate_enum_decl): Likewise.
15667 * langhooks.h (struct lang_hooks_for_types): Likewise.
15669 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15671 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
15672 __builtin_memcpy instead of constructing an additional
15673 __builtin_aarch64_simd_oi one vector at a time.
15674 (vst1q_bf16_x2): Likewise.
15675 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
15676 an additional __builtin_aarch64_simd_ci one vector at a time.
15677 (vst1q_bf16_x3): Likewise.
15678 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
15679 (vst1q_bf16_x4): Likewise.
15680 (vst2_bf16): Use __builtin_memcpy instead of constructing an
15681 additional __builtin_aarch64_simd_oi one vector at a time.
15682 (vst2q_bf16): Likewise.
15683 (vst3_bf16): Use __builtin_memcpy instead of constructing an
15684 additional __builtin_aarch64_simd_ci mode one vector at a
15686 (vst3q_bf16): Likewise.
15687 (vst4_bf16): Use __builtin_memcpy instead of constructing an
15688 additional __builtin_aarch64_simd_xi one vector at a time.
15689 (vst4q_bf16): Likewise.
15691 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15693 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
15694 (__ST2Q_LANE_FUNC): Delete.
15695 (vst2_lane_f16): Use __builtin_memcpy to copy vector
15696 structure instead of constructing __builtin_aarch64_simd_oi
15697 one vector at a time.
15698 (vst2_lane_f32): Likewise.
15699 (vst2_lane_f64): Likewise.
15700 (vst2_lane_p8): Likewise.
15701 (vst2_lane_p16): Likewise.
15702 (vst2_lane_p64): Likewise.
15703 (vst2_lane_s8): Likewise.
15704 (vst2_lane_s16): Likewise.
15705 (vst2_lane_s32): Likewise.
15706 (vst2_lane_s64): Likewise.
15707 (vst2_lane_u8): Likewise.
15708 (vst2_lane_u16): Likewise.
15709 (vst2_lane_u32): Likewise.
15710 (vst2_lane_u64): Likewise.
15711 (vst2_lane_bf16): Likewise.
15712 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
15713 structure instead of using a union.
15714 (vst2q_lane_f32): Likewise.
15715 (vst2q_lane_f64): Likewise.
15716 (vst2q_lane_p8): Likewise.
15717 (vst2q_lane_p16): Likewise.
15718 (vst2q_lane_p64): Likewise.
15719 (vst2q_lane_s8): Likewise.
15720 (vst2q_lane_s16): Likewise.
15721 (vst2q_lane_s32): Likewise.
15722 (vst2q_lane_s64): Likewise.
15723 (vst2q_lane_u8): Likewise.
15724 (vst2q_lane_u16): Likewise.
15725 (vst2q_lane_u32): Likewise.
15726 (vst2q_lane_u64): Likewise.
15727 (vst2q_lane_bf16): Likewise.
15729 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15731 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
15732 (__ST3Q_LANE_FUNC): Delete.
15733 (vst3_lane_f16): Use __builtin_memcpy to copy vector
15734 structure instead of constructing __builtin_aarch64_simd_ci
15735 one vector at a time.
15736 (vst3_lane_f32): Likewise.
15737 (vst3_lane_f64): Likewise.
15738 (vst3_lane_p8): Likewise.
15739 (vst3_lane_p16): Likewise.
15740 (vst3_lane_p64): Likewise.
15741 (vst3_lane_s8): Likewise.
15742 (vst3_lane_s16): Likewise.
15743 (vst3_lane_s32): Likewise.
15744 (vst3_lane_s64): Likewise.
15745 (vst3_lane_u8): Likewise.
15746 (vst3_lane_u16): Likewise.
15747 (vst3_lane_u32): Likewise.
15748 (vst3_lane_u64): Likewise.
15749 (vst3_lane_bf16): Likewise.
15750 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
15751 structure instead of using a union.
15752 (vst3q_lane_f32): Likewise.
15753 (vst3q_lane_f64): Likewise.
15754 (vst3q_lane_p8): Likewise.
15755 (vst3q_lane_p16): Likewise.
15756 (vst3q_lane_p64): Likewise.
15757 (vst3q_lane_s8): Likewise.
15758 (vst3q_lane_s16): Likewise.
15759 (vst3q_lane_s32): Likewise.
15760 (vst3q_lane_s64): Likewise.
15761 (vst3q_lane_u8): Likewise.
15762 (vst3q_lane_u16): Likewise.
15763 (vst3q_lane_u32): Likewise.
15764 (vst3q_lane_u64): Likewise.
15765 (vst3q_lane_bf16): Likewise.
15767 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15769 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
15770 (__ST4Q_LANE_FUNC): Delete.
15771 (vst4_lane_f16): Use __builtin_memcpy to copy vector
15772 structure instead of constructing __builtin_aarch64_simd_xi
15773 one vector at a time.
15774 (vst4_lane_f32): Likewise.
15775 (vst4_lane_f64): Likewise.
15776 (vst4_lane_p8): Likewise.
15777 (vst4_lane_p16): Likewise.
15778 (vst4_lane_p64): Likewise.
15779 (vst4_lane_s8): Likewise.
15780 (vst4_lane_s16): Likewise.
15781 (vst4_lane_s32): Likewise.
15782 (vst4_lane_s64): Likewise.
15783 (vst4_lane_u8): Likewise.
15784 (vst4_lane_u16): Likewise.
15785 (vst4_lane_u32): Likewise.
15786 (vst4_lane_u64): Likewise.
15787 (vst4_lane_bf16): Likewise.
15788 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
15789 structure instead of using a union.
15790 (vst4q_lane_f32): Likewise.
15791 (vst4q_lane_f64): Likewise.
15792 (vst4q_lane_p8): Likewise.
15793 (vst4q_lane_p16): Likewise.
15794 (vst4q_lane_p64): Likewise.
15795 (vst4q_lane_s8): Likewise.
15796 (vst4q_lane_s16): Likewise.
15797 (vst4q_lane_s32): Likewise.
15798 (vst4q_lane_s64): Likewise.
15799 (vst4q_lane_u8): Likewise.
15800 (vst4q_lane_u16): Likewise.
15801 (vst4q_lane_u32): Likewise.
15802 (vst4q_lane_u64): Likewise.
15803 (vst4q_lane_bf16): Likewise.
15805 2021-08-06 Martin Liska <mliska@suse.cz>
15807 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
15808 a target option is restored, it can have
15809 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
15810 and error should not be emitted.
15812 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
15814 * gcov-io.h (gcov_write): Declare.
15815 * gcov-io.c (gcov_write): New.
15816 (gcov_write_counter): Remove.
15817 (gcov_write_tag_length): Likewise.
15818 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
15819 gcov_write_unsigned().
15820 * doc/invoke.texi (fprofile-info-section): Mention
15821 __gcov_info_to_gdca().
15823 2021-08-06 Martin Sebor <msebor@redhat.com>
15825 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
15826 arguments to by-reference.
15827 (iterate_fix_dominators): Same.
15828 * dominance.h (iterate_fix_dominators): Same.
15829 * ipa-prop.h: Call auto_vec::to_vec_legacy.
15830 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
15831 arguments to by-reference.
15832 (debug_data_dependence_relation): Same.
15833 (dump_data_dependence_relations): Same.
15834 * tree-data-ref.h (debug_data_dependence_relation): Same.
15835 (dump_data_dependence_relations): Same.
15836 * tree-predcom.c (dump_chains): Same.
15837 (initialize_root_vars_lm): Same.
15838 (determine_unroll_factor): Same.
15839 (replace_phis_by_defined_names): Same.
15840 (insert_init_seqs): Same.
15841 (pcom_worker::tree_predictive_commoning_loop): Call
15842 auto_vec::to_vec_legacy.
15843 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
15844 arguments to by-reference.
15845 * tree-ssa-threadbackward.c (populate_worklist): Same.
15846 (back_threader::resolve_def): Same.
15847 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
15848 (vect_enhance_data_refs_alignment): Same.
15849 (vect_check_lower_bound): Same.
15850 (vect_prune_runtime_alias_test_list): Same.
15851 (vect_permute_store_chain): Same.
15852 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
15853 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
15854 * tree-vectorizer.h (vect_permute_store_chain): Same.
15855 * vec.c (test_init): New function.
15856 (vec_c_tests): Call new function.
15857 * vec.h (vec): Declare ctors, dtor, and assignment.
15858 (auto_vec::vec_to_legacy): New function.
15859 (vec::copy): Adjust initialization.
15861 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
15864 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
15865 callee only uses GPRs.
15866 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
15867 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
15868 * config/i386/x86gprintrin.h: Add
15869 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
15870 to disable non-GPR ISAs.
15872 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15874 PR middle-end/101787
15875 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
15877 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15879 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
15880 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
15881 (vect_is_extending_load, vect_is_integer_truncation): New functions,
15882 moved from aarch64.c but given different names.
15883 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
15884 (aarch64_is_reduction, aarch64_reduc_type)
15885 (aarch64_embedded_comparison_type, aarch64_comparison_type)
15886 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
15887 in favor of the above. Update callers accordingly.
15889 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15892 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
15893 writing .cpu directive in asm output.
15894 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
15895 (arm_last_printed_arch_string): Delete.
15896 (arm_last-printed_fpu_string): Delete.
15897 (arm_configure_build_target): If use of floating-point/SIMD is
15898 disabled, remove all fp/simd related features from the target ISA.
15899 (last_arm_targ_options): New variable.
15900 (arm_print_asm_arch_directives): Add new parameters. Change order
15901 of emitted directives and handle all cases here.
15902 (arm_file_start): Always call arm_print_asm_arch_directives, move
15903 all generation of .arch/.arch_extension here.
15904 (arm_file_end): Call arm_print_asm_arch.
15905 (arm_declare_function_name): Call arm_print_asm_arch_directives
15906 instead of printing .arch/.fpu directives directly.
15908 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15910 * config/arm/arm.c (arm_configure_build_target): Don't call
15911 arm_option_reconfigure_globals.
15912 (arm_option_restore): Call arm_option_reconfigure_globals after
15913 reconfiguring the target.
15914 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
15916 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15918 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
15919 arch_name is always set.
15921 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15923 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15924 of vec_select high-half from being added into Neon subtract
15927 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15929 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15930 of vec_select high-half from being added into Neon add cost.
15932 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
15934 * cfgloop.h (loops_list::loops_list): Add one optional argument
15935 root and adjust accordingly, update loop tree walking and factor
15937 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
15939 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
15941 PR tree-optimization/101626
15942 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
15943 reverse scalar storage order on a pointer or vector component.
15945 2021-08-05 liuhongt <hongtao.liu@intel.com>
15947 * config/i386/sse.md (cond_<code><mode>): New expander.
15949 2021-08-05 liuhongt <hongtao.liu@intel.com>
15951 * config/i386/sse.md (cond_<code><mode>): New expander.
15953 2021-08-05 liuhongt <hongtao.liu@intel.com>
15955 * config/i386/sse.md (cond_<code><mode>): New expander.
15957 2021-08-04 David Malcolm <dmalcolm@redhat.com>
15960 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
15962 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15965 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
15966 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
15968 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15971 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
15972 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
15973 data with SSE register from one memory location to another.
15975 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15977 * config/s390/s390.c (expand_perm_with_vpdi): New function.
15978 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
15979 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
15980 parameterized expander.
15981 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
15983 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15985 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
15986 (struct expand_vec_perm_d): Define struct.
15987 (expand_perm_with_merge): New function.
15988 (vectorize_vec_perm_const_1): New function.
15989 (s390_vectorize_vec_perm_const): New function.
15990 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
15992 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15994 * config/s390/vector.md (V_HW_64): Remove mode iterator.
15995 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
15996 * config/s390/vx-builtins.md
15997 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
16000 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16002 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
16004 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
16006 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
16007 instead of an unspec.
16009 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
16011 * config/s390/s390-modes.def: Add more vector modes to support
16012 concatenation of two vectors.
16013 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
16015 (s390_expand_merge): Likewise.
16016 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
16017 (s390_expand_merge): New function.
16018 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
16019 Remove constant definitions.
16020 * config/s390/vector.md (V_HW_2): Add mode iterators.
16021 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
16022 (vec_2x_nelts, vec_2x_wide): New mode attributes.
16023 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
16024 New pattern definitions.
16025 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
16026 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
16027 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
16028 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
16030 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
16032 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
16033 emit vec merge pattern.
16035 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16037 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
16039 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
16040 vec_select high-half from being added into Neon multiply
16042 * rtlanal.c (vec_series_highpart_p): Define.
16043 * rtlanal.h (vec_series_highpart_p): Declare.
16045 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
16047 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
16049 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
16050 vec_select cost from being added into Neon multiply cost.
16052 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16054 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
16055 which old_loop_vinfo is an epilogue loop that handles a constant
16056 number of iterations.
16058 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16060 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
16061 when a reanalyzed loop fails to be cheaper than the current
16064 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
16066 * config/aarch64/aarch64.c: Fix a typo.
16068 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
16070 PR gcov-profile/101773
16071 * gcov-io.c (gcov_close): Check return code of a fclose.
16073 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
16076 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
16077 .file statement when needed.
16079 2021-08-04 Richard Biener <rguenther@suse.de>
16081 * tree-vect-data-refs.c (vect_check_gather_scatter):
16082 Include widening conversions only when the result is
16083 still handed by native gather or the current offset
16084 size not already matches the data size.
16085 Also succeed analysis in case there's no native support,
16086 noted by a IFN_LAST ifn and a NULL decl.
16087 (vect_analyze_data_refs): Always consider gathers.
16088 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
16089 Test for no IFN gather rather than decl gather.
16090 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
16091 gather-scatter info and cost emulated gathers accordingly.
16092 (vect_truncate_gather_scatter_offset): Properly test for
16094 (vect_use_strided_gather_scatters_p): Likewise.
16095 (get_load_store_type): Handle emulated gathers and its
16097 (vectorizable_load): Likewise. Emulate them by extracting
16098 scalar offsets, doing scalar loads and a vector construct.
16100 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
16103 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
16104 argument to set m_max_size.
16105 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
16106 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
16107 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
16109 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
16110 Marc Glisse <marc.glisse@inria.fr>
16112 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
16113 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
16114 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
16116 2021-08-04 Richard Biener <rguenther@suse.de>
16118 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
16119 out code to decompose vector loads ...
16120 (optimize_vector_load): ... here. Generalize it to
16121 handle intermediate widening and TARGET_MEM_REF loads
16122 and apply it to loads with a supported vector mode as well.
16124 2021-08-04 Richard Biener <rguenther@suse.de>
16126 PR tree-optimization/101756
16127 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
16128 the result of the reduction epilogue is compatible to the original
16131 2021-08-04 liuhongt <hongtao.liu@intel.com>
16134 * config/i386/i386.md (peephole2): Refine predicate from
16135 register_operand to general_reg_operand.
16137 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
16139 * gimple-range-path.h (path_range_query::dump): Mark override.
16141 2021-08-04 Richard Biener <rguenther@suse.de>
16143 PR tree-optimization/101769
16144 * tree-tailcall.c (eliminate_tail_call): Add the created loop
16145 for the first recursion and return it via the new output parameter.
16146 (optimize_tail_call): Pass through new output param.
16147 (tree_optimize_tail_calls_1): After creating all latches,
16148 add the created loop to the loop tree. Do not mark loops for fixup.
16150 2021-08-04 Martin Liska <mliska@suse.cz>
16152 * doc/invoke.texi: Document threader-mode param.
16154 2021-08-04 liuhongt <hongtao.liu@intel.com>
16156 * config/i386/sse.md (cond_fma<mode>): New expander.
16157 (cond_fms<mode>): Ditto.
16158 (cond_fnma<mode>): Ditto.
16159 (cond_fnms<mode>): Ditto.
16161 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16163 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
16165 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
16167 * config/rs6000/constraints.md: Remove "e" from the list of available
16168 constraint characters.
16170 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
16172 PR gcov-profile/71672
16173 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
16175 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
16177 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
16179 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
16181 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
16182 try XMM31 to avoid vzeroupper.
16184 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16186 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
16187 -mcpu=neoverse-512tvb.
16188 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
16189 * config/aarch64/aarch64-tune.md: Regenerate.
16190 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
16191 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
16192 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
16193 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
16194 (aarch64_adjust_body_cost): Likewise.
16196 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16198 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
16199 record issue information for operations that occur in the
16202 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16204 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
16205 parameter. Detect cases in which an Advanced SIMD MLA would almost
16206 certainly require a MOV.
16207 (aarch64_count_ops): Update accordingly.
16209 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16211 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
16212 function, split out from...
16213 (aarch64_detect_vector_stmt_subtype): ...here.
16214 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
16216 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16218 * config/aarch64/aarch64-protos.h (sve_vec_cost):
16219 Add gather_load_x32_cost and gather_load_x64_cost.
16220 * config/aarch64/aarch64.c (generic_sve_vector_cost)
16221 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
16222 accordingly, using the values given by the scalar_load * number
16223 of elements calculation that we used previously.
16224 (aarch64_detect_vector_stmt_subtype): Use the new fields.
16226 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16228 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
16229 function, split out from...
16230 (aarch64_adjust_body_cost): ...here.
16232 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16234 * config/aarch64/fractional-cost.h: New file.
16235 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
16236 and cost_fraction.h.
16237 (vec_cost_fraction): New typedef.
16238 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
16239 (aarch64_detect_vector_stmt_subtype): Likewise.
16240 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
16241 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
16243 (aarch64_adjust_body_cost): Likewise.
16244 (aarch64_test_cost_fraction): New function.
16245 (aarch64_run_selftests): Call it.
16247 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16249 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
16251 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
16253 (aarch64_estimated_poly_value): Likewise. Use the least significant
16254 set bit for the minimum and likely values. Use the most significant
16255 set bit for the maximum value.
16257 2021-08-03 liuhongt <hongtao.liu@intel.com>
16259 * config/i386/sse.md (cond_<insn><mode>): New expander.
16260 (cond_mul<mode>): Ditto.
16262 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
16264 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
16266 2021-08-03 liuhongt <hongtao.liu@intel.com>
16268 * config/i386/sse.md (cond_<insn><mode>):New expander.
16269 (cond_mul<mode>): Ditto.
16270 (cond_div<mode>): Ditto.
16272 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16274 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
16275 check stack_realign_needed for stack realignment.
16276 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
16277 than the largest integer supported by vector register.
16278 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
16279 (MOVE_MAX): Set to bytes of the largest integer supported by
16281 (STORE_MAX_PIECES): New.
16283 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16285 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16286 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16287 data from one memory location to another.
16289 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16291 PR middle-end/90773
16292 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
16294 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
16296 PR tree-optimization/101724
16297 * params.opt: Remove --param=threader-iterative.
16298 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
16301 2021-08-02 Tom de Vries <tdevries@suse.de>
16303 PR middle-end/101665
16304 * doc/extend.texi (nonnull attribute): Improve documentation.
16306 2021-08-02 Andrew Pinski <apinski@marvell.com>
16308 PR rtl-optimization/101683
16309 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
16311 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
16313 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
16314 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
16315 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
16316 (tree_ssa_phiop_worker): Update call to function above.
16318 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
16321 * config/i386/i386.md (bsr_rex64_1_zext): New.
16322 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
16323 with gen_bsr_rex64_1_zext.
16325 2021-07-31 Jakub Jelinek <jakub@redhat.com>
16328 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
16329 define_insn patterns.
16330 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
16331 Add combine splitters for constant - clz.
16332 (clz<mode>2): Use a temporary pseudo for bsr result.
16334 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16336 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
16337 _mm_floor_sd, _mm_floor_ss): New.
16339 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16341 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
16342 _mm_ceil_sd, _mm_ceil_ss): New.
16344 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16346 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
16347 _mm_blend_ps, _mm_blendv_ps): New.
16349 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
16350 Uroš Bizjak <ubizjak@gmail.com>
16352 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
16353 to generate a conditional move using the carry flag after sub $1.
16354 (peephole2): Eliminate a register-to-register move by inverting
16355 the condition of a conditional move.
16357 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16359 * config/mmix/mmix.md ("call", "call_value", "*call_real")
16360 ("*call_value_real"): Don't generate rtx mentioning the generic
16361 operands 1 and 2 to "call", and similarly for "call_value".
16362 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
16363 (mmix_print_operand): Use '!' instead of 'p'.
16365 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16367 * doc/md.texi (call): Correct information about operand 2.
16368 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
16370 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16372 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
16374 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16376 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
16378 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16380 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
16381 const_basic_block..
16382 (*::get_bb_range): Ditto.
16383 (*::bb_range_p): Ditto.
16384 * gimple-range-cache.h: Change prototypes.
16386 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16388 PR middle-end/90773
16389 * builtins.c (builtin_memcpy_read_str): Change the mode argument
16390 from scalar_int_mode to fixed_size_mode.
16391 (builtin_strncpy_read_str): Likewise.
16392 (gen_memset_value_from_prev): New function.
16393 (builtin_memset_read_str): Change the mode argument from
16394 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
16395 and support CONST_VECTOR.
16396 (builtin_memset_gen_str): Likewise.
16397 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
16399 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
16400 with fixed_size_mode.
16401 (builtin_memset_read_str): Likewise.
16402 * expr.c (widest_int_mode_for_size): Renamed to ...
16403 (widest_fixed_size_mode_for_size): Add a bool argument to
16404 indicate if QI vector mode can be used.
16405 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
16406 instead of widest_int_mode_for_size.
16407 (pieces_addr::adjust): Change the mode argument from
16408 scalar_int_mode to fixed_size_mode.
16409 (op_by_pieces_d): Make m_len read-only. Add a bool member,
16410 m_qi_vector_mode, to indicate that QI vector mode can be used.
16411 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
16412 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
16413 instead of widest_int_mode_for_size.
16414 (op_by_pieces_d::get_usable_mode): Change the mode argument from
16415 scalar_int_mode to fixed_size_mode. Call
16416 widest_fixed_size_mode_for_size instead of
16417 widest_int_mode_for_size.
16418 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
16419 function to return the smallest integer or QI vector mode.
16420 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
16421 instead of widest_int_mode_for_size. Call
16422 smallest_fixed_size_mode_for_size instead of
16423 smallest_int_mode_for_size.
16424 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
16425 indicate that QI vector mode can be used and pass it to
16426 op_by_pieces_d::op_by_pieces_d.
16427 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
16428 instead of widest_int_mode_for_size. Pass memsetp to
16429 widest_fixed_size_mode_for_size to support QI vector mode.
16430 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
16431 (store_by_pieces): Pass memsetp to
16432 store_by_pieces_d::store_by_pieces_d.
16433 (clear_by_pieces_1): Removed.
16434 (clear_by_pieces): Replace clear_by_pieces_1 with
16435 builtin_memset_read_str and pass true to store_by_pieces_d to
16436 support vector mode broadcast.
16437 (string_cst_read_str): Change the mode argument from
16438 scalar_int_mode to fixed_size_mode.
16439 * expr.h (by_pieces_constfn): Change scalar_int_mode to
16441 (by_pieces_prev): Likewise.
16442 * rtl.h (lowpart_subreg_regno): New.
16443 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
16444 simplify_subreg_regno.
16445 * target.def (gen_memset_scratch_rtx): New hook.
16446 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
16447 * doc/tm.texi: Regenerated.
16449 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16452 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
16453 TARGET_EXPR instead of MODIFY_EXPR.
16455 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16458 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
16459 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
16460 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
16462 (vec_cmpu<IMSA:mode><mode_i>): New expander.
16464 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16467 * config/i386/i386-options.c (ix86_option_override_internal):
16468 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
16470 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
16473 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
16474 explicitly dereferencing __a.
16475 (vld1_s64): Likewise.
16476 (vld1_u64): Likewise.
16477 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
16478 and change to VAR13.
16480 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
16482 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
16483 use of m_range_analyzer.
16484 (loop_versioning::lv_dom_walker::before_dom_children): Same.
16485 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
16486 (loop_versioning::prune_loop_conditions): Replace vr_values use
16487 with range_query interface.
16488 (pass_loop_versioning::execute): Use ranger.
16490 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16493 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
16494 enum values, and emit a warning if they mismatch.
16496 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
16498 * cfgloop.h (as_const): New function.
16499 (class loop_iterator): Rename to ...
16500 (class loops_list): ... this.
16501 (loop_iterator::next): Rename to ...
16502 (loops_list::Iter::fill_curr_loop): ... this and adjust.
16503 (loop_iterator::loop_iterator): Rename to ...
16504 (loops_list::loops_list): ... this and adjust.
16505 (loops_list::Iter): New class.
16506 (loops_list::iterator): New type.
16507 (loops_list::const_iterator): New type.
16508 (loops_list::begin): New function.
16509 (loops_list::end): Likewise.
16510 (loops_list::begin const): Likewise.
16511 (loops_list::end const): Likewise.
16512 (FOR_EACH_LOOP): Remove.
16513 (FOR_EACH_LOOP_FN): Remove.
16514 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
16515 for loop with loops_list instance.
16516 (sort_sibling_loops): Likewise.
16517 (disambiguate_loops_with_multiple_latches): Likewise.
16518 (verify_loop_structure): Likewise.
16519 * cfgloopmanip.c (create_preheaders): Likewise.
16520 (force_single_succ_latches): Likewise.
16521 * config/aarch64/falkor-tag-collision-avoidance.c
16522 (execute_tag_collision_avoidance): Likewise.
16523 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
16524 * config/s390/s390.c (s390_adjust_loops): Likewise.
16525 * doc/loop.texi: Likewise.
16526 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
16527 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
16528 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
16529 (loop_versioning::make_versioning_decisions): Likewise.
16530 * gimple-ssa-split-paths.c (split_paths): Likewise.
16531 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
16532 * graphite.c (canonicalize_loop_form): Likewise.
16533 (graphite_transform_loops): Likewise.
16534 * ipa-fnsummary.c (analyze_function_body): Likewise.
16535 * ipa-pure-const.c (analyze_function): Likewise.
16536 * loop-doloop.c (doloop_optimize_loops): Likewise.
16537 * loop-init.c (loop_optimizer_finalize): Likewise.
16538 (fix_loop_structure): Likewise.
16539 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
16540 (move_loop_invariants): Likewise.
16541 * loop-unroll.c (decide_unrolling): Likewise.
16542 (unroll_loops): Likewise.
16543 * modulo-sched.c (sms_schedule): Likewise.
16544 * predict.c (predict_loops): Likewise.
16545 (pass_profile::execute): Likewise.
16546 * profile.c (branch_prob): Likewise.
16547 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
16548 (sel_find_rgns): Likewise.
16549 * tree-cfg.c (replace_loop_annotate): Likewise.
16550 (replace_uses_by): Likewise.
16551 (move_sese_region_to_fn): Likewise.
16552 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
16553 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
16554 * tree-parloops.c (parallelize_loops): Likewise.
16555 * tree-predcom.c (tree_predictive_commoning): Likewise.
16556 * tree-scalar-evolution.c (scev_initialize): Likewise.
16557 (scev_reset): Likewise.
16558 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
16559 * tree-ssa-live.c (remove_unused_locals): Likewise.
16560 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
16561 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
16562 (tree_ssa_lim_initialize): Likewise.
16563 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
16564 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
16565 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
16566 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
16567 (free_numbers_of_iterations_estimates): Likewise.
16568 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
16569 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
16570 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
16571 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
16572 (pass_scev_cprop::execute): Likewise.
16573 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
16574 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
16575 * tree-ssa-threadupdate.c
16576 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
16577 * tree-vectorizer.c (vectorize_loops): Likewise.
16578 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
16580 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
16582 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
16583 generating a VOIDmode register for e.g the
16584 function_arg_info::end_marker.
16586 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
16588 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
16589 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
16590 (cc mode_attr): Similarly.
16591 (ccz subst_attr): Similarly.
16592 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
16593 * config/h8300/testcompare.md: Remove various cc0 based patterns
16594 that had been commented out. Add pattern to set CCZ from a bit
16597 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
16598 Julian Brown <julian@codesourcery.com>
16599 Kwok Cheung Yeung <kcy@codesourcery.com>
16601 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
16602 'update_stmt' after modification.
16603 (pass_oacc_loop_designation): New function, extracted out of...
16604 (pass_oacc_device_lower): ... this.
16605 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
16606 (make_pass_oacc_loop_designation): New
16607 * passes.def: Add it.
16608 * tree-parloops.c (create_parallel_loop): Adjust.
16609 * tree-pass.h (make_pass_oacc_loop_designation): New.
16611 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
16613 * flag-types.h (enum threader_mode): New.
16614 * params.opt: Add entry for --param=threader-mode.
16615 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
16616 (class back_threader): New.
16617 (back_threader::back_threader): New.
16618 (back_threader::~back_threader): New.
16619 (back_threader::maybe_register_path): New.
16620 (back_threader::find_taken_edge): New.
16621 (back_threader::find_taken_edge_switch): New.
16622 (back_threader::find_taken_edge_cond): New.
16623 (back_threader::resolve_def): New.
16624 (back_threader::resolve_phi): New.
16625 (back_threader::find_paths_to_names): New.
16626 (back_threader::find_paths): New.
16629 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
16630 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
16631 (pass_thread_jumps::execute): Abstract out code...
16632 (try_thread_blocks): ...here.
16633 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
16634 Abstract out threading candidate code to...
16635 (single_succ_to_potentially_threadable_block): ...here.
16636 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
16638 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
16639 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
16640 Return bool from register_jump_thread.
16642 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
16644 * target.def: in0 and in1 do not need to be registers.
16645 * doc/tm.texi: Regenerate.
16647 2021-07-29 liuhongt <hongtao.liu@intel.com>
16650 * config/i386/i386.c (ix86_widen_mult_cost): New function.
16651 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
16654 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
16657 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
16658 (rs6000_preferred_doloop_mode): New hook.
16659 * doc/tm.texi: Regenerate.
16660 * doc/tm.texi.in: Add hook preferred_doloop_mode.
16661 * target.def (preferred_doloop_mode): New hook.
16662 * targhooks.c (default_preferred_doloop_mode): New hook.
16663 * targhooks.h (default_preferred_doloop_mode): New hook.
16664 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
16665 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
16666 and compute_doloop_base_on_mode.
16668 2021-07-28 Martin Sebor <msebor@redhat.com>
16670 PR middle-end/101494
16671 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
16672 and size computation.
16674 2021-07-28 Martin Sebor <msebor@redhat.com>
16676 PR middle-end/101601
16677 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
16679 Handle pointers to functions.
16681 2021-07-28 Martin Sebor <msebor@redhat.com>
16683 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
16684 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
16685 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
16686 (access_ref::access_ref): Same.
16687 (access_ref::phi): Same.
16688 (access_ref::get_ref): Same.
16689 (access_ref::size_remaining): Same.
16690 (access_ref::offset_in_range): Same.
16691 (access_ref::add_offset): Same.
16692 (access_ref::inform_access): Same.
16693 (ssa_name_limit_t::visit_phi): Same.
16694 (ssa_name_limit_t::leave_phi): Same.
16695 (ssa_name_limit_t::next): Same.
16696 (ssa_name_limit_t::next_phi): Same.
16697 (ssa_name_limit_t::~ssa_name_limit_t): Same.
16698 (pointer_query::pointer_query): Same.
16699 (pointer_query::get_ref): Same.
16700 (pointer_query::put_ref): Same.
16701 (pointer_query::flush_cache): Same.
16702 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
16703 (check_nul_terminated_array): Same.
16704 (unterminated_array): Same.
16705 (maybe_warn_for_bound): Same.
16706 (check_read_access): Same.
16707 (warn_for_access): Same.
16708 (get_size_range): Same.
16709 (check_access): Same.
16710 (gimple_call_alloc_size): Move to tree.c.
16711 (gimple_parm_array_size): Move to pointer-query.cc.
16712 (get_offset_range): Same.
16713 (gimple_call_return_array): Same.
16714 (handle_min_max_size): Same.
16715 (handle_array_ref): Same.
16716 (handle_mem_ref): Same.
16717 (compute_objsize): Same.
16718 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
16719 (call_dealloc_argno): Same.
16720 (fndecl_dealloc_argno): Same.
16721 (new_delete_mismatch_p): Same.
16722 (matching_alloc_calls_p): Same.
16723 (warn_dealloc_offset): Same.
16724 (maybe_emit_free_warning): Same.
16725 * builtins.h (check_nul_terminated_array): Move to
16726 gimple-ssa-warn-access.h.
16727 (check_nul_terminated_array): Same.
16728 (warn_string_no_nul): Same.
16729 (unterminated_array): Same.
16730 (class ssa_name_limit_t): Same.
16731 (class pointer_query): Same.
16732 (struct access_ref): Same.
16733 (class range_query): Same.
16734 (struct access_data): Same.
16735 (gimple_call_alloc_size): Same.
16736 (gimple_parm_array_size): Same.
16737 (compute_objsize): Same.
16738 (class access_data): Same.
16739 (maybe_emit_free_warning): Same.
16740 * calls.c (initialize_argument_information): Remove call to
16741 maybe_emit_free_warning.
16742 * gimple-array-bounds.cc: Include new header..
16743 * gimple-fold.c: Same.
16744 * gimple-ssa-sprintf.c: Same.
16745 * gimple-ssa-warn-restrict.c: Same.
16746 * passes.def: Add pass_warn_access.
16747 * tree-pass.h (make_pass_warn_access): Declare.
16748 * tree-ssa-strlen.c: Include new headers.
16749 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
16750 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
16751 * gimple-ssa-warn-access.cc: New file.
16752 * gimple-ssa-warn-access.h: New file.
16753 * pointer-query.cc: New file.
16754 * pointer-query.h: New file.
16756 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16758 PR middle-end/101624
16759 * ubsan.c (maybe_instrument_pointer_overflow,
16760 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
16761 PARM_DECLs or RESULT_DECLs.
16762 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
16764 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16766 PR middle-end/101642
16767 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
16768 to type of bswap16 for comparison.
16769 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
16771 2021-07-28 Richard Biener <rguenther@suse.de>
16773 PR tree-optimization/101615
16774 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
16775 at CTOR SLP graph entries.
16777 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
16779 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
16780 Add "r,w" alternative.
16782 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
16785 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
16786 AVX_U128_DIRTY when all bits are zero.
16788 2021-07-28 Richard Biener <rguenther@suse.de>
16790 PR tree-optimization/101615
16791 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
16792 external nodes cannot be permuted so make them perm_out 0.
16794 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
16797 * config.in: Regenerate.
16798 * config/gcn/gcn-hsa.h (A_FIJI): New define.
16799 (A_900): New define.
16800 (A_906): New define.
16801 (A_908): New define.
16802 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
16803 * config/gcn/gcn.c (output_file_start): Adjust attributes according
16804 to the assembler capabilities.
16805 * config/gcn/mkoffload.c (main): Likewise.
16806 * configure: Regenerate.
16807 * configure.ac: Add tests for LLVM assembler attribute features.
16809 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
16811 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
16812 cond_false and cond_true on branches.
16814 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16816 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
16819 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16821 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
16823 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16826 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
16827 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
16828 and has special !TARGET_AVX512VL expansion.
16829 (vashrv2di3<mask_name>): Rename to ...
16830 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
16831 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
16833 2021-07-28 Martin Uecker <muecker@gwdg.de>
16835 * calls.c (maybe_warn_rdwr_sizes): Correct argument
16836 numbers in warning that were switched.
16838 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
16840 PR tree-optimization/101596
16841 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
16842 by using new_type's precision instead.
16844 2021-07-28 liuhongt <hongtao.liu@intel.com>
16847 * config/i386/i386.h (processor_costs): Add new member
16849 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
16850 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
16851 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
16852 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
16853 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
16854 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
16855 generic_cost, core_cost): Initialize integer_to_sse same value
16857 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
16858 * config/i386/i386.c (ix86_builtin_vectorization_cost):
16859 Use integer_to_sse instead of sse_op to calculate the cost of
16862 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16864 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
16866 (write_init_file): Call write_ovld_static_init.
16868 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16870 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
16872 (write_init_file): Call write_bif_static_init.
16874 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16876 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
16877 (TYPE_MAP_SIZE): New macro.
16878 (type_map): New initialized variable.
16879 (typemap_cmp): New function.
16880 (write_type_node): Likewise.
16881 (write_fntype_init): Implement.
16883 2021-07-27 Martin Sebor <msebor@redhat.com>
16885 PR tree-optimization/101584
16886 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
16887 (check_defs): Call it.
16889 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16891 * tree-ssa-dom.c (dom_jump_threader_simplifier):
16892 Put avail_exprs_stack in the class, instead of passing it to
16893 jump_threader_simplifier.
16894 (dom_jump_threader_simplifier::simplify): Add state argument.
16895 (dom_opt_dom_walker): Add state.
16896 (pass_dominator::execute): Pass state to threader.
16897 (dom_opt_dom_walker::before_dom_children): Use state.
16898 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
16899 arguments by state.
16900 (jump_threader::record_temporary_equivalences_from_phis):
16901 Register equivalences through the state variable.
16902 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
16903 Record ranges in a statement through the state variable.
16904 (jump_threader::simplify_control_stmt_condition): Pass state to
16906 (jump_threader::simplify_control_stmt_condition_1): Same.
16907 (jump_threader::thread_around_empty_blocks): Remove obsolete
16909 (jump_threader::thread_through_normal_block): Record equivalences
16910 on edge through the state variable.
16911 (jump_threader::thread_across_edge): Abstract state pushing.
16912 (jt_state::jt_state): New.
16913 (jt_state::push): New.
16914 (jt_state::pop): New.
16915 (jt_state::register_equiv): New.
16916 (jt_state::record_ranges_from_stmt): New.
16917 (jt_state::register_equivs_on_edge): New.
16918 (jump_threader_simplifier::jump_threader_simplifier): Move from
16920 (jump_threader_simplifier::simplify): Add state argument.
16921 * tree-ssa-threadedge.h (class jt_state): New.
16922 (class jump_threader): Add state to constructor.
16923 (class jump_threader_simplifier): Add state to simplify. Remove
16924 avail_exprs_stack from class.
16925 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
16927 (vrp_jump_threader::vrp_jump_threader): Add state.
16928 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
16930 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16932 * Makefile.in (OBJS): Add gimple-range-path.o.
16933 * gimple-range-path.cc: New file.
16934 * gimple-range-path.h: New file.
16936 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
16938 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
16939 inside vec_duplicate for all patterns.
16940 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
16941 Push sign/zero-extension inside vec_duplicate.
16943 2021-07-27 Richard Biener <rguenther@suse.de>
16945 PR tree-optimization/101573
16946 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
16947 looking at uninitialized PHI arg defs in some constrained cases.
16948 (warn_uninitialized_vars): Call it.
16949 (execute_early_warn_uninitialized): Calculate dominators.
16951 2021-07-27 Richard Biener <rguenther@suse.de>
16953 PR tree-optimization/39821
16954 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
16955 vector_stmt for widening arithmetic.
16956 (vectorizable_conversion): Adjust.
16958 2021-07-27 Martin Jambor <mjambor@suse.cz>
16960 * cgraph.h (ipa_replace_map): New field force_load_ref.
16961 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
16962 aded new flag load_dereferenced, adjusted comments.
16963 (ipa_get_param_dereferenced): New function.
16964 (ipa_set_param_dereferenced): Likewise.
16965 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
16966 * ipa-cp.c: Include gimple.h.
16967 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
16968 (get_replacement_map): New parameter force_load_ref, set the
16969 appropriate flag in ipa_replace_map if set.
16970 (struct symbol_and_index_together): New type.
16971 (adjust_refs_in_act_callers): New function.
16972 (adjust_references_in_caller): Likewise.
16973 (create_specialized_node): When appropriate, call
16974 adjust_references_in_caller and force only load references.
16975 * ipa-prop.c (load_from_dereferenced_name): New function.
16976 (ipa_analyze_controlled_uses): Also detect loads from a
16977 dereference, harden testing of call statements.
16978 (ipa_write_node_info): Stream the dereferenced flag.
16979 (ipa_read_node_info): Likewise.
16980 (ipa_set_jf_constant): Also create refdesc when jump function
16981 references a variable.
16982 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
16983 also on references of variables and return a symtab_node. Adjust
16985 (propagate_controlled_uses): Also remove references to VAR_DECLs.
16987 2021-07-27 Jakub Jelinek <jakub@redhat.com>
16989 PR middle-end/101586
16990 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
16991 positions above or equal to sz except for diagnostics of flexible
16994 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16996 PR tree-optimization/78888
16997 * gimple-range-fold.cc (get_letter_range): New.
16998 (fold_using_range::range_of_builtin_call): Call get_letter_range.
17000 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
17002 PR tree-optimization/78888
17003 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
17004 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
17006 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
17007 Marc Glisse <marc.glisse@inria.fr>
17009 * match.pd (rotate): Simplify equality/inequality of rotations.
17010 (bswap): Simplify equality/inequality tests of byte swapping.
17012 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17014 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
17017 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17019 * range-op.cc (operator_lshift::fold_range): Pass rel to
17020 base class fold_range.
17021 (operator_rshift::fold_range): Same.
17023 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
17026 * toplev.h (min_align_loops_log): Remove declaration.
17027 (min_align_jumps_log, min_align_labels_log): Likewise.
17028 (min_align_functions_log): Likewise.
17030 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17032 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
17034 (execute_vrp): Abstract out simplification of conditionals...
17035 (simplify_casted_conds): ...here.
17037 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
17039 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
17040 Add gimple argument.
17041 (array_bounds_checker::check_array_ref): Same.
17042 (array_bounds_checker::check_addr_expr): Same.
17043 (array_bounds_checker::check_array_bounds): Pass statement to
17044 check_array_bounds and check_addr_expr.
17045 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
17046 (check_addr_expr): Same.
17047 (get_value_range): Same.
17049 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17051 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
17052 (sdot_prod, udot_prod): ... This.
17053 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
17055 (<sur>dot_prod<vsi2qi>): ... this.
17056 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
17057 Change operands order.
17058 (<sur>sadv16qi): Use new operands order.
17059 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
17060 vdotq_s32): Use new RTL ordering.
17062 2021-07-26 Tamar Christina <tamar.christina@arm.com>
17064 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
17065 aarch64_types_ternop_suss_qualifiers): New.
17066 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
17067 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
17068 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
17070 2021-07-23 Jakub Jelinek <jakub@redhat.com>
17072 PR rtl-optimization/101562
17073 * expmed.c (store_integral_bit_field): Only use movstrict_optab
17074 if the operand isn't paradoxical.
17076 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
17078 * gimple-array-bounds.h (class array_bounds_checker): Change
17079 ranges type to range_query.
17081 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17083 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
17084 __builtin_memcpy instead of constructing
17085 __builtin_aarch64_simd_oi one vector at a time.
17086 (vst1_u64_x2): Likewise.
17087 (vst1_f64_x2): Likewise.
17088 (vst1_s8_x2): Likewise.
17089 (vst1_p8_x2): Likewise.
17090 (vst1_s16_x2): Likewise.
17091 (vst1_p16_x2): Likewise.
17092 (vst1_s32_x2): Likewise.
17093 (vst1_u8_x2): Likewise.
17094 (vst1_u16_x2): Likewise.
17095 (vst1_u32_x2): Likewise.
17096 (vst1_f16_x2): Likewise.
17097 (vst1_f32_x2): Likewise.
17098 (vst1_p64_x2): Likewise.
17099 (vst1q_s8_x2): Likewise.
17100 (vst1q_p8_x2): Likewise.
17101 (vst1q_s16_x2): Likewise.
17102 (vst1q_p16_x2): Likewise.
17103 (vst1q_s32_x2): Likewise.
17104 (vst1q_s64_x2): Likewise.
17105 (vst1q_u8_x2): Likewise.
17106 (vst1q_u16_x2): Likewise.
17107 (vst1q_u32_x2): Likewise.
17108 (vst1q_u64_x2): Likewise.
17109 (vst1q_f16_x2): Likewise.
17110 (vst1q_f32_x2): Likewise.
17111 (vst1q_f64_x2): Likewise.
17112 (vst1q_p64_x2): Likewise.
17114 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17116 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
17117 __builtin_memcpy instead of constructing
17118 __builtin_aarch64_simd_ci one vector at a time.
17119 (vst1_u64_x3): Likewise.
17120 (vst1_f64_x3): Likewise.
17121 (vst1_s8_x3): Likewise.
17122 (vst1_p8_x3): Likewise.
17123 (vst1_s16_x3): Likewise.
17124 (vst1_p16_x3): Likewise.
17125 (vst1_s32_x3): Likewise.
17126 (vst1_u8_x3): Likewise.
17127 (vst1_u16_x3): Likewise.
17128 (vst1_u32_x3): Likewise.
17129 (vst1_f16_x3): Likewise.
17130 (vst1_f32_x3): Likewise.
17131 (vst1_p64_x3): Likewise.
17132 (vst1q_s8_x3): Likewise.
17133 (vst1q_p8_x3): Likewise.
17134 (vst1q_s16_x3): Likewise.
17135 (vst1q_p16_x3): Likewise.
17136 (vst1q_s32_x3): Likewise.
17137 (vst1q_s64_x3): Likewise.
17138 (vst1q_u8_x3): Likewise.
17139 (vst1q_u16_x3): Likewise.
17140 (vst1q_u32_x3): Likewise.
17141 (vst1q_u64_x3): Likewise.
17142 (vst1q_f16_x3): Likewise.
17143 (vst1q_f32_x3): Likewise.
17144 (vst1q_f64_x3): Likewise.
17145 (vst1q_p64_x3): Likewise.
17147 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
17150 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
17151 hard register when LRA is in progress.
17153 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17155 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
17156 __builtin_memcpy instead of using a union.
17157 (vst1q_s8_x4): Likewise.
17158 (vst1_s16_x4): Likewise.
17159 (vst1q_s16_x4): Likewise.
17160 (vst1_s32_x4): Likewise.
17161 (vst1q_s32_x4): Likewise.
17162 (vst1_u8_x4): Likewise.
17163 (vst1q_u8_x4): Likewise.
17164 (vst1_u16_x4): Likewise.
17165 (vst1q_u16_x4): Likewise.
17166 (vst1_u32_x4): Likewise.
17167 (vst1q_u32_x4): Likewise.
17168 (vst1_f16_x4): Likewise.
17169 (vst1q_f16_x4): Likewise.
17170 (vst1_f32_x4): Likewise.
17171 (vst1q_f32_x4): Likewise.
17172 (vst1_p8_x4): Likewise.
17173 (vst1q_p8_x4): Likewise.
17174 (vst1_p16_x4): Likewise.
17175 (vst1q_p16_x4): Likewise.
17176 (vst1_s64_x4): Likewise.
17177 (vst1_u64_x4): Likewise.
17178 (vst1_p64_x4): Likewise.
17179 (vst1q_s64_x4): Likewise.
17180 (vst1q_u64_x4): Likewise.
17181 (vst1q_p64_x4): Likewise.
17182 (vst1_f64_x4): Likewise.
17183 (vst1q_f64_x4): Likewise.
17185 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
17187 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
17188 instead of constructing __builtin_aarch64_simd_oi one vector
17190 (vst2_u64): Likewise.
17191 (vst2_f64): Likewise.
17192 (vst2_s8): Likewise.
17193 (vst2_p8): Likewise.
17194 (vst2_s16): Likewise.
17195 (vst2_p16): Likewise.
17196 (vst2_s32): Likewise.
17197 (vst2_u8): Likewise.
17198 (vst2_u16): Likewise.
17199 (vst2_u32): Likewise.
17200 (vst2_f16): Likewise.
17201 (vst2_f32): Likewise.
17202 (vst2_p64): Likewise.
17203 (vst2q_s8): Likewise.
17204 (vst2q_p8): Likewise.
17205 (vst2q_s16): Likewise.
17206 (vst2q_p16): Likewise.
17207 (vst2q_s32): Likewise.
17208 (vst2q_s64): Likewise.
17209 (vst2q_u8): Likewise.
17210 (vst2q_u16): Likewise.
17211 (vst2q_u32): Likewise.
17212 (vst2q_u64): Likewise.
17213 (vst2q_f16): Likewise.
17214 (vst2q_f32): Likewise.
17215 (vst2q_f64): Likewise.
17216 (vst2q_p64): Likewise.
17218 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17220 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
17221 instead of constructing __builtin_aarch64_simd_ci one vector
17223 (vst3_u64): Likewise.
17224 (vst3_f64): Likewise.
17225 (vst3_s8): Likewise.
17226 (vst3_p8): Likewise.
17227 (vst3_s16): Likewise.
17228 (vst3_p16): Likewise.
17229 (vst3_s32): Likewise.
17230 (vst3_u8): Likewise.
17231 (vst3_u16): Likewise.
17232 (vst3_u32): Likewise.
17233 (vst3_f16): Likewise.
17234 (vst3_f32): Likewise.
17235 (vst3_p64): Likewise.
17236 (vst3q_s8): Likewise.
17237 (vst3q_p8): Likewise.
17238 (vst3q_s16): Likewise.
17239 (vst3q_p16): Likewise.
17240 (vst3q_s32): Likewise.
17241 (vst3q_s64): Likewise.
17242 (vst3q_u8): Likewise.
17243 (vst3q_u16): Likewise.
17244 (vst3q_u32): Likewise.
17245 (vst3q_u64): Likewise.
17246 (vst3q_f16): Likewise.
17247 (vst3q_f32): Likewise.
17248 (vst3q_f64): Likewise.
17249 (vst3q_p64): Likewise.
17251 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17253 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
17254 instead of constructing __builtin_aarch64_simd_xi one vector
17256 (vst4_u64): Likewise.
17257 (vst4_f64): Likewise.
17258 (vst4_s8): Likewise.
17259 (vst4_p8): Likewise.
17260 (vst4_s16): Likewise.
17261 (vst4_p16): Likewise.
17262 (vst4_s32): Likewise.
17263 (vst4_u8): Likewise.
17264 (vst4_u16): Likewise.
17265 (vst4_u32): Likewise.
17266 (vst4_f16): Likewise.
17267 (vst4_f32): Likewise.
17268 (vst4_p64): Likewise.
17269 (vst4q_s8): Likewise.
17270 (vst4q_p8): Likewise.
17271 (vst4q_s16): Likewise.
17272 (vst4q_p16): Likewise.
17273 (vst4q_s32): Likewise.
17274 (vst4q_s64): Likewise.
17275 (vst4q_u8): Likewise.
17276 (vst4q_u16): Likewise.
17277 (vst4q_u32): Likewise.
17278 (vst4q_u64): Likewise.
17279 (vst4q_f16): Likewise.
17280 (vst4q_f32): Likewise.
17281 (vst4q_f64): Likewise.
17282 (vst4q_p64): Likewise.
17284 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17286 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
17287 instead of constructing __builtin_aarch64_simd_oi one vector
17289 (vtbx4_u8): Likewise.
17290 (vtbx4_p8): Likewise.
17292 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17294 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
17295 instead of constructing __builtin_aarch64_simd_oi one vector
17297 (vtbl3_u8): Likewise.
17298 (vtbl3_p8): Likewise.
17299 (vtbl4_s8): Likewise.
17300 (vtbl4_u8): Likewise.
17301 (vtbl4_p8): Likewise.
17303 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17305 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
17306 instead of constructing __builtin_aarch64_simd_oi one vector
17308 (vqtbx2_u8): Likewise.
17309 (vqtbx2_p8): Likewise.
17310 (vqtbx2q_s8): Likewise.
17311 (vqtbx2q_u8): Likewise.
17312 (vqtbx2q_p8): Likewise.
17313 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
17314 __builtin_aarch64_simd_ci one vector at a time.
17315 (vqtbx3_u8): Likewise.
17316 (vqtbx3_p8): Likewise.
17317 (vqtbx3q_s8): Likewise.
17318 (vqtbx3q_u8): Likewise.
17319 (vqtbx3q_p8): Likewise.
17320 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
17321 __builtin_aarch64_simd_xi one vector at a time.
17322 (vqtbx4_u8): Likewise.
17323 (vqtbx4_p8): Likewise.
17324 (vqtbx4q_s8): Likewise.
17325 (vqtbx4q_u8): Likewise.
17326 (vqtbx4q_p8): Likewise.
17328 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17330 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
17331 instead of constructing __builtin_aarch64_simd_oi one vector
17333 (vqtbl2_u8): Likewise.
17334 (vqtbl2_p8): Likewise.
17335 (vqtbl2q_s8): Likewise.
17336 (vqtbl2q_u8): Likewise.
17337 (vqtbl2q_p8): Likewise.
17338 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
17339 __builtin_aarch64_simd_ci one vector at a time.
17340 (vqtbl3_u8): Likewise.
17341 (vqtbl3_p8): Likewise.
17342 (vqtbl3q_s8): Likewise.
17343 (vqtbl3q_u8): Likewise.
17344 (vqtbl3q_p8): Likewise.
17345 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
17346 __builtin_aarch64_simd_xi one vector at a time.
17347 (vqtbl4_u8): Likewise.
17348 (vqtbl4_p8): Likewise.
17349 (vqtbl4q_s8): Likewise.
17350 (vqtbl4q_u8): Likewise.
17351 (vqtbl4q_p8): Likewise.
17353 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
17356 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
17358 2021-07-22 Andrew Pinski <apinski@marvell.com>
17360 PR tree-optimization/10153
17361 * tree-tailcall.c (create_tailcall_accumulator):
17362 Don't call fold_convert as the type should be correct already.
17363 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
17364 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
17366 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
17368 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
17369 varying_p check for null/non-null check.
17371 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17373 PR tree-optimization/101511
17374 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
17375 is in ssa2's equiv set, and don't trap if so.
17377 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17379 PR tree-optimization/101497
17380 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
17383 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17385 PR tree-optimization/101496
17386 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
17387 first, then vrp_visit_cond_Stmt.
17389 2021-07-22 liuhongt <hongtao.liu@intel.com>
17391 * config/i386/i386-expand.c
17392 (ix86_broadcast_from_integer_constant): Rename to ..
17393 (ix86_broadcast_from_constant): .. this, and extend it to
17395 (ix86_expand_vector_move): Extend to float mode.
17396 * config/i386/i386-features.c
17397 (replace_constant_pool_with_broadcast): Remove.
17398 (remove_partial_avx_dependency_gate): Ditto.
17399 (constant_pool_broadcast): Ditto.
17400 (class pass_constant_pool_broadcast): Ditto.
17401 (make_pass_constant_pool_broadcast): Ditto.
17402 (remove_partial_avx_dependency): Adjust gate.
17403 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
17404 * config/i386/i386-protos.h
17405 (make_pass_constant_pool_broadcast): Remove.
17407 2021-07-22 liuhongt <hongtao.liu@intel.com>
17409 * config/i386/constraints.md (Wb): New constraint.
17411 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
17413 (*ashlqi3_1): Ditto.
17414 (*<insn><mode>3_1): Split to ..
17415 (*ashr<mode>3_1): this, ...
17416 (*lshr<mode>3_1): and this, also extend this pattern to avx512
17418 (*<insn><mode>3_1): Split to ..
17419 (*ashr<mode>3_1): this, ...
17420 (*lshrqi3_1): and this, also extend this pattern to avx512
17422 (*lshrhi3_1): And this, also extend this pattern to avx512
17424 * config/i386/sse.md (k<code><mode>): New define_split after
17425 it to convert generic shift pattern to mask shift ones.
17427 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
17428 Joseph Myers <joseph@codesourcery.com>
17429 Cesar Philippidis <cesar@codesourcery.com>
17431 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
17432 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
17434 * tree-pretty-print.c (dump_omp_clause): Likewise.
17435 * omp-general.c (oacc_verify_routine_clauses): Likewise.
17436 * gimplify.c (gimplify_scan_omp_clauses)
17437 (gimplify_adjust_omp_clauses): Likewise.
17438 * tree-nested.c (convert_nonlocal_omp_clauses)
17439 (convert_local_omp_clauses): Likewise.
17440 * omp-low.c (scan_sharing_clauses): Likewise.
17441 * omp-offload.c (execute_oacc_device_lower): Update.
17443 2021-07-21 Martin Sebor <msebor@redhat.com>
17445 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
17447 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17449 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
17452 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17454 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
17456 (write_fntype_init): New stub function.
17457 (write_init_bif_table): Likewise.
17458 (write_init_ovld_table): New function.
17459 (write_init_file): Implement.
17461 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17463 * config/rs6000/rs6000-gen-builtins.c
17464 (write_autogenerated_header): New function.
17465 (write_decls): Likewise.
17466 (write_extern_fntype): New callback function.
17467 (write_header_file): Implement.
17469 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17471 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
17474 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17476 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
17478 (complete_base_type): Likewise.
17479 (construct_fntype_id): Likewise.
17480 (parse_bif_entry): Call contruct_fntype_id.
17481 (parse_ovld_entry): Likewise.
17483 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17485 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
17486 (MAXOVLDSTANZAS): New macro.
17487 (ovld_stanzas): New variable.
17488 (curr_ovld_stanza): Likewise.
17489 (MAXOVLDS): New macro.
17490 (ovlddata): New struct.
17491 (ovlds): New variable.
17492 (curr_ovld): Likewise.
17493 (max_ovld_args): Likewise.
17494 (parse_ovld_entry): New function.
17495 (parse_ovld_stanza): Likewise.
17496 (parse_ovld): Implement.
17498 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17500 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
17503 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17505 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
17506 (parse_prototype): Implement.
17508 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17510 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
17511 (curr_bif_stanza): New variable.
17512 (stanza_entry): New struct.
17513 (stanza_map): New initialized variable.
17514 (enable_string): Likewise.
17515 (fnkinds): New enum.
17516 (typelist): New struct.
17517 (attrinfo): Likewise.
17518 (MAXRESTROPNDS): New macro.
17519 (prototype): New struct.
17520 (MAXBIFS): New macro.
17521 (bifdata): New struct.
17522 (bifs): New variable.
17523 (curr_bif): Likewise.
17524 (bif_order): Likewise.
17525 (bif_index): Likewise.
17526 (fatal): New function.
17527 (stanza_name_to_stanza): Likewise.
17528 (parse_bif_attrs): New stub function.
17529 (parse_prototype): Likewise.
17530 (parse_bif_entry): New function.
17531 (parse_bif_stanza): Likewise.
17532 (parse_bif): Implement.
17533 (set_bif_order): New function.
17534 (create_bif_order): Implement.
17536 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17538 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
17539 (num_bifs): New variable.
17540 (num_ovld_stanzas): Likewise.
17541 (num_ovlds): Likewise.
17542 (parse_codes): New enum.
17543 (bif_rbt): New variable.
17544 (ovld_rbt): Likewise.
17545 (fntype_rbt): Likewise.
17546 (bifo_rbt): Likewise.
17547 (parse_bif): New stub function.
17548 (create_bif_order): Likewise.
17549 (parse_ovld): Likewise.
17550 (write_header_file): Likewise.
17551 (write_init_file): Likewise.
17552 (write_defines_file): Likewise.
17553 (delete_output_files): New function.
17556 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
17559 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
17560 from CRC32 _builtin functions.
17562 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
17564 * coverage.c (build_gcov_info_var_registration): Mark the object placed
17565 in the linker set as referenced so that it does not get optimized away.
17567 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
17570 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17572 * config.gcc (riscv*-*-*): Detect which python is available.
17574 2021-07-21 Jakub Jelinek <jakub@redhat.com>
17576 PR middle-end/101535
17577 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
17578 contexts in which decl isn't privatized and for ORT_TARGET return
17579 false if decl is mapped.
17581 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17583 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
17584 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
17585 has been successfully unrolled.
17587 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17589 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
17590 exit condition if no tail loop is needed, and if the original exit
17591 condition should therefore be kept as-is.
17592 (tree_transform_and_unroll_loop): Handle that case here too.
17594 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
17596 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
17598 (free_data_refs): Likewise.
17599 * tree-data-ref.h (free_dependence_relations): Likewise.
17600 (free_data_refs): Likewise.
17601 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
17603 (struct component): Likewise.
17604 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
17605 (pcom_worker::~pcom_worker): Likewise.
17606 (pcom_worker::release_chain): Adjust as auto_vec changes.
17607 (pcom_worker::loop): Rename to ...
17608 (pcom_worker::m_loop): ... this.
17609 (pcom_worker::datarefs): Rename to ...
17610 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
17611 (pcom_worker::dependences): Rename to ...
17612 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
17613 (pcom_worker::chains): Rename to ...
17614 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
17615 (pcom_worker::looparound_phis): Rename to ...
17616 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
17618 (pcom_worker::cache): Rename to ...
17619 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
17620 (pcom_worker::release_chain): Adjust for auto_vec changes.
17621 (pcom_worker::release_chains): Adjust for auto_vec and renaming
17623 (release_component): Remove.
17624 (release_components): Adjust for release_component removal.
17625 (component_of): Adjust to use vec.
17626 (merge_comps): Likewise.
17627 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
17628 (pcom_worker::determine_offset): Likewise.
17629 (class comp_ptrs): Remove.
17630 (pcom_worker::split_data_refs_to_components): Adjust for renaming
17631 changes, for comp_ptrs removal with auto_vec.
17632 (pcom_worker::suitable_component_p): Adjust for renaming changes.
17633 (pcom_worker::filter_suitable_components): Adjust for release_component
17635 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
17636 (pcom_worker::find_looparound_phi): Likewise.
17637 (pcom_worker::add_looparound_copies): Likewise.
17638 (pcom_worker::determine_roots_comp): Likewise.
17639 (pcom_worker::single_nonlooparound_use): Likewise.
17640 (pcom_worker::execute_pred_commoning_chain): Likewise.
17641 (pcom_worker::execute_pred_commoning): Likewise.
17642 (pcom_worker::try_combine_chains): Likewise.
17643 (pcom_worker::prepare_initializers_chain): Likewise.
17644 (pcom_worker::prepare_initializers): Likewise.
17645 (pcom_worker::prepare_finalizers_chain): Likewise.
17646 (pcom_worker::prepare_finalizers): Likewise.
17647 (pcom_worker::tree_predictive_commoning_loop): Likewise.
17649 2021-07-20 Martin Sebor <msebor@redhat.com>
17651 PR middle-end/101397
17652 * builtins.c (gimple_call_return_array): Add argument. Correct
17653 offsets for memchr, mempcpy, stpcpy, and stpncpy.
17654 (compute_objsize_r): Adjust offset computation for argument returning
17657 2021-07-20 Martin Sebor <msebor@redhat.com>
17659 PR middle-end/101300
17660 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
17662 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
17664 * function.c (assign_parm_setup_block): Use adjust_address instead
17665 of change_address to preserve MEM_EXPR and friends.
17667 2021-07-20 Martin Sebor <msebor@redhat.com>
17669 * cfgloop.h (single_likely_exit): Adjust by-value argument to
17670 by-const-reference.
17671 * cfgloopanal.c (single_likely_exit): Same.
17672 * cgraph.h (struct cgraph_node): Same.
17673 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
17674 * genautomata.c (merge_states): Same.
17675 * genextract.c (VEC_char_to_string): Same.
17676 * genmatch.c (dt_node::gen_kids_1): Same.
17677 (walk_captures): Adjust by-value argument to by-reference.
17678 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
17679 to by-const-reference.
17680 * gimple.c (gimple_build_call_vec): Same.
17681 (gimple_build_call_internal_vec): Same.
17682 (gimple_build_switch): Same.
17683 (sort_case_labels): Same.
17684 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17686 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
17687 by-const-reference.
17688 (gimple_build_call_internal_vec): Same.
17689 (gimple_build_switch): Same.
17690 (sort_case_labels): Same.
17691 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17693 * haifa-sched.c (calc_priorities): Adjust by-value argument to
17694 by-const-reference.
17695 (sched_init_luids): Same.
17696 (haifa_init_h_i_d): Same.
17697 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
17698 (adjust_callers_for_value_intersection): Adjust by-value argument to
17700 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
17701 by-const-reference.
17702 (find_more_contexts_for_caller_subset): Same.
17703 (find_aggregate_values_for_callers_subset): Same.
17704 (copy_useful_known_contexts): Same.
17705 * ipa-fnsummary.c (remap_edge_summaries): Same.
17706 (remap_freqcounting_predicate): Same.
17707 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
17709 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
17710 to by-const-reference.
17711 * ipa-predicate.h (predicate::remap_after_inlining): Same.
17712 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
17713 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
17714 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
17715 * read-rtl.c (add_overload_instance): Same.
17716 * rtl.h (native_decode_rtx): Same.
17717 (native_decode_vector_rtx): Same.
17718 * sched-int.h (sched_init_luids): Same.
17719 (haifa_init_h_i_d): Same.
17720 * simplify-rtx.c (native_decode_vector_rtx): Same.
17721 (native_decode_rtx): Same.
17722 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
17723 (shrink_wrap_one_built_in_call_with_conds): Same.
17724 (shrink_wrap_conditional_dead_built_in_calls): Same.
17725 * tree-data-ref.c (create_runtime_alias_checks): Same.
17726 (compute_all_dependences): Same.
17727 * tree-data-ref.h (compute_all_dependences): Same.
17728 (create_runtime_alias_checks): Same.
17729 (index_in_loop_nest): Same.
17730 * tree-if-conv.c (mask_exists): Same.
17731 * tree-loop-distribution.c (class loop_distribution): Same.
17732 (loop_distribution::create_rdg_vertices): Same.
17733 (dump_rdg_partitions): Same.
17734 (debug_rdg_partitions): Same.
17735 (partition_contains_all_rw): Same.
17736 (loop_distribution::distribute_loop): Same.
17737 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
17738 (oacc_entry_exit_single_gang): Same.
17739 * tree-ssa-loop-im.c (hoist_memory_references): Same.
17740 (loop_suitable_for_sm): Same.
17741 * tree-ssa-loop-niter.c (bound_index): Same.
17742 * tree-ssa-reassoc.c (update_ops): Same.
17743 (swap_ops_for_binary_stmt): Same.
17744 (rewrite_expr_tree): Same.
17745 (rewrite_expr_tree_parallel): Same.
17746 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
17747 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
17748 * tree-ssa-structalias.c (process_all_all_constraints): Same.
17749 (make_constraints_to): Same.
17750 (handle_lhs_call): Same.
17751 (find_func_aliases_for_builtin_call): Same.
17752 (sort_fieldstack): Same.
17753 (check_for_overlaps): Same.
17754 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
17755 (vect_create_cond_for_unequal_addrs): Same.
17756 (vect_create_cond_for_lower_bounds): Same.
17757 (vect_create_cond_for_alias_checks): Same.
17758 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
17759 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
17760 (vect_make_slp_decision): Same.
17761 (vect_slp_bbs): Same.
17762 (duplicate_and_interleave): Same.
17763 (vect_transform_slp_perm_load): Same.
17764 (vect_schedule_slp): Same.
17765 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
17766 (vect_schedule_slp): Same.
17767 (duplicate_and_interleave): Same.
17768 * tree.c (build_vector_from_ctor): Same.
17769 (build_vector): Same.
17770 (check_vector_cst): Same.
17771 (check_vector_cst_duplicate): Same.
17772 (check_vector_cst_fill): Same.
17773 (check_vector_cst_stepped): Same.
17774 * tree.h (build_vector_from_ctor): Same.
17776 2021-07-20 Jakub Jelinek <jakub@redhat.com>
17779 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
17780 type from bool to int.
17781 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
17782 EASY_VECTOR_MSB case if either step or copies is not 1.
17783 (vspltis_shifted): Fix comment typo.
17784 (easy_altivec_constant): Change return type from bool to int, instead
17785 of returning true return byte size of the element mode that should be
17786 used to synthetize the constant.
17787 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
17788 that vspltis_shifted is 0, handle the case where easy_altivec_constant
17789 assumes using different vector mode from CONST_VECTOR's mode.
17790 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
17791 easy_altivec_constant to determine mode in which -1 >> -1 should be
17792 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
17794 2021-07-20 Richard Biener <rguenther@suse.de>
17797 * dwarf2out.h (dwarf_file_data): Add key member.
17798 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
17799 (dwarf_file_hasher::hash): Hash key.
17800 (lookup_filename): Remap the filename and store it in the
17801 filename member of dwarf_file_data when creating a new
17803 (file_name_acquire): Do not remap the filename again.
17804 (maybe_emit_file): Likewise.
17806 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
17808 * config/aarch64/aarch64-simd-builtins.def: Use two variant
17809 generators for all TBL/TBX intrinsics and rename to
17810 consistent forms: qtbl[1234] or qtbx[1234].
17811 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
17813 (aarch64_qtbl1<mode>): This.
17814 (aarch64_tbx1<mode>): Rename to...
17815 (aarch64_qtbx1<mode>): This.
17816 (aarch64_tbl2v16qi): Delete.
17817 (aarch64_tbl3<mode>): Rename to...
17818 (aarch64_qtbl2<mode>): This.
17819 (aarch64_tbx4<mode>): Rename to...
17820 (aarch64_qtbx2<mode>): This.
17821 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
17822 renamed qtbl1 and qtbl2 RTL patterns.
17823 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
17825 (vqtbl1_s8): Likewise.
17826 (vqtbl1_u8): Likewise.
17827 (vqtbl1q_p8): Likewise.
17828 (vqtbl1q_s8): Likewise.
17829 (vqtbl1q_u8): Likewise.
17830 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
17831 (vqtbx1_u8): Likewise.
17832 (vqtbx1_p8): Likewise.
17833 (vqtbx1q_s8): Likewise.
17834 (vqtbx1q_u8): Likewise.
17835 (vqtbx1q_p8): Likewise.
17836 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
17837 (vtbl1_u8): Likewise.
17838 (vtbl1_p8): Likewise.
17839 (vtbl2_s8): Likewise
17840 (vtbl2_u8): Likewise.
17841 (vtbl2_p8): Likewise.
17842 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
17843 (vtbl3_u8): Likewise.
17844 (vtbl3_p8): Likewise.
17845 (vtbl4_s8): Likewise.
17846 (vtbl4_u8): Likewise.
17847 (vtbl4_p8): Likewise.
17848 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
17849 (vtbx2_u8): Likewise.
17850 (vtbx2_p8): Likewise.
17851 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
17852 (vqtbl2_u8): Likewise.
17853 (vqtbl2_p8): Likewise.
17854 (vqtbl2q_s8): Likewise.
17855 (vqtbl2q_u8): Likewise.
17856 (vqtbl2q_p8): Likewise.
17857 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
17858 (vqtbx2_u8): Likewise.
17859 (vqtbx2_p8): Likewise.
17860 (vqtbx2q_s8): Likewise.
17861 (vqtbx2q_u8): Likewise.
17862 (vqtbx2q_p8): Likewise.
17863 (vtbx4_s8): Likewise.
17864 (vtbx4_u8): Likewise.
17865 (vtbx4_p8): Likewise.
17867 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
17870 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
17872 (define_peephole2 atomic_loaddi_fpu): Ditto.
17874 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17876 * config.gcc (riscv*-*-*): Detect which python is available.
17878 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17880 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
17881 (smul<mode>3_highpart): ... this.
17882 (mulhu_<mode>): Rename to...
17883 (umul<mode>3_highpart): ... this.
17884 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
17885 MULHU_V2DI, MULHU_V4SI): Adjust.
17887 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17889 PR tree-optimization/100696
17890 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
17891 * internal-fn.def (IFN_MULH): New internal function.
17892 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
17893 recog normal multiply highpart as IFN_MULH.
17894 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
17897 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17899 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
17900 (BTF_DEBUGGING_INFO): Likewise.
17901 * doc/tm.texi.in: Document the new macros.
17902 * doc/tm.texi: Regenerated.
17903 * toplev.c: Guard initialization of debug hooks.
17905 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17907 * flags.h (ctf_debuginfo_p): New function declaration.
17908 * opts.c (ctf_debuginfo_p): New function definition.
17910 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
17913 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
17914 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
17915 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
17916 * config/gcn/gcn-valu.md: Add a warning comment.
17917 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
17918 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
17919 * config/gcn/gcn.md: Add a warning comment.
17920 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
17921 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
17923 (EF_AMDGPU_XNACK): New.
17924 (EF_AMDGPU_SRAM_ECC): New.
17926 (copy_early_debug_info): Use elf_flags.
17927 (main): Handle -mxnack and -msram-ecc options.
17928 * doc/invoke.texi: Document -mxnack and -msram-ecc.
17930 2021-07-19 Andrew Pinski <apinski@marvell.com>
17933 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
17934 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
17936 2021-07-19 Richard Biener <rguenther@suse.de>
17938 PR tree-optimization/101505
17939 * tree-vect-patterns.c (vect_determine_precisions): Walk
17940 PHIs also for loop vectorization.
17942 2021-07-19 Richard Biener <rguenther@suse.de>
17944 * gimple.h (gimple_expr_type): Remove.
17945 * doc/gimple.texi: Remove gimple_expr_type documentation.
17947 2021-07-19 Richard Biener <rguenther@suse.de>
17949 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
17950 (ao_ref_init_from_vn_reference): Likewise.
17951 (fully_constant_reference): Likewise.
17952 (vn_reference_lookup_call): Do not set vr->type to random
17954 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
17956 * tree-vect-generic.c (expand_vector_piecewise): Pass in
17957 whether we expanded parallel.
17958 (expand_vector_parallel): Adjust.
17959 (expand_vector_addition): Likewise.
17960 (expand_vector_comparison): Likewise.
17961 (expand_vector_operation): Likewise.
17962 (expand_vector_scalar_condition): Likewise.
17963 (expand_vector_conversion): Likewise.
17965 2021-07-19 Richard Biener <rguenther@suse.de>
17967 * tree-vrp.c (register_edge_assert_for_2): Use the
17969 (vrp_folder::fold_predicate_in): Likewise.
17970 * vr-values.c (gimple_assign_nonzero_p): Likewise.
17971 (vr_values::extract_range_from_comparison): Likewise.
17972 (vr_values::extract_range_from_ubsan_builtin): Use the
17973 type of the first operand.
17974 (vr_values::extract_range_basic): Push down type
17975 computation, use the appropriate LHS.
17976 (vr_values::extract_range_from_assignment): Use the
17979 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17982 * common/config/i386/i386-common.c (ix86_handle_option): For
17983 -mgeneral-regs-only, enable the GPR only instructions which are
17984 enabled implicitly by SSE ISAs unless they have been disabled
17987 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17990 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
17991 ix86_avx_u128_mode_needed.
17992 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
17993 returns AVX register.
17995 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
17997 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
17998 (handle_const_call): Liekise
17999 (handle_pure_call): Liekise
18001 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
18003 PR tree-optimization/96542
18004 * range-op.cc (range_operator::wi_fold_in_parts): New.
18005 (range_operator::fold_range): Call wi_fold_in_parts.
18006 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
18007 * range-op.h (wi_fold_in_parts): Add prototype.
18009 2021-07-16 David Malcolm <dmalcolm@redhat.com>
18011 * doc/analyzer.texi: Add __analyzer_dump_state.
18013 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18015 * config/rs6000/rbtree.c: New file.
18016 * config/rs6000/rbtree.h: New file.
18018 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18020 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
18021 (typeinfo): Add restr field.
18022 (match_bracketed_pair): New function.
18023 (match_const_restriction): Implement.
18025 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18027 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
18029 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18031 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
18032 (basetype): Likewise.
18033 (typeinfo): Likewise.
18034 (handle_pointer): New function.
18035 (match_basetype): New stub function.
18036 (match_const_restriction): Likewise.
18037 (match_type): New function.
18039 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18041 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
18043 (advance_line): Likewise.
18044 (safe_inc_pos): Likewise.
18045 (match_identifier): Likewise.
18046 (match_integer): Likewise.
18047 (match_to_right_bracket): Likewise.
18049 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18051 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
18052 (ovld_file): Likewise.
18053 (header_file): Likewise.
18054 (init_file): Likewise.
18055 (defines_file): Likewise.
18056 (pgm_path): Likewise.
18057 (bif_path): Likewise.
18058 (ovld_path): Likewise.
18059 (header_path): Likewise.
18060 (init_path): Likewise.
18061 (defines_path): Likewise.
18062 (LINELEN): New macro.
18063 (linebuf): New variable.
18067 (bif_diag): New function.
18068 (ovld_diag): Likewise.
18070 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18072 * config/rs6000/rs6000-builtin-new.def: New.
18073 * config/rs6000/rs6000-overload.def: New.
18075 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18077 * config/rs6000/rs6000-gen-builtins.c: New.
18079 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
18081 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
18082 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
18083 * gengtype-state.c (state_writer::write_state_file_list): Add a
18084 parameter to the fileslist expression for the number of build
18086 (read_state_files_list): Detect build headers and strip the
18087 initial "./" or ".\" from their names.
18088 * gengtype.c (build_headers): New global variable.
18089 (num_build_headers): Likewise.
18090 (open_base_files): Emit #include for each build header.
18091 (main): Detect and count build headers.
18092 * gengtype.h (build_headers): New extern variable.
18093 (num_build_headers): Likewise.
18095 2021-07-16 Richard Biener <rguenther@suse.de>
18097 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
18098 the type of the LHS.
18099 (find_bswap_or_nop_1): Likewise.
18100 (find_bswap_or_nop): Likewise.
18101 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
18103 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
18104 Remove unused parameters, pass in the scalar type. Fix
18105 internal store function handling.
18106 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
18107 (vect_get_vector_types_for_stmt): Move down check for
18108 existing vector stmt after we've determined a scalar type.
18109 Pass down the used scalar type to vect_get_smallest_scalar_type.
18110 * tree-vect-generic.c (expand_vector_condition): Use
18111 the type of the LHS.
18112 (expand_vector_scalar_condition): Likewise.
18113 (expand_vector_operations_1): Likewise.
18114 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
18115 (vect_recog_dot_prod_pattern): Likewise.
18116 (vect_recog_sad_pattern): Likewise.
18117 (vect_recog_widen_op_pattern): Likewise.
18118 (vect_recog_widen_sum_pattern): Likewise.
18119 (vect_recog_mixed_size_cond_pattern): Likewise.
18121 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
18123 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
18124 (dump_eaf_flags): Dump EAF_NOT_RETURNED
18125 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
18126 and EAF_NOT_RETURNED.
18127 (modref_summary::useful_p): Likewise.
18128 (modref_summary_lto::useful_p): Likewise.
18129 (struct) modref_summary_lto: Use eaf_fleags_t.
18130 (deref_flags): Handle EAF_NOT_RETURNED.
18131 (struct escape_point): Use min_flags.
18132 (modref_lattice::init): Add EAF_NOT_RETURNED.
18133 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
18134 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
18136 (analyze_parms): Also analyze const functions; update conition on
18138 (modref_write): Update streaming.
18139 (read_section): Update streaming.
18140 (remap_arg_flags): Use eaf_flags_t.
18141 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
18142 * ipa-modref.h: (eaf_flags_t): New typedef.
18143 (struct modref_summary): Use eaf_flags_t.
18144 * tree-core.h (EAF_NOT_RETURNED): New constant.
18146 2021-07-16 Richard Biener <rguenther@suse.de>
18148 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
18149 the type of the LHS.
18150 (gimple_assign_nonnegative_warnv_p): Likewise.
18151 (gimple_call_nonnegative_warnv_p): Likewise. Return false
18152 if the call has no LHS.
18153 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
18154 * tree-eh.c (stmt_could_throw_1_p): Likewise.
18155 * tree-inline.c (insert_init_stmt): Likewise.
18156 * tree-ssa-loop-niter.c (get_val_for): Likewise.
18157 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
18159 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
18160 gassign *. Use the type of the lhs.
18161 (vn_nary_op_lookup_stmt): Adjust.
18162 (vn_nary_op_insert_stmt): Likewise.
18164 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
18166 * config/s390/predicates.md (bras_sym_operand): Accept all
18167 functions in 64-bit mode, use UNSPEC_PLT31.
18168 (larl_operand): Use UNSPEC_PLT31.
18169 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
18170 (legitimize_pic_address): Likewise.
18171 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
18173 (s390_delegitimize_address): Use UNSPEC_PLT31.
18174 (s390_output_addr_const_extra): Likewise.
18175 (print_operand): Add @PLT to TLS calls, handle %K.
18176 (s390_function_profiler): Mark __fentry__/_mcount as function,
18177 use %K, use UNSPEC_PLT31.
18178 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
18179 (s390_emit_call): Use UNSPEC_PLT31.
18180 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
18181 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
18182 (*movdi_64): Use %K.
18183 (reload_base_64): Likewise.
18184 (*sibcall_brc): Likewise.
18185 (*sibcall_brcl): Likewise.
18186 (*sibcall_value_brc): Likewise.
18187 (*sibcall_value_brcl): Likewise.
18189 (*brasl): Likewise.
18190 (*bras_r): Likewise.
18191 (*brasl_r): Likewise.
18192 (*bras_tls): Likewise.
18193 (*brasl_tls): Likewise.
18194 (main_base_64): Likewise.
18195 (reload_base_64): Likewise.
18196 (@split_stack_call<mode>): Likewise.
18198 2021-07-16 Richard Biener <rguenther@suse.de>
18200 PR tree-optimization/101467
18201 * tree-vect-stmts.c (vect_gen_while): Properly guard
18202 make_temp_ssa_name usage.
18204 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
18206 * config.gcc: Don't use forked print-sysroot-suffix.sh and
18207 t-sysroot-suffix for C-SKY.
18208 * config/csky/print-sysroot-suffix.sh: Delete.
18209 * config/csky/t-csky-linux: Delete.
18210 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
18211 instead of CSKY_MULTILIB_DIRNAMES.
18213 2021-07-16 Richard Biener <rguenther@suse.de>
18215 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
18216 conversion issues with the partial reduction of the reused
18217 vector accumulator.
18219 2021-07-16 Richard Biener <rguenther@suse.de>
18221 * config/i386/i386-options.c (ix86_option_override_internal): Set
18222 param_vect_partial_vector_usage to zero if not set.
18224 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
18227 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
18228 (VALID_INT_MODE_P): Add SDmode and DDmode.
18229 Add TDmode for TARGET_64BIT.
18230 (VALID_DFP_MODE_P): Remove.
18231 * config/i386/i386.c (ix86_hard_regno_mode_ok):
18232 Do not use VALID_DFP_MODE_P.
18234 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
18236 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
18238 (fold_using_range::fold_stmt): Ditto.
18239 (fold_using_range::range_of_range_op): Ditto.
18240 (fold_using_range::range_of_phi): Ditto.
18241 (fold_using_range::range_of_call): Ditto.
18242 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
18243 (fold_using_range::range_of_builtin_call): Ditto.
18244 (fold_using_range::range_of_cond_expr): Ditto.
18245 * gimple-range-fold.h (gimple_range_type): New.
18247 2021-07-15 Martin Sebor <msebor@redhat.com>
18249 PR middle-end/97027
18250 * tree-ssa-strlen.c (handle_assign): New function.
18251 (maybe_warn_overflow): Add argument.
18252 (nonzero_bytes_for_type): New function.
18253 (count_nonzero_bytes): Handle more tree types. Call
18254 nonzero_bytes_for_tye.
18255 (count_nonzero_bytes): Handle types.
18256 (handle_store): Handle stores from function calls.
18257 (strlen_check_and_optimize_call): Move code to handle_assign. Call
18258 it for assignments from function calls.
18260 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18265 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
18267 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18269 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
18271 2021-07-15 Martin Sebor <msebor@redhat.com>
18275 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
18276 (operand_compare::verify_hash_value): Same.
18277 * tree-core.h (OEP_DECL_NAME): New.
18279 2021-07-15 Martin Jambor <mjambor@suse.cz>
18281 * profile-count.h (profile_count::value): Change the return type to
18283 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
18285 * tree-cfg.c (dump_function_to_file): Likewise.
18287 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
18290 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
18291 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
18294 2021-07-15 Richard Biener <rguenther@suse.de>
18296 * tree-vectorizer.h (vect_gen_while): Match up with
18297 vect_gen_while_not.
18298 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
18299 of vect_gen_while_not.
18300 (vect_gen_while_not): Adjust.
18301 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
18303 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
18305 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
18306 (ranger_cache::range_of_def): Call adjust_range.
18307 (ranger_cache::entry_range): Same.
18308 * gimple-range-cache.h (non_null_ref::adjust_range): New.
18309 * gimple-range.cc (gimple_ranger::range_of_expr): Call
18311 (gimple_ranger::range_on_entry): Same.
18313 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18316 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18318 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18320 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18323 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18325 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18326 (sdot_prod, udot_prod): ...These.
18327 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18328 (aarch64_<sur>dot<vsi2qi>): Rename to...
18329 (<sur>dot_prod<vsi2qi>): ...This.
18330 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18333 2021-07-15 Jakub Jelinek <jakub@redhat.com>
18335 PR middle-end/101437
18336 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
18337 types even if they have non-BLKmode TYPE_MODE.
18339 2021-07-15 Richard Biener <rguenther@suse.de>
18342 * gcc.c (process_command): Process -gtoggle like process_options
18343 would after parsing options.
18345 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18347 * cfgexpand.c (expand_asm_loc): Adjust.
18348 (expand_asm_stmt): Likewise.
18349 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
18350 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18351 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
18352 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
18353 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
18354 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
18355 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
18356 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
18357 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
18358 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
18359 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
18360 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
18361 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
18362 * doc/tm.texi: Regenerate.
18363 * target.def: Add location argument to md_asm_adjust.
18365 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18367 * tree-diagnostic.c (diagnostic_report_current_function): Use the
18368 diagnostic's location, not input_location.
18370 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18372 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
18374 (expand_asm_stmt): Likewise.
18376 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18378 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
18379 addressed memory rtx, if any.
18380 (rs6000_split_multireg_move): Fix code formatting.
18381 Handle MMA build built-ins with operands in adjacent memory locations.
18383 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18385 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
18388 2021-07-14 Jason Merrill <jason@redhat.com>
18390 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
18392 2021-07-14 Jason Merrill <jason@redhat.com>
18394 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
18397 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18399 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
18400 Remove erroneous line.
18402 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
18404 * params.opt (param_evrp_mode): Change default.
18406 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18408 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18409 (sdot_prod, udot_prod): ...These.
18410 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18411 (aarch64_<sur>dot<vsi2qi>): Rename to...
18412 (<sur>dot_prod<vsi2qi>): ...This.
18413 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18416 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18418 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18420 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18422 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
18424 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18426 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
18428 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18430 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
18431 (usdot_prod<vsi2qi>): ... This.
18432 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
18433 (usdot_prod): ...This.
18434 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
18435 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
18437 (@<sur>dot_prod<vsi2qi>): ...This.
18438 * config/aarch64/aarch64-sve-builtins-base.cc
18439 (svusdot_impl::expand): Use it.
18441 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18443 * optabs.def (usdot_prod_optab): New.
18444 * doc/md.texi: Document it and clarify other dot prod optabs.
18445 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
18446 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
18447 * optabs.c (expand_widen_pattern_expr): Likewise.
18448 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
18449 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
18450 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
18452 (vect_widened_op_tree): Optionally ignore
18454 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
18456 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
18459 * config/i386/driver-i386.c (host_detect_local_cpu): Check
18460 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
18461 Enable UINTR only for 64-bit codegen.
18462 * config/i386/i386-options.c
18463 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
18465 * config/i386/i386.h (ARCH_ARG): New.
18466 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
18467 "[arch|tune] 64" for 64-bit codegen.
18469 2021-07-14 Richard Biener <rguenther@suse.de>
18471 PR tree-optimization/101445
18472 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
18473 of the IV in the correct direction for negative stride
18476 2021-07-14 Jakub Jelinek <jakub@redhat.com>
18479 * godump.c (godump_str_hash): New type.
18480 (godump_container::pot_dummy_types): Use string_hash instead of
18481 ptr_hash in the hash_set.
18483 2021-07-14 Richard Biener <rguenther@suse.de>
18485 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
18486 vector types where the old vector type has a multiple of
18487 the new vector type elements.
18488 (vect_create_partial_epilog): New function, split out from...
18489 (vect_create_epilog_for_reduction): ... here.
18490 (vect_transform_cycle_phi): Reduce the re-used accumulator
18491 to the new vector type.
18493 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18495 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
18496 non-'t'-sized arg check.
18498 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18500 * tree-cfg.c (cleanup_dead_labels_eh): Update
18501 post_landing_pad label upon change of landing pad block's
18503 (cleanup_dead_labels): Check that a removed label is not that
18506 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
18508 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
18510 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
18511 Add Neon to general purpose register case for zero-extend
18513 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
18514 case to prevent some cases opting to go through memory.
18515 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
18516 * rtl.c (rtvec_series_p): Define predicate to determine
18517 whether a vector contains a linear series of integers.
18518 * rtl.h (rtvec_series_p): Define.
18519 * rtlanal.c (vec_series_lowpart_p): Define predicate to
18520 determine if a vector selection is equivalent to the low part
18522 * rtlanal.h (vec_series_lowpart_p): Define.
18523 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
18524 Add vec_select -> subreg simplification.
18526 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
18528 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
18529 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
18530 _mm_test_mix_ones_zeros): New.
18532 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
18533 Richard Biener <rguenther@suse.de>
18535 * gimple.c (gimple_could_trap_p_1): Make S argument a
18536 "const gimple*". Preserve constness in call to
18537 gimple_asm_volatile_p.
18538 (gimple_could_trap_p): Make S argument a "const gimple*".
18539 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
18540 Update function prototypes.
18542 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18544 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
18545 (_loop_vec_info::main_loop_edge): New field.
18546 (_loop_vec_info::skip_main_loop_edge): Likewise.
18547 (_loop_vec_info::skip_this_loop_edge): Likewise.
18548 (_loop_vec_info::reusable_accumulators): Likewise.
18549 (_stmt_vec_info::reduc_scalar_results): Likewise.
18550 (_stmt_vec_info::reused_accumulator): Likewise.
18551 (vect_get_main_loop_result): Declare.
18552 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
18553 reduc_scalar_inputs.
18554 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
18555 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
18556 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
18557 skip_main_loop_edge and skip_this_loop_edge fields.
18558 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
18559 (vect_emit_reduction_init_stmts): New function.
18560 (get_initial_def_for_reduction): Use it.
18561 (get_initial_defs_for_reduction): Likewise. Change the vinfo
18562 parameter to a loop_vec_info.
18563 (vect_create_epilog_for_reduction): Store the scalar results
18564 in the reduc_info. If an epilogue loop is reusing an accumulator
18565 from the main loop, and if the epilogue loop can also be skipped,
18566 try to place the reduction code in the join block. Record
18567 accumulators that could potentially be reused by epilogue loops.
18568 (vect_transform_cycle_phi): When vectorizing epilogue loops,
18569 try to reuse accumulators from the main loop. Record the initial
18570 value in reduc_info for non-SLP reductions too.
18572 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18574 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
18575 adjustment handling. Take the neutral value as an argument,
18576 in place of the code argument.
18577 (vect_transform_cycle_phi): Update accordingly. Handle the
18578 initial values of cond reductions separately from code reductions.
18579 Choose the adjustment here rather than in
18580 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
18582 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18584 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
18585 (neutral_op_for_reduction): ...this, providing a more general
18587 (vect_create_epilog_for_reduction): Update accordingly.
18588 (vectorizable_reduction): Likewise.
18589 (vect_transform_cycle_phi): Likewise.
18591 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18593 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
18594 reduc_info instead of the original stmt_vec_info.
18595 (vect_transform_cycle_phi): Update accordingly.
18597 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18599 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
18600 reduc_info as an additional parameter.
18601 (vect_transform_cycle_phi): Update accordingly.
18603 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18605 * tree-vectorizer.h: Include tree-ssa-operands.h.
18606 (vect_phi_initial_value): New function.
18607 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
18608 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
18609 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
18610 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
18612 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18614 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
18615 the phi results to vectype after creating them. Remove later
18616 conversion code that thus becomes redundant.
18618 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18620 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
18621 the new_phis vector with a reduc_inputs vector. Combine handling
18622 of reduction chains and ncopies > 1.
18624 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18626 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
18627 scalar_results to group_size elements after reducing down from
18628 N*group_size elements. Construct an array_slice of the live-out
18629 stmts and assert that there is one stmt per scalar result.
18631 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18633 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
18634 nested_in_vect_loop and use double_reduc everywhere. Remove dead
18635 assignment to "loop".
18637 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18639 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
18640 vector types first. For scalar types, consider both the preferred
18641 vector mode and the alternative vector modes.
18642 * optabs-query.c (can_vec_mask_load_store_p): Use the same
18643 structure as above, in particular using related_vector_mode
18644 for modes provided by autovectorize_vector_modes.
18646 2021-07-13 Jakub Jelinek <jakub@redhat.com>
18647 Richard Biener <rguenther@suse.de>
18649 PR tree-optimization/101419
18650 * tree-pass.h (PROP_objsz): Define.
18651 (make_pass_early_object_sizes): Declare.
18652 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
18653 there to pass_early_object_sizes, drop parameter.
18654 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
18655 drop parameter, move pass_post_ipa_warn right after that.
18656 * tree-object-size.c (pass_object_sizes::execute): Rename to...
18657 (object_sizes_execute): ... this. Add insert_min_max_p argument.
18658 (pass_data_object_sizes): Move after object_sizes_execute.
18659 (pass_object_sizes): Likewise. In execute method call
18660 object_sizes_execute, drop set_pass_param method and insert_min_max_p
18661 non-static data member and its initializer in the ctor.
18662 (pass_data_early_object_sizes, pass_early_object_sizes,
18663 make_pass_early_object_sizes): New.
18664 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
18665 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
18667 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
18670 * config/riscv/constraints.md ("S"): Update description and remove
18672 * doc/md.texi (Machine Constraints): Document the 'S' constraints
18675 2021-07-13 Richard Biener <rguenther@suse.de>
18678 2021-07-12 Richard Biener <rguenther@suse.de>
18680 * tree-vect-slp.c (vect_slp_region): Show the number of
18681 SLP graph entries in the optimization message.
18683 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
18685 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
18687 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
18688 return type to long.
18689 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
18692 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18694 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
18695 Query relation between the 2 operands and use it.
18697 2021-07-12 Sergei Trofimovich <siarheit@google.com>
18699 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
18701 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18704 * config/i386/predicates.md (vec_setm_sse41_operand):
18705 Rename from vec_setm_operand.
18706 (vec_setm_avx2_operand): New predicate.
18707 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
18708 Use vec_setm_sse41_operand as operand 2 predicate.
18709 (vec_set<V_256_512:mode): New expander.
18710 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
18711 as operand 2 predicate.
18713 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18715 PR tree-optimization/101335
18716 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
18718 2021-07-12 Andrew Pinski <apinski@marvell.com>
18720 * tree-ssa-phiopt.c (match_simplify_replacement): Move
18721 insert of the sequence before the movement of the
18722 statement. Check if to see if the statement is used
18723 outside of the original phi to see if we should move it.
18725 2021-07-12 Richard Biener <rguenther@suse.de>
18727 * dump-context.h (debug_dump_context::debug_dump_context):
18728 Add FILE * parameter defaulted to stderr.
18729 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
18730 * tree-vect-slp.c (dot_slp_tree): New functions.
18732 2021-07-12 Richard Biener <rguenther@suse.de>
18734 PR tree-optimization/101373
18735 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
18736 references when the BB may not return.
18737 (compute_avail): Pass in the function we're working on and
18738 replace cfun references with it. Externally throwing
18739 const calls also possibly terminate the function.
18740 (pass_pre::execute): Pass down the function we're working on.
18741 * gcse.c (compute_hash_table_work): Externally throwing
18742 const/pure calls also need record_last_mem_set_info.
18743 * postreload-gcse.c (record_opr_changes): Looping or externally
18744 throwing const/pure calls also need record_last_mem_set_info.
18746 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18748 * recog.c (memory_address_addr_space_p): Change the type to bool.
18749 Return true/false instead of 1/0.
18750 (offsettable_memref_p): Ditto.
18751 (offsettable_nonstrict_memref_p): Ditto.
18752 (offsettable_address_addr_space_p): Ditto.
18753 Change the type of addressp indirect function to bool.
18754 * recog.h (memory_address_addr_space_p): Change the type to bool.
18755 (strict_memory_address_addr_space_p): Ditto.
18756 (offsettable_memref_p): Ditto.
18757 (offsettable_nonstrict_memref_p): Ditto.
18758 (offsettable_address_addr_space_p): Ditto.
18759 * reload.c (maybe_memory_address_addr_space_p): Ditto.
18760 (strict_memory_address_addr_space_p): Change the type to bool.
18761 Return true/false instead of 1/0.
18762 (maybe_memory_address_addr_space_p): Change the type to bool.
18764 2021-07-12 Richard Biener <rguenther@suse.de>
18766 * tree-vect-slp.c (vect_slp_region): Show the number of
18767 SLP graph entries in the optimization message.
18769 2021-07-12 Richard Biener <rguenther@suse.de>
18771 PR tree-optimization/101394
18772 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
18773 copies from abnormals for a full redundancy.
18775 2021-07-12 Richard Biener <rguenther@suse.de>
18777 PR middle-end/101423
18778 * gimple.c (gimple_could_trap_p_1): Internal function calls
18780 * tree-eh.c (tree_could_trap_p): Likewise.
18782 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18785 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
18787 (vmulq_n_u32): Likewise.
18788 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
18789 (vmulq_n_f32): Likewise.
18790 (vmul_n_f16): Likewise.
18791 (vmulq_n_f16): Likewise.
18793 2021-07-12 Martin Liska <mliska@suse.cz>
18795 PR sanitizer/101425
18796 * gcc.c (check_offload_target_name): Call
18797 candidates_list_and_hint only if we have a candidate.
18799 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18802 * config/arm/neon.md (vec_init): Move to ...
18803 * config/arm/vec-common.md (vec_init): ... here.
18804 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
18806 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
18808 PR tree-optimization/101403
18809 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
18810 signedness of the shift is not the same as the signedness of
18811 the type extension.
18813 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
18814 Uroš Bizjak <ubizjak@gmail.com>
18816 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
18817 divmod of a constant numerator with new define_insn_and_split.
18819 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
18822 * config/i386/i386-expand.c (ix86_expand_call): If a call is
18823 to a non-local-binding, or local but to a public symbol, then
18824 assume that it might be indirected via the lazy symbol binder.
18825 Mark R10 and R10 as clobbered in that case.
18827 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
18830 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
18831 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
18832 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
18834 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
18836 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
18837 log2 range of operands[3] to [1,31].
18838 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
18840 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18842 * doc/md.texi: Don't split @smallexample in multiple @groups.
18844 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18846 * doc/md.texi: Add missing 'see' word.
18848 2021-07-09 Andrew Pinski <apinski@marvell.com>
18850 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
18851 to take sequence and gimple_match_op. Accept the case where
18852 op is a SSA_NAME and one statement in the sequence.
18853 Also allow constants.
18854 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
18855 Update call to phiopt_early_allow. Discard the sequence if not
18858 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
18863 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
18864 instead of ISA_HAS_MSA.
18865 (mips_expand_vec_unpack): Likewise.
18866 (mips_expand_vector_init): Likewise.
18868 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
18870 * config/rs6000/vsx.md (mods_<mode>): Rename to...
18871 (mod<mode>3): ... this.
18872 (modu_<mode>): Rename to...
18873 (umod<mode>3): ... this.
18874 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
18875 MODU_V4SI): Adjust.
18877 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
18879 * config/h8300/shiftrotate.md (variable shifts): Expose condition
18880 code handling for the test before the loop.
18882 2021-07-08 Martin Jambor <mjambor@suse.cz>
18885 * ipa-sra.c (class isra_call_summary): New member
18886 m_before_any_store, initialize it in the constructor.
18887 (isra_call_summary::dump): Dump the new field.
18888 (ipa_sra_call_summaries::duplicate): Copy it.
18889 (process_scan_results): Set it.
18890 (isra_write_edge_summary): Stream it.
18891 (isra_read_edge_summary): Likewise.
18892 (param_splitting_across_edge): Only override
18893 safe_to_import_accesses if m_before_any_store is set.
18895 2021-07-08 Martin Sebor <msebor@redhat.com>
18897 PR bootstrap/101374
18898 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
18899 Use Object Size Type 0 instead of 1.
18901 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18903 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
18906 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18908 * match.pd: Simplify an extend-operate-truncate sequence involving
18911 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
18912 Richard Biener <rguenther@suse.de>
18914 PR tree-optimization/40210
18915 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
18916 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
18917 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
18919 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
18922 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
18924 * config/i386/mmx.md (V_32): New mode iterator.
18925 (mov<V_32:mode>): Use V_32 mode iterator.
18926 (*mov<V_32:mode>_internal): Ditto.
18927 (*push<V_32:mode>2_rex64): Ditto.
18928 (*push<V_32:mode>2): Ditto.
18929 (movmisalign<V_32:mode>): Ditto.
18930 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
18931 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
18932 (vec_unpacks_lo_v4qi): New expander.
18933 (vec_unpacks_hi_v4qi): Ditto.
18934 (vec_unpacku_lo_v4qi): Ditto.
18935 (vec_unpacku_hi_v4qi): Ditto.
18936 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
18937 (VALID_INT_MODE_P): Ditto.
18939 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
18942 * config/rs6000/rs6000.md (udivti3): New insn.
18943 (divti3): New insn.
18944 (umodti3): New insn.
18945 (modti3): New insn.
18947 2021-07-07 Martin Sebor <msebor@redhat.com>
18949 PR tree-optimization/100137
18950 PR tree-optimization/99121
18951 PR tree-optimization/97027
18952 * builtins.c (access_ref::access_ref): Also set offmax.
18953 (access_ref::offset_in_range): Define new function.
18954 (access_ref::add_offset): Set offmax.
18955 (access_ref::inform_access): Handle access_none.
18956 (handle_mem_ref): Clear ostype.
18957 (compute_objsize_r): Handle ASSERT_EXPR.
18958 * builtins.h (struct access_ref): Add offmax member.
18959 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
18960 compute_objsize() and simplify.
18962 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
18964 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
18965 and VSX_BUILTIN_STXVP.
18967 2021-07-07 Martin Sebor <msebor@redhat.com>
18970 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
18971 a stray %K from error_at() missed in r12-2088.
18973 2021-07-07 Richard Biener <rguenther@suse.de>
18975 PR tree-optimization/99728
18976 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
18978 (mem_refs_may_alias_p): Add assert we handled aggregate
18980 (sm_seq_valid_bb): Give up when running into aggregate copies.
18981 (ref_indep_loop_p): Handle aggregate copies as never
18982 being invariant themselves but allow other refs to be
18983 disambiguated against them.
18984 (can_sm_ref_p): Do not try to apply store-motion to aggregate
18987 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
18990 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
18993 2021-07-06 Martin Sebor <msebor@redhat.com>
18995 * gimple-pretty-print.c (percent_G_format): Remove.
18996 * tree-diagnostic.c (default_tree_printer): Remove calls.
18997 * tree-pretty-print.c (percent_K_format): Remove.
18998 * tree-pretty-print.h (percent_K_format): Remove.
19000 2021-07-06 Martin Sebor <msebor@redhat.com>
19002 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
19003 Remove %K and use error_at.
19004 (aarch64_expand_fcmla_builtin): Same.
19005 (aarch64_expand_builtin_tme): Same.
19006 (aarch64_expand_builtin_memtag): Same.
19007 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
19008 (arm_expand_builtin): Same.
19009 * config/arm/arm.c (bounds_check): Same.
19011 2021-07-06 Martin Sebor <msebor@redhat.com>
19013 * builtins.c (warn_string_no_nul): Remove %G.
19014 (maybe_warn_for_bound): Same.
19015 (warn_for_access): Same.
19016 (check_access): Same.
19017 (check_strncat_sizes): Same.
19018 (expand_builtin_strncat): Same.
19019 (expand_builtin_strncmp): Same.
19020 (expand_builtin): Same.
19021 (expand_builtin_object_size): Same.
19022 (warn_dealloc_offset): Same.
19023 (maybe_emit_free_warning): Same.
19024 * calls.c (maybe_warn_alloc_args_overflow): Same.
19025 (maybe_warn_nonstring_arg): Same.
19026 (maybe_warn_rdwr_sizes): Same.
19027 * expr.c (expand_expr_real_1): Remove %K.
19028 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
19029 (gimple_fold_builtin_strncat): Same.
19030 * gimple-ssa-sprintf.c (format_directive): Same.
19031 (handle_printf_call): Same.
19032 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
19033 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19034 (maybe_diag_access_bounds): Same. Call gimple_location.
19035 (check_bounds_or_overlap): Same.
19036 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
19037 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
19038 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
19039 (maybe_diag_stxncpy_trunc): Same.
19040 (handle_builtin_stxncpy_strncat): Same.
19041 (maybe_warn_pointless_strcmp): Same.
19042 * tree-ssa-uninit.c (maybe_warn_operand): Same.
19044 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
19047 * config/i386/predicates.md (vec_setm_operand): Enable
19048 register_operand for TARGET_SSE4_1.
19049 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
19050 as operand 2 predicate. Call ix86_expand_vector_set_var
19051 for non-constant index operand.
19052 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
19053 Call ix86_expand_vector_set_var for non-constant index operand.
19055 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
19057 * config/h8300/jumpcall.md (*branch): When possible, generate
19058 the comparison in CCZN mode.
19059 * config/h8300/predicates.md (simple_memory_operand): Reject all
19060 auto-increment addressing modes.
19062 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19064 PR bootstrap/100246
19065 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
19068 2021-07-06 Richard Biener <rguenther@suse.de>
19070 * doc/md.texi (vec_fmaddsub<mode>4): Document.
19071 (vec_fmsubadd<mode>4): Likewise.
19072 * optabs.def (vec_fmaddsub$a4): Add.
19073 (vec_fmsubadd$a4): Likewise.
19074 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
19075 (IFN_VEC_FMSUBADD): Likewise.
19076 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
19077 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
19078 (addsub_pattern::build): Likewise.
19079 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
19080 and CFN_VEC_FMSUBADD are not transparent for permutes.
19081 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
19082 (vec_fmsubadd<mode>4): Likewise.
19084 2021-07-06 Richard Biener <rguenther@suse.de>
19086 * doc/invoke.texi (fmove-loop-stores): Document.
19087 * common.opt (fmove-loop-stores): New option.
19088 * opts.c (default_options_table): Enable -fmove-loop-stores
19089 at -O1 but not -Og.
19090 * tree-ssa-loop-im.c (pass_lim::execute): Pass
19091 flag_move_loop_stores instead of true to
19092 loop_invariant_motion_in_fun.
19094 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
19096 * doc/install.texi: Document --with-dsymutil.
19098 2021-07-06 Andrew Pinski <apinski@marvell.com>
19100 PR tree-optimization/101256
19101 * dbgcnt.def (phiopt_edge_range): New counter.
19102 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
19103 Check to make sure the new name is defined in the same
19104 bb as the conditional before duplicating range info.
19105 Also add debug counter.
19107 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19109 PR rtl-optimization/100328
19110 * config/i386/i386-options.c (ix86_option_override_internal):
19111 Set param_ira_consider_dup_in_all_alts to 0.
19113 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
19115 PR rtl-optimization/100328
19116 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
19118 * ira.c (ira_get_dup_out_num): Adjust as parameter
19119 param_ira_consider_dup_in_all_alts.
19120 * params.opt (ira-consider-dup-in-all-alts): New.
19121 * ira-conflicts.c (process_regs_for_copy): Add one parameter
19122 single_input_op_has_cstr_p.
19123 (get_freq_for_shuffle_copy): New function.
19124 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
19125 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
19127 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
19129 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
19130 generate condition code aware RTL directly.
19132 2021-07-05 Andrew Pinski <apinski@marvell.com>
19134 PR tree-optimization/101039
19135 * match.pd (A CMP 0 ? A : -A): New patterns.
19136 * tree-ssa-phiopt.c (abs_replacement): Delete function.
19137 (tree_ssa_phiopt_worker): Don't call abs_replacement.
19138 Update comment about abs_replacement.
19140 2021-07-05 Andrew Pinski <apinski@marvell.com>
19142 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
19143 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
19145 2021-07-05 Andrew Pinski <apinski@marvell.com>
19147 * tree-ssa-phiopt.c (match_simplify_replacement):
19148 Add early_p argument. Call gimple_simplify_phiopt
19149 instead of gimple_simplify.
19150 (tree_ssa_phiopt_worker): Update call to
19151 match_simplify_replacement and allow unconditionally.
19152 (phiopt_early_allow): New function.
19153 (gimple_simplify_phiopt): New function.
19155 2021-07-05 Andrew Pinski <apinski@marvell.com>
19157 PR middle-end/101237
19158 * fold-const.c (negate_expr_p): Remove call to element_mode
19159 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
19160 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
19161 (fold_negate_expr_1): Likewise.
19162 (const_unop): Likewise.
19163 (fold_cond_expr_with_comparison): Likewise.
19164 (fold_binary_loc): Likewise.
19165 (fold_ternary_loc): Likewise.
19166 (tree_call_nonnegative_warnv_p): Likewise.
19167 * match.pd (-(A + B) -> (-B) - A): Likewise.
19169 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
19171 * configure.ac: Handle --with-dsymutil in the same way as we
19172 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
19173 Extract the type and version for the dsymutil configured or
19174 found by the default searches.
19175 * config.in: Regenerated.
19176 * configure: Regenerated.
19177 * collect2.c (do_dsymutil): Handle locating dsymutil in the
19178 same way as for the assembler and linker.
19179 * config/darwin.h (DSYMUTIL): Delete.
19180 * gcc.c: Report a configured dsymutil correctly.
19181 * exec-tool.in: Allow for dsymutil.
19183 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
19185 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
19186 Handle V4QI and V2HI modes.
19187 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
19188 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
19189 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
19190 Handle 4-byte dual operands with TARGET_XOP and single operands
19191 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
19192 mmx_pshufbv4qi3 for TARGET_SSSE3.
19193 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
19194 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
19195 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
19196 (expand_vec_perm_even_odd_1): Handle V4QI mode.
19197 (expand_vec_perm_broadcast_1): Handle V4QI mode.
19198 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
19199 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
19200 (mmx_pshufbv4qi3): Ditto.
19201 (*mmx_pblendw32): Ditto.
19202 (*mmx_pblendw64): Rename from *mmx_pblendw.
19203 (mmx_punpckhbw_low): New insn_and_split pattern.
19204 (mmx_punpcklbw_low): Ditto.
19206 2021-07-05 Richard Biener <rguenther@suse.de>
19208 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
19209 set LOOP_C_INFINITE on the vectorized loop.
19211 2021-07-05 Richard Biener <rguenther@suse.de>
19213 PR middle-end/101291
19214 * cfgloopmanip.c (loop_version): Set the loop copy of the
19215 versioned loop to the new loop.
19217 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
19220 * config.gcc: Ensure that Darwin biarch definitions are
19221 added before i386.h.
19222 * config/i386/darwin.h (TARGET_64BIT): Remove.
19223 (PR80556_WORKAROUND): New.
19224 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
19225 (DARWIN_SUBARCH_SPEC): New.
19226 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
19227 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19228 (REAL_LIBGCC_SPEC): Remove.
19229 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
19230 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19231 (REAL_LIBGCC_SPEC): Remove.
19233 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
19235 PR middle-end/101294
19236 * expr.c (store_constructor): Don't use vec_duplicate on vector.
19238 2021-07-02 Martin Sebor <msebor@redhat.com>
19240 PR middle-end/98871
19241 PR middle-end/98512
19242 * diagnostic.c (get_any_inlining_info): New.
19243 (update_effective_level_from_pragmas): Handle inlining context.
19244 (diagnostic_enabled): Same.
19245 (diagnostic_report_diagnostic): Same.
19246 * diagnostic.h (struct diagnostic_info): Add ctor.
19247 (struct diagnostic_context): Add new member.
19248 * tree-diagnostic.c (set_inlining_locations): New.
19249 (tree_diagnostics_defaults): Set new callback pointer.
19251 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
19253 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
19255 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
19256 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
19257 lxvp and stxvp built-ins.
19258 (mma_init_builtins): Handle lxvp and stxvp built-ins.
19259 (builtin_function_type): Likewise.
19260 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
19262 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19264 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
19265 additional argument for the code.
19266 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
19267 argument for the code. Just return if the ZN bits are useful or
19268 not rather than the old style CC_* enums.
19269 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
19270 more generic shiftqi patterns.
19271 (shifthi_noscratch, shiftsi_noscratch): Similarly.
19272 (shiftqi_noscratch_set_flags): New pattern.
19273 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
19275 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
19277 PR tree-optimization/101223
19278 * range-op.cc (build_lt): Add -1 for signed values.
19279 (built_gt): Subtract -1 for signed values.
19281 2021-07-02 David Faust <david.faust@oracle.com>
19283 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
19284 (btf_asm_type): Likewise.
19286 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19288 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
19289 an array of rtx rather than a pointer to rtx. Add code argument.
19290 (compute_a_shift_length): Similarly.
19291 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
19292 shift itself isn't an operand. Create dummy operand[0] to carry
19293 a mode and pass a suitable rtx code to compute_a_shift_length.
19294 (get_shift_alg): Adjust operand number of clobber in output templates.
19295 (output_a_shift): Make first argument an array of rtx rather than
19296 a pointer to rtx. Add code argument for the type of shift.
19297 Adjust now that the shift itself is no longer an operand.
19298 (compute_a_shift_length): Similarly.
19299 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
19300 iterator rather than nshift_operator.
19301 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
19302 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
19303 and compute_a_shift_length.
19304 (shiftqi_noscratch_clobber_flags): Likewise.
19305 (shifthi_noscratch_clobber_flags): Likewise.
19306 (shiftsi_noscratch_clobber_flags): Likewise.
19308 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19311 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
19312 dsymutil for BTF/CTF.
19314 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19317 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
19318 segment to include BTF.
19319 (BTF_INFO_SECTION_NAME): New.
19321 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19323 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
19324 (small_data_operand, memreg_operand, small_insn_p): Likewise.
19325 * config/m32r/m32r.c (call_operand): Adjust return type.
19326 (small_data_operand, memreg_operand): Likewise.
19328 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19330 * config/frv/frv-protos.h (integer_register_operand): Adjust return
19332 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
19333 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
19334 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
19335 (move_source_operand, move_destination_operand): Likewise.
19336 (condexec_source_operand, condexec_dest_operand): Likewise.
19337 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
19338 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
19339 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
19340 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
19341 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
19342 (even_fpr_operand, odd_fpr_operand): Likewise.
19343 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
19344 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
19345 (uint4_operand, uint1_operand, int_2word_operand): Likewise
19346 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
19347 (relational_operator, float_relational_operator): Likewise.
19348 (ccr_eqne_operator, minmax_operator): Likewise.
19349 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
19350 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
19351 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
19352 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
19353 (quad_acc_operand, accg_operand): Likewise.
19355 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19357 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
19358 return type to a bool.
19359 (nonimmediate_nonstack_operand): Likewise.
19360 (xstormy16_splittable_below100_operand): Likewise.
19361 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
19363 (xstormy16_splittable_below100_operand): Likewise.
19365 2021-07-02 Richard Biener <rguenther@suse.de>
19367 PR tree-optimization/101293
19368 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
19369 with combined offsets.
19370 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
19371 combined with the rest of the offset.
19373 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
19375 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
19376 PIC encodings for PE-COFF targets.
19378 2021-07-02 Jakub Jelinek <jakub@redhat.com>
19381 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
19382 Return nullptr for TImode inner mode.
19384 2021-07-02 Richard Biener <rguenther@suse.de>
19386 PR tree-optimization/101280
19387 PR tree-optimization/101173
19388 * gimple-loop-interchange.cc
19389 (tree_loop_interchange::valid_data_dependences): Properly
19390 guard all dependence checks with DDR_REVERSED_P or its
19393 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
19395 * config/i386/i386-expand.c (ix86_expand_builtin):
19396 Add branch to clear odata when ZF is set for asedecenc_expand
19397 and wideaesdecenc_expand.
19399 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
19401 * config/i386/gcc-auto-profile: regenerate
19403 2021-07-02 liuhongt <hongtao.liu@intel.com>
19405 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
19406 (trunc<mode><pmov_dst_4_lower>2): this.
19408 2021-07-01 David Malcolm <dmalcolm@redhat.com>
19410 * diagnostic.h (diagnostic_context::m_file_cache): New field.
19411 * input.c (class fcache): Rename to...
19412 (class file_cache_slot): ...this, making most members private and
19413 prefixing fields with "m_".
19414 (file_cache_slot::get_file_path): New accessor.
19415 (file_cache_slot::get_use_count): New accessor.
19416 (file_cache_slot::missing_trailing_newline_p): New accessor.
19417 (file_cache_slot::inc_use_count): New.
19418 (fcache_buffer_size): Move to...
19419 (file_cache_slot::buffer_size): ...here.
19420 (fcache_line_record_size): Move to...
19421 (file_cache_slot::line_record_size): ...here.
19422 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
19423 (fcache_tab_size): Move to file_cache::num_file_slots.
19424 (diagnostic_file_cache_init): Update for move of fcache_tab
19425 to global_dc->m_file_cache.
19426 (diagnostic_file_cache_fini): Likewise.
19427 (lookup_file_in_cache_tab): Convert to...
19428 (file_cache::lookup_file): ...this.
19429 (diagnostics_file_cache_forcibly_evict_file): Update for move of
19430 fcache_tab to global_dc->m_file_cache, moving most of
19431 implementation to...
19432 (file_cache::forcibly_evict_file): ...this new function and...
19433 (file_cache_slot::evict): ...this new function.
19434 (evicted_cache_tab_entry): Convert to...
19435 (file_cache::evicted_cache_tab_entry): ...this.
19436 (add_file_to_cache_tab): Convert to...
19437 (file_cache::add_file): ...this, moving bulk of implementation
19439 (file_cache_slot::create): ..this new function.
19440 (file_cache::file_cache): New.
19441 (file_cache::~file_cache): New.
19442 (lookup_or_add_file_to_cache_tab): Convert to...
19443 (file_cache::lookup_or_add_file): ..this new function.
19444 (fcache::fcache): Rename to...
19445 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
19447 (fcache::~fcache): Rename to...
19448 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
19450 (needs_read): Convert to...
19451 (file_cache_slot::needs_read_p): ...this.
19452 (needs_grow): Convert to...
19453 (file_cache_slot::needs_grow_p): ...this.
19454 (maybe_grow): Convert to...
19455 (file_cache_slot::maybe_grow): ...this.
19456 (read_data): Convert to...
19457 (file_cache_slot::read_data): ...this.
19458 (maybe_read_data): Convert to...
19459 (file_cache_slot::maybe_read_data): ...this.
19460 (get_next_line): Convert to...
19461 (file_cache_slot::get_next_line): ...this.
19462 (goto_next_line): Convert to...
19463 (file_cache_slot::goto_next_line): ...this.
19464 (read_line_num): Convert to...
19465 (file_cache_slot::read_line_num): ...this.
19466 (location_get_source_line): Update for moving of globals to
19467 global_dc->m_file_cache.
19468 (location_missing_trailing_newline): Likewise.
19469 * input.h (class file_cache_slot): New forward decl.
19470 (class file_cache): New.
19472 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
19474 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
19475 128-bit floating point conditional move support.
19476 (have_compare_and_set_mask): Add IEEE 128-bit floating point
19478 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
19479 (mov<mode>cc_p10, IEEE128 iterator): New insn.
19480 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
19481 (fpmask<mode>, IEEE128 iterator): New insn.
19482 (xxsel<mode>, IEEE128 iterator): New insn.
19484 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
19487 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
19489 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19491 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19493 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
19495 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
19496 (vec_duplicate<mode>): New expander.
19498 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19501 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19503 (ix86_byte_broadcast): New function.
19504 (ix86_convert_const_wide_int_to_broadcast): Likewise.
19505 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
19506 size is 16 bytes or bigger.
19507 (ix86_broadcast_from_integer_constant): New function.
19508 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
19509 to broadcast if mode size is 16 bytes or bigger.
19510 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
19512 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
19514 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19516 * config/i386/predicates.md (ix86_endbr_immediate_operand):
19517 Return true/false instead of 1/0.
19518 (movq_parallel): Ditto.
19520 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19522 * recog.c (general_operand): Return true/false instead of 1/0.
19523 (register_operand): Ditto.
19524 (immediate_operand): Ditto.
19525 (const_int_operand): Ditto.
19526 (const_scalar_int_operand): Ditto.
19527 (const_double_operand): Ditto.
19528 (push_operand): Ditto.
19529 (pop_operand): Ditto.
19530 (memory_operand): Ditto.
19531 (indirect_operand): Ditto.
19533 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19535 * genpreds.c (write_predicate_subfunction):
19536 Change the type of written subfunction to bool.
19537 (write_one_predicate_function):
19538 Change the type of written function to bool.
19539 (write_tm_preds_h): Ditto.
19540 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
19541 * recog.c (general_operand): Change the type to bool.
19542 (address_operand): Ditto.
19543 (register_operand): Ditto.
19544 (pmode_register_operand): Ditto.
19545 (scratch_operand): Ditto.
19546 (immediate_operand): Ditto.
19547 (const_int_operand): Ditto.
19548 (const_scalar_int_operand): Ditto.
19549 (const_double_operand): Ditto.
19550 (nonimmediate_operand): Ditto.
19551 (nonmemory_operand): Ditto.
19552 (push_operand): Ditto.
19553 (pop_operand): Ditto.
19554 (memory_operand): Ditto.
19555 (indirect_operand): Ditto.
19556 (ordered_comparison_operator): Ditto.
19557 (comparison_operator): Ditto.
19558 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
19559 Change the type of indirect predicate function to bool.
19560 * config/rs6000/rs6000.c (easy_vector_constant):
19561 Change the type to bool.
19562 * config/mips/mips-protos.h (m16_based_address_p):
19563 Change the type of operand 3 to bool.
19565 2021-07-01 Richard Biener <rguenther@suse.de>
19567 PR tree-optimization/101280
19568 PR tree-optimization/101173
19569 * gimple-loop-interchange.cc
19570 (tree_loop_interchange::valid_data_dependences): Revert
19571 previous change and instead correctly handle DDR_REVERSED_P
19574 2021-07-01 Richard Biener <rguenther@suse.de>
19576 PR tree-optimization/101278
19577 * tree-ssa-dse.c (dse_classify_store): First check for
19578 uses, then ignore stmt for chaining purposes.
19580 2021-07-01 Richard Biener <rguenther@suse.de>
19582 PR tree-optimization/100778
19583 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
19584 vectorized ops ahead of their scalar BB.
19586 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19589 * config/i386/i386.md (*nabs<dwi>2_doubleword):
19590 New insn_and_split pattern.
19591 (*nabs<dwi>2_1): Ditto.
19592 * config/i386/i386-features.c
19593 (general_scalar_chain::compute_convert_gain):
19594 Handle (NEG (ABS (...))) RTX. Rewrite src code
19595 scanner as switch statement.
19596 (general_scalar_chain::convert_insn):
19597 Handle (NEG (ABS (...))) RTX.
19598 (general_scalar_to_vector_candidate_p):
19599 Detect (NEG (ABS (...))) RTX. Reorder case statements
19600 for (AND (NOT (...) ...)) fallthrough.
19602 2021-07-01 Richard Biener <rguenther@suse.de>
19604 PR tree-optimization/101178
19605 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
19606 (slpg::perm_in): Add.
19607 (slpg::get_perm_in): Remove.
19608 (slpg::get_perm_materialized): Add.
19609 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
19610 during permute propagation and materialization.
19612 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19615 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
19617 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19619 PR middle-end/94366
19620 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
19621 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
19622 use boolean_type_node instead of integer_type_node as NE_EXPR type.
19623 (lower_reduction_clauses): Likewise.
19625 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19627 * config/gcn/gcn.c: Include dwarf2.h.
19628 (gcn_addr_space_debug): New function.
19629 (TARGET_ADDR_SPACE_DEBUG): New hook.
19631 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19633 * common/config/gcn/gcn-common.c
19634 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
19635 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
19637 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
19638 (gcn_frame_pointer_rqd): New function.
19639 (TARGET_FRAME_POINTER_REQUIRED): New hook.
19641 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19643 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
19644 prologue register saves.
19645 (gcn_debug_unwind_info): Use UI_DWARF2.
19646 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
19647 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
19648 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
19649 (DWARF_LINK_REGISTER): New define.
19650 (FIRST_PSEUDO_REGISTER): Increment.
19651 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
19652 (CALL_USED_REGISTERS): Likewise.
19653 (REGISTER_NAMES): Likewise.
19655 2021-06-30 Richard Biener <rguenther@suse.de>
19657 PR tree-optimization/101267
19658 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
19659 API and use SLP compatible interface of vect_is_simple_use.
19660 Reject not vectorized SLP defs for callers that do not support
19662 (vect_check_store_rhs): Handle masked stores and pass down
19663 the appropriate operator index.
19664 (vectorizable_call): Adjust.
19665 (vectorizable_store): Likewise.
19666 (vectorizable_load): Likewise. Handle SLP pecularity of
19668 (vect_is_simple_use): Remove special-casing of masked stores.
19670 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19672 * common.opt (foffload): Remove help as Driver only.
19673 * gcc.c (display_help): Add -foffload.
19675 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19677 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
19678 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
19679 a candidate list; better inform no offload target is configured
19680 and fix hint extraction when passed target is not '\0' at [len].
19681 * common.opt (foffload): Add tailing '.'.
19682 (foffload-options): Likewise; fix flag name in the help string.
19684 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19687 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
19688 (vcage_f32): Gate comparison on __FAST_MATH__.
19689 (vcageq_f32): Likewise.
19690 (vcale_f32): Likewise.
19691 (vcaleq_f32): Likewise.
19692 (vcagt_f32): Likewise.
19693 (vcagtq_f32): Likewise.
19694 (vcalt_f32): Likewise.
19695 (vcaltq_f32): Likewise.
19696 (vcage_f16): Likewise.
19697 (vcageq_f16): Likewise.
19698 (vcale_f16): Likewise.
19699 (vcaleq_f16): Likewise.
19700 (vcagt_f16): Likewise.
19701 (vcagtq_f16): Likewise.
19702 (vcalt_f16): Likewise.
19703 (vcaltq_f16): Likewise.
19705 2021-06-30 Richard Biener <rguenther@suse.de>
19707 PR tree-optimization/101264
19708 * tree-vect-slp.c (vect_optimize_slp): Propagate the
19709 computed perm_in to all "any" permute successors
19710 we cannot de-duplicate immediately.
19712 2021-06-30 liuhongt <hongtao.liu@intel.com>
19715 * config/i386/sse.md
19716 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
19718 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
19720 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
19721 * config/i386/subst.md (maskz_scalar): New define_subst.
19722 (maskz_scalar_name): New subst_attr.
19723 (maskz_scalar_op5): Ditto.
19724 (round_saeonly_maskz_scalar_op5): Ditto.
19725 (round_saeonly_maskz_scalar_operand5): Ditto.
19727 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
19729 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
19730 Increase code CSECT alignment to at least 32 bytes.
19731 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
19732 alignment designation.
19734 2021-06-29 Sergei Trofimovich <siarheit@google.com>
19736 * doc/generic.texi: Fix s/net yet/not yet/ typo.
19738 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19740 PR tree-optimization/101254
19741 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
19742 wrapping/non-wrapping when setting the result range.
19744 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19746 * value-query.cc (gimple_range_global): Allow phis.
19748 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19750 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
19751 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
19752 statement for location context.
19753 (check_for_binary_op_overflow): Ditto.
19754 (simplify_using_ranges::get_vr_for_comparison): Ditto.
19755 (simplify_using_ranges::compare_name_with_value): Ditto.
19756 (simplify_using_ranges::compare_names): Ditto.
19757 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
19758 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
19759 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
19760 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
19761 (simplify_using_ranges::two_valued_val_range_p): Ditto.
19762 (simplify_using_ranges::simplify): Ditto.
19763 * vr-values.h: Adjust prototypes.
19765 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
19768 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
19770 2021-06-29 Julian Brown <julian@codesourcery.com>
19772 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
19773 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
19774 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
19776 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
19777 (MAX_FIXED_MODE_SIZE): Change to 128.
19779 2021-06-29 Julian Brown <julian@codesourcery.com>
19781 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
19782 (s_mnemonic): Add clrsb.
19783 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
19784 (clrsb<mode>2): Add expander for SImode/DImode.
19786 2021-06-29 Julian Brown <julian@codesourcery.com>
19788 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
19789 muldi3): Add patterns.
19791 2021-06-29 Julian Brown <julian@codesourcery.com>
19793 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
19794 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
19796 2021-06-29 Julian Brown <julian@codesourcery.com>
19798 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
19800 2021-06-29 Joseph Myers <joseph@codesourcery.com>
19802 * btfout.c, ctfout.c: Include "memmodel.h".
19804 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19806 * gcc.c (check_offload_target_name): Cast len argument to
19807 %q.*s to 'int'; avoid -Wstringop-truncation warning.
19809 2021-06-29 Richard Biener <rguenther@suse.de>
19811 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
19812 to "any" permute nodes and relax "any" permute proapgation
19813 during iterative backward propagation.
19815 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19818 * common.opt (-foffload=): Update description.
19819 (-foffload-options=): New.
19820 * doc/invoke.texi (C Language Options): Document
19821 -foffload and -foffload-options.
19822 * gcc.c (check_offload_target_name): New, split off from
19823 handle_foffload_option.
19824 (check_foffload_target_names): New.
19825 (handle_foffload_option): Handle -foffload=default.
19826 (driver_handle_option): Update for -foffload-options.
19827 * lto-opts.c (lto_write_options): Use -foffload-options
19828 instead of -foffload.
19829 * lto-wrapper.c (merge_and_complain, append_offload_options):
19831 * opts.c (common_handle_option): Likewise.
19833 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19835 * doc/invoke.texi (C Language Options): Sort options
19836 alphabetically in optlist and also the description itself.
19837 Remove leftover -fallow-single-precision from and add missing
19838 -fgnu-tm to the optlist.
19840 2021-06-29 Richard Biener <rguenther@suse.de>
19842 * tree-vect-slp.c (slpg_vertex::visited): Remove.
19843 (vect_slp_perms_eq): Handle -1 permutes.
19844 (vect_optimize_slp): Rewrite permute propagation.
19846 2021-06-29 Jakub Jelinek <jakub@redhat.com>
19849 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
19850 perform the optimization in GENERIC when sanitizing and x has a
19853 2021-06-29 Richard Biener <rguenther@suse.de>
19855 PR tree-optimization/101242
19856 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
19857 PHIs with not represented initial values as leafs.
19859 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
19861 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
19862 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
19863 "register" keyword.
19864 (pdp11_initial_elimination_offset) Remove unused variable.
19865 (pdp11_cmp_length) Ditto.
19866 (pdp11_insn_cost): Ditto, and fix signedness warning.
19868 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
19870 * btfout.c: Include tm_p.h.
19873 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19875 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
19877 (bpf_expand_epilogue): Likewise.
19878 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
19879 Do not define DBX_DEBUGGING_INFO.
19881 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19883 * doc/invoke.texi: Document the CTF and BTF debug info options.
19885 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19886 David Faust <david.faust@oracle.com>
19887 Jose E. Marchesi <jose.marchesi@oracle.com>
19888 Weimin Pan <weimin.pan@oracle.com>
19890 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
19891 GTFILES. Add new object files.
19892 * common.opt: Add CTF and BTF debug info options.
19893 * btfout.c: New file.
19894 * ctfc.c: Likewise.
19895 * ctfc.h: Likewise.
19896 * ctfout.c: Likewise.
19897 * dwarf2ctf.c: Likewise.
19898 * dwarf2ctf.h: Likewise.
19899 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
19901 * dwarf2out.c (dwarf2out_source_line): Likewise.
19902 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
19904 (debug_format_do_cu): New function.
19905 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
19907 Include dwarf2ctf.c.
19908 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
19910 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
19911 (CTF_DEBUG): New bitmask.
19912 (BTF_DEBUG): Likewise.
19913 (enum ctf_debug_info_levels): New enum.
19914 * gengtype.c (open_base_files): Handle ctfc.h.
19915 (main): Handle uint32_t type.
19916 * flags.h (btf_debuginfo_p): New definition.
19917 (dwarf_based_debuginfo_p): Likewise.
19918 * opts.c (debug_type_names): Add entries for CTF and BTF.
19919 (btf_debuginfo_p): New function.
19920 (dwarf_based_debuginfo_p): Likewise.
19921 (common_handle_option): Handle -gctfN and -gbtf options.
19922 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
19923 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
19926 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
19928 * dwarf2out.c (AT_class): Function is no longer static.
19929 (AT_int): Likewise.
19930 (AT_unsigned): Likewise.
19931 (AT_loc): Likewise.
19932 (get_AT): Likewise.
19933 (get_AT_string): Likewise.
19934 (get_AT_flag): Likewise.
19935 (get_AT_unsigned): Likewise.
19936 (get_AT_ref): Likewise.
19937 (new_die_raw): Likewise.
19938 (lookup_decl_die): Likewise.
19939 (base_type_die): Likewise.
19940 (add_name_attribute): Likewise.
19941 (add_AT_int): Likewise.
19942 (add_AT_unsigned): Likewise.
19943 (add_AT_loc): Likewise.
19944 (dw_get_die_tag): New function.
19945 (dw_get_die_child): Likewise.
19946 (dw_get_die_sib): Likewise.
19947 (struct dwarf_file_data): Move from here to dwarf2out.h
19948 (struct dw_attr_struct): Likewise.
19949 * dwarf2out.h: Analogous changes.
19951 2021-06-28 Martin Jambor <mjambor@suse.cz>
19954 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
19955 members m_dead_stmts and m_dead_ssas.
19956 * ipa-param-manipulation.c
19957 (ipa_param_body_adjustments::mark_dead_statements): New function.
19958 (ipa_param_body_adjustments::common_initialization): Call it on
19959 all removed but not split parameters.
19960 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
19962 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
19964 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
19965 dead debug statements.
19966 (copy_phis_for_bb): Do not copy dead PHI nodes.
19968 2021-06-28 Martin Jambor <mjambor@suse.cz>
19971 * symtab-clones.h (clone_info): Removed member param_adjustments.
19972 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
19973 deal with pass-through splits now.
19974 (ipa_param_performed_split): Removed.
19975 (ipa_param_adjustments::modify_call): Adjusted parameters.
19976 (class ipa_param_body_adjustments): Adjusted parameters of
19977 register_replacement, modify_gimple_stmt and modify_call_stmt.
19978 (ipa_verify_edge_has_no_modifications): Declare.
19979 (ipa_edge_modifications_finalize): Declare.
19980 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
19981 performed_splits processing, pas only edge to padjs->modify_call,
19982 check that call arguments were not modified if they should not have
19984 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
19986 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
19987 (ipa_edge_modification_info): Likewise.
19988 (ipa_edge_modification_sum): Likewise.
19989 (ipa_edge_modifications): New edge summary.
19990 (ipa_verify_edge_has_no_modifications): New function.
19991 (transitive_split_p): Removed.
19992 (transitive_split_map): Likewise.
19993 (init_transitive_splits): Likewise.
19994 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
19995 summary instead of performed_splits.
19996 (ipa_param_body_adjustments::register_replacement): Drop dummy
19997 parameter, set base_index of the created ipa_param_body_replacement.
19998 (phi_arg_will_live_p): New function.
19999 (ipa_param_body_adjustments::common_initialization): Do not create
20000 IPA_SRA dummy decls.
20001 (simple_tree_swap_info): Removed.
20002 (remap_split_decl_to_dummy): Likewise.
20003 (record_argument_state_1): New function.
20004 (record_argument_state): Likewise.
20005 (ipa_param_body_adjustments::modify_call_stmt): New parameter
20006 orig_stmt. Do not work with dummy decls, save necessary info about
20007 changes to ipa_edge_modifications.
20008 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
20009 orig_stmt, pass it to modify_call_stmt.
20010 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
20011 modify_gimple_stmt.
20012 (ipa_edge_modifications_finalize): New function.
20013 * tree-inline.c (remap_gimple_stmt): Pass original statement to
20014 modify_gimple_stmt.
20015 (copy_phis_for_bb): Do not copy dead PHI nodes.
20016 (expand_call_inline): Do not remap performed_splits.
20017 (update_clone_info): Likewise.
20018 * toplev.c: Include ipa-param-manipulation.h.
20019 (toplev::finalize): Call ipa_edge_modifications_finalize.
20021 2021-06-28 Andrew Pinski <apinski@marvell.com>
20023 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
20024 info if we're the only things setting the target PHI.
20025 (value_replacement): Don't duplicate range here.
20026 (minmax_replacement): Likewise.
20028 2021-06-28 Richard Biener <rguenther@suse.de>
20030 PR tree-optimization/101229
20031 * gimple-walk.c (gimple_walk_op): Handle PHIs.
20033 2021-06-28 Martin Liska <mliska@suse.cz>
20035 * config/v850/v850.c (construct_dispose_instruction): Allocate
20037 (construct_prepare_instruction): Likewise.
20039 2021-06-28 Martin Liska <mliska@suse.cz>
20041 * config/v850/v850.c (v850_option_override): Build default
20043 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
20044 ignored for inlining.
20045 (TARGET_CAN_INLINE_P): New.
20047 2021-06-28 Richard Biener <rguenther@suse.de>
20049 PR tree-optimization/101207
20050 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
20051 permute eliding for load permutations properly.
20053 2021-06-28 Richard Biener <rguenther@suse.de>
20055 PR tree-optimization/101173
20056 * gimple-loop-interchange.cc
20057 (tree_loop_interchange::valid_data_dependences): Disallow outer
20058 loop dependence distance of zero.
20060 2021-06-28 liuhongt <hongtao.liu@intel.com>
20063 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
20064 define_insn_and_split.
20065 (*avx_cmp<mode>3_ltint): Ditto.
20066 (*avx2_pcmp<mode>3_3): Ditto.
20067 (*avx2_pcmp<mode>3_4): Ditto.
20068 (*avx2_pcmp<mode>3_5): Ditto.
20070 2021-06-28 liuhongt <hongtao.liu@intel.com>
20072 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
20073 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
20074 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
20075 IX86_BUILTIN_PBLENDVB128): Replace icode with
20077 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
20079 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
20080 New pre_reload splitter.
20082 2021-06-27 Andrew Pinski <apinski@marvell.com>
20084 PR middle-end/101230
20085 * fold-const.c (fold_ternary_loc): Check
20086 the return value of invert_tree_comparison.
20088 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
20090 * config.gcc: Add SPDX License Identifier.
20091 (powerpc-ibm-aix789): Default to aix73.h.
20092 (powerpc-ibm-aix7.2.*.*): New stanza.
20093 * config/rs6000/aix72.h: Add SPDX License Identifier.
20094 * config/rs6000/aix73.h: New file.
20096 2021-06-26 Jason Merrill <jason@redhat.com>
20098 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
20100 2021-06-26 Andrew Pinski <apinski@marvell.com>
20102 * genmatch.c (lower_cond): Copy for_subst_vec
20103 for the simplify also.
20104 (lower): Swap the order for lower_for and lower_cond.
20106 2021-06-26 Andrew Pinski <apinski@marvell.com>
20108 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
20109 flow senatitive info on the moved ssa set.
20111 2021-06-26 Andrew Pinski <apinski@marvell.com>
20113 * fold-const.c (fold_cond_expr_with_comparison):
20114 Exand arg0 into comp_code, arg00, and arg01.
20115 (fold_ternary_loc): Use invert_tree_comparison
20116 instead of fold_invert_truthvalue for the case
20117 where we have A CMP B ? C : A.
20119 2021-06-25 Martin Sebor <msebor@redhat.com>
20121 PR middle-end/101216
20122 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
20124 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
20126 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
20128 2021-06-25 Richard Biener <rguenther@suse.de>
20130 PR tree-optimization/101202
20131 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
20134 2021-06-25 Richard Biener <rguenther@suse.de>
20136 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
20137 STMT_VINFO_REDUC_DEF from the original representative.
20139 2021-06-25 Martin Sebor <msebor@redhat.com>
20141 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
20142 gimple_no_warning_p and gimple_set_no_warning with
20143 warning_suppressed_p, and suppress_warning.
20145 (maybe_warn_for_bound): Same.
20146 (warn_for_access): Same.
20147 (check_access): Same.
20148 (expand_builtin_strncmp): Same.
20149 (fold_builtin_varargs): Same.
20150 * calls.c (maybe_warn_nonstring_arg): Same.
20151 (maybe_warn_rdwr_sizes): Same.
20152 * cfgexpand.c (expand_call_stmt): Same.
20153 * cgraphunit.c (check_global_declaration): Same.
20154 * fold-const.c (fold_undefer_overflow_warnings): Same.
20155 (fold_truth_not_expr): Same.
20156 (fold_unary_loc): Same.
20157 (fold_checksum_tree): Same.
20158 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
20159 (array_bounds_checker::check_mem_ref): Same.
20160 (array_bounds_checker::check_addr_expr): Same.
20161 (array_bounds_checker::check_array_bounds): Same.
20162 * gimple-expr.c (copy_var_decl): Same.
20163 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
20164 (gimple_fold_builtin_strncat): Same.
20165 (gimple_fold_builtin_stxcpy_chk): Same.
20166 (gimple_fold_builtin_stpcpy): Same.
20167 (gimple_fold_builtin_sprintf): Same.
20168 (fold_stmt_1): Same.
20169 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
20170 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
20171 * gimple-ssa-sprintf.c (handle_printf_call): Same.
20172 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
20173 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
20174 * gimple-ssa-warn-restrict.h: Adjust declarations.
20175 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
20176 gimple_no_warning_p and gimple_set_no_warning with
20177 warning_suppressed_p, and suppress_warning.
20178 (check_call): Same.
20179 (check_bounds_or_overlap): Same.
20180 * gimple.c (gimple_build_call_from_tree): Same.
20181 * gimplify.c (gimplify_return_expr): Same.
20182 (gimplify_cond_expr): Same.
20183 (gimplify_modify_expr_complex_part): Same.
20184 (gimplify_modify_expr): Same.
20185 (gimple_push_cleanup): Same.
20186 (gimplify_expr): Same.
20187 * omp-expand.c (expand_omp_for_generic): Same.
20188 (expand_omp_taskloop_for_outer): Same.
20189 * omp-low.c (lower_rec_input_clauses): Same.
20190 (lower_lastprivate_clauses): Same.
20191 (lower_send_clauses): Same.
20192 (lower_omp_target): Same.
20193 * tree-cfg.c (pass_warn_function_return::execute): Same.
20194 * tree-complex.c (create_one_component_var): Same.
20195 * tree-inline.c (remap_gimple_op_r): Same.
20196 (copy_tree_body_r): Same.
20197 (declare_return_variable): Same.
20198 (expand_call_inline): Same.
20199 * tree-nested.c (lookup_field_for_decl): Same.
20200 * tree-sra.c (create_access_replacement): Same.
20201 (generate_subtree_copies): Same.
20202 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
20203 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
20204 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
20205 * tree-ssa-loop-im.c (execute_sm): Same.
20206 * tree-ssa-phiopt.c (cond_store_replacement): Same.
20207 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
20208 (handle_builtin_strcpy): Same.
20209 (maybe_diag_stxncpy_trunc): Same.
20210 (handle_builtin_stxncpy_strncat): Same.
20211 (handle_builtin_strcat): Same.
20212 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
20213 (set_no_uninit_warning): Same.
20214 (uninit_undefined_value_p): Same.
20215 (warn_uninit): Same.
20216 (maybe_warn_operand): Same.
20217 * tree-vrp.c (compare_values_warnv): Same.
20218 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
20219 (test_for_singularity): Same.
20220 * gimple.h (warning_suppressed_p): New function.
20221 (suppress_warning): Same.
20222 (copy_no_warning): Same.
20223 (gimple_set_block): Call gimple_set_location.
20224 (gimple_set_location): Call copy_warning.
20226 2021-06-25 Martin Sebor <msebor@redhat.com>
20228 * tree.h (warning_suppressed_at, copy_warning,
20229 warning_suppressed_p, suppress_warning): New functions.
20231 2021-06-25 Martin Sebor <msebor@redhat.com>
20233 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
20234 * gengtype.c (open_base_files): Add diagnostic-spec.h.
20235 * diagnostic-spec.c: New file.
20236 * diagnostic-spec.h: New file.
20237 * tree.h (no_warning, all_warnings, suppress_warning_at): New
20239 * warning-control.cc: New file.
20241 2021-06-25 liuhongt <hongtao.liu@intel.com>
20244 * config/i386/i386.c (x86_order_regs_for_local_alloc):
20247 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20249 PR tree-optimization/101189
20250 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
20251 LHS range of condition to postfold routine.
20252 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
20253 FALSE edge if the LHS range supports it being taken.
20254 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
20256 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20258 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
20259 (relation_oracle::find_relation_block): Check correct bitmap.
20260 (relation_oracle::dump): Do not dump NULL blocks.
20262 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20264 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
20265 range_on_edge instead of manually calculating.
20267 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20269 * range-op.cc: Fix comment.
20271 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
20274 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
20275 Handle V8QI and V4HI modes.
20276 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
20278 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
20279 (mmxpackmode): New mode attribute.
20280 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
20281 (mmxunpackmode): New mode attribute.
20282 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
20283 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
20284 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
20285 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
20286 * config/i386/i386.md (extsuffix): Move from ...
20287 * config/i386/sse.md: ... here.
20289 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20291 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
20292 (dwarf2out_finish): ...instead of here.
20294 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20296 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
20297 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
20298 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
20299 * configure: Regenerate.
20301 2021-06-24 Richard Biener <rguenther@suse.de>
20303 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
20304 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
20305 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
20307 2021-06-24 Richard Biener <rguenther@suse.de>
20309 * config/i386/sse.md (avx_addsubv4df3): Rename to
20311 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
20312 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
20313 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
20314 * config/i386/i386-builtin.def: Adjust.
20315 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
20316 * optabs.def (vec_addsub_optab): New optab.
20317 * tree-vect-slp-patterns.c (class addsub_pattern): New.
20318 (slp_patterns): Add addsub_pattern.
20319 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
20320 across CFN_VEC_ADDSUB.
20321 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
20323 * doc/md.texi (vec_addsub<mode>3): Document.
20325 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20327 PR middle-end/101170
20328 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
20329 where regno + subreg_regno_offset wraps around use 0 as starting
20332 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20334 PR middle-end/101172
20335 * stor-layout.c (finish_bitfield_representative): If nextf has
20336 error_mark_node type, set repr type to error_mark_node too.
20338 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
20340 * config/s390/s390.c (s390_function_profiler): Ignore labelno
20342 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
20344 2021-06-24 Richard Biener <rguenther@suse.de>
20346 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
20347 across operations that have different semantics on different
20350 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20352 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
20353 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
20354 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
20355 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
20356 OMP_TARGET user outer_ctx instead of ctx for placeholders and
20357 initializer/combiner gimplification.
20358 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
20359 on target constructs.
20360 (lower_rec_input_clauses): Likewise.
20361 (lower_omp_target): Likewise.
20362 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
20363 on target if in_reduction is present.
20365 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
20367 * tree-predcom.c (class pcom_worker): New class.
20368 (release_chain): Renamed to...
20369 (pcom_worker::release_chain): ...this.
20370 (release_chains): Renamed to...
20371 (pcom_worker::release_chains): ...this.
20372 (aff_combination_dr_offset): Renamed to...
20373 (pcom_worker::aff_combination_dr_offset): ...this.
20374 (determine_offset): Renamed to...
20375 (pcom_worker::determine_offset): ...this.
20376 (class comp_ptrs): New class.
20377 (split_data_refs_to_components): Renamed to...
20378 (pcom_worker::split_data_refs_to_components): ...this,
20379 and update with class comp_ptrs.
20380 (suitable_component_p): Renamed to...
20381 (pcom_worker::suitable_component_p): ...this.
20382 (filter_suitable_components): Renamed to...
20383 (pcom_worker::filter_suitable_components): ...this.
20384 (valid_initializer_p): Renamed to...
20385 (pcom_worker::valid_initializer_p): ...this.
20386 (find_looparound_phi): Renamed to...
20387 (pcom_worker::find_looparound_phi): ...this.
20388 (add_looparound_copies): Renamed to...
20389 (pcom_worker::add_looparound_copies): ...this.
20390 (determine_roots_comp): Renamed to...
20391 (pcom_worker::determine_roots_comp): ...this.
20392 (determine_roots): Renamed to...
20393 (pcom_worker::determine_roots): ...this.
20394 (single_nonlooparound_use): Renamed to...
20395 (pcom_worker::single_nonlooparound_use): ...this.
20396 (remove_stmt): Renamed to...
20397 (pcom_worker::remove_stmt): ...this.
20398 (execute_pred_commoning_chain): Renamed to...
20399 (pcom_worker::execute_pred_commoning_chain): ...this.
20400 (execute_pred_commoning): Renamed to...
20401 (pcom_worker::execute_pred_commoning): ...this.
20402 (struct epcc_data): New member worker.
20403 (execute_pred_commoning_cbck): Call execute_pred_commoning
20404 with pcom_worker pointer.
20405 (find_use_stmt): Renamed to...
20406 (pcom_worker::find_use_stmt): ...this.
20407 (find_associative_operation_root): Renamed to...
20408 (pcom_worker::find_associative_operation_root): ...this.
20409 (find_common_use_stmt): Renamed to...
20410 (pcom_worker::find_common_use_stmt): ...this.
20411 (combinable_refs_p): Renamed to...
20412 (pcom_worker::combinable_refs_p): ...this.
20413 (reassociate_to_the_same_stmt): Renamed to...
20414 (pcom_worker::reassociate_to_the_same_stmt): ...this.
20415 (stmt_combining_refs): Renamed to...
20416 (pcom_worker::stmt_combining_refs): ...this.
20417 (combine_chains): Renamed to...
20418 (pcom_worker::combine_chains): ...this.
20419 (try_combine_chains): Renamed to...
20420 (pcom_worker::try_combine_chains): ...this.
20421 (prepare_initializers_chain): Renamed to...
20422 (pcom_worker::prepare_initializers_chain): ...this.
20423 (prepare_initializers): Renamed to...
20424 (pcom_worker::prepare_initializers): ...this.
20425 (prepare_finalizers_chain): Renamed to...
20426 (pcom_worker::prepare_finalizers_chain): ...this.
20427 (prepare_finalizers): Renamed to...
20428 (pcom_worker::prepare_finalizers): ...this.
20429 (tree_predictive_commoning_loop): Renamed to...
20430 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
20431 some calls and remove some cleanup code.
20432 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
20433 (static variable looparound_phis): Remove.
20434 (static variable name_expansions): Remove.
20436 2021-06-24 Richard Biener <rguenther@suse.de>
20438 * tree-vect-slp.c (slpg_vertex): New struct.
20439 (vect_slp_build_vertices): Adjust.
20440 (vect_optimize_slp): Likewise. Maintain an outgoing permute
20441 and a materialized one.
20443 2021-06-24 Richard Biener <rguenther@suse.de>
20445 PR tree-optimization/101105
20446 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
20447 Only ignore steps when they are equal or scalar order is preserved.
20449 2021-06-24 liuhongt <hongtao.liu@intel.com>
20452 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
20453 Adjust comments for ix86_expand_vecop_qihi2.
20454 (ix86_expand_vecmul_qihi): Renamed to ..
20455 (ix86_expand_vecop_qihi2): Adjust function prototype to
20456 support shift operation, add static to definition.
20457 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
20458 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
20459 ix86_expand_vec_shift_qihi_constant.
20460 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
20461 (ix86_expand_vec_shift_qihi_constant): Deleted.
20462 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
20464 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
20465 condition TARGET_64BIT.
20466 (mul<mode>3): Ditto.
20467 (<insn><mode>3): Ditto.
20468 (vlshr<mode>3): Extend to support avx512 vlshr.
20469 (v<insn><mode>3): New expander for
20471 (v<insn>v8qi3): Ditto.
20472 (vashrv8hi3<mask_name>): Renamed to ..
20473 (vashr<mode>3): And extend to support V16QImode for avx512.
20474 (vashrv16qi3): Deleted.
20475 (vashrv2di3<mask_name>): Extend expander to support avx512
20478 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
20480 * doc/lto.texi (Design Overview): Update that slim objects are
20483 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
20485 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
20486 of OTHER_POWER10_MASKS so it will not be enabled by default.
20488 2021-06-23 Richard Biener <rguenther@suse.de>
20489 Martin Jambor <mjambor@suse.cz>
20491 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
20492 param replacement unconditionally. Adjust comment.
20494 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20496 * Makefile.in (OBJS): Add gimple-range-fold.o
20497 * gimple-range-fold.cc: New.
20498 * gimple-range-fold.h: New.
20499 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
20500 (gimple_range_calc_op2): Ditto.
20501 * gimple-range-gori.h: Move prototypes to here.
20502 * gimple-range.cc: Adjust include files.
20503 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
20504 (fur_source::get_operand): Ditto.
20505 (fur_source::get_phi_operand): Ditto.
20506 (fur_source::query_relation): Ditto.
20507 (fur_source::register_relation): Ditto.
20508 (class fur_edge): Ditto.
20509 (fur_edge::fur_edge): Ditto.
20510 (fur_edge::get_operand): Ditto.
20511 (fur_edge::get_phi_operand): Ditto.
20512 (fur_stmt::fur_stmt): Ditto.
20513 (fur_stmt::get_operand): Ditto.
20514 (fur_stmt::get_phi_operand): Ditto.
20515 (fur_stmt::query_relation): Ditto.
20516 (class fur_depend): Relocate to gimple-range-fold.h.
20517 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
20518 (fur_depend::register_relation): Ditto.
20519 (fur_depend::register_relation): Ditto.
20520 (class fur_list): Ditto.
20521 (fur_list::fur_list): Ditto.
20522 (fur_list::get_operand): Ditto.
20523 (fur_list::get_phi_operand): Ditto.
20524 (fold_range): Ditto.
20525 (adjust_pointer_diff_expr): Ditto.
20526 (gimple_range_adjustment): Ditto.
20527 (gimple_range_base_of_assignment): Ditto.
20528 (gimple_range_operand1): Ditto.
20529 (gimple_range_operand2): Ditto.
20530 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
20531 (gimple_range_calc_op2): Ditto.
20532 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
20533 (fold_using_range::range_of_range_op): Ditto.
20534 (fold_using_range::range_of_address): Ditto.
20535 (fold_using_range::range_of_phi): Ditto.
20536 (fold_using_range::range_of_call): Ditto.
20537 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
20538 (fold_using_range::range_of_builtin_call): Ditto.
20539 (fold_using_range::range_of_cond_expr): Ditto.
20540 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
20541 (fold_using_range::relation_fold_and_or): Ditto.
20542 (fold_using_range::postfold_gcond_edges): Ditto.
20543 * gimple-range.h: Add gimple-range-fold.h to include files. Change
20544 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
20545 (gimple_range_handler): Relocate to gimple-range-fold.h.
20546 (gimple_range_ssa_p): Ditto.
20547 (range_compatible_p): Ditto.
20548 (class fur_source): Ditto.
20549 (class fur_stmt): Ditto.
20550 (class fold_using_range): Ditto.
20551 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
20552 (gimple_range_calc_op2): Ditto.
20554 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20556 PR tree-optimization/101148
20557 PR tree-optimization/101014
20558 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
20559 (ranger_cache::~ranger_cache): Adjust.
20560 (ranger_cache::block_range): Check if propagation disallowed.
20561 (ranger_cache::propagate_cache): Disallow propagation if new value
20562 can't be stored properly.
20563 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
20565 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20567 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
20568 (sbr_vector::set_bb_range): Return true.
20569 (class sbr_sparse_bitmap): Adjust.
20570 (sbr_sparse_bitmap::set_bb_range): Return value.
20571 (block_range_cache::set_bb_range): Return value.
20572 (ranger_cache::propagate_cache): Use return value to print msg.
20573 * gimple-range-cache.h (class block_range_cache): Adjust.
20575 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20577 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
20579 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
20581 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
20582 so this pattern can be used for test/compare removal. Pass
20583 current insn to compute_logical_op_length and output_logical_op.
20584 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
20585 (h8300_and_costs): Add argument to compute_logical_op_length.
20586 (output_logical_op): Add new argument. Use it to determine if the
20587 condition codes are used and adjust the output accordingly.
20588 (compute_logical_op_length): Add new argument and update length
20589 computations when condition codes are used.
20590 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
20592 (output_logical_op): Likewise.
20594 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20597 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
20598 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
20599 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
20600 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
20601 * config/i386/sse.md (unspec): ... here.
20603 2021-06-23 Martin Liska <mliska@suse.cz>
20606 * optc-save-gen.awk: Put back arm_fp16_format to
20609 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20612 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
20615 (clz<mode>2): Update RTX pattern for additions.
20617 2021-06-23 Jakub Jelinek <jakub@redhat.com>
20619 PR middle-end/101167
20620 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
20621 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
20623 2021-06-22 Sergei Trofimovich <siarheit@google.com>
20625 * doc/rtl.texi: drop unbalanced parenthesis.
20627 2021-06-22 Richard Biener <rguenther@suse.de>
20629 PR middle-end/101156
20630 * gimplify.c (gimplify_expr): Remove premature incorrect
20633 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20635 PR tree-optimization/101159
20636 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
20639 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20641 PR middle-end/101160
20642 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
20643 clear crtl->return_rtx instead of keeping it referencing a pseudo.
20645 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20646 Andrew Pinski <apinski@marvell.com>
20648 PR tree-optimization/101162
20649 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
20652 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20654 * range-op.cc (range_relational_tests): New.
20655 (range_op_tests): Call range_relational_tests.
20657 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20659 * range-op.cc (operator_cast::lhs_op1_relation): New.
20660 (operator_identity::lhs_op1_relation): Mew.
20662 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20664 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
20666 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20668 * range-op.cc (operator_plus::lhs_op1_relation): New.
20669 (operator_plus::lhs_op2_relation): New.
20671 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20673 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
20674 relation_oracle if dominators exist.
20675 (ranger_cache::~ranger_cache): Dispose of oracle.
20676 (ranger_cache::dump_bb): Dump oracle.
20677 * gimple-range.cc (fur_source::fur_source): New.
20678 (fur_source::get_operand): Use mmeber query.
20679 (fur_source::get_phi_operand): Use member_query.
20680 (fur_source::query_relation): New.
20681 (fur_source::register_dependency): Delete.
20682 (fur_source::register_relation): New.
20683 (fur_edge::fur_edge): Adjust.
20684 (fur_edge::get_phi_operand): Fix comment.
20685 (fur_edge::query): Delete.
20686 (fur_stmt::fur_stmt): Adjust.
20687 (fur_stmt::query): Delete.
20688 (fur_depend::fur_depend): Adjust.
20689 (fur_depend::register_relation): New.
20690 (fur_depend::register_relation): New.
20691 (fur_list::fur_list): Adjust.
20692 (fur_list::get_operand): Use member query.
20693 (fold_using_range::range_of_range_op): Process and query relations.
20694 (fold_using_range::range_of_address): Adjust dependency call.
20695 (fold_using_range::range_of_phi): Ditto.
20696 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
20697 (fold_using_range::relation_fold_and_or): New.
20698 (fold_using_range::postfold_gcond_edges): New.
20699 * gimple-range.h (class gimple_ranger): Adjust.
20700 (class fur_source): Adjust members.
20701 (class fur_stmt): Ditto.
20702 (class fold_using_range): Ditto.
20704 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20706 * range-op.cc (range_operator::wi_fold): Apply relation effect.
20707 (range_operator::fold_range): Adjust and apply relation effect.
20708 (*::fold_range): Add relation parameters.
20709 (*::op1_range): Ditto.
20710 (*::op2_range): Ditto.
20711 (range_operator::lhs_op1_relation): New.
20712 (range_operator::lhs_op2_relation): New.
20713 (range_operator::op1_op2_relation): New.
20714 (range_operator::op1_op2_relation_effect): New.
20715 (relop_early_resolve): New.
20716 (operator_equal::op1_op2_relation): New.
20717 (operator_equal::fold_range): Call relop_early_resolve.
20718 (operator_not_equal::op1_op2_relation): New.
20719 (operator_not_equal::fold_range): Call relop_early_resolve.
20720 (operator_lt::op1_op2_relation): New.
20721 (operator_lt::fold_range): Call relop_early_resolve.
20722 (operator_le::op1_op2_relation): New.
20723 (operator_le::fold_range): Call relop_early_resolve.
20724 (operator_gt::op1_op2_relation): New.
20725 (operator_gt::fold_range): Call relop_early_resolve.
20726 (operator_ge::op1_op2_relation): New.
20727 (operator_ge::fold_range): Call relop_early_resolve.
20728 * range-op.h (class range_operator): Adjust parameters and methods.
20730 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20732 * Makefile.in (OBJS): Add value-relation.o.
20733 * gimple-range.h: Adjust include files.
20734 * tree-data-ref.c: Adjust include file order.
20735 * value-query.cc (range_query::get_value_range): Default to no oracle.
20736 (range_query::query_relation): New.
20737 (range_query::query_relation): New.
20738 * value-query.h (class range_query): Adjust.
20739 * value-relation.cc: New.
20740 * value-relation.h: New.
20742 2021-06-22 Richard Biener <rguenther@suse.de>
20744 PR tree-optimization/101151
20745 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
20748 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
20750 * config/riscv/riscv.c (thead_c906_tune_info): New.
20751 (riscv_tune_info_table): Use new tune.
20753 2021-06-22 Richard Biener <rguenther@suse.de>
20755 PR tree-optimization/101158
20756 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
20757 checking after checking for matching operation.
20759 2021-06-22 Richard Biener <rguenther@suse.de>
20761 PR tree-optimization/101159
20762 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
20763 missing NULL vectype check.
20765 2021-06-22 Richard Biener <rguenther@suse.de>
20767 PR tree-optimization/101154
20768 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
20770 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20773 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
20774 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
20775 * config/i386/i386.c (ix86_expand_prologue): Clear it.
20776 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
20777 Remove "" from match_operand. Emit new insns using emit_move_insn and
20778 set ix86_last_zero_store_uid to INSN_UID of the last store.
20779 Add peephole2s for 1/2/4 stores of const0_rtx following previous
20782 2021-06-22 Martin Liska <mliska@suse.cz>
20784 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
20787 2021-06-22 Martin Liska <mliska@suse.cz>
20789 * gcov-io.h: Remove padding entries.
20791 2021-06-22 liuhongt <hongtao.liu@intel.com>
20793 PR tree-optimization/97770
20794 * tree-vect-patterns.c (vect_recog_popcount_pattern):
20796 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
20798 2021-06-22 liuhongt <hongtao.liu@intel.com>
20801 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
20802 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
20803 (expand<mode>_mask): this ..
20804 (*expand<mode>_mask): New pre_reload splitter to transform
20805 v{,p}expand* to vmov* when mask is zero, all ones, or has all
20806 ones in it's lower part, otherwise still generate
20809 2021-06-22 liuhongt <hongtao.liu@intel.com>
20812 * config/i386/i386-expand.c
20813 (ix86_expand_special_args_builtin): Keep constm1_operand only
20814 if it satisfies insn's operand predicate.
20816 2021-06-21 Jason Merrill <jason@redhat.com>
20819 * df-scan.c (df_ref_record): Check that regno < endregno.
20820 * function.c (assign_parms, expand_function_end): Do nothing with a
20821 TYPE_EMPTY_P result.
20823 2021-06-21 Richard Biener <rguenther@suse.de>
20825 PR tree-optimization/101120
20826 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
20828 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
20829 DR chain DCE capability.
20830 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
20831 * tree-vect-stmts.c (vectorizable_load): Remove unused
20832 loads in the DR chain for SLP.
20834 2021-06-21 Jakub Jelinek <jakub@redhat.com>
20836 PR inline-asm/100785
20837 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
20838 output or input operands were already error_mark_node.
20839 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
20840 remove all inputs, outputs and clobbers from the asm and
20841 set template to "".
20843 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20845 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
20846 (vceq_s16): Likewise.
20847 (vceq_s32): Likewise.
20848 (vceq_u8): Likewise.
20849 (vceq_u16): Likewise.
20850 (vceq_u32): Likewise.
20851 (vceq_p8): Likewise.
20852 (vceqq_s8): Likewise.
20853 (vceqq_s16): Likewise.
20854 (vceqq_s32): Likewise.
20855 (vceqq_u8): Likewise.
20856 (vceqq_u16): Likewise.
20857 (vceqq_u32): Likewise.
20858 (vceqq_p8): Likewise.
20859 (vceq_f32): Gate __a == __b on __FAST_MATH__.
20860 (vceqq_f32): Likewise.
20861 (vceq_f16): Likewise.
20862 (vceqq_f16): Likewise.
20864 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20867 * config/arm/iterators.md (NEON_VACMP): Remove.
20868 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
20870 (neon_vca<cmp_op><mode>_insn): Likewise.
20871 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
20874 2021-06-21 Richard Biener <rguenther@suse.de>
20876 PR tree-optimization/101121
20877 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
20878 when we just lack a stmt with the desired op when doing permutation.
20879 (vect_build_slp_tree): When caching a failed SLP build attempt
20880 assert that at least one lane is marked as not matching.
20882 2021-06-21 liuhongt <hongtao.liu@intel.com>
20885 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
20886 register alternative.
20887 (*and<mode>_1): Ditto.
20889 (*andn<mode>_1): Ditto.
20890 (*<code><mode>_1): Ditto.
20891 (*<code>qi_1): Ditto.
20892 (*one_cmpl<mode>2_1): Ditto.
20893 (*one_cmplsi2_1_zext): Ditto.
20894 (*one_cmplqi2_1): Ditto.
20895 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
20896 the order of mask registers to be before general registers.
20898 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
20901 * config/i386/i386.md: New define_peephole2s to shrink writing
20902 1, 2 or 4 consecutive zeros to memory when optimizing for size.
20904 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
20906 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
20907 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
20908 more efficient code when the source can be trivially simplified.
20910 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20912 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
20913 a range if global is not available.
20914 (ranger_cache::entry_range): Fallback to range_of_def.
20915 * gimple-range-cache.h (range_of_def): Adjust prototype.
20917 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20919 PR tree-optimization/101014
20920 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
20922 (ranger_cache::~ranger_cache): Ditto.
20923 (ranger_cache::enable_new_values): Delete.
20924 (ranger_cache::push_poor_value): Delete.
20925 (ranger_cache::range_of_def): Remove poor value processing.
20926 (ranger_cache::entry_range): Ditto.
20927 (ranger_cache::fill_block_cache): Ditto.
20928 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
20929 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
20930 * gimple-range.h (class gimple_ranger): Adjust.
20932 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
20935 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
20936 derived from arm_canon_arch.
20937 (arm_canon_arch_option): Call it.
20938 (arm_canon_arch_multilib_option): New function.
20939 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
20940 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
20941 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
20942 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
20943 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
20944 * config/arm/arm.opt (mlibarch): New option.
20945 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
20946 of march on RHS with mlibarch.
20948 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
20950 * config.in: Regenerate.
20951 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
20953 * configure: Regenerate.
20954 * configure.ac: Fix for global_load assembler functions.
20956 2021-06-18 Richard Biener <rguenther@suse.de>
20958 PR tree-optimization/101112
20959 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
20960 to lookup a pattern stmt def.
20962 2021-06-18 Jakub Jelinek <jakub@redhat.com>
20964 PR middle-end/101062
20965 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
20966 representatives in QUAL_UNION_TYPE.
20968 2021-06-18 Andrew Pinski <apinski@marvell.com>
20970 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
20971 Add counting of how many times it is done.
20972 (factor_out_conditional_conversion): Likewise.
20973 (match_simplify_replacement): Likewise.
20974 (value_replacement): Likewise.
20975 (spaceship_replacement): Likewise.
20976 (cond_store_replacement): Likewise.
20977 (cond_if_else_store_replacement_1): Likewise.
20978 (hoist_adjacent_loads): Likewise.
20980 2021-06-18 Andrew Pinski <apinski@marvell.com>
20982 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
20983 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
20984 (verify_gimple_assign_binary): Reject point and offset types on
20985 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
20986 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
20987 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
20989 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
20991 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
20992 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
20993 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
20996 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
20998 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
20999 earlyclobber to alts 0/1.
21000 (gen_addadd): Add earlyclobber to alts 0/1.
21001 * config/rs6000/fusion.md: Regenerate file.
21003 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21005 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
21007 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21009 * gimple-range-cache.cc: Comment cleanups.
21010 * gimple-range-gori.cc: Comment cleanups.
21011 * gimple-range.cc: Comment/spacing cleanups
21012 * value-range.h: Comment cleanups.
21014 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
21017 * calls.c (expand_call): Replace PUSH_ARGS with
21018 targetm.calls.push_argument (0).
21019 (emit_library_call_value_1): Likewise.
21020 * defaults.h (PUSH_ARGS): Removed.
21021 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
21022 targetm.calls.push_argument (0).
21023 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
21024 (emit_push_insn): Pass the number bytes to push to
21025 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
21026 * hooks.c (hook_bool_uint_true): New.
21027 * hooks.h (hook_bool_uint_true): Likewise.
21028 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
21029 targetm.calls.push_argument (0).
21030 * target.def (push_argument): Add a targetm.calls hook.
21031 * targhooks.c (default_push_argument): New.
21032 * targhooks.h (default_push_argument): Likewise.
21033 * config/bpf/bpf.h (PUSH_ARGS): Removed.
21034 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
21035 * config/cr16/cr16.h (PUSH_ARGS): Removed.
21036 * config/i386/i386.c (ix86_push_argument): New.
21037 (TARGET_PUSH_ARGUMENT): Likewise.
21038 * config/i386/i386.h (PUSH_ARGS): Removed.
21039 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
21040 * config/m32c/m32c.h (PUSH_ARGS): Removed.
21041 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
21042 * config/pru/pru.h (PUSH_ARGS): Likewise.
21043 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
21044 TARGET_PUSH_ARGUMENT hook.
21045 * doc/tm.texi: Regenerated.
21047 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
21050 * config/i386/i386-expand.c (expand_vector_set_var):
21051 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
21052 ix86_expand_vector_init_duplicate.
21053 (ix86_expand_vector_init_duplicate): Emit insv_1 for
21054 QImode for !TARGET_PARTIAL_REG_STALL.
21055 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
21056 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
21057 as operand 2 predicate. Call ix86_expand_vector_set_var
21058 for non-constant index operand.
21059 (vec_setv2si): Ditto.
21060 (vec_setv4hi): Ditto.
21061 (vec_setv8qi): ditto.
21063 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21065 PR tree-optimization/100790
21066 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
21069 2021-06-17 Martin Liska <mliska@suse.cz>
21071 * doc/invoke.texi: Use consistently -O1 instead of -O.
21073 2021-06-17 Martin Liska <mliska@suse.cz>
21075 * gcov-io.h: Update documentation entry about string format.
21077 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
21080 * config/s390/vecintrin.h (vec_doublee): Fix to use
21081 __builtin_s390_vflls.
21082 (vec_floate): Fix to use __builtin_s390_vflrd.
21084 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21086 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
21087 * dominance.h (get_dominated_to_depth): Likewise.
21088 (get_all_dominated_blocks): Likewise.
21089 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
21090 * gcse.c (hoist_code): Likewise.
21091 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
21092 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
21093 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
21094 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
21096 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21098 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
21099 * dominance.h (get_dominated_by_region): Likewise.
21100 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
21101 (gimple_duplicate_sese_tail): Likewise.
21102 (move_sese_region_to_fn): Likewise.
21104 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21106 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
21107 * dominance.h (get_dominated_by): Likewise.
21108 * auto-profile.c (afdo_find_equiv_class): Adjust.
21109 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
21110 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
21111 * tree-cfg.c (test_linear_chain): Likewise.
21112 (test_diamond): Likewise.
21114 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21116 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
21117 * cfgloopanal.c (get_loop_hot_path): Likewise.
21118 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
21120 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21122 * cgraph.c (cgraph_node::collect_callers): Return
21123 auto_vec<cgraph_edge *>.
21124 * cgraph.h (cgraph_node::collect_callers): Likewise.
21125 * ipa-cp.c (create_specialized_node): Adjust.
21126 (decide_about_value): Likewise.
21127 (decide_whether_version_node): Likewise.
21128 * ipa-sra.c (process_isra_node_results): Likewise.
21130 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
21132 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
21133 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
21135 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
21138 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
21140 * gimple-range.cc (debug_seed_ranger): New.
21141 (dump_ranger): New.
21142 (debug_ranger): New.
21144 2021-06-17 Richard Biener <rguenther@suse.de>
21146 PR tree-optimization/54400
21147 * tree-vectorizer.h (enum slp_instance_kind): Add
21148 slp_inst_kind_bb_reduc.
21149 (reduction_fn_for_scalar_code): Declare.
21150 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
21151 Check SLP_INSTANCE_KIND instead of looking at the
21153 (vect_slp_analyze_instance_alignment): Likewise.
21154 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
21155 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
21156 chain linearization from vect_build_slp_tree_2 and generalize
21157 for the use of BB reduction vectorization.
21158 (vect_build_slp_tree_2): Adjust accordingly.
21159 (vect_optimize_slp): Elide permutes at the root of BB reduction
21161 (vectorizable_bb_reduc_epilogue): New function.
21162 (vect_slp_prune_covered_roots): Likewise.
21163 (vect_slp_analyze_operations): Use them.
21164 (vect_slp_check_for_constructors): Recognize associatable
21165 chains for BB reduction vectorization.
21166 (vectorize_slp_instance_root_stmt): Generate code for the
21167 BB reduction epilogue.
21169 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21171 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
21173 (gori_compute::may_recompute_p): New.
21174 (gori_compute::outgoing_edge_range_p): Perform recomputations.
21175 * gimple-range-gori.h (class gori_compute): Add prototype.
21177 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
21179 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
21180 true when a range can be calculated.
21181 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
21183 2021-06-16 Martin Sebor <msebor@redhat.com>
21185 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
21186 Correct documented defaults.
21188 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
21190 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
21191 m_new_value_p directly.
21193 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
21196 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
21197 Handle 64bit modes for TARGET_SSE4_1.
21198 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
21199 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
21200 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
21201 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
21202 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
21203 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
21205 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21207 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
21208 Change to an expander that emits the correct instruction
21209 depending on endianness.
21210 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
21211 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
21213 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21215 * config/aarch64/aarch64-simd-builtins.def: Split generator
21216 for aarch64_<su>qmovn builtins into scalar and vector
21218 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
21220 (aarch64_<su>qmovn<mode>_insn_be): Define.
21221 (aarch64_<su>qmovn<mode>): Split into scalar and vector
21222 variants. Change vector variant to an expander that emits the
21223 correct instruction depending on endianness.
21225 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21227 * config/aarch64/aarch64-simd-builtins.def: Split generator
21228 for aarch64_sqmovun builtins into scalar and vector variants.
21229 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
21230 Split into scalar and vector variants. Change vector variant
21231 to an expander that emits the correct instruction depending
21233 (aarch64_sqmovun<mode>_insn_le): Define.
21234 (aarch64_sqmovun<mode>_insn_be): Define.
21236 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21238 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
21239 Define - modeling zero-high-half semantics.
21240 (aarch64_xtn<mode>): Change to an expander that emits the
21241 appropriate instruction depending on endianness.
21242 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
21244 (aarch64_xtn2<mode>_le): Rename to...
21245 (aarch64_xtn2<mode>_insn_le): This.
21246 (aarch64_xtn2<mode>_be): Rename to...
21247 (aarch64_xtn2<mode>_insn_be): This.
21248 (vec_pack_trunc_<mode>): Emit truncation instruction instead
21250 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
21251 attribute iterator.
21253 2021-06-16 Martin Jambor <mjambor@suse.cz>
21255 PR tree-optimization/100453
21256 * tree-sra.c (create_access): Disqualify any const candidates
21257 which are written to.
21258 (sra_modify_expr): Do not store sub-replacements back to a const base.
21259 (handle_unscalarized_data_in_subtree): Likewise.
21260 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
21261 instead of constant_decl_p.
21263 2021-06-16 Jakub Jelinek <jakub@redhat.com>
21265 PR middle-end/101062
21266 * stor-layout.c (finish_bitfield_representative): For fields in unions
21267 assume nextf is always NULL.
21268 (finish_bitfield_layout): Compute bit field representatives also in
21269 unions, but handle it as if each bitfield was the only field in the
21272 2021-06-16 Richard Biener <rguenther@suse.de>
21274 PR tree-optimization/101088
21275 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
21276 supported refs on edges. Do not assert same ref but
21277 different kind stores are unsuported but mark them so.
21278 (hoist_memory_references): Only look for supported refs
21281 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
21283 PR rtl-optimization/46235
21284 * config/i386/i386.md: New define_split for bt followed by cmov.
21285 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
21286 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
21287 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
21288 by setnc with zero extension.
21290 2021-06-16 Richard Biener <rguenther@suse.de>
21292 PR tree-optimization/101083
21293 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
21294 vectype as argument.
21295 (vect_build_slp_tree_2): Adjust.
21297 2021-06-15 Martin Sebor <msebor@redhat.com>
21299 PR middle-end/100876
21300 * builtins.c: (gimple_call_return_array): Account for size_t
21301 mangling as either unsigned int or unsigned long
21303 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
21305 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
21306 up before eliminating comparisons.
21308 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
21310 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
21311 nonzero|X is nonzero.
21312 (range_op_bitwise_and_tests): Add tests for above.
21314 2021-06-15 Carl Love <cel@us.ibm.com>
21317 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
21319 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
21322 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
21325 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
21326 (struct gimplify_omp_ctx): Extend defaultmap array by one.
21327 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
21328 (omp_notice_variable): Update type classification for Fortran.
21329 (gimplify_scan_omp_clauses): Update calls for new argument; handle
21330 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
21331 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
21332 * langhooks.c (lhd_omp_scalar_p): Likewise.
21333 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
21334 (LANG_HOOKS_DECLS): Add them.
21335 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
21336 omp_scalar_p pointer type to include the new bool argument.
21338 2021-06-15 David Malcolm <dmalcolm@redhat.com>
21340 * doc/analyzer.texi
21341 (Special Functions for Debugging the Analyzer): Add
21342 __analyzer_dump_capacity.
21344 2021-06-15 Jakub Jelinek <jakub@redhat.com>
21347 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
21348 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
21350 2021-06-15 Richard Biener <rguenther@suse.de>
21352 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
21353 check to identify loop latches.
21354 * cfgloop.c (verify_loop_structure): Likewise.
21355 * loop-init.c (apply_loop_flags): Allow marked irreducible
21356 regions even with multiple latches.
21357 * predict.c (rebuild_frequencies): Simplify.
21359 2021-06-15 Richard Biener <rguenther@suse.de>
21361 * tree-ssa-threadupdate.c
21362 (jump_thread_path_registry::mark_threaded_blocks): Assert we
21363 have marked irreducible regions.
21365 2021-06-14 Martin Sebor <msebor@redhat.com>
21368 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
21369 Handle calls to placement new.
21370 (ndecl_dealloc_argno): Avoid placement delete.
21372 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
21375 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
21376 create_tmp_reg_or_ssa_name().
21378 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
21380 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21381 (ranger_cache::enable_new_values): Set to specified value and
21382 return the old value.
21383 (ranger_cache::disable_new_values): Delete.
21384 (ranger_cache::fill_block_cache): Disable non 1st order derived
21386 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
21387 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
21389 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
21392 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
21393 Return true early when testing with V2HImode.
21394 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
21396 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
21398 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
21399 (mve_vec_unpack<US>_hi_<mode>): New pattern.
21400 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
21401 (mve_vmovntq_<supf><mode>): Prefix with '@'.
21402 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
21404 (vec_unpack<US>_lo_<mode>): Likewise.
21405 (vec_pack_trunc_<mode>): Rename to
21406 neon_quad_vec_pack_trunc_<mode>.
21407 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
21409 (vec_unpack<US>_lo_<mode>): New.
21410 (vec_pack_trunc_<mode>): New.
21412 2021-06-14 Richard Biener <rguenther@suse.de>
21414 PR tree-optimization/100934
21415 * tree-ssa-dom.c (pass_dominator::execute): Properly
21416 mark irreducible regions.
21418 2021-06-14 Martin Liska <mliska@suse.cz>
21420 * doc/invoke.texi: Put r{...} on the same line as @item.
21422 2021-06-14 Martin Liska <mliska@suse.cz>
21424 * doc/invoke.texi: Add missing newline.
21426 2021-06-14 Martin Liska <mliska@suse.cz>
21428 * doc/invoke.texi: Remove '+' charasters.
21430 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
21432 * config.gcc (arc): Add support for with_cpu option.
21433 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
21435 2021-06-14 Richard Biener <rguenther@suse.de>
21437 PR tree-optimization/101031
21438 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
21439 instead of size when accounting for a possibly string
21442 2021-06-14 Martin Liska <mliska@suse.cz>
21444 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
21446 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
21448 * value-query.cc (gimple_range_global): Call get_range_global
21449 if called after inlining.
21451 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
21454 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21455 Emit constant permutation insn directly from here.
21457 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
21459 * attribs.c (find_attribute_namespace): Iterate over vec<> with
21461 * auto-profile.c (afdo_find_equiv_class): Likewise.
21462 * gcc.c (do_specs_vec): Likewise.
21463 (do_spec_1): Likewise.
21464 (driver::set_up_specs): Likewise.
21465 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
21466 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
21467 (imm_store_chain_info::try_coalesce_bswap): Likewise.
21468 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
21469 (get_location_for_stmts): Likewise.
21470 * graphite-poly.c (print_iteration_domains): Likewise.
21471 (free_poly_bb): Likewise.
21472 (remove_gbbs_in_scop): Likewise.
21473 (free_scop): Likewise.
21474 (dump_gbb_cases): Likewise.
21475 (dump_gbb_conditions): Likewise.
21476 (print_pdrs): Likewise.
21477 (print_scop): Likewise.
21478 * ifcvt.c (cond_move_process_if_block): Likewise.
21479 * lower-subreg.c (decompose_multiword_subregs): Likewise.
21480 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
21481 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
21482 * sel-sched-dump.c (dump_insn_vector): Likewise.
21483 * store-motion.c (store_ops_ok): Likewise.
21484 (store_killed_in_insn): Likewise.
21485 * timevar.c (timer::named_items::print): Likewise.
21486 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
21487 (cleanup_tree_cfg_noloop): Likewise.
21488 * tree-data-ref.c (dump_data_references): Likewise.
21489 (print_dir_vectors): Likewise.
21490 (print_dist_vectors): Likewise.
21491 (dump_data_dependence_relations): Likewise.
21492 (dump_dist_dir_vectors): Likewise.
21493 (dump_ddrs): Likewise.
21494 (create_runtime_alias_checks): Likewise.
21495 (free_subscripts): Likewise.
21496 (save_dist_v): Likewise.
21497 (save_dir_v): Likewise.
21498 (invariant_access_functions): Likewise.
21499 (same_access_functions): Likewise.
21500 (access_functions_are_affine_or_constant_p): Likewise.
21501 (find_data_references_in_stmt): Likewise.
21502 (graphite_find_data_references_in_stmt): Likewise.
21503 (free_dependence_relations): Likewise.
21504 (free_data_refs): Likewise.
21505 * tree-inline.c (copy_debug_stmts): Likewise.
21506 * tree-into-ssa.c (dump_currdefs): Likewise.
21507 (rewrite_update_phi_arguments): Likewise.
21508 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
21509 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
21511 (vect_slp_analyze_node_dependences): Likewise.
21512 (vect_slp_analyze_instance_dependence): Likewise.
21513 (vect_record_base_alignments): Likewise.
21514 (vect_get_peeling_costs_all_drs): Likewise.
21515 (vect_peeling_supportable): Likewise.
21516 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
21517 (vec_info::free_stmt_vec_infos): Likewise.
21519 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
21521 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
21522 (andqi3_1<cczn>): Removed.
21523 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
21524 (H8/SX bit logicals): Split out from other patterns.
21525 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
21526 mulqihi3_const_clobber_flags.
21527 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
21529 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
21532 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
21533 to true if red zone is used.
21534 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
21535 ix86_red_zone_used.
21536 * config/i386/i386.h (machine_function): Add red_zone_used.
21537 (ix86_red_zone_size): Removed.
21538 (ix86_red_zone_used): New.
21539 * config/i386/i386.md (peephole2 patterns): Replace
21540 ix86_red_zone_size with ix86_red_zone_used.
21542 2021-06-12 Jason Merrill <jason@redhat.com>
21544 * doc/extend.texi (unused variable attribute): Applies to
21545 structure fields as well.
21547 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
21549 * auto-profile.c (read_profile): fix a typo in an error string
21551 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
21553 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
21555 * tree-pretty-print.c (dump_omp_clauses): Update.
21556 (dump_generic_node) <OMP_CLAUSE>: Use it.
21558 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21561 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
21562 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
21563 the polymorphic variants matching code.
21564 (__arm_vld1q_z): Likewise.
21565 (__arm_vld2q): Likewise.
21566 (__arm_vld4q): Likewise.
21567 (__arm_vldrbq_gather_offset): Likewise.
21568 (__arm_vldrbq_gather_offset_z): Likewise.
21570 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21572 PR tree-optimization/96392
21573 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
21575 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21577 PR tree-optimization/96392
21578 * fold-const.c (fold_real_zero_addition_p): Take both arguments
21579 of the addition or subtraction, not just the zero. Use this
21580 other argument in tests for signaling NaNs and signed zeros.
21581 (tree_expr_maybe_real_minus_zero_p): New predicate.
21582 * fold-const.h (fold_real_zero_addition_p): Update prototype.
21583 (tree_expr_maybe_real_minus_zero_p): New function prototype.
21584 * match.pd: Update calls to fold_real_zero_addition_p.
21585 Replace HONOR_NANS with tree_expr_maybe_nan_p.
21586 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
21587 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
21588 * tree-ssa-reassoc.c (eliminate_using_constants): Update
21589 call to fold_real_zero_addition_p.
21591 2021-06-11 Richard Biener <rguenther@suse.de>
21593 PR tree-optimization/101025
21594 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
21595 all refs that require dependence checking.
21597 2021-06-11 Richard Biener <rguenther@suse.de>
21599 PR tree-optimization/101028
21600 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
21601 reassoc discovery fails fatally, mark appropriate lanes
21604 2021-06-11 Richard Biener <rguenther@suse.de>
21606 PR tree-optimization/101026
21607 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
21608 have a representative for the associated chain nodes.
21610 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21612 PR rtl-optimization/101008
21613 * simplify-rtx.c (relational_result): New function.
21614 (simplify_logical_relational_operation,
21615 simplify_relational_operation): Use it.
21617 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21620 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
21622 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
21625 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
21626 false if the permutation can be implemented with constant
21627 permutation instruction in wider mode.
21628 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
21629 Handle V8QImode and V4HImode.
21631 2021-06-11 Martin Liska <mliska@suse.cz>
21633 PR gcov-profile/100788
21634 * common.opt: Add new option.
21635 * coverage.c (coverage_begin_function): Emit warning instead on
21636 the internal compiler error.
21637 * doc/invoke.texi: Document the option.
21638 * toplev.c (process_options): Enable it by default.
21640 2021-06-11 Richard Biener <rguenther@suse.de>
21642 PR middle-end/101009
21643 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
21644 to set *init_b to true when we encounter a constant equal
21646 (compute_affine_dependence): Also dump the actual DR_REF.
21648 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21650 PR tree-optimization/100984
21651 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
21652 replacements table.
21653 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
21655 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
21657 * config/rs6000/rs6000.md
21658 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
21659 define_insn_and_split.
21661 2021-06-11 Richard Biener <rguenther@suse.de>
21663 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
21664 to sort operands of the associative chain.
21666 2021-06-11 Richard Biener <rguenther@suse.de>
21668 * system.h (gcc_stablesort_r): Declare.
21669 * sort.cc (gcc_sort_r): Support stable sort.
21670 (gcc_stablesort_r): Define.
21671 * vec.h (vec<>::stablesort): Add.
21673 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
21676 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
21677 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
21678 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
21679 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
21680 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
21681 (expand_vec_perm_interleave2): Handle 64bit modes.
21682 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
21683 (expand_vec_perm_even_odd_1): Ditto.
21684 (ix86_vectorize_vec_perm_const): Ditto.
21685 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
21686 * config/i386/sse.md: ... here.
21687 * config/i386/mmx.md (*vec_interleave_lowv2sf):
21688 New insn_and_split pattern.
21689 (*vec_interleave_highv2sf): Ditto.
21690 (mmx_pshufbv8qi3): New insn pattern.
21691 (*mmx_pblendw): Ditto.
21693 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
21695 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
21696 (build_acc): Likewise.
21697 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
21698 source operands in little-endian mode.
21699 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
21700 (mma_init_builtins): Likewise.
21701 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
21702 ordering for the MMA assemble and build source operands.
21703 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
21705 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
21708 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
21710 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
21712 * config/h8300/extensions.md: Replace _clobber_flags patterns
21715 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
21717 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
21718 (vcond_mask_<mode><tointvec>): this.
21720 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
21721 Thomas Schwinge <thomas@codesourcery.com>
21723 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
21724 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
21725 * gimple.h (enum gf_mask): Split
21726 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
21727 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
21728 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
21729 (is_gimple_omp_oacc): Update.
21730 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
21731 * gimplify.c (gimplify_omp_target_update): Likewise.
21732 * omp-expand.c (expand_omp_target, build_omp_regions_1)
21733 (omp_make_gimple_edges): Likewise.
21734 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
21737 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21739 * value-query.cc (value_query::value_on_edge): Rename name to
21741 (range_query::range_on_edge): Same.
21742 (range_query::value_of_expr): Same.
21743 (range_query::value_on_edge): Same.
21744 * value-query.h (class value_query): Same.
21745 (class range_query): Same.
21747 2021-06-10 Richard Biener <rguenther@suse.de>
21749 PR tree-optimization/101003
21750 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
21751 use the pattern stmt defs when linearizing a chain.
21753 2021-06-10 Jakub Jelinek <jakub@redhat.com>
21756 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
21757 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
21759 2021-06-10 Clement Chigot <clement.chigot@atos.net>
21761 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
21762 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
21764 2021-06-09 Andrew Pinski <apinski@marvell.com>
21766 PR tree-optimization/100925
21767 * match.pd (a ? CST1 : CST2): Limit transformations
21768 that would produce a negative to integeral types only.
21769 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
21771 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21774 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21776 * doc/tm.texi: Correctly update.
21778 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21780 * doc/tm.texi: Correctly update.
21782 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21785 * doc/tm.texi.in (Trampolines): Add a missing blank line.
21787 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
21790 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
21791 and -ftrampolines work only with Ada.
21792 * doc/tm.texi.in (Trampolines): Likewise.
21793 * doc/tm.texi: Regenerated.
21795 2021-06-09 Carl Love <cel@us.ibm.com>
21797 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
21798 Add define for new builtins.
21799 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
21800 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
21801 overloaded builtin definitions.
21802 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
21803 VSIGNEXTSD2Q): Add builtin expansions.
21804 (SIGNEXT): Add P10 overload definition.
21805 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
21806 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
21807 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
21808 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
21809 vsignextend_si_v2di)[VIlong]: Add define_expand.
21810 Make define_insn vsx_sign_extend_si_v2di visible.
21811 * doc/extend.texi: Add documentation for the vec_signexti,
21812 vec_signextll builtins and vec_signextq.
21814 2021-06-09 Carl Love <cel@us.ibm.com>
21816 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
21817 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
21818 __floattikf_sw, __floatuntikf_sw respectively.
21819 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
21820 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
21821 define_insn for mode IEEE 128.
21823 2021-06-09 Carl Love <cel@us.ibm.com>
21825 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
21826 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
21827 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
21828 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
21829 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
21830 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
21831 uses of VSX_TI to VEC_TI.
21833 2021-06-09 Carl Love <cel@us.ibm.com>
21835 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
21837 2021-06-09 Carl Love <cel@us.ibm.com>
21839 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
21841 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
21842 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
21843 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
21844 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
21845 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
21846 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
21847 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
21849 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
21850 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
21851 altivec_vrlqnm): New define_expands.
21852 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
21853 VCMPGTUT_P): Add macro expansions.
21854 (BU_P10V_AV_P): Add builtin predicate definition.
21855 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
21856 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
21857 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
21858 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
21859 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
21860 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
21861 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
21862 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
21863 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
21864 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
21865 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
21866 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
21867 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
21868 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
21869 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
21870 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
21871 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
21872 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
21873 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
21874 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
21875 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
21876 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
21877 P10V_BUILTIN_MODU_V1TI):
21878 New overloaded definitions.
21879 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
21880 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
21881 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
21882 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
21883 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
21884 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
21886 (altivec_init_builtins): New E_V1TImode case statement.
21887 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
21888 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
21889 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
21890 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
21891 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
21892 E_V1TImode]: New case statements.
21893 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
21894 value RS6000_BTI_bool_V1TI.
21895 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
21896 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
21897 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
21898 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
21899 vlshrv1ti3, vashrv1ti3): New define_expands.
21900 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
21901 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
21902 UNSPEC_VSX_MODUQ): New unspecs.
21903 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
21904 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
21906 (vcmpnet): New define_expand.
21907 * doc/extend.texi: Add documentation for the new builtins vec_rl,
21908 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
21909 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
21910 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
21911 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
21912 vec_any_ge, vec_any_le.
21914 2021-06-09 Carl Love <cel@us.ibm.com>
21916 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
21917 bug in argument generation.
21919 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21921 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
21923 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
21924 remove <supf> iterator.
21925 (mve_vclzq_u<mode>): New.
21926 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
21927 (neon_vclz<mode): Move to ...
21928 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
21929 * config/arm/vec-common.md: ... here. Add support for MVE.
21931 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21933 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
21934 (@mve_vrhaddq_<supf><mode): Likewise.
21935 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
21936 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
21937 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
21939 2021-06-09 imba-tjd <109224573@qq.com>
21941 * doc/invoke.texi: Fix typo.
21943 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
21945 PR middle-end/53267
21946 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
21947 Support evaluation of fmod/fmodf/fmodl at compile-time.
21949 2021-06-09 Richard Biener <rguenther@suse.de>
21951 PR tree-optimization/100981
21952 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
21953 gimple_get_lhs to also handle calls.
21954 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
21957 2021-06-09 Richard Biener <rguenther@suse.de>
21959 PR tree-optimization/97832
21960 * tree-vectorizer.h (_slp_tree::failed): New.
21961 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
21963 (_slp_tree::~_slp_tree): Free failed.
21964 (vect_build_slp_tree): Retain failed nodes and record
21965 matches in them, copying that back out when running
21966 into a cached fail. Dump start and end of discovery.
21967 (dt_sort_cmp): New.
21968 (vect_build_slp_tree_2): Handle associatable chains
21969 together doing more aggressive operand swapping.
21971 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21974 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
21976 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
21977 Linux and GNU targets.
21979 2021-06-09 Richard Biener <rguenther@suse.de>
21981 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
21984 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21986 * config/arc/arc.md (loop_end): Change it to
21987 define_insn_and_split.
21989 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21991 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
21992 (machi): New pattern.
21993 (umaddhisi4): Use VMAC2HU instruction.
21994 (umachi): New pattern.
21996 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21998 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
21999 * config/arc/arc.c (arc_split_move_p): New function.
22000 (arc_split_move): Clean up.
22001 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
22002 (movdf_insn): Likewise.
22003 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
22005 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
22008 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
22009 argument to "raw". Do not emit segment overrides when "raw" is true.
22011 2021-06-09 Martin Liska <mliska@suse.cz>
22013 * doc/gcov.texi: Create a proper JSON files.
22014 * doc/invoke.texi: Remove dots in order to make it a valid
22017 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
22019 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
22020 (insn_is_load_p): Use pattern_is_rotate64.
22021 (insn_is_swap_p): Likewise.
22022 (quad_aligned_load_p): Likewise.
22023 (const_load_sequence_p): Likewise.
22024 (replace_swapped_aligned_load): Likewise.
22025 (recombine_lvx_pattern): Likewise.
22026 (recombine_stvx_pattern): Likewise.
22028 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
22030 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
22031 fur_stmt source record.
22032 * gimple-range.cc (fur_source::get_operand): Generic range query.
22033 (fur_source::get_phi_operand): New.
22034 (fur_source::register_dependency): New.
22035 (fur_source::query): New.
22036 (class fur_edge): New. Edge source for operands.
22037 (fur_edge::fur_edge): New.
22038 (fur_edge::get_operand): New.
22039 (fur_edge::get_phi_operand): New.
22040 (fur_edge::query): New.
22041 (fur_stmt::fur_stmt): New.
22042 (fur_stmt::get_operand): New.
22043 (fur_stmt::get_phi_operand): New.
22044 (fur_stmt::query): New.
22045 (class fur_depend): New. Statement source and process dependencies.
22046 (fur_depend::fur_depend): New.
22047 (fur_depend::register_dependency): New.
22048 (class fur_list): New. List source for operands.
22049 (fur_list::fur_list): New.
22050 (fur_list::get_operand): New.
22051 (fur_list::get_phi_operand): New.
22052 (fold_range): New. Instantiate appropriate fur_source class and fold.
22053 (fold_using_range::range_of_range_op): Use new API.
22054 (fold_using_range::range_of_address): Ditto.
22055 (fold_using_range::range_of_phi): Ditto.
22056 (imple_ranger::fold_range_internal): Use fur_depend class.
22057 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
22058 * gimple-range.h (class fur_source): Now a base class.
22059 (class fur_stmt): New.
22060 (fold_range): New prototypes.
22061 (fur_source::fur_source): Delete.
22063 2021-06-08 Andrew Pinski <apinski@marvell.com>
22065 PR tree-optimization/25290
22066 * tree-ssa-phiopt.c (xor_replacement): Delete.
22067 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
22068 (match_simplify_replacement): Allow one cheap preparation
22069 statement that can be moved to before the if.
22071 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
22073 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
22074 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
22076 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
22078 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
22079 Create length attribute on define_insn_and_split. Only split for cases which we
22081 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
22082 fix length computation.
22083 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
22085 2021-06-08 Richard Biener <rguenther@suse.de>
22087 PR tree-optimization/100923
22088 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
22089 the operand vector to be valueized.
22090 (valueize_refs): Likewise.
22091 (valueize_shared_reference_ops_from_ref): Adjust.
22092 (valueize_shared_reference_ops_from_call): Likewise.
22093 (vn_reference_lookup_3): Likewise.
22094 (vn_reference_lookup_pieces): Likewise. Re-valueize
22095 with honoring availability when we are about to create
22096 the ao_ref and valueized before.
22097 (vn_reference_lookup): Likewise.
22098 (vn_reference_insert_pieces): Adjust.
22100 2021-06-08 Richard Biener <rguenther@suse.de>
22102 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
22103 (_slp_instance::root_stmts): ... a vector.
22104 (SLP_INSTANCE_ROOT_STMT): Rename to ...
22105 (SLP_INSTANCE_ROOT_STMTS): ... this.
22106 (slp_root::root): Change to...
22107 (slp_root::roots): ... a vector.
22108 (slp_root::slp_root): Adjust.
22109 * tree-vect-slp.c (_slp_instance::location): Adjust.
22110 (vect_free_slp_instance): Release the root stmt vector.
22111 (vect_build_slp_instance): Adjust.
22112 (vect_analyze_slp): Likewise.
22113 (_bb_vec_info::~_bb_vec_info): Likewise.
22114 (vect_slp_analyze_operations): Likewise.
22115 (vect_bb_vectorization_profitable_p): Likewise. Adjust
22116 costs for the root stmt.
22117 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
22119 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
22121 (vectorize_slp_instance_root_stmt): Adjust.
22122 (vect_schedule_slp): Likewise.
22124 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
22126 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
22127 (ssa_equiv_stack::ssa_equiv_stack): New.
22128 (ssa_equiv_stack::~ssa_equiv_stack): New.
22129 (ssa_equiv_stack::enter): New.
22130 (ssa_equiv_stack::leave): New.
22131 (ssa_equiv_stack::push_replacement): New.
22132 (ssa_equiv_stack::get_replacement): New.
22133 (is_pointer_ssa): New.
22134 (class pointer_equiv_analyzer): New.
22135 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
22136 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
22137 (pointer_equiv_analyzer::set_global_equiv): New.
22138 (pointer_equiv_analyzer::set_cond_equiv): New.
22139 (pointer_equiv_analyzer::get_equiv): New.
22140 (pointer_equiv_analyzer::enter): New.
22141 (pointer_equiv_analyzer::leave): New.
22142 (pointer_equiv_analyzer::get_equiv_expr): New.
22143 (pta_valueize): New.
22144 (pointer_equiv_analyzer::visit_stmt): New.
22145 (pointer_equiv_analyzer::visit_edge): New.
22146 (hybrid_folder::value_of_expr): Call PTA.
22147 (hybrid_folder::value_on_edge): Same.
22148 (hybrid_folder::pre_fold_bb): New.
22149 (hybrid_folder::post_fold_bb): New.
22150 (hybrid_folder::pre_fold_stmt): New.
22151 (rvrp_folder::pre_fold_bb): New.
22152 (rvrp_folder::post_fold_bb): New.
22153 (rvrp_folder::pre_fold_stmt): New.
22154 (rvrp_folder::value_of_expr): Call PTA.
22155 (rvrp_folder::value_on_edge): Same.
22157 2021-06-08 Jakub Jelinek <jakub@redhat.com>
22160 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
22161 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
22163 2021-06-08 Richard Biener <rguenther@suse.de>
22165 PR middle-end/100951
22166 * tree-vect-generic.c (expand_vector_piecewise): Build a
22167 VECTOR_CST if all elements are constant.
22168 (expand_vector_condition): Likewise.
22169 (lower_vec_perm): Likewise.
22170 (expand_vector_conversion): Likewise.
22172 2021-06-08 Martin Liska <mliska@suse.cz>
22174 * doc/invoke.texi: Document new param evrp-sparse-threshold.
22176 2021-06-08 Martin Liska <mliska@suse.cz>
22178 * genautomata.c (create_automata): Fix typo.
22180 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22182 PR tree-optimization/100794
22183 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
22184 allow_unroll_p and only allow unrolling when it's true.
22185 (tree_predictive_commoning): Add parameter allow_unroll_p and
22187 (run_tree_predictive_commoning): Likewise.
22188 (pass_predcom::gate): Check flag_tree_loop_vectorize and
22189 global_options_set.x_flag_predictive_commoning.
22190 (pass_predcom::execute): Adjust for allow_unroll_p.
22192 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
22194 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
22195 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
22196 lambda function cleanup, remove scev_reset call, and adjust return
22198 (tree_predictive_commoning): Adjust for different changed values,
22199 only set flag TODO_update_ssa_only_virtuals if changed.
22200 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
22201 from todo_flags_finish.
22203 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22205 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
22206 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
22207 (sbr_sparse_bitmap::bitmap_set_quad): New.
22208 (sbr_sparse_bitmap::bitmap_get_quad): New.
22209 (sbr_sparse_bitmap::set_bb_range): New.
22210 (sbr_sparse_bitmap::get_bb_range): New.
22211 (sbr_sparse_bitmap::bb_range_p): New.
22212 (block_range_cache::block_range_cache): initialize bitmap obstack.
22213 (block_range_cache::~block_range_cache): Destruct obstack.
22214 (block_range_cache::set_bb_range): Decide when to utilze the
22215 sparse on entry cache.
22216 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
22217 * params.opt (-param=evrp-sparse-threshold): New.
22219 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22221 * bitmap.c (bitmap_set_aligned_chunk): New.
22222 (bitmap_get_aligned_chunk): New.
22223 (test_aligned_chunk): New.
22224 (bitmap_c_tests): Call test_aligned_chunk.
22225 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
22227 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
22230 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22232 (ix86_expand_vector_init_one_nonzero): Ditto.
22233 (ix86_expand_vector_init_one_var): Ditto.
22234 (ix86_expand_vector_init_general): Ditto.
22235 * config/i386/mmx.md (vec_initv4qiqi): New expander.
22237 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
22239 * config/h8300/movepush.md: Change most _clobber_flags
22240 patterns to instead use <cczn> subst.
22241 (movsi_cczn): New pattern with usable CC cases split out.
22242 (movsi_h8sx_cczn): Likewise.
22244 2021-06-07 Martin Liska <mliska@suse.cz>
22246 * common/common-target.def: Split long lines and replace them
22248 * target.def: Likewise.
22249 * doc/tm.texi: Re-generated.
22251 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22254 * fold-const.c (fold_read_from_vector): Return NULL if trying to
22255 read from a CONSTRUCTOR with vector type elements.
22257 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22259 PR middle-end/100898
22260 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
22261 should copy any arguments. Don't call gimple_call_num_args
22262 on id->call_stmt or call_stmt more than once.
22264 2021-06-07 liuhongt <hongtao.liu@intel.com>
22267 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
22269 (<insn>v4siv4di2): Delete constraints for define_expand.
22271 2021-06-07 liuhongt <hongtao.liu@intel.com>
22274 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
22275 assignment of cfun->machine->has_explicit_vzeroupper.
22276 * config/i386/i386-features.c
22277 (ix86_add_reg_usage_to_vzerouppers): Delete.
22278 (ix86_add_reg_usage_to_vzeroupper): Ditto.
22279 (rest_of_handle_insert_vzeroupper): Remove
22280 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
22282 (gate): Remove cfun->machine->has_explicit_vzeroupper.
22283 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
22285 * config/i386/i386.c (ix86_insn_callee_abi): New function.
22286 (ix86_initialize_callee_abi): Ditto.
22287 (ix86_expand_avx_vzeroupper): Ditto.
22288 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
22290 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
22291 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
22293 * config/i386/i386.h (struct GTY(()) machine_function): Delete
22294 has_explicit_vzeroupper.
22295 * config/i386/i386.md (enum unspec): New member
22297 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
22298 define_constants for insn callee abi index.
22299 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
22300 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
22301 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
22302 (*avx_vzeroupper): Rename to ..
22303 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
22304 call_insn which has a special vzeroupper ABI.
22305 (*avx_vzeroupper_1): Deleted.
22307 2021-06-07 liuhongt <hongtao.liu@intel.com>
22310 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
22311 it won't use stack pointer reg.
22312 * final.c (leaf_function_p): When call_insn is a fake call, it
22313 won't affect caller as a leaf function.
22314 * reg-stack.c (callee_clobbers_any_stack_reg): New.
22315 (subst_stack_regs): When call_insn doesn't clobber any stack
22316 reg, don't clear the arguments.
22317 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
22319 * shrink-wrap.c (requires_stack_frame_p): No need for stack
22320 frame for a fake call.
22321 * rtl.h (FAKE_CALL_P): New macro.
22323 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
22325 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
22327 (sparc_order_regs_for_local_alloc): ...this.
22328 (sparc_leaf_reg_remap): Declare.
22329 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
22330 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
22331 * config/sparc/sparc.c (leaf_reg_remap): Delete.
22332 (order_regs_for_local_alloc): Rename to...
22333 (sparc_order_regs_for_local_alloc): ...this.
22334 (sparc_leaf_reg_remap): New function.
22335 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
22337 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
22339 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
22340 Use assemble_name to output BSS section name.
22342 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
22344 * config/i386/constraints.md (Bs):
22345 Remove boolean operators from match_test RTX.
22348 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
22351 2021-06-06 Martin Liska <mliska@suse.cz>
22353 * doc/extend.texi: Add missing @headitem.
22354 * doc/invoke.texi: Likewise.
22355 * doc/objc.texi: Likewise.
22357 2021-06-06 Martin Liska <mliska@suse.cz>
22359 * genhooks.c (emit_findices): Remove unused function.
22360 (emit_documentation): Do not call emit_findices
22361 and do not search for @Fcode directives.
22363 2021-06-06 Martin Liska <mliska@suse.cz>
22365 * doc/invoke.texi: Remove extra character.
22367 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22369 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
22371 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22373 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
22374 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
22375 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
22376 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
22378 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22380 * config/or1k/or1k.md (*movdi): Fix empty split condition.
22382 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22384 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
22387 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22389 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
22390 *zero_extendsidi2): Fix empty split condition.
22392 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
22394 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
22396 * config/h8300/bitfield.md: Likewise.
22397 * config/h8300/combiner.md: Likewise.
22398 * config/h8300/divmod.md: Likewise.
22399 * config/h8300/extensions.md: Likewise.
22400 * config/h8300/jumpcall.md: Likewise.
22401 * config/h8300/movepush.md: Likewise.
22402 * config/h8300/multiply.md: Likewise.
22403 * config/h8300/other.md: Likewise.
22404 * config/h8300/shiftrotate.md: Likewise.
22405 * config/h8300/logical.md: Likewise. Fix split pattern to use
22406 code iterator that somehow slipped through.
22408 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
22410 PR middle-end/100905
22411 * tree-nested.c (convert_nonlocal_omp_clauses,
22412 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
22414 2021-06-04 Martin Sebor <msebor@redhat.com>
22416 PR middle-end/100732
22417 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
22418 with either source or destination argument of invalid type.
22419 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
22420 calls with arguments of invalid type.
22422 2021-06-04 Martin Sebor <msebor@redhat.com>
22424 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
22426 (attr_access::vla_bounds): Also handle VLA bounds.
22428 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22430 * config/i386/predicates.md (GOT_memory_operand):
22431 Implement using match_code RTXes.
22432 (GOT32_symbol_operand): Ditto.
22434 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22437 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22439 (ix86_expand_vector_init_general): Ditto.
22440 Use SImode instead of word_mode for logic operations
22441 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
22442 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
22443 implemented by expand_vec_perm_1.
22444 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
22445 should be implemented using standard shuffle patterns.
22446 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
22447 V2HI modes to modes, implementable with shuffle for one operand.
22448 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
22449 (*pshufw_1): New insn pattern.
22450 (*vec_dupv2hi): Ditto.
22451 (vec_initv2hihi): New expander.
22453 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22455 * config/arm/vfp.md (no_literal_pool_df_immediate,
22456 no_literal_pool_sf_immediate): Fix empty split condition.
22458 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22460 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
22461 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
22462 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
22463 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
22464 *<sse4_1_avx2>_pblendvb_lt): Likewise.
22466 2021-06-04 Jakub Jelinek <jakub@redhat.com>
22469 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
22470 concatenation from half-sized modes with TImode elements.
22472 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
22474 * config/arc/arc.c (arc_override_options): Disable millicode
22475 thunks when RF16 is on.
22477 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22479 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
22481 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22483 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
22484 Replace PROMOTE_MODE marco with its content.
22486 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22488 * config/cris/cris.md (*addi_reload): Fix empty split condition.
22490 2021-06-03 Jim Wilson <jimw@sifive.com>
22492 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
22493 turn it on for all riscv targets.
22495 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
22498 * config/i386/i386-expand.c (ix86_expand_vector_set):
22499 Handle V2HI and V4QI modes.
22500 (ix86_expand_vector_extract): Ditto.
22501 * config/i386/mmx.md (*pinsrw): New insn pattern.
22504 (*pextrw_zext): Ditto.
22506 (*pextrb_zext): Ditto.
22507 (vec_setv2hi): New expander.
22508 (vec_extractv2hihi): Ditto.
22509 (vec_setv4qi): Ditto.
22510 (vec_extractv4qiqi): Ditto.
22511 (vec_setv8qi): Enable only for TARGET_SSE4_1.
22512 (vec_extractv8qiqi): Ditto.
22514 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
22516 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
22517 order to subf instruction.
22518 * config/rs6000/fusion.md: Regenerate.
22520 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22522 * calls.c (get_size_range): Use range_of_expr instead of
22523 determine_value_range.
22524 * tree-affine.c (expr_to_aff_combination): Same.
22525 * tree-data-ref.c (split_constant_offset): Same.
22526 * tree-vrp.c (determine_value_range_1): Remove.
22527 (determine_value_range): Remove.
22528 * tree-vrp.h (determine_value_range): Remove.
22530 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22532 * function-tests.c (test_ranges): Call gimple_range_tests.
22533 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
22535 * gimple-range.cc (fur_source::get_operand): Do not call
22536 get_tree_range or gimple_range_global.
22538 (get_tree_range): Move to value-query.cc.
22539 Call get_arith_expr_range.
22540 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
22541 Include gimple-range-tests.cc.
22542 * gimple-range.h (fold_range): Add argument.
22543 (get_tree_range): Remove.
22544 * selftest.h (gimple_range_tests): New.
22545 * value-query.cc (global_range_query::range_of_expr): Add
22547 (range_query::get_tree_range): Move from gimple-range.cc.
22548 * value-query.h (class range_query): Add get_tree_range and
22549 get_arith_expr_range. Make fur_source a friend.
22550 * vr-values.c (vr_values::range_of_expr): Pass stmt to
22552 * gimple-range-tests.cc: New file.
22554 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22556 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
22557 update_global_range.
22558 * value-query.cc (update_global_range): New.
22559 * value-query.h (update_global_range): New.
22561 2021-06-03 David Malcolm <dmalcolm@redhat.com>
22563 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
22564 printing the same location twice if there are fix-it hints,
22565 multiple locations, or a label.
22567 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
22569 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
22570 thresholds to narrow the upper bound on epilogue iterations.
22572 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
22574 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
22575 (mve_vabsq_s<mode>): Likewise.
22576 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
22577 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
22578 * config/arm/vec-common.md (neg<mode>2): Rename to
22579 <absneg_str><mode>2.
22581 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
22583 * common/config/arc/arc-common.c (arc_option_optimization_table):
22584 Remove malign-call.
22585 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
22586 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
22587 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
22588 * config/arc/arc.md (abssi2_mixed): Remove pattern.
22589 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
22590 (malign-call): Likewise.
22591 (mmixed-code): Likewise.
22592 * doc/invoke.texi (ARC): Update doc.
22594 2021-06-03 Martin Liska <mliska@suse.cz>
22596 * common.opt: Use proper Enum values.
22597 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
22598 (parse_sanitizer_options): Handle only sanitizer_opts.
22599 (common_handle_option): Just assign value.
22601 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22604 * tree-inline.c (inline_forbidden_p): Remove test on return type.
22606 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22608 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
22609 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
22610 (type_byte_size): Inline into...
22611 (add_byte_size_attribute): ...this and call add_scalar_info.
22613 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22615 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
22616 (typed_binop_from_tree): New function.
22617 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
22618 turn a divide by a power of 2 into a shift.
22619 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
22620 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
22621 typed divide by calling typed_binop_from_tree.
22623 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22625 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
22626 (is_handled_procedure_type): Likewise.
22627 (struct loc_descr_context): Add strict_signedness field.
22628 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
22629 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
22630 (resolve_args_picking): Minor tweak.
22631 (function_to_dwarf_procedure): Initialize strict_signedness field.
22632 (type_byte_size): Likewise.
22633 (field_byte_offset): Likewise.
22634 (gen_descr_array_type_die): Likewise.
22635 (gen_variant_part): Likewise.
22636 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
22637 to true when a context is present before evaluating the arguments.
22638 <COND_EXPR>: Do not generate a useless comparison with zero.
22639 When dereferencing an address, if strict_signedness is true and the
22640 type is small and signed, use DW_OP_deref_type to do the dereference
22641 and then DW_OP_convert to convert back to the generic type.
22643 2021-06-03 Jakub Jelinek <jakub@redhat.com>
22646 * tree-inline.c (copy_tree_body_r): Handle iterators on
22647 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
22649 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22651 * config/arc/arc.md (*bbit_di): Remove.
22653 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
22655 PR rtl-optimization/100264
22656 * ree.c (get_sub_rtx): Ignore SET expressions without register
22657 destinations and remove assertion, as it is not valid anymore
22658 with this new behaviour.
22659 (merge_def_and_ext): Eliminate destination check for register
22660 as such SET expressions can't occur anymore.
22661 (combine_reaching_defs): Likewise.
22663 2021-06-02 Jakub Jelinek <jakub@redhat.com>
22666 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
22667 -Wtype-limits warnings.
22668 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
22669 in operands to avoid -Wsign-compare warnings.
22671 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
22673 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
22676 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
22678 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
22680 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
22682 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
22684 * config/s390/subst.md(cconly_subst): Use a single constraint
22685 in (match_scratch).
22687 2021-06-02 Martin Liska <mliska@suse.cz>
22689 * ipa-icf.h: Use auto_vec for memory_access_types.
22691 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
22693 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
22694 argument from prototype.
22695 (output_logical_op): Add rtx_code argument.
22696 (compute_logical_op_length): Likewise.
22697 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
22698 to compute_a_shift_length.
22699 (output_logical_op); New argument with the rtx code rather than
22700 extracting it from an operand. Handle QImode too.
22701 (compute_logical_op_length): Similary.
22702 (compute_a_shift_length): Drop unused argument.
22703 * config/h8300/h8300.md (logicals): New code iterator.
22704 * config/h8300/logical.md (<code><mode>3 expander): Combine
22705 the "and" expander with the "ior"/"xor" expander.
22706 (bclr<mode>msx): Combine the QI/HI mode patterns.
22707 (<logical><mode>3 insns): Use code iterator rather than match_operator.
22708 Handle QImode as well. Update call to output_logical_op and
22709 compute_logical_op_length to pass in rtx_code
22710 Fix split condition on all define_insn_and_split patterns.
22711 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
22712 the flags and setting ZN via existing define_subst.
22713 * config/h8300/shiftrotate.md: Drop unused argument from
22714 calls to compute_a_shift_length.
22715 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
22717 2021-06-01 Andrew Pinski <apinski@marvell.com>
22719 PR tree-optimization/25290
22720 * tree-ssa-phiopt.c (match_simplify_replacement):
22722 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
22723 (two_value_replacement): Change the comment about
22724 conditional_replacement.
22725 (conditional_replacement): Delete.
22727 2021-06-01 Andrew Pinski <apinski@marvell.com>
22729 PR tree-optimization/95481
22730 * tree-tailcall.c (find_tail_calls): Handle empty typed
22733 2021-06-01 Andrew Pinski <apinski@marvell.com>
22735 * gimplify.c (zero_sized_field_decl): Delete
22736 (zero_sized_type): Delete
22737 (gimplify_init_ctor_eval): Use is_empty_type instead
22738 of zero_sized_field_decl.
22739 (gimplify_modify_expr): Use is_empty_type instead of
22742 2021-06-01 Jason Merrill <jason@redhat.com>
22745 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
22747 2021-06-01 Jason Merrill <jason@redhat.com>
22750 * diagnostic.h (warning_enabled_at): Declare.
22751 * diagnostic.c (diagnostic_enabled): Factor out from...
22752 (diagnostic_report_diagnostic): ...here.
22753 (warning_enabled_at): New.
22755 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
22757 * gimple-ssa-evrp.c: Enable exporting of global ranges.
22759 2021-06-01 Martin Liska <mliska@suse.cz>
22762 * doc/invoke.texi: Mention that -fgcse-after-reload
22763 is enabled with -O3.
22765 2021-06-01 liuhongt <hongtao.liu@intel.com>
22767 PR tree-optimization/98365
22768 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
22769 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
22770 (convert_scalar_cond_reduction): Ditto.
22771 (predicate_scalar_phi): Ditto.
22773 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22775 PR tree-optimization/100781
22776 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
22777 value calculation by default.
22778 (ranger_cache::enable_new_values): New.
22779 (ranger_cache::disable_new_values): New.
22780 (ranger_cache::push_poor_value): Check if new values are allowed.
22781 * gimple-range-cache.h (class ranger_cache): New member/methods.
22782 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
22783 statement, and disable/renable new value calculation.
22785 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22787 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
22788 (ranger_cache::range_of_def): New.
22789 (ranger_cache::entry_range): New.
22790 (ranger_cache::exit_range): New.
22791 (ranger_cache::range_of_expr): Adjust.
22792 (ranger_cache::range_on_edge): Adjust.
22793 (ranger_cache::propagate_cache): Call exit_range directly.
22794 * gimple-range-cache.h (class ranger_cache): Adjust.
22796 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22798 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
22799 gori_compute being a member rather than base class.
22800 dervied call to member call.
22801 (ranger_cache::dump): No longer dump gori_map.
22802 (ranger_cache::dump_bb): New.
22803 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
22804 being a member rather than base class.
22805 (ranger_cache::set_global_range): Ditto.
22806 (ranger_cache::ssa_range_in_bb): Ditto.
22807 (ranger_cache::range_of_expr): New.
22808 (ranger_cache::range_on_edge): New.
22809 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
22810 (ranger_cache::propagate_cache): Adjust debugging output.
22811 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
22813 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
22814 member, and inherit from range_query instead.
22815 (ranger_cache::dump_bb): New. split from dump.
22816 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
22817 (gori_compute::expr_range_at_stmt): Delete.
22818 (gori_compute::compute_name_range_op): Delete.
22819 (gori_compute::compute_operand_range_switch): Add fur_source.
22820 (gori_compute::compute_operand_range): Add fur_source param, inline
22821 old compute_name_range_op and optimize_logical_operands.
22822 (struct tf_range): Delete.
22823 (gori_compute::logical_combine): Adjust
22824 (gori_compute::optimize_logical_operands): Delete.
22825 (gori_compute::compute_logical_operands_in_chain): Delete.
22826 (gori_compute::compute_logical_operands): Adjust.
22827 (gori_compute::compute_operand1_range): Adjust to fur_source.
22828 (gori_compute::compute_operand2_range): Ditto.
22829 (gori_compute::compute_operand1_and_operand2_range): Ditto.
22830 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
22831 and adjust to fur_source.
22832 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
22833 range_query and fur_source.
22834 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
22835 from the ranger_cache..
22836 (gimple_ranger::fold_range_internal): Adjust to base class change of
22838 (gimple_ranger::dump_bb): Adjust dump.
22839 * gimple-range.h (gimple_ranger):export gori computes object.
22841 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22843 PR tree-optimization/100774
22844 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
22845 Constant values are also not stale.
22846 (ranger_cache::set_global_range): Range invariant values should also
22847 have the correct timestamp.
22849 2021-05-31 Martin Liska <mliska@suse.cz>
22851 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
22852 Unpack FUNCTION_DECL_DECL_TYPE.
22853 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
22854 Stream FUNCTION_DECL_DECL_TYPE instead of
22855 DECL_IS_OPERATOR_NEW_P.
22856 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
22858 (DECL_IS_OPERATOR_NEW_P): Likewise.
22859 (DECL_IS_OPERATOR_DELETE_P): Likewise.
22860 (DECL_LAMBDA_FUNCTION_P): Likewise.
22862 2021-05-31 Richard Biener <rguenther@suse.de>
22865 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
22866 * internal-fn.def (SHUFFLEVECTOR): New.
22867 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
22868 * doc/extend.texi: Document __builtin_shufflevector.
22870 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
22873 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
22874 indexed form addresses.
22876 2021-05-29 Jeff Law <jlaw@tachyum.com>
22878 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
22879 parameter. Call callers fixed.
22881 (output_plussi): Add FALLTHRU markers.
22882 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
22884 2021-05-29 Jakub Jelinek <jakub@redhat.com>
22886 PR middle-end/99928
22887 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
22888 combined with parallel, make sure to add shared clause to
22889 parallel for explicit linear clause.
22891 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
22893 PR tree-optimization/100787
22894 * gimple-ssa-evrp.c: Disable exporting of global ranges.
22896 2021-05-28 Jason Merrill <jason@redhat.com>
22898 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
22899 operator--, operator*, operator==, and operator!=.
22900 (class tsi_range): New.
22902 2021-05-28 Richard Biener <rguenther@suse.de>
22904 PR tree-optimization/100778
22905 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
22906 trapping ops in different BBs.
22908 2021-05-28 Richard Biener <rguenther@suse.de>
22911 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
22912 copy fntype from original call.
22914 2021-05-28 Martin Liska <mliska@suse.cz>
22916 PR gcov-profile/100751
22917 * doc/gcov.texi: Revert partially a hunk that was wrong.
22919 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
22921 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
22923 (HAVE_sync_compare_and_swaphi): Likewise.
22924 (HAVE_sync_compare_and_swapsi): Likewise.
22926 2021-05-28 Jakub Jelinek <jakub@redhat.com>
22928 PR middle-end/99928
22929 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
22931 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
22933 * gimplify.c (gimplify_omp_affinity): New.
22934 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
22935 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
22936 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
22937 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
22938 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
22940 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
22941 Richard Biener <rguenther@suse.de>
22943 * match.pd <popcount & / + pattern matching>:
22944 When generating popcount directly fails, try doing it in two halves.
22946 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
22948 * Makefile.in (generated_files): Add gimple-match.c and
22951 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22953 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
22955 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22957 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
22959 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
22961 PR tree-optimization/99398
22962 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
22963 where the fed operands are CTOR/CST and propagated through
22964 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
22965 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
22967 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
22970 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22972 * config/i386/mmx.md (addv2sf3): Do not call
22973 ix86_fixup_binary_operands_no_copy.
22976 (<smaxmin:code>v2sf3): Ditto.
22977 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
22978 (<plusminus:insn><VI_32:mode>3): Remove expander.
22979 (<plusminus:insn><VI_32:mode>3): Rename from
22980 "*<plusminus:insn><VI_32:mode>3".
22981 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
22982 (mulv2hi3): Remove expander.
22983 (mulv2hi3): Rename from *mulv2hi3.
22984 (<s>mulv2hi3_highpart): Remove expander.
22985 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
22986 (<smaxmin:code><MMXMODE14:mode>3): Rename from
22987 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
22988 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
22989 (SMAXMIN_MMXMODEI): Remove mode iterator.
22990 (<smaxmin:code>v4hi3): New expander.
22991 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
22992 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
22993 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22994 (SMAXMIN_VI_32): Remove mode iterator.
22995 (<umaxmin:code><MMXMODE24:mode>3): Rename from
22996 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
22997 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
22998 (UMAXMIN_MMXMODEI): Remove mode iterator.
22999 (<umaxmin:code>v8qi3): New expander.
23000 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
23001 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
23002 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
23003 (UMAXMIN_VI_32): Remove mode iterator.
23004 (<any_shift:insn>v2hi3): Remove expander.
23005 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
23006 (<any_logic:code><MMXMODEI:mode>3): Do not call
23007 ix86_fixup_binary_operands_no_copy.
23008 (<any_logic:code><VI_32:mode>3): Remove expander.
23009 (<any_logic:code><VI_32:mode>3): Rename from
23010 "*<any_logic:code><VI_32:mode>3".
23011 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
23012 * config/i386/sse.md (div<VF2:mode>3): Do not call
23013 ix86_fixup_binary_operands_no_copy.
23014 (div<VF1:mode>3): Ditto.
23015 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
23016 (smulhrsv4hi3): Ditto.
23017 (smulhrsv2hi3): Ditto.
23019 2021-05-27 Martin Sebor <msebor@redhat.com>
23021 * ggc.h (gt_ggc_mx): Add overloads for all integers.
23023 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
23025 (hash_map::operator==): New function.
23027 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23030 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
23031 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
23032 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
23033 (*xop_maskcmp<VI_32:mode>3): Ditto.
23034 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
23035 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
23037 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
23040 * config/arm/arm.c (arm_configure_build_target): Remove parameter
23041 opts_set, directly check opts parameters for being non-null.
23042 (arm_option_restore): Update call to arm_configure_build_target.
23043 (arm_option_override): Likewise.
23044 (arm_can_inline_p): Likewise.
23045 (arm_valid_target_attribute_tree): Likewise.
23046 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
23047 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
23050 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23052 * vr-values.c (simplify_conversion_using_ranges): Use
23053 get_range_query instead of get_global_range_query.
23055 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
23057 * gimple-range.cc (get_range_global): Move to value-query.cc.
23058 (gimple_range_global): Same.
23059 (get_global_range_query): Same.
23060 (global_range_query::range_of_expr): Same.
23061 * gimple-range.h (class global_range_query): Move to
23063 (gimple_range_global): Same.
23064 * tree-ssanames.c (get_range_info): Move to value-query.cc.
23065 (get_ptr_nonnull): Same.
23066 * tree-ssanames.h (get_range_info): Remove.
23067 (get_ptr_nonnull): Remove.
23068 * value-query.cc (get_ssa_name_range_info): Move from
23070 (get_ssa_name_ptr_info_nonnull): Same.
23071 (get_range_global): Move from gimple-range.cc.
23072 (gimple_range_global): Same.
23073 (get_global_range_query): Same.
23074 (global_range_query::range_of_expr): Same.
23075 * value-query.h (class global_range_query): Move from
23077 (gimple_range_global): Same.
23079 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
23082 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
23083 (uavgv2hi3_ceil): Ditto.
23085 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23088 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
23090 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23092 * tree-ssanames.c (get_range_info): Merge both copies of
23093 get_range_info into one that works with irange.
23094 * tree-ssanames.h (get_range_info): Remove version that works on
23097 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23099 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
23100 (expand_builtin_strnlen): Same.
23101 (determine_block_size): Same.
23102 * fold-const.c (expr_not_equal_to): Same.
23103 * gimple-fold.c (size_must_be_zero_p): Same.
23104 * gimple-match-head.c: Include gimple-range.h.
23105 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
23106 * gimple-ssa-warn-restrict.c
23107 (builtin_memref::extend_offset_range): Same.
23108 * graphite-sese-to-poly.c (add_param_constraints): Same.
23109 * internal-fn.c (get_min_precision): Same.
23110 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
23111 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
23113 * tree-data-ref.c (split_constant_offset): Same.
23114 (dr_step_indicator): Same.
23115 * tree-dfa.c (get_ref_base_and_extent): Same.
23116 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
23117 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
23118 (determine_value_range): Same.
23119 (record_nonwrapping_iv): Same.
23120 (infer_loop_bounds_from_signedness): Same.
23121 (scev_var_range_cant_overflow): Same.
23122 * tree-ssa-phiopt.c (two_value_replacement): Same.
23123 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
23124 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
23125 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
23127 (dump_strlen_info): Same.
23128 (set_strlen_range): Same.
23129 (maybe_diag_stxncpy_trunc): Same.
23130 (get_len_or_size): Same.
23131 (handle_integral_assign): Same.
23132 * tree-ssa-structalias.c (find_what_p_points_to): Same.
23133 * tree-ssa-uninit.c (find_var_cmp_const): Same.
23134 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
23135 * tree-vect-patterns.c (vect_get_range_info): Same.
23136 (vect_recog_divmod_pattern): Same.
23137 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
23138 (register_edge_assert_for_2): Same.
23139 (determine_value_range_1): Same.
23140 * tree.c (get_range_pos_neg): Same.
23141 * vr-values.c (vr_values::get_lattice_entry): Same.
23142 (vr_values::update_value_range): Same.
23143 (simplify_conversion_using_ranges): Same.
23145 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23147 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
23148 get_range_query instead of query argument.
23149 (pass_walloca::execute): Enable and disable global ranger.
23151 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23153 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
23155 (rvrp_folder::~rvrp_folder): Call disable_ranger.
23156 (hybrid_folder::hybrid_folder): Call enable_ranger.
23157 (hybrid_folder::~hybrid_folder): Call disable_ranger.
23159 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
23161 * function.c (allocate_struct_function): Set cfun->x_range_query.
23162 * function.h (struct function): Declare x_range_query.
23163 (get_range_query): New.
23164 (get_global_range_query): New.
23165 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
23166 Remove call to safe_grow_cleared.
23167 * gimple-range.cc (get_range_global): New.
23168 (gimple_range_global): Move from gimple-range.h.
23169 (get_global_range_query): New.
23170 (global_range_query::range_of_expr): New.
23171 (enable_ranger): New.
23172 (disable_ranger): New.
23173 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
23174 (class global_range_query): New.
23175 (enable_ranger): New.
23176 (disable_ranger): New.
23177 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
23178 dump_all_value_ranges to dump.
23179 * tree-vrp.c (vrp_prop::finalize): Same.
23180 * value-query.cc (range_query::dump): New.
23181 * value-query.h (range_query::dump): New.
23182 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
23183 (vr_values::dump): ...this.
23184 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
23185 dump and make virtual.
23187 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
23189 * config/i386/i386.c (ix86_autovectorize_vector_modes):
23190 Add V4QImode and V16QImode for TARGET_SSE2.
23191 * doc/sourcebuild.texi (Vector-specific attributes):
23192 Add vect64 and vect32 description.
23194 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
23196 * gimple-range-gori.cc (range_def_chain::register_dependency):
23197 Resize m_def_chain when needed.
23199 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
23201 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
23202 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
23203 * config/arm/vec-common.md: .. here. Add support for MVE.
23205 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23207 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
23209 * config/microblaze/microblaze.c (microblaze_legitimize_address,
23211 microblaze_option_override, print_operand): Likewise.
23212 * config/microblaze/microblaze.md (call_internal_plt,
23213 call_value_intern_plt, call_value_intern): Likewise.
23214 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
23215 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
23216 call_value_multiple_internal1): Likewise.
23217 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
23219 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
23221 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
23222 arc_ccfsm_advance, symbolic_reference_mentioned_p,
23223 arc_raw_symbolic_reference_mentioned_p): Remove register
23226 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23229 * omp-low.c: Include omp-offload.h.
23230 (create_omp_child_function): If current_function_decl has
23231 "omp declare target" attribute and is_gimple_omp_offloaded,
23232 remove that attribute from the copy of attribute list and
23233 add "omp target entrypoint" attribute instead.
23234 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
23235 variables for offloading if in omp_maybe_offloaded_ctx.
23236 * omp-offload.c (pass_omp_target_link::execute): Nullify second
23237 argument to GOMP_target_data_ext in offloaded code.
23239 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23241 * config/csky/csky.c (csky_can_change_mode_class): Delete.
23242 For csky, HF/SF mode use the low bits of VREGS.
23244 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23246 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
23247 DECL which is a reference for OMP.
23249 2021-05-26 Martin Liska <mliska@suse.cz>
23251 PR gcov-profile/100751
23252 * doc/gcov.texi: Document that __gcov_dump can be called just
23253 once and that __gcov_reset resets run-time counters.
23255 2021-05-26 Martin Liska <mliska@suse.cz>
23257 * doc/install.texi: Port relevant part from install-old.texi
23258 and re-generate list of CPUs and systems.
23260 2021-05-26 Martin Liska <mliska@suse.cz>
23262 * Makefile.in: Remove it.
23263 * doc/include/fdl.texi: Update next/previous chapters.
23264 * doc/install.texi: Likewise.
23265 * doc/install-old.texi: Removed.
23267 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23269 * config/csky/csky.c (ck810_legitimate_index_p): Support
23270 "base + index" with DF mode.
23271 * config/csky/constraints.md ("Y"): New constraint for memory operands
23272 without index register.
23273 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
23274 when mov between memory and general registers, and lower their priority.
23275 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
23277 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23279 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
23281 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23283 * config/csky/csky.md (untyped_call): Emit clobber for return
23284 registers to mark them used.
23286 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23288 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
23290 2021-05-26 Andrew Pinski <apinski@marvell.com>
23292 * match.pd (x < 0 ? ~y : y): New patterns.
23294 2021-05-26 Andrew Pinski <apinski@marvell.com>
23296 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
23297 A?POW2:0 and A?0:POW2.
23299 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23301 * gimple-range-gori.cc (class logical_stmt_cache): Delete
23302 (logical_stmt_cache::logical_stmt_cache ): Delete.
23303 (logical_stmt_cache::~logical_stmt_cache): Delete.
23304 (logical_stmt_cache::cache_entry::dump): Delete.
23305 (logical_stmt_cache::get_range): Delete.
23306 (logical_stmt_cache::cached_name ): Delete.
23307 (logical_stmt_cache::same_cached_name): Delete.
23308 (logical_stmt_cache::cacheable_p): Delete.
23309 (logical_stmt_cache::slot_diagnostics ): Delete.
23310 (logical_stmt_cache::dump): Delete.
23311 (gori_compute_cache::gori_compute_cache): Delete.
23312 (gori_compute_cache::~gori_compute_cache): Delete.
23313 (gori_compute_cache::compute_operand_range): Delete.
23314 (gori_compute_cache::cache_stmt): Delete.
23315 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
23317 (class gori_compute_cache): Delete.
23319 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23321 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
23323 (fold_using_range::range_of_address): Adjust.
23324 (fold_using_range::range_of_phi): Adjust.
23325 * gimple-range.h (class fur_source): Adjust.
23326 (fur_source::fur_source): Adjust.
23328 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23330 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
23331 from expr_range_in_bb and adjust.
23332 (gori_compute::compute_name_range_op): Adjust.
23333 (gori_compute::optimize_logical_operands): Adjust.
23334 (gori_compute::compute_logical_operands_in_chain): Adjust.
23335 (gori_compute::compute_operand1_range): Adjust.
23336 (gori_compute::compute_operand2_range): Adjust.
23337 (ori_compute_cache::cache_stmt): Adjust.
23338 * gimple-range-gori.h (gori_compute): Rename prototype.
23340 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23342 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
23343 checked only after range_of_stmt, not range_on_entry.
23344 (gimple_ranger::range_on_entry): Check for non-null in any
23345 predecessor block, if it is not already non-null.
23346 (gimple_ranger::range_on_exit): DOnt check for non-null after
23347 range on entry call.
23348 (gimple_ranger::dump_bb): New. Split from dump.
23349 (gimple_ranger::dump): Adjust.
23350 * gimple-range.h (class gimple_ranger): Adjust.
23352 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23354 * gimple-range-cache.cc (struct range_timestamp): Delete.
23355 (class temporal_cache): Adjust.
23356 (temporal_cache::get_timestamp): Delete.
23357 (temporal_cache::set_dependency): Delete.
23358 (temporal_cache::temporal_value): Adjust.
23359 (temporal_cache::current_p): Take dependencies as params.
23360 (temporal_cache::set_timestamp): Adjust.
23361 (temporal_cache::set_always_current): Adjust.
23362 (ranger_cache::get_non_stale_global_range): Adjust.
23363 (ranger_cache::register_dependency): Delete.
23364 * gimple-range-cache.h (class range_cache): Adjust.
23366 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23368 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
23370 (range_def_chain::~range_def_chain): Dispose of obstack rather than
23371 each individual bitmap.
23372 (range_def_chain::set_import): New.
23373 (range_def_chain::get_imports): New.
23374 (range_def_chain::chain_import_p): New.
23375 (range_def_chain::register_dependency): Rename from build_def_chain
23377 (range_def_chain::def_chain_in_bitmap_p): New.
23378 (range_def_chain::add_def_chain_to_bitmap): New.
23379 (range_def_chain::has_def_chain): Just check first depenedence.
23380 (range_def_chain::get_def_chain): Process imports, use generic
23381 register_dependency routine.
23382 (range_def_chain::dump): New.
23383 (gori_map::gori_map): Allocate import list.
23384 (gori_map::~gori_map): Release imports.
23385 (gori_map::exports): Check for past allocated block size.
23386 (gori_map::imports): New.
23387 (gori_map::def_chain_in_export_p): Delete.
23388 (gori_map::is_import_p): New.
23389 (gori_map::maybe_add_gori): Handle imports.
23390 (gori_map::dump): Adjust output, add imports.
23391 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
23392 (gori_export_iterator::gori_export_iterator): New.
23393 (gori_export_iterator::next): New.
23394 (gori_export_iterator::get_name): New.
23395 * gimple-range-gori.h (range_def_chain): Add imports and direct
23396 dependecies via struct rdc.
23397 (range_def_chain::depend1): New.
23398 (range_def_chain::depend2): New.
23399 (class gori_map): Adjust.
23400 (FOR_EACH_GORI_IMPORT_NAME): New.
23401 (FOR_EACH_GORI_EXPORT_NAME): New.
23402 (class gori_export_iterator): New.
23404 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23406 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
23407 export cache filling to here.
23408 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
23410 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23412 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
23413 (gori_map): Move to gimple-range-gori.h.
23414 (gori_compute::gori_compute): Adjust.
23415 (gori_compute::~gori_compute): Delete.
23416 (gori_compute::compute_operand_range_switch): Adjust.
23417 (gori_compute::compute_operand_range): Adjust.
23418 (gori_compute::compute_logical_operands): Adjust.
23419 (gori_compute::has_edge_range_p ): Adjust.
23420 (gori_compute::set_range_invariant): Delete.
23421 (gori_compute::dump): Adjust.
23422 (gori_compute::outgoing_edge_range_p): Adjust.
23423 * gimple-range-gori.h (class range_def_chain): Relocate here.
23424 (class gori_map): Relocate here.
23425 (class gori_compute): Inherit from gori_map, and adjust.
23427 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
23429 * value-range.cc (range_tests_legacy): Use
23430 build_nonstandard_integer_type instead of int and short.
23432 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23434 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
23435 when really creating an initialization statement for it.
23437 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23439 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
23441 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
23443 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
23445 2021-05-25 Martin Liska <mliska@suse.cz>
23447 PR tree-optimization/92860
23449 * optc-save-gen.awk: Remove exceptions.
23451 2021-05-25 Martin Liska <mliska@suse.cz>
23453 * asan.h (sanitize_coverage_p): New function.
23454 * doc/extend.texi: Document it.
23455 * fold-const.c (fold_range_test): Use sanitize_flags_p
23456 instead of flag_sanitize_coverage.
23457 (fold_truth_andor): Likewise.
23458 * sancov.c: Likewise.
23459 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
23460 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
23461 -fsanitize-coverage when inlining.
23463 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23465 * config/csky/csky-modes.def : Fix copyright.
23467 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23469 * config/csky/csky-modes.def : Amend copyright.
23470 * config/csky/csky_insn_fpuv2.md : Likewise.
23471 * config/csky/csky_insn_fpuv3.md : Likewise.
23473 2021-05-25 Richard Biener <rguenther@suse.de>
23475 PR middle-end/100727
23476 * calls.c (initialize_argument_information): Explicitely test
23477 for WITH_SIZE_EXPR.
23478 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
23480 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23482 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
23483 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
23484 the signle definition. The signle definition may not work well
23485 at simplify_subreg_regno().
23486 (HARD_FRAME_POINTER_REGNUM): New.
23487 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
23488 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
23489 csky_initial_elimination_offset, csky_expand_prologue,
23490 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
23492 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23494 * config/csky/csky.c (csky_option_override):
23495 Init csky_arch_isa_features[] in advance, so TARGET_DSP
23496 and TARGET_DIV can be set well.
23498 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23500 * config/csky/constraints.md ("l", "h"): Delete.
23501 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
23502 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
23503 * config/csky/csky.c (regno_reg_classm,
23504 csky_secondary_reload, csky_register_move_cost):
23505 Use HILO_REGS instead of LO_REGS and HI_REGS.
23507 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23509 * config/csky/constraints.md ("W"): New constriant for mem operand
23510 with base reg, index register.
23511 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
23512 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
23514 ("Dv"): New constraint for const double value that can be used at
23516 * config/csky/csky-modes.def (HFmode): New mode.
23517 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
23518 to "csky_valid_mem_constraint_operand" and support new constraint
23520 (csky_get_movedouble_length): New.
23521 (fpuv3_output_move): New.
23522 (fpuv3_const_double): New.
23523 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
23524 (decompose_csky_address): Refine.
23525 (csky_print_operand): New "CONST_DOUBLE" operand.
23526 (csky_output_move): Support fpv3 instructions.
23527 (csky_get_movedouble_length): New.
23528 (fpuv3_output_move): New.
23529 (fpuv3_const_double): New.
23530 (csky_emit_compare): Cover float comparsion.
23531 (csky_emit_compare_float): Refine.
23532 (csky_vaild_fpuv2_mem_operand): Rename to
23533 "csky_valid_mem_constraint_operand" and support new constraint "W".
23534 (ck860_rtx_costs): New.
23535 (csky_rtx_costs): Add the cost calculation of CK860.
23536 (regno_reg_class): New vregs for fpuv3.
23537 (csky_dbx_regno): Likewise.
23538 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
23539 (csky_conditional_register_usage): Suporrot fpuv3.
23540 (csky_dwarf_register_span): Suporrot fpuv3.
23541 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
23542 (ck810_legitimate_index_p): Support fp16.
23543 * config/csky/csky.h (TARGET_TLS): ADD CK860.
23544 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
23545 (TARGET_SINGLE_FPU): Support fpuv3.
23546 (TARGET_SUPPORT_FPV3): New.
23547 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
23548 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
23549 REG_CLASS_CONTENTS): Support fpuv3.
23550 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
23551 (csky_movsf_fpv2): Likewise.
23552 (ck801_movsf): Likewise.
23553 (csky_movsf): Likewise.
23555 (csky_movdf_fpv2): Likewise.
23556 (ck801_movdf): Likewise.
23557 (csky_movdf): Likewise.
23558 (movsicc): Refine. Use "comparison_operatior" instead of
23559 "ordered_comparison_operatior".
23560 (addsicc): Likewise.
23561 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
23562 (call_value_internal_vh): New.
23563 * config/csky/csky_cores.def (CK860): New arch and cpu.
23568 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
23569 into emit-patterns and match-patterns, remain the emit-patterns here,
23570 and move the match-patterns to csky_insn_fpuv2.md or
23571 csky_insn_fpuv3.md.
23572 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
23573 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
23575 * config/csky/csky_isa.def (fcr): New.
23580 (CK860): New definition for ck860.
23581 * config/csky/csky_tables.opt (ck860): New processors ck860,
23582 ck860f. And new arch ck860.
23587 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
23588 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
23589 * config/csky/t-csky-elf: Support 860.
23590 * config/csky/t-csky-linux: Likewise.
23591 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
23593 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
23595 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
23596 add generation of logical-add and add-logical fusion pairs.
23597 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
23599 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
23600 logical-add and add-logical fusion by default.
23601 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
23602 -mpower10-fusion-add-logical options.
23603 * config/rs6000/fusion.md: Regenerate file.
23605 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
23607 * value-range.cc (irange::legacy_equal_p): Check type when
23608 comparing VR_VARYING types.
23609 (range_tests_legacy): Test comparing VARYING ranges of different
23612 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23614 * config/aarch64/aarch64.c (neoversen1_tunings):
23615 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
23617 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23619 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
23620 extern weak symbols. Limit symbol offsets for non-GOT symbols with
23623 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23625 * config/arm/neon.md (vec_load_lanesxi<mode>)
23626 (vec_store_lanexoi<mode>): Move ...
23627 * config/arm/vec-common.md: here.
23629 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23631 * config/arm/neon.md (vec_load_lanesoi<mode>)
23632 (vec_store_lanesoi<mode>): Move ...
23633 * config/arm/vec-common.md: here.
23635 2021-05-24 liuhongt <hongtao.liu@intel.com>
23638 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
23639 stmt with GIMPLE_NOP when lhs doesn't exist.
23641 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
23644 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
23645 New instruction pattern.
23646 (*push<VI_32:mode>2): Ditto.
23647 (push splitter for SSE registers): New splitter.
23649 2021-05-23 Andrew Pinski <apinski@marvell.com>
23651 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
23653 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
23655 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
23656 * config/rs6000/fusion.md: Regenerate file.
23658 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23660 * config/rs6000/genfusion.pl (gen_addadd): New function.
23661 * config/rs6000/fusion.md: Regenerate file.
23662 * config/rs6000/rs6000-cpus.def: Add
23663 OPTION_MASK_P10_FUSION_2ADD to masks.
23664 * config/rs6000/rs6000.c (rs6000_option_override_internal):
23665 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
23666 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
23668 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23670 PR middle-end/99928
23671 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
23672 * gimplify.c (enum gimplify_omp_var_data): Fix up
23673 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
23674 (omp_lastprivate_for_combined_outer_constructs): If combined target
23675 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
23676 GOVD_MAP | GOVD_SEEN.
23677 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
23678 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
23679 (gimplify_adjust_omp_clauses): For firstprivate clauses with
23680 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
23681 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
23682 let it be replaced by implicit map clause.
23684 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23686 PR middle-end/99928
23687 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
23689 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
23690 (gimplify_omp_for): Likewise.
23692 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23694 PR middle-end/90115
23695 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
23696 'external' in blocks.
23698 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23700 PR middle-end/90115
23701 * flag-types.h (enum openacc_privatization): New.
23702 * params.opt (-param=openacc-privatization): New.
23703 * doc/invoke.texi (openacc-privatization): Document it.
23704 * omp-general.h (get_openacc_privatization_dump_flags): New
23706 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
23707 * omp-offload.c (execute_oacc_device_lower)
23708 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
23709 * target.def (goacc.adjust_private_decl): Add 'location_t'
23711 * doc/tm.texi: Regenerate.
23712 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
23713 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
23714 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
23715 Likewise. Preserve it for...
23716 (nvptx_goacc_expand_var_decl): ... use here.
23718 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23720 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
23723 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23725 PR middle-end/90115
23726 * omp-low.c (oacc_privatization_candidate_p): New function.
23727 (oacc_privatization_scan_clause_chain)
23728 (oacc_privatization_scan_decl_chain): Use it. Also
23729 'gcc_checking_assert' that we're not seeing duplicates.
23731 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23733 PR middle-end/90115
23734 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
23737 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23739 PR middle-end/90115
23740 * omp-offload.c (execute_oacc_device_lower): Explain.
23742 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23744 PR middle-end/90115
23745 * omp-offload.c (execute_oacc_device_lower)
23746 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
23748 * internal-fn.c (expand_UNIQUE): Don't expect
23749 'IFN_UNIQUE_OACC_PRIVATE'.
23751 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23753 PR middle-end/90115
23754 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
23756 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23758 PR middle-end/90115
23759 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
23760 (nvptx_goacc_expand_var_decl): Tighten.
23762 2021-05-21 Julian Brown <julian@codesourcery.com>
23763 Chung-Lin Tang <cltang@codesourcery.com>
23764 Thomas Schwinge <thomas@codesourcery.com>
23766 PR middle-end/90115
23767 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
23768 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
23769 * doc/tm.texi: Regenerate.
23770 * expr.c (expand_expr_real_1): Expand decls using the
23771 expand_var_decl OpenACC hook if defined.
23772 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
23773 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
23774 * omp-low.c (omp_context): Add oacc_privatization_candidates
23776 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
23778 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
23779 private marker's gimple call arguments, and pass it to
23780 lower_oacc_reductions.
23781 (oacc_privatization_scan_clause_chain)
23782 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
23784 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
23785 * omp-offload.c (convert.h): Include.
23786 (oacc_loop_xform_head_tail): Treat private-variable markers like
23787 fork/join when transforming head/tail sequences.
23788 (struct var_decl_rewrite_info): Add struct.
23789 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
23790 (execute_oacc_device_lower): Support rewriting gang-private
23791 variables using target hook, and fix up addr_expr and var_decl
23793 * target.def (adjust_private_decl, expand_var_decl): New hooks.
23794 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
23796 (gcn_goacc_adjust_private_decl): ...this.
23797 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
23799 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
23800 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
23801 definition using gcn_goacc_adjust_gangprivate_decl...
23802 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
23803 gcn_goacc_adjust_private_decl.
23804 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
23805 (gang_private_shared_size): New global variable.
23806 (gang_private_shared_align): Likewise.
23807 (gang_private_shared_sym): Likewise.
23808 (gang_private_shared_hmap): Likewise.
23809 (nvptx_option_override): Initialize these.
23810 (nvptx_file_end): Output gang_private_shared_sym.
23811 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
23813 (nvptx_set_current_function): Clear gang_private_shared_hmap.
23814 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
23815 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
23817 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
23819 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
23821 2021-05-21 Richard Biener <rguenther@suse.de>
23822 H.J. Lu <hjl.tools@gmail.com>
23824 PR middle-end/90773
23825 * expr.c (expand_constructor): Elide expand_constructor if
23826 move by pieces is preferred.
23828 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23830 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
23831 Take a flag and mode value as arguments.
23832 (aarch64_modifies_global_state_p): Likewise.
23833 (aarch64_reads_global_state_p): Likewise.
23834 (aarch64_could_trap_p): Likewise.
23835 (aarch64_get_attributes): Likewise.
23836 (aarch64_init_simd_builtins): Adjust callsite of above.
23837 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
23838 function attributes to apply to builtins.
23839 (aarch64_init_crc32_builtins): Likewise.
23840 (aarch64_init_builtin_rsqrt): Likewise.
23842 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23844 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
23845 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
23846 (gen_2logical): Use new fusion types.
23847 * config/rs6000/fusion.md: Regenerate.
23849 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23852 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
23853 Handle V4QI and V2HI modes.
23854 (ix86_expand_sse_movcc): Ditto.
23855 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
23856 New instruction pattern.
23857 (*eq<VI_32:mode>3): Ditto.
23858 (*gt<VI_32:mode>3): Ditto.
23859 (*xop_pcmov_<VI_32:mode>): Ditto.
23860 (mmx_pblendvb32): Ditto.
23861 (mmx_pblendvb64): Rename from mmx_pblendvb.
23862 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
23863 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
23864 (vcond<VI_32:mode><VI_32:mode>): Ditto.
23865 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
23866 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
23868 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23870 PR tree-optimization/94589
23871 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
23872 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
23873 not the same as cmp3) treat <= the same as < and >= the same as >.
23874 Don't require that cond2_phi_edge is true edge, instead take
23875 false/true edges into account based on cmp1/cmp2 comparison kinds.
23877 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23880 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
23881 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
23882 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
23883 using SMAXMIN_MMXMODEI mode iterator.
23884 (*<smaxmin:code>v4qi3): New insn pattern.
23885 (*<smaxmin:code>v2hi3): Ditto.
23886 (SMAXMIN_VI_32): New mode iterator.
23887 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
23888 (UMAXMIN_MMXMODEI): New mode iterator.
23889 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
23890 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
23891 using UMAXMIN_MMXMODEI mode iterator.
23892 (*<umaxmin:code>v4qi3): New insn pattern.
23893 (*<umaxmin:code>v2hi3): Ditto.
23894 (UMAXMIN_VI_32): New mode iterator.
23895 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
23896 (abs<VI_32:mode>2): New insn pattern.
23897 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
23898 * config/i386/sse.md: ... here.
23900 2021-05-20 Clement Chigot <clement.chigot@atos.net>
23901 David Edelsohn <dje.gcc@gmail.com>
23903 * collect2.c (scan_prog_file): Issue non-fatal warning for
23906 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
23908 * doc/invoke.texi (-Wno-c++11-extensions)
23909 (-Wno-c++14-extensions, -Wno-c++17-extensions)
23910 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
23913 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23915 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
23916 * config/darwin.c (darwin_override_options): Likewise.
23917 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
23918 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
23919 (DWARF2_FRAME_REG_OUT): Likewise.
23920 * config/mips/mips.c (mips_output_filename): Likewise.
23921 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
23923 (rs6000_dbx_register_number): Likewise.
23924 * dbxout.c: Include flags.h.
23925 * dwarf2cfi.c (cfi_label_required_p): Likewise.
23926 (dwarf2out_do_frame): Likewise.
23927 * except.c: Include flags.h.
23928 * final.c (dwarf2_debug_info_emitted_p): Likewise.
23929 (final_scan_insn_1): Likewise.
23930 * flags.h (dwarf_debuginfo_p): New function declaration.
23931 * opts.c (dwarf_debuginfo_p): New function definition.
23932 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
23933 * toplev.c (process_options): Likewise.
23935 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23937 * common.opt: Change type to support bitmasks.
23938 * flag-types.h (enum debug_info_type): Rename enumerator constants.
23939 (NO_DEBUG): New bitmask.
23940 (DBX_DEBUG): Likewise.
23941 (DWARF2_DEBUG): Likewise.
23942 (XCOFF_DEBUG): Likewise.
23943 (VMS_DEBUG): Likewise.
23944 (VMS_AND_DWARF2_DEBUG): Likewise.
23945 * flags.h (debug_set_to_format): New function declaration.
23946 (debug_set_count): Likewise.
23947 (debug_set_names): Likewise.
23948 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
23949 (debug_set_to_format): New function definition.
23950 (debug_set_count): Likewise.
23951 (debug_set_names): Likewise.
23952 (set_debug_level): Update access to debug_type_names.
23953 * toplev.c: Likewise.
23955 2021-05-20 Martin Sebor <msebor@redhat.com>
23957 PR middle-end/100684
23958 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
23960 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23963 * config/i386/i386.md (isa): Remove x64_bmi.
23964 (enabled): Remove x64_bmi.
23965 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23966 Remove general register alternative.
23967 (*andnot<VI_32:mode>3): Ditto.
23968 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23969 (*<any_logic:code><VI_32:mode>3): Ditto.
23971 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23973 * config/arm/arm.c: Include head files tree-vectorizer.h and
23976 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23979 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
23980 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
23981 (<s>mulv4hi3_highpart): New expander.
23982 (*<s>mulv2hi3_highpart): New insn pattern.
23983 (<s>mulv2hi3_higpart): New expander.
23984 (*<any_shift:insn>v2hi3): New insn pattern.
23985 (<any_shift:insn>v2hi3): New expander.
23986 * config/i386/sse.md (smulhrsv2hi3): New expander.
23987 (*smulhrsv2hi3): New insn pattern.
23989 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23991 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
23993 * params.opt (vect-inner-loop-cost-factor): New.
23994 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
23995 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
23996 tree-vectorizer.h and its required ones.
23997 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
23998 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
23999 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
24000 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
24001 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
24002 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
24004 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
24005 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
24006 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
24008 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
24009 Torbjörn Svensson <torbjorn.svensson@st.com>
24012 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
24014 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24016 PR middle-end/99928
24017 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
24018 explicit linear clause when combined with target, make it map(tofrom:)
24019 instead of no clause or firstprivate.
24021 2021-05-20 Jakub Jelinek <jakub@redhat.com>
24023 PR tree-optimization/94589
24024 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
24025 of integral conversions.
24027 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
24029 * gimple-range.cc (fur_source::get_operand): New.
24030 (gimple_range_fold): Delete.
24031 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
24032 (fold_using_range::range_of_range_op): Move from gimple_ranger.
24033 (fold_using_range::range_of_address): Ditto.
24034 (fold_using_range::range_of_phi): Ditto.
24035 (fold_using_range::range_of_call): Ditto.
24036 (fold_using_range::range_of_builtin_ubsan_call): Move from
24037 range_of_builtin_ubsan_call.
24038 (fold_using_range::range_of_builtin_call): Move from
24039 range_of_builtin_call.
24040 (gimple_ranger::range_of_builtin_call): Delete.
24041 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
24042 (gimple_ranger::fold_range_internal): New.
24043 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
24044 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
24045 gimple_ranger. Improve ranges of SSA_NAMES when possible.
24046 * gimple-range.h (gimple_ranger): Remove various range_of routines.
24047 (class fur_source): New.
24048 (class fold_using_range): New.
24049 (fur_source::fur_source): New.
24051 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
24052 instead of range_of_builtin_call.
24054 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
24056 * doc/cpp.texi (Common Predefined Macros): Update documentation
24057 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
24059 2021-05-19 Alex Coplan <alex.coplan@arm.com>
24062 * config/arm/arm.md (nonsecure_call_internal): Always ensure
24063 callee's address is in a register.
24065 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
24067 * common/config/riscv/riscv-common.c
24068 (riscv_subset_list::parsing_subset_version): Properly parse the letter
24070 (riscv_subset_list::parse_std_ext,
24071 riscv_subset_list::parse_multiletter_ext): To handle errors generated
24072 in riscv_subset_list::parsing_subset_version.
24074 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24076 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
24077 type attribute in patterns generating XTN(2).
24079 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24081 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
24082 Remove as duplicate of...
24083 (aarch64_xtn<mode>): This.
24084 (aarch64_xtn2<mode>_le): Move position in file.
24085 (aarch64_xtn2<mode>_be): Move position in file.
24086 (aarch64_xtn2<mode>): Move position in file.
24087 (vec_pack_trunc_<mode>): Define as an expander.
24089 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24091 * config/aarch64/aarch64-simd-builtins.def: Split builtin
24092 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
24093 separate scalar and vector generators.
24094 * config/aarch64/aarch64-simd.md
24095 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
24097 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
24098 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
24099 * config/aarch64/iterators.md: Define SD_HSDI iterator.
24101 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
24103 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
24105 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
24107 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24109 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
24110 Implement as an expand emitting a big/little endian
24111 instruction pattern.
24112 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
24113 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
24115 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
24117 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
24118 Implement as an expand emitting a big/little endian
24119 instruction pattern.
24120 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
24121 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
24122 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
24123 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
24125 2021-05-19 Richard Biener <rguenther@suse.de>
24127 PR middle-end/100672
24128 * fold-const.c (fold_negate_expr_1): Use element_precision.
24129 (negate_expr_p): Likewise.
24131 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
24133 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
24134 (pred_load): New int attribute.
24135 * config/aarch64/aarch64-sve.md
24136 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
24137 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
24138 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
24139 code_for_aarch64_load.
24141 2021-05-19 Richard Biener <rguenther@suse.de>
24143 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
24144 sure TARGET_MEM_REF bases are expanded as memory.
24145 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
24146 Do not mark TARGET_MEM_REF bases addressable.
24147 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
24148 TARGET_MEM_REF bases as never rewritable.
24149 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
24150 walk TARGET_MEM_REF bases as address-takens.
24151 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
24153 2021-05-19 Richard Biener <rguenther@suse.de>
24155 * builtins.c (get_object_alignment_1): Strip outer
24157 * tree-dfa.c (get_ref_base_and_extent): Handle outer
24158 WITH_SIZE_EXPR for size processing and process the
24160 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
24161 outer WITH_SIZE_EXPR.
24162 (ao_ref_base_alias_ptr_type): Likewise.
24163 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
24164 and handle that accordingly, stripping it for the
24165 core alias workers.
24166 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
24167 looking through it instead of returning NULL.
24169 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24171 PR middle-end/100576
24172 * builtins.c (check_read_access): Convert bound to size_type_node if
24175 2021-05-19 Richard Biener <rguenther@suse.de>
24177 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
24178 (verify_types_in_gimple_reference): ... here. Sanitize.
24179 (verify_gimple_call): Verify references in LHS and arguments.
24180 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
24182 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
24184 * config/i386/i386.h (VALID_INT_MODE_P):
24185 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
24186 * config/i386/i386.md (isa): Add x64_bmi.
24187 (enabled): Handle x64_bmi.
24188 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
24189 Add alternative using 64bit general registers.
24190 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
24192 2021-05-19 Jakub Jelinek <jakub@redhat.com>
24194 PR middle-end/99928
24195 * tree.h (OMP_MASTER_COMBINED): Define.
24196 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
24197 handling for outer combined/composite constructs to a loop.
24198 Handle lastprivate on combined target.
24199 (gimplify_expr): Formatting fix.
24201 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
24203 * passes.def: Add sink_code pass before store_merging.
24204 * tree-ssa-sink.c (pass_sink_code:clone): New.
24206 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
24208 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
24209 rs6000_special_adjust_field_align_p.
24210 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
24211 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
24212 Remove ABI warning.
24213 (rs6000_function_arg): Likewise.
24214 * config/rs6000/rs6000-protos.h
24215 (rs6000_special_adjust_field_align_p): Remove prototype.
24216 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
24218 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
24219 rs6000_special_adjust_field_align_p.
24221 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24224 * config/i386/i386.h (VALID_SSE2_REG_MODE):
24225 Add V4QI and V2HI modes.
24226 (VALID_INT_MODE_P): Ditto.
24227 * config/i386/mmx.md (VI_32): New mode iterator.
24228 (mmxvecsize): Handle V4QI and V2HI.
24230 (mov<VI_32:mode>): New expander.
24231 (*mov<mode>_internal): New insn pattern.
24232 (movmisalign<VI_32:mode>): New expander.
24233 (neg<VI_32:mode>): New expander.
24234 (<plusminus:insn><VI_32:mode>3): New expander.
24235 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
24236 (mulv2hi3): New expander.
24237 (*mulv2hi3): New insn pattern.
24238 (one_cmpl<VI_32:mode>2): New expander.
24239 (*andnot<VI_32:mode>3): New insn pattern.
24240 (<any_logic:code><VI_32:mode>3): New expander.
24241 (*<any_logic:code><VI_32:mode>3): New insn pattern.
24243 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24245 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
24246 Fix a mode mismatch with operand 1.
24248 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24251 * config/i386/i386-expand.c (split_double_mode): Return
24252 temporary register when simplify_gen_subreg fails with
24253 the high half od the paradoxical subreg.
24255 2021-05-18 Richard Biener <rguenther@suse.de>
24257 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
24258 and honor it when expanding.
24259 (expand_used_vars_for_block): Pass through forced_stack_var.
24260 (expand_used_vars): Likewise.
24261 (discover_nonconstant_array_refs_r): Set bits in
24262 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
24263 (avoid_type_punning_on_regs): Likewise.
24264 (discover_nonconstant_array_refs): Likewise.
24265 (pass_expand::execute): Create and pass down forced_stack_var
24266 bitmap. For parameters and returns temporarily set
24267 TREE_ADDRESSABLE when expand_function_start.
24269 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24271 * doc/sourcebuild.texi: Document 'dg-note'.
24273 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
24276 * configure: Regenerate.
24277 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
24279 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24281 * gimple.h (is_gimple_omp_oacc): Tighten.
24282 * omp-low.c (check_omp_nesting_restrictions): Adjust.
24284 2021-05-18 Richard Biener <rguenther@suse.de>
24286 * tree-ssa-operands.c (mark_address_taken): Simplify.
24288 2021-05-18 Martin Liska <mliska@suse.cz>
24290 * config/gcn/mkoffload.c (STR): Redefine.
24291 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24292 * config/nvptx/mkoffload.c (STR): Likewise.
24294 2021-05-18 Martin Liska <mliska@suse.cz>
24296 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
24297 Use startswith function instead of strncmp.
24298 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
24299 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
24300 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
24301 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
24302 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
24303 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
24304 * config/arm/arm.c (arm_file_start): Likewise.
24305 (arm_valid_target_attribute_rec): Likewise.
24306 (thumb1_md_asm_adjust): Likewise.
24307 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
24308 * config/avr/avr.c (STR_PREFIX_P): Likewise.
24309 (avr_set_current_function): Likewise.
24310 (avr_handle_addr_attribute): Likewise.
24311 (avr_asm_output_aligned_decl_common): Likewise.
24312 (avr_asm_named_section): Likewise.
24313 (avr_section_type_flags): Likewise.
24314 (avr_asm_select_section): Likewise.
24315 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
24316 (c6x_section_type_flags): Likewise.
24317 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
24318 (darwin_objc_declare_unresolved_class_reference): Likewise.
24319 (darwin_objc_declare_class_definition): Likewise.
24320 * config/darwin.c (indirect_data): Likewise.
24321 (darwin_encode_section_info): Likewise.
24322 (darwin_objc2_section): Likewise.
24323 (darwin_objc1_section): Likewise.
24324 (machopic_select_section): Likewise.
24325 (darwin_globalize_label): Likewise.
24326 (darwin_label_is_anonymous_local_objc_name): Likewise.
24327 (darwin_asm_named_section): Likewise.
24328 (darwin_asm_output_dwarf_offset): Likewise.
24329 * config/frv/frv.c (frv_string_begins_with): Likewise.
24330 (frv_in_small_data_p): Likewise.
24331 * config/gcn/mkoffload.c (STR): Likewise.
24333 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
24334 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
24335 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
24336 (ix86_md_asm_adjust): Likewise.
24337 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24338 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
24339 (i386_pe_file_end): Likewise.
24340 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
24341 (ia64_section_type_flags): Likewise.
24342 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
24343 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
24344 (mips16_stub_function_p): Likewise.
24345 (mips_function_rodata_section): Likewise.
24346 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
24347 (msp430_function_section): Likewise.
24348 (msp430_section_type_flags): Likewise.
24349 (msp430_expand_helper): Likewise.
24350 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
24351 (nios2_valid_target_attribute_rec): Likewise.
24352 * config/nvptx/mkoffload.c (process): Likewise.
24354 * config/pa/som.h: Likewise.
24355 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
24356 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
24357 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
24358 (rs6000_inner_target_options): Likewise.
24359 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
24360 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
24361 * config/vax/vax.c (vax_output_int_move): Likewise.
24362 * config/vms/vms-ld.c (startswith): Likewise.
24363 (process_args): Likewise.
24365 * config/vms/vms.c: Likewise.
24367 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24369 PR rtl-optimization/100590
24370 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
24371 they are NONJUMP_INSN_P.
24373 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24376 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
24377 DECL_ASSEMBLER_NAME on the fn_decl.
24379 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24381 PR tree-optimization/94589
24382 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
24383 phi result used in (res & ~1) == 0 comparison as res >= 0 as
24384 res == 2 would be UB with -ffinite-math-only.
24386 2021-05-18 Martin Liska <mliska@suse.cz>
24388 * Makefile.in: genversion.o should depend on DATESTAMP.
24390 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
24392 * config/arc/simdext.md (negv2si2): Remove round bracket.
24394 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24396 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
24397 _Bool as macro expanding to _Bool.
24399 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24402 * tree.c (build_reference_type_for_mode)
24403 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
24405 (build_reference_type, build_pointer_type): Invoke
24406 build_*_type_for_mode with VOIDmode.
24408 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
24410 PR tree-optimization/100512
24411 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
24412 and non-zero pointer ranges as invariant.
24413 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
24414 processing from here.
24416 2021-05-17 Tom de Vries <tdevries@suse.de>
24419 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
24420 * config/nvptx/nvptx.c (nvptx_output_barrier)
24421 (nvptx_output_atomic_insn): New function.
24422 (nvptx_print_operand): Add support for 'B'.
24423 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
24426 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
24428 PR tree-optimization/100349
24429 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
24432 2021-05-17 Tamar Christina <tamar.christina@arm.com>
24434 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
24435 (host_detect_local_cpu): Use it.
24437 2021-05-17 Martin Liska <mliska@suse.cz>
24439 * doc/invoke.texi: Add 2 missing dots.
24441 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
24443 PR bootstrap/100552
24444 * configure.ac: Replace pattern substitution with call to sed.
24445 * configure: Regenerate.
24447 2021-05-17 Richard Biener <rguenther@suse.de>
24449 PR middle-end/100582
24450 * tree.c (array_at_struct_end_p): Get to the base of the
24451 reference before looking for the underlying decl.
24453 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
24455 * genoutput.c (validate_insn_alternatives) Make "wrong number of
24456 alternatives" message more specific, and remove assumption on where
24459 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24461 * config/arm/iterators.md (V16): New iterator.
24462 (VH_cvtto): New iterator.
24463 (v_cmp_result): Added V4HF and V8HF support.
24464 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
24465 (vcond<mode><mode>): Likewise.
24466 (vcond_mask_<mode><v_cmp_result>): Likewise.
24467 (vcond<VH_cvtto><mode>): New expander.
24469 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24471 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
24473 * config/arm/arm.c (arm_expand_vector_compare): Add support for
24475 (arm_expand_vcond): Likewise.
24476 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
24477 VCMPEQQ_N_S, VCMPNEQ_N_S.
24478 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
24479 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
24480 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
24481 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
24482 (@mve_vpselq_<supf><mode>): Likewise.
24483 (@mve_vpselq_f<mode>"): Likewise.
24484 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
24485 and move to vec-common.md.
24486 (vec_cmpu<mode><mode>): Likewise.
24487 (vcond<mode><mode>): Likewise.
24488 (vcond<V_cvtto><mode>): Likewise.
24489 (vcondu<mode><v_cmp_result>): Likewise.
24490 (vcond_mask_<mode><v_cmp_result>): Likewise.
24491 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
24492 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
24493 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
24494 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
24495 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
24496 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
24498 (vec_cmpu<mode><mode>): Likewise.
24499 (vcond<mode><mode>): Likewise.
24500 (vcond<V_cvtto><mode>): Likewise.
24501 (vcondu<mode><v_cmp_result>): Likewise.
24502 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
24505 2021-05-17 liuhongt <hongtao.liu@intel.com>
24508 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
24509 gsi_insert_seq_before instead.
24511 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24513 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
24514 (arm_sat_ok): ...this.
24516 2021-05-17 Martin Liska <mliska@suse.cz>
24518 * lto-wrapper.c (merge_flto_options): Factor out a new function.
24519 (merge_and_complain): Use it.
24520 (run_gcc): Merge also linker command line -flto=foo argument
24523 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
24525 * config/arm/arm.h (CPP_SPEC): Remove error message about
24526 -mlittle-endian/-mbig-endian conflict.
24528 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24530 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
24531 __ROP_PROTECT__ if -mrop-protect is selected.
24533 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24535 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
24536 rop_hash_save_offset and rop_hash_size.
24537 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
24538 rop_hash_size and rop_hash_save_offset.
24539 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
24540 (rs6000_emit_prologue): Emit hashst[p] in prologue.
24541 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
24542 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
24544 (hashst): New define_insn.
24545 (hashchk): Likewise.
24547 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24549 * config/rs6000/rs6000.c (rs6000_option_override_internal):
24550 Disable shrink wrap when inserting ROP-protect instructions.
24551 * config/rs6000/rs6000.opt (mrop-protect): New option.
24552 (mprivileged): Likewise.
24553 * doc/invoke.texi: Document mrop-protect and mprivileged.
24555 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
24557 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
24560 2021-05-15 Martin Jambor <mjambor@suse.cz>
24563 2021-05-13 Martin Jambor <mjambor@suse.cz>
24565 PR tree-optimization/100453
24566 * tree-sra.c (sra_modify_assign): All const base accesses do not
24567 need refreshing, not just those from decl_pool.
24568 (sra_modify_assign): Do not refresh into a const base decl.
24570 2021-05-15 Jakub Jelinek <jakub@redhat.com>
24572 PR rtl-optimization/100342
24573 * regcprop.c (copy_value): When copying a source reg in a wider
24574 mode than it has recorded for the value, adjust recorded destination
24575 mode too or punt if !REG_CAN_CHANGE_MODE_P.
24577 2021-05-14 Jason Merrill <jason@redhat.com>
24579 * intl.h: Add comments.
24581 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24583 * config/aarch64/aarch64-simd.md
24584 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
24585 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
24586 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
24587 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
24588 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
24589 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
24590 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
24591 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
24592 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
24594 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24597 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
24598 boolean logic equivalent.
24599 (vtst_s16): Likewise.
24600 (vtst_s32): Likewise.
24601 (vtst_u8): Likewise.
24602 (vtst_u16): Likewise.
24603 (vtst_u32): Likewise.
24604 (vtst_p8): Likewise.
24605 (vtst_p16): Likewise.
24606 (vtstq_s8): Likewise.
24607 (vtstq_s16): Likewise.
24608 (vtstq_s32): Likewise.
24609 (vtstq_u8): Likewise.
24610 (vtstq_u16): Likewise.
24611 (vtstq_u32): Likewise.
24612 (vtstq_p8): Likewise.
24613 (vtstq_p16): Likewise.
24614 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
24615 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
24617 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24619 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
24620 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
24621 (aarch64_sqdmlsl2<mode>): Delete.
24622 (aarch64_sqdmlal2_lane<mode>): Merge this...
24623 (aarch64_sqdmlsl2_lane<mode>): ... And this...
24624 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
24625 (aarch64_sqdmlal2_laneq<mode>): Merge this...
24626 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
24627 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
24628 (aarch64_sqdmlal2_n<mode>): Merge this...
24629 (aarch64_sqdmlsl2_n<mode>): ... And this...
24630 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
24632 2021-05-13 Martin Sebor <msebor@redhat.com>
24634 PR middle-end/100574
24635 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
24636 all null arguments.
24638 2021-05-13 Martin Sebor <msebor@redhat.com>
24640 PR tree-optimization/93100
24641 PR middle-end/98583
24642 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
24643 don't modify referenced objects.
24645 2021-05-13 Martin Jambor <mjambor@suse.cz>
24647 PR tree-optimization/100453
24648 * tree-sra.c (sra_modify_assign): All const base accesses do not
24649 need refreshing, not just those from decl_pool.
24650 (sra_modify_assign): Do not refresh into a const base decl.
24652 2021-05-13 Martin Liska <mliska@suse.cz>
24654 * tree-ssa-dom.c: Remove m_simplifier.
24656 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
24659 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
24660 canonicalize DImode inequality comparisons against the
24661 maximum integral value.
24663 2021-05-13 Jakub Jelinek <jakub@redhat.com>
24665 PR tree-optimization/98856
24666 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
24667 Expect V2DI and V4DI arithmetic right shifts to be emulated.
24668 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
24670 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
24671 expand_vec_perm_2perm_pblendv): New functions.
24672 (ix86_expand_vec_perm_const_1): Use them.
24673 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
24674 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
24675 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
24676 (ashrv4di3): New define_expand.
24677 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
24678 and !TARGET_AVX512VL expansion.
24680 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
24683 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
24684 sizes < 16 to a register when constructing vpcmov pattern.
24685 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
24687 2021-05-13 Martin Liska <mliska@suse.cz>
24689 * gcov-io.c (gcov_write_block): Remove.
24690 (gcov_write_words): Likewise.
24691 (gcov_read_words): Re-implement using gcov_read_bytes.
24692 (gcov_allocate): Remove.
24693 (GCOV_BLOCK_SIZE): Likewise.
24694 (struct gcov_var): Remove most of the fields.
24695 (gcov_position): Implement with ftell.
24696 (gcov_rewrite): Remove setting of start and offset fields.
24697 (from_file): Re-format.
24698 (gcov_open): Remove setbuf call. It should not be needed.
24699 (gcov_close): Remove internal buffer handling.
24700 (gcov_magic): Use __builtin_bswap32.
24701 (gcov_write_counter): Use directly gcov_write_unsigned.
24702 (gcov_write_string): Use direct fwrite and do not round
24704 (gcov_seek): Use directly fseek.
24705 (gcov_write_tag): Use gcov_write_unsigned directly.
24706 (gcov_write_length): Likewise.
24707 (gcov_write_tag_length): Likewise.
24708 (gcov_read_bytes): Use directly fread.
24709 (gcov_read_unsigned): Use gcov_read_words.
24710 (gcov_read_counter): Likewise.
24711 (gcov_read_string): Use gcov_read_bytes.
24712 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
24713 that size is not in bytes, but words (4B).
24714 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
24715 (GCOV_TAG_ARCS_LENGTH): Likewise.
24716 (GCOV_TAG_ARCS_NUM): Likewise.
24717 (GCOV_TAG_COUNTER_LENGTH): Likewise.
24718 (GCOV_TAG_COUNTER_NUM): Likewise.
24719 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
24721 2021-05-13 liuhongt <hongtao.liu@intel.com>
24724 * config/i386/sse.md (ssedoublevecmode): Add attribute for
24725 V64QI/V32HI/V16SI/V4DI.
24726 (ssehalfvecmode): Add attribute for V2DI/V2DF.
24727 (*vec_concatv4si_0): Extend to VI124_128.
24728 (*vec_concat<mode>_0): New pre-reload splitter.
24729 * config/i386/predicates.md (movq_parallel): New predicate.
24731 2021-05-13 Alexandre Oliva <oliva@adacore.com>
24733 * targhooks.c (default_zero_call_used_regs): Retry using
24734 successfully-zeroed registers as sources.
24736 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
24738 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
24740 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
24743 * gimple-range.cc (range_of_builtin_call): Skip out on
24744 processing __builtin_clz when varying.
24746 2021-05-12 Tom de Vries <tdevries@suse.de>
24749 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
24750 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
24751 to ptx_version_option.
24752 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
24753 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
24754 (define_insn "nvptx_vote_ballot"): Use sync variant for
24756 * config/nvptx/nvptx.opt (ptx_version): Add enum.
24757 (mptx): Add option.
24758 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
24760 2021-05-12 Richard Biener <rguenther@suse.de>
24762 PR tree-optimization/100566
24763 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
24764 allow_back for all edge queries.
24766 2021-05-12 liuhongt <hongtao.liu@intel.com>
24769 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
24770 splitters for pblendvb of NOT mask register.
24772 2021-05-12 Richard Biener <rguenther@suse.de>
24774 PR tree-optimization/100519
24775 * tree-ssa-reassoc.c (can_associate_p): Split into...
24776 (can_associate_op_p): ... this
24777 (can_associate_type_p): ... and this.
24778 (is_reassociable_op): Call can_associate_op_p.
24779 (break_up_subtract_bb): Call the appropriate predicates.
24780 (reassociate_bb): Likewise.
24782 2021-05-12 Martin Liska <mliska@suse.cz>
24784 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
24785 (run_gcc): Use -flto argument detection for merged
24788 2021-05-12 Martin Liska <mliska@suse.cz>
24790 * lto-wrapper.c (print_lto_docs_link): New function.
24791 (run_gcc): Print warning about missing job server detection
24792 after we know NR of partitions. Do the same for -flto{,=1}.
24793 * opts.c (get_option_html_page): Support -flto option.
24795 2021-05-12 Martin Liska <mliska@suse.cz>
24797 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
24799 (append_option): Remove.
24800 (find_option): Rework to use the vector type.
24801 (remove_option): Remove.
24802 (merge_and_complain): Use vectors for cl_decoded_option data
24804 (append_compiler_options): Likewise.
24805 (append_diag_options): Likewise.
24806 (append_linker_options): Likewise.
24807 (append_offload_options): Likewise.
24808 (compile_offload_image): Likewise.
24809 (compile_images_for_offload_targets): Likewise.
24810 (find_and_merge_options): Likewise.
24811 (run_gcc): Likewise.
24813 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
24816 * dwarf2out.c (dwarf2out_finish): Set
24817 have_multiple_function_sections with multi-range text_section.
24819 2021-05-12 Martin Liska <mliska@suse.cz>
24821 PR bootstrap/100560
24822 * Makefile.in: Remove version.h from linker command line.
24824 2021-05-12 Richard Biener <rguenther@suse.de>
24826 PR middle-end/100547
24827 * rtl.h (rtvec_alloc): Make argument size_t.
24828 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
24830 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24832 PR middle-end/100508
24833 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
24834 type, don't reuse DECL_RTL if it has different mode, instead force
24835 creation of a new DEBUG_EXPR.
24837 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24838 Marc Glisse <marc.glisse@inria.fr>
24840 PR tree-optimization/94589
24841 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
24842 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
24844 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
24847 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
24848 * config/i386/mmx.md (MMXMODE124): New mode iterator.
24850 (mmxintvecmode): New mode attribute.
24851 (mmxintvecmodelower): Ditto.
24852 (*mmx_maskcmpv2sf3_comm): New insn pattern.
24853 (*mmx_maskcmpv2sf3): Ditto.
24854 (vec_cmpv2sfv2si): New expander.
24855 (vcond<V2FI:mode>v2si): Ditto.
24856 (mmx_vlendvps): New insn pattern.
24857 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
24858 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
24859 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
24861 2021-05-11 Martin Sebor <msebor@redhat.com>
24863 PR middle-end/21433
24864 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
24866 2021-05-11 Richard Biener <rguenther@suse.de>
24868 * gimple-fold.c (gimple_fold_call): Do not call
24869 maybe_fold_reference on call arguments or the static chain.
24870 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
24873 2021-05-11 Martin Liska <mliska@suse.cz>
24875 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
24876 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
24877 (DEF_HSAIL_SAT_BUILTIN): Likewise.
24878 (DEF_HSAIL_INTR_BUILTIN): Likewise.
24879 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
24880 * doc/frontends.texi: Remove BRIG.
24881 * doc/install.texi: Likewise.
24882 * doc/invoke.texi: Likewise.
24883 * doc/standards.texi: Likewise.
24884 * brig-builtins.def: Removed.
24885 * brig/ChangeLog: Removed.
24886 * brig/Make-lang.in: Removed.
24887 * brig/brig-builtins.h: Removed.
24888 * brig/brig-c.h: Removed.
24889 * brig/brig-lang.c: Removed.
24890 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
24891 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
24892 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
24893 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
24894 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
24895 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
24896 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
24897 * brig/brigfrontend/brig-comment-handler.cc: Removed.
24898 * brig/brigfrontend/brig-control-handler.cc: Removed.
24899 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
24900 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
24901 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
24902 * brig/brigfrontend/brig-function-handler.cc: Removed.
24903 * brig/brigfrontend/brig-function.cc: Removed.
24904 * brig/brigfrontend/brig-function.h: Removed.
24905 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
24906 * brig/brigfrontend/brig-label-handler.cc: Removed.
24907 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
24908 * brig/brigfrontend/brig-machine.c: Removed.
24909 * brig/brigfrontend/brig-machine.h: Removed.
24910 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
24911 * brig/brigfrontend/brig-module-handler.cc: Removed.
24912 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
24913 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
24914 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
24915 * brig/brigfrontend/brig-to-generic.cc: Removed.
24916 * brig/brigfrontend/brig-to-generic.h: Removed.
24917 * brig/brigfrontend/brig-util.cc: Removed.
24918 * brig/brigfrontend/brig-util.h: Removed.
24919 * brig/brigfrontend/brig-variable-handler.cc: Removed.
24920 * brig/brigfrontend/hsa-brig-format.h: Removed.
24921 * brig/brigfrontend/phsa.h: Removed.
24922 * brig/brigspec.c: Removed.
24923 * brig/config-lang.in: Removed.
24924 * brig/gccbrig.texi: Removed.
24925 * brig/lang-specs.h: Removed.
24926 * brig/lang.opt: Removed.
24928 2021-05-11 Richard Biener <rguenther@suse.de>
24931 * ipa-param-manipulation.c
24932 (ipa_param_body_adjustments::modify_call_stmt): Avoid
24933 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
24934 via gimple_call_lhs_ptr.
24936 2021-05-11 Alex Coplan <alex.coplan@arm.com>
24939 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
24940 Avoid emitting CFA adjusts on the sp if we have the fp.
24942 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
24944 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
24945 (VMULD): New iterator.
24946 (VCOND): Handle V4HF and V8HF.
24947 (VCONQ): Fix entry for V2SF.
24948 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
24949 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
24950 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
24951 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
24952 Use a 128-bit vector mode for the indexed operand. Use stype for
24953 the scheduling type.
24955 2021-05-11 Richard Biener <rguenther@suse.de>
24957 * gimple-fold.c (maybe_fold_reference): Only return
24958 is_gimple_min_invariant values.
24960 2021-05-11 Richard Biener <rguenther@suse.de>
24962 PR middle-end/100509
24963 * gimple-fold.c (fold_gimple_assign): Only call
24964 get_symbol_constant_value on register type symbols.
24966 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
24967 Joe Ramsay <joe.ramsay@arm.com>
24970 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
24971 (__arm_vcmpneq): Remove duplicate definition.
24972 (__arm_vstrwq_scatter_offset_p): Likewise.
24973 (__arm_vmaxq_x): Likewise.
24974 (__arm_vmlsdavaq): Likewise.
24975 (__arm_vmlsdavaxq): Likewise.
24976 (__arm_vmlsdavq_p): Likewise.
24977 (__arm_vmlsdavxq_p): Likewise.
24978 (__arm_vrmlaldavhaq): Likewise.
24979 (__arm_vstrbq_p): Likewise.
24980 (__arm_vstrbq_scatter_offset): Likewise.
24981 (__arm_vstrbq_scatter_offset_p): Likewise.
24982 (__arm_vstrdq_scatter_offset): Likewise.
24983 (__arm_vstrdq_scatter_offset_p): Likewise.
24984 (__arm_vstrdq_scatter_shifted_offset): Likewise.
24985 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
24987 2021-05-11 Jakub Jelinek <jakub@redhat.com>
24989 PR middle-end/100471
24990 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
24991 is 0, bypass the reduction loop including
24992 GOMP_taskgroup_reduction_unregister call.
24994 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24996 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
24997 costing_for_scalar.
24998 (rs6000_density_test): Early return if costing_for_scalar is true.
24999 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
25001 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25003 * doc/tm.texi: Regenerated.
25004 * target.def (init_cost): Add new parameter costing_for_scalar.
25005 * targhooks.c (default_init_cost): Adjust for new parameter.
25006 * targhooks.h (default_init_cost): Likewise.
25007 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
25008 (vect_compute_single_scalar_iteration_cost): Likewise.
25009 (vect_analyze_loop_2): Likewise.
25010 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
25011 (vect_bb_vectorization_profitable_p): Likewise.
25012 * tree-vectorizer.h (init_cost): Likewise.
25013 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
25014 * config/i386/i386.c (ix86_init_cost): Likewise.
25015 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
25017 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
25019 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
25020 vect_nonmem and moved into...
25021 (struct rs6000_cost_data): ...here.
25022 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
25023 (rs6000_add_stmt_cost): Likewise.
25024 (rs6000_finish_cost): Likewise.
25026 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
25028 * range-op.cc (get_bool_state): Adjust head comment.
25029 (operator_not_equal::op1_range): Fix comment.
25030 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
25032 2021-05-10 Martin Sebor <msebor@redhat.com>
25034 PR middle-end/100425
25035 PR middle-end/100510
25036 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
25037 (pass_walloca::xlimit_certain_p): ...to this.
25038 (pass_walloca::gate): Execute for any kind of handled warning.
25039 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
25040 warnings when xlimit_certain_p is set.
25042 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
25044 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
25045 Return ALTIVEC_REGS if that is best_class.
25046 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
25048 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25050 * config/arm/arm.h (CPP_SPEC): Remove error message about
25053 2021-05-10 Martin Jambor <mjambor@suse.cz>
25055 * ipa-prop.h (IPA_NODE_REF): Removed.
25056 (IPA_NODE_REF_GET_CREATE): Likewise.
25057 (IPA_EDGE_REF): Likewise.
25058 (IPA_EDGE_REF_GET_CREATE): Likewise.
25059 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
25060 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
25061 use of ipa_node_params_sum.
25062 (ipcp_versionable_function_p): Likewise.
25063 (push_node_to_stack): Likewise.
25064 (pop_node_from_stack): Likewise.
25065 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
25066 direct use of ipa_node_params_sum.
25067 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
25068 ipa_node_params_sum.
25069 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
25071 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
25072 use of ipa_node_params_sum.
25073 (self_recursively_generated_p): Likewise.
25074 (propagate_scalar_across_jump_function): Likewise.
25075 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
25076 direct use of ipa_edge_args_sum, moved the lookup after the early
25077 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
25078 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
25079 direct uses of ipa_node_params_sum.
25080 (propagate_vr_across_jump_function): Likewise.
25081 (propagate_aggregate_lattice): Likewise.
25082 (propagate_aggs_across_jump_function): Likewise.
25083 (propagate_constants_across_call): Likewise, also replaced
25084 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
25085 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
25086 of ipa_node_params_sum.
25087 (estimate_local_effects): Likewise.
25088 (add_all_node_vals_to_toposort): Likewise.
25089 (propagate_constants_topo): Likewise.
25090 (ipcp_propagate_stage): Likewise.
25091 (ipcp_discover_new_direct_edges): Likewise.
25092 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
25093 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
25094 (get_info_about_necessary_edges): Likewise.
25095 (want_remove_some_param_p): Likewise.
25096 (create_specialized_node): Likewise.
25097 (self_recursive_pass_through_p): Likewise.
25098 (self_recursive_agg_pass_through_p): Likewise.
25099 (find_more_scalar_values_for_callers_subset): Likewise and also
25100 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
25101 case replacing two of those with a single query.
25102 (find_more_contexts_for_caller_subset): Likewise for the
25103 ipa_polymorphic_call_context overload.
25104 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
25105 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
25106 ipa_node_params_sum.
25107 (find_aggregate_values_for_callers_subset): Likewise, also reusing
25108 results of ipa_edge_args_sum->get.
25109 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
25110 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
25111 direct use of ipa_edge_args_sum.
25112 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
25113 summary query after the early exit and reused the result later.
25114 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
25115 ipa_node_params_sum.
25116 (decide_whether_version_node): Likewise. Removed re-querying for
25117 summaries after cloning.
25118 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
25119 ipa_node_params_sum.
25120 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
25122 (identify_dead_nodes): Likewise.
25123 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
25124 ipa_node_params_sum.
25125 (ipcp_store_vr_results): Likewise.
25126 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
25127 (ipa_fn_summary_t::duplicate): Likewise.
25128 (analyze_function_body): Likewise.
25129 (estimate_calls_size_and_time): Likewise.
25130 (ipa_cached_call_context::duplicate_from): Likewise.
25131 (ipa_call_context::equal_to): Likewise.
25132 (remap_edge_params): Likewise.
25133 (ipa_merge_fn_summary_after_inlining): Likewise.
25134 (inline_read_section): Likewise.
25135 * ipa-icf.c (sem_function::param_used_p): Likewise.
25136 * ipa-modref.c (compute_parm_map): Likewise.
25137 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
25139 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
25140 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25142 * ipa-profile.c (check_argument_count): Likewise.
25143 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
25144 with a direct use of ipa_node_params_sum.
25145 (ipa_initialize_node_params): Likewise.
25146 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
25147 direct use of ipa_edge_args_sum and reused the query result.
25148 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
25149 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25150 direct use of ipa_edge_args_sum.
25151 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
25152 ipa_node_params_sum and reused the result of the query.
25153 (ipa_analyze_node): Likewise.
25154 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
25155 of ipa_node_params_sum.
25156 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
25157 direct uses of ipa_edge_args_sum.
25158 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
25159 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
25160 direct use of ipa_edge_args_sum. Removed superficial re-querying the
25162 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
25163 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
25165 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
25166 direct use of ipa_edge_args_sum.
25167 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
25168 use of ipa_node_params_sum.
25169 (ipa_print_node_params): Likewise.
25170 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
25171 direct uses of ipa_edge_args_sum.
25172 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
25174 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
25175 ipa_node_params_sum.
25176 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
25177 scopes where it is used.
25179 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
25181 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
25182 <case E_V2SImode>: Force op_true to register.
25184 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25186 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
25187 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
25188 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
25190 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
25191 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
25192 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
25193 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
25194 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
25195 (mve_vcmpneq_n_f<mode>): Remove.
25196 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
25197 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
25198 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
25200 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25202 * config/arm/iterators.md (MVE_COMPARISONS): New.
25204 (mve_cmp_type): New.
25205 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
25207 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
25208 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
25209 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
25210 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
25211 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25212 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25213 (mve_vcmpneq_n_<mode>): Remove.
25215 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25217 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
25218 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
25219 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
25222 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25224 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
25225 (vcmpneq_n_u): Likewise.
25226 (vcmpeqq_u,): Likewise.
25227 (vcmpeqq_n_u): Likewise.
25228 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
25229 VCMPEQQ_N_U and VCMPNEQ_N_U.
25230 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
25231 (mve_vcmpeqq_n): Likewise.
25232 (mve_vcmpeqq): Likewise.
25233 (mve_vcmpneq_n): Likewise.
25235 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25237 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
25238 the 's' version of the builtin.
25240 2021-05-10 Richard Biener <rguenther@suse.de>
25242 PR tree-optimization/100492
25243 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
25244 Find nothing when the loop contains an irreducible region.
25246 2021-05-10 Richard Biener <rguenther@suse.de>
25248 PR middle-end/100464
25250 * gimple-fold.c (canonicalize_constructor_val): Do not set
25253 2021-05-10 Richard Biener <rguenther@suse.de>
25255 PR tree-optimization/100434
25256 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
25258 (dse_optimize_stmt): Handle call LHS by dropping the
25259 LHS or the whole call if it doesn't have other
25261 (pass_dse::execute): Adjust.
25263 2021-05-10 Martin Liska <mliska@suse.cz>
25265 * Makefile.in: Add missing genversion rule.
25267 2021-05-10 Alex Coplan <alex.coplan@arm.com>
25270 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
25271 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
25273 2021-05-10 Martin Liska <mliska@suse.cz>
25275 * builtins.c (is_builtin_name): Use startswith
25276 function instead of strncmp.
25277 * collect2.c (main): Likewise.
25278 (has_lto_section): Likewise.
25279 (scan_libraries): Likewise.
25280 * coverage.c (coverage_checksum_string): Likewise.
25281 (coverage_init): Likewise.
25282 * dwarf2out.c (is_cxx): Likewise.
25283 (gen_compile_unit_die): Likewise.
25284 * gcc-ar.c (main): Likewise.
25285 * gcc.c (init_spec): Likewise.
25286 (read_specs): Likewise.
25287 (execute): Likewise.
25288 (check_live_switch): Likewise.
25289 * genattrtab.c (write_attr_case): Likewise.
25290 (IS_ATTR_GROUP): Likewise.
25291 * gencfn-macros.c (main): Likewise.
25292 * gengtype.c (type_for_name): Likewise.
25293 (gen_rtx_next): Likewise.
25294 (get_file_langdir): Likewise.
25295 (write_local): Likewise.
25296 * genmatch.c (get_operator): Likewise.
25297 (get_operand_type): Likewise.
25298 (expr::gen_transform): Likewise.
25299 * genoutput.c (validate_optab_operands): Likewise.
25300 * incpath.c (add_sysroot_to_chain): Likewise.
25301 * langhooks.c (lang_GNU_C): Likewise.
25302 (lang_GNU_CXX): Likewise.
25303 (lang_GNU_Fortran): Likewise.
25304 (lang_GNU_OBJC): Likewise.
25305 * lto-wrapper.c (run_gcc): Likewise.
25306 * omp-general.c (omp_max_simt_vf): Likewise.
25307 * omp-low.c (omp_runtime_api_call): Likewise.
25308 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
25309 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
25310 * real.c (real_from_string): Likewise.
25311 * selftest.c (assert_str_startswith): Likewise.
25312 * timevar.c (timer::validate_phases): Likewise.
25313 * tree.c (get_file_function_name): Likewise.
25314 * ubsan.c (ubsan_use_new_style_p): Likewise.
25315 * varasm.c (default_function_rodata_section): Likewise.
25316 (incorporeal_function_p): Likewise.
25317 (default_section_type_flags): Likewise.
25318 * system.h (startswith): Define startswith.
25320 2021-05-10 Martin Liska <mliska@suse.cz>
25322 * bitmap.h (class auto_bitmap): Remove
25323 __cplusplus >= 201103.
25324 * config/aarch64/aarch64.c: Likewise.
25325 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
25327 * sbitmap.h: Likewise.
25329 2021-05-10 Martin Liska <mliska@suse.cz>
25331 * Makefile.in: Rename gcov-iov to genversion and depend
25332 on version.h (instead of gcov-iov.h).
25333 * gcov-io.h: Include version.h instread of gcov-iov.h.
25334 * gengtype-state.c (read_state_version): Likewise.
25335 * gcov-iov.c: Moved to...
25336 * genversion.c: ...here.
25337 * lto-streamer.h (LTO_major_version): Define it with
25339 * version.c: Removed.
25340 * version.h: Removed.
25342 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25344 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
25345 * config/arc/simdext.md (VCT): Add predicates for iterator
25348 (voptab): Likewise.
25349 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
25350 (<voptab>v2si3): New patterns.
25352 (reduc_plus_scal_v4hi): Likewise.
25353 (reduc_plus_scal_v2si): Likewise.
25354 (vec_duplicatev2si): Likewise.
25355 (vec_duplicatev4hi): Likewise.
25357 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25359 * config/arc/simdext.md: Format and cleanup file.
25361 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25363 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
25364 only when munaligned-access option is on.
25365 (movmisalign<mode>): Likewise.
25367 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25369 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
25371 * config/arc/arc.c (arc_reorg): Remove underscore from string.
25373 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25375 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
25376 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
25377 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
25378 (norm_f): Likewise.
25381 (clzsi2): Use fls instruction when available.
25382 (arc_clzsi2): Likewise.
25384 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25386 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
25388 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25390 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
25392 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
25394 PR middle-end/100467
25395 * toplev.c (compile_file): Call insn_locations_init before
25396 targetm.asm_out.code_end.
25398 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25401 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25403 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25405 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25406 Andrew Stubbs <amd@codesourcery.com>
25409 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
25410 emit_move_insn operands.
25412 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25414 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
25415 location for the outgoing edges of an empty block.
25416 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
25417 type by means of the get_array_descr_info langhook, if it is set and
25418 returns true. Remove obsolete code dealing with unnamed subtypes.
25420 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25422 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
25423 (sbr_vector): Renamed from ssa_block_cache.
25424 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
25425 (ssa_block_ranges::~ssa_block_ranges): Remove.
25426 (sbr_vector::set_bb_range): Use varying and undefined cached values.
25427 (ssa_block_ranges::set_bb_varying): Remove.
25428 (sbr_vector::get_bb_range): Adjust assert.
25429 (sbr_vector::bb_range_p): Adjust assert.
25430 (~block_range_cache): No freeing loop required.
25431 (block_range_cache::get_block_ranges): Remove.
25432 (block_range_cache::set_bb_range): Inline get_block_ranges.
25433 (block_range_cache::set_bb_varying): Remove.
25434 * gimple-range-cache.h (set_bb_varying): Remove prototype.
25435 * value-range.h (irange_allocator::get_memory): New.
25437 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25439 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
25440 dominator tree is available and requested.
25441 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
25442 (ranger_cache::fill_block_cache): Don't search dom tree here either.
25443 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
25445 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25447 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
25448 only PHI nodes better.
25450 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25452 * gimple-range-edge.h (gimple_outgoing_range): Rename from
25454 (gcond_edge_range): Export prototype.
25455 * gimple-range-edge.cc (gcond_edge_range): New.
25456 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
25457 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
25459 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25461 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
25462 default range into a temp and allocate only what is needed.
25464 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25466 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
25468 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25470 * gimple-range.h (gimple_range_global): Pick up parameter initial
25471 values, and use-before defined locals are UNDEFINED.
25473 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25475 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
25477 * tree.h (reverse_storage_order_for_component_p): Return false if
25478 the type is a pointer.
25480 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25482 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25484 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25487 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
25488 Handle V8QI, V4HI and V2SI modes.
25489 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
25490 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
25491 * config/i386/i386.md (unspec): ... here.
25493 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
25494 Tom de Vries <tdevries@suse.de>
25496 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
25497 a truth_value_p reduction variable is nonintegral.
25499 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25502 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
25503 Return false for mode sizes < 16.
25505 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25508 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
25510 2021-05-06 Martin Jambor <mjambor@suse.cz>
25512 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
25513 when there is no function summary.
25514 (ipa_sra_summarize_function): produce edge summaries even when
25517 2021-05-06 Tom Tromey <tom@tromey.com>
25519 * godump.c (string_hash_eq): Remove.
25520 (go_finish): Use htab_eq_string.
25522 2021-05-06 Tom Tromey <tom@tromey.com>
25524 * gengtype-state.c (read_state): Use htab_eq_string.
25525 (string_eq): Remove.
25527 2021-05-06 Tom Tromey <tom@tromey.com>
25529 * gensupport.c (htab_eq_string): Remove.
25531 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
25534 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
25535 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
25536 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
25537 set_ignored_loc callbacks.
25538 * debug.c (do_nothing_debug_hooks): Likewise.
25539 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
25540 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
25541 (in_text_section_p, last_text_label, last_cold_label,
25542 switch_text_ranges, switch_cold_ranges): New data items.
25543 (dwarf2out_note_section_used): Remove.
25544 (dwarf2out_begin_prologue): Set fde->ignored_debug and
25546 (mark_ignored_debug_section): New helper function.
25547 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
25548 mark_ignored_debug_section.
25549 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
25550 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
25551 (size_of_aranges): Adjust formula for multi-part text ranges size.
25552 (output_aranges): Output multi-part text ranges.
25553 (dwarf2out_set_ignored_loc): New callback function.
25554 (dwarf2out_finish): Output multi-part text ranges.
25555 (dwarf2out_c_finalize): Clear new data items.
25556 * final.c (final_start_function_1): Call set_ignored_loc callback.
25557 (final_scan_insn_1): Likewise.
25558 * ggc-page.c (gt_ggc_mx): New helper function.
25559 * stringpool.c (gt_pch_nx): Likewise.
25561 2021-05-06 Richard Biener <rguenther@suse.de>
25563 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
25564 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
25565 (TV_TREE_INTO_SSA): New.
25566 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
25567 (rewrite_blocks): Likewise.
25568 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
25570 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25572 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
25573 abs_replacement, xor_replacement,
25574 cond_removal_in_popcount_clz_ctz_pattern,
25575 replace_phi_edge_with_variable): Change type of phi argument from
25576 gimple * to gphi *.
25578 2021-05-06 Richard Biener <rguenther@suse.de>
25580 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
25581 Output an opt-info message.
25582 (do_split_loop_on_cond): Likewise.
25583 (tree_ssa_split_loops): Update SSA form here.
25585 2021-05-06 Richard Biener <rguenther@suse.de>
25587 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
25588 return variable removal.
25590 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
25592 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
25593 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
25594 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
25596 * config/s390/s390.c (s390_const_operand_ok): Remove unused
25599 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25601 PR tree-optimization/94589
25602 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
25603 spaceship_replacement.
25604 (cond_only_block_p, spaceship_replacement): New functions.
25606 2021-05-06 Richard Biener <rguenther@suse.de>
25609 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
25610 dealing with a debug use and only query existing addresses
25612 (lower_emutls_1): Avoid splitting out addresses for debug
25613 stmts, reset the debug stmt when we fail to find existing
25615 (lower_emutls_phi_arg): Set wi.stmt.
25617 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
25620 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
25621 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
25622 (stack_protect_test): Use cbranch helper.
25624 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25627 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
25628 always return the establisher frame for __builtin_frame_address (0).
25630 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
25633 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
25634 (ix86_cpu_model_var): Likewise.
25635 (ix86_cpu_features2_type_node): Likewise.
25636 (ix86_cpu_features2_var): Likewise.
25637 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
25640 2021-05-05 Martin Sebor <msebor@redhat.com>
25642 * passes.def (pass_warn_printf): Run after SSA.
25644 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25646 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
25647 * config/arm/predicates.md (minus_one_operand): New predicate.
25649 2021-05-05 Jeff Law <jlaw@tachyum.com>
25651 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
25653 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
25655 PR rtl-optimization/100263
25656 * postreload.c (move2add_valid_value_p): Ensure register can
25659 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25661 PR rtl-optimization/100411
25662 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
25663 and beginning of function markers.
25665 2021-05-05 Jeff Law <jlaw@tachyum.com>
25667 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
25668 * config/cr16/cr16.c (notice_update_cc): Remove.
25669 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
25671 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
25674 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
25675 Handle V8QI, V4HI and V2SI modes.
25676 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
25677 (ix86_build_signbit_mask): Ditto.
25678 * config/i386/mmx.md (MMXMODE14): New mode iterator.
25679 (<smaxmin:code><MMXMODE14:mode>3): New expander.
25680 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
25681 (<umaxmin:code><MMXMODE24:mode>3): New expander.
25682 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
25683 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
25684 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25685 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25686 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25687 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25689 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25691 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
25692 not expand the VALUE_EXPR of variables put in the non-local frame.
25693 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
25694 to be ignored for debug info, ensure its variable offsets are not.
25696 2021-05-05 Richard Biener <rguenther@suse.de>
25698 PR tree-optimization/79333
25699 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
25700 Fold stmt following SSA edges.
25702 2021-05-05 Richard Biener <rguenther@suse.de>
25704 PR middle-end/100394
25705 * calls.c (expand_call): Preserve possibly throwing calls.
25706 * cfgexpand.c (expand_call_stmt): When a call can throw signal
25707 RTL expansion there are side-effects.
25708 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
25709 mark all possibly throwing stmts necessary unless we can elide
25711 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
25712 -fdelete-dead-exceptions.
25713 * tree.h (DECL_PURE_P): Add note about exceptions.
25715 2021-05-05 Alexandre Oliva <oliva@adacore.com>
25717 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
25720 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
25722 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
25723 get_fnname_from_decl for name of thunk.
25724 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
25725 and ASM_OUTPUT_LABEL.
25726 (rs6000_xcoff_declare_function_name): Use assemble_name and
25728 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
25729 (rs6000_xcoff_encode_section_info): Don't add mapping class
25730 for aliases. Always add [DS] mapping class to primary
25732 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
25734 2021-05-04 Martin Sebor <msebor@redhat.com>
25736 PR middle-end/100307
25737 * builtins.c (compute_objsize_r): Clear base0 for pointers.
25739 2021-05-04 Jeff Law <jlaw@tachyum.com>
25741 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
25743 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
25745 * caller-save.c: Remove CC0.
25746 * cfgcleanup.c: Remove CC0.
25747 * cfgrtl.c: Remove CC0.
25748 * combine.c: Remove CC0.
25749 * compare-elim.c: Remove CC0.
25750 * conditions.h: Remove CC0.
25751 * config/h8300/h8300.h: Remove CC0.
25752 * config/h8300/h8300-protos.h: Remove CC0.
25753 * config/h8300/peepholes.md: Remove CC0.
25754 * config/i386/x86-tune-sched.c: Remove CC0.
25755 * config/m68k/m68k.c: Remove CC0.
25756 * config/rl78/rl78.c: Remove CC0.
25757 * config/sparc/sparc.c: Remove CC0.
25758 * config/xtensa/xtensa.c: Remove CC0.
25759 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
25760 RTL where that is used as a placeholder only.
25761 * cprop.c: Remove CC0.
25762 * cse.c: Remove CC0.
25763 * cselib.c: Remove CC0.
25764 * df-problems.c: Remove CC0.
25765 * df-scan.c: Remove CC0.
25766 * doc/md.texi: Remove CC0. Adjust an example.
25767 * doc/rtl.texi: Remove CC0. Adjust an example.
25768 * doc/tm.texi: Regenerate.
25769 * doc/tm.texi.in: Remove CC0.
25770 * emit-rtl.c: Remove CC0.
25771 * final.c: Remove CC0.
25772 * fwprop.c: Remove CC0.
25773 * gcse-common.c: Remove CC0.
25774 * gcse.c: Remove CC0.
25775 * genattrtab.c: Remove CC0.
25776 * genconfig.c: Remove CC0.
25777 * genemit.c: Remove CC0.
25778 * genextract.c: Remove CC0.
25779 * gengenrtl.c: Remove CC0.
25780 * genrecog.c: Remove CC0.
25781 * haifa-sched.c: Remove CC0.
25782 * ifcvt.c: Remove CC0.
25783 * ira-costs.c: Remove CC0.
25784 * ira.c: Remove CC0.
25785 * jump.c: Remove CC0.
25786 * loop-invariant.c: Remove CC0.
25787 * lra-constraints.c: Remove CC0.
25788 * lra-eliminations.c: Remove CC0.
25789 * optabs.c: Remove CC0.
25790 * postreload-gcse.c: Remove CC0.
25791 * postreload.c: Remove CC0.
25792 * print-rtl.c: Remove CC0.
25793 * read-rtl-function.c: Remove CC0.
25794 * reg-notes.def: Remove CC0.
25795 * reg-stack.c: Remove CC0.
25796 * reginfo.c: Remove CC0.
25797 * regrename.c: Remove CC0.
25798 * reload.c: Remove CC0.
25799 * reload1.c: Remove CC0.
25800 * reorg.c: Remove CC0.
25801 * resource.c: Remove CC0.
25802 * rtl.c: Remove CC0.
25803 * rtl.def: Remove CC0.
25804 * rtl.h: Remove CC0.
25805 * rtlanal.c: Remove CC0.
25806 * sched-deps.c: Remove CC0.
25807 * sched-rgn.c: Remove CC0.
25808 * shrink-wrap.c: Remove CC0.
25809 * simplify-rtx.c: Remove CC0.
25810 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
25811 CC_STATUS_MDEP, and CC_STATUS.
25812 * target.def: Remove CC0.
25813 * valtrack.c: Remove CC0.
25814 * var-tracking.c: Remove CC0.
25816 2021-05-04 Richard Biener <rguenther@suse.de>
25818 PR tree-optimization/100414
25819 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
25821 (tree_ssa_phiopt_worker): But unconditionally here.
25823 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
25825 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
25826 && and || with floating-point and complex arguments.
25828 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25830 * tree-inline.c (insert_debug_decl_map): Delete.
25831 (copy_debug_stmt): Minor tweak.
25832 (setup_one_parameter): Do not use a variable if the value is either
25833 a read-only DECL or a non-addressable local variable in the caller.
25834 In this case, insert the debug-only variable in the map manually.
25835 (expand_call_inline): Do not generate a CLOBBER for these values.
25836 * tree-inline.h (debug_map): Minor tweak.
25838 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25840 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
25841 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
25843 2021-05-04 Richard Biener <rguenther@suse.de>
25845 PR tree-optimization/100329
25846 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
25848 (insert_stmt_after): Assert we're not running into asm goto.
25850 2021-05-04 Richard Biener <rguenther@suse.de>
25852 PR tree-optimization/100398
25853 * tree-ssa-dse.c (pass_dse::execute): Preserve control
25856 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25858 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
25860 2021-05-04 Alexandre Oliva <oliva@adacore.com>
25862 * builtins.c (try_store_by_multiple_pieces): New.
25863 (expand_builtin_memset_args): Use it. If target_char_cast
25864 fails, proceed as for non-constant val. Pass len's ctz to...
25865 * expr.c (clear_storage_hints): ... this. Try store by
25866 multiple pieces after setmem.
25867 (clear_storage): Adjust.
25868 * expr.h (clear_storage_hints): Likewise.
25869 (try_store_by_multiple_pieces): Declare.
25870 * passes.def: Replace the last copy_prop with ccp.
25872 2021-05-03 Tom de Vries <tdevries@suse.de>
25875 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
25878 2021-05-03 Richard Biener <rguenther@suse.de>
25880 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
25882 2021-05-03 Richard Biener <rguenther@suse.de>
25884 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
25885 (dse_dom_walker): Remove.
25886 (dse_dom_walker::dse_optimize_stmt): Rename...
25887 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
25888 (dse_dom_walker::before_dom_children): Inline ...
25889 (pass_dse::execute): ... here. Perform a reverse program
25892 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
25895 * configure: Regenerated.
25897 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
25900 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
25901 (s390_md_asm_adjust): Handle hard registers.
25903 2021-05-03 Jakub Jelinek <jakub@redhat.com>
25905 PR tree-optimization/100382
25906 * tree-ssa-dse.c: Include tree-eh.h.
25907 (dse_dom_walker::before_dom_children): Don't remove stmts if
25908 stmt_unremovable_because_of_non_call_eh_p is true.
25910 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
25912 * varasm.c (compute_reloc_for_var): Split out from...
25913 (get_variable_section): Use it.
25914 * output.h (compute_reloc_for_var): Declare.
25915 * config/rs6000/rs6000-protos.h
25916 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
25918 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
25919 Don't append storage mapping class to symbol.
25920 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
25921 Don't convert TLS BSS to common.
25922 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
25923 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
25925 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
25927 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
25928 If align is 0 from TLS class, use the same rules as varasm.c
25929 If not common, switch to BSS section manually.
25930 If common, emit appropriate comm or lcomm directive.
25931 (rs6000_xcoff_encode_section_info): Add logic to append all
25932 storage mapping classes.
25933 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
25934 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25935 rs6000_xcoff_asm_output_aligned_decl_common.
25936 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25937 rs6000_xcoff_asm_output_aligned_decl_common.
25938 (ASM_OUTPUT_TLS_COMMON): Use
25939 rs6000_xcoff_asm_output_aligned_decl_common.
25941 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25944 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
25945 as first argument of pseudo_node_t constructors.
25947 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25950 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
25952 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
25954 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
25955 (gt_pch_nx (int_range<1> *&)): New.
25956 (gt_ggc_mx (int_range<1> *&)): New.
25957 * value-range.h (class irange): Add GTY support for
25960 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
25962 * doc/options.texi (Negative): Change either or to both and.
25964 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25966 * config/aarch64/aarch64-simd-builtins.def: Add
25967 float_ml[as][q]_laneq builtin generator macros.
25968 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
25969 (aarch64_float_mla_laneq<mode>): Define.
25970 (aarch64_float_mls_laneq<mode>): Define.
25971 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
25972 instead of GCC vector extensions.
25973 (vmlaq_laneq_f32): Likewise.
25974 (vmls_laneq_f32): Likewise.
25975 (vmlsq_laneq_f32): Likewise.
25977 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25979 * config/aarch64/aarch64-simd-builtins.def: Add
25980 float_ml[as]_lane builtin generator macros.
25981 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
25983 (mul_lane<mode>3): This, and re-order arguments.
25984 (aarch64_float_mla_lane<mode>): Define.
25985 (aarch64_float_mls_lane<mode>): Define.
25986 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
25987 instead of GCC vector extensions.
25988 (vmlaq_lane_f32): Likewise.
25989 (vmls_lane_f32): Likewise.
25990 (vmlsq_lane_f32): Likewise.
25992 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25994 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
25995 builtin generator macros.
25996 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
25998 (aarch64_float_mls<mode>): Define.
25999 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
26000 instead of relying on GCC vector extensions.
26001 (vmla_f64): Likewise.
26002 (vmlaq_f32): Likewise.
26003 (vmlaq_f64): Likewise.
26004 (vmls_f32): Likewise.
26005 (vmls_f64): Likewise.
26006 (vmlsq_f32): Likewise.
26007 (vmlsq_f64): Likewise.
26008 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
26010 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
26012 * config/aarch64/aarch64-simd-builtins.def: Add
26013 float_ml[as]_n_builtin generator macros.
26014 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
26016 (mul_n<mode>3): This, and re-order arguments.
26017 (aarch64_float_mla_n<mode>): Define.
26018 (aarch64_float_mls_n<mode>): Define.
26019 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
26020 instead of inline asm.
26021 (vmlaq_n_f32): Likewise.
26022 (vmls_n_f32): Likewise.
26023 (vmlsq_n_f32): Likewise.
26025 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
26027 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
26028 builtin generator macros.
26029 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
26030 (aarch64_pmull_hiv16qi_insn): Define.
26031 (aarch64_pmull_hiv16qi): Define.
26032 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
26033 instead of inline asm.
26034 (vmull_p8): Likewise.
26036 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26038 * config/avr/avr.md: Adjust peepholes to match and
26039 generate parallels with clobber of REG_CC.
26040 (mov<mode>_insn): Rename to mov<mode>_insn_split.
26041 (*mov<mode>_insn): Rename to mov<mode>_insn.
26043 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
26045 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
26046 for VAR_DECLs if -fdata-sections enabled.
26048 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
26050 PR bootstrap/100327
26051 * config/rs6000/rs6000.c
26052 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
26053 (rs6000_libgcc_floating_mode_supported_p): New target hook.
26055 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26057 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
26059 (class back_threader_registry): ...to here...
26060 (class back_threader_profitability): ...and here...
26061 (thread_jumps::thread_through_all_blocks): Remove argument.
26062 (back_threader_registry::back_threader_registry): New.
26063 (back_threader_registry::~back_threader_registry): New.
26064 (back_threader_registry::thread_through_all_blocks): New.
26065 (thread_jumps::profitable_jump_thread_path): Move from here...
26066 (back_threader_profitability::profitable_path_p): ...to here.
26067 (thread_jumps::find_taken_edge): New.
26068 (thread_jumps::convert_and_register_current_path): Move...
26069 (back_threader_registry::register_path): ...to here.
26070 (thread_jumps::register_jump_thread_path_if_profitable): Move...
26071 (thread_jumps::maybe_register_path): ...to here.
26072 (thread_jumps::handle_phi): Call find_taken_edge and
26073 maybe_register_path.
26074 (thread_jumps::handle_assignment): Same.
26075 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
26076 tree argument to handle_phi and handle_assignment.
26077 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
26078 set of m_speed_p and m_max_threaded_paths.
26079 (pass_thread_jumps::execute): Remove second argument from
26080 find_jump_threads_backwards.
26081 (pass_early_thread_jumps::execute): Same.
26083 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
26085 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
26086 (class dom_opt_dom_walker): Initialize some class variables.
26087 (pass_dominator::execute): Pass evrp_range_analyzer and
26088 dom_jump_threader_simplifier to dom_opt_dom_walker.
26089 Adjust for some functions moving into classes.
26090 (simplify_stmt_for_jump_threading): Adjust and move to...
26091 (jump_threader_simplifier::simplify): ...here.
26092 (dom_opt_dom_walker::before_dom_children): Adjust for
26093 m_evrp_range_analyzer.
26094 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
26095 (test_for_singularity): Place in dom_opt_dom_walker class.
26096 (dom_opt_dom_walker::optimize_stmt): The argument
26097 evrp_range_analyzer is now a class field.
26098 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
26099 (thread_jumps::thread_through_all_blocks): New.
26100 (thread_jumps::convert_and_register_current_path): Use m_registry.
26101 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
26102 being in the threader class.
26103 (pass_early_thread_jumps::execute): Same.
26104 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
26105 (jump_threader::jump_threader): ...here.
26106 (threadedge_finalize_values): Move...
26107 (jump_threader::~jump_threader): ...here.
26108 (jump_threader::remove_jump_threads_including): New.
26109 (jump_threader::thread_through_all_blocks): New.
26110 (record_temporary_equivalences_from_phis): Move...
26111 (jump_threader::record_temporary_equivalences_from_phis): ...here.
26112 (record_temporary_equivalences_from_stmts_at_dest): Move...
26113 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
26115 (simplify_control_stmt_condition_1): Move to jump_threader class.
26116 (simplify_control_stmt_condition): Move...
26117 (jump_threader::simplify_control_stmt_condition): ...here.
26118 (thread_around_empty_blocks): Move...
26119 (jump_threader::thread_around_empty_blocks): ...here.
26120 (thread_through_normal_block): Move...
26121 (jump_threader::thread_through_normal_block): ...here.
26122 (thread_across_edge): Move...
26123 (jump_threader::thread_across_edge): ...here.
26124 (thread_outgoing_edges): Move...
26125 (jump_threader::thread_outgoing_edges): ...here.
26126 * tree-ssa-threadedge.h: Move externally facing functings...
26127 (class jump_threader): ...here...
26128 (class jump_threader_simplifier): ...and here.
26129 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
26130 (jump_thread_path_allocator::jump_thread_path_allocator): New.
26131 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
26132 (jump_thread_path_allocator::allocate_thread_edge): New.
26133 (jump_thread_path_allocator::allocate_thread_path): New.
26134 (jump_thread_path_registry::jump_thread_path_registry): New.
26135 (jump_thread_path_registry::~jump_thread_path_registry): New.
26136 (jump_thread_path_registry::allocate_thread_edge): New.
26137 (jump_thread_path_registry::allocate_thread_path): New.
26138 (dump_jump_thread_path): Make extern.
26139 (debug (const vec<jump_thread_edge *> &path)): New.
26140 (struct removed_edges): Move to tree-ssa-threadupdate.h.
26141 (struct thread_stats_d): Remove.
26142 (remove_ctrl_stmt_and_useless_edges): Make static.
26143 (lookup_redirection_data): Move...
26144 (jump_thread_path_registry::lookup_redirection_data): ...here.
26145 (ssa_redirect_edges): Make static.
26146 (thread_block_1): Move...
26147 (jump_thread_path_registry::thread_block_1): ...here.
26148 (thread_block): Move...
26149 (jump_thread_path_registry::thread_block): ...here.
26150 (thread_through_loop_header): Move...
26151 (jump_thread_path_registry::thread_through_loop_header): ...here.
26152 (mark_threaded_blocks): Move...
26153 (jump_thread_path_registry::mark_threaded_blocks): ...here.
26154 (debug_path): Move...
26155 (jump_thread_path_registry::debug_path): ...here.
26156 (debug_all_paths): Move...
26157 (jump_thread_path_registry::dump): ..here.
26158 (rewire_first_differing_edge): Move...
26159 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
26160 (adjust_paths_after_duplication): Move...
26161 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
26162 (duplicate_thread_path): Move...
26163 (jump_thread_path_registry::duplicate_thread_path): ..here.
26164 (remove_jump_threads_including): Move...
26165 (jump_thread_path_registry::remove_jump_threads_including): ...here.
26166 (thread_through_all_blocks): Move to...
26167 (jump_thread_path_registry::thread_through_all_blocks): ...here.
26168 (delete_jump_thread_path): Remove.
26169 (register_jump_thread): Move...
26170 (jump_thread_path_registry::register_jump_thread): ...here.
26171 * tree-ssa-threadupdate.h: Move externally facing functions...
26172 (class jump_thread_path_allocator): ...here...
26173 (class jump_thread_path_registry): ...and here.
26174 (thread_through_all_blocks): Remove.
26175 (struct removed_edges): New.
26176 (register_jump_thread): Remove.
26177 (remove_jump_threads_including): Remove.
26178 (delete_jump_thread_path): Remove.
26179 (remove_ctrl_stmt_and_useless_edges): Remove.
26180 (free_dom_edge_info): New prototype.
26181 * tree-vrp.c: Remove x_vr_values hack.
26182 (class vrp_jump_threader_simplifier): New.
26183 (vrp_jump_threader_simplifier::simplify): New.
26184 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
26185 Remove m_dummy_cond.
26186 Instantiate m_simplifier and m_threader.
26187 (vrp_jump_threader::thread_through_all_blocks): New.
26188 (vrp_jump_threader::simplify_stmt): Remove.
26189 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
26190 Remove x_vr_values hack.
26191 (execute_vrp): Adjust for thread_through_all_blocks being in a
26194 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
26196 * genflags.c (gen_insn): Print failed expansion string.
26198 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26200 * expr.c (alignment_for_piecewise_move): Call mode_for_size
26201 without limit to MAX_FIXED_MODE_SIZE.
26203 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26205 PR middle-end/90773
26206 * builtins.c (builtin_memset_gen_str): Don't use return from
26207 simplify_gen_subreg.
26209 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
26212 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
26213 (*addsi3_carry_zext_0r): Ditto.
26214 (*sub<mode>3_carry_0): Ditto.
26215 (*subsi3_carry_zext_0r): Ditto.
26216 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
26218 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
26219 Also consider ix86_carry_flag_unset_operator to calculate
26220 the cost of adc/sbb insn.
26222 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
26224 PR rtl-optimization/100225
26225 PR rtl-optimization/84878
26226 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
26227 where we have an instruction which touches (writes) any hard
26228 register from df->regular_block_artificial_uses set.
26229 Allow not-single-set instruction only right before basic block
26232 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
26234 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
26236 2021-04-30 LevyHsu <admin@levyhsu.com>
26238 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
26239 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
26240 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
26241 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
26243 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26245 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
26247 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26249 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
26250 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
26251 * config/i386/i386.c: Adjust.
26252 * config/i386/i386.md: Adjust.
26253 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
26254 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26255 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26256 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26257 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26258 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26259 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26260 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26261 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26262 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26263 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
26265 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26267 * config/i386/i386-expand.c (ix86_expand_int_compare):
26268 Swap operands of GTU and LEU comparison to emit carry flag comparison.
26269 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
26270 predicate to allow more combine opportunities with memory operands.
26271 (*sub<mode>3_carry_0): Ditto.
26273 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
26275 PR rtl-optimization/100303
26276 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
26277 boolean that indicates whether the use will only be used in
26278 debug instructions. Treat it in the same way that existing
26279 cross-EBB debug references would be handled if so.
26280 (function_info::make_uses_available): Likewise.
26281 * rtl-ssa/functions.h (function_info::make_uses_available): Update
26282 prototype accordingly.
26283 (function_info::make_uses_available): Likewise.
26284 * fwprop.c (try_fwprop_subst): Update call accordingly.
26286 2021-04-29 Jeff Law <jlaw@tachyum.com>
26288 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
26291 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26292 Richard Biener <rguenther@suse.de>
26295 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
26296 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
26297 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
26298 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
26299 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
26300 to PURE_ARGS category.
26301 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26302 Handle PURE_ARGS category.
26303 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
26305 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26307 * configure.ac: Check for the presence of sys/locking.h header and
26308 for whether _LK_LOCK is supported by _locking.
26309 * configure: Regenerate.
26310 * config.in: Likewise.
26311 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
26312 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
26313 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
26315 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26317 * config/i386/predicates.md (fcmov_comparison_operator):
26318 Do not check for trivial FP comparison operator.
26319 <case GEU, case LTU>: Allow CCGZmode.
26320 <case GTU, case LEU>: Do not allow CCCmode.
26321 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
26322 (ix86_carry_flag_operator): Match only LTU and UNLT code.
26323 Do not check for trivial FP comparison operator. Allow CCGZmode.
26325 2021-04-29 Tom de Vries <tdevries@suse.de>
26327 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
26328 fd->loop.step by either step or orig_step.
26330 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26332 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
26333 (load_got_register): Do the PIC dance here.
26334 (sparc_legitimize_tls_address): Simplify.
26335 (sparc_emit_probe_stack_range): Likewise.
26336 (sparc32_initialize_trampoline): Likewise.
26337 (sparc64_initialize_trampoline): Likewise.
26338 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
26339 (probe_stack_range<P:mode>): Likewise.
26340 (flush<P:mode>): Likewise.
26341 (tgd_hi22<P:mode>): Likewise.
26342 (tgd_lo10<P:mode>): Likewise.
26343 (tgd_add<P:mode>): Likewise.
26344 (tgd_call<P:mode>): Likewise.
26345 (tldm_hi22<P:mode>): Likewise.
26346 (tldm_lo10<P:mode>): Likewise.
26347 (tldm_add<P:mode>): Likewise.
26348 (tldm_call<P:mode>): Likewise.
26349 (tldo_hix22<P:mode>): Likewise.
26350 (tldo_lox10<P:mode>): Likewise.
26351 (tldo_add<P:mode>): Likewise.
26352 (tie_hi22<P:mode>): Likewise.
26353 (tie_lo10<P:mode>): Likewise.
26354 (tie_add<P:mode>): Likewise.
26355 (tle_hix22<P:mode>): Likewise.
26356 (tle_lox10<P:mode>): Likewise.
26357 (stack_protect_setsi): Rename to...
26358 (stack_protect_set32): ...this.
26359 (stack_protect_setdi): Rename to...
26360 (stack_protect_set64): ...this.
26361 (stack_protect_set): Adjust calls to above.
26362 (stack_protect_testsi): Rename to...
26363 (stack_protect_test32): ...this.
26364 (stack_protect_testdi): Rename to...
26365 (stack_protect_test64): ...this.
26366 (stack_protect_test): Adjust calls to above.
26368 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
26370 PR middle-end/90773
26371 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
26372 (builtin_strncpy_read_str): Likewise.
26373 (builtin_memset_read_str): Add an argument for the previous RTL
26374 information and generate the new RTL from the previous RTL info.
26375 (builtin_memset_gen_str): Likewise.
26376 * builtins.h (builtin_strncpy_read_str): Update the prototype.
26377 (builtin_memset_read_str): Likewise.
26378 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
26379 returns true, round up size and alignment to the widest integer
26380 mode for maximum size.
26381 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
26382 and pass it to m_constfn.
26383 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
26384 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
26385 initialize m_push. Initialize m_overlap_op_by_pieces with
26386 targetm.overlap_op_by_pieces_p ().
26387 (op_by_pieces_d::run): Pass the previous RTL information to
26388 pieces_addr::adjust and generate overlapping operations if
26389 m_overlap_op_by_pieces is true.
26391 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
26393 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
26395 (can_store_by_pieces): Use by_pieces_constfn on constfun.
26396 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
26397 for op_by_pieces_d change.
26398 (clear_by_pieces_1): Add a dummy argument.
26399 (clear_by_pieces): Updated for op_by_pieces_d change.
26400 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
26401 (string_cst_read_str): Add a dummy argument.
26402 * expr.h (by_pieces_constfn): Add a dummy argument.
26403 (by_pieces_prev): New.
26404 * target.def (overlap_op_by_pieces_p): New target hook.
26405 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
26406 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
26407 * doc/tm.texi: Regenerated.
26409 2021-04-29 Richard Biener <rguenther@suse.de>
26411 PR tree-optimization/100253
26412 * tree-vect-stmts.c (vectorizable_load): Do not assume
26413 element alignment when DR_MISALIGNMENT is -1.
26414 (vectorizable_store): Likewise.
26416 2021-04-29 Jakub Jelinek <jakub@redhat.com>
26419 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
26420 absu_hwi instead of abs_hwi.
26422 2021-04-29 Richard Biener <rguenther@suse.de>
26424 PR middle-end/38474
26425 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
26426 forwarding when indirect forwarding through ESCAPED
26429 2021-04-29 Tom de Vries <tdevries@suse.de>
26432 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
26433 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
26434 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
26435 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
26437 2021-04-29 Richard Biener <rguenther@suse.de>
26439 PR tree-optimization/99912
26440 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
26441 (dse_dom_walker::todo): Likewise.
26442 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
26444 (dse_dom_walker::before_dom_children): Remove trivially
26445 dead SSA defs and schedule CFG cleanup if we removed all
26447 (pass_dse::execute): Get TODO as computed by the DOM walker
26448 and return it. Wipe dominator info earlier.
26450 2021-04-29 Richard Biener <rguenther@suse.de>
26453 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
26454 Track blocks to cleanup EH in new m_need_eh_cleanup.
26455 (ipcp_modif_dom_walker::cleanup_eh): New.
26456 (ipcp_transform_function): Release dominator info before
26459 2021-04-29 Martin Sebor <msebor@redhat.com>
26461 PR middle-end/100250
26462 * attribs.c (attr_access::array_as_string): Avoid dereferencing
26463 a pointer when it's null.
26465 2021-04-29 Martin Sebor <msebor@redhat.com>
26467 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
26468 * ipa-free-lang-data.cc: New file.
26469 * tree.c: Move pass free_lang_data to file above.
26470 (build_array_type_1): Declare extern.
26471 * tree.h (build_array_type_1): Declare.
26473 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26475 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
26476 make consistent with updated RTL pattern.
26477 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
26478 Implement using ss_truncate and us_truncate rather than
26480 * config/aarch64/iterators.md: Remove redundant unspecs and
26481 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
26483 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26485 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
26486 attributes consistent with those defined in arm_neon.h.
26488 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26490 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
26491 attributes consistent with those defined in arm_neon.h.
26493 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26495 * config/aarch64/aarch64-simd-builtins.def: Add
26496 float_trunc_rodd builtin generator macros.
26497 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
26499 (aarch64_float_trunc_rodd_lo_v2sf): Define.
26500 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
26501 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
26502 (aarch64_float_trunc_rodd_hi_v4sf): Define.
26503 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
26504 instead of inline asm.
26505 (vcvtx_high_f32_f64): Likewise.
26506 (vcvtxd_f32_f64): Likewise.
26507 * config/aarch64/iterators.md: Add FCVTXN unspec.
26509 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26511 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
26513 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
26515 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
26516 instead of inline asm.
26517 (vqtbx1_u8): Likewise.
26518 (vqtbx1_p8): Likewise.
26519 (vqtbx1q_s8): Likewise.
26520 (vqtbx1q_u8): Likewise.
26521 (vqtbx1q_p8): Likewise.
26522 (vtbx2_s8): Likewise.
26523 (vtbx2_u8): Likewise.
26524 (vtbx2_p8): Likewise.
26526 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26528 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
26530 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
26531 instead of inline asm.
26532 (vqtbl1_s8): Likewise.
26533 (vqtbl1_u8): Likewise.
26534 (vqtbl1q_p8): Likewise.
26535 (vqtbl1q_s8): Likewise.
26536 (vqtbl1q_u8): Likewise.
26537 (vtbl1_s8): Likewise.
26538 (vtbl1_u8): Likewise.
26539 (vtbl1_p8): Likewise.
26540 (vtbl2_s8): Likewise.
26541 (vtbl2_u8): Likewise.
26542 (vtbl2_p8): Likewise.
26544 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26546 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
26547 ssri_n buitin generator macro.
26548 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
26549 instead of inline asm.
26550 (vsri_n_p16): Likewise.
26551 (vsri_n_p64): Likewise.
26552 (vsriq_n_p8): Likewise.
26553 (vsriq_n_p16): Likewise.
26554 (vsriq_n_p64): Likewise.
26556 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26558 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
26559 iterator for polynomial ssli_n builtin generator macro.
26560 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
26561 instead of inline asm.
26562 (vsli_n_p16): Likewise.
26563 (vsliq_n_p8): Likewise.
26564 (vsliq_n_p16): Likewise.
26565 * config/aarch64/iterators.md: Define VALLP mode iterator.
26567 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26569 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
26570 iterator to generate [su]adalp RTL builtins.
26571 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
26572 [su]adalp RTL pattern.
26573 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
26574 instead of inline asm.
26575 (vpadal_u32): Likewise.
26577 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26579 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
26580 builtin generator macros.
26581 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
26583 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
26584 instead of inline asm.
26585 (vpaddl_s16): Likewise.
26586 (vpaddl_s32): Likewise.
26587 (vpaddl_u8): Likewise.
26588 (vpaddl_u16): Likewise.
26589 (vpaddl_u32): Likewise.
26590 (vpaddlq_s8): Likewise.
26591 (vpaddlq_s16): Likewise.
26592 (vpaddlq_s32): Likewise.
26593 (vpaddlq_u8): Likewise.
26594 (vpaddlq_u16): Likewise.
26595 (vpaddlq_u32): Liwewise.
26596 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
26597 appropriate attributes.
26599 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26601 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
26602 for aarch64_addp<mode> builtin macro generator.
26603 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
26604 aarch64_addp<mode> RTL pattern.
26605 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
26606 instead of inline asm.
26607 (vpaddq_s16): Likewise.
26608 (vpaddq_s32): Likewise.
26609 (vpaddq_s64): Likewise.
26610 (vpaddq_u8): Likewise.
26611 (vpaddq_u16): Likewise.
26612 (vpaddq_u32): Likewise.
26613 (vpaddq_u64): Likewise.
26615 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26617 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
26618 builtin generator macros.
26619 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
26621 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
26622 instead of inline asm.
26623 (vqdmulh_n_s32): Likewise.
26624 (vqdmulhq_n_s16): Likewise.
26625 (vqdmulhq_n_s32): Likewise.
26626 (vqrdmulh_n_s16): Likewise.
26627 (vqrdmulh_n_s32): Likewise.
26628 (vqrdmulhq_n_s16): Likewise.
26629 (vqrdmulhq_n_s32): Likewise.
26631 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
26633 * doc/install.texi (--enable-offload-defaulted): Document.
26635 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26637 * config/avr/avr-dimode.md: Turn existing patterns into
26638 define_insn_and_split style patterns where the splitter
26639 adds a clobber of the condition code register. Drop "cc"
26640 attribute. Add new patterns to match output of
26642 * config/avr/avr-fixed.md: Likewise.
26643 * config/avr/avr.c (cc_reg_rtx): New.
26644 (avr_parallel_insn_from_insns): Adjust insn count
26645 for removal of set of cc0.
26646 (avr_is_casesi_sequence): Likewise.
26647 (avr_casei_sequence_check_operands): Likewise.
26648 (avr_optimize_casesi): Likewise. Also insert
26649 new insns after jump_insn.
26650 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
26651 for removal of set of cc0.
26652 (avr_init_expanders): Initialize cc_reg_rtx.
26653 (avr_regno_reg_class): Handle REG_CC.
26654 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
26655 (avr_notice_update_cc): Remove function.
26656 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
26657 (compare_condition): Adjust for PARALLEL with
26659 (out_shift_with_cnt): Likewise.
26660 (ashlhi3_out): Likewise.
26661 (ashrhi3_out): Likewise.
26662 (lshrhi3_out): Likewise.
26663 (avr_class_max_nregs): Return single reg for REG_CC.
26664 (avr_compare_pattern): Check for REG_CC instead
26666 (avr_reorg_remove_redundant_compare): Likewise.
26667 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
26668 (avr_hard_regno_nregs): Return single reg for REG_CC.
26669 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
26670 (avr_md_asm_adjust): Clobber REG_CC.
26671 (TARGET_HARD_REGNO_NREGS): Define.
26672 (TARGET_CLASS_MAX_NREGS): Define.
26673 (TARGET_MD_ASM_ADJUST): Define.
26674 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
26676 (enum reg_class): Add CC_REG class.
26677 (NOTICE_UPDATE_CC): Remove.
26678 (CC_OVERFLOW_UNUSABLE): Remove.
26679 (CC_NO_CARRY): Remove.
26680 * config/avr/avr.md: Turn existing patterns into
26681 define_insn_and_split style patterns where the splitter
26682 adds a clobber of the condition code register. Drop "cc"
26683 attribute. Add new patterns to match output of
26685 (sez): Remove unused pattern.
26687 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
26690 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
26693 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
26696 * config/aarch64/constraints.md (Utq): Require the address to
26697 be valid for both the element mode and for V2DImode.
26699 2021-04-28 Jakub Jelinek <jakub@redhat.com>
26700 Tobias Burnus <tobias@codesourcery.com>
26702 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
26703 * gcc.c (process_command): New variable.
26704 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
26705 set it if -foffload is defaulted.
26706 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
26707 (compile_offload_image): If OFFLOAD_DEFAULTED and
26708 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
26709 if corresponding mkoffload can't be found.
26710 (compile_images_for_offload_targets): Likewise. Free and clear
26711 offload_names if no valid offload is found.
26712 * config.in: Regenerate.
26713 * configure: Regenerate.
26715 2021-04-28 Richard Biener <rguenther@suse.de>
26717 PR tree-optimization/100292
26718 * tree-vect-generic.c (expand_vector_condition): Do not fold
26721 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26723 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
26724 * config/rs6000/aix64.opt (m64): New.
26727 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26729 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
26730 (index_term_p): Handle ASHIFT too.
26732 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26734 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
26735 (sync_lock_test_and_set<mode>): Adjust accordingly.
26736 (sync_lock_release<mode>): Likewise.
26738 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26740 * config/vax/vax-protos.h (adjacent_operands_p): Remove
26742 * config/vax/vax.c (adjacent_operands_p): Remove.
26744 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
26746 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
26747 through to the non-conditional execution case if getting the
26748 condition for conditional execution has failed.
26750 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26752 PR middle-end/100284
26753 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
26754 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
26755 than asserting on it.
26757 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26759 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
26760 with TARGET_AIX_OS.
26762 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26765 * calls.c (precompute_register_parameters): Additionally test
26766 targetm.precompute_tls_p to pre-compute argument.
26767 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
26768 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
26769 * target.def (precompute_tls_p): New.
26770 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
26771 * doc/tm.texi: Regenerated.
26773 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26776 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
26777 back to HOST_WIDE_INT.
26779 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
26782 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
26783 memory alignment for the outer mode.
26785 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
26787 PR middle-end/90773
26788 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
26789 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
26791 2021-04-27 Alex Coplan <alex.coplan@arm.com>
26794 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
26795 with negative immediates: ensure we expand cbranchsi4_scratch
26796 correctly and ensure we satisfy its constraints.
26797 * config/arm/sync.md
26798 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
26799 attempt to tie two output operands together with constraints;
26800 collapse two alternatives.
26801 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
26802 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
26804 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26807 * config/aarch64/predicates.md (aarch64_sub_immediate,
26808 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
26809 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
26810 * config/aarch64/aarch64.c (aarch64_print_operand,
26811 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
26813 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26815 PR tree-optimization/100239
26816 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
26817 permutations with all indices from the first zero element as vec_shl.
26819 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26821 PR rtl-optimization/100254
26822 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
26823 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
26825 2021-04-27 Richard Biener <rguenther@suse.de>
26827 PR tree-optimization/99912
26828 * passes.def: Add comment about new TODO_remove_unused_locals.
26829 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
26832 2021-04-27 Richard Biener <rguenther@suse.de>
26834 PR tree-optimization/99912
26835 * passes.def (pass_all_optimizations): Add pass_dse before
26836 the first pass_dce, move the first pass_dse before the
26837 pass_dce following pass_pre.
26839 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26841 PR tree-optimization/95527
26842 * generic-match-head.c: Include tm.h.
26843 * gimple-match-head.c: Include tm.h.
26844 * match.pd (CLZ == INTEGER_CST): Don't use
26845 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
26846 if clz == CFN_CLZ. Add missing val declaration.
26847 (CTZ cmp CST): New simplifications.
26849 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26851 PR tree-optimization/96696
26852 * expr.c (expand_expr_divmod): New function.
26853 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
26854 divisions. Formatting fixes.
26855 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
26858 2021-04-27 Martin Jambor <mjambor@suse.cz>
26861 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
26862 If removing a call statement LHS SSA name, release it.
26864 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
26867 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
26868 is valid before including it in the mask.
26870 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26873 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
26876 2021-04-27 Richard Biener <rguenther@suse.de>
26878 PR tree-optimization/100051
26879 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
26880 disambiguator based on access size vs. decl size.
26882 2021-04-27 Richard Biener <rguenther@suse.de>
26884 PR tree-optimization/100278
26885 * tree-ssa-pre.c (compute_avail): Give up when we cannot
26886 adjust TBAA beacuse of mismatching bases.
26888 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26891 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
26892 For any_rotate define_insn_split and following splitters, use
26893 SWI iterator instead of SWI48.
26895 2021-04-27 Richard Biener <rguenther@suse.de>
26897 PR tree-optimization/99776
26898 * match.pd (bit_field_ref (ctor)): Relax element extract
26899 type compatibility checks.
26901 2021-04-27 Cui,Lili <lili.cui@intel.com>
26903 * common/config/i386/i386-common.c (processor_names):
26904 Sync processor_names with processor_type.
26905 * config/i386/i386-options.c (processor_cost_table):
26906 Sync processor_cost_table with processor_type.
26908 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26910 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
26911 (irange::set): Call irange_set_1bit_anti_range for handling all
26912 1-bit ranges. Fall through on ~[MIN,MAX].
26914 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26916 * value-range.cc (irange::legacy_num_pairs): Remove.
26917 (irange::invert): Change gcc_assert to gcc_checking_assert.
26918 * value-range.h (irange::num_pairs): Adjust for a cached
26919 num_pairs(). Also, rename all gcc_assert's to
26920 gcc_checking_assert's.
26922 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26924 * value-range.cc (irange::operator=): Set m_kind.
26925 (irange::copy_to_legacy): Handle varying and undefined sources
26926 as a legacy copy since they can be easily copied.
26927 (irange::irange_set): Set m_kind.
26928 (irange::irange_set_anti_range): Same.
26929 (irange::set): Rename normalize_min_max to normalize_kind.
26930 (irange::verify_range): Adjust for multi-ranges having the
26932 (irange::irange_union): Set m_kind.
26933 (irange::irange_intersect): Same.
26934 (irange::invert): Same.
26935 * value-range.h (irange::kind): Always return m_kind.
26936 (irange::varying_p): Rename to...
26937 (irange::varying_comptaible_p): ...this.
26938 (irange::undefined_p): Only look at m_kind.
26939 (irange::irange): Always set VR_UNDEFINED if applicable.
26940 (irange::set_undefined): Always set VR_UNDEFINED.
26941 (irange::set_varying): Always set m_kind to VR_VARYING.
26942 (irange::normalize_min_max): Rename to...
26943 (irange::normalize_kind): ...this.
26945 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26947 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
26948 Adjust for constant_p including varying_p.
26949 * tree-vrp.c (vrp_prop::finalize): Same.
26950 (determine_value_range): Same.
26951 * vr-values.c (vr_values::range_of_expr): Same.
26952 * value-range.cc (irange::symbolic_p): Do not check varying_p.
26953 (irange::constant_p): Same.
26955 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26957 * value-range.cc (irange::legacy_lower_bound): Replace
26958 !undefined_p check with num_ranges > 0.
26959 (irange::legacy_upper_bound): Same.
26960 * value-range.h (irange::type): Same.
26961 (irange::lower_bound): Same.
26962 (irange::upper_bound): Same.
26964 2021-04-26 Richard Biener <rguenther@suse.de>
26966 PR tree-optimization/99956
26967 * gimple-loop-interchange.cc (compute_access_stride):
26968 Try instantiating the access in a shallower loop nest
26969 if instantiating failed.
26970 (compute_access_strides): Pass adjustable loop_nest
26971 to compute_access_stride.
26973 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
26975 * doc/sourcebuild.texi (arm_cmse_hw): Document.
26977 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26979 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
26981 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
26982 Nathan Sidwell <nathan@codesourcery.com>
26983 Tom de Vries <vries@codesourcery.com>
26984 Julian Brown <julian@codesourcery.com>
26985 Kwok Cheung Yeung <kcy@codesourcery.com>
26987 * omp-offload.c (oacc_validate_dims): Implement
26988 '-Wopenacc-parallelism'.
26989 * doc/invoke.texi (-Wopenacc-parallelism): Document.
26991 2021-04-26 Richard Biener <rguenther@suse.de>
26993 * tree-cfg.h (gimplify_build1): Remove.
26994 (gimplify_build2): Likewise.
26995 (gimplify_build3): Likewise.
26996 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
26997 (gimplify_build2): Likewise.
26998 (gimplify_build3): Likewise.
26999 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
27001 (gimplify_build2): Likewise.
27002 (gimplify_build3): Likewise.
27003 (tree_vec_extract): Use resimplify with following SSA edges.
27004 (expand_vector_parallel): Avoid passing NULL size/bitpos
27005 to tree_vec_extract.
27006 * expr.c (store_constructor): Deal with zero-element CTORs.
27007 * match.pd (bit_field_ref <vector CTOR>): Make sure to
27008 produce vector constants when possible.
27010 2021-04-26 Richard Biener <rguenther@suse.de>
27012 * tree-complex.c: Include gimple-fold.h.
27013 (expand_complex_addition): Use gimple_build.
27014 (expand_complex_multiplication_components): Likewise.
27015 (expand_complex_multiplication): Likewise.
27016 (expand_complex_div_straight): Likewise.
27017 (expand_complex_div_wide): Likewise.
27018 (expand_complex_division): Likewise.
27019 (expand_complex_conjugate): Likewise.
27020 (expand_complex_comparison): Likewise.
27022 2021-04-26 Richard Biener <rguenther@suse.de>
27024 * tree-ssa-phiopt.c (two_value_replacement): Remove use
27025 of legacy gimplify_buildN API.
27027 2021-04-26 Richard Biener <rguenther@suse.de>
27029 PR tree-optimization/99473
27030 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
27033 2021-04-26 Richard Biener <rguenther@suse.de>
27035 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
27036 Use replace_call_with_value.
27038 2021-04-26 Richard Biener <rguenther@suse.de>
27040 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
27041 (update_gimple_call): Likewise.
27042 (update_call_from_tree): Likewise.
27043 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
27044 (valid_gimple_call_p): Likewise.
27045 (move_ssa_defining_stmt_for_defs): Likewise.
27046 (finish_update_gimple_call): Likewise.
27047 (update_gimple_call): Likewise.
27048 (update_call_from_tree): Likewise.
27049 (propagate_tree_value_into_stmt): Use replace_call_with_value.
27050 * gimple-fold.h (update_gimple_call): Declare.
27051 * gimple-fold.c (valid_gimple_rhs_p): Move here from
27052 tree-ssa-propagate.c.
27053 (update_gimple_call): Likewise.
27054 (valid_gimple_call_p): Likewise.
27055 (finish_update_gimple_call): Likewise, and simplify.
27056 (gimplify_and_update_call_from_tree): Implement
27057 update_call_from_tree functionality, avoid excessive
27058 push/pop_gimplify_context.
27059 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
27060 (gimple_fold_call): Likewise.
27061 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
27062 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
27063 (pass_fold_builtins::execute): Likewise.
27064 (optimize_stack_restore): Use replace_call_with_value.
27065 * tree-cfg.c (fold_loop_internal_call): Likewise.
27066 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
27067 only gimplify_and_update_call_from_tree.
27068 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
27069 (handle_builtin_strchr): Likewise.
27070 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
27072 2021-04-26 Jakub Jelinek <jakub@redhat.com>
27075 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
27076 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
27079 2021-04-25 liuhongt <hongtao.liu@intel.com>
27082 * config/i386/i386-builtin.def (BDESC): Change the icode of
27083 the following builtins to CODE_FOR_nothing.
27084 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
27085 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
27086 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
27087 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
27088 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
27089 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
27090 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
27091 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
27092 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
27093 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
27094 (sse2_eq<mode>3): Ditto.
27095 (sse4_1_eqv2di3): Ditto.
27096 (sse2_gt<mode>3): Rename to ..
27097 (*sse2_gt<mode>3): .. this.
27099 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27102 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27105 * config/darwin.c (darwin_binds_local_p): Assume that any
27106 public symbol might be interposed for PIC code. Update function
27107 header comment to reflect current Darwin capability.
27109 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
27112 * config/darwin.c (darwin_binds_local_p): Assume that any
27113 public symbol might be interposed for PIC code. Update function
27114 header comment to reflect current Darwin capability.
27116 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
27118 * doc/sourcebuild.texi: Document no-opts and any-opts target
27121 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27123 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
27125 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
27127 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
27129 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27132 * config/i386/i386-options.c (ix86_option_override_internal):
27133 Error out when -m96bit-long-double is used with 64bit targets.
27134 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
27136 2021-04-23 Martin Liska <mliska@suse.cz>
27138 * lto-wrapper.c: Remove FIXME about usage of
27139 hardware_concurrency. The function is not on par with
27142 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
27145 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
27146 Copy operand 3 to operand 4. Use sse_reg_operand
27147 as operand 3 predicate.
27148 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
27149 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
27150 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
27151 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
27152 Copy operand 1 to operand 0.
27153 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
27154 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
27155 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
27157 2021-04-23 Alex Coplan <alex.coplan@arm.com>
27159 PR rtl-optimization/100230
27160 * early-remat.c (early_remat::sort_candidates): Use delete[]
27161 instead of delete for array allocated with new[].
27163 2021-04-23 Richard Biener <rguenther@suse.de>
27165 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
27166 (capture_info::capture_info): Likewise.
27167 (capture_info::walk_match): Likewise.
27168 (expr::gen_transform): Likewise.
27169 (dt_simplify::gen_1): Likewise.
27170 * gimple-match-head.c (maybe_resimplify_conditional_op):
27171 Remove VEC_COND_EXPR special-casing.
27172 (gimple_simplify): Likewise.
27173 * gimple.c (gimple_could_trap_p_1): Adjust.
27174 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
27175 to participate in PRE.
27177 2021-04-23 Richard Biener <rguenther@suse.de>
27179 * cfganal.c (connect_infinite_loops_to_exit): First call
27180 add_noreturn_fake_exit_edges.
27181 * ipa-sra.c (process_scan_results): Do not call the now redundant
27182 add_noreturn_fake_exit_edges.
27183 * predict.c (tree_estimate_probability): Likewise.
27184 (rebuild_frequencies): Likewise.
27185 * store-motion.c (one_store_motion_pass): Likewise.
27187 2021-04-23 Richard Biener <rguenther@suse.de>
27189 PR tree-optimization/100222
27190 * predict.c (pass_profile::execute): Remove redundant call to
27191 mark_irreducible_loops.
27192 (report_predictor_hitrates): Likewise.
27194 2021-04-23 Richard Biener <rguenther@suse.de>
27196 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
27197 valid_gimple_rhs_p by instead gimplifying to one.
27199 2021-04-23 Richard Biener <rguenther@suse.de>
27201 PR tree-optimization/99971
27202 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
27203 Always use TBAA for loads.
27205 2021-04-23 liuhongt <hongtao.liu@intel.com>
27208 * config/i386/i386-options.c (ix86_option_override_internal):
27209 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
27210 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
27211 by target attribute.
27213 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
27215 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
27217 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
27219 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
27221 * config.gcc (powerpc-ibm-aix6.*): Remove.
27222 * config/rs6000/aix61.h: Delete.
27224 2021-04-22 Martin Liska <mliska@suse.cz>
27226 PR testsuite/100159
27227 PR testsuite/100192
27228 * builtins.c (expand_builtin): Fix typos and missing comments.
27229 * dwarf2out.c (gen_subprogram_die): Likewise.
27230 (gen_struct_or_union_type_die): Likewise.
27232 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
27235 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
27236 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
27238 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
27240 * config/i386/darwin.h (TARGET_64BIT): Remove definition
27241 based on TARGET_ISA_64BIT.
27242 (TARGET_64BIT_P): Remove definition based on
27243 TARGET_ISA_64BIT_P().
27245 2021-04-21 Martin Liska <mliska@suse.cz>
27248 2021-04-21 Martin Liska <mliska@suse.cz>
27250 * lto-wrapper.c (cpuset_popcount): Remove.
27251 (init_num_threads): Remove and use hardware_concurrency.
27253 2021-04-21 Martin Liska <mliska@suse.cz>
27256 * main.c (main): Call toplev::finalize in CHECKING_P mode.
27257 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
27258 when incremental LTO linking happens.
27260 2021-04-21 Martin Liska <mliska@suse.cz>
27262 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
27263 makeserver cannot be detected, then use -flto=N fallback.
27265 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27267 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
27268 default to yes for aarch64-linux-gnu.
27269 * configure: Regenerate.
27271 2021-04-21 Martin Liska <mliska@suse.cz>
27273 * lto-wrapper.c (cpuset_popcount): Remove.
27274 (init_num_threads): Remove and use hardware_concurrency.
27276 2021-04-21 Martin Liska <mliska@suse.cz>
27278 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
27279 which remains to be '(... || 0)' and clang complains about it.
27280 * dwarf2out.c (AT_vms_delta): Declare conditionally.
27281 (add_AT_vms_delta): Likewise.
27282 * tree.c (fld_simplified_type): Use rather more common pattern
27283 for disabling of something (#if 0).
27284 (get_tree_code_name): Likewise.
27285 (verify_type_variant): Likewise.
27287 2021-04-21 Martin Liska <mliska@suse.cz>
27289 * config/i386/i386-expand.c (decide_alignment): Use newly named
27290 macro TARGET_CPU_P.
27291 * config/i386/i386.c (ix86_decompose_address): Likewise.
27292 (ix86_address_cost): Likewise.
27293 (ix86_lea_outperforms): Likewise.
27294 (ix86_avoid_lea_for_addr): Likewise.
27295 (ix86_add_stmt_cost): Likewise.
27296 * config/i386/i386.h (TARGET_*): Remove.
27297 (TARGET_CPU_P): New macro.
27298 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
27299 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
27300 (swap_top_of_ready_list): Likewise.
27301 (ix86_atom_sched_reorder): Likewise.
27302 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
27303 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
27305 2021-04-21 Martin Liska <mliska@suse.cz>
27307 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
27309 (SET_TARGET_NO_SAHF): Likewise.
27310 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
27311 (SET_TARGET_PREFETCH_SSE): Likewise.
27312 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
27313 (SET_TARGET_NO_TUNE): Likewise.
27314 (TARGET_EXPLICIT_NO_80387_P): Likewise.
27315 (SET_TARGET_NO_80387): Likewise.
27317 * config/i386/i386.h (TARGET_*): Remove.
27318 * opth-gen.awk: Generate new used macros.
27320 2021-04-21 Martin Liska <mliska@suse.cz>
27322 * config/i386/i386.h (PTA_*): Remove.
27323 (enum pta_flag): New.
27324 (DEF_PTA): Generate PTA_* values from i386-isa.def.
27325 * config/i386/i386-isa.def: New file.
27327 2021-04-21 Alex Coplan <alex.coplan@arm.com>
27330 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
27331 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
27332 jump table targets.
27334 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
27336 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
27337 x86_64-*-* targets.
27338 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
27340 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
27341 (ix86_handle_option): Handle -mmwait.
27342 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
27343 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
27344 __builtin_ia32_monitor and __builtin_ia32_mwait.
27345 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
27346 (ix86_valid_target_attribute_inner_p): Likewise.
27347 (ix86_option_override_internal): Enable mwait/monitor
27348 instructions for -msse3.
27349 * config/i386/i386.h (TARGET_MWAIT): New.
27350 (TARGET_MWAIT_P): Likewise.
27351 * config/i386/i386.opt: Add -mmwait.
27352 * config/i386/mwaitintrin.h: New file.
27353 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
27354 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
27356 (@sse3_monitor_<mode>): Likewise.
27357 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
27358 * doc/extend.texi: Document mwait target attribute.
27359 * doc/invoke.texi: Document -mmwait.
27361 2021-04-21 Martin Liska <mliska@suse.cz>
27363 * config/i386/i386-options.c (DEF_ENUM): Remove it.
27364 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
27365 * config/i386/stringop.def (DEF_ENUM): Likewise.
27367 2021-04-21 Martin Liska <mliska@suse.cz>
27369 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
27370 of print_generic_expr.
27372 2021-04-21 Jakub Jelinek <jakub@redhat.com>
27374 PR rtl-optimization/100148
27375 * cprop.c (constprop_register): Use next_nondebug_insn instead of
27378 2021-04-21 Martin Liska <mliska@suse.cz>
27381 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
27382 free_dominance_info calls.
27384 2021-04-21 Richard Biener <rguenther@suse.de>
27386 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
27387 parameter (and assume it to be false).
27388 (fold_gimple_assign): Adjust, remove all callers of
27389 maybe_fold_reference calling it with is_lhs true.
27390 (gimple_fold_call): Likewise.
27391 (fold_stmt_1): Likewise.
27393 2021-04-21 Richard Biener <rguenther@suse.de>
27395 * fold-const.c (pedantic_non_lvalue_loc): Remove.
27396 (fold_binary_loc): Adjust.
27397 (fold_ternary_loc): Likewise.
27399 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27401 PR middle-end/100130
27402 * varasm.c (get_block_for_decl): Make sure that any use of the
27403 retain attribute matches the section's retain flag.
27404 (switch_to_section): Check for retain mismatches even when
27405 changing sections, but do not warn if the given decl is the
27406 section's named.decl.
27407 (output_object_block): Pass the first decl in the block (if any)
27408 to switch_to_section.
27410 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
27412 * config/i386/i386-c.c (ix86_target_macros_internal): Define
27413 __CRC32__ for -mcrc32.
27414 * config/i386/i386-options.c (ix86_option_override_internal):
27415 Enable crc32 instruction for -msse4.2.
27416 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
27418 (sse4_2_crc32di): Likewise.
27419 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
27422 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
27425 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
27428 2021-04-20 Martin Liska <mliska@suse.cz>
27430 * doc/invoke.texi: Fix typo.
27431 * params.opt: Likewise.
27433 2021-04-20 Martin Liska <mliska@suse.cz>
27435 * doc/invoke.texi: Document new param.
27437 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
27439 PR tree-optimization/100081
27440 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
27441 rather than gori_compute_cache.
27442 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
27443 (range_def_chain::m_logical_depth): New member.
27444 (range_def_chain::range_def_chain): Initialize m_logical_depth.
27445 (range_def_chain::get_def_chain): Don't build defchains through more
27446 than LOGICAL_LIMIT logical expressions.
27447 * params.opt (param_ranger_logical_depth): New.
27449 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
27452 * config/arm/arm.c (arm_configure_build_target): Do not strip
27453 extended FPU/SIMD feature bits from the target ISA when -mfpu
27454 is specified (partial revert of r11-8168).
27456 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
27458 * params.opt (-param=openacc-kernels=): Add.
27459 * omp-oacc-kernels-decompose.cc
27460 (pass_omp_oacc_kernels_decompose::gate): Use it.
27461 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
27462 (--param): ... here, 'openacc-kernels'.
27464 2021-04-19 Martin Liska <mliska@suse.cz>
27467 * gengtype.c (finish_root_table): Align function arguments
27468 in between declaration and definition.
27470 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
27472 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
27473 frames larger than the SEH maximum frame size.
27475 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
27477 PR rtl-optimization/99927
27478 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
27479 is dead, just drop it.
27481 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27484 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
27485 * doc/tm.texi: Regenerate.
27486 * doc/tm.texi.in (D language and ABI): Add @hook for
27487 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
27489 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27491 * config/darwin-d.c (darwin_d_handle_target_object_format): New
27493 (darwin_d_register_target_info): New function.
27494 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27495 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
27497 (dragonfly_d_register_target_info): New function.
27498 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27499 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
27501 (freebsd_d_register_target_info): New function.
27502 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27503 * config/glibc-d.c (glibc_d_handle_target_object_format): New
27505 (glibc_d_register_target_info): New function.
27506 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27507 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
27509 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
27510 as handler for objectFormat key.
27511 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
27513 (winnt_d_register_target_info): New function.
27514 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27515 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
27517 (netbsd_d_register_target_info): New function.
27518 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27519 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
27521 (openbsd_d_register_target_info): New function.
27522 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27523 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
27524 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
27525 handler for objectFormat key.
27526 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
27528 (rs6000_d_register_target_info): Add
27529 rs6000_d_handle_target_object_format as handler for objectFormat key.
27530 * config/sol2-d.c (solaris_d_handle_target_object_format): New
27532 (solaris_d_register_target_info): New function.
27533 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27535 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27538 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
27539 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
27541 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
27542 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
27543 the old and new alignment after applying MIN/MAX to it is different.
27545 2021-04-16 Tamar Christina <tamar.christina@arm.com>
27548 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
27549 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
27551 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
27553 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
27555 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
27556 this section and its subsections.
27558 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27561 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
27562 define_insn patterns.
27564 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27566 PR rtl-optimization/98689
27567 * reg-notes.def (UNTYPED_CALL): New note.
27568 * combine.c (distribute_notes): Handle it.
27569 * emit-rtl.c (try_split): Likewise.
27570 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
27571 that calls with the note implicitly set all return value registers.
27572 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
27575 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27577 PR rtl-optimization/99596
27578 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
27579 register accesses for const calls. Assume that pure functions
27580 can only read from global registers. Ignore cases in which
27581 the stack pointer has been marked global.
27583 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27586 * tree-vect-loop.c (vect_transform_loop): Don't remove just
27587 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
27588 them by their last argument.
27590 2021-04-15 Martin Liska <mliska@suse.cz>
27592 * doc/invoke.texi: Other params don't use it, remove it.
27594 2021-04-15 Richard Biener <rguenther@suse.de>
27596 * gimple-builder.h: Add deprecation note.
27598 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27601 * attribs.h (restrict_type_identity_attributes_to): Declare.
27602 * attribs.c (restrict_type_identity_attributes_to): New function.
27604 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27607 * attribs.h (affects_type_identity_attributes): Declare.
27608 * attribs.c (remove_attributes_matching): New function.
27609 (affects_type_identity_attributes): Likewise.
27611 2021-04-15 Jakub Jelinek <jakub@redhat.com>
27614 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
27615 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
27616 ZERO_EXTEND, SIGN_EXTEND or AND.
27618 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27620 PR rtl-optimization/99929
27621 * rtl.h (same_vector_encodings_p): New function.
27622 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
27623 * cselib.c (rtx_equal_for_cselib_1): Likewise.
27624 * jump.c (rtx_renumbered_equal_p): Likewise.
27625 * lra-constraints.c (operands_match_p): Likewise.
27626 * reload.c (operands_match_p): Likewise.
27627 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
27629 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27631 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
27632 more information about variable-length CONST_VECTORs.
27634 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
27636 PR rtl-optimization/100066
27637 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
27638 ordered modes when choosing splitting mode for hard reg.
27640 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27643 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
27645 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
27647 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
27649 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
27650 for mask operand types.
27651 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
27652 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
27654 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
27656 * config/s390/s390.c (s390_const_operand_ok): Check the new
27657 operand types and generate a list of valid values.
27659 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27661 * doc/tm.texi: Regenerate.
27662 * doc/tm.texi.in (D language and ABI): Add @hook for
27663 TARGET_D_REGISTER_OS_TARGET_INFO.
27665 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27667 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
27669 (aarch64_d_register_target_info): New function.
27670 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
27672 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
27674 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
27675 (arm_d_register_target_info): New function.
27676 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
27677 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27678 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
27679 (ix86_d_register_target_info): New function.
27680 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
27681 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27682 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
27683 (mips_d_register_target_info): New function.
27684 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
27685 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27686 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
27687 (pa_d_register_target_info): New function.
27688 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
27689 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27690 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
27692 (riscv_d_register_target_info): New function.
27693 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
27694 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27695 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
27697 (rs6000_d_register_target_info): New function.
27698 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
27700 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27701 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
27702 (s390_d_register_target_info): New function.
27703 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
27704 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27705 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
27707 (sparc_d_register_target_info): New function.
27708 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
27709 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27710 * doc/tm.texi: Regenerate.
27711 * doc/tm.texi.in (D language and ABI): Add @hook for
27712 TARGET_D_REGISTER_CPU_TARGET_INFO.
27714 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27716 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
27717 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
27718 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
27719 * doc/tm.texi: Regenerate.
27720 * doc/tm.texi.in (D language and ABI): Add @hook for
27721 TARGET_D_HAS_STDCALL_CONVENTION.
27723 2021-04-14 Richard Biener <rguenther@suse.de>
27725 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
27726 VEC_COND_EXPRs have a gimple_val condition.
27727 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
27728 can no longer have a GENERIC condition.
27730 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
27733 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
27734 from the isa_delta when -mfpu has been used.
27735 (arm_options_perform_arch_sanity_checks): It's the architecture that
27736 lacks an FPU not the processor.
27738 2021-04-13 Richard Biener <rguenther@suse.de>
27740 PR tree-optimization/100053
27741 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
27742 not use optimistic dominance queries for backedges to validate
27744 (dominated_by_p_w_unex): Add parameter to ignore executable
27745 state on backedges.
27746 (rpo_elim::eliminate_avail): Adjust.
27748 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27751 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
27752 *aarch64_bfxilsi_extrdi): New define_insn patterns.
27754 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27757 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
27758 outermode, return NULL if the result doesn't encode back to the
27759 original byte sequence.
27760 (simplify_gen_subreg): Don't create SUBREGs from constants to
27761 MODE_COMPOSITE_P outermode.
27763 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27765 PR rtl-optimization/99905
27766 * combine.c (expand_compound_operation): If pos + len > modewidth,
27767 perform the right shift by pos in inner_mode and then convert to mode,
27768 instead of trying to simplify a shift of rtx with inner_mode by pos
27769 as if it was a shift in mode.
27771 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27774 * combine.c (simplify_and_const_int_1): Don't optimize varop
27775 away if it has side-effects.
27777 2021-04-12 Martin Liska <mliska@suse.cz>
27779 * doc/extend.texi: Escape @smallexample content.
27781 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
27783 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
27784 alternative in order to load a DFP zero.
27786 2021-04-12 Martin Liska <mliska@suse.cz>
27788 * doc/extend.texi: Be more precise in documentation
27789 of symver attribute.
27791 2021-04-12 Martin Liska <mliska@suse.cz>
27794 * gimplify.c (gimplify_expr): Right now, we unpoison all
27795 variables before a goto <dest>. We should not do it if we are
27798 2021-04-12 Cui,Lili <lili.cui@intel.com>
27800 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
27802 * common/config/i386/i386-common.c (processor_names): Add
27804 (processor_alias_table): Add rocketlake.
27805 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
27806 INTEL_COREI7_ROCKETLAKE.
27807 * config.gcc: Add -march=rocketlake.
27808 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
27810 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
27811 (processor_cost_table): Add rocketlake cost.
27812 * config/i386/i386.h (ix86_size_cost) : Define
27814 (processor_type) : Add PROCESSOR_ROCKETLAKE.
27815 (PTA_ROCKETLAKE): Ditto.
27816 * doc/extend.texi: Add rocketlake.
27817 * doc/invoke.texi: Add rocketlake.
27819 2021-04-12 Cui,Lili <lili.cui@intel.com>
27821 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
27822 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
27823 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
27824 * doc/invoke.texi: Change alderlake ISA list.
27826 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
27828 PR middle-end/98088
27829 * omp-expand.c (expand_oacc_collapse_init): Update condition in
27832 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
27835 * config/i386/serializeintrin.h (_serialize): Defined as macro.
27837 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27840 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
27841 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
27843 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27845 PR middle-end/99989
27846 * gimple-ssa-warn-alloca.c
27847 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
27848 0 with integer precision unconditionally.
27850 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27852 PR rtl-optimization/98601
27853 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
27854 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
27855 unaligned_mems handle VOIDmode like BLKmode.
27857 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
27860 * tree.c (free_lang_data_in_decl): Do not release body of
27861 declare_variant_alt.
27863 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27865 * config/aarch64/aarch64.c (aarch64_option_restore): If the
27866 architecture was specified explicitly and the tuning wasn't,
27867 tune for the architecture rather than the configured default CPU.
27869 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27871 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
27872 as the temporary register.
27874 2021-04-09 Martin Liska <mliska@suse.cz>
27876 * doc/extend.texi: Move non-target attributes on the top level.
27878 2021-04-09 Martin Liska <mliska@suse.cz>
27880 * doc/invoke.texi: Document minimum and maximum value of the
27881 argument for both supported compression algorithms.
27883 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
27885 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
27886 TLS BSS before TLS data.
27887 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
27889 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27891 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
27893 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27895 * match.pd: Extend vec_cond folds to handle shifts.
27897 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
27899 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
27902 2021-04-08 Alex Coplan <alex.coplan@arm.com>
27905 * config/arm/iterators.md (MVE_vecs): New.
27906 (V_elem): Also handle V2DF.
27907 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
27908 (*mve_vdup<mode>): ... this. Remove second alternative since
27909 vec_duplicate of const_int is not canonical RTL, and we don't
27910 want to match symbol_refs.
27911 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
27913 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
27915 * fold-const.c (fold_single_bit_test): Fix typo.
27916 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
27919 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27921 PR tree-optimization/97513
27922 * tree-vect-slp.c (vect_add_slp_permutation): New function,
27924 (vectorizable_slp_permutation): ...here. Detect cases in which
27925 all VEC_PERM_EXPRs are guaranteed to have the same stepped
27926 permute vector and only generate one permute vector for that case.
27927 Extend that case to handle variable-length vectors.
27929 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27931 PR tree-optimization/99873
27932 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
27933 (vect_build_slp_instance): Don't split store groups that could
27934 use IFN_STORE_LANES.
27936 2021-04-07 Jakub Jelinek <jakub@redhat.com>
27939 * varasm.c (output_constant_pool_contents): Don't strip name encoding
27940 from XSTR (desc->sym, 0) or from label before passing those to
27943 2021-04-07 Richard Biener <rguenther@suse.de>
27945 PR tree-optimization/99954
27946 * tree-loop-distribution.c: Include tree-affine.h.
27947 (generate_memcpy_builtin): Try using tree-affine to prove
27949 (loop_distribution::classify_builtin_ldst): Always classify
27952 2021-04-07 Richard Biener <rguenther@suse.de>
27954 PR tree-optimization/99947
27955 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
27956 steps vector to avoid pushing elements from the reallocated
27959 2021-04-07 Richard Biener <rguenther@suse.de>
27961 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
27962 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
27964 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
27966 (debug_vn_reference_ops): New.
27968 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
27970 PR tree-optimization/98736
27971 * tree-loop-distribution.c
27972 * (loop_distribution::bb_top_order_init):
27973 Compute RPO with programing order preserved by calling function
27974 rev_post_order_and_mark_dfs_back_seme.
27976 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
27979 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
27980 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
27982 (process_bb_lives): Don't update biggest mode of hard reg for
27983 implicit in multi-register group. Use the new functions for
27984 updating dead_set and unused_set by register notes.
27986 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
27988 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
27991 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
27993 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
27994 (skylake_memset): Likewise.
27995 (skylake_cost): Change CLEAR_RATIO to 17.
27996 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
27997 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
27998 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
28000 2021-04-06 Richard Biener <rguenther@suse.de>
28002 PR tree-optimization/99880
28003 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
28004 set vectorized defs of relevant PHIs.
28006 2021-04-06 Richard Biener <rguenther@suse.de>
28008 PR tree-optimization/99924
28009 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
28010 nodes w/o scalar stmts as visited.
28012 2021-04-06 Alex Coplan <alex.coplan@arm.com>
28015 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
28016 PCS for [su]fix_optab.
28018 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28020 * config/darwin.c (machopic_legitimize_pic_address): Check
28021 that the current pic register is one of the hard reg set
28022 before setting liveness.
28024 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
28026 * config/darwin.c (machopic_legitimize_pic_address): Fix
28027 whitespace, remove unused code.
28029 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28031 PR tree-optimization/99882
28032 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
28035 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28037 PR rtl-optimization/99863
28038 * dse.c (replace_read): Drop regs_live argument. Instead of
28039 regs_live, use store_insn->fixed_regs_live if non-NULL,
28040 otherwise punt if insns sequence clobbers or sets any hard
28043 2021-04-03 Jakub Jelinek <jakub@redhat.com>
28046 * targhooks.h (default_print_patchable_function_entry_1): Declare.
28047 * targhooks.c (default_print_patchable_function_entry_1): New function,
28048 copied from default_print_patchable_function_entry with an added flags
28050 (default_print_patchable_function_entry): Rewritten into a small
28051 wrapper around default_print_patchable_function_entry_1.
28052 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
28054 (rs6000_print_patchable_function_entry): New function.
28056 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
28058 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
28060 2021-04-01 Jason Merrill <jason@redhat.com>
28063 * common.opt: Document v15 and v16.
28065 2021-04-01 Richard Biener <rguenther@suse.de>
28067 PR tree-optimization/99863
28068 * gimplify.c (gimplify_init_constructor): Recompute vector
28071 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28073 * doc/extend.texi (symver attribute): Fix up syntax errors
28076 2021-04-01 Jakub Jelinek <jakub@redhat.com>
28078 PR tree-optimization/96573
28079 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
28080 also pointer types.
28082 2021-04-01 Richard Biener <rguenther@suse.de>
28084 PR tree-optimization/99856
28085 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
28086 precision to vector element precision.
28088 2021-04-01 Martin Jambor <mjambor@suse.cz>
28090 PR tree-optimization/97009
28091 * tree-sra.c (access_or_its_child_written): New function.
28092 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
28095 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
28098 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
28100 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
28103 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
28104 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
28105 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
28106 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
28107 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
28108 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
28110 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
28111 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
28112 (define_attr "prefixed"): Update initializer.
28114 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28117 * dwarf2out.c (debug_ranges_dwo_section): New variable.
28118 (DW_RANGES_IDX_SKELETON): Define.
28119 (struct dw_ranges): Add begin_entry and end_entry members.
28120 (DEBUG_DWO_RNGLISTS_SECTION): Define.
28121 (add_ranges_num): Adjust r initializer for addition of *_entry
28123 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
28124 set idx to DW_RANGES_IDX_SKELETON.
28125 (use_distinct_base_address_for_range): New function.
28126 (index_rnglists): Don't set r->idx if it is equal to
28127 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
28128 r->end_entry for -gsplit-dwarf if those will be needed by
28130 (output_rnglists): Add DWO argument. If true, switch to
28131 debug_ranges_dwo_section rather than debug_ranges_section.
28132 Adjust l1/l2 label indexes. Only output the offset table when
28133 dwo is true and don't include in there the skeleton range
28134 entry if present. For -gsplit-dwarf, skip ranges that belong
28135 to the other rnglists section. Change return type from void
28136 to bool and return true if there are any range entries for
28137 the other section. For dwarf_split_debug_info use
28138 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
28139 entries instead of DW_RLE_start_end, DW_RLE_start_length and
28140 DW_RLE_base_address. Use use_distinct_base_address_for_range.
28141 (init_sections_and_labels): Initialize debug_ranges_dwo_section
28142 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
28143 and range_base_label indexes.
28144 (dwarf2out_finish): Call index_rnglists earlier before finalizing
28145 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
28146 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
28147 with different dwo arguments.
28148 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
28150 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28152 PR tree-optimization/98268
28153 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
28154 recompute_tree_invariant_for_addr_expr after successfully
28155 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
28157 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28159 PR tree-optimization/99726
28160 * tree-data-ref.c (create_intersect_range_checks_index): Bail
28161 out if there is more than one access function SCEV for the loop
28164 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28166 PR rtl-optimization/97141
28167 PR rtl-optimization/98726
28168 * emit-rtl.c (valid_for_const_vector_p): Return true for
28170 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
28171 poly_wide_int instead of a wide_int.
28172 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
28174 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
28175 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
28176 false for CONST_VECTORs that cannot be forced to memory.
28177 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
28178 is too complex to force to memory, build it up from individual
28181 2021-03-31 Jan Hubicka <jh@suse.cz>
28184 * cgraph.c (cgraph_node::release_body): Fix overactive check.
28186 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
28189 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
28192 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
28194 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
28195 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
28197 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
28198 "rep movsb/stosb" only for known sizes.
28199 * config/i386/i386-options.c (processor_cost_table): Use Ice
28200 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
28201 Rapids and Alder Lake.
28202 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
28203 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
28204 (icelake_memset): Likewise.
28205 (icelake_cost): Likewise.
28206 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28209 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28212 * config/aarch64/aarch64.c
28213 (aarch64_vectorize_preferred_vector_alignment): Query the size
28214 of the provided SVE vector; do not assume that all SVE vectors
28215 have the same size.
28217 2021-03-31 Jan Hubicka <jh@suse.cz>
28220 * cgraph.c (cgraph_node::release_body): Remove all callers and
28222 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
28223 * cgraphunit.c (cgraph_node::expand): And here.
28225 2021-03-31 Martin Liska <mliska@suse.cz>
28227 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
28228 and one negated condition.
28230 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28231 Richard Sandiford <richard.sandiford@arm.com>
28234 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
28235 constraints on operands[2] and similarly 0 and rk constraints
28236 on operands[1] corresponding to that.
28238 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28241 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
28242 linker doesn't support DWARF sections new in DWARF5.
28243 * config/i386/i386-options.c (ix86_option_override_internal): Default
28244 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
28246 * config.in: Regenerated.
28247 * configure: Regenerated.
28249 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28252 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
28253 available issue_info before using it.
28255 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28258 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
28261 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
28264 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
28265 (altivec_lvsl_reg_<mode>): ... this.
28266 (altivec_lvsr_reg): Change to ...
28267 (altivec_lvsr_reg_<mode>): ... this.
28268 * config/rs6000/predicates.md (vec_set_index_operand): New.
28269 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
28270 Enable 32bit variable vec_insert for all TARGET_VSX.
28271 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28272 Enable 32bit variable vec_insert for p9 and above.
28273 (rs6000_expand_vector_set_var_p8): Rename to ...
28274 (rs6000_expand_vector_set_var_p7): ... this.
28275 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
28277 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
28278 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
28279 gen_altivec_lvsr_reg_di.
28281 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
28284 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
28285 (__rdtscp): Likewise.
28287 2021-03-30 Tamar Christina <tamar.christina@arm.com>
28289 PR tree-optimization/99825
28290 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
28291 Reject non-mult 2 lanes.
28293 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
28296 * config/arm/arm.c (arm_file_start): Fix emission of
28297 Tag_ABI_VFP_args attribute.
28299 2021-03-30 Richard Biener <rguenther@suse.de>
28301 PR tree-optimization/99824
28302 * stor-layout.c (set_min_and_max_values_for_integral_type):
28303 Assert the precision is within the bounds of
28304 WIDE_INT_MAX_PRECISION.
28305 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
28306 the outermost component ref only to lower the access size
28307 and initialize that from the access type.
28309 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
28312 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
28313 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
28315 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
28317 * config/aarch64/aarch64.md
28318 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
28319 attribute to disambiguate between SIMD and FP variants of the
28322 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
28324 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
28325 (analyze_ssa_name_flags): Fix typo in comment.
28327 2021-03-29 Alex Coplan <alex.coplan@arm.com>
28330 * config/aarch64/aarch64-sve-builtins.cc
28331 (function_builder::add_function): Add placeholder_p argument, use
28332 placeholder decls if this is set.
28333 (function_builder::add_unique_function): Instead of conditionally adding
28334 direct overloads, unconditionally add either a direct overload or a
28336 (function_builder::add_overloaded_function): Set placeholder_p if we're
28337 using C++ overloads. Use the obstack for string storage instead
28338 of relying on the tree nodes.
28339 (function_builder::add_overloaded_functions): Don't return early for
28340 m_direct_overloads: we need to add placeholders.
28341 * config/aarch64/aarch64-sve-builtins.h
28342 (function_builder::add_function): Add placeholder_p argument.
28344 2021-03-29 Richard Biener <rguenther@suse.de>
28346 PR tree-optimization/99807
28347 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
28348 assert below VEC_PERM handling.
28350 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28353 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
28354 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
28355 matching const_int 0.
28356 (move_lo_quad_internal_be_<mode>): Likewise.
28357 (move_lo_quad_<mode>): Update for the above.
28358 * config/aarch64/iterators.md (VQ_2E): Delete.
28360 2021-03-29 Jakub Jelinek <jakub@redhat.com>
28362 PR tree-optimization/99777
28363 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
28364 types other than scalar integral types.
28366 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
28368 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
28369 XCOFF TLS reloc decorations.
28371 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
28373 * doc/analyzer.texi (Analyzer Internals): Update link to
28374 "A Memory Model for Static Analysis of C Programs".
28376 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
28378 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
28379 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
28381 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
28382 (rs6000_special_round_type_align): Recursively check innermost first
28385 2021-03-26 Jakub Jelinek <jakub@redhat.com>
28388 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
28389 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
28390 assignment with drap_reg active, queue reg save for hfp with offset 0
28391 and flush queued reg saves. When handling a push with rule18,
28392 defer queueing reg save for hfp and just assert the offset is 0.
28393 (scan_trace): Assert that fde->rule18 is false.
28395 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
28398 * ira-costs.c (record_reg_classes): Put case with
28399 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
28400 * ira.c (ira_setup_alts): Ditto.
28401 * lra-constraints.c (process_alt_operands): Ditto.
28402 * recog.c (asm_operand_ok): Ditto.
28403 * reload.c (find_reloads): Ditto.
28405 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28407 * config/aarch64/aarch64-protos.h
28408 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
28409 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
28410 * config/aarch64/aarch64.c (generic_addrcost_table): Update
28411 accordingly, using the same costs as for post_modify.
28412 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
28413 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
28414 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
28415 (a64fx_addrcost_table): Likewise.
28416 (neoversev1_addrcost_table): New.
28417 (neoversev1_tunings): Use neoversev1_addrcost_table.
28418 (aarch64_address_cost): Use the new post_modify costs for CImode
28421 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28423 * config/aarch64/aarch64.opt
28424 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
28425 * doc/invoke.texi: Document it.
28426 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
28427 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
28428 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
28429 (aarch64_vec_issue_info): New structures.
28430 (cpu_vector_cost): Write comments above the variables rather
28432 (cpu_vector_cost::issue_info): New member variable.
28433 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
28434 and tree-ssa-loop-niter.h.
28435 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
28436 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
28437 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
28438 (thunderx3t110_vector_cost): Initialize issue_info to null.
28439 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
28440 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
28441 (neoversev1_vector_cost): Use them.
28442 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
28443 (aarch64_vector_costs::saw_sve_only_op): New member variable.
28444 (aarch64_vector_costs::num_vector_iterations): Likewise.
28445 (aarch64_vector_costs::scalar_ops): Likewise.
28446 (aarch64_vector_costs::advsimd_ops): Likewise.
28447 (aarch64_vector_costs::sve_ops): Likewise.
28448 (aarch64_vector_costs::seen_loads): Likewise.
28449 (aarch64_simd_vec_costs_for_flags): New function.
28450 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
28451 Count the number of predicate operations required by SVE WHILE
28453 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
28454 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
28455 (aarch64_count_ops): Likewise.
28456 (aarch64_add_stmt_cost): Record whether see an SVE operation
28457 that cannot currently be implementing using Advanced SIMD.
28458 Record issue information about the scalar, Advanced SIMD
28459 and (where relevant) SVE versions of a loop.
28460 (aarch64_vec_op_count::dump): New function.
28461 (aarch64_sve_op_count::dump): Likewise.
28462 (aarch64_estimate_min_cycles_per_iter): Likewise.
28463 (aarch64_adjust_body_cost): If issue information is available,
28464 try to compare the issue rates of the various loop implementations
28465 and increase or decrease the vector body cost accordingly.
28467 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28469 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
28470 Assume a zero cost for induction phis.
28472 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28474 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
28476 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
28477 vector comparisons.
28479 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28481 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
28483 (aarch64_add_stmt_cost): Call it.
28485 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28487 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
28488 New tuning parameter.
28489 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
28490 (aarch64_estimated_sve_vq): New function.
28491 (aarch64_vector_costs::analyzed_vinfo): New member variable.
28492 (aarch64_vector_costs::is_loop): Likewise.
28493 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
28494 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
28495 (aarch64_record_potential_advsimd_unrolling): New function.
28496 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
28497 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
28498 aarch64_analyze_bb_vinfo on the first use of a costs structure.
28499 Detect whether we're vectorizing a loop for SVE that might be
28500 completely unrolled if it used Advanced SIMD instead.
28501 (aarch64_adjust_body_cost_for_latency): New function.
28502 (aarch64_finish_cost): Call it.
28504 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28506 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
28507 (aarch64_init_cost): New function.
28508 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
28509 the default unsigned[3].
28510 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
28511 (TARGET_VECTORIZE_INIT_COST): Override.
28512 (TARGET_VECTORIZE_FINISH_COST): Likewise.
28513 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
28515 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28517 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
28518 (neoversev1_sve_vector_cost): New cost structures.
28519 (neoversev1_vector_cost): Likewise.
28520 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
28522 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28524 * config/aarch64/aarch64-protos.h
28525 (sve_vec_cost::scatter_store_elt_cost): New member variable.
28526 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28527 accordingly, taking the cost from the cost of a scalar_store.
28528 (a64fx_sve_vector_cost): Likewise.
28529 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
28531 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28533 * config/aarch64/aarch64-protos.h
28534 (simd_vec_cost::store_elt_extra_cost): New member variable.
28535 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28536 accordingly, using the vec_to_scalar cost for the new field.
28537 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28538 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28539 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28540 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28541 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28542 (thunderx3t110_advsimd_vector_cost): Likewise.
28543 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
28545 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28547 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
28548 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
28549 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
28550 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28551 accordingly, using zero for the new costs.
28552 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28553 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28554 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28555 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28556 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28557 (thunderx3t110_advsimd_vector_cost): Likewise.
28558 (aarch64_ld234_st234_vectors): New function.
28559 (aarch64_adjust_stmt_cost): Likewise.
28560 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
28561 the new vector costs.
28563 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28565 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
28566 derived class of simd_vec_cost. Add information about CLAST[AB]
28567 and FADDA instructions.
28568 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28569 accordingly, using the vec_to_scalar costs for the new fields.
28570 (a64fx_sve_vector_cost): Likewise.
28571 (aarch64_reduc_type): New function.
28572 (aarch64_sve_in_loop_reduction_latency): Likewise.
28573 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
28574 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
28575 that occur in the loop body.
28576 (aarch64_add_stmt_cost): Update call accordingly.
28578 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28580 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
28582 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
28583 above the fields rather than to the right.
28584 (simd_vec_cost::reduc_i8_cost): New member variable.
28585 (simd_vec_cost::reduc_i16_cost): Likewise.
28586 (simd_vec_cost::reduc_i32_cost): Likewise.
28587 (simd_vec_cost::reduc_i64_cost): Likewise.
28588 (simd_vec_cost::reduc_f16_cost): Likewise.
28589 (simd_vec_cost::reduc_f32_cost): Likewise.
28590 (simd_vec_cost::reduc_f64_cost): Likewise.
28591 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28592 accordingly, using the vec_to_scalar_cost for the new fields.
28593 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28594 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28595 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28596 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28597 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28598 (thunderx3t110_advsimd_vector_cost): Likewise.
28599 (aarch64_use_new_vector_costs_p): New function.
28600 (aarch64_simd_vec_costs): New function, split out from...
28601 (aarch64_builtin_vectorization_cost): ...here.
28602 (aarch64_is_reduction): New function.
28603 (aarch64_detect_vector_stmt_subtype): Likewise.
28604 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
28605 using the new vector costs.
28607 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28610 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
28611 TLS declarations as public.
28613 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28615 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
28616 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
28617 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
28618 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
28619 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
28620 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
28621 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
28622 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
28623 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
28625 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28628 * config.gcc (*-*-cygwin*): Add winnt-d.o
28629 (*-*-mingw*): Likewise.
28630 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
28631 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
28632 * config/i386/t-cygming: Add winnt-d.o.
28633 * config/i386/winnt-d.c: New file.
28635 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28637 * config/freebsd-d.c: Include memmodel.h.
28639 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28642 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
28643 * config/t-openbsd: Add openbsd-d.o.
28644 * config/openbsd-d.c: New file.
28646 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
28648 PR tree-optimization/96974
28649 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
28650 with graceful exit.
28652 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28655 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28659 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28660 always_inline in system headers.
28662 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
28664 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
28666 2021-03-25 Jakub Jelinek <jakub@redhat.com>
28669 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
28670 * fold-const.c (operand_compare::operand_equal_p): Don't compare
28671 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
28673 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28677 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28678 always_inline in system headers.
28680 2021-03-25 Richard Biener <rguenther@suse.de>
28682 PR tree-optimization/99746
28683 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
28684 the scalar stmt as patterned. Instead set up required things
28687 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
28689 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
28692 2021-03-24 Martin Liska <mliska@suse.cz>
28695 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
28697 * config/i386/i386-options.c (ix86_option_override_internal):
28698 Add run-time assert.
28700 2021-03-24 Martin Jambor <mjambor@suse.cz>
28703 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
28704 parameters with unknown type.
28705 (ipacp_value_safe_for_type): New function.
28706 (propagate_vals_across_arith_jfunc): Verify that the constant type
28707 can be used for a type of the formal parameter.
28708 (propagate_vals_across_ancestor): Likewise.
28709 (propagate_scalar_across_jump_function): Likewise. Pass the type
28710 also to propagate_vals_across_ancestor.
28712 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
28715 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
28717 (movmisalign<mode>_mve_load): Likewise.
28719 2021-03-24 Jakub Jelinek <jakub@redhat.com>
28722 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
28723 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
28725 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28727 * doc/sourcebuild.texi (sysconf): New effective target.
28729 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28731 * config/i386/predicates.md (reg_or_const_vec_operand): New.
28732 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
28733 the now *-prefixed insn_and_split, turn the splitter const vec
28734 into an input for the insn, making it an ignored immediate for
28735 non-split cases, and loaded into the scratch register
28738 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
28741 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
28742 Use define_relaxed_memory_constraint for them.
28744 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
28747 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
28748 colon to the diagnostic message.
28750 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
28752 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
28754 (try_fwprop_subst_note): Use set_info instead of insn_info.
28755 (try_fwprop_subst_pattern): Likewise.
28756 (try_fwprop_subst_notes): Likewise.
28757 (try_fwprop_subst): Likewise.
28758 (forward_propagate_subreg): Likewise.
28759 (forward_propagate_and_simplify): Likewise.
28760 (forward_propagate_into): Likewise.
28761 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
28763 (set_info::single_nondebug_insn_use): Likewise.
28764 (set_info::single_phi_use): Likewise.
28765 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
28767 (set_info::single_nondebug_insn_use): Likewise.
28768 (set_info::single_phi_use): Likewise.
28770 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
28772 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
28774 2021-03-23 Jakub Jelinek <jakub@redhat.com>
28777 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
28778 expand_mult to perform an unsigned rather than a signed
28781 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
28784 * config/i386/cpuid.h (__cpuid): Add __volatile__.
28785 (__cpuid_count): Likewise.
28787 2021-03-23 Richard Biener <rguenther@suse.de>
28789 PR tree-optimization/99721
28790 * tree-vect-slp.c (vect_slp_analyze_node_operations):
28791 Make sure we can schedule the node.
28793 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28795 * config/riscv/riscv.c (riscv_subword): Take endianness into
28796 account when calculating the byte offset.
28798 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28800 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
28801 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
28802 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
28803 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
28804 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
28805 new predicate "subreg_lowpart_operator"
28807 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28809 * config/riscv/riscv.c (riscv_swap_instruction): New function
28810 to byteswap an SImode rtx containing an instruction.
28811 (riscv_trampoline_init): Byteswap the generated instructions
28814 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28816 * common/config/riscv/riscv-common.c
28817 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
28818 * config.gcc (riscv32be-*, riscv64be-*): Set
28819 TARGET_BIG_ENDIAN_DEFAULT to 1.
28820 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
28821 depending on default endianness.
28822 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28823 * config/riscv/linux.h (LINK_SPEC): Likewise.
28824 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
28825 default endianness.
28826 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
28828 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28830 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
28831 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28832 * config/riscv/linux.h (LINK_SPEC): Likewise.
28833 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
28835 (BYTES_BIG_ENDIAN): Handle big endian.
28836 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
28837 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
28839 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
28841 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28843 * regcprop.c (find_oldest_value_reg): Ask target whether
28844 different mode is fine for replacement register.
28846 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
28848 PR tree-optimization/99296
28849 * value-range.cc (irange::irange_set_1bit_anti_range): New.
28850 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
28851 * value-range.h (irange::irange_set_1bit_anti_range): New.
28853 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
28856 * config/aarch64/constraints.md (UtQ): Use
28857 define_relaxed_memory_constraint for it.
28858 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
28859 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28860 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
28861 (have_relaxed_memory_constraints): New static var.
28862 (relaxed_memory_start, relaxed_memory_end): Ditto.
28863 (add_constraint): Add arg is_relaxed_memory. Check name for
28864 relaxed memory. Set up is_relaxed_memory in constraint_data and
28865 have_relaxed_memory_constraints. Adjust calls.
28866 (choose_enum_order): Process relaxed memory.
28867 (write_tm_preds_h): Ditto.
28868 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28869 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28870 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
28871 * ira-lives.c (single_reg_class): Use
28872 insn_extra_relaxed_memory_constraint.
28873 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
28874 * lra-constraints.c (valid_address_p): Use
28875 insn_extra_relaxed_memory_constraint instead of other memory
28877 (process_alt_operands): Process CT_RELAXED_MEMORY.
28878 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
28879 * recog.c (asm_operand_ok, preprocess_constraints): Process
28881 * reload.c (find_reloads): Ditto.
28882 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
28883 * stmt.c (parse_input_constraint): Use
28884 insn_extra_relaxed_memory_constraint.
28886 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
28889 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
28892 2021-03-22 Alex Coplan <alex.coplan@arm.com>
28895 * config/arm/arm-protos.h (neon_make_constant): Add generate
28896 argument to guard emitting insns, default to true.
28897 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
28898 CONST_VECTORs which neon_make_constant can't handle.
28899 (neon_vdup_constant): Add generate argument, avoid emitting
28900 insns if it's not set.
28901 (neon_make_constant): Plumb new generate argument through.
28902 * config/arm/constraints.md (Ui): New. Use it...
28903 * config/arm/mve.md (*mve_mov<mode>): ... here.
28904 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
28905 synthesize constants.
28907 2021-03-22 Richard Biener <rguenther@suse.de>
28909 * debug.h: Add deprecation warning.
28911 2021-03-22 Richard Biener <rguenther@suse.de>
28913 PR tree-optimization/99694
28914 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
28917 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
28920 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
28921 after type checking.
28923 2021-03-22 Jakub Jelinek <jakub@redhat.com>
28927 * dwarf2out.c (get_full_len): Use get_precision rather than
28929 (add_const_value_attribute): Make sure add_AT_wide argument has
28930 precision prec rather than some very wide one.
28932 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
28934 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
28935 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
28936 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
28937 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
28938 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
28939 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
28940 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
28941 *fix_trunc<mode>si2_internal): Fix empty split condition.
28942 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
28943 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
28944 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
28945 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
28947 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
28950 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28951 Convert idx to DImode.
28952 (rs6000_expand_vector_set_var_p8): Likewise.
28954 2021-03-21 Jakub Jelinek <jakub@redhat.com>
28957 * dwarf2out.c (insert_float): Change return type from void to
28958 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
28959 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
28962 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
28965 * config/i386/i386.c (construct_container): Check cfun != NULL
28966 before accessing silent_p.
28968 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
28970 * asan.c: Fix typos in comments.
28972 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
28974 PR rtl-optimization/99680
28975 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
28976 (process_address_1): Check empty constraint before using
28979 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
28981 * config/rs6000/rs6000.c (power10_cost): New.
28982 (rs6000_option_override_internal): Set Power10 costs.
28983 (rs6000_issue_rate): Set Power10 issue rate.
28984 * config/rs6000/power10.md: Rewrite for Power10.
28986 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
28989 * lra-constraints.c (process_address_1): Don't use unknown
28990 constraint for address constraint.
28992 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
28995 * config.gcc (powerpc-*-darwin8): Delete the reference to
28996 the now removed darwin8.h.
28998 2021-03-19 Olivier Hainque <hainque@adacore.com>
29001 * config/vxworksae.h (VX_CPU_PREFIX): Define.
29003 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
29005 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
29007 2021-03-19 Tamar Christina <tamar.christina@arm.com>
29009 PR tree-optimization/99656
29010 * tree-vect-slp-patterns.c (linear_loads_p,
29011 complex_add_pattern::matches, is_eq_or_top,
29012 vect_validate_multiplication, complex_mul_pattern::matches,
29013 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
29014 * tree-vectorizer.h: (complex_load_perm_t): Removed.
29015 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
29016 complex_load_perm_t.
29018 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
29021 * config/i386/i386-options.c (ix86_init_machine_status): Set
29023 * config/i386/i386.c (init_cumulative_args): Set silent_p to
29025 (construct_container): Return early for return and argument
29026 errors if silent_p is true.
29027 * config/i386/i386.h (machine_function): Add silent_p.
29029 2021-03-19 Jakub Jelinek <jakub@redhat.com>
29032 * config/arm/constraints.md (Ds): New constraint.
29033 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
29034 constraint instead of w,Dm.
29036 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
29038 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
29041 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
29043 PR middle-end/99641
29044 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
29045 array type, do the computation of the current position in sizetype.
29047 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
29050 * lra-constraints.c (process_address_1): Use lookup_constraint
29051 only for a single constraint.
29053 2021-03-18 Martin Sebor <msebor@redhat.com>
29055 PR middle-end/99502
29056 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
29057 (inbounds_memaccess_p): ...to this. Check the ending offset of
29058 the accessed member.
29060 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
29062 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
29063 %> quote markers to error messages.
29064 (gcn_goacc_validate_dims): Likewise.
29065 (gcn_conditional_register_usage): Remove exclaimation mark from error
29067 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
29069 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
29071 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
29074 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
29075 Kito Cheng <kito.cheng@sifive.com>
29077 * config/riscv/riscv.c (riscv_block_move_straight): Change type
29078 to unsigned HOST_WIDE_INT for parameter and local variable with
29079 HOST_WIDE_INT type.
29080 (riscv_adjust_block_mem): Ditto.
29081 (riscv_block_move_loop): Ditto.
29082 (riscv_expand_block_move): Ditto.
29084 2021-03-18 Nick Clifton <nickc@redhat.com>
29086 * config/v850/v850.c (construct_restore_jr): Increase static
29088 (construct_save_jarl): Likewise.
29089 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
29091 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29093 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
29094 (aarch64_override_options_internal): Use it.
29095 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
29098 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
29100 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
29101 error message format issues.
29102 (nios2_option_override): Likewise.
29103 (nios2_expand_fpu_builtin): Likewise.
29104 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
29105 truncation warning.
29106 (nios2_expand_custom_builtin): More error message format fixes.
29107 (nios2_expand_rdwrctl_builtin): Likewise.
29108 (nios2_expand_rdprs_builtin): Likewise.
29109 (nios2_expand_eni_builtin): Likewise.
29110 (nios2_expand_builtin): Likewise.
29111 (nios2_register_custom_code): Likewise.
29112 (nios2_valid_target_attribute_rec): Likewise.
29113 (nios2_add_insn_asm): Fix uninitialized variable warning.
29115 2021-03-17 Jan Hubicka <jh@suse.cz>
29117 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
29118 of gather to match reality.
29119 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
29121 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29123 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
29124 to compare against CC_REG rather than NE.
29126 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
29129 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
29130 inline assembly statements.
29131 (ix86_print_operand): Update 'P' handling for -fno-plt.
29133 2021-03-17 Tamar Christina <tamar.christina@arm.com>
29136 * config/aarch64/aarch64.c
29137 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
29139 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
29142 * config/rs6000/predicates.md (branch_comparison_operator): Allow
29143 ordered and unordered for CCFPmode, if flag_finite_math_only.
29145 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29148 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
29149 rather than ASHIFT.
29150 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
29152 2021-03-16 Martin Liska <mliska@suse.cz>
29155 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
29156 cl_optimization_compare function.
29158 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
29160 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
29163 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29166 * config/i386/i386.h (struct machine_function): Add
29167 has_explicit_vzeroupper bitfield.
29168 * config/i386/i386-expand.c (ix86_expand_builtin): Set
29169 cfun->machine->has_explicit_vzeroupper when expanding
29170 IX86_BUILTIN_VZEROUPPER.
29171 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
29172 Do the mode switching only when TARGET_VZEROUPPER, expensive
29173 optimizations turned on and not optimizing for size.
29174 (pass_insert_vzeroupper::gate): Enable even when
29175 cfun->machine->has_explicit_vzeroupper is set.
29177 2021-03-16 Jakub Jelinek <jakub@redhat.com>
29180 * config/aarch64/aarch64.c
29181 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
29182 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
29183 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
29185 2021-03-15 Richard Biener <rguenther@suse.de>
29187 PR tree-optimization/98834
29188 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
29189 subsetting by truncating the access size.
29191 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
29193 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
29194 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
29197 2021-03-15 Martin Liska <mliska@suse.cz>
29199 * spellcheck.c: Add missing comma in initialization.
29201 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
29203 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
29204 alternative 2 and alternative 1 with alternative 3 using
29205 YW register constraint.
29206 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
29207 using YW register constraint.
29208 (*vec_extractv16qi_zext): Ditto.
29209 (*vec_extractv4si): Merge alternatives 4 and 5
29210 using Yw register constraint.
29211 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
29213 2021-03-13 Martin Sebor <msebor@redhat.com>
29215 PR tree-optimization/99489
29216 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
29217 is not a call statement.
29219 2021-03-13 Jakub Jelinek <jakub@redhat.com>
29221 PR tree-optimization/99544
29222 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
29223 if for vector types multiplication can't be done in type's mode.
29225 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
29228 * config/sparc/constraints.md (w): Rename to...
29229 (W): ... this and ditch previous implementation.
29230 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
29231 (*movdf_insn_sp64): Likewise.
29232 (*mov<VM64:mode>_insn_sp64): Likewise.
29233 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
29235 (atomic_compare_and_swap_leon3_1): Likewise.
29236 (*atomic_compare_and_swapdi_v8plus): Likewise.
29237 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
29238 architecture and add missing address validity check during LRA.
29240 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
29243 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
29244 occuring for assumed-size arrays in use_device_{ptr,addr}.
29246 2021-03-12 Jakub Jelinek <jakub@redhat.com>
29249 * config/i386/constraints.md (YW): New internal constraint.
29250 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
29251 (*<sse2_avx2>_<insn><mode>3<mask_name>,
29252 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
29253 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
29255 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
29256 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
29257 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
29258 into one, use Yw instead of former x,v.
29259 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
29260 the last alternative.
29261 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
29262 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
29263 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
29264 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
29265 into one, use <v_Yw> instead of former x,v.
29266 (avx2_interleave_highv32qi<mask_name>,
29267 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
29268 constraints. Add && <mask_avx512bw_condition> to condition.
29269 (avx2_interleave_lowv32qi<mask_name>,
29270 vec_interleave_lowv16qi<mask_name>,
29271 avx2_interleave_highv16hi<mask_name>,
29272 vec_interleave_highv8hi<mask_name>,
29273 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
29274 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
29275 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
29276 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
29277 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
29278 Yw instead of v in constraints.
29279 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
29280 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
29281 instead of Yv in constraints.
29282 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
29283 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
29284 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
29285 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
29286 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
29288 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
29289 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
29290 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
29291 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
29292 two, one with just x, another isa avx512vl with v.
29294 2021-03-12 Martin Liska <mliska@suse.cz>
29296 * doc/invoke.texi: Add missing param documentation.
29298 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29301 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
29302 analyzer/trimmed-graph.o.
29303 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
29304 feasibility checking to reflect new implementation.
29305 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
29307 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
29309 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29311 * digraph.cc (selftest::test_shortest_paths): Update
29312 shortest_paths init for new param. Add test of
29313 SPS_TO_GIVEN_TARGET.
29314 * shortest-paths.h (enum shortest_path_sense): New.
29315 (shortest_paths::shortest_paths): Add "sense" param.
29316 Update for renamings. Generalize to use "sense" param.
29317 (shortest_paths::get_shortest_path): Rename param.
29318 (shortest_paths::m_sense): New field.
29319 (shortest_paths::m_prev): Rename...
29320 (shortest_paths::m_best_edge): ...to this.
29321 (shortest_paths::get_shortest_path): Update for renamings.
29322 Conditionalize flipping of path on sense of traversal.
29324 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29326 * digraph.cc (selftest::test_shortest_paths): Add test coverage
29327 for paths from B and C.
29328 * shortest-paths.h (shortest_paths::shortest_paths): Handle
29329 unreachable nodes, rather than asserting.
29331 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
29334 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
29335 xcoff_tbss_section_name.
29336 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
29337 * xcoffout.c (xcoff_tbss_section_name): Delete.
29338 * xcoffout.h (xcoff_tbss_section_name): Delete.
29340 2021-03-11 Richard Biener <rguenther@suse.de>
29342 PR tree-optimization/99523
29343 * tree-cfg.c (dump_function_to_file): Dump SSA names
29344 w/o identifier to the decls section as well, not only those
29345 without a VAR_DECL.
29347 2021-03-11 Jakub Jelinek <jakub@redhat.com>
29350 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
29351 function calls with lhs fail if the lhs don't have compatible types.
29353 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
29355 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
29356 Change FRAME_POINTER_REGNUM to correspond to a new faked
29357 register faked_fp, part of GENNONACR_REGS like faked_ap.
29358 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
29359 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
29360 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
29361 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
29362 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
29364 (CRIS_REAL_FP_REGNUM): New constant.
29365 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
29366 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
29367 (cris_initial_elimination_offset): Handle elimination changes
29368 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
29369 and add one from FRAME_POINTER_REGNUM to
29370 HARD_FRAME_POINTER_REGNUM.
29371 (cris_expand_prologue, cris_expand_epilogue): Emit code for
29372 hard_frame_pointer_rtx instead of frame_pointer_rtx.
29374 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
29377 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
29378 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
29380 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
29383 * lra-constraints.c (process_address_1): Don't check unknown
29384 constraint, use X for empty constraint.
29386 2021-03-10 Alex Coplan <alex.coplan@arm.com>
29388 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
29389 Fix typo in comment describing "is_ha" argument.
29391 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
29393 * doc/sourcebuild.texi: Document LRA target selector.
29395 2021-03-10 David Malcolm <dmalcolm@redhat.com>
29397 * doc/ux.texi: Add subsection contrasting interactive versus
29398 batch usage of GCC.
29400 2021-03-10 Joel Hutton <joel.hutton@arm.com>
29403 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
29405 (vectorizable_load): Fix gather load mask check condition.
29407 2021-03-10 Richard Biener <rguenther@suse.de>
29409 PR tree-optimization/99510
29410 * tree.c (check_aligned_type): Check that the candidate
29411 has TYPE_USER_ALIGN set instead of matching with the
29414 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
29416 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
29417 float and vector integer modes only if the mode is not larger.
29419 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
29421 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
29423 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29425 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
29427 * ira-lives.c (single_reg_class): Ditto.
29429 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
29431 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
29432 the architecture-specific rtems.h.
29433 (aarch64-*-rtems*): Likewise.
29434 (arm*-*-rtems*): Likewise.
29435 (epiphany-*-rtems*): Likewise.
29436 (riscv*-*-rtems*): Likewise.
29438 2021-03-09 Jakub Jelinek <jakub@redhat.com>
29440 PR tree-optimization/99305
29441 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
29442 before integer_all_onesp instead of vice versa.
29444 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
29446 * common/config/arm/arm-common.c (arm_config_default): Change type
29447 of 'i' to unsigned.
29449 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29452 * lra-constraints.c (process_address_1): Process constraint 'g'
29453 separately and digital constraints containing more one digit.
29455 2021-03-09 Nick Clifton <nickc@redhat.com>
29457 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
29458 (DWARF"_DEBUGGING_INFO): Define.
29460 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
29463 * calls.c (initialize_argument_information): When the argument
29464 is passed by reference, do not make a copy in a thunk only if
29465 the argument is already in memory. Remove redundant test for
29466 the case of callee copy.
29468 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29471 * lra-constraints.c (process_address_1): Process 0..9 constraints
29472 in process_address_1.
29474 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
29476 * config/s390/s390.c (struct s390_processor processor_table):
29477 Binutils name string must not be empty.
29479 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
29481 * config/arc/arc.c (arc_attr_type): Remove function.
29483 2021-03-09 Martin Liska <mliska@suse.cz>
29486 * config/i386/i386-options.c (ix86_option_override_internal):
29487 Set isa_flags for OPTS argument and not for the global
29490 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29492 * config/rs6000/predicates.md (ds_form_mem_operand): Check
29495 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29498 * config/rs6000/predicates.md (ds_form_mem_operand) New
29500 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
29501 ds_form_mem_operand in ld/lwa patterns.
29502 * config/rs6000/fusion.md: Regenerate file.
29504 2021-03-08 Martin Sebor <msebor@redhat.com>
29506 PR middle-end/98266
29507 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
29508 (array_bounds_checker::check_array_bounds): Call it.
29510 2021-03-08 Martin Sebor <msebor@redhat.com>
29512 PR middle-end/97631
29513 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
29514 (handle_builtin_stxncpy_strncat): Rename locals. Determine
29515 destination size from allocation calls. Issue a more appropriate
29517 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
29518 (handle_builtin_memset): Same.
29520 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
29523 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
29524 to ensure we do not have an Altivec style address.
29525 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
29526 an Altivec style address.
29527 (*vsx_le_perm_store_<mode>): Likewise.
29528 (splitters after *vsx_le_perm_store_<mode>): Likewise.
29529 (vsx_load_<mode>): Disable special expander if passed an Altivec
29531 (vsx_store_<mode>): Likewise.
29533 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29536 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
29537 (aarch64_simd_shift_imm_vec_hi): Likewise.
29538 (aarch64_simd_shift_imm_vec_si): Likewise.
29539 (aarch64_simd_shift_imm_vec_di): Likewise.
29540 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
29541 predicate from above.
29542 (aarch64_shrn<mode>_insn_be): Likewise.
29543 (aarch64_rshrn<mode>_insn_le): Likewise.
29544 (aarch64_rshrn<mode>_insn_be): Likewise.
29545 (aarch64_shrn2<mode>_insn_le): Likewise.
29546 (aarch64_shrn2<mode>_insn_be): Likewise.
29547 (aarch64_rshrn2<mode>_insn_le): Likewise.
29548 (aarch64_rshrn2<mode>_insn_be): Likewise.
29550 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
29553 * lra-constraints.c (skip_contraint_modifiers): New function.
29554 (process_address_1): Use it before lookup_constraint call.
29556 2021-03-08 Martin Liska <mliska@suse.cz>
29559 * config/i386/i386-options.c (ix86_option_override_internal):
29560 Enable UINTR and HRESET for -march that supports it.
29562 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
29564 * config/s390/s390.c (f_constraint_p): New function.
29565 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
29566 (TARGET_MD_ASM_ADJUST): Likewise.
29568 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
29571 * tree-nested.c (convert_local_reference_stmt): Avoid calling
29572 lookup_field_for_decl for Fortran module (= namespace context).
29574 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
29576 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
29577 comparison with arithmetic right shift.
29578 (s390_expand_vcond): No need for a force_reg anymore.
29579 s390_vec_compare will do it.
29580 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
29581 immediate operands.
29583 2021-03-07 Jakub Jelinek <jakub@redhat.com>
29586 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
29587 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
29589 * config/i386/sse.md (v_Yw): New define_mode_attr.
29590 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
29591 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
29593 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
29594 xYw in constraints.
29596 2021-03-06 Julian Brown <julian@codesourcery.com>
29598 * tree-pretty-print.c (dump_generic_node): Emit non-generic
29599 address space info for aggregates.
29601 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
29603 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
29605 2021-03-05 Jakub Jelinek <jakub@redhat.com>
29607 PR middle-end/99322
29608 * tree-cfg.c (bb_to_omp_idx): New variable.
29609 (execute_build_cfg): Release the bb_to_omp_idx vector after
29610 cleanup_tree_cfg returns.
29611 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
29612 for bb_to_omp_idx being a vec<int> instead of pointer to array
29614 (make_edges): Remove bb_to_omp_idx local variable, don't pass
29615 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
29616 vec<int> instead of pointer to array of ints and don't free/release
29618 (remove_bb): When removing a bb and placing forced label somewhere
29619 else, ensure it is put into the same OpenMP region during cfg
29620 pass if possible or to entry successor as fallback. Unregister
29621 bb from bb_to_omp_idx.
29623 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
29626 * lra-constraints.c (process_address_1): Skip decomposing address
29627 for asm insn operand with unknown constraint.
29629 2021-03-05 Martin Jambor <mjambor@suse.cz>
29632 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
29633 corresponding speculative edges if we are about to resolve
29634 sepculation. Make edge direct (and so resolve speculations) before
29635 removing it from call_site_hash.
29636 (cgraph_edge::make_direct): Relax the initial assert to allow calling
29637 the function on speculative direct edges.
29639 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
29641 PR rtl-optimization/99376
29642 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
29643 of low-order zero bits is too large, set the result to 0 directly.
29645 2021-03-04 Jakub Jelinek <jakub@redhat.com>
29647 PR middle-end/93235
29648 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
29649 SUBREG and a SUBREG to op_mode can't be created.
29651 2021-03-04 Alex Coplan <alex.coplan@arm.com>
29654 * config/aarch64/aarch64-sve-builtins.cc
29655 (function_resolver::require_vector_type): Handle error_mark_node.
29657 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
29659 * cfgexpand.c (expand_asm_loc): Pass new parameter.
29660 (expand_asm_stmt): Likewise.
29661 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
29663 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
29664 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
29665 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
29666 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
29667 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
29668 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
29669 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
29670 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
29671 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
29672 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
29673 * doc/tm.texi (md_asm_adjust): Likewise.
29674 * target.def (md_asm_adjust): Likewise.
29676 2021-03-04 Richard Biener <rguenther@suse.de>
29678 PR middle-end/97855
29679 * tree-pretty-print.c: Poison pp_printf.
29680 (dump_decl_name): Avoid use of pp_printf.
29681 (dump_block_node): Likewise.
29682 (dump_generic_node): Likewise.
29684 2021-03-04 Martin Sebor <msebor@redhat.com>
29686 PR middle-end/96963
29687 PR middle-end/94655
29688 * builtins.c (handle_array_ref): New helper.
29689 (handle_mem_ref): New helper.
29690 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
29691 into new helper functions. Correct a workaround for vectorized
29694 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
29696 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
29697 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
29698 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
29699 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
29700 attribute for Power10.
29701 * config/rs6000/mma.md (*movoo): Likewise.
29702 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
29703 (define_mode_attr bits): Add DD/TD modes.
29704 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
29705 store_conditionalpti): Update size attribute for Power10.
29707 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
29710 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
29711 -Wuninitialized, -Wmaybe-uninitialized.
29712 (wide-int.o-warn): Likewise.
29714 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
29716 * common/config/arm/arm-common.c: Include configargs.h.
29717 (arm_config_default): New function.
29718 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
29719 processors that do not support Thumb. Take into account the
29720 --with-mode configuration setting for selecting the default.
29721 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
29722 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
29724 2021-03-03 Martin Liska <mliska@suse.cz>
29726 PR gcov-profile/97461
29727 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
29729 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
29732 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29733 point back the hard frame pointer to its default location when the
29734 frame is larger than SEH_MAX_FRAME_SIZE.
29736 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29739 * config/i386/predicates.md (logic_operator): New define_predicate.
29740 * config/i386/i386.md (mov + mem using comm arith peephole2):
29741 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
29742 and the inner mode is [QH]Imode.
29744 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29747 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
29748 (new_loc_list): Clear end_entry.
29749 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
29750 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
29752 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
29753 initialize also end_entry.
29755 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29758 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
29759 partitions, if in non-layout mode after reorder_blocks also move
29760 affected blocks to ensure a single partition transition.
29762 2021-03-03 Jason Merrill <jason@redhat.com>
29765 * cgraphunit.c (process_function_and_variable_attributes): Don't
29766 warn about flatten on an alias if the target also has it.
29767 * cgraph.h (symtab_node::get_alias_target_tree): New.
29769 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
29771 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
29772 period to symbol name.
29773 (tls_get_addr_internal<mode>): Same.
29775 2021-03-02 David Malcolm <dmalcolm@redhat.com>
29778 * diagnostic-show-locus.c
29779 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
29782 2021-03-02 Martin Sebor <msebor@redhat.com>
29784 PR middle-end/99276
29785 * builtins.c (warn_for_access): Remove stray warning text.
29787 2021-03-02 Martin Sebor <msebor@redhat.com>
29789 PR middle-end/99295
29790 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
29793 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29796 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
29797 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
29798 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
29799 DW_MACRO_define_strx and DW_MACRO_undef_strx.
29800 (save_macinfo_strings): Use DW_MACRO_*_str* even with
29801 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
29802 DW_MACRO_undef_strx.
29804 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29806 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
29808 (BT_FN_V8HI_V8HI_UINT): Likewise.
29809 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
29810 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
29811 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
29812 New builtin definitions.
29813 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
29814 vector extension version.
29815 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
29816 available with current -march level.
29817 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
29818 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
29819 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
29820 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
29821 (vec_extend_to_fp32_lo): Likewise.
29822 (vec_round_from_fp32): Likewise.
29823 (vec_convert_to_fp16): Likewise.
29824 (vec_convert_from_fp16): Likewise.
29825 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
29826 (vclfnls_v8hi): Likewise.
29827 (vcrnfs_v8hi): Likewise.
29828 (vcfn_v8hi): Likewise.
29829 (vcnf_v8hi): Likewise.
29831 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29833 * common/config/s390/s390-common.c (processor_flags_table): New entry.
29834 * config.gcc: Enable arch14 for --with-arch and --with-tune.
29835 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
29836 arch14 for unknown CPU models.
29837 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
29838 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
29839 (s390_get_sched_attrmask): Likewise.
29840 (s390_get_unit_mask): Likewise.
29841 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
29842 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
29843 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
29844 (TARGET_NNPA_P): New macro definitions.
29845 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
29846 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
29848 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29850 PR middle-end/95757
29851 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
29852 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
29853 != 1 comparisons if name is lhs of a comparison.
29855 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
29859 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
29860 * config/darwin.c (darwin_should_restore_cfa_state): New.
29861 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
29862 * doc/tm.texi: Regenerated.
29863 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
29864 * dwarf2cfi.c (connect_traces): If the target requests, restore
29865 the CFA expression after a DW_CFA_restore.
29866 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
29868 2021-03-01 Martin Liska <mliska@suse.cz>
29871 * optc-save-gen.awk: Add 4 more exceptions.
29873 2021-03-01 Nathan Sidwell <nathan@acm.org>
29876 * tree.h (TYPE_ALIGN_RAW): New accessor.
29877 (TYPE_ALIGN): Use it.
29879 2021-03-01 Jan Hubicka <jh@suse.cz>
29882 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
29884 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
29887 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29888 point the hard frame pointer to the SSE register save area instead
29889 of the general register save area. Perform only minimal adjustment
29890 for small frames if it is initially not correctly aligned.
29891 (ix86_expand_prologue): Remove early saves for a SEH target.
29892 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
29894 2021-02-28 Jakub Jelinek <jakub@redhat.com>
29897 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
29898 typo - referneced -> referenced.
29899 * tree.c (component_ref_size): Fix comment typo -
29900 refernce -> reference.
29901 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
29902 traling -> trailing.
29903 (aliasing_component_refs_p): Fix comment typos -
29904 refernce -> reference and refernece -> reference and
29905 traling -> trailing.
29906 (nonoverlapping_refs_since_match_p): Fix comment typo -
29907 referneces -> references.
29908 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
29909 referneces -> references.
29911 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
29913 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
29914 diagnostic message to avoid use of a contraction and format
29917 2021-02-27 Jakub Jelinek <jakub@redhat.com>
29920 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
29921 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
29922 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
29923 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
29926 2021-02-26 Richard Biener <rguenther@suse.de>
29928 PR middle-end/99281
29929 * expr.c (store_field): For calls with return-slot optimization
29930 and addressable return type expand the store directly.
29932 2021-02-26 Richard Biener <rguenther@suse.de>
29935 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
29937 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
29940 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
29943 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
29945 * config.gcc: Add rs6000-pcrel-opt.o.
29946 * config/rs6000/rs6000-pcrel-opt.c: New file.
29947 * config/rs6000/pcrel-opt.md: New file.
29948 * config/rs6000/predicates.md: Add d_form_memory predicate.
29949 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
29950 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
29951 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
29952 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
29953 and make_pass_pcrel_opt().
29954 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
29955 (rs6000_option_override_internal): Add pcrel-opt.
29956 (rs6000_delegitimize_address): Support pcrel-opt.
29957 (rs6000_opt_masks): Add pcrel-opt.
29958 (pcrel_opt_valid_mem_p): New function.
29959 (reg_to_non_prefixed): Make global.
29960 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
29961 (output_pcrel_opt_reloc): New function.
29962 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
29963 (pcrel_extern_addr): Set loads_extern_addr.
29964 Add include for pcrel-opt.md.
29965 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
29966 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
29969 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
29972 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
29973 If TARGET_64BIT and dest is SUBREG, we check the width, if it
29974 equal to SImode, we use SImode operation, just like what we are
29977 2021-02-26 Marek Polacek <polacek@redhat.com>
29979 * builtins.c (warn_for_access): Fix typos.
29981 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
29983 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
29984 mark in front of the immediate quantity.
29985 (<optab>_rolsi3_uxtw): Likewise.
29987 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
29990 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
29991 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
29992 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
29993 address and disable when the FPCXT is not available.
29994 (nonsecure_call_value_reg_thumb2): Likewise.
29996 2021-02-25 Nathan Sidwell <nathan@acm.org>
29999 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
30001 2021-02-25 Tamar Christina <tamar.christina@arm.com>
30003 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
30005 2021-02-25 Richard Biener <rguenther@suse.de>
30007 PR tree-optimization/99253
30008 * tree-vect-loop.c (check_reduction_path): First compute
30009 code, then verify out-of-loop uses.
30011 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30014 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
30016 2021-02-25 Jakub Jelinek <jakub@redhat.com>
30018 PR tree-optimization/80635
30019 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
30020 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
30021 has mode precision.
30023 2021-02-25 Richard Biener <rguenther@suse.de>
30025 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
30026 load_map population.
30027 (vect_match_slp_patterns_2): Revert part of last change.
30028 (vect_analyze_slp): Do not interleave optimize_load_redistribution
30029 with pattern detection but do it afterwards. Dump the
30030 whole SLP graph after pattern recognition and load
30031 redistribution optimization finished.
30033 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30036 * omp-low.c (struct omp_context): Add teams_nested_p and
30037 nonteams_nested_p members.
30038 (scan_omp_target): Diagnose teams nested inside of target with other
30039 directives strictly nested inside of the same target.
30040 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
30041 ctx->nonteams_nested_p as needed.
30043 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
30045 PR inline-asm/99123
30046 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
30048 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
30050 * config/cris/cris.c (cris_expand_prologue): Set
30051 current_function_static_stack_size, if flag_stack_usage_info.
30053 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
30055 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
30056 (rs6000_final_prescan_insn): Adjust.
30057 (rs6000_asm_output_opcode): Likewise.
30059 2021-02-24 Martin Sebor <msebor@redhat.com>
30061 PR middle-end/97172
30062 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
30063 from function arguments.
30065 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30067 PR tree-optimization/99220
30068 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
30069 node from cache when it's about to be deleted.
30071 2021-02-24 Jakub Jelinek <jakub@redhat.com>
30073 PR tree-optimization/99225
30074 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
30075 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
30076 build_int_cst (..., 1). Formatting fixes.
30078 2021-02-24 Tamar Christina <tamar.christina@arm.com>
30080 PR tree-optimization/99149
30081 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
30083 (vect_slp_reset_pattern): Remove.
30084 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
30085 (complex_mul_pattern::build, complex_fma_pattern::build,
30086 complex_fms_pattern::build): Fix ref counts.
30087 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
30088 when node is being deleted.
30089 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
30090 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
30092 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
30094 2021-02-24 Matthias Klose <doko@ubuntu.com>
30097 2020-12-07 Matthias Klose <doko@ubuntu.com>
30099 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
30100 and ENABLE_RTL_FLAG_CHECKING.
30102 2021-02-24 Richard Biener <rguenther@suse.de>
30105 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
30107 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
30109 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
30110 (vsx_assemble_pair): ...to this.
30111 (*mma_assemble_pair): Rename from this...
30112 (*vsx_assemble_pair): ...to this.
30113 (mma_disassemble_pair): Rename from this...
30114 (vsx_disassemble_pair): ...to this.
30115 (*mma_disassemble_pair): Rename from this...
30116 (*vsx_disassemble_pair): ...to this.
30117 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
30118 BU_COMPAT): New macros.
30119 (mma_assemble_pair): Rename from this...
30120 (vsx_assemble_pair): ...to this.
30121 (mma_disassemble_pair): Rename from this...
30122 (vsx_disassemble_pair): ...to this.
30123 (mma_assemble_pair): New compatibility built-in.
30124 (mma_disassemble_pair): Likewise.
30125 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
30126 (RS6000_BUILTIN_COMPAT): Define.
30127 (bdesc_compat): New.
30128 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30129 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
30130 and VSX_BUILTIN_ASSEMBLE_PAIR.
30131 (rs6000_init_builtins): Register compatibility built-ins.
30132 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
30133 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
30134 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
30135 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
30136 (__builtin_vsx_assemble_pair): ...to this.
30137 (__builtin_mma_disassemble_pair): Rename from this...
30138 (__builtin_vsx_disassemble_pair): ...to this.
30140 2021-02-23 Martin Liska <mliska@suse.cz>
30143 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
30144 with different alignment. That leads to an invalid red zone
30145 size allocated in runtime.
30147 2021-02-23 Jakub Jelinek <jakub@redhat.com>
30149 PR tree-optimization/99204
30150 * fold-const.c (fold_read_from_constant_string): Check that
30151 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
30153 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
30154 Kewen Lin <linkw@gcc.gnu.org>
30156 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
30157 (rotl<mode>3_insert_3): ...this.
30158 (plus_ior_xor): New code_iterator.
30159 (define_split for GPR rl*imi): New splitter.
30160 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
30161 for integer merging.
30163 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30165 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
30167 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
30168 into a register when the above is enabled.
30169 * config/aarch64/aarch64.c (neoversev1_tunings):
30170 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30171 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
30173 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
30175 * config/cris/cris.c (cris_print_operand) <'T'>: Change
30176 valid operand from is now an addi mult-value to shift-value.
30177 * config/cris/cris.md (*addi): Change expression of scaled
30178 operand from mult to ashift.
30179 * config/cris/cris.md (*addi_reload): New insn_and_split.
30181 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
30184 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
30185 hook_bool_const_tree_hwi_hwi_const_tree_true.
30186 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
30188 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
30190 PR rtl-optimization/98791
30191 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
30192 for unordered modes.
30194 2021-02-22 Martin Liska <mliska@suse.cz>
30196 * tree-inline.c (inline_forbidden_p): Set
30197 inline_forbidden_reason.
30199 2021-02-22 Richard Biener <rguenther@suse.de>
30201 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
30204 2021-02-22 Richard Biener <rguenther@suse.de>
30206 PR tree-optimization/99165
30207 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
30208 Accumulate changed to ret.
30210 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
30213 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
30215 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
30217 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30220 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
30222 (trunctf<DFP_ALL:mode>2): Likewise.
30223 (trunctdtf2_vr): Likewise.
30224 (trunctdtf2): Likewise.
30225 (extend<DFP_ALL:mode>tf2_vr): Likewise.
30226 (extend<DFP_ALL:mode>tf2): Likewise.
30227 (extendtftd2_vr): Likewise.
30228 (extendtftd2): Likewise.
30230 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30232 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
30233 add memory alternative.
30234 (tf_to_fprx2): New pattern.
30236 2021-02-19 Martin Sebor <msebor@redhat.com>
30239 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
30240 (attr_access::free_lang_data): Remove a spurious test.
30242 2021-02-19 Nathan Sidwell <nathan@acm.org>
30244 * doc/invoke.texi (flang-info-module-read): Document.
30246 2021-02-19 Martin Liska <mliska@suse.cz>
30248 PR translation/99167
30249 * params.opt: Fix typo.
30251 2021-02-19 Richard Biener <rguenther@suse.de>
30253 PR middle-end/99122
30254 * tree-inline.c (inline_forbidden_p): Do not inline functions
30255 with VLA arguments or return value.
30257 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30260 * config/arm/arm.md (*stack_protect_combined_set_insn,
30261 *stack_protect_combined_test_insn): If force_const_mem result
30262 is not valid general operand, force its address into the destination
30265 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30268 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
30269 pad or non-local label, put FORCED_LABELs from bb b after that label
30270 rather than before it.
30272 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
30275 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
30276 expand_vector_broadcast' to emit the vec_duplicate operand.
30278 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
30280 PR rtl-optimization/96264
30281 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
30284 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
30287 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
30288 looking up the retain attribute.
30289 (resolve_unique_section): Likewise.
30290 (get_variable_section): Likewise.
30291 (switch_to_section): Likewise. Warn when a symbol without the
30292 retain attribute and a symbol with the retain attribute are
30293 placed in the section with the same name, instead of the used
30295 * doc/extend.texi: Document the "retain" attribute.
30297 2021-02-18 Nathan Sidwell <nathan@acm.org>
30300 * doc/invoke.texi (flang-info-include-translate): Document header
30303 2021-02-18 Richard Biener <rguenther@suse.de>
30305 PR middle-end/99122
30306 * ipa-fnsummary.c (analyze_function_body): Set
30307 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
30308 * tree-inline.c (insert_init_debug_bind): Pass NULL for
30309 error_mark_node values.
30310 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
30312 (setup_one_parameter): Delay force_value_to_type until when
30315 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
30317 PR tree-optimization/99142
30318 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
30320 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30322 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
30323 wide_int_bitmask::wide_int_bitmask (uint64_t),
30324 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
30325 wide_int_bitmask::operator ~ () const,
30326 wide_int_bitmask::operator | (wide_int_bitmask) const,
30327 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
30329 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
30330 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
30331 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
30332 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
30333 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
30334 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
30335 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
30336 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
30337 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
30338 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
30339 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
30340 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
30341 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
30342 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
30343 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
30344 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
30345 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
30346 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
30347 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
30348 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
30349 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
30350 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
30351 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
30352 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
30354 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30356 PR middle-end/99109
30357 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
30358 (build_printable_array_type): ... this. Add nelts argument. For
30359 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
30360 nelts, call build_array_type_nelts.
30361 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
30362 instead of build_zero_elt_array_type and build_array_type_nelts.
30364 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30367 * config/i386/i386.c (distance_non_agu_define): Don't call
30368 extract_insn_cached here.
30369 (ix86_lea_outperforms): Save and restore recog_data around call
30370 to distance_non_agu_define and distance_agu_use.
30371 (ix86_ok_to_clobber_flags): Remove.
30372 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
30373 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
30374 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
30375 into define_insn. Move the splitting to define_peephole2 and
30376 check there using peep2_regno_dead_p if FLAGS_REG is dead.
30378 2021-02-17 Julian Brown <julian@codesourcery.com>
30380 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
30383 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
30386 * config/mips/mips.c (mips_symbol_insns): Do not use
30387 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
30389 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
30391 PR inline-asm/98096
30392 * stmt.c (resolve_operand_name_1): Take inout operands into account
30393 for access to labels by names.
30394 * doc/extend.texi: Describe counting operands for accessing labels.
30396 2021-02-16 Richard Biener <rguenther@suse.de>
30398 PR tree-optimization/38474
30399 * tree-ssa-structalias.c (variable_info::address_taken): New.
30400 (new_var_info): Initialize address_taken.
30401 (process_constraint): Set address_taken.
30402 (solve_constraints): Use the new address_taken flag rather
30403 than is_reg_var for sorting variables.
30404 (dump_constraint): Dump the variable number if the name
30407 2021-02-16 Jakub Jelinek <jakub@redhat.com>
30410 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
30411 multiply by 4096 and for inbranch by 8192.
30412 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
30413 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
30415 2021-02-15 Maya Rashish <coypu@sdf.org>
30417 * config/aarch64/aarch64.c (aarch64_init_builtins):
30418 Call SUBTARGET_INIT_BUILTINS.
30420 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
30422 PR rtl-optimization/98872
30423 * init-regs.c (initialize_uninitialized_regs): Skip initialization
30424 if CONST0_RTX is NULL.
30426 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
30428 PR rtl-optimization/98863
30429 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
30430 (function_info::build_info): Turn into a declaration, moving the
30431 definition to internals.h.
30432 (function_info::bb_walker): Declare.
30433 (function_info::create_reg_use): Likewise.
30434 (function_info::calculate_potential_phi_regs): Take a build_info
30436 (function_info::place_phis, function_info::create_ebbs): Declare.
30437 (function_info::calculate_ebb_live_in_for_debug): Likewise.
30438 (function_info::populate_backedge_phis): Delete.
30439 (function_info::start_block, function_info::end_block): Declare.
30440 (function_info::populate_phi_inputs): Delete.
30441 (function_info::m_potential_phi_regs): Move information to build_info.
30442 * rtl-ssa/internals.h: New file.
30443 (function_info::bb_phi_info): New class.
30444 (function_info::build_info): Moved from functions.h.
30445 Add a constructor and destructor.
30446 (function_info::build_info::ebb_use): Delete.
30447 (function_info::build_info::ebb_def): Likewise.
30448 (function_info::build_info::bb_live_out): Likewise.
30449 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
30450 (function_info::build_info::potential_phi_regs): Likewise.
30451 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
30452 (function_info::build_info::ebb_def_regs): Likewise.
30453 (function_info::build_info::bb_phis): Likewise.
30454 (function_info::build_info::bb_mem_live_out): Likewise.
30455 (function_info::build_info::bb_to_rpo): Likewise.
30456 (function_info::build_info::def_stack): Likewise.
30457 (function_info::build_info::old_def_stack_limit): Likewise.
30458 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
30459 Remove the regno argument. Push the previous definition onto the
30460 definition stack where necessary.
30461 * rtl-ssa/accesses.cc: Include internals.h.
30462 * rtl-ssa/changes.cc: Likewise.
30463 * rtl-ssa/blocks.cc: Likewise.
30464 (function_info::build_info::build_info): Define.
30465 (function_info::build_info::~build_info): Likewise.
30466 (function_info::bb_walker): New class.
30467 (function_info::bb_walker::bb_walker): Define.
30468 (function_info::add_live_out_use): Convert a logarithmic-complexity
30469 test into a linear one. Allow the same definition to be passed
30471 (function_info::calculate_potential_phi_regs): Moved from
30472 functions.cc. Take a build_info parameter and store the
30473 information there instead.
30474 (function_info::place_phis): New function.
30475 (function_info::add_entry_block_defs): Update call to record_reg_def.
30476 (function_info::calculate_ebb_live_in_for_debug): New function.
30477 (function_info::add_phi_nodes): Use bb_phis to decide which
30478 registers need phi nodes and initialize ebb_def_regs accordingly.
30479 Do not add degenerate phis here.
30480 (function_info::add_artificial_accesses): Use create_reg_use.
30481 Assert that all definitions are listed in the DF LR sets.
30482 Update call to record_reg_def.
30483 (function_info::record_block_live_out): Record live-out register
30484 values in the phis of successor blocks. Use the live-out set
30485 when processing the last block in an EBB, instead of always
30486 using the live-in sets of successor blocks. AND the live sets
30487 with the set of registers that have been defined in the EBB,
30488 rather than with all potential phi registers. Cope correctly
30489 with branches back to the start of the current EBB.
30490 (function_info::start_block): New function.
30491 (function_info::end_block): Likewise.
30492 (function_info::populate_phi_inputs): Likewise.
30493 (function_info::create_ebbs): Likewise.
30494 (function_info::process_all_blocks): Rewrite into a multi-phase
30496 * rtl-ssa/functions.cc: Include internals.h.
30497 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
30498 (function_info::init_function_data): Remove caller.
30499 * rtl-ssa/insns.cc: Include internals.h
30500 (function_info::create_reg_use): New function. Lazily any
30501 degenerate phis needed by the linear RPO view.
30502 (function_info::record_use): Use create_reg_use. When processing
30503 debug uses, use potential_phi_regs and test it before checking
30504 whether the register is live on entry to the current EBB. Lazily
30505 calculate ebb_live_in_for_debug.
30506 (function_info::record_call_clobbers): Update call to record_reg_def.
30507 (function_info::record_def): Likewise.
30509 2021-02-15 Martin Liska <mliska@suse.cz>
30511 * toplev.c (init_asm_output): Free output of
30512 gen_command_line_string function.
30513 (process_options): Likewise.
30515 2021-02-15 Martin Liska <mliska@suse.cz>
30517 * params.opt: Add 2 missing Param keywords.
30519 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
30521 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
30523 2021-02-15 Jakub Jelinek <jakub@redhat.com>
30525 PR tree-optimization/99079
30526 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
30527 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
30528 require both type and TREE_TYPE (@1) to be integral types and either
30529 type having smaller or equal precision, or TREE_TYPE (@1) being
30530 unsigned type, or type being signed type. If TREE_TYPE (@1)
30531 doesn't have wrapping overflow, perform the subtraction of one in
30534 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
30535 Richard Biener <rguether@suse.de>
30538 * ipa-reference.c (ipa_init): Only conditinally initialize
30539 reference_vars_to_consider.
30540 (propagate): Conditionally deninitialize reference_vars_to_consider.
30541 (ipa_reference_write_optimization_summary): Sanity check that
30542 reference_vars_to_consider is not allocated.
30544 2021-02-13 Levy Hsu <admin@levyhsu.com>
30547 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
30548 extend parameter to get_si_mem_base_reg declaration.
30549 (get_si_mem_base_reg): Add extend parameter. Set it.
30550 (analyze): Pass extend arg to get_si_mem_base_reg.
30551 (transform): Likewise. Use it when rewriting mems.
30552 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
30553 loads and emit sign/zero extending load followed by subreg move.
30555 2021-02-13 Jim Wilson <jimw@sifive.com>
30558 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
30559 exit when !reload_completed. Only perform check for compressed reg
30560 if reload_completed.
30561 (riscv_rtx_costs): In MEM case, when optimizing for size and
30562 shorten memrefs, if not compressible, then increase cost.
30564 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30566 PR rtl-optimization/98439
30567 * recog.c (pass_split_before_regstack::gate): Enable even when
30568 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
30571 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30574 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
30575 swap of V2SImode elements in memory into DImode memory rotate by 32.
30577 2021-02-12 Martin Sebor <msebor@redhat.com>
30579 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
30581 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30583 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
30584 m_temp_obstack rather than m_obstack to allocate the temporary use.
30586 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30588 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
30589 as read-modify operations.
30591 2021-02-12 Richard Biener <rguenther@suse.de>
30593 PR middle-end/38474
30594 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
30595 fbi->aa_walk_budget is bigger than zero. Update
30596 fbi->aa_walk_budget.
30597 (param_change_prob): Likewise.
30598 * ipa-prop.c (detect_type_change_from_memory_writes):
30599 Properly account walk_aliased_vdefs.
30600 (parm_preserved_before_stmt_p): Canonicalize updates.
30601 (parm_ref_data_preserved_p): Likewise.
30602 (parm_ref_data_pass_through_p): Likewise.
30603 (determine_known_aggregate_parts): Account own alias queries.
30605 2021-02-12 Martin Liska <mliska@suse.cz>
30607 * opts-common.c (decode_cmdline_option): Release werror_arg.
30608 * opts.c (gen_producer_string): Release output of
30609 gen_command_line_string.
30611 2021-02-12 Richard Biener <rguenther@suse.de>
30613 PR tree-optimization/38474
30614 * params.opt (-param=max-store-chains-to-track=): New param.
30615 (-param=max-stores-to-track=): Likewise.
30616 * doc/invoke.texi (max-store-chains-to-track): Document.
30617 (max-stores-to-track): Likewise.
30618 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
30620 (pass_store_merging::m_n_stores): Likewise.
30621 (pass_store_merging::terminate_and_process_chain): Update
30622 m_n_stores and m_n_chains.
30623 (pass_store_merging::process_store): Likewise. Terminate
30624 oldest chains if the number of stores or chains get too large.
30625 (imm_store_chain_info::terminate_and_process_chain): Dump
30628 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
30630 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
30631 the cold section, emit a nop before the directive if the previous
30632 active instruction can throw.
30634 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
30637 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
30638 memory addresses that are legal for quad word accesses.
30640 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
30643 * config/arm/thumb2.md (*doloop_end_internal): Generate
30644 alternative sequence to handle long range branches.
30646 2021-02-11 Joel Hutton <joel.hutton@arm.com>
30648 PR tree-optimization/98772
30649 * optabs-tree.c (supportable_half_widening_operation): New function
30650 to check for supportable V8QI->V8HI widening patterns.
30651 * optabs-tree.h (supportable_half_widening_operation): New function.
30652 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
30653 to create promotion stmts for V8QI->V8HI widening patterns.
30654 (vectorizable_conversion): Add case for V8QI->V8HI.
30656 2021-02-11 Richard Biener <rguenther@suse.de>
30658 * sparseset.h (SPARSESET_ELT_BITS): Remove.
30659 (SPARSESET_ELT_TYPE): Use unsigned int.
30660 * fwprop.c: Do not include sparseset.h.
30662 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30665 * varasm.c (declare_weak): For -fsyntax-only, allow even
30666 TREE_ASM_WRITTEN function decls.
30668 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30671 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
30672 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
30673 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
30674 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
30675 calling simplify_gen_subreg on it.
30677 2021-02-10 Martin Liska <mliska@suse.cz>
30679 * config/nvptx/nvptx.c (nvptx_option_override): Use
30680 flag_patchable_function_entry instead of the removed
30681 function_entry_patch_area_size.
30683 2021-02-10 Martin Liska <mliska@suse.cz>
30685 PR tree-optimization/99002
30686 PR tree-optimization/99026
30687 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
30688 leak when adjacent cases are merged.
30689 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
30691 (make_pass_lower_switch): Remove trailing whitespace.
30692 * tree-switch-conversion.h (release_clusters): New.
30694 2021-02-10 Richard Biener <rguenther@suse.de>
30696 PR rtl-optimization/99054
30697 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
30698 (fixup_partitions): Adjust.
30699 (rtl_verify_edges): Likewise.
30701 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30703 PR middle-end/99007
30704 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
30705 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
30708 2021-02-10 Richard Biener <rguenther@suse.de>
30711 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
30714 2021-02-10 Richard Biener <rguenther@suse.de>
30716 PR tree-optimization/99024
30717 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
30718 clear loop->aux if it is associated with the destroyed loop_vinfo.
30720 2021-02-10 Martin Liska <mliska@suse.cz>
30722 PR tree-optimization/99002
30723 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
30726 2021-02-10 Martin Liska <mliska@suse.cz>
30729 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
30730 a reference exists.
30732 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30735 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
30736 at class scope for DWARF5+.
30738 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
30740 PR rtl-optimization/96015
30741 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
30742 (relax_delay_slots): When deleting a jump to the next active
30743 instruction over a barrier, first delete the barrier if the
30744 jump is the only way to reach the target label.
30746 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
30748 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
30749 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
30750 vector multiplies and vect.alu for SSRA.
30751 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
30752 vect.mul cost field.
30753 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
30754 * config/arm/arm.c: Likewise.
30756 2021-02-09 Richard Biener <rguenther@suse.de>
30758 PR tree-optimization/98863
30759 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
30760 * tree-ssa-sccvn.c (last_pushed_avail): New global.
30761 (rpo_elim::eliminate_push_avail): Chain pushed avails.
30762 (unwind_state::avail_top): Add.
30763 (do_unwind): Rewrite unwinding of avail entries.
30764 (do_rpo_vn): Initialize last_pushed_avail and
30765 avail_top of the undo state.
30767 2021-02-09 Jakub Jelinek <jakub@redhat.com>
30769 PR middle-end/99004
30770 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
30771 const char * to char * and free those pointers after use.
30773 2021-02-09 Richard Biener <rguenther@suse.de>
30775 PR tree-optimization/99017
30776 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
30777 zero vector cost entries.
30779 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
30781 PR middle-end/98974
30782 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
30783 parameter in vectorizable_condition.
30785 2021-02-08 Richard Biener <rguenther@suse.de>
30788 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
30790 2021-02-08 Martin Liska <mliska@suse.cz>
30793 * cfgexpand.c (pass_expand::execute): Parse per-function option
30794 flag_patchable_function_entry and use it.
30795 * common.opt: Remove function_entry_patch_area_size and
30796 function_entry_patch_area_start global variables.
30797 * opts.c (parse_and_check_patch_area): New function.
30798 (common_handle_option): Use it.
30799 * opts.h (parse_and_check_patch_area): New function.
30800 * toplev.c (process_options): Parse and use
30801 function_entry_patch_area_size.
30803 2021-02-08 Martin Sebor <msebor@redhat.com>
30805 * doc/extend.texi (attribute malloc): Correct typos.
30807 2021-02-05 Nathan Sidwell <nathan@acm.org>
30810 * gcc.c (driver::maybe_run_linker): Check for input file
30811 accessibility if not linking.
30813 2021-02-05 Richard Biener <rguenther@suse.de>
30815 PR tree-optimization/98855
30816 * tree-vectorizer.h (add_stmt_cost): New overload.
30817 * tree-vect-slp.c (li_cost_vec_cmp): New.
30818 (vect_bb_slp_scalar_cost): Cost individual loop regions
30819 separately. Account for the scalar instance root stmt.
30821 2021-02-05 Tom de Vries <tdevries@suse.de>
30824 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
30826 (bit_test_cluster::emit): Reuse location_t for newly created
30828 (switch_decision_tree::try_switch_expansion): Preserve
30830 * tree-switch-conversion.h: Change function signatures.
30832 2021-02-05 Jakub Jelinek <jakub@redhat.com>
30835 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
30836 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
30837 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
30838 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
30840 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30842 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
30843 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
30844 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
30845 (vget_high_f16): Reimplement using new builtin.
30846 (vget_high_f32): Likewise.
30847 (vget_high_f64): Likewise.
30848 (vget_high_p8): Likewise.
30849 (vget_high_p16): Likewise.
30850 (vget_high_p64): Likewise.
30851 (vget_high_s8): Likewise.
30852 (vget_high_s16): Likewise.
30853 (vget_high_s32): Likewise.
30854 (vget_high_s64): Likewise.
30855 (vget_high_u8): Likewise.
30856 (vget_high_u16): Likewise.
30857 (vget_high_u32): Likewise.
30858 (vget_high_u64): Likewise.
30860 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30862 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
30863 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
30864 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
30865 (vget_low_f16): Reimplement using new builtin.
30866 (vget_low_f32): Likewise.
30867 (vget_low_f64): Likewise.
30868 (vget_low_p8): Likewise.
30869 (vget_low_p16): Likewise.
30870 (vget_low_p64): Likewise.
30871 (vget_low_s8): Likewise.
30872 (vget_low_s16): Likewise.
30873 (vget_low_s32): Likewise.
30874 (vget_low_s64): Likewise.
30875 (vget_low_u8): Likewise.
30876 (vget_low_u16): Likewise.
30877 (vget_low_u32): Likewise.
30878 (vget_low_u64): Likewise.
30880 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
30882 * gcc.c (print_multilib_info): Check all required argument is provided
30885 2021-02-05 liuhongt <hongtao.liu@intel.com>
30888 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
30889 generate integer mask comparison for 128/256-bits vector when
30890 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
30891 delete redundant !maskcmp condition.
30892 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
30894 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
30895 condition directly to if (maskcmp), add extra check for
30896 cmpmode, it should be MODE_INT.
30897 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
30898 parameters op_true/op_false.
30899 (ix86_use_mask_cmp_p): New.
30901 2021-02-05 liuhongt <hongtao.liu@intel.com>
30904 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
30905 Remove m_GENERIC from ~list.
30906 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
30908 2021-02-04 David Malcolm <dmalcolm@redhat.com>
30911 * diagnostic-show-locus.c (compatible_locations_p): Require
30912 locations in the same macro map to be either both from the
30913 macro definition, or both from the macro arguments.
30915 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30917 * config/aarch64/aarch64-simd-builtins.def: Add
30918 [su]mull_hi_lane[q] builtin generator macros.
30919 * config/aarch64/aarch64-simd.md
30920 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
30921 (aarch64_<su>mull_hi_lane<mode>): Define.
30922 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
30923 (aarch64_<su>mull_hi_laneq<mode>): Define.
30924 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
30925 builtin instead of inline asm.
30926 (vmull_high_lane_s32): Likewise.
30927 (vmull_high_lane_u16): Likewise.
30928 (vmull_high_lane_u32): Likewise.
30929 (vmull_high_laneq_s16): Likewise.
30930 (vmull_high_laneq_s32): Likewise.
30931 (vmull_high_laneq_u16): Likewise.
30932 (vmull_high_laneq_u32): Liekwise.
30934 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30936 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
30937 builtin generator macros.
30938 * config/aarch64/aarch64-simd.md
30939 (aarch64_<su>mull_hi_n<mode>_insn): Define.
30940 (aarch64_<su>mull_hi_n<mode>): Define.
30941 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
30942 instead of inline asm.
30943 (vmull_high_n_s32): Likewise.
30944 (vmull_high_n_u16): Likewise.
30945 (vmull_high_n_u32): Likewise.
30947 2021-02-04 Richard Biener <rguenther@suse.de>
30949 PR tree-optimization/98855
30950 * tree-vect-loop.c (vectorizable_phi): Do not cost
30951 single-argument PHIs.
30952 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
30953 * tree-vect-stmts.c (vectorizable_bswap): Also perform
30954 costing for SLP operation.
30956 2021-02-04 Martin Liska <mliska@suse.cz>
30958 * doc/extend.texi: Mention -mprefer-vector-width in target
30961 2021-02-03 Martin Sebor <msebor@redhat.com>
30963 PR tree-optimization/98937
30964 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
30965 Flush pointer_query cache.
30967 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30969 * config/rs6000/genfusion.pl (gen_2logical): Add missing
30970 fixes based on patch review.
30971 * config/rs6000/fusion.md: Regenerate file.
30973 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30975 * config/rs6000/t-rs6000: Comment out auto generation of
30978 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
30980 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
30981 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
30982 (output_file_start): Add gfx908.
30983 * config/gcn/gcn.opt (gpu_type): Add gfx908.
30984 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
30985 (MULTILIB_DIRNAMES): Add gfx908.
30986 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
30987 (main): Recognize gfx908.
30988 * config/gcn/t-omp-device: Add gfx908.
30990 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30992 * config/aarch64/aarch64-simd-builtins.def: Add
30993 [su]mlsl_hi_lane[q] builtin macro generators.
30994 * config/aarch64/aarch64-simd.md
30995 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
30996 (aarch64_<su>mlsl_hi_lane<mode>): Define.
30997 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
30998 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
30999 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
31000 builtin instead of inline asm.
31001 (vmlsl_high_lane_s32): Likewise.
31002 (vmlsl_high_lane_u16): Likewise.
31003 (vmlsl_high_lane_u32): Likewise.
31004 (vmlsl_high_laneq_s16): Likewise.
31005 (vmlsl_high_laneq_s32): Likewise.
31006 (vmlsl_high_laneq_u16): Likewise.
31007 (vmlsl_high_laneq_u32): Likewise.
31008 (vmlal_high_laneq_u32): Likewise.
31010 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31012 * config/aarch64/aarch64-simd-builtins.def: Add
31013 [su]mlal_hi_lane[q] builtin generator macros.
31014 * config/aarch64/aarch64-simd.md
31015 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
31016 (aarch64_<su>mlal_hi_lane<mode>): Define.
31017 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
31018 (aarch64_<su>mlal_hi_laneq<mode>): Define.
31019 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
31020 builtin instead of inline asm.
31021 (vmlal_high_lane_s32): Likewise.
31022 (vmlal_high_lane_u16): Likewise.
31023 (vmlal_high_lane_u32): Likewise.
31024 (vmlal_high_laneq_s16): Likewise.
31025 (vmlal_high_laneq_s32): Likewise.
31026 (vmlal_high_laneq_u16): Likewise.
31027 (vmlal_high_laneq_u32): Likewise.
31029 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31031 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
31032 builtin generator macros.
31033 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
31035 (aarch64_<su>mlsl_hi_n<mode>): Define.
31036 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
31037 instead of inline asm.
31038 (vmlsl_high_n_s32): Likewise.
31039 (vmlsl_high_n_u16): Likewise.
31040 (vmlsl_high_n_u32): Likewise.
31042 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31044 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
31045 builtin generator macros.
31046 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
31048 (aarch64_<su>mlal_hi_n<mode>): Define.
31049 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
31050 instead of inline asm.
31051 (vmlal_high_n_s32): Likewise.
31052 (vmlal_high_n_u16): Likewise.
31053 (vmlal_high_n_u32): Likewise.
31055 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
31057 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
31059 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
31061 (aarch64_<su>mlal_hi<mode>_insn): This.
31062 (aarch64_<su>mlal_hi<mode>): Define.
31063 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
31064 instead of inline asm.
31065 (vmlal_high_s16): Likewise.
31066 (vmlal_high_s32): Likewise.
31067 (vmlal_high_u8): Likewise.
31068 (vmlal_high_u16): Likewise.
31069 (vmlal_high_u32): Likewise.
31071 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
31073 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
31074 after calling alter_subreg() on a (mem).
31076 2021-02-03 Martin Liska <mliska@suse.cz>
31079 * lto-streamer-out.c (produce_lto_section): Fill up missing
31081 * lto-streamer.h (struct lto_section): Add _padding field.
31083 2021-02-03 Richard Biener <rguenther@suse.de>
31085 * lto-streamer.c (lto_get_section_name): Free temporary
31087 * tree-loop-distribution.c
31088 (loop_distribution::merge_dep_scc_partitions): Free edge data.
31090 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31092 PR middle-end/97487
31093 * ifcvt.c (noce_can_force_operand): New function.
31094 (noce_emit_move_insn): Use it.
31095 (noce_try_sign_mask): Likewise. Formatting fix.
31097 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31099 PR middle-end/97971
31100 * lra-constraints.c (process_alt_operands): For inline asm, don't call
31101 fatal_insn, but instead return false.
31103 2021-02-03 Jakub Jelinek <jakub@redhat.com>
31105 PR tree-optimization/98287
31106 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
31109 2021-02-03 Tamar Christina <tamar.christina@arm.com>
31111 PR tree-optimization/98928
31112 * tree-vect-loop.c (vect_analyze_loop_2): Change
31113 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
31114 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
31115 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
31116 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
31118 2021-02-02 Richard Biener <rguenther@suse.de>
31120 * gimple-loop-interchange.cc (prepare_data_references):
31122 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
31123 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
31124 * tree-vect-stmts.c (vectorizable_condition): Do not
31126 (vectorizable_comparison): Likewise.
31128 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31130 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
31131 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
31132 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
31133 (vrsqrteq_u32): Likewise.
31135 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31137 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
31138 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
31139 (aarch64_sqxtun2<mode>_be): Likewise.
31140 (aarch64_sqxtun2<mode>): Likewise.
31141 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
31142 (vqmovun_high_s32): Likewise.
31143 (vqmovun_high_s64): Likewise.
31144 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
31146 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31148 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
31150 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
31152 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31154 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
31155 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
31156 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
31157 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
31158 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
31159 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
31160 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
31161 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
31163 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31165 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
31166 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
31167 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
31169 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31171 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
31172 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
31174 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31176 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
31177 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
31178 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
31179 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
31180 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
31181 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
31182 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
31183 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
31184 NONE builtin flags.
31186 2021-02-02 Jakub Jelinek <jakub@redhat.com>
31188 PR tree-optimization/98848
31189 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
31190 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
31192 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
31195 * expr.c: Check mode before calling store_expr.
31197 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
31199 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
31201 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
31202 instruction using expression ior.
31203 (mve_vornq_u<mode>): New expander.
31204 (mve_vornq_f<mode>): Use ior code instead of unspec.
31205 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
31207 2021-02-02 Alexandre Oliva <oliva@adacore.com>
31209 * tree-nested.c (convert_nonlocal_reference_op): Move
31210 current_function_decl restore after re-gimplification.
31211 (convert_local_reference_op): Likewise.
31213 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31215 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
31217 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
31219 (aarch64_rshrn<mode>_insn_be): Likewise.
31220 (aarch64_rshrn<mode>): Likewise.
31221 (aarch64_rshrn2<mode>_insn_le): Likewise.
31222 (aarch64_rshrn2<mode>_insn_be): Likewise.
31223 (aarch64_rshrn2<mode>): Likewise.
31224 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
31225 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
31227 (vrshrn_high_n_s32): Likewise.
31228 (vrshrn_high_n_s64): Likewise.
31229 (vrshrn_high_n_u16): Likewise.
31230 (vrshrn_high_n_u32): Likewise.
31231 (vrshrn_high_n_u64): Likewise.
31232 (vrshrn_n_s16): Likewise.
31233 (vrshrn_n_s32): Likewise.
31234 (vrshrn_n_s64): Likewise.
31235 (vrshrn_n_u16): Likewise.
31236 (vrshrn_n_u32): Likewise.
31237 (vrshrn_n_u64): Likewise.
31239 2021-02-01 Sergei Trofimovich <siarheit@google.com>
31241 PR tree-optimization/98499
31242 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
31243 conservatively and assume all possible side-effects.
31245 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31247 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
31248 vec_unpacku_hi_): Define builtins.
31249 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
31251 (vmovl_high_s16): Likewise.
31252 (vmovl_high_s32): Likewise.
31253 (vmovl_high_u8): Likewise.
31254 (vmovl_high_u16): Likewise.
31255 (vmovl_high_u32): Likewise.
31257 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31259 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
31261 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
31263 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
31265 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
31267 (vabdl_s16): Likewise.
31268 (vabdl_s32): Likewise.
31269 (vabdl_u8): Likewise.
31270 (vabdl_u16): Likewise.
31271 (vabdl_u32): Likewise.
31272 * config/aarch64/iterators.md (ABDL): New int iterator.
31273 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
31275 2021-02-01 Martin Sebor <msebor@redhat.com>
31277 * tree.h (BLOCK_VARS): Add comment.
31278 (BLOCK_SUBBLOCKS): Same.
31279 (BLOCK_SUPERCONTEXT): Same.
31280 (BLOCK_ABSTRACT_ORIGIN): Same.
31281 (inlined_function_outer_scope_p): Same.
31283 2021-02-01 Martin Sebor <msebor@redhat.com>
31285 PR middle-end/97172
31286 * attribs.c (attr_access::free_lang_data): Define new function.
31287 * attribs.h (attr_access::free_lang_data): Declare new function.
31289 2021-02-01 Richard Biener <rguenther@suse.de>
31291 * vec.h (auto_vec::auto_vec): Add memory stat parameters
31293 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
31295 2021-02-01 Tamar Christina <tamar.christina@arm.com>
31297 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
31298 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
31300 2021-02-01 Richard Biener <rguenther@suse.de>
31302 PR rtl-optimization/98863
31303 * config/i386/i386-features.c (convert_scalars_to_vector):
31304 Set DF_RD_PRUNE_DEAD_DEFS.
31306 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
31308 * system.h (SIZE_MAX): Define if not already defined.
31310 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
31312 * config/rs6000/genfusion.pl (gen_2logical): New function to
31313 generate patterns for logical-logical fusion.
31314 * config/rs6000/fusion.md: Regenerated patterns.
31315 * config/rs6000/rs6000-cpus.def: Add
31316 OPTION_MASK_P10_FUSION_2LOGICAL.
31317 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31318 Enable logical-logical fusion for p10.
31319 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
31321 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31323 * config/rs6000/rs6000.opt: Add periods to new AIX options.
31325 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31327 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
31328 (mabi=vec-default): New.
31329 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
31330 __EXTABI__ for AIX Vector extended ABI.
31331 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
31333 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
31335 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
31337 2021-01-30 Jakub Jelinek <jakub@redhat.com>
31339 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
31340 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
31342 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
31345 * lra-constraints.c (in_class_p): Don't narrow class only for REG
31348 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
31350 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
31351 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
31352 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
31354 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
31356 PR tree-optimization/98866
31357 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
31358 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
31359 (gori_map::m_maybe_invariant): Rename from all_outgoing.
31360 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
31361 (gori_map::is_export_p): Ditto.
31362 (gori_map::calculate_gori): Ditto.
31363 (gori_compute::set_range_invariant): New.
31364 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
31365 invariant for pointers evaluating to [1, +INF].
31367 2021-01-29 Richard Biener <rguenther@suse.de>
31369 PR rtl-optimization/98863
31370 * config/i386/i386-features.c (remove_partial_avx_dependency):
31371 Do not perform DF analysis.
31372 (pass_data_remove_partial_avx_dependency): Remove
31375 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31377 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
31378 builtin generator macros.
31379 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
31381 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
31382 instead of inline asm.
31383 (vmull_n_s32): Likewise.
31384 (vmull_n_u16): Likewise.
31385 (vmull_n_u32): Likewise.
31387 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31389 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
31391 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
31393 (aarch64_<sur>abdl2<mode>): ... This.
31394 (<sur>sadv16qi): Adjust use of above.
31395 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
31397 (vabdl_high_s16): Likewise.
31398 (vabdl_high_s32): Likewise.
31399 (vabdl_high_u8): Likewise.
31400 (vabdl_high_u16): Likewise.
31401 (vabdl_high_u32): Likewise.
31403 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31405 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
31407 (uabal2): Likewise.
31408 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
31410 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
31412 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
31414 (vabal_high_s16): Likewise.
31415 (vabal_high_s32): Likewise.
31416 (vabal_high_u8): Likewise.
31417 (vabal_high_u16): Likewise.
31418 (vabal_high_u32): Likewise.
31419 * config/aarch64/iterators.md (ABAL2): New mode iterator.
31420 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
31422 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31424 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
31427 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
31429 (aarch64_<sur>abal<mode>): ... This
31430 (<sur>sadv16qi): Adust use of the above.
31431 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
31433 (vabal_s16): Likewise.
31434 (vabal_s32): Likewise.
31435 (vabal_u8): Likewise.
31436 (vabal_u16): Likewise.
31437 (vabal_u32): Likewise.
31439 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31441 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
31443 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
31445 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
31447 (vaddlv_s16): Likewise.
31448 (vaddlv_u8): Likewise.
31449 (vaddlv_u16): Likewise.
31450 (vaddlvq_s8): Likewise.
31451 (vaddlvq_s16): Likewise.
31452 (vaddlvq_s32): Likewise.
31453 (vaddlvq_u8): Likewise.
31454 (vaddlvq_u16): Likewise.
31455 (vaddlvq_u32): Likewise.
31456 (vaddlv_s32): Likewise.
31457 (vaddlv_u32): Likewise.
31458 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
31459 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
31460 (Vwstype): New mode attribute.
31462 (VWIDE_S): Likewise.
31463 (USADDLV): New int iterator.
31464 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
31466 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31468 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
31469 builtin generator macros.
31470 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
31472 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
31473 instead of inline asm.
31474 (vmlsl_lane_s32): Likewise.
31475 (vmlsl_lane_u16): Likewise.
31476 (vmlsl_lane_u32): Likewise.
31477 (vmlsl_laneq_s16): Likewise.
31478 (vmlsl_laneq_s32): Likewise.
31479 (vmlsl_laneq_u16): Likewise.
31480 (vmlsl_laneq_u32): Likewise.
31482 2021-01-29 Richard Biener <rguenther@suse.de>
31484 * doc/invoke.texi (--param max-gcse-memory): Document unit
31486 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
31487 * params.opt (--param max-gcse-memory): Adjust default and
31488 document unit of size.
31490 2021-01-29 Richard Biener <rguenther@suse.de>
31492 PR rtl-optimization/98863
31493 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
31494 HOST_WIDE_INT for the memory estimate.
31496 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
31497 Richard Biener <rguenther@suse.de>
31499 PR tree-optimization/97627
31500 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
31501 Do not analyze fake edges.
31503 2021-01-29 Richard Biener <rguenther@suse.de>
31505 PR rtl-optimization/98144
31506 * df.h (df_mir_bb_info): Add con_visited member.
31507 * df-problems.c (df_mir_alloc): Initialize con_visited,
31508 do not fully populate IN and OUT.
31509 (df_mir_reset): Likewise.
31510 (df_mir_confluence_0): Set con_visited.
31511 (df_mir_confluence_n): Properly handle implicitely
31512 fully populated IN and OUT as designated by con_visited
31513 and update con_visited accordingly.
31515 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31518 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
31519 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
31520 && !TARGET_REALLY_IWMMXT to conditions.
31522 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31525 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
31528 2021-01-28 Marek Polacek <polacek@redhat.com>
31531 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
31532 the main variant, maybe reset it in its variants too.
31533 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
31534 (check_aligned_type): Check if TYPE_USER_ALIGN match.
31536 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
31539 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
31540 of constant zero for comparisons.
31542 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
31544 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
31545 support for mapping built-in function names for long double
31546 built-in functions if long double is IEEE 128-bit.
31548 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31550 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
31551 builtin generator macros.
31552 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
31554 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
31555 instead of inline asm.
31556 (vmlsl_n_s32): Likewise.
31557 (vmlsl_n_u16): Likewise.
31558 (vmlsl_n_u32): Likewise.
31560 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31562 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
31563 builtin generator macros.
31564 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
31566 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
31567 instead of inline asm.
31568 (vmlal_n_s32): Likewise.
31569 (vmlal_n_u16): Likewise.
31570 (vmlal_n_u32): Likewise.
31572 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31574 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
31576 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
31578 (aarch64_shrn2<mode>_insn_be): Likewise.
31579 (aarch64_shrn2<mode>): Likewise.
31580 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
31582 (vshrn_high_n_s32): Likewise.
31583 (vshrn_high_n_s64): Likewise.
31584 (vshrn_high_n_u16): Likewise.
31585 (vshrn_high_n_u32): Likewise.
31586 (vshrn_high_n_u64): Likewise.
31588 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31590 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
31592 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
31594 (aarch64_shrn<mode>_insn_be): Likewise.
31595 (aarch64_shrn<mode>): Likewise.
31596 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
31598 (vshrn_n_s32): Likewise.
31599 (vshrn_n_s64): Likewise.
31600 (vshrn_n_u16): Likewise.
31601 (vshrn_n_u32): Likewise.
31602 (vshrn_n_u64): Likewise.
31603 * config/aarch64/iterators.md (vn_mode): New mode attribute.
31605 2021-01-28 Richard Biener <rguenther@suse.de>
31607 PR rtl-optimization/80960
31608 * dse.c (check_mem_read_rtx): Call get_addr on the
31611 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
31612 David Edelsohn <dje.gcc@gmail.com>
31615 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31616 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
31618 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31620 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
31621 wrapper call rs6000_expand_vector_set_var for cleanup. Call
31622 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
31624 (rs6000_expand_vector_set_var): Delete.
31625 (rs6000_expand_vector_set_var_p9): Make static.
31626 (rs6000_expand_vector_set_var_p8): Make static.
31628 2021-01-28 Xing GUO <higuoxing@gmail.com>
31630 * common/config/riscv/riscv-common.c
31631 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
31632 when `p` extension exists.
31634 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
31636 PR rtl-optimization/97684
31637 * ira.c (ira): Call ira_set_pseudo_classes before
31638 update_equiv_regs when it is necessary.
31640 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31643 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
31644 %w0, %w1 and %2 instead of %0, %1 and %2.
31646 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
31648 * config/rs6000/genfusion.pl: New script to generate
31649 define_insn_and_split patterns so combine can arrange fused
31650 instructions next to each other.
31651 * config/rs6000/fusion.md: New file, generated fused instruction
31652 patterns for combine.
31653 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
31654 (non_update_memory_operand): New predicate.
31655 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
31656 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
31658 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
31660 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31661 Automatically set OPTION_MASK_P10_FUSION and
31662 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
31663 (rs600_opt_masks): Allow -mpower10-fusion
31664 in function attributes.
31665 (address_is_non_pfx_d_or_x): New function.
31666 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
31667 * config/rs6000/rs6000.md: Include fusion.md.
31668 * config/rs6000/rs6000.opt: Add -mpower10-fusion
31669 and -mpower10-fusion-ld-cmpi.
31670 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
31672 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31674 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
31675 builtin generator macros.
31676 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
31678 (aarch64_<su>mlal<mode>): This.
31679 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
31680 instead of inline asm.
31681 (vmlal_s16): Likewise.
31682 (vmlal_s32): Likewise.
31683 (vmlal_u8): Likewise.
31684 (vmlal_u16): Likewise.
31685 (vmlal_u32): Likewise.
31687 2021-01-27 Richard Biener <rguenther@suse.de>
31689 PR tree-optimization/98854
31690 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
31691 PHIs from scalars when the number of CTORs matches the
31692 number of children.
31694 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31696 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
31698 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
31700 (aarch64_mls_n<mode>): This.
31701 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
31703 (vmls_n_s32): Likewise.
31704 (vmls_n_u16): Likewise.
31705 (vmls_n_u32): Likewise.
31706 (vmlsq_n_s16): Likewise.
31707 (vmlsq_n_s32): Likewise.
31708 (vmlsq_n_u16): Likewise.
31709 (vmlsq_n_u32): Likewise.
31711 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31713 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
31715 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
31717 (vmls_s16): Likewise.
31718 (vmls_s32): Likewise.
31719 (vmls_u8): Likewise.
31720 (vmls_u16): Likewise.
31721 (vmls_u32): Likewise.
31722 (vmlsq_s8): Likewise.
31723 (vmlsq_s16): Likewise.
31724 (vmlsq_s32): Likewise.
31725 (vmlsq_u8): Likewise.
31726 (vmlsq_u16): Likewise.
31727 (vmlsq_u32): Likewise.
31729 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31731 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
31733 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
31735 (aarch64_mla_n<mode>): This.
31736 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
31738 (vmla_n_s32): Likewise.
31739 (vmla_n_u16): Likewise.
31740 (vmla_n_u32): Likewise.
31741 (vmlaq_n_s16): Likewise.
31742 (vmlaq_n_s32): Likewise.
31743 (vmlaq_n_u16): Likewise.
31744 (vmlaq_n_u32): Likewise.
31746 2021-01-27 liuhongt <hongtao.liu@intel.com>
31749 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
31750 (*sse2_eq<mode>3): Ditto.
31752 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31754 * tree-pass.h (PROP_trees): Rename to ...
31755 (PROP_gimple): ... this.
31756 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
31757 * passes.c (execute_function_dump, execute_function_todo,
31758 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
31759 * varpool.c (ctor_for_folding): Likewise.
31761 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31763 PR tree-optimization/97260
31764 * varpool.c: Include tree-pass.h.
31765 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
31766 non-TREE_SIDE_EFFECTS automatic variables.
31768 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
31770 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
31772 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
31773 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
31774 (gen_compile_unit_die): Recognise C++23.
31776 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31779 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
31782 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31785 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
31786 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
31787 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
31789 2021-01-26 Richard Biener <rguenther@suse.de>
31791 * gimple-pretty-print.c (dump_binary_rhs): Handle
31792 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
31794 2021-01-26 Richard Biener <rguenther@suse.de>
31796 PR middle-end/98726
31797 * tree.h (vector_cst_int_elt): Remove.
31798 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
31801 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
31803 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
31804 for V64DFmode min/max reductions.
31806 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31808 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
31809 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
31810 two halves, one with x and the other with const0_rtx, ordered
31811 depending on endianity.
31813 2021-01-26 Alexandre Oliva <oliva@adacore.com>
31815 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
31816 temporaries not seen in binding block, and not about to be
31817 added as gimple variables.
31819 2021-01-25 Martin Sebor <msebor@redhat.com>
31822 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
31824 2021-01-25 Martin Liska <mliska@suse.cz>
31826 * value-prof.c (get_nth_most_common_value): Use %s instead
31829 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31832 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
31833 readelf -wi is able to read the emitted .debug_info back.
31834 * configure: Regenerated.
31836 2021-01-25 Martin Liska <mliska@suse.cz>
31838 PR gcov-profile/98739
31839 * common.opt: Add missing sign symbol.
31840 * value-prof.c (get_nth_most_common_value): Restore handling
31841 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
31842 PROFILE_REPRODUCIBILITY_MULTITHREADED.
31844 2021-01-25 Richard Biener <rguenther@suse.de>
31846 PR middle-end/98807
31847 * tree.c (vector_element_bits): Always use precision of
31848 the element type for boolean vectors.
31850 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31852 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
31853 (ENDFILE_SPEC): Evaluate qnolinkcmds.
31855 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31857 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
31858 nostartfiles handling since this is already done by
31859 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
31860 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
31861 is already done by LINK_COMMAND_SPEC.
31862 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
31863 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
31866 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31869 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
31870 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
31871 value rather than host size_t.
31872 (fold_const_call): Change type of s2 from size_t to
31873 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
31874 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
31875 instead of s2 as last argument.
31877 2021-01-25 Tamar Christina <tamar.christina@arm.com>
31879 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
31880 VCMLA_OP, VCMUL_OP): New.
31881 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
31882 * config/arm/neon.md (cmul<conj_op><mode>3): New.
31883 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
31884 UNSPEC_VCMUL_CONJ): New.
31885 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
31886 cml<fcmac1><conj_op><mode>4): New.
31888 2021-01-23 Jakub Jelinek <jakub@redhat.com>
31891 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
31893 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
31895 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
31897 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
31899 (vmla_s16): Likewise.
31900 (vmla_s32): Likewise.
31901 (vmla_u8): Likewise.
31902 (vmla_u16): Likewise.
31903 (vmla_u32): Likewise.
31904 (vmlaq_s8): Likewise.
31905 (vmlaq_s16): Likewise.
31906 (vmlaq_s32): Likewise.
31907 (vmlaq_u8): Likewise.
31908 (vmlaq_u16): Likewise.
31909 (vmlaq_u32): Likewise.
31911 2021-01-22 David Malcolm <dmalcolm@redhat.com>
31913 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
31916 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31919 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
31920 filenames to emit, still emit the required 0 index directory and
31921 filename entries that match DW_AT_comp_dir and DW_AT_name of the
31924 2021-01-22 Marek Polacek <polacek@redhat.com>
31927 * doc/invoke.texi: Update C++ ABI Version 15 description.
31929 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31931 PR tree-optimization/98766
31932 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
31933 comparing against type size with param_avoid_fma_max_bits.
31935 2021-01-22 Richard Biener <rguenther@suse.de>
31937 PR middle-end/98793
31938 * tree.c (vector_element_bits): Key single-bit bool vector on
31939 integer mode rather than not vector mode.
31941 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31944 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31945 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
31947 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
31948 to call different path for P8 and P9.
31949 (rs6000_expand_vector_set_var_p9): New function.
31950 (rs6000_expand_vector_set_var_p8): New function.
31952 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31956 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31957 Ajdust variable index vec_insert from address dereference to
31958 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
31959 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31961 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
31963 2021-01-22 Martin Liska <mliska@suse.cz>
31965 PR gcov-profile/98739
31966 * profile.c (compute_value_histograms): Drop time profile for
31967 -fprofile-reproducible=multithreaded.
31969 2021-01-22 Nathan Sidwell <nathan@acm.org>
31971 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
31974 2021-01-22 Richard Biener <rguenther@suse.de>
31976 PR middle-end/98773
31977 * tree-data-ref.c (initalize_matrix_A): Revert previous
31978 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
31980 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31982 PR tree-optimization/90248
31983 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
31984 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
31986 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
31987 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
31989 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31991 PR tree-optimization/98255
31992 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
31993 extend index - low_bound from sizetype's precision rather than index
31995 (get_addr_base_and_unit_offset_1): Likewise.
31996 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
31997 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
31999 2021-01-22 Richard Biener <rguenther@suse.de>
32001 PR tree-optimization/98786
32002 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
32003 adding new uses of abnormals. Verify we deal with a conditional
32006 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32009 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
32011 2021-01-22 liuhongt <hongtao.liu@intel.com>
32015 * config/i386/sse.md (VI_128_256): New mode iterator.
32016 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
32017 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
32018 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
32019 define_insn_and_split to lower avx512 vector comparison to avx
32020 version when dest is vector.
32021 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
32022 define_insn_and_split for negating the comparison result.
32023 * config/i386/predicates.md (float_vector_all_ones_operand):
32025 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
32026 general NOT operator without UNSPEC_MASKOP.
32028 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
32030 PR rtl-optimization/98777
32031 * lra-int.h (lra_pmode_pseudo): New extern.
32032 * lra.c (lra_pmode_pseudo): New global.
32034 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
32036 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
32038 * fwprop.c (fwprop_propagation::classify_result): Allow
32039 (subreg (mem)) simplifications.
32041 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32043 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
32045 (aarch64_sqdmlal<mode>): ... This...
32046 (aarch64_sqdmlsl<mode>): ... And this.
32047 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
32048 (aarch64_sqdmlal_lane<mode>): ... This...
32049 (aarch64_sqdmlsl_lane<mode>): ... And this.
32050 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
32051 (aarch64_sqdmlsl_laneq<mode>): ... This...
32052 (aarch64_sqdmlal_laneq<mode>): ... And this.
32053 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
32054 (aarch64_sqdmlsl_n<mode>): ... This...
32055 (aarch64_sqdmlal_n<mode>): ... And this.
32056 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
32057 (aarch64_sqdmlal2<mode>_internal): ... This...
32058 (aarch64_sqdmlsl2<mode>_internal): ... And this.
32060 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
32062 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
32064 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
32067 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
32069 2021-01-21 liuhongt <hongtao.liu@intel.com>
32071 PR rtl-optimization/98694
32072 * regcprop.c (copy_value): If SRC had been assigned a mode
32073 narrower than the copy, we can't link DEST into the chain even
32074 they have same hard_regno_nregs(i.e. HImode/SImode in i386
32077 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32079 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
32080 Convert to define_insn_and_split. Split into simple move when moving
32083 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
32085 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
32086 Adjust comment. Simplify code.
32088 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32091 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
32092 with DW_FORM_line_strp form.
32093 (prune_unused_types_update_strings): Don't add into debug_str_hash
32094 indirect strings with DW_FORM_line_strp form.
32095 (adjust_name_comp_dir): New function.
32096 (dwarf2out_finish): Call it on CU DIEs after resetting
32097 debug_line_str_hash.
32099 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
32101 PR rtl-optimization/98722
32102 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
32103 has no 3-op add insn to transform insns containing two pluses.
32105 2021-01-20 Richard Biener <rguenther@suse.de>
32107 * hwint.h (add_hwi): New function.
32108 (mul_hwi): Likewise.
32109 * tree-data-ref.c (initialize_matrix_A): Properly translate
32110 tree constants and avoid HOST_WIDE_INT_MIN.
32111 (lambda_matrix_row_add): Avoid undefined integer overflow
32112 and return true on such overflow.
32113 (lambda_matrix_right_hermite): Handle overflow from
32114 lambda_matrix_row_add gracefully. Simplify previous fix.
32115 (analyze_subscript_affine_affine): Likewise.
32117 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
32119 PR tree-optimization/96674
32120 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
32121 x >= y && y != XXX_MIN --> x > y - 1
32123 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
32125 PR tree-optimization/98535
32126 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
32127 If the high and low permutes are the same, remove the high permutes
32128 from the working set and only continue with the low ones.
32130 2021-01-20 Jakub Jelinek <jakub@redhat.com>
32132 PR tree-optimization/98721
32133 * builtins.c (access_ref::inform_access): Don't assume
32134 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
32135 object whenever allocfn is NULL, rather than only when DECL_P
32136 is true. Use %qE instead of %qD for that. Formatting fixes.
32138 2021-01-20 Richard Biener <rguenther@suse.de>
32140 PR tree-optimization/98758
32141 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
32142 (lambda_matrix_right_hermite): Avoid undefinedness with
32143 signed integer abs and multiplication.
32144 (analyze_subscript_affine_affine): Use lambda_int.
32146 2021-01-20 David Malcolm <dmalcolm@redhat.com>
32149 * dwarf2out.c (output_line_info): Rename static variable
32150 "generation", moving it out of the function to...
32151 (output_line_info_generation): New.
32152 (init_sections_and_labels): Likewise, renaming the variable to...
32153 (init_sections_and_labels_generation): New.
32154 (dwarf2out_c_finalize): Reset the new variables.
32156 2021-01-19 Martin Sebor <msebor@redhat.com>
32158 PR middle-end/98664
32159 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
32160 all functions, even if they're not declared artificial or inline.
32161 * tree.c (tree_inlined_location): Use macro expansion location
32162 only if scope traversal fails to expose one.
32164 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
32166 PR rtl-optimization/92294
32167 * alias.c (compare_base_symbol_refs): Take an extra parameter
32168 and add the distance between two symbols to it. Enshrine in
32169 comments that -1 means "either 0 or 1, but we can't tell
32170 which at compile time".
32171 (memrefs_conflict_p): Update call accordingly.
32172 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
32175 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32177 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
32178 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
32179 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
32180 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
32181 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
32182 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
32183 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
32184 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
32185 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
32186 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
32187 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
32188 sqmovun): Use NONE flags.
32190 2021-01-19 Richard Biener <rguenther@suse.de>
32193 * ipa-modref.c (analyze_stmt): Only record a summary for a
32196 2021-01-19 Richard Biener <rguenther@suse.de>
32198 PR middle-end/98638
32199 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
32201 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
32203 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
32204 built-in define __FIX_LEON3FT_TN0018.
32206 2021-01-19 Richard Biener <rguenther@suse.de>
32209 * tree-inline.c (tree_function_versioning): Set input_location
32210 to UNKNOWN_LOCATION throughout the function.
32212 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
32215 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
32217 2021-01-19 Martin Jambor <mjambor@suse.cz>
32220 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
32221 whether non-call exceptions allow removal of a statement.
32222 (isra_analyze_call): Pass the appropriate function to
32223 ssa_name_only_returned_p.
32225 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
32227 * config/riscv/arch-canonicalize (longext_sort): New function for
32228 sorting 'multi-letter'.
32229 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
32230 'alts'. The 'arch' may not be the first of 'alts'.
32231 (_expand_combination): Add underline for the 'ext' without '*'.
32232 This is because, a single-letter extension can always be treated well
32233 with a '_' prefix, but it cannot be separated out if it is appended
32236 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
32239 * ira.c (ira): Skip abnormal critical edge splitting.
32241 2021-01-18 Jakub Jelinek <jakub@redhat.com>
32243 PR tree-optimization/98727
32244 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
32245 second .MUL_OVERFLOW operand for signed multiplication with overflow
32246 checking if the second operand of multiplication is not constant.
32248 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
32250 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
32251 defaults to version 4.
32253 2021-01-18 David Malcolm <dmalcolm@redhat.com>
32255 * attribs.h (fndecl_dealloc_argno): New decl.
32256 * builtins.c (call_dealloc_argno): Split out second half of
32258 (fndecl_dealloc_argno): New.
32259 * doc/extend.texi (Common Function Attributes): Document the
32260 interaction between the analyzer and the malloc attribute.
32261 * doc/invoke.texi (Static Analyzer Options): Likewise.
32263 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
32265 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
32266 dwarf_version to 4.
32267 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
32269 2021-01-17 Martin Jambor <mjambor@suse.cz>
32272 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
32275 2021-01-17 Mark Wielaard <mark@klomp.org>
32277 * common.opt (gdwarf-): Init(5).
32278 * doc/invoke.texi (-gdwarf): Document default to 5.
32280 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
32282 * builtin-types.def
32283 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
32285 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
32286 ...this. Add extra argument.
32287 * gimplify.c (omp_default_clause): Ensure that event handle is
32288 firstprivate in a task region.
32289 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
32290 (gimplify_adjust_omp_clauses): Likewise.
32291 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
32292 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
32293 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
32294 if detach clause specified. Add detach argument when generating
32296 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
32298 (finish_taskreg_scan): Move field for variable containing the event
32299 handle to the front of the struct.
32300 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
32302 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
32303 OMP_CLAUSE_DETACH clause.
32304 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
32305 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
32306 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
32308 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
32310 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
32312 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32314 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
32315 multilib directory names. Use MULTILIB_REQUIRED instead of
32316 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
32317 -mcustom-fpu-cfg=fph2 multilib.
32319 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32321 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
32322 (nios2_init_fpu_configs): Provide register values for new
32323 -mcustom-fpu-cfg=fph2 option variant.
32324 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
32327 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32329 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
32330 custom instruction warnings.
32332 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32334 PR tree-optimization/96669
32335 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
32337 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32339 PR tree-optimization/96271
32340 * passes.def: Pass false argument to first two pass_cd_dce
32341 instances and true to last instance. Add comment that
32342 last instance rewrites no longer addressed locals.
32343 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
32345 (pass_cd_dce::set_pass_param): New method.
32346 (pass_cd_dce::execute): Return TODO_update_address_taken from
32347 last cd_dce instance.
32349 2021-01-15 Carl Love <cel@us.ibm.com>
32351 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
32353 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
32354 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
32355 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
32356 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
32357 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
32358 Add builtin define.
32359 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
32360 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
32361 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
32362 New overloaded definitions.
32363 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
32364 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
32365 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
32366 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
32367 P10V_BUILTIN_MULHU_V4SI]: Add case
32368 statement for builtins.
32369 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
32370 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
32371 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
32372 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
32373 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
32374 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
32375 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
32376 Add define_insn, mode is VIlong.
32377 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
32378 Add builtin descriptions.
32380 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
32382 * final.c (final_start_function_1): Reset force_source_line.
32384 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32386 PR tree-optimization/96669
32387 * match.pd (((1 << A) & 1) != 0 -> A == 0,
32388 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
32389 possibly different power of two constants and to right shift too.
32391 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32393 PR tree-optimization/96681
32394 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
32395 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
32396 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
32397 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
32399 2021-01-15 Alexandre Oliva <oliva@adacore.com>
32401 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
32403 2021-01-15 Tamar Christina <tamar.christina@arm.com>
32405 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
32406 cmul<conj_op><mode>3): New.
32407 * config/aarch64/iterators.md (UNSPEC_FCMUL,
32408 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
32409 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
32410 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
32411 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
32412 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
32413 (rot_op): Renamed to conj_op.
32414 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
32415 cmul<conj_op><mode>3): New.
32416 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
32417 cmul<conj_op><mode>3): New.
32419 2021-01-15 David Malcolm <dmalcolm@redhat.com>
32423 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32424 Escape the tempfile name when constructing the expected output.
32426 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32428 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
32430 (aarch64_<su>mlsl_hi<mode>): ... This.
32431 (aarch64_<su>mlsl_hi<mode>): Define.
32432 (*aarch64_<su>mlsl<mode): Rename to...
32433 (aarch64_<su>mlsl<mode): ... This.
32434 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
32435 smlsl_hi, umlsl_hi): Define builtins.
32436 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
32437 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
32438 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
32439 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
32441 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
32443 * config/i386/i386-c.c (ix86_target_macros):
32444 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
32446 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32449 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
32450 * Makefile.in (RTL_SSA_H): New variable.
32451 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
32452 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
32453 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
32455 * config/aarch64/aarch64-cc-fusion.cc: New file.
32457 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32459 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
32460 calling cancel_changes for changes that no longer exist.
32462 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32464 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
32465 (function_info::reg_defs): ...this.
32466 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
32467 (function_info::reg_defs): ...this.
32469 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32472 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32474 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32477 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32480 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32482 2021-01-15 Richard Biener <rguenther@suse.de>
32484 PR tree-optimization/96376
32485 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
32486 for VMAT_INVARIANT.
32488 2021-01-15 Martin Liska <mliska@suse.cz>
32490 * doc/install.texi: Document that some tests need pytest module.
32491 * doc/sourcebuild.texi: Likewise.
32493 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32496 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32498 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32500 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
32501 (mve_vshrq_n_u<mode>_imm): Likewise.
32502 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
32503 * config/arm/vec-common.md: ... here.
32505 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32507 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
32509 * config/arm/neon.md (vashl<mode>3): Delete.
32510 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
32511 (vasl<mode>3): New expander.
32513 2021-01-15 Richard Biener <rguenther@suse.de>
32515 PR tree-optimization/98685
32516 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
32517 of vector extern defs.
32519 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32522 * diagnostic.c (diagnostic_kind_text): Break out this array
32524 (diagnostic_build_prefix): ...here.
32525 (fancy_abort): Detect when diagnostic_initialize has not yet been
32526 called and fall back to a minimal implementation of printing the
32527 ICE, rather than segfaulting in internal_error.
32529 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32531 * diagnostic.c (diagnostic_initialize): Eliminate
32532 parseable_fixits_p in favor of initializing extra_output_kind from
32533 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32534 (convert_column_unit): New function, split out from...
32535 (diagnostic_converted_column): ...this.
32536 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
32537 Use them to call convert_column_unit on the column values.
32538 (diagnostic_report_diagnostic): Eliminate conditional on
32539 parseable_fixits_p in favor of a switch statement on
32540 extra_output_kind, passing the appropriate values to the new
32541 params of print_parseable_fixits.
32542 (selftest::test_print_parseable_fixits_none): Update for new
32543 params of print_parseable_fixits.
32544 (selftest::test_print_parseable_fixits_insert): Likewise.
32545 (selftest::test_print_parseable_fixits_remove): Likewise.
32546 (selftest::test_print_parseable_fixits_replace): Likewise.
32547 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32549 (selftest::diagnostic_c_tests): Call it.
32550 * diagnostic.h (enum diagnostics_extra_output_kind): New.
32551 (diagnostic_context::parseable_fixits_p): Delete field in favor
32553 (diagnostic_context::extra_output_kind): ...this new field.
32554 * doc/invoke.texi (Environment Variables): Add
32555 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32556 * opts.c (common_handle_option): Update handling of
32557 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
32560 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32562 * tree-vect-slp-patterns.c (class complex_operations_pattern,
32563 complex_operations_pattern::matches,
32564 complex_operations_pattern::recognize,
32565 complex_operations_pattern::build): New.
32566 (slp_patterns): Use it.
32568 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32570 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
32571 * optabs.def (cmls_optab, cmls_conj_optab): New.
32572 * doc/md.texi: Document them.
32573 * tree-vect-slp-patterns.c (class complex_fms_pattern,
32574 complex_fms_pattern::matches, complex_fms_pattern::recognize,
32575 complex_fms_pattern::build): New.
32577 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32579 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
32580 * optabs.def (cmla_optab, cmla_conj_optab): New.
32581 * doc/md.texi: Document them.
32582 * tree-vect-slp-patterns.c (vect_match_call_p,
32583 class complex_fma_pattern, vect_slp_reset_pattern,
32584 complex_fma_pattern::matches, complex_fma_pattern::recognize,
32585 complex_fma_pattern::build): New.
32587 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32589 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
32590 * optabs.def (cmul_optab, cmul_conj_optab): New.
32591 * doc/md.texi: Document them.
32592 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
32593 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
32594 vect_build_combine_node, class complex_mul_pattern,
32595 complex_mul_pattern::matches, complex_mul_pattern::recognize,
32596 complex_mul_pattern::build): New.
32598 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32600 * tree-vect-slp.c (optimize_load_redistribution_1): New.
32601 (optimize_load_redistribution, vect_is_slp_load_node): New.
32602 (vect_match_slp_patterns): Use it.
32604 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32606 * tree-vect-slp-patterns.c (complex_add_pattern::build):
32609 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
32611 * config/gcn/mkoffload.c (main): Create an offload image only in
32612 64-bit configurations.
32614 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
32617 * config/i386/i386-options.c (ix86_option_override_internal):
32618 Issue an error for -fcf-protection with CF_BRANCH when compiling
32619 for 32-bit non-TARGET_CMOV targets.
32621 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32624 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
32625 Remove declaration and initialization of shadow variable "ret".
32626 (ix86_option_override_internal): Remove delcaration of
32627 shadow variable "i". Redeclare shadowed variable to unsigned.
32628 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
32629 * config/i386/i386-builtins.c (get_builtin_code_for_version):
32630 Update for redeclaration.
32631 * config/i386/i386.h (pta_size): Ditto.
32633 2021-01-14 Richard Biener <rguenther@suse.de>
32635 PR tree-optimization/98674
32636 * tree-data-ref.c (base_supports_access_fn_components_p): New.
32637 (initialize_data_dependence_relation): For two bases without
32638 possible access fns resort to type size equality when determining
32639 shape compatibility.
32641 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32644 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
32645 <=, >= operators in vcle and vcge intrinsics respectively.
32646 * config/arm/arm_neon_builtins.def: Remove entry for
32649 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32652 * config/i386/i386-options.c (ix86_function_specific_save):
32653 Remove redundant assignment to opts->x_ix86_branch_cost.
32654 * config/i386/i386.c (ix86_prefetch_sse):
32655 Rename from x86_prefetch_sse. Update all uses.
32656 * config/i386/i386.h: Update for rename.
32657 * config/i386/i386-options.h: Ditto.
32659 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32662 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
32663 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
32664 Use Bm instead of m for non-avx. Add isa attribute.
32666 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32668 PR tree-optimization/96688
32669 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
32670 ~X can be simplified.
32672 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
32674 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
32675 IFN_LOAD_LANES results.
32677 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32679 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
32681 (aarch64_xtn<mode>): Likewise.
32682 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
32685 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32687 (vmovl_s16): Likewise.
32688 (vmovl_s32): Likewise.
32689 (vmovl_u8): Likewise.
32690 (vmovl_u16): Likewise.
32691 (vmovl_u32): Likewise.
32692 (vmovn_s16): Likewise.
32693 (vmovn_s32): Likewise.
32694 (vmovn_s64): Likewise.
32695 (vmovn_u16): Likewise.
32696 (vmovn_u32): Likewise.
32697 (vmovn_u64): Likewise.
32699 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32701 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
32703 (aarch64_<su>qxtn2<mode>_be): Likewise.
32704 (aarch64_<su>qxtn2<mode>): Likewise.
32705 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
32707 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
32708 (su): Handle ss_truncate and us_truncate.
32709 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
32711 (vqmovn_high_s32): Likewise.
32712 (vqmovn_high_s64): Likewise.
32713 (vqmovn_high_u16): Likewise.
32714 (vqmovn_high_u32): Likewise.
32715 (vqmovn_high_u64): Likewise.
32717 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32719 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
32721 (aarch64_xtn2<mode>_be): Likewise.
32722 (aarch64_xtn2<mode>): Likewise.
32723 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
32725 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
32727 (vmovn_high_s32): Likewise.
32728 (vmovn_high_s64): Likewise.
32729 (vmovn_high_u16): Likewise.
32730 (vmovn_high_u32): Likewise.
32731 (vmovn_high_u64): Likewise.
32733 2021-01-13 Stafford Horne <shorne@gmail.com>
32735 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
32737 2021-01-13 Stafford Horne <shorne@gmail.com>
32739 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
32741 2021-01-13 Stafford Horne <shorne@gmail.com>
32743 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
32744 define for __or1k_hard_float__.
32746 2021-01-13 Stafford Horne <shorne@gmail.com>
32748 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
32749 (PROFILE_HOOK): Define to call _mcount.
32750 (FUNCTION_PROFILER): Change from abort to no-op.
32752 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32754 PR tree-optimization/96691
32755 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
32756 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
32757 (~D ^ C) or (D ^ C) can be simplified.
32759 2021-01-13 Richard Biener <rguenther@suse.de>
32761 PR tree-optimization/92645
32762 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
32763 until after vector lowering.
32765 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32767 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
32769 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
32770 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
32772 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32774 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
32776 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
32777 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
32779 2021-01-13 Richard Biener <rguenther@suse.de>
32781 PR tree-optimization/92645
32782 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
32783 BIT_FIELD_REF argument.
32784 (vect_build_slp_tree_2): Record the desired vector type
32785 on the external vector def.
32786 (vectorizable_slp_permutation): Handle required punning
32787 of existing vector defs.
32789 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32791 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
32793 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32795 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
32797 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
32799 * config.gcc [$target == *-*-gnu*]: Enable
32800 'default_gnu_indirect_function'.
32802 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32805 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
32806 registers before calling targetm.vectorize.vec_perm_const, only after
32808 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
32809 two argument permutation when one operand is zero vector and only
32810 after that force operands into registers.
32811 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
32812 define_insn_and_split pattern.
32813 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
32814 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
32815 (*avx2_zero_extendv8hiv8si2_1): Likewise.
32816 (*avx512f_zero_extendv8siv8di2_1): Likewise.
32817 (*avx2_zero_extendv4siv4di2_1): Likewise.
32818 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
32820 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
32821 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
32822 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
32823 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
32824 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
32825 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
32827 2021-01-13 Martin Liska <mliska@suse.cz>
32829 PR tree-optimization/98455
32830 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
32831 Record also virtual PHIs.
32832 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
32835 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
32837 * doc/invoke.texi (C++ Modules): Fix typos.
32839 2021-01-13 Richard Biener <rguenther@suse.de>
32841 PR tree-optimization/98640
32842 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
32843 handle plus or minus from a truncated operand to be
32846 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32849 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
32850 define_insn_and_split patterns.
32851 (splitter after *btr<mode>_2): New splitter.
32853 2021-01-13 Martin Liska <mliska@suse.cz>
32856 * cgraphunit.c (analyze_functions): Remove dead code.
32858 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
32860 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
32861 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
32862 (a64fx_regmove_cost, a64fx_vector_cost): New.
32863 (a64fx_tunings): Use the new added cost tables.
32865 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32868 * config/i386/predicates.md (pmovzx_parallel): New predicate.
32869 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
32870 define_insn_and_split pattern.
32871 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
32872 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
32874 2021-01-13 Julian Brown <julian@codesourcery.com>
32876 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
32877 to fix v0 register.
32879 2021-01-13 Julian Brown <julian@codesourcery.com>
32881 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
32884 2021-01-13 Julian Brown <julian@codesourcery.com>
32886 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
32887 for reciprocal-approximation instructions.
32888 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
32889 refinement and division result.
32890 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
32892 2021-01-13 Julian Brown <julian@codesourcery.com>
32894 * config/gcn/gcn-valu.md (subdf): Rename to...
32897 2021-01-12 Martin Liska <mliska@suse.cz>
32899 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
32901 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
32903 * function-abi.h: Fix typo.
32905 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
32909 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
32910 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
32911 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
32912 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
32913 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
32914 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
32915 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
32916 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
32917 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
32918 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
32919 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
32920 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
32921 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
32922 (ARM_HAVE_V2DI_LDST): Likewise.
32923 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
32924 (*movmisalign<mode>_mve_load): New pattern.
32925 * config/arm/neon.md (movmisalign<mode>): Move to ...
32926 * config/arm/vec-common.md: ... here.
32928 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
32931 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
32932 of pattern 'plus (plus (hard reg, const), pseudo)'.
32934 2021-01-12 Richard Biener <rguenther@suse.de>
32936 PR tree-optimization/98550
32937 * tree-vect-slp.c (vect_record_max_nunits): Check whether
32938 the group size is a multiple of the vector element count.
32939 (vect_build_slp_tree_1): When we need to fail because
32940 the vector type choosen causes unrolling do so lazily
32941 without affecting matches only at the end to guide group splitting.
32943 2021-01-12 Martin Liska <mliska@suse.cz>
32946 * optc-save-gen.awk: Compare also n_target_save vars with
32949 2021-01-12 Martin Liska <mliska@suse.cz>
32951 * gcov.c (source_info::debug): New.
32952 (print_usage): Add --debug (-D) option.
32953 (process_args): Likewise.
32954 (generate_results): Call src->debug after
32955 accumulate_line_counts.
32956 (read_graph_file): Properly assign id for EXIT_BLOCK.
32957 * profile.c (branch_prob): Dump function body before it is
32960 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32962 PR tree-optimization/98629
32963 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
32964 unless returning non-zero.
32966 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32968 PR tree-optimization/95731
32969 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
32970 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
32971 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
32972 only after optimize_range_tests_var_bound.
32974 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32976 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
32977 * configure: Regenerated.
32979 2021-01-12 liuhongt <hongtao.liu@intel.com>
32982 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
32984 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
32987 2021-01-12 Alexandre Oliva <oliva@adacore.com>
32989 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
32991 (auto_end_imm_use_stmt_traverse): New struct.
32992 (FOR_EACH_IMM_USE_STMT): Use it.
32993 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
32995 * gimple-ssa-strength-reduction.c: ... here, ...
32996 * graphite-scop-detection.c: ... here, ...
32997 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
32998 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
32999 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
33000 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
33001 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
33002 * tree-vect-slp.c: ... and here, ...
33003 * doc/tree-ssa.texi: ... and the example here.
33005 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33007 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
33008 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
33009 (*sdiv_pow2<mode>3): New pattern.
33010 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
33011 Wrap the ASRD in an UNSPEC_PRED_X.
33012 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
33013 predicate with a constant PTRUE, if it isn't already.
33014 (*cond_<sve_int_op><mode>_z): Replace with...
33015 (*cond_<sve_int_op><mode>_any): ...this new pattern.
33017 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33019 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
33020 SVE_FULL_I to SVE_I.
33021 (*cond_bic<mode>_any): Likewise.
33023 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33025 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
33026 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
33029 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33031 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
33032 SVE_FULL_I to SVE_I.
33033 (*aarch64_cond_<su>abd<mode>_2): Likewise.
33034 (*aarch64_cond_<su>abd<mode>_any): Likewise.
33035 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
33036 for the max and min but not for the minus.
33037 (*aarch64_cond_<su>abd<mode>_3): New pattern.
33039 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33041 * config/aarch64/iterators.md (SVE_24I): New iterator.
33042 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
33043 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
33045 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33047 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
33048 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
33050 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
33051 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
33052 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
33053 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
33055 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33057 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
33058 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
33059 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
33062 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
33064 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
33065 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
33066 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
33068 2021-01-11 Martin Liska <mliska@suse.cz>
33071 * symtab-clones.h (clone_info::release): Release
33072 symtab::m_clones with ggc_delete as it's a GGC memory.
33074 2021-01-11 Matthias Klose <doko@ubuntu.com>
33076 * Makefile.in (LINK_PROGRESS): Show the link target.
33078 2021-01-11 Richard Biener <rguenther@suse.de>
33080 PR tree-optimization/91403
33081 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
33082 single-element interleaving group size at 4096 elements.
33084 2021-01-11 Richard Biener <rguenther@suse.de>
33086 PR tree-optimization/98526
33087 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
33088 of the actual reduction op for the regular case.
33089 (vectorizable_reduction): Cost the stmts
33090 vect_transform_reduction produces here.
33092 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
33094 * tree-ssa-forwprop.c (simplify_vector_constructor): For
33095 big-endian, use UNPACK[_FLOAT]_HI.
33097 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33099 * tree-vect-slp-patterns.c (class complex_pattern,
33100 class complex_add_pattern): Add parameters to matches.
33101 (complex_add_pattern::build): Free memory.
33102 (complex_add_pattern::matches): Move validation end of match.
33103 (complex_add_pattern::recognize): Likewise.
33105 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33107 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
33109 2021-01-11 Tamar Christina <tamar.christina@arm.com>
33111 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
33113 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33115 PR tree-optimization/95867
33116 * tree-ssa-math-opts.h: New header.
33117 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
33118 (powi_as_mults): No longer static. Use build_one_cst instead of
33119 build_real. Formatting fix.
33120 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
33121 (attempt_builtin_powi): Handle multiplication reassociation without
33122 powi_fndecl using powi_as_mults.
33123 (reassociate_bb): For integral types don't require
33124 -funsafe-math-optimizations to call attempt_builtin_powi.
33126 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33128 PR tree-optimization/95852
33129 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
33130 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
33131 allow in the bb any of the stmts in that vector, div_stmt and
33132 up to 3 cast stmts.
33133 (arith_cast_equal_p): New function.
33134 (arith_overflow_check_p): Add cast_stmt argument, handle signed
33135 multiply overflow checks.
33136 (match_arith_overflow): Adjust caller. Handle signed multiply
33139 2021-01-11 Jakub Jelinek <jakub@redhat.com>
33141 PR tree-optimization/95852
33142 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
33143 (uaddsub_overflow_check_p): Renamed to ...
33144 (arith_overflow_check_p): ... this. Handle also multiplication
33145 with overflow check.
33146 (match_uaddsub_overflow): Renamed to ...
33147 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
33148 also multiplication with overflow check. Adjust function comment.
33149 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
33150 match_arith_overflow also for MULT_EXPR.
33152 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33154 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
33155 __builtin_convertvector.
33156 (vmovl_s16): Likewise.
33157 (vmovl_s32): Likewise.
33158 (vmovl_u8): Likewise.
33159 (vmovl_u16): Likewise.
33160 (vmovl_u32): Likewise.
33161 (vmovn_s16): Likewise.
33162 (vmovn_s32): Likewise.
33163 (vmovn_s64): Likewise.
33164 (vmovn_u16): Likewise.
33165 (vmovn_u32): Likewise.
33166 (vmovn_u64): Likewise.
33168 2021-01-11 Martin Liska <mliska@suse.cz>
33170 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
33171 (if_chain::is_beneficial): Delete clusters
33172 (find_conditions): Make second argument of conditions_in_bbs a
33173 pointer so that we control over it's lifetime.
33174 (pass_if_to_switch::execute): Delete them.
33176 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
33178 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
33181 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33183 * config/vax/vax.md (cc): Remove mode attribute.
33184 (subst_<cc>, subst_f<cc>): Rename to...
33185 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
33186 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
33187 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
33188 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
33190 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33192 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
33193 `const_double_zero'.
33195 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33197 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
33198 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
33201 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33203 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
33205 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
33206 with `const_double_zero'.
33207 * doc/rtl.texi (Constant Expression Types): Document it.
33209 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33212 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
33213 POINTER_DIFF_EXPR to be any integral type.
33215 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33217 PR rtl-optimization/98603
33218 * function.c (instantiate_virtual_regs_in_insn): For asm goto
33219 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
33220 if any, set ASM_OPERANDS mode to VOIDmode and change
33221 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
33223 2021-01-09 Alexandre Oliva <oliva@gnu.org>
33226 * final.c (notice_source_line): Narrow down the condition to
33227 skip a line-0 marker.
33229 2021-01-08 Sergei Trofimovich <siarheit@google.com>
33231 * ipa-modref.c (merge_call_side_effects): Fix
33232 linebreak split by reordering two print calls.
33234 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33236 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
33237 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
33238 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
33239 consistency, fix constraint.
33241 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33243 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
33244 callables instead of mask values.
33245 (struct target_flag_set_p): New predicate.
33246 (s390_cpu_cpp_builtins_internal): Define or undefine
33247 __LONG_DOUBLE_VX__ macro.
33249 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33252 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
33253 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
33256 2021-01-08 Richard Biener <rguenther@suse.de>
33258 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
33260 2021-01-08 Richard Biener <rguenther@suse.de>
33262 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
33263 (vect_build_slp_tree): On cache hit release the matched
33264 scalar stmts vector.
33265 * tree-vect-stmts.c (vectorizable_store): Properly free
33266 vec_oprnds before possibly gathering them again.
33268 2021-01-08 Richard Biener <rguenther@suse.de>
33270 PR tree-optimization/98544
33271 * tree-vect-slp.c (vect_optimize_slp): Always materialize
33272 permutes at a permute node.
33274 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33277 * config/i386/i386.c (x86_function_profiler): Use R10 to call
33278 mcount in large model. Sorry for large model with PIC.
33280 2021-01-08 Jakub Jelinek <jakub@redhat.com>
33283 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
33284 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
33285 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
33286 TargetSave and initialize for variables with enum types.
33287 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
33288 mstack-protector-guard-symbol=): Add Save.
33289 * config/i386/i386-options.c (ix86_function_specific_save,
33290 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
33291 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
33292 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
33293 x_ix86_veclibabi_type.
33295 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33297 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
33298 SVE_FULL_I to SVE_I.
33299 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
33301 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33303 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
33304 SVE_FULL_I to SVE_I.
33305 (*cond_uxt<mode>_any): Likewise.
33307 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33309 * config/aarch64/iterators.md (Vwhalf): New iterator.
33310 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
33312 (aarch64_<sur>adalp<mode>): ... This. Make more
33314 (<sur>sadv16qi): Adjust callsite of the above.
33315 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
33317 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
33319 (vpadal_s16): Likewise.
33320 (vpadal_u8): Likewise.
33321 (vpadal_u16): Likewise.
33322 (vpadalq_s8): Likewise.
33323 (vpadalq_s16): Likewise.
33324 (vpadalq_s32): Likewise.
33325 (vpadalq_u8): Likewise.
33326 (vpadalq_u16): Likewise.
33327 (vpadalq_u32): Likewise.
33329 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33331 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
33333 (aarch64_<su>abd<mode>): ... This.
33334 (<sur>sadv16qi): Adjust callsite of the above.
33335 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
33337 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
33339 (vabd_s16): Likewise.
33340 (vabd_s32): Likewise.
33341 (vabd_u8): Likewise.
33342 (vabd_u16): Likewise.
33343 (vabd_u32): Likewise.
33344 (vabdq_s8): Likewise.
33345 (vabdq_s16): Likewise.
33346 (vabdq_s32): Likewise.
33347 (vabdq_u8): Likewise.
33348 (vabdq_u16): Likewise.
33349 (vabdq_u32): Likewise.
33351 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33353 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
33355 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
33356 (vaba_s16): Likewise.
33357 (vaba_s32): Likewise.
33358 (vaba_u8): Likewise.
33359 (vaba_u16): Likewise.
33360 (vaba_u32): Likewise.
33361 (vabaq_s8): Likewise.
33362 (vabaq_s16): Likewise.
33363 (vabaq_s32): Likewise.
33364 (vabaq_u8): Likewise.
33365 (vabaq_u16): Likewise.
33366 (vabaq_u32): Likewise.
33368 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33370 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
33371 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
33372 Change RTL pattern to match.
33374 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33376 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
33377 * config/riscv/riscv-c.c (riscv-subset.h): New.
33378 (INCLUDE_STRING): Define.
33379 (riscv_cpu_cpp_builtins): Add new style architecture extension
33381 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
33382 (riscv_subset_list::end): New.
33383 (riscv_current_subset_list): New.
33385 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33387 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
33388 Move to riscv-subset.h.
33389 (struct riscv_subset_t): Ditto.
33390 (class riscv_subset_list): Ditto.
33391 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
33392 from riscv-common.c.
33393 (struct riscv_subset_t): Ditto.
33394 (class riscv_subset_list): Ditto.
33395 * config/riscv/t-riscv ($(common_out_file)): Add file
33398 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33401 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
33402 New define_insn patterns.
33404 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33406 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
33407 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
33408 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
33410 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33412 PR tree-optimization/98560
33413 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
33414 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
33416 (vec_cond_direct): Likewise argument 2.
33417 (vec_condu_direct, vec_condeq_direct): Delete.
33418 (expand_vect_cond_optab_fn): Rename to...
33419 (expand_vec_cond_optab_fn): ...this, replacing old macro.
33420 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
33421 (expand_vect_cond_mask_optab_fn): Rename to...
33422 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
33423 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
33425 (direct_vec_cond_optab_supported_p): Likewise.
33426 (direct_vec_condu_optab_supported_p): Delete.
33427 (direct_vec_condeq_optab_supported_p): Delete.
33428 * gimple-isel.cc: Include internal-fn.h.
33429 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
33432 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33434 PR tree-optimization/98560
33435 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
33436 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
33438 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
33440 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
33441 rotate_insn and optab code attributes.
33442 Update all uses to merged code attribute.
33443 * config/i386/sse.md: Update all uses to merged code attribute.
33444 * config/i386/mmx.md: Update all uses to merged code attribute.
33446 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33448 PR tree-optimization/98568
33449 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
33450 (bswap_replace): Use it.
33452 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
33454 PR rtl-optimization/97978
33455 * lra-int.h (lra_hard_reg_split_p): New external.
33456 * lra.c (lra_hard_reg_split_p): New global.
33457 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
33458 * lra-assigns.c (lra_assign): Don't check allocation correctness
33459 after hard reg splitting.
33461 2021-01-06 Martin Sebor <msebor@redhat.com>
33464 * builtins.c (new_delete_mismatch_p): New overload.
33465 (new_delete_mismatch_p (tree, tree)): Call it.
33467 2021-01-06 Alexandre Oliva <oliva@adacore.com>
33469 * Makefile.in (T_GLIMITS_H): New.
33470 (stmp-int-hdrs): Depend on it, use it.
33471 * config/t-vxworks (T_GLIMITS_H): Override it.
33472 (vxw-glimits.h): New.
33474 2021-01-06 Richard Biener <rguenther@suse.de>
33476 PR tree-optimization/98513
33477 * value-range.cc (intersect_ranges): Compare the upper bounds
33478 for the expected relation.
33480 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
33483 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
33485 * doc/standards.texi (HSAIL): Remove section.
33487 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
33489 * configure: Re-generate.
33491 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33493 * doc/invoke.texi (-std=c++20): Adjust for the publication of
33494 ISO 14882:2020 standard.
33495 * doc/standards.texi: Likewise.
33497 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33499 PR tree-optimization/94802
33500 * expr.h (maybe_optimize_sub_cmp_0): Declare.
33501 * expr.c: Include tree-pretty-print.h and flags.h.
33502 (maybe_optimize_sub_cmp_0): New function.
33503 (do_store_flag): Use it.
33504 * cfgexpand.c (expand_gimple_cond): Likewise.
33506 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33508 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
33509 * rtlanal.c (simple_regno_set): Tweak description to clarify the
33512 2021-01-05 Richard Biener <rguenther@suse.de>
33514 PR tree-optimization/98516
33515 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
33516 lanes when materializing on a VEC_PERM node.
33517 (vectorizable_slp_permutation): Dump the permute properly.
33519 2021-01-05 Richard Biener <rguenther@suse.de>
33521 * tree-vect-slp.c (vect_slp_region): Move debug counter
33522 to cover individual subgraphs.
33524 2021-01-05 Richard Biener <rguenther@suse.de>
33526 PR tree-optimization/98428
33527 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
33528 vector lane extracts for loop vectorization.
33530 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33532 PR tree-optimization/98514
33533 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
33535 (operand_rank): Change type from hash_map<tree, long> to
33536 hash_map<tree, int64_t>.
33537 (phi_rank): Change return type from long to int64_t.
33538 (loop_carried_phi): Change block_rank variable type from long to
33540 (propagate_rank): Change return type, rank parameter type and
33541 op_rank variable type from long to int64_t.
33542 (find_operand_rank): Change return type from long to int64_t
33543 and change slot variable type from long * to int64_t *.
33544 (insert_operand_rank): Change rank parameter type from long to
33546 (get_rank): Change return type and rank variable type from long to
33547 int64_t. Use PRId64 instead of ld to print the rank.
33548 (init_reassoc): Change rank variable type from long to int64_t
33549 and adjust correspondingly bb_rank and operand_rank initialization.
33551 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33553 PR tree-optimization/96928
33554 * tree-ssa-phiopt.c (xor_replacement): New function.
33555 (tree_ssa_phiopt_worker): Call it.
33557 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33559 PR tree-optimization/96930
33560 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
33561 from narrower value which has the same type as 1 << B, perform
33562 the right shift on the narrower value followed by extension.
33564 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33566 PR tree-optimization/96239
33567 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
33569 (get_status_for_store_merging): Don't return BB_INVALID for blocks
33570 with potential bswap optimizable CONSTRUCTORs.
33571 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
33574 2021-01-05 Richard Biener <rguenther@suse.de>
33576 PR tree-optimization/98381
33577 * tree.c (vector_element_bits): Properly compute bool vector
33579 * tree-vect-loop.c (vectorizable_live_operation): Properly
33580 compute the last lane bit offset.
33582 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33585 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
33586 Clear the top 64 bytes of the input XMM register.
33587 (sse_cvttps2pi): Ditto.
33589 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33592 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
33594 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
33597 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
33600 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
33602 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
33603 (umaddsidi4_split): Likewise.
33605 2021-01-05 liuhongt <hongtao.liu@intel.com>
33608 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
33609 define_insn_and_split for zero_extend of subreg HI of pmovskb
33611 (*sse2_pmovskb_zexthisi): Add new combine splitters for
33612 zero_extend of not of subreg HI of pmovskb result.
33614 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33617 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
33619 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
33620 convert_memory_address to convert symbolic immediates to ptr_mode
33621 before forcing them to memory.
33623 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33625 PR rtl-optimization/97144
33626 * recog.c (constrain_operands): Initialize matching_operand
33627 for each alternative, rather than only doing it once.
33629 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33631 PR rtl-optimization/98403
33632 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
33633 why we don't remove call clobbers.
33634 (function_info::apply_changes_to_insn): Don't attempt to add
33635 call clobbers here.
33637 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33639 PR tree-optimization/98371
33640 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
33641 (vect_analyze_loop): If an epilogue loop appears to be cheaper
33642 than the main loop, re-analyze it as a main loop before adopting
33645 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
33648 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
33649 * aclocal.m4, configure: Regenerate.
33650 * Makefile.in (NETLIBS): Define.
33651 (BACKEND): Remove $(CODYLIB).
33653 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33655 PR rtl-optimization/98334
33656 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
33657 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
33659 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
33661 * tree-inline.c (expand_call_inline): Restore input_location.
33662 Return result from recursive call.
33664 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33666 PR tree-optimization/95401
33667 * config/aarch64/aarch64-sve-builtins.cc
33668 (gimple_folder::load_store_cookie): Use bits rather than bytes
33669 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
33670 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
33671 * tree-vect-stmts.c (vectorizable_store): Likewise.
33672 (vectorizable_load): Likewise.
33674 2021-01-04 Richard Biener <rguenther@suse.de>
33676 PR tree-optimization/98308
33677 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
33680 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33682 PR tree-optimization/95771
33683 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
33684 with precision smaller than int's precision and types with precision
33685 twice as large as long long. Formatting fixes.
33687 2021-01-04 Richard Biener <rguenther@suse.de>
33689 PR tree-optimization/98464
33690 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
33691 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
33692 (process_bb): Adjust.
33694 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
33697 * doc/invoke.texi (-fsanitize=address): Fix wording describing
33698 clash with -fsanitize=hwaddress.
33700 2021-01-04 Richard Biener <rguenther@suse.de>
33702 PR tree-optimization/98282
33703 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
33704 invariants as VN_NARY.
33706 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33709 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
33710 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
33711 to handle zero operands.
33713 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33715 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
33716 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
33717 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
33720 2021-01-04 Richard Biener <rguenther@suse.de>
33722 PR tree-optimization/98393
33723 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
33724 when hitting the limit.
33726 2021-01-04 Richard Biener <rguenther@suse.de>
33728 PR tree-optimization/98291
33729 * tree-vect-loop.c (vectorizable_reduction): Bypass
33730 associativity check for SLP reductions with VF 1.
33732 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33734 PR tree-optimization/96782
33735 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
33737 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
33739 * collect-utils.c (collect_execute): Check dumppfx.
33740 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
33741 to collect_execute.
33742 (do_link): Add new parameter atsuffix.
33743 (main): Handle -dumpdir option. Skip one argument for
33744 -o, -isystem and -B options.
33745 * gcc.c (make_at_file): New helper function.
33746 (close_at_file): Use it.
33748 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33750 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
33751 Amend handling for LD64_VERSION fallback defaults.
33753 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33755 * config.gcc: Compute default version information
33756 from the configured target. Likewise defaults for
33758 * config/darwin10.h: Removed.
33759 * config/darwin12.h: Removed.
33760 * config/darwin9.h: Removed.
33761 * config/rs6000/darwin8.h: Removed.
33763 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33765 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
33767 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33769 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
33770 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
33772 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33774 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
33776 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
33778 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33780 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
33781 for the Darwin10 unwinder stub from here ...
33782 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
33784 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33786 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
33787 (ASM_DEBUG_SPEC):Only define if the assembler supports
33789 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
33790 (DARWIN_PREFER_DWARF): Define.
33791 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
33792 (DARWIN_PREFER_DWARF): Likewise
33793 (DSYMUTIL_SPEC): Likewise.
33794 (COLLECT_RUN_DSYMUTIL): Likewise.
33795 (ASM_DEBUG_SPEC): Likewise.
33796 (ASM_DEBUG_OPTION_SPEC): Likewise.
33798 2021-01-02 Jan Hubicka <jh@suse.cz>
33800 * cfg.c (free_block): ggc_free bb.
33802 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33804 * gcc.c (process_command): Update copyright notice dates.
33805 * gcov-dump.c (print_version): Ditto.
33806 * gcov.c (print_version): Ditto.
33807 * gcov-tool.c (print_version): Ditto.
33808 * gengtype.c (create_file): Ditto.
33809 * doc/cpp.texi: Bump @copying's copyright year.
33810 * doc/cppinternals.texi: Ditto.
33811 * doc/gcc.texi: Ditto.
33812 * doc/gccint.texi: Ditto.
33813 * doc/gcov.texi: Ditto.
33814 * doc/install.texi: Ditto.
33815 * doc/invoke.texi: Ditto.
33817 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33819 * ChangeLog-2020: Rotate ChangeLog. New file.
33822 Copyright (C) 2021 Free Software Foundation, Inc.
33824 Copying and distribution of this file, with or without modification,
33825 are permitted in any medium without royalty provided the copyright
33826 notice and this notice are preserved.